The BioMart Community Portal (www.biomart.org) is a community-driven effort to provide a unified interface to biomedical databases that are distributed worldwide. The portal provides access to numerous database projects supported by 30 scientific organizations. It includes over 800 different biological datasets spanning genomics, proteomics, model organisms, cancer data, ontology information and more. All resources available through the portal are independently administered and funded by their host organizations. The BioMart data federation technology provides a unified interface to all the available data. The latest version of the portal comes with many new databases that have been created by our ever-growing community. It also comes with better support and extensibility for data analysis and visualization tools. A new addition to our toolbox, the enrichment analysis tool is now accessible through graphical and web service interface. The BioMart community portal averages over one million requests per day. Building on this level of service and the wealth of information that has become available, the BioMart Community Portal has introduced a new, more scalable and cheaper alternative to the large data stores maintained by specialized organizations.
We investigate deep learning autoencoders for the unsupervised recognition of phase transitions in physical systems formulated on a lattice. We focus our investigation on the 2-dimensional ferromagnetic Ising model and then test the application of the autoencoder on the anti-ferromagnetic Ising model. We use spin configurations produced for the 2-dimensional ferromagnetic and anti-ferromagnetic Ising model in zero external magnetic field. For the ferromagnetic Ising model, we study numerically the relation between one latent variable extracted from the autoencoder to the critical temperature Tc. The proposed autoencoder reveals the two phases, one for which the spins are ordered and the other for which spins are disordered, reflecting the restoration of the ℤ2 symmetry as the temperature increases. We provide a finite volume analysis for a sequence of increasing lattice sizes. For the largest volume studied, the transition between the two phases occurs very close to the theoretically extracted critical temperature. We define as a quasi-order parameter the absolute average latent variable z̃, which enables us to predict the critical temperature. One can define a latent susceptibility and use it to quantify the value of the critical temperature Tc(L) at different lattice sizes and that these values suffer from only small finite scaling effects. We demonstrate that Tc(L) extrapolates to the known theoretical value as L →∞ suggesting that the autoencoder can also be used to extract the critical temperature of the phase transition to an adequate precision. Subsequently, we test the application of the autoencoder on the anti-ferromagnetic Ising model, demonstrating that the proposed network can detect the phase transition successfully in a similar way. Graphical abstract
This study demonstrates that a virtual registry can be used for the preselection of participants for AD studies.
Current bioinformatics tools accomplish high accuracies in classifying allergenic protein sequences with high homology and generally perform poorly with low homology protein sequences. Although some homologous regions explained Immunoglobulin E (IgE) cross-reactivity in groups of allergens, no universal molecular structure could be associated with allergenicity. In addition, studies have showed that cross-reactivity is not directly linked to the homology between protein sequences. Therefore, a new homology independent method needs to be developed to determine if a protein is an allergen or not. The aim of this study is therefore to differentiate sets of allergenic and non-allergenic proteins using a signal-processing based bioinformatics approach. In this paper, a new method was proposed for characterisation and classification of allergenic protein sequences. For this method hydrophobicity amino acid index was used to encode proteins to numerical sequences and Discrete Fourier Transform to extract features for each protein. Finally, a classifier was constructed based on Support Vector Machines. In order to demonstrate the applicability of the proposed method 857 allergen and 1000 non-allergen proteins were collected from UniProt online database. The results obtained from the proposed method yielded: MCC: 0.752 ± 0.007, Specificity: 0.912 ± 0.005, Sensitivity: 0.835 ± 0.008 and Total Accuracy: 87.65% ± 0.004.
Despite the modern advances in the available computational resources, the length and time scales of the physical systems that can be studied in full atomic detail, via molecular simulations, are still limited. To overcome such limitations, coarse-grained (CG) models have been developed to reduce the dimensionality of the physical system under study. However, to study such systems at the atomic level, it is necessary to re-introduce the atomistic details into the CG description. Such an ill-posed mathematical problem is typically treated via numerical algorithms, which need to balance accuracy, efficiency, and general applicability. Here, we introduce an efficient and versatile method for backmapping multi-component CG macromolecules of arbitrary microstructures. By utilizing deep learning algorithms, we train a convolutional neural network to learn structural correlations between polymer configurations at the atomistic and their corresponding CG descriptions, obtained from atomistic simulations. The trained model is then utilized to get predictions of atomistic structures from input CG configurations. As an illustrative example, we apply the convolutional neural network to polybutadiene copolymers of various microstructures, in which each monomer microstructure (i.e., cis-1,4, trans-1,4, and vinyl-1,2) is represented as a different CG particle type. The proposed methodology is transferable over molecular weight and various microstructures. Moreover, starting from a specific single CG configuration with a given microstructure, we show that by modifying its chemistry (i.e., CG particle types), we are able to obtain a set of well equilibrated polymer configurations of different microstructures (chemistry) than the one of the original CG configuration.
Adaptive radiation and evolutionary stasis are characterized by very different evolution rates. The main aim of this study was to investigate if any genes have a special role to a high or low evolution rate. The availability of animal genomes permitted comparison of gene content of genomes of 24 vertebrate species that evolved through adaptive radiation (representing high evolutionary rate) and of 20 vertebrate species that are considered as living fossils (representing a slow evolutionary rate or evolutionary stasis). Mammals, birds, reptiles, and bony fishes were included in the analysis. Pathway analysis was performed for genes found to be specific in adaptive radiation or evolutionary stasis respectively. Pathway analysis revealed that DNA repair and cellular response to DNA damage are important (false discovery rate = 8.35 × 10−5; 7.15 × 10−6, respectively) for species evolved through adaptive radiation. This was confirmed by further genetic in silico analysis (p = 5.30 × 10−3). Nucleotide excision repair and base excision repair were the most significant pathways. Additionally, the number of DNA repair genes was found to be linearly related to the genome size and the protein number (proteome) of the 44 animals analyzed (p < 1.00 × 10−4), this being compatible with Drake’s rule. This is the first study where radiated and living fossil species have been genetically compared. Evidence has been found that cancer-related genes have a special role in radiated species. Linear association of the number of DNA repair genes with the species genome size has also been revealed. These comparative genetics results can support the idea of punctuated equilibrium evolution.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.