We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.
The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.
The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organise biology around the sequences of large genomes. It is a comprehensive source of stable automatic annotation of the human genome sequence, with confirmed gene predictions that have been integrated with external data sources, and is available as either an interactive web site or as flat files. It is also an open source software engineering project to develop a portable system able to handle very large genomes and associated requirements from sequence analysis to data storage and visualisation. The Ensembl site is one of the leading sources of human genome sequence annotation and provided much of the analysis for publication by the international human genome project of the draft genome. The Ensembl system is being installed around the world in both companies and academic sites on machines ranging from supercomputers to laptops.
second model, the two main conditions were parametrically modulated by the two categories, respectively (SOM, S5.1). The activation of the precuneus was higher for hard dominance-solvable games than for easy ones ( Fig. 4A and table S10). The activation of the insula was higher for the highly focal coordination games than for less focal ones ( Fig. 4B and table S11). Previous studies also found that precuneus activity increased when the number of planned moves increased (40, 41). The higher demand for memory-related imagery and memory retrieval may explain the greater precuneus activation in hard dominance-solvable games. In highly focal coordination games, the participants may have felt quite strongly that the pool students must notice the same salient feature. This may explain why insula activation correlates with NCI.Participants might have disagreed about which games were difficult. We built a third model to investigate whether the frontoparietal activation correlates with how hard a dominance-solvable game is and whether the activation in insula and ACC correlates with how easy a coordination game is. Here, the two main conditions were parametrically modulated by each participant's probability of obtaining a reward in each game (SOM, S2.2 and S5.2). We found a negative correlation between the activation of the precuneus and the participant's probability of obtaining a reward in dominance-solvable games ( Fig. 4C and table S12), which suggests that dominance-solvable games that yielded lower payoffs presented harder mental challenges. In a previous study on working memory, precuneus activity positively correlated with response times, a measure of mental effort (24). Both findings are consistent with the interpretation that subjective measures reflecting harder tasks (higher efforts) correlate with activation in precuneus. A positive correlation between insula activation and the participant's probability of obtaining a reward again suggests that coordination games with a highly salient feature strongly activated the "gut feeling" reported by many participants (Fig. 4D and table S13). A previous study found that the subjective rating of "chills intensity" in music correlates with activation of insula (42). Both findings are consistent with the interpretation that the subjective intensity of how salient a stimulus is correlates with activation in insula.As mentioned, choices were made significantly faster in coordination games than in dominancesolvable games. The results of the second and third models provide additional support for the idea that intuitive and deliberative mental processes have quite different properties. The "slow and effortful" process was more heavily taxed when the dominance-solvable games were harder. The "fast and effortless" process was more strongly activated when coordination was easy.
The BioMart Community Portal (www.biomart.org) is a community-driven effort to provide a unified interface to biomedical databases that are distributed worldwide. The portal provides access to numerous database projects supported by 30 scientific organizations. It includes over 800 different biological datasets spanning genomics, proteomics, model organisms, cancer data, ontology information and more. All resources available through the portal are independently administered and funded by their host organizations. The BioMart data federation technology provides a unified interface to all the available data. The latest version of the portal comes with many new databases that have been created by our ever-growing community. It also comes with better support and extensibility for data analysis and visualization tools. A new addition to our toolbox, the enrichment analysis tool is now accessible through graphical and web service interface. The BioMart community portal averages over one million requests per day. Building on this level of service and the wealth of information that has become available, the BioMart Community Portal has introduced a new, more scalable and cheaper alternative to the large data stores maintained by specialized organizations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.