This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.
The relationship of cell proliferation to the temporal expression of genes characterizing a developmental sequence associated with bone cell differentiation was examined in primary diploid cultures of fetal calvarial derived osteoblasts by the combined use of autoradiography, histochemistry, biochemistry, and mRNA assays of osteoblast cell growth and phenotypic genes. Modifications in gene expression define a developmental sequence that has 1) three principle periods--proliferation, extracellular matrix maturation, and mineralization--and 2) two restriction points to which the cells can progress but cannot pass without further signals--the first when proliferation is down-regulated and gene expression associated with extracellular matrix maturation is induced, and the second when mineralization occurs. Initially, actively proliferating cells, expressing cell cycle- and cell growth-regulated genes, produce a fibronectin/type I collagen extracellular matrix. A reciprocal and functionally coupled relationship between the decline in proliferative activity and the subsequent induction of genes associated with matrix maturation and mineralization is supported by 1) a temporal sequence of events in which there is an enhanced expression of alkaline phosphatase immediately following the proliferative period, and later, an increased expression of osteocalcin and osteopontin at the onset of mineralization; 2) increased expression of a specific subset of osteoblast phenotype markers, alkaline phosphatase and osteopontin, when proliferation is inhibited by hydroxyurea; and 3) enhanced levels of expression of the osteoblast markers as a function of ascorbic acid-induced collagen deposition, suggesting that the extracellular matrix contributes to both the shutdown of proliferation and the development of the osteoblast phenotype.
The human X chromosome has a unique biology that was shaped by its evolution as the sex chromosome shared by males and females. We have determined 99.3% of the euchromatic sequence of the X chromosome. Our analysis illustrates the autosomal origin of the mammalian sex chromosomes, the stepwise process that led to the progressive loss of recombination between X and Y, and the extent of subsequent degradation of the Y chromosome. LINE1 repeat elements cover one-third of the X chromosome, with a distribution that is consistent with their proposed role as way stations in the process of X-chromosome inactivation. We found 1,098 genes in the sequence, of which 99 encode proteins expressed in testis and in various tumour types. A disproportionately high number of mendelian diseases are documented for the X chromosome. Of this number, 168 have been explained by mutations in 113 X-linked genes, which in many cases were characterized with the aid of the DNA sequence.
The major histocompatibility complex (MHC) is the most important region in the vertebrate genome with respect to infection and autoimmunity, and is crucial in adaptive and innate immunity. Decades of biomedical research have revealed many MHC genes that are duplicated, polymorphic and associated with more diseases than any other region of the human genome. The recent completion of several large-scale studies offers the opportunity to assimilate the latest data into an integrated gene map of the extended human MHC. Here, we present this map and review its content in relation to paralogy, polymorphism, immune function and disease.
Knowledge of the complete genomic DNA sequence of an organism allows a systematic approach to defining its genetic components. The genomic sequence provides access to the complete structures of all genes, including those without known function, their control elements, and, by inference, the proteins they encode, as well as all other biologically important sequences. Furthermore, the sequence is a rich and permanent source of information for the design of further biological studies of the organism and for the study of evolution through cross-species sequence comparison. The power of this approach has been amply demonstrated by the determination of the sequences of a number of microbial and model organisms. The next step is to obtain the complete sequence of the entire human genome. Here we report the sequence of the euchromatic part of human chromosome 22. The sequence obtained consists of 12 contiguous segments spanning 33.4 megabases, contains at least 545 genes and 134 pseudogenes, and provides the first view of the complex chromosomal landscapes that will be found in the rest of the genome.
second model, the two main conditions were parametrically modulated by the two categories, respectively (SOM, S5.1). The activation of the precuneus was higher for hard dominance-solvable games than for easy ones ( Fig. 4A and table S10). The activation of the insula was higher for the highly focal coordination games than for less focal ones ( Fig. 4B and table S11). Previous studies also found that precuneus activity increased when the number of planned moves increased (40, 41). The higher demand for memory-related imagery and memory retrieval may explain the greater precuneus activation in hard dominance-solvable games. In highly focal coordination games, the participants may have felt quite strongly that the pool students must notice the same salient feature. This may explain why insula activation correlates with NCI.Participants might have disagreed about which games were difficult. We built a third model to investigate whether the frontoparietal activation correlates with how hard a dominance-solvable game is and whether the activation in insula and ACC correlates with how easy a coordination game is. Here, the two main conditions were parametrically modulated by each participant's probability of obtaining a reward in each game (SOM, S2.2 and S5.2). We found a negative correlation between the activation of the precuneus and the participant's probability of obtaining a reward in dominance-solvable games ( Fig. 4C and table S12), which suggests that dominance-solvable games that yielded lower payoffs presented harder mental challenges. In a previous study on working memory, precuneus activity positively correlated with response times, a measure of mental effort (24). Both findings are consistent with the interpretation that subjective measures reflecting harder tasks (higher efforts) correlate with activation in precuneus. A positive correlation between insula activation and the participant's probability of obtaining a reward again suggests that coordination games with a highly salient feature strongly activated the "gut feeling" reported by many participants (Fig. 4D and table S13). A previous study found that the subjective rating of "chills intensity" in music correlates with activation of insula (42). Both findings are consistent with the interpretation that the subjective intensity of how salient a stimulus is correlates with activation in insula.As mentioned, choices were made significantly faster in coordination games than in dominancesolvable games. The results of the second and third models provide additional support for the idea that intuitive and deliberative mental processes have quite different properties. The "slow and effortful" process was more heavily taxed when the dominance-solvable games were harder. The "fast and effortless" process was more strongly activated when coordination was easy.
Effective use of the human and mouse genomes requires reliable identification of genes and their products. Although multiple public resources provide annotation, different methods are used that can result in similar but not identical representation of genes, transcripts, and proteins. The collaborative consensus coding sequence (CCDS) project tracks identical protein annotations on the reference mouse and human genomes with a stable identifier (CCDS ID), and ensures that they are consistently represented on the NCBI, Ensembl, and UCSC Genome Browsers. Importantly, the project coordinates on manually reviewing inconsistent protein annotations between sites, as well as annotations for which new evidence suggests a revision is needed, to progressively converge on a complete protein-coding set for the human and mouse reference genomes, while maintaining a high standard of reliability and biological accuracy. To date, the project has identified 20,159 human and 17,707 mouse consensus coding regions from 17,052 human and 16,893 mouse genes. Three evaluation methods indicate that the entries in the CCDS set are highly likely to represent real proteins, more so than annotations from contributing groups not included in CCDS. The CCDS database thus centralizes the function of identifying well-supported, identically-annotated, protein-coding regions.[Supplemental material is available online at www.genome.org. Data sets and documentation are available in the CCDS database at http://www.ncbi.nlm.nih.gov/CCDS.]One key goal of genome projects is to identify and accurately annotate all protein-coding genes. The resulting annotations add functional context to the sequence data and make it easier to traverse to other rich sources of gene and protein information. Accurately annotating known genes, identifying novel genes, and tracking annotations over time are complex processes that are best achieved through a combination of large-scale computational analyses and expert curation. These methods must (1) process repetitive sequences in multiple categories including retrotransposons, segmental duplications, and paralogs; (2) process variation including copy number variation (CNV) (Feuk et al. 2006) and microsatellites; (3) distinguish functional genes and alleles from pseudogenes; (4) define alternate splice products; and (5) avoid erroneous interpretation based on experimental error.
Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense-antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.