By analyzing 1,780,295 5Ј-end sequences of human full-length cDNAs derived from 164 kinds of oligo-cap cDNA libraries, we identified 269,774 independent positions of transcriptional start sites (TSSs) for 14,628 human RefSeq genes. These TSSs were clustered into 30,964 clusters that were separated from each other by more than 500 bp and thus are very likely to constitute mutually distinct alternative promoters. To our surprise, at least 7674 (52%) human RefSeq genes were subject to regulation by putative alternative promoters (PAPs). On average, there were 3.1 PAPs per gene, with the composition of one CpG-island-containing promoter per 2.6 CpG-less promoters. In 17% of the PAP-containing loci, tissue-specific use of the PAPs was observed. The richest tissue sources of the tissue-specific PAPs were testis and brain. It was also intriguing that the PAP-containing promoters were enriched in the genes encoding signal transduction-related proteins and were rarer in the genes encoding extracellular proteins, possibly reflecting the varied functional requirement for and the restricted expression of those categories of genes, respectively. The patterns of the first exons were highly diverse as well. On average, there were 7.7 different splicing types of first exons per locus partly produced by the PAPs, suggesting that a wide variety of transcripts can be achieved by this mechanism. Our findings suggest that use of alternate promoters and consequent alternative use of first exons should play a pivotal role in generating the complexity required for the highly elaborated molecular systems in humans.[Supplemental material is available online at www.genome.org. The sequence data from this study have been submitted to DDBJ under accession nos. DA000001-DA999999, DB000001-DB294747, DB294748-DB384947, BP192706-BP383670, AU279383-AU280837, and AU116788-U160826.]One of the most striking findings revealed by the Human Genome Project is that the human genome contains only 20,000-25,000 kinds of protein-coding genes (International Human Genome Sequencing Consortium 2004). This number is unexpectedly small compared with the total gene numbers in yeast, fly, and worm genomes, which are estimated to be 6,000, 14,000, and 19,000, respectively (Goffeau et al. 1996;C. elegans Sequencing Consortium 1998;Adams et al. 2000). It is supposed that there must be other factors in addition to mere gene numbers to satisfy the prerequisites that enable the human genome to fabricate such highly elaborated systems as the brain and immune systems. To explain this, it has been hypothesized that multifaceted use of the genes should play a pivotal role in functional
Experiences during brain development may influence the pathogenesis of developmental disorders. Thus, social isolation (SI) rearing after weaning is a useful animal model for studying the pathological mechanisms of such psychiatric diseases. In this study, we examined the effect of SI on neurogenesis in the hippocampal dentate gyrus (DG) relating to memory and emotion-related behaviors. When newly divided cells were labeled with 5-bromo-2'-deoxyuridine (BrdU) before SI, the number of BrdU-positive cells and the rate of differentiation into neurons were significantly decreased after 4-week SI compared with those in group-housed mice. Repeated treatment of fluoxetine prevented the SI-induced impairment of survival of newly divided cells and ameliorated spatial memory impairment and part of aggression in SI mice. Furthermore, we investigated the changes in gene expression in the DG of SI mice by using DNA microarray and real-time PCR. We finally found that SI reduced the expression of development-related genes Nurr1 and Npas4. These findings suggest that communication in juvenile is important in the survival and differentiation of newly divided cells, which may be associated with memory and aggression, and raise the possibility that the reduced expression of Nurr1 and/or Npas4 may contribute to the impairment of neurogenesis and memory and aggression induced by SI.
In this report, we first cloned a cDNA for a protein that is highly expressed in mouse kidney and then isolated its counterparts in human, rat hamster, and guinea pig by polymerase chain reaction-based cloning. The cDNAs of the five species encoded polypeptides of 244 amino acids, which shared more than 85% identity with each other and showed high identity with a human sperm 34-kDa protein, P34H, as well as a murine lungspecific carbonyl reductase of the short-chain dehydrogenase/reductase superfamily. In particular, the human protein is identical to P34H, except for one amino acid substitution. The purified recombinant proteins of the five species were about 100-kDa homotetramers with NADPH-linked reductase activity for ␣-dicarbonyl compounds, catalyzed the oxidoreduction between xylitol and L-xylulose, and were inhibited competitively by nbutyric acid. Therefore, the proteins are designated as dicarbonyl/L-xylulose reductases (DCXRs). The substrate specificity and kinetic constants of DCXRs for dicarbonyl compounds and sugars are similar to those of mammalian diacetyl reductase and L-xylulose reductase, respectively, and the identity of the DCXRs with these two enzymes was demonstrated by their co-purification from hamster and guinea pig livers and by protein sequencing of the hepatic enzymes. Both DCXR and its mRNA are highly expressed in kidney and liver of human and rodent tissues, and the protein was localized primarily to the inner membranes of the proximal renal tubules in murine kidneys. The results imply that P34H and diacetyl reductase (EC 1.1.1.5) are identical to Lxylulose reductase (EC 1.1.1.10), which is involved in the uronate cycle of glucose metabolism, and the unique localization of the enzyme in kidney suggests that it has a role other than in general carbohydrate metabolism.
DBTSS is a database of transcriptional start sites, based on our unique collection of precise, experimentally determined 5′-end sequences of full-length cDNAs. Since its first release in 2002, several major updates have been made. In this update, we expanded the human transcriptional start site dataset by 19 million uniquely mapped, and RefSeq-associated, 5′-end sequences, which were generated by a newly introduced Solexa sequencer. Moreover, in order to provide means for interpreting those massive TSS data, we implemented two new analytical tools: one for connecting expression information with predicted transcription factor binding sites; the other for examining evolutionary conservation or species-specificity of promoters and transcripts, which can be browsed by our own comparative genome viewer. With the expanded dataset and the enhanced functionalities, DBTSS provides a unique platform that enables in-depth transcriptome analyses. DBTSS is accessible at http://dbtss.hgc.jp/.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.