We collected and completely sequenced 28,469 full-length complementary DNA clones from Oryza sativa L. ssp. japonica cv. Nipponbare. Through homology searches of publicly available sequence data, we assigned tentative protein functions to 21,596 clones (75.86%). Mapping of the cDNA clones to genomic DNA revealed that there are 19,000 to 20,500 transcription units in the rice genome. Protein informatics analysis against the InterPro database revealed the existence of proteins presented in rice but not in Arabidopsis. Sixty-four percent of our cDNAs are homologous to Arabidopsis proteins.
Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense-antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.
We have developed a novel assay system for systematic analysis of protein-protein interactions (PPIs) that is characteristic of a PCR-mediated rapid sample preparation and a high-throughput assay system based on the mammalian two-hybrid method. Using gene-specific primers, we successfully constructed the assay samples by two rounds of PCR with up to 3.6 kb from the first-round PCR fragments. In the assay system, we designed all the steps to be performed by adding only samples, reagents, and cells into 384-well assay plates using two types of semiautomatic multiple dispensers. The system enabled us examine more than 20,000 assay wells per day. We detected 145 interactions in our pilot study using 3500 samples derived from mouse full-length enriched cDNAs. Analysis of the interaction data showed both several significant interaction clusters and predicted functions of a few uncharacterized proteins. In combination with our comprehensive mouse full-length cDNA clone bank covering a large part of the whole genes, our high-throughput assay system will discover many interactions to facilitate understanding of the function of uncharacterized proteins and the molecular mechanism of crucial biological processes, and also enable completion of a rough draft of the entire PPI panel in certain cell types or tissues of mouse within a short time.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.