Isoform discovery by targeted cloning, 'deep-well' pooling and parallel sequencing

Salehi‐Ashtiani, Kourosh; Yang, Xinping; Derti, Adnan; Tian, Weidong; Hao, Tong; Lin, Chenwei; Makowski, Kathryn A.; Shen, Lei; Murray, Ryan R.; Szeto, David; Tusneem, Nadeem; Smith, Douglas R.; Cusick, Michael E.; Hill, David E.; Roth, Frederick P.; Vidal, Marc

doi:10.1038/nmeth.1224

Cited by 27 publications

(26 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Full-length cDNA sequences have complete sequences of transcripts including coding regions (CDSs) or all exons and untranslated regions (UTRs), and thus facilitate subsequent studies on genomic structure and functional analysis [25].It is a time consuming and laborious process using the traditional Sanger sequencing method to obtain large collections of full-length cDNAs. Fortunately, the NGS method used in this study and other studies has shown to be an efficient approach to obtain a large number of full-length cDNAs [26]–[27]. In addition, over 7 000 unigenes with 5′ or 3′ UTR are also useful for obtaining full-length cDNA sequences via the primer walking method.…”

Section: Resultsmentioning

confidence: 86%

Transcriptome Analysis of Crucian Carp (Carassius auratus), an Important Aquaculture and Hypoxia-Tolerant Species

Liao

Cheng

et al. 2013

PLoS ONE

View full text Add to dashboard Cite

The crucian carp is an important aquaculture species and a potential model to study genome evolution and physiological adaptation. However, so far the genomics and transcriptomics data available for this species are still scarce. We performed de novo transcriptome sequencing of four cDNA libraries representing brain, muscle, liver and kidney tissues respectively, each with six specimens. The removal of low quality reads resulted in 2.62 million raw reads, which were assembled as 127,711 unigenes, including 84,867 isotigs and 42,844 singletons. A total of 22,273 unigenes were found with significant matches to 14,449 unique proteins. Around14,398 unigenes were assigned with at least one Gene Ontology (GO) category in 84,876 total assignments, and 6,382 unigenes were found in 237 predicted KEGG pathways. The gene expression analysis revealed more genes expressed in brain, more up-regulated genes in muscle and more down-regulated genes in liver as compared with gene expression profiles of other tissues. In addition, 23 enzymes in the glycolysis/gluconeogenesis pathway were recovered. Importantly, we identified 5,784 high-quality putative SNP and 11,295 microsatellite markers which include 5,364 microsatellites with flanking sequences ≥50 bp. This study produced the most comprehensive genomic resources that have been derived from crucian carp, including thousands of genetic markers, which will not only lay a foundation for further studies on polyploidy origin and anoxic survival but will also facilitate selective breeding of this important aquaculture species.

show abstract

Section: Resultsmentioning

confidence: 86%

Transcriptome Analysis of Crucian Carp (Carassius auratus), an Important Aquaculture and Hypoxia-Tolerant Species

Liao

Cheng

et al. 2013

PLoS ONE

View full text Add to dashboard Cite

show abstract

“…First, full-length ORFs corresponding to known and novel isoforms are amplified by reverse transcription followed by PCR (RT-PCR) using gene-specific primers. Pools of resulting RT-PCR products are Gateway-cloned (Walhout et al, 2000) and individual ORFs are sequenced using an NGS-based deep-well approach (Salehi-Ashtiani et al, 2008). Second, Gateway-cloned full-length isoform ORFs are transferred into various expression vectors to allow systematic functional analyses such as binary protein-protein and protein-DNA interaction assays or measurement of enzymatic activities.…”

Section: Resultsmentioning

confidence: 99%

Widespread Expansion of Protein Interaction Capabilities by Alternative Splicing

Yang

Coulombe-Huntington

Kang

et al. 2016

Cell

Self Cite

480

458

View full text Add to dashboard Cite

SUMMARY While alternative splicing is known to diversify the functional characteristics of some genes, the extent to which protein isoforms globally contribute to functional complexity on a proteomic scale remains unknown. To address this systematically, we cloned full-length open reading frames of alternatively spliced transcripts for a large number of human genes and used protein-protein interaction profiling to functionally compare hundreds of protein isoform pairs. The majority of isoform pairs share less than 50% of their interactions. In the global context of interactome network maps, alternative isoforms tend to behave like distinct proteins rather than minor variants of each other. Interaction partners specific to alternative isoforms tend to be expressed in a highly tissue-specific manner and belong to distinct functional modules. Our strategy, applicable to other functional characteristics, reveals a widespread expansion of protein interaction capabilities through alternative splicing and suggests that many alternative ‘isoforms’ are functionally divergent (i.e., ‘functional alloforms’).

show abstract

“…Recent studies (3)(4)(5)(6)(7) have shown that, by sequencing the mRNA content of cells, one can quantify the expression levels of known genes (by counting how often sequences from a given gene are observed) and refine their boundaries. For example, Nagalakshmi et al (3) studied the Saccharomyces cerevisiae transcriptome by mapping reads to the location of known genes to quantify expression, and to known splice sites to measure their occurrence.…”

mentioning

confidence: 99%

Ab initio construction of a eukaryotic transcriptome by massively parallel mRNA sequencing

Yassour

Kaplan

Fraser

et al. 2009

Proc. Natl. Acad. Sci. U.S.A.

206

222

View full text Add to dashboard Cite

Defining the transcriptome, the repertoire of transcribed regions encoded in the genome, is a challenging experimental task. Current approaches, relying on sequencing of ESTs or cDNA libraries, are expensive and labor-intensive. Here, we present a general approach for ab initio discovery of the complete transcriptome of the budding yeast, based only on the unannotated genome sequence and millions of short reads from a single massively parallel sequencing run. Using novel algorithms, we automatically construct a highly accurate transcript catalog. Our approach automatically and fully defines 86% of the genes expressed under the given conditions, and discovers 160 previously undescribed transcription units of 250 bp or longer. It correctly demarcates the 5 and 3 UTR boundaries of 86 and 77% of expressed genes, respectively. The method further identifies 83% of known splice junctions in expressed genes, and discovers 25 previously uncharacterized introns, including 2 cases of condition-dependent intron retention. Our framework is applicable to poorly understood organisms, and can lead to greater understanding of the transcribed elements in an explored genome.computational biology ͉ RNAseq ͉ next generation sequencing ͉ transcriptome profiling ͉ Saccharomyces cerevisiae E xperimentally defining the complete transcriptome of eukaryotic organisms has traditionally been a challenging task, involving large, costly, and slow experimental efforts for sequencing of ESTs and full-length cDNA libraries. Unlike the genome, RNA transcripts are not present at equimolar concentrations, and are typically expressed in a context-specific manner. Thus, despite the fact that the genomes of Ͼ1,000 species have been sequenced, only few transcriptomes have been extensively characterized.Recent advances in massively parallel sequencing technology (1, 2) offer new and powerful approaches to the study of transcriptomes. Recent studies (3-7) have shown that, by sequencing the mRNA content of cells, one can quantify the expression levels of known genes (by counting how often sequences from a given gene are observed) and refine their boundaries. For example, Nagalakshmi et al. (3) studied the Saccharomyces cerevisiae transcriptome by mapping reads to the location of known genes to quantify expression, and to known splice sites to measure their occurrence. Similarly, Mortazavi et al. (5) studied the mouse transcriptome by mapping reads to known exons and known splice junctions, as well as to ''putative'' junctions between known exons. Thus, in both cases (and in additional studies, see refs. 4-7) the analysis critically depended on existing annotation.A more challenging problem is to define a transcriptome ab initio, based only on the unannotated genome sequence and millions of short reads from cDNA samples. Rapid and efficient methods to do so would transform our ability to define transcripts and study transcription in any genome. This ability would be particularly important in a new genome project involving phylogenetically isolated species ...

show abstract

Isoform discovery by targeted cloning, 'deep-well' pooling and parallel sequencing

Cited by 27 publications

References 18 publications

Transcriptome Analysis of Crucian Carp (Carassius auratus), an Important Aquaculture and Hypoxia-Tolerant Species

Transcriptome Analysis of Crucian Carp (Carassius auratus), an Important Aquaculture and Hypoxia-Tolerant Species

Widespread Expansion of Protein Interaction Capabilities by Alternative Splicing

Ab initio construction of a eukaryotic transcriptome by massively parallel mRNA sequencing

Contact Info

Product

Resources

About