2021
DOI: 10.1101/2021.05.21.445138
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

recount3: summaries and queries for large-scale RNA-seq expression and splicing

Abstract: We present recount3, a resource consisting of over 750,000 publicly available human and mouse RNA sequencing (RNA-seq) samples uniformly processed by our new Monorail analysis pipeline. To facilitate access to the data, we provide the recount3 and snapcount R/Bioconductor packages as well as complementary web resources. Using these tools, data can be downloaded as study-level summaries or queried for specific exon-exon junctions, genes, samples, or other features. Monorail can be used to process local and/or p… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
63
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 43 publications
(66 citation statements)
references
References 60 publications
0
63
0
Order By: Relevance
“…Alzheimer's disease, for example, was signi cantly associated with LV21 (FDR < 1e-18) and with LV5 (FDR < 0.01) (Supplementary Tables 31 and 33). LV21 was strongly expressed in a variety of soft tissue sarcomas, monocytes/macrophages (including microglia from cortex samples), and aortic valves (Supplementary Figure 30); as discussed previously, macrophages play a key role in the reverse cholesterol transport and thus atherogenesis [70]. LV5 was expressed in breast cancer and brain glioma samples, microglia (cortex), liver, and kidney, among other cell types (Supplementary Figure 31).…”
Section: Projections Reveal Trait Clusters With Shared Transcriptomic Propertiesmentioning
confidence: 61%
See 1 more Smart Citation
“…Alzheimer's disease, for example, was signi cantly associated with LV21 (FDR < 1e-18) and with LV5 (FDR < 0.01) (Supplementary Tables 31 and 33). LV21 was strongly expressed in a variety of soft tissue sarcomas, monocytes/macrophages (including microglia from cortex samples), and aortic valves (Supplementary Figure 30); as discussed previously, macrophages play a key role in the reverse cholesterol transport and thus atherogenesis [70]. LV5 was expressed in breast cancer and brain glioma samples, microglia (cortex), liver, and kidney, among other cell types (Supplementary Figure 31).…”
Section: Projections Reveal Trait Clusters With Shared Transcriptomic Propertiesmentioning
confidence: 61%
“…Also, the underlying factorization method rests on linear combinations of variables, which could miss important and more complex co-expression patterns. In addition, recount2, the training dataset used, has since been surpassed in size and scale by other resources [15,83]. The second approach we used in this study is TWAS, where we are only considering the hypothesis that GWAS loci affect traits via changes in gene expression, and other effects such as coding variants disrupting protein-protein interactions are not captured.…”
Section: Discussionmentioning
confidence: 99%
“…However, rapid adoption of full-length RNA sequencing (RNA-Seq) over the past decade has led to the public archival of datasets obtained from various cell types across multiple species. Furthermore, recent computational methods have been developed to comprehensively analyze patterns of alternative splicing across hundreds of thousands of publicly archived RNA-Seq datasets [27][28][29] . We have used these databases to identify many cell type-specific alternative exons that are suitable for use in AAV vectors.…”
Section: Introductionmentioning
confidence: 99%
“…Efforts have been made to simplify the access to public RNA-seq data by creating unified resources and databases. For example, the latest iteration of the recount database (recount3) uniformly processed >750 000 RNA-seq samples in humans and mice, enabling secondary analyses of RNA-seq datasets across different studies ( 5 ). RNA-seq data in the GTEx (Genotype-Tissue Expression) and TCGA (The Cancer Genome Atlas) have also been uniformly processed to provide normalized gene expression data ( 6 ).…”
Section: Introductionmentioning
confidence: 99%