Serghei Mangul scite author profile

The Genotype-Tissue Expression (GTEx) project was established to characterize genetic effects on the transcriptome across human tissues and to link these regulatory mechanisms to trait and disease associations. Here, we present analyses of the version 8 data, examining 15,201 RNA-sequencing samples from 49 tissues of 838 postmortem donors. We comprehensively characterize genetic associations for gene expression and splicing in cis and trans, showing that regulatory associations are found for almost all genes, and describe the underlying molecular mechanisms and their contribution to allelic heterogeneity and pleiotropy of complex traits. Leveraging the large diversity of tissues, we provide insights into the tissue specificity of genetic effects and show that cell type composition is a key factor in understanding gene regulatory mechanisms in human tissues.

show abstract

Intratumoral CD4+ T Cells Mediate Anti-tumor Cytotoxicity in Human Bladder Cancer

Kwek

Raju

et al. 2020

Cell

472

411

View full text Add to dashboard Cite

Summary Responses to anti-PD-1 immunotherapy occur but are infrequent in bladder cancer. The specific T cells that mediate tumor rejection are unknown. T cells from human bladder tumors and non-malignant tissue were assessed with single-cell RNA and paired T cell receptor (TCR) sequencing of 30,604 T cells from 7 patients. We find that the states and repertoires of CD8 + T cells are not distinct in tumors compared with non-malignant tissues. In contrast, single-cell analysis of CD4 + T cells demonstrates several tumor-specific states, including multiple distinct states of regulatory T cells. Surprisingly, we also find multiple cytotoxic CD4 + T cell states that are clonally expanded. These CD4 + T cells can kill autologous tumors in an MHC class II-dependent fashion and are suppressed by regulatory T cells. Further, a gene signature of cytotoxic CD4 + T cells in tumors predicts a clinical response in 244 metastatic bladder cancer patients treated with anti-PD-L1.

show abstract

Critical Assessment of Metagenome Interpretation: the second round of challenges

et al. 2022

View full text Add to dashboard Cite

Evaluating metagenomic software is key for optimizing metagenome interpretation and focus of the Initiative for the Critical Assessment of Metagenome Interpretation (CAMI). The CAMI II challenge engaged the community to assess methods on realistic and complex datasets with long- and short-read sequences, created computationally from around 1,700 new and known genomes, as well as 600 new plasmids and viruses. Here we analyze 5,002 results by 76 program versions. Substantial improvements were seen in assembly, some due to long-read data. Related strains still were challenging for assembly and genome recovery through binning, as was assembly quality for the latter. Profilers markedly matured, with taxon profilers and binners excelling at higher bacterial ranks, but underperforming for viruses and Archaea. Clinical pathogen detection results revealed a need to improve reproducibility. Runtime and memory usage analyses identified efficient programs, including top performers with other metrics. The results identify challenges and guide researchers in selecting methods for analyses.

show abstract

Co-expression networks reveal the tissue-specific regulation of transcription and splicing

et al. 2017

View full text Add to dashboard Cite

Gene co-expression networks capture biologically important patterns in gene expression data, enabling functional analyses of genes, discovery of biomarkers, and interpretation of genetic variants. Most network analyses to date have been limited to assessing correlation between total gene expression levels in a single tissue or small sets of tissues. Here, we built networks that additionally capture the regulation of relative isoform abundance and splicing, along with tissue-specific connections unique to each of a diverse set of tissues. We used the Genotype-Tissue Expression (GTEx) project v6 RNA sequencing data across 50 tissues and 449 individuals. First, we developed a framework called Transcriptome-Wide Networks (TWNs) for combining total expression and relative isoform levels into a single sparse network, capturing the interplay between the regulation of splicing and transcription. We built TWNs for 16 tissues and found that hubs in these networks were strongly enriched for splicing and RNA binding genes, demonstrating their utility in unraveling regulation of splicing in the human transcriptome. Next, we used a Bayesian biclustering model that identifies network edges unique to a single tissue to reconstruct Tissue-Specific Networks (TSNs) for 26 distinct tissues and 10 groups of related tissues. Finally, we found genetic variants associated with pairs of adjacent nodes in our networks, supporting the estimated network structures and identifying 20 genetic variants with distant regulatory impact on transcription and splicing. Our networks provide an improved understanding of the complex relationships of the human transcriptome across tissues.

show abstract

Cell type–specific genetic regulation of gene expression across human tissues

Kim-Hellmuth

Aguet

Oliva

et al. 2020

Science

227

127

View full text Add to dashboard Cite

The Genotype-Tissue Expression (GTEx) project has identified expression and splicing quantitative trait loci in cis (QTLs) for the majority of genes across a wide range of human tissues. However, the functional characterization of these QTLs has been limited by the heterogeneous cellular composition of GTEx tissue samples. We mapped interactions between computational estimates of cell type abundance and genotype to identify cell type–interaction QTLs for seven cell types and show that cell type–interaction expression QTLs (eQTLs) provide finer resolution to tissue specificity than bulk tissue cis-eQTLs. Analyses of genetic associations with 87 complex traits show a contribution from cell type–interaction QTLs and enables the discovery of hundreds of previously unidentified colocalized loci that are masked in bulk tissue.

show abstract

Systematic benchmarking of omics computational tools

et al. 2019

View full text Add to dashboard Cite

Computational omics methods packaged as software have become essential to modern biological research. The increasing dependence of scientists on these powerful software tools creates a need for systematic assessment of these methods, known as benchmarking. Adopting a standardized benchmarking practice could help researchers who use omics data to better leverage recent technological innovations. Our review summarizes benchmarking practices from 25 recent studies and discusses the challenges, advantages, and limitations of benchmarking across various domains of biology. We also propose principles that can make computational biology benchmarking studies more sustainable and reproducible, ultimately increasing the transparency of biomedical data and results.

show abstract

Cell type specific genetic regulation of gene expression across human tissues

Kim-Hellmuth

Aguet

Oliva

et al. 2019

Preprint

114

View full text Add to dashboard Cite

The Genotype-Tissue Expression (GTEx) project has identified expression and splicing quantitative trait loci (cis-QTLs) for the majority of genes across a wide range of human tissues. However, the interpretation of these QTLs has been limited by the heterogeneous cellular composition of GTEx tissue samples. Here, we map interactions between computational estimates of cell type abundance and genotype to identify cell type interaction QTLs for seven cell types and show that cell type interaction eQTLs provide finer resolution to tissue specificity than bulk tissuecis-eQTLs. Analyses of genetic associations to 87 complex traits show a contribution from cell type interaction QTLs and enables the discovery of hundreds of previously unidentified colocalized loci that are masked in bulk tissue.One Sentence SummaryEstimated cell type abundances from bulk RNA-seq across tissues reveal the cellular specificity of quantitative trait loci.

show abstract

Estimation of alternative splicing isoform frequencies from RNA-Seq data

Nicolae

Mangul

Măndoiu

et al. 2011

Algorithms Mol Biol

149

View full text Add to dashboard Cite

BackgroundMassively parallel whole transcriptome sequencing, commonly referred as RNA-Seq, is quickly becoming the technology of choice for gene expression profiling. However, due to the short read length delivered by current sequencing technologies, estimation of expression levels for alternative splicing gene isoforms remains challenging.ResultsIn this paper we present a novel expectation-maximization algorithm for inference of isoform- and gene-specific expression levels from RNA-Seq data. Our algorithm, referred to as IsoEM, is based on disambiguating information provided by the distribution of insert sizes generated during sequencing library preparation, and takes advantage of base quality scores, strand and read pairing information when available. The open source Java implementation of IsoEM is freely available at http://dna.engr.uconn.edu/software/IsoEM/.ConclusionsEmpirical experiments on both synthetic and real RNA-Seq datasets show that IsoEM has scalable running time and outperforms existing methods of isoform and gene expression level estimation. Simulation experiments confirm previous findings that, for a fixed sequencing cost, using reads longer than 25-36 bases does not necessarily lead to better accuracy for estimating expression levels of annotated isoforms and genes.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Serghei Mangul

The GTEx Consortium atlas of genetic regulatory effects across human tissues

Intratumoral CD4+ T Cells Mediate Anti-tumor Cytotoxicity in Human Bladder Cancer

Critical Assessment of Metagenome Interpretation: the second round of challenges

Co-expression networks reveal the tissue-specific regulation of transcription and splicing

Cell type–specific genetic regulation of gene expression across human tissues

Systematic benchmarking of omics computational tools

Cell type specific genetic regulation of gene expression across human tissues

Estimation of alternative splicing isoform frequencies from RNA-Seq data

Contact Info

Product

Resources

About