The role of the human microbiome in health and disease is increasingly appreciated. We studied the composition of microbial communities present in blood across 192 individuals, including healthy controls and patients with three disorders affecting the brain: schizophrenia, amyotrophic lateral sclerosis, and bipolar disorder. By using high-quality unmapped RNA sequencing reads as candidate microbial reads, we performed profiling of microbial transcripts detected in whole blood. We were able to detect a wide range of bacterial and archaeal phyla in blood. Interestingly, we observed an increased microbial diversity in schizophrenia patients compared to the three other groups. We replicated this finding in an independent schizophrenia case–control cohort. This increased diversity is inversely correlated with estimated cell abundance of a subpopulation of CD8+ memory T cells in healthy controls, supporting a link between microbial products found in blood, immunity and schizophrenia.
Profiling immunoglobulin (Ig) receptor repertoires with specialized assays can be costineffective and time-consuming. Here we report ImReP, a computational method for rapid and accurate profiling of the Ig repertoire, including the complementary-determining region 3 (CDR3), using regular RNA sequencing data such as those from 8,555 samples across 53 tissues types from 544 individuals in the Genotype-Tissue Expression (GTEx v6) project. Using ImReP and GTEx v6 data, we generate a collection of 3.6 million Ig sequences, termed the atlas of immunoglobulin repertoires (TAIR), across a broad range of tissue types that often do not have reported Ig repertoires information. Moreover, the flow of Ig clonotypes and inter-tissue repertoire similarities across immune-related tissues are also evaluated. In summary, TAIR is one of the largest collections of CDR3 sequences and tissue types, and should serve as an important resource for studying immunological diseases.
High-throughput RNA-sequencing (RNA-seq) technologies provide an unprecedented opportunity to explore the individual transcriptome. Unmapped reads are a large and often overlooked output of standard RNA-seq analyses. Here, we present Read Origin Protocol (ROP), a tool for discovering the source of all reads originating from complex RNA molecules. We apply ROP to samples across 2630 individuals from 54 diverse human tissues. Our approach can account for 99.9% of 1 trillion reads of various read length. Additionally, we use ROP to investigate the functional mechanisms underlying connections between the immune system, microbiome, and disease. ROP is freely available at https://github.com/smangul1/rop/wiki.Electronic supplementary materialThe online version of this article (10.1186/s13059-018-1403-7) contains supplementary material, which is available to authorized users.
Alternative splicing (AS) is prevalent in cancer, generating an extensive but largely unexplored repertoire of novel immunotherapy targets. We describe I soform peptides from R NA splicing for I mmunotherapy target S creening (IRIS), a computational platform capable of discovering AS-derived tumor antigens (TAs) for T cell receptor (TCR) and chimeric antigen receptor T cell (CAR-T) therapies. IRIS leverages large-scale tumor and normal transcriptome data and incorporates multiple screening approaches to discover AS-derived TAs with tumor-associated or tumor-specific expression. In a proof-of-concept analysis integrating transcriptomics and immunopeptidomics data, we showed that hundreds of IRIS-predicted TCR targets are presented by human leukocyte antigen (HLA) molecules. We applied IRIS to RNA-seq data of neuroendocrine prostate cancer (NEPC). From 2,939 NEPC-associated AS events, IRIS predicted 1,651 epitopes from 808 events as potential TCR targets for two common HLA types (A*02:01 and A*03:01). A more stringent screening test prioritized 48 epitopes from 20 events with “neoantigen-like” NEPC-specific expression. Predicted epitopes are often encoded by microexons of ≤30 nucleotides. To validate the immunogenicity and T cell recognition of IRIS-predicted TCR epitopes, we performed in vitro T cell priming in combination with single-cell TCR sequencing. Seven TCRs transduced into human peripheral blood mononuclear cells (PBMCs) showed high activity against individual IRIS-predicted epitopes, providing strong evidence of isolated TCRs reactive to AS-derived peptides. One selected TCR showed efficient cytotoxicity against target cells expressing the target peptide. Our study illustrates the contribution of AS to the TA repertoire of cancer cells and demonstrates the utility of IRIS for discovering AS-derived TAs and expanding cancer immunotherapies.
Sequencing of RNA provides the possibility to study an individual's transcriptome landscape and determine allelic expression ratios. Single-molecule protocols generate multi-kilobase reads longer than most transcripts, allowing sequencing of complete haplotype isoforms. This allows partitioning the reads into two parental haplotypes. While the read length of the single-molecule protocols is long, the relatively high error rate limits the ability to accurately detect the genetic variants and assemble them into the haplotype-specific isoforms. In this paper, we present Haplotype-specific Isoform reconstruction (HapIso), a method able to tolerate the relatively high error rate of the single-molecule platform and partition the isoform reads into the parental alleles. Phasing the reads according to the allele of origin allows our method to efficiently distinguish between the read errors and the true biological mutations. HapIso uses a k -means clustering algorithm aiming to group the reads into two meaningful clusters maximizing the similarity of the reads within the cluster and minimizing the similarity of the reads from different clusters. Each cluster corresponds to a parental haplotype. We used the family pedigree information to evaluate our approach. Experimental validation suggests that HapIso is able to tolerate the relatively high error rate and accurately partition the reads into the parental alleles of the isoform transcripts. We also applied HapIso to novel clinical single-molecule RNA-Seq data to estimate allele-specific expression of genes of interest. Our method was able to correct reads and determine Glu1883Lys point mutation of clinical significance validated by GeneDx HCM panel. Furthermore, our method is the first method able to reconstruct the haplotype-specific isoforms from long single-molecule reads.
The MYC proto-oncogene contributes to the pathogenesis of more than half of human cancers. Malignant transformation by MYC transcriptionally up-regulates the core pre-mRNA splicing machinery and causes misregulation of alternative splicing. However, our understanding of how splicing changes are directed by MYC is limited. We performed a signaling pathway-guided splicing analysis to identify MYC-dependent splicing events. These included an HRAS cassette exon repressed by MYC across multiple tumor types. To molecularly dissect the regulation of this HRAS exon, we used antisense oligonucleotide tiling to identify splicing enhancers and silencers in its flanking introns. RNA-binding motif prediction indicated multiple binding sites for hnRNP H and hnRNP F within these cis-regulatory elements. Using siRNA knockdown and cDNA expression, we found that both hnRNP H and F activate the HRAS cassette exon. Mutagenesis and targeted RNA immunoprecipitation implicate two downstream G-rich elements in this splicing activation. Analyses of ENCODE RNA-seq datasets confirmed hnRNP H regulation of HRAS splicing. Analyses of RNA-seq datasets across multiple cancers showed a negative correlation of HNRNPH gene expression with MYC hallmark enrichment, consistent with the effect of hnRNP H on HRAS splicing. Interestingly, HNRNPF expression showed a positive correlation with MYC hallmarks and thus was not consistent with the observed effects of hnRNP F. Loss of hnRNP H/F altered cell cycle progression and induced apoptosis in the PC3 prostate cancer cell line. Collectively, our results reveal mechanisms for MYC-dependent regulation of splicing and point to possible therapeutic targets in prostate cancers.
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.