The polycomb group gene Bmi1 is required for maintenance of adult stem cells in many organs1, 2. Inactivation of Bmi1 leads to impaired stem cell self-renewal due to deregulated gene expression. One critical target of BMI1 is Ink4a/Arf, which encodes the cell cycle inhibitors p16ink4a and p19Arf3. However, deletion of Ink4a/Arf only partially rescues Bmi1 null phenotypes4, indicating that other important targets of BMI1 exist. Here, using the continuously-growing mouse incisor as a model system, we report that Bmi1 is expressed by incisor stem cells and that deletion of Bmi1 resulted in fewer stem cells, perturbed gene expression, and defective enamel production. Transcriptional profiling revealed that Hox expression is normally repressed by BMI1 in the adult, and functional assays demonstrated that BMI1-mediated repression of Hox genes preserves the undifferentiated state of stem cells. As Hox gene upregulation has also been reported in other systems when Bmi1 is inactivated1, 2, 5–7, our findings point to a general mechanism whereby BMI1-mediated repression of Hox genes is required for the maintenance of adult stem cells and for prevention of inappropriate differentiation.
SUMMARY Overlapping genes pose an evolutionary dilemma as one DNA sequence evolves under the selection pressures of multiple proteins. Here, we perform systematic statistical and mutational analyses of the overlapping HIV-1 genes tat and rev and engineer exhaustive libraries of non-overlapped viruses to perform deep mutational scanning of each gene independently. We find a “segregated” organization in which overlapped sites encode functional residues of one gene or the other, but never both. Furthermore, this organization eliminates unfit genotypes, providing a fitness advantage to the population. Our comprehensive analysis reveals the extraordinary manner in which HIV minimizes the constraint of overlapping genes and repurposes that constraint to its own advantage. Thus, overlaps are not just consequences of evolutionary constraints, but rather can provide population fitness advantages.
The intracellular pH (pHi) of most cancers is constitutively higher than that of normal cells and enhances proliferation and cell survival. We found that increased pHi enabled the tumorigenic behaviors caused by somatic arginine-to-histidine mutations, which are frequent in cancer and confer pH sensing not seen with wild-type proteins. Experimentally raising the pHi increased the activity of R776H mutant epidermal growth factor receptor (EGFR-R776H), thereby increasing proliferation and causing transformation in fibroblasts. An Arg-to-Gly mutation did not confer these effects. Molecular dynamics simulations of EGFR suggested that decreased protonation of His776 at high pH causes conformational changes in the αC helix that may stabilize the active form of the kinase. An Arg-to-His, but not Arg-to-Lys, mutation in the transcription factor p53 (p53-R273H) decreased its transcriptional activity and attenuated the DNA damage response in fibroblasts and breast cancer cells with high pHi. Lowering pHi attenuated the tumorigenic effects of both EGFR-R776H and p53-R273H. Our data suggest that some somatic mutations may confer a fitness advantage to the higher pHi of cancer cells.
Profiling immunoglobulin (Ig) receptor repertoires with specialized assays can be costineffective and time-consuming. Here we report ImReP, a computational method for rapid and accurate profiling of the Ig repertoire, including the complementary-determining region 3 (CDR3), using regular RNA sequencing data such as those from 8,555 samples across 53 tissues types from 544 individuals in the Genotype-Tissue Expression (GTEx v6) project. Using ImReP and GTEx v6 data, we generate a collection of 3.6 million Ig sequences, termed the atlas of immunoglobulin repertoires (TAIR), across a broad range of tissue types that often do not have reported Ig repertoires information. Moreover, the flow of Ig clonotypes and inter-tissue repertoire similarities across immune-related tissues are also evaluated. In summary, TAIR is one of the largest collections of CDR3 sequences and tissue types, and should serve as an important resource for studying immunological diseases.
High-throughput RNA-sequencing (RNA-seq) technologies provide an unprecedented opportunity to explore the individual transcriptome. Unmapped reads are a large and often overlooked output of standard RNA-seq analyses. Here, we present Read Origin Protocol (ROP), a tool for discovering the source of all reads originating from complex RNA molecules. We apply ROP to samples across 2630 individuals from 54 diverse human tissues. Our approach can account for 99.9% of 1 trillion reads of various read length. Additionally, we use ROP to investigate the functional mechanisms underlying connections between the immune system, microbiome, and disease. ROP is freely available at https://github.com/smangul1/rop/wiki.Electronic supplementary materialThe online version of this article (10.1186/s13059-018-1403-7) contains supplementary material, which is available to authorized users.
BackgroundVaccines dramatically affect an individual’s adaptive immune system and thus provide an excellent means to study human immunity. Upon vaccination, the B cells that express antibodies (Abs) that happen to bind the vaccine are stimulated to proliferate and undergo mutagenesis at their Ab locus. This process may alter the composition of B cell lineages within an individual, which are known collectively as the antibody repertoire (AbR). Antibodies are also highly expressed in whole blood, potentially enabling RNA sequencing (RNA-seq) technologies to query this diversity. Less is known about the diversity of AbR responses across individuals to a given vaccine and if individuals tend to yield a similar response to the same antigenic stimulus.MethodsHere we implement a bioinformatic pipeline that extracts the AbR information from a time-series RNA-seq dataset of five patients who were administered a seasonal trivalent influenza vaccine (TIV). We harness the detailed time-series nature of this dataset and use methods based in functional data analysis (FDA) to identify the Abs that respond to the vaccine. We then design and implement rigorous statistical tests in order to ask whether or not these patients exhibit a convergent AbR response to the same TIV.ResultsWe find that high-resolution time-series data can be used to help identify the Abs that respond to an antigenic stimulus and that this response can exhibit a convergent nature across patients inoculated with the same vaccine. However, correlations in AbR diversity among individuals prior to inoculation can confound inference of a convergent signal unless it is taken into account.ConclusionsWe developed a framework to identify the elements of an AbR that respond to an antigen. This information could be used to understand the diversity of different immune responses in different individuals, as well as to gauge the effectiveness of the immune response to a given stimulus within an individual. We also present a framework for testing a convergent hypothesis between AbRs; a hypothesis that is more difficult to test than previously appreciated. Our discovery of a convergent signal suggests that similar epitopes do select for antibodies with similar sequence characteristics.Electronic supplementary materialThe online version of this article (doi:10.1186/s13073-016-0314-z) contains supplementary material, which is available to authorized users.
Cancer can be viewed as a set of different diseases with distinctions based on tissue origin, driver mutations, and genetic signatures. Accordingly, each of these distinctions have been used to classify cancer subtypes and to reveal common features. Here, we present a different analysis of cancer based on amino acid mutation signatures. Non-negative Matrix Factorization and principal component analysis of 29 cancers revealed six amino acid mutation signatures, including four signatures that were dominated by either arginine to histidine (Arg>His) or glutamate to lysine (Glu>Lys) mutations. Sample-level analyses reveal that while some cancers are heterogeneous, others are largely dominated by one type of mutation. Using a non-overlapping set of samples from the COSMIC somatic mutation database, we validate five of six mutation signatures, including signatures with prominent arginine to histidine (Arg>His) or glutamate to lysine (Glu>Lys) mutations. This suggests that our classification of cancers based on amino acid mutation patterns may provide avenues of inquiry pertaining to specific protein mutations that may generate novel insights into cancer biology.
Assay-based approaches provide a detailed view of the adaptive immune system by profiling immunoglobulin (Ig) receptor repertoires. However, these methods carry a high cost and lack the scale of standard RNA sequencing (RNA-Seq). Here we report the development of ImReP, a novel computational method for rapid and accurate profiling of the immunoglobulin repertoire from regular RNA-Seq data. ImReP can also accurately assemble the complementary determining regions 3 (CDR3s), the most variable regions of Ig receptors. We applied our novel method to 8,555 samples across 53 tissues from 544 individuals in the Genotype-Tissue Expression (GTEx v6) project. ImReP is able to efficiently extract Ig-derived reads from RNA-Seq data. Using ImReP, we have created a systematic atlas of 3.6 million Ig sequences across a broad range of tissue types, most of which have not been studied for Ig receptor repertoires.We also compared the GTEx tissues to track the flow of Ig clonotypes across immune-related tissues, including secondary lymphoid organs and organs encompassing mucosal, exocrine, and endocrine sites, and we examined the compositional similarities of clonal populations between these tissues. The Atlas of Immunoglobulin Repertoires (The AIR), is freely available at https://github.com/smangul1/TheAIR/wiki , is one of the largest collection of CDR3 sequences and tissue types. We anticipate this recourse will enhance future immunology studies and advance the development of therapies for human diseases. ImReP is freely available at https://github.com/mandricigor/imrep/wiki Results Related workA number of tools have been developed to reconstruct the Ig receptor repertoire.Repertoire analysis from RNA-Seq data typically starts with mapping the reads to the germline V, D, and J genes obtained from the International ImMunoGeneTics (IMGT) database 12 . There are three possible read mapping scenarios: (1) the read is entirely mapped to the V gene; (2) the read is entirely mapped to the J gene; (3) the read is partially mapped to the V and J genes simultaneously. Existing methods consider only reads from category (3). These methods use different underlying algorithms to map reads to germline genes. For example, MiXCR 8 relies on an in-house alignment procedure, IgBlast 13 utilizes BLAST with an optimized set of parameters, and IMSEQ 14 uses in-house pairwise alignment between the read sequence and the germline V and J segment sequences.Following the alignment, MiXCR performs overlapping of previously aligned reads into contigs. The resulting contigs are re-aligned to the V, D, and J genes to verify that the significant portion of non-template N insertions is covered. In contrast to MiXCR, which simultaneously aligns reads to both V, D, and J genes, IgBlast separately aligns the query read to the databases comprised of V, D, and J genes. IgBlast uses a specific sequence to separately align; first, the program finds the best V gene hit. Then, IgBlast masks the aligned read region and performs an alignment to the J gene database. (In the ...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.