Background The detection of physiologically relevant protein isoforms encoded by the human genome is critical to biomedicine. Mass spectrometry (MS)-based proteomics is the preeminent method for protein detection, but isoform-resolved proteomic analysis relies on accurate reference databases that match the sample; neither a subset nor a superset database is ideal. Long-read RNA sequencing (e.g., PacBio or Oxford Nanopore) provides full-length transcripts which can be used to predict full-length protein isoforms. Results We describe here a long-read proteogenomics approach for integrating sample-matched long-read RNA-seq and MS-based proteomics data to enhance isoform characterization. We introduce a classification scheme for protein isoforms, discover novel protein isoforms, and present the first protein inference algorithm for the direct incorporation of long-read transcriptome data to enable detection of protein isoforms previously intractable to MS-based detection. We have released an open-source Nextflow pipeline that integrates long-read sequencing in a proteomic workflow for isoform-resolved analysis. Conclusions Our work suggests that the incorporation of long-read sequencing and proteomic data can facilitate improved characterization of human protein isoform diversity. Our first-generation pipeline provides a strong foundation for future development of long-read proteogenomics and its adoption for both basic and translational research.
Endothelial cells (ECs) comprise the lumenal lining of all blood vessels and are critical for the functioning of the cardiovascular system. Their phenotypes can be modulated by alternative splicing of RNA to produce distinct protein isoforms. To characterize the RNA and protein isoform landscape within ECs, we applied a long read proteogenomics approach to analyse human umbilical vein endothelial cells (HUVECs). Transcripts delineated from PacBio sequencing serve as the basis for a sample-specific protein database used for downstream mass-spectrometry (MS) analysis to infer protein isoform expression. We detected 53,863 transcript isoforms from 10,426 genes, with 22,195 of those transcripts being novel. Furthermore, the predominant isoform in HUVECs does not correspond with the accepted “reference isoform” 25% of the time, with vascular pathway-related genes among this group. We found 2,597 protein isoforms supported through unique peptides, with an additional 2,280 isoforms nominated upon incorporation of long-read transcript evidence. We characterized a novel alternative acceptor for endothelial-related gene CDH5, suggesting potential changes in its associated signalling pathways. Finally, we identified novel protein isoforms arising from a diversity of RNA splicing mechanisms supported by uniquely mapped novel peptides. Our results represent a high-resolution atlas of known and novel isoforms of potential relevance to endothelial phenotypes and function.
We have developed methods to achieve efficient CRISPR-Cas9–mediated gene knockout in ex vivo mouse embryonic salivary epithelial explants. Salivary epithelial explants provide a valuable model for characterizing cell signaling, differentiation, and epithelial morphogenesis, but research has been limited by a paucity of efficient gene perturbation methods. Here, we demonstrate highly efficient gene perturbation by transient transduction of guide RNA–expressing lentiviruses into Cas9-expressing salivary epithelial buds isolated from Cas9 transgenic mice. We first show that salivary epithelial explants can be cultured in low-concentration, nonsolidified Matrigel suspensions in 96-well plates, which greatly increases sample throughput compared to conventional cultures embedded in solidified Matrigel. We further show that salivary epithelial explants can grow and branch with FGF7 alone, while supplementing with insulin, transferrin, and selenium (ITS) enhances growth and branching. We then describe an efficient workflow to produce experiment-ready, high-titer lentiviruses within 1 wk after molecular cloning. To track transduced cells, we designed the lentiviral vector to coexpress a nuclear fluorescent reporter with the guide RNA. We routinely achieved 80% transduction efficiency when antibiotic selection was used. Importantly, we detected robust loss of targeted protein products when testing 9 guide RNAs for 3 different genes. Moreover, targeting the β1 integrin gene ( Itgb1) inhibited branching morphogenesis, which supports the importance of cell–matrix adhesion in driving branching morphogenesis. In summary, we have established a lentivirus-based method that can efficiently perturb genes of interest in salivary epithelial explants, which will greatly facilitate studies of specific gene functions using this system.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.