Determining protein levels in each tissue and how they compare with RNA levels is important for understanding human biology and disease as well as regulatory processes that control protein levels. We quantified the relative protein levels from over 12,000 genes across 32 normal human tissues. Tissue-specific or tissue-enriched proteins were identified and compared to transcriptome data. Many ubiquitous transcripts are found to encode tissue-specific proteins. Discordance of RNA and protein enrichment revealed potential sites of synthesis and action of secreted proteins. The tissue-specific distribution of proteins also provides an indepth view of complex biological events that require the interplay of multiple tissues. Most importantly, our study demonstrated that protein tissue-enrichment information can explain phenotypes of genetic diseases, which cannot be obtained by transcript information alone. Overall, our results demonstrate how understanding protein levels can provide insights into regulation, secretome, metabolism, and human diseases.
Large-scale multi-ethnic cohorts offer unprecedented opportunities to elucidate the genetic factors influencing complex traits related to health and disease among minority populations. At the same time, the genetic diversity in these cohorts presents new challenges for analysis and interpretation. We consider the utility of race and/or ethnicity categories in genome-wide association studies (GWASs) of multi-ethnic cohorts. We demonstrate that race/ethnicity information enhances the ability to understand population-specific genetic architecture. To address the practical issue that self-identified racial/ethnic information may be incomplete, we propose a machine learning algorithm that produces a surrogate variable, termed HARE. We use height as a model trait to demonstrate the utility of HARE and ethnicity-specific GWASs.
Understanding of the RNA editing process has been broadened considerably by the next generation sequencing technology; however, several issues regarding this regulatory step remain unresolved – the strategies to accurately delineate the editome, the mechanism by which its profile is maintained, and its evolutionary and functional relevance. Here we report an accurate and quantitative profile of the RNA editome for rhesus macaque, a close relative of human. By combining genome and transcriptome sequencing of multiple tissues from the same animal, we identified 31,250 editing sites, of which 99.8% are A-to-G transitions. We verified 96.6% of editing sites in coding regions and 97.5% of randomly selected sites in non-coding regions, as well as the corresponding levels of editing by multiple independent means, demonstrating the feasibility of our experimental paradigm. Several lines of evidence supported the notion that the adenosine deamination is associated with the macaque editome – A-to-G editing sites were flanked by sequences with the attributes of ADAR substrates, and both the sequence context and the expression profile of ADARs are relevant factors in determining the quantitative variance of RNA editing across different sites and tissue types. In support of the functional relevance of some of these editing sites, substitution valley of decreased divergence was detected around the editing site, suggesting the evolutionary constraint in maintaining some of these editing substrates with their double-stranded structure. These findings thus complement the “continuous probing” model that postulates tinkering-based origination of a small proportion of functional editing sites. In conclusion, the macaque editome reported here highlights RNA editing as a widespread functional regulation in primate evolution, and provides an informative framework for further understanding RNA editing in human.
Supplementary data are available at Bioinformatics online.
Coronary artery disease (CAD) is a leading cause of death, yet its genetic determinants are not fully elucidated. We report a multi-ethnic genome-wide association study of CAD involving nearly a quarter of a million cases, incorporating the largest cohorts to date of Whites, Blacks, and Hispanics from the Million Veteran Program with existing studies including CARDIoGRAMplusC4D, UK Biobank, and Biobank Japan. We verify substantial and nearly equivalent heritability of CAD across multiple ancestral groups, discover 107 novel loci including the first nine on the X-chromosome, identify the first eight genome-wide significant loci among Blacks and Hispanics, and demonstrate that two common haplotypes are largely responsible for the risk stratification at the well-known 9p21 locus in most populations except those of African origin where both haplotypes are virtually absent. We identify 15 loci for angiographically derived burden of coronary atherosclerosis, which robustly overlap with the strongest and earliest loci reported to date for clinical CAD. Phenome-wide association analyses of novel loci and externally validated polygenic risk scores (PRS) augment signals from the insulin resistance cluster of risk factors and consequences, extend previously established pleiotropic associations of loci with traditional risk factors to include smoking and family history, and confirm a substantially reduced transferability of existing PRS to Blacks. Downstream integrative genomic analyses reinforce the critical role of endothelial, fibroblast, and smooth muscle cells within the coronary vessel wall in CAD susceptibility. Our study highlights the value of a multi-ethnic design in efficiently characterizing the genetic architecture of CAD across all human populations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.