The Genotype-Tissue Expression (GTEx) project was established to characterize genetic effects on the transcriptome across human tissues and to link these regulatory mechanisms to trait and disease associations. Here, we present analyses of the version 8 data, examining 15,201 RNA-sequencing samples from 49 tissues of 838 postmortem donors. We comprehensively characterize genetic associations for gene expression and splicing in cis and trans, showing that regulatory associations are found for almost all genes, and describe the underlying molecular mechanisms and their contribution to allelic heterogeneity and pleiotropy of complex traits. Leveraging the large diversity of tissues, we provide insights into the tissue specificity of genetic effects and show that cell type composition is a key factor in understanding gene regulatory mechanisms in human tissues.
Understanding gene function and regulation in homeostasis and disease requires knowledge of the cellular and tissue contexts in which genes are expressed. Here, we applied four single-nucleus RNA sequencing methods to eight diverse, archived, frozen tissue types from 16 donors and 25 samples, generating a cross-tissue atlas of 209,126 nuclei profiles, which we integrated across tissues, donors, and laboratory methods with a conditional variational autoencoder. Using the resulting cross-tissue atlas, we highlight shared and tissue-specific features of tissue-resident cell populations; identify cell types that might contribute to neuromuscular, metabolic, and immune components of monogenic diseases and the biological processes involved in their pathology; and determine cell types and gene modules that might underlie disease mechanisms for complex traits analyzed by genome-wide association studies.
Highlights d Unsupervised clustering revealed subtype with EMT and phosphoprotein signatures d Potential therapeutic vulnerabilities included survivin, NSD3, LSD1, and EZH2 d Rb phosphorylation nominated as a biomarker for trials with CDK4/6 inhibitors d Detailed immune landscape analysis highlighted targetable points of immuneregulation
PURPOSE Smoldering multiple myeloma (SMM) is a precursor condition of multiple myeloma (MM) with a 10% annual risk of progression. Various prognostic models exist for risk stratification; however, those are based on solely clinical metrics. The discovery of genomic alterations that underlie disease progression to MM could improve current risk models. METHODS We used next-generation sequencing to study 214 patients with SMM. We performed whole-exome sequencing on 166 tumors, including 5 with serial samples, and deep targeted sequencing on 48 tumors. RESULTS We observed that most of the genetic alterations necessary for progression have already been acquired by the diagnosis of SMM. Particularly, we found that alterations of the mitogen-activated protein kinase pathway ( KRAS and NRAS single nucleotide variants [SNVs]), the DNA repair pathway (deletion 17p, TP53, and ATM SNVs), and MYC (translocations or copy number variations) were all independent risk factors of progression after accounting for clinical risk staging. We validated these findings in an external SMM cohort by showing that patients who have any of these three features have a higher risk of progressing to MM. Moreover, APOBEC associated mutations were enriched in patients who progressed and were associated with a shorter time to progression in our cohort. CONCLUSION SMM is a genetically mature entity whereby most driver genetic alterations have already occurred, which suggests the existence of a right-skewed model of genetic evolution from monoclonal gammopathy of undetermined significance to MM. We identified and externally validated genomic predictors of progression that could distinguish patients at high risk of progression to MM and, thus, improve on the precision of current clinical models.
Current genomics methods are designed to handle tens to thousands of samples but will need to scale to millions to match the pace of data and hypothesis generation in biomedical science. Here, we show that high efficiency at low cost can be achieved by leveraging general-purpose libraries for computing using graphics processing units (GPUs), such as PyTorch and TensorFlow. We demonstrate > 200-fold decreases in runtime and ~ 5–10-fold reductions in cost relative to CPUs. We anticipate that the accessibility of these libraries will lead to a widespread adoption of GPUs in computational genomics.
Allele expression (AE) analysis robustly measures cis-regulatory effects. Here, we present and demonstrate the utility of a vast AE resource generated from the GTEx v8 release, containing 15,253 samples spanning 54 human tissues for a total of 431 million measurements of AE at the SNP level and 153 million measurements at the haplotype level. In addition, we develop an extension of our tool phASER that allows effect sizes of cis-regulatory variants to be estimated using haplotype-level AE data. This AE resource is the largest to date, and we are able to make haplotype-level data publicly available. We anticipate that the availability of this resource will enable future studies of regulatory variation across human tissues.
Antiviral therapeutics against SARS-CoV-2 are needed to treat the pandemic disease COVID-19. Pharmacological targeting of a host factor required for viral replication can suppress viral spread with a low probability of viral mutation leading to resistance. Here, we used a genome-wide loss of function CRISPR/Cas9 screen in human lung epithelial cells to identify potential host therapeutic targets. Validation of our screening hits revealed that the kinase SRPK1, together with the closely related SRPK2, were jointly essential for SARS-CoV-2 replication; inhibition of SRPK1/2 with small molecules led to a dramatic decrease (more than 100,000-fold) in SARS-CoV-2 virus production in immortalized and primary human lung cells. Subsequent biochemical studies revealed that SPRK1/2 phosphorylate the viral nucleocapsid (N) protein at sites highly conserved across human coronaviruses and, due to this conservation, even a distantly related coronavirus was highly sensitive to an SPRK1/2 inhibitor. Together, these data suggest that SRPK1/2-targeted therapies may be an efficacious strategy to prevent or treat COVID-19 and other coronavirus-mediated diseases.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.