Tissue and cell-type identity lie at the core of human physiology and disease. Understanding the genetic underpinnings of complex tissues and individual cell lineages is crucial for developing improved diagnostics and therapeutics. We present genome-wide functional interaction networks for 144 human tissues and cell types developed using a data-driven Bayesian methodology that integrates thousands of diverse experiments spanning tissue and disease states. Tissue-specific networks predict lineage-specific responses to perturbation, reveal genes’ changing functional roles across tissues, and illuminate disease-disease relationships. We introduce NetWAS, which combines genes with nominally significant GWAS p-values and tissue-specific networks to identify disease-gene associations more accurately than GWAS alone. Our webserver, GIANT, provides an interface to human tissue networks through multi-gene queries, network visualization, analysis tools including NetWAS, and downloadable networks. GIANT enables systematic exploration of the landscape of interacting genes that shape specialized cellular functions across more than one hundred human tissues and cell types.
Autism spectrum disorder (ASD) is a complex neurodevelopmental disorder with a strong genetic basis. Yet, only a small fraction of potentially causal genes—about 65 genes out of an estimated several hundred—are known with strong genetic evidence from sequencing studies. We developed a complementary machine-learning approach based on a human brain-specific gene network to present a genome-wide prediction of autism risk genes, including hundreds of candidates for which there is minimal or no prior genetic evidence. Our approach was validated in a large independent case–control sequencing study. Leveraging these genomewide predictions and the brain-specific network, we demonstrated that the large set of ASD genes converges on a smaller number of key pathways and developmental stages of the brain. Finally, we identified likely pathogenic genes within frequent autism-associated copy-number variants and proposed genes and pathways that are likely mediators of ASD across multiple copy-number variants. All predictions and functional insights are available at http://asd.princeton.edu.
We present SEEK (http://seek.princeton.edu), a query-based search engine across very large transcriptomic data collections, including thousands of human data sets from almost 50 microarray and next-generation sequencing platforms. SEEK uses a novel query-level cross-validation-based algorithm to automatically prioritize data sets relevant to the query and a robust search approach to identify query-coregulated genes, pathways, and processes. SEEK provides cross-platform handling, multi-gene query search, iterative metadata-based search refinement, and extensive visualization-based analysis options.
Adeno-associated virus (AAV) receptor (AAVR) is an essential receptor for the entry of multiple AAV serotypes with divergent rules; however, the mechanism remains unclear. Here, we determine the structures of the AAV1-AAVR and AAV5-AAVR complexes, revealing the molecular details by which PKD1 recognizes AAV5 and PKD2 is solely engaged with AAV1. PKD2 lies on the plateau region of the AAV1 capsid. However, the AAV5-AAVR interface is strikingly different, in which PKD1 is bound at the opposite side of the spike of the AAV5 capsid than the PKD2-interacting region of AAV1. Residues in strands F/G and the CD loop of PKD1 interact directly with AAV5, whereas residues in strands B/C/E and the BC loop of PKD2 make contact with AAV1. These findings further the understanding of the distinct mechanisms by which AAVR recognizes various AAV serotypes and provide an example of a single receptor engaging multiple viral serotypes with divergent rules.
The capping of mRNA and the proofreading plays essential roles in SARS-CoV-2 replication and transcription. Here, we present the cryo-EM structure of the SARS-CoV-2
R
eplication-
T
ranscription
C
omplex (RTC) in a form identified as Cap(0)-RTC, which couples a
C
o-transcriptional
C
apping
C
omplex (CCC) composed of nsp12 NiRAN, nsp9, the bifunctional nsp14 possessing a N-terminal exoribonuclease (ExoN) and a C-terminal N7-methyltransferase (N7-MTase), and nsp10 as a cofactor of nsp14. Nsp9 and nsp12 NiRAN recruit nsp10/nsp14 into the Cap(0)-RTC, forming the N7-CCC to yield cap(0) (
7Me
GpppA) at 5’ end of pre-mRNA. A dimeric form of Cap(0)-RTC observed by cryo-EM suggests an
in trans
backtracking mechanism for nsp14 ExoN to facilitate proofreading of the RNA in concert with polymerase nsp12. These results not only provide a structural basis for understanding co-transcriptional modification of SARS-CoV-2 mRNA, but also shed light on how replication fidelity in SARS-CoV-2 is maintained.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.