Knowledge of the molecular interactions of human proteins within tissues is important for identifying their tissue-specific roles and for shedding light on tissue phenotypes. However, many protein–protein interactions (PPIs) have no tissue-contexts. The TissueNet database bridges this gap by associating experimentally-identified PPIs with human tissues that were shown to express both pair-mates. Users can select a protein and a tissue, and obtain a network view of the query protein and its tissue-associated PPIs. TissueNet v.2 is an updated version of the TissueNet database previously featured in NAR. It includes over 40 human tissues profiled via RNA-sequencing or protein-based assays. Users can select their preferred expression data source and interactively set the expression threshold for determining tissue-association. The output of TissueNet v.2 emphasizes qualitative and quantitative features of query proteins and their PPIs. The tissue-specificity view highlights tissue-specific and globally-expressed proteins, and the quantitative view highlights proteins that were differentially expressed in the selected tissue relative to all other tissues. Together, these views allow users to quickly assess the unique versus global functionality of query proteins. Thus, TissueNet v.2 offers an extensive, quantitative and user-friendly interface to study the roles of human proteins across tissues. TissueNet v.2 is available at http://netbio.bgu.ac.il/tissuenet.
A longstanding puzzle in human genetics is what limits the clinical manifestation of hundreds of hereditary diseases to certain tissues, while their causal genes are expressed throughout the human body. A general conception is that tissue-selective disease phenotypes emerge when masking factors operate in unaffected tissues, but are specifically absent or insufficient in disease-manifesting tissues. Although this conception has critical impact on the understanding of disease manifestation, it was never challenged in a systematic manner across a variety of hereditary diseases and affected tissues. Here, we address this gap in our understanding via rigorous analysis of the susceptibility of over 30 tissues to 112 tissue-selective hereditary diseases. We focused on the roles of paralogs of causal genes, which are presumably capable of compensating for their aberration. We show for the first time at large-scale via quantitative analysis of omics datasets that, preferentially in the disease-manifesting tissues, paralogs are under-expressed relative to causal genes in more than half of the diseases. This was observed for several susceptible tissues and for causal genes with varying number of paralogs, suggesting that imbalanced expression of paralogs increases tissue susceptibility. While for many diseases this imbalance stemmed from up-regulation of the causal gene in the disease-manifesting tissue relative to other tissues, it was often combined with down-regulation of its paralog. Notably in roughly 20% of the cases, this imbalance stemmed only from significant down-regulation of the paralog. Thus, dosage relationships between paralogs appear as important, yet currently under-appreciated, modifiers of disease manifestation.
A longstanding puzzle in human genetics is what limits the clinical manifestation of hundreds of hereditary diseases to certain tissues or cell types, while their causal genes are present and expressed throughout the human body. Here we considered a possible role for paralogs of causal genes in affecting this tissue selectivity. It has been shown across organisms that paralogs can compensate for the loss of each other. We hypothesized that specifically in the disease tissue causal genes and their paralogs are imbalanced, leading to insufficient compensation and to the emergence of disease phenotypes. While demonstrated previously in the context of few specific diseases, this hypothesis was never assessed quantitatively at large-scale. For this, we analyzed functional relationships between causal genes and their paralogs associated with 112 tissue-selective hereditary diseases. To test our hypothesis we used several large-scale omics datasets, including RNA sequencing profiles of over 30 different human tissues. Indeed, the expression of causal genes and their paralogs was significantly imbalanced in their disease tissues compared to unaffected tissues. Imbalanced expression was evident across different disease tissues, and was common to causal genes with single or multiple paralogs. This imbalance was driven by significant upregulation of the causal gene in its disease tissue, often combined with significant down-regulation of a paralog. Nevertheless, in additional 20% of the causal genes, a paralog alone was significantly down-regulated in the disease tissue. Our results suggest that dosage relationships between paralogs affect the phenotypic outcome of germline aberrations, adding paralogs as important modifiers of disease manifestation.
Motivation The distinct functionalities of human tissues and cell types underlie complex phenotype-genotype relationships, yet often remain elusive. Harnessing the multitude of bulk and single-cell human transcriptomes while focusing on processes can help reveal these distinct functionalities. Results The Tissue-Process Activity (TiPA) method aims to identify processes that are preferentially active or under-expressed in specific contexts, by comparing the expression levels of process genes between contexts. We tested TiPA on 1,579 tissue-specific processes and bulk tissue transcriptomes, finding that it performed better than another method. Next, we used TiPA to ask whether the activity of certain processes could underlie the tissue-specific manifestation of 1,233 hereditary diseases. We found that 21% of the disease-causing genes indeed participated in such processes, thereby illuminating their genotype-phenotype relationships. Lastly, we applied TiPA to single-cell transcriptomes of 108 human cell types, revealing that process activities often match cell type identities and can thus aid annotation efforts. Hence, differential activity of processes can highlight the distinct functionality of tissues and cells in a robust and meaningful manner. Availability TiPA code is available in GitHub (https://github.com/moranshar/TiPA). In addition, all data are available as part of the Supplementary Information. Supplementary information Supplementary data are available at Bioinformatics online.
How do aberrations in widely expressed genes lead to tissue‐selective hereditary diseases? Previous attempts to answer this question were limited to testing a few candidate mechanisms. To answer this question at a larger scale, we developed “Tissue Risk Assessment of Causality by Expression” (TRACE), a machine learning approach to predict genes that underlie tissue‐selective diseases and selectivity‐related features. TRACE utilized 4,744 biologically interpretable tissue‐specific gene features that were inferred from heterogeneous omics datasets. Application of TRACE to 1,031 disease genes uncovered known and novel selectivity‐related features, the most common of which was previously overlooked. Next, we created a catalog of tissue‐associated risks for 18,927 protein‐coding genes (https://netbio.bgu.ac.il/trace/). As proof‐of‐concept, we prioritized candidate disease genes identified in 48 rare‐disease patients. TRACE ranked the verified disease gene among the patient's candidate genes significantly better than gene prioritization methods that rank by gene constraint or tissue expression. Thus, tissue selectivity combined with machine learning enhances genetic and clinical understanding of hereditary diseases.
Genetic studies of Mendelian and rare diseases face the critical challenges of identifying pathogenic gene variants and their modes-of-action. Previous efforts rarely utilized the tissue-selective manifestation of these diseases for their elucidation. Here we introduce an interpretable machine learning (ML) platform that utilizes heterogeneous and large-scale tissue-aware datasets of human genes, and rigorously, concurrently and quantitatively assesses hundreds of candidate mechanisms per disease. The resulting tissue-aware ML platform is applicable in gene-specific, tissue-specific, or patient-specific modes. Application of the platform to selected Mendelian disease genes pinpointed mechanisms that lead to tissue-specific disease manifestation. When applied jointly to diseases that manifest in the same tissue, the models revealed common known and previously underappreciated factors that underlie tissue-selective disease manifestation. Lastly, we harnessed our ML platform toward genetic diagnosis of tissue-selective rare diseases. Patient-specific models of candidate disease-causing genes from 50 patients successfully prioritized the pathogenic gene in 86% of the cases, implying that the tissue-selectivity of rare diseases aids in filtering out unlikely candidate genes. Thus, interpretable tissue-aware ML models can boost mechanistic understanding and genetic diagnosis of tissue-selective heritable diseases. A webserver supporting gene prioritization is available at https://netbio.bgu.ac.il/trace/.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.