The identification of causal variants in sequencing studies remains a considerable challenge that can be partially addressed by new gene-specific knowledge. Here, we integrate measures of how essential a gene is to supporting life, as inferred from viability and phenotyping screens performed on knockout mice by the International Mouse Phenotyping Consortium and essentiality screens carried out on human cell lines. We propose a cross-species gene classification across the Full Spectrum of Intolerance to Loss-of-function (FUSIL) and demonstrate that genes in five mutually exclusive FUSIL categories have differing biological properties. Most notably, Mendelian disease genes, particularly those associated with developmental disorders, are highly overrepresented among genes non-essential for cell survival but required for organism development. After screening developmental disorder cases from three independent disease sequencing consortia, we identify potentially pathogenic variants in genes not previously associated with rare diseases. We therefore propose FUSIL as an efficient approach for disease gene discovery.
Tuberculosis remains a major global health challenge worldwide, causing more than a million deaths annually. To determine newer methods for detecting and combating the disease, it is necessary to characterise global host responses to infection. Several high throughput omics studies have provided a rich resource including a list of several genes differentially regulated in tuberculosis. An integrated analysis of these studies is necessary to identify a unified response to the infection. Such data integration is met with several challenges owing to platform dependency, patient heterogeneity, and variability in the extent of infection, resulting in little overlap among different datasets. Network-based approaches offer newer alternatives to integrate and compare diverse data. In this study, we describe a meta-analysis of host’s whole blood transcriptomic profiles that were integrated into a genome-scale protein–protein interaction network to generate response networks in active tuberculosis, and monitor their behaviour over treatment. We report the emergence of a highly active common core in disease, showing partial reversals upon treatment. The core comprises 380 genes in which STAT1, phospholipid scramblase 1 (PLSCR1), C1QB, OAS1, GBP2 and PSMB9 are prominent hubs. This network captures the interplay between several biological processes including pro-inflammatory responses, apoptosis, complement signalling, cytoskeletal rearrangement, and enhanced cytokine and chemokine signalling. The common core is specific to tuberculosis, and was validated on an independent dataset from an Indian cohort. A network-based approach thus enables the identification of common regulators that characterise the molecular response to infection, providing a platform-independent foundation to leverage maximum insights from available clinical data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.