Autism Spectrum Disorder (ASD) and Intellectual Disability (ID) are comorbid neurodevelopmental disorders with complex genetic architectures. Despite large-scale sequencing studies only a fraction of the risk genes were identified for both. Here, we present a novel network-based gene risk prioritization algorithm named DeepND that performs cross-disorder analysis to improve prediction power by exploiting the comorbidity of ASD and ID via multitask learning. Our model leverages information from gene coexpression networks that model human brain development using graph convolutional neural networks and learns which spatio-temporal neurovelopmental windows are important for disorder etiologies. We show that our approach substantially improves the state-of-the-art prediction power in both single-disorder and cross-disorder settings. DeepND identifies mediodorsal thalamus and cerebral cortex brain region and infancy to childhood period as the highest neurodevelopmental risk window for both ASD and ID. We observe that both disorders are enriched in transcription regulators. Despite tight regulatory links in between ASD risk genes, such is lacking across ASD and ID risk genes or within ID risk genes. Finally, we investigate frequent ASD and ID associated copy number variation regions and confident false findings to suggest several novel susceptibility gene candidates. DeepND can be generalized to analyze any combinations of comorbid disorders and is released at http://github.com/ciceklab/deepnd. # Equal contribution.*Correspondance: cicek@cs.bilkent.edu.tr been used to assess gene risk using excess genetic burden from case-control and family studies [35] which are recently extended to work with multiple traits [68]. Yet, these tools work with genes with observed disruptive mutations (mainly de novo). It is often of interest to use these as prior risk and obtain a posterior gene interaction network-adjusted risk which can also assess risk for genes with no prior signal. Network-based computational gene risk prediction methods come handy for (i) imputing the insufficient statistical signal and providing a genome-wide risk ranking, and (ii) finding out the affected cellular circuitries such as pathways and networks of genes [35,29,24,37,69,27,26,67,10]. While these methods have helped unraveling the underlying mechanisms, they have several limitations. First, by design, they are limited to work with a single disorder. In order to compare and contrast comorbid disorders such as ASD and ID using these tools, one approach is to bag the mutational burden observed for each disorder assuming two are the same. However, disorder specific features are lost as a consequence [29]. The more common approach is to perform independent analyses per disorder and intersect the results. Unfortunately, this approach ignores valuable source of information coming from the shared genetic architecture and lose prediction power as per-disorder analyses have less input (i.e., samples, mutation counts) and less statistical power [81,12,37]. Second, c...
The last decade has proven that amyotrophic lateral sclerosis (ALS) is clinically and genetically heterogeneous, and that the genetic component in sporadic cases might be stronger than expected. This study investigates 1,200 patients to revisit ALS in the ethnically heterogeneous yet inbred Turkish population. Familial ALS (fALS) accounts for 20% of our cases. The rates of consanguinity are 30% in fALS and 23% in sporadic ALS (sALS). Major ALS genes explained the disease cause in only 35% of fALS, as compared with ~70% in Europe and North America. Whole exome sequencing resulted in a discovery rate of 42% (53/127). Whole genome analyses in 623 sALS cases and 142 population controls, sequenced within Project MinE, revealed well‐established fALS gene variants, solidifying the concept of incomplete penetrance in ALS. Genome‐wide association studies (GWAS) with whole genome sequencing data did not indicate a new risk locus. Coupling GWAS with a coexpression network of disease‐associated candidates, points to a significant enrichment for cell cycle‐ and division‐related genes. Within this network, literature text‐mining highlights DECR1, ATL1, HDAC2, GEMIN4, and HNRNPA3 as important genes. Finally, information on ALS‐related gene variants in the Turkish cohort sequenced within Project MinE was compiled in the GeNDAL variant browser (http://www.gendal.org).
The cover image is based on the Data Article Revisiting the complex architecture of ALS in Turkey: Expanding genotypes, shared phenotypes, molecular networks, and a public variant database by Ceren Tunca et al., https://doi.org/10.1002/humu.24055.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.