Whole genome sequencing (WGS) is a promising strategy to unravel variants or genes responsible for human diseases and traits. However, there is a lack of robust platforms for a comprehensive downstream analysis. In the present study, we first proposed three novel algorithms, sequence gap-filled gene feature annotation, bit-block encoded genotypes and sectional fast access to text lines to address three fundamental problems. The three algorithms then formed the infrastructure of a robust parallel computing framework, KGGSeq, for integrating downstream analysis functions for whole genome sequencing data. KGGSeq has been equipped with a comprehensive set of analysis functions for quality control, filtration, annotation, pathogenic prediction and statistical tests. In the tests with whole genome sequencing data from 1000 Genomes Project, KGGSeq annotated several thousand more reliable non-synonymous variants than other widely used tools (e.g. ANNOVAR and SNPEff). It took only around half an hour on a small server with 10 CPUs to access genotypes of ∼60 million variants of 2504 subjects, while a popular alternative tool required around one day. KGGSeq's bit-block genotype format used 1.5% or less space to flexibly represent phased or unphased genotypes with multiple alleles and achieved a speed of over 1000 times faster to calculate genotypic correlation.
Genetic variants are implicated in the development of amyotrophic lateral sclerosis (ALS), but it is unclear whether the burden of rare variants in ALS genes has an effect on survival. We performed whole genome sequencing on 8 familial ALS (FALS) patients with superoxide dismutase 1 (SOD1) mutation and whole exome sequencing on 46 sporadic ALS (SALS) patients living in Hong Kong and found that 67% had at least 1 rare variant in the exons of 40 ALS genes; 22% had 2 or more. Patients with 2 or more rare variants had lower probability of survival than patients with 0 or 1 variant (p = 0.001). After adjusting for other factors, each additional rare variant increased the risk of respiratory failure or death by 60% (p = 0.0098). The presence of the rare variant was associated with the risk of ALS (Odds ratio 1.91, 95% confidence interval 1.03-3.61, p = 0.03), and ALS patients had higher rare variant burden than controls (MB, p = 0.004). Our findings support an oligogenic basis with the burden of rare variants affecting the development and survival of ALS.
Background Biliary atresia (BA) is the most common obstructive cholangiopathy in neonates, often progressing to end-stage cirrhosis. BA pathogenesis is believed to be multifactorial, but the genetic contribution, especially for nonsyndromic BA (common form: > 85%) remains poorly defined. Methods We conducted whole exome sequencing on 89 nonsyndromic BA trios to identify rare variants contributing to BA etiology. Functional evaluation using patients’ liver biopsies, human cell and zebrafish models were performed. Clinical impact on respiratory system was assessed with clinical evaluation, nasal nitric oxide (nNO), high speed video analysis and transmission electron microscopy. Findings We detected rare, deleterious de novo or biallelic variants in liver-expressed ciliary genes in 31.5% (28/89) of the BA patients. Burden test revealed 2.6-fold (odds ratio (OR) [95% confidence intervals (CI)]= 2.58 [1.15–6.07], adjusted p = 0.034) over-representation of rare, deleterious mutations in liver-expressed ciliary gene set in patients compared to controls. Functional analyses further demonstrated absence of cilia in the BA livers with KIF3B and TTC17 mutations, and knockdown of PCNT, KIF3B and TTC17 in human control fibroblasts and cholangiocytes resulted in reduced number of cilia. Additionally, CRISPR/Cas9-engineered zebrafish knockouts of KIF3B, PCNT and TTC17 displayed reduced biliary flow. Abnormally low level of nNO was detected in 80% (8/10) of BA patients carrying deleterious ciliary mutations, implicating the intrinsic ciliary defects. Interpretation Our findings support strong genetic susceptibility for nonsyndromic BA. Ciliary gene mutations leading to cholangiocyte cilia malformation and dysfunction could be a key biological mechanism in BA pathogenesis. Funding The study is supported by General Research Fund, HMRF Commissioned Paediatric Research at HKCH and Li Ka Shing Faculty of Medicine Enhanced New Staff Start-up Fund.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.