The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
Summary Genome sequencing projects are discovering millions of genetic variants in humans, and interpretation of their functional effects is essential for understanding the genetic basis of variation in human traits. Here we report sequencing and deep analysis of mRNA and miRNA from lymphoblastoid cell lines of 462 individuals from the 1000 Genomes Project – the first uniformly processed RNA-seq data from multiple human populations with high-quality genome sequences. We discovered extremely widespread genetic variation affecting regulation of the majority of genes, with transcript structure and expression level variation being equally common but genetically largely independent. Our characterization of causal regulatory variation sheds light on cellular mechanisms of regulatory and loss-of-function variation, and allowed us to infer putative causal variants for dozens of disease-associated loci. Altogether, this study provides a deep understanding of the cellular mechanisms of transcriptome variation and of the landscape of functional variants in the human genome.
The human X chromosome has a unique biology that was shaped by its evolution as the sex chromosome shared by males and females. We have determined 99.3% of the euchromatic sequence of the X chromosome. Our analysis illustrates the autosomal origin of the mammalian sex chromosomes, the stepwise process that led to the progressive loss of recombination between X and Y, and the extent of subsequent degradation of the Y chromosome. LINE1 repeat elements cover one-third of the X chromosome, with a distribution that is consistent with their proposed role as way stations in the process of X-chromosome inactivation. We found 1,098 genes in the sequence, of which 99 encode proteins expressed in testis and in various tumour types. A disproportionately high number of mendelian diseases are documented for the X chromosome. Of this number, 168 have been explained by mutations in 113 X-linked genes, which in many cases were characterized with the aid of the DNA sequence.
Primary ciliary dyskinesia (PCD, MIM 242650) is characterized by recurrent infections of the respiratory tract due to reduced mucociliary clearance and by sperm immobility. Half of the affected offspring have situs inversus (reversed organs), which results from randomization of left-right (LR) asymmetry. We previously localized to chromosome 5p a PCD locus containing DNAH5, which encodes a protein highly similar to the Chlamydomonas gamma-dynein heavy chain. Here we characterize the full-length 14-kb transcript of DNAH5. Sequence analysis in individuals with PCD with randomization of LR asymmetry identified mutations resulting in non-functional DNAH5 proteins.
Nephronophthisis (NPHP) is the most frequent genetic cause of chronic renal failure in children. Identification of four genes mutated in NPHP subtypes 1-4 (refs. 4-9) has linked the pathogenesis of NPHP to ciliary functions. Ten percent of affected individuals have retinitis pigmentosa, constituting the renal-retinal Senior-Loken syndrome (SLSN). Here we identify, by positional cloning, mutations in an evolutionarily conserved gene, IQCB1 (also called NPHP5), as the most frequent cause of SLSN. IQCB1 encodes an IQ-domain protein, nephrocystin-5. All individuals with IQCB1 mutations have retinitis pigmentosa. Hence, we examined the interaction of nephrocystin-5 with RPGR (retinitis pigmentosa GTPase regulator), which is expressed in photoreceptor cilia and associated with 10-20% of retinitis pigmentosa. We show that nephrocystin-5, RPGR and calmodulin can be coimmunoprecipitated from retinal extracts, and that these proteins localize to connecting cilia of photoreceptors and to primary cilia of renal epithelial cells. Our studies emphasize the central role of ciliary dysfunction in the pathogenesis of SLSN.
Nephronophthisis (NPHP), a group of autosomal recessive cystic kidney disorders, is the most common genetic cause of progressive renal failure in children and young adults. NPHP may be associated with Leber congenital amaurosis, tapeto-retinal degeneration, cerebellar ataxia, cone-shaped epiphyses, congenital oculomotor apraxia and hepatic fibrosis. Loci associated with an infantile type of NPHP on 9q22-q31 (NPHP2), juvenile types of NPHP on chromosomes 2q12-q13 (NPHP1) and 1p36 (NPHP4) and an adolescent type of NPHP on 3q21-q22 (NPHP3) have been mapped. NPHP1 and NPHP4 have been identified, and interaction of the respective encoded proteins nephrocystin and nephrocystin-4 has been shown. Here we report the identification of NPHP3, encoding a novel 1,330-amino acid protein that interacts with nephrocystin. We describe mutations in NPHP3 in families with isolated NPHP and in families with NPHP with associated hepatic fibrosis or tapeto-retinal degeneration. We show that the mouse ortholog Nphp3 is expressed in the node, kidney tubules, retina, respiratory epithelium, liver, biliary tract and neural tissues. In addition, we show that a homozygous missense mutation in Nphp3 is probably responsible for the polycystic kidney disease (pcy) mouse phenotype. Interventional studies in the pcy mouse have shown beneficial effects by modification of protein intake and administration of methylprednisolone, suggesting therapeutic strategies for treating individuals with NPHP3.
Antenatal Bartter syndrome (aBS) comprises a heterogeneous group of autosomal recessive salt-losing nephropathies. Identification of three genes that code for renal transporters and channels as responsible for aBS has resulted in new insights into renal salt handling, diuretic action and blood-pressure regulation. A gene locus of a fourth variant of aBS called BSND, which in contrast to the other forms is associated with sensorineural deafness (SND) and renal failure, has been mapped to chromosome 1p. We report here the identification by positional cloning, in a region not covered by the human genome sequencing projects, of a new gene, BSND, as the cause of BSND. We examined ten families with BSND and detected seven different mutations in BSND that probably result in loss of function. In accordance with the phenotype, BSND is expressed in the thin limb and the thick ascending limb of the loop of Henle in the kidney and in the dark cells of the inner ear. The gene encodes a hitherto unknown protein with two putative transmembrane alpha-helices and thus might function as a regulator for ion-transport proteins involved in aBS, or else as a new transporter or channel itself.
Hailey-Hailey disease (HHD) is an autosomal dominant skin disorder characterized by suprabasal cell separation (acantholysis) of the epidermis. Previous genetic linkage studies localized the gene to a 5 cM interval on human chromosome 3q21. After reducing the disease critical region to <1 cM, we used a positional cloning strategy to identify the gene ATP2C1, which is mutated in HHD. ATP2C1 encodes a new class of P-type Ca(2+)-transport ATPase, which is the homologue for the rat SPLA and the yeast PMR1 medial Golgi Ca(2+)pumps and is related to the sarco(endo)plasmic calcium ATPase (SERCA) and plasma membrane calcium ATPase (PCMA) families of Ca(2+)pumps. The predicted protein has the same apparent transmembrane organization and contains all of the conserved domains present in other P-type ATPases. ATP2C1 produces two alternative splice variants of approximately 4.5 kb encoding predicted proteins of 903 and 923 amino acids. We identified 13 different mutations, including nonsense, frameshift insertion and deletions, splice-site mutations, and non-conservative missense mutations. This study demonstrates that defects in ATP2C1 cause HHD and together with the recent identification of ATP2A2 as the defective gene in Darier's disease, provide further evidence of the critical role of Ca(2+)signaling in maintaining epidermal integrity.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.