Smoking is a leading cause of preventable death, causing approximately five million premature deaths world-wide each year 1, 2 . Evidence for genetic influence on smoking behaviour and nicotine dependence (ND) 3-8 has prompted a search for susceptibility genes. Furthermore, assessing the impact of sequence variants on smoking-related diseases is important for public health reasons 9, 10 . Smoking is the major risk factor for lung cancer (LC) [11][12][13][14] , and one of the main risk factors for peripheral arterial disease (PAD) [15][16][17] . We have identified a common variant in the nicotinic acetylcholine receptor gene cluster on chromosome 15q24 with an effect on smoking quantity, ND and the risk of two smoking-related diseases in populations of European descent. The variant has an effect on the number of cigarettes smoked per day in 15,771 smokers (P=6×10 −20 ). The same variant associated with ND in a previous genome-wide association study using low quantity smokers as controls (OR=1.3, P=1×10 −3 ) 18,19 , and with a similar approach we observe a highly significant association with ND (OR =1.40, P=7×10 −15 ). Comparison of LC (N=1,024) and PAD (N= 2,738) cases with about 30,000 population controls each showed that the variant confers risk of LC (OR=1.31, P=1.5×10 −8 ) and PAD (OR=1.19, P=1.4×10 −7 ). The findings highlight the role of nicotine addiction in the pathogenesis of other serious diseases and provide a case study of the role of active gene-environment correlation 20 in the pathogenesis of disease.To perform a genome-wide association (GWA) study of smoking quantity (SQ), we utilised questionnaire data limited to basic questions on smoking behaviour that were available for a large number of lifetime smokers. The GWA scan comprises 10,995 Icelandic smokers who Reprints and permissions information is available at www.nature.com/reprints.
The characterization of mutational processes that generate sequence diversity in the human genome is of paramount importance both to medical genetics and to evolutionary studies. To understand how the age and sex of transmitting parents affect de novo mutations, here we sequence 1,548 Icelanders, their parents, and, for a subset of 225, at least one child, to 35× genome-wide coverage. We find 108,778 de novo mutations, both single nucleotide polymorphisms and indels, and determine the parent of origin of 42,961. The number of de novo mutations from mothers increases by 0.37 per year of age (95% CI 0.32-0.43), a quarter of the 1.51 per year from fathers (95% CI 1.45-1.57). The number of clustered mutations increases faster with the mother's age than with the father's, and the genomic span of maternal de novo mutation clusters is greater than that of paternal ones. The types of de novo mutation from mothers change substantially with age, with a 0.26% (95% CI 0.19-0.33%) decrease in cytosine-phosphate-guanine to thymine-phosphate-guanine (CpG>TpG) de novo mutations and a 0.33% (95% CI 0.28-0.38%) increase in C>G de novo mutations per year, respectively. Remarkably, these age-related changes are not distributed uniformly across the genome. A striking example is a 20 megabase region on chromosome 8p, with a maternal C>G mutation rate that is up to 50-fold greater than the rest of the genome. The age-related accumulation of maternal non-crossover gene conversions also mostly occurs within these regions. Increased sequence diversity and linkage disequilibrium of C>G variants within regions affected by excess maternal mutations indicate that the underlying mutational process has persisted in humans for thousands of years. Moreover, the regional excess of C>G variation in humans is largely shared by chimpanzees, less by gorillas, and is almost absent from orangutans. This demonstrates that sequence diversity in humans results from evolving interactions between age, sex, mutation type, and genomic location.
Hair, skin and eye colors are highly heritable and visible traits in humans. We carried out a genome-wide association scan for variants associated with hair and eye pigmentation, skin sensitivity to sun and freckling among 2,986 Icelanders. We then tested the most closely associated SNPs from six regions--four not previously implicated in the normal variation of human pigmentation--and replicated their association in a second sample of 2,718 Icelanders and a sample of 1,214 Dutch. The SNPs from all six regions met the criteria for genome-wide significance. A variant in SLC24A4 is associated with eye and hair color, a variant near KITLG is associated with hair color, two coding variants in TYR are associated with eye color and freckles, and a variant on 6p25.3 is associated with freckles. The fifth region provided refinements to a previously reported association in OCA2, and the sixth encompasses previously described variants in MC1R.
Key Points Whole-genome sequencing of 11 262 Icelanders reveals that clonal hematopoiesis is very common in the elderly. Somatic mutation of some genes is strongly associated with clonal hematopoiesis, but in most cases, no driver mutations were evident.
Here we describe the insights gained from sequencing the whole genomes of 2,636 Icelanders to a median depth of 20×. We found 20 million SNPs and 1.5 million insertions-deletions (indels). We describe the density and frequency spectra of sequence variants in relation to their functional annotation, gene position, pathway and conservation score. We demonstrate an excess of homozygosity and rare protein-coding variants in Iceland. We imputed these variants into 104,220 individuals down to a minor allele frequency of 0.1% and found a recessive frameshift mutation in MYL4 that causes early-onset atrial fibrillation, several mutations in ABCB4 that increase risk of liver diseases and an intronic variant in GNAS associating with increased thyroid-stimulating hormone levels when maternally inherited. These data provide a study design that can be used to determine how variation in the sequence of the human genome gives rise to human diversity.
Familial clustering studies indicate that breast cancer risk has a substantial genetic component. To identify new breast cancer risk variants, we genotyped approximately 300,000 SNPs in 1,600 Icelandic individuals with breast cancer and 11,563 controls using the Illumina Hap300 platform. We then tested selected SNPs in five replication sample sets. Overall, we studied 4,554 affected individuals and 17,577 controls. Two SNPs consistently associated with breast cancer: approximately 25% of individuals of European descent are homozygous for allele A of rs13387042 on chromosome 2q35 and have an estimated 1.44-fold greater risk than noncarriers, and for allele T of rs3803662 on 16q12, about 7% are homozygous and have a 1.64-fold greater risk. Risk from both alleles was confined to estrogen receptor-positive tumors. At present, no genes have been identified in the linkage disequilibrium block containing rs13387042. rs3803662 is near the 5' end of TNRC9 , a high mobility group chromatin-associated protein whose expression is implicated in breast cancer metastasis to bone.
The common sequence variants that have recently been associated with cancer risk are particular to a single, or at most two, cancer types. Following up on our genome-wide scan of basal cell carcinoma1, we identified rs401681(C) on chromosome 5p15.33 satisfying our threshold for genome-wide significance (OR=1.25, P=3.7×10−12). We tested rs401681 for association with sixteen additional cancer types in over 30,000 cancer cases and 45,000 controls and found association with lung cancer (OR=1.15, P=7.2×10−8) and urinary bladder, prostate and cervix cancer (ORs 1.07–1.31, all P<4×10−4). However, rs401681(C) appears to confer protection against cutaneous melanoma (OR=0.88, P=8.0×10−4). Interestingly, most of these cancer types have a strong environmental component to their risk. Investigation of the region led us to rs2736098(A), that showed stronger association with some cancer types. However, neither variant could fully account for the association of the other. Rs2736098 corresponds to A305A in the telomerase reverse transcriptase (TERT) protein while rs401681 is in an intron of the CLPTM1L gene.
Prostate cancer is the most prevalent noncutaneous cancer in males in developed regions, with African American men having among the highest worldwide incidence and mortality rates. Here we report a second genetic variant in the 8q24 region that, in conjunction with another variant we recently discovered, accounts for about 11%-13% of prostate cancer cases in individuals of European descent and 31% of cases in African Americans. We made the current discovery through a genome-wide association scan of 1,453 affected Icelandic individuals and 3,064 controls using the Illumina HumanHap300 BeadChip followed by four replication studies. A key step in the discovery was the construction of a 14-SNP haplotype that efficiently tags a relatively uncommon (2%-4%) susceptibility variant in individuals of European descent that happens to be very common (approximately 42%) in African Americans. The newly identified variant shows a stronger association with affected individuals who have an earlier age at diagnosis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.