Here we describe the SweGen data set, a comprehensive map of genetic variation in the Swedish population. These data represent a basic resource for clinical genetics laboratories as well as for sequencing-based association studies by providing information on genetic variant frequencies in a cohort that is well matched to national patient cohorts. To select samples for this study, we first examined the genetic structure of the Swedish population using high-density SNP-array data from a nation-wide cohort of over 10 000 Swedish-born individuals included in the Swedish Twin Registry. A total of 1000 individuals, reflecting a cross-section of the population and capturing the main genetic structure, were selected for whole-genome sequencing. Analysis pipelines were developed for automated alignment, variant calling and quality control of the sequencing data. This resulted in a genome-wide collection of aggregated variant frequencies in the Swedish population that we have made available to the scientific community through the website https://swefreq.nbis.se. A total of 29.2 million single-nucleotide variants and 3.8 million indels were detected in the 1000 samples, with 9.9 million of these variants not present in current databases. Each sample contributed with an average of 7199 individual-specific variants. In addition, an average of 8645 larger structural variants (SVs) were detected per individual, and we demonstrate that the population frequencies of these SVs can be used for efficient filtering analyses. Finally, our results show that the genetic diversity within Sweden is substantial compared with the diversity among continental European populations, underscoring the relevance of establishing a local reference data set.
The genetics underlying thyroid cancer dedifferentiation is only partly understood and has not yet been characterised using comprehensive pan-genomic analyses. We investigated a unique case with synchronous follicular thyroid carcinoma (FTC), poorly differentiated thyroid carcinoma (PDTC), and anaplastic thyroid carcinoma (ATC), as well as regional lymph node metastases from the PDTC and ATC from a single patient using whole-genome sequencing (WGS). The FTC displayed mutations in CALR, RB1, and MSH2, and the PDTC exhibited mutations in TP53, DROSHA, APC, TERT , and additional DNA repair genes-associated with an immense increase in sub-clonal somatic mutations. All components displayed an overrepresentation of C>T transitions with associated microsatellite instability (MSI) in the PDTC and ATC, with borderline MSI in the FTC. Clonality analyses pinpointed a shared ancestral clone enriched for mutations in TP53-associated regulation of DNA repair and identified important sub-clones for each tumour component already present in the corresponding preceding lesion. This genomic characterisation of the natural progression of thyroid cancer reveals several novel genes of interest for future studies. Moreover, the findings support the theory of a stepwise dedifferentiation process and suggest that defects in DNA repair could play an important role in the clonal evolution of thyroid cancer.
BackgroundHeritable factors are well known to increase the risk of cancer in families. Known susceptibility genes account for a small proportion of all colorectal cancer cases. The aim of this study was to identify the genetic background in a family suggested to segregate a dominant cancer syndrome with a high risk of rectal- and gastric cancer. We performed whole exome sequencing in three family members, 2 with rectal cancer and 1 with gastric cancer and followed it up in additional family members, other patients and controls.ResultsWe identified 12 novel non-synonymous single nucleotide variants, which were shared among 5 affected members of this family. The mutations were found in 12 different genes; DZIP1L, PCOLCE2, IGSF10, SUCNR1, OR13C8, EPB41L4B, SEC16A, NOTCH1, TAS2R7, SF3A1, GAL3ST1, and TRIOBP. None of the mutations was suggested as a high penetrant mutation. It was not possible to completely rule out any of the mutations as contributing to disease, although seven were more unlikely than the others. Neither did we rule out the effect of all thousands of intronic, intergenic and synonymous variants shared between the three persons used for exome sequencing.ConclusionsWe propose this family, suggested to segregate dominant disease, could be an example of complex inheritance.
Plasmodium falciparum genome has 81% A+T content. This nucleotide bias leads to extreme codon usage bias and culminates in frequent insertion of asparagine homorepeats in the proteome. Using recodonized GFP sequences, we show that codons decoded via G:U wobble pairing are suboptimal codons that are negatively associated to protein translation efficiency. Despite this, one third of all codons in the genome are GU wobble codons, suggesting that codon usage in P. falciparum has not been driven to maximize translation efficiency, but may have evolved as translational regulatory mechanism. Particularly, asparagine homorepeats are generally encoded by locally clustered GU wobble AAT codons, we demonstrated that this GU wobble-rich codon context is the determining factor that causes reduction of protein level. Moreover, insertion of clustered AAT codons also causes destabilization of the transcripts. Interestingly, more frequent asparagine homorepeats insertion is seen in single-exon genes, suggesting transcripts of these genes may have been programmed for rapid mRNA decay to compensate for the inefficiency of mRNA surveillance regulation on intronless genes. To our knowledge, this is the first study that addresses P. falciparum codon usage in vitro and provides new insights on translational regulation and genome evolution of this parasite.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.