Neoplasms are thought to progress to cancer through genetic instability generating cellular diversity and clonal expansions driven by selection for mutations in cancer genes. Despite advances in the study of molecular biology of cancer genes, relatively little is known about evolutionary mechanisms that drive neoplastic progression. It is unknown, for example, which may be more predictive of future progression of a neoplasm: genetic homogenization of the neoplasm, possibly caused by a clonal expansion, or the accumulation of clonal diversity. Here, in a prospective study, we show that clonal diversity measures adapted from ecology and evolution can predict progression to adenocarcinoma in the premalignant condition known as Barrett's esophagus, even when controlling for established genetic risk factors, including lesions in TP53 (p53; ref. 6) and ploidy abnormalities. Progression to cancer through accumulation of clonal diversity, on which natural selection acts, may be a fundamental principle of neoplasia with important clinical implications.
After two decades of improvements, the current human reference genome (GRCh38) is the most accurate and complete vertebrate genome ever produced. However, no single chromosome has been finished end to end, and hundreds of unresolved gaps persist 1,2. Here we present a human genome assembly that surpasses the continuity of GRCh38 2 , along with a gapless, telomere-to-telomere assembly of a human chromosome. This was enabled by high-coverage, ultra-long-read nanopore sequencing of the complete hydatidiform mole CHM13 genome, combined with complementary technologies for quality improvement and validation. Focusing our efforts on the human X chromosome 3 , we reconstructed the centromeric satellite DNA array (approximately 3.1 Mb) and closed the 29 remaining gaps in the current reference, including new sequences from the human pseudoautosomal regions and from cancer-testis ampliconic gene families (CT-X and GAGE). These sequences will be integrated into future human reference genome releases. In addition, the complete chromosome X, combined with the ultra-long nanopore data, allowed us to map methylation patterns across complex tandem repeats and satellite arrays. Our results demonstrate that finishing the entire human genome is now within reach, and the data presented here will facilitate ongoing efforts to complete the other human chromosomes. Complete, telomere-to-telomere reference genome assemblies are necessary to ensure that all genomic variants are discovered and studied. At present, unresolved areas of the human genome are defined by multi-megabase satellite arrays in the pericentromeric regions and the ribosomal DNA arrays on acrocentric short arms, as well as regions enriched in segmental duplications that are greater than hundreds of kilobases in length and that exhibit sequence identity of more than 98% between paralogues. Owing to their absence from the reference, these repeat-rich sequences are often excluded from genetics and genomics studies, which limits the scope of association and functional analyses 4,5. Unresolved repeat sequences also result in unintended consequences; for example, paralogous sequence variants incorrectly being called as allelic variants 6 , and the contamination of bacterial gene databases 7. Completion of the entire human genome is expected to contribute to our understanding of chromosome function 8 , human disease 9 and genomic variation, which will improve technologies in biomedicine that use short-read mapping to a reference genome (for example, RNA sequencing (RNA-seq) 10 , chromatin immunoprecipitation followed by sequencing (ChIP-seq) 11 and assay for transposase-accessible chromatin using sequencing (ATAC-seq) 12). The fundamental challenge of reconstructing a genome from many comparatively short sequencing reads-a process known as genome assembly-is distinguishing the repeated sequences from one another 13. Resolving such repeats relies on sequencing reads that are long enough to span the entire repeat or accurate enough to distinguish each repeat copy on the basis of...
Duplex Sequencing (DS) is a next-generation sequencing methodology capable of detecting a single mutation among >1 × 107 wild-type nucleotides, thereby enabling the study of heterogeneous populations and very-low-frequency genetic alterations. DS can be applied to any double-stranded DNA sample, but it is ideal for small genomic regions of <1 Mb in size. The method relies on the ligation of sequencing adapters harboring random yet complementary double-stranded nucleotide sequences to the sample DNA of interest. Individually labeled strands are then PCR-amplified, creating sequence ‘families’ that share a common tag sequence derived from the two original complementary strands. Mutations are scored only if the variant is present in the PCR families arising from both of the two DNA strands. Here we provide a detailed protocol for efficient DS adapter synthesis, library preparation and target enrichment, as well as an overview of the data analysis workflow. The protocol typically takes 1–3 d.
DNA hypomethylation is a common trait of colorectal cancer. Studies in tumor cell lines and animal models indicate that genome-wide demethylation may cause genetic instability and hence facilitate or accelerate tumor progression. Recent studies have shown that DNA hypomethylation precedes genomic damage in human gastrointestinal cancer, but the nature of this damage has not been clearly established. Here, we show a thorough analysis of DNA methylation and genetic alterations in two series of colorectal carcinomas. The extent of DNA demethylation but not of hypermethylation (both analyzed by amplification of intermethylated sites in near 200 independent sequences arbitrarily selected) correlated with the cumulated genomic damage assessed by two different techniques (arbitrarily primed PCR and comparative genomic hybridization). DNA hypomethylation-related instability was mainly of chromosomal nature and could be explained by a genome-wide effect rather than by the concurrence of the most prevalent genetic and epigenetic alterations. Moreover, the association of p53 mutations with genomic instability was secondary to DNA hypomethylation and the correlation between DNA hypomethylation and genomic instability was observed in tumors with and without mutation in the p53 gene. Our data support a direct link between genome-wide demethylation and chromosomal instability in human colorectal carcinogenesis and are consistent with the studies in model systems demonstrating a role of DNA demethylation in inducing chromosomal instability. (Cancer Res 2006; 66(17): 8462-8)
These results suggest that the combined K-ras and p16 analyses may be of prognostic use in human colorectal cancer.
Background & Aims-Telomere shortening is implicated in cancer and aging, and might link these two biological events. We explored this hypothesis in ulcerative colitis, a chronic inflammatory disease that predisposes to colorectal cancer and where shorter telomeres have been associated with chromosomal instability and tumor progression.
Purpose: Leukocyte telomere length has gained attention as a marker of oxidative damage and age-related diseases, including cancer. We hypothesize that leukocyte telomere length might be able to predict future risk of cancer and examined this in a cohort of patients with Barrett's esophagus, who are at increased risk of esophageal adenocarcinoma and thus were enrolled in a long-term cancer surveillance program. Patients and Methods: In this prospective study, telomere length was measured by quantitative PCR in baseline blood samples in a cohort of 300 patients with Barrett's esophagus followed for a mean of 5.8 years. Leukocyte telomere length hazard ratios (HR) for risk of esophageal adenocarcinoma were calculated using multivariate Cox models. Results: Shorter telomeres were associated with increased esophageal adenocarcinoma risk (age-adjusted HR between top and bottom quartiles of telomere length, 3.45; 95% confidence interval, 1.35-8.78; P = 0.009). This association was still significant when individually or simultaneously adjusted for age, gender, nonsteroidal anti-inflammatory drug (NSAID) use, cigarette smoking, and waist-to-hip ratio (HR, 4.18; 95% confidence interval, 1.60-10.94; P = 0.004). The relationship between telomere length and cancer risk was particularly strong among NSAID nonusers, ever smokers, and patients with low waist-to-hip ratio. Conclusion: Leukocyte telomere length predicts risk of esophageal adenocarcinoma in patients with Barrett's esophagus independently of smoking, obesity, and NSAID use. These results show the ability of leukocyte telomere length to predict the risk of future cancer and suggest that it might also have predictive value in other cancers arising in a setting of chronic inflammation. (Cancer Epidemiol Biomarkers Prev 2007;16(12):2649-55)
Current sequencing methods are error-prone, which precludes the identification of low frequency mutations for early cancer detection. Duplex sequencing is a sequencing technology that decreases errors by scoring mutations present only in both strands of DNA. Our aim was to determine whether duplex sequencing could detect extremely rare cancer cells present in peritoneal fluid from women with highgrade serous ovarian carcinomas (HGSOCs). These aggressive cancers are typically diagnosed at a late stage and are characterized by TP53 mutations and peritoneal dissemination. We used duplex sequencing to analyze TP53 mutations in 17 peritoneal fluid samples from women with HGSOC and 20 from women without cancer. The tumor TP53 mutation was detected in 94% (16/17) of peritoneal fluid samples from women with HGSOC (frequency as low as 1 mutant per 24,736 normal genomes). Additionally, we detected extremely low frequency TP53 mutations (median mutant fraction 1/13,139) in peritoneal fluid from nearly all patients with and without cancer (35/37). These mutations were mostly deleterious, clustered in hotspots, increased with age, and were more abundant in women with cancer than in controls. The total burden of TP53 mutations in peritoneal fluid distinguished cancers from controls with 82% sensitivity (14/17) and 90% specificity (18/20). Age-associated, low frequency TP53 mutations were also found in 100% of peripheral blood samples from 15 women with and without ovarian cancer (none with hematologic disorder). Our results demonstrate the ability of duplex sequencing to detect rare cancer cells and provide evidence of widespread, low frequency, age-associated somatic TP53 mutation in noncancerous tissue.TP53 mutations | ultra-deep sequencing | ovarian cancer | clonal hematopoiesis | premalignant mutations T he detection of tumor-specific mutations in clinically accessible samples has enormous potential to transform cancer diagnostics, monitoring, and screening. However, a major limitation is insufficiently accurate sequencing methods. Conventional nextgeneration sequencing (NGS) technologies have a high false positive error rate, which precludes reliable detection of mutations at frequencies <1/100 (1). "Molecular tagging" of single-stranded DNA decreases the rate of false mutations to less than 1 per 10,000 sequenced nucleotides and has been successfully applied to the detection of mutant cancer DNA in a variety of clinical samples (2-6). However, this false positive error rate limits the specificity of this method in challenging situations in which ultra-deep sequencing is needed to detect extremely low frequency mutant molecules (e.g., <1/10,000), as is the case of ovarian cancer DNA in Pap smears (5). Because true mutations are indistinguishable from artifacts, compromised specificity leads to lower sensitivity and overall low diagnostic accuracy. Duplex sequencing is an NGS technology that employs molecular tagging of both strands of DNA independently. True mutations are defined as mutations that are present at the same...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.