Motivation: In light of the increasing adoption of targeted resequencing (TR) as a cost-effective strategy to identify disease-causing variants, a robust method for copy number variation (CNV) analysis is needed to maximize the value of this promising technology.Results: We present a method for CNV detection for TR data, including whole-exome capture data. Our method calls copy number gains and losses for each target region based on normalized depth of coverage. Our key strategies include the use of base-level log-ratios to remove GC-content bias, correction for an imbalanced library size effect on log-ratios, and the estimation of log-ratio variations via binning and interpolation. Our methods are made available via CONTRA (COpy Number Targeted Resequencing Analysis), a software package that takes standard alignment formats (BAM/SAM) and outputs in variant call format (VCF4.0), for easy integration with other next-generation sequencing analysis packages. We assessed our methods using samples from seven different target enrichment assays, and evaluated our results using simulated data and real germline data with known CNV genotypes.Availability and implementation: Source code and sample data are freely available under GNU license (GPLv3) at http://contra-cnv.sourceforge.net/Contact: Jason.Li@petermac.orgSupplementary information: Supplementary data are available at Bioinformatics online.
Low grade serous ovarian tumours are a rare and under-characterised histological subtype of epithelial ovarian tumours, with little known of the molecular drivers and facilitators of tumorigenesis beyond classic oncogenic RAS/RAF mutations. With a move towards targeted therapies due to the chemoresistant nature of this subtype, it is pertinent to more fully characterise the genetic events driving this tumour type, some of which may influence response to therapy and/or development of drug resistance. We performed genome-wide high-resolution genomic copy number analysis (Affymetrix SNP6.0) and mutation hotspot screening (KRAS, BRAF, NRAS, HRAS, ERBB2 and TP53) to compare a large cohort of ovarian serous borderline tumours (SBTs, n = 57) with low grade serous carcinomas (LGSCs, n = 19). Whole exome sequencing was performed for 13 SBTs, nine LGSCs and one mixed low/high grade carcinoma. Copy number aberrations were detected in 61% (35/57) of SBTs, compared to 100% (19/19) of LGSCs. Oncogenic RAS/RAF/ERBB2 mutations were detected in 82.5% (47/57) of SBTs compared to 63% (12/19) of LGSCs, with NRAS mutations detected only in LGSC. Some copy number aberrations appeared to be enriched in LGSC, most significantly loss of 9p and homozygous deletions of the CDKN2A/2B locus. Exome sequencing identified BRAF, KRAS, NRAS, USP9X and EIF1AX as the most frequently mutated genes. We have identified markers of progression from borderline to LGSC and novel drivers of LGSC. USP9X and EIF1AX have both been linked to regulation of mTOR, suggesting that mTOR inhibitors may be a key companion treatment for targeted therapy trials of MEK and RAF inhibitors.
Despite intensive efforts using linkage and candidate gene approaches, the genetic etiology for the majority of families with a multi-generational breast cancer predisposition is unknown. In this study, we used whole-exome sequencing of thirty-three individuals from 15 breast cancer families to identify potential predisposing genes. Our analysis identified families with heterozygous, deleterious mutations in the DNA repair genes FANCC and BLM, which are responsible for the autosomal recessive disorders Fanconi Anemia and Bloom syndrome. In total, screening of all exons in these genes in 438 breast cancer families identified three with truncating mutations in FANCC and two with truncating mutations in BLM. Additional screening of FANCC mutation hotspot exons identified one pathogenic mutation among an additional 957 breast cancer families. Importantly, none of the deleterious mutations were identified among 464 healthy controls and are not reported in the 1,000 Genomes data. Given the rarity of Fanconi Anemia and Bloom syndrome disorders among Caucasian populations, the finding of multiple deleterious mutations in these critical DNA repair genes among high-risk breast cancer families is intriguing and suggestive of a predisposing role. Our data demonstrate the utility of intra-family exome-sequencing approaches to uncover cancer predisposition genes, but highlight the major challenge of definitively validating candidates where the incidence of sporadic disease is high, germline mutations are not fully penetrant, and individual predisposition genes may only account for a tiny proportion of breast cancer families.
BackgroundMucinous ovarian tumors are an unusual group of rare neoplasms with an apparently clear progression from benign to borderline to carcinoma, yet with a controversial cell of origin in the ovarian surface epithelium. They are thought to be molecularly distinct from other ovarian tumors but there have been no exome-level sequencing studies performed to date.MethodsTo understand the genetic etiology of mucinous ovarian tumors and assess the presence of novel therapeutic targets or pathways, we undertook exome sequencing of 24 tumors encompassing benign (5), borderline (8) and carcinoma (11) histologies and also assessed a validation cohort of 58 tumors for specific gene regions including exons 4–9 of TP53.ResultsThe predominant mutational signature was of C>T transitions in a NpCpG context, indicative of deamination of methyl-cytosines. As well as mutations in known drivers (KRAS, BRAF and CDKN2A), we identified a high percentage of carcinomas with TP53 mutations (52 %), and recurrent mutations in RNF43, ELF3, GNAS, ERBB3 and KLF5.ConclusionsThe diversity of mutational targets suggests multiple routes to tumorigenesis in this heterogeneous group of tumors that is generally distinct from other ovarian subtypes.Electronic supplementary materialThe online version of this article (doi:10.1186/s13073-015-0210-y) contains supplementary material, which is available to authorized users.
Mucinous carcinomas represent a distinct morphological subtype which can arise from several organ sites, including the ovary, and their genetic characteristics are largely under-described. Exome sequencing of 12 primary mucinous ovarian tumours identified RNF43 as the most frequently somatically mutated novel gene, secondary to KRAS and mutated at a frequency equal to that of TP53 and BRAF. Further screening of RNF43 in a larger cohort of ovarian tumours identified additional mutations, with a total frequency of 2/22 (9%) in mucinous ovarian borderline tumours and 6/29 (21%) in mucinous ovarian carcinomas. Seven mutations were predicted to truncate the protein and one missense mutation was predicted to be deleterious by in silico analysis. Six tumours had allelic imbalance at the RNF43 locus, with loss of the wild-type allele. The mutation spectrum strongly suggests that RNF43 is an important tumour suppressor gene in mucinous ovarian tumours, similar to its reported role in mucinous pancreatic precancerous cysts.
BackgroundUsing whole exome sequencing to predict aberrations in tumours is a cost effective alternative to whole genome sequencing, however is predominantly used for variant detection and infrequently utilised for detection of somatic copy number variation.ResultsWe propose a new method to infer copy number and genotypes using whole exome data from paired tumour/normal samples. Our algorithm uses two Hidden Markov Models to predict copy number and genotypes and computationally resolves polyploidy/aneuploidy, normal cell contamination and signal baseline shift. Our method makes explicit detection on chromosome arm level events, which are commonly found in tumour samples. The methods are combined into a package named ADTEx (Aberration Detection in Tumour Exome). We applied our algorithm to a cohort of 17 in-house generated and 18 TCGA paired ovarian cancer/normal exomes and evaluated the performance by comparing against the copy number variations and genotypes predicted using Affymetrix SNP 6.0 data of the same samples. Further, we carried out a comparison study to show that ADTEx outperformed its competitors in terms of precision and F-measure.ConclusionsOur proposed method, ADTEx, uses both depth of coverage ratios and B allele frequencies calculated from whole exome sequencing data, to predict copy number variations along with their genotypes. ADTEx is implemented as a user friendly software package using Python and R statistical language. Source code and sample data are freely available under GNU license (GPLv3) at http://adtex.sourceforge.net/.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2164-15-732) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.