Summary: More and more cancer studies use next-generation sequencing (NGS) data to detect various types of genomic variation. However, even when researchers have such data at hand, single-nucleotide polymorphism arrays have been considered necessary to assess copy number alterations and especially loss of heterozygosity (LOH). Here, we present the tool Control-FREEC that enables automatic calculation of copy number and allelic content profiles from NGS data, and consequently predicts regions of genomic alteration such as gains, losses and LOH. Taking as input aligned reads, Control-FREEC constructs copy number and B-allele frequency profiles. The profiles are then normalized, segmented and analyzed in order to assign genotype status (copy number and allelic content) to each genomic region. When a matched normal sample is provided, Control-FREEC discriminates somatic from germline events. Control-FREEC is able to analyze overdiploid tumor samples and samples contaminated by normal cells. Low mappability regions can be excluded from the analysis using provided mappability tracks.Availability: C++ source code is available at: http://bioinfo.curie.fr/projects/freec/Contact: freec@curie.frSupplementary information: Supplementary data are available at Bioinformatics online.
Neuroblastoma is a tumor of the peripheral sympathetic nervous system, derived from multipotent neural crest cells (NCCs). To define core regulatory circuitries (CRCs) controlling the gene expression program of neuroblastoma, we established and analyzed the neuroblastoma super-enhancer landscape. We discovered three types of identity in neuroblastoma cell lines: a sympathetic noradrenergic identity, defined by a CRC module including the PHOX2B, HAND2 and GATA3 transcription factors (TFs); an NCC-like identity, driven by a CRC module containing AP-1 TFs; and a mixed type, further deconvoluted at the single-cell level. Treatment of the mixed type with chemotherapeutic agents resulted in enrichment of NCC-like cells. The noradrenergic module was validated by ChIP-seq. Functional studies demonstrated dependency of neuroblastoma with noradrenergic identity on PHOX2B, evocative of lineage addiction. Most neuroblastoma primary tumors express TFs from the noradrenergic and NCC-like modules. Our data demonstrate a previously unknown aspect of tumor heterogeneity relevant for neuroblastoma treatment strategies.
Summary: We present a tool for control-free copy number alteration (CNA) detection using deep-sequencing data, particularly useful for cancer studies. The tool deals with two frequent problems in the analysis of cancer deep-sequencing data: absence of control sample and possible polyploidy of cancer cells. FREEC (control-FREE Copy number caller) automatically normalizes and segments copy number profiles (CNPs) and calls CNAs. If ploidy is known, FREEC assigns absolute copy number to each predicted CNA. To normalize raw CNPs, the user can provide a control dataset if available; otherwise GC content is used. We demonstrate that for Illumina single-end, mate-pair or paired-end sequencing, GC-contentr normalization provides smooth profiles that can be further segmented and analyzed in order to predict CNAs.Availability: Source code and sample data are available at http://bioinfo-out.curie.fr/projects/freec/.Contact: freec@curie.frSupplementary information: Supplementary data are available at Bioinformatics online.
During X chromosome inactivation (XCI), the Polycomb Repressive Complex 2 (PRC2) is thought to participate in the early maintenance of the inactive state. Although Xist RNA is essential for the recruitment of PRC2 to the X chromosome, the precise mechanism remains unclear. Here, we demonstrate that the PRC2 cofactor Jarid2 is an important mediator of Xist-induced PRC2 targeting. The region containing the conserved B and F repeats of Xist is critical for Jarid2 recruitment via its unique N-terminal domain. Xist-induced Jarid2 recruitment occurs chromosome-wide independently of a functional PRC2 complex, unlike at other parts of the genome, such as CG-rich regions, where Jarid2 and PRC2 binding are interdependent. Conversely, we show that Jarid2 loss prevents efficient PRC2 and H3K27me3 enrichment to Xist-coated chromatin. Jarid2 thus represents an important intermediate between PRC2 and Xist RNA for the initial targeting of the PRC2 complex to the X chromosome during onset of XCI.
The fusion between EWS and ETS family members is a key oncogenic event in Ewing tumors and important EWS-FLI1 target genes have been identified. However, until now, the search for EWS-FLI1 targets has been limited to promoter regions and no genome-wide comprehensive analysis of in vivo EWS-FLI1 binding sites has been undertaken. Using a ChIP-Seq approach to investigate EWS-FLI1-bound DNA sequences in two Ewing cell lines, we show that this chimeric transcription factor preferentially binds two types of sequences including consensus ETS motifs and microsatellite sequences. Most bound sites are found outside promoter regions. Microsatellites containing more than 9 GGAA repeats are very significantly enriched in EWS-FLI1 immunoprecipitates. Moreover, in reporter gene experiments, the transcription activation is highly dependent upon the number of repeats that are included in the construct. Importantly, in vivo EWS-FLI1-bound microsatellites are significantly associated with EWS-FLI1-driven gene activation. Put together, these results point out the likely contribution of microsatellite elements to long-distance transcription regulation and to oncogenesis.
Deciphering the ways in which somatic mutations and germline susceptibility variants cooperate to promote cancer is challenging. Ewing sarcoma is characterized by fusions between EWSR1 and members of the ETS gene family, usually EWSR1-FLI1, leading to the generation of oncogenic transcription factors that bind DNA at GGAA motifs1–3. A recent genome-wide association study4 identified susceptibility variants near EGR2. Here we found that EGR2 knockdown inhibited proliferation, clonogenicity and spheroidal growth in vitro and induced regression of Ewing sarcoma xenografts. Targeted germline deep sequencing of the EGR2 locus in affected subjects and controls revealed 291 Ewing-associated SNPs. At rs79965208, the A risk allele connected adjacent GGAA repeats by converting an interspaced GGAT motif into a GGAA motif, thereby increasing the number of consecutive GGAA motifs and thus the EWSR1-FLI1–dependent enhancer activity of this sequence, with epigenetic characteristics of an active regulatory element. EWSR1-FLI1 preferentially bound to the A risk allele, which increased global and allele-specific EGR2 expression. Collectively, our findings establish cooperation between a dominant oncogene and a susceptibility variant that regulates a major driver of Ewing sarcomagenesis.
Summary: We present SVDetect, a program designed to identify genomic structural variations from paired-end and mate-pair next-generation sequencing data produced by the Illumina GA and ABI SOLiD platforms. Applying both sliding-window and clustering strategies, we use anomalously mapped read pairs provided by current short read aligners to localize genomic rearrangements and classify them according to their type, e.g. large insertions–deletions, inversions, duplications and balanced or unbalanced inter-chromosomal translocations. SVDetect outputs predicted structural variants in various file formats for appropriate graphical visualization.Availability: Source code and sample data are available at http://svdetect.sourceforge.net/Contact: svdetect@curie.frSupplementary information: Supplementary data are available at Bioinformatics online.
Supplementary data are available at Bioinformatics online.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.