Recent evidence shows that the disruption of constitutive insulated neighbourhoods might lead to oncogene dysregulation. We present here a systematic pan-cancer characterisation of the associations between constitutive boundaries and genome alterations in cancer. Specifically, we investigate the enrichment of somatic mutation, abnormal methylation, and copy number alteration events in the proximity of CTCF bindings overlapping with topological boundaries (junctions) in 26 cancer types. Focusing on CTCF motifs that are both inboundary (overlapping with junctions) and active (overlapping with peaks of CTCF expression), we find a significant enrichment of somatic mutations in several cancer types. Furthermore, mutated junctions are significantly conserved across cancer types, and we also observe a positive selection of transversions rather than transitions in many cancer types. We also analyzed the mutational signature found on the different classes of CTCF motifs, finding some signatures (such as SBS26) to have a higher weight within in-boundary than off-bounday motifs. Regarding methylation, we find a significant number of over-methylated active in-boundary CTCF motifs in several cancer types; similarly to somatic-mutated junctions, they also have a significant conservation across cancer types. Finally, in several cancer types we observe that copy number alterations tend to overlap with active junctions more often than in matched normal samples. While several articles have recently reported a mutational enrichment at CTCF binding sites for specific cancer types, our analysis is pancancer and investigates abnormal methylation and copy number alterations in addition to somatic mutations. Our method is fully replicable and suggests several follow-up tumourspecific analyses.
OPEN ACCESSCitation: Pinoli P, Stamoulakatou E, Nguyen A-P, Rodríguez Martínez M, Ceri S (2020) Pan-cancer analysis of somatic mutations and epigenetic alterations in insulated neighbourhood boundaries. PLoS ONE 15(1): e0227180. https://doi.org/ 10.neighboring domains [1,2]. TAD boundaries, revealed by high-throughput chromatin conformation capture (HiC) techniques [3], are enriched for the insulator binding protein CTCF, which plays a central role in multiple complex genomic processes, including transcription [4,5], imprinting [6,7], long-range chromatin interactions and subnuclear localization [8,9].TADs are hierarchical in nature [10,11] and usually contain smaller sub-TADs and individual loops that function as insulated neighborhoods [12,13]. While TADs are largely invariant features of genome organization and are mostly conserved across cell types and even across species, sub-TADs, loops, and insulated neighborhoods appear to differ, at least partially, between different cell lineages [14].Loss of CTCF-mediated insulation has been identified as a potential reason for domain disruption and spreading [15] and has been connected with multiple malignancies, such as cancer [16][17][18], intellectual disability [19,20] and developmental disorders [...