Abstract:We have developed an XmaI-RRBS method for rapid and affordable genome-wide DNA methylation analysis, with library preparation taking only 4 days and sequencing possible within 4 h. We have also addressed several challenges in order to further improve the RRBS technology. XmaI-RRBS may be performed on degraded DNA samples and is compatible with the bench-top next-generation sequencing machines.
“…Previous studies have tested the potential of other restriction enzymes and enzyme combinations to expand the range of CpG sites that can be targeted in a genome ( 8 – 11 , 28 , 30 , 56 , 57 ). However, to our knowledge, there is currently no computational method that systematically explores the capacity of all commercially-available restriction enzymes to generate ‘personalised’ reduced-representations of the genome whilst minimising the experimental cost ( Supplementary Figure S2E ).…”
Section: Resultsmentioning
confidence: 99%
“…XmaI-RRBS data generated on the Ion Torrent platform ( 30 ) and MspI&Taq α I-RRBS data generated on the Illumina HiSeq platform ( 31 ) were quality-trimmed using Trim Galore ( www.bioinformatics.babraham.ac.uk/projects/trim_galore/ ) and had base pairs removed from the 3′ end to avoid including filled-in nucleotides with artificial methylation states (the filled-in XmaI, MspI and Taq α I cut sites include the nucleotide sequence CCGG, CG and CG respectively). The data was then mapped to the human genome (for XmaI data, parameters: –non_directional) or the mouse genome (for MspI&Taq α I data, parameters: –directional) using Bismark (0.18.0) ( 55 ).…”
Section: Methodsmentioning
confidence: 99%
“…We assessed the performance of cuRRBS predictions in two independent experimental datasets ( 30 , 31 ) (see ‘Experimental validation of cuRRBS’ in Results and discussion). We ran cuRRBS fixing the theoretical size ranges tested to the ones reported in the publications ( 30 , 31 ) and we used as our sites of interest the CpGs that overlapped with CpG islands (CGI-CpGs) in the human ( 30 ) and the mouse genomes ( 31 ) respectively. From the cuRRBS output files, we recovered the IDs of the sites that should be theoretically sequenced.…”
Section: Methodsmentioning
confidence: 99%
“…From the cuRRBS output files, we recovered the IDs of the sites that should be theoretically sequenced. Moreover, using the experimental RRBS data ( 30 , 31 ), we could obtain the IDs of the sites that were actually sequenced (filtered by a given depth of coverage threshold). Afterwards, we calculated the following variables for each one of the datasets: True positives ( TP ): number of CGI-CpGs that cuRRBS predicted to be sequenced and were indeed found in the RRBS data.…”
Section: Methodsmentioning
confidence: 99%
“…Here, we have tested the enrichment ability of cuRRBS in several biological systems, with sites in both CpG and CHG contexts and multiple species, to showcase the generalisability and utility of the software ( 35 – 41 ). In addition, we take advantage of two recently published independent RRBS datasets to demonstrate the accuracy of the software predictions in both single and double enzyme experimental settings ( 30 , 31 ).…”
DNA methylation is an important epigenetic modification in many species that is critical for development, and implicated in ageing and many complex diseases, such as cancer. Many cost-effective genome-wide analyses of DNA modifications rely on restriction enzymes capable of digesting genomic DNA at defined sequence motifs. There are hundreds of restriction enzyme families but few are used to date, because no tool is available for the systematic evaluation of restriction enzyme combinations that can enrich for certain sites of interest in a genome. Herein, we present customised Reduced Representation Bisulfite Sequencing (cuRRBS), a novel and easy-to-use computational method that solves this problem. By computing the optimal enzymatic digestions and size selection steps required, cuRRBS generalises the traditional MspI-based Reduced Representation Bisulfite Sequencing (RRBS) protocol to all restriction enzyme combinations. In addition, cuRRBS estimates the fold-reduction in sequencing costs and provides a robustness value for the personalised RRBS protocol, allowing users to tailor the protocol to their experimental needs. Moreover, we show in silico that cuRRBS-defined restriction enzymes consistently out-perform MspI digestion in many biological systems, considering both CpG and CHG contexts. Finally, we have validated the accuracy of cuRRBS predictions for single and double enzyme digestions using two independent experimental datasets.
“…Previous studies have tested the potential of other restriction enzymes and enzyme combinations to expand the range of CpG sites that can be targeted in a genome ( 8 – 11 , 28 , 30 , 56 , 57 ). However, to our knowledge, there is currently no computational method that systematically explores the capacity of all commercially-available restriction enzymes to generate ‘personalised’ reduced-representations of the genome whilst minimising the experimental cost ( Supplementary Figure S2E ).…”
Section: Resultsmentioning
confidence: 99%
“…XmaI-RRBS data generated on the Ion Torrent platform ( 30 ) and MspI&Taq α I-RRBS data generated on the Illumina HiSeq platform ( 31 ) were quality-trimmed using Trim Galore ( www.bioinformatics.babraham.ac.uk/projects/trim_galore/ ) and had base pairs removed from the 3′ end to avoid including filled-in nucleotides with artificial methylation states (the filled-in XmaI, MspI and Taq α I cut sites include the nucleotide sequence CCGG, CG and CG respectively). The data was then mapped to the human genome (for XmaI data, parameters: –non_directional) or the mouse genome (for MspI&Taq α I data, parameters: –directional) using Bismark (0.18.0) ( 55 ).…”
Section: Methodsmentioning
confidence: 99%
“…We assessed the performance of cuRRBS predictions in two independent experimental datasets ( 30 , 31 ) (see ‘Experimental validation of cuRRBS’ in Results and discussion). We ran cuRRBS fixing the theoretical size ranges tested to the ones reported in the publications ( 30 , 31 ) and we used as our sites of interest the CpGs that overlapped with CpG islands (CGI-CpGs) in the human ( 30 ) and the mouse genomes ( 31 ) respectively. From the cuRRBS output files, we recovered the IDs of the sites that should be theoretically sequenced.…”
Section: Methodsmentioning
confidence: 99%
“…From the cuRRBS output files, we recovered the IDs of the sites that should be theoretically sequenced. Moreover, using the experimental RRBS data ( 30 , 31 ), we could obtain the IDs of the sites that were actually sequenced (filtered by a given depth of coverage threshold). Afterwards, we calculated the following variables for each one of the datasets: True positives ( TP ): number of CGI-CpGs that cuRRBS predicted to be sequenced and were indeed found in the RRBS data.…”
Section: Methodsmentioning
confidence: 99%
“…Here, we have tested the enrichment ability of cuRRBS in several biological systems, with sites in both CpG and CHG contexts and multiple species, to showcase the generalisability and utility of the software ( 35 – 41 ). In addition, we take advantage of two recently published independent RRBS datasets to demonstrate the accuracy of the software predictions in both single and double enzyme experimental settings ( 30 , 31 ).…”
DNA methylation is an important epigenetic modification in many species that is critical for development, and implicated in ageing and many complex diseases, such as cancer. Many cost-effective genome-wide analyses of DNA modifications rely on restriction enzymes capable of digesting genomic DNA at defined sequence motifs. There are hundreds of restriction enzyme families but few are used to date, because no tool is available for the systematic evaluation of restriction enzyme combinations that can enrich for certain sites of interest in a genome. Herein, we present customised Reduced Representation Bisulfite Sequencing (cuRRBS), a novel and easy-to-use computational method that solves this problem. By computing the optimal enzymatic digestions and size selection steps required, cuRRBS generalises the traditional MspI-based Reduced Representation Bisulfite Sequencing (RRBS) protocol to all restriction enzyme combinations. In addition, cuRRBS estimates the fold-reduction in sequencing costs and provides a robustness value for the personalised RRBS protocol, allowing users to tailor the protocol to their experimental needs. Moreover, we show in silico that cuRRBS-defined restriction enzymes consistently out-perform MspI digestion in many biological systems, considering both CpG and CHG contexts. Finally, we have validated the accuracy of cuRRBS predictions for single and double enzyme digestions using two independent experimental datasets.
Despite the advantages of neoadjuvant chemotherapy (nAct), associated toxicity is a serious complication that renders monitoring of the patients' response to nAct highly important. thus, prediction of tumor response to treatment is imperative to avoid exposure of potential non-responders to deleterious complications. We have performed genome-wide analysis of DnA methylation by XmaI-RRBS and selected CpG dinucleotides differential methylation of which discriminates luminal B breast cancer samples with different sensitivity to NACT. With this data, we have developed multiplex methylation sensitive restriction enzyme pcR (MSRe-pcR) protocol for determining the methylation status of 10 genes (SLC9A3, C1QL2, DPYS, IRF4, ADCY8, KCNQ2, TERT, SYNDIG1, SKOR2 and GRIK1) that distinguish BC samples with different NACT response. Analysis of these 10 markers by MSRE-PCR in biopsy samples allowed us to reveal three top informative combinations of markers, (1) IRF4 and C1QL2; (2) IRF4, C1QL2, and ADCY8; (3) IRF4, C1QL2, and DPYS, with the areas under Roc curves (AUCs) of 0.75, 0.78 and 0.74, respectively. A classifier based on IRF4 and C1QL2 better meets the diagnostic panel simplicity requirements, as it consists of only two markers. Diagnostic accuracy of the panel of these two markers is 0.75, with the sensitivity of 75% and specificity of 75%.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.