The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
We have developed a new generation of genome-wide DNA methylation BeadChip which allows high-throughput methylation profiling of the human genome. The new high density BeadChip can assay over 480K CpG sites and analyze twelve samples in parallel. The innovative content includes coverage of 99% of RefSeq genes with multiple probes per gene, 96% of CpG islands from the UCSC database, CpG island shores and additional content selected from whole-genome bisulfite sequencing data and input from DNA methylation experts. The well-characterized Infinium® Assay is used for analysis of CpG methylation using bisulfite-converted genomic DNA. We applied this technology to analyze DNA methylation in normal and tumor DNA samples and compared results with whole-genome bisulfite sequencing (WGBS) data obtained for the same samples. Highly comparable DNA methylation profiles were generated by the array and sequencing methods (average R2 of 0.95). The ability to determine genome-wide methylation patterns will rapidly advance methylation research.
Supplementary data are available at Bioinformatics online.
Our results demonstrate a genome-wide scalable array-based methylation readout platform that is both highly reproducible and quantitative. In the near future, this platform should enable the analysis of hundreds of thousands to millions of CpG sites per sample.
SUMMARYInfinium methylation arrays are widely used to robustly measure methylation of DNA in humans. However, such arrays are not available for the vast majority of non-human mammals. Moreover, even if species-specific arrays were available, probe differences between them would confound cross-species comparisons. To address these challenges, we developed the Mammalian Methylation Array, a single custom Infinium array that measures cytosine methylation levels of over 35 thousand CpG sites that are well conserved across species within the mammalian class. By design, the probes on the array tolerate cross-species mutations. To design the array, we developed the Conserved Methylation Array Probe Selector (CMAPS) algorithm, which takes as input a multi-species sequence alignment and probe design constraints. A greedy search algorithm was used to identify oligonucleotide sequences (probes) with high coverage across different mammalian species. We annotate the probes on the array with respect to genes in 159 different species and provide details on the sequence context including CpG island status and chromatin states. Our calibration experiments demonstrate the high fidelity of this array in humans, rats, and mice. The mammalian methylation array has several strengths: it applies to all mammalian species even those that have not yet been sequenced, it provides deep coverage of specific cytosines facilitating the development of highly robust epigenetic biomarkers, and it covers highly conserved CpGs which greatly increases the probability that biological insights gained in one species will readily translate to others. The mammalian methylation array is expected to find many applications in preclinical studies, comparative biology, and epigenetic studies of aging and development.
We have developed a highly sensitive, specific and reproducible method for microRNA (miRNA) expression profiling, using the BeadArray™ technology. This method incorporates an enzyme-assisted specificity step, a solid-phase primer extension to distinguish between members of miRNA families. In addition, a universal PCR is used to amplify all targets prior to array hybridization. Currently, assay probes are designed to simultaneously analyse 735 well-annotated human miRNAs. Using this method, highly reproducible miRNA expression profiles were generated with 100–200 ng total RNA input. Furthermore, very similar expression profiles were obtained with total RNA and enriched small RNA species (R2 ≥ 0.97). The method has a 3.5–4 log (105–109 molecules) dynamic range and is able to detect 1.2- to 1.3-fold-differences between samples. Expression profiles generated by this method are highly comparable to those obtained with RT–PCR (R2 = 0.85–0.90) and direct sequencing (R = 0.87–0.89). This method, in conjunction with the 96-sample array matrix should prove useful for high-throughput expression profiling of miRNAs in large numbers of tissue samples.
Infinium methylation arrays are not available for the vast majority of non-human mammals. Moreover, even if species-specific arrays were available, probe differences between them would confound cross-species comparisons. To address these challenges, we developed the mammalian methylation array, a single custom array that measures up to 36k CpGs per species that are well conserved across many mammalian species. We designed a set of probes that can tolerate specific cross-species mutations. We annotate the array in over 200 species and report CpG island status and chromatin states in select species. Calibration experiments demonstrate the high fidelity in humans, rats, and mice. The mammalian methylation array has several strengths: it applies to all mammalian species even those that have not yet been sequenced, it provides deep coverage of conserved cytosines facilitating the development of epigenetic biomarkers, and it increases the probability that biological insights gained in one species will translate to others.
Comprehensive targeted methylation sequencing of 9223 CpG sites in plasma cfDNA from patients with common advanced cancers detects the presence of cancer and underlying cancer type with high accuracy. Methylation scores in plasma cfDNA correspond with treatment outcomes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.