BackgroundThere has been a long-standing need in biomedical research for a method that quantifies the normally mixed composition of leukocytes beyond what is possible by simple histological or flow cytometric assessments. The latter is restricted by the labile nature of protein epitopes, requirements for cell processing, and timely cell analysis. In a diverse array of diseases and following numerous immune-toxic exposures, leukocyte composition will critically inform the underlying immuno-biology to most chronic medical conditions. Emerging research demonstrates that DNA methylation is responsible for cellular differentiation, and when measured in whole peripheral blood, serves to distinguish cancer cases from controls.ResultsHere we present a method, similar to regression calibration, for inferring changes in the distribution of white blood cells between different subpopulations (e.g. cases and controls) using DNA methylation signatures, in combination with a previously obtained external validation set consisting of signatures from purified leukocyte samples. We validate the fundamental idea in a cell mixture reconstruction experiment, then demonstrate our method on DNA methylation data sets from several studies, including data from a Head and Neck Squamous Cell Carcinoma (HNSCC) study and an ovarian cancer study. Our method produces results consistent with prior biological findings, thereby validating the approach.ConclusionsOur method, in combination with an appropriate external validation set, promises new opportunities for large-scale immunological studies of both disease states and noxious exposures.
Pulmonary damage caused by chronic colonization of the cystic fibrosis (CF) lung by microbial communities is the proximal cause of respiratory failure. While there has been an effort to document the microbiome of the CF lung in pediatric and adult patients, little is known regarding the developing microflora in infants. We examined the respiratory and intestinal microbiota development in infants with CF from birth to 21 months. Distinct genera dominated in the gut compared to those in the respiratory tract, yet some bacteria overlapped, demonstrating a core microbiota dominated by Veillonella and Streptococcus. Bacterial diversity increased significantly over time, with evidence of more rapidly acquired diversity in the respiratory tract. There was a high degree of concordance between the bacteria that were increasing or decreasing over time in both compartments; in particular, a significant proportion (14/16 genera) increasing in the gut were also increasing in the respiratory tract. For 7 genera, gut colonization presages their appearance in the respiratory tract. Clustering analysis of respiratory samples indicated profiles of bacteria associated with breast-feeding, and for gut samples, introduction of solid foods even after adjustment for the time at which the sample was collected. Furthermore, changes in diet also result in altered respiratory microflora, suggesting a link between nutrition and development of microbial communities in the respiratory tract. Our findings suggest that nutritional factors and gut colonization patterns are determinants of the microbial development of respiratory tract microbiota in infants with CF and present opportunities for early intervention in CF with altered dietary or probiotic strategies.
Genome-wide methylation arrays are powerful tools for assessing cell composition of complex mixtures. We compare three approaches to select reference libraries for deconvoluting neutrophil, monocyte, B-lymphocyte, natural killer, and CD4+ and CD8+ T-cell fractions based on blood-derived DNA methylation signatures assayed using the Illumina HumanMethylationEPIC array. The IDOL algorithm identifies a library of 450 CpGs, resulting in an average R2 = 99.2 across cell types when applied to EPIC methylation data collected on artificial mixtures constructed from the above cell types. Of the 450 CpGs, 69% are unique to EPIC. This library has the potential to reduce unintended technical differences across array platforms.Electronic supplementary materialThe online version of this article (10.1186/s13059-018-1448-7) contains supplementary material, which is available to authorized users.
BackgroundConfounding due to cellular heterogeneity represents one of the foremost challenges currently facing Epigenome-Wide Association Studies (EWAS). Statistical methods leveraging the tissue-specificity of DNA methylation for deconvoluting the cellular mixture of heterogenous biospecimens offer a promising solution, however the performance of such methods depends entirely on the library of methylation markers being used for deconvolution. Here, we introduce a novel algorithm for Identifying Optimal Libraries (IDOL) that dynamically scans a candidate set of cell-specific methylation markers to find libraries that optimize the accuracy of cell fraction estimates obtained from cell mixture deconvolution.ResultsApplication of IDOL to training set consisting of samples with both whole-blood DNA methylation data (Illumina HumanMethylation450 BeadArray (HM450)) and flow cytometry measurements of cell composition revealed an optimized library comprised of 300 CpG sites. When compared existing libraries, the library identified by IDOL demonstrated significantly better overall discrimination of the entire immune cell landscape (p = 0.038), and resulted in improved discrimination of 14 out of the 15 pairs of leukocyte subtypes. Estimates of cell composition across the samples in the training set using the IDOL library were highly correlated with their respective flow cytometry measurements, with all cell-specific R2>0.99 and root mean square errors (RMSEs) ranging from [0.97 % to 1.33 %] across leukocyte subtypes. Independent validation of the optimized IDOL library using two additional HM450 data sets showed similarly strong prediction performance, with all cell-specific R2>0.90 and RMSE<4.00 %. In simulation studies, adjustments for cell composition using the IDOL library resulted in uniformly lower false positive rates compared to competing libraries, while also demonstrating an improved capacity to explain epigenome-wide variation in DNA methylation within two large publicly available HM450 data sets.ConclusionsDespite consisting of half as many CpGs compared to existing libraries for whole blood mixture deconvolution, the optimized IDOL library identified herein resulted in outstanding prediction performance across all considered data sets and demonstrated potential to improve the operating characteristics of EWAS involving adjustments for cell distribution. In addition to providing the EWAS community with an optimized library for whole blood mixture deconvolution, our work establishes a systematic and generalizable framework for the assembly of libraries that improve the accuracy of cell mixture deconvolution.Electronic supplementary materialThe online version of this article (doi:10.1186/s12859-016-0943-7) contains supplementary material, which is available to authorized users.
The homogeneity of methylation classes for gliomas with IDH mutation, despite their histological diversity, suggests that IDH mutation is associated with a distinct DNA methylation phenotype and an altered metabolic profile in glioma.
Background: Fetal programming describes the theory linking environmental conditions during embryonic and fetal development with risk of diseases later in life. Environmental insults in utero may lead to changes in epigenetic mechanisms potentially affecting fetal development.Objectives: We examined associations between in utero exposures, infant growth, and methylation of repetitive elements and gene-associated DNA in human term placenta tissue samples.Methods: Placental tissues and associated demographic and clinical data were obtained from subjects delivering at Women and Infants Hospital in Providence, Rhode Island (USA). Methylation levels of long interspersed nuclear element-1 (LINE-1) and the Alu element AluYb8 were determined in 380 placental samples from term deliveries using bisulfite pyrosequencing. Genomewide DNA methylation profiles were obtained in a subset of 184 samples using the Illumina Infinium HumanMethylation27 BeadArray. Multiple linear regression, model-based clustering methods, and gene set enrichment analysis examined the association between birth weight percentile, demographic variables, and repetitive element methylation and gene-associated CpG locus methylation.Results: LINE-1 and AluYb8 methylation levels were found to be significantly positively associated with birth weight percentile (p = 0.01 and p < 0.0001, respectively) and were found to differ significantly among infants exposed to tobacco smoke and alcohol. Increased placental AluYb8 methylation was positively associated with average methylation among CpG loci found in polycomb group target genes; developmentally related transcription factor binding sites were overrepresented for differentially methylated loci associated with both elements.Conclusions: Our results suggest that repetitive element methylation markers, most notably AluYb8 methylation, may be susceptible to epigenetic alterations resulting from the intrauterine environment and play a critical role in mediating placenta function, and may ultimately inform on the developmental basis of health and disease.
Background: There is increasing epidemiologic evidence that arsenic exposure in utero, even at low levels found throughout much of the world, is associated with adverse reproductive outcomes and may contribute to long-term health effects. Animal models, in vitro studies, and human cancer data suggest that arsenic may induce epigenetic alterations, specifically by altering patterns of DNA methylation.Objectives: In this study we aimed to identify differences in DNA methylation in cord blood samples of infants with in utero, low-level arsenic exposure.Methods: DNA methylation of cord-blood derived DNA from 134 infants involved in a prospective birth cohort in New Hampshire was profiled using the Illumina Infinium Methylation450K array. In utero arsenic exposure was estimated using maternal urine samples collected at 24–28 weeks gestation. We used a novel cell mixture deconvolution methodology for examining the association between inferred white blood cell mixtures in infant cord blood and in utero arsenic exposure; we also examined the association between methylation at individual CpG loci and arsenic exposure levels.Results: We found an association between urinary inorganic arsenic concentration and the estimated proportion of CD8+ T lymphocytes (1.18; 95% CI: 0.12, 2.23). Among the top 100 CpG loci with the lowest p-values based on their association with urinary arsenic levels, there was a statistically significant enrichment of these loci in CpG islands (p = 0.009). Of those in CpG islands (n = 44), most (75%) exhibited higher methylation levels in the highest exposed group compared with the lowest exposed group. Also, several CpG loci exhibited a linear dose-dependent relationship between methylation and arsenic exposure.Conclusions: Our findings suggest that in utero exposure to low levels of arsenic may affect the epigenome. Long-term follow-up is planned to determine whether the observed changes are associated with health outcomes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.