Comparison of Clustering Methods for Investigation of Genome-Wide Methylation Array Data

Clifford, Harry; Wessely, Frank; Pendurthi, Satish; Emes, Richard D.

doi:10.3389/fgene.2011.00088

Cited by 34 publications

(29 citation statements)

References 27 publications

(33 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…K -means clustering of the RNA data, using the Silhouette measure (Clifford et al, 2011) to identify the best k (Fig. S1B), revealed two distinct cell populations that were roughly equal in size (Fig.…”

Section: Resultsmentioning

confidence: 99%

“…K -means clustering was done in MATLAB using the squared Euclidean distance of normalized data (z-scores). To determine the optimal k , we applied every value from 2 to 20, assessed the average Silhouette value (Clifford et al, 2011) for each clustering result (Figure S1B), and selected k =2, which gave the largest mean Silhouette value. Differentially expressed genes were identified using a two-sided Wilcoxon-Mann-Whitney rank sum test implemented in the “coin” package in R. Differences between populations were determined by subtracting median Ct values (equivalent to log 2 expression levels).…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Single-Cell Transcript Profiles Reveal Multilineage Priming in Early Progenitors Derived from Lgr5 + Intestinal Stem Cells

Kim¹,

Saadatpour²,

Guo³

et al. 2016

Cell Reports

View full text Add to dashboard Cite

SUMMARY Lgr5+ intestinal stem cells (ISC) drive epithelial self-renewal, and their immediate progeny – intestinal bipotential progenitors – produce absorptive and secretory lineages via lateral inhibition. To define features of early transit from the ISC compartment, we used a microfluidics approach to measure selected stem- and lineage-specific transcripts in single Lgr5+ cells. We identified two distinct cell populations, one that expresses known ISC markers and a second, abundant population that simultaneously expresses markers of stem and mature absorptive and secretory cells. Single-molecule mRNA in situ hybridization and immunofluorescence verified expression of lineage-restricted genes in a subset of Lgr5+ cells in vivo. Transcriptional network analysis revealed that one group of Lgr5+ cells arises from the other and displays characteristics expected of bipotential progenitors, including activation of Notch ligand and cell-cycle inhibitor genes. These findings define the earliest steps in ISC differentiation and reveal multilineage gene priming as a fundamental property of the process.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

Single-Cell Transcript Profiles Reveal Multilineage Priming in Early Progenitors Derived from Lgr5 + Intestinal Stem Cells

Kim¹,

Saadatpour²,

Guo³

et al. 2016

Cell Reports

View full text Add to dashboard Cite

show abstract

“…Multiscale bootstrap values for each node determined by resampling of 1,000 replicates are shown. Canberra distance and Ward Linkage was determined to provide the greatest cluster separation based on mean silhouette width using the ClusterRank software (Clifford et al., 2011).…”

Section: Resultsmentioning

confidence: 99%

“…This enables the exploration of complex data without the need for a priori definition of groups that may be biased by experimenter expectations. The most appropriate algorithm for clustering was determined empirically using the ClusterRank software (Clifford et al., 2011). To identify statistically significant differentially methylated sites, normalized data were log 2 transformed and a t ‐test with a false discovery rate (FDR) multiple hypothesis correction was conducted to compare the mean scores between identified clusters.…”

Section: Methodsmentioning

confidence: 99%

Antiepileptic drugs and the fetal epigenome

et al. 2012

Self Cite

View full text Add to dashboard Cite

SUMMARYAntiepileptic drugs (AEDs) can lower maternal folate and increase maternal homocysteine levels, which are known to affect the methyl cycle and hence DNA methylation levels. The influence of in utero exposure to AEDs on fetal DNA methylation was investigated. Genome-wide fetal epigenomic profiles were determined using the Infinium 27K BeadArray from Illumina (San Diego, CA, U.S.A.). The Infinium array measures approximately 27,000 CpG loci associated with 14,496 genes at single-nucleotide resolution. Eighteen cord blood samples (nine samples from babies exposed to AEDs and nine controls) from otherwise uncomplicated pregnancies were compared. Unsupervised hierarchic clustering was used to compare the calculated methylation profiles. A clear distinction between the methylation profiles of samples from babies exposed to AEDs in utero compared with controls was detected. These data provide evidence of an epigenetic effect associated with antenatal AED and high-dose folate supplementation during pregnancy. The differences in fetal DNA methylation of those exposed to AEDs shows that a genomewide effect of methylation is evident. In addition, the epigenetic changes observed appear to be, in this limited sample, independent of extremes of birth weight centiles. These preliminary data highlight possible mechanisms by which AEDs might influence fetal outcomes and the potential of optimizing AED-specific folate supplementation regimens to offset these effects.

show abstract

“…Another study used both techniques to cluster transcription factors [101]. Clifford et al [102] compared hierarchical clustering to other clustering techniques (k-means, k-medoids, and fuzzy clustering) to determine the most appropriate one for analyzing Illumina methylation data. Since no significant difference was found between the methods, a combination was proposed; the final output will be given by the method that achieves the best results in each case.…”

Section: Bioinformatics Of Personalized Epigeneticsmentioning

confidence: 99%