BackgroundGene set enrichment (GSE) analysis is a popular framework for condensing information from gene expression profiles into a pathway or signature summary. The strengths of this approach over single gene analysis include noise and dimension reduction, as well as greater biological interpretability. As molecular profiling experiments move beyond simple case-control studies, robust and flexible GSE methodologies are needed that can model pathway activity within highly heterogeneous data sets.ResultsTo address this challenge, we introduce Gene Set Variation Analysis (GSVA), a GSE method that estimates variation of pathway activity over a sample population in an unsupervised manner. We demonstrate the robustness of GSVA in a comparison with current state of the art sample-wise enrichment methods. Further, we provide examples of its utility in differential pathway activity and survival analysis. Lastly, we show how GSVA works analogously with data from both microarray and RNA-seq experiments.ConclusionsGSVA provides increased power to detect subtle pathway activity changes over a sample population in comparison to corresponding methods. While GSE methods are generally regarded as end points of a bioinformatic analysis, GSVA constitutes a starting point to build pathway-centric models of biology. Moreover, GSVA contributes to the current need of GSE methods for RNA-seq data. GSVA is an open source software package for R which forms part of the Bioconductor project and can be downloaded at http://www.bioconductor.org.
A plethora of candidate genes have been identified for complex polygenic disorders, but the underlying disease mechanisms remain largely unknown. We explored the pathophysiology of type 2 diabetes (T2D) by analyzing global gene expression in human pancreatic islets. A group of coexpressed genes (module), enriched for interleukin-1-related genes, was associated with T2D and reduced insulin secretion. One of the module genes that was highly overexpressed in islets from T2D patients is SFRP4, which encodes secreted frizzled-related protein 4. SFRP4 expression correlated with inflammatory markers, and its release from islets was stimulated by interleukin-1β. Elevated systemic SFRP4 caused reduced glucose tolerance through decreased islet expression of Ca(2+) channels and suppressed insulin exocytosis. SFRP4 thus provides a link between islet inflammation and impaired insulin secretion. Moreover, the protein was increased in serum from T2D patients several years before the diagnosis, suggesting that SFRP4 could be a potential biomarker for islet dysfunction in T2D.
Long non-coding RNAs (lncRNAs) can act as scaffolds that promote the interaction of proteins, RNA, and DNA. There is increasing evidence of sequence-specific interactions of lncRNAs with DNA via triple-helix (triplex) formation. This process allows lncRNAs to recruit protein complexes to specific genomic regions and regulate gene expression. Here we propose a computational method called Triplex Domain Finder (TDF) to detect triplexes and characterize DNA-binding domains and DNA targets statistically. Case studies showed that this approach can detect the known domains of lncRNAs Fendrr, HOTAIR and MEG3 . Moreover, we validated a novel DNA-binding domain in MEG3 by a genome-wide sequencing method. We used TDF to perform a systematic analysis of the triplex-forming potential of lncRNAs relevant to human cardiac differentiation. We demonstrated that the lncRNA with the highest triplex-forming potential, GATA6-AS , forms triple helices in the promoter of genes relevant to cardiac development. Moreover, down-regulation of GATA6-AS impairs GATA6 expression and cardiac development. These data indicate the unique ability of our computational tool to identify novel triplex-forming lncRNAs and their target genes.
There is a growing perception that long non-coding RNAs (lncRNAs) modulate cellular function. In this study, we analyzed the role of the lncRNA HOTAIR in mesenchymal stem cells (MSCs) with particular focus on senescence-associated changes in gene expression and DNA-methylation (DNAm). HOTAIR binding sites were enriched at genomic regions that become hypermethylated with increasing cell culture passage. Overexpression and knockdown of HOTAIR inhibited or stimulated adipogenic differentiation of MSCs, respectively. Modification of HOTAIR expression evoked only very moderate effects on gene expression, particularly of polycomb group target genes. Furthermore, overexpression and knockdown of HOTAIR resulted in DNAm changes at HOTAIR binding sites. Five potential triple helix forming domains were predicted within the HOTAIR sequence based on reverse Hoogsteen hydrogen bonds. Notably, the predicted triple helix target sites for these HOTAIR domains were also enriched in differentially expressed genes and close to DNAm changes upon modulation of HOTAIR. Electrophoretic mobility shift assays provided further evidence that HOTAIR domains form RNA–DNA–DNA triplexes with predicted target sites. Our results demonstrate that HOTAIR impacts on differentiation of MSCs and that it is associated with senescence-associated DNAm. Targeting of epigenetic modifiers to relevant loci in the genome may involve triple helix formation with HOTAIR.
Dendritic cells (DC) are professional antigen presenting cells that develop from hematopoietic stem cells through successive steps of lineage commitment and differentiation. Multipotent progenitors (MPP) are committed to DC restricted common DC progenitors (CDP), which differentiate into specific DC subsets, classical DC (cDC) and plasmacytoid DC (pDC). To determine epigenetic states and regulatory circuitries during DC differentiation, we measured consecutive changes of genome-wide gene expression, histone modification and transcription factor occupancy during the sequel MPP-CDP-cDC/pDC. Specific histone marks in CDP reveal a DC-primed epigenetic signature, which is maintained and reinforced during DC differentiation. Epigenetic marks and transcription factor PU.1 occupancy increasingly coincide upon DC differentiation. By integrating PU.1 occupancy and gene expression we devised a transcription factor regulatory circuitry for DC commitment and subset specification. The circuitry provides the transcription factor hierarchy that drives the sequel MPP-CDP-cDC/pDC, including Irf4, Irf8, Tcf4, Spib and Stat factors. The circuitry also includes feedback loops inferred for individual or multiple factors, which stabilize distinct stages of DC development and DC subsets. In summary, here we describe the basic regulatory circuitry of transcription factors that drives DC development.
BackgroundPrimary cells enter replicative senescence after a limited number of cell divisions. This process needs to be considered in cell culture experiments, and it is particularly important for regenerative medicine. Replicative senescence is associated with reproducible changes in DNA methylation (DNAm) at specific sites in the genome. The mechanism that drives senescence-associated DNAm changes remains unknown - it may involve stochastic DNAm drift due to imperfect maintenance of epigenetic marks or it is directly regulated at specific sites in the genome.ResultsIn this study, we analyzed the reorganization of nuclear architecture and DNAm changes during long-term culture of human fibroblasts and mesenchymal stromal cells (MSCs). We demonstrate that telomeres shorten and shift towards the nuclear center at later passages. In addition, DNAm profiles, either analyzed by MethylCap-seq or by 450k IlluminaBeadChip technology, revealed consistent senescence-associated hypermethylation in regions associated with H3K27me3, H3K4me3, and H3K4me1 histone marks, whereas hypomethylation was associated with chromatin containing H3K9me3 and lamina-associated domains (LADs). DNA hypermethylation was significantly enriched in the vicinity of genes that are either up- or downregulated at later passages. Furthermore, specific transcription factor binding motifs (e.g. EGR1, TFAP2A, and ETS1) were significantly enriched in differentially methylated regions and in the promoters of differentially expressed genes.ConclusionsSenescence-associated DNA hypermethylation occurs at specific sites in the genome and reflects functional changes in the course of replicative senescence. These results indicate that tightly regulated epigenetic modifications during long-term culture contribute to changes in nuclear organization and gene expression.Electronic supplementary materialThe online version of this article (doi:10.1186/s13148-015-0057-5) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.