BackgroundMethods for the integrative analysis of multi-omics data are required to draw a more complete and accurate picture of the dynamics of molecular systems. The complexity of biological systems, the technological limits, the large number of biological variables and the relatively low number of biological samples make the analysis of multi-omics datasets a non-trivial problem.Results and ConclusionsWe review the most advanced strategies for integrating multi-omics datasets, focusing on mathematical and methodological aspects.
PURPOSE Recurrently mutated genes and chromosomal abnormalities have been identified in myelodysplastic syndromes (MDS). We aim to integrate these genomic features into disease classification and prognostication. METHODS We retrospectively enrolled 2,043 patients. Using Bayesian networks and Dirichlet processes, we combined mutations in 47 genes with cytogenetic abnormalities to identify genetic associations and subgroups. Random-effects Cox proportional hazards multistate modeling was used for developing prognostic models. An independent validation on 318 cases was performed. RESULTS We identify eight MDS groups (clusters) according to specific genomic features. In five groups, dominant genomic features include splicing gene mutations ( SF3B1, SRSF2, and U2AF1) that occur early in disease history, determine specific phenotypes, and drive disease evolution. These groups display different prognosis (groups with SF3B1 mutations being associated with better survival). Specific co-mutation patterns account for clinical heterogeneity within SF3B1- and SRSF2-related MDS. MDS with complex karyotype and/or TP53 gene abnormalities and MDS with acute leukemia–like mutations show poorest prognosis. MDS with 5q deletion are clustered into two distinct groups according to the number of mutated genes and/or presence of TP53 mutations. By integrating 63 clinical and genomic variables, we define a novel prognostic model that generates personally tailored predictions of survival. The predicted and observed outcomes correlate well in internal cross-validation and in an independent external cohort. This model substantially improves predictive accuracy of currently available prognostic tools. We have created a Web portal that allows outcome predictions to be generated for user-defined constellations of genomic and clinical features. CONCLUSION Genomic landscape in MDS reveals distinct subgroups associated with specific clinical features and discrete patterns of evolution, providing a proof of concept for next-generation disease classification and prognosis.
The liver connects the intestinal portal vasculature with the general circulation, employing diverse immune cells to protect from gut-translocating pathogens 1 . In liver lobules, blood flows from portal triads (PTs) situated in peri-portal (PP) lobular regions to the central vein (CV) via a polarized sinusoidal network. Despite this asymmetry, liver resident immune cells are considered to be broadly dispersed across the lobule. This differs from lymphoid organs, in which immune cells adopt spatially biased positions to promote effective host defense 2,3 . Here we employed quantitative multiplex imaging, genetic perturbations, transcriptomics, infection-based assays,
In this paper we compared taxonomic results obtained by metataxonomics (16S rRNA gene sequencing) and metagenomics (whole shotgun metagenomic sequencing) to investigate their reliability for bacteria profiling, studying the chicken gut as a model system. The experimental conditions included two compartments of gastrointestinal tracts and two sampling times. We compared the relative abundance distributions obtained with the two sequencing strategies and then tested their capability to distinguish the experimental conditions. The results showed that 16S rRNA gene sequencing detects only part of the gut microbiota community revealed by shotgun sequencing. Specifically, when a sufficient number of reads is available, Shotgun sequencing has more power to identify less abundant taxa than 16S sequencing. Finally, we showed that the less abundant genera detected only by shotgun sequencing are biologically meaningful, being able to discriminate between the experimental conditions as much as the more abundant genera detected by both sequencing strategies.
The role of epigenetics in endothelial cell senescence is a cutting-edge topic in ageing research. However, little is known of the relative contribution to pro-senescence signal propagation provided by microRNAs shuttled by extracellular vesicles (EVs) released from senescent cells. Analysis of microRNA and DNA methylation profiles in non-senescent (control) and senescent (SEN) human umbilical vein endothelial cells (HUVECs), and microRNA profiling of their cognate small EVs (sEVs) and large EVs demonstrated that SEN cells released a significantly greater sEV number than control cells. sEVs were enriched in miR-21-5p and miR-217, which target DNMT1 and SIRT1. Treatment of control cells with SEN sEVs induced a miR-21/miR-217-related impairment of DNMT1-SIRT1 expression, the reduction of proliferation markers, the acquisition of a senescent phenotype and a partial demethylation of the locus encoding for miR-21. MicroRNA profiling of sEVs from plasma of healthy subjects aged 40-100 years showed an inverse U-shaped age-related trend for miR-21-5p, consistent with senescence-associated biomarker profiles. Our findings suggest that miR-21-5p/miR-217 carried by SEN sEVs spread pro-senescence signals, affecting DNA methylation and cell replication.
Changes in blood epigenetic age have been associated with several pathological conditions and have recently been described to anticipate cancer development. In this work, we analyze a publicly available leukocytes methylation dataset to evaluate the relation between DNA methylation age and the prospective development of specific types of cancer. We calculated DNA methylation age acceleration using five state-of-the-art estimators (three multi-site: Horvath, Hannum, Weidner; and two CpG specific: ELOV2 and FHL2) in a cohort including 845 subjects from the EPIC-Italy project and we compared 424 samples that remained cancer-free over the approximately ten years of follow-up with 235 and 166 subjects who developed breast and colorectal cancer, respectively. We show that the epigenetic age estimated from blood DNA methylation data is statistically significantly associated to future breast and male colorectal cancer development. These results are corroborated by survival analysis that shows significant association between age acceleration and cancer incidence suggesting that the chance of developing age-related diseases may be predicted by circulating epigenetic markers, with a dependence upon tumor type, sex and age estimator. These are encouraging results towards the non-invasive and perspective usage of epigenetic biomarkers.
Aging is a biological process characterized by the progressive functional decline of many interrelated physiological systems. In particular, aging is associated with the development of a systemic state of low-grade chronic inflammation (inflammaging), and with progressive deterioration of metabolic function. Systems biology has helped in identifying the mediators and pathways involved in these phenomena, mainly through the application of high-throughput screening methods, valued for their molecular comprehensiveness. Nevertheless, inflammation and metabolic regulation are dynamical processes whose behavior must be understood at multiple levels of biological organization (molecular, cellular, organ, and system levels) and on multiple time scales. Mathematical modeling of such behavior, with incorporation of mechanistic knowledge on interactions between inflammatory and metabolic mediators, may help in devising nutritional interventions capable of preventing, or ameliorating, the age-associated functional decline of the corresponding systems.
Motivation DNA methylation is a stable epigenetic mark with major implications in both physiological (development, aging) and pathological conditions (cancers and numerous diseases). Recent research involving methylation focuses on the development of molecular age estimation methods based on DNA methylation levels (mAge). An increasing number of studies indicate that divergences between mAge and chronological age may be associated to age-related diseases. Current advances in high-throughput technologies have allowed the characterization of DNA methylation levels throughout the human genome. However, experimental methylation profiles often contain multiple missing values that can affect the analysis of the data and also mAge estimation. Although several imputation methods exist, a major deficiency lies in the inability to cope with large datasets, such as DNA methylation chips. Specific methods for imputing missing methylation data are therefore needed. Results We present a simple and computationally efficient imputation method, metyhLImp, based on linear regression. The rationale of the approach lies in the observation that methylation levels show a high degree of inter-sample correlation. We performed a comparative study of our approach with other imputation methods on DNA methylation data of healthy and disease samples from different tissues. Performances have been assessed both in terms of imputation accuracy and in terms of the impact imputed values have on mAge estimation. In comparison to existing methods, our linear regression model proves to perform equally or better and with good computational efficiency. The results of our analysis provide recommendations for accurate estimation of missing methylation values. Availability and implementation The R-package methyLImp is freely available at https://github.com/pdilena/methyLImp. Supplementary information Supplementary data are available at Bioinformatics online.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.