Computational Biology 2019
DOI: 10.15586/computationalbiology.2019.ch5
|View full text |Cite
|
Sign up to set email alerts
|

Multivariate Statistical Methods for High-Dimensional Multiset Omics Data Analysis

Abstract: This chapter covers the state-of-the-art multivariate statistical methods designed for high-dimensional multiset omics data analysis. Recent biotechnological developments have enabled large-scale measurement of various biomolecular data, such as genotypic and phenotypic data, dispersed over various omics domains. An emergent research direction is to analyze these data sources using an integrated approach to better model and understand the underlying biology of complex disease conditions. However, comprehensive… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 53 publications
(65 reference statements)
0
8
0
Order By: Relevance
“…To identify which variables were affected with multicollinearity, we tested the variance inflation factors (VIFs). VIFs were less than two, suggesting a moderate correlation, although not sufficiently strong to justify any remedial action [ 50 ]. Data processing and statistical calculations were conducted using IBM SPSS version 25 (1989–2018 by IBM Corp.©, Armonk, New York, NY, USA).…”
Section: Methodsmentioning
confidence: 99%
“…To identify which variables were affected with multicollinearity, we tested the variance inflation factors (VIFs). VIFs were less than two, suggesting a moderate correlation, although not sufficiently strong to justify any remedial action [ 50 ]. Data processing and statistical calculations were conducted using IBM SPSS version 25 (1989–2018 by IBM Corp.©, Armonk, New York, NY, USA).…”
Section: Methodsmentioning
confidence: 99%
“…Concatenating matrices will further amplify these differences. The concept of penalized multiblock analysis accompanied with feature selection is suitable for this situation. , It also manages the imbalance of signal scales between platforms. Deng et al reported efficient classification of sample groups in integrated LC-MS and NMR data with feature selection for multiblock PLS-DA .…”
Section: Chemoinformatics and Computational Modelingmentioning
confidence: 99%
“…To date, omics research in TBI is still in its infancy, and most studies approach different aspects of TBI pathophysiology, leaving little space for consolidation of data from multiple sources. Additionally, the statistical modeling of TBI poses a challenge in view of the heterogeneity of the disease and since data generated by high-throughput technology may be measured in thousands to millions per sample [22]. This high dimensionality carries statistical difficulties such as sparsity, multicollinearity, model complexity, and model overfitting [112].…”
Section: Statistical Challenges Artificial Intelligence and Machine/deep Learningmentioning
confidence: 99%
“…Therefore, no single omics modality can completely reflect the complexity of TBI, and a system biology approach is needed. To accomplish this, multiset techniques based on PLS and CCA are available [22]. In addition, network and enrichment analysis is valuable to identify molecules of pathophysiological significance and to understand the downstream flow of information from DNA to physiology [60].…”
Section: Statistical Challenges Artificial Intelligence and Machine/deep Learningmentioning
confidence: 99%