2023
DOI: 10.1007/978-981-19-8460-0_9
|View full text |Cite
|
Sign up to set email alerts
|

STREAMLINE: A Simple, Transparent, End-To-End Automated Machine Learning Pipeline Facilitating Data Analysis and Algorithm Comparison

Abstract: Objective: While machine learning (ML) includes a valuable array of tools for analyzing biomedical data with multivariate and complex underlying associations, significant time and expertise is required to assemble effective, rigorous, comparable, reproducible, and unbiased pipelines. Automated ML (AutoML) tools seek to facilitate ML application by automating a subset of analysis pipeline elements. In this study we develop and validate a Simple, Transparent, End-to-end Automated Machine Learning Pipeline (STREA… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 12 publications
(15 citation statements)
references
References 79 publications
(70 reference statements)
0
9
0
Order By: Relevance
“…Aside from the Youden’s J statistic-based threshold model for our dichotomous diagnostic test, a variety of statistical, probabilistic, and optimization techniques can be applied to genomic data using machine learning (ML) for binary classification. An automated end-to-end ML pipeline called STREAMLINE 47 was used for the binary classification of the breast tumor and paired normal DNA (as in Fig. 3 ).…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Aside from the Youden’s J statistic-based threshold model for our dichotomous diagnostic test, a variety of statistical, probabilistic, and optimization techniques can be applied to genomic data using machine learning (ML) for binary classification. An automated end-to-end ML pipeline called STREAMLINE 47 was used for the binary classification of the breast tumor and paired normal DNA (as in Fig. 3 ).…”
Section: Resultsmentioning
confidence: 99%
“…Machine learning binary classification modeling and evaluation was conducted using version 0.2.5 of the Simple Transparent End-To-End Automated Machine Learning Pipeline (STREAMLINE) 44 applying mostly default pipeline run parameters (exceptions identified below). STREAMLINE is a recently developed AutoML tool that facilitates ML modeling while enforcing rigorous modeling algorithm comparisons, analysis transparency, reproducibility, and sensitivity to complex associations in data.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…To investigate the classi cation performance of liver enzymes in differentiating amyloid status, we used a machine learning approach with penalized logistic regression using the STREAMLINE tool [30]. This approach proved advantageous in reducing the effect of multicollinearity on feature selection [31].…”
Section: Discussionmentioning
confidence: 99%
“…The ROC curves and mean AUC of machine learning approach using penalized logistic regression Sensitivity is on the y-axis and 1-speci city is on the x-axis. 30 Abbreviations: ALP, alkaline phosphatase; ALT, alanine aminotransferase; APOE, apolipoprotein E; AST, aspartate aminotransferase; AUC, area under the curve; ROC, receiver operating characteristic.…”
Section: Figurementioning
confidence: 99%