Comparing machine learning and logistic regression methods for predicting hypertension using a combination of gene expression and next-generation sequencing data

Held, E; Cape, Joshua; Tintle, Nathan L.

doi:10.1186/s12919-016-0020-2

Cited by 29 publications

(30 citation statements)

References 7 publications

(13 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In a more direct way, Held et al [ 14 ] built support vector machine models to predict disease status from genes that simultaneously collapse genotype variants and use gene expression effects. Specifically, based on 637 individuals with a simulated hypertension phenotype, some or all of the first 150 simulated data sets were used for a selection of interesting genes (training), and three from the remaining 50 simulated data sets were used for classification (testing).…”

Section: Message #2: Exploiting the Information From Different Data Tmentioning

confidence: 99%

“…The required hyperparameters are derived from cross-validation. Held et al [ 14 ] find that the predictive performance is slightly higher for a support vector machine with a linear kernel than for the other methods. With logistic regression and use of a radial kernel, the performance decreases with a greater number of genes.…”

Section: Message #2: Exploiting the Information From Different Data Tmentioning

confidence: 99%

“…In a most general approach, rare variants are somehow collapsed into some kind of pseudo markers, as seen in the contributions by Sun et al [ 11 ] or Held et al [ 14 ]. This could easily be incorporated in other machine learning approaches as well.…”

Section: Message #2: Exploiting the Information From Different Data Tmentioning

confidence: 99%

“…When focusing on the clinical validity, that is, whether a genetic test is able to predict or identify a disease of interest, both the strength of the association and diagnostic or prognostic value have to be measured. As an example for this, the area under the curve was reported in the contribution by Held et al [ 14 ].…”

Section: Message #3: Evaluating the Evidence From Machine Learning Mementioning

confidence: 99%

See 3 more Smart Citations

Machine learning and data mining in complex genomic data—a review on the lessons learned in Genetic Analysis Workshop 19

et al. 2016

Self Cite

View full text Add to dashboard Cite

In the analysis of current genomic data, application of machine learning and data mining techniques has become more attractive given the rising complexity of the projects. As part of the Genetic Analysis Workshop 19, approaches from this domain were explored, mostly motivated from two starting points. First, assuming an underlying structure in the genomic data, data mining might identify this and thus improve downstream association analyses. Second, computational methods for machine learning need to be developed further to efficiently deal with the current wealth of data.In the course of discussing results and experiences from the machine learning and data mining approaches, six common messages were extracted. These depict the current state of these approaches in the application to complex genomic data. Although some challenges remain for future studies, important forward steps were taken in the integration of different data types and the evaluation of the evidence. Mining the data for underlying genetic or phenotypic structure and using this information in subsequent analyses proved to be extremely helpful and is likely to become of even greater use with more complex data sets.

show abstract

Section: Message #2: Exploiting the Information From Different Data Tmentioning

confidence: 99%

Section: Message #2: Exploiting the Information From Different Data Tmentioning

confidence: 99%

Section: Message #2: Exploiting the Information From Different Data Tmentioning

confidence: 99%

Section: Message #3: Evaluating the Evidence From Machine Learning Mementioning

confidence: 99%

See 2 more Smart Citations

Machine learning and data mining in complex genomic data—a review on the lessons learned in Genetic Analysis Workshop 19

et al. 2016

Self Cite

View full text Add to dashboard Cite

show abstract

“…At the second stage, SoftMax regression was applied to classify the health status of individuals using the learned features. Brain images have been obtained under various health conditions [24]. These images have constructed a (6)…”

Section: Proposed Methodsmentioning

confidence: 99%

An intelligent Alzheimer’s disease diagnosis method using unsupervised feature learning

2019

View full text Add to dashboard Cite

In computer systems, especially with the advancement of the Internet and databases, big data is increasingly expanding and is advancing exponentially [1-4]. This is mostly true in medical big data and images. Therefore, the issue of exploding data shows the concept and power of the big data. In the field of medicine, especially magnetic resonance imaging (MRI) images, the issue of big data with high data dimensions is investigated [5]. As people grow older in the community, an untreated disease would be common, which is called Alzheimer's, and it has been proven that it has no treatment, but it can be prevented from development with timely diagnosis. Alzheimer is known as the most common disease among the various causes of dementia and with each passing decade, the number of people infected with the disease is almost doubled. For this reason, timely

show abstract

Revisit of Machine Learning Supported Biological and Biomedical Studies

Wang

Zeng

2018

Methods in Molecular Biology

View full text Add to dashboard Cite

Generally, machine learning includes many in silico methods to transform the principles underlying natural phenomenon to human understanding information, which aim to save human labor, to assist human judge, and to create human knowledge. It should have wide application potential in biological and biomedical studies, especially in the era of big biological data. To look through the application of machine learning along with biological development, this review provides wide cases to introduce the selection of machine learning methods in different practice scenarios involved in the whole biological and biomedical study cycle and further discusses the machine learning strategies for analyzing omics data in some cutting-edge biological studies. Finally, the notes on new challenges for machine learning due to small-sample high-dimension are summarized from the key points of sample unbalance, white box, and causality.

show abstract

Comparing machine learning and logistic regression methods for predicting hypertension using a combination of gene expression and next-generation sequencing data

Cited by 29 publications

References 7 publications

Machine learning and data mining in complex genomic data—a review on the lessons learned in Genetic Analysis Workshop 19

Machine learning and data mining in complex genomic data—a review on the lessons learned in Genetic Analysis Workshop 19

An intelligent Alzheimer’s disease diagnosis method using unsupervised feature learning

Revisit of Machine Learning Supported Biological and Biomedical Studies

Contact Info

Product

Resources

About