2016
DOI: 10.3389/fgene.2016.00136
|View full text |Cite
|
Sign up to set email alerts
|

Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications

Abstract: Data mining and pattern recognition methods reveal interesting findings in genetic studies, especially on how the genetic makeup is associated with inherited diseases. Although researchers have proposed various data mining models for biomedical approaches, there remains a challenge in accurately prioritizing the single nucleotide polymorphisms (SNP) associated with the disease. In this commentary, we review the state-of-art data mining and pattern recognition models for identifying inherited diseases and delib… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
6
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 46 publications
(53 reference statements)
0
6
0
Order By: Relevance
“…Differentiating statistical from machine learning methods is somewhat arbitrary [ 18 , 51 ]. Typical examples of machine learning algorithms are artificial neural networks (ANNs) [ 10 ], SVMs, and random forests [ 52 ]. Okazaki and Ott [ 53 ] provided a good review on machine learning approaches to digenic inherence.…”
Section: Machine Learning Approachesmentioning
confidence: 99%
“…Differentiating statistical from machine learning methods is somewhat arbitrary [ 18 , 51 ]. Typical examples of machine learning algorithms are artificial neural networks (ANNs) [ 10 ], SVMs, and random forests [ 52 ]. Okazaki and Ott [ 53 ] provided a good review on machine learning approaches to digenic inherence.…”
Section: Machine Learning Approachesmentioning
confidence: 99%
“…Notably, the SVM provided the overall best performance for our dataset because it is a machine learning approach for binary classification and can handle the smaller datasets well ( 54 ). RT appeared to give the poorest performance, likely due to underfitting caused by smaller dataset ( 55 ). To further assess the performance of the top-ranking features, an independent urine sample was applied to validate the models.…”
Section: Resultsmentioning
confidence: 99%
“…Such precise positions can help prevent the pitfalls of annotation errors, making our analysis more robust. We speculate that such analysis can also aid in predicting the early onset of cancers [ 21 , 22 ]. Taken together, our analyses could provide early roads for prognosis where these genes could aid as key candidates.…”
Section: Resultsmentioning
confidence: 99%