Computational Optical Imaging and Artificial Intelligence in Biomedical Sciences 2024
DOI: 10.1117/12.3002539
|View full text |Cite
|
Sign up to set email alerts
|

Predicting gene families from human DNA sequences using machine learning: a logistic regression approach

Nkgaphe Tsebesebe,
Kelvin Mpofu,
Sphumelele Ndlovu
et al.

Abstract: Machine learning is a powerful technique for analysing large-scale data and learning patterns, which provides high accuracy and shorter processing times. In this work, a machine learning algorithm (multinomial logistic regression) is used to predict the gene families from a human DNA sequence. 4380 sequences were converted into overlapping k-mers of length 6 to produce 232 414 k-mers. The data set was split into 80/20 train and test datasets, and the multinomial logistic regression model achieved a 93.9% accur… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 22 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?