Review on the Application of Machine Learning Algorithms in the Sequence Data Mining of DNA

Yang, Aimin; Zhang, Wei; Wang, Jiahao; Yang, Ke; Han, Yang; Zhang, Limin

doi:10.3389/fbioe.2020.01032

Cited by 100 publications

(55 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Classification is a very important method of data mining [ 27 ]. The concept of classification is to learn a classification function or construct a classification model on the basis of existing data, which is commonly referred to as a classifier.…”

Section: Basic Theoretical Knowledge Of Metalearningmentioning

confidence: 99%

[Retracted] Review on the Application of Metalearning in Artificial Intelligence

Zhang

Wang

et al. 2021

Computational Intelligence and Neuroscience

Self Cite

View full text Add to dashboard Cite

In recent years, artificial intelligence supported by big data has gradually become more dependent on deep reinforcement learning. However, the application of deep reinforcement learning in artificial intelligence is limited by prior knowledge and model selection, which further affects the efficiency and accuracy of prediction, and also fails to realize the learning ability of autonomous learning and prediction. Metalearning came into being because of this. Through learning the information metaknowledge, the ability to autonomously judge and select the appropriate model can be formed, and the parameters can be adjusted independently to achieve further optimization. It is a novel method to solve big data problems in the current neural network model, and it adapts to the development trend of artificial intelligence. This article first briefly introduces the research process and basic theory of metalearning and discusses the differences between metalearning and machine learning and the research direction of metalearning in big data. Then, four typical applications of metalearning in the field of artificial intelligence are summarized: few-shot learning, robot learning, unsupervised learning, and intelligent medicine. Then, the challenges and solutions of metalearning are analyzed. Finally, a systematic summary of the full text is made, and the future development prospect of this field is assessed.

show abstract

Section: Basic Theoretical Knowledge Of Metalearningmentioning

confidence: 99%

[Retracted] Review on the Application of Metalearning in Artificial Intelligence

Zhang

Wang

et al. 2021

Computational Intelligence and Neuroscience

Self Cite

View full text Add to dashboard Cite

show abstract

“…Identifying a GMO in a food processed from several species means that the analysts have to manage a larger matrix than previously performed. Design of experiments (DOE), machine learning, artificial neural networks, fuzzy logic, or genetic algorithms are some of the available tools to manage the big data that knowledge matrices could become (Alley et al, 2020;Nielsen and Voigt, 2018;Sivarajah et al, 2017;Yang et al, 2020;Yin et al, 2017). The management of millions of SNPs used in genomic selection show it is easily manageable.…”

Section: Signatures and Scars In Processed Productsmentioning

confidence: 99%

“…These big data could then be analysed by the species and mutagenesis category used to distinguish the similarities and differences, at least genetic, caused to species not mutated by man. Despite the numerous errors present in the sequence bases (Bertheau, 2019;Steinegger and Salzberg, 2020;Tang, 2020), the large number of sequences available, whether or not from GMOs, should, with a reasoned use of various statistical and DSS software and artificial intelligence, make it possible to distinguish scars and signatures (Alley et al, 2020;Block et al, 2013;Cadzow et al, 2014;Guillot et al, 2014;Interdonato et al, 2020;Koumakis, 2020;Nielsen and Voigt, 2018;Yang et al, 2020). Finally, some experiments should be enough to demonstrate the universality of the concept.…”

Section: Proof Of Conceptmentioning

confidence: 99%

Advances in identifying GM plants: toward the routine detection of ‘hidden’ and ‘new’ GMOs

Bertheau¹

2021

Burleigh Dodds Series in Agricultural Science

View full text Add to dashboard Cite

In 2018 the Court of Justice of the European Union recalled that organisms with genomes modified by artifactual techniques should be considered GMOs under European regulations. GMOs derived from cultures of cells isolated in vitro or from new genomic techniques must therefore be traceable. This chapter reviews the various technical steps and characteristics of those techniques causing genomic and epigenomic scars and signatures. These intentional and unintentional traces, some of which are already used for varietal identification, and are being standardized, can be used to identify these GMOs and differentiate them from natural mutants. The chapter suggests a routine procedure for operators and control laboratories to achieve this without additional costs.

show abstract

“…Their results show that the SVM is superior to the Fisher linear discrimination classifier based on 10-fold cross-validation by 14.8%. Yang et al (2020) provided a review that introduced sequencing technology development and explains the structure of DNA sequence data and sequence similarity. Second, they analyzed the necessary DM process, summarized several of the significant ML algorithms, and highlighted the future challenges faced by ML algorithms in extracting biological sequence data and possible future solutions.…”

Section: Different Applicationsmentioning

confidence: 99%

Genetic variations analysis for complex brain disease diagnosis using machine learning techniques: opportunities and hurdles

Ahmed

Alarabi

El–Sappagh

et al. 2021

PeerJ Computer Science

View full text Add to dashboard Cite

Background and Objectives This paper presents an in-depth review of the state-of-the-art genetic variations analysis to discover complex genes associated with the brain’s genetic disorders. We first introduce the genetic analysis of complex brain diseases, genetic variation, and DNA microarrays. Then, the review focuses on available machine learning methods used for complex brain disease classification. Therein, we discuss the various datasets, preprocessing, feature selection and extraction, and classification strategies. In particular, we concentrate on studying single nucleotide polymorphisms (SNP) that support the highest resolution for genomic fingerprinting for tracking disease genes. Subsequently, the study provides an overview of the applications for some specific diseases, including autism spectrum disorder, brain cancer, and Alzheimer’s disease (AD). The study argues that despite the significant recent developments in the analysis and treatment of genetic disorders, there are considerable challenges to elucidate causative mutations, especially from the viewpoint of implementing genetic analysis in clinical practice. The review finally provides a critical discussion on the applicability of genetic variations analysis for complex brain disease identification highlighting the future challenges. Methods We used a methodology for literature surveys to obtain data from academic databases. Criteria were defined for inclusion and exclusion. The selection of articles was followed by three stages. In addition, the principal methods for machine learning to classify the disease were presented in each stage in more detail. Results It was revealed that machine learning based on SNP was widely utilized to solve problems of genetic variation for complex diseases related to genes. Conclusions Despite significant developments in genetic diseases in the past two decades of the diagnosis and treatment, there is still a large percentage in which the causative mutation cannot be determined, and a final genetic diagnosis remains elusive. So, we need to detect the variations of the genes related to brain disorders in the early disease stages.

show abstract

Review on the Application of Machine Learning Algorithms in the Sequence Data Mining of DNA

Cited by 100 publications

References 39 publications

[Retracted] Review on the Application of Metalearning in Artificial Intelligence

[Retracted] Review on the Application of Metalearning in Artificial Intelligence

Advances in identifying GM plants: toward the routine detection of ‘hidden’ and ‘new’ GMOs

Genetic variations analysis for complex brain disease diagnosis using machine learning techniques: opportunities and hurdles

Contact Info

Product

Resources

About