We proposed a categorized method of DNA sequences matrix by FCM (fuzzy cluster means). FCM avoided the errors caused by the reduction of dimensions. It further reached comprehensive machine learning. In our experiment, there are 40 training data which are artificial samples, and we verify the proposed method with 182 natural DNA sequences. The result showed the proposed method enhanced the accuracy of the classification of genes from 76% to 93%.
Abstract. This paper proposed a noise filter with L 2 -norm distance method to design a classification of RNA sequences for the species identification, included of the small sample size of the nucleic acid sequence. This method amended and expanded the study by Hu et al. in 2011 [1]. We verified this method with the biological sample "slipper orchids" and its hybrid for biological species identification test. The result is showed that after applied our method, we can distinguish the paternity of a hybrid among a set of samples of "slipper orchids".
Abstract-This paper proposes a noise filter with L 2 -norm distance method to design a classification of RNA sequences for the species identification, including of the small sample size of the nucleic acid sequence. This method amends and expands the study of Hu et al. in 2011 [1]. We verify this method with the biological sample "slipper orchids" and its hybrid for biological species identification test. The result shows that we can distinguish the paternity of a hybrid among a set of samples of "slipper orchids" by using this method.Index Terms-L 2 -norm distance; nucleic acid sequence; species identification I. INTRODUCTIONThis method is mainly based on L 2 -norm distance to classify the amino acid sequences, to do pre-processing filtering noise toward the non-A, U, C, G character analyzed through electrophoresis, and to check the progeny of hybrid. This study found that we can easily and efficiently differentiate the species relationships of "slipper orchids" samples by this method, which modified Hu et al.'s study [1]. They explored the sequence analysis but didn't mention the method which might fail. That is, A small sample size of RNA sequence may not be successfully classified by artificial intelligence methods with the mathematical calculation [1] [12]. Pre-processing and noise filtering can solve garbled electrophoresis and effectively resolve the problem of automated RNA sequencing analysis [14] [16] [17]. Consequently, further expansion of the species can truly be applied to biological classification, as Table 2-3.In the past, the "morphological" observation method [3] was widely adopted to make species identification toward animals and plants. However, the conditions necessary for such identification are very strict. There must be a complete animal and plant appearance or the characteristics parts of that type of animal and plant [2]. RNA records genetic characteristics of organisms, and various species have different genetic composition. Also, different individuals of the same species can be distinguished through RNA analysis.This study amends the classification of RNA sequences proposed by Hu et al. in 2011 [1], launching mathematical analysis to solve the garbled problem due to the small sample electrophoretic analysis of nucleic acid sequences. RNA electrophoresis analysis has the characteristics of negatively charged nucleic acids which cross the gel in the electric field and move towards the cathode. Because of different Manuscript received May 16, 2011; revised June 20, 2011. molecular weights, the gel pore size varies in the speed of movement, so as to separate the different sizes of nucleic acids. However, RNA sequencing generally employs vertical electrophoresis [13]. The range of gel electrophoresis analysis can analyze from several nucleotides to millions of chromosomal RNA of nucleotides. However, it has a resolving power within a certain range and can't analyze any RNA fragments of various sizes with a colloid. Therefore, to obtain excellent resolving power, we must explore the range ...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.