Abstract. MERS (Middle EastRespiratory Syndrome) is a worldwide disease these days. The number of infected people is 1038(08/03/2015) in Saudi Arabia and 186(08/03/2015) in South Korea. MERS is all over the world including Europe and the fatality rate is 38.8%, East Asia and the Middle East. The MERS is also known as a cousin of SARS (Severe Acute Respiratory Syndrome) because both diseases show similar symptoms such as high fever and difficulty in breathing. This is why we compared MERS with SARS. We used data of the spike glycoprotein from NCBI. As a way of analyzing the protein, apriori algorithm, decision tree, SVM were used, and particularly SVM was iterated by normal, polynomial, and sigmoid. The result came out that the MERS and the SARS are alike but also different in some way.
Abstract-EbolaVirus, which has high facilities up to 90%, is introduced into the human population through close contact with the blood, secretions, organs or other bodily fluids of infected animals. It is mostly occurred in Central and West Africa, near tropical rainforests and their host are bats. In this paper, we analyzed the DNA sequences of 5 Ebolavirus : Bundibugyo Ebolavirus, Reston Ebolavirus, Sudan Ebolavirus, TaiForest Ebolavirus, and Zaire Ebolavirus and investigated the difference between them based on the genus they are involved in. Furthermore, we look for the frequency of the amino acid and find the similarities between the Ebolaviruses.Index Terms-Ebolavirus, bundibugyo ebolavirus, reston ebolavirus, sudan ebolavirus, taiforest ebolavirus, zaire ebolavirus, decision tree, apriori algorithm.
H3N2, H5N1 and H5N8 virus were widespread epidemic in South Korea. Especially in 2014 Korea, the serious outbreak of avian influenza caused by H5N8 took place, effecting not only birds but also dogs. Antibody of H5N8 virus was found on a dog which differentiated the virus from existing H3N2 canine virus. At this point, we wanted to find out why H5N8 was self-medicated in dogs and whether H5N8 would cross species boundaries and be fatal to dogs or other species. While H5N1 is avian influenza like H5N8, many cases of fatal infections among dogs caused by H5N1 have been reported. Another kind of avian influenza, H3N2 is most common type of canine influenza in Asia. With the use of decision tree and apriori algorithm, we could find out characteristics of H5N8 by comparing it with H5N1 and H3N2.
Dengue fever, caused by the dengue virus, has been a widespread epidemic during the 21 st century. A mosquito-borne RNA virus, the dengue virus has four serotypes; all are able to cause the disease. Vaccination for the virus is arduous, since the vaccine must be able to immunize all four serotypes. In order to investigate genomic similarities and differences, we analyzed the genomes of the four serotypes: Dengue virus 1, Dengue virus 2, Dengue virus 3, and Dengue virus 4. We investigated the positions on each genome that had significant differences by using the decision tree. We also tried to find the similarities of the four viruses with the apriori algorithm. Through our experiment, we were able to investigate both the genomic similarities and the differences of each serotype, and were able to reach an interesting conclusion that the viruses, though they possess certain similarities, have an unusually large number of differences amongst themselves.
Unlike direct treatment in the past, nowadays, data mining of information of diseases is very useful to cure patients. Also, with prediction of DNA sequence of specific illnesses, lots of people can avoid them. Bioinformatics, study of union of life science, biology and informatics, becomes one of the most important subjects to the future medical industry. A number of scientists and engineers have developed this area and as a result, various methodologies in aligning DNA sequences such as hidden markov model, artificial neural networks and support vector machines were developed during the last few decades. Especially, Support Vector Machine(SVM) is used in Supervised Learning, finding the furthermost hyperplane that separates given data. Unlike other methods, we can get more sophisticated and accurate results with learning method. Because of using SVM that has little parameters, we can also simplify the complex pattern and it is so effective in data analysis that we can easily investigate elements which have an effect on results. Moreover, to improve exactitude our study, we search and use DNA sequence data about HIV from NCBI(National Center for Biotechnology Information), which have reliable and numerous data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.