Cloud computing offers significant research and economic benefits to healthcare organisations. Cloud services provide a safe place for storing and managing large amounts of such sensitive data. Under conventional flow of gene information, gene sequence laboratories send out raw and inferred information via Internet to several sequence libraries. DNA sequencing storage costs will be minimised by use of cloud service. In this study, the authors put forward a novel genomic informatics system using Amazon Cloud Services, where genomic sequence information is stored and accessed for processing. True identification of exon regions in a DNA sequence is a key task in bioinformatics, which helps in disease identification and design drugs. Three base periodicity property of exons forms the basis of all exon identification techniques. Adaptive signal processing techniques found to be promising in comparison with several other methods. Several adaptive exon predictors (AEPs) are developed using variable normalised least mean square and its maximum normalised variants to reduce computational complexity. Finally, performance evaluation of various AEPs is done based on measures such as sensitivity, specificity and precision using various standard genomic datasets taken from National Center for Biotechnology Information genomic sequence database.
Substantial research and monetary aids to healthcare establishments are provided by cloud computing. A benign position to store and handle vast genome data is offered by cloud services. Labs for gene sequencing send out raw and contingent data over the Internet to multiple sequence collections under conservative flow of gene information. The use of cloud services also reduces the storage costs of deoxyribonucleic acid (DNA) sequencing. Here, an efficient and new bio-informatics genomic system is proposed by the use of cloud services from Amazon to access the stored gene data and process it. A key task in bio-informatics is to locate protein-coding sections in a gene sequence based on three base periodicity (TBP) is for disease diagnosis and design drugs. Here, a novel cloud-based adaptive exon predictor (AEP) using Amazon cloud services is proposed to improve the accuracy in exon finding ability as well as aimed at superior convergence. Noise in the input gene sequence given to the proposed AEPs is pre-processed using normalized LMS filtering. Computational complexity can be reduced using proposed data normalized form of least logarithmic absolute difference (NLLAD) algorithm and its error normalized variants. It was shown that sign regressor NLLAD (SRNLLAD) dependent AEP is efficient in exon forecast applications using different metrics for a performance like sensitivity 0.8037, precision 0.8052 along with specificity 0.8146 by different gene sequences considered from the National Center for Biotechnology Information (NCBI) databank. The proposed AEPs have shown upright performance than typical LMS and other AEPs in terms of exon prediction accuracy, convergence, and computational complexity. Their less computational complexity will be found attractive, and they are suitable to use in bio-informatics nano devices.INDEX TERMS Amazon cloud services, bio-informatics, convergence, deoxyribonucleic acid, National Center for Biotechnology Information, base three periodicity.
In the field of Bio-informatics, locating the exon fragments in a deoxyribonucleic acid (DNA) sequence is an important and vital work. Study of protein coding regions is a wide phenomenon in identification of diseases and design of drugs. The regions of DNA that have the protein coding information are termed as exons. Hence identifying the exon segments in a genomic sequence is a crucial job in bio-informatics. Three base periodicity (TBP) has been observed in the regions of DNA sequences can be easily determined by applying signal processing methods. Adaptive signal processing techniques found to be useful than other available methods. This is due to their unique capability to alter weight coefficients based on genomic sequence. We propose efficient adaptive exon predictors (AEPs) based on these considerations using Proportionate Normalized LMS (PNLMS) algorithm and Maximum Proportionate Normalized LMS (MPNLMS) algorithm to improve exon locating ability and better convergence. To ease the complexity of computations in the denominator during filtering process, proposed AEPs using PNLMS and its maximum variants are combined with signature algorithms. Hybrid variants of proposed AEPs include PNLMS, DCPNLMS, ECPNLMS, SSPNLMS, MPNLMS, MDCPNLMS, MECPNLMS and MSSPNLMS algorithms. It was shown that the AEP based on MDCPNLMS is superior in applications of exon identification depending on performance measures with Sensitivity 0.7346, Specificity 0.7483 and precision 0.7325 for a genomic sequence with accession AF009962 at a threshold of 0.8. Finally the capability of several AEPs in predicting exon locations is verified using different DNA sequences found in National Center for Biotechnology Information (NCBI) gene database.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.