Increasing use of therapeutic peptides for treating cancer has received considerable attention of the scientific community in the recent years. The present study describes the in silico model developed for predicting and designing anticancer peptides (ACPs). ACPs residue composition analysis revealed the preference of A, F, K, L and W. Positional preference analysis revealed that residue A, F and K are preferred at N-terminus and residue L and K are preferred at C-terminus. Motif analysis revealed the presence of motifs like LAKLA, AKLAK, FAKL, LAKL in ACPs. Prediction models were developed using various input features and implementing different machine learning classifiers on two datasets main and alternate dataset. In the case of main dataset, ETree Classifier based model developed using dipeptide composition achieved maximum MCC of 0.51 and 0.83 AUROC on the training dataset. In the case of alternate dataset, ETree Classifier based model developed using amino acid composition performed best and achieved the highest MCC of 0.80 and AUROC of 0.97 on the training dataset. Models were trained and tested using five-fold cross validation technique and their performance was also evaluated on the validation dataset. Best models were implemented in the webserver AntiCP 2.0, freely available at https://webs.iiitd.edu.in/raghava/anticp2. The webserver is compatible with multiple screens such as iPhone, iPad, laptop, and android phones. The standalone version of the software is provided in the form of GitHub package as well as in docker technology.
EGFRIndb gathers biological and chemical information on EGFR inhibitors from the literature. It is hoped that it will serve as a useful resource in drug discovery and provide data for docking, virtual screening and Quantitative structure-activity relationship (QSAR) model development to the cancer researchers.
BackgroundLiver Hepatocellular Carcinoma (LIHC) is the second major cancer worldwide, responsible for millions of premature deaths every year. Prediction of clinical staging is vital to implement optimal therapeutic strategy and prognostic prediction in cancer patients. However, to date, no method has been developed for predicting stage of LIHC from genomic profile of samples.ResultsIn current study, in silico models have been developed for classifying LIHC patients in early and late stage using RNA expression and DNA methylation data. The Cancer Genome Atlas (TCGA) dataset contains 173 early and 177 late stage samples of LIHC, was extensively analysed to identify differentially expressed RNA transcripts and methylated CpG sites that can discriminate early and late stages of LIHC samples with high precision. Naive Bayes model developed using 51 features that combine 21 CpG methylation sites and 30 RNA transcripts achieved maximum MCC 0.58 with accuracy 78.87% on validation dataset. Further, we also analysed genomics and epigenomics profiles of normal and LIHC samples and developed model to classify LIHC samples with AUROC 0.99. In addition, multiclass models developed for classifying samples in normal, early and late stage of cancer and achieved accuracy of 76.54% and AUROC of 0.86.ConclusionOur study reveals stage prediction of LIHC samples with high accuracy based on genomics and epigenomics profiling is a challenging task in comparison to classification of LIHC and normal samples. Comprehensive analysis, differentially expressed RNA transcripts, methylated CpG sites in LIHC samples and prediction models are available from CancerLSP ( http://webs.iiitd.edu.in/raghava/cancerlsp/).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.