Membrane proteins are vital mediating molecules responsible for the interaction of a cell with its surroundings. These proteins are involved in different functionalities such as ferrying of molecules and nutrients across membrane, recognizing foreign bodies, receiving outside signals and translating them into the cell. Membrane proteins play significant role in drug interaction as nearly 50% of the drug targets are membrane proteins. Due to the momentous role of membrane protein in cell activity, computational models able to predict membrane protein with accurate measures bears indispensable importance. The conventional experimental methods used for annotating membrane proteins are time-consuming and costly and in some cases impossible. Computationally intelligent techniques have emerged to be as a useful resource in the automation of prediction and hence the annotation process. In this study, various techniques have been reviewed that are based on different computational intelligence models used for prediction process. These techniques were formulated by different researchers and were further evaluated to provide a comparative analysis. Analysis shows that the usage of support vector machine-based prediction techniques bears more assiduous results.
The most expedient unit of the human body is its cell. Encapsulated within the cell are many infinitesimal entities and molecules which are protected by a cell membrane. The proteins that are associated with this lipid based bilayer cell membrane are known as membrane proteins and are considered to play a significant role. These membrane proteins exhibit their effect in cellular activities inside and outside of the cell. According to the scientists in pharmaceutical organizations, these membrane proteins perform key task in drug interactions. In this study, a technique is presented that is based on various computationally intelligent methods used for the prediction of membrane protein without the experimental use of mass spectrometry. Statistical moments were used to extract features and furthermore a Multilayer Neural Network was trained using backpropagation for the prediction of membrane proteins. Results show that the proposed technique performs better than existing methodologies.
For many biological functions membrane proteins (MPs) are considered crucial. Due to this nature of MPs, many pharmaceutical agents have reflected them as attractive targets. It bears indispensable importance that MPs are predicted with accurate measures using effective and efficient computational models (CMs). Annotation of MPs using in vitro analytical techniques is time-consuming and expensive; and in some cases, it can prove to be intractable. Due to this scenario, automated prediction and annotation of MPs through CM based techniques have appeared to be useful. Based on the use of computational intelligence and statistical moments based feature set, an MP prediction framework is proposed. Furthermore, the previously used dataset has been enhanced by incorporating new MPs from the latest release of UniProtKB. Rigorous experimentation proves that the use of statistical moments with a multilayer neural network, trained using back-propagation based prediction techniques allows more thorough results.
Cancerlectins are significantly important group of lectins that have an inhibitory effect on cancer cells with respect to their growth. They have a vital role in various tumor cell interactions like adhesion, growth, metastasis, differentiation and mainly in cellular infection. The investigations associated with cancerlectins are applicable to relevant studies in laboratories, diagnostics and therapy in clinical applications, and drug discoveries in targeting cancers. Prediction of cancerlectins is considered a helpful task due to the fact that they are specifically useful in dissecting cancers. Although, several Bioinformatics tools have been developed to predict cancerlectins, however, the need for improvement in the quality of its prediction model requires enhancements in the annotation and determination process of cancerlectins. In this study, a new model is proposed that builds on statistical moments based features to distinguish cancerlectins from non-cancerlectins. The currently proposed model achieved an accuracy of 88.36% using jackknife test which is better than current state-of-the-art models. These outcomes suggest that the use of statistical moments could bear more effective and efficient results. For the accessibility of the scientific community, a user-friendly web server has been developed which will associate the researchers in medical science. Web server is freely accessible at https://www.biopred.org/canlect. INDEX TERMS Cancerlectins, Hahn moments, lectins, moment invariants, PRIM.
Sumoylation is the post-translational modification that is involved in the adaption of the cells and the functional properties of a large number of proteins. Sumoylation has key importance in subcellular concentration, transcriptional synchronization, chromatin remodeling, response to stress, and regulation of mitosis. Sumoylation is associated with developmental defects in many human diseases such as cancer, Huntington’s, Alzheimer’s, Parkinson’s, Spin cerebellar ataxia 1, and amyotrophic lateral sclerosis. The covalent bonding of Sumoylation is essential to inheriting part of the operative characteristics of some other proteins. For that reason, the prediction of the Sumoylation site has significance in the scientific community. A novel and efficient technique is proposed to predict the Sumoylation sites in proteins by incorporating Chou’s Pseudo Amino Acid Composition (PseAAC) with statistical moments-based features. The outcomes from the proposed system using 10 fold cross-validation testing are 94.51%, 94.24%, 94.79% and 0.8903% accuracy, sensitivity, specificity and MCC, respectively. The performance of the proposed system is so far the best in comparison to the other state-of-the-art methods. The codes for the current study are available on the GitHub repository using the link: https://github.com/csbioinfopk/iSumoK-PseAAC.
Enhancers regulate gene expression, by playing a crucial role in the synthesis of RNAs and proteins. They do not directly encode proteins or RNA molecules. In order to control gene expression, it is important to predict enhancers and their potency. Given their distance from the target gene, lack of common motifs, and tissue/cell specificity, enhancer regions are thought to be difficult to predict in DNA sequences. Recently, a number of bioinformatics tools were created to distinguish enhancers from other regulatory components and to pinpoint their advantages. However, because the quality of its prediction method needs to be improved, its practical application value must also be improved. Based on nucleotide composition and statistical moment-based features, the current study suggests a novel method for identifying enhancers and non-enhancers and evaluating their strength. The proposed study outperformed state-of-the-art techniques using fivefold and tenfold cross-validation in terms of accuracy. The accuracy from the current study results in 86.5% and 72.3% in enhancer site and its strength prediction respectively. The results of the suggested methodology point to the potential for more efficient and successful outcomes when statistical moment-based features are used. The current study's source code is available to the research community at https://github.com/csbioinfopk/enpred.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.