Características biológicas e morfológicas de cepas brasileiras de Schistosoma mansoni em Mus musculus

Extracellular matrix (ECM) proteins play an important role in a series of biological processes of cells. The study of ECM proteins is helpful to further comprehend their biological functions. We propose ECMP-RF (extracellular matrix proteins prediction by random forest) to predict ECM proteins. Firstly, the features of the protein sequence are extracted by combining encoding based on grouped weight, pseudo amino-acid composition, pseudo position-specific scoring matrix, a local descriptor, and an autocorrelation descriptor. Secondly, the synthetic minority oversampling technique (SMOTE) algorithm is employed to process the class imbalance data, and the elastic net (EN) is used to reduce the dimension of the feature vectors. Finally, the random forest (RF) classifier is used to predict the ECM proteins. Leave-one-out cross-validation shows that the balanced accuracy of the training and testing datasets is 97.3% and 97.9%, respectively. Compared with other state-of-the-art methods, ECMP-RF is significantly better than other predictors.

show abstract

Fertility-LightGBM: A fertility-related protein prediction model by multi-information fusion and light gradient boosting machine

Wang

Yue

Yang

et al. 2021

Biomedical Signal Processing and Control

View full text Add to dashboard Cite

Elementary Transformation and its Applications for Split Quaternion Matrices

Wang

Yue

Liu

2019

Adv. Appl. Clifford Algebras

View full text Add to dashboard Cite

The Real Representation of Canonical Hyperbolic Quaternion Matrices and Its Applications

Wang¹,

Yue²,

Xu³

et al. 2019

AJAMS

View full text Add to dashboard Cite

show abstract

Fertility-LightGBM: A fertility-related protein prediction model by multi-information fusion and light gradient boosting machine

Yue

Wang

Yang

et al. 2020

Preprint

View full text Add to dashboard Cite

The identification of fertility-related proteins plays an essential part in understanding the embryogenesis of germ cell development. Since the traditional experimental methods are expensive and time-consuming to identify fertility-related proteins, the purposes of predicting protein functions from amino acid sequences appeared. In this paper, we propose a fertility-related protein prediction model. Firstly, the model combines protein physicochemical property information, evolutionary information and sequence information to construct the initial feature space ‘ALL’. Then, the least absolute shrinkage and selection operator (LASSO) is used to remove redundant features. Finally, light gradient boosting machine (LightGBM) is used as a classifier to predict. The 5-fold cross-validation accuracy of the training dataset is 88.5%, and the independent accuracy of the training dataset is 91.5%. The results show that our model is more competitive for the prediction of fertility-related proteins, which is helpful for the study of fertility diseases and related drug targets.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Lingling Yue

Prediction of Extracellular Matrix Proteins by Fusing Multiple Feature Information, Elastic Net, and Random Forest Algorithm

Fertility-LightGBM: A fertility-related protein prediction model by multi-information fusion and light gradient boosting machine

Elementary Transformation and its Applications for Split Quaternion Matrices

The Real Representation of Canonical Hyperbolic Quaternion Matrices and Its Applications

Fertility-LightGBM: A fertility-related protein prediction model by multi-information fusion and light gradient boosting machine

Contact Info

Product

Resources

About