2020
DOI: 10.1016/j.omtn.2020.08.022
|View full text |Cite
|
Sign up to set email alerts
|

An Interpretable Prediction Model for Identifying N7-Methylguanosine Sites Based on XGBoost and SHAP

Abstract: Recent studies have increasingly shown that the chemical modification of mRNA plays an important role in the regulation of gene expression. N 7 -methylguanosine (m7G) is a type of positively-charged mRNA modification that plays an essential role for efficient gene expression and cell viability. However, the research on m7G has received little attention to date. Bioinformatics tools can be applied as auxiliary methods to identify m7G sites in transcriptomes. In this study, we develop a no… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
67
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 105 publications
(68 citation statements)
references
References 36 publications
0
67
0
Order By: Relevance
“…It uses a sparsity-aware learning algorithm to process sparse data and weighted quantile sketch to approximate tree learning [41]. Since the decision tree is a simple classifier composed of hierarchically organized dichotomous determinations, its structure also demonstrates good interpretability [48][49][50]. In addition, the model can deal with missing values well.…”
Section: Prediction Modelmentioning
confidence: 99%
“…It uses a sparsity-aware learning algorithm to process sparse data and weighted quantile sketch to approximate tree learning [41]. Since the decision tree is a simple classifier composed of hierarchically organized dichotomous determinations, its structure also demonstrates good interpretability [48][49][50]. In addition, the model can deal with missing values well.…”
Section: Prediction Modelmentioning
confidence: 99%
“…It is a powerful tool to peer inside black box models and understand how they arrive at a particular decision. Multiple recent studies used only SHAP values for variable selection 35 – 39 , however, only the variables with the greatest impact as defined by average absolute SHAP value were chosen. This is in contrast to this study, in which it was shown that variables with the greatest contribution are not necessarily robust (Fig.…”
Section: Discussionmentioning
confidence: 99%
“…In the application of biomedicine, Bi et al (2020) [23] developed a new interpretive machine learning approach using the XGBoost algorithm and six different types of sequential encoding schemes to distinguish m7G sites, with cross-validation showing that their approach was more accurate than other models. Mahmud et al (2019) [24] validated the reliability and superiority of the XGBoost classifier for the determination of drug-target interactions (DTI).…”
Section: Extreme Gradient Boosting (Xgboost) Algorithm Applicationsmentioning
confidence: 99%