2018
DOI: 10.1002/cmdc.201800533
|View full text |Cite
|
Sign up to set email alerts
|

In Silico Prediction of Blood–Brain Barrier Permeability of Compounds by Machine Learning and Resampling Methods

Abstract: The blood-brain barrier (BBB) as a part of absorption protects the central nervous system by separating the brain tissue from the bloodstream. In recent years, BBB permeability has become a critical issue in chemical ADMET prediction, but almost all models were built using imbalanced data sets, which caused a high false-positive rate. Therefore, we tried to solve the problem of biased data sets and built a reliable classification model with 2358 compounds. Machine learning and resampling methods were used simu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
137
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 133 publications
(147 citation statements)
references
References 45 publications
(81 reference statements)
0
137
1
Order By: Relevance
“…To solve this problem, the above‐mentioned researchers have attempted many approaches, such as combining data, applying various thresholds for dividing data class, and using different oversampling ratios. The results have clearly demonstrated that these approaches are not suitable for such research (Wang et al, ).…”
Section: Introductionmentioning
confidence: 99%
“…To solve this problem, the above‐mentioned researchers have attempted many approaches, such as combining data, applying various thresholds for dividing data class, and using different oversampling ratios. The results have clearly demonstrated that these approaches are not suitable for such research (Wang et al, ).…”
Section: Introductionmentioning
confidence: 99%
“…The work of Shen et al developed SVM models using 1593 compounds (1283 BBB + and 310 BBB − ) by using different pattern selection methods and obtained the overall accuracy of 98.2% [80]. Both methods have the limitation of unbalanced datasets (where the number of BBB+ is higher than the BBB− within the training set), which was addressed on the work of Wang et al by using resampling methods coupled with the machine-learning techniques, to achieve accuracy rates of 0.919 in external test data [81]. Wang and collaborators compiled a dataset of 439 unique molecules, which were employed to generate a diverse set of QSAR models and consensus (R 2 = 0.504 for external dataset prediction).…”
Section: Permeability and The Use Of Cellular And Noncellular Modelsmentioning
confidence: 99%
“…While such models assign speci c logBB/PR values for each drug, binary models have so far reached a higher prediction accuracy and provide a preliminary insight regarding the behavior of candidate drugs which is su cient in early drug discovery stages. Predominantly, binarization of drug permeability across the BBB is performed by setting empirical thresholds to logBB values [5][6][7][8][9]. However, S. Kunwittaya et al [6] have shown that varying logBB thresholds lead to a difference in the prediction accuracy.…”
Section: Introductionmentioning
confidence: 99%
“…In this context, different types of classi ers were trained in the literature including Support Vector Machines (SVM) [6,8,11,12], Linear Discriminant Analysis (LDA) [13], Arti cial Neural Networks (ANN) [6] and Multi-Layer Perceptron (MLP) [8,9], k-Nearest Neighbors (k-NN) [8], Decision Trees (DT) [6,7] and Random Forests (RF) [5,8,9]. Other studies apply consensus models, by training and combining multiple classi ers [8,9]. While consensus models mitigate the over tting problem of single classi ers, they naturally require high computational power, especially when dealing with high dimensional data.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation