2019
DOI: 10.1021/acs.jcim.9b00867
|View full text |Cite
|
Sign up to set email alerts
|

Feature Selection for Polymer Informatics: Evaluating Scalability and Robustness of the FS4RVDD Algorithm Using Synthetic Polydisperse Data Sets

Abstract: The feature selection (FS) process is a key step in the Quantitative Structure-Property Relationship (QSPR) modeling of physicochemical properties in Cheminformatics. In particular, the inference of QSPR models for polymeric material properties constitutes a complex problem because of the uncertainty introduced by the polydispersity of these materials. The main challenge is how to capture the polydispersity information from the molecular weight distribution (MWD) curve to achieve a more effective computational… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 50 publications
(90 reference statements)
0
6
0
Order By: Relevance
“…After comparing 79 model setups using 3 different polymer representations, 7 feature representations, and 8 ML algorithms, we reveal the pros and cons behind different options. Such a systematic benchmark study provides valuable guidance for the model selection in polymer’s T g prediction, and it would also benefit other polymer informatics tasks such as mechanical, electronic, or optical properties. …”
Section: Introductionmentioning
confidence: 99%
“…After comparing 79 model setups using 3 different polymer representations, 7 feature representations, and 8 ML algorithms, we reveal the pros and cons behind different options. Such a systematic benchmark study provides valuable guidance for the model selection in polymer’s T g prediction, and it would also benefit other polymer informatics tasks such as mechanical, electronic, or optical properties. …”
Section: Introductionmentioning
confidence: 99%
“…Several researchers focused on developing a method for analyzing the performance of several polymers including CPE. That are all can be depicted below, Cravero et al (2019) [8] examine the FS4RVDD algorithm's scalability and reliability. The measurement of the data bank, the modulo of the aspect subsets, noise in data, and kind of correlation were all varied and combined to create synthetic data (linear and nonlinear).…”
Section: Related Workmentioning
confidence: 99%
“…Many groups have conducted experiments and used polymer informatics to research polymers actively and to enhance their performance or discover new polymers. Recently, machine learning has been applied to polymer informatics, and this has produced remarkable results [1][2][3][4][5]. Thus, the progress of machine learning algorithms is important for polymer informatics; however, the availability of systematic datasets for machine learning is more important [6].…”
Section: Introductionmentioning
confidence: 99%