2021
DOI: 10.1021/acs.est.1c04326
|View full text |Cite
|
Sign up to set email alerts
|

pySiRC”: Machine Learning Combined with Molecular Fingerprints to Predict the Reaction Rate Constant of the Radical-Based Oxidation Processes of Aqueous Organic Contaminants

Abstract: We developed a web application structured in a machine learning and molecular fingerprint algorithm for the automatic calculation of the reaction rate constant of the oxidative processes of organic pollutants by • OH and SO 4•− radicals in the aqueous phasethe pySiRC platform. The model development followed the OECD principles: internal and external validation, applicability domain, and mechanistic interpretation. Three machine learning algorithms combined with molecular fingerprints were evaluated, and all t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
41
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 60 publications
(44 citation statements)
references
References 73 publications
0
41
0
Order By: Relevance
“…Bioinformatics tools play an important role in monitoring SARS-CoV-2 and provide non-computational users with the possibility to analyze data and advanced knowledge related to COVID-19 (Hufsky et al 2021 ). This is probably part of the reason that although several studies have been conducted to monitor SARS-CoV-2, and estimate the number of infected people from wastewater samples, they are mostly used by a small group of researchers (Sanches-Neto et al 2021 ). Therefore, in addition to studies of SARS-CoV-2 monitoring by experimental techniques such as RT-qPCR and genomic sequencing, it is extremely important to develop web applications to automate the SARS-CoV-2 detection combined with COVID-19 monitoring (Pérez-Cataluña et al 2022 ).…”
Section: Introductionmentioning
confidence: 99%
“…Bioinformatics tools play an important role in monitoring SARS-CoV-2 and provide non-computational users with the possibility to analyze data and advanced knowledge related to COVID-19 (Hufsky et al 2021 ). This is probably part of the reason that although several studies have been conducted to monitor SARS-CoV-2, and estimate the number of infected people from wastewater samples, they are mostly used by a small group of researchers (Sanches-Neto et al 2021 ). Therefore, in addition to studies of SARS-CoV-2 monitoring by experimental techniques such as RT-qPCR and genomic sequencing, it is extremely important to develop web applications to automate the SARS-CoV-2 detection combined with COVID-19 monitoring (Pérez-Cataluña et al 2022 ).…”
Section: Introductionmentioning
confidence: 99%
“…Mean and max Tanimoto similarity between compounds in the test set and the training set were used to assess the applicability domain as proposed in ref . The chosen threshold to include or remove a molecule from the applicability domain was found by incrementally varying the Tanimoto similarity from 0 to 0.4 and 0.05 for the max and mean, respectively.…”
Section: Resultsmentioning
confidence: 99%
“…Methods like SHAP (SHapley Additive exPlanations, a game theoretic approach to explain outputs of machine learning models) could be applied to add physical meaning and intuition. This was applied recently for rate constant predictions where important variables from fixed fingerprints were identified . However, this is not possible in our case since we have learned representations that do not exactly correspond to physical features.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations