2021
DOI: 10.1021/acs.jchemed.1c00142
|View full text |Cite
|
Sign up to set email alerts
|

A Gentle Introduction to Machine Learning for Chemists: An Undergraduate Workshop Using Python Notebooks for Visualization, Data Processing, Analysis, and Modeling

Abstract: Machine learning, a subdomain of artificial intelligence, is a widespread technology that is molding how chemists interact with data. Therefore, it is a relevant skill to incorporate into the toolbox of any chemistry student. This work presents a workshop that introduces machine learning for chemistry students based on a set of Python notebooks and assignments. Python, one of the most popular programming languages, is open source, free to use, and has plenty of learning resources. The workshop is designed for … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
57
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 62 publications
(65 citation statements)
references
References 28 publications
0
57
0
Order By: Relevance
“…Figure 6 shows the scheme to obtain the polynomial equation from dataframe. The library allows for the treatment of large data sets in the convenient structure of dataframes and various tools for their handling; 52 this is especially useful when students manipulate large data sets.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Figure 6 shows the scheme to obtain the polynomial equation from dataframe. The library allows for the treatment of large data sets in the convenient structure of dataframes and various tools for their handling; 52 this is especially useful when students manipulate large data sets.…”
Section: Discussionmentioning
confidence: 99%
“… 43 Lafuente et al carried out an interesting study directed to Machine Learning for Chemists through Jupyter notebooks. 52 Inside the Open Chemistry Project, Hanwell et al. presented the JupyterLab for use in Quantum chemistry, and the project was developed using Open Source Initiative.…”
Section: Introductionmentioning
confidence: 99%
“…However, we have found two recent publications from 2020 and 2021 where this approach is suggested. For chemistry students, Lafluente et al present an introductory workshop focusing on utilising Python and visualisation/ML libraries [66]. The example Jupyter notebooks 33 lead students through an introduction to 33 https://github.com/ML4chemArg/ Intro-to-Machine-Learning-in-Chemistry Python, basic statistics, exploratory data analysis, classification, and regression.…”
Section: Cs Pytorchmentioning
confidence: 99%
“…Recently there have been articles in multiple domains walking a domain expert through the best tools and techniques available to employ ML [52,75,111,66,92]. For example, Nakhle and Harfouche provide four detailed Jupyter notebooks 52 walking domain experts in phenomics (plant sciences) through four steps of a ML task [82].…”
Section: Guiding the Domain Expertmentioning
confidence: 99%
“…In Table S1 to S3, the data set explored is a collection of white and red wine from the "Vinho Verde" wine region in Portugal. [28][29][30] The data set for red wine consists of 1599 samples (Table S1) and the one for white wine consists of 4898 samples (Table S2). Each sample has 11 physicochemical variables: Fixed acidity (g(tartaric acid)/dm 3 ), Volatile acidity (g(acetic acid)/dm 3 ), Citric acid (g/dm 3 ), Residual sugar (g/dm 3 ), Chlorides (g(sodium chloride)/dm 3 ), Free sulfur dioxide (mg/dm 3 ), Total sulfur dioxide (mg/dm 3 ), Density (g/cm 3 ), pH, Sulphates (g(potassium sulphate)/dm 3 ), Alcohol (vol.%) and one qualitative variable: quality.…”
Section: Data Setmentioning
confidence: 99%