2020
DOI: 10.1002/wcms.1458
|View full text |Cite
|
Sign up to set email alerts
|

ChemML: A machine learning and informatics program package for the analysis, mining, and modeling of chemical and materials data

Abstract: ChemML is an open machine learning (ML) and informatics program suite that is designed to support and advance the data‐driven research paradigm that is currently emerging in the chemical and materials domain. ChemML allows its users to perform various data science tasks and execute ML workflows that are adapted specifically for the chemical and materials context. Key features are automation, general‐purpose utility, versatility, and user‐friendliness in order to make the application of modern data science a vi… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
25
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
9

Relationship

2
7

Authors

Journals

citations
Cited by 48 publications
(30 citation statements)
references
References 40 publications
0
25
0
1
Order By: Relevance
“…In terms of feature representation, systematic development of new descriptors, standardization of their evaluation, and easier accessibility via user interfaces (e.g., Python libraries) are necessary to establish their long-term development. 60 A transparent and sustained study of feature representation would involve researchers with variety of domain knowledge and expertise to accelerate future developments.…”
Section: Llmentioning
confidence: 99%
See 1 more Smart Citation
“…In terms of feature representation, systematic development of new descriptors, standardization of their evaluation, and easier accessibility via user interfaces (e.g., Python libraries) are necessary to establish their long-term development. 60 A transparent and sustained study of feature representation would involve researchers with variety of domain knowledge and expertise to accelerate future developments.…”
Section: Llmentioning
confidence: 99%
“…16 The Bigger Picture Scarce and sparse chemical datasets. The number of unique small molecules is practically infinite, with number estimates of possible synthesizable small molecules ranging from 10 24 -10 60 . Machine learning could expand coverage of chemical space-for instance, in the design, synthesis, and development stages of drugsthat traditionally are resourceintensive tasks.…”
Section: Introductionmentioning
confidence: 99%
“…Initial molecular geometries for all 962 P-molecules were obtained from SMILES codes through a Python script utilizing the RDKit (RDK, 2000), ChemML (Haghighatlari et al, 2020), and ChemCoord (Weser, 2017) libraries.…”
Section: Methodsmentioning
confidence: 99%
“…Los 28 textos que abordan la CD y la CI y recibieron citas, han sido elaborados de manera colectiva (67.2 %) o individual (32.8 %). Por un lado, aquellos artículos con más integrantes participantes y citas recibidas son: "Geospatial Data Management Research: Progress and Future Directions" (Breunig et al, 2020) con nueve participantes y 1 cita recibida; "ChemML: A machine learning La informetría y el análisis del discurso aplicados a la producción científica en la Ciencia de Datos y Ciencia de la información Celso Martínez Musiño 9 and informatics program package for the analysis, mining, and modeling of chemical and materials data" (Haghighatlari et al, 2020), con ocho investigadores; 5 citas recibidas, y; "Reproducible research and GIScience: an evaluation using AGILE conference papers" (Nüst et al, 2018) Morriello (2020), redactados en portugués, francés, japonés e italiano. Aunque hay un predominio de la lengua inglesa como medio natural de la comunicación científica, se observa la presencia de idiomas distintos que cumplen con las exigencias y requerimientos normativos de las revistas y las bases de datos especializadas.…”
Section: Autoría E Idiomaunclassified