2023
DOI: 10.1038/d41586-023-03316-8
|View full text |Cite
|
Sign up to set email alerts
|

Garbage in, garbage out: mitigating risks and maximizing benefits of AI in research

Brooks Hanson,
Shelley Stall,
Joel Cutcher-Gershenfeld
et al.
Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
4
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(4 citation statements)
references
References 13 publications
0
4
0
Order By: Relevance
“…The accuracy of the values predicted by an ANN model is relatively high for many cases ( Figure 2 ) [ 69 ]. However, if the training experimental data contain considerable error, the trained ANN model becomes inaccurate, for which it is said garbage in , garbage out [ 77 ]. The interpretability of an ANN model is relatively low because an ANN model is very complex from the viewpoint of simple causality [ 78 ].…”
Section: Machine Learning (Ml)mentioning
confidence: 99%
“…The accuracy of the values predicted by an ANN model is relatively high for many cases ( Figure 2 ) [ 69 ]. However, if the training experimental data contain considerable error, the trained ANN model becomes inaccurate, for which it is said garbage in , garbage out [ 77 ]. The interpretability of an ANN model is relatively low because an ANN model is very complex from the viewpoint of simple causality [ 78 ].…”
Section: Machine Learning (Ml)mentioning
confidence: 99%
“…In practice, however, this astranomical task is computationally impossible, which explains partly why this article puts forward just only a structural biophysics-based prototype (i.e., semaGIBAC), instead of a real GIBAC. Therefore, this article here again proposes an AI-based open strategy [95] to make it possible and conceivable to generate a vast amount of data to train a real GIBAC with reasonable accuracy and precision, as openness in data acquisition and generation, and AI algorithms is essential for promoting transparency, reproducibility, and collaboration within the community of drug discovery & design, and for facilitating continued improvement of the performance (accuracy, precision and efficiency) of GIBAC in precise drug discovery & design [35,36] in future. As mentioned above, this article puts forward a set of in silico steps of structural and biophysical data generation towards a paradigm shift in precise drug discovery and design [59,60].…”
Section: In Silico Generation Of Structural and Biophysical Data With...mentioning
confidence: 99%
“…This has led to missing values in multi‐source data integration (garbage in, garbage out). The academic community has formed a consensus regarding the significant issues of similar data quality [29,30] . The issue of missing data serves as a common problem among multi‐source data, [31] as demonstrated in research fields such as medicine to reduce model performance [32,33] .…”
Section: Introductionmentioning
confidence: 99%