2016
DOI: 10.1016/j.str.2015.12.010
|View full text |Cite
|
Sign up to set email alerts
|

Safeguarding Structural Data Repositories against Bad Apples

Abstract: Structural biology research generates large amounts of data, some deposited in public databases/repositories, but a substantial remainder never becoming available to the scientific community. Additionally, some of the deposited data contain less or more serious errors that may bias the results of data mining. Thorough analysis and discussion of these problems is needed in order to ameliorate this situation. This note is an attempt to propose some solutions and encourage both further discussion and action on th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

1
55
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
4
1
1
1

Relationship

4
3

Authors

Journals

citations
Cited by 37 publications
(56 citation statements)
references
References 25 publications
1
55
0
Order By: Relevance
“…The field of Zn coordination by macromolecules is sufficiently well grounded on the previous, competent studies, such as those presented in Refs. As another conclusion we reiterate the opinion that the field of structural biology should continue to be on the lookout and safeguard itself against fallacious data‐mining meta‐analyses, as well as against individual bad apples (macromolecular structures, especially with small‐molecule components) that contaminate our repositories …”
Section: Discussionmentioning
confidence: 71%
See 1 more Smart Citation
“…The field of Zn coordination by macromolecules is sufficiently well grounded on the previous, competent studies, such as those presented in Refs. As another conclusion we reiterate the opinion that the field of structural biology should continue to be on the lookout and safeguard itself against fallacious data‐mining meta‐analyses, as well as against individual bad apples (macromolecular structures, especially with small‐molecule components) that contaminate our repositories …”
Section: Discussionmentioning
confidence: 71%
“…In order to perform a meaningful meta‐analysis, it is essential to ensure proper quality control of the input data. It has been pointed out more than once that one bad apple can spoil a whole bushel of decent data points . All that the authors seem to have done in this respect was to limit the resolution of the input structures to better than 3 Å, and examine the B factors of the ligands involved in unusual angles and to compare them to the average B factor for all the ligands.…”
Section: Discussionmentioning
confidence: 99%
“…The importance of retaining raw diffraction data has been emphasized numerous times (Jones et al, 1996;Androulakis et al, 2008;Baker et al, 2008;Jovine et al, 2008;Rupp, 2012;Domagalski et al, 2014;Minor et al, 2016). The International Union of Crystallography (IUCr) responded by forming the IUCr Diffraction Data Deposition Working Group (DDDWG) in 2011.…”
Section: Introductionmentioning
confidence: 99%
“…1/3) 6,7 . Such errors can easily become viral 24 and may confuse, among others, untrained scientists that lack time to fix stereochemical issues.…”
mentioning
confidence: 99%
“…Therefore, we suggest that in the absence of clear experimental or stereochemical evidence, it may be more suitable to not assign any ions in biomolecular structures with insufficient resolutions since those assignments may subsequently be taken for granted by others with the risk to create inappropriate prior knowledge affecting the outcome of database surveys 24 .…”
mentioning
confidence: 99%