2022
DOI: 10.1107/s2053273322007483
|View full text |Cite
|
Sign up to set email alerts
|

Towards a machine-readable literature: finding relevant papers based on an uploaded powder diffraction pattern

Abstract: A prototype application for machine-readable literature is investigated. The program is called pyDataRecognition and serves as an example of a data-driven literature search, where the literature search query is an experimental data set provided by the user. The user uploads a powder pattern together with the radiation wavelength. The program compares the user data to a database of existing powder patterns associated with published papers and produces a rank ordered according to their similarity score. The prog… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 35 publications
(31 reference statements)
0
2
0
Order By: Relevance
“…To give more perspective, modern X-ray detectors can generate as many as 1,000,000 images per day, and data post-processing, analysis, and interpretation of the experiments can take over a year [Doucet et al 2020;Wang et al 2017b]. The ability of machine learning methods to process big data, and find patterns in complex data, and the computer vision algorithms for the autonomous detection of images can play a significant role in accelerating the existing workflows at beamline user facilities by providing immediate feedback during the experiments [Wang et al 2017b;Doucet et al 2020;Sullivan et al 2019;Yanxon et al 2023;Wang et al 2017b;Banko et al 2021;Özer et al 2022;Venderley et al 2022].…”
Section: Materials Characterizationmentioning
confidence: 99%
“…To give more perspective, modern X-ray detectors can generate as many as 1,000,000 images per day, and data post-processing, analysis, and interpretation of the experiments can take over a year [Doucet et al 2020;Wang et al 2017b]. The ability of machine learning methods to process big data, and find patterns in complex data, and the computer vision algorithms for the autonomous detection of images can play a significant role in accelerating the existing workflows at beamline user facilities by providing immediate feedback during the experiments [Wang et al 2017b;Doucet et al 2020;Sullivan et al 2019;Yanxon et al 2023;Wang et al 2017b;Banko et al 2021;Özer et al 2022;Venderley et al 2022].…”
Section: Materials Characterizationmentioning
confidence: 99%
“…A CSP protocol uses similarity indices to identify duplicate structures, 25 but they are also useful in other contexts such as classification and database searching. [26][27][28][29][30] Similiarity indices based on the comparison of powder diffraction patterns (in the following, "powder-based indices") are attractive because they can be used for both duplicate identification and to compare the candidate structures with experimental XRPD patterns. A further advantage of powder-based indices is that they identify structurally related compounds (conformational phases, isomorphous systems) as similar.…”
Section: Introductionmentioning
confidence: 99%