2019
DOI: 10.1371/journal.pbio.3000416
|View full text |Cite
|
Sign up to set email alerts
|

Predicting translational progress in biomedical research

Abstract: Fundamental scientific advances can take decades to translate into improvements in human health. Shortening this interval would increase the rate at which scientific discoveries lead to successful treatment of human disease. One way to accomplish this would be to identify which advances in knowledge are most likely to translate into clinical research. Toward that end, we built a machine learning system that detects whether a paper is likely to be cited by a future clinical trial or guideline. Despite the noisi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
86
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
7
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 63 publications
(94 citation statements)
references
References 34 publications
(45 reference statements)
0
86
0
Order By: Relevance
“…Similar monitoring activities have been carried out using different strategies. For example, with specific regard to citation (biblio)metrics, Hutchins et al built a machine learning system suitable for use to detect whether a paper is likely to be cited by a future clinical trial or guideline, in an effort to predict translational research progress [186].…”
Section: Discussionmentioning
confidence: 99%
“…Similar monitoring activities have been carried out using different strategies. For example, with specific regard to citation (biblio)metrics, Hutchins et al built a machine learning system suitable for use to detect whether a paper is likely to be cited by a future clinical trial or guideline, in an effort to predict translational research progress [186].…”
Section: Discussionmentioning
confidence: 99%
“…We developed a prototype machine learning pipeline, described below, to identify, parse, and resolve references from such full-text articles for inclusion in the NIH-OCC. Finally, once citations are resolved, these are entered into our data processing pipelines for calculating downstream metrics like the Relative Citation Ratio [10] and the Approximate Potential to Translate [23].…”
Section: Descriptionmentioning
confidence: 99%
“…With the release of this NIH-OCC for biomedicine at large, the subsequent work that draws upon NIH-supported discoveries is now visible as well, and barriers to entry for scientometric studies will be reduced. The science-of-science community has illustrated the high value of link-level citation data (as opposed to aggregated citation measures), e.g., using such information to discover principles of citation dynamics [1921], quantify the influence of model organism research on human studies [22], and predict the transmission of knowledge from basic research into clinical studies [23]. Thus, comprehensive open citation data can both enable the attribution of scientific progress and convey foreknowledge that research discoveries will culminate in downstream applications.…”
Section: Introductionmentioning
confidence: 99%
“…After merging, we used the resulting 606.6 million document-document links to cluster the documents into 28,889 clusters using the Leiden algorithm 10 . The second major step is to characterize each cluster using the document level metadata along with US patent reference data, US National Institutes of Health (NIH) and National Science Foundation (NSF) project data from Star Metrics, paper-to-project link tables from NIH ExPORTER, and additional metrics from the NIH iCite2.0 database 11,12 . We also created a visual map of the clusters using the OpenOrd layout routine 13 and cluster-level relatedness.…”
Section: Background and Summarymentioning
confidence: 99%