Trialstreamer: a living, automatically updated database of clinical trial reports

Marshall, Iain J.; Nye, Benjamin E.; Kuiper, Joël; Noel-Storr, Anna; Marshall, Rachel; Maclean, Rory; Soboczenski, Frank; Nenkova, Ani; Thomas, James; Wallace, Byron C.

doi:10.1101/2020.05.15.20103044

Cited by 6 publications

(13 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…27 Further metrics included under 'Other' were odds ratios, 37 normalised discounted cumulative gain, 29 'sentences needed to screen per article' in order to find one relevant sentence, 38 McNemar test, 36 C-statistic (with 95% CI) and Brier score (with 95% CI). 39 Real-life evaluations, such as the percentage of outputs needing human correction, or time saved per article, were reported by one publication, 30 and an evaluation as part of a wider screening system was done in another. 40 There were several approaches and justifications of using macro-or micro-averaged precision, recall, or F1 scores in the included publications.…”

Section: Reported Performance Metrics Used For Evaluationmentioning

confidence: 99%

“…One, for example, applied their system to new, unlabelled data and reported that classifying the whole of PubMed takes around 20 hours using a graphics processing unit (GPU). 39 In another example, the authors reported using Google Colab GPUs, along with estimates of computing time for different training settings. 68 3.4.2.4 Is the source code available?…”

Section: Is There a Description Of The Hardware Used?mentioning

confidence: 99%

“…A small number of publications did a real-life assessment, where the data extraction algorithm was applied to different, unlabelled, and often much larger datasets or tested while conducting actual systematic reviews. 30,33,39,42,68,71,72 3.4.3.2 Are basic metrics reported (true/false positives and negatives)?…”

Section: Testingmentioning

confidence: 99%

See 2 more Smart Citations

Data extraction methods for systematic review (semi)automation: A living systematic review

et al. 2021

Self Cite

View full text Add to dashboard Cite

Background: The reliable and usable (semi)automation of data extraction can support the field of systematic review by reducing the workload required to gather information about the conduct and results of the included studies. This living systematic review examines published approaches for data extraction from reports of clinical studies. Methods: We systematically and continually search MEDLINE, Institute of Electrical and Electronics Engineers (IEEE), arXiv, and the dblp computer science bibliography databases. Full text screening and data extraction are conducted within an open-source living systematic review application created for the purpose of this review. This iteration of the living review includes publications up to a cut-off date of 22 April 2020. Results: In total, 53 publications are included in this version of our review. Of these, 41 (77%) of the publications addressed extraction of data from abstracts, while 14 (26%) used full texts. A total of 48 (90%) publications developed and evaluated classifiers that used randomised controlled trials as the main target texts. Over 30 entities were extracted, with PICOs (population, intervention, comparator, outcome) being the most frequently extracted. A description of their datasets was provided by 49 publications (94%), but only seven (13%) made the data publicly available. Code was made available by 10 (19%) publications, and five (9%) implemented publicly available tools. Conclusions: This living systematic review presents an overview of (semi)automated data-extraction literature of interest to different types of systematic review. We identified a broad evidence base of publications describing data extraction for interventional reviews and a small number of publications extracting epidemiological or diagnostic accuracy data. The lack of publicly available gold-standard data for evaluation, and lack of application thereof, makes it difficult to draw conclusions on which is the best-performing system for each data extraction target. With this living review we aim to review the literature continually.

show abstract

Section: Reported Performance Metrics Used For Evaluationmentioning

confidence: 99%

Section: Is There a Description Of The Hardware Used?mentioning

confidence: 99%

See 1 more Smart Citation

Data extraction methods for systematic review (semi)automation: A living systematic review

et al. 2021

Self Cite

View full text Add to dashboard Cite

show abstract

“…We developed a machine learning system which maintains a live database of annotated RCT reports, named Trialstreamer. We have described the computational methods and accuracy of the system components in detail elsewhere, 11 and summarise the key points relevant to the current study below.…”

Section: Methodsmentioning

confidence: 99%

“… 14 These machine learning models are trained 1 on 280 000 abstracts manually labelled as being RCTs or not by Cochrane Crowd, ( https://crowd.cochrane.org ) a collaborative citizen science project. We do not rely on the Publication Type index alone, as we have previously found it to miss a substantial proportion of the 5–7 most recent years of articles 11 (due to delay in manual indexing after publication). We next removed any RCTs that were not conducted in humans (eg, animal or agricultural studies) using an SVM model.…”

Section: Methodsmentioning

confidence: 99%

State of the evidence: a survey of global disparities in clinical trials

et al. 2021

Self Cite

View full text Add to dashboard Cite

IntroductionIdeally, health conditions causing the greatest global disease burden should attract increased research attention. We conducted a comprehensive global study investigating the number of randomised controlled trials (RCTs) published on different health conditions, and how this compares with the global disease burden that they impose.MethodsWe use machine learning to monitor PubMed daily, and find and analyse RCT reports. We assessed RCTs investigating the leading causes of morbidity and mortality from the Global Burden of Disease study. Using regression models, we compared numbers of actual RCTs in different health conditions to numbers predicted from their global disease burden (disability-adjusted life years (DALYs)). We investigated whether RCT numbers differed for conditions disproportionately affecting countries with lower socioeconomic development.ResultsWe estimate 463 000 articles describing RCTs (95% prediction interval 439 000 to 485 000) were published from 1990 to July 2020. RCTs recruited a median of 72 participants (IQR 32–195). 82% of RCTs were conducted by researchers in the top fifth of countries by socio-economic development. As DALYs increased for a particular health condition by 10%, the number of RCTs in the same year increased by 5% (3.2%–6.9%), but the association was weak (adjusted R2=0.13). Conditions disproportionately affecting countries with lower socioeconomic development, including respiratory infections and tuberculosis (7000 RCTs below predicted) and enteric infections (9700 RCTs below predicted), appear relatively under-researched for their disease burden. Each 10% shift in DALYs towards countries with low and middle socioeconomic development was associated with a 4% reduction in RCTs (3.7%–4.9%). These disparities have not changed substantially over time.ConclusionResearch priorities are not well optimised to reduce the global burden of disease. Most RCTs are produced by highly developed countries, and the health needs of these countries have been, on average, favoured.

show abstract

Systematic comparison of Mendelian randomization studies and randomized controlled trials using electronic databases

Sobczyk

Smith

Gaunt

2022

Preprint

View full text Add to dashboard Cite

Mendelian Randomization (MR) uses genetic instrumental variables to make causal inferences. Whilst sometimes referred to as "nature's randomized trial", it has distinct assumptions that make comparisons between the results of MR studies with those of actual randomized controlled trials (RCTs) invaluable. To scope the potential for (semi-)-automated triangulation of MR and RCT evidence, we mined ClinicalTrials.Gov, PubMed and EpigraphDB databases and carried out a series of 26 manual literature comparisons among 54 MR and 77 RCT publications. We found that only 11% of completed RCTs identified in ClinicalTrials.Gov submitted their results to the database. Similarly low coverage was revealed for Semantic Medline (SemMedDB) semantic triples derived from MR and RCT publications - 25% and 12%, respectively. Among intervention types that can be mimicked by MR, only trials of pharmaceutical interventions could be automatically matched to MR results due to insufficient annotation with MeSH ontology. A manual survey of the literature highlighted the potential for triangulation across a number of exposure/outcome pairs if these challenges can be addressed. We conclude that careful triangulation of MR with RCT evidence should involve consideration of similarity of phenotypes across study designs, intervention intensity and duration, study population demography and health status, comparator group, intervention goal and quality of evidence.

show abstract

Trialstreamer: a living, automatically updated database of clinical trial reports

Cited by 6 publications

References 19 publications

Data extraction methods for systematic review (semi)automation: A living systematic review

Data extraction methods for systematic review (semi)automation: A living systematic review

State of the evidence: a survey of global disparities in clinical trials

Systematic comparison of Mendelian randomization studies and randomized controlled trials using electronic databases

Contact Info

Product

Resources

About