Automating document classification with distant supervision to increase the efficiency of systematic reviews: A case study on identifying studies with HIV impacts on female sex workers

Li, Xiaoxiao; Zhang, Amy; Al-Zaidy, Rabah A.; Rao, Amrita; Baral, Stefan; Bao, Le; Giles, C. Lee

doi:10.1371/journal.pone.0270034

Cited by 2 publications

(1 citation statement)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As an input to machine learning models, most often bag-of-words (BOW) text representations were applied ( N = 30/89, 33.7%) [ 32 , 41 , 52 , 54 – 56 , 59 , 61 , 68 , 72 , 82 , 84 , 85 , 87 , 89 , 92 , 93 , 95 , 96 , 100 , 106 , 108 , 110 , 112 , 114 , 115 , 119 – 122 ], followed by term-frequency/inverse document frequency (TF-IDF) ( N = 16/89, 18.0%) [ 45 , 53 , 57 , 60 , 63 , 66 , 68 , 73 , 76 , 83 , 91 , 109 , 115 , 116 , 122 , 123 ], topic models ( N = 10/89, 11.2%) [ 45 , 60 , 84 , 86 , 91 , 93 , 104 , 107 , 109 , 115 , 123 ], keywords ( N = 9, 10.1%) [ 52 , 75 , 76 , 91 , 98 , 100 , 117 , 123 , 127 ], standardized terms such as Medical Subject ...…”

Section: Resultsmentioning

confidence: 99%

Automation of systematic reviews of biomedical literature: a scoping review of studies indexed in PubMed

Tóth,

Berek,

Gulácsi

et al. 2024

Syst Rev

View full text Add to dashboard Cite

Background The demand for high-quality systematic literature reviews (SRs) for evidence-based medical decision-making is growing. SRs are costly and require the scarce resource of highly skilled reviewers. Automation technology has been proposed to save workload and expedite the SR workflow. We aimed to provide a comprehensive overview of SR automation studies indexed in PubMed, focusing on the applicability of these technologies in real world practice. Methods In November 2022, we extracted, combined, and ran an integrated PubMed search for SRs on SR automation. Full-text English peer-reviewed articles were included if they reported studies on SR automation methods (SSAM), or automated SRs (ASR). Bibliographic analyses and knowledge-discovery studies were excluded. Record screening was performed by single reviewers, and the selection of full text papers was performed in duplicate. We summarized the publication details, automated review stages, automation goals, applied tools, data sources, methods, results, and Google Scholar citations of SR automation studies. Results From 5321 records screened by title and abstract, we included 123 full text articles, of which 108 were SSAM and 15 ASR. Automation was applied for search (19/123, 15.4%), record screening (89/123, 72.4%), full-text selection (6/123, 4.9%), data extraction (13/123, 10.6%), risk of bias assessment (9/123, 7.3%), evidence synthesis (2/123, 1.6%), assessment of evidence quality (2/123, 1.6%), and reporting (2/123, 1.6%). Multiple SR stages were automated by 11 (8.9%) studies. The performance of automated record screening varied largely across SR topics. In published ASR, we found examples of automated search, record screening, full-text selection, and data extraction. In some ASRs, automation fully complemented manual reviews to increase sensitivity rather than to save workload. Reporting of automation details was often incomplete in ASRs. Conclusions Automation techniques are being developed for all SR stages, but with limited real-world adoption. Most SR automation tools target single SR stages, with modest time savings for the entire SR process and varying sensitivity and specificity across studies. Therefore, the real-world benefits of SR automation remain uncertain. Standardizing the terminology, reporting, and metrics of study reports could enhance the adoption of SR automation techniques in real-world practice.

show abstract

Section: Resultsmentioning

confidence: 99%

Automation of systematic reviews of biomedical literature: a scoping review of studies indexed in PubMed

Tóth,

Berek,

Gulácsi

et al. 2024

Syst Rev

View full text Add to dashboard Cite

show abstract

Automation of systematic reviews of biomedical literature: a systematic review of studies indexed in PubMed

Tóth,

Berek,

Gulácsi

et al. 2023

Preprint

View full text Add to dashboard Cite

Background The demand for high quality systematic literature reviews (SLRs) is growing for evidence-based medical decision making. SLRs are costly and require the scarce resource of highly skilled reviewers. Automation technology has been proposed to save workload and expedite the SLR workflow. Objectives We aimed to provide a comprehensive overview of SLR automation studies indexed in PubMed, focusing on the applicability of these technologies in real world practice. Methods In November 2022, we ran a combined search syntax of four published SLRs on SLR automation. Full-text English peer-reviewed articles were included if they reported Studies on SLR Automation Methods (SSAM), or Automated SLRs (ASLR). Bibliographic analyses and knowledge-discovery studies were excluded. Record screening was performed by single reviewers, the selection of full text papers was performed in duplicate. We summarized the publication details, automated review stages, automation goals, applied tools, data sources, methods, results and Google Scholar citations of SLR automation studies. Results From 5321 records screened by title and abstract, we included 123 full text articles, out of which 108 were SSAMs and 15 ASLRs. Automation was applied for search, record screening, full-text selection, data extraction, risk of bias assessment, evidence synthesis, assessment of evidence quality and reporting in 19 (15.4%), 89 (72.4%), 6 (4.9%), 13 (10.6%), 9 (7.3%), 2 (1.6%), 2 (1.6%), and 2 (1.6%) studies, respectively. Multiple SLR stages were automated by 11 (8.9%) studies. The performance of automated record screening varied largely across SLR topics. In published ASLRs we found examples of automated search, record screening, full-text selection and data extraction. In some ASLRs automation complemented fully manual reviews to increase sensitivity rather than to save workload. Reporting of automation details were often incomplete in ASLRs. Conclusions Automation techniques are being developed for all SLRs stages, but with limited real-world adoption. Most SLR automation tools target single SLR stages, with modest time savings for the entire SLR process and varying sensitivity and specificity across studies. Therefore, the real-world benefits of SLR automation remain uncertain. Standardizing the terminology, reporting, and metrics of study reports could enhance the adoption of SLR automation techniques in real-world practice.

show abstract

Automating document classification with distant supervision to increase the efficiency of systematic reviews: A case study on identifying studies with HIV impacts on female sex workers

Cited by 2 publications

References 39 publications

Automation of systematic reviews of biomedical literature: a scoping review of studies indexed in PubMed

Automation of systematic reviews of biomedical literature: a scoping review of studies indexed in PubMed

Automation of systematic reviews of biomedical literature: a systematic review of studies indexed in PubMed

Contact Info

Product

Resources

About