Proceedings of the 21st Workshop on Biomedical Language Processing 2022
DOI: 10.18653/v1/2022.bionlp-1.34
|View full text |Cite
|
Sign up to set email alerts
|

DISTANT-CTO: A Zero Cost, Distantly Supervised Approach to Improve Low-Resource Entity Extraction Using Clinical Trials Literature

Abstract: PICO recognition is an information extraction task for identifying participant, intervention, comparator, and outcome information from clinical literature. Manually identifying PICO information is the most time-consuming step for conducting systematic reviews (SR), which is already labor-intensive. A lack of diversified and large, annotated corpora restricts innovation and adoption of automated PICO recognition systems. The largest-available PICO entity/span corpus is manually annotated which is too expensive … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 20 publications
0
2
0
Order By: Relevance
“…Vocabularies are structured, standardized data sources that do not capture writing variations from clinical literature and custom-built ReGeX are restricted by either task or entity type. 35 , 36 We used distant supervision dictionaries created from the structured fields of clinicaltrials.gov (CTO) as described by Dhrangadhariya and Müller 22 Principal investigators of the clinical study manually enter data in CTO, thereby incorporating large-scale writing variations. 37 …”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Vocabularies are structured, standardized data sources that do not capture writing variations from clinical literature and custom-built ReGeX are restricted by either task or entity type. 35 , 36 We used distant supervision dictionaries created from the structured fields of clinicaltrials.gov (CTO) as described by Dhrangadhariya and Müller 22 Principal investigators of the clinical study manually enter data in CTO, thereby incorporating large-scale writing variations. 37 …”
Section: Methodsmentioning
confidence: 99%
“…One of these approaches only explores distant supervision for intervention extraction using a single labelling source. 22 The other approach studies weak supervision for PICO span extraction but still utilizes some supervised annotation signals about whether a sentence includes PICO information. 23 …”
Section: Introductionmentioning
confidence: 99%