The Fourth Workshop on Insights From Negative Results in NLP 2023
DOI: 10.18653/v1/2023.insights-1.3
|View full text |Cite
|
Sign up to set email alerts
|

A Data-centric Framework for Improving Domain-specific Machine Reading Comprehension Datasets

Iva Bojic,
Josef Halim,
Verena Suharman
et al.

Abstract: Low-quality data can cause downstream problems in high-stakes applications. Data-centric approach emphasizes on improving dataset quality to enhance model performance. Highquality datasets are needed for general-purpose Large Language Models (LLMs) training, as well as for domain-specific models, which are usually small in size as it is costly to engage a large number of domain experts for their creation. Thus, it is vital to ensure high-quality domain-specific training data. In this paper, we propose a framew… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
references
References 29 publications
0
0
0
Order By: Relevance