A Data-centric Framework for Improving Domain-specific Machine Reading Comprehension Datasets

Bojic, Iva; Halim, Josef; Suharman, Verena; Tar, Sreeja; Ong, Qi Chwen; Phung, Duy; Ravaut, Mathieu; Joty, Shafiq; Car, Josip

doi:10.18653/v1/2023.insights-1.3

The Fourth Workshop on Insights From Negative Results in NLP 2023

DOI: 10.18653/v1/2023.insights-1.3

|View full text |Cite

A Data-centric Framework for Improving Domain-specific Machine Reading Comprehension Datasets

Iva Bojic,

Josef Halim,

Verena Suharman

et al.

Abstract: Low-quality data can cause downstream problems in high-stakes applications. Data-centric approach emphasizes on improving dataset quality to enhance model performance. Highquality datasets are needed for general-purpose Large Language Models (LLMs) training, as well as for domain-specific models, which are usually small in size as it is costly to engage a large number of domain experts for their creation. Thus, it is vital to ensure high-quality domain-specific training data. In this paper, we propose a framew… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article1

Relationship

Self Cite1

Independent0

Authors

Journals

Cited by 1 publication

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

A pilot randomised controlled trial exploring the feasibility and efficacy of a human-AI sleep coaching model for improving sleep among university students

Liu,

Ito,

Ngo

et al. 2024

DIGITAL HEALTH

Self Cite

View full text Add to dashboard Cite

Objective Sleep quality is a crucial concern, particularly among youth. The integration of health coaching with question-answering (QA) systems presents the potential to foster behavioural changes and enhance health outcomes. This study proposes a novel human-AI sleep coaching model, combining health coaching by peers and a QA system, and assesses its feasibility and efficacy in improving university students’ sleep quality. Methods In a four-week unblinded pilot randomised controlled trial, 59 university students (mean age: 21.9; 64% males) were randomly assigned to the intervention (health coaching and QA system; n = 30) or the control conditions (QA system; n = 29). Outcomes included efficacy of the intervention on sleep quality (Pittsburgh Sleep Quality Index; PSQI), objective and self-reported sleep measures (obtained from Fitbit and sleep diaries) and feasibility of the study procedures and the intervention. Results Analysis revealed no significant differences in sleep quality (PSQI) between intervention and control groups (adjusted mean difference = −0.51, 95% CI: [−1.55–0.77], p = 0.40). The intervention group demonstrated significant improvements in Fitbit measures of total sleep time (adjusted mean difference = 32.5, 95% CI: [5.9–59.1], p = 0.02) and time in bed (adjusted mean difference = 32.3, 95% CI: [2.7–61.9], p = 0.03) compared to the control group, although other sleep measures were insignificant. Adherence was high, with the majority of the intervention group attending all health coaching sessions. Most participants completed baseline and post-intervention self-report measures, all diary entries, and consistently wore Fitbits during sleep. Conclusions The proposed model showed improvements in specific sleep measures for university students and the feasibility of the study procedures and intervention. Future research may extend the intervention period to see substantive sleep quality improvements.

show abstract

A pilot randomised controlled trial exploring the feasibility and efficacy of a human-AI sleep coaching model for improving sleep among university students

Liu,

Ito,

Ngo

et al. 2024

DIGITAL HEALTH

Self Cite

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

A Data-centric Framework for Improving Domain-specific Machine Reading Comprehension Datasets

Cited by 1 publication

References 29 publications

A pilot randomised controlled trial exploring the feasibility and efficacy of a human-AI sleep coaching model for improving sleep among university students

A pilot randomised controlled trial exploring the feasibility and efficacy of a human-AI sleep coaching model for improving sleep among university students

Contact Info

Product

Resources

About