SleepQA: A health coaching dataset on sleep for extractive question answering

Bojic I., Ong QC., Thakkar M., Kamran E., Le Shua IY., Pang JRE., Chen J., Nayak V., Joty S., Car J.

Question Answering (QA) systems can support health coaches in facilitating clients' lifestyle behavior changes (e.g., in adopting healthy sleep habits). In this paper, we design a domain-specific QA pipeline for sleep coaching. To this end, we release SleepQA, a dataset created from 7,005 passages comprising 4,250 training examples with single annotations and 750 examples with 5-way annotations1. We fine-Tuned different domain-specific BERT models on our dataset and perform extensive automatic and human evaluation of the resulting end-To-end QA pipeline. Comparisons of our pipeline with baseline show improvements in domain-specific natural language processing on realworld questions. We hope that this dataset will lead to wider research interest in this important health domain.

Type

Conference paper

Publication Date

2022-01-01T00:00:00+00:00

Volume

193

Pages

199 - 217

Total pages

18

Permalink Close