SleepQA: A health coaching dataset on sleep for extractive question answering
Bojic I., Ong QC., Thakkar M., Kamran E., Le Shua IY., Pang JRE., Chen J., Nayak V., Joty S., Car J.
Question Answering (QA) systems can support health coaches in facilitating clients' lifestyle behavior changes (e.g., in adopting healthy sleep habits). In this paper, we design a domain-specific QA pipeline for sleep coaching. To this end, we release SleepQA, a dataset created from 7,005 passages comprising 4,250 training examples with single annotations and 750 examples with 5-way annotations1. We fine-Tuned different domain-specific BERT models on our dataset and perform extensive automatic and human evaluation of the resulting end-To-end QA pipeline. Comparisons of our pipeline with baseline show improvements in domain-specific natural language processing on realworld questions. We hope that this dataset will lead to wider research interest in this important health domain.
