Abstract
We present a corpus of simulated counselling sessions consisting of speech- and text-based dialogs in Cantonese. Consisting of 152K Chinese characters, the corpus labels the dialog act of both client and counsellor utterances, segments each dialog into stages, and identifies the forward and backward links in the dialog. We analyze the distribution of client and counsellor communicative intentions in the various stages, and discuss significant patterns of the dialog flow.
Original language | English |
---|---|
Title of host publication | Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022) |
Publisher | European Language Resources Association (ELRA) |
Pages | 5723-5730 |
ISBN (Electronic) | 979-10-95546-72-6 |
Publication status | Published - Jun 2022 |
Event | 13th Conference on Language Resources and Evaluation (LREC 2022) - Palais du Pharo, Marseille, France Duration: 20 Jun 2022 → 25 Jun 2022 https://lrec2022.lrec-conf.org/en/ |
Publication series
Name | 2022 Language Resources and Evaluation Conference, LREC 2022 |
---|
Conference
Conference | 13th Conference on Language Resources and Evaluation (LREC 2022) |
---|---|
Country/Territory | France |
City | Marseille |
Period | 20/06/22 → 25/06/22 |
Internet address |
Research Keywords
- dialog act
- dialog corpus
- counselling
- Cantonese
Publisher's Copyright Statement
- This full text is made available under CC-BY-NC 4.0. https://creativecommons.org/licenses/by-nc/4.0/deed.en