NTCIR-14 STC-3

NTCIR-14 STC-3 (Short Text Conversation Task)

Test Collection

Dialogue Quality and Nugget Detection (DQND) Subtasks

Recently, many reserachers are trying to build automatic helpdesk systems. However, there are very few methods to evaluate such systems.
In STC-3 NDDQ subtasks, we aim to explore methods to evaluate customer-helpdesk dialogues automatically. This dataset have the following features:

Chinese customer-helpdesk dialogues carwled from Weibo.
English dialgoues: manually translated from a subset of the Chinese dialgoues.
Nugget type annotatoins for each turn: indicate whether the current turn is useful to accomplish the task.
Quality annotation for each dialogue.
- task accomplishment
- customer satisfcation
- dialogue effectiveness

In NTCIR-14 STC3-NDDQ, we consider annotations ground truth, and participants are required to predict nugget type for each turn (Nugget Detection, or ND) and dialogue quality for each dialogue (Dialogue Quality, or DQ).

Task data