11. RIKEN Word Processor Operation Dialogue Speech Corpus (RIKEN-DLG)
Laboratory for Language-based Intelligent Systems, Brain Science Institute, RIKEN
a. Dialogues of request for document making
- A professional word processor operator makes documents using a computer listening to the user's requirements (Dialogues among a user, a secretary, an assistant, and a professional).
- A professional explains his work watching the viedo display of the document making recording (Monologues of a professional).
b. Question-answer dialogues
- Dialogues of a user who makes documents by himself/herself and asks questions to the professional about operation of word processor (Dialogues between a user and a professional).
Vols. 1-3: Speech data, transcribed text and database with morpheme tags
Vol. 4: Transcribed text and database with morpheme tags*1
Vol. 1: Dialogues requesting making documents (9 dialogues and 9 monologues); no more than 2 hours per dialogue.
Vol. 2: Question-answer dialogues 2002-1 (18 dialogues); no more than one hour per dialogue.
Vol. 3: Question-answer dialogues 2002-2 (18 dialogues); no more than one hour per dialogue.
Vol. 4: Question-answer dialogues 2001 (15 dialogues) no more than two hours per dialogue *1
A total of 129 speakers participated in the recording.
Speech file format
RAW format (16 kHz, 16 bit, Stereo, LittleEndian)*2
Vols. 1-3: 1 DVD each
Vol. 4: 1 CD-ROM
For research purpose only
*1 Vol. 4 does not contain speech data.
*2 A part of monologues in Vol. 1 is recorded with 32 kHz, 16 bit, Mono conditions.
Recording level varies according to the recorded year.
A table of sampling frequencies, number of channels, and recording conditions is contained in Vol. 1.
All documents are written in Japanese.
Speech sample for test listening
Dialogues of request for document making
- 0238 R:
- 0239 R:
- 0240 R:
- 0241 R:
- 0242 R: