2. University of Tsukuba Multilingual Speech Corpus (UT-ML)
Data DOI
https://doi.org/10.32130/src.UT-ML
Producer, Project
Machine Intelligence and Biomedical Engineering Laboratory, University of Tsukuba
Special Research Project for the Typological Investigation of Languages and Cultures of the East and West, University of Tsukuba, 1998-2002
Contents
Isolated word (50 items)
- Digits (14 items)
- Month names of the calendar month (12 items)
- Seven day of the week (7 items)
- Words on weather (4 items)
- Phrases of greeting (6 items)
- Words of reply (3 items)
- Words of time(4 items)
Continuous speech
- Aesop's Fables "The North Wind and The Sun"
Speaker
98 people from 11 countries
Arabic | 5 people (3 males and 2 females) |
---|---|
Chinese | 14 people (7 males and 7 females) |
English | 8 people (4 males and 4 females) |
French | 8 people (5 males and 3 females) |
German | 8 people (4 males and 4 females) |
Indonesian | 8 people (6 males and 2 females) |
Japanese | 13 people (7 males and 6 females) |
Korean | 10 people (6 males and 4 females) |
Russian | 5 people (3 males and 2 females) |
Spanish | 8 people (3 males and 5 females) |
Thai | 14 people (7 males and 7 females) |
Recording environment
Soundproof room
Speech file format
WAV format (16 kHz, 16 bit, Mono)
Distribution media
1 CD-ROM
Licensing
For research purpose only
Price
No fee
Further information
Note
All documents are written in Japanese.
Speech sample for test listening
- Arabic
- Chinese
- English
- French
- German
- Indonesian
- Japanese
- Korean
- Russian
- Spanish
- Thai