2. University of Tsukuba Multilingual Speech Corpus (UT-ML)

Producer, Project

Machine Intelligence and Biomedical Engineering Laboratory, University of Tsukuba

Special Research Project for the Typological Investigation of Languages and Cultures of the East and West, University of Tsukuba, 1998-2002

Contents

Isolated word (50 items)

  1. Digits (14 items)
  2. Month names of the calendar month (12 items)
  3. Seven day of the week (7 items)
  4. Words on weather (4 items)
  5. Phrases of greeting (6 items)
  6. Words of reply (3 items)
  7. Words of time(4 items)

Continuous speech

  1. Aesop's Fables "The North Wind and The Sun"

Speaker

98 people from 11 countries

Arabic 5 people (3 males and 2 females)
Chinese 14 people (7 males and 7 females)
English 8 people (4 males and 4 females)
French 8 people (5 males and 3 females)
German 8 people (4 males and 4 females)
Indonesian8 people (6 males and 2 females)
Japanese13 people (7 males and 6 females)
Korean 10 people (6 males and 4 females)
Russian 5 people (3 males and 2 females)
Spanish 8 people (3 males and 5 females)
Thai 14 people (7 males and 7 females)

Recording environment

Soundproof room

Speech file format

WAV format (16 kHz, 16 bit, Mono)

Distribution media

1 CD-ROM

Licensing

For research purpose only

Price

No fee

Further information

PDF file

Note

All documents are written in Japanese.

Speech sample for test listening

Go to corpora list