18. Chinese MULTEXT Corpus (MULTEXT-C)

Data DOI

https://doi.org/10.32130/src.MULTEXT-C

Producer, Project

Assoc. Prof. Masahiko Komatsu, Health Sciences University of Hokkaido (now, Kanagawa University)

Contents

The Chinese version of Multilingual Text Tools and Corpora (MULTEXT).

The speakers were asked to read aloud the 40 passages (each passage includes 5 - 6 sentences) as naturally as possible.

Speaker

10 native speakers of Chinese (5 males and 5 females)

1 speaker read all 40 passages, and each of the other 9 speakers read 15 passages (each passage was read by 4 or 5 speakers).

Recording environment

Soundproof room

Speech file format

WAV format (22 050 Hz, 16 bit, Mono)

Distribution media

1 CD-ROM

Licensing

For research purpose only

Price

No fee

Speech sample for test listening

Chinese texts translated from the MULTEXT corpus and the Japanese MULTEXT corpus

「我刚到伦敦,而我的行李却去了罗马。因为我有糖尿病,明天必须拿到我的行李,请拜托有关负责人尽快帮我去找。这期间,我还需要一些应急药,麻烦您帮我联系一下儿。」

Go to corpora list