34. Corpus of Connecting Nihongo Utterance and Text (Coco-Nut)

Data DOI

https://doi.org/10.32130/src.Coco-Nut

Producer, Project

Aya Watanabe and Prof. Shinnosuke Takamichi, The University of Tokyo

Contents

This corpus consists of Japanese speech, their transcriptions, and their characteristics prompts (free-form descriptions that express characteristics of speech).

The speech was gathered from the YouTube as 24kHz mp3 and converted into 44.1kHz wav. This corpus contains 7,330, 8-hour (in total) speech. The characteristics prompts were collected through crowdsourcing. The number of prompts is 1 per utterance in training data, and 5 per utterance in validation/test data.

The characteristics prompts are provided in the creator's github repository. NII-SRC provides the speech data and their transcriptions.

Speaker

7,330 speakers in total

Speech file format

WAV format (44.1 kHz, 16 bit, Stereo)

Distribution media

1 DVD(DL)

Licensing

For research purpose only

Price

No fee

Further information

https://sites.google.com/site/shinnosuketakamichi/research-topics/coconut_corpus

Speech sample for test listening

Go to corpora list