NTT Infant Speech Database (INFANT)
Data DOI
https://doi.org/10.32130/src.INFANT
Producer, Project
Shigeaki AMANO, Tadahisa KONDO, and Kazumi KATO, NTT Communication Science Laboratories
Contents
Speech data spoken by Japanese native 5 infants from 3 families are recorded more than one hour per month since their birth till 5 years old; completely spontaneous speech.
- This database includes the following data:
- Speech wave (16 kHz, 16 bit, Mono)
- Transcribed text (Chinese character and Japanese Kana alphabet, Katakana alphabet)
- Utterance attributes (Speaker gender, utterance environment, speech volume, etc.)
- Utterance time information
- Comments such as paralinguistic information, etc.
- Voiced/unvoiced flag
- Fundamental frequency (F0) information
- Phoneme labels
Speaker
Japanese native speakers including the parents and their children (2 boys and 3 girls of 3 families).
Speaker ID | Age (months) | Period (months) | Time (hours) | Repetitions | Frequency |
---|---|---|---|---|---|
A | 0–30 | 25 | 161 | 316 | ≥ 1 hour/month |
B | 0–54 | 50 | 140 | 720 | ≥ 1 hour/month |
C | 0–60 | 61 | 68 | 398 | 15 minutes/week |
D | 0–60 | 61 | 66 | 290 | 15 minutes/week |
E | 0–59 | 50 | 106 | 691 | ≥ 1 hour/month |
Total | 247 | 541 | 2415 |
Speech file format
WAV format (16 kHz, 16 bit, Mono)
Distribution media
3 BD-DL
Licensing
For research and development purposes only
Price
11 000 yen (plus consumption tax for a domestic order)
Speech sample for test listening
Girl / Recording period: 0–60 months
- at the age of 0 month
- アーアー 。
- at the age of 12 months
- アイ 。
- at the age of 24 months
- ちょうだいの カウ 。
- at the age of 36 months
- チュイティーモーダ 。
- at the age of 48 months
- あの 人 そうでしょう 。
- at the age of 59 months
- なんか 暑くなってきた 。