9. IPSJ SIG-SLP Corpora and Environments for Noisy Speech Recognition

9-f. Reverberant Speech Recognition Evaluation Environment (CENSREC-4)

Producer, Project

Noisy Speech Recognition Evaluation Working Group,

Special Interest Group on Spoken Language Information Processing,

Information Processing Society of Japan (IPSJ)


Common platform for evaluating independently speech recognition accuracy and speech interval detection under noisy environment.

The target evaluation framework of CENSREC-4 is distant-talking speech recognition in various reverberation environments. The data contained in CENSREC-4 are connected digit utterances as in CENSREC-1.

Recording environment

In-car, Office, Meeting room, Lounge, Elevator hall, Living room, Japanese style room, Japanese style bath

(Real environment data: In-car, Office, Meeting room, Lounge)


Speech file format

RAW format (16kHz, 16bit, Mono, BigEndian)

Distribution media



For research and development purposes only


No fee

Speech sample for test listening

Basic data set: digit strings as in CENSREC-1 under reverberant conditions

utterance clean test set A test set B
1 (/ichi/) clean Office Lounge
1 0 (/ichizero/) clean Elevator hall Japanese style room
5 (/go/) clean In-car Meeting room
3 (/saN/) clean Living room Japanese style bath

Go to corpora list