9. IPSJ SIG-SLP Corpora and Environments for Noisy Speech Recognition

9-c. Audio-Visual Speech Recognition Evaluation Environment (CENSREC-1-AV)

Noisy Speech Recognition Evaluation Working Group,

Special Interest Group on Spoken Language Information Processing,

Information Processing Society of Japan (IPSJ)

Common platform for evaluating independently speech recognition accuracy and speech interval detection under noisy environment.

An evaluation corpus for audio-visual speech recognition of continuously spoken single digits in Japanese.

The digit sequence of each utterance and the pronunciation of Japanese digits are the same as the CENSREC-1 (AURORA-2J) database.

Including color and infrared mouth images, which were recorded simultaneously with speech.

2 DVDs

For research and development purposes only

No fee

Digit strings same as CENSREC-1

Color and infrared mouth images recorded simultaneously with speech

Examples of color and infrared pictures (3 frame/sec, upper: color, lower: infrared)