5. Real World Computing Project (RWCP) Speech Corpora

5-c. RWCP News Speech Corpus (RWCP-SP99)

Data DOI

https://doi.org/10.32130/src.RWCP-SP99

Producer, Project

RWCP (Real World Computing Partnership)

RWCP Intellectual Resources Working group

Contents

Professional announcers read broadcast news articles.

A professional broadcast reporter wrote the draft based on the actual event. The announcer read to himself once and then he read the draft once imaging the actual broadcasting.

Each speaker read about 40 news articles (30 independent articles + 10 articles common to all speakers) and set A (50 sentences) of ATR's 503 phonetically balanced sentences.

Speaker

Speech file format

RAW format (16 kHz, 16 bit, Mono, BigEndian)

WAV format (16 kHz, 16 bit, Mono) renewal in Aug. 2009

Distribution media

1 DVD

Licensing

For research purpose only

Price

No fee

Further information

PDF file

Note

All documents are written in Japanese.

Speech sample for test listening

Go to corpora list