Prospectus

Speech and language data are widely acknowledged to be indispensable to promoting speech and language research. Such data need to be of much variety. Recently, large amounts of data have become needed for use in speech and language processing systems, the most common of which use statistical methods. It has become possible to evaluate the performance and set new research objectives based on the obtained results using the real data, as the studies on speech and language processing technologies have developed recently. Moreover, it is necessary to objectively compare the performance of various methods to promote research and development of speech and language processing systems. The best way to conduct such comparisons, given our current knowledge, is to use each method or system to process a common body of data and to then compare the results.

To enable such endeavors, it is necessary to collect and keep large amounts of speech and language data of various kinds. These resources must be open to the public so that they may be utilized for research and development and system performance assessment. A collection of data to be used for this purpose is called a speech/language database or a speech/language corpus, as it is well known. Recently, the necessity and the significance of speech and language corpora have been acknowledged widely, but in the past, individual researchers recorded speech data or collected language data, storing and using them as needed. Each research institute has collected more or less similar speech and language data, though doing so separately at each institute cost much time and money. Preparing a common framework has come to be considered necessary so as to create, collect, store, distribute, and share speech and language data in order to develop speech and language studies and related areas.

With this background, the Linguistic Data Consortium (LDC) was established in 1992. LDC is an open consortium of universities, companies, and government research laboratories. Over 100 institutions have joined the consortium; most of them are from the U.S. The consortium creates, collects, and distributes speech and text databases, lexicons, and other resources for research and development purposes. The European Language Resources Association (ELRA) was established as a non-profit organization in 1995. It is the driving force to make available the language resources for language engineering and to evaluate language engineering technologies.

It has become possible worldwide to utilize speech and language data of English and European languages owing to the establishment of LDC and ELRA. However, the domestic supply system of spoken and written Japanese data has not been established, and only a small amount of this data is available for use from overseas. Not only domestic but also overseas researchers are interested in Japanese speech and language data, but the requests from abroad for available speech and language data cannot be met in the present state of affairs.

Although the necessity of shared speech data has long been acknowledged, their realization has been slow to develop in Japan. Owing to the need to prepare a systematic, common framework for collecting, creating, storing, distributing, and sharing speech and language data in order to secure progress in future research, the Linguistic Resources Sharing Initiative (LRSI) was launched in 1994 and later GSK (Gengo Shigen Kyookai, Language Resources Association) was established in 1999. However, these efforts were not able to function as expected. GSK was renovated as an NPO in 2003; a 3-year project was adopted in 2005 for financially supporting its activity. The association plans to concentrate mostly on text corpora.

The National Institute of Informatics (NII), both as the national center of informatics and as one of the inter-university research institutes of the Inter-University Research Institute Corporation, aims to deepen the field of informatics, to create future value by informatics, to construct an infrastructure for scientific information based on a scientific information network as well as the contents of that network, and to contribute to the scientific community as a whole. As a part of promoting these missions, NII has decided to initiate the Speech Resources Consortium (SRC) toward creation of future value in information media, especially speech media. NII will promote this consortium together with GSK.

Objective

SRC aims at collection, distribution, investigation, research, and standardization of electronic data and software tools that are necessary for the development of science, education, and industry concerning speech. The consortium will contribute to the development of information society through these activities.

Activities

Investigating present speech resources and making their catalogues.
Requesting research institutions to offer the present speech resources to SRC.
Publicity, distribution, and promotion of speech resources.
Standardizing speech resources.
Preparing standardized contract form for collection and distribution of speech resources.
Additional production of frequently requested speech resources.
Creating and redistributing revised versions of already distributed resources.
Analyzing and processing speech resources.
Designing and creating new speech resources.
Investigating and researching speech resources.
Cooperating with similar overseas organizations.
Other services necessary to perform the objectives of SRC.

Organization

Associate Professor: Junichi YAMAGISHI
Project Researcher: Tomoko OHSUGA
Staff: Ayako NOZAWA
Adviser: Shuichi ITAHASHI
: Nobutaka ONO
: Yuichi ISHIMOTO

Committee

To promote the development of SRC, a committee called the Speech Resources Promotion Committee is constituted in the SRC. Around 20 members are invited from the fields of speech processing, linguistics, acoustics, speech and language corpus creation, speech and language resource provision, and the judicial circle.

Prof. Shigeaki AMANO: Aichi Syukutoku University
Dr. Shoko ARAKI: NTT Communication Science Laboratories
Prof. Hitoshi ISAHARA: Otemon Gakuin University
Prof. Yuichi ISHIMOTO: Institute of Technologists
Dr. Kiyotaka UCHIMOTO: National Institute of Information and Communications Technology
Dr. Tomoko OHSUGA: National Institute of Informatics
Prof. Nobutaka ONO: Tokyo Metropolitan University
Dr. Takehiko KAGOSHIMA: Corporate Research & Development Center, Toshiba Corporation
Prof. Hideaki KIKUCHI: Waseda University
Prof. Norihide KITAOKA: Toyohashi University of Technology
Prof. Hanae KOISO: National Institute for Japanese Language and Linguistics
Prof. Shoichi KOYAMA: National Institute of Informatics
Prof. Sakriani SAKTI: Nara Institute of Science and Technology
Prof. Koiti HASIDA: Institute of Physical and Chemical Research
Prof. Satoru HAYAMIZU: Waseda University
Prof. Tomoko MATSUI: The Institute of Statistical Mathematics
Prof. Nobuaki MINEMATSU: The University of Tokyo
Prof. Junichi YAMAGISHI: National Institute of Informatics
Dr. Hitoshi YAMAMOYO: Data Science Research Laboratory, NEC Corporation

(update: 2024-05-01)

Go to top of page