Prospectus

Speech and language data are widely acknowledged to be indispensable to promoting speech and language research. Such data need to be of much variety. Recently, large amounts of data have become needed for use in speech and language processing systems, the most common of which use statistical methods. It has become possible to evaluate the performance and set new research objectives based on the obtained results using the real data, as the studies on speech and language processing technologies have developed recently. Moreover, it is necessary to objectively compare the performance of various methods to promote research and development of speech and language processing systems. The best way to conduct such comparisons, given our current knowledge, is to use each method or system to process a common body of data and to then compare the results.

To enable such endeavors, it is necessary to collect and keep large amounts of speech and language data of various kinds. These resources must be open to the public so that they may be utilized for research and development and system performance assessment. A collection of data to be used for this purpose is called a speech/language database or a speech/language corpus, as it is well known. Recently, the necessity and the significance of speech and language corpora have been acknowledged widely, but in the past, individual researchers recorded speech data or collected language data, storing and using them as needed. Each research institute has collected more or less similar speech and language data, though doing so separately at each institute cost much time and money. Preparing a common framework has come to be considered necessary so as to create, collect, store, distribute, and share speech and language data in order to develop speech and language studies and related areas.

With this background, the Linguistic Data Consortium (LDC) was established in 1992. LDC is an open consortium of universities, companies, and government research laboratories. Over 100 institutions have joined the consortium; most of them are from the U.S. The consortium creates, collects, and distributes speech and text databases, lexicons, and other resources for research and development purposes. The European Language Resources Association (ELRA) was established as a non-profit organization in 1995. It is the driving force to make available the language resources for language engineering and to evaluate language engineering technologies.

It has become possible worldwide to utilize speech and language data of English and European languages owing to the establishment of LDC and ELRA. However, the domestic supply system of spoken and written Japanese data has not been established, and only a small amount of this data is available for use from overseas. Not only domestic but also overseas researchers are interested in Japanese speech and language data, but the requests from abroad for available speech and language data cannot be met in the present state of affairs.

Although the necessity of shared speech data has long been acknowledged, their realization has been slow to develop in Japan. Owing to the need to prepare a systematic, common framework for collecting, creating, storing, distributing, and sharing speech and language data in order to secure progress in future research, the Linguistic Resources Sharing Initiative (LRSI) was launched in 1994 and later GSK (Gengo Shigen Kyookai, Language Resources Association) was established in 1999. However, these efforts were not able to function as expected. GSK was renovated as an NPO in 2003; a 3-year project was adopted in 2005 for financially supporting its activity. The association plans to concentrate mostly on text corpora.

The National Institute of Informatics (NII), both as the national center of informatics and as one of the inter-university research institutes of the Inter-University Research Institute Corporation, aims to deepen the field of informatics, to create future value by informatics, to construct an infrastructure for scientific information based on a scientific information network as well as the contents of that network, and to contribute to the scientific community as a whole. As a part of promoting these missions, NII has decided to initiate the Speech Resources Consortium (SRC) toward creation of future value in information media, especially speech media. NII will promote this consortium together with GSK.

Objective

SRC aims at collection, distribution, investigation, research, and standardization of electronic data and software tools that are necessary for the development of science, education, and industry concerning speech. The consortium will contribute to the development of information society through these activities.

Activities

  1. Investigating present speech resources and making their catalogues.
  2. Requesting research institutions to offer the present speech resources to SRC.
  3. Publicity, distribution, and promotion of speech resources.
  4. Standardizing speech resources.
  5. Preparing standardized contract form for collection and distribution of speech resources.
  6. Additional production of frequently requested speech resources.
  7. Creating and redistributing revised versions of already distributed resources.
  8. Analyzing and processing speech resources.
  9. Designing and creating new speech resources.
  10. Investigating and researching speech resources.
  11. Cooperating with similar overseas organizations.
  12. Other services necessary to perform the objectives of SRC.

Organization

Associate Professor
Junichi YAMAGISHI
Project Researcher
Tomoko OHSUGA
Staff
Marika HORIUCHI
Adviser
Shuichi ITAHASHI
Nobutaka ONO
Yuichi ISHIMOTO

Committee

To promote the development of SRC, a committee called the Speech Resources Promotion Committee is constituted in the SRC. Around 20 members are invited from the fields of speech processing, linguistics, acoustics, speech and language corpus creation, speech and language resource provision, and the judicial circle.

Dr. Masami AKAMINE
Corporate Research & Development Center, Toshiba Corporation
Prof. Shigeaki AMANO
Aichi Syukutoku University
Dr. Shoko ARAKI
NTT Communication Science Laboratories
Prof. Hitoshi ISAHARA
Toyohashi University of Technology
Dr. Yuichi ISHIMOTO
The National Institute for Japanese Language
Prof. Shuichi ITAHASHI
Tsukuba University
Dr. Kiyotaka UCHIMOTO
National Institute of Information and Communications Technology
Dr. Tomoko OHSUGA
National Institute of Informatics
Prof. Nobutaka ONO
Tokyo Metropolitan University
Prof. Noriko KANDO
National Institute of Informatics
Prof. Hideaki KIKUCHI
Waseda University
Prof. Kazuya TAKEDA
Nagoya University
Prof. Satoshi NAKAMURA
Nara Institute of Science and Technology
Prof. Koiti HASIDA
The University of Tokyo
Dr. Ken HANAZAWA
Biometrics Research Laboratories, NEC Corporation
Prof. Satoru HAYAMIZU
Gifu University
Prof. Kikuo MAEKAWA
The National Institute for Japanese Language
Prof. Tomoko MATSUI
The Institute of Statistical Mathematics
Prof. Nobuaki MINEMATSU
The University of Tokyo
Prof. Junichi YAMAGISHI
National Institute of Informatics

(update: 2018-04-01)

Go to top of page