The 7th NTCIR Workshop
NTCIR-7 MOAT Evaluation Agreement Forms - Xinhua English Text and Xinhua Chinese Text

[NTCIR-7 User Ugreement Forms]


# This page is available English only.

1. INTRODUCTION

"Xinhua English Text (1998-2001) and "Xinhua Annotated English Text", or (LDC Catalog #)LDC2006E106 and LDC2006E108 NTCIR Opinion Annotation Pilot Task Evaluation Corpus for research purposes, is available for the participants of NTCIR-7 MOAT Task.

"Xinhua Chinese Text (1998-2001)(Simplified Chinese Text)", or  LDC2008E48 NTCIR Multilingual Opinion Annotation Task,  is available for the participants of the NTCIR-7 MOAT Task only for the purposes of the NTCIR Workshop. It is free of charge for the registered participants of NTCIR-7 MOAT.

Xinhua Chinese Text (1998-2001) is also included in either of the following LDC corpus:
LDC2003T09: Chinese Gigaword First Edition, which released on May 22, 2003.*
LDC2005T14: Chinese Gigaword Second Edition, which released on Aug 17, 2005.
LDC2007T38: Chinese Gigaword Third Edition, which released on Aug 17, 2007.
If you have one of the above three, you do not need to newly obtain the corpus.

*For the documents included in Chinese Gigaword First Edition, different format of Doc ID is used.
Please convert the DocID if you use that edition.

This is only a portion of the data for the NTCIR-7 MOAT Task. The rest of the data can be obtained directly from NTCIR after filling out and sending the two forms below.

2. HOW TO OBTAIN THE DATA

(1) Register to participate in the MOAT task at NTCIR-V
The LDC will grant the license to the registered participants.
(2) Download the LDC's "NTCIR-7 MOAT Evaluation Agreement".
(3) Complete and sign the LDC agreement.
(4) Fax a signed agreement to the Linguistic Data Consortium (LDC).
Only one signed form by fax is necessary for the LDC.
(5) The document data will be provided to you by the LDC via their internet server for download.
Contacting LDC:
Linguistic Data Consortium
3600 Market Street
Suite 810
Philadelphia, PA, 19104-2653, USA
Pone:+1(215)898-0464
Fax:+1(215)573-2175
Email: ldc@ldc.upenn.edu
ATTN: Ms Ilya Ahtaridis, Membership Coordinator

3. SCOPE OF THE LICENSE

The license for use will be valid for NTCIR-7 Participants until their participation in the NTCIR-7 Evaluation has ended, or after research in Opinion Analysis using the data has ended.

4. CONVERSION OF LDC DOCUMENT DATA INTO NTCIR FORMAT

The data distributed by the LDC is the complete Xinhua corpus. For the Opinion Annotation Pilot Task we have pre-segmented the files. The files will be made available by the LDC after signing the agreement forms.

[NTCIR-7 User Ugreement Forms]

contact; ntc-admin
2008-08-29