NTCIR Project
Tools
xin2ntc-new.pl

[JAPANESE] [NTCIR Home] [NTCIR Tools Home]


xin2ntc-new.pl

This Document conversion script is script file that can convert the ducuments included in the provided Xinhua Chinese News Article Data into the NTCIR standard document format.

1 To obtain Xinhua Chinese News Article Data

For the NTCIR-8 ACLIA and MOAT participants:

For the non-participants, Xinhua Chinese News Article Data (1998-2005) for NTCIR Test Collection is available for research purpose use from the Linguistic Data Consortium (the LDC).

the Linguistic Data Consortium (the LDC):http://www.ldc.upenn.edu/

2 To convert the documents into the NTCIR standard document format

The documents in the obtained Corpus shall be converted into the NTCIR standard document format by the script xin2ntc-new.pl.

Script and README
http://research.nii.ac.jp/ntcir/tools/xin2ntc-new.pl_txt (updated on 2009-08-10)