[JAPANESE] [NTCIR Home] [NTCIR Tools Home]
This Document conversion script is script file that can convert the ducuments
included in the provided Xinhua Chinese News Article Data into the NTCIR
standard document format.
1 To obtain Xinhua Chinese News Article Data
For the NTCIR-8 ACLIA and MOAT participants:
For the non-participants, Xinhua Chinese News Article Data (1998-2005) for NTCIR Test Collection is available for research purpose use from the Linguistic Data Consortium (the LDC).
the Linguistic Data Consortium (the LDC):http://www.ldc.upenn.edu/
2 To convert the documents into the NTCIR standard document format
The documents in the obtained Corpus shall be converted into the NTCIR
standard document format by the script xin2ntc-new.pl.
Script and README
http://research.nii.ac.jp/ntcir/tools/xin2ntc.1.pl_txt (updated on 2009-08-10)