[JAPANESE] [NTCIR Home] [NTCIR Tools Home]
This Document conversion script is script file that can convert the ducuments
included in the provided New York Times News Article Data (English) into
the NTCIR standard document format.
1 To obtain New York Times News Article Data
For the NTCIR-8 GeoTime and MOAT participants:
For the non-participants, New York Times News Article Data (2002-2005) for NTCIR Test Collection is available for research purpose use from the Linguistic Data Consortium (the LDC).
the Linguistic Data Consortium (the LDC):http://www.ldc.upenn.edu/
2 To convert the documents into the NTCIR standard record format
The documents in the obtained Corpus shall be converted into the NTCIR
standard document format by the script nyt2ntc.pl.
Script
http://research.nii.ac.jp/ntcir/tools/nyt2ntc.pl
README
http://research.nii.ac.jp/ntcir/tools/README-for-nyt2ntcScript.txt