NTCIR Project
Tools
nyt2ntc.pl

[JAPANESE] [NTCIR Home] [NTCIR Tools Home]


nyt2ntc.pl

This Document conversion script is script file that can convert the ducuments included in the provided New York Times News Article Data (English) into the NTCIR standard document format.

1 To obtain New York Times News Article Data

For the NTCIR-8 GeoTime and MOAT participants:

For the non-participants, New York Times News Article Data (2002-2005) for NTCIR Test Collection is available for research purpose use from the Linguistic Data Consortium (the LDC).

the Linguistic Data Consortium (the LDC):http://www.ldc.upenn.edu/

2 To convert the documents into the NTCIR standard record format

The documents in the obtained Corpus shall be converted into the NTCIR standard document format by the script nyt2ntc.pl.

Script
http://research.nii.ac.jp/ntcir/tools/nyt2ntc.pl
README
http://research.nii.ac.jp/ntcir/tools/README-for-nyt2ntcScript.txt