README for Conversion Scripts for NewYorkTimes 2002 to 2005 ------------------------------------------------------------------------------- Data conversion script "nyt2ntc.pl": nyt2ntc.pl" is a script for format conversion of document data from "NewYorkTimes" format to "NTCIR format". USAGE: % perl nyt2ntc.pl is a file of "NewYorkTimes" documents. Example: ( for 1 file conversion) % perl nyt2ntc.pl nyt_eng_200201 > nyt_eng_200201-ntc Example: ( for all 2002 files conversion, use foreach in csh/tcsh ) % foreach i ( nyt_eng_2002?? ) foreach? perl nyt2ntc.pl $i > $i-ntc foreach? end ------------------------------------------------------------------------------- Daisuke Ishikawa (NTCIR Project Researcher)