[JAPANESE] [NTCIR Home] [NTCIR Data Home]
![]()
NTCIR-9 RITE test collection can be used for experiments of Recognizing Inference in Text, for Chinese
(Simplified (CS), Traditional (CT)), and Japanese (JA).
| Collection | Subtask | Task Data | |||
| Language | Text Pairs | Relevance judgment (gold / gold standard data) |
|||
| Development Data | Test Data | ||||
| # | |||||
| NTCIR-9 RITE |
BC | Cs | 407 | 407 | 2-way (Y/N) |
| Ct | 421 | 900 | |||
| J | 500 | 500 | |||
| MC | Cs | 407 | 407 | Five-way iF/R/B/C/I) | |
| Ct | 421 | 900 | |||
| J | 440 | 440 | |||
| Entrance Exam * | J | 499 | 442 | 2-way (Y/N) | |
| RITE4QA | Cs | - | 964 | 2-way (Y/N) | |
| Ct | - | 682 | |||
| J | - | 682 | |||
J: Japanese, E: English, C: Chinese (Cs: simplified Chinese, Ct:traditional Chinese)
* Entrance Exam data for NTCIR-9 RITE is available for task
participants
in NTCIR-9 workshop, only.
Permission to use the Data is under negotiation. We will announce when it is available.
README
<dataset type="bc">
<pair id="1">
<t1>Ί_“‡‚ÍA“~‚Å‚àƒnƒCƒrƒXƒJƒX‚ªç‚«—‚ê‚éŠy‰€‚¾B</t1>
<t2>Ί_“‡‚Ì“~‚Ì‹C‰·‚Í‚‚¢B</t2>
</pair>
<pair id="2">
: : :
</dataset>
<dataset type="bc">
<pair id="1" label="Y">
<t1>Ί_“‡‚ÍA“~‚Å‚àƒnƒCƒrƒXƒJƒX‚ªç‚«—‚ê‚éŠy‰€‚¾B</t1>
<t2>Ί_“‡‚Ì“~‚Ì‹C‰·‚Í‚‚¢B</t2>
</pair>
<pair id="2" label="N">
: : :
</dataset>
![]()
How to obtain Document Data;
https://research.nii.ac.jp/ntcir/permission/perm-en-DocumentData.html
Contact us : idr-ntcir
- Application Form [txt]
- User agreement form (sent by email)
Reference

Notice
The test collection was constructed and used for the NTCIR project. It is usable only for research purposes.
The document collection included in the test collection was made available
to NII for use in the NTCIR project free of charge or for a fee. The providers
of the document data understand the importance of such test collections
in research on information access technologies and have kindly given their
permission to use the data for research purposes. Please remember that
the document data in the NTCIR test collection is copyrighted and has commercial
value as data. To maintain a good relationship with the data producers/provider,
we researchers must be reliable partners and use the data only for research
purposes under the user agreement, and we must use the data carefully so
as not to violate copyright.