[JAPANESE] [NTCIR Home] [NTCIR DATA Home]
This test collection is intended to evaluate three different techniques
(subtasks) related to patent information processing: document retrieval,
passage retrieval, and classification. In the document retrieval, a claim
in a patent application is used as a search topic to search for the patents
that can invalidate the demand in the topic patent. In the passage retrieval,
the
paragraphs (passages) in a document retrieved for the document retrieval are sorted according to the degree to which a passage provides grounds to judge whether the document is relevant.
In the classification, patent applications are categorized according to
the F-term classification system. The document collection includes unexamined
Japanese patent applications published in 1993-2002 and Patent Abstracts
of Japan published in 1993-2002. The entire collection is provided by NII
for research purposes.
Collection | Task | Documents | Task data | |||||||
Genre | Filename | Lang. | Year | # of docs | Size | Topic/ | Relevance judge |
|||
Lang. | # | |||||||||
NTCIR-5 PATENT | IR | patent full-text | Publication of unexamined Japanese patent applications (kkh) |
J | 1993-2002 | 3,496,252 | 94.5GB | JE | Document Retrieval 1,223 Passage Retrieval 356 Classification - Theme 2,008 Classification - F term 500 |
4 3 1 |
patent abstract | Patent Abstracts of Japan (paj) |
E | 1993-2002 | 3,496,252 | 5,482MB |
*The entire collection is provided by NII.
(1) Document Retrieval Subtask
The followings are the procedures to obtain the test collection. The test collection and data available from NII are free of charge.
Application Form [txt]
User agreement Form (sent by email)
The test collection has been constructed and used for the NTCIR. They are
usable only for the research purpose use.
Task Overview of NTCIR 5 Patent
Overview of Patent Retrieval Task at NTCIR-5
Overview of Classification Subtask at NTCIR-5 Patent Retrieval
Task
NTCIR Project (Rm.1309)
National Institute of Informatics
2-1-2 Hitotsubashi Chiyoda-ku, Tokyo
102-8430, JAPAN
PHONE: +81-3-4212-2750
FAX: +81-3-4212-2751
Email: ntc-secretariat
The release of the new test collections and correction information shall
be announced through the ntcir Mailing list
The documents collection included in the test collection were provided
to NII for used in NTCIR free of charge or for a fee. The providers of
the document data kindly understand the importance of the test collection
in the research on information access technologies and then granted the
use of the data for research purpose. Please remember that the document
data in the NTCIR test collection is copyrighted and has commercial value
as data. It is important for our continued reliable and good relationship
with the data producers/providers that we researchers must behave as a
reliable partners and use the data only for research purpose under the
user agreement and use them carefully not to violate any rights for them
.
[JAPANESE] [NTCIR Home] [Top of this page]
[NTCIR DATA
Home]
ntc-admin