Fourth International Workshop on SCIentific DOCument Analysis
associated with JSAI International Symposia on AI 2020 (IsAI-2020)

Workshop: November 16 - 17, 2020

Workshop: November 15, 2020

SCIDOCA2020 will be held as an online conference.

New Information


The time schedule of the conference has been decided.


The conference schedule was changed to 15th of November. It is not changed that the conference will be held online.


  • The submission deadline has been extended to 10th of September.
  • Due to the world-wide spreading of COVID-19, we have decided to hold the conference online.


SCIDOCA-2020 will be a one-day or two-day workshop during 16-17 November, 2020. Due to the current COVID-19 situation, there is a possibility that the workshop will be an online meeting.

Aims and Scope

Recent proliferation of scientific papers and technical documents has become an obstacle to efficient information acquisition of new information in various fields. It is almost impossible for individual researchers to check and read all related documents. Even retrieving relevant documents is becoming harder and harder. This workshop gathers all the researchers and experts who are aiming at scientific document analysis from various perspectives, and invite technical paper presentations and system demonstrations that cover any aspects of scientific document analysis.

Important Dates

Paper submission deadline: August 31, 2020 (AOE)
Paper submission deadline: September 10, 2020 (AOE)
Notification: October 1, 2020
Camera-ready due: October 8, 2020 (AOE)
Camera-ready due: November 8, 2020
Workshop: November 16-17, 2020


Please register the workshop at registration page of JSAI International Symposia on AI 2020.


Relevant topics include, but are not limited to, the following:

  • text analysis
  • document structure analysis
  • logical structure analysis
  • figure and table analysis
  • citation analysis of scientific and technical documents
  • scientific information assimilation
  • summarization and visualization
  • knowledge discovery/mining from scientific papers and data
  • similar document retrieval
  • entity and relation linking between documents and knowledge base
  • survey generation
  • resources for scientific documents analysis
  • document understanding in general
  • NLP systems aiming for scientific documents including tagging, parsing, coreference, etc.

Invited Speaker

Pierre Zweigenbaum, Université Paris-Saclay

Title:Scientific document analysis for life sciences: recent work at LIMSI


There are two classes of submissions:
  • Long paper on original and completed work, including concrete evaluation and analysis wherever appropriate; and
  • Short paper on a small, focused contribution, work in progress, a negative result, or an opinion piece.

The page limits are up to 14 pages including references for the longer papers, and up to 7 pages including references for the short papers. (Reviewers will be told that there is no penalty for writing a shorter submission.)

All submissions should be written in English, formatted according to the Springer Verlag LNCS style in a pdf form, which can be obtained from here. The paper should be anonymized. If you use a word file, please follow the instruction of the format, and then convert it into a pdf form and submit it at the paper submission page.

For both classes, in addition to the original unpublished work, we also accept the papers that have already been published or presented in other venues. This submission should also be anonymized, and will be reviewed by the program committee.

The accepted papers will not be archived in general. The papers are distributed to the participants of the workshop on a USB flash drive. If the authors hope to make their paper publicly available, we also will provide a link to the pdf on this webpage. Otherwise, we do not upload the papers on the web. Unpublished submissions on both long and short paper tracks are considered as the candidates for post-proceedings of LNAI (the authors can also reject the invitation, if they wish). The papers will be archived only by this post-proceedings.

You can submit your paper at . If you cannot submit a paper by EasyChair System by some trouble, please send email to "nomura[at]"

If a paper is accepted, at least one author of the paper must register the workshop and present it. Please register the workshop at registration page.

Post Proceedings


SCIDOCA2020 Program

Presentation time
Long paper: 20 min + 5 min (QA)
Short paper: 10 min + 5 min (QA)
  • 9:20: Opening
  • 9:30-10:40: Named Entity Recognition
    • (L) Preliminary Experiments of Span-based Distant Supervision for Biomedical NER
      Takayoshi Shibahara, Ikuya Yamada, Noriki Nishida, Shanshan Liu, Kouji Kozaki, Taro Watanabe and Yuji Matsumoto
    • (S) Named Entity Recognition on COVID-19 Scientific Papers
      An Dao and Akiko Aizawa
    • (S) Word-Level Chemical Named Entity Recognition Based on Subword Sequence Analysis
      Kojiro Machi and Masaharu Yoshioka
    • (S) Coordination Identification for Composite Named Entity Normalization
      Yuya Sawada, Hiroki Teranishi and Yuji Matsumoto
  • 10:50-12:00: Search and Prediction
    • (L) Detecting Hypothetical Triples Based on Formal Concept Analysis
      Yoshiaki Okubo and Ken Satoh
    • (S) CovRelex: A COVID-19 Retrieval System with Relation Extraction Vu Tran, Hien Van Tran, Phuong Minh Nguyen, Chau Minh Nguyen, Ken Satoh, Yuji Matsumoto and Minh Le Nguyen
    • (S) Citation Network Analysis of the COVID-19 Open Research Dataset
      Junichiro Mori
    • (S) Suggesting the gene to be investigated next in cellular biological experiments by text mining
      Kei Ito, Yoshimasa Tsuruoka and Daiju Kitagawa
  • 13:30-14:45: Document Analysis
    • (L) Document Structure Extraction: An Exploratory Study
      Logan Markewich, Yubin Xing, Hao Zhang, Zhexin Jiang, Navid Lambert-Shirzad, Roy Ka-Wei Lee, Zhi Li and Seok-Bum Ko
    • (L) An Empirical Study of Proxy-labeling for Adaptation of Discourse Dependency Parsing
      Noriki Nishida and Yuji Matsumoto
    • (L) Dataset Creation for Grounding of Formulae
      Takuto Asakura, Andre Greiner-Petter, Akiko Aizawa and Yusuke Miyao
  • 15:00-16:05: Information Extraction
    • (L) Brain Function and Coordinate Extraction from Neuroscience Full Text Papers
      Mako Akeda and Yoshinobu Kano
    • (L) Enhancing Biomedical Relation Extraction with Indirect Evidences
      Qin Dai, Naoya Inoue, Ryo Takahashi and Kentaro Inui
    • (S) Extraction of the Material Synthesis Procedure
      Shanshan Liu, Matsunori Uenuma, Hiroyuki Shindo and Yuji Matsumoto
  • 16:20-17:20: Invited Talk
    • Scientific document analysis for life sciences: recent work at LIMSI
      Pierre Zweigenbaum

Workshop Chairs

Yuji Matsumoto, Riken, Japan
Shoshin Nomura, NII, Japan

Program Committee Members (To be confirmed)

Takeshi Abekawa, NII, Japan
Akiko Aizawa, NII, Japan
Naoya Inoue, Tohoku University, Japan
Kentaro Inui, Tohoku University, Japan
Yoshinobu Kano, Shizuoka University, Japan
Yusuke Miyao, University of Tokyo, Japan
Junichiro Mori, University of Tokyo, Japan
Hidetsugu Nanba, Hiroshima City University, Japan
Ken Satoh, NII, Japan
Hiroyuki Shindo, NAIST, Japan
Yoshimasa Tsuruoka, University of Tokyo, Japan
Minh Le Nguyen, JAIST, Japan
Pontus Stenetorp, University College London, UK

For any inquiry concerning the workshop, please send it to "nomura[at]"

SCIDOCA 2020 home page

Back To Top