Presentation 2019-05-31
Proposal for Automatic Extraction Framework of Superconductors Related Information from Scientific Literature
Luca Foppiano, Thaer M. Dieb, Akira Suzuki, Masashi Ishii,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) The automatic collection of materials information from research papers using Natural Language Processing (NLP) is highly required for rapid materials development using big data, namely materials informatics (MI). The difficulty of this automatic collection is mainly caused by the variety of expressions in the papers, a robust system with tolerance to such variety is required to be developed. In this paper, we report an ongoing interdisciplinary work to construct a system for automatic collection of superconductor-related information from scientific literature using text mining techniques. We focused on the identification of superconducting material names and their critical temperature (Tc) key property. We discuss the construction of a prototype for extraction and linking using machine learning (ML) techniques for the physical information collection. From the evaluation using 500 sample documents, we define a baseline and a direction for future improvements.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) material informatics / superconductors / machine learning / nlp / tdm
Paper # SC2019-1
Date of Issue 2019-05-24 (SC)

Conference Information
Committee SC
Conference Date 2019/5/31(2days)
Place (in Japanese) (See Japanese page)
Place (in English) National Institute for Materials Science
Topics (in Japanese) (See Japanese page)
Topics (in English) Science Service Platform, Data Service and Machine Learning, etc
Chair Masahide Nakamura(Kobe Univ.)
Vice Chair Shinji Kikuchi(National Institute for Materials Science) / Yoji Yamato(NTT)
Secretary Shinji Kikuchi(Tokyo University of Technology) / Yoji Yamato(Fujitsu Lab.)
Assistant

Paper Information
Registration To Technical Committee on Service Computing
Language ENG
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Proposal for Automatic Extraction Framework of Superconductors Related Information from Scientific Literature
Sub Title (in English)
Keyword(1) material informatics
Keyword(2) superconductors
Keyword(3) machine learning
Keyword(4) nlp
Keyword(5) tdm
1st Author's Name Luca Foppiano
1st Author's Affiliation National Institute for Materials Science(NIMS)
2nd Author's Name Thaer M. Dieb
2nd Author's Affiliation National Institute for Materials Science(NIMS)
3rd Author's Name Akira Suzuki
3rd Author's Affiliation National Institute for Materials Science(NIMS)
4th Author's Name Masashi Ishii
4th Author's Affiliation National Institute for Materials Science(NIMS)
Date 2019-05-31
Paper # SC2019-1
Volume (vol) vol.119
Number (no) SC-66
Page pp.pp.1-5(SC),
#Pages 5
Date of Issue 2019-05-24 (SC)