論文 Proposal for Automatic Extraction Framework of Superconductors Related Information from Scientific Literature

FOPPIANO, Luca ORCID ; Thaer, M. Dieb SAMURAI ORCID ; SUZUKI, Akira ORCID ; ISHII, Masashi SAMURAI ORCID

コレクション

引用
FOPPIANO, Luca, Thaer, M. Dieb, SUZUKI, Akira, ISHII, Masashi. Proposal for Automatic Extraction Framework of Superconductors Related Information from Scientific Literature.

説明:

(abstract)

The automatic collection of materials information from research papers using Natural Language Processing (NLP) is highly required for rapid materials development using big data, namely materials informatics (MI). The difficulty of this automatic collection is mainly caused by the variety of expressions in the papers, a robust system with tolerance to such variety is required to be developed. In this paper, we report an ongoing interdisciplinary work to construct a system for automatic collection of superconductor-related information from scientific literature using text mining techniques. We focused on the identification of superconducting material names and their critical temperature (Tc) key property. We discuss the construction of a prototype for extraction and linking using machine learning (ML) techniques for the physical information collection. From the evaluation using 500 sample documents, we define a baseline and a direction for future improvements.

権利情報:

キーワード: TDM, NLP, Machine Learning, Material Informatics, Superconductors

刊行年月日: 2019-05-31

出版者:

掲載誌:

研究助成金:

原稿種別: 著者最終稿 (Accepted manuscript)

MDR DOI:

公開URL:

関連資料:

その他の識別子:

連絡先:

更新時刻: 2022-10-03 01:50:55 +0900

MDRでの公開時刻: 2021-08-14 03:55:01 +0900

ファイル名 サイズ
ファイル名 SC2019-1(nims-1).pdf (サムネイル)
application/pdf
サイズ 756KB 詳細