論文 実験的熱電特性のデータベース化に向けた論文データ収集WebシステムStarry dataの開発

桂 ゆかり SAMURAI ORCID (National Institute for Materials Science) ; 熊谷 将也 ; 郡司 咲子 ; 今井 庸二 ; 木村 薫

コレクション

引用
桂 ゆかり, 熊谷 将也, 郡司 咲子, 今井 庸二, 木村 薫. 実験的熱電特性のデータベース化に向けた論文データ収集WebシステムStarry dataの開発. 粉体および粉末冶金. 2017, 64 (8), 467-470. https://doi.org/10.48505/nims.4832
SAMURAI

説明:

(abstract)

Although numerous papers are published each year, most of the experimental data reported in those papers are only available as two-dimensional plot images. Data-driven materials science using the machine learning technologies will be accelerated by gathering those published experimental data into a database. By taking thermoelectric materials as a test case, we attempted to optimize the processes of collection of papers, extraction of numeric data from plot images, and sample-based data storage into a database. By searching with a keyword “thermoelectric”, we obtained a list of 47,936 papers. Among these papers, we selected 18,471 papers as possible papers with thermoelectric properties, and succeeded to download 14,835 full-text PDF files. We developed a web system named “Starry data”, to assist the sequential data extraction from the images contained in those PDF files. This system also assists materials scientists to annotate experimental samples efficiently, to develop a descriptive database that can be used for machine-learning of the complex, sample-dependent materials properties.

権利情報:

キーワード: materials informatics, materials database, data curation, thermoelectric materials

刊行年月日: 2017-08-30

出版者: Japan Society of Powder and Powder Metallurgy

掲載誌:

  • 粉体および粉末冶金 vol. 64 issue. 8 p. 467-470

研究助成金:

  • 日本学術振興会 16K14379 (科学研究費補助金(挑戦的萌芽研究)文献データ収集支援システムの開発による大規模高次元物性データベースの構築 )

原稿種別: 著者最終稿 (Accepted manuscript)

MDR DOI: https://doi.org/10.48505/nims.4832

公開URL: https://doi.org/10.2497/jjspm.64.467

関連資料:

その他の識別子:

連絡先:

更新時刻: 2024-10-10 16:30:55 +0900

MDRでの公開時刻: 2024-10-10 16:30:55 +0900

ファイル名 サイズ
ファイル名 粉体粉末冶金協会特集号記事5(著者版).pdf (サムネイル)
application/pdf
サイズ 688KB 詳細