Dataset Supercon 2 Dataset

Foppiano, Luca ORCID

Collection

Citation
Foppiano, Luca. Supercon 2 Dataset. https://doi.org/10.48505/nims.3735

Description:

(abstract)

The automatic extraction of materials and related properties from the scientific literature is gaining attention in data-driven materials science (Materials Informatics). In this paper, we discuss Grobid-superconductors, our solution for automatically extracting superconductor material names and respective properties from text. Built as a Grobid module, it combines machine learning and heuristic approaches in a multi-step architecture that supports input data as raw text or PDF documents. Using Grobid-superconductors, we built SuperCon2, a database of 40324 materials and properties records from 37700 papers. The material (or sample) information is represented by name, chemical formula, and material class, and is characterised by shape, doping, substitution variables for components, and substrate as adjoined information. The properties include the Tc superconducting critical temperature and, when available, applied pressure with the Tc measurement method.

Data origin type: informatics_and_data_science

Rights:

Keyword: tdm, materials science , superconductors, machine learning, dataset

Date published: 2022-09-10

Publisher: National Institute for Materials Science

Journal:

Funding:

Manuscript type: Not a journal article

MDR DOI: https://doi.org/10.48505/nims.3735

First published URL: https://github.com/lfoppiano/supercon

Related item:

Other identifier(s):

Contact agent:

Updated at: 2022-10-18 10:11:01 +0900

Published on MDR: 2022-11-10 15:27:13 +0900

Measurement method / 計測法

Description / 説明 :

Category / カテゴリ :

Category description / カテゴリの説明 :

Analysis field / 解析分野 :

Analysis field description / 解析分野の説明 :

Measurement environment / 計測環境 :

Standarized procedure / 標準手順 :

Measured at / 計測時刻 :

Filename Size
Filename supercon2_1203_papers.csv (Thumbnail)
text/csv
Size 2.18 MB Detail
Filename supercon2_v22.12.03.csv
text/csv
Size 13.9 MB Detail