Akira Suzuki
(Research and Services Division of Materials Data and Integrated System, National Institute for Materials Science
)
Description:
(abstract)In this study, we developed a new information extraction method using a material dictionary database (MDDB), which parses scientific articles and collects related phrases from various ex-pressions. We used magnetic properties as an illustrative case to analyze the working of the proposed system. Structured terms comprising sub-phrases, tagged words, and their relationships enabled automatic annotation and information extraction. The MDDB was constructed on a pre-built knowledge base that includes information categories and related keywords. These cat-egories can be hierarchically structured and flexibly updated to extract a wide range of information on the associations between magnetic materials and properties along with the measurement systems used, structural analyses performed, and theoretical foundations. Herein, we propose preliminary rule-based phrase collection methods and label pattern extraction for phrases that can easily add new structured terms. We found 1,136 new phrases by label pattern-matching that enabled more related expressions to be retrieved from the text and enhanced the information extraction’s accuracy. Approximately 350 relationships among the material types, properties, and values were extracted from the manually modified annotations of 40 articles on permanent magnets. Our method can be applied to other research domains and can be used by such disciplines to build knowledge bases for any topic in their field.
Data origin type: informatics_and_data_science
Rights:
Creative Commons BY Attribution 4.0 International
Keyword: database, magnetic property, materials informatics, natural language processing, ontology, information extraction
Date published:
Publisher: NIMS
Journal:
Funding:
Manuscript type: Not a journal article
MDR DOI: https://doi.org/10.48505/nims.3857
First published URL: https://github.com/suzuki-akira3/MDDB.git
Related item:
Other identifier(s):
Contact agent:
Updated at: 2023-01-31 22:30:50 +0900
Published on MDR: 2023-02-03 11:36:20 +0900
Description / 説明 :
Category /
カテゴリ
:
https://matvoc.nims.go.jp/entity/Q21
Category description / カテゴリの説明 :
Calculated at / 計算時刻 :
| Filename | Size | |||
|---|---|---|---|---|
| Filename |
MDDB.owl
(Thumbnail)
application/rdf+xml |
Size | 2.37 MB | Detail |
| Filename |
Table_2_all_data.csv
text/csv |
Size | 3.89 KB | Detail |
| Filename |
Table_3_all_data.csv
text/csv |
Size | 38 KB | Detail |
| Filename |
Table_5_all_data.csv
text/csv |
Size | 100 KB | Detail |
| Filename |
Table_6_all_data.csv
text/csv |
Size | 4.97 KB | Detail |
| Filename |
Table_7_all_data.csv
text/csv |
Size | 3.3 KB | Detail |
| Filename |
Table_8_all_data.csv
text/csv |
Size | 43.8 KB | Detail |
| Filename |
Table_9_all_data.csv
text/csv |
Size | 102 KB | Detail |
| Filename |
Table_10_all_data.csv
text/csv |
Size | 24.2 KB | Detail |