# A material dictionary database to extract information on permanent magnets from scientific articles

https://mdr.nims.go.jp/datasets/20a8b35a-6c97-4670-bdf3-c157aeb3d470

## File

- [MDDB.owl](https://mdr.nims.go.jp/filesets/d5d8d31b-4585-4f2d-8e22-d7896b03f41d/download) ([Detail](https://mdr.nims.go.jp/filesets/d5d8d31b-4585-4f2d-8e22-d7896b03f41d.md))
- [Table_2_all_data.csv](https://mdr.nims.go.jp/filesets/82ca4844-86fc-4b3b-8ee2-44c90d467f7a/download) ([Detail](https://mdr.nims.go.jp/filesets/82ca4844-86fc-4b3b-8ee2-44c90d467f7a.md))
- [Table_3_all_data.csv](https://mdr.nims.go.jp/filesets/9d7c229e-2fa2-41f2-918f-1cf61753afd4/download) ([Detail](https://mdr.nims.go.jp/filesets/9d7c229e-2fa2-41f2-918f-1cf61753afd4.md))
- [Table_5_all_data.csv](https://mdr.nims.go.jp/filesets/8c93ce80-7a25-4cf6-a43a-be8acbe1460c/download) ([Detail](https://mdr.nims.go.jp/filesets/8c93ce80-7a25-4cf6-a43a-be8acbe1460c.md))
- [Table_6_all_data.csv](https://mdr.nims.go.jp/filesets/0bec9532-6322-42e7-911a-557d990f80c9/download) ([Detail](https://mdr.nims.go.jp/filesets/0bec9532-6322-42e7-911a-557d990f80c9.md))
- [Table_7_all_data.csv](https://mdr.nims.go.jp/filesets/e7955aad-99ca-4100-9752-cf7935ad6425/download) ([Detail](https://mdr.nims.go.jp/filesets/e7955aad-99ca-4100-9752-cf7935ad6425.md))
- [Table_8_all_data.csv](https://mdr.nims.go.jp/filesets/5bf5808b-f3fd-4c5c-a053-a66e5355a7d9/download) ([Detail](https://mdr.nims.go.jp/filesets/5bf5808b-f3fd-4c5c-a053-a66e5355a7d9.md))
- [Table_9_all_data.csv](https://mdr.nims.go.jp/filesets/006abab2-e9ba-44e6-9a6b-1cc8b1a60141/download) ([Detail](https://mdr.nims.go.jp/filesets/006abab2-e9ba-44e6-9a6b-1cc8b1a60141.md))
- [Table_10_all_data.csv](https://mdr.nims.go.jp/filesets/bf8ffc47-f3a7-4853-b42d-9ab8c1488065/download) ([Detail](https://mdr.nims.go.jp/filesets/bf8ffc47-f3a7-4853-b42d-9ab8c1488065.md))

## Id

20a8b35a-6c97-4670-bdf3-c157aeb3d470

## Local identifier



## Visibility

open_to_public

## State

published

## Created at

2023-01-29T23:31:04.703595Z

## Updated at

2023-01-31T13:30:50.462414Z

## Published at

2023-02-03T02:36:20.491023Z

## Doi

https://doi.org/10.48505/nims.3857

## First published url

https://github.com/suzuki-akira3/MDDB.git

## Date published



## Recorded date published



## Resource type

dataset

## Manuscript type

na

## Collection



## Title

- title: A material dictionary database to extract information on permanent magnets
    from scientific articles
  title_type: original
  lang: en

## Description

- description: In this study, we developed a new information extraction method using
    a material dictionary database (MDDB), which parses scientific articles and collects
    related phrases from various ex-pressions. We used magnetic properties as an illustrative
    case to analyze the working of the proposed system. Structured terms comprising
    sub-phrases, tagged words, and their relationships enabled automatic annotation
    and information extraction. The MDDB was constructed on a pre-built knowledge
    base that includes information categories and related keywords. These cat-egories
    can be hierarchically structured and flexibly updated to extract a wide range
    of information on the associations between magnetic materials and properties along
    with the measurement systems used, structural analyses performed, and theoretical
    foundations. Herein, we propose preliminary rule-based phrase collection methods
    and label pattern extraction for phrases that can easily add new structured terms.
    We found 1,136 new phrases by label pattern-matching that enabled more related
    expressions to be retrieved from the text and enhanced the information extraction’s
    accuracy. Approximately 350 relationships among the material types, properties,
    and values were extracted from the manually modified annotations of 40 articles
    on permanent magnets. Our method can be applied to other research domains and
    can be used by such disciplines to build knowledge bases for any topic in their
    field.
  description_type: abstract
  lang: en

## Creator

- name: Akira Suzuki
  role: author
  orcid: https://orcid.org/0000-0002-8167-0414
  organization: National Institute for Materials Science
  department: Research and Services Division of Materials Data and Integrated System
  ror: https://ror.org/026v1ze26

## Contact agent



## Publisher

organization: NIMS

## Managing organization



## Keyword

- subject: database
  schema: not_defined
- subject: magnetic property
  schema: not_defined
- subject: materials informatics
  schema: not_defined
- subject: natural language processing
  schema: not_defined
- subject: ontology
  schema: not_defined
- subject: information extraction
  schema: not_defined

## Rights

- description: Creative Commons BY Attribution 4.0 International
  identifier: https://creativecommons.org/licenses/by/4.0/

## Other identifier(s)



## Data origin

- data_origin_type: informatics_and_data_science

## Embargo



## Journal



## Conference



## Related item



## Funding

- identifier: JPMXP1122715503
  funder_name: Ministry of Education, Culture, Sports, Science and Technology (MEXT)
  description: Data Creation and Utilization-Type Material Research and Development
    Project (Digital Trans-formation Initiative Center for Magnetic Materials)

## Instrument



## Instrument operator



## Instrument managing organization



## Measurement method



## Specimen



## Chemical composition



## Structure for specimen



## Structural feature for specimen



## Specific property for specimen



## Process for specimen treatment



## Computational method

- category_vocabulary: https://matvoc.nims.go.jp/entity/Q21

## Energy level/transition state



## Software



## Custom property



## Fileset

- id: d5d8d31b-4585-4f2d-8e22-d7896b03f41d
  filename: MDDB.owl
  content_type: application/rdf+xml
  size: 2481077
  md5: 48751068f034bc6686c4fc31cca4dd63
- id: 82ca4844-86fc-4b3b-8ee2-44c90d467f7a
  filename: Table_2_all_data.csv
  content_type: text/csv
  size: 3987
  md5: 2d30342279a7455d989be12ab0d2df55
- id: 9d7c229e-2fa2-41f2-918f-1cf61753afd4
  filename: Table_3_all_data.csv
  content_type: text/csv
  size: 38877
  md5: c9e833b4ce8aaa88a38ef6e80d4e9933
- id: 8c93ce80-7a25-4cf6-a43a-be8acbe1460c
  filename: Table_5_all_data.csv
  content_type: text/csv
  size: 102838
  md5: 397b846d1fff88a3c49060961ec7c243
- id: 0bec9532-6322-42e7-911a-557d990f80c9
  filename: Table_6_all_data.csv
  content_type: text/csv
  size: 5086
  md5: f02853840f886092a67a13c054a90453
- id: e7955aad-99ca-4100-9752-cf7935ad6425
  filename: Table_7_all_data.csv
  content_type: text/csv
  size: 3379
  md5: f6baeb4642b62b2ca6f4d9382ca5f212
- id: 5bf5808b-f3fd-4c5c-a053-a66e5355a7d9
  filename: Table_8_all_data.csv
  content_type: text/csv
  size: 44878
  md5: ed3b7862863722ab5c6d5ad6553c11ce
- id: 006abab2-e9ba-44e6-9a6b-1cc8b1a60141
  filename: Table_9_all_data.csv
  content_type: text/csv
  size: 104031
  md5: 7762ee60319e149dde55cd5b41197e62
- id: bf8ffc47-f3a7-4853-b42d-9ab8c1488065
  filename: Table_10_all_data.csv
  content_type: text/csv
  size: 24790
  md5: c416f387c7e8baff0b0b8edf183e8fca

## Thumbnail

fileset_id: d5d8d31b-4585-4f2d-8e22-d7896b03f41d
filename: MDDB.owl