論文 CSIML: a cost-sensitive and iterative machine-learning method for small and imbalanced materials data sets

Shengzhou Li SAMURAI ORCID ; Ayako Nakata SAMURAI ORCID

コレクション

引用
Shengzhou Li, Ayako Nakata. CSIML: a cost-sensitive and iterative machine-learning method for small and imbalanced materials data sets. Chemistry Letters. 2024, 53 (5), . https://doi.org/10.1093/chemle/upae090
SAMURAI

説明:

(abstract)

Materials science research benefits from the powerful machine-learning (ML) surrogate models, but it is also limited by the implicit requirement for sufficiently big and balanced data distribution for ML. In this paper, we propose a model to obtain more credible results for small and imbalanced materials data sets as well as chemical knowledge. Taking 2 bandgaps imbalanced data sets as instances, we demonstrate the usability and performance of our model compared with common ML models with normal sampling and resampling methods.

権利情報:

キーワード: cost-sensitive, iterative machine-learning method, small and imbalanced materials data sets, chemical knowledge, CSIML

刊行年月日: 2024-05-02

出版者: Oxford University Press (OUP)

掲載誌:

  • Chemistry Letters (ISSN: 03667022) vol. 53 issue. 5

研究助成金:

  • JSPS JP20H05883
  • JSPS JP20H05878
  • JST PRESTO JPMJPR20T4

原稿種別: 出版者版 (Version of record)

MDR DOI:

公開URL: https://doi.org/10.1093/chemle/upae090

関連資料:

その他の識別子:

連絡先:

更新時刻: 2024-11-28 16:30:28 +0900

MDRでの公開時刻: 2024-11-28 16:30:29 +0900

ファイル名 サイズ
ファイル名 upae090 (1).pdf (サムネイル)
application/pdf
サイズ 3.35MB 詳細