Ai Koizumi
;
Guillaume Deffrennes
;
Kei Terayama
;
Ryo Tamura
Description:
(abstract)Obtaining a fine approximation of a black-box function is important for understanding and evaluating innovative materials. Active learning aims to improve the approximation of black-box functions with fewer training data. In this study, we investigate whether active learning based on uncertainty sampling enables the efficient approximation of black-box functions in regression tasks using various material databases. In cases where the inputs are provided uniformly and defined in a relatively low-dimensional space, the liquidus surfaces of the ternary systems are the focus. The results show that uncertainty-based active learning can produce a better black-box function with higher prediction accuracy than that by random sampling. Furthermore, in cases in which the inputs are distributed discretely and unbalanced in a high-dimensional feature space, datasets extracted from materials databases for inorganic materials, small molecules, and polymers are addressed, and uncertainty-based active learning is occasionally inefficient. Based on the dependency on the material descriptors, active learning tends to produce a better black-box functions than random sampling when the dimensions of the descriptor are small. The results indicate that active learning is occasionally inefficient in obtaining a better black-box function in materials science.
Rights:
Keyword: active learning, black-box function, materials informatics
Date published: 2024-11-06
Publisher: Springer Science and Business Media LLC
Journal:
Funding:
Manuscript type: Publisher's version (Version of record)
MDR DOI:
First published URL: https://doi.org/10.1038/s41598-024-76800-4
Related item:
Other identifier(s):
Contact agent:
Updated at: 2024-11-21 16:35:14 +0900
Published on MDR: 2024-11-21 16:35:15 +0900
| Filename | Size | |||
|---|---|---|---|---|
| Filename |
s41598-024-76800-4 (1).pdf
(Thumbnail)
application/pdf |
Size | 2.28 MB | Detail |