Publication
Performance of uncertainty-based active learning for efficient approximation of black-box functions in materials science
Obtaining a fine approximation of a black-box function is important for understanding and evaluating innovative materials. Active learning aims to improve the approximation of black-box functions with fewer training data. In this study, we investigate whether active learning based on uncertainty sampling enables the efficient approximation of black-box functions in regression tasks using various material databases. In cases where the inputs are provided uniformly and defined in a relatively low-dimensional space, the liquidus surfaces of the ternary systems are the focus. The results show that uncertainty-based active learning can produce a better black-box function with higher prediction accuracy than that by random sampling. Furthermore, in cases in which the inputs are distributed discretely and unbalanced in a high-dimensional feature space, datasets extracted from materials databases for inorganic materials, small molecules, and polymers are addressed, and uncertainty-based active learning is occasionally inefficient. Based on the dependency on the material descriptors, active learning tends to produce a better black-box functions than random sampling when the dimensions of the descriptor are small. The results indicate that active learning is occasionally inefficient in obtaining a better black-box function in materials science.
- First published at
- Creator
- Keyword
- Resource type
- Publisher
- Date published
- 06/11/2024
- Rights statement
- License description
- Journal
- Manuscript type
- Version of record (Published version)
- Language
- Funding reference
Items
Thumbnail | Title | Date Uploaded | Size | Visibility | Actions |
---|---|---|---|---|---|
|
s41598-024-76800-4 (1).pdf | 2.28 MB | MDR Open |
|