作者
Bhavya Kailkhura, Brian Gallagher, Sookyung Kim, Anna Hiszpanski, T Yong-Jin Han
发表日期
2019/11/14
期刊
npj Computational Materials
卷号
5
期号
1
页码范围
108
出版商
Nature Publishing Group UK
简介
Despite ML’s impressive performance in commercial applications, several unique challenges exist when applying ML in materials science applications. In such a context, the contributions of this work are twofold. First, we identify common pitfalls of existing ML techniques when learning from underrepresented/imbalanced material data. Specifically, we show that with imbalanced data, standard methods for assessing quality of ML models break down and lead to misleading conclusions. Furthermore, we find that the model’s own confidence score cannot be trusted and model introspection methods (using simpler models) do not help as they result in loss of predictive performance (reliability-explainability trade-off). Second, to overcome these challenges, we propose a general-purpose explainable and reliable machine-learning framework. Specifically, we propose a generic pipeline that employs an ensemble of …
引用总数
20192020202120222023202451735353321
学术搜索中的文章