查看文章

aaai.org 中的 [PDF]

Reglo: Provable neural network repair for global robustness properties

作者

Feisi Fu, Zhilu Wang, Weichao Zhou, Yixuan Wang, Jiameng Fan, Chao Huang, Qi Zhu, Xin Chen, Wenchao Li

发表日期

2024/3/24

期刊

Proceedings of the AAAI Conference on Artificial Intelligence

卷号

期号

页码范围

12061-12071

简介

We present REGLO, a novel methodology for repairing pretrained neural networks to satisfy global robustness and individual fairness properties. A neural network is said to be globally robust with respect to a given input region if and only if all the input points in the region are locally robust. This notion of global robustness also captures the notion of individual fairness as a special case. We prove that any counterexample to a global robustness property must exhibit a corresponding large gradient. For ReLU networks, this result allows us to efficiently identify the linear regions that violate a given global robustness property. By formulating and solving a suitable robust convex optimization problem, REGLO then computes a minimal weight change that will provably repair these violating linear regions. Experimental results demonstrate that REGLO can significantly improve global robustness and individual fairness while maintaining performance across a wide variety of benchmarks.

引用总数

被引用次数：6

202320244 1

学术搜索中的文章

Reglo: Provable neural network repair for global robustness properties

F Fu, Z Wang, W Zhou, Y Wang, J Fan, C Huang, Q Zhu… - Proceedings of the AAAI Conference on Artificial …, 2024

被引用次数：6 相关文章所有 2 个版本