Reglo: Provable neural network repair for global robustness properties

F Fu, Z Wang, W Zhou, Y Wang, J Fan… - Proceedings of the …, 2024 - ojs.aaai.org
Proceedings of the AAAI Conference on Artificial Intelligence, 2024ojs.aaai.org
We present REGLO, a novel methodology for repairing pretrained neural networks to satisfy
global robustness and individual fairness properties. A neural network is said to be globally
robust with respect to a given input region if and only if all the input points in the region are
locally robust. This notion of global robustness also captures the notion of individual fairness
as a special case. We prove that any counterexample to a global robustness property must
exhibit a corresponding large gradient. For ReLU networks, this result allows us to efficiently …
We present REGLO, a novel methodology for repairing pretrained neural networks to satisfy global robustness and individual fairness properties. A neural network is said to be globally robust with respect to a given input region if and only if all the input points in the region are locally robust. This notion of global robustness also captures the notion of individual fairness as a special case. We prove that any counterexample to a global robustness property must exhibit a corresponding large gradient. For ReLU networks, this result allows us to efficiently identify the linear regions that violate a given global robustness property. By formulating and solving a suitable robust convex optimization problem, REGLO then computes a minimal weight change that will provably repair these violating linear regions. Experimental results demonstrate that REGLO can significantly improve global robustness and individual fairness while maintaining performance across a wide variety of benchmarks.
ojs.aaai.org
以上显示的是最相近的搜索结果。 查看全部搜索结果