查看文章

arxiv.org 中的 [PDF]

Augmented Fairness: An Interpretable Model Augmenting Decision-Makers' Fairness

作者

Tong Wang, Maytal Saar-Tsechansky

发表日期

2020/11/17

期刊

arXiv preprint arXiv:2011.08398

简介

We propose a model-agnostic approach for mitigating the prediction bias of a black-box decision-maker, and in particular, a human decision-maker. Our method detects in the feature space where the black-box decision-maker is biased and replaces it with a few short decision rules, acting as a "fair surrogate". The rule-based surrogate model is trained under two objectives, predictive performance and fairness. Our model focuses on a setting that is common in practice but distinct from other literature on fairness. We only have black-box access to the model, and only a limited set of true labels can be queried under a budget constraint. We formulate a multi-objective optimization for building a surrogate model, where we simultaneously optimize for both predictive performance and bias. To train the model, we propose a novel training algorithm that combines a nondominated sorting genetic algorithm with active learning. We test our model on public datasets where we simulate various biased "black-box" classifiers (decision-makers) and apply our approach for interpretable augmented fairness.

引用总数

被引用次数：12

20212022202320241 5 5 1

学术搜索中的文章

Augmented Fairness: An Interpretable Model Augmenting Decision-Makers' Fairness

T Wang, M Saar-Tsechansky - arXiv preprint arXiv:2011.08398, 2020

被引用次数：12 相关文章所有 2 个版本