查看文章

arxiv.org 中的 [PDF]

Empirical analysis of multi-task learning for reducing model bias in toxic comment detection

作者

Ameya Vaidya, Feng Mai, Yue Ning

发表日期

2019/9/21

期刊

arXiv preprint arXiv:1909.09758

简介

With the recent rise of toxicity in online conversations on social media platforms, using modern machine learning algorithms for toxic comment detection has become a central focus of many online applications. Researchers and companies have developed a variety of models to identify toxicity in online conversations, reviews, or comments with mixed successes. However, many existing approaches have learned to incorrectly associate non-toxic comments that have certain trigger-words (e.g. gay, lesbian, black, muslim) as a potential source of toxicity. In this paper, we evaluate several state-of-the-art models with the specific focus of reducing model bias towards these commonly-attacked identity groups. We propose a multi-task learning model with an attention layer that jointly learns to predict the toxicity of a comment as well as the identities present in the comments in order to reduce this bias. We then compare our model to an array of shallow and deep-learning models using metrics designed especially to test for unintended model bias within these identity groups.

引用总数

被引用次数：19

202020212022202320248 5 4 1 1

学术搜索中的文章

Empirical analysis of multi-task learning for reducing model bias in toxic comment detection

A Vaidya, F Mai, Y Ning - arXiv preprint arXiv:1909.09758, 2019

被引用次数：19 相关文章所有 2 个版本