Learning from the worst: Dynamically generated datasets to improve online hate detection

B Vidgen, T Thrush, Z Waseem, D Kiela - arXiv preprint arXiv:2012.15761, 2020 - arxiv.org
We present a human-and-model-in-the-loop process for dynamically generating datasets
and training better performing and more robust hate detection models. We provide a new …

Developing an online hate classifier for multiple social media platforms

J Salminen, M Hopf, SA Chowdhury, S Jung… - … -centric Computing and …, 2020 - Springer
The proliferation of social media enables people to express their opinions widely online.
However, at the same time, this has resulted in the emergence of conflict and hate, making …

A survey of race, racism, and anti-racism in NLP

A Field, SL Blodgett, Z Waseem, Y Tsvetkov - arXiv preprint arXiv …, 2021 - arxiv.org
Despite inextricable ties between race and language, little work has considered race in NLP
research and development. In this work, we survey 79 papers from the ACL anthology that …

CONAN--COunter NArratives through Nichesourcing: a multilingual dataset of responses to fight online hate speech

YL Chung, E Kuzmenko, SS Tekiroglu… - arXiv preprint arXiv …, 2019 - arxiv.org
Although there is an unprecedented effort to provide adequate responses in terms of laws
and policies to hate content on social media platforms, dealing with hatred online is still a …

Racism is a virus: Anti-asian hate and counterhate in social media during the covid-19 crisis

C Ziems, B He, S Soni, S Kumar - 2020 - europepmc.org
The spread of COVID-19 has sparked racism, hate, and xenophobia in social media
targeted at Chinese and broader Asian communities. However, little is known about how …

Vulnerable community identification using hate speech detection on social media

Z Mossie, JH Wang - Information Processing & Management, 2020 - Elsevier
With the rapid development in mobile computing and Web technologies, online hate speech
has been increasingly spread in social network platforms since it's easy to post any …

Thou shalt not hate: Countering online hate speech

B Mathew, P Saha, H Tharad, S Rajgaria… - Proceedings of the …, 2019 - aaai.org
Hate content in social media is ever increasing. While Facebook, Twitter, Google have
attempted to take several steps to tackle the hateful content, they have mostly been …

Just say no: Analyzing the stance of neural dialogue generation in offensive contexts

A Baheti, M Sap, A Ritter, M Riedl - arXiv preprint arXiv:2108.11830, 2021 - arxiv.org
Dialogue models trained on human conversations inadvertently learn to generate toxic
responses. In addition to producing explicitly offensive utterances, these models can also …

Anatomy of online hate: developing a taxonomy and machine learning models for identifying and classifying hate in online news media

J Salminen, H Almerekhi, M Milenković… - Proceedings of the …, 2018 - ojs.aaai.org
Online social media platforms generally attempt to mitigate hateful expressions, as these
comments can be detrimental to the health of the community. However, automatically …

A just and comprehensive strategy for using NLP to address online abuse

D Jurgens, E Chandrasekharan, L Hemphill - arXiv preprint arXiv …, 2019 - arxiv.org
Online abusive behavior affects millions and the NLP community has attempted to mitigate
this problem by developing technologies to detect abuse. However, current methods have …