Exploring Cross-Cultural Differences in English Hate Speech Annotations: From Dataset Construction to Analysis

N Lee, C Jung, J Myung, J Jin… - Proceedings of the …, 2024 - aclanthology.org
Warning**: this paper contains content that may be offensive or upsetting.* Most hate speech
datasets neglect the cultural diversity within a single language, resulting in a critical …

Crehate: Cross-cultural re-annotation of english hate speech dataset

N Lee, C Jung, J Myung, J Jin, J Kim, A Oh - arXiv preprint arXiv …, 2023 - arxiv.org
English datasets predominantly reflect the perspectives of certain nationalities, which can
lead to cultural biases in models and datasets. This is particularly problematic in tasks …

Eyes don't lie: Subjective hate annotation and detection with gaze

Ö Alaçam, S Hoeken, S Zarrieß - Proceedings of the 2024 …, 2024 - aclanthology.org
Hate speech is a complex and subjective phenomenon. In this paper, we present a dataset
(GAZE4HATE) that provides gaze data collected in a hate speech annotation experiment …

Exploring Amharic hate speech data collection and classification approaches

AA Ayele, SM Yimam, TD Belay, T Asfaw… - Proceedings of the …, 2023 - aclanthology.org
In this paper, we present a study of efficient data selection and annotation strategies for
Amharic hate speech. We also build various classification models and investigate the …

AU_NLP at SemEval-2023 task 10: Explainable detection of online sexism using fine-tuned RoBERTa

A Das, N Raychawdhary, T Bhattacharya… - Proceedings of the …, 2023 - aclanthology.org
Social media is a concept developed to link people and make the globe smaller. But it has
recently developed into a center for sexist memes that target especially women. As a result …

Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language Models

MD Bui, K von der Wense, A Lauscher - arXiv preprint arXiv:2411.03888, 2024 - arxiv.org
Warning: this paper contains content that may be offensive or upsetting Hate speech
moderation on global platforms poses unique challenges due to the multimodal and …

Improving Adversarial Data Collection by Supporting Annotators: Lessons from GAHD, a German Hate Speech Dataset

J Goldzycher, P Röttger, G Schneider - arXiv preprint arXiv:2403.19559, 2024 - arxiv.org
Hate speech detection models are only as good as the data they are trained on. Datasets
sourced from social media suffer from systematic gaps and biases, leading to unreliable …

[PDF][PDF] The end of the rehydration era-the problem of sharing harmful twitter research data

D Assenmacher, I Sen… - 2nd Workshop …, 2023 - workshop-proceedings.icwsm.org
Social media research is currently confronted with a datasharing problem, as social media
platforms prohibit full data distribution in their terms of service. Until recent changes to the …

Disinformation detection: Knowledge infusion with transfer learning and visualizations

M Schütz - European Conference on Information Retrieval, 2023 - Springer
The automatic detection of disinformation has gained an increased focus by the research
community during the last years. The spread of false information can be an issue for political …

Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management

SM Yimam, D Dementieva, T Fischer… - arXiv preprint arXiv …, 2024 - arxiv.org
Despite regulations imposed by nations and social media platforms, such as recent EU
regulations targeting digital violence, abusive content persists as a significant challenge …