Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge …
S Dev, J Goyal, D Tewari, S Dave… - Advances in Neural …, 2024 - proceedings.neurips.cc
With rapid development and deployment of generative language models in global settings, there is an urgent need to also scale our measurements of harm, not just in the number and …
Recent studies have shown that Text-to-Image (T2I) model generations can reflect social stereotypes present in the real world. However, existing approaches for evaluating …
Current datasets for unwanted social bias auditing are limited to studying protected demographic features such as race and gender. In this work, we introduce a comprehensive …
Maarten Sap - Publications Maarten Sap Publications Contact About Me CV Notes/Blogposts Applying to grad school Giving feedback for talks Notes from my 2020 Academic Job Search …
A Leidinger, R Rogers - Proceedings of the AAAI/ACM Conference on AI …, 2024 - ojs.aaai.org
With the widespread availability of LLMs since the release of ChatGPT and increased public scrutiny, commercial model development appears to have focused their efforts …
S Wang, T Hu, H Xiao, Y Li, C Zhang… - … Journal of Digital …, 2024 - Taylor & Francis
The launch of large language models (LLMs) like ChatGPT in late 2022 and the anticipated arrival of future GPT-x iterations have marked the beginning of the generative artificial …
This paper introduces the concept of actionability in the context of bias measures in natural language processing (NLP). We define actionability as the degree to which a …
Large-scale deployment of large language models (LLMs) in various applications, such as chatbots and virtual assistants, requires LLMs to be culturally sensitive to the user to ensure …