Bias out-of-the-box: An empirical analysis of intersectional occupational biases in popular generative language models HR Kirk, Y Jun, F Volpin, H Iqbal, E Benussi, F Dreyer, A Shtedritski, ... Advances in Neural Information Processing Systems 34, 2611-2624, 2021 | 152 | 2021 |
Memes in the wild: Assessing the generalizability of the hateful memes challenge dataset HR Kirk, Y Jun, P Rauba, G Wachtel, R Li, X Bai, N Broestl, M Doff-Sotta, ... arXiv preprint arXiv:2107.04313, 2021 | 26 | 2021 |
Trusted source alignment in large language models V Bashlovkina, Z Kuang, R Matthews, E Clifford, Y Jun, WW Cohen, ... arXiv preprint arXiv:2311.06697, 2023 | 3 | 2023 |
Bias Out-of-the-Box: An Empirical Analysis of Intersectional Occupational Biases in Popular Generative Language Models Y Asano, E Benussi, F Dreyer, H Iqbal, HR Kirk, A Shtedritski, F Volpin, ... 4San Diego, CANeural Information Processing Systems Foundation, 2022 | | 2022 |