Improving alignment of dialogue agents via targeted human judgements A Glaese, N McAleese, M Trębacz, J Aslanides, V Firoiu, T Ewalds, ... arXiv preprint arXiv:2209.14375, 2022 | 338 | 2022 |
Representation in AI Evaluations AS Bergman, LA Hendricks, M Rauh, B Wu, W Agnew, M Kunesch, I Duan, ... Proceedings of the 2023 ACM Conference on Fairness, Accountability, and …, 2023 | 13 | 2023 |