Making it possible for the auditing of ai: A systematic review of ai audits and ai auditability

Y Li, S Goel - Information Systems Frontiers, 2024 - Springer
Artificial intelligence (AI) technologies have become the key driver of innovation in society.
However, numerous vulnerabilities of AI systems can lead to negative consequences for …

Evaluating the social impact of generative ai systems in systems and society

I Solaiman, Z Talat, W Agnew, L Ahmad… - arXiv preprint arXiv …, 2023 - arxiv.org
Generative AI systems across modalities, ranging from text, image, audio, and video, have
broad social impacts, but there exists no official standard for means of evaluating those …

Embers of autoregression: Understanding large language models through the problem they are trained to solve

RT McCoy, S Yao, D Friedman, M Hardy… - arXiv preprint arXiv …, 2023 - arxiv.org
The widespread adoption of large language models (LLMs) makes it important to recognize
their strengths and limitations. We argue that in order to develop a holistic understanding of …

Designing responsible ai: Adaptations of ux practice to meet responsible ai challenges

Q Wang, M Madaio, S Kane, S Kapania… - Proceedings of the …, 2023 - dl.acm.org
Technology companies continue to invest in efforts to incorporate responsibility in their
Artificial Intelligence (AI) advancements, while efforts to audit and regulate AI systems …

Foundational challenges in assuring alignment and safety of large language models

U Anwar, A Saparov, J Rando, D Paleka… - arXiv preprint arXiv …, 2024 - arxiv.org
This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …

Investigating Practices and Opportunities for Cross-functional Collaboration around AI Fairness in Industry Practice

WH Deng, N Yildirim, M Chang, M Eslami… - Proceedings of the …, 2023 - dl.acm.org
An emerging body of research indicates that ineffective cross-functional collaboration–the
interdisciplinary work done by industry practitioners across roles–represents a major barrier …

Birds, bats and beyond: Evaluating generalization in bioacoustics models

B Van Merriënboer, J Hamer, V Dumoulin… - Frontiers in Bird …, 2024 - frontiersin.org
In the context of passive acoustic monitoring (PAM) better models are needed to reliably
gain insights from large amounts of raw, unlabeled data. Bioacoustics foundation models …

Against predictive optimization: On the legitimacy of decision-making algorithms that optimize predictive accuracy

A Wang, S Kapoor, S Barocas… - ACM Journal on …, 2024 - dl.acm.org
We formalize predictive optimization, a category of decision-making algorithms that use
machine learning (ML) to predict future outcomes of interest about individuals. For example …

Evaluation gaps in machine learning practice

B Hutchinson, N Rostamzadeh, C Greer… - Proceedings of the …, 2022 - dl.acm.org
Forming a reliable judgement of a machine learning (ML) model's appropriateness for an
application ecosystem is critical for its responsible use, and requires considering a broad …

Open problems in technical ai governance

A Reuel, B Bucknall, S Casper, T Fist, L Soder… - arXiv preprint arXiv …, 2024 - arxiv.org
AI progress is creating a growing range of risks and opportunities, but it is often unclear how
they should be navigated. In many cases, the barriers and uncertainties faced are at least …