Data and its (dis) contents: A survey of dataset development and use in machine learning research

A Paullada, ID Raji, EM Bender, E Denton, A Hanna - Patterns, 2021 - cell.com
In this work, we survey a breadth of literature that has revealed the limitations of
predominant practices for dataset collection and use in the field of machine learning. We …

A primer on artificial intelligence in plant digital phenomics: embarking on the data to insights journey

AL Harfouche, F Nakhle, AH Harfouche… - Trends in Plant …, 2023 - cell.com
Artificial intelligence (AI) has emerged as a fundamental component of global agricultural
research that is poised to impact on many aspects of plant science. In digital phenomics, AI …

Evaluating the social impact of generative ai systems in systems and society

I Solaiman, Z Talat, W Agnew, L Ahmad… - arXiv preprint arXiv …, 2023 - arxiv.org
Generative AI systems across modalities, ranging from text, image, audio, and video, have
broad social impacts, but there exists no official standard for means of evaluating those …

On the genealogy of machine learning datasets: A critical history of ImageNet

E Denton, A Hanna, R Amironesei, A Smart… - Big Data & …, 2021 - journals.sagepub.com
In response to growing concerns of bias, discrimination, and unfairness perpetuated by
algorithmic systems, the datasets used to train and evaluate machine learning models have …

Assessing the fairness of ai systems: Ai practitioners' processes, challenges, and needs for support

M Madaio, L Egede, H Subramonyam… - Proceedings of the …, 2022 - dl.acm.org
Various tools and practices have been developed to support practitioners in identifying,
assessing, and mitigating fairness-related harms caused by AI systems. However, prior …

Studying up machine learning data: Why talk about bias when we mean power?

M Miceli, J Posada, T Yang - Proceedings of the ACM on Human …, 2022 - dl.acm.org
Research in machine learning (ML) has argued that models trained on incomplete or biased
datasets can lead to discriminatory outputs. In this commentary, we propose moving the …

The participatory turn in ai design: Theoretical foundations and the current state of practice

F Delgado, S Yang, M Madaio, Q Yang - … of the 3rd ACM Conference on …, 2023 - dl.acm.org
Despite the growing consensus that stakeholders affected by AI systems should participate
in their design, enormous variation and implicit disagreements exist among current …

Crowdworksheets: Accounting for individual and collective identities underlying crowdsourced dataset annotation

M Díaz, I Kivlichan, R Rosen, D Baker… - Proceedings of the …, 2022 - dl.acm.org
Human annotated data plays a crucial role in machine learning (ML) research and
development. However, the ethical considerations around the processes and decisions that …

Designing responsible ai: Adaptations of ux practice to meet responsible ai challenges

Q Wang, M Madaio, S Kane, S Kapania… - Proceedings of the …, 2023 - dl.acm.org
Technology companies continue to invest in efforts to incorporate responsibility in their
Artificial Intelligence (AI) advancements, while efforts to audit and regulate AI systems …

Exploring how machine learning practitioners (try to) use fairness toolkits

WH Deng, M Nagireddy, MSA Lee, J Singh… - Proceedings of the …, 2022 - dl.acm.org
Recent years have seen the development of many open-source ML fairness toolkits aimed
at helping ML practitioners assess and address unfairness in their systems. However, there …