“Everyone wants to do the model work, not the data work”: Data Cascades in High-Stakes AI

N Sambasivan, S Kapania, H Highfill… - proceedings of the …, 2021 - dl.acm.org
AI models are increasingly applied in high-stakes domains like health and conservation.
Data quality carries an elevated significance in high-stakes AI due to its heightened …

A review on human–AI interaction in machine learning and insights for medical applications

M Maadi, H Akbarzadeh Khorshidi… - International journal of …, 2021 - mdpi.com
Objective: To provide a human–Artificial Intelligence (AI) interaction review for Machine
Learning (ML) applications to inform how to best combine both human domain expertise and …

Jury learning: Integrating dissenting voices into machine learning models

ML Gordon, MS Lam, JS Park, K Patel… - Proceedings of the …, 2022 - dl.acm.org
Whose labels should a machine learning (ML) algorithm learn to emulate? For ML tasks
ranging from online comment toxicity to misinformation detection to medical diagnosis …

“Dave... I can assure you... that it's going to be all right...” A definition, case for, and survey of algorithmic assurances in human-autonomy trust relationships

BW Israelsen, NR Ahmed - ACM Computing Surveys (CSUR), 2019 - dl.acm.org
People who design, use, and are affected by autonomous artificially intelligent agents want
to be able to trust such agents—that is, to know that these agents will perform correctly, to …

A survey on data collection for machine learning: a big data-ai integration perspective

Y Roh, G Heo, SE Whang - IEEE Transactions on Knowledge …, 2019 - ieeexplore.ieee.org
Data collection is a major bottleneck in machine learning and an active research topic in
multiple communities. There are largely two reasons data collection has recently become a …

Improving fairness in machine learning systems: What do industry practitioners need?

K Holstein, J Wortman Vaughan, H Daumé III… - Proceedings of the …, 2019 - dl.acm.org
The potential for machine learning (ML) systems to amplify social inequities and unfairness
is receiving increasing popular and academic attention. A surge of recent work has focused …

The disagreement deconvolution: Bringing machine learning performance metrics in line with reality

ML Gordon, K Zhou, K Patel, T Hashimoto… - Proceedings of the …, 2021 - dl.acm.org
Machine learning classifiers for human-facing tasks such as comment toxicity and
misinformation often score highly on metrics such as ROC AUC but are received poorly in …

The future of human-AI collaboration: a taxonomy of design knowledge for hybrid intelligence systems

D Dellermann, A Calma, N Lipusch, T Weber… - arXiv preprint arXiv …, 2021 - arxiv.org
Recent technological advances, especially in the field of machine learning, provide
astonishing progress on the road towards artificial general intelligence. However, tasks in …

Don't Blame the Annotator: Bias Already Starts in the Annotation Instructions

M Parmar, S Mishra, M Geva, C Baral - arXiv preprint arXiv:2205.00415, 2022 - arxiv.org
In recent years, progress in NLU has been driven by benchmarks. These benchmarks are
typically collected by crowdsourcing, where annotators write examples based on annotation …

Labelling instructions matter in biomedical image analysis

T Rädsch, A Reinke, V Weru, MD Tizabi… - Nature Machine …, 2023 - nature.com
Biomedical image analysis algorithm validation depends on high-quality annotation of
reference datasets, for which labelling instructions are key. Despite their importance, their …