The disagreement deconvolution: Bringing machine learning performance metrics in line with reality

ML Gordon, K Zhou, K Patel, T Hashimoto… - Proceedings of the …, 2021 - dl.acm.org
Machine learning classifiers for human-facing tasks such as comment toxicity and
misinformation often score highly on metrics such as ROC AUC but are received poorly in …

Eliciting and learning with soft labels from every annotator

KM Collins, U Bhatt, A Weller - Proceedings of the AAAI conference on …, 2022 - ojs.aaai.org
The labels used to train machine learning (ML) models are of paramount importance.
Typically for ML classification tasks, datasets contain hard labels, yet learning using soft …

Efficient elicitation approaches to estimate collective crowd answers

JJY Chung, JY Song, S Kutty, S Hong, J Kim… - Proceedings of the …, 2019 - dl.acm.org
When crowdsourcing the creation of machine learning datasets, statistical distributions that
capture diverse answers can represent ambiguous data better than a single best answer …

Online spatial crowdsensing with expertise-aware truth inference and task allocation

X Wang, R Jia, L Fu, H Jin, X Tian… - IEEE Journal on …, 2021 - ieeexplore.ieee.org
Emerging crowdsensing paradigm enables a large number of sensing applications, where
much attention is drawn to the fundamental problems of data collection and truth inference …

Learning personalized decision support policies

U Bhatt, V Chen, KM Collins, P Kamalaruban… - arXiv preprint arXiv …, 2023 - arxiv.org
Individual human decision-makers may benefit from different forms of support to improve
decision outcomes. However, a key question is which form of support will lead to accurate …

Eliciting confidence for improving crowdsourced audio annotations

AE Méndez Méndez, M Cartwright, JP Bello… - Proceedings of the ACM …, 2022 - dl.acm.org
In this work we explore confidence elicitation methods for crowdsourcing" soft" labels, eg,
probability estimates, to reduce the annotation costs for domains with ambiguous data …

Crowdsourcing system for numerical tasks based on latent topic aware worker reliability

Z Shi, S Jiang, L Zhang, Y Du… - IEEE INFOCOM 2021 …, 2021 - ieeexplore.ieee.org
Crowdsourcing is a widely adopted way for various labor-intensive tasks. One of the core
problems in crowdsourcing systems is how to assign tasks to most suitable workers for better …

Implementing active learning in cybersecurity: Detecting anomalies in redacted emails

MH Chung, L Wang, S Li, Y Yang, C Giang… - arXiv preprint arXiv …, 2023 - arxiv.org
Research on email anomaly detection has typically relied on specially prepared datasets
that may not adequately reflect the type of data that occurs in industry settings. In our …

On the efficiency of data collection for crowdsourced classification

E Manino, L Tran-Thanh, N Jennings - 2018 - eprints.soton.ac.uk
The quality of crowdsourced data is often highly variable. For this reason, it is common to
collect redundant data and use statistical methods to aggregate it. Empirical studies show …

Trustworthy Machine Learning: From Algorithmic Transparency to Decision Support

U Bhatt - 2024 - repository.cam.ac.uk
Developing machine learning models worthy of decision-maker trust is crucial to using
models in practice. Algorithmic transparency tools, such as explainability and uncertainty …