Bias in data‐driven artificial intelligence systems—An introductory survey

E Ntoutsi, P Fafalios, U Gadiraju… - … : Data Mining and …, 2020 - Wiley Online Library
Artificial Intelligence (AI)‐based systems are widely employed nowadays to make decisions
that have far‐reaching impact on individuals and society. Their decisions might affect …

A brief introduction to weakly supervised learning

ZH Zhou - National science review, 2018 - academic.oup.com
Supervised learning techniques construct predictive models by learning from a large
number of training examples, where each training example has a label indicating its ground …

A survey on data collection for machine learning: a big data-ai integration perspective

Y Roh, G Heo, SE Whang - IEEE Transactions on Knowledge …, 2019 - ieeexplore.ieee.org
Data collection is a major bottleneck in machine learning and an active research topic in
multiple communities. There are largely two reasons data collection has recently become a …

Label poisoning is all you need

R Jha, J Hayase, S Oh - Advances in Neural Information …, 2023 - proceedings.neurips.cc
In a backdoor attack, an adversary injects corrupted data into a model's training dataset in
order to gain control over its predictions on images with a specific attacker-defined trigger. A …

Some like it hoax: Automated fake news detection in social networks

E Tacchini, G Ballarin, ML Della Vedova… - arXiv preprint arXiv …, 2017 - arxiv.org
In recent years, the reliability of information on the Internet has emerged as a crucial issue of
modern society. Social network sites (SNSs) have revolutionized the way in which …

Data programming: Creating large training sets, quickly

AJ Ratner, CM De Sa, S Wu… - Advances in neural …, 2016 - proceedings.neurips.cc
Large labeled training sets are the critical building blocks of supervised learning methods
and are key enablers of deep learning techniques. For some applications, creating labeled …

Truth inference in crowdsourcing: Is the problem solved?

Y Zheng, G Li, Y Li, C Shan, R Cheng - Proceedings of the VLDB …, 2017 - dl.acm.org
Crowdsourcing has emerged as a novel problem-solving paradigm, which facilitates
addressing problems that are hard for computers, eg, entity resolution and sentiment …

Robustness of conditional gans to noisy labels

KK Thekumparampil, A Khetan… - Advances in neural …, 2018 - proceedings.neurips.cc
We study the problem of learning conditional generators from noisy labeled samples, where
the labels are corrupted by random noise. A standard training of conditional GANs will not …

Adversarial learning targeting deep neural network classification: A comprehensive review of defenses against attacks

DJ Miller, Z Xiang, G Kesidis - Proceedings of the IEEE, 2020 - ieeexplore.ieee.org
With wide deployment of machine learning (ML)-based systems for a variety of applications
including medical, military, automotive, genomic, multimedia, and social networking, there is …

Quality control in crowdsourcing: A survey of quality attributes, assessment techniques, and assurance actions

F Daniel, P Kucherbaev, C Cappiello… - ACM Computing …, 2018 - dl.acm.org
Crowdsourcing enables one to leverage on the intelligence and wisdom of potentially large
groups of individuals toward solving problems. Common problems approached with …