Challenges in deploying machine learning: a survey of case studies

A Paleyes, RG Urma, ND Lawrence - ACM computing surveys, 2022 - dl.acm.org
In recent years, machine learning has transitioned from a field of academic research interest
to a field capable of solving real-world business problems. However, the deployment of …

A survey on federated learning systems: Vision, hype and reality for data privacy and protection

Q Li, Z Wen, Z Wu, S Hu, N Wang, Y Li… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
As data privacy increasingly becomes a critical societal concern, federated learning has
been a hot research topic in enabling the collaborative training of machine learning models …

“Everyone wants to do the model work, not the data work”: Data Cascades in High-Stakes AI

N Sambasivan, S Kapania, H Highfill… - proceedings of the …, 2021 - dl.acm.org
AI models are increasingly applied in high-stakes domains like health and conservation.
Data quality carries an elevated significance in high-stakes AI due to its heightened …

Data collection and quality challenges in deep learning: A data-centric ai perspective

SE Whang, Y Roh, H Song, JG Lee - The VLDB Journal, 2023 - Springer
Data-centric AI is at the center of a fundamental shift in software engineering where machine
learning becomes the new software, powered by big data and computing infrastructure …

Towards accountability for machine learning datasets: Practices from software engineering and infrastructure

B Hutchinson, A Smart, A Hanna, E Denton… - Proceedings of the …, 2021 - dl.acm.org
Datasets that power machine learning are often used, shared, and reused with little visibility
into the processes of deliberation that led to their creation. As artificial intelligence systems …

A survey on data collection for machine learning: a big data-ai integration perspective

Y Roh, G Heo, SE Whang - IEEE Transactions on Knowledge …, 2019 - ieeexplore.ieee.org
Data collection is a major bottleneck in machine learning and an active research topic in
multiple communities. There are largely two reasons data collection has recently become a …

Collaboration challenges in building ml-enabled systems: Communication, documentation, engineering, and process

N Nahar, S Zhou, G Lewis, C Kästner - Proceedings of the 44th …, 2022 - dl.acm.org
The introduction of machine learning (ML) components in software projects has created the
need for software engineers to collaborate with data scientists and other specialists. While …

Assuring the machine learning lifecycle: Desiderata, methods, and challenges

R Ashmore, R Calinescu, C Paterson - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
Machine learning has evolved into an enabling technology for a wide range of highly
successful applications. The potential for this success to continue and accelerate has placed …

A software engineering perspective on engineering machine learning systems: State of the art and challenges

G Giray - Journal of Systems and Software, 2021 - Elsevier
Context: Advancements in machine learning (ML) lead to a shift from the traditional view of
software development, where algorithms are hard-coded by humans, to ML systems …

The effects of data quality on machine learning performance

L Budach, M Feuerpfeil, N Ihde, A Nathansen… - arXiv preprint arXiv …, 2022 - arxiv.org
Modern artificial intelligence (AI) applications require large quantities of training and test
data. This need creates critical challenges not only concerning the availability of such data …