Automating exploratory data analysis via machine learning: An overview

T Milo, A Somech - Proceedings of the 2020 ACM SIGMOD international …, 2020 - dl.acm.org
Exploratory Data Analysis (EDA) is an important initial step for any knowledge discovery
process, in which data scientists interactively explore unfamiliar datasets by issuing a …

Software engineering for AI-based systems: a survey

S Martínez-Fernández, J Bogner, X Franch… - ACM Transactions on …, 2022 - dl.acm.org
AI-based systems are software systems with functionalities enabled by at least one AI
component (eg, for image-, speech-recognition, and autonomous driving). AI-based systems …

On the design of ai-powered code assistants for notebooks

AM McNutt, C Wang, RA Deline… - Proceedings of the 2023 …, 2023 - dl.acm.org
AI-powered code assistants, such as Copilot, are quickly becoming a ubiquitous component
of contemporary coding contexts. Among these environments, computational notebooks …

Improving fairness in machine learning systems: What do industry practitioners need?

K Holstein, J Wortman Vaughan, H Daumé III… - Proceedings of the …, 2019 - dl.acm.org
The potential for machine learning (ML) systems to amplify social inequities and unfairness
is receiving increasing popular and academic attention. A surge of recent work has focused …

Human-AI collaboration in data science: Exploring data scientists' perceptions of automated AI

D Wang, JD Weisz, M Muller, P Ram, W Geyer… - Proceedings of the …, 2019 - dl.acm.org
The rapid advancement of artificial intelligence (AI) is changing our lives in many ways. One
application domain is data science. New techniques in automating the creation of AI, known …

How do data science workers collaborate? roles, workflows, and tools

AX Zhang, M Muller, D Wang - Proceedings of the ACM on Human …, 2020 - dl.acm.org
Today, the prominence of data science within organizations has given rise to teams of data
science workers collaborating on extracting insights from data, as opposed to individual data …

How ai developers overcome communication challenges in a multidisciplinary team: A case study

D Piorkowski, S Park, AY Wang, D Wang… - Proceedings of the …, 2021 - dl.acm.org
The development of AI applications is a multidisciplinary effort, involving multiple roles
collaborating with the AI developers, an umbrella term we use to include data scientists and …

A large-scale study about quality and reproducibility of jupyter notebooks

JF Pimentel, L Murta, V Braganholo… - 2019 IEEE/ACM 16th …, 2019 - ieeexplore.ieee.org
Jupyter Notebooks have been widely adopted by many different communities, both in
science and industry. They support the creation of literate programming documents that …

What's wrong with computational notebooks? Pain points, needs, and design opportunities

S Chattopadhyay, I Prasad, AZ Henley… - Proceedings of the …, 2020 - dl.acm.org
Computational notebooks-such as Azure, Databricks, and Jupyter-are a popular, interactive
paradigm for data scientists to author code, analyze data, and interleave visualizations, all …

Wrex: A unified programming-by-example interaction for synthesizing readable code for data scientists

I Drosos, T Barik, PJ Guo, R DeLine… - Proceedings of the 2020 …, 2020 - dl.acm.org
Data wrangling is a difficult and time-consuming activity in computational notebooks, and
existing wrangling tools do not fit the exploratory workflow for data scientists in these …