How do data science workers collaborate? roles, workflows, and tools

AX Zhang, M Muller, D Wang - Proceedings of the ACM on Human …, 2020 - dl.acm.org
Today, the prominence of data science within organizations has given rise to teams of data
science workers collaborating on extracting insights from data, as opposed to individual data …

What's wrong with computational notebooks? Pain points, needs, and design opportunities

S Chattopadhyay, I Prasad, AZ Henley… - Proceedings of the …, 2020 - dl.acm.org
Computational notebooks-such as Azure, Databricks, and Jupyter-are a popular, interactive
paradigm for data scientists to author code, analyze data, and interleave visualizations, all …

Trust in AutoML: exploring information needs for establishing trust in automated machine learning systems

J Drozdal, J Weisz, D Wang, G Dass, B Yao… - Proceedings of the 25th …, 2020 - dl.acm.org
We explore trust in a relatively new area of data science: Automated Machine Learning
(AutoML). In AutoML, AI methods are used to generate and optimize machine learning …

How data scientists use computational notebooks for real-time collaboration

AY Wang, A Mittal, C Brooks, S Oney - … of the ACM on Human-Computer …, 2019 - dl.acm.org
Effective collaboration in data science can leverage domain expertise from each team
member and thus improve the quality and efficiency of the work. Computational notebooks …

Designing ground truth and the social life of labels

M Muller, CT Wolf, J Andres, M Desmond… - Proceedings of the …, 2021 - dl.acm.org
Ground-truth labeling is an important activity in machine learning. Many studies have
examined how crowdworkers apply labels to records in machine learning datasets …

B2: Bridging code and interactive visualization in computational notebooks

Y Wu, JM Hellerstein, A Satyanarayan - Proceedings of the 33rd Annual …, 2020 - dl.acm.org
Data scientists have embraced computational notebooks to author analysis code and
accompanying visualizations within a single document. Currently, although these media …

Documentation matters: Human-centered ai system to assist data science code documentation in computational notebooks

AY Wang, D Wang, J Drozdal, M Muller, S Park… - ACM Transactions on …, 2022 - dl.acm.org
Computational notebooks allow data scientists to express their ideas through a combination
of code and documentation. However, data scientists often pay attention only to the code …

Forgetting practices in the data sciences

M Muller, A Strohmayer - Proceedings of the 2022 CHI Conference on …, 2022 - dl.acm.org
HCI engages with data science through many topics and themes. Researchers have
addressed biased dataset problems, arguing that bad data can cause innocent software to …

Telling stories from computational notebooks: Ai-assisted presentation slides creation for presenting data science work

C Zheng, D Wang, AY Wang, X Ma - … of the 2022 CHI Conference on …, 2022 - dl.acm.org
Creating presentation slides is a critical but time-consuming task for data scientists. While
researchers have proposed many AI techniques to lift data scientists' burden on data …

How much automation does a data scientist want?

D Wang, QV Liao, Y Zhang, U Khurana… - arXiv preprint arXiv …, 2021 - arxiv.org
Data science and machine learning (DS/ML) are at the heart of the recent advancements of
many Artificial Intelligence (AI) applications. There is an active research thread in AI,\autoai …