Workflow analysis of data science code in public GitHub repositories

D Ramasamy, C Sarasua, A Bacchelli… - Empirical Software …, 2023 - Springer
Despite the ubiquity of data science, we are far from rigorously understanding how coding in
data science is performed. Even though the scientific literature has hinted at the iterative and …

From data to insight: work practices of analysts in the enterprise

E Kandogan, A Balakrishnan… - IEEE computer …, 2014 - ieeexplore.ieee.org
With greater availability of data, businesses are increasingly becoming data-driven
enterprises, establishing standards for data acquisition, processing, infrastructure, and …

Interactive analysis of big data

J Heer, S Kandel - XRDS: Crossroads, The ACM Magazine for Students, 2012 - dl.acm.org
Interactive analysis of big data Page 1 XRDS • fall 2012 • Vol.19 • No.1 50 Interactive Analysis
of Big Data Big data is all the rage. Computer scientists in databases, distributed systems …

Where do stories come from? examining the exploration process in investigative data journalism

D Showkat, EPS Baumer - Proceedings of the ACM on Human-Computer …, 2021 - dl.acm.org
Investigative data journalists work with a variety of data sources to tell a story. Though prior
work has indicated that there is a close relationship between journalists' data work practices …

Table scraps: an actionable framework for multi-table data wrangling from an artifact study of computational journalism

S Kasica, C Berret, T Munzner - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
For the many journalists who use data and computation to report the news, data wrangling is
an integral part of their work. Despite an abundance of literature on data wrangling in the …

A qualitative interview study of distributed tracing visualisation: A characterisation of challenges and opportunities

T Davidson, E Wall, J Mace - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Distributed tracing tools have emerged in recent years to enable operators of modern
internet applications to troubleshoot cross-component problems in deployed applications …

Video big data analytics in the cloud: A reference architecture, survey, opportunities, and open research issues

A Alam, I Ullah, YK Lee - IEEE Access, 2020 - ieeexplore.ieee.org
The proliferation of multimedia devices over the Internet of Things (IoT) generates an
unprecedented amount of data. Consequently, the world has stepped into the era of big …

[PDF][PDF] Data visualization: enhancing big data more adaptable and valuable

AS Fiaz, N Asha, D Sumathi, AS Navaz - International Journal of …, 2016 - academia.edu
The main focus of this paper is on Big Data and Data Visualization techniques which
together make the usage of data analytics more efficient and valuable. The term 'Big Data', is …

Progressive data science: Potential and challenges

C Turkay, N Pezzotti, C Binnig, H Strobelt… - arXiv preprint arXiv …, 2018 - arxiv.org
Data science requires time-consuming iterative manual activities. In particular, activities
such as data selection, preprocessing, transformation, and mining, highly depend on …

[PDF][PDF] Self-Service Data Preparation: Research to Practice.

JM Hellerstein, J Heer, S Kandel - IEEE Data Eng. Bull., 2018 - scholar.archive.org
It is widely accepted that the majority of time in any data analysis project is devoted to
preparing the data [25]. In 2012, noted data science leader DJ Patil put the fraction of time …