Computational reproducibility of Jupyter notebooks from biomedical publications

S Samuel, D Mietchen - GigaScience, 2024 - academic.oup.com
Background Jupyter notebooks facilitate the bundling of executable code with its
documentation and output in one interactive environment, and they represent a popular …

Distilkaggle: A distilled dataset of kaggle jupyter notebooks

M Mostafavi Ghahfarokhi, A Asgari… - Proceedings of the 21st …, 2024 - dl.acm.org
Jupyter notebooks have become indispensable tools for data analysis and processing in
various domains. However, despite their widespread use, there is a notable research gap in …

Static analysis driven enhancements for comprehension in machine learning notebooks

APS Venkatesh, S Sabu, M Chekkapalli… - Empirical Software …, 2024 - Springer
Jupyter notebooks have emerged as the predominant tool for data scientists to develop and
share machine learning solutions, primarily using Python as the programming language …

TypeEvalPy: A Micro-benchmarking Framework for Python Type Inference Tools

AP Shivarpatna Venkatesh, S Sabu, J Wang… - Proceedings of the …, 2024 - dl.acm.org
In light of the growing interest in type inference research for Python, both researchers and
practitioners require a standardized process to assess the performance of various type …

The Emergence of Large Language Models in Static Analysis: A First Look through Micro-Benchmarks

APS Venkatesh, S Sabu, AM Mir, S Reis… - Proceedings of the 2024 …, 2024 - dl.acm.org
The application of Large Language Models (LLMs) in software engineering, particularly in
static analysis tasks, represents a paradigm shift in the field. In this paper, we investigate the …

Predicting the Understandability of Computational Notebooks through Code Metrics Analysis

MM Ghahfarokhi, A Asadi, A Asgari… - arXiv preprint arXiv …, 2024 - arxiv.org
Computational notebooks have become the primary coding environment for data scientists.
However, research on their code quality is still emerging, and the code shared is often of …

Bloat beneath Python's Scales: A Fine-Grained Inter-Project Dependency Analysis

GP Drosos, T Sotiropoulos, D Spinellis… - Proceedings of the …, 2024 - dl.acm.org
Modern programming languages promote software reuse via package managers that
facilitate the integration of inter-dependent software libraries. Software reuse comes with the …

A Flexible Cell Classification for ML Projects in Jupyter Notebooks

M Perez, S Aydin, H Lichter - arXiv preprint arXiv:2403.07562, 2024 - arxiv.org
Jupyter Notebook is an interactive development environment commonly used for rapid
experimentation of machine learning (ML) solutions. Describing the ML activities performed …

A Roadmap for Enriching Jupyter Notebooks Documentation with Kaggle Data

M Mostafavi Ghahfarokhi, H Jahantigh… - Proceedings of the …, 2024 - dl.acm.org
Recent advancements in AI and data science have led to the increased use of Jupyter
notebooks. As such, various AI-Based automated tools have been also developed to …

Hidden Gems in the Rough: Computational Notebooks as an Uncharted Oasis for IDEs

S Titov, K Grotov, A Prasad S. Venkatesh - Proceedings of the 1st ACM …, 2024 - dl.acm.org
In this paper, we outline potential ways for the further development of computational
notebooks in Integrated Development Environments (IDEs). We discuss notebooks …