To seed or not to seed? an empirical analysis of usage of seeds for testing in machine learning projects

S Dutta, A Arunachalam… - 2022 IEEE Conference on …, 2022 - ieeexplore.ieee.org
Many Machine Learning (ML) algorithms are in-herently random in nature-executing them
using the same inputs may lead to slightly different results across different runs. Such …

Flex: fixing flaky tests in machine learning projects by updating assertion bounds

S Dutta, A Shi, S Misailovic - Proceedings of the 29th ACM Joint Meeting …, 2021 - dl.acm.org
Many machine learning (ML) algorithms are inherently random–multiple executions using
the same inputs may produce slightly different results each time. Randomness impacts how …

Balancing effectiveness and flakiness of non-deterministic machine learning tests

CS Xia, S Dutta, S Misailovic… - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org
Testing Machine Learning (ML) projects is challenging due to inherent non-determinism of
various ML algorithms and the lack of reliable ways to compute reference results …

Tera: Optimizing stochastic regression tests in machine learning projects

S Dutta, J Selvam, A Jain, S Misailovic - Proceedings of the 30th ACM …, 2021 - dl.acm.org
The stochastic nature of many Machine Learning (ML) algorithms makes testing of ML tools
and libraries challenging. ML algorithms allow a developer to control their accuracy and run …

Do Automatic Test Generation Tools Generate Flaky Tests?

M Gruber, MF Roslan, O Parry, F Scharnböck… - Proceedings of the 46th …, 2024 - dl.acm.org
Non-deterministic test behavior, or flakiness, is common and dreaded among developers.
Researchers have studied the issue and proposed approaches to mitigate it. However, the …

Detecting flaky tests in probabilistic and machine learning applications

S Dutta, A Shi, R Choudhary, Z Zhang, A Jain… - Proceedings of the 29th …, 2020 - dl.acm.org
Probabilistic programming systems and machine learning frameworks like Pyro, PyMC3,
TensorFlow, and PyTorch provide scalable and efficient primitives for inference and training …

De-flake your tests: Automatically locating root causes of flaky tests in code at google

C Ziftci, D Cavalcanti - 2020 IEEE International Conference on …, 2020 - ieeexplore.ieee.org
Regression testing is a critical part of software development and maintenance. It ensures
that modifications to existing software do not break existing behavior and functionality. One …

What is the vocabulary of flaky tests? an extended replication

BHP Camara, MAG Silva, AT Endo… - 2021 IEEE/ACM 29th …, 2021 - ieeexplore.ieee.org
Software systems have been continuously evolved and delivered with high quality due to the
widespread adoption of automated tests. A recurring issue hurting this scenario is the …

A study on the lifecycle of flaky tests

W Lam, K Muşlu, H Sajnani… - Proceedings of the ACM …, 2020 - dl.acm.org
During regression testing, developers rely on the pass or fail outcomes of tests to check
whether changes broke existing functionality. Thus, flaky tests, which nondeterministically …

Understanding flaky tests: The developer's perspective

M Eck, F Palomba, M Castelluccio… - Proceedings of the 2019 …, 2019 - dl.acm.org
Flaky tests are software tests that exhibit a seemingly random outcome (pass or fail) despite
exercising unchanged code. In this work, we examine the perceptions of software …