Reliable testing: Detecting state-polluting tests to prevent test dependency

A Gyori, A Shi, F Hariri, D Marinov - Proceedings of the 2015 …, 2015 - dl.acm.org
Writing reliable test suites for large object-oriented systems is complex and time consuming.
One common cause of unreliable test suites are test dependencies that can cause tests to …

Refactoring test smells with junit 5: Why should developers keep up-to-date?

E Soares, M Ribeiro, R Gheyi… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Test smells are symptoms in the test code that indicate possible design or implementation
problems. Previous research demonstrated their harmfulness and the developers' …

Bugswarm: Mining and continuously growing a dataset of reproducible failures and fixes

DA Tomassi, N Dmeiri, Y Wang… - 2019 IEEE/ACM 41st …, 2019 - ieeexplore.ieee.org
Fault-detection, localization, and repair methods are vital to software quality; but it is difficult
to evaluate their generality, applicability, and current effectiveness. Large, diverse, realistic …

A qualitative study on the sources, impacts, and mitigation strategies of flaky tests

S Habchi, G Haben, M Papadakis… - … IEEE Conference on …, 2022 - ieeexplore.ieee.org
Test flakiness forms a major testing concern. Flaky tests manifest non-deterministic
outcomes that cripple continu-ous integration and lead developers to investigate false alerts …

How effective are mutation testing tools? An empirical analysis of Java mutation testing tools with manual analysis and real faults

M Kintis, M Papadakis, A Papadopoulos… - Empirical Software …, 2018 - Springer
Mutation analysis is a well-studied, fault-based testing technique. It requires testers to
design tests based on a set of artificial defects. The defects help in performing testing …

When life gives you oranges: detecting and diagnosing intermittent job failures at mozilla

J Lampel, S Just, S Apel, A Zeller - Proceedings of the 29th ACM Joint …, 2021 - dl.acm.org
Continuous delivery of cloud systems requires constant running of jobs (build processes,
tests, etc.). One issue that plagues this continuous integration (CI) process are intermittent …

A large-scale study of test coverage evolution

M Hilton, J Bell, D Marinov - Proceedings of the 33rd ACM/IEEE …, 2018 - dl.acm.org
Statement coverage is commonly used as a measure of test suite quality. Coverage is often
used as a part of a code review process: if a patch decreases overall coverage, or is itself …

A replication study on the usability of code vocabulary in predicting flaky tests

G Haben, S Habchi, M Papadakis… - 2021 IEEE/ACM 18th …, 2021 - ieeexplore.ieee.org
Industrial reports indicate that flaky tests are one of the primary concerns of software testing
mainly due to the false signals they provide. To deal with this issue, researchers have …

Modeling and ranking flaky tests at Apple

E Kowalczyk, K Nair, Z Gao, L Silberstein… - Proceedings of the …, 2020 - dl.acm.org
Test flakiness---inability to reliably repeat a test's Pass/Fail outcome---continues to be a
significant problem in Industry, adversely impacting continuous integration and test …

Flex: fixing flaky tests in machine learning projects by updating assertion bounds

S Dutta, A Shi, S Misailovic - Proceedings of the 29th ACM Joint Meeting …, 2021 - dl.acm.org
Many machine learning (ML) algorithms are inherently random–multiple executions using
the same inputs may produce slightly different results each time. Randomness impacts how …