[HTML][HTML] A/B testing: a systematic literature review

F Quin, D Weyns, M Galster, CC Silva - Journal of Systems and Software, 2024 - Elsevier
A/B testing, also referred to as online controlled experimentation or continuous
experimentation, is a form of hypothesis testing where two variants of a piece of software are …

Large language models for software engineering: Survey and open problems

A Fan, B Gokkaya, M Harman, M Lyubarskiy… - arXiv preprint arXiv …, 2023 - arxiv.org
This paper provides a survey of the emerging area of Large Language Models (LLMs) for
Software Engineering (SE). It also sets out open research challenges for the application of …

Exploring the effectiveness of large language models in generating unit tests

ML Siddiq, J Santos, RH Tanvir, N Ulfat, FA Rifat… - arXiv preprint arXiv …, 2023 - arxiv.org
A code generation model generates code by taking a prompt from a code comment, existing
code, or a combination of both. Although code generation models (eg, GitHub Copilot) are …

Predicting flaky tests categories using few-shot learning

A Akli, G Haben, S Habchi, M Papadakis… - arXiv preprint arXiv …, 2022 - arxiv.org
Flaky tests are tests that yield different outcomes when run on the same version of a
program. This non-deterministic behaviour plagues continuous integration with false signals …

Natural Language Processing-based Software Testing: A Systematic Literature Review

M Boukhlif, M Hanine, N Kharmoum, AR Noriega… - IEEE …, 2024 - ieeexplore.ieee.org
New approaches to software testing are required due to the rising complexity of today's
software applications and the rapid growth of software engineering practices. Among these …

What made this test flake? pinpointing classes responsible for test flakiness

S Habchi, G Haben, J Sohn, A Franci… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
Flaky tests are defined as tests that manifest non-deterministic behaviour by passing and
failing intermittently for the same version of the code. These tests cripple continuous …

[HTML][HTML] Test flakiness' causes, detection, impact and responses: A multivocal review

A Tahir, S Rasheed, J Dietrich, N Hashemi… - Journal of Systems and …, 2023 - Elsevier
Flaky tests (tests with non-deterministic outcomes) pose a major challenge for software
testing. They are known to cause significant issues, such as reducing the effectiveness and …

Practical flaky test prediction using common code evolution and test history data

M Gruber, M Heine, N Oster… - … IEEE Conference on …, 2023 - ieeexplore.ieee.org
Non-deterministically behaving test cases cause developers to lose trust in their regression
test suites and to eventually ignore failures. Detecting flaky tests is therefore a crucial task in …

Investigating developers' perception on software testability and its effects

T Sharma, S Georgiou, M Kechagia, TA Ghaleb… - Empirical Software …, 2023 - Springer
The opinions and perspectives of software developers are highly regarded in software
engineering research. The experience and knowledge of software practitioners are …

The importance of discerning flaky from fault-triggering test failures: a case study on the Chromium CI

G Haben, S Habchi, M Papadakis, M Cordy… - arXiv preprint arXiv …, 2023 - arxiv.org
Flaky tests are tests that pass and fail on different executions of the same version of a
program under test. They waste valuable developer time by making developers investigate …