Software testing with large language models: Survey, landscape, and vision

J Wang, Y Huang, C Chen, Z Liu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Pre-trained large language models (LLMs) have recently emerged as a breakthrough
technology in natural language processing and artificial intelligence, with the ability to …

A survey of flaky tests

O Parry, GM Kapfhammer, M Hilton… - ACM Transactions on …, 2021 - dl.acm.org
Tests that fail inconsistently, without changes to the code under test, are described as flaky.
Flaky tests do not give a clear indication of the presence of software bugs and thus limit the …

State of practical applicability of regression testing research: A live systematic literature review

R Greca, B Miranda, A Bertolino - ACM Computing Surveys, 2023 - dl.acm.org
Context: Software regression testing refers to rerunning test cases after the system under
test is modified, ascertaining that the changes have not (re-) introduced failures. Not all …

Empirically evaluating readily available information for regression test optimization in continuous integration

D Elsner, F Hauer, A Pretschner, S Reimer - Proceedings of the 30th acm …, 2021 - dl.acm.org
Regression test selection (RTS) and prioritization (RTP) techniques aim to reduce testing
efforts and developer feedback time after a change to the code base. Using various …

Exploring better black-box test case prioritization via log analysis

Z Chen, J Chen, W Wang, J Zhou, M Wang… - ACM Transactions on …, 2023 - dl.acm.org
Test case prioritization (TCP) has been widely studied in regression testing, which aims to
optimize the execution order of test cases so as to detect more faults earlier. TCP has been …

Scalable and accurate test case prioritization in continuous integration contexts

AS Yaraghi, M Bagherzadeh, N Kahani… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Continuous Integration (CI) requires efficient regression testing to ensure software quality
without significantly delaying its CI builds. This warrants the need for techniques to reduce …

Chatgpt: A study on its utility for ubiquitous software engineering tasks

G Sridhara, S Mazumdar - arXiv preprint arXiv:2305.16837, 2023 - arxiv.org
ChatGPT (Chat Generative Pre-trained Transformer) is a chatbot launched by OpenAI on
November 30, 2022. OpenAI's GPT-3 family of large language models serve as the …

Test-case prioritization for configuration testing

R Cheng, L Zhang, D Marinov, T Xu - Proceedings of the 30th ACM …, 2021 - dl.acm.org
Configuration changes are among the dominant causes of failures of large-scale software
system deployment. Given the velocity of configuration changes, typically at the scale of …

Identifying similar test cases that are specified in natural language

M Viggiato, D Paas, C Buzon… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Software testing is still a manual process in many industries, despite the recent
improvements in automated testing techniques. As a result, test cases (which consist of one …

The impact of flaky tests on historical test prioritization on chrome

E Fallahzadeh, PC Rigby - … of the 44th International Conference on …, 2022 - dl.acm.org
Test prioritization algorithms prioritize probable failing tests to give faster feedback to
developers in case a failure occurs. Test prioritization approaches that use historical failures …