Evolutionary computation in the era of large language model: Survey and roadmap

X Wu, S Wu, J Wu, L Feng, KC Tan - arXiv preprint arXiv:2401.10034, 2024 - arxiv.org
Large Language Models (LLMs), built upon Transformer-based architectures with massive
pretraining on diverse data, have not only revolutionized natural language processing but …

A systematic literature review on source code similarity measurement and clone detection: Techniques, applications, and challenges

M Zakeri-Nasrabadi, S Parsa, M Ramezani… - Journal of Systems and …, 2023 - Elsevier
Measuring and evaluating source code similarity is a fundamental software engineering
activity that embraces a broad range of applications, including but not limited to code …

Codamosa: Escaping coverage plateaus in test generation with pre-trained large language models

C Lemieux, JP Inala, SK Lahiri… - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org
Search-based software testing (SBST) generates high-coverage test cases for programs
under test with a combination of test case generation and mutation. SBST's performance …

Codexglue: A machine learning benchmark dataset for code understanding and generation

S Lu, D Guo, S Ren, J Huang, A Svyatkovskiy… - arXiv preprint arXiv …, 2021 - arxiv.org
Benchmark datasets have a significant impact on accelerating research in programming
language tasks. In this paper, we introduce CodeXGLUE, a benchmark dataset to foster …

Codet: Code generation with generated tests

B Chen, F Zhang, A Nguyen, D Zan, Z Lin… - arXiv preprint arXiv …, 2022 - arxiv.org
The task of generating code solutions for a given programming problem can benefit from the
use of pre-trained language models such as Codex, which can produce multiple diverse …

Large language models are zero-shot fuzzers: Fuzzing deep-learning libraries via large language models

Y Deng, CS Xia, H Peng, C Yang, L Zhang - Proceedings of the 32nd …, 2023 - dl.acm.org
Deep Learning (DL) systems have received exponential growth in popularity and have
become ubiquitous in our everyday life. Such systems are built on top of popular DL …

An empirical evaluation of using large language models for automated unit test generation

M Schäfer, S Nadi, A Eghbali… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Unit tests play a key role in ensuring the correctness of software. However, manually
creating unit tests is a laborious task, motivating the need for automation. Large Language …

No more manual tests? evaluating and improving chatgpt for unit test generation

Z Yuan, Y Lou, M Liu, S Ding, K Wang, Y Chen… - arXiv preprint arXiv …, 2023 - arxiv.org
Unit testing is essential in detecting bugs in functionally-discrete program units. Manually
writing high-quality unit tests is time-consuming and laborious. Although traditional …

Studying the usage of text-to-text transfer transformer to support code-related tasks

A Mastropaolo, S Scalabrino, N Cooper… - 2021 IEEE/ACM …, 2021 - ieeexplore.ieee.org
Deep learning (DL) techniques are gaining more and more attention in the software
engineering community. They have been used to support several code-related tasks, such …

Unsupervised translation of programming languages

B Roziere, MA Lachaux… - Advances in neural …, 2020 - proceedings.neurips.cc
A transcompiler, also known as source-to-source translator, is a system that converts source
code from a high-level programming language (such as C++ or Python) to another …