Towards an understanding of large language models in software engineering tasks

Z Zheng, K Ning, Q Zhong, J Chen, W Chen… - Empirical Software …, 2025 - Springer
Abstract Large Language Models (LLMs) have drawn widespread attention and research
due to their astounding performance in text generation and reasoning tasks. Derivative …

A systematic literature review on source code similarity measurement and clone detection: Techniques, applications, and challenges

M Zakeri-Nasrabadi, S Parsa, M Ramezani… - Journal of Systems and …, 2023 - Elsevier
Measuring and evaluating source code similarity is a fundamental software engineering
activity that embraces a broad range of applications, including but not limited to code …

Protecting intellectual property of large language model-based code generation apis via watermarks

Z Li, C Wang, S Wang, C Gao - Proceedings of the 2023 ACM SIGSAC …, 2023 - dl.acm.org
The rise of large language model-based code generation (LLCG) has enabled various
commercial services and APIs. Training LLCG models is often expensive and time …

An empirical study on fine-tuning large language models of code for automated program repair

K Huang, X Meng, J Zhang, Y Liu… - 2023 38th IEEE/ACM …, 2023 - ieeexplore.ieee.org
The advent of large language models (LLMs) has opened up new opportunities for
automated program repair (APR). In particular, some recent studies have explored how to …

Cctest: Testing and repairing code completion systems

Z Li, C Wang, Z Liu, H Wang, D Chen… - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org
Code completion, a highly valuable topic in the software development domain, has been
increasingly promoted for use by recent advances in large language models (LLMs). To …

Split and merge: Aligning position biases in large language model based evaluators

Z Li, C Wang, P Ma, D Wu, S Wang, C Gao… - arXiv preprint arXiv …, 2023 - arxiv.org
Large language models (LLMs) have shown promise as automated evaluators for assessing
the quality of answers generated by AI systems. However, these LLM-based evaluators …

Vectorizing program ingredients for better jvm testing

T Gao, J Chen, Y Zhao, Y Zhang, L Zhang - Proceedings of the 32nd …, 2023 - dl.acm.org
JVM testing is one of the most widely-used methodologies for guaranteeing the quality of
JVMs. Among various JVM testing techniques, synthesis-based JVM testing, which …

Ircoder: Intermediate representations make language models robust multilingual code generators

I Paul, G Glavaš, I Gurevych - arXiv preprint arXiv:2403.03894, 2024 - arxiv.org
Code understanding and generation have fast become some of the most popular
applications of language models (LMs). Nonetheless, research on multilingual aspects of …

Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit

Y Wan, Z Bi, Y He, J Zhang, H Zhang, Y Sui… - ACM Computing …, 2024 - dl.acm.org
Code intelligence leverages machine learning techniques to extract knowledge from
extensive code corpora, with the aim of developing intelligent tools to improve the quality …

Refining decompiled c code with large language models

WK Wong, H Wang, Z Li, Z Liu, S Wang, Q Tang… - arXiv preprint arXiv …, 2023 - arxiv.org
AC decompiler converts an executable into source code. The recovered C source code,
once re-compiled, is expected to produce an executable with the same functionality as the …