Unleashing the power of compiler intermediate representation to enhance neural program embeddings

Z Zheng, K Ning, Q Zhong, J Chen, W Chen… - Empirical Software …, 2025 - Springer

Abstract Large Language Models (LLMs) have drawn widespread attention and research
due to their astounding performance in text generation and reasoning tasks. Derivative …

被引用次数：58 相关文章所有 2 个版本

[PDF] arxiv.org

A systematic literature review on source code similarity measurement and clone detection: Techniques, applications, and challenges

M Zakeri-Nasrabadi, S Parsa, M Ramezani… - Journal of Systems and …, 2023 - Elsevier

Measuring and evaluating source code similarity is a fundamental software engineering
activity that embraces a broad range of applications, including but not limited to code …

被引用次数：40 相关文章所有 4 个版本

Protecting intellectual property of large language model-based code generation apis via watermarks

Z Li, C Wang, S Wang, C Gao - Proceedings of the 2023 ACM SIGSAC …, 2023 - dl.acm.org

The rise of large language model-based code generation (LLCG) has enabled various
commercial services and APIs. Training LLCG models is often expensive and time …

被引用次数：41 相关文章

[PDF] github.io

An empirical study on fine-tuning large language models of code for automated program repair

K Huang, X Meng, J Zhang, Y Liu… - 2023 38th IEEE/ACM …, 2023 - ieeexplore.ieee.org

The advent of large language models (LLMs) has opened up new opportunities for
automated program repair (APR). In particular, some recent studies have explored how to …

被引用次数：67 相关文章所有 3 个版本

[PDF] arxiv.org

Cctest: Testing and repairing code completion systems

Z Li, C Wang, Z Liu, H Wang, D Chen… - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org

Code completion, a highly valuable topic in the software development domain, has been
increasingly promoted for use by recent advances in large language models (LLMs). To …

被引用次数：59 相关文章所有 5 个版本

[PDF] arxiv.org

Split and merge: Aligning position biases in large language model based evaluators

Z Li, C Wang, P Ma, D Wu, S Wang, C Gao… - arXiv preprint arXiv …, 2023 - arxiv.org

Large language models (LLMs) have shown promise as automated evaluators for assessing
the quality of answers generated by AI systems. However, these LLM-based evaluators …

被引用次数：30 相关文章所有 3 个版本

[PDF] github.io

Vectorizing program ingredients for better jvm testing

T Gao, J Chen, Y Zhao, Y Zhang, L Zhang - Proceedings of the 32nd …, 2023 - dl.acm.org

JVM testing is one of the most widely-used methodologies for guaranteeing the quality of
JVMs. Among various JVM testing techniques, synthesis-based JVM testing, which …

被引用次数：18 相关文章所有 4 个版本

[PDF] arxiv.org

Ircoder: Intermediate representations make language models robust multilingual code generators

I Paul, G Glavaš, I Gurevych - arXiv preprint arXiv:2403.03894, 2024 - arxiv.org

Code understanding and generation have fast become some of the most popular
applications of language models (LMs). Nonetheless, research on multilingual aspects of …

被引用次数：9 相关文章所有 2 个版本

[PDF] acm.org

Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit

Y Wan, Z Bi, Y He, J Zhang, H Zhang, Y Sui… - ACM Computing …, 2024 - dl.acm.org

Code intelligence leverages machine learning techniques to extract knowledge from
extensive code corpora, with the aim of developing intelligent tools to improve the quality …

被引用次数：12 相关文章所有 5 个版本

[PDF] arxiv.org

Refining decompiled c code with large language models

WK Wong, H Wang, Z Li, Z Liu, S Wang, Q Tang… - arXiv preprint arXiv …, 2023 - arxiv.org

AC decompiler converts an executable into source code. The recovered C source code,
once re-compiled, is expected to produce an executable with the same functionality as the …

被引用次数：15 相关文章所有 2 个版本

高级搜索

QQ 群