Large language models for code (ie, code LLMs) have shown strong code understanding and generation capabilities. To evaluate the capabilities of code LLMs in various aspects …
L Ma, S Liu, L Bu, S Li, Y Wang, Y Liu - arXiv preprint arXiv:2409.12866, 2024 - arxiv.org
Large Language models have achieved impressive performance in automated software engineering. Extensive efforts have been made to evaluate the abilities of code LLMs in …
The success of AI models relies on the availability of large, diverse, and high-quality datasets, which can be challenging to obtain due to data scarcity, privacy concerns, and …
Large Language Models (LLMs) applied to code-related applications have emerged as a prominent field, attracting significant interest from both academia and industry. However, as …
Reasoning about asynchronous plans is challenging since it requires sequential and parallel planning to optimize time costs. Can large language models (LLMs) succeed at this …
Y Chen, R Jabbarvand - Proceedings of the 33rd ACM SIGSOFT …, 2024 - dl.acm.org
Test flakiness, a non-deterministic behavior of builds irrelevant to code changes, is a major and continuing impediment to deliver-ing reliable software. The very few techniques for the …
X Lian, S Wang, J Ma, F Liu, X Tan, L Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Code generation, the task of producing source code from prompts, has seen significant advancements with the advent of pre-trained large language models (PLMs). Despite these …
S Mishra, VS Jadhav, S Karande… - Fourth Workshop on …, 2024 - ceur-ws.org
Heat exchangers (HEs) are essential in process industries for efficient thermal energy transfer. Their design and optimization are crucial for improving energy efficiency, reducing …
Y Yang, Y Nie, Z Wang, Y Tang, W Guo, B Li… - arXiv preprint arXiv …, 2024 - arxiv.org
Existing works have established multiple benchmarks to highlight the security risks associated with Code GenAI. These risks are primarily reflected in two areas: a model …