In this paper, we consider contamination by code generation test sets, in particular in their use in modern large language models. We discuss three possible sources of such …
Commit messages provide descriptions of the modifications made in a commit using natural language, making them crucial for software maintenance and evolution. Recent …
M Stallone, V Saxena, L Karlinsky, B McGinn… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper introduces long-context Granite code models that support effective context windows of up to 128K tokens. Our solution for scaling context length of Granite 3B/8B code …
Z Ma, S An, Z Lin, Y Zou, B Xie - arXiv preprint arXiv:2412.19031, 2024 - arxiv.org
Language models have been applied to various software development tasks, but the performance varies according to the scale of the models. Large Language Models (LLMs) …
Z Yang - arXiv preprint arXiv:2409.06338, 2024 - arxiv.org
We argue that there are two major distinct capabilities in long context understanding: retrieval and holistic understanding. Understanding and further improving LLMs' long …
Y Li, H Jiang, Q Wu, X Luo, S Ahn, C Zhang, AH Abdi… - neurips2024-enlsp.github.io
Abstract Long-context Large Language Models (LLMs) have unlocked numerous possibilities for downstream applications, many of which involve multiple requests sharing …