M2rc-eval: Massively multilingual repository-level code completion evaluation

文章

学术资源搜索

获得 4 条结果（用时0.02秒）

我的图书馆

M2rc-eval: Massively multilingual repository-level code completion evaluation

在引用文章中搜索

[PDF] arxiv.org

Qwen2. 5-coder technical report

B Hui, J Yang, Z Cui, J Yang, D Liu, L Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

In this report, we introduce the Qwen2. 5-Coder series, a significant upgrade from its
predecessor, CodeQwen1. 5. This series includes six models: Qwen2. 5-Coder-(0.5 B/1.5 …

被引用次数：91 相关文章所有 3 个版本

[PDF] arxiv.org

Mdeval: Massively multilingual code debugging

S Liu, L Chai, J Yang, J Shi, H Zhu, L Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

Code large language models (LLMs) have made significant progress in code debugging by
directly generating the correct code based on the buggy code snippet. Programming …

被引用次数：5 相关文章

[PDF] arxiv.org

Evaluating and aligning codellms on human preference

J Yang, J Yang, K Jin, Y Miao, L Zhang, L Yang… - arXiv preprint arXiv …, 2024 - arxiv.org

Code large language models (codeLLMs) have made significant strides in code generation.
Most previous code-related benchmarks, which consist of various programming exercises …

被引用次数：3 相关文章

[PDF] arxiv.org

Fullstack bench: Evaluating llms as full stack coder

S Liu, H Zhu, J Liu, S Xin, A Li, R Long, L Chen… - arXiv preprint arXiv …, 2024 - arxiv.org

As the capabilities of code large language models (LLMs) continue to expand, their
applications across diverse code intelligence domains are rapidly increasing. However …

被引用次数：3 相关文章所有 2 个版本

高级搜索

QQ 群

M2rc-eval: Massively multilingual repository-level code completion evaluation

Qwen2. 5-coder technical report

Mdeval: Massively multilingual code debugging

Evaluating and aligning codellms on human preference

Fullstack bench: Evaluating llms as full stack coder

引用