查看文章

nsf.gov 中的 [PDF]

Is the cure worse than the disease? overfitting in automated program repair

作者

Edward K Smith, Earl T Barr, Claire Le Goues, Yuriy Brun

发表日期

2015/8/30

图书

Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering

页码范围

532-543

简介

Automated program repair has shown promise for reducing the significant manual effort debugging requires. This paper addresses a deficit of earlier evaluations of automated repair techniques caused by repairing programs and evaluating generated patches' correctness using the same set of tests. Since tests are an imperfect metric of program correctness, evaluations of this type do not discriminate between correct patches and patches that overfit the available tests and break untested but desired functionality. This paper evaluates two well-studied repair tools, GenProg and TrpAutoRepair, on a publicly available benchmark of bugs, each with a human-written patch. By evaluating patches using tests independent from those used during repair, we find that the tools are unlikely to improve the proportion of independent tests passed, and that the quality of the patches is proportional to the coverage of the test suite …

引用总数

被引用次数：364

20152016201720182019202020212022202320246 21 29 49 55 42 44 44 41 26

学术搜索中的文章

Is the cure worse than the disease? overfitting in automated program repair

EK Smith, ET Barr, C Le Goues, Y Brun - Proceedings of the 2015 10th Joint Meeting on …, 2015

被引用次数：363 相关文章所有 17 个版本

Is the cure worse than the disease? A large-scale analysis of overfitting in automated program repair*

EK Smith, ET Barr, C Le Goues, Y Brun - 2015

被引用次数：1 相关文章所有 3 个版本