查看文章

A study on identifying code author from real development

作者

Siyi Gong, Hao Zhong

发表日期

2022/11/7

图书

Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering

页码范围

1627-1631

简介

Identifying code authors is important in many research topics, and various approaches have been proposed. Although these approaches achieve promising results on their datasets, their true effectiveness is still in question. To the best of our knowledge, only one large-scale study was conducted to explore the impacts of related factors (e.g., the temporal effect and the distribution of files per author). This study selected Google Code Jam programs as their subjects, but such programs are quite different from the source files that programmers write in daily development. To understand their effectiveness and challenges, we replicate their study and use their approach to analyze source files that are retrieved from real projects. The prior study claims that the temporal effect and the distribution of files per author have only minor impacts on their trained models. In the contrast, we find that in 85.48% pairs of training and …

引用总数

被引用次数：3

2022202320241 2

学术搜索中的文章

A study on identifying code author from real development

S Gong, H Zhong - Proceedings of the 30th ACM Joint European Software …, 2022

被引用次数：3 相关文章所有 5 个版本