A systematic literature review on source code similarity measurement and clone detection: Techniques, applications, and challenges

M Zakeri-Nasrabadi, S Parsa, M Ramezani… - Journal of Systems and …, 2023 - Elsevier
Measuring and evaluating source code similarity is a fundamental software engineering
activity that embraces a broad range of applications, including but not limited to code …

A systematic literature review on bad smells–5 w's: which, when, what, who, where

EV de Paulo Sobrinho, A De Lucia… - IEEE Transactions …, 2018 - ieeexplore.ieee.org
Bad smells are sub-optimal code structures that may represent problems needing attention.
We conduct an extensive literature review on bad smells relying on a large body of …

A novel neural source code representation based on abstract syntax tree

J Zhang, X Wang, H Zhang, H Sun… - 2019 IEEE/ACM 41st …, 2019 - ieeexplore.ieee.org
Exploiting machine learning techniques for analyzing programs has attracted much
attention. One key problem is how to represent code fragments well for follow-up analysis …

Sourcerercc: Scaling code clone detection to big-code

H Sajnani, V Saini, J Svajlenko, CK Roy… - Proceedings of the 38th …, 2016 - dl.acm.org
Despite a decade of active research, there has been a marked lack in clone detection
techniques that scale to large repositories for detecting near-miss clones. In this paper, we …

Bigcloneeval: A clone detection tool evaluation framework with bigclonebench

J Svajlenko, CK Roy - 2016 IEEE international conference on …, 2016 - ieeexplore.ieee.org
Many clone detection tools have been proposed in the literature. However, our knowledge of
their performance in real software systems is limited, particularly their recall. We previously …

Benchmarks for software clone detection: A ten-year retrospective

CK Roy, JR Cordy - 2018 IEEE 25th International Conference …, 2018 - ieeexplore.ieee.org
There have been a great many methods and tools proposed for software clone detection.
While some work has been done on assessing and comparing performance of these tools …

Comparison and evaluation of clone detection techniques with different code representations

Y Wang, Y Ye, Y Wu, W Zhang… - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org
As one of bad smells in code, code clones may increase the cost of software maintenance
and the risk of vulnerability propagation. In the past two decades, numerous clone detection …

The mutation and injection framework: Evaluating clone detection tools with mutation analysis

J Svajlenko, CK Roy - IEEE Transactions on Software …, 2019 - ieeexplore.ieee.org
An abundant number of clone detection tools have been proposed in the literature due to the
many applications and benefits of clone detection. However, there has been difficulty in the …

Bigclonebench

J Svajlenko, CK Roy - Code Clone Analysis: Research, Tools, and …, 2021 - Springer
Many clone detection tools and techniques have been created to tackle various use-cases,
including syntactical clone detection, semantic clone detection, inter-project clone detection …

A machine learning based framework for code clone validation

G Mostaeen, B Roy, CK Roy, K Schneider… - Journal of Systems and …, 2020 - Elsevier
A code clone is a pair of code fragments, within or between software systems that are
similar. Since code clones often negatively impact the maintainability of a software system …