Source-code similarity detection and detection tools used in academia: a systematic review

M Novak, M Joy, D Kermek - ACM Transactions on Computing Education …, 2019 - dl.acm.org
Teachers deal with plagiarism on a regular basis, so they try to prevent and detect
plagiarism, a task that is complicated by the large size of some classes. Students who cheat …

“Low-resource” text classification: A parameter-free classification method with compressors

Z Jiang, M Yang, M Tsirlin, R Tang… - Findings of the …, 2023 - aclanthology.org
Deep neural networks (DNNs) are often used for text classification due to their high
accuracy. However, DNNs can be computationally intensive, requiring millions of …

[图书][B] An introduction to Kolmogorov complexity and its applications

M Li, P Vitányi - 2008 - Springer
Ming Li Paul Vitányi Fourth Edition Page 1 An Introduction to Kolmogorov Complexity and Its
Applications Ming Li Paul Vitányi Fourth Edition Texts in Computer Science Page 2 Texts in …

The similarity metric

M Li, X Chen, X Li, B Ma… - IEEE transactions on …, 2004 - ieeexplore.ieee.org
A new class of distances appropriate for measuring similarity relations between sequences,
say one type of similarity per distance, is studied. We propose a new" normalized …

Clustering by compression

R Cilibrasi, PMB Vitányi - IEEE Transactions on Information …, 2005 - ieeexplore.ieee.org
We present a new method for clustering based on compression. The method does not use
subject-specific features or background knowledge, and works as follows: First, we …

Juxtapp: A scalable system for detecting code reuse among android applications

S Hanna, L Huang, E Wu, S Li, C Chen… - Detection of Intrusions and …, 2013 - Springer
Mobile application markets such as the Android Marketplace provide a centralized
showcase of applications that end users can purchase or download for free onto their mobile …

A state of art on source code plagiarism detection

M Agrawal, DK Sharma - 2016 2nd International Conference …, 2016 - ieeexplore.ieee.org
Plagiarism is becoming a serious problem for intellectual community. The detection of
plagiarism at various levels is a major issue. The complexity of the problem increases when …

[PDF][PDF] Automated Assessment of Programming Assignments.

V Pieterse - CSERC, 2013 - academia.edu
In this paper I explain the design of our own assessment software and discuss our
experience of using it in relation to the above-mentioned factors and concerns. My reflection …

A source code similarity system for plagiarism detection

Z Đurić, D Gašević - The Computer Journal, 2013 - academic.oup.com
Source code plagiarism is an easy to do task, but very difficult to detect without proper tool
support. Various source code similarity detection systems have been developed to help …

Input–output maps are strongly biased towards simple outputs

K Dingle, CQ Camargo, AA Louis - Nature communications, 2018 - nature.com
Many systems in nature can be described using discrete input–output maps. Without
knowing details about a map, there may seem to be no a priori reason to expect that a …