[图书][B] Modern information retrieval

R Baeza-Yates, B Ribeiro-Neto - 1999 - people.ischool.berkeley.edu
Information retrieval (IR) has changed considerably in recent years with the expansion of the
World Wide Web and the advent of modern and inexpensive graphical user interfaces and …

Intrinsic plagiarism analysis

B Stein, N Lipka, P Prettenhofer - Language Resources and Evaluation, 2011 - Springer
Research in automatic text plagiarism detection focuses on algorithms that compare
suspicious documents against a collection of reference documents. Recent approaches …

State-of-the-art in detecting academic plagiarism

N Meuschke, B Gipp - International Journal for Educational Integrity, 2013 - ojs.unisa.edu.au
The problem of academic plagiarism has been present for centuries. Yet, the widespread
dissemination of information technology, including the internet, made plagiarising much …

PDLK: Plagiarism detection using linguistic knowledge

A Abdi, N Idris, RM Alguliyev, RM Aliguliyev - Expert Systems with …, 2015 - Elsevier
Plagiarism is described as the reuse of someone else's previous ideas, work or even words
without sufficient attribution to the source. This paper presents a method to detect external …

Do not crawl in the DUST: Different URLs with similar text

Z Bar-Yossef, I Keidar, U Schonfeld - ACM Transactions on the Web …, 2009 - dl.acm.org
We consider the problem of DUST: Different URLs with Similar Text. Such duplicate URLs
are prevalent in Web sites, as Web server software often uses aliases and redirections, and …

Finding similar files in large document repositories

G Forman, K Eshghi, S Chiocchetti - Proceedings of the eleventh ACM …, 2005 - dl.acm.org
Hewlett-Packard has many millions of technical support documents in a variety of
collections. As part of content management, such collections are periodically merged and …

Citation pattern matching algorithms for citation-based plagiarism detection: greedy citation tiling, citation chunking and longest common citation sequence

B Gipp, N Meuschke - Proceedings of the 11th ACM symposium on …, 2011 - dl.acm.org
Plagiarism Detection Systems have been developed to locate instances of plagiarism eg
within scientific papers. Studies have shown that the existing approaches deliver reasonable …

Near similarity search and plagiarism analysis

B Stein, SM Zu Eissen - From Data and Information Analysis to Knowledge …, 2006 - Springer
Existing methods to text plagiarism analysis mainly base on “chunking”, a process of
grouping a text into meaningful units each of which gets encoded by an integer number …

Sentence-based natural language plagiarism detection

DR White, MS Joy - Journal on Educational Resources in Computing …, 2004 - dl.acm.org
With the increasing levels of access to higher education in the United Kingdom, larger class
sizes make it unrealistic for tutors to be expected to identify instances of peer-to-peer …

[PDF][PDF] Effective clone detection without language barriers

M Rieger - 2005 - scg.unibe.ch
Duplication is detected by comparing features of source fragments. The main problem for the
detection is that source code is rarely copied exactly. The detection process must be able to …