A high-performance plagiarism detection system

N Cooke; L Gillam; P Wrobel; H Cooke; F Al-Obaidli

Back

A high-performance plagiarism detection system

Conference presentation

Open access

A high-performance plagiarism detection system

N Cooke, L Gillam, P Wrobel, H Cooke and F Al-Obaidli

PLEF 2011 Notebook papers

PAN at CLEF 2011 (Amsterdam, 19/09/2011 - 22/09/2011)

2011

Abstract

In this paper we report on our high-performance plagiarism detection system which is able to process the PAN plagiarism corpus for the external plagiarism detection task within relatively short timescales in contrast to previously reported state-of-the-art, and still produce a reasonable degree of performance (PAN 11, 4th place, PlagDet=0.2467329, Recall=0.1500480, Precision=0.7106536, Granularity=1.0058894). At the core of our system is a simple method which avoids the use of hash-type approaches, but about which we are unable to disclose too many details due to a patent application in progress. We optimised our performance using the PAN10 collection, and used the best parameters for the final submission. We anticipated a relatively similar performance at PAN11, modulo changes to the plagiarism cases, and 4th place this year put us between participants who had been 5th and 6th in PAN 10.

Files and links (2)

pdf

pan11-plagiarism-notebook-6-pages68.77 kBDownload View

TextSRIDA, Open Access

url

http://clef2011.org/index.php?page=pages/proceedings.phpView

Published (Version of record)

Metrics

186 File views/ downloads

32 Record Views

Details

Title: A high-performance plagiarism detection system
Creators: N Cooke
L Gillam
P Wrobel
H Cooke
F Al-Obaidli
Publication Details: PLEF 2011 Notebook papers
Conference: PAN at CLEF 2011 (Amsterdam, 19/09/2011 - 22/09/2011)
Date published: 2011
Date submitted: 11/01/2012
Identifiers: 99514158502346
Academic Unit: Department of Computer Science
Resource Type: Conference presentation

A high-performance plagiarism detection system

Abstract

Files and links (2)

Metrics

Details

Usage Policy