Improving the Reliability of the Plagiarism Detection System

Warning

This publication doesn't include Faculty of Economics and Administration. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

KASPRZAK Jan BRANDEJS Michal

Year of publication 2010
Type Article in Proceedings
Conference Notebook Papers of CLEF 2010 LABs and Workshops
MU Faculty or unit

Faculty of Informatics

Citation
Web http://www.uni-weimar.de/medien/webis/events/pan-10/pan10-web/about.html
Field Informatics
Keywords plagiarism; document similarity; external plagiarism; intrinsic plagiarism
Description In this paper we describe our approach at the PAN 2010 plagiarism detection competition. We refer to the system we have used in PAN'09. We then present the improvements we have tried since the PAN'09 competition, and their impact on the results on the development corpus. We describe our experiments with intrinsic plagiarism detection and evaluate them. We then discuss the computational cost of each step of our implementation, including the performance data from two different computers.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.