• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Automated System for Detecting Python Code Plagiarism

Student: Bubnova Mariia

Supervisor: Stanislav A. Bober

Faculty: HSE Tikhonov Moscow Institute of Electronics and Mathematics (MIEM HSE)

Educational Programme: Computer Systems and Networks (Master)

Final Grade: 9

Year of Graduation: 2020

This paper outlines a new approach to plagiarism detection between program codes performed in Python. The object of the study is a system for similarity detecting which combines different approaches of text and program analysis. The developed system was tested on programs created by third-year students of the educational programs “Applied Mathematics” and “Computer Security” as tasks in the disciplines “Computer Practicum” and “Python Programming Language”. In this report there is an overview of existing anti-plagiarism systems for program code. Moreover, methods for finding similarities in texts/ program codes were considered. As a result, the most effective methods were chosen to be implemented in the developed system. Created system excludes most of disadvantages which were found during critical analysis of analogues. Plagiarism checking system is written in Python and can be used by a teacher for work purposes to find clusters of students whose codes with a certain probability are similar. Two import methods were realized: files can be downloaded both from cloud and local storage.

Full text (added May 21, 2020)

Student Theses at HSE must be completed in accordance with the University Rules and regulations specified by each educational programme.

Summaries of all theses must be published and made freely available on the HSE website.

The full text of a thesis can be published in open access on the HSE website only if the authoring student (copyright holder) agrees, or, if the thesis was written by a team of students, if all the co-authors (copyright holders) agree. After a thesis is published on the HSE website, it obtains the status of an online publication.

Student theses are objects of copyright and their use is subject to limitations in accordance with the Russian Federation’s law on intellectual property.

In the event that a thesis is quoted or otherwise used, reference to the author’s name and the source of quotation is required.

Search all student theses