• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Relevant Documents Search System

Student: Polushin Gleb

Supervisor: Alexander Sukhov

Faculty: Faculty of Economics, Management, and Business Informatics

Educational Programme: Software Engineering (Bachelor)

Final Grade: 10

Year of Graduation: 2016

The focus of the work is on the development of relevant document search system. Relevancy implies the similarity of documents topics. The developed system will make it possible to reduce complexity of literature search operations. The object of research is a process of relevant literature search. The subject of research is keywords extraction methods and algorithms and automated search based on keywords. The paper consists of introduction, three chapters, conclusion, bibliography with 36 titles, and three appendices. The work is in 41 pages, containing 14 illustrations and 7 tables. Introduction describes the research topic, indicates major problems, gives research topicality, states aims and objectives. In the first chapter the overview of existing solutions in the area of relevant documents search is presented. Keywords extraction approaches and scientific electronic resources allowing users to perform search based on keywords are also demonstrated. The second chapter includes the subject area analysis, describes the stages of system design and main algorithms. The process of components implementation and integration is given in the third chapter. The conclusion summarizes obtained results and future research perspectives. Appendix A shows sequence diagrams describing system behavior. Appendix Б presents flowchart diagram containing keywords selection algorithm. The source code of the developed system is given in appendix В.

Full text (added May 30, 2016)

Student Theses at HSE must be completed in accordance with the University Rules and regulations specified by each educational programme.

Summaries of all theses must be published and made freely available on the HSE website.

The full text of a thesis can be published in open access on the HSE website only if the authoring student (copyright holder) agrees, or, if the thesis was written by a team of students, if all the co-authors (copyright holders) agree. After a thesis is published on the HSE website, it obtains the status of an online publication.

Student theses are objects of copyright and their use is subject to limitations in accordance with the Russian Federation’s law on intellectual property.

In the event that a thesis is quoted or otherwise used, reference to the author’s name and the source of quotation is required.

Search all student theses