• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Corpora Storage Subsystem Development for Linguistic Research System

Student: Zaitsev Konstantin

Supervisor: Viacheslav Lanin

Faculty: Faculty of Economics, Management, and Business Informatics

Educational Programme: Software Engineering (Bachelor)

Year of Graduation: 2021

This paper is devoted to the process of developing a web service for working with corpora storage. The paper allows developers of linguistic systems to use a web service to store and perform operations on corpora by connecting to a specific corpus storage. The paper contains an introduction, three chapters, a conclusion, a bibliography and appendices. The first chapter contains an analysis of the existing system of the linguistic system and transformation subsystem, identification of their problems, analysis and comparison of natural language processing tools and non-relational data management systems, and the formation of system requirements. The second chapter is devoted to the design of the system using diagrams in UML notation. A description of the interaction of web services, REST methods of services, data models for storage, system architecture and interface is given. The third chapter describes the process of developing services using a stack of technologies C #, ASP.NET, MongoDB, MongoDB Atlas. The result of the work is a client-service application with REST API architecture with the ability to manage data in the corpus storage. The main work consists of 54 A4 pages, includes 28 illustrations, 6 tables and 6 appendices.

Student Theses at HSE must be completed in accordance with the University Rules and regulations specified by each educational programme.

Summaries of all theses must be published and made freely available on the HSE website.

The full text of a thesis can be published in open access on the HSE website only if the authoring student (copyright holder) agrees, or, if the thesis was written by a team of students, if all the co-authors (copyright holders) agree. After a thesis is published on the HSE website, it obtains the status of an online publication.

Student theses are objects of copyright and their use is subject to limitations in accordance with the Russian Federation’s law on intellectual property.

In the event that a thesis is quoted or otherwise used, reference to the author’s name and the source of quotation is required.

Search all student theses