• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site
For visually-impairedUser profile (HSE staff only)SearchMenu

Development and Implementation of Anaphora Resolution Models in Chinese Based on User-generated Content in Medical Forums

Student: Alena Tsvetkova

Supervisor: Irina Efimenko

Faculty: Faculty of World Economy and International Affairs

Educational Programme: Asian Studies (Bachelor)

Final Grade: 9

Year of Graduation: 2020

The purpose of this study is to create a computer program for resolving the pronoun anaphora in a text in Chinese. Types of anaphora in the Chinese language were considered, a dictionary of pronouns was compiled in terms of the linguistics approach. By applying the methods of computer linguistics three models were developed in the course of the study for the anaphora resolution in Chinese texts. The accuracy of the SpanBERT + BERT-Chinese model accounts for 68.5% on the OntoNotes 5.0 corpus for the Chinese language. The practical significance of the study lies in the fact that the created models can be used in the automatic analysis of medical texts in Chinese. Test of the model that showed the highest result was carried out on messages from the Chinese medical forum haodf.com.

Full text (added May 14, 2020)

Student Theses at HSE must be completed in accordance with the University Rules and regulations specified by each educational programme.

Summaries of all theses must be published and made freely available on the HSE website.

The full text of a thesis can be published in open access on the HSE website only if the authoring student (copyright holder) agrees, or, if the thesis was written by a team of students, if all the co-authors (copyright holders) agree. After a thesis is published on the HSE website, it obtains the status of an online publication.

Student theses are objects of copyright and their use is subject to limitations in accordance with the Russian Federation’s law on intellectual property.

In the event that a thesis is quoted or otherwise used, reference to the author’s name and the source of quotation is required.

Search all student theses