• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Neural Text Classification Using Discourse Structure of the Text

Student: Chernyavskiy Aleksandr

Supervisor: Dmitry Ilvovsky

Faculty: Faculty of Computer Science

Educational Programme: Applied Mathematics and Information Science (Bachelor)

Final Grade: 10

Year of Graduation: 2019

This paper considers the text classification problem. We verify quality for two tasks: fact-checking task and categorization of quotes from an Internet forum corresponding to argumentation. Attention to the automatic verification of facts has significantly enhanced after the "Pizzagate" scandal in the US in 2016. A strong model for solving these and analogous problems still does not exist. The aforementioned tasks require the style-based approach to achieve the best results because of the restriction on using additional data. The proposed model effectively encodes parsed discourse trees due to the recursive neural net. The similar method had the finest performance for the sentiment analysis and semantic relatedness of two sentences task before. In the paper, the comparison with popular methods is given. Experiments illustrate that the suggested model is effective and reaches top results for the assigned tasks. In addition, it confirms that discourse analysis helps to obtain a much higher quality for the classification of longer texts in the case of Internet argumentation analysis.

Full text (added May 19, 2019)

Student Theses at HSE must be completed in accordance with the University Rules and regulations specified by each educational programme.

Summaries of all theses must be published and made freely available on the HSE website.

The full text of a thesis can be published in open access on the HSE website only if the authoring student (copyright holder) agrees, or, if the thesis was written by a team of students, if all the co-authors (copyright holders) agree. After a thesis is published on the HSE website, it obtains the status of an online publication.

Student theses are objects of copyright and their use is subject to limitations in accordance with the Russian Federation’s law on intellectual property.

In the event that a thesis is quoted or otherwise used, reference to the author’s name and the source of quotation is required.

Search all student theses