• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site
Master 2023/2024

Linguistic Data: Quantitative Analysis and Visualisation

Area of studies: Fundamental and Applied Linguistics
Delivered by: School of Linguistics
When: 2 year, 2 module
Mode of studies: distance learning
Online hours: 16
Open to: students of all HSE University campuses
Instructors: Daria Popova
Master’s programme: Linguistic Theory and Language Description
Language: English
ECTS credits: 3
Contact hours: 32

Course Syllabus

Abstract

First year: The course is devoted to modern methods of data analysis, as applied to linguistic data, including methods of statistical inference and explanatory data analysis with visualizations. We begin with theoretical background in mathematical statistics and discuss limitations of statistical methods and their applicability to linguistical problems. From practical point of view, we use R system to do actual analysis with real datasets. We also discuss different visualization techniques using popular library ggplot2. Second year: Preprocessing of linguistic data in Python is designed to further the students’ knowledge of natural language processing and to polish their programming skills. The course aims to provide the students with the programming and natural language processing knowledge and competencies necessary to plan and conduct research projects of their own leading to the M.Sc. dissertation and scientific publications.