Regular version of the site

Academic Supervisor

Daria Ryzhova

Manager

Симонова Татьяна Владиславовна

Study Office

For further information about the programme

Feedback: Got ideas on how to make HSE University better? Share them here.

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!
To be used only for spelling or punctuation mistakes.

Our courses: Preprocessing and analyzing linguistic data in Python

The course introduces the main instruments of linguistic data preprocessing for the consequent statistical analysis. This includes distilling data, working with word arrays, frequency dictionary compilation, using formats compatible with R-Studio and using statistical libraries Pandas and Numpy. The course also covers important source of linguistic data and mining methods, formats (XML, json, csv) and their parsing.

Master’s Programme 'Linguistic Theory and Fieldwork'

Our courses: Preprocessing and analyzing linguistic data in Python