• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Our courses: Preprocessing and analyzing linguistic data in Python

The course introduces the main instruments of linguistic data preprocessing for the consequent statistical analysis. This includes distilling data, working with word arrays, frequency dictionary compilation, using formats compatible with R-Studio and using statistical libraries Pandas and Numpy. The course also covers important source of linguistic data and mining methods, formats (XML, json, csv) and their parsing.