Svetlana Toldova
- Senior Research Fellow, Laboratory Head: Faculty of Humanities / Laboratory of Formal Models in Linguistics
- Associate Professor: Faculty of Humanities / School of Linguistics
- Svetlana Toldova has been at HSE University since 2013.
Education and Degrees
Lomonosov Moscow State University
Thesis Title: Discourse structure and focusing as the important factors for the nomination of a particular referent
Lomonosov Moscow State University
Lomonosov Moscow State University
According to the International Standard Classification of Education (ISCED) 2011, Candidate of Sciences belongs to ISCED level 8 - "doctoral or equivalent", together with PhD, DPhil, D.Lit, D.Sc, LL.D, Doctorate or similar. Candidate of Sciences allows its holders to reach the level of the Associate Professor.
Continuing education / Professional retraining / Internships / Study abroad experience
The 9th Russian Summer School in Information Retrieval (RuSSIR 2015). August 24-28, 2015 in St. Petersburg, Russia. Сo-organized by the National Research University Higher School of Economics and the Russian Information Retrieval Evaluation Seminar (ROMIP).
Courses (2024/2025)
- Linguistic Approaches to Discourse (Bachelor’s programme; Faculty of Humanities; 3 year, 2 module)Rus
- Master Classes of Guest Professors (Master’s programme; Faculty of Humanities; 2 year, 3 module)Rus
- Master Classes of Guest Professors (Master’s programme; Faculty of Humanities; 1 year, 2 module)Rus
- Models and methods in language description (Master’s programme; Faculty of Humanities; 1 year, 3, 4 module)Eng
- Research and Design Seminar (Master’s programme; Faculty of Humanities; 2 year, 1-3 module)Rus
- Research and Design Seminar (Master’s programme; Faculty of Humanities; 1 year, 1-4 module)Rus
- Past Courses
Courses (2023/2024)
- Corpus Linguistics (Mago-Lego; 3, 4 module)Rus
- Corpus Linguistics II (Master’s programme; Faculty of Humanities; 2 year, 1, 2 module)Rus
- Corpus Linguistics II (Mago-Lego; 1, 2 module)Rus
- Linguistic Approaches to Discourse (Optional course (faculty); 4 module)Rus
- Master Classes of Guest Professors (Master’s programme; Faculty of Humanities; 2 year, 3 module)Rus
- Models and methods in language description (Master’s programme; Faculty of Humanities; 1 year, 3, 4 module)Eng
- Research and Design Seminar (Master’s programme; Faculty of Humanities field of study Fundamental and Applied Linguistics, field of study Fundamental and Applied Linguistics; 2 year, 1-3 module)Rus
- Research and Design Seminar (Master’s programme; Faculty of Humanities; 1 year, 1-4 module)Rus
- Research and Design seminar "Linguistic projects" (Master’s programme; Faculty of Humanities; 2 year, 1-3 module)Eng
- Research seminar "Uralic languages" (Bachelor’s programme; Faculty of Humanities; 1 year, 3, 4 module)Rus
Courses (2022/2023)
- Corpus Linguistics (Mago-Lego; 3, 4 module)Rus
- Corpus Linguistics (Master’s programme; Faculty of Humanities; 1 year, 3, 4 module)Rus
- Corpus Linguistics (Master’s programme; HSE Tikhonov Moscow Institute of Electronics and Mathematics (MIEM HSE); 1 year, 3, 4 module)Rus
- Linguistic Approaches to Discourse (Minor; Faculty of Humanities; 2 module)Rus
- Master Classes of Guest Professors (Master’s programme; Faculty of Humanities; 2 year, 1-3 module)Rus
- Models and methods in language description (Master’s programme; Faculty of Humanities; 1 year, 2-4 module)Eng
- Research and Design Seminar (Master’s programme; Faculty of Humanities field of study Fundamental and Applied Linguistics, field of study Fundamental and Applied Linguistics; 1 year, 1-4 module)Rus
- Research and Design Seminar (Master’s programme; Faculty of Humanities; 2 year, 1-3 module)Rus
- Research seminar "Uralic languages" (Bachelor’s programme; Faculty of Humanities; 1 year, 3, 4 module)Rus
- State of the Art NLP Technologies (Master’s programme; Faculty of Humanities; 2 year, 1 module)Rus
Courses (2021/2022)
- Master Classes of Guest Experts (Master’s programme; Faculty of Humanities; 2 year, 2, 3 module)Rus
- Research and Design Seminar (Master’s programme; Faculty of Humanities; 2 year, 1-3 module)Rus
- Research and Design Seminar (Master’s programme; Faculty of Humanities; 1 year, 1-4 module)Rus
- Research seminar "Uralic languages" (Bachelor’s programme; Faculty of Humanities; 1 year, 3, 4 module)Rus
Courses (2020/2021)
- Linguistic Approaches to Discourse (Minor; Faculty of Humanities; 2 module)Rus
- Master Classes of Guest Experts (Master’s programme; Faculty of Humanities; 1 year, 2, 3 module)Rus
- Master Classes of Guest Professors (Master’s programme; Faculty of Humanities; 2 year, 1-3 module)Rus
- Research Seminar "Information Structure, Theory and Typology" (Bachelor’s programme; Faculty of Humanities; 4 year, 1, 2 module)Rus
- Research and Design Seminar (Master’s programme; Faculty of Humanities; 1 year, 1-4 module)Rus
- Research and Design Seminar (Master’s programme; Faculty of Humanities; 2 year, 1-3 module)Rus
- Research seminar "Uralic languages" (Bachelor’s programme; Faculty of Humanities; 1 year, 3, 4 module)Rus
Grants
Recent research projects
Linguistics
2022-2-23 "Variation in the discourse and lexicon: an investigation of closely related languages with digital methods" (RSF)
2021-2023 "The morphology of agreement" (RFBR)
2019-2021. "Syntax and Semantics of Uralic and Altaic languages: converging functional typological and formal perspectives" (RFBR, participant)
Natural Language Processing
2017-2019 "Models and methods of discourse and narrative parsing for text mining, text understanding and dialogue systems", RFBR research project №17-29-07033, under supervision of I. Smirnov
2020-2022. Automated methods for sentiment analysis of coherent texts with multiple attitudes based on Russian sentiment frames. RFBR research project №20-07-01059, under supervision of N. Lukashevich
Previous projects
2016-2018 "Four Grammars of Languages of Multilingual Russia", RSF 16-18-02081, under supervision of S.Tatevosov
2016-2018 "Syntax-Semantics Interface in Uralic and Altaic Languages" RFBI 16-06-00536, supervisor
Conferences
- 2023
Малые языки в большой лингвистике 2023 (Москва). Presentation: Глаголы со значением изменения/мены в кадарском и в литературном даргинском
Linguistic forum 2023. Language functioning in remote areas: the arctic and beyond (Москва). Presentation: Совместно с Сумбатовой Н.Р. Труднодоступность и морфологическая сложность (на примере языков даргинской группы)
13-я конференция «Типология морфосинтаксических параметров» (Москва). Presentation: Person-number asymmetry: Agreement of passive miratives in Kazym Khanty
- 2022
12-я конференция «Типология морфосинтаксических параметров» (Москва). Presentation: Относительный порядок дативного аргумента в дитранзитивных конструкциях в русском языке: корпусные и экспериментальные исследования
Малые языки в большой лингвистике (Москва). Presentation: Рефлексивная посессивность в уйльтинском и эвенском
Syntax of Uralic Languages 4 (Санкт-Петербург). Presentation: The mirative construction in Kazym Khanty
The Second International Conference ANATOLIA-THE CAUCASUS-IRAN (Ереван). Presentation: The system of Reflexive Pronouns in Dargwa languages.
Международная научная конференция «Современная лингвистика: от теории к практике» («Contemporary linguistics: theory and practice») (Казань). Presentation: Standard Dargwa Corpus
28-я МЕЖДУНАРОДНАЯ КОНФЕРЕНЦИЯ по компьютерной лингвистике и интеллектуальным технологиям "Диалог-2022". Presentation: Non-canonical constructions with reflexive possessives in Russian: u-possessor constructions
- 2021
27-ая Международная конференция по компьютерной лингвистике и интеллектуальным технологиям «Диалог-2021» (Москва). Presentation: The order of objects in Russian: a corpus study
11-я конференция «Типология морфосинтаксических параметров» (Москва). Presentation: Possessive pronouns in Russian: A corpus and experimental study
Workshop on the Structure of Uralic Languages (Pécs). Presentation: The morphosyntax of non-finite clauses in Kazym Khanty and some of its puzzles
- 2020
26-я международная конференция по компьютерной лингвистике и интеллектуальным технологиям (Москва). Presentation: Discourse Features of Blogs in Subcorpus of Russian RST-treebank
Интернет и современное общество (IMS-20) (он-лайн). Presentation: Формирование набора отношений для корпуса с дискурсивной разметкой текста
53rd Annual Meeting of the Societas Linguistica Europaea. Presentation: Kazym Khanty non-finite forms: Multifunctionality and variability in the amount of structure
- 2019
Formal Approaches to Russian Linguistics 3 (Москва). Presentation: “Fig/Hren” and their semantic interpretation in Russian Or: Wondering what the f*** fig (tebe) is?
3rd SOUL - Syntax of Uralic Languages (Tartu). Presentation: Syntax of DO-encoding patterns in Moksha
Диалог (25-я международная конференция по компьютерной лингвистике и интеллектуальным технологиям) (Москва). Presentation: Classification Models for Rst Discourse Parsing of Texts In Russian
Диалог (25-я международная конференция по компьютерной лингвистике и интеллектуальным технологиям) (Москва). Presentation: Contrast and Comparison Relations in RST Framework: the Case of Russian
Descriptive grammars and typology (Хельсинки). Presentation: Competing motion-cumpurpose strategies in Northern Selkup: a corpus study
Workshop on Discourse Relation Parsing and Treebanking 2019 (Миннеаполис). Presentation: Towards the Data-driven System for Rhetorical Parsing of Russian Texts
Congreso Internacional CORE 2019 (Мехико). Presentation: Discourse analysis: a Rethoric structure theory approach
52nd Annual Meeting of the Societas Linguistica Europaea (Лейпциг). Presentation: Dedicated possessive reflexives in languages with head marking of a possessor
52nd Annual Meeting of the Societas Linguistica Europaea (Лейпциг). Presentation: Evidential and epistemic semantics of modal particles in Northern Selkup
25 Международная конференция по компьютерной лингвистике и интеллектуальным технологиям «Диалог» (Москва). Presentation: Discourse Features of Blogs in Subcorpus of Russian RST-treebank
25 Международная конференция по компьютерной лингвистике и интеллектуальным технологиям «Диалог» (Москва). Presentation: Discourse Features of Blogs in Subcorpus of Russian RST-treebank
- 2018
Computational Methods for Endangered Language Documentation and Description (Париж). Presentation: The possible re-usage of fieldwork data for automated morphological parsing (the case of Moksha)
DGfS 2018: 40th Annual Conference of the German Linguistic Society (Штутгарт). Presentation: Properties of definite declension in Moksha
The 18th International Morphology Meeting. Workshop 3. Morphological aspects of Uralic and Turkic languages (Budapest). Presentation: The split in nominal paradigms and the size of extended nominal projection in Moksha
3-й Колмогоровский семинар по компьютерной лингвистике и наукам о языке (Москва). Presentation: Extraction of multi-word ‘Cause-Effect’ connectives
51st Annual Meeting of the Societas Linguistica Europaea (Таллинн). Presentation: Differential object marking, word order and verb adjacency in Komi
Linguistic diversity, minority languages and digital research infrastructures (Hamburg). Presentation: Towards semi-automated shallow syntax using FLEx data
Concort-2018 (Нижний Новгород). Presentation: Корпусное исследование оборотов с местоимением с предикативным антецедентом
Concort-2018 (Нижний Новгород). Presentation: Корпусное исследование порядка слов в северном диалекте селькупского языка
Язык, история, культура бесермян: состояние и перспективы исследований (Глазов). Presentation: Категория притяжательности в языке бесермян и других пермских языках
Языки народов России в контакте с русским языком: явления морфосинтаксической и семантической интерференции (Москва). Presentation: Порядок слов в северных диалектах селькупского языка: к вопросу об изменении порядка слов под влиянием русского языка
4th Workshop on Languages of the Volga-Kama Sprachbund (Москва). Presentation: Grammatical and lexical case distinction in Moksha and Hill Mari
- 2017
XLVI Международная филологическая научная конференция (Санкт-Петербург). Presentation: Семантика глагола и выбор оформления прямого дополнения в мокшанском языке
Чтения памяти А. И. Кузнецовой (Москва). Presentation: Посессивный показатель в коми-зырянском языке как маркер пресуппозиции
Чтения памяти А. И. Кузнецовой (Москва). Presentation: Посессивный показатель в коми-зырянском языке как маркер пресуппозиции
Компьютерная лингвистика и интеллектуальные технологии: Диалог 2017 (Москва). Presentation: Порядок следования прилагательных разных семантических классов в русском языке в свете корпусных данных
Компьютерная лингвистика и интеллектуальные технологии: Диалог 2017 (Москва). Presentation: Coreferenсe resolution for Russian: the impact of semantic features
Conference on the Syntax Of Uralic Languages (Будапешт). Presentation: The NP/DP-structure in Moksha language
The 50th Annual Meeting of the Societas Linguistica Europaea (Цюрих). Presentation: Looking for a D-layer in Moksha
The 50th Annual Meeting of the Societas Linguistica Europaea (Цюрих). Presentation: The interaction of possessive and definite noun declensions in Moksha
Малые языки в большой лингвистике (Москва). Presentation: Possessive markers in Komi-Zyrian: topic, presupposition, or discourse markers
Малые языки в большой лингвистике (Москва). Presentation: Синтаксические, семантические и прагматические свойства посессивных рефлексивных местоимений (на материале финно-угорских и тунгусо-маньчжурских языков)
ConCort (Москва). Presentation: Выбор референциальных средств разграничения эпизодов на примере текстов корпуса Russian CliPS
INLG 2017, 6th Workshop on Recent Advances in RST and Related Formalisms (Santiago de Compostela). Presentation: Rhetorical relations markers in Russian RST Treebank
- 2016
The 17th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2016) (Konya). Presentation: Features for discourse-new referent detection in Russian
Рабочее совещание, посвящённое дифференцированному маркированию актантов (Москва). Presentation: Кодирование прямого дополнения в мокшанском языке: определенность vs. топикальность
6-я тематическая конференция серии «Типология морфосинтаксических параметров» (Москва). Presentation: Структурная позиция прямого дополнения и его коммуникативный статус
The 17th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2016) (Konya). Presentation: Features for discourse-new referent detection in Russian
Information structure and discourse in the minority languages of the Russian Federation (2016). Presentation: Topicality and Differential Object Marking in Moksha
Grammar and Corpora (Мангейм). Presentation: Multiple prenominal adjectives ordering in Russian: a corpus study
Компьютерная лингвистика и наука о языке (Москва). Presentation: Identification of singleton mentions in Russian
Компьютерная лингвистика и наука о языке (Москва). Presentation: Coreference in Russian oral movie retellings (the experience of coreference relations annotation in Russian CliPS corpora)
- 2015
Dialogue - 2015 (international conference for computational linguistics) (Москва). Presentation: Coreference Chains in Czech, English and Russian: Preliminary Findings.
Congressus Duodecimus Internationalis Fenno-Ugristarum. (Оулу). Presentation: Differential object marking in Moksha language
- 2014
CILC 2014 : 6th International Conference on Corpus Linguistics (Лас Пальмас). Presentation: Coreference Corpus in Russian
CILC 2014 : 6th International Conference on Corpus Linguistics (Лас Пальмас). Presentation: Corpora Acquisition for Machine Learning Web Query Intent Classification
Искусственный интеллект и естественный язык (AINL) (Сколково). Presentation: Форум по оценке методов автоматической обработки текстов: распознавание анафорических и кореферентных связей
Supervisor of the following Doctoral theses
- 1I. Azerkovich Evaluating the input of ontological information in coreference resolution for Russian language, 2021
- 2D. Alexeevsky Methods for automatic wordnet relation extraction from dictionary definitions, 2018
- 3Automatic Discourse and Pragmatics Analysis of Casual Conversations
- 4Reflections of syntactic structures in transformer neural networks
Employment history
1987 – 2017 Research fellow, Department of theoretical and applied linguistics, Faculty of Philology, Moscow State University
2000 – 2013 Associate Professor, State Academic University for the Humanities, Institute of Linguistics
2013 - 2013 Senior Fellow, Institute of World Cultures, Lomonosov State University
1991 – 2002 Engineer, Russian Research Institute for Artificial Intelligence;
2007 – 2010 Linguist, Medialogia (NLP consultant);
2011 – 2014 Leading Research Fellow, Center for Semantic Technologies, National Research University “Higher School of Economics”
2010-2011 - News360
Results of Competition for Research and Teaching Laboratories Announced
This year, HSE has supported the founding of 10 new Research and Teaching Laboratories (RTLs) in various fields, from cognitive psychology and computer modeling, to international justice and economics of sports.