Jul

2021

'When We Find Something That Doesn’t Work Well Enough, We Replace It in Order to Develop a More Effective Approach'

Photo by Yandex

The winners of the third annual Ilya Segalovich Award were recently announced in Moscow. Established by Yandex, the award promotes the scientific endeavours of young researchers from Russia, Belarus, and Kazakhstan in the field of Computer Science. Among this year’s winners were three HSE students, including Alexander Grishin, a Doctoral student of the Big Data and Information Retrieval School of the HSE Faculty of Computer Science. Alexander spoke to us about his work, research challenges, and why he was surprised to receive the award.

HSE University and My Research Work

I started doing research work as an undergraduate in the Faculty of Biological and Medical Physics at the Moscow Institute of Physics and Technology (MIPT). In my third and fourth years there, I used machine learning to analyse DNA nucleotide sequences. It wasn’t an easy task—I had never studied machine learning before and I didn’t have enough knowledge of biology. I was keen on the subject, but I felt I lacked hard skills.

I realized that I was more interested in the technical side of things than the biology side, and I started looking for a place where I could gain those skills. That’s how I ended up in HSE University’s ‘Data Science’ Master’s Programme at the School of Data Analysis. The first year required a lot of intensive study, so I had no time for research. Then I met Professor Dmitry Vetrov, and we’ve been doing research together ever since.

My work on Professor Vetrov’s research team started with the biology side of things. I joined Insilico Medicine, a company that synthesizes pharmaceuticals. Alongside my academic advisor Daniil Polykovsky, I worked on forecasting the properties of molecules. This would enable us to generate molecules with the same properties for medicinal purposes.

Solving a difficult task requires solving easier ones first, so we started by trying to determine whether or not a molecule had any specific properties. We never published the method we developed, as two very similar articles were released around that time and the authors of those articles used a slightly better approach. That’s when we knew that our work was sound and up-to-date, just not quick enough.

After that, I began looking for another area of research. I wanted to work with Pavel Shvechikov and luckily he was able to take me on. I wanted to explore a new field, and that ended up being reinforcement learning. Pavel and I worked together for a few years. Our team gave a presentation at the NeurIPS oral workshop and wrote an article for ICML.

Our team doesn’t specialize in reinforcement learning, so we sometimes lack the level of expertise we’d like. But this is common, as not many people work on reinforcement learning at a serious academic level. We decided to put together a team dedicated to the field. As for the commercial sector, companies have always been interested in this area, but it remains underdeveloped.

The Prize-Winning Research

The things we’re doing are quite simple. Rather than focus on specific tasks, we develop general algorithms. We take an academic approach and try to develop more effective algorithms that can cover a wide range of tasks. We also study existing algorithms and the way they work in order to uncover potential problems. To put it simply, we analyze each individual element and when we find something that doesn’t work well enough, we replace it in order to develop a more effective approach.

We received the Ilya Segalovich Award for our article about overestimation bias in reinforcement learning. After performing an action, we try to figure out whether these actions will have good or bad consequences. Previously, we have relied on very primitive tools such as understatement or artificial suppression to reduce overestimation bias. But we’ve developed a more flexible and effective tool to avoid this kind of overestimation. Samsung has recognized our research too, which was a surprise to me. I thought the company would be more interested in practical results, while our work has the academic focus of enhancing the effectiveness of reinforcement learning.

Are there traditional tasks in reinforcement learning? Yes, and they’re used to test algorithms and compare them. One example is Atari games, in which the agent sees a picture, another image of the game, and performs a certain action (controlling the agent). These games vary both visually and mechanically.

We tested our algorithms on ‘locomotor tasks’—a set of tasks in a physical simulator that models different kinds of robots, from simple robots with a couple of joints to complex humanoid ones. The agent receives signals from sensors informing it of the position, angle, and speed of each limb. Every 15 milliseconds, the agent must exert a force to change the position of the body. Then the agent must exert more force to perform a new task, such as running as fast as it can without falling.

Reinforcement learning involves working with states—things the agent sees or does depending on the state it finds itself in, or rewards it receives for performing the right actions or sequences. The great thing about it is that the specifics of these states—where they come from, what they look like, how they change—don’t really matter. This is because learning algorithms are universal and the task is always the same: to teach the agent the best approach that produces the biggest reward. From a theoretical standpoint, the reward itself isn’t important, be it the speed of the robot’s movements, efficient power consumption, a click from a user, or money made at the stock exchange. The methods we used to teach robots how to run (or rather, to learn how to run themselves) could be used to distribute power generated by a nuclear power plant or advertise goods to consumers. Reinforcement learning has limitless applications.

The Yandex Prize and My Hobby

I’m delighted we received the award for a number of reasons. Firstly, personal motivation and external recognition often don’t go hand-in-hand in academia. Everyone has had their articles turned down more than once. So an award is something extraordinary, and it drives you to do better. Another important aspect is that the winners of this award receive substantial financial support, access to Yandex Toloka, and an invite to a conference. These are great motivators too.

Secondly, I didn’t think I had much of a chance of winning because I’ve had very few things published, so the award came as a pleasant surprise. The organizers must have seen me as a researcher capable of creating something interesting and useful. The number of publications wasn’t a deciding factor in their decision.

I was also delighted to tell Professor Vetrov (he wasn’t eligible for the award, although I’m sure he would have won if he had been) and my friends. Even if they don’t know much about science or research, they think it’s cool and they’re happy for me.

The award ceremony took place almost at the same time as the first concert of the ensemble I’m in, which is made up of six cellists and a pianist. We’ve got quite a wide repertoire, so there’s bound to be something for everyone to enjoy. We play everything from Shostakovich waltzes to the Game of Thrones theme. That’s another achievement I’m proud of.

Date

6 July 2021

Topics

Community

Keywords

achievements master's programmes doctoral programmes

About

Big Data and Information Retrieval School, Faculty of Computer Science, Master's Programme in Data Science

About persons

Dmitry Vetrov

‘Employers Know That HSE Graduates are Well-Prepared, Analytical, and Adaptable’

Warda Tariq, from Pakistan, completed her Master’s in Data Science at HSE University–Moscow in 2024. She is now undertaking a PhD at the Faculty of Computer Science while working remotely as an AI/ML developer. Warda spoke to the HSE News Service about blending theory and practice, what an HSE education provides apart from academic knowledge, and her advice for making the most of university.

28 November

Oct

2025

‘Start Working on Your Articles from the Very Beginning of Your PhD’

Andrés Castañón Rincón, from Spain, is a doctoral student at the HSE School of Philosophy and Cultural Studies in Moscow working on the history of Soviet Marxism philosophy. In his interview with the HSE News Service, he explains why studying Soviet Marxism is relevant today, talks about the advantages and challenges of his work in Moscow as an international researcher, and gives some advice to beginner PhD students.

30 October

Jun

2025

‘Education is the Mother of all Disciplines’

Moses Oluoke Omopekunola, from Nigeria, is a second-year student of the Science of Learning and Assessment (SOLA) programme. Intent on pursuing a career in psychometrics, he has already joined the PhD programme in Education to deepen his knowledge of the theory and methodology of learning. In this interview with the HSE News Service, Moses explains how his master’s programme aligns with his future plans, recalls the challenges he has faced, and shares the most valuable lessons he has learned so far.

6 June

Apr

2025

Russian and Chinese Scholars Share Experience of Transformation of Doctoral Education

The Russian and Chinese postgraduate education systems originally borrowed their institutional frameworks from the Soviet Union. However, in the 21st century, they have evolved along different paths. While key performance indicators for postgraduate programmes in Russia are declining, China is seeing a rapid increase in the number of postgraduate students. These contrasting trajectories and the reforms undertaken in each country in recent decades were the focus of a roundtable discussion held as part of the 25th Yasin (April) International Academic Conference.

23 April

Aug

2024

Doctoral Student Explores the Challenges Faced by International PhD Seekers During the Pandemic

In late June 2024, a pre-defence of Nurudeen Abdul-Rahaman’s dissertation took place at the HSE Institute of Education. Nurudeen Abdul-Rahaman, a doctoral student from Ghana, has presented his dissertation ‘Academic and Social Integration of Foreign Doctoral Students at Russian Universities during the Covid-19 Pandemic’ for the degree of Candidate of Sciences in Education (PhD).The HSE News Service spoke with Nurudeen as well as his academic supervisor, Evgeniy Terentev, Director of the Institute of Education, about their extensive research on international doctoral students in Russia and Nurudeen's contribution to this research.

8 August 2024

Aug

2024

‘We Cannot Understand the Modern Ideological Confrontation without the Accusations that Emerged during the Lausanne Process’

Rainer Matos Franco, from Mexico, defended his PhD thesis with honours at HSE University this June. In his dissertation, Rainer Matos Franco examines the history of anticommunism in Europe during the 1920s. The HSE News Service spoke with Rainer and his academic supervisor, Tatiana Borisova, about the significance of the Lausanne Process for the Cold War and contemporary history, the opportunities provided by HSE University for international PhD candidates, and the challenges of working with a vast database of historical sources.

5 August 2024

Jun

2024

‘I Am Able to Tell My Students Things That I Always Wanted to Tell People in Russia’

Ana Livia Araujo Esteves, from Sao Paulo, Brazil, is a journalist, a third-year doctoral student of International Relations, and visiting lecturer at the HSE School of International Regional Studies. In her interview for the HSE News Service, she speaks about her motivation to carry out research and teach students in Russia, shares some tips for people from Latin America living in Moscow, and talks about why a dog can be a reason to stay in Russia for just a bit longer.

4 June 2024

Jan

2024

Zaruhi Hakobyan Shortlisted for HSE Alumni Awards

Zaruhi Hakobyan, master’s graduate of the HSE University Faculty of Economic Sciences and research scientist at the University of Luxembourg, is involved not only in research but also in organising academic events for young scientists and students. As a foreign graduate of HSE University, Zaruhi was nominated for the HSE Alumni Awards ‘for her tireless enthusiasm in popularising economic science, teaching, and research at the international level’ and made the shortlist.

12 January 2024

Jul

2023

‘Studying at HSE Was a Chance for Me to Get to Know Some Supportive Seniors, Knowledgeable Professors, and Wonderful Friends’

On August 4, 2023, a pre-defence of the thesis on ‘Refugee-Host Community Conflict over Assimilation, Integration, and State Legitimacy: The Case of Rohingyas in Bangladesh’ by Md. Reza Habib will be held at HSE University. The preliminary defence will take place at a joint meeting of the HSE School of Sociology and the International Laboratory for Social Integration Research. Md. Reza Habib shared his experience of studying and preparing his PhD with the HSE News Service.

27 July 2023

Jul

2023

‘At HSE University, We Receive Substantial Support for Our Research’

Wenrui Zhang, from China, is a recent graduate of theMaster’s in Economics and Economic Policy at the HSE UniversityFaculty of Economic Sciences. Having successfully defended his master’s thesis on the impact of COVID-19 on the incomes of vulnerable groups, Wenrui has set his sights on publishing his research and enrolling in adoctoral programme at the university. The HSE News Service interviewed Wenrui about his achievements so far and his goals for the future, and also spoke to Prof.Elena Kotyrlo, his academic supervisor.

10 July 2023