• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Transformer-Based Architectures for Tabular Data

Student: Khrushkov Pavel

Supervisor: Evgeny Sokolov

Faculty: Faculty of Computer Science

Educational Programme: Data Science (Master)

Year of Graduation: 2021

Currently, the methods of deep learning show state-of-the-art quality for a wide range of tasks. Over the years of research, various task-specific or even field-specific models have emerged, i.e., recurrent neural networks or Transformers for natural language processing and convolutional neural networks for computer vision tasks. One of the fields that lack such specialized architectures is tabular data processing, for which fully-connected neural networks are still in use, and the standard methods are non-differentiable models, e.g., gradient boosting decision tree. This fact, along with a prevailing tendency towards using Transformers out of the scope of natural language processing, raises the question of the applicability of Transformers for tabular data. This work studies this possibility and the use of Transformer-specific architecture modifications to force a proper inductive bias. We propose T-Transformer, which performs better than CatBoost, TabNet, or fully-connected networks on some datasets. T-Transformer does not require many parameters, yet resulting in competitive quality. We also study the interpretability of the proposed method.

Student Theses at HSE must be completed in accordance with the University Rules and regulations specified by each educational programme.

Summaries of all theses must be published and made freely available on the HSE website.

The full text of a thesis can be published in open access on the HSE website only if the authoring student (copyright holder) agrees, or, if the thesis was written by a team of students, if all the co-authors (copyright holders) agree. After a thesis is published on the HSE website, it obtains the status of an online publication.

Student theses are objects of copyright and their use is subject to limitations in accordance with the Russian Federation’s law on intellectual property.

In the event that a thesis is quoted or otherwise used, reference to the author’s name and the source of quotation is required.

Search all student theses