• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Mining Financial Transactions for Customers Behavior Prediction

Student: Nekhaenko Aleksandr

Supervisor: Sergei Kuznetsov

Faculty: Faculty of Computer Science

Educational Programme: Data Science (Master)

Year of Graduation: 2018

The paper considers the algorithm for solving the business process of an international service that provides opportunities for the loyalty system using a bank card. The main idea of the study is to find a way to determine the missing information of customer transactions. On the basis of textual information about the performed payment (SMS from the bank) contained in the transaction, the presence of a relationship with other features of the data set is investigated. Text data was presented in the form of vectors, using the approaches “bag of words”, TF-IDF, Word2Vec, FastText, on the basis of which the models of logistic regression, Random Forest, FastText were trained. The analysis of the effect of clearing the text (lower case, removing punctuation, numbers) on the quality of the classification was carried out. The increase in quality was shown by the cleaning of punctuation marks, other methods worsened the result. It was possible to achieve acceptable quality models - the best result was shown by the Word2Vec, FastText ~ 95% accuracy models on the test sample, which is 25% of the entire set.

Student Theses at HSE must be completed in accordance with the University Rules and regulations specified by each educational programme.

Summaries of all theses must be published and made freely available on the HSE website.

The full text of a thesis can be published in open access on the HSE website only if the authoring student (copyright holder) agrees, or, if the thesis was written by a team of students, if all the co-authors (copyright holders) agree. After a thesis is published on the HSE website, it obtains the status of an online publication.

Student theses are objects of copyright and their use is subject to limitations in accordance with the Russian Federation’s law on intellectual property.

In the event that a thesis is quoted or otherwise used, reference to the author’s name and the source of quotation is required.

Search all student theses