• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Standardized Approach to Automatization of Data Quality Control

Student: Zdorov Mikhail

Supervisor: Tina Berezhnaya

Faculty: Faculty of Creative Industries

Educational Programme: Data Journalism (Master)

Year of Graduation: 2020

Data is an inexhaustible resource, which potential significantly exceeds the potential of minerals. It is used to make effective management decisions, create new products or improve existing ones. The opposite effects arise from the use of low quality data: it is misleading, organizations suffer reputational and financial losses. They can be avoided if the data is checked on time and the low quality is revealed. Stable speed and quality of data quality control can be ensured only by the automatization. Currently, there is no standardized approach to data quality control that satisfies business conditions – criteria of speed, quality and versatility. Organizations that monitor the quality of their data use unique automatization solutions based on proprietary technologies. They cannot be duplicated, their development and implementation require large financial costs. In this work, such a standardized approach was proposed. It includes 7 indicators for checking the quality of the data, the methodology for their calculation and recommendations for automatization solution. It is based on an analysis of existing approaches for their compliance with the criteria of speed, quality and versatility. The compliance of this standardized approach with the business conditions was proved experimentally by successful testing of MVP (minimum viable product) that automatizes calculation of the proposed indicators.

Student Theses at HSE must be completed in accordance with the University Rules and regulations specified by each educational programme.

Summaries of all theses must be published and made freely available on the HSE website.

The full text of a thesis can be published in open access on the HSE website only if the authoring student (copyright holder) agrees, or, if the thesis was written by a team of students, if all the co-authors (copyright holders) agree. After a thesis is published on the HSE website, it obtains the status of an online publication.

Student theses are objects of copyright and their use is subject to limitations in accordance with the Russian Federation’s law on intellectual property.

In the event that a thesis is quoted or otherwise used, reference to the author’s name and the source of quotation is required.

Search all student theses