• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site
  • HSE University
  • Student Theses
  • Construction of Local Approximators and Prediction Accuracy Estimation for Prediction of Physicochemical Properties of Molecules

Construction of Local Approximators and Prediction Accuracy Estimation for Prediction of Physicochemical Properties of Molecules

Student: Matveev Albert

Supervisor: Evgeny V. Burnaev

Faculty: Faculty of Computer Science

Educational Programme: Mathematical Methods of Optimization and Stochastics (Master)

Final Grade: 9

Year of Graduation: 2017

Chemical informatics focuses on storing, indexing, searching, retrieving, and applying information about chemical compounds. Chemical informatics helps chemists investigate new problems and organize and analyze scientific data to develop novel compounds, materials, and processes through the application of information technology. Mathematical modeling and modern machine learning algorithms are the main research tools for approximation of physicochemical properties of molecules. Distinctive features of such problems are the large dimensionality of the input space, significant noise in the experimental data, heterogeneity of the sample. These features require additional analysis and development of new approaches. The aim of this work is to develop a system of interrelated methods (develop new methods and modify existing ones) that would allow to consider features of the problem and provide the needs described above. The main results include the following. A new conformal metric was proposed for construction of conformal predictors for the case of classification problem. A method of the input space partitioning with construction of regression models was used for prediction of physicochemical properties of molecules, a regression method based on classification algorithms was developed. Described methods were applied to the practical problem and numerical simulations were carried out on the melting point dataset and lipophilicity dataset.

Full text (added May 26, 2017)

Student Theses at HSE must be completed in accordance with the University Rules and regulations specified by each educational programme.

Summaries of all theses must be published and made freely available on the HSE website.

The full text of a thesis can be published in open access on the HSE website only if the authoring student (copyright holder) agrees, or, if the thesis was written by a team of students, if all the co-authors (copyright holders) agree. After a thesis is published on the HSE website, it obtains the status of an online publication.

Student theses are objects of copyright and their use is subject to limitations in accordance with the Russian Federation’s law on intellectual property.

In the event that a thesis is quoted or otherwise used, reference to the author’s name and the source of quotation is required.

Search all student theses