• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Random Forest Visualization

Student: Kuzneczova Natal`ya

Supervisor: Irina A. Lomazova

Faculty: School of Software Engineering

Educational Programme: Master

Year of Graduation: 2014

<p style="text-align: justify;">Classi\x0ccation is the process of assigning a class label to an observation based on its&nbsp;proprieties or attributes. A classi\x0ccation algorithm is applied to a data set, producing a model. By studying the model, insights about the data set structure can be gained. The bene\x0cts that a model can bring depend on the model. In this work, a Random Forest model is used for the analysis of data. A Random Forest model is explored by means of visualization. The results include this report and the prototype of a visualization analysis tool. The tool, named ReFINE for Random Forest INspEctor, consists of several visualizations for a Random Forest model. ReFINE provides visualizations for Random Forest components - trees, and its special feature: proximity measure, variable importance, interactions and prototypes. Each of these aspects is presented with a di\x0berent visualization technique; all the visualizations are integrated together to show the connections between them and allow a user to discover patterns in data sets. The e\x0bectiveness of the approach is validated with various data sets, including generated and real data. As a result, ReFINE allows to investigate data, its most importance variables, theirs split points, connection between instances and their distribution.</p>

Student Theses at HSE must be completed in accordance with the University Rules and regulations specified by each educational programme.

Summaries of all theses must be published and made freely available on the HSE website.

The full text of a thesis can be published in open access on the HSE website only if the authoring student (copyright holder) agrees, or, if the thesis was written by a team of students, if all the co-authors (copyright holders) agree. After a thesis is published on the HSE website, it obtains the status of an online publication.

Student theses are objects of copyright and their use is subject to limitations in accordance with the Russian Federation’s law on intellectual property.

In the event that a thesis is quoted or otherwise used, reference to the author’s name and the source of quotation is required.

Search all student theses