• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Ontologies Manager Implementation for the Real Estate Agency Information System

Student: Glukhov Roman

Supervisor: Lyudmila N. Lyadova

Faculty: Faculty of Economics, Management, and Business Informatics

Educational Programme: Bachelor

Year of Graduation: 2014

<p align="center">Аннотация к выпускной квалификационной работе на тему:</p><p align="center">&laquo;Ontologies Manager Implementation for the Real Estate Agency Information System&raquo;</p><p>Студента группы БИ-10-2 <strong>Глухов Романа Игоревича</strong></p><p>In this paper, we consider the problem of automatic search of the real estate advertisements on the Internet. The main aim is to develop an application that search and retrieve information from the proposals on the real estate market on the Internet.</p><p>Today there are varieties of services designed to find and extract structured information. Services use a variety of approaches, such as retrieving data with their subsequent processing and loading into the database. The advantages of this approach include the fact that downloading data is structured. This allows to quickly communicating to the already loaded data. The disadvantages of this approach is that the processing of information portals with information on real estate, in particular the free message boards (organized databases), it does not always give the correct classification, and hence, the request can meet the irrelevant information. It should be noted that this approach is most useful for extracting data from the sites where the information is clearly structured.</p><p>This work was based on a method that accesses the data sources on the Internet with regard to their ways with their submission-accepted classification, etc. Free search portals on adding new ad is required to fill the main and additional fields on your ad in order to find it quickly, that is, the database is flagged &quot;classify&quot; information. The request for a particular Ad is generated by its consistent complication, i.e. adding new screening criteria (ad type (sale, rent, purchase), location (region, district, and exact address), type of house (Stalin, Khrushchev, etc. etc.), material (brick, panel, etc.), area, price range). After that, &quot;script&quot; processes the request and extends relevant to the query proposals from the base. Then in the address bar appears header, each of which - it previously specified filter.</p><p>During the analyzed websites sources &quot;Avito&quot; and &quot;Hand in Hand&quot;, the analysis highlighted the main sections and filters for Real Estate, based on them has been developed ontology. Ontology can describe as triples (subject, predicate, subject) different judgments about instances of domain objects. Due to this, same instance ontology concepts can be described in different ways , for example, &quot; Avito &quot; subject &laquo;Rooms&raquo;, predicate &laquo;One-rooms&raquo;, &quot;subject / 1 -komnatnye&raquo;; for &quot; Hand in Hand &quot; subject &laquo;Rooms&raquo;, predicate &laquo;One-rooms&raquo;, subject &laquo;/ rooms = 1 .&quot; The method is based on the sequential construction of the address bar to which the request will be performed by retrieving data from the ontology corresponding to a user-specified filter. At this stage, the&nbsp; prototype of application allows you to build a valid request to the website source &quot;Avito&quot;, and in the future we plan to implement this to &quot;Hand in Hand&quot; portal. The positive side of this approach is that we get the relevant request of the user data; the negative side is that at each request must be handled by the internet source.</p><p>The problem of defining proposals for the real estate market is divided into several stages. At the first stage, the extract necessary information from the ontology. The second step is the formation of the address bar of the values ​​that we have received from the ontology. The third stage is a request to the website and download information on xpath-queries.</p><p>&nbsp;During the work, the library OwlDotNetApi is able to process data from the ontology has been studied. Service &quot;IVS&quot; using this library to solve the problem of information retrieval was also analyzed. Algorithms based on service described algorithms to solve the problem. In turn directly, to extract data from html-pages was dismantled HtmlAgilityPack library and were described algorithms to extract information from HTML-documents.</p><p>The paper contains a Windows Forms application developed in Visual Studio environment, which search real estate offers for the requested criteria. Prototype application allows you to receive relevant ads with search site &quot;Avito.&quot;</p>

Student Theses at HSE must be completed in accordance with the University Rules and regulations specified by each educational programme.

Summaries of all theses must be published and made freely available on the HSE website.

The full text of a thesis can be published in open access on the HSE website only if the authoring student (copyright holder) agrees, or, if the thesis was written by a team of students, if all the co-authors (copyright holders) agree. After a thesis is published on the HSE website, it obtains the status of an online publication.

Student theses are objects of copyright and their use is subject to limitations in accordance with the Russian Federation’s law on intellectual property.

In the event that a thesis is quoted or otherwise used, reference to the author’s name and the source of quotation is required.

Search all student theses