A Query System for UNESCO's World Heritage at the WWW

Elena Vinuela Diaz

AbstractThe Web is probably the largest and richest information repository available today. Search engines are the common access routes to this valuable source but as they only do a keyword search sometimes their role is limited to the retrieval of potentially relevant documents but they cannot "understand" their semantic.

The aim of this thesis is to develop a semantic query system for World Heritage sites on the WWW. Web pages are usually semi-structured, and therefore very difficult to query for information. Advanced extraction processes of the information needs to be performed. This study evaluates an ontology driven approach for extracting reliable information from web pages about World Heritage.

An ontology that models the important concepts has been constructed, based on the analysis of the World Heritage domain. The ontology language DAML+OIL has been chosen for the ontology.

A prototype web-based application has been developed to perform the whole process, from getting the user s query to presenting the results of the semantic extraction processes.
KeywordsInformation Extraction, Information Retrieval, annotations, ontology, DAML+OIL, Knowledge base, Internet, World Heritage
TypeMaster's thesis [Academic thesis]
Year2004
PublisherInformatics and Mathematical Modelling, Technical University of Denmark, DTU
AddressRichard Petersens Plads, Building 321, DK-2800 Kgs. Lyngby
SeriesIMM-Thesis-2004-29
Note
Electronic version(s)[pdf]
BibTeX data [bibtex]
IMM Group(s)Computer Science & Engineering