Integrated Retrieval from Web of Documents and Data

TitleIntegrated Retrieval from Web of Documents and Data
Publication TypeBook Chapter
Year of Publication2009
AuthorsKrishnaprasad Thirunarayan, Trivikram Immaneni
KeywordsData Retrieval, Hybrid Query Language, Hypertext Web, Information Retrieval, Semantic Web, Unified Web
Abstract

The Semantic Web is evolving into a property-linked web of data, conceptually different from but contained in the Web of hyperlinked documents. Data Retrieval techniques are typically used to retrieve data from the Semantic Web while Information Retrieval techniques are used to retrieve documents from the Hypertext Web. We present a Unified Web model that integrates the two webs and formalizes connection between them. We then present an approach to retrieving documents and data that captures best of both the worlds. Specifically, it improves recall for legacy documents and provides keyword-based search capability for the Semantic Web. We specify the Hybrid Query Language that embodies this approach, and the prototype system SITAR that implements it. We conclude with areas of future work

Full Text

Krishnaprasad Thirunarayan and Trivikram Immaneni, 'Integrated Retrieval from Web of Documents and Data:'Advances in Data Management, Studies in Computational Intelligence,Ras, Zbigniew W. and Dardzinska, Agnieszka (Eds.),vol. 223, (2009), pp. 25-48, Isbn: 978-3-642-02189-3, Doi: 10.1007/978-3-642-02190-9_2
pages: 25-48
publisher: Springer Berlin / Heidelberg
year: 2009
hasEditor: Z. W. Ras And A. Dardzinska
related resource url: http://knoesis.org/resources/library-resources/files/download/IRWDD.pdf and http://dx.doi.org/10.1007/978-3-642-02190-9_2
hasBookTitle: Advances in Data Management