CMIC at INEX 2007: Book Search track

Walid Magdy, Kareem Darwish

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

With massive book digitization efforts underway, the need for effective retrieval of books and pages in books is an important problem. This paper describes our submissions to the INEX 2007 Book Search track. We explored using book specific features such as table of content and index pages and headers along with non-book specific features. Our results show that indexing the entire contents of books and headers provided the most effective retrieval strategy.

Original languageEnglish
Title of host publicationFocused Access to XML Documents - 6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007, Revised and Selected Papers
Pages175-182
Number of pages8
DOIs
Publication statusPublished - 22 Sep 2008
Event6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007 - Dagstuhl Castle, Germany
Duration: 17 Dec 200719 Dec 2007

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4862 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007
CountryGermany
CityDagstuhl Castle
Period17/12/0719/12/07

    Fingerprint

Keywords

  • Book search
  • OCR retrieval

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Magdy, W., & Darwish, K. (2008). CMIC at INEX 2007: Book Search track. In Focused Access to XML Documents - 6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007, Revised and Selected Papers (pp. 175-182). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4862 LNCS). https://doi.org/10.1007/978-3-540-85902-4_16