Searching for tables in digital documents

Ying Liu, Kun Bai, Prasenjit Mitra, C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

Tables are ubiquitous. In scientific documents, tables are widely used to present experimental results or statistical data in a condensed fashion. Current search engines do not allow the end-user to search for relevant tables. In this paper, we describe TableSeer, an automatic table extraction and search engine system. TableSeer crawls scientific documents, identifies documents with tables, extracts tables from documents, indexes them and enables end-users to search for tables. We also propose an extensive set of mediumindependent metadata for tables representation. Given a query, TableSeer ranks the returned results using an innovative ranking algorithm - TableRank. Our results show that TableSeer outperforms popular search engines, such as Google Scholar when the end-user seeks for tables.

Original languageEnglish
Title of host publicationProceedings of the International Conference on Document Analysis and Recognition, ICDAR
Pages934-938
Number of pages5
Volume2
DOIs
Publication statusPublished - 2007
Externally publishedYes
Event9th International Conference on Document Analysis and Recognition, ICDAR 2007 - Curitiba
Duration: 23 Sep 200726 Sep 2007

Other

Other9th International Conference on Document Analysis and Recognition, ICDAR 2007
CityCuritiba
Period23/9/0726/9/07

    Fingerprint

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition

Cite this

Liu, Y., Bai, K., Mitra, P., & Lee Giles, C. (2007). Searching for tables in digital documents. In Proceedings of the International Conference on Document Analysis and Recognition, ICDAR (Vol. 2, pp. 934-938). [4377052] https://doi.org/10.1109/ICDAR.2007.4377052