TableRank

A ranking algorithm for table search and retrieval

Ying Liu, Kun Bai, Prasenjit Mitra, C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

Tables are ubiquitous in web pages and scientific documents. With the explosive development of the web, tables have become a valuable information repository. Therefore, effectively and efficiently searching tables becomes a challenge. Existing search engines do not provide satisfactory search results largely because the current ranking schemes are inadequate for table search and automatic table understanding and extraction are rather difficult in general. In this work, we design and evaluate a novel table ranking algorithm - TableRank to improve the performance of our table search engine TableSeer. Given a keyword based table query, TableRank facilities TableSeer to return the most relevant tables by tailoring the classic vector space model. TableRank adopts an innovative term weighting scheme by aggregating multiple weighting factors from three levels: term, table and document. The experimental results show that our table search engine outperforms existing search engines on table search. In addition, incorporating multiple weighting factors can significantly improve the ranking results.

Original languageEnglish
Title of host publicationProceedings of the National Conference on Artificial Intelligence
Pages317-322
Number of pages6
Volume1
Publication statusPublished - 2007
Externally publishedYes
EventAAAI-07/IAAI-07 Proceedings: 22nd AAAI Conference on Artificial Intelligence and the 19th Innovative Applications of Artificial Intelligence Conference - Vancouver, BC
Duration: 22 Jul 200726 Jul 2007

Other

OtherAAAI-07/IAAI-07 Proceedings: 22nd AAAI Conference on Artificial Intelligence and the 19th Innovative Applications of Artificial Intelligence Conference
CityVancouver, BC
Period22/7/0726/7/07

Fingerprint

Search engines
Vector spaces
Websites

ASJC Scopus subject areas

  • Software

Cite this

Liu, Y., Bai, K., Mitra, P., & Giles, C. L. (2007). TableRank: A ranking algorithm for table search and retrieval. In Proceedings of the National Conference on Artificial Intelligence (Vol. 1, pp. 317-322)

TableRank : A ranking algorithm for table search and retrieval. / Liu, Ying; Bai, Kun; Mitra, Prasenjit; Giles, C. Lee.

Proceedings of the National Conference on Artificial Intelligence. Vol. 1 2007. p. 317-322.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Liu, Y, Bai, K, Mitra, P & Giles, CL 2007, TableRank: A ranking algorithm for table search and retrieval. in Proceedings of the National Conference on Artificial Intelligence. vol. 1, pp. 317-322, AAAI-07/IAAI-07 Proceedings: 22nd AAAI Conference on Artificial Intelligence and the 19th Innovative Applications of Artificial Intelligence Conference, Vancouver, BC, 22/7/07.
Liu Y, Bai K, Mitra P, Giles CL. TableRank: A ranking algorithm for table search and retrieval. In Proceedings of the National Conference on Artificial Intelligence. Vol. 1. 2007. p. 317-322
Liu, Ying ; Bai, Kun ; Mitra, Prasenjit ; Giles, C. Lee. / TableRank : A ranking algorithm for table search and retrieval. Proceedings of the National Conference on Artificial Intelligence. Vol. 1 2007. pp. 317-322
@inproceedings{46bb3258d89b43a08d64485451ac177f,
title = "TableRank: A ranking algorithm for table search and retrieval",
abstract = "Tables are ubiquitous in web pages and scientific documents. With the explosive development of the web, tables have become a valuable information repository. Therefore, effectively and efficiently searching tables becomes a challenge. Existing search engines do not provide satisfactory search results largely because the current ranking schemes are inadequate for table search and automatic table understanding and extraction are rather difficult in general. In this work, we design and evaluate a novel table ranking algorithm - TableRank to improve the performance of our table search engine TableSeer. Given a keyword based table query, TableRank facilities TableSeer to return the most relevant tables by tailoring the classic vector space model. TableRank adopts an innovative term weighting scheme by aggregating multiple weighting factors from three levels: term, table and document. The experimental results show that our table search engine outperforms existing search engines on table search. In addition, incorporating multiple weighting factors can significantly improve the ranking results.",
author = "Ying Liu and Kun Bai and Prasenjit Mitra and Giles, {C. Lee}",
year = "2007",
language = "English",
isbn = "1577353234",
volume = "1",
pages = "317--322",
booktitle = "Proceedings of the National Conference on Artificial Intelligence",

}

TY - GEN

T1 - TableRank

T2 - A ranking algorithm for table search and retrieval

AU - Liu, Ying

AU - Bai, Kun

AU - Mitra, Prasenjit

AU - Giles, C. Lee

PY - 2007

Y1 - 2007

N2 - Tables are ubiquitous in web pages and scientific documents. With the explosive development of the web, tables have become a valuable information repository. Therefore, effectively and efficiently searching tables becomes a challenge. Existing search engines do not provide satisfactory search results largely because the current ranking schemes are inadequate for table search and automatic table understanding and extraction are rather difficult in general. In this work, we design and evaluate a novel table ranking algorithm - TableRank to improve the performance of our table search engine TableSeer. Given a keyword based table query, TableRank facilities TableSeer to return the most relevant tables by tailoring the classic vector space model. TableRank adopts an innovative term weighting scheme by aggregating multiple weighting factors from three levels: term, table and document. The experimental results show that our table search engine outperforms existing search engines on table search. In addition, incorporating multiple weighting factors can significantly improve the ranking results.

AB - Tables are ubiquitous in web pages and scientific documents. With the explosive development of the web, tables have become a valuable information repository. Therefore, effectively and efficiently searching tables becomes a challenge. Existing search engines do not provide satisfactory search results largely because the current ranking schemes are inadequate for table search and automatic table understanding and extraction are rather difficult in general. In this work, we design and evaluate a novel table ranking algorithm - TableRank to improve the performance of our table search engine TableSeer. Given a keyword based table query, TableRank facilities TableSeer to return the most relevant tables by tailoring the classic vector space model. TableRank adopts an innovative term weighting scheme by aggregating multiple weighting factors from three levels: term, table and document. The experimental results show that our table search engine outperforms existing search engines on table search. In addition, incorporating multiple weighting factors can significantly improve the ranking results.

UR - http://www.scopus.com/inward/record.url?scp=36349024712&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=36349024712&partnerID=8YFLogxK

M3 - Conference contribution

SN - 1577353234

SN - 9781577353232

VL - 1

SP - 317

EP - 322

BT - Proceedings of the National Conference on Artificial Intelligence

ER -