Generating synopses for document-element search

Sumit Bhatia, Shibamouli Lahiri, Prasenjit Mitra

Research output: Chapter in Book/Report/Conference proceedingConference contribution

10 Citations (Scopus)

Abstract

Scientists often search for document-elements like tables, figures, or algorithm pseudo-codes. Domain scientists and researchers report important data, results and algorithms using these document-elements; readers want to compare the reported results with their findings. Some document-element search engines have been proposed (especially to search for tables and figures) to make this task easier. While searching for document-elements today, the end-user is presented with the caption of the document-element and a sentence in the document text that refers to the document-element. Oftentimes, the caption and the reference text do not contain enough information to interpret the document-element. In this paper, we present the first set of methods to extract this useful information (synopsis) related to document-elements automatically. We also investigate the problem of choosing the optimum synopsis-size that strikes a balance between information content and size of the generated synopses.

Original languageEnglish
Title of host publicationInternational Conference on Information and Knowledge Management, Proceedings
Pages2003-2006
Number of pages4
DOIs
Publication statusPublished - 2009
Externally publishedYes
EventACM 18th International Conference on Information and Knowledge Management, CIKM 2009 - Hong Kong
Duration: 2 Nov 20096 Nov 2009

Other

OtherACM 18th International Conference on Information and Knowledge Management, CIKM 2009
CityHong Kong
Period2/11/096/11/09

Fingerprint

Search engine
Information content
End users

Keywords

  • Classification
  • Document-element
  • Summarization
  • Synopses

ASJC Scopus subject areas

  • Business, Management and Accounting(all)
  • Decision Sciences(all)

Cite this

Bhatia, S., Lahiri, S., & Mitra, P. (2009). Generating synopses for document-element search. In International Conference on Information and Knowledge Management, Proceedings (pp. 2003-2006) https://doi.org/10.1145/1645953.1646287

Generating synopses for document-element search. / Bhatia, Sumit; Lahiri, Shibamouli; Mitra, Prasenjit.

International Conference on Information and Knowledge Management, Proceedings. 2009. p. 2003-2006.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Bhatia, S, Lahiri, S & Mitra, P 2009, Generating synopses for document-element search. in International Conference on Information and Knowledge Management, Proceedings. pp. 2003-2006, ACM 18th International Conference on Information and Knowledge Management, CIKM 2009, Hong Kong, 2/11/09. https://doi.org/10.1145/1645953.1646287
Bhatia S, Lahiri S, Mitra P. Generating synopses for document-element search. In International Conference on Information and Knowledge Management, Proceedings. 2009. p. 2003-2006 https://doi.org/10.1145/1645953.1646287
Bhatia, Sumit ; Lahiri, Shibamouli ; Mitra, Prasenjit. / Generating synopses for document-element search. International Conference on Information and Knowledge Management, Proceedings. 2009. pp. 2003-2006
@inproceedings{cead818239604f32860caf7d2aa1c75b,
title = "Generating synopses for document-element search",
abstract = "Scientists often search for document-elements like tables, figures, or algorithm pseudo-codes. Domain scientists and researchers report important data, results and algorithms using these document-elements; readers want to compare the reported results with their findings. Some document-element search engines have been proposed (especially to search for tables and figures) to make this task easier. While searching for document-elements today, the end-user is presented with the caption of the document-element and a sentence in the document text that refers to the document-element. Oftentimes, the caption and the reference text do not contain enough information to interpret the document-element. In this paper, we present the first set of methods to extract this useful information (synopsis) related to document-elements automatically. We also investigate the problem of choosing the optimum synopsis-size that strikes a balance between information content and size of the generated synopses.",
keywords = "Classification, Document-element, Summarization, Synopses",
author = "Sumit Bhatia and Shibamouli Lahiri and Prasenjit Mitra",
year = "2009",
doi = "10.1145/1645953.1646287",
language = "English",
isbn = "9781605585123",
pages = "2003--2006",
booktitle = "International Conference on Information and Knowledge Management, Proceedings",

}

TY - GEN

T1 - Generating synopses for document-element search

AU - Bhatia, Sumit

AU - Lahiri, Shibamouli

AU - Mitra, Prasenjit

PY - 2009

Y1 - 2009

N2 - Scientists often search for document-elements like tables, figures, or algorithm pseudo-codes. Domain scientists and researchers report important data, results and algorithms using these document-elements; readers want to compare the reported results with their findings. Some document-element search engines have been proposed (especially to search for tables and figures) to make this task easier. While searching for document-elements today, the end-user is presented with the caption of the document-element and a sentence in the document text that refers to the document-element. Oftentimes, the caption and the reference text do not contain enough information to interpret the document-element. In this paper, we present the first set of methods to extract this useful information (synopsis) related to document-elements automatically. We also investigate the problem of choosing the optimum synopsis-size that strikes a balance between information content and size of the generated synopses.

AB - Scientists often search for document-elements like tables, figures, or algorithm pseudo-codes. Domain scientists and researchers report important data, results and algorithms using these document-elements; readers want to compare the reported results with their findings. Some document-element search engines have been proposed (especially to search for tables and figures) to make this task easier. While searching for document-elements today, the end-user is presented with the caption of the document-element and a sentence in the document text that refers to the document-element. Oftentimes, the caption and the reference text do not contain enough information to interpret the document-element. In this paper, we present the first set of methods to extract this useful information (synopsis) related to document-elements automatically. We also investigate the problem of choosing the optimum synopsis-size that strikes a balance between information content and size of the generated synopses.

KW - Classification

KW - Document-element

KW - Summarization

KW - Synopses

UR - http://www.scopus.com/inward/record.url?scp=74549222997&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=74549222997&partnerID=8YFLogxK

U2 - 10.1145/1645953.1646287

DO - 10.1145/1645953.1646287

M3 - Conference contribution

SN - 9781605585123

SP - 2003

EP - 2006

BT - International Conference on Information and Knowledge Management, Proceedings

ER -