Can back-of-the-book indexes be automatically created?

Zhaohui Wu, Zhenhui Li, Prasenjit Mitra, C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

10 Citations (Scopus)

Abstract

Automatic creation of back-of-the-book indexes remains one of the few manual tasks related to publishing. Inspired by how human indexers work on back-of-the-book indexes creation, we present a new domain-independent, corpus-free and training-free automation approach. Given a book, the index terms will be sequentially selected according to an indexability score encoded by the structure information residing in a book as well as a novel context-aware term informativeness measurement utilizing the power of the web knowledge base such as Wikipedia. By extensive experiments on books from various domains, we show our approach to be a more effective and practical than ones that used previous keyword extraction and supervised learning.

Original languageEnglish
Title of host publicationInternational Conference on Information and Knowledge Management, Proceedings
Pages1745-1750
Number of pages6
DOIs
Publication statusPublished - 2013
Externally publishedYes
Event22nd ACM International Conference on Information and Knowledge Management, CIKM 2013 - San Francisco, CA
Duration: 27 Oct 20131 Nov 2013

Other

Other22nd ACM International Conference on Information and Knowledge Management, CIKM 2013
CitySan Francisco, CA
Period27/10/131/11/13

Fingerprint

Informativeness
Knowledge base
Information structure
Keyword extraction
Automation
Wikipedia
Context-aware
Experiment
World Wide Web

Keywords

  • Back-of-the-book index
  • Book index
  • Term informativeness

ASJC Scopus subject areas

  • Business, Management and Accounting(all)
  • Decision Sciences(all)

Cite this

Wu, Z., Li, Z., Mitra, P., & Giles, C. L. (2013). Can back-of-the-book indexes be automatically created? In International Conference on Information and Knowledge Management, Proceedings (pp. 1745-1750) https://doi.org/10.1145/2505515.2505627

Can back-of-the-book indexes be automatically created? / Wu, Zhaohui; Li, Zhenhui; Mitra, Prasenjit; Giles, C. Lee.

International Conference on Information and Knowledge Management, Proceedings. 2013. p. 1745-1750.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Wu, Z, Li, Z, Mitra, P & Giles, CL 2013, Can back-of-the-book indexes be automatically created? in International Conference on Information and Knowledge Management, Proceedings. pp. 1745-1750, 22nd ACM International Conference on Information and Knowledge Management, CIKM 2013, San Francisco, CA, 27/10/13. https://doi.org/10.1145/2505515.2505627
Wu Z, Li Z, Mitra P, Giles CL. Can back-of-the-book indexes be automatically created? In International Conference on Information and Knowledge Management, Proceedings. 2013. p. 1745-1750 https://doi.org/10.1145/2505515.2505627
Wu, Zhaohui ; Li, Zhenhui ; Mitra, Prasenjit ; Giles, C. Lee. / Can back-of-the-book indexes be automatically created?. International Conference on Information and Knowledge Management, Proceedings. 2013. pp. 1745-1750
@inproceedings{56cedc772a2c40fea90b33e3f7375092,
title = "Can back-of-the-book indexes be automatically created?",
abstract = "Automatic creation of back-of-the-book indexes remains one of the few manual tasks related to publishing. Inspired by how human indexers work on back-of-the-book indexes creation, we present a new domain-independent, corpus-free and training-free automation approach. Given a book, the index terms will be sequentially selected according to an indexability score encoded by the structure information residing in a book as well as a novel context-aware term informativeness measurement utilizing the power of the web knowledge base such as Wikipedia. By extensive experiments on books from various domains, we show our approach to be a more effective and practical than ones that used previous keyword extraction and supervised learning.",
keywords = "Back-of-the-book index, Book index, Term informativeness",
author = "Zhaohui Wu and Zhenhui Li and Prasenjit Mitra and Giles, {C. Lee}",
year = "2013",
doi = "10.1145/2505515.2505627",
language = "English",
isbn = "9781450322638",
pages = "1745--1750",
booktitle = "International Conference on Information and Knowledge Management, Proceedings",

}

TY - GEN

T1 - Can back-of-the-book indexes be automatically created?

AU - Wu, Zhaohui

AU - Li, Zhenhui

AU - Mitra, Prasenjit

AU - Giles, C. Lee

PY - 2013

Y1 - 2013

N2 - Automatic creation of back-of-the-book indexes remains one of the few manual tasks related to publishing. Inspired by how human indexers work on back-of-the-book indexes creation, we present a new domain-independent, corpus-free and training-free automation approach. Given a book, the index terms will be sequentially selected according to an indexability score encoded by the structure information residing in a book as well as a novel context-aware term informativeness measurement utilizing the power of the web knowledge base such as Wikipedia. By extensive experiments on books from various domains, we show our approach to be a more effective and practical than ones that used previous keyword extraction and supervised learning.

AB - Automatic creation of back-of-the-book indexes remains one of the few manual tasks related to publishing. Inspired by how human indexers work on back-of-the-book indexes creation, we present a new domain-independent, corpus-free and training-free automation approach. Given a book, the index terms will be sequentially selected according to an indexability score encoded by the structure information residing in a book as well as a novel context-aware term informativeness measurement utilizing the power of the web knowledge base such as Wikipedia. By extensive experiments on books from various domains, we show our approach to be a more effective and practical than ones that used previous keyword extraction and supervised learning.

KW - Back-of-the-book index

KW - Book index

KW - Term informativeness

UR - http://www.scopus.com/inward/record.url?scp=84889592982&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84889592982&partnerID=8YFLogxK

U2 - 10.1145/2505515.2505627

DO - 10.1145/2505515.2505627

M3 - Conference contribution

SN - 9781450322638

SP - 1745

EP - 1750

BT - International Conference on Information and Knowledge Management, Proceedings

ER -