ESTER: Efficient search on text, entities, and relations

Holger Bast, Alexandru Chitea, Fabian Suchanek, Ingmar Weber

Research output: Chapter in Book/Report/Conference proceedingConference contribution

70 Citations (Scopus)

Abstract

We present ESTER, a modular and highly efficient system for combined full-text and ontology search. ESTER builds on a query engine that supports two basic operations: prefix search and join. Both of these can be implemented very efficiently with a compact index, yet in combination provide powerful querying capabilities. We show how ESTER can answer basic SPARQL graph-pattern queries on the ontology by reducing them to a small number of these two basic operations. ESTER further supports a natural blend of such semantic queries with ordinary full-text queries. Moreover, the prefix search operation allows for a fully interactive and proactive user interface, which after every keystroke suggests to the user possible semantic interpretations of his or her query, and speculatively executes the most likely of these interpretations. As a proof of concept, we applied ESTER to the English Wikipedia, which contains about 3 million documents, combined with the recent YAGO ontology, which contains about 2.5 million facts. For a variety of complex queries, ESTER achieves worst-case query processing times of a fraction of a second, on a single machine, with an index size of about 4 GB.

Original languageEnglish
Title of host publicationProceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07
Pages671-678
Number of pages8
DOIs
Publication statusPublished - 30 Nov 2007
Externally publishedYes
Event30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07 - Amsterdam, Netherlands
Duration: 23 Jul 200727 Jul 2007

Other

Other30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07
CountryNetherlands
CityAmsterdam
Period23/7/0727/7/07

Fingerprint

Ontology
Query
Semantics
Query processing
Prefix
User interfaces
Engines
SPARQL
Wikipedia
Single Machine
Query Processing
User Interface
Join
Text
Engine
Likely
Graph in graph theory
Interpretation

Keywords

  • Interactive
  • Ontologies
  • Proactive
  • Semantic search
  • Wikipedia

ASJC Scopus subject areas

  • Information Systems
  • Software
  • Applied Mathematics

Cite this

Bast, H., Chitea, A., Suchanek, F., & Weber, I. (2007). ESTER: Efficient search on text, entities, and relations. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07 (pp. 671-678) https://doi.org/10.1145/1277741.1277856

ESTER : Efficient search on text, entities, and relations. / Bast, Holger; Chitea, Alexandru; Suchanek, Fabian; Weber, Ingmar.

Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07. 2007. p. 671-678.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Bast, H, Chitea, A, Suchanek, F & Weber, I 2007, ESTER: Efficient search on text, entities, and relations. in Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07. pp. 671-678, 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07, Amsterdam, Netherlands, 23/7/07. https://doi.org/10.1145/1277741.1277856
Bast H, Chitea A, Suchanek F, Weber I. ESTER: Efficient search on text, entities, and relations. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07. 2007. p. 671-678 https://doi.org/10.1145/1277741.1277856
Bast, Holger ; Chitea, Alexandru ; Suchanek, Fabian ; Weber, Ingmar. / ESTER : Efficient search on text, entities, and relations. Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07. 2007. pp. 671-678
@inproceedings{43f188a7522c4338b3f0ad07cac3e584,
title = "ESTER: Efficient search on text, entities, and relations",
abstract = "We present ESTER, a modular and highly efficient system for combined full-text and ontology search. ESTER builds on a query engine that supports two basic operations: prefix search and join. Both of these can be implemented very efficiently with a compact index, yet in combination provide powerful querying capabilities. We show how ESTER can answer basic SPARQL graph-pattern queries on the ontology by reducing them to a small number of these two basic operations. ESTER further supports a natural blend of such semantic queries with ordinary full-text queries. Moreover, the prefix search operation allows for a fully interactive and proactive user interface, which after every keystroke suggests to the user possible semantic interpretations of his or her query, and speculatively executes the most likely of these interpretations. As a proof of concept, we applied ESTER to the English Wikipedia, which contains about 3 million documents, combined with the recent YAGO ontology, which contains about 2.5 million facts. For a variety of complex queries, ESTER achieves worst-case query processing times of a fraction of a second, on a single machine, with an index size of about 4 GB.",
keywords = "Interactive, Ontologies, Proactive, Semantic search, Wikipedia",
author = "Holger Bast and Alexandru Chitea and Fabian Suchanek and Ingmar Weber",
year = "2007",
month = "11",
day = "30",
doi = "10.1145/1277741.1277856",
language = "English",
isbn = "1595935975",
pages = "671--678",
booktitle = "Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07",

}

TY - GEN

T1 - ESTER

T2 - Efficient search on text, entities, and relations

AU - Bast, Holger

AU - Chitea, Alexandru

AU - Suchanek, Fabian

AU - Weber, Ingmar

PY - 2007/11/30

Y1 - 2007/11/30

N2 - We present ESTER, a modular and highly efficient system for combined full-text and ontology search. ESTER builds on a query engine that supports two basic operations: prefix search and join. Both of these can be implemented very efficiently with a compact index, yet in combination provide powerful querying capabilities. We show how ESTER can answer basic SPARQL graph-pattern queries on the ontology by reducing them to a small number of these two basic operations. ESTER further supports a natural blend of such semantic queries with ordinary full-text queries. Moreover, the prefix search operation allows for a fully interactive and proactive user interface, which after every keystroke suggests to the user possible semantic interpretations of his or her query, and speculatively executes the most likely of these interpretations. As a proof of concept, we applied ESTER to the English Wikipedia, which contains about 3 million documents, combined with the recent YAGO ontology, which contains about 2.5 million facts. For a variety of complex queries, ESTER achieves worst-case query processing times of a fraction of a second, on a single machine, with an index size of about 4 GB.

AB - We present ESTER, a modular and highly efficient system for combined full-text and ontology search. ESTER builds on a query engine that supports two basic operations: prefix search and join. Both of these can be implemented very efficiently with a compact index, yet in combination provide powerful querying capabilities. We show how ESTER can answer basic SPARQL graph-pattern queries on the ontology by reducing them to a small number of these two basic operations. ESTER further supports a natural blend of such semantic queries with ordinary full-text queries. Moreover, the prefix search operation allows for a fully interactive and proactive user interface, which after every keystroke suggests to the user possible semantic interpretations of his or her query, and speculatively executes the most likely of these interpretations. As a proof of concept, we applied ESTER to the English Wikipedia, which contains about 3 million documents, combined with the recent YAGO ontology, which contains about 2.5 million facts. For a variety of complex queries, ESTER achieves worst-case query processing times of a fraction of a second, on a single machine, with an index size of about 4 GB.

KW - Interactive

KW - Ontologies

KW - Proactive

KW - Semantic search

KW - Wikipedia

UR - http://www.scopus.com/inward/record.url?scp=36448932681&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=36448932681&partnerID=8YFLogxK

U2 - 10.1145/1277741.1277856

DO - 10.1145/1277741.1277856

M3 - Conference contribution

AN - SCOPUS:36448932681

SN - 1595935975

SN - 9781595935977

SP - 671

EP - 678

BT - Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07

ER -