Supporting the automatic construction of entity aware search engines

Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti

Research output: Chapter in Book/Report/Conference proceedingConference contribution

10 Citations (Scopus)

Abstract

Several web sites deliver a large number of pages, each publishing data about one instance of some real world entity, such as an athlete, a stock quote, a book. Although it is easy for a human reader to recognize these instances, current search engines are unaware of them. Technologies for the Semantic Web aim at achieving this goal; however, so far they have been of little help in this respect, as semantic publishing is very limited. We have developed a method to automatically search on the web for pages that publish data representing an instance of a certain conceptual entity. Our method takes as input a small set of sample pages: it automatically infers a description of the underlying conceptual entity and then searches the web for other pages containing data representing the same entity. We have implemented our method in a system prototype, which has been used to conduct several experiments that have produced interesting results.

Original languageEnglish
Title of host publicationInternational Conference on Information and Knowledge Management, Proceedings
Pages149-156
Number of pages8
DOIs
Publication statusPublished - 1 Dec 2008
Externally publishedYes
Event10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08 - Napa Valley, CA, United States
Duration: 26 Oct 200830 Oct 2008

Other

Other10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08
CountryUnited States
CityNapa Valley, CA
Period26/10/0830/10/08

Fingerprint

Search engine
World Wide Web
Prototype
Semantic web
Experiment
Web sites

Keywords

  • Entity aware search engines
  • Resource discovery
  • Web exploration

ASJC Scopus subject areas

  • Business, Management and Accounting(all)
  • Decision Sciences(all)

Cite this

Blanco, L., Crescenzi, V., Merialdo, P., & Papotti, P. (2008). Supporting the automatic construction of entity aware search engines. In International Conference on Information and Knowledge Management, Proceedings (pp. 149-156) https://doi.org/10.1145/1458502.1458526

Supporting the automatic construction of entity aware search engines. / Blanco, Lorenzo; Crescenzi, Valter; Merialdo, Paolo; Papotti, Paolo.

International Conference on Information and Knowledge Management, Proceedings. 2008. p. 149-156.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Blanco, L, Crescenzi, V, Merialdo, P & Papotti, P 2008, Supporting the automatic construction of entity aware search engines. in International Conference on Information and Knowledge Management, Proceedings. pp. 149-156, 10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08, Napa Valley, CA, United States, 26/10/08. https://doi.org/10.1145/1458502.1458526
Blanco L, Crescenzi V, Merialdo P, Papotti P. Supporting the automatic construction of entity aware search engines. In International Conference on Information and Knowledge Management, Proceedings. 2008. p. 149-156 https://doi.org/10.1145/1458502.1458526
Blanco, Lorenzo ; Crescenzi, Valter ; Merialdo, Paolo ; Papotti, Paolo. / Supporting the automatic construction of entity aware search engines. International Conference on Information and Knowledge Management, Proceedings. 2008. pp. 149-156
@inproceedings{f152fd8ebcdc4243b0c8526fc3132def,
title = "Supporting the automatic construction of entity aware search engines",
abstract = "Several web sites deliver a large number of pages, each publishing data about one instance of some real world entity, such as an athlete, a stock quote, a book. Although it is easy for a human reader to recognize these instances, current search engines are unaware of them. Technologies for the Semantic Web aim at achieving this goal; however, so far they have been of little help in this respect, as semantic publishing is very limited. We have developed a method to automatically search on the web for pages that publish data representing an instance of a certain conceptual entity. Our method takes as input a small set of sample pages: it automatically infers a description of the underlying conceptual entity and then searches the web for other pages containing data representing the same entity. We have implemented our method in a system prototype, which has been used to conduct several experiments that have produced interesting results.",
keywords = "Entity aware search engines, Resource discovery, Web exploration",
author = "Lorenzo Blanco and Valter Crescenzi and Paolo Merialdo and Paolo Papotti",
year = "2008",
month = "12",
day = "1",
doi = "10.1145/1458502.1458526",
language = "English",
isbn = "9781605582603",
pages = "149--156",
booktitle = "International Conference on Information and Knowledge Management, Proceedings",

}

TY - GEN

T1 - Supporting the automatic construction of entity aware search engines

AU - Blanco, Lorenzo

AU - Crescenzi, Valter

AU - Merialdo, Paolo

AU - Papotti, Paolo

PY - 2008/12/1

Y1 - 2008/12/1

N2 - Several web sites deliver a large number of pages, each publishing data about one instance of some real world entity, such as an athlete, a stock quote, a book. Although it is easy for a human reader to recognize these instances, current search engines are unaware of them. Technologies for the Semantic Web aim at achieving this goal; however, so far they have been of little help in this respect, as semantic publishing is very limited. We have developed a method to automatically search on the web for pages that publish data representing an instance of a certain conceptual entity. Our method takes as input a small set of sample pages: it automatically infers a description of the underlying conceptual entity and then searches the web for other pages containing data representing the same entity. We have implemented our method in a system prototype, which has been used to conduct several experiments that have produced interesting results.

AB - Several web sites deliver a large number of pages, each publishing data about one instance of some real world entity, such as an athlete, a stock quote, a book. Although it is easy for a human reader to recognize these instances, current search engines are unaware of them. Technologies for the Semantic Web aim at achieving this goal; however, so far they have been of little help in this respect, as semantic publishing is very limited. We have developed a method to automatically search on the web for pages that publish data representing an instance of a certain conceptual entity. Our method takes as input a small set of sample pages: it automatically infers a description of the underlying conceptual entity and then searches the web for other pages containing data representing the same entity. We have implemented our method in a system prototype, which has been used to conduct several experiments that have produced interesting results.

KW - Entity aware search engines

KW - Resource discovery

KW - Web exploration

UR - http://www.scopus.com/inward/record.url?scp=77951136761&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77951136761&partnerID=8YFLogxK

U2 - 10.1145/1458502.1458526

DO - 10.1145/1458502.1458526

M3 - Conference contribution

SN - 9781605582603

SP - 149

EP - 156

BT - International Conference on Information and Knowledge Management, Proceedings

ER -