Web data reconciliation: Models and experiences

Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti

Research output: Chapter in Book/Report/Conference proceedingChapter

2 Citations (Scopus)

Abstract

An increasing number of web sites offer structured information about recognizable concepts, relevant to many application domains, such as finance, sport, commercial products. However, web data is inherently imprecise and uncertain, and conflicting values can be provided by different web sources. Characterizing the uncertainty of web data represents an important issue and several models have been recently proposed in the literature. This chapter illustrates state-of-the-art Bayesan models to evaluate the quality of data extracted from the Web and reports the results of an extensive application of the models on real life web data. Experimental results show that for some applications even simple approaches can provide effective results, while sophisticated solutions are needed to obtain a more precise characterization of the uncertainty.

Original languageEnglish
Title of host publicationSearch Computing
Subtitle of host publicationBroadening Web Search
EditorsStefano Ceri, Marco Brambilla
Pages1-15
Number of pages15
DOIs
Publication statusPublished - 1 Dec 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7538
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

    Fingerprint

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Blanco, L., Crescenzi, V., Merialdo, P., & Papotti, P. (2012). Web data reconciliation: Models and experiences. In S. Ceri, & M. Brambilla (Eds.), Search Computing: Broadening Web Search (pp. 1-15). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7538). https://doi.org/10.1007/978-3-642-34213-4_1