Solomon

Seeking the truth via copying detection

Xin Luna Dong, Yifan Hu, Laure Berti-Equille, Divesh Srivastava

Research output: Chapter in Book/Report/Conference proceedingChapter

23 Citations (Scopus)

Abstract

We live in the Information Era, with access to a huge amount of information from a variety of data sources. However, data sources are of different qualities, often providing conflicting, out-of-date and incomplete data. Data sources can also easily copy, reformat and modify data from other sources, propagating erroneous data. These issues make the identification of high quality information and sources non-trivial. We demonstrate the SOLOMON system, whose core is a module that detects copying between sources. We demonstrate that we can effectively detect copying relationship between data sources, leverage the results in truth discovery, and provide a user-friendly interface to facilitate users in identifying sources that best suit their information needs.

Original languageEnglish
Title of host publicationProceedings of the VLDB Endowment
Pages1617-1620
Number of pages4
Volume3
Edition2
Publication statusPublished - Sep 2010
Externally publishedYes

Fingerprint

Copying
User interfaces

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Computer Science(all)

Cite this

Dong, X. L., Hu, Y., Berti-Equille, L., & Srivastava, D. (2010). Solomon: Seeking the truth via copying detection. In Proceedings of the VLDB Endowment (2 ed., Vol. 3, pp. 1617-1620)

Solomon : Seeking the truth via copying detection. / Dong, Xin Luna; Hu, Yifan; Berti-Equille, Laure; Srivastava, Divesh.

Proceedings of the VLDB Endowment. Vol. 3 2. ed. 2010. p. 1617-1620.

Research output: Chapter in Book/Report/Conference proceedingChapter

Dong, XL, Hu, Y, Berti-Equille, L & Srivastava, D 2010, Solomon: Seeking the truth via copying detection. in Proceedings of the VLDB Endowment. 2 edn, vol. 3, pp. 1617-1620.
Dong XL, Hu Y, Berti-Equille L, Srivastava D. Solomon: Seeking the truth via copying detection. In Proceedings of the VLDB Endowment. 2 ed. Vol. 3. 2010. p. 1617-1620
Dong, Xin Luna ; Hu, Yifan ; Berti-Equille, Laure ; Srivastava, Divesh. / Solomon : Seeking the truth via copying detection. Proceedings of the VLDB Endowment. Vol. 3 2. ed. 2010. pp. 1617-1620
@inbook{a45ab30f983c49e4b9089deff1332eeb,
title = "Solomon: Seeking the truth via copying detection",
abstract = "We live in the Information Era, with access to a huge amount of information from a variety of data sources. However, data sources are of different qualities, often providing conflicting, out-of-date and incomplete data. Data sources can also easily copy, reformat and modify data from other sources, propagating erroneous data. These issues make the identification of high quality information and sources non-trivial. We demonstrate the SOLOMON system, whose core is a module that detects copying between sources. We demonstrate that we can effectively detect copying relationship between data sources, leverage the results in truth discovery, and provide a user-friendly interface to facilitate users in identifying sources that best suit their information needs.",
author = "Dong, {Xin Luna} and Yifan Hu and Laure Berti-Equille and Divesh Srivastava",
year = "2010",
month = "9",
language = "English",
volume = "3",
pages = "1617--1620",
booktitle = "Proceedings of the VLDB Endowment",
edition = "2",

}

TY - CHAP

T1 - Solomon

T2 - Seeking the truth via copying detection

AU - Dong, Xin Luna

AU - Hu, Yifan

AU - Berti-Equille, Laure

AU - Srivastava, Divesh

PY - 2010/9

Y1 - 2010/9

N2 - We live in the Information Era, with access to a huge amount of information from a variety of data sources. However, data sources are of different qualities, often providing conflicting, out-of-date and incomplete data. Data sources can also easily copy, reformat and modify data from other sources, propagating erroneous data. These issues make the identification of high quality information and sources non-trivial. We demonstrate the SOLOMON system, whose core is a module that detects copying between sources. We demonstrate that we can effectively detect copying relationship between data sources, leverage the results in truth discovery, and provide a user-friendly interface to facilitate users in identifying sources that best suit their information needs.

AB - We live in the Information Era, with access to a huge amount of information from a variety of data sources. However, data sources are of different qualities, often providing conflicting, out-of-date and incomplete data. Data sources can also easily copy, reformat and modify data from other sources, propagating erroneous data. These issues make the identification of high quality information and sources non-trivial. We demonstrate the SOLOMON system, whose core is a module that detects copying between sources. We demonstrate that we can effectively detect copying relationship between data sources, leverage the results in truth discovery, and provide a user-friendly interface to facilitate users in identifying sources that best suit their information needs.

UR - http://www.scopus.com/inward/record.url?scp=84865557019&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84865557019&partnerID=8YFLogxK

M3 - Chapter

VL - 3

SP - 1617

EP - 1620

BT - Proceedings of the VLDB Endowment

ER -