Data fusion: Resolving conflicts from multiple sources

Xin Luna Dong, Laure Berti-Equille, Divesh Srivastava

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Citations (Scopus)

Abstract

Many data management applications, such as setting up Web portals, managing enterprise data, managing community data, and sharing scientific data, require integrating data from multiple sources. Each of these sources provides a set of values and different sources can often provide conflicting values. To present quality data to users, it is critical to resolve conflicts and discover values that reflect the real world; this task is called data fusion. This paper describes a novel approach that finds true values from conflicting information when there are a large number of sources, among which some may copy from others. We present a case study on real-world data showing that the described algorithm can significantly improve accuracy of truth discovery and is scalable when there are a large number of data sources.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages64-76
Number of pages13
Volume7923 LNCS
DOIs
Publication statusPublished - 16 Jul 2013
Externally publishedYes
Event14th International Conference on Web-Age Information Management, WAIM 2013 - Beidaihe, China
Duration: 14 Jun 201316 Jun 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7923 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other14th International Conference on Web-Age Information Management, WAIM 2013
CountryChina
CityBeidaihe
Period14/6/1316/6/13

Fingerprint

Data Fusion
Data fusion
Information management
Industry
Web Portal
Data Quality
Data Management
Conflict
Resolve
Sharing

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Dong, X. L., Berti-Equille, L., & Srivastava, D. (2013). Data fusion: Resolving conflicts from multiple sources. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7923 LNCS, pp. 64-76). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7923 LNCS). https://doi.org/10.1007/978-3-642-38562-9-7

Data fusion : Resolving conflicts from multiple sources. / Dong, Xin Luna; Berti-Equille, Laure; Srivastava, Divesh.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 7923 LNCS 2013. p. 64-76 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7923 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Dong, XL, Berti-Equille, L & Srivastava, D 2013, Data fusion: Resolving conflicts from multiple sources. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 7923 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 7923 LNCS, pp. 64-76, 14th International Conference on Web-Age Information Management, WAIM 2013, Beidaihe, China, 14/6/13. https://doi.org/10.1007/978-3-642-38562-9-7
Dong XL, Berti-Equille L, Srivastava D. Data fusion: Resolving conflicts from multiple sources. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 7923 LNCS. 2013. p. 64-76. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-642-38562-9-7
Dong, Xin Luna ; Berti-Equille, Laure ; Srivastava, Divesh. / Data fusion : Resolving conflicts from multiple sources. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 7923 LNCS 2013. pp. 64-76 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{8f1adb94fcba42fdbb5cfe9c3671f871,
title = "Data fusion: Resolving conflicts from multiple sources",
abstract = "Many data management applications, such as setting up Web portals, managing enterprise data, managing community data, and sharing scientific data, require integrating data from multiple sources. Each of these sources provides a set of values and different sources can often provide conflicting values. To present quality data to users, it is critical to resolve conflicts and discover values that reflect the real world; this task is called data fusion. This paper describes a novel approach that finds true values from conflicting information when there are a large number of sources, among which some may copy from others. We present a case study on real-world data showing that the described algorithm can significantly improve accuracy of truth discovery and is scalable when there are a large number of data sources.",
author = "Dong, {Xin Luna} and Laure Berti-Equille and Divesh Srivastava",
year = "2013",
month = "7",
day = "16",
doi = "10.1007/978-3-642-38562-9-7",
language = "English",
isbn = "9783642385612",
volume = "7923 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "64--76",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Data fusion

T2 - Resolving conflicts from multiple sources

AU - Dong, Xin Luna

AU - Berti-Equille, Laure

AU - Srivastava, Divesh

PY - 2013/7/16

Y1 - 2013/7/16

N2 - Many data management applications, such as setting up Web portals, managing enterprise data, managing community data, and sharing scientific data, require integrating data from multiple sources. Each of these sources provides a set of values and different sources can often provide conflicting values. To present quality data to users, it is critical to resolve conflicts and discover values that reflect the real world; this task is called data fusion. This paper describes a novel approach that finds true values from conflicting information when there are a large number of sources, among which some may copy from others. We present a case study on real-world data showing that the described algorithm can significantly improve accuracy of truth discovery and is scalable when there are a large number of data sources.

AB - Many data management applications, such as setting up Web portals, managing enterprise data, managing community data, and sharing scientific data, require integrating data from multiple sources. Each of these sources provides a set of values and different sources can often provide conflicting values. To present quality data to users, it is critical to resolve conflicts and discover values that reflect the real world; this task is called data fusion. This paper describes a novel approach that finds true values from conflicting information when there are a large number of sources, among which some may copy from others. We present a case study on real-world data showing that the described algorithm can significantly improve accuracy of truth discovery and is scalable when there are a large number of data sources.

UR - http://www.scopus.com/inward/record.url?scp=84880031204&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84880031204&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-38562-9-7

DO - 10.1007/978-3-642-38562-9-7

M3 - Conference contribution

AN - SCOPUS:84880031204

SN - 9783642385612

VL - 7923 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 64

EP - 76

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -