EXPLAINER: Entity resolution explanations

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Entity Resolution is a fundamental data cleaning and integration problem that has received considerable attention in the past few decades. While rule-based methods have been used in many practical scenarios and are often easy to understand, machine-learning-based methods provide the best accuracy. However, the state-of-the-art classifiers are very opaque. There has been some work towards understanding and debugging the early stages of the entity resolution pipeline, e.g. blocking and generating features (similarity scores). However, there are no such efforts for explaining the model or its predictions. In this demo, we propose ExplainER, a tool to understand and explain entity resolution classifiers with different granularity levels of explanations. Using several benchmark datasets, we will demonstrate how ExplainER can handle different scenarios for a variety of classifiers.

Original languageEnglish
Title of host publicationProceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019
PublisherIEEE Computer Society
Pages2000-2003
Number of pages4
ISBN (Electronic)9781538674741
DOIs
Publication statusPublished - 1 Apr 2019
Event35th IEEE International Conference on Data Engineering, ICDE 2019 - Macau, China
Duration: 8 Apr 201911 Apr 2019

Publication series

NameProceedings - International Conference on Data Engineering
Volume2019-April
ISSN (Print)1084-4627

Conference

Conference35th IEEE International Conference on Data Engineering, ICDE 2019
CountryChina
CityMacau
Period8/4/1911/4/19

Fingerprint

Classifiers
Learning systems
Cleaning
Pipelines

Keywords

  • Data cleaning
  • Data cleaning explanations
  • Data integration
  • Entity resolution
  • Explainable ai

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Information Systems

Cite this

Ebaid, A., Thirumuruganathan, S., Aref, W. G., Elmagarmid, A., & Ouzzani, M. (2019). EXPLAINER: Entity resolution explanations. In Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019 (pp. 2000-2003). [8731597] (Proceedings - International Conference on Data Engineering; Vol. 2019-April). IEEE Computer Society. https://doi.org/10.1109/ICDE.2019.00224

EXPLAINER : Entity resolution explanations. / Ebaid, Amr; Thirumuruganathan, Saravanan; Aref, Walid G.; Elmagarmid, Ahmed; Ouzzani, Mourad.

Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019. IEEE Computer Society, 2019. p. 2000-2003 8731597 (Proceedings - International Conference on Data Engineering; Vol. 2019-April).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Ebaid, A, Thirumuruganathan, S, Aref, WG, Elmagarmid, A & Ouzzani, M 2019, EXPLAINER: Entity resolution explanations. in Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019., 8731597, Proceedings - International Conference on Data Engineering, vol. 2019-April, IEEE Computer Society, pp. 2000-2003, 35th IEEE International Conference on Data Engineering, ICDE 2019, Macau, China, 8/4/19. https://doi.org/10.1109/ICDE.2019.00224
Ebaid A, Thirumuruganathan S, Aref WG, Elmagarmid A, Ouzzani M. EXPLAINER: Entity resolution explanations. In Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019. IEEE Computer Society. 2019. p. 2000-2003. 8731597. (Proceedings - International Conference on Data Engineering). https://doi.org/10.1109/ICDE.2019.00224
Ebaid, Amr ; Thirumuruganathan, Saravanan ; Aref, Walid G. ; Elmagarmid, Ahmed ; Ouzzani, Mourad. / EXPLAINER : Entity resolution explanations. Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019. IEEE Computer Society, 2019. pp. 2000-2003 (Proceedings - International Conference on Data Engineering).
@inproceedings{214e52ed168e47f1b4d9531e53757608,
title = "EXPLAINER: Entity resolution explanations",
abstract = "Entity Resolution is a fundamental data cleaning and integration problem that has received considerable attention in the past few decades. While rule-based methods have been used in many practical scenarios and are often easy to understand, machine-learning-based methods provide the best accuracy. However, the state-of-the-art classifiers are very opaque. There has been some work towards understanding and debugging the early stages of the entity resolution pipeline, e.g. blocking and generating features (similarity scores). However, there are no such efforts for explaining the model or its predictions. In this demo, we propose ExplainER, a tool to understand and explain entity resolution classifiers with different granularity levels of explanations. Using several benchmark datasets, we will demonstrate how ExplainER can handle different scenarios for a variety of classifiers.",
keywords = "Data cleaning, Data cleaning explanations, Data integration, Entity resolution, Explainable ai",
author = "Amr Ebaid and Saravanan Thirumuruganathan and Aref, {Walid G.} and Ahmed Elmagarmid and Mourad Ouzzani",
year = "2019",
month = "4",
day = "1",
doi = "10.1109/ICDE.2019.00224",
language = "English",
series = "Proceedings - International Conference on Data Engineering",
publisher = "IEEE Computer Society",
pages = "2000--2003",
booktitle = "Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019",

}

TY - GEN

T1 - EXPLAINER

T2 - Entity resolution explanations

AU - Ebaid, Amr

AU - Thirumuruganathan, Saravanan

AU - Aref, Walid G.

AU - Elmagarmid, Ahmed

AU - Ouzzani, Mourad

PY - 2019/4/1

Y1 - 2019/4/1

N2 - Entity Resolution is a fundamental data cleaning and integration problem that has received considerable attention in the past few decades. While rule-based methods have been used in many practical scenarios and are often easy to understand, machine-learning-based methods provide the best accuracy. However, the state-of-the-art classifiers are very opaque. There has been some work towards understanding and debugging the early stages of the entity resolution pipeline, e.g. blocking and generating features (similarity scores). However, there are no such efforts for explaining the model or its predictions. In this demo, we propose ExplainER, a tool to understand and explain entity resolution classifiers with different granularity levels of explanations. Using several benchmark datasets, we will demonstrate how ExplainER can handle different scenarios for a variety of classifiers.

AB - Entity Resolution is a fundamental data cleaning and integration problem that has received considerable attention in the past few decades. While rule-based methods have been used in many practical scenarios and are often easy to understand, machine-learning-based methods provide the best accuracy. However, the state-of-the-art classifiers are very opaque. There has been some work towards understanding and debugging the early stages of the entity resolution pipeline, e.g. blocking and generating features (similarity scores). However, there are no such efforts for explaining the model or its predictions. In this demo, we propose ExplainER, a tool to understand and explain entity resolution classifiers with different granularity levels of explanations. Using several benchmark datasets, we will demonstrate how ExplainER can handle different scenarios for a variety of classifiers.

KW - Data cleaning

KW - Data cleaning explanations

KW - Data integration

KW - Entity resolution

KW - Explainable ai

UR - http://www.scopus.com/inward/record.url?scp=85067950714&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85067950714&partnerID=8YFLogxK

U2 - 10.1109/ICDE.2019.00224

DO - 10.1109/ICDE.2019.00224

M3 - Conference contribution

AN - SCOPUS:85067950714

T3 - Proceedings - International Conference on Data Engineering

SP - 2000

EP - 2003

BT - Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019

PB - IEEE Computer Society

ER -