NADEEF/ER

Generic and interactive entity resolution

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

Entity resolution (ER), the process of identifying and eventually merging records that refer to the same real-world entities, is an important and long-standing problem. We present Nadeef/Er, a generic and interactive entity resolution system, which is built as an extension over our open-source generalized data cleaning system Nadeef. Nadeef/Er provides a rich programming interface for manipulating entities, which allows generic, efficient and extensible ER. In this demo, users will have the opportunity to experience the following features: (1) Easy specification - Users can easily define ER rules with a browser-based specification, which will then be automatically transformed to various functions, treated as black-boxes by Nadeef; (2) Generality and extensibility - Users can customize their ER rules by refining and fine-tuning the above functions to achieve both effective and efficient ER solutions; (3) Interactivity - We also extended the existing Nadeef dashboard with summarization and clustering techniques to facilitate understanding problems faced by the ER process as well as to allow users to influence resolution decisions.

Original languageEnglish
Title of host publicationProceedings of the ACM SIGMOD International Conference on Management of Data
PublisherAssociation for Computing Machinery
Pages1071-1074
Number of pages4
ISBN (Print)9781450323765
DOIs
Publication statusPublished - 1 Jan 2014
Event2014 ACM SIGMOD International Conference on Management of Data, SIGMOD 2014 - Snowbird, UT, United States
Duration: 22 Jun 201427 Jun 2014

Other

Other2014 ACM SIGMOD International Conference on Management of Data, SIGMOD 2014
CountryUnited States
CitySnowbird, UT
Period22/6/1427/6/14

Fingerprint

Specifications
Merging
Refining
Cleaning
Tuning

Keywords

  • Entity resolution
  • Generic
  • Interactive
  • NADEEF

ASJC Scopus subject areas

  • Software
  • Information Systems

Cite this

Elmagarmid, A., Ilyas, I. F., Ouzzani, M., Quiane Ruiz, J. A., Tang, N., & Yin, S. (2014). NADEEF/ER: Generic and interactive entity resolution. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 1071-1074). Association for Computing Machinery. https://doi.org/10.1145/2588555.2594511

NADEEF/ER : Generic and interactive entity resolution. / Elmagarmid, Ahmed; Ilyas, Ihab F.; Ouzzani, Mourad; Quiane Ruiz, Jorge Arnulfo; Tang, Nan; Yin, Si.

Proceedings of the ACM SIGMOD International Conference on Management of Data. Association for Computing Machinery, 2014. p. 1071-1074.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Elmagarmid, A, Ilyas, IF, Ouzzani, M, Quiane Ruiz, JA, Tang, N & Yin, S 2014, NADEEF/ER: Generic and interactive entity resolution. in Proceedings of the ACM SIGMOD International Conference on Management of Data. Association for Computing Machinery, pp. 1071-1074, 2014 ACM SIGMOD International Conference on Management of Data, SIGMOD 2014, Snowbird, UT, United States, 22/6/14. https://doi.org/10.1145/2588555.2594511
Elmagarmid A, Ilyas IF, Ouzzani M, Quiane Ruiz JA, Tang N, Yin S. NADEEF/ER: Generic and interactive entity resolution. In Proceedings of the ACM SIGMOD International Conference on Management of Data. Association for Computing Machinery. 2014. p. 1071-1074 https://doi.org/10.1145/2588555.2594511
Elmagarmid, Ahmed ; Ilyas, Ihab F. ; Ouzzani, Mourad ; Quiane Ruiz, Jorge Arnulfo ; Tang, Nan ; Yin, Si. / NADEEF/ER : Generic and interactive entity resolution. Proceedings of the ACM SIGMOD International Conference on Management of Data. Association for Computing Machinery, 2014. pp. 1071-1074
@inproceedings{b9d465123e6349eaa4f3b640336e178d,
title = "NADEEF/ER: Generic and interactive entity resolution",
abstract = "Entity resolution (ER), the process of identifying and eventually merging records that refer to the same real-world entities, is an important and long-standing problem. We present Nadeef/Er, a generic and interactive entity resolution system, which is built as an extension over our open-source generalized data cleaning system Nadeef. Nadeef/Er provides a rich programming interface for manipulating entities, which allows generic, efficient and extensible ER. In this demo, users will have the opportunity to experience the following features: (1) Easy specification - Users can easily define ER rules with a browser-based specification, which will then be automatically transformed to various functions, treated as black-boxes by Nadeef; (2) Generality and extensibility - Users can customize their ER rules by refining and fine-tuning the above functions to achieve both effective and efficient ER solutions; (3) Interactivity - We also extended the existing Nadeef dashboard with summarization and clustering techniques to facilitate understanding problems faced by the ER process as well as to allow users to influence resolution decisions.",
keywords = "Entity resolution, Generic, Interactive, NADEEF",
author = "Ahmed Elmagarmid and Ilyas, {Ihab F.} and Mourad Ouzzani and {Quiane Ruiz}, {Jorge Arnulfo} and Nan Tang and Si Yin",
year = "2014",
month = "1",
day = "1",
doi = "10.1145/2588555.2594511",
language = "English",
isbn = "9781450323765",
pages = "1071--1074",
booktitle = "Proceedings of the ACM SIGMOD International Conference on Management of Data",
publisher = "Association for Computing Machinery",

}

TY - GEN

T1 - NADEEF/ER

T2 - Generic and interactive entity resolution

AU - Elmagarmid, Ahmed

AU - Ilyas, Ihab F.

AU - Ouzzani, Mourad

AU - Quiane Ruiz, Jorge Arnulfo

AU - Tang, Nan

AU - Yin, Si

PY - 2014/1/1

Y1 - 2014/1/1

N2 - Entity resolution (ER), the process of identifying and eventually merging records that refer to the same real-world entities, is an important and long-standing problem. We present Nadeef/Er, a generic and interactive entity resolution system, which is built as an extension over our open-source generalized data cleaning system Nadeef. Nadeef/Er provides a rich programming interface for manipulating entities, which allows generic, efficient and extensible ER. In this demo, users will have the opportunity to experience the following features: (1) Easy specification - Users can easily define ER rules with a browser-based specification, which will then be automatically transformed to various functions, treated as black-boxes by Nadeef; (2) Generality and extensibility - Users can customize their ER rules by refining and fine-tuning the above functions to achieve both effective and efficient ER solutions; (3) Interactivity - We also extended the existing Nadeef dashboard with summarization and clustering techniques to facilitate understanding problems faced by the ER process as well as to allow users to influence resolution decisions.

AB - Entity resolution (ER), the process of identifying and eventually merging records that refer to the same real-world entities, is an important and long-standing problem. We present Nadeef/Er, a generic and interactive entity resolution system, which is built as an extension over our open-source generalized data cleaning system Nadeef. Nadeef/Er provides a rich programming interface for manipulating entities, which allows generic, efficient and extensible ER. In this demo, users will have the opportunity to experience the following features: (1) Easy specification - Users can easily define ER rules with a browser-based specification, which will then be automatically transformed to various functions, treated as black-boxes by Nadeef; (2) Generality and extensibility - Users can customize their ER rules by refining and fine-tuning the above functions to achieve both effective and efficient ER solutions; (3) Interactivity - We also extended the existing Nadeef dashboard with summarization and clustering techniques to facilitate understanding problems faced by the ER process as well as to allow users to influence resolution decisions.

KW - Entity resolution

KW - Generic

KW - Interactive

KW - NADEEF

UR - http://www.scopus.com/inward/record.url?scp=84904327514&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84904327514&partnerID=8YFLogxK

U2 - 10.1145/2588555.2594511

DO - 10.1145/2588555.2594511

M3 - Conference contribution

SN - 9781450323765

SP - 1071

EP - 1074

BT - Proceedings of the ACM SIGMOD International Conference on Management of Data

PB - Association for Computing Machinery

ER -