Supporting annotations on relations

Mohamed Y. Eltabakh, Walid G. Aref, Ahmed K. Elmagarmid, Mourad Ouzzani, Yasin N. Silva

Research output: Chapter in Book/Report/Conference proceedingConference contribution

22 Citations (Scopus)

Abstract

Annotations play a key role in understanding and curating databases. Annotations may represent comments, descriptions, lineage information, among several others. Annotation management is a vital mechanism for sharing knowledge and building an interactive and collaborative environment among database users and scientists. What makes it challenging is that annotations can be attached to database entities at various granularities, e.g., at the table, tuple, column, cell levels, or more generally, to any subset of cells that results from a select statement. Therefore, simple comment fields in tuples would not work because of the combinatorial nature of the annotations. In this paper, we present extensions to current database management systems to support annotations. We propose storage schemes to efficiently store annotations at multiple granularities, i.e., at the table, tuple, column, and cell levels. Compared to storing the annotations with the individual cells, the proposed schemes achieve more than an order-of-magnitude reduction in storage and up to 70% saving in the query execution time. We define types of annotations that inherit different behaviors. Through these types, users can specify, for example, whether or not an annotation is continuously applied over newly inserted data and whether or not an annotation is archived when the base data is modified. These annotation types raise several storage and processing challenges that are addressed in the paper. We propose declarative ways to add, archive, query, and propagate annotations. The proposed mechanisms are realized through extensions to the standard SQL. We implemented the proposed functionalities inside PostgreSQL with an easy to use Excel-based front-end graphical interface.

Original languageEnglish
Title of host publicationProceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09
Pages379-390
Number of pages12
DOIs
Publication statusPublished - 21 Sep 2009
Externally publishedYes
Event12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09 - Saint Petersburg, Russian Federation
Duration: 24 Mar 200926 Mar 2009

Other

Other12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09
CountryRussian Federation
CitySaint Petersburg
Period24/3/0926/3/09

Fingerprint

Processing

ASJC Scopus subject areas

  • Computer Science Applications
  • Software

Cite this

Eltabakh, M. Y., Aref, W. G., Elmagarmid, A. K., Ouzzani, M., & Silva, Y. N. (2009). Supporting annotations on relations. In Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09 (pp. 379-390) https://doi.org/10.1145/1516360.1516405

Supporting annotations on relations. / Eltabakh, Mohamed Y.; Aref, Walid G.; Elmagarmid, Ahmed K.; Ouzzani, Mourad; Silva, Yasin N.

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09. 2009. p. 379-390.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Eltabakh, MY, Aref, WG, Elmagarmid, AK, Ouzzani, M & Silva, YN 2009, Supporting annotations on relations. in Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09. pp. 379-390, 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09, Saint Petersburg, Russian Federation, 24/3/09. https://doi.org/10.1145/1516360.1516405
Eltabakh MY, Aref WG, Elmagarmid AK, Ouzzani M, Silva YN. Supporting annotations on relations. In Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09. 2009. p. 379-390 https://doi.org/10.1145/1516360.1516405
Eltabakh, Mohamed Y. ; Aref, Walid G. ; Elmagarmid, Ahmed K. ; Ouzzani, Mourad ; Silva, Yasin N. / Supporting annotations on relations. Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09. 2009. pp. 379-390
@inproceedings{cb1820221ca74604936d9d10eee1f0e9,
title = "Supporting annotations on relations",
abstract = "Annotations play a key role in understanding and curating databases. Annotations may represent comments, descriptions, lineage information, among several others. Annotation management is a vital mechanism for sharing knowledge and building an interactive and collaborative environment among database users and scientists. What makes it challenging is that annotations can be attached to database entities at various granularities, e.g., at the table, tuple, column, cell levels, or more generally, to any subset of cells that results from a select statement. Therefore, simple comment fields in tuples would not work because of the combinatorial nature of the annotations. In this paper, we present extensions to current database management systems to support annotations. We propose storage schemes to efficiently store annotations at multiple granularities, i.e., at the table, tuple, column, and cell levels. Compared to storing the annotations with the individual cells, the proposed schemes achieve more than an order-of-magnitude reduction in storage and up to 70{\%} saving in the query execution time. We define types of annotations that inherit different behaviors. Through these types, users can specify, for example, whether or not an annotation is continuously applied over newly inserted data and whether or not an annotation is archived when the base data is modified. These annotation types raise several storage and processing challenges that are addressed in the paper. We propose declarative ways to add, archive, query, and propagate annotations. The proposed mechanisms are realized through extensions to the standard SQL. We implemented the proposed functionalities inside PostgreSQL with an easy to use Excel-based front-end graphical interface.",
author = "Eltabakh, {Mohamed Y.} and Aref, {Walid G.} and Elmagarmid, {Ahmed K.} and Mourad Ouzzani and Silva, {Yasin N.}",
year = "2009",
month = "9",
day = "21",
doi = "10.1145/1516360.1516405",
language = "English",
isbn = "9781605584225",
pages = "379--390",
booktitle = "Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09",

}

TY - GEN

T1 - Supporting annotations on relations

AU - Eltabakh, Mohamed Y.

AU - Aref, Walid G.

AU - Elmagarmid, Ahmed K.

AU - Ouzzani, Mourad

AU - Silva, Yasin N.

PY - 2009/9/21

Y1 - 2009/9/21

N2 - Annotations play a key role in understanding and curating databases. Annotations may represent comments, descriptions, lineage information, among several others. Annotation management is a vital mechanism for sharing knowledge and building an interactive and collaborative environment among database users and scientists. What makes it challenging is that annotations can be attached to database entities at various granularities, e.g., at the table, tuple, column, cell levels, or more generally, to any subset of cells that results from a select statement. Therefore, simple comment fields in tuples would not work because of the combinatorial nature of the annotations. In this paper, we present extensions to current database management systems to support annotations. We propose storage schemes to efficiently store annotations at multiple granularities, i.e., at the table, tuple, column, and cell levels. Compared to storing the annotations with the individual cells, the proposed schemes achieve more than an order-of-magnitude reduction in storage and up to 70% saving in the query execution time. We define types of annotations that inherit different behaviors. Through these types, users can specify, for example, whether or not an annotation is continuously applied over newly inserted data and whether or not an annotation is archived when the base data is modified. These annotation types raise several storage and processing challenges that are addressed in the paper. We propose declarative ways to add, archive, query, and propagate annotations. The proposed mechanisms are realized through extensions to the standard SQL. We implemented the proposed functionalities inside PostgreSQL with an easy to use Excel-based front-end graphical interface.

AB - Annotations play a key role in understanding and curating databases. Annotations may represent comments, descriptions, lineage information, among several others. Annotation management is a vital mechanism for sharing knowledge and building an interactive and collaborative environment among database users and scientists. What makes it challenging is that annotations can be attached to database entities at various granularities, e.g., at the table, tuple, column, cell levels, or more generally, to any subset of cells that results from a select statement. Therefore, simple comment fields in tuples would not work because of the combinatorial nature of the annotations. In this paper, we present extensions to current database management systems to support annotations. We propose storage schemes to efficiently store annotations at multiple granularities, i.e., at the table, tuple, column, and cell levels. Compared to storing the annotations with the individual cells, the proposed schemes achieve more than an order-of-magnitude reduction in storage and up to 70% saving in the query execution time. We define types of annotations that inherit different behaviors. Through these types, users can specify, for example, whether or not an annotation is continuously applied over newly inserted data and whether or not an annotation is archived when the base data is modified. These annotation types raise several storage and processing challenges that are addressed in the paper. We propose declarative ways to add, archive, query, and propagate annotations. The proposed mechanisms are realized through extensions to the standard SQL. We implemented the proposed functionalities inside PostgreSQL with an easy to use Excel-based front-end graphical interface.

UR - http://www.scopus.com/inward/record.url?scp=70349101933&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=70349101933&partnerID=8YFLogxK

U2 - 10.1145/1516360.1516405

DO - 10.1145/1516360.1516405

M3 - Conference contribution

SN - 9781605584225

SP - 379

EP - 390

BT - Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09

ER -