HandsOn DB: Managing Data Dependencies Involving Human Actions

Mohamed Eltabakh, Walid Aref, Ahmed Elmagarmid, Mourad Ouzzani

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Consider two values, x and y, in the database, where y = F(x). To maintain the consistency of the data, whenever x changes, F needs to be executed to re-compute y and update its value in the database. This is straightforward in the case where F can be executed by the DBMS, e.g., SQL or C function. In this paper, we address the more challenging case where F is a human action, e.g., conducting a wet-lab experiment, taking manual measurements, or collecting instrument readings. In this case, when x changes, y remains invalid (inconsistent with the current value of x) until the human action involved in the derivation is performed and its output result is reflected into the database. Many application domains, e.g., scientific applications in biology, chemistry, and physics, contain multiple such derivations and dependencies that involve human actions. In this paper, we propose HandsOn DB, a prototype database engine for managing dependencies that involve human actions while maintaining the consistency of the derived data. HandsOn DB includes the following features: (1) semantics and syntax for interfaces through which users can register human activities into the database and express the dependencies among the data items on these activities; (2) mechanisms for invalidating and revalidating the derived data; and (3) new operator semantics that alert users when the returned query results contain potentially invalid data, and enable evaluating queries on either valid data only, or both valid and potentially invalid data. Performance results are presented that study the overheads associated with these features and demonstrate the feasibility and practicality in realizing HandsOn DB.

Original languageEnglish
Article number6560053
Pages (from-to)2193-2206
Number of pages14
JournalIEEE Transactions on Knowledge and Data Engineering
Volume26
Issue number9
DOIs
Publication statusPublished - 1 Sep 2014

Fingerprint

Semantics
Physics
Engines
Experiments

Keywords

  • Data dependencies
  • Database design
  • manipulation languages
  • modeling and management
  • Query design and implementation languages
  • Query processing

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Information Systems
  • Computer Science Applications

Cite this

HandsOn DB : Managing Data Dependencies Involving Human Actions. / Eltabakh, Mohamed; Aref, Walid; Elmagarmid, Ahmed; Ouzzani, Mourad.

In: IEEE Transactions on Knowledge and Data Engineering, Vol. 26, No. 9, 6560053, 01.09.2014, p. 2193-2206.

Research output: Contribution to journalArticle

@article{658cf78a755f44ea9d35e6db9fc693e2,
title = "HandsOn DB: Managing Data Dependencies Involving Human Actions",
abstract = "Consider two values, x and y, in the database, where y = F(x). To maintain the consistency of the data, whenever x changes, F needs to be executed to re-compute y and update its value in the database. This is straightforward in the case where F can be executed by the DBMS, e.g., SQL or C function. In this paper, we address the more challenging case where F is a human action, e.g., conducting a wet-lab experiment, taking manual measurements, or collecting instrument readings. In this case, when x changes, y remains invalid (inconsistent with the current value of x) until the human action involved in the derivation is performed and its output result is reflected into the database. Many application domains, e.g., scientific applications in biology, chemistry, and physics, contain multiple such derivations and dependencies that involve human actions. In this paper, we propose HandsOn DB, a prototype database engine for managing dependencies that involve human actions while maintaining the consistency of the derived data. HandsOn DB includes the following features: (1) semantics and syntax for interfaces through which users can register human activities into the database and express the dependencies among the data items on these activities; (2) mechanisms for invalidating and revalidating the derived data; and (3) new operator semantics that alert users when the returned query results contain potentially invalid data, and enable evaluating queries on either valid data only, or both valid and potentially invalid data. Performance results are presented that study the overheads associated with these features and demonstrate the feasibility and practicality in realizing HandsOn DB.",
keywords = "Data dependencies, Database design, manipulation languages, modeling and management, Query design and implementation languages, Query processing",
author = "Mohamed Eltabakh and Walid Aref and Ahmed Elmagarmid and Mourad Ouzzani",
year = "2014",
month = "9",
day = "1",
doi = "10.1109/TKDE.2013.117",
language = "English",
volume = "26",
pages = "2193--2206",
journal = "IEEE Transactions on Knowledge and Data Engineering",
issn = "1041-4347",
publisher = "IEEE Computer Society",
number = "9",

}

TY - JOUR

T1 - HandsOn DB

T2 - Managing Data Dependencies Involving Human Actions

AU - Eltabakh, Mohamed

AU - Aref, Walid

AU - Elmagarmid, Ahmed

AU - Ouzzani, Mourad

PY - 2014/9/1

Y1 - 2014/9/1

N2 - Consider two values, x and y, in the database, where y = F(x). To maintain the consistency of the data, whenever x changes, F needs to be executed to re-compute y and update its value in the database. This is straightforward in the case where F can be executed by the DBMS, e.g., SQL or C function. In this paper, we address the more challenging case where F is a human action, e.g., conducting a wet-lab experiment, taking manual measurements, or collecting instrument readings. In this case, when x changes, y remains invalid (inconsistent with the current value of x) until the human action involved in the derivation is performed and its output result is reflected into the database. Many application domains, e.g., scientific applications in biology, chemistry, and physics, contain multiple such derivations and dependencies that involve human actions. In this paper, we propose HandsOn DB, a prototype database engine for managing dependencies that involve human actions while maintaining the consistency of the derived data. HandsOn DB includes the following features: (1) semantics and syntax for interfaces through which users can register human activities into the database and express the dependencies among the data items on these activities; (2) mechanisms for invalidating and revalidating the derived data; and (3) new operator semantics that alert users when the returned query results contain potentially invalid data, and enable evaluating queries on either valid data only, or both valid and potentially invalid data. Performance results are presented that study the overheads associated with these features and demonstrate the feasibility and practicality in realizing HandsOn DB.

AB - Consider two values, x and y, in the database, where y = F(x). To maintain the consistency of the data, whenever x changes, F needs to be executed to re-compute y and update its value in the database. This is straightforward in the case where F can be executed by the DBMS, e.g., SQL or C function. In this paper, we address the more challenging case where F is a human action, e.g., conducting a wet-lab experiment, taking manual measurements, or collecting instrument readings. In this case, when x changes, y remains invalid (inconsistent with the current value of x) until the human action involved in the derivation is performed and its output result is reflected into the database. Many application domains, e.g., scientific applications in biology, chemistry, and physics, contain multiple such derivations and dependencies that involve human actions. In this paper, we propose HandsOn DB, a prototype database engine for managing dependencies that involve human actions while maintaining the consistency of the derived data. HandsOn DB includes the following features: (1) semantics and syntax for interfaces through which users can register human activities into the database and express the dependencies among the data items on these activities; (2) mechanisms for invalidating and revalidating the derived data; and (3) new operator semantics that alert users when the returned query results contain potentially invalid data, and enable evaluating queries on either valid data only, or both valid and potentially invalid data. Performance results are presented that study the overheads associated with these features and demonstrate the feasibility and practicality in realizing HandsOn DB.

KW - Data dependencies

KW - Database design

KW - manipulation languages

KW - modeling and management

KW - Query design and implementation languages

KW - Query processing

UR - http://www.scopus.com/inward/record.url?scp=84959469454&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84959469454&partnerID=8YFLogxK

U2 - 10.1109/TKDE.2013.117

DO - 10.1109/TKDE.2013.117

M3 - Article

AN - SCOPUS:84959469454

VL - 26

SP - 2193

EP - 2206

JO - IEEE Transactions on Knowledge and Data Engineering

JF - IEEE Transactions on Knowledge and Data Engineering

SN - 1041-4347

IS - 9

M1 - 6560053

ER -