AUDIT

approving and tracking updates with dependencies in collaborative databases

Khaleel Mershad, Qutaibah M. Malluhi, Mourad Ouzzani, Mingjie Tang, Michael Gribskov, Walid G. Aref

Research output: Contribution to journalArticle

Abstract

Collaborative databases such as genome databases, often involve extensive curation activities where collaborators need to interact to be able to converge and agree on the content of data. In a typical scenario, a member of the collaboration makes some updates and these become visible to all collaborators for possible comments and modifications. At the same time, these updates are usually pending the approval or rejection from the data custodian based on the related discussion and the content of the data. Unfortunately, the approval and authorization of updates in current databases is based solely on the identity of the user, e.g., via the SQL GRANT and REVOKE commands. In this paper, we present a scalable cloud-based collaborative database system to support collaboration and data curation scenarios. Our system is based on an Update Pending Approval model. In a nutshell, when a collaborator updates a given data item, it is marked as pending approval until the data custodian approves or rejects the update. Until then, any other collaborator can view and comment on the data, pending its approval. We fully realized our system inside HBase, a cloud-based platform. We also conducted extensive experiments showing that the system scales well under different workloads.

Original languageEnglish
Pages (from-to)81-119
Number of pages39
JournalDistributed and Parallel Databases
Volume36
Issue number1
DOIs
Publication statusPublished - 1 Mar 2018

Fingerprint

Genes
Data base
Experiments
Scenarios
Authorization
Experiment
Workload

Keywords

  • Big data
  • Cloud computing
  • Collaborative databases
  • Data dependency
  • Multiversion data
  • Update authorization

ASJC Scopus subject areas

  • Software
  • Information Systems
  • Hardware and Architecture
  • Information Systems and Management

Cite this

AUDIT : approving and tracking updates with dependencies in collaborative databases. / Mershad, Khaleel; Malluhi, Qutaibah M.; Ouzzani, Mourad; Tang, Mingjie; Gribskov, Michael; Aref, Walid G.

In: Distributed and Parallel Databases, Vol. 36, No. 1, 01.03.2018, p. 81-119.

Research output: Contribution to journalArticle

Mershad, Khaleel ; Malluhi, Qutaibah M. ; Ouzzani, Mourad ; Tang, Mingjie ; Gribskov, Michael ; Aref, Walid G. / AUDIT : approving and tracking updates with dependencies in collaborative databases. In: Distributed and Parallel Databases. 2018 ; Vol. 36, No. 1. pp. 81-119.
@article{78181082e747441e8401bf8c1dd593d4,
title = "AUDIT: approving and tracking updates with dependencies in collaborative databases",
abstract = "Collaborative databases such as genome databases, often involve extensive curation activities where collaborators need to interact to be able to converge and agree on the content of data. In a typical scenario, a member of the collaboration makes some updates and these become visible to all collaborators for possible comments and modifications. At the same time, these updates are usually pending the approval or rejection from the data custodian based on the related discussion and the content of the data. Unfortunately, the approval and authorization of updates in current databases is based solely on the identity of the user, e.g., via the SQL GRANT and REVOKE commands. In this paper, we present a scalable cloud-based collaborative database system to support collaboration and data curation scenarios. Our system is based on an Update Pending Approval model. In a nutshell, when a collaborator updates a given data item, it is marked as pending approval until the data custodian approves or rejects the update. Until then, any other collaborator can view and comment on the data, pending its approval. We fully realized our system inside HBase, a cloud-based platform. We also conducted extensive experiments showing that the system scales well under different workloads.",
keywords = "Big data, Cloud computing, Collaborative databases, Data dependency, Multiversion data, Update authorization",
author = "Khaleel Mershad and Malluhi, {Qutaibah M.} and Mourad Ouzzani and Mingjie Tang and Michael Gribskov and Aref, {Walid G.}",
year = "2018",
month = "3",
day = "1",
doi = "10.1007/s10619-017-7208-y",
language = "English",
volume = "36",
pages = "81--119",
journal = "Distributed and Parallel Databases",
issn = "0926-8782",
publisher = "Springer Netherlands",
number = "1",

}

TY - JOUR

T1 - AUDIT

T2 - approving and tracking updates with dependencies in collaborative databases

AU - Mershad, Khaleel

AU - Malluhi, Qutaibah M.

AU - Ouzzani, Mourad

AU - Tang, Mingjie

AU - Gribskov, Michael

AU - Aref, Walid G.

PY - 2018/3/1

Y1 - 2018/3/1

N2 - Collaborative databases such as genome databases, often involve extensive curation activities where collaborators need to interact to be able to converge and agree on the content of data. In a typical scenario, a member of the collaboration makes some updates and these become visible to all collaborators for possible comments and modifications. At the same time, these updates are usually pending the approval or rejection from the data custodian based on the related discussion and the content of the data. Unfortunately, the approval and authorization of updates in current databases is based solely on the identity of the user, e.g., via the SQL GRANT and REVOKE commands. In this paper, we present a scalable cloud-based collaborative database system to support collaboration and data curation scenarios. Our system is based on an Update Pending Approval model. In a nutshell, when a collaborator updates a given data item, it is marked as pending approval until the data custodian approves or rejects the update. Until then, any other collaborator can view and comment on the data, pending its approval. We fully realized our system inside HBase, a cloud-based platform. We also conducted extensive experiments showing that the system scales well under different workloads.

AB - Collaborative databases such as genome databases, often involve extensive curation activities where collaborators need to interact to be able to converge and agree on the content of data. In a typical scenario, a member of the collaboration makes some updates and these become visible to all collaborators for possible comments and modifications. At the same time, these updates are usually pending the approval or rejection from the data custodian based on the related discussion and the content of the data. Unfortunately, the approval and authorization of updates in current databases is based solely on the identity of the user, e.g., via the SQL GRANT and REVOKE commands. In this paper, we present a scalable cloud-based collaborative database system to support collaboration and data curation scenarios. Our system is based on an Update Pending Approval model. In a nutshell, when a collaborator updates a given data item, it is marked as pending approval until the data custodian approves or rejects the update. Until then, any other collaborator can view and comment on the data, pending its approval. We fully realized our system inside HBase, a cloud-based platform. We also conducted extensive experiments showing that the system scales well under different workloads.

KW - Big data

KW - Cloud computing

KW - Collaborative databases

KW - Data dependency

KW - Multiversion data

KW - Update authorization

UR - http://www.scopus.com/inward/record.url?scp=85029671084&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85029671084&partnerID=8YFLogxK

U2 - 10.1007/s10619-017-7208-y

DO - 10.1007/s10619-017-7208-y

M3 - Article

VL - 36

SP - 81

EP - 119

JO - Distributed and Parallel Databases

JF - Distributed and Parallel Databases

SN - 0926-8782

IS - 1

ER -