Query optimization for differentially private data management systems

Shangfu Peng, Yin Yang, Zhenjie Zhang, Marianne Winslett, Yong Yu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Citations (Scopus)

Abstract

Differential privacy (DP) enables publishing statistical query results over sensitive data, with rigorous privacy guarantees, and very conservative assumptions about the adversary's background knowledge. This paper focuses on the interactive DP framework, which processes incoming queries on the fly, each of which consumes a portion of the user-specified privacy budget. Existing systems process each query independently, which often leads to considerable privacy budget waste. Motivated by this, we propose Pioneer, a query optimizer for an interactive, DP-compliant DBMS. For each new query, Pioneer creates an execution plan that combines past query results and new results from the underlying data. When a query has multiple semantically equivalent plans, Pioneer automatically selects one with minimal privacy budget consumption. Extensive experiments confirm that Pioneer achieves significant savings of the privacy budget, and can answer many more queries than existing systems for a fixed total budget, with comparable result accuracy.

Original languageEnglish
Title of host publicationICDE 2013 - 29th International Conference on Data Engineering
Pages1093-1104
Number of pages12
DOIs
Publication statusPublished - 2013
Externally publishedYes
Event29th International Conference on Data Engineering, ICDE 2013 - Brisbane, QLD, Australia
Duration: 8 Apr 201311 Apr 2013

Other

Other29th International Conference on Data Engineering, ICDE 2013
CountryAustralia
CityBrisbane, QLD
Period8/4/1311/4/13

Fingerprint

Information management
Experiments

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Information Systems

Cite this

Peng, S., Yang, Y., Zhang, Z., Winslett, M., & Yu, Y. (2013). Query optimization for differentially private data management systems. In ICDE 2013 - 29th International Conference on Data Engineering (pp. 1093-1104). [6544900] https://doi.org/10.1109/ICDE.2013.6544900

Query optimization for differentially private data management systems. / Peng, Shangfu; Yang, Yin; Zhang, Zhenjie; Winslett, Marianne; Yu, Yong.

ICDE 2013 - 29th International Conference on Data Engineering. 2013. p. 1093-1104 6544900.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Peng, S, Yang, Y, Zhang, Z, Winslett, M & Yu, Y 2013, Query optimization for differentially private data management systems. in ICDE 2013 - 29th International Conference on Data Engineering., 6544900, pp. 1093-1104, 29th International Conference on Data Engineering, ICDE 2013, Brisbane, QLD, Australia, 8/4/13. https://doi.org/10.1109/ICDE.2013.6544900
Peng S, Yang Y, Zhang Z, Winslett M, Yu Y. Query optimization for differentially private data management systems. In ICDE 2013 - 29th International Conference on Data Engineering. 2013. p. 1093-1104. 6544900 https://doi.org/10.1109/ICDE.2013.6544900
Peng, Shangfu ; Yang, Yin ; Zhang, Zhenjie ; Winslett, Marianne ; Yu, Yong. / Query optimization for differentially private data management systems. ICDE 2013 - 29th International Conference on Data Engineering. 2013. pp. 1093-1104
@inproceedings{00889daf7e9f4a91824ccfe2df190e41,
title = "Query optimization for differentially private data management systems",
abstract = "Differential privacy (DP) enables publishing statistical query results over sensitive data, with rigorous privacy guarantees, and very conservative assumptions about the adversary's background knowledge. This paper focuses on the interactive DP framework, which processes incoming queries on the fly, each of which consumes a portion of the user-specified privacy budget. Existing systems process each query independently, which often leads to considerable privacy budget waste. Motivated by this, we propose Pioneer, a query optimizer for an interactive, DP-compliant DBMS. For each new query, Pioneer creates an execution plan that combines past query results and new results from the underlying data. When a query has multiple semantically equivalent plans, Pioneer automatically selects one with minimal privacy budget consumption. Extensive experiments confirm that Pioneer achieves significant savings of the privacy budget, and can answer many more queries than existing systems for a fixed total budget, with comparable result accuracy.",
author = "Shangfu Peng and Yin Yang and Zhenjie Zhang and Marianne Winslett and Yong Yu",
year = "2013",
doi = "10.1109/ICDE.2013.6544900",
language = "English",
isbn = "9781467349086",
pages = "1093--1104",
booktitle = "ICDE 2013 - 29th International Conference on Data Engineering",

}

TY - GEN

T1 - Query optimization for differentially private data management systems

AU - Peng, Shangfu

AU - Yang, Yin

AU - Zhang, Zhenjie

AU - Winslett, Marianne

AU - Yu, Yong

PY - 2013

Y1 - 2013

N2 - Differential privacy (DP) enables publishing statistical query results over sensitive data, with rigorous privacy guarantees, and very conservative assumptions about the adversary's background knowledge. This paper focuses on the interactive DP framework, which processes incoming queries on the fly, each of which consumes a portion of the user-specified privacy budget. Existing systems process each query independently, which often leads to considerable privacy budget waste. Motivated by this, we propose Pioneer, a query optimizer for an interactive, DP-compliant DBMS. For each new query, Pioneer creates an execution plan that combines past query results and new results from the underlying data. When a query has multiple semantically equivalent plans, Pioneer automatically selects one with minimal privacy budget consumption. Extensive experiments confirm that Pioneer achieves significant savings of the privacy budget, and can answer many more queries than existing systems for a fixed total budget, with comparable result accuracy.

AB - Differential privacy (DP) enables publishing statistical query results over sensitive data, with rigorous privacy guarantees, and very conservative assumptions about the adversary's background knowledge. This paper focuses on the interactive DP framework, which processes incoming queries on the fly, each of which consumes a portion of the user-specified privacy budget. Existing systems process each query independently, which often leads to considerable privacy budget waste. Motivated by this, we propose Pioneer, a query optimizer for an interactive, DP-compliant DBMS. For each new query, Pioneer creates an execution plan that combines past query results and new results from the underlying data. When a query has multiple semantically equivalent plans, Pioneer automatically selects one with minimal privacy budget consumption. Extensive experiments confirm that Pioneer achieves significant savings of the privacy budget, and can answer many more queries than existing systems for a fixed total budget, with comparable result accuracy.

UR - http://www.scopus.com/inward/record.url?scp=84881342820&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84881342820&partnerID=8YFLogxK

U2 - 10.1109/ICDE.2013.6544900

DO - 10.1109/ICDE.2013.6544900

M3 - Conference contribution

SN - 9781467349086

SP - 1093

EP - 1104

BT - ICDE 2013 - 29th International Conference on Data Engineering

ER -