Distribution-based microdata anonymization

Nick Koudas, Ting Yu, Divesh Srivastava, Qing Zhang

Research output: Chapter in Book/Report/Conference proceedingChapter

8 Citations (Scopus)

Abstract

Before sharing to support ad hoc aggregate analyses, microdata often need to be anonymized to protect the privacy of individuals. A variety of privacy models have been proposed for microdata anonymization. Many of these models (e.g., t -closeness) essentially require that, after anonymization, groups of sensitive attribute values follow specified distributions. To support such models, in this paper we study the problem of transforming a group of sensitive attribute values to follow a certain target distribution with minimal data distortion. Specifically, we develop and evaluate a novel methodology that combines the use of sensitive attribute permutation and generalization with the addition of fake sensitive attribute values to achieve this transformation. We identify metrics related to accuracy of aggregate query answers over the transformed data, and develop efficient anonymization algorithms to optimize these accuracy metrics. Using a variety of data sets, we experimentally demonstrate the effectiveness of our techniques.

Original languageEnglish
Title of host publicationProceedings of the VLDB Endowment
Pages958-969
Number of pages12
Volume2
Edition1
Publication statusPublished - 2009
Externally publishedYes

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Computer Science(all)

Cite this

Koudas, N., Yu, T., Srivastava, D., & Zhang, Q. (2009). Distribution-based microdata anonymization. In Proceedings of the VLDB Endowment (1 ed., Vol. 2, pp. 958-969)

Distribution-based microdata anonymization. / Koudas, Nick; Yu, Ting; Srivastava, Divesh; Zhang, Qing.

Proceedings of the VLDB Endowment. Vol. 2 1. ed. 2009. p. 958-969.

Research output: Chapter in Book/Report/Conference proceedingChapter

Koudas, N, Yu, T, Srivastava, D & Zhang, Q 2009, Distribution-based microdata anonymization. in Proceedings of the VLDB Endowment. 1 edn, vol. 2, pp. 958-969.
Koudas N, Yu T, Srivastava D, Zhang Q. Distribution-based microdata anonymization. In Proceedings of the VLDB Endowment. 1 ed. Vol. 2. 2009. p. 958-969
Koudas, Nick ; Yu, Ting ; Srivastava, Divesh ; Zhang, Qing. / Distribution-based microdata anonymization. Proceedings of the VLDB Endowment. Vol. 2 1. ed. 2009. pp. 958-969
@inbook{1894eeec66b04d6099b504401c5b12ab,
title = "Distribution-based microdata anonymization",
abstract = "Before sharing to support ad hoc aggregate analyses, microdata often need to be anonymized to protect the privacy of individuals. A variety of privacy models have been proposed for microdata anonymization. Many of these models (e.g., t -closeness) essentially require that, after anonymization, groups of sensitive attribute values follow specified distributions. To support such models, in this paper we study the problem of transforming a group of sensitive attribute values to follow a certain target distribution with minimal data distortion. Specifically, we develop and evaluate a novel methodology that combines the use of sensitive attribute permutation and generalization with the addition of fake sensitive attribute values to achieve this transformation. We identify metrics related to accuracy of aggregate query answers over the transformed data, and develop efficient anonymization algorithms to optimize these accuracy metrics. Using a variety of data sets, we experimentally demonstrate the effectiveness of our techniques.",
author = "Nick Koudas and Ting Yu and Divesh Srivastava and Qing Zhang",
year = "2009",
language = "English",
volume = "2",
pages = "958--969",
booktitle = "Proceedings of the VLDB Endowment",
edition = "1",

}

TY - CHAP

T1 - Distribution-based microdata anonymization

AU - Koudas, Nick

AU - Yu, Ting

AU - Srivastava, Divesh

AU - Zhang, Qing

PY - 2009

Y1 - 2009

N2 - Before sharing to support ad hoc aggregate analyses, microdata often need to be anonymized to protect the privacy of individuals. A variety of privacy models have been proposed for microdata anonymization. Many of these models (e.g., t -closeness) essentially require that, after anonymization, groups of sensitive attribute values follow specified distributions. To support such models, in this paper we study the problem of transforming a group of sensitive attribute values to follow a certain target distribution with minimal data distortion. Specifically, we develop and evaluate a novel methodology that combines the use of sensitive attribute permutation and generalization with the addition of fake sensitive attribute values to achieve this transformation. We identify metrics related to accuracy of aggregate query answers over the transformed data, and develop efficient anonymization algorithms to optimize these accuracy metrics. Using a variety of data sets, we experimentally demonstrate the effectiveness of our techniques.

AB - Before sharing to support ad hoc aggregate analyses, microdata often need to be anonymized to protect the privacy of individuals. A variety of privacy models have been proposed for microdata anonymization. Many of these models (e.g., t -closeness) essentially require that, after anonymization, groups of sensitive attribute values follow specified distributions. To support such models, in this paper we study the problem of transforming a group of sensitive attribute values to follow a certain target distribution with minimal data distortion. Specifically, we develop and evaluate a novel methodology that combines the use of sensitive attribute permutation and generalization with the addition of fake sensitive attribute values to achieve this transformation. We identify metrics related to accuracy of aggregate query answers over the transformed data, and develop efficient anonymization algorithms to optimize these accuracy metrics. Using a variety of data sets, we experimentally demonstrate the effectiveness of our techniques.

UR - http://www.scopus.com/inward/record.url?scp=78649333205&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78649333205&partnerID=8YFLogxK

M3 - Chapter

AN - SCOPUS:78649333205

VL - 2

SP - 958

EP - 969

BT - Proceedings of the VLDB Endowment

ER -