Abstract
In many real world situations, data are updated and released over time. In each release, the attributes are fixed but the number of records may vary, and the attribute values may be modified. Privacy can be compromised due to the disclosure of information when one combines different release versions of the data. Preventing information disclosure becomes more difficult when the adversary possesses two kinds of background knowledge: correlations among sensitive attribute values over time and compromised records. In this paper, we propose a Bayesian-based anonymization framework to protect against these kinds of background knowledge in a continuous data publishing setting. The proposed framework mimics the adversary’s reasoning method in continuous release and estimates her posterior belief using a Bayesian approach. Moreover, we analyze threat deriving from the compromised records in the current release and the following ones. Experimental results on two datasets show that our proposed framework outperforms JS-reduce, the state of the art approach for continuous data publishing, in terms of the adversary’s information gain as well as data utility and privacy loss.
Original language | English (US) |
---|---|
Pages (from-to) | 197-225 |
Number of pages | 29 |
Journal | Transactions on Data Privacy |
Volume | 12 |
Issue number | 3 |
State | Published - Dec 2019 |
Externally published | Yes |
All Science Journal Classification (ASJC) codes
- Software
- Statistics and Probability