Title
Copula-Based Multi-Dimensional Crowdsourced Data Synthesis and Release with Local Privacy.
Abstract
Various paradigms, based on differential privacy, have been proposed to release a privacy-preserving dataset with statistical approximation. Nonetheless, most existing schemes are limited when facing highly correlated attributes, and cannot prevent privacy threats from untrusted servers. In this paper, we propose a novel Copula-based scheme to efficiently synthesize and release multi-dimensional crowdsourced data with local differential privacy. In our scheme, each participant's (or user's) data is locally transformed into bit strings based on a randomized response technique, which guarantees a participant's privacy on the participant (user) side. Then, Copula theory is lever-aged to synthesize multi-dimensional crowdsourced data based on univariate marginal distribution and attribute dependence. Univariate marginal distribution is estimated by the Lasso-based regression algorithm from the aggregated privacy-preserving bit strings. Dependencies among attributes are modeled as multivariate Gaussian Copula, of which parameter is estimated by Pearson correlation coefficients. We conduct experiments to validate the effectiveness of our scheme. Our experimental results demonstrate that our scheme is effective for the release of multi-dimensional data with local differential privacy guaranteed to distributed participants.
Year
Venue
Keywords
2017
IEEE Global Communications Conference
Multi-dimensional crowdsourced data,Copula functions,local differential privacy,data synthesis and release
Field
DocType
ISSN
Data mining,Pearson product-moment correlation coefficient,Differential privacy,Computer science,Copula (linguistics),Lasso (statistics),Real-time computing,Multivariate normal distribution,Information privacy,Univariate,Marginal distribution
Conference
2334-0983
Citations 
PageRank 
References 
0
0.34
0
Authors
4
Name
Order
Citations
PageRank
Xinyu Yang169565.61
Teng Wang233642.78
Xuebin Ren3143.79
Wei Yu41338118.61