Strong Hardness of Privacy from Weak Traitor Tracing

A central problem in differential privacy is to accurately answer a large family Q of statistical queries over a data universe X. A statistical query on a dataset \(D \in X^n\) asks “what fraction of the elements of D satisfy a given predicate p on X?” Ig

  • PDF / 1,389,930 Bytes
  • 31 Pages / 439.37 x 666.142 pts Page_size
  • 1 Downloads / 162 Views

DOWNLOAD

REPORT


Columbia University, New York, USA [email protected] Northeastern University, Boston, USA 3 MIT, Cambridge, USA Princeton University, Princeton, USA

Abstract. A central problem in differential privacy is to accurately answer a large family Q of statistical queries over a data universe X. A statistical query on a dataset D ∈ X n asks “what fraction of the elements of D satisfy a given predicate p on X?” Ignoring computational constraints, it is possible to accurately answer exponentially many queries on an exponential size universe while satisfying differential privacy (Blum et al., STOC’08). Dwork et al. (STOC’09) and Boneh and Zhandry (CRYPTO’14) showed that if both Q and X are of polynomial size, then there is an efficient differentially private algorithm that accurately answers all the queries. They also proved that if Q and X are both exponentially large, then under a plausible assumption, no efficient algorithm exists. We show that, under the same assumption, if either the number of queries or the data universe is of exponential size, then there is no differentially private algorithm that answers all the queries. Specifically, we prove that if one-way functions and indistinguishability obfuscation exist, then: ˜ 7 ) queries on a data universe X 1. For every n, there is a family Q of O(n d of size 2 such that no poly(n, d) time differentially private algorithm takes a dataset D ∈ X n and outputs accurate answers to every query in Q. 2. For every n, there is a family Q of 2d queries on a data universe ˜ 7 ) such that no poly(n, d) time differentially private X of size O(n algorithm takes a dataset D ∈ X n and outputs accurate answers to every query in Q. In both cases, the result is nearly quantitatively tight, since there is an ˜ 2 ) queries on efficient differentially private algorithm that answers Ω(n an exponential size data universe, and one that answers exponentially ˜ 2 ). many queries on a data universe of size Ω(n Our proofs build on the connection between hardness of differential privacy and traitor-tracing schemes (Dwork et al., STOC’09; Ullman, STOC’13). We prove our hardness result for a polynomial size query set (resp., data universe) by showing that they follow from the existence of a special type of traitor-tracing scheme with very short ciphertexts (resp., secret keys), but very weak security guarantees, and then constructing such a scheme. The full version of this work appears on the IACR Crypto ePrint [26]. c International Association for Cryptologic Research 2016  M. Hirt and A. Smith (Eds.): TCC 2016-B, Part I, LNCS 9985, pp. 659–689, 2016. DOI: 10.1007/978-3-662-53641-4 25

660

1

L. Kowalczyk et al.

Introduction

The goal of privacy-preserving data analysis is to release rich statistical information about a sensitive dataset while respecting the privacy of the individuals represented in that dataset. The past decade has seen tremendous progress towards understanding when and how these two competing goals can be reconciled, including surprisingly powerful differentially private algorithms as well