Revisit Randomized Response

15
Lecture 4 How to define Privacy Differential Privacy Revisit Randomized Response Laplace Mechanism optional Announcement Canvas HW 0 solution pwolfted be online HW 1 coming Waitlist

Transcript of Revisit Randomized Response

Page 1: Revisit Randomized Response

Lecture 4

How to define Privacy

Differential Privacy

Revisit Randomized Response

Laplace Mechanism optional

Announcement Canvas

HW 0 solution pwolftedbeonline

HW 1 coming

Waitlist

Page 2: Revisit Randomized Response

How to define privacyApproaches

Arm Race i Think of possible attacks Defense against theseattacks

Example K anonymityagainst Linkage attack Think netfix attack

4 IMDB data

Formulate General Criteria

Page 3: Revisit Randomized Response

K anonymityInput Table Output Table

Generalization

Replace a single value with a set ofpossible values28 I 30

male 1 7 female male

Table is K anonymous

if each row matches withat least k 1 other rows in

the nonsensitive attributes

Page 4: Revisit Randomized Response

Seems to resist Linkage attacksCan't identify a record uniquelySeem hard to link other sources of info

What can go wrongEveryone in their 30s has cancer

Rule out other info

Page 5: Revisit Randomized Response

Composition

Crossreferencing a

28 yearsold

tied.in ai sasaa ha'sash

Page 6: Revisit Randomized Response

K anonymity issues

Specifies a set of acceptable output k anonymous tablesDoes not specify the algorithmic processFlexibility may leak info

Page 7: Revisit Randomized Response

Differential Privacy Dwork McSherry Nissim Smith2006

Algorithmic Property

Rigorous guarantees against arbitrary external infoResists known attacks

Page 8: Revisit Randomized Response

Data domain X e.g Eotgd RdData set X Xz Xz Xn C X

Randomized Algorithm AAtx is a randomvariable

Xz

x i A AceXn a

Data setwhatproperty

Page 9: Revisit Randomized Response

Thought Experiment

XzXz Xz

X i A Acx x

X i s A AuXn

XnDataSet DataSet

X is a neighbor of xif they differ in one data point

Idea of Dpi Neighboring data sets induceStability close output distributions

Page 10: Revisit Randomized Response

Definition Differential privacyX X'TIX datasets

It is E differentially private iffor all neighbors X and X hypothetical

for all subsets Em of outputsPEACH c EI e ee BEACHEET

bX E is small

Acxy e'Elite

Acid outputs

ie

Page 11: Revisit Randomized Response
Page 12: Revisit Randomized Response
Page 13: Revisit Randomized Response
Page 14: Revisit Randomized Response

Basic Proof Strategy

for all neighbors X and Xfor all subsets E of outputs

PEACH c EI e ee PEACHEE

PEACH y s.eePEACxD Y

Page 15: Revisit Randomized Response