Ulfar Erlingsson - FujitsuTwo common data sets in Machine Learning MNIST dataset: 70,000 images....

8

1 © 2019 FUJITSU Ulfar Erlingsson Senior Staff Research Scientist at Google Heads a team within Google Brain doing research on privacy and security for machine learning. Previously, he has been a researcher at Microsoft Research, Silicon Valley and an Associate Professor at Reykjavik University, Iceland.

Upload
others
Category

Documents
view
8
download
0

Embed Size (px):

Transcript of Ulfar Erlingsson - FujitsuTwo common data sets in Machine Learning MNIST dataset: 70,000 images....

Page 1: Ulfar Erlingsson - FujitsuTwo common data sets in Machine Learning MNIST dataset: 70,000 images. 28⨉28 pixels each. CIFAR-10 dataset: 60,000 color images. 32⨉32 pixels each

1 © 2019 FUJITSU

Ulfar Erlingsson

Senior Staff Research Scientist at Google

Heads a team within Google Brain doing research on privacy and security for machine learning.

Previously, he has been a researcher at Microsoft Research, Silicon Valley and an Associate Professor at Reykjavik University, Iceland.

Page 2: Ulfar Erlingsson - FujitsuTwo common data sets in Machine Learning MNIST dataset: 70,000 images. 28⨉28 pixels each. CIFAR-10 dataset: 60,000 color images. 32⨉32 pixels each

Is privacy an obstacle?Where does it raise the biggest challenge?

Page 3: Ulfar Erlingsson - FujitsuTwo common data sets in Machine Learning MNIST dataset: 70,000 images. 28⨉28 pixels each. CIFAR-10 dataset: 60,000 color images. 32⨉32 pixels each

Metaphor for Privacy(randomized response)

Page 4: Ulfar Erlingsson - FujitsuTwo common data sets in Machine Learning MNIST dataset: 70,000 images. 28⨉28 pixels each. CIFAR-10 dataset: 60,000 color images. 32⨉32 pixels each

Microdata: An Individual’s Report

Page 5: Ulfar Erlingsson - FujitsuTwo common data sets in Machine Learning MNIST dataset: 70,000 images. 28⨉28 pixels each. CIFAR-10 dataset: 60,000 color images. 32⨉32 pixels each

Microdata: An Individual’s Report

Each bit is flipped with probability

25%

Page 6: Ulfar Erlingsson - FujitsuTwo common data sets in Machine Learning MNIST dataset: 70,000 images. 28⨉28 pixels each. CIFAR-10 dataset: 60,000 color images. 32⨉32 pixels each

Big Picture Remains!

Page 7: Ulfar Erlingsson - FujitsuTwo common data sets in Machine Learning MNIST dataset: 70,000 images. 28⨉28 pixels each. CIFAR-10 dataset: 60,000 color images. 32⨉32 pixels each

Two common data sets in Machine Learning

MNIST dataset: 70,000 images

28⨉28 pixels each

CIFAR-10 dataset: 60,000 color images

32⨉32 pixels each

Page 8: Ulfar Erlingsson - FujitsuTwo common data sets in Machine Learning MNIST dataset: 70,000 images. 28⨉28 pixels each. CIFAR-10 dataset: 60,000 color images. 32⨉32 pixels each

What are the utility benefits / costs of ML privacy ?

Training ML models with privacy works and ensures strong generalization… and may help with data retention & removal concerns

But...Training with privacy means the MLmodel cannot “see” unique outliers

Model can’t learn about truly weird data

Utility of privacy-preserving ML models may always be worse on real outliers

Low-level Software Security: Attacks and Defenses Ulfar ... · Low-level Software Security: Attacks and Defenses Ulfar Erlingsson Microsoft Research, Silicon Valley and Reykjav k

Low-level Software Security: Attacks and Defenses Ulfar ... · Low-level Software Security: Attacks and Defenses Ulfar Erlingsson Microsoft Research, Silicon Valley and Reykjav k

History of Pixels

History of Pixels

Neighborhood pixels

Neighborhood pixels

François Martin, Pixels Award 2014 - Pixels Festival S01E01

François Martin, Pixels Award 2014 - Pixels Festival S01E01

Print has DOTS Screens have PIXELS · Print has DOTS Screens have PIXELS Picture Elements Pix + Els = PIXELS. 2 Resolution Vocabulary Print uses Dots Per Inch (DPI) Screens use Pixels

Print has DOTS Screens have PIXELS · Print has DOTS Screens have PIXELS Picture Elements Pix + Els = PIXELS. 2 Resolution Vocabulary Print uses Dots Per Inch (DPI) Screens use Pixels

Pixels not papers

Pixels not papers

Makalah Tugas Antena Reflector Ulfar 12221789 REV Ver03

Makalah Tugas Antena Reflector Ulfar 12221789 REV Ver03

“ Pixels that Sound ” Find pixels that correspond (correlate !?) to sound

“ Pixels that Sound ” Find pixels that correspond (correlate !?) to sound

John McHugh, Ulfar Erlingsson Data Structures for IPv6 Network Traffic Analysis Using Sets and Bags.

John McHugh, Ulfar Erlingsson Data Structures for IPv6 Network Traffic Analysis Using Sets and Bags.

BU505M/BU302M Series Users Guide - TOSHIBA TELI...20 : 2.0 mega pixels 23 : 2.3 mega pixels 30 : 3.0 mega pixels 40 : 4.0 mega pixels 50 : 5.0 mega pixels 60 : 6.0 mega pixels 65 :

BU505M/BU302M Series Users Guide - TOSHIBA TELI...20 : 2.0 mega pixels 23 : 2.3 mega pixels 30 : 3.0 mega pixels 40 : 4.0 mega pixels 50 : 5.0 mega pixels 60 : 6.0 mega pixels 65 :

Teamviewer Ulfar Hairunnisa - Jartel

Teamviewer Ulfar Hairunnisa - Jartel

Digital HD Video Camera Recorder 4 HVR-HD1000P · MAX. 6.1M (2848 x 2136) (4:3) Approx. 3200K pixels Approx. 2280K pixels Approx. 1710K pixels Approx. 2280K pixels Approx. 3040K pixels

Digital HD Video Camera Recorder 4 HVR-HD1000P · MAX. 6.1M (2848 x 2136) (4:3) Approx. 3200K pixels Approx. 2280K pixels Approx. 1710K pixels Approx. 2280K pixels Approx. 3040K pixels

$CSC 311: Introduction to Machine Learningbonner/courses/2020f/csc311/lectures/... · 2020. 11. 5. · Given image, construct \dataset" of pixels represented by their RGB pixel intensities$

CSC 311: Introduction to Machine Learningbonner/courses/2020f/csc311/lectures/... · 2020. 11. 5. · Given image, construct \dataset" of pixels represented by their RGB pixel intensities

Augmented pixels full

Augmented pixels full

DataSet Pro DataSet Pro Vous présente Introduction.

DataSet Pro DataSet Pro Vous présente Introduction.

194989591 Makalah Tugas Antena Reflector Ulfar 12221789 REV Ver03

194989591 Makalah Tugas Antena Reflector Ulfar 12221789 REV Ver03

Limited Penetration of Governance and Conflict in Sub ... · The starting step of constructing our dataset is to divide the SSA continent into pixels of 1 degree of latitude × 1

Limited Penetration of Governance and Conflict in Sub ... · The starting step of constructing our dataset is to divide the SSA continent into pixels of 1 degree of latitude × 1

Languages

Pages

Legal

Copyright © 2022 FDOCUMENTS