Superfície Design de DESIGN GRÁFICO – SATC | Prof. Diego Piovesan Medeiros, Esp. Têxtil.
PRIVACY TOOLS FOR SHARING RESEARCH DATA NSF site visit October 19, 2015 Salil Vadhan Supported by...
-
Upload
roy-lewis-poole -
Category
Documents
-
view
218 -
download
0
Transcript of PRIVACY TOOLS FOR SHARING RESEARCH DATA NSF site visit October 19, 2015 Salil Vadhan Supported by...
![Page 1: PRIVACY TOOLS FOR SHARING RESEARCH DATA NSF site visit October 19, 2015 Salil Vadhan Supported by the NSF Secure & Trustworthy Cyberspace (SaTC) program,](https://reader035.fdocuments.net/reader035/viewer/2022062423/5697bfe31a28abf838cb52eb/html5/thumbnails/1.jpg)
PRIVACY TOOLS FOR SHARING RESEARCH DATA
NSF site visitOctober 19, 2015
Salil Vadhan
Supported by the NSF Secure & Trustworthy Cyberspace (SaTC) program, the Sloan Foundation, and Google.
![Page 2: PRIVACY TOOLS FOR SHARING RESEARCH DATA NSF site visit October 19, 2015 Salil Vadhan Supported by the NSF Secure & Trustworthy Cyberspace (SaTC) program,](https://reader035.fdocuments.net/reader035/viewer/2022062423/5697bfe31a28abf838cb52eb/html5/thumbnails/2.jpg)
Computational Social Science
The potential: massive new sources of data and ease of sharing will revolutionize social science.
The problem: protecting the privacy of individual subjects
privacy open data
e.g. NYT 5/21/12 “Troves of Personal Data, Forbidden to Researchers”
privacy
utilitytraditional approaches(e.g. “stripping PII”)
![Page 3: PRIVACY TOOLS FOR SHARING RESEARCH DATA NSF site visit October 19, 2015 Salil Vadhan Supported by the NSF Secure & Trustworthy Cyberspace (SaTC) program,](https://reader035.fdocuments.net/reader035/viewer/2022062423/5697bfe31a28abf838cb52eb/html5/thumbnails/3.jpg)
Our Goal
computerscience
socialscience
statistics
law &policy
privacyopen data
privacy
utility
Achieve: &
Via:
Chong Vadhan
Gasser Sweeney
King Crosas
Airoldi
Dwork (MSR) Altman
(MIT)
Nissim (BGU)
Smith (PSU)
Kantarcioglu (UTD)
Gaboardi(Dundee)
Honaker
O’Brien Hurley
![Page 4: PRIVACY TOOLS FOR SHARING RESEARCH DATA NSF site visit October 19, 2015 Salil Vadhan Supported by the NSF Secure & Trustworthy Cyberspace (SaTC) program,](https://reader035.fdocuments.net/reader035/viewer/2022062423/5697bfe31a28abf838cb52eb/html5/thumbnails/4.jpg)
4
Harvard Dataverse Repository:1274 dataverses with 59,265 datasetsand 1,415,241 downloadsLargest social science repository in the world
Dataverse Repositories around the world: 12 repositories in production with research data~10 under construction
Use Case: Data Repositories
![Page 5: PRIVACY TOOLS FOR SHARING RESEARCH DATA NSF site visit October 19, 2015 Salil Vadhan Supported by the NSF Secure & Trustworthy Cyberspace (SaTC) program,](https://reader035.fdocuments.net/reader035/viewer/2022062423/5697bfe31a28abf838cb52eb/html5/thumbnails/5.jpg)
Datasets are restricted due to privacy concerns
Goal: enable wider sharing while protecting privacy
![Page 6: PRIVACY TOOLS FOR SHARING RESEARCH DATA NSF site visit October 19, 2015 Salil Vadhan Supported by the NSF Secure & Trustworthy Cyberspace (SaTC) program,](https://reader035.fdocuments.net/reader035/viewer/2022062423/5697bfe31a28abf838cb52eb/html5/thumbnails/6.jpg)
Challenges for Sharing Sensitive Data
Complexity of Law• Thousands of privacy laws in the US alone, at federal,
state and local level, usually context-specific: HIPAA, FERPA, CIPSEA, Privacy Act, PPRA, ESRA, ….
Difficulty of Deidentification• Stripping “PII” usually provides
weak protections and/or poor utility
Inefficient Process for Obtaining Restricted Data• Can involve months of negotiation between institutions,
original researchers
Goal: make sharing easier for researcher without expertise in
privacy law/cs/stats
Sweeney `97
![Page 7: PRIVACY TOOLS FOR SHARING RESEARCH DATA NSF site visit October 19, 2015 Salil Vadhan Supported by the NSF Secure & Trustworthy Cyberspace (SaTC) program,](https://reader035.fdocuments.net/reader035/viewer/2022062423/5697bfe31a28abf838cb52eb/html5/thumbnails/7.jpg)
Vision: Integrated Privacy Tools
Risk Assessment and
De-Identification
Differential Privacy
Customized & Machine-
ActionableTerms of Use
Data Tag Generator
Data Set
Query Access
Restricted Access
Tools we are working on
Consent from
subjects
Open Access toSanitized Data Set
IRB proposal & review
Policy Proposals and Best Practices
Database of Privacy Laws
& Regulations
Deposit in repository
![Page 8: PRIVACY TOOLS FOR SHARING RESEARCH DATA NSF site visit October 19, 2015 Salil Vadhan Supported by the NSF Secure & Trustworthy Cyberspace (SaTC) program,](https://reader035.fdocuments.net/reader035/viewer/2022062423/5697bfe31a28abf838cb52eb/html5/thumbnails/8.jpg)
DataTags Ecosystem with Collaborations
![Page 9: PRIVACY TOOLS FOR SHARING RESEARCH DATA NSF site visit October 19, 2015 Salil Vadhan Supported by the NSF Secure & Trustworthy Cyberspace (SaTC) program,](https://reader035.fdocuments.net/reader035/viewer/2022062423/5697bfe31a28abf838cb52eb/html5/thumbnails/9.jpg)
This Site Visit: Depth over Breadth
• Short presentations of specific works to illustrate:• Cross-disciplinary collaboration• Involvement team members from PIs to students• Knowledge transfer and outreach
• No attempt to survey everything we are doing• E.g. papers in FOCS, SODA, COLT, CSF, ICALP, …• See annual report and project website.• Please ask if you’re wondering!
![Page 10: PRIVACY TOOLS FOR SHARING RESEARCH DATA NSF site visit October 19, 2015 Salil Vadhan Supported by the NSF Secure & Trustworthy Cyberspace (SaTC) program,](https://reader035.fdocuments.net/reader035/viewer/2022062423/5697bfe31a28abf838cb52eb/html5/thumbnails/10.jpg)
• Privacy Tools for Social ScienceGary King (IQSS)
• A Differentially Private Curator Tool& Supporting Theoretical Work
James Honaker (IQSS)Kobbi Nissim (CRCS)
• DataTags: The Vision & Implementationin Technology Science
Latanya Sweeney (Data Privacy Lab, IQSS)
• Logic Programming for Data TaggingStephen Chong (CRCS)
Agenda I
CS Soc Sci Stats Law Policy
CS Soc Sci Stats Law Policy
![Page 11: PRIVACY TOOLS FOR SHARING RESEARCH DATA NSF site visit October 19, 2015 Salil Vadhan Supported by the NSF Secure & Trustworthy Cyberspace (SaTC) program,](https://reader035.fdocuments.net/reader035/viewer/2022062423/5697bfe31a28abf838cb52eb/html5/thumbnails/11.jpg)
Agenda II
• Education & OutreachSalil Vadhan (CRCS)Urs Gasser (Berkman)
• Lunch & Poster Session with Students & Postdocs
• Modern Framework for Privacy Analysis & Government Open Data
David O’Brien (Berkman)Alexandra Wood (Berkman)
• Bridging Notions of Privacy in CS, Law, Social ScienceKobbi Nissim (CRCS)
CS Soc Sci Stats Law Policy
CS Soc Sci Stats Law Policy
CS Soc Sci Stats Law Policy
![Page 12: PRIVACY TOOLS FOR SHARING RESEARCH DATA NSF site visit October 19, 2015 Salil Vadhan Supported by the NSF Secure & Trustworthy Cyberspace (SaTC) program,](https://reader035.fdocuments.net/reader035/viewer/2022062423/5697bfe31a28abf838cb52eb/html5/thumbnails/12.jpg)
Agenda III
• Summary & Future PlansSalil Vadhan (CRCS)
• Transition to PracticeMerce Crosas (IQSS)
• NSF Private Discussion
• Feedback
CS Soc Sci Stats Law Policy
CS Soc Sci Stats Law Policy