ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online...

19
UNIVERSITY OF GRANADA ISP-Enabled Behavioral Ad Targeting without Deep Packet Inspection Gabriel Maciá-Fernández 1 , Yong Wang 2 , Rafael Rodríguez-Gómez 1 , Aleksandar Kuzmanovic 3 1 University of Granada (Spain) 2 University of Electronic Science and Technology (China) 3 Northwestern University (USA)

Transcript of ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online...

Page 1: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

ISP-Enabled Behavioral Ad Targeting without Deep Packet

Inspection

Gabriel Maciá-Fernández1, Yong Wang2,

Rafael Rodríguez-Gómez1, Aleksandar Kuzmanovic3

1University of Granada (Spain)2University of Electronic Science and Technology (China)

3Northwestern University (USA)

Page 2: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Google, Yahoo, AOL dominate the online advertising market

Online advertisers v.s. ISPs

ISP started deploying deep packet inspection techniques to track and collect user browsing behavior

Background

“Online advertising is a $20 billion industry that is growing rapidly…”

2/19

Page 3: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Federal Wiretap Act states a simple prohibition: “thou shalt not intercept the contents of communications…Violations can result in civil and criminal penalties”

This prohibition has clearly been violated by deep packet inspection techniques.

Electronic Communications Privacy Act states that any provider can hand-over non-content records to anyone except the government

Background

3/19

Page 4: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Is it possible torecover userbrowsing patternsonly from thelimited informationprovided by TCP headers?

How accurately?

How scalable wouldthis approach be?

Our challenge

4/19

Page 5: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Our Approach

Trace analysis: in a tapping point , extract web browsing

communication features from traces

Detection: Correlate the information from the two sources to detect the web

pages actually accessed by clients

Profile websites: collect information about web

pages from websites

5/19

Page 6: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Website profiling (crawling websites):

For every web page in a site, we record:

• Size in bytes (plain/compressed) of root file and all embedded objects

• Location of objects (internal / external)

• List of embedded objects

• List of links

The whole process (I)

6/19

Page 7: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Web browsing features analysis from traces:

Obtain traces in a tapping point

Filter and separate web traffic from every source IP to any destination

Estimate the size and the location of thedownloaded objects:

• Web pages delimited by a time threshold: 1 second

• Downloaded objects delimited by PUSH flag

The whole process (II)

7/19

Page 8: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Detection algorithm basics:

Find the web page in the website profile that bestmatches the sizes and locations of the objectsdetected in the trace

Details:Unique objects or root files lead to direct detection

Separate comparison for root files and objects

Ambiguities are clarified by selecting pages with:• Higuest percentage of detected objects

• Consistent navigation pattern (Link analysis)

The whole process (III)

8/19

Page 9: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Sources of error:

Estimation of the objects size: cookies, chunk sizeinformation

Dynamic website behavior

Browsing behavior: pipelining, caching, parallelbrowsing

Spurious requests

The whole process (IV)

9/19

Page 10: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Experimental setup:6 different websites for web profiling

We crawl a subset of 2000 pages for each website

We generate quasi-random walks on each websitewith 100 pages and obtain TCP level traces

Experimental evaluation

List of websites (URL)

New York Times (www.nytimes.com)

FC. Barcelona (www.fcbarcelona.com)

IKEA (www.ikea.com)

Toyota (www.toyota.com)

University 1 (www.northwestern.edu)

University 2 (ceres.ugr.es)

10/19

Page 11: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Site uniqueness results:

Uniqueness detection is a powerful feature

Experimental evaluation

Page

s[%

]

100

120

80

60

40

20

Nytimes Barcelona IKEA Toyota Univ1 Univ2

Pages with unique objectsPages with unique root files

Pages with unique objects/root files

11/19

Page 12: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Basic performance results:

Mean success rate = 86%

False positives rate < 5%

Experimental evaluation

Perc

enta

ge[%

]

Nytimes Barcelona IKEA Toyota Univ1 Univ2

100

80

60

40

20

0

Succ. rateFalse positives

12/19

Page 13: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Sensibility to outdated profiles or traces:

Experimental evaluation

TRACES ARE OLD

PROFILES ARE OLD

Days

Ch

ange

rate

[%]

13/19

Page 14: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Sensibility to outdated profiles or traces:

Experimental evaluation

TRACES ARE OLD

PROFILES ARE OLD

Perc

enta

ge[%

]Pe

rcen

tage

[%]

Days

Days14/19

Page 15: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Different browsing scenarios:

Experimental evaluation

ScenarioSuccess

ratesFalse

positives

PipeliningDisabled 89% 4%

Enabled 88% 4%

CacheDisabled 90% 4%

Enabled 89% 4%

Type of navigation

Sequential 89% 4%

Parallel-two 74% 7%

Parallel-four 63% 8%

15/19

Page 16: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Scaling the website profile:

From 2000 to 9200+ web pages crawled in Toyota

78% of pages have either unique size objects orunique root files

The success rate reduces from 89% to 81%

The false positives increase from 4% to 8%

Experimental evaluation

16/19

Page 17: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Experiments in the wild:

Logged visited URIs and timestamps for 17 volunteers

User navigation replayed and traces saved

Top 41 websites crawled

Success rate of 85%

False positive ratio is 9%

Experimental evaluation

17/19

Page 18: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

We are able to recover web browsingpatterns without inspecting payload

Our detection algorithm achievesdetection rates around 86% with low false positives (<5%)

The methodology is also scalable and resilient to a wide number of error sources: outdated information (profiles ortraces), pipelining, caching, different typesof navigation, etc.

Conclusions

18/19

Page 19: ISP-Enabled Behavioral Ad Targeting without Deep Packet€¦ · advertising market Online advertisers v.s. ISPs ISP started deploying deep packet inspection techniques to track and

UNIVERSITY OF

GRANADA

Thank you for your attention

19/19