The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP...

59
Online Advertising: The Good, The Bad, and The Ugly Yi-Min Wang Group Manager & Principal Researcher Cybersecurity & Systems Management Group Microsoft Research, Redmond

Transcript of The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP...

Page 1: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Online Advertising: The Good, The Bad, and The Ugly

Yi-Min WangGroup Manager & Principal Researcher

Cybersecurity & Systems Management Group

Microsoft Research, Redmond

Page 2: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

The Traffic-to-Money Converter &

The STC Generation

• The Traffic-to-Money Converter

Traffic-to-Money

ConverterTraffic Money

• STC = Search, Type, and Click

– The STC generation collectively generates a

lot of web traffic

• Traffic-to-money converter for the web

– Mass-market ads syndication programs

– Mass-market exploit affiliate programs

Page 3: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

The STC Traffic

Search Type Click

MerchantsNon-Merchants

Search

Engine

Target web pages

Mass-Market

Advertisement

Syndication

Program #1

Search

Ads

Advertisement

Syndication or

Exploit Affiliate

Programs

Spam

Ads-Portal

Page

Typo Domain

Ads-Portal

Page

Ads by

G/Y/MSpyware

Vendors

The Good

The Bad

The Ugly

Hacked

Ads

Page 4: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Web Analytics &Advertising Syndication

MerchantsNon-Merchants

Target web pages

Mass-Market

Advertisement

Syndication

Program #1

Ads by

G/Y/M

The Good

Page 5: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

<img alt="" border="0" name="DCSIMG" width="1" height="1"

src="http://statse.webtrendslive.com/DCSArO55rNH8I36lrbe6wexE5_5B8I/njs.gif?

dcsuri=/nojavascript&amp;WT.js=No"/>

Page 6: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Where’s The Bug?

1x1 transparent-gif web bug magnified

Page 7: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Web Bug

Page 8: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

http://WhatIsMyIP.com/

Page 9: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

http://ip-address.domaintools.com/

Page 10: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Web Analytics: Example #1

Primary URL on Primary Domain

Secondary URL on

Third-Party Domain

Primary URL

statse.webtrendslive.com/.../dcs.gif

Show nothing

Page 11: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Web Analytics: Example #2

Primary URL: http://www.aidsmeds.com/

Secondary URL on

Third-Party Domain

Primary URL

google-analytics.com/__utm.gif

ssl.google-analytics.com/urchin.js

Page 12: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

http://www.aidsmeds.com/

Secondary URL on

Third-Party Domain

Primary URL

google-analytics.com/__utm.gif

ssl.google-analytics.com/urchin.js

Page 13: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Advertising Syndication

Primary URL: http://www.aids.org/factSheets/

Secondary URL on

Third-Party Domain

Primary URL

pagead2.googlesyndication.com

/pagead/show_ads.js

Before ads are

displayed; even

without clicking

any ads

Show small ads

Page 14: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Potential Security and Privacy Concerns

• Scripts executed without user permission– Redirection to third-party domains happened without

user knowledge

• Not all URLs get recorded in browser history– Don’t know what, when, and why

• Many consumer machines have fixed IP addresses– Like car license plate number for information highway

• Corporate proxy IP addresses necessarily identify the company– Like company logo on company vans

• Redirected-to third-party domains can set cookies– Can ID <IP address, account> pair

Page 15: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

1.8% 1.4% 1.1% 1.0% 1.0%

13.0%

3.7%3.0%

0.0%

2.0%

4.0%

6.0%

8.0%

10.0%

12.0%

14.0%

Googlesyndication.com

Doubleclick.net

Atdmt.c

om

Fastclick.net

Amazon.com

Advertising.com

Casalemedia.com

Overture.com

Top Syndication Servers

Co

ve

rag

e o

f T

op

On

e M

illi

on

UR

Ls

Traffic Cameras for the Information Highway

One camera in every 8th street corner

Page 16: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Domain Parking &Typo-squatting

Type

MerchantsNon-Merchants

Target web pages

Advertisement

Syndication or

Typo Domain

Ads-Portal

Page

The Ugly

Page 17: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Secondary URL on

Third-Party Domain

Primary URL

apps5.oingo.com/apps/domainp

ark/domainpark.cgi

Zero content

Show full-page ads

Domain Parking

Primary URL: http://VictorasSecret.com/

Page 18: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

It used to be much uglier… (oingo.com)

Page 19: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

DomainSponsor.com

Page 20: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Internet Real Estate Business

• Rule of thumb: every unique visitor is worth 5 cents on average– $7.00 / 365 / $0.05 = 0.38 unique visitors/ day

• How to attract traffic:– Generic name domains

• Sex.com ($12 million), Diamond.com ($7.5 million), Business.com ($7.5 million in 1999), Sweatpants.com ($8,500)

– Typo-squatting domains• http://VictorasSecret.com/

– Trademark domains• http://www.MicrosoftPowerpoint.com/

Page 21: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

When typo of

slashdot got

slashdotted…

Page 22: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Strider Typo-Patrol

• Typo generation algorithm

– Missing-dot typos• wwwSouthwest.com

– Character-omission typos• MarthStewart.com

– Character-permutation typos• NYTiems.com

– Character-replacement typos• DidneyWorld.com

– Character-insertion typos• WashingtonPoost.com

Page 23: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Top Typo-squatting Domain Parking Servers

19%

14%

3.30% 3.30% 3.10%2.20%

0%2%4%6%8%

10%12%14%16%18%20%

Oingo.com

Domainsponsor.com

Sedoparking.com

Qsrch.com

Hitfarm

.com

Netster.com

Top Domain Parking Servers

% o

f 2,2

33 A

cti

ve T

yp

os

Page 24: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Strider URL Tracer with Typo-Patrolhttp://research.microsoft.com/URLTracer

One in every six active typo domains

was owned by Unasi/Domaincar

Overall, one in every four active typo

domains was parked with oingo.com

Page 26: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Search Spam

Search

MerchantsNon-Merchants

Search

Engine

Target web pages

Advertisement

SyndicationSpam

Ads-Portal

Page

The Ugly

Page 27: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Google search “coach handbag”

Page 28: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Spam Doorway: http://coach-handbag-

top.blogspot.com/ topsearch10.com

Page 29: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Content

Page 30: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Links

Page 31: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Redirection Spam

Primary URL: http://coach-handbag-top.blogspot.com/

Secondary URL on

Third-Party Domain

Primary URL

http://www.topsearch10.com/se

arch.php?aid=56979...

Redirect to full-page ads; cloaking

Page 32: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Spam Detection

• Content-based approach

– Information retrieval-based ranking

– Applied to too many fake pages that are never shown to any users (i.e., cloaking)

• Behavior-based approach

– Strider SearchMonkeys: mimicking human browsing in full fidelity

– Comment-spam hunting, cloaking detection, tracking redirection to known-spammer domain, etc.

– Turn search spam problem into system security problem

Page 33: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Strider Search Ranger System

Spam

Hunters

Search Monkeys

running actual browsers

Search

engines

Primary-

URL page

Third-party

domain page

Strider URL Tracer

Third-party

domain page

Spammed

forums

Redirection

report

Known-

Bad URLs

Unclassified URLs

grouped & ranked

by redirection

domains

Spam Verifier

Redirection Spam

Analyzer Known-bad

signatures Spam suspects URLs

Confirmed spam URLs

& redirection domains

1 2

3

4

Page 34: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Spammer-Targeted Categories

30.8%

14.2%

8.9%

27.5%

2.7%

7.6% 7.8%3.3% 3.9%

9.7% 11.6%

0%

5%

10%

15%

20%

25%

30%

35%

Dru

gs

Adult

Gam

bling

Rin

gtone

Money

Acc

essorie

s

Travel

Car

s

Furnitu

re

Musi

c

Ave

rage

Spammer-targeted Categories

Per-

Cate

go

ry S

pam

Perc

en

tag

e

Density DCG/Max

Page 35: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Top Spam Doorway Domains

493

396

296242 225 218 207 178 172 150 131 124 123 110

0

100

200

300

400

500

600

blo

gspot.co

mnet

scap

e.co

m

hom

eto

wn.a

ol.com

hom

eto

wn.a

ol.de

oas

.org

xoom

er.a

lice.

it

hom

e.a

ol.co

mfr

eew

ebs.

com

blo

gstu

dio

.com

max

pag

es.c

om

usa

id.g

ov

blo

gshari

ng.c

om

sitegtr

.com

foro

space

.com

blo

g.h

lx.c

om

# o

f S

pa

m A

pp

ea

ran

ce

3,882

Page 36: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Spam Percentages

77% 74%84%

91%78% 77%

95%

52%

99%

81% 85%93%

100% 95% 100%

0%

20%

40%

60%

80%

100%

blo

gsp

ot.co

mnet

scap

e.co

m

hom

etow

n.a

ol.c

om

hom

etow

n.a

ol.d

eoas

.org

xoom

er.a

lice.

it

hom

e.ao

l.com

free

web

s.co

m

blo

gst

udi

o.c

om

max

pag

es.c

om

usa

id.g

ov

blo

gsh

arin

g.co

msi

tegt

r.co

mfo

rosp

ace.

com

blo

g.h

lx.c

om

% U

RL

s D

ete

cte

d a

s S

pa

m

At least 3 out of 4 were spam!

Page 37: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Top .gov/.edu Doorway Domains

150

6354

35 34 32 27 25 24 22 18 17 16 15 13

0

2040

6080

100

120140

160

usa

id.g

ov

mit.e

du

gat

ech.e

du

ucs

d.e

du

tsm

u.e

du

cuden

ver

.edu

uco

nn.e

du

eva

nsvi

lle.e

du

har

vard

.edu

virgin

ia.e

du

apu.e

du

neu

.edu

dot.gov

uch

icag

o.e

du

was

hin

gto

n.e

du

# o

f S

pa

m A

pp

ea

ran

ce

s

Page 38: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Universal Redirectors

• www.google.com/url?q=http://evetamthes.the.forthnet.gr/login.htm

• rds.yahoo.com/_ylt=/*http://frft.networkforbestever.org/ps/

• store.adobe.com/cgi-bin/redirect/n=14630?http://rme19-funny-

ringtones.blogspot.com

• www.usaid.gov/cgi-bin/goodbye?http://xanax.anothervision.info

• www.ihs.gov/PublicInfo/Publications/Kids/safety/IHS_DisclaimerKids_prod.cfm?link_out=http://waypossible.com/dr/cas

ino

• www.fmcsa.dot.gov/redirect.asp?page=http://maxpages.com/tro

ctrocbas

• serifos.eecs.harvard.edu/proxy/http://pharmacy-goods.com/r/tramadol

• www.library.drexel.edu/cgi-bin/r.cgi?url=http://replica-

watches.20six.co.uk

Page 39: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Top Redirection Domains

1156

1022

879

649543

398 356 334 326 309 308 289 266 260 258

0

200

400

600

800

1000

1200

1400

pays

efe

ed.n

et

topse

arc

h10.c

om

topm

eds10

.com

them

p3dir

ect.co

m

sea

rchad

v.c

om

six

xx.in

fo

rightf

inder.

net

vip

-onlin

e-s

earc

h.in

foa3b

4.in

fo

topm

obile

10.c

om

yourf

astf

ind.o

rgare

arate

.com

find-m

ore

.biz

yourf

reevid

s.c

om

webre

sours

es.

info

# o

f S

pa

m A

pp

ea

ran

ce

s

Page 40: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Malicious Websites

Search Type

MerchantsNon-Merchants

Search

Engine

Target web pages

Exploit Affiliate

Programs

Spam

Ads-Portal

Page

Typo Domain

Ads-Portal

Page

Spyware

Vendors

The Bad

Hacked

Ads

Page 41: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Google search “pain killer”

Page 42: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Malicious Spam

Primary URL: http:// www. blogigo. de/ pain_killer

Vulnerability exploits;

Sometimes window closed

http:// biopharmasite. info/

directory.php

Secondary URL on

Third-Party Domain

Primary URL

Page 43: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150
Page 44: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150
Page 45: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150
Page 46: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Exploit failed… Click here to install…

Page 47: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

TheRegister.com Malicious Banner

Page 48: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

MySpace.com Malicious Banner

Page 49: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Where Did This Come From?

Page 50: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

IDS

Firewall

Honeypot versus HoneyMonkey

Honeypot

Server

Process

Server-Side

Vulnerability

Server

Process

Malicious

or Hacked

Client

Malicious

Network

Packets

Client-Side

Vulnerability

Browser

Malicious

or Hacked

Web Server

HTTP

Request

Malicious

HTTP

Response

Blacklist

Honey

Monkey

Browser

= Spider Crawler

Takedown

Page 51: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

FDR

Sandbox

Virtual

Machine

URL

Tracer

Other

Content

Provider

HoneyMonkey Blackbox Exploit Detection

Browser

Content

Provider

Obfuscated

Java Scripts

Exploit

Provider

Third-Party URLs

Malware Installation

Malicious Scripts

Page 52: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Density of Malicious Websites

Suspicious List Popular List

# URLs scanned 16,190 1,000,000

# Exploit URLs 207 (1.28%) 710 (0.071%)

# Exploit URLs

After Redirection

(Expansion Factor)

752

(263%)

1,036

(46%)

# Exploit Sites 288 470

SP2-to-SP1 Ratio 204/688 = 0.30 131/980 = 0.13

Page 53: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Infection Rate Heavily Depends on Patch Level(May~June 2005)

# Exploit URLs # Exploit Sites

Total 752 288

WinXP SP1-UP(UP=UnPatched)

688 268

WinXP SP2-UP 204 115

WinXP SP2-PP(PP=Partially Patched)

17 10

WinXP SP2-FP(FP=Fully Patched)

0 0

Page 54: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

0

5

1015

20

25

30

0 50 100 150 200 250

Site ranking based on the number of hosted exploit URLs

Nu

mb

er

of

ho

ste

d

ex

plo

it U

RL

sSite Ranking by Number of Hosted Exploit URLs

toolbarpartner.com

.edu: hacked course bulletin board

Page 55: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Exploit Pages Organized by Account Names

http://ToolbarPartner.com

/adverts

/romas /west /0MhNSYFE

/x-web

/index.html

/index2.html /page1.htm

/index.html /index2.html /page1.htm

/index.html

/index2.html /page1.htm

/index.html

/index2.html /page1.htm

Pretend to be an

Advertisement

Syndicator

Page 56: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Zero-Day Exploit DetectionVulnerabilities exploited before patch was released

• Used to be an ad-hoc & manual process that relied heavily on external finders

• HoneyMonkey turned it into a systematic & automatic process that allows Microsoft to lead the battle– HoneyMonkey running on fully patched WinXP SP2 VM

constantly scanning the 752 exploit URLs

• The Javaprxy.dll zero day– Early July, 2005: detected the first zero-day exploit URL within

2.5 hours of scanning confirmed to be the first in-the-wild exploit URL reported to MSRC

– 26 of the 752 URLs “upgraded” to the javaprxy exploit

• 25 of them generated third-party URLs to an unknown exploit provider site: hxxp://82.179.166.2/[8 random chars]/test2/iejp.htm

– Takedown notices sent most, but not all, of the 25 URLs stopped exploiting javaprxy

Page 57: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Observations

• Monitoring easy-to-find exploit URLs is effective

– Zero-day exploits need to connect to popular pages

• Monitoring content providers with well-known

URLs is effective (because they cannot move)

– Exploit providers can move and randomize URLs

• Monitoring highly ranked and advanced exploit

URLs is effective

– First detected zero-day exploit URL belongs to the #9

site

– 7 of the top 10 sites upgraded (by connection counts)

– Nearly half of the SP2-PP exploit URLs upgraded

Page 58: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

HoneyMonkey Anti-Exploit Process

Search

Engine

Crawler

O(1010) pages

on the Web

HoneyMonkey Network

of O(102) PCs

running unpatched VMs

Top O(108)

URLs(ranked by

click-through

counts, etc.)

O(104)

exploit

URLs

HoneyMonkey

Network of O(101) PCs

running partially- or

fully-patched VMs

O(101)

zero-day

exploit

URLs

Anti-Spyware

& Anti-Virus

Security

Response

Center

Legal

Takedown

Corporate

Proxy

Blocking

ISP

Blocking

Browser

Blocking

Search

Result

Blocking

SPAM

URLs

SPIM

URLsSpam URLs from Strider Search Ranger

Page 59: The STC Generation & The “Casino Business Model” · 2018-01-04 · •Corporate proxy IP addresses necessarily identify the company ... ringtones.blogspot.com ... 0 50 100 150

Summary

• A common redirection-based framework for analyzing:– Web Bugs

– Advertising Syndication

– Typo-squatting

– Redirection Spam

– Malicious websites

• Automated web patrol with Strider monkeys– Analyzing individual web pages with known-bad

signatures

– Analyzing groups of web pages to discover new signatures