Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security...

25
Find Me If You Can: Improving Geographical Prediction with Social and Spatial Proximity

Transcript of Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security...

Page 1: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

Find Me If You Can: Improving Geographical Prediction

with Social and Spatial Proximity

Page 2: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

Part 1: Introduction

Page 3: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

Importance of geolocation

Finding compromised accounts (prevent security breaches).

Personalization of information based on location.

Page 4: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

MotivationIP address typically provide accuracy at

the city level.results are inconsistent.Geo-IP databases require constant

maintenance.Other geolocation strategies are non

transparent.

Page 5: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

Part 3: use of observations in a predictive model

Part 2: relationships

Part 1: Introduction

Page 6: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

Paper outlinestudy the relationship between geography

and friendship.they use the Facebook social network in

order to study the relationship.Some users provide their addresses.we get 30.6 million edges between

individuals with known location.

Page 7: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

Factors that affect relationshipsSocial Norms.Distance.Communication technologies.

Page 8: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

• Males are significantly more likely to share their address information than females.

• users that share their addresses tend to have many more friends.

Supplying addresses on Facebook

Page 9: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

No bias problem.

Page 10: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

low density: power-law with exponent -1.37.

high density: power-law with exponent -3.07.

96% of people live in areas before the transition point on exponent -1.37 .

Page 11: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

we see that the curves increase linearly only for a small distance.

we increase the radius and expect to find an increase in the population.

on the other hand, we move further away from urban centers to rural areas.

Page 12: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

• we can get a good fit to a curve of the form . The exponent very close to c = −1.

Page 13: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

for short distances the probability is higher in lower density areas

at about 50 miles the three curves converge. at long distances, people in high density areas

being more likely to be friends.

Page 14: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

ranku(v) := |{w : d(u,w) < d(u, v)}|.we do see a nice smooth curve, again with an

exponent of close to −1.

Page 15: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

All the curves with exponent about −1.higher at low ranks for people in less dense areas, and

higher at high ranks for people in more dense areas (cosmopolitan effect).

Page 16: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

Part 3: use of observations in a predictive model

Part 2: relationships

Part 1: Introduction

Page 17: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

)

. = 0.0019 (taken from slide 12).

Page 18: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.
Page 19: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

attempt to recover addresses of 75 % of individuals.iteratively using the newly guessed locations as

input as well as the locations provided by users.

Page 20: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

Prediction performance as a function of friend count.

Page 21: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

A good trade-off is 5+blend.

Page 22: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

BenefitsInfo about relationships with greater

accuracy and in greater depth.The new algorithm.

Page 23: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

Part 3: use of observations in a predictive model

Part 2: relationships

Part 1: Introduction

Page 24: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

Future workFuture work can improve even more the

accuracy.Using social gathering.

Page 25: Part 1: Introduction Importance of geolocation Finding compromised accounts (prevent security breaches). Personalization of information based on location.

attaching time stamps to data.More weight to new friendships than old

ones.