Yelper Helper Concept

20
Personalized Review Engine for Yelp Users Yelper Helper Alex Ruiz- Euler 08/2014

description

A Personalized Review Engine for Yelp Users.

Transcript of Yelper Helper Concept

Page 1: Yelper Helper Concept

•Personalized Review Engine for Yelp Users

•Yelper Helper

•Alex Ruiz-Euler•08/2014

Page 2: Yelper Helper Concept

•MVP•MVP

Page 3: Yelper Helper Concept

Ye •Yelper Helper

•PROBLEM •SOLUTION

Page 4: Yelper Helper Concept

•MVP•MVP•Yelper Helper: Overview.

•Determine usefulness of new reviews

•Compute user similarity

•User making query

Page 5: Yelper Helper Concept

•MVP•MVP•Yelper Helper: Overview.

•Determine usefulness of new reviews

•Compute user similarity

•User making query

Page 6: Yelper Helper Concept

•MVP•MVP•Yelp Reviews

•Useful tags

Page 7: Yelper Helper Concept

•Review

•User

•Review attribute

s

•User attributes

•Business attribute

s

•Useful

tags

•1 •Abe

• Vocabulary richness, stars...

• no. reviews, average rating...

•Average rating...

•3

•MVP•MVP •Predicting Number of “Useful” Tags

•Data structure (Las Vegas):

•363,691 reviews

•112,702 users

•3,536 businesses

• (source: Yelp Academic Dataset)

Page 8: Yelper Helper Concept

•MVP•MVP •Validation: Poisson regression / Comparing AIC.

• Feature Selection

•Model Selection

Page 9: Yelper Helper Concept

•MVP•MVP•Yelper Helper: Overview.

• Predict •usefulness

of new reviews

•Compute user similarity

Page 10: Yelper Helper Concept

•MVP•MVP•Yelper Helper: Overview.

• Predict •usefulness

of new reviews

•Compute user similarity

Page 11: Yelper Helper Concept

•MVP•MVP •Use-taste matrix / Restaurant-category matrix

•U: Ratings (stars)

• Rest 1

• Rest 2

• Rest 3

• Rest 4

•User 1

•1 •3 •2•User

2 •2 •4 •1•User

3•2 •1

•User 4 •1 •2 •1

• Hipster

• Divey

• Upscale

• Intimate

• Touristy

• Classy

• Romantic

•Rest 1

•1 •1•Rest

2•1 •1

•Rest 3

•1 •1 •1•Rest

4•1 •1 •1

•V: Restaurant profile

Page 12: Yelper Helper Concept

•2

•MVP•MVP •User profile matrix

• Hipster

• Divey

• Upscale

• Intimate

• Touristy

• Classy

• Romantic

• User 1

•3 •1 •33 •1• User

2•2

• User 3

•1 •1 •1• User

4•3

•1•3 •2 •1 •3 •1•5 •4 •4 •5

•2 •3•1 •2 •3

•1•3

Page 13: Yelper Helper Concept

•MVP•MVP •Similarity Matrix – Euclidean Distance Over UV.

•User 1 •User 2 •User 3

•User 4

•User 1 •0

•User 2 •1.5 •0

•User 3 •2 •3.4 •0

•User 4 •7.2 •1 •2 •0

Page 14: Yelper Helper Concept

•MVP•MVP •About Me – Alex Ruiz-Euler (PhD Political Science, 2014)

Page 15: Yelper Helper Concept

•MVP•MVP

•Thank You.

Page 16: Yelper Helper Concept

•MVP•MVP

Page 17: Yelper Helper Concept

•MVP•MVP

Page 18: Yelper Helper Concept

•MVP•MVP

Page 19: Yelper Helper Concept

•MVP•MVP•Problem: ~75% of Yelp reviews have 0 “useful” tags*.

• (* Las Vegas sample.)

Page 20: Yelper Helper Concept

•Issues with data

• For similarity:

Attributes of users in Yelp are about activity, not preferences.

→ Uncover taste preferences with collaborative filtering.

• For prediction:

Prediction of usefulness of review:

a) Too many zeros (zero-inflated!). Weird results (null vs. full).

→ Zero-inflated Poisson model.