Panos Ipeirotis (with Anindya Ghose and Beibei Li) Leonard N. Stern School of Business New York...

39
Panos Ipeirotis (with Anindya Ghose and Beibei Li) Leonard N. Stern School of Business New York University Towards a Theory Model for Product Search

description

Towards a Theory Model for Product Search. Panos Ipeirotis (with Anindya Ghose and Beibei Li) Leonard N. Stern School of Business New York University . How can I find the best hotel in New York City?. Recommender Systems? . Problem : Low purchase frequency for many product types - PowerPoint PPT Presentation

Transcript of Panos Ipeirotis (with Anindya Ghose and Beibei Li) Leonard N. Stern School of Business New York...

Page 1: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Panos Ipeirotis(with Anindya Ghose and Beibei Li)

Leonard N. Stern School of BusinessNew York University

Towards a Theory Model for Product Search

Page 2: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

How can I find the best hotel in New York City?

Page 3: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Recommender Systems?

Problem:• Low purchase frequency for many product

types• Cold start for new consumers and new

products• Privacy and data availability: Individual-level

purchase history to derive personal preference.

Page 4: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Facet Search?

Problem: - Likely to miss a deal;- Still need to rank the results!

Page 5: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Skyline?

Problem:- Feasibility diminishes as # of product characteristics increases.

Skyline: Identify the “Pareto optimal“ set of results.

Page 6: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

IR-based approaches?

Problem:• Finding relevant documents is not the same as

choosing a product• We can “consume” many relevant documents, but only

seek a single product…

Perhaps IR theory should not be driving product search

Page 7: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Theoretical Background

• Economic Surplus: Quantify gains from exchanging goods.

How to define “Best Value”?

Everything has its utility: e.g., products, money.Buying a product involves the exchange of utilities in-between.

• Utility Theory: Measure the satisfaction from consumption of various goods and services.

How to measure “Surplus”?

How does “Utility” work?

To measure “Utility”: Utility of Products, Utility of Money.

Page 8: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Theoretical Background: Utility Theory

How to define “Best”?

Utility: Quantify the happiness.

Utility of Product

Get Hotel(happy)

Pay Money(unhappy)

Utility of Money

>=?=<

Page 9: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Utility of Money

Utility of Money – The utility that the consumer will lose by paying the price for that product.

1M 1M+1000 100

( )mU p p

Page 10: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Utility of Product

Utility of Product : The utility that the consumer will gain from buying the product.

Simplest case: Use a linear combination:1( ) ( ,..., ) .

KK k k

p pU X U x x x

Latent Consumer PreferencesObserved Product

Characteristics

Unobserved ProductCharacteristics

Page 11: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Utility Surplus

Utility Surplus for a consumer is the gain in the utility of product minus the loss in the utility of money.

The higher the surplus, the higher “value” from a product

Rank high the products that generate high surplus!Key Challenge: How to estimate the

preferences?

Utility of Product Utility of Money Stochastic ErrorUtility Surplus

1

Kk k

k

x

p

Page 12: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

From Utility Surplus to Market Shares and Back: Logit Model (McFadden 1974)

Utility is fundamentally private: We can never observe it!

But we can observe actions driven by utility!

Assumption: Everyone has similar preferences, together with has a personal component of choice, the error term ε

Observed Market Share_j = Pr(Consumers choose j over everything else) = Pr(Surplus_j > Surplus_everything else)

Estimation Strategy: is proportional to the market share.

( )ijP choice

Page 13: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Logit Model - Estimation

,

,

exp( ( ))( ) .

1 exp( ( ))ji

jljl

VP choice

V

( ) .obsji

jtotal

dP choice market share

d

observed demand for j

observed total demand

1

ln( ) . K

obs k kj j j j

k

pd x

Estimation Strategy: is proportional to the market share.

( )ijP choice

Solution by logistic regression!

Notice: Logistic Regression is a direct derivation from a theory-driven user behavior model, not a heuristic (McFadden, Nobel Prize in 2000)

Log of demand equalshotel utility! Estimate

α,β parameters by regression

Page 14: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

14

But, consumers have different preferences

Page 15: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Overall Preference

BLP Model (Berry, Levinsohn, and Pakes 1995)

BLP Model: All consumers are not the same; Consumers belong to groups with different

preferences; Group preference defined through consumer

demographics, income, purchase purpose, …, etc.

Type 1Type 2 Type 3

Problem: We do NOT know T for individual consumers

Overall Preference

T = [age, gender, income, purpose, …] Preference = f (T)

Page 16: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

BLP Model (Berry, Levinsohn, and Pakes 1995)

Basic Idea: Monitor demand for products in different markets.

differences in demand different demographics

What do we know? Demographic distributions! Demographic differences in different markets! Overall demand in different markets!

Page 17: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

BLP Model - Example

Table A: 80% Indians, 20% Americans; - Lamb: 80% gone, Chicken: 20% gone.Table B: 10% Indians, 90% Americans; - Lamb: 10% gone, Chicken: 90% gone.

Example 2: Lunch BuffetLamb: Stewed lamb with chillies;Chicken: Flat pasta cooked with cream and cheese;

Indians favor lamb, and Americans favor chicken!BLP: Aggregate Demand Individual

Preference

Page 18: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

BLP Model – In Hotel Context

Miami: 70% Couples, 30% Business; - Spa: 80% demand, Conference: 20% demand.New York: 30% Couples, 70% Business; - Spa: 40% demand, Conference: 60% gone.

Example: Estimate preferences based on demographics Hotels with spa and pool

Couples favor spas, and business favor conference!BLP: Aggregate Demand Individual

Preference

Hotels with conference centers

Page 19: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Surplus-based Ranking

Personalized Ranking: ask for consumer demographics and purchase context and estimate surplus using BLP for personalized ranking

Basic Idea: • Compute the surplus for each product based on

consumer preferences (average across consumers)• Rank products accordingly• Top-ranked product provides “best value”

Again, people are different…

Page 20: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Surplus-based Ranking

Best Value: Cyber Apartments

Best Value: Novotel

PhD StudentsLousy, Distant, Tiny,…

Cheapest!!!

ProfessorsFancy, Convenient,

Comfortable,… Costly

Example 3: Hotel Search at a conference

Page 21: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Hotel Search Experiment - Data

Service Characteristics: TripAdvisor & Travelocity

Location Characteristics: Social geo-tags Geo-Mapping Search Tools Image Classification

Text Mining: “Subjectivity,” “Readability”Stylistic Characteristics for the quality of word-of-mouth:

Demand Data: Travelocity, 2117 hotels, 11/2008-1/2009.Demographics: TripAdvisor, “Travel purpose,” “Age group” for travelers in different cities

Page 22: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Result (1) - Economic Marginal Effects

Characteristics Marginal EffectPublic transportation 18.09%

Beach 18.00%Interstate highway 7.99%

Downtown 4.70%Hotel class (Star

rating)3.77%

External amenities 0.08%Internal amenities 0.06%Annual Crime Rate - 0.27%

Lake/River - 12.94%

Page 23: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Result (2) – Preference Deviations Based on Different Travel Purposes

Consumers with different travel purposes show different preferences towards the same set of hotel characteristics.

Page 24: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Result (2) - Sensitivity to Online Rating Based on Different Age Groups

Age 18-34 pay more attention to reviews than other age groups.

Page 25: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Ranking Evaluation - User Study (1)

Experiment 1: Blind pair-wise, 200 MTurk users, 6 cities, 10 baselines.

User Explanations: Diversity; Price not the only factor; Multi-dimensional preferences.

Our reasoning: Our economic-based model introduces “diversity” naturally.

Finding: Our surplus-based ranking is overwhelmingly preferred in any single comparison! (p=0.05, sign test, in all comparisons)

Page 26: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Ranking Evaluation - User Study (1)

26

Page 27: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Ranking Evaluation - User Study (2)

77%

23%

BusinessPersonalizedAverage

81%

19%

FamilyPersonalizedAverage

In all cases, the personalized approach is preferred

Experiment 2: Blind pair-wise, 200 MTurk users.

Page 28: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Model Comparisons: Better Utility Models, Better Ranking

28

Extended Model with

Demographics

Hybrid Model BLP PCM

Nested Logit OLS

RMSE 0.0347 0.0881 0.1011 0.1909 0.2399 0.3215MSE 0.0012 0.0078 0.0102 0.0364 0.0576 0.1034MAD 0.0100 0.0276 0.0362 0.0524 0.1311 0.2673

Economic Models of Discrete Choice:Logit (McFadden 1974), BLP (1995), PCM (2007), Hybrid (2011)

Predictive Power:Out-of-Sample prediction, training set size 5669, test set size 2430.

Page 29: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Model Comparisons: Better Utility Models, Better Ranking

29

Economic Models of Discrete Choice:Logit (McFadden 1974), BLP (1995), PCM (2007), Hybrid (2011)Ranking Performance:

Hybrid City

BLP PCM Nested Logit

Logit

New York 68% 64% 61% 67%Los Angeles 70% 71% 67% 73%

SFO 68% 73% 78% 74%Orlando 72% 65% 76% 70%

New Orleans 70% 66% 68% 69%Salt Lake City 64% 69% 62% 65%

Significance LevelP=0.05≥ 59%

P=0.01≥ 62%

P=0.001≥ 66%

(Sign Test, N=100)

We observed that better utility models improve ranking

Page 30: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

30

Hotel Search Engine Experiments

Page 31: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Impact of Search Engine Design on Consumer Behavior

31

Research Question• What is the impact of different ranking

mechanisms on consumer online behavior?

Page 32: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Impact of Search Engine Design on Consumer Behavior

32

Randomized Experiments

• 890 unique user responses, two-week period • Hotel search engine designed using Google App

Engine• Online behavior tracking system

• Subjects recruited online via AMT• Prescreening spammers (95% approval, <1 min)

Page 33: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

33

Randomized Experiments

• Hotel Search Engine Application (http://nyuhotels.appspot.com)

Search Context

RankingMethods

Impact of Search Engine Design on Consumer Behavior

Page 34: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Impact of Search Engine Design on Consumer Behavior

34

Page 35: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Impact of Search Engine Design on Consumer Behavior

• Ranking Experiment Design (Mixed):

35

Randomized Experiments

(Within-Subject)

New York City Los Angeles

(Be-tween-

Sub-ject)

Treatment Group 1 BVR BVR

Treatment Group 2 Price Price

Treatment Group 3 Travelocity User Rating Travelocity User Rating

Treatment Group 4TripAdvisor User Rating TripAdvisor User Rat-

ing

• Hotel Search Engine Application (http://nyuhotels.appspot.com)

Page 36: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Main Results – Ranking Experiment

36

• Surplus-based ranking outperforms the other three in motivating online engagement and purchase.

Purchase Propensity

(NYC)

Purchase Propensity

(LA)BVR 0.80 0.92Price 0.62 0.75

Travelocity 0.55 0.43TripAdvisor 0.61 0.57

Group mean over subjects.Significant (p<0.05), Post Hoc ANOVA.

Page 37: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Robustness Tests

37

• Users may change ranking and purchase under others.

- < 5% users changed ranking methods; hold after excluding those users.• Users with “planned purchases” favor “value” more.

- Allow users to leave without purchase.

• Users in BVR group are more likely to convert? - Randomized assignment; - Ask users about their online hotel shopping behavior (how

often do they search/purchase, price range, etc.), no significant

difference.

• Users didn’t buy from the top ranked, but lower ranked.

- BVR leads to significantly higher # purchases on top-3 positions, compared to the other ranking methods.

Page 38: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Conclusion & Future Work

Major Contributions:• Inter-disciplinary approach• Captures consumer decision making process• Privacy-preserving: Aggregate data Personal

preferences

• Product bundles• Integrate search into choice model

• Using consumer browsing info• Integrated utility maximization model

Future Directions:

A New Ranking System for Product Search

Economic utility theory, “Best Value” ranked on Top;

Validated with user study with +15000 users, 6 cities.

Page 39: Panos Ipeirotis (with  Anindya Ghose  and  Beibei  Li) Leonard N. Stern School of Business New York University

Demo: http://nyuhotels.appspot.com/

Q & A

Thank you!