Overlapping Quality Adjustment using Online Data

51
Overlapping Quality Adjustment using Online Data Preliminary Results Alberto Cavallo MIT

Transcript of Overlapping Quality Adjustment using Online Data

Page 1: Overlapping Quality Adjustment using Online Data

Overlapping Quality Adjustment using Online Data

Preliminary Results

Alberto CavalloMIT

Page 2: Overlapping Quality Adjustment using Online Data

2

STATE STREET ASSOCIATES

Outline

• Scraped Online Data• Introduction• Advantages

• Overlapping Quality Adjustments• CPI data vs Online Prices• Example: US Televisions

Page 3: Overlapping Quality Adjustment using Online Data

3

STATE STREET ASSOCIATES

Goods & services online

• Food and Beverages• Electronics• Household Products• Apparel• Health, Beauty & Drugs• Transportation (airline, trains, taxis)• Gas/Fuel/Electricity• Real Estate• Labor

~ 60% of CPI weights can now be covered with online data

Page 4: Overlapping Quality Adjustment using Online Data

4

STATE STREET ASSOCIATES

Online prices behave like offline prices

In some countries (eg US) online and offline markets are closely integrated   similar price levels & pricing strategiesOften online prices have a markup over offline data, but it tends to be constant over time implies similar inflation rates In countries where online markets are very small, online and offline prices can be identical because firms do not have a differentiated pricing strategy online (they simply show their offline database in their website)

Page 5: Overlapping Quality Adjustment using Online Data

Online prices as a new source of Price Index data

Disadvantages:• Relatively few retailers• No quantity data  expenditure weights?• Only ~60% CPI categories (most services are not yet available online)

Advantages:• Low cost data collection• Prices are available in real‐time• Data can be collected remotely in ~70 countries• Many product details:  brand, package size, sale indicator, price control, etc.• High‐frequency (daily)• Information on all products sold in each retailer ( i.e. census within retailer)• New products sampled automatically from the day they are introduced until 

they disappear from the store• Retailers group products into narrow caregories or “urls”  close 

substitutes are easy to identify

Page 6: Overlapping Quality Adjustment using Online Data

Billion Prices Project at MIT• Joint work with Roberto Rigobon• Started collecting online prices in 2007• Daily prices from ~1000 retailers in 50 countries 

Page 7: Overlapping Quality Adjustment using Online Data

Daily Price Indices

Available for 21 countries: USA, Argentina, Australia, Canada, Chile, China, Colombia, France, Germany,  Greece, Ireland, Italy, Japan, Korea, Netherlands, Russia, South Africa, Spain, UK, Uruguay, and Venezuela

US and Argentina indices are publicly available onlineDaily series published with a 3‐day lagUse standard CPI techniques and weights, but we don’t make explicit quality adjustments. 

Page 8: Overlapping Quality Adjustment using Online Data

8

STATE STREET ASSOCIATESUS Daily Price Index

online

Source: BPP – PriceStats – BLS (CPI‐U, US city‐average, all items, NSA). Updated until 11/20/2012.

Page 9: Overlapping Quality Adjustment using Online Data

9

STATE STREET ASSOCIATESUS Annual Inflation

online

Source: BPP – PriceStats – BLS (CPI‐U, US city‐average, all items, NSA)

Page 10: Overlapping Quality Adjustment using Online Data

10

STATE STREET ASSOCIATES

US Monthly Inflation

online

Source: BPP – PriceStats – BLS (CPI‐U, US city‐average, all items, NSA)

Page 11: Overlapping Quality Adjustment using Online Data

11

STATE STREET ASSOCIATESFrance ‐Monthly Inflation

Source: BPP – PriceStats – BLS (CPI‐U, US city‐average, all items, NSA)

Page 12: Overlapping Quality Adjustment using Online Data

12

STATE STREET ASSOCIATES

Greece – Monthly Inflation

Source: BPP – PriceStats – BLS (CPI‐U, US city‐average, all items, NSA)

Page 13: Overlapping Quality Adjustment using Online Data

The Problem of Quality Adjustment

82

84

86

88

90

92

94

96

98

100

102

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17

Price

Month

Model 1 Model 2 Model 3

Traditional CPI Data

Ohta and Griliches (76):  “..what the hedonic approach attempted was to provide a tool for estimating ‘missing’ prices “Turvey (1999):  Advocates a chained geometric index of matched observations, incorporating entrys and exits  of products inmediately, with a sample size that varies over time. 

Page 14: Overlapping Quality Adjustment using Online Data

Overlapping Quality Adjustment

From the ILO/IMF/OECD/… CPI Manual:

“When there is overlap, simple linking…may provide an acceptable solution to the problem of … quality change”. 

“in practice, however, this method is not used very extensively because the requisite data are seldom available”

“the information needed for this…will never be available if price collectors … introduce a new quality when an old one is dropped”

Page 15: Overlapping Quality Adjustment using Online Data

Traditional CPI data 

82

84

86

88

90

92

94

96

98

100

102

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17

Price

Month

Model 1 Model 2 Model 3

Page 16: Overlapping Quality Adjustment using Online Data

Online Data

82

84

86

88

90

92

94

96

98

100

102

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17

Price

Month

Model 1 Model 2 Model 3

Page 17: Overlapping Quality Adjustment using Online Data

Online Data

82

87

92

97

102

107

112

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19

Price

Month

Model 1 Model 2 Model 3 Model 5 Model 6 Model 7

Page 18: Overlapping Quality Adjustment using Online Data

How online data can help

• New products are immediately priced in the data

• Large number of overlapping products• Retailers automatically identify close substitutes (via url)

• High frequency:– Detect new products sooner – Easier to identify sales / spikes / anomalies

Page 19: Overlapping Quality Adjustment using Online Data

Literature on Matched‐Model vsHedonic Indices

• Silver & Heravi (99):  Scanner data. “Matched approach as a special case of the theoretically based SEHI approach. Caution is however advised when the loss of data in matching is severe. “

• Aizcorbe, Corrado & Doms (2003): matched indices can yield similar numerical results to dummy‐variable hedonics when :• Panel of prices with constant quality• “High frequency” to minimize importance of “unmatched” models

Page 20: Overlapping Quality Adjustment using Online Data

Example With US TVs• Largest electronics retailer in the US in 2009• Daily panel data for 544 LCD televisions• Sold from April 2008 to November 2009 (577 days)

Page 21: Overlapping Quality Adjustment using Online Data

Product Life (duration)

Note: Excluding products lasting less than 30 days (refurbished or open box)

Mean Product life : 178 days (median 154)

Page 22: Overlapping Quality Adjustment using Online Data

Changes per Product

Mean Price Changes per id: 6  (median 4)

Page 23: Overlapping Quality Adjustment using Online Data

Share of “Exits”

Page 24: Overlapping Quality Adjustment using Online Data

Alternative Price Indices

• CPI – Televisions (BLS, with hedonic adjustment)• Online Index – with “traditional CPI sampling”– Small sample (few varieties)– Censored price spells (missing introduction prices)

1. Comparable (treated as same good ) all P change)2. Non‐Comparable (treated as different goods)  al Q 

change )

• Online Index  ‐ Small sample • Online Index • Online Index with daily data

Page 25: Overlapping Quality Adjustment using Online Data

Online Index ‐ Comparables

• Un‐weighted geometric mean of price relatives (Jevons index)

• Monthly data (1st day of each month)• Censored Spells• Comparables: start with all products available on the first month, 

then replace them as they disappear with the closest model (price and characteristics). 

• Treated as same good all price change 

• Small sample: only “Samsung” brand (64 TVs)

Page 26: Overlapping Quality Adjustment using Online Data

020

0040

0060

00pr

ice

01apr2008 01oct2008 01apr2009 01oct2009date

SAMSUNG TVs

Page 27: Overlapping Quality Adjustment using Online Data

Example of a matched model30

035

040

045

050

0pr

ice

01nov2008 01feb2009 01may2009 01aug2009 01nov2009date

id = 16889102226 id = 16889102248

Page 28: Overlapping Quality Adjustment using Online Data

010

0020

0030

0040

00pr

ice

01apr2008 01oct2008 01apr2009 01oct2009date

Samsung TVs with substitutionMatched models

Page 29: Overlapping Quality Adjustment using Online Data

Online Index – Comparables60

7080

9010

0

04−01−2008 10−01−2008 04−01−2009 10−01−2009date

Online Index − Substitutions − All Price Change US CPI − Televisions

Price Index

Page 30: Overlapping Quality Adjustment using Online Data

Online Index – Comparables−

30.0

−25

.0−

20.0

−15

.0−

10.0

04−01−2008 10−01−2008 04−01−2009 10−01−2009date

Online Index − Substitutions − All Price Change US CPI − Televisions

Annual Inflation

Page 31: Overlapping Quality Adjustment using Online Data

Online Index – Comparables−

20.0

−10

.00.

010

.0

04−01−2008 10−01−2008 04−01−2009 10−01−2009date

Online Index − Substitutions − All Price Change US CPI − Televisions

Monthly Index

Page 32: Overlapping Quality Adjustment using Online Data

Online Index with Non‐comparables

• Un‐weighted geometric mean of price relatives (Jevons)• Monthly data (1st day of each month)• Censored Spells• Non‐comparables: start with all products available on the first 

month, then substitute them as they disappear with the closest model (price and characteristics). 

• Treated as different goodsall quality change 

• Small sample: only “Samsung” brand (64 TVs)

Page 33: Overlapping Quality Adjustment using Online Data

Online Index with Non‐comparables60

7080

9010

0

04−01−2008 10−01−2008 04−01−2009 10−01−2009date

Online Index − Substitutions − All Price Change US CPI − Televisions

Price Index

Page 34: Overlapping Quality Adjustment using Online Data

Online Index with Non‐comparables−

40.0

−30

.0−

20.0

−10

.0

04−01−2008 10−01−2008 04−01−2009 10−01−2009date

Online Index − Substitutions − All Quality US CPI − Televisions

Annual Inflation

Page 35: Overlapping Quality Adjustment using Online Data

Online Index with Non‐comparables−

20.0

−10

.00.

010

.0

04−01−2008 10−01−2008 04−01−2009 10−01−2009date

Online Index − Substitutions − All Quality US CPI − Televisions

Monthly Index

Page 36: Overlapping Quality Adjustment using Online Data

Online – Small Sample

• Un‐weighted geometric mean (Jevons Index)• Monthly data• Complete times series for each good• Small sample: – Only “Samsung” brand ‐ 64 tvs– Randomly select 10% of ids

Page 37: Overlapping Quality Adjustment using Online Data

Online – Small Sample60

7080

9010

0

04−01−2008 10−01−2008 04−01−2009 10−01−2009date

Online Index − Small Sample US CPI − Televisions

Price Index

Page 38: Overlapping Quality Adjustment using Online Data

Online – Small Sample−

30.0

−25

.0−

20.0

−15

.0

04−01−2008 10−01−2008 04−01−2009 10−01−2009date

Online Index − Small Sample US CPI − Televisions

Annual Inflation

Page 39: Overlapping Quality Adjustment using Online Data

Online Index – Small Sample−

6.0

−4.

0−

2.0

0.0

2.0

04−01−2008 10−01−2008 04−01−2009 10−01−2009date

Online Index − Small Sample US CPI − Televisions

Monthly Index

Page 40: Overlapping Quality Adjustment using Online Data

Online Index

• Un‐weighted geometric mean (Jevons)• Monthly data• Complete times series for each good• Large sample: all products available (577)

Page 41: Overlapping Quality Adjustment using Online Data

Online Index60

7080

9010

0

04−01−2008 10−01−2008 04−01−2009 10−01−2009date

Online Index US CPI − Televisions

Price Index

Page 42: Overlapping Quality Adjustment using Online Data

Online Index−

30.0

−28

.0−

26.0

−24

.0−

22.0

−20

.0

04−01−2008 10−01−2008 04−01−2009 10−01−2009date

Online Index US CPI − Televisions

Annual Inflation

Page 43: Overlapping Quality Adjustment using Online Data

Online Index−

8.0

−6.

0−

4.0

−2.

00.

0

04−01−2008 10−01−2008 04−01−2009 10−01−2009date

Online Index US CPI − Televisions

Monthly Index

Page 44: Overlapping Quality Adjustment using Online Data

Do we need daily data?

• Online index with daily prices

Page 45: Overlapping Quality Adjustment using Online Data

Online Index – Daily Data60

7080

9010

0

04−01−2008 10−01−2008 04−01−2009 10−01−2009date

Online Index − Daily Online Index

Price Index

Page 46: Overlapping Quality Adjustment using Online Data

Online Index – Daily Data−

35.0

−30

.0−

25.0

−20

.0

04−01−2008 10−01−2008 04−01−2009 10−01−2009date

Online Index − Daily Online Index

Annual Inflation

Page 47: Overlapping Quality Adjustment using Online Data

Online Index – Daily Data−

8.0

−6.

0−

4.0

−2.

00.

0

04−01−2008 10−01−2008 04−01−2009 10−01−2009date

Online Index − Daily Online Index

Monthly Index

Page 48: Overlapping Quality Adjustment using Online Data

Preliminary Results

• An online index with simple overlapping quality adjustments can approximate the BLS TV index with hedonic adjustments

• Requires both uncensored price spells and a large number of closely comparable goods

• Daily data is not required, but it provides anticipation and can be used to detect price anomalies

Page 49: Overlapping Quality Adjustment using Online Data

Other Advantages

• No “new good bias”: Silver (98)  the launch of a new model affects the behavior of the old model

82

84

86

88

90

92

94

96

98

100

102

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17

Price

Month

Model 1 Model 2 Model 3

Page 50: Overlapping Quality Adjustment using Online Data

Next Steps

• Expand the analysis to other retailers, sectors, and countries

• What happens in sectors like Apparel, where there is little overlap?

Page 51: Overlapping Quality Adjustment using Online Data

51

STATE STREET ASSOCIATES