Predicting Purchase Behavior from Stated Intentions: A ...pages.stern.nyu.edu/~mkt/wps/MARK 05 04...

Predicting Purchase Behavior from Stated Intentions: A Unified Model

Baohong Sun and Vicki G. Morwitz1

1 Baohong Sun is Assistant Professor of Marketing at Kenan-Flagler Business School, University of North Carolina. Tel: 919-962-9579, Fax: 919-962-7186, Email: [email protected]. Vicki G. Morwitz is Associate Professor of Marketing at Leonard N. Stern School of Business, New York University. Tel: 212-998-0618, Fax: 212-995-4006, Email: [email protected]. We would like to thank seminar participants at the University of Chicago and Marketing Science Conference for valuable comments.

MARK-05-04

2

Predicting Purchase Behavior from Stated Intentions: A Unified Model

Abstract

Stated intentions are imperfect measures of actual purchasing. Intentions data can contain systematic biases, intentions can change over time and the correlation between intentions and actual purchase can be imperfect. Ignoring any of the discrepancies between intentions and purchasing can result in biased estimates of variable coefficients and biased forecasts of future demand. Previous models are either aggregate level models or only take into account a subset of these discrepancies. Thus, these models are limited in their ability to aid managers in targeting the right consumers with the right marketing offer. In this paper, we develop a unified model of the relationship between intentions and purchasing that (1) takes into account all possible sources of discrepancies between intentions and purchasing; (2) forecasts purchasing probability at the individual level by linking explanatory variables (e.g., socio-demographics, product attributes and promotion variables) and intentions with actual purchasing; (3) considers multiple levels of purchase decisions rather than the simple purchase / no-purchase decision. We empirically demonstrate that this model provides more accurate individual level purchase predictions and is therefore more useful for guiding targeting efforts.

MARK-05-04

3

INTRODUCTION Self-reported purchase intentions, because of their flexibility, ease-of-use, and

inexpensiveness, are one of the most widely used proxy measures of purchasing in sales

forecasts (Chandon, Morwitz and Reinartz, in press; Infosino 1986; Wittink 2001; Verhoefl and

Hans Franses 2002; Whitlark, Geurts and Swenson 1993) and new product tests (Jamieson and

Bass 1989; Silk and Urban 1978; Urban and Hauser 1980; Urban and Katz 1983). Intentions are

also used to segment markets and evaluate the effectiveness of promotions for different

individuals (Sewall 1978).

The widespread use of intentions to forecast actual purchasing relies on the strong

assumption that intentions are good indicators of individuals’ purchase behavior. A key question

then is whether self-reported intentions are reliable indicators of individuals’ subsequent

purchasing? If not, how should marketers combine stated intentions measures with other

available data to forecast respondents’ probability of purchase? Past research has shown that the

predictive validity of intentions is questionable (e.g. Belk 1985; Clawson 1971). Individual-level

purchase behavior differs from stated intentions and those individual-level discrepancies do not

cancel in the aggregate. This results in a discrepancy between the overall mean stated purchase

intention and the subsequent proportion of buyers. Moreover, models ignoring this discrepancy

will not only lead to inaccurate forecasts, but also provide biased estimates of the relationship

between correlates of intentions (e.g., socio-demographics, product attributes and promotional

variables) and purchasing (Hsiao and Sun 1999; Young, Morwitz and DeSarbo). Thus,

segmentation and promotion strategies made based on these models can be sub-optimal.

Previous research in marketing and psychology has identified three main reasons why

stated intentions and actual purchasing differ: (i) systematic intentions biases (systematic biases

MARK-05-04

4

in the reporting of stated intentions at the time of the survey, e.g., social desirability bias)

(Granois and Summers 1975; Hsiao and Sun 1999; Kalwani and Silk 1982; Morrison 1979); (ii)

changes in true intentions over time (changes in true intentions between the time of the survey

and the time of the purchasing associated with changes in explanatory variables, e.g., price

increase) (Manski 1990; Morrison 1979); (iii) an imperfect correlation between true intentions

and purchasing (e.g, the psychological difference between intentions and behavior) (Bagozzi and

Dholakia 1999; Gollwitzer 1999a,b).

[Insert Table 1 here]

Table 1 summarizes previous research on the intentions-behavior relationship and

highlights our contribution to this research stream. In an influential paper, Morrison (1979)

develops a modified version of the beta-binomial model that allows for the impact of exogenous

events on true intentions. Kalwani and Silk (1982) further analyze and apply Morrison's model to

different product categories. Bemmaor (1995) extends Morrison's (1979) model to allow for

heterogeneous switching probabilities. Infosino (1986) interprets intentions as a monotonic

transformation of a latent value (willingness-to-pay minus price) and examines the effect of

promotions on the probability of subsequent purchase. The purpose of most of these studies was

to identify rather than explain discrepancies between stated intentions and actual purchasing. In

addition, since all of these are aggregate-level models, they forecast the same purchase

probabilities for all respondents with the same stated level of intentions, and therefore they are

only useful for forecasting aggregate sales and not for identifying which individuals are more

likely to buy.

In recent years, a stream of models has emerged that examines the relationship between

intentions and behavior at the respondent or segment level. These models incorporate some of

MARK-05-04

5

the possible sources of discrepancies between these two variables. In a descriptive study,

Morwitz and Schmittlein (1992) segment respondents using demographic information, and

develop separate forecasts for each segment. Using binary and multinomial choice models,

Young et al (1998) and Hsiao and Sun (1999) demonstrate how to recover true intentions from

stated intentions when systematic intentions biases are present. Both models do not take into

account the possibility that true intentions may change over time nor the imperfect correlation

between true intentions and purchasing. The way we model systematic intentions biases is an

extension of Young et. al. (1998) and Hsiao and Sun (1999). However, there are substantive

differences between our model and theirs. First, their studies focus on modeling systematic

response biases and still rely on the assumption that true intentions equal purchasing. Second,

their models are only based on intentions survey data and the derived intention-purchase

relationship is not validated using purchase data. Third, their models assume intentions without

bias are equivalent to subsequent purchasing and therefore the purchase model is the same as the

intentions model.

Moreover, most existing aggregate and disaggregate models only consider the binary

purchase/no purchase decision. However, more and more companies are offering multiple levels

of products and services and managers are interested in predicting purchases for these different

levels, in addition to predicting simple purchase/ no purchase decisions. For example, in the

cellular phone market, an intentions survey data might ask: “Which usage plan to intend to sign

up for?”, with response options, “high usage plan,” “medium usage plan,” “low usage plan,” and

“no plan.” Alternatively, we observe marketers asking consumers about their purchase timing

decisions, with questions such as: “When do you intend to purchase the product?“ with response

MARK-05-04

6

options, “within six months,” “7-12 months,” or “not within a year.”2 These data cannot be

handled by conventional intentions models such as Morrison’s (1979) which were designed to

only consider purchase / no-purchase decisions.

In summary, to better use self-reported intentions for sales forecasting, new product

testing, promotion evaluation and targeting, there is a need for a unified individual level model

that (1) takes into account all possible sources of discrepancies between intentions and

purchasing; (2) forecasts purchasing probability at the individual level by linking explanatory

variables (e.g., socio-demographics, product attributes and promotion variables) and intentions

with actual purchasing; (3) considers multiple levels of purchase decisions rather than being

restricted to binary purchase / no-purchase decisions.

In this paper, we develop a conceptual framework of the intentions - purchase

relationship, and propose a unified individual-level intentions model that corrects for systematic

intentions biases, adjusts for changes in true intentions over time associated with changes in

related explanatory variables, allows for an imperfect correlation between intentions and

purchasing, and allows for multiple levels of purchase choices. We then develop a forecasting

method to predict individual purchase probabilities using intentions survey data and demonstrate

that this model predicts purchase probabilities at the individual level that are more accurate and

useful for guiding targeting effort than other models currently used in marketing. Thus, our

paper is the first paper that proposes a unified statistical framework to take into account the three

possible causes of discrepancies between stated intentions and purchasing and allows individual-

level correlates and intentions to explain purchasing. It unifies stated intentions and purchasing

by allowing them to be both directly and indirectly connected.

2 Morwitz and Schmittlein (1992) treat these alternatives as ordered choices. This is a reasonable assumption since consumers who state they intend to purchase within 6 months are also more likely to purchase than those who state intentions to purchase within 7-12 months and so on.

MARK-05-04

7

In the next section, we develop a conceptual framework linking explanatory variables,

stated intentions and actual purchasing. We then propose a unified statistical model to model the

discrepancies between stated intentions and actual purchasing. We next estimate the unified

purchase-forecasting model using intentions survey data related to automobiles and personal

computers. Finally we discuss managerial implications and opportunities for future research.

CONCEPTUAL FRAMEWORK

Systematic Intentions Bias

It is well known that stated intentions may not perfectly reveal respondents' true

intentions. Respondents may not reveal their true intentions because they (1) do not understand

the nature of their own preferences (Balasubramanian and Kamakura 1989; Kahneman and Snell

1992); (2) want to impress the interviewer; (3) try to guess the correct answer (e.g., what the

sponsor or interviewer would prefer them to say); or (4) simply misunderstand the question

(Andersen 1988, chapter 9). These effects can lead to a difference between an individual’s stated

and true intentions. The random component of measurement error in individual reports cancels

out but the systematic component persists at the aggregate level (Kalwani and Silk 1982). We

use the term systematic intentions bias, to refer to systematic differences between reported and

true intentions at the time of the survey.

The direction of the systematic intentions bias can be two-sided or one-sided. For

instance, when asked their intentions to buy a newly developed high-tech product, respondents

may not have sufficient knowledge and may construct impromptu answers in order to avoid the

embarrassment of not having an answer for the question. Thus, they may provide erroneous

answers that could be biased in either direction and the resultant stated intentions are

contaminated with two sided intentions bias. Alternatively, there could also be a “social

MARK-05-04

8

desirability bias” (Bagozzi 1994; Bagozzi, Yi and Lynch 1999) and the respondent may over-

report “desirable” behaviors and under-report “undesirable” ones (Sherman 1980). Thus, the

stated intentions are always skewed downward or upward. In such cases, the resultant stated

intentions are contaminated with one-sided intentions bias. 3

Change in true intentions over time

Even if respondents state their true intentions, it is very likely that, at the time of the

survey, they may not possess all the relevant information related to an intended future purchase.

Many unexpected events can occur between the time of the survey and the time of the purchase,

leading true intentions to change over time (Manski 1990; Morrison 1979). For example,

respondents’ true intentions might shift with an unanticipated change of price or improvement of

the product. Infosino (1986) suggests that events such as promotions shift the entire intentions

distribution. The unexpected promotion leads to increased purchasing because more of the

distribution is above the threshold required for purchase.

Imperfect correlation between true intentions and purchase

Even if there are no systematic intentions biases and true intentions do not change over

time, respondents' true intentions at the time of purchase may still differ from their actual

purchasing behavior. We attribute this difference to all unobservable factors that cause true

intentions to be an imperfect representation of actual purchasing. For example, this may be

related to the notion in psychology that there is a fundamental psychological difference between

forming an intention to perform a goal and achieving a goal (Bagozzi and Dholakia 1999;

Gollwitzer 1999a, b). In a purchase context, this may be due to unobservable factors; e.g. the

3 Of course, there might exist other forms of intentions bias, for example, mixtures of two-sided and one-sided biases where respondents randomly select an answer that is close to their true intention. In this paper, we only consider two-sided and one-sided intentions biases. We do not aim to model all forms of intentions biases.

MARK-05-04

9

intended product is out of stock (Bagozzi and Dholakia 1999) or the under-estimated cost of

searching for the right product.

STATISTICAL MODEL

In this section we propose a unified model to analyze categorical purchase intentions

data.4 Since it has become fairly common to collect intentions data measured in a graded order,

we assume that a respondent is confronted with a set of ordered levels j=0, …,M. The set of

respondents is denoted by n=1, 2, …, N.

Intentions Model

The basic framework we shall follow is the conventional random utility model (e.g.

McFadden 1974). Let Un be the (indirect) utility for the nth respondent. We assume that it can be

decomposed into two components.

nn

nnn

XU

ξαξµ+=

+= (2)

where µn is the deterministic part of the latent utility that determines true intentions, and Xn

denotes some socio-demographics, product attributes and marketing variables, e.g. product

properties, price, age, gender, present job category, etc. The coefficients of these variables are

defined by α. ξn represents all other unobservable random factors and is assumed to follow an

i.i.d. normal distribution.

Let the observed stated intention level yjn be defined as: 4 Note that conventionally intentions are measured on 5-point likelihood scales or 11-point probability scales and purchases are measured as binary (i.e., the product is purchased or not). We adopt a multinomial ordered probit model of intentions and purchasing for the following reasons: First, it is becoming more common for marketers to measure intentions with multiple levels or responses (e.g., usage categories, timing levels, etc.) Second, we want to develop a more general model that can incorporate situations with more than two intentions and purchase levels. The binary buy / no buy decision model is nested within our general model. Third, we can distinguish two-sided intentions bias and one-sided intentions bias models in the multinomial case while, in binary case, these two types of response bias models collapse to the same model. Our model, thus, is applicable to general (ordered) discrete choices of multi levels.

(1)

MARK-05-04

10

=.otherwise,0

,respondent by thechosen isintention level theif,1 nthjthy jn (3)

For notational convenience, we let *jny be an indicator denoting latent true intentions.

≤<

=.otherwise,0

if,1 ,1* jnj-jn

lUly (4)

where jl is the threshold for intention level j. Then we have the ordered probit model for true

intention, which is based on utility maximization.

)()()1( 1*

njnjjn llyP µµ −Φ−−Φ== − , (5)

for j=1, .., M-1 . Φ is the normal distribution function derived from the density function of ξn.

)()1( 0*0 nn lyP µ−Φ== and )(1)1( 1

*nMMn lyP µ−Φ−== − . We define )1( * == jnjn yPF , which is

the probability of having j as the true purchase intention.

The first model we consider corresponds to the case when respondents report their stated

intentions based on their true preference as defined by equation (2). That is,

jnjnjn FyPyP ==== )1()1( * (6)

We call the model in the above equation the random utility model. It corresponds to the case

when stated intentions are perfect indicators of true intentions.

In the second model, we allow for the existence of two-sided intentions bias. We assume

that with probability πj for j=0, …, M the nth respondent chooses the jth intention level

irrespective of his or her underlying preference, and that with probability 1-∑ =

M

j j0π , the

respondent provides his or her true intention, consistent with utility maximization. Therefore, the

probability of observing the nth individual choosing the jth level is given by

(7) ,)1()1(0∑ =

−+==M

j jnjjjn FyP ππ

MARK-05-04

11

for j=0, …, M. We call this second model the two-sided intentions bias model.5 Intuitively, with

the two-sided intentions bias model, the probability of observing stated intention j is the sum of

the probability of randomly choosing the jth intention and the joint probability of not giving a

random answer and choosing the jth intention based on the utility function.6

In the third model, we consider the case in which the intentions bias can be either positive

or negative. In the analysis to follow, we assume that respondents' stated intentions always over-

report their true intentions, however under-reporting can be modeled similarly. A respondent has

a probability πj to state random intentions j for j=0 …, M. The respondent always chooses the jth

intention level if the jth intention level is the higher of his or her random intention and true

intention.

∑∑==

−+==M

jijni

j

iinjjn FFyP ]1[][)1(

0ππ , (8)

for j=1, … M and

∑=

=−==M

jjnn yPyP

10 ).1(1)1( (9)

We call this the one-sided intentions bias model. Intuitively, respondents always report the

higher of their random intention and their true intention, and thus their reported intention can be

higher than their true intention. In other words, this model allows respondents to exaggerate their

stated intentions.

5 Hsiao and Sun (1999) provides a more detailed explanation on how to model intentions bias. Murthi and Srinivasan (1999) also develop a similar model to address different issues. 6 Note our focus is not to provide a process-level explanation on how respondents offer biased answers. Rather we adopt a statistical approach to take into consideration intentions bias. While equations (6), (7) and (8-9) represent

statistical assumptions we made in the model, we do not claim they represent the process by which respondents provide biased intentions.

MARK-05-04

12

Note that when πj is zero for all j, both the two-sided intentions bias model and the one-

sided intentions bias model become the random utility model as defined in equation (6). Thus the

random utility model is nested in both the two-sided and the one-sided intentions bias models.

This implies that, given an intentions survey, we can always assume the data are contaminated

with (constant) intentions biases and start with the two intentions bias models. If intentions

biases do not exist in the data, the estimates of πj for all j will not be significantly different from

zero. The usual model selection criteria such as AIC and BIC can be used to choose between the

two-sided and the one-sided intentions bias models.

Instead of assuming every respondent has the same probability πj of giving biased

intentions,, we allow πj to vary across respondents by making πj a function of Wn, a vector of

observable explanatory variables. We define

)( njjn Wγπ Φ= . (10)

)( njWγΦ takes a multinomial probit form for all j in π. Wn include variables representing the

types of respondents that are associated with providing biased responses to the intentions

question (e.g., product knowledge, education, gender). γj measures the effect of Wn on the

probability of stating biased intentions.

Using models (7) and (8-9), the underlying true intentions can be derived by

disentangling the systematic intentions bias from stated intentions. We next consider the portion

of the unified model that describes the relationship between intentions and actual purchase. We

note that this component was ignored in Young et. al (1998) and Hsiao and Sun (1999).

Purchasing Model

MARK-05-04

13

We assume that the purchase decision also follows a conventional random utility model.

Let 'nU be the (indirect) utility level for the nth respondent. Louviere et. al. (1999) note that the

fundamental latent constructs of preference and utility are quite stable and that, after controlling

for context effects, intentions can be used to predict actual choices. Hsiao, Sun and Morwitz

(2002) show that purchasing is a function of intentions. Based on these empirical findings, we

assume that the purchase decision depends on both latent true intentions at the time of the survey

Un and changes in explanatory variables Xn between the time of survey and the time of purchase

( nX∆ ). Thus, we have the following equations,

'

'

'

)'('

''

nnn

nnnn

nnn

nnn

XX

eXXeXU

eU

ξλα

βξλβαλβ

µ

+∆+=

++∆+=+∆+=

+=

where 'nµ is the deterministic part of latent purchase utility and '

ne denotes all the unobservable

factors that affect the purchase decision. We assume 'ne is normally distributed. The inclusion of

nX∆ captures changes in the explanatory variables which lead to a shift in the purchase utility

and hence can affect the subsequent purchase decision. Replacing Un by its expression as in

equation (2), give us equation (13). Letting βαα =' and nnn e'' += βξξ , we obtain Equation (14).

With appropriate normalization, we normalize the variance of 'nξ to one so that '

nξ is standard

normal. Since α can be estimated from the intentions model, we only need to estimate

parameters β and λ. Coefficient β measures the direct effect of latent true intentions and λ

measures the effect of changing explanatory variables on purchasing.

(11)

(12)

(13)

(14)

MARK-05-04

14

Let zjn be an indicator variable to denote that the nth respondent purchase the jth

alternative:

≤≤

=.otherwise,0

''' if,1 ,1 jnj-jn

lUlz (15)

where jl ' is the threshold for purchasing j. Then the probability of purchasing the jth

alternative is:

)''()'()1( 1'

njnjjn llzP µµ −Φ−−Φ== − (16)

for j=1,..,M-1 and Φ is the normal distribution function derived from the density function of 'nξ .

)'()1( '00 nn lzP µ−Φ== and )''(1)1( 1 nMMn lzP µ−Φ−== − . We define )1(' == jnjn zPF , which

is the probability of purchasing choice j.

There are unforeseeable events or unobservable factors that can lead to an imperfect

relationship between true intentions and purchasing. This is captured by the correlation between

nξ and 'nξ , which we define as ρ. We assume that

''2' 1 nnn ξρξρξ −+= (17)

where nξ and ''nξ are i.i.d. nξ captures unobserved events that occur at the time of survey. '

nξ

captures the new events that affect consumers' actual purchasing. ρ measures the persistence of

nξ between the time of the survey and the time of purchase. In one extreme case ρ=0,

0),cov( ' =nn ξξ which implies such events at the time of survey and at the time of purchase are

independent. In another extreme case ρ=1, nn ξξ =' , which implies events at the time of survey

and at the time of purchase are the same. In our model, we consider the general case without

putting any constraints on ρ .

MARK-05-04

15

Assuming that there are no systematic intentions biases, the joint probability that true

intentions equal j and actual purchasing equals j' at the time of the purchase is given by

njjnjjn FjjGjjGjjGjjGzyP '''''

'* '),1()1,()1,1(),()1,1( =−−−−−−+=== (18)

The function G(j,j') denotes the joint probability of njn l µξ −< and njn l ''' µξ −< .

Unified Model

The above equation gives the joint probability that true intentions equals j and actual

purchasing equals j' at the time of purchase. However, to complete our unified model, we also

need to consider the possibility of systematic intentions biases at the time of the survey. The joint

probabilities of stating intention j and purchasing 'j when two-sided biases are present are given

by:

∑ =−+===

M

j njjjnnjjnnjjn FFzyP0

''

''' )1()1,1( ππ (19)

Similarly, the joint probabilities of stating intention j and purchasing 'j when one-sided biases

are present are given by:

∑∑=

=−+===

M

jinjjin

j

i njjjnnjjn FFzyP ''0

''' ]1[][)1,1( ππ (20)

where ''njF and '

'njjF are given by equations (16) and (18). We call (19) and (20) the unified

models. We call these unified models since they integrate systematic intentions biases, changes

in true intentions over time and the imperfect correlation between true intentions and actual

purchasing in the same model. They also relate intentions with purchasing both directly and

indirectly.

The log-likelihood function is given by

MARK-05-04

16

∑ ∑ ===n j njjnnjjn zyPzyL

j

)]1,1(log[logmax '',,,, ρλγβα (21)

Note that at the heart of the unified model is a bi-variate probit model. For identification

purposes, we follow the convention and normalize one of the constants in both the intentions

and purchase utility functions to be zero, that is 00 =l and 0'0 =l . We also normalize the

variance of nξ and 'nξ to be 1.

PURCHASE PREDICTION USING INTENTIONS DATA

In the above model development, we assumed both stated intentions and purchase data

are available. However, firms are more interested in using intentions data to forecast individual

purchase probabilities in situations when actual purchase data are not yet available (e.g., prior to

a new product launch or before a change in the marketing program is made). Specifically, it is 'α

(or α given β) and forecasts of actual purchasing, after adjusting for discrepancies between

intentions and purchasing, that managers can use to make marketing and targeting decisions

before purchasing data are available. In practice, we assume we have a historical sample for

which both intentions and purchase data are available and apply our unified model to the

historical data to obtain estimates of α, β, γj, λ, and ρ. Once we have reliable estimates of β, λ

and ρ from the historical data that characterize the relationship between true intentions and

purchasing, we use them together with the new intentions survey to obtain new estimates of α

and γj to forecast future purchasing. To determine the probability that respondent n purchases at

level j' conditional on stated intention j, we use Bayes Law:

)1Pr()1,1Pr(

)1|1Pr( ''' =

======

jn

njjnjnnjnjj y

zyyzA (22)

MARK-05-04

17

where njjA ' denotes the conditional probability that the nth respondent purchases the jth

alternative conditional on his or her stated intention j. The term )1,1Pr( ' == njjn zy can be

obtained by calculating the joint probability of purchasing the jth alternative and stating the jth

intention alternative (equation 19 or 20 using known values of β, λ and ρ and the prediction

intentions data). The term )1Pr( =jny can be obtained by calculating the probability of stating

the jth intention (using equation 7 or 8-9 with prediction intentions data). Thus, the probability

that the nth respondent purchases the jth alternative ( njA ' ) is given by

)]1Pr([0

'' == ∑=

jn

M

jnjjnj yAA (23)

In general, the unified model can be applied to intentions survey data to forecast

purchasing by following a four-step procedure. (1) First, we estimate various systematic

intentions bias models (equations 7, 8-9) to determine whether any systematic intentions biases

exist in the stated intentions and, if so, in which direction. (2) Once we find out the direction and

magnitude of any systematic intentions biases, we then apply the corresponding unified model

(equation 19 or 20) to a historical sample of intentions and purchase to obtain estimates of β, λ

and ρ . (3) Next, we input β, λ and ρ together with the prediction intentions data and any other

available information such as demographic variables to the unified model again to obtain more

accurate estimates of α and γj. (4) Finally, we predict the individual-level purchase probabilities

using equation (23) and obtain more accurate estimates of 'α .

Note, in the historical sample, we have Xn measured both at the time of survey and at the

time of purchase. However, in the prediction sample, Xn is only measured at the time of the

survey. Fortunately, in practice, many variables that may cause true intentions to systematically

change over time are product and promotion related and may be known or even planned by

MARK-05-04

18

management (Infosino 1988). For example, management may plan to introduce a 30 percent

price cut for a personal computer a few months after the intentions survey. The price drop will

not affect the true intention measured at the time of the survey but will increase the demand at

the time of purchase. If management is interested in predicting sales one year after the survey,

they should take into account the price drop, or nX∆ , to obtain a more accurate forecast.

Since the same β, λ and ρ from the historical sample are used for prediction, we are

assuming the intentions-purchasing relationship we observed in the historical sample carries over

to the prediction sample. Thus the choice of the historical sample has to be consistent with the

above assumption. If historical data are not available, there are other ways to obtain a calibration

sample. For example, a two-stage study can be done. In the first stage, a pilot study can be

conducted where intentions and purchasing data are collected from a small number of

respondents. We then estimate β, λ and ρ from the pilot sample. In the second stage, we collect

intentions from a large sample of respondents and then use the prediction intentions data

combined with β, λ and ρ from the pilot study to forecast sales.

EMPIRICAL APPLICATION

Data Description

To demonstrate the predictive power of our unified model, we apply the model to two

different intentions survey datasets involving automobiles and personal computers. Multiple

waves of surveys were conducted to measure intentions to buy an automobile or a home personal

computer using two different but similar U.S. consumer mail panels. For the focal prediction

samples, we used stated intentions collected from a randomly selected 2000 households in the

fourth quarter of 1989 for the automobile data , and those from 3315 households collected in

January 1987 for the personal computer data. For the historical samples for which actual

MARK-05-04

19

purchase information is also available, we used stated intentions collected from 1000 households

in the fourth quarter of 1988 for the automobile data, and from 1105 households collected in

January 1986 for the personal computer data.

For the automobile data the intentions question asked: “When will the next new (not

used) CAR (not truck or van) be purchased by someone in your household?” with response

alternatives: “6 months or less, 7-12 months, 13-24 months, 25-36 months, Over 36 months, and

Never.” For the personal computer data, the intentions question asked: “Do you or does anyone

in your household plan to acquire a (another) personal computer in the future for use at home?”

with response alternatives: “Yes, in the next 6 months, Yes, in the next 7 to 12 months, Yes, in

the next 13 to 24 months, Yes, sometime, but not within 24 months, and No, will not acquire

one.” Whether an automobile was purchased was directly asked in each survey wave. For the

personal computer data, we used respondents' reports of product ownership to infer whether a

household purchased a computer in a given time period. We assume a household bought a

personal computer if they switched from being a non-owner to an owner from one wave to the

next.8

For the automobile data, we define the purchase intentions j (purchase 'j ) as 1 if the nth

consumer intends to purchase (actually purchases) an automobile within 12 months and 0 if he or

she does not intend to purchase ( does not actually purchase) within 12 months. For the personal

computer data, purchase intentions j and purchase 'j are constructed as follows: 9 10

8 For the personal computer data, we have excluded repeat purchasing since we cannot determine when a repeat purchase occurred with these data. Thus, we only include respondents who intend to purchase their first personal computer. 9 Note the major purpose of our study is to establish that our unified model, which is developed for general ordered discrete choices, provides more accurate purchase predictions at the individual level than existing forecasting models. The empirical applications are only for demonstration purposes. Our goal is not to provide a detailed accounting of consumers automobile and personal computer purchase timing decisions, but rather is to describe the unified model, and illustrate how it can be used. We treat timed purchase intentions as ordered purchase intentions.

MARK-05-04

20

=

.months 12 within occurs)(actually intended is purchase no if,0months 12-7ithin computer w personal a purchase (actually) tointendsconsumer nth theif,1

months 6within computer personal a purchase (actually) tointendsconsumer nth theif,2)'( jj

In addition to the intentions and ownership questions, extensive demographic information

was also collected including the size of household, annual household income, age of the head of

household, marital status, home ownership, household stage of life, occupation, education of the

head of household, race, number of cars owned, regional dummy variables, ownership of a

cellular phone, etc.

In the automobile data, we only have a single measurement of the demographic variables

and most of these demographic variables are unlikely to change over time. Thus, we assume no

change in the demographic variables. Fortunately, for the personal computer data, we do have

demographic information collected at each survey wave. In order to demonstrate how changes in

explanatory variables can lead true intentions to change over time, we analyze the impact of

changes in one particular demographic variable, the number of cars owned.

[Insert Table 2 Here]

Table 2 lists all the variables used in the estimation and provides some descriptive

statistics. Comparing the percent of respondents who stated an intention to buy with the

corresponding percent that actually purchased, we see that for both the historical and prediction

samples, intentions overstate purchasing. For example, in the personal computer prediction

This follows Morwitz and Schmittlein (1992) who use the same data set and treat the alternatives as ordered choices. As long as the intentions or purchases are discrete and ordered, our model should be applicable. 10 The reason we aggregate the multiple intention levels to a binary choice for the automobile data is that the data only allow us to construct binary purchase information (buy within a year or otherwise). Similarly, we aggregate intentions and purchase categories to 3 levels for PC data because the ownership information collected at each survey wave only allow us to construct 3 levels discrete purchase information. It is important to note that the categorization is only due to data limitation. It is not required for the application of our unified model. How the categorization affects the magnitude of our parameter estimates is another research topic, which we leave for future research.

MARK-05-04

21

sample, 15.26 percent of the respondents stated they would purchase a computer within 6-12

months but only 5.67 percent actually bought. This demonstrates that respondents exaggerated

their demand to buy in the future, suggesting that direct use of stated intentions as a proxy for

purchasing would lead to inaccurate sales forecasts.

RESULTS

Before we use the unified model to describe the relationship between intentions and

purchasing, we need first to determine whether any systematic intentions biases exist in the

reported intentions, and if so, in which direction. We estimate three models, namely, the random

utility model, the two-sided intentions bias model and the one-sided intentions bias model using

the historical data assuming constant πj. Loglikelihood ratio tests reject the hypothesis that πj =0.

We then compare log-likelihood values, AIC and BIC between the two competing intentions bias

models. We find that the one-sided intentions bias model provides a better fit indicating that

respondents were overstating their intentions of purchasing an automobile as well as a personal

computer at the time of the surveys. This is consistent with the existing literature that shows that

respondents often exaggerate their future demand for socially desirable and high-tech new

products. For the remainder of the paper, we will assume the existence of one-sided intentions

bias.

To examine whether admitting individual level explanatory variables, adding the

purchase model and considering the various discrepancies indeed improve model fit and

predictive accuracy, we estimate seven models. The first model is Morrison’s (1979) model

estimated using the MLE method proposed by Kalwani and Silk (1982). The second model is

Bemmaor’s (1995) model which represents the most recent development in aggregate intentions

models. Since these two aggregate models can only predict purchase / no-purchase decisions, we

MARK-05-04

22

modify our unified model by defining buying within 12 months as a purchase and all other cases

as a non-purchase within a year. The third model is our unified model ignoring purchase

information (λ=0 and ρ=1). It is similar to the most recent development in disaggregate models,

i.e., Hsiao and Sun (1999) for the multinomial case and Young et. al. (1999) for the binary case.

These models only focus on modeling response biases. Their objective is to derive *jny which is

defined by equation (2). The fourth model is our unified model ignoring systematic intentions

biases (πj=0 for all j). It still assumes intentions are directly and indirectly related to purchasing

and forecasts purchase based on Equation (23). The fifth is our unified model without changes in

true intensions over time (λ=0). The sixth model is our unified model assuming perfect

correlations between true intentions and purchasing (ρ=1). The seventh model is our unified

model, which nests models 3 to 6. The comparison of our unified model with these four nested

models shows the relative importance of each component in explaining the data. Note, for the

automobile data, since we do not have information on changes in explanatory variables to

explain changes in true intentions over time, we do not estimate model 5.

[Insert Table 3 here]

In Table 3 we report the model fit statistics of the competing models estimated using the

historical data. For discrete choices, research has shown that percentage correctly predicted (CP)

and Efron’s R2 are more powerful criteria for selecting the best fitting models than are AIC and

BIC (Amemiya 1985). 11 We report CP, Efron’s R2 and the simulated numbers of purchases for

the total sample (equation 23) as well as for each level of stated intentions (equation 22). We

11 Efron’s R-Square is calculated as ∑ ∑∑ ∑

= =

= =

−

−− N

n

M

j jnjn

N

n

M

j jnjn

YY

FY

1 02

1 02

)(

)ˆ(1 . It conveys the proportion of the variance of the

dependent variable explained by the independent variable and is a more reliable model selection criterion for discrete choice models (Amemiya 1985). Hence, the higher Efron’s R2, the more desirable the model.

MARK-05-04

23

compare them with the observed numbers from the sample. It is important to note that i) Models

1 and 2 are not directly applicable to the automobile data which has binary intentions;12 ii)

Models 1 and 2 only offer probabilities of binary purchase / no-purchase decisions at each

intentions level. To obtain the number of simulated purchases, we use the total number of

respondents times the simulated percentages. This is different from what we do for the

disaggregate models, for which we simply count the number of consumers who are simulated to

purchase.

We first compare Models 1 and 2 with the unified model (Model 7). Both CP and Efron’s

R2 indicate that the unified model outperforms Models 1 and 2. This suggests that model fit can

be improved by further disaggregating the data from the intentions level to the individual level.

We note that aggregate models can also be quite accurate in predicting aggregate sales. This is

because these models are designed to fit aggregate sales instead of modeling individual behavior.

The advantage of disaggregate models is that they predict purchase probability at the individual

level which is more useful for guiding marketing and targeting efforts.

Model 3 represents existing disaggregate models (it is the same as Young et. al. for the

automobile data and the same as Hsiao and Sun for the personal computer data). CP, Efron’s R2

and the predicted total purchases indicate that our unified model improves existing disaggregate

models. The unified model outperforms Models 4 to 6 suggesting that it is important to consider

all three discrepancies when characterizing the relationship between intentions and purchasing.

Comparisons of the improvement of model selection criteria for Models 3, 4, 5, and 6 with

Model 7 suggest that adding the purchase model and considering systematic intentions bias are

the most important elements for improving data fit, followed by incorporating changes in true

12Bemmaor’s model is applicable to data with more than 2 intentions levels. For binary intentions, Bemmaor’s model can help specify upper and lower bounds for aggregate proportion of purchasers.

MARK-05-04

24

intentions over time and incorporating an imperfect intentions-purchasing correlations.13 Since

Model 7 is the best fitting model, we focus just on Model 7 when we discuss the estimation and

prediction results in the following discussion.


In Table 4 we report the estimation results using the historical data for all the competing

models. Assuming one-sided intentions bias, education has a negative impact on the probability

of stating biased intentions indicating that higher educated respondents state more accurate

intentions. Perhaps more educated respondents can better understand the question and therefore

can more accurately formulate their answers. We calculated the average probability of giving

biased intentions. For example, for the personal computer data, π1 and π2 are calculated to be 3.1

and 11.9, respectively, indicating that there is a 3.1 percent probability that respondents state an

intention to buy a computer within 6 months while their true intention is to either to buy within 7

to 12 months or not buy within a year. There is an 11.9 percent chance that respondents state an

intention to buy within 7 to 12 months while their true intention is not to buy within a year.

For the personal computer data, in order to demonstrate the impact of changes in

explanatory variables over time, we looked at how changes in the number of cars owned affected

the probability of purchasing a personal computer. Assuming these changes are foreseeable, we

found that buying additional cars in the recent past decreases the probability of acquiring a

personal computer in the near future (λ=-0.021). We think this happens because purchasing a car

reduces a household’s disposable income, which in turn is likely to lower their propensity to buy

an expensive product like a personal computer. Ideally, one would look at how changes in

product and promotion related variables systematically alter respondents’ true intentions over

13 We caution against generalizing the results on the relative importance of including these components in improving data fit because they are likely to be product specific.

MARK-05-04

25

time. However, in our data sets we only had demographic variables available and the number of

cars owned was the only variable that changed over a year and had a significant impact on

purchasing. We believe that the inclusion of product and promotion related variables would

enhance the results.

ρ is estimated to be 0.732 (t=3.12) for the automobile data and 0.548 (t=22.42) for the

personal computer data. The log likelihood ratio tests reject the hypothesis that ρ=1, indicating

that true intentions and actual purchasing are positively but not perfectly correlated even after we

take into account systematic intentions biases and changes in true intentions over time.14 This

suggests that unobserved shocks in the environment affect both true intentions and actual

purchasing in the same direction but not identically in magnitude.

In the intentions model, all the demographic variables are significant and have the

expected signs. Variables such as marital status, occupation, home ownership, number of cars,

family size, education level, gender of head of household, social status and income are all

significantly related to purchase intentions for both products.

In the purchase model, parameter β is estimated to be 1.109 (t=5.44) for the automobile

and 1.497 (t=4.01) for the personal computer data, implying that latent true intentions

significantly and positively affect purchasing. This confirms Hsiao, Sun and Morwitz (2002) that

intentions can be used to predict purchase. It is important to note that 'α , which is equal to α

multiplied by a scalar β, denotes the vector of parameters representing the effect of descriptor

variables on purchase (i.e., in our case socio-demographic variables, but in general this could

include product and promotion related variables as well). It is the more accurate estimates of

14 The significance level is under-stated because ρ was estimated as a bounded variable.

MARK-05-04

26

these variables that can help managers target the right consumers more efficiently with a more

effective marketing strategy.


After calibrating the unified model using historical data and obtaining estimates of

β, λ and ρ, we then re-apply the unified model to the focal prediction sample (where only

intentions data are available) to obtain individual sales forecasts and more accurate estimates of

'α . In Table 5, we report CP, Efron’s R2 and the predicted number of purchases. The comparison

results of prediction power across the seven competing models are similar to those described

earlier for model fit. All the prediction criteria show that the unified model predicts actual

purchasing more accurately than the other models. The relative importance of including the

various components in our model in improving prediction power as we go from Model 3 to 7

follows the same trend as we saw when we discussed data fit. Thus, by recognizing the different

kinds of discrepancies between intentions and purchasing and by allowing intentions and

purchasing to be directly and indirectly connected, the unified model is able to provide more

accurate purchase predictions.

Since our random utility framework allows purchasing to depend on individual

characteristics, we can examine what type of consumers are more likely to purchase and

calculate individual probabilities of purchase using equation (23). To concisely present the

prediction results, we do not report purchase probabilities for each respondent but instead report

gains charts for the disaggregate models (Models 3 to 7) in Figure 1.15 To obtain the gains charts,

we begin by selecting the 10 percent of the respondents from the calibration sample who,

according to the predicted purchase probabilities from the model, are most likely to make a

15 Those interested in reading more about the gains chart methodology are referred to Drozdenko and Drake (2002).

MARK-05-04

27

purchase. Next we compute the number of accurately predicted purchases from this group,

relative to the total number of purchases in the entire sample. This percentage is the gain due to

using the model and represents the value of sorting the predicted purchase probability based on

the model for more precise targeting. We repeat this for each model. Analogous values are also

computed for each percentile of the holdout sample (top 20%, top 30% etc.). The “No Model”

line depicts a situation in which customers are grouped randomly. So, if we have a randomly

drawn group that constitutes 10% of the sample it will on average contain 10% of the overall

number of purchases etc. The greater the difference between the gains curve and the baseline

model, the better the model in guiding targeting efforts.

The results of this analysis show that the accuracy of predicting individual purchases for

our unified model is superior to that of the baseline case and the competing disaggregate models.

The results are robust at each percentile. Models 1 and 2 are aggregate models that cannot give

individual purchase probabilities. We therefore cannot include them in the gains chart to show

their value in guiding targeting efforts. While aggregate models can be useful for forecasting

aggregate sales and helping to make go / no-go new product launch decisions, disaggregate

models are useful in guiding marketing and targeting efforts in addition to offering more accurate

overall sales forecasts.

GENERAL DISCUSSION

“For any curious human being, asking questions is easy. But for a professional

researcher, it can be a daunting challenge fraught with innumerable chances to destroy a study's

validity (Anderson 1988).” Marketing decisions made based on raw, unadjusted self-reported

intentions can be sub-optimal. In this paper, we propose a unified model that simultaneously

takes into account systematic intentions biases, changes in true intentions over time, and the

MARK-05-04

28

imperfect correlation between true intentions and actual purchasing. It also unifies stated

intentions and purchasing by allowing them to be both directly and indirectly connected.

Specifically, the unified model allows us to (1) recover underlying true intentions from biased

stated intentions, (2) obtain more accurate estimates of marketing and socio-demographic

coefficients, which characterize which consumers are more likely to purchase and how

marketing strategies affect their purchase decisions, and (3) derive individual multi-level

forecasts of actual purchase probabilities. The proposed model outperforms models existing in

the literature that only consider a subset of the intentions-purchase discrepancies discussed in

this paper. This model helps to make purchase intentions a better-understood and more useful

tool for forecasting actual purchasing and targeting the individual consumers.

Our results offer implications for managers who rely on intentions to forecast sales. First,

managers should be aware that intentions surveys are usually contaminated with systematic

intentions biases, especially for new products for which consumers do not have enough

knowledge, durable products for which consumers do not have enough experience, or socially

sensitive products for which consumers may want to hide their true intentions to comply with

social norms. To minimize the likelihood that respondents report biased intentions, care should

be taken in survey design to provide as much information about the product or privacy to

respondents as possible. Second, managers should know that the longer the period between the

intentions survey and time period of interest for measuring behavior, the less accurately stated

intentions will reflect subsequent purchasing. Managers can either reduce the time between these

two events or closely track the possible factors that may lead true intentions to change over time.

Third, managers should consider factors that affect the correlation between true intentions and

purchasing. For example, if it is significantly harder for consumers to actually pay for and buy a

MARK-05-04

29

product than it is to hold a true intention to buy, managers should develop marketing programs

that ease the burden of purchasing and facilitate the transaction, e.g., no-payment-no-interest-for-

a-year programs for expensive products and education programs for new and technically

complex products. Fourth, given limited resources, managers should focus their marketing

efforts on the individual customers or segments of consumers who are predicted to be most likely

to purchase the product.

Our model is limited by several simplifying assumptions, which in turn, offer avenues for

future research. First, more general model can be developed to nest both one-sided and two-

sided intentions bias model. Future extensions can also be made to consider unordered choices.

We should note, however, that the one-sided intentions bias model is no longer meaningful if the

choices are not ordered. Second, we can modify the model to accommodate brand choices. Third,

we can incorporate the finding that asking consumers purchase intent questions has an impact on

their actual purchase incidence (Chandon, Reinartz and Morwitz; Fitzsimons and Morwitz 1996;

Morwitz, Johnson and Schmittlein 1993; Louviere et. al. 1999; Sherman 1980). Fourth, we can

better take into account consumer heterogeneity by allowing the model parameters to be

heterogeneous across consumers (Hutchinson, Kamakura and Lynch 2000, Jedidi, Jagpal and

DeSarbo 1997). Fifth, the purchase timing contained in the data sets can also be formally

modeled. Sixth, if one has multiple measurements of consumers’ latent true intentions, latent trait

or itemized response models can be used to study the relationship between intentions and

purchasing (Bagozzi, Yi and Nassen 1999). Finally, it would be interesting to apply the unified

model to multiple product categories and study how the nature of the systematic intentions bias,

changes in true intentions over time and the imperfect correlation between true intentions and

purchasing varies across different types of product categories.

MARK-05-04

30

REFERENCES

Amemiya, Takeshi (1985), Advanced Econometrics, Harvard University Press, Cambridge, MA. Anderson, Alan R. (1988), Cheap But Good Marketing Research, Dow Jones, Irwin, 117-192. Armstrong, J. Scott, Vicki G. Morwitz, and V. Kumar (2000), “Sales Forecasts for Existing Consumer Products and Services: Do Purchase Intentions Contribute to Accuracy?” International Journal of Forecasting, 16, 383-397. Bagozzi, Richard (1994), “Measurement in Market Research: Basic Principles of Questionnaire Design,” Principles of Marketing Research,edited by Richard Bagozzi, Blackwell Business, 1-49. Bagozzi, Richard and Utpal Dholakis (1999), “Goal Setting and Goal Striving in Consumer Behavior,” Journal of Marketing, 63 (Special Issue), 19-32. Bagozzi, R.P., Y. Yi and K.D. Nassen (1999), “Representation of Measurement Error in Marketing Variables: Review of Approaches and Extension to Three Facet Designs.” Journal of Econometrics, 89, 399-421. Balasubramanian, Siva and Wagner A. Kamakura (1989), “Measuring Consumer Attitudes Towards the Marketplace with Tailored Interviews,” Journal of Marketing Research, August, 311-26. Baumgartner Haas and Jan-Benedict E. M. Steenkamp (2000), “Response Styles in Marketing Research: A Cross-National Investigation”, Journal of Marketing Research, 34, 219-232. Belk, Russell W. (1985), “Issues in the Intention-Behavior Discrepancy,” Research in Consumer Behavior, Vol. 1, edited by Jagdish N. Sheth, Greenwich, CT: JAI Press, 1-34. Bemmaor, Albert C.(1995), “Predicting Behavior From Intention-to-Buy Measures: The Parametric Case,” Journal of Marketing Research, 32, 176-191. Clawson, C. Joseph (1971), “How Useful Are 90-Day Purchase Probabilities?” Journal of Marketing, 35, 43-47. Chandon, Pierre, Vicki G. Morwitz, and Werner J. Reinartz (forthcoming), “The Short- and Long-Term Effects of Measuring Intent to Repurchase” Journal of Consumer Research Fishbein, Morris and Icek Ajzen (1975), “Belief, Attitude, Intention, and Behavior,” Reading, MA: Addison-Wesley Publishing Company. Fitzsimons, Gavan J. and Vicki G. Morwitz (1996), “The effect of measuring intent on brand-level purchase behavior,” Journal of Consumer Research, 23, 1-11.

MARK-05-04

31

Ganster, Daniel C., Harry W. Hennessey and Fred Luthans (1983), “Social Desirability Response Effects: Three Alternative Models,” Academy of Management Journal, 26, 321-331. Gollwitzer, Peter M. (1999a), “Implementation Intentions,” American Psychologist, 54(July), 493-503. Gollwitzer, Peter M. (1999b), “Action Phases and Mind-Sets,” in E.T. Higgins and R.M. Sorrentino (Eds.), Handbook of Motivation and Cognition (Vol. 2), New York: Guilford, 53-92. Granbois, Donald H. and John O. Summers (1975), “Primary and Secondary Validity of Consumer Purchase Probabilities,” Journal of Consumer Research, 1, 31-38. Haaijer, Marinus E. Wagner A. Kamakura and Michel Wedel (1999) “The Information Content of Response Latencies in Conjoint Choice Experiments,” Journal of Marketing Research 37(3) 376-82. Hsiao, Cheng and Baohong Sun (1999), “Modeling Response Bias with an Application to High-tech Product Survey Data,” Journal of Econometrics, 89, 1-2, 15-39. Hutchinson, J. Wesley, Wagner Kamakura and John Lynch (2000), “Unobserved Heterogeneity as An Alternative Explanation for “Reversal” Effect in Behavior Research.” Journal of Consumer Research, 27, December, 324-344. Infosino, William J. (1986), “Forecasting New Product Sales From Likelihood of Purchase Ratings,” Marketing Science, 5(Fall), 372-384. Jamieson, Linda F. and Frank M. Bass (1989), “Adjusting Stated Intention Measures to Predict Trial Purchase of New Products: A Comparison of Models and Methods,” Journal of Marketing Research, 26(August), 336-345. Jedidi, Kamel, Harsharanjeet S. Jagpal and Wayne S. DeSarbo (1997), “Finite-Mixture Structural Equation Models for Response-Based Segmentation and Unobserved Heterogeneity.” Marketing Science, 16 (1), 39-59. Kahneman, Daniel and Jackie Snell (1992), “Predicting a Changing Task: Do People Know What They Will Like?” Journal of Behavioral Decision Making, 5, 187-200. Kalwani, Manohar U. and Alvin J. Silk (1982), “On the Reliability and Predictive Validity of Purchase Intention Measures,” Marketing Science, 1(Summer), 243-286. Klein, Roger and Robert Sherman (1997), “Estimating New Product Demand from Biased Survey Data,” Journal of Econometrics, 76, 1-2(Jan/Feb), 53-76. Lock, Edwin A. and Gary P. Latham (1990), A Theory of Goal Setting and Task Performance, Englewood Cliffs, NJ: Prentice Hall.

MARK-05-04

32

Louviere Jordan, Robert Meyer, David S. Bunch, Richard Carson, Benedict Dellaert, W. Michael Hanemann, David Hensher and James Irwin (1999), “Combining Sources of Preference Data for Modeling Complex Decision Processes,” Marketing Letters, 10:3, 205-217. Manski, Charles F. (1990), “The Use of Intentions Data to Predict Behavior: A Best Case Analysis,” Journal of the American Statistical Association, 85(December), 934-940. McFadden, Daniel (1974). “Conditional Logit Analysis of Qualitative Choice Behavior,” Frontiers in Econometrics, edited by P. Zarembka, New York: Academic Press, 105-142. Morrison, Donald G. (1979), “Purchase Intentions and Purchase Behavior,” Journal of Marketing, 43(Spring), 65-74. Morwitz, Vicki G. (1997) “It Seems Like Only Yesterday: The Nature and Consequences of Telescoping Errors in Marketing Research,” Journal of Consumer Psychology, 6(1), 1-30. Morwitz, Vicki G. and David Schmittlein, (1992), “Using Segmentation to Improve Sales Forecasts Based on Purchase Intent: Which ‘Intenders’ Actually Buy?” Journal of Marketing Research, 29, 391-405. Morwitz, Vicki G., Eric Johnson and David Schmittlein, (1993), “Does Measuring Intent Change Behavior?” Journal of Consumer Research, 22, 46-61. Murthi, B.P.S. and Kannan Srinivasan (1999), “A Study of Consumers' Extent of Evaluation in Brand Choice Decision: A Multi-State Evaluation Choice Model”, Journal of Business, 72, 2, 229-256. Orbell, Sheina, Sarah Hodgkins, and Paschal Sheeran (1997), “Implementation Intentions and the Theory of Planned Behavior”, Personality and Social Psychology Bulletin, 23, 945-954. Sewall, Murphy A. (1978), “Market Segmentation Based on Consumer Ratings of Proposed Product Design,” Journal of Marketing Research, 15(November), 557-564. Sherman, Steven J. (1980), “On the Self-erasing Nature of Errors of Prediction,” Journal of Personality and Social Psychology, 39 (August), 2111-221. Silk, Alvin J. and Glen L. Urban (1978), “Pre-Test Market Evaluation of New Packaged Goods: A Model and Measurement Methodology,” Journal of Marketing Research, 14 (January), 1-29. Urban, L. Glen and John R. Hauser (1980), Design and Marketing of New Products, Englewood Cliffs, NJ: Prentice-Hall, Inc. Urban, L. Glen and Gerald M. Katz (1983), “Pre-Test-Market Models: Validation and Managerial Implications,” Journal of Marketing Research, 20(August), 221-234.

MARK-05-04

33

Verhoef1, Peter C. and Philip Hans Franses (2002), “On combining revealed and stated preferences to forecast customer behavior: three case studies,” working paper, Erasmus University Rotterdam. Wittink D.R. (2001), Forecasting with conjoint analysis, Principles of Forecasting: A Handbook for Researchers and Practitioners (ed. Armstrong J.S.), Boston: Kluwer, 147-167 Whitlark, David B. Michael D. Geurts and Michael J. Swenson (1993), “New Product Forecasting with a Purchase Intention Survey,” Journal of Business Forecasting, 12, 3(Fall), 18-21. Young, Martin, Wayne S. DeSarbo, and Vicki G. Morwitz (1998), “The Stochastic Modeling of Purchase Intentions and Behavior,” Management Science, 44, 2 (February), 188-202.

MARK-05-04

34

Table 1. Comparison of Our Model to Past Intentions Models

Sources of Discrepancy Level of

Modeling Systematic intentions

bias

Changes in true

intentions over time

Imperfect correlation between

true intentions

and purchasing

Modeling Approach

Allow for Explanatory

Variables

Data Application

Results

Morrison (79) Aggregate at intentions level

Y

Y Y Beta-binomial

N (aggregate)

Auto Linear relation between stated intentions and true intentions

Kalwani and Silk (82)

Aggregate at intentions level

Y Y Y Beta-binomial

N (aggregate)

Durable and packaged goods

Heterogeneous piecewise linear relation across product categories

Bemmaor (95) Aggregate at intentions level

Y Y Y Beta-binomial

N (aggregate)

Durable goods, services, and

other activities

Heterogeneous switching probabilities; Upper and lower bounds

Infosino (86) Aggregate at intentions level

Y N Y Probit N (aggregate)

Service Systematic promotion effects

Morwitz and Schmittlein (92)

Aggregate at intentions level

N N N Segment Y (individual)

Durable goods Forecast sales based on segmentation

Young et. al. (98) Individual Y N N Bayesian Y (individual)

Durable goods

Improved estimates αa and better purchase prediction

Hsiao and Sun (99) Individual Y N N Logit Y (individual)

Telecomm Improved estimates of α and better purchase prediction

This Paper Individual Y Y Y Multivariate Ordered Probit

Y (individual)

Automobile Personal

Computer

More accurate estimates of α’ and more accurate forecasts of individual purchase probabilities

a. α refers to the estimated coefficients of explanatory variables, e.g. socio-demographics, product attributes and marketing variables.

MARK-05-04

35

Table 2: Descriptive Statistics

Automobile

Mean or Percentage

Variable Name

Variable Descriptions Historical Sample

Prediction Sample

Number of observations 2000 1000 INTENTION 1 0 PURCHASE 1a 0

intend to purchase automobile within 12 months do not intend to purchase automobile within a year purchase automobile within12 months after survey do not purchase automobile within a year after survey

9.87% 90.13% 5.01%

94.99%

9.54% 90.46% 5.13%b 94.87%

MARITAL1 FEMPL OCC1 OCC5 OCC6 EDU LIVE3 LIVE4 OWN1 INCOME YEAR NUMCARS

Married couple Female is full-time or part-time employed Household head education – manager, professional Household head education – craft and repair Household head education – operator and laborer Education level of household head (0=less than grade school; 1=Grade school; 2=Grad grade school; …; 7=post graduate university) Type of residence – mobile home Type of residence – condominium Own home Income (1=<10000; 2=10000-19999; 3=20000-29999; 4=30000-44900) Years of car currently in household Number of cars currently owned by the household

57.95% 31.1%

23.01% 7.0%

8.22% 4.11

6.13% 2.14% 7.58% 2.86

2.84 2.02

55.32% 29.4%

25.14% 6.88% 8.53% 4.27

5.99% 2.06% 7.36% 2.71

2.99 1.86

MARK-05-04

36

Personal Computer Mean or Percentage

Variable Name

Variable Descriptions

Historical Sample Prediction Sample

Number of observations 3315 1105 INTENTION 2 1 0 PURCHASE 2a 1 0

intend to purchase PC within 6 months intend to purchase PC within 7 to12 months do not intend to purchase PC within a year purchase PC within within 6 months after survey purchase PC within 7 to12 months after survey do not purchase PC within a year after survey

7.52% 13.97% 78.51% 6.69% 5.51%

87.80%

7.38% 15.26% 77.36% 6.42%b 5.67%

87.91%

CARS BABY YOUNG EDUCATION LGSIZE NEW-HOUSEHOLD UPSCALE MIDAGE-NO KIDS PROFESSIONAL CLERICAL WORKING-HOURS MALE-HEAD WHITE-COLLAR INCOME

number of cars family has children under 6 year-old age <= 30 householder education(0=less than grade school; 1=Grade school; 2=Grad grade school;… ;7=post graduate university) large size family with more than 6 members household's life cycle - new household household's life cycle - upscale families household's life cycle - mid aged without children householder occupation -- professional householder occupation -- clerical number of working hours of householder household head is male household head is white collar household income

1.62 16.95% 31.78%

4.19b

0.093% 14.85% 22.80% 23.52% 24.80% 28.20%

2.50 79.44% 34.65% $38,868

1.67 16.54% 32.61%

4.25

0.117% 14.37% 21.46% 22.65% 25.37% 29.61%

2.61 76.79% 35.42% $37,932

a. For the personal computer data, purchase is inferred from a change in self reported ownership. For the automobile data, purchase is directly measured. b. Note the purchase information for the prediction sample is for validating predictive accuracy. In applying the unified model to the focal prediction sample, we

assume this information is not available.

MARK-05-04

37

Table 3. Comparison of Data Fittinga Automobileb

(1) Morrison’s

Model

(2) Bemmaor’s

Model

(3) Ignore

Purchase Information

(4) Ignore

Intentions Bias

(6) Ignore

Imperfect Correlation

(7) Unified Model

Model Selection -Log-L value AIC BIC

1731.0 1735.0 1746.2

NA

895.8 908.8 945.2

1722.5 1735.5 1771.9

1684.3 1697.3 1750.5

1664.5 1677.5 1734.5

CP Intenders Nonintenders Effron’s R2

Intenders Nonintenders Derived Purchases Purchases 1 0 Intentions 1 197 87 110 0 1803 13 1790

0.870 0.715 0.887

0.086 0.077 0.087

NA NA

0.900 0.818 0.909

0.086 0.068 0.088

NA NA

0.871 0.824 0.875

0.081 0.044 0.085

93 106 16 1785

0.863 0.841 0.862

0.079 0.042 0.083

94 106

16 1787

0.912 0.864 0.915

0.086 0.049 0.090

91 106

15 1788

0.963 0.887 0.968

0.092 0.064 0.095

89 108

14 1789 Personal Computer

a. The results in Tables 3 and 4 are obtained using historical samples in which both intentions and purchase data are available. b. There are 2000 households in the historical sample for the automobile data. There are 3315 households in the historical sample for the personal computer data. c. 151 is the first entry of the 3x3 matrix. It indicates that out of the 249 respondents stating an intention of purchasing within 6 months (intention 2), 151 actually made a purchase within 6 months. d. Since Model 1 and 2 can only predict purchase and no purchase, we treat purchase within 12 months as purchase. In addition, these models can only predict percentage of purchase without

knowing who are more likely to purchase. We obtain 164 using 249x66%. 164 indicate that out of the 249 respondents stating intention 2, 66% or 164 actually made a purchase within 12 months. This number should be compared with 151+28 in the sample, 154+30 in the unified model, and so on.

(1) Morrison’s

Model

(2) Bemmaor’s

Model

(3) Ignore


(4) Ignore

Intentions Bias

(5) Ignore true

intention shift

(6) Ignore


(7) Unified Model

Model Selection -Log-L value AIC BIC

2244.5 2248.5 2260.7

NA

1045.9 1062.9 1114.8

2238.6 2254.6 2303.4

2166.5 2186.5 2247.6

2150.0 2169.0 2227.0

2131.1 2150.1 2208.1

CP Intenders 1 Intenders 2 Nonintenders Effron’s R2

Intenders 1 Intenders 2 Nonintenders Derived Purchases Purchases 2, 1 0 Intentions 2 249 151c 28 70 1 463 28 145 290 0 2603 43 10 2550

0.890 0.710 0.841 0.916

0.082 0.060 0.077 0.085

66% (164d) 35% (162)

3% (78)

0.910 0.788 0.852 0.932

0.084 0.072 0.079 0.086

69% c(172) 37% (171) 5% (130)

0.88 0.846 0.842 0.890

0.075 0.070 0.072 0.076

159 34 56 34 149 280 38 13 2552

0.87 0.841 0.801 0.885

0.073

0.0644 0.072 0.074

156 33 60 33 148 282 40 13 2550

0.91 0.863 0.851 0.925

0.082 0.072 0.082 0.083

156 31 62 33 147 283 41 11 2551

0.92 0.871 0.862 0.935

0.084 0.0754 0.083 0.085

155 30 64 31 148 284 40 12 2551

0.95 0.897 0.894 0.965

0.088 0.0776 0.088 0.089

154 30 65 30 146 287 41 11 2551

MARK-05-04

38

Table 4. Maximum Likelihood Estimates of Competing Models

Automobile

Variables

(3) Ignore Purchase

Information

(4) Ignore Intentions

Bias

(6) Ignore Imperfect

Correlation

(7) Unified Model

INTENTION MODEL α THRESHOLD1 l1 MARITAL1 FEMPL OCC1 OCC5 OCC6 LIVE3 LIVE4 OWN1 INCOME YEAR π constant EDU PURCHASE MODEL β ρ

0.505(2.27)a -1.601 (2.52) 0.129(2.23) 0.305(2.40) -0.480(1.97) -0.288(2.65) 0.731(2.43) 0.952(2.79) -0.194(1.99) 0.166(2.06) 0.118(2.14)

0.0514

-2.206(2.42) -0.183(2.06)

0.426(2.21) -1.612 (2.43) 0.133(3.04) 0.309(2.42) -0.492(2.38) -0.296(2.88) 0.711(2.74) 0.988(3.05) -0.174(1.91) 0.166(2.00) 0.111(2.43)

0.834(4.17)

0.541(2.11)

0.427(3.45) -1.627 (2.72) 0.140(3.50) 0.311(2.60) -0.480(2.45) -0.301(2.90) 0.710(3.24) 1.061(3.53) -0.160(2.37) 0.165(2.21) 0.111(2.97)

0.0465

-2.320(2.64) -0.188(2.41)

1.057(4.09)

0.817(4.03) -1.732 (3.02) 0.139(3.48) 0.295(3.01) -0.477(2.55) -0.310(2.88) 0.712(3.22) 1.055(3.15) -0.159(2.38) 0.177(2.30) 0.101(2.99)

0.0423

-2.331(2.01) -0.192(2.35)

1.109(5.44) 0.732(3.12)

MARK-05-04

39

Personal Computer

a. T-statistics are reported in the parenthesis.

Variables

(3) Ignore Purchase

Information

(4) Ignore Intentions

Bias

(5) Ignore changes in

intentions

(6) Ignore Imperfect

Correlation

(7) Unified Model

INTENTION MODEL α THRESHOLD1 l2 THRESHOLD1 l1 CARS BABY LGSIZE EDUCATION NEW-HOUSEH. PROFESSIONAL WORKING-HOUR MALE-HEAD WHITE-COLLAR INCOME CELLULAR π1

CONSTANT EDUCATION π2

CONSTANT EDUCATION PURCHASE MODEL β λ CHANGE OF CARS ρ

0.937(15.21) 1.238(20.73) 0.059(1.49) -0.280(3.92) 0.288(1.29) 0.030(1.68) -0.213(2.64) 0.091(3.22) -0.029(0.57) 0.230(0.38) 0.150(2.61) 0.230(2.38) 0.309(2.55)

0.027 -2.765(1.63) -0.401(2.43)

0.101 -2.022(2.09) -0.371(3.07)

0.624(10.72) 0.996(8.83) 0.063(1.30)b -0.262(3.10) 0.279(1.39) 0.030(2.02) -0.203(2.70) 0.098(2.69) -0.024(0.85) 0.236(1.31) 0.098(2.55) 0.229(2.41) 0.328(2.23)

0.852(2.09)

-0.011(1.90)

0.301(20.55)

1.325(17.45) 1.756(52.91) 0.053(15.44) -0.304(20.17) 0.297(2.86) 0.031(1.97) -0.232(3.85) 0.103(2.65) -0.030(2.15) 0.241(2.74) 0.075(2.91) 0.212(3.03) 0.328(2.95)

0.029 -2.752(1.63) -0.401(2.43)

0.115 -2.022(2.09) -0.371(3.07)

1.337(3.49)

0.479(20.33)

1.338(19.18) 1.892(53.92) 0.058(18.76) -0.293(32.99) 0.310(2.73) 0.028(2.00) -0.237(3.94) 0.097(2.71) -0.026(2.20) 0.238(2.72) 0.077(2.86) 0.215(3.21) 0.330(2.74)

0.029 -2.749(1.85) -0.341(2.75)

0.117 -2.064(2.31) -0.369(3.12)

1.328(3.54)

-0.024(1.93)

1.450(19.52) 1.889(63.86) 0.054(2.78) -0.294(4.36) 0.319(2.69) 0.028(2.11) -0.221(4.12) 0.109(3.03) -0.029(1.99) 0.238(2.36) 0.075(2.79) 0.218(3.30) 0.307(2.73)

0.031 -2.725(1.92) -0.353(2.81)

0.119 -2.059(2.41) -0.358(3.23)

1.497(4.01)

-0.021(2.03)

0.548(22.42)

MARK-05-04

40

Table 5. Comparison of Predictive Accuracya Automobileb

(1) Morrison’s

Model

(2) Bemmaor’s

Model

(3) Ignore Purchase

Information

(4) Ignore

Intentions Bias

(6) Ignore


(7) Unified Model

CP Intenders Nonintenders

Effron’s R2

Intenders Nonintenders Predicted Total Purchases Purchases 1 0 Intentions 1 95 42 53 0 905 9 896

0.860 0.708 0.876

0.084 0.074 0.085

NA NA

0.87 0.756 0.882

0.086 0.077 0.087

NA NA

0.86 0.841 0.862

0.070 0.0416 0.073

46 49 13 882

0.85 0.803 0.855

0.070

0.0416 0.073

44 51 11 884

0.88 0.823 0.886

0.084 0.0461 0.088

44 51 10 885

0.92 0.863 0.926

0.089

0.0501 0.093

43 52

9 886

Personal Computer

(1) Morrison’s

Model

(2) Bemmaor’s

Model

(3) Ignore


(4) Ignore

Intentions Bias

(5) Ignore true

intention shift

(6) Ignore


(7) Unified Model

CP Intenders 1 Intenders 2 Nonintenders Effron’s R2

Intenders 1 Intenders 2 Nonintenders Predicted Total Purchasesc Purchases 2 1 0 Intentions 1 82 50 8 23c 2 168 11 53 104 0 858 10 3 842

0.820 0.642 0.739 0.853

0.068 0.040 0.063 0.072

61% (50) 35% (59) 3.4% (29)

0.840 0.695 0.743 0.873

0.070 0.049 0.065 0.073

64% (52) 37% (62) 5.3% (45)

0.84 0.745 0.774 0.862

0.066

0.0617 0.0669 0.0635

58 7 17 16 59 93 14 5 839

0.82 0.726 0.839 0.769

0.060

0.0593 0.0598 0.0601

56 8 18 14 58 96 13 5 840

0.85 0.767 0.809 0.866

0.073

0.0612 0.0721 0.0743

53 9 20 13 56 99 12 4 842

0.87 0.861 0.849 0.875

0.077

0.0678 0.0769 0.0779

53 9 20 12 56 100 102 4 852

0.900 0.887 0.901 0.901

0.079 0.0712 0.0772 0.0801

52 9 22 10 55 103 12 3 843

a. The results in Tables 5 are obtained using the prediction intentions sample in which only stated intentions are available. b. There are 1000 households in the prediction sample for the automobile data. There are 1105 households for the personal computer data. c. This is the actual number purchasing from the prediction intentions data. This information is not used for estimation and prediction. It is only used as a benchmark for examining the

predictive accuracy of the different models.

MARK-05-04

41

Figure 1. Gain’s Chart

Persaonal Computer

0%

20%

40%

60%

80%

100%

0% 20% 40% 60% 80% 100%

Percentage of Consumers

Cum

ulat

ive

Gai

ns

no-modelModel 3Model 4Model 5Model 6Model 7

Automobile

0%

20%

40%

60%

80%

100%

0% 20%

40%

60%

80%

100%

Percentage of Consumers

Cum

ulat

ive

Gai

ns no-modelModel 3Model 4Model 6Model 7

MARK-05-04

Predicting Purchase Behavior from Stated Intentions: A ...pages.stern.nyu.edu/~mkt/wps/MARK 05 04...

Documents

Transcript of Predicting Purchase Behavior from Stated Intentions: A ...pages.stern.nyu.edu/~mkt/wps/MARK 05 04...