Lifetime Value in the Bank Industry - Centre de … ·  · 2015-09-28Lifetime Value in the Bank...

Post on 21-Apr-2018

231 views 3 download

Transcript of Lifetime Value in the Bank Industry - Centre de … ·  · 2015-09-28Lifetime Value in the Bank...

Lifetime Value in the Bank Industry

Problem submitted by the National Bank of

Canada

Sixth Montreal Industrial Problem Solving Workshop

August 17–21, 2015

National Bank of Canada 1 / 26

General Context

Acquiring and retaining profitable customers is anever-growing challenge for banks.

Customer demographics, buying behavior and needsare evolving rapidly.

The competition is aggressive and adapting rapidly.

Banks now need a 360-degree view of each customerin order to focus their resources efficiently.

National Bank of Canada 2 / 26

National Bank of Canada

These challenges are pushing the National Bank ofCanada to gain a better understanding of the driversbehind its Customer Lifetime Value (CLV).

The Client Intelligence & Modelization Team hasbeen asked to develop methods to evaluate the CLV.

The CLV will help the National Bank define,understand, and predict the relationship between agiven client and the Bank.

National Bank of Canada 3 / 26

The Problem

The CLV is a simple concept in itself, but difficult toimplement in a complex business context.

A very large amount of data must be taken intoaccount, for example:

I Notions of client acquisition and attrition;

I Holding of diverse banking products and services, theirvolume, usage, and profitability;

I Other clients’ characteristics such as geographical,demographic, and market data.

National Bank of Canada 4 / 26

The Problem

This data often needs a lot of cleaning andmanipulation to be usable and meaningful.

The profiles, products, or services of the Bank’sclients vary greatly; so do their behaviours andexpectations.

The team’s goal for this workshop was to explore andtest different approaches to compute the CustomerLifetime Value.

National Bank of Canada 5 / 26

Data Description: Commercial Data

Aggregated monthly data from 2013 to 2015.

“VECT IN” binary vector of length 24, which standsfor the products: savings (3), loans (5), transactions(16). Contains 1479 distinct vectors.

VECT IN NB VECT IN VALUE000001011010000001100000 27240 189.2424000001011010000001110000 21855 204.9733

National Bank of Canada 6 / 26

Data Description: Retail Data

Original data: 4831 retail clients (rows), 6130 attributes(columns).

Cleaned data: 4831 rows and 88 columns.

Data for a particular month.

Main attributes are:

Demographic attributes (age, gender, marital status, . . . ).Product information (tenure, holding, balance).EFT (Electronic Fund Transfers), AML (Anti-MonetaryLaundering), Aggregated fees.

More values for each product could improve the reliability of thesuggested solution.

National Bank of Canada 7 / 26

Relevant Literature

1 We focused on exploring an indicator calledCustomer Lifetime Value (CLV).

2 There exists several studies focusing on CLV in theretail banking industry.

3 The approaches of [3], [2], and [1] informed ouranalysis and approach to modelling the CLV.

4 We want a model or algorithm for estimating andpredicting the CLV in order to drive marketingdecisions and customer relationship management.

National Bank of Canada 8 / 26

Relevant Literature

1 [3] proposed a customer valuation model based on acombination of Markov chains and classification andregression trees (CART).

2 Use German bank data.

3 Focus on parsimonious model based on homogeneouscustomer segments.

4 Four main factors: age, demographics and lifestyle,type and intensity of product usage, and activity level.

5 Factors measured by multiple indicators used aspredictor variables for profit contribution.

National Bank of Canada 9 / 26

Relevant Literature

1 CART analysis to cluster the client base into severalhomogeneous sub-groups.

2 Each homogeneous sub-group was considered a stateof a Markov chain.

3 Moves between states over time with transitionprobabilities.

4 Probabilities estimated by counting the number ofcustomers who moved between two states anddividing by the total number of customers.

National Bank of Canada 10 / 26

Relevant Literature

1 [2] uses CLV for the modelling and prediction of acertain type of customer behaviour.

2 Denote the usage of a product j , during time periodt, by customer i by the variable xi ,j ,t .

3 CLV for customer i from time t to t + h is defined as

CLVi ,t =h∑

k=1

q∑j=1

1

(1 + r)kCFi ,j ,t+k . (1)

National Bank of Canada 11 / 26

Relevant Literature

1 CFi ,j ,t denotes the net cash flows yielded by thetransaction on product j .

2 r > 0 is a relevant discount rate for the time period[t, t + 1).

3 πj , for the j-th product during the time period,denotes the average marginal profit per unit ofproduct usage.

4 CFi ,j ,t = πjxi ,j ,t

National Bank of Canada 12 / 26

Relevant Literature

1 [2] performs a classification of customers.

2 Goal of avoiding misclassification based on aCLV-sensitive loss function.

3 Methods (including logistic regression, decision trees,and neural networks) were used and compared for theclassification procedure.

National Bank of Canada 13 / 26

Objectives

Aim: Develop a model for Client Lifetime Value

Objectives:

1 Identify and segment relevant variables.

2 Develop CLV model.

3 Develop a predictive model to help determine futuremarketing strategies based on CLV.

National Bank of Canada 14 / 26

First objective

Identify and segment relevant variables

Volume of data

Understand drivers of CLV

Approaches used

Generalized linear models (GLM)

Stepwise regression

Classification and regression trees (CART)

Principal components analysis (PCA)

Self-organizing maps (SOM)

National Bank of Canada 15 / 26

GLM

Express CLVi as a function of

Products held: 0-1 vector, e.g., (1, 0, 1, 1, 0, 1);

Demographics: age, gender, income, zip code region;

Client profile and behavior: credit score, averagebalance in checking account, number of transactions.

CLVi = f (β0i + β1iξ1i + β2iξ2i + . . . + εi)

National Bank of Canada 16 / 26

Classification and regression trees (CART)

National Bank of Canada 17 / 26

PCA

I Dimension reduction

The vector of explanatory variables

(ξ1, ξ2, ξ3, . . . , ξn)

is mapped into(n∑

i=1

w(1)i ξi ,

n∑i=1

w(2)i ξi , . . . ,

n∑i=1

w(n)i ξi

).

I A few components explain the variability in the data

Line of creditPersonal loans (e.g., car loan, student loan)Mastercard

National Bank of Canada 18 / 26

Mehr
Typewritten Text
Mehr
Typewritten Text
Mehr
Typewritten Text
Mehr
Typewritten Text
Mehr
Typewritten Text
Mehr
Typewritten Text
Mehr
Typewritten Text
Think of different colors as different customers having different set of attributes (R, G, B's)
Mehr
Typewritten Text
Mehr
Typewritten Text
Mehr
Typewritten Text
Mehr
Typewritten Text
Mehr
Typewritten Text
Mehr
Typewritten Text
Mehr
Typewritten Text

The output clustering of theSOM, retail bankinginformation is cash flow, demographics and producholdings

SOM HeatMap

SOM extract clusters reduce dimensionality simpler Markov chains with fewer transition probabilities.

Gender vs. Fees Avg Total 12

GENDER

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

0

0.2

0.4

0.6

0.8

1FEES_AVG_A_TOTAL_12

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

0

100

200

300

400

2 / 1

Language vs. Fees Avg Total 12

LANGUAGE

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

0

0.2

0.4

0.6

0.8

1FEES_AVG_A_TOTAL_12

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

0

100

200

300

400

3 / 1

Second objective

Develop CLV model

Literature review

Static: GLM

Dynamic: modelling transition of states

Highlights

Importance of segmentation

(Semi-)Markov chain

Logistic regression

National Bank of Canada 19 / 26

Markov chains

Vector of product holdings (xt,1, . . . , xt,q)

Example: (1,0,1,1,0,1)

q products: 2q possible vectors

Importance of segmentation

P =

P1,1 P1,2 · · · P1,q

P2,1 P2,2 · · · P2,q...

... . . . ...Pq,1 Pq,2 · · · Pq,q

National Bank of Canada 20 / 26

Markov chains

I Segmentation of transitions by client characteristics.

I Clients spend a lot of time in a given state.

A Markov chain implies a geometric distribution for theamount of time spent in a given state.

Pr(T = n) = pn−1(1− p), n = 1, 2, . . .

I Modelling constraint

National Bank of Canada 21 / 26

Markov chains

I Possible solution: model separately the time spent ina state and the transitions.

I Semi-Markov chain

P =

x P1,2 · · · P1,q

P2,1 x · · · P2,q...

... . . . ...Pq,1 Pq,2 · · · x

National Bank of Canada 22 / 26

Logistic regression

To model the transition probabilities as a function ofexplanatory variables, we can use a logistic regression.

aj ,k = β0,jk + β1,jkξ1 + β2,jkξ2 + . . .

Pr(xt+1 = k | xt = j , ξ) =exp(aj ,k)∑qj=1 exp(aj ,k)

National Bank of Canada 23 / 26

Third objective

Develop a predictive model to help determinefuture marketing strategies based on CLV

National Bank of Canada 24 / 26

Thank you!

National Bank of Canada 25 / 26

References

[1] Y. Ekinci, F. Ulengin, N. Uray, and B. Ulengin. Analysis ofcustomer lifetime value and marketing expenditure decisionsthrough a Markovian-based model. European Journal ofOperational Research, 237:278–288, 2014.

[2] N. Glady, B. Baesens, and C. Croux. Modeling churn usingcustomer lifetime value. European Journal of OperationalResearch, 197:402–411, 2009.

[3] M. Haenlein, A. M. Kaplin, and A. J. Beeser. A model todetermine customer lifetime value in a retail banking context.European Management Journal, 25(3):221–234, 2007.

National Bank of Canada 26 / 26