Business Intelligence Using SAS Final Presentation

48
Bank Marketing Project Group 7: Zhaodi Liu Preete Dixit Nandini Naik Rashmi Nadubeedi Ramesh Pravin Kumar Prem Kumar

description

Business Intelligence Using SAS Final Presentation

Transcript of Business Intelligence Using SAS Final Presentation

Page 1: Business Intelligence Using SAS Final Presentation

Bank Marketing Project

Group 7:Zhaodi Liu

Preete Dixit

Nandini Naik

Rashmi Nadubeedi Ramesh

Pravin Kumar Prem Kumar

Page 2: Business Intelligence Using SAS Final Presentation

Agenda

• Project Motivation• Data Description• Our BI Models• Experimental Results• Association Rule mining• Managerial Implications• Challenges• Conclusion

Page 3: Business Intelligence Using SAS Final Presentation

Project Motivation

• Direct marketing targets customers directly with a personalized message as opposed to Mass marketing

• The primary benefit to businesses: – Increased lead generation– Increase sales volume – Increased customer base

– Minimize losses

• Focus on generating more "qualified" leads

Page 4: Business Intelligence Using SAS Final Presentation

Impact of Data Mining

• Can be very effective for direct marketing• Use of sophisticated algorithms generate

rules, determine the most useful attributes and predict future outcome

• Our goal is to predict the probability of a client subscribing to the term deposit

• In the interest:– To boost sales to existing customers– Increase customer loyalty– Recapture old customers and generate new

business

Page 5: Business Intelligence Using SAS Final Presentation

Data Description

• The data is related with direct marketing campaigns (phone calls) of a Portuguese banking institution

Data Set Characteristics:

Multivariate Number of Instances: 45211 Area: Business

Attribute Characteristics:

Real Number of Attributes: 17 Date Donated 2012-02-14

BANK CLIENT DATA

Serial No Name Description Data Type 1. age Client’s age numeric 2. job type of job categorical 3. marital marital status categorical 4. education level of education categorical 5. default has credit in default? binary 6. balance average yearly balance in euros numeric 7. housing has housing loan? binary 8. loan has personal loan? binary

Page 6: Business Intelligence Using SAS Final Presentation

DATA RELATED WITH THE LAST CONTACT OF THE CURRENT CAMPAIGN Serial No Name Description Data Type

9. contact contact communication type categorical 10. day last contact day of the month numeric 11. month last contact month of year categorical 12. duration last contact duration, in seconds numeric 13. campaign number of contacts performed

during this campaign and for this client

numeric

14. pdays Number of days that passed by after the client was last contacted

from a previous campaign

numeric

15. previous number of contacts performed before this campaign and for this

client

numeric

16. poutcome outcome of the previous marketing campaign

categorical

OUTPUT VARIABLE (DESIRED TARGET) Serial No Name Description Data Type

17. y has the client subscribed a term deposit

binary

Page 7: Business Intelligence Using SAS Final Presentation

BI Model - With Target Profile

Page 8: Business Intelligence Using SAS Final Presentation

Profit/Loss Matrix

Page 9: Business Intelligence Using SAS Final Presentation

Decision Tree

Page 10: Business Intelligence Using SAS Final Presentation
Page 11: Business Intelligence Using SAS Final Presentation
Page 12: Business Intelligence Using SAS Final Presentation

Regression

Page 13: Business Intelligence Using SAS Final Presentation

Regression Node: To convert categorical values to interval value using dummy variables concept.To group logically related categories in order to reduce the number of independent variables in regression equation.

Page 14: Business Intelligence Using SAS Final Presentation
Page 15: Business Intelligence Using SAS Final Presentation

Neural Network

Page 16: Business Intelligence Using SAS Final Presentation

The model converged after 70 iterations and contains 124 weights.

Page 17: Business Intelligence Using SAS Final Presentation

Neural Network with Input Selection

Page 18: Business Intelligence Using SAS Final Presentation

• Reducing the number of modelling inputs reduces the number of modelling weights as well as computational costs and possibly improves the model performance.

• The useful inputs are selected by connecting the Neural Network Node to the Regression Node.

Page 19: Business Intelligence Using SAS Final Presentation

The model converged after 42 iterations and contains 46 weights.

Page 20: Business Intelligence Using SAS Final Presentation

BI Model - Without Target Profile

Page 21: Business Intelligence Using SAS Final Presentation

Decision Tree

Page 22: Business Intelligence Using SAS Final Presentation

Regression

Page 23: Business Intelligence Using SAS Final Presentation

Neural Network The model converged after 70 iterations and contains 124 weights.

Page 24: Business Intelligence Using SAS Final Presentation

Neural Network with Input Selection The model converged after 76 iterations and contains 85 weights.

Page 25: Business Intelligence Using SAS Final Presentation

Model Assessment and Scoring results with Target Profile Model

– The performance of the four models are compared based on the average profit using the model comparison node.

Page 26: Business Intelligence Using SAS Final Presentation

Fit Statistics

Page 27: Business Intelligence Using SAS Final Presentation

ROC Plots

Page 28: Business Intelligence Using SAS Final Presentation

Confusion Matrix

Page 29: Business Intelligence Using SAS Final Presentation

Scoring

– Scoring is used to implement the model deemed best by the model comparison node for predicted the outcome for a new case/observation for which the outcome is unknown.

Replaced Variables:– Job – management

– Education – Secondary– Contact – Cellular

Rejected Variables:– Poutcome– Target y

Page 30: Business Intelligence Using SAS Final Presentation

Scoring

Page 31: Business Intelligence Using SAS Final Presentation

Scoring

Actual Data Scores:– Percentage No = 88.476%– Percentage Yes = 11.524%

There is a slight difference of 1.725% in the prediction model outcome and the actual outcome.

Page 32: Business Intelligence Using SAS Final Presentation

Model Assessment and Scoring results without Target Profile Model

– The performance of the four models are compared based on the misclassification rate using the model comparison node.

Page 33: Business Intelligence Using SAS Final Presentation

Fit Statistics

Page 34: Business Intelligence Using SAS Final Presentation

ROC Plots

Page 35: Business Intelligence Using SAS Final Presentation

Confusion Matrix

Page 36: Business Intelligence Using SAS Final Presentation

Scoring

Page 37: Business Intelligence Using SAS Final Presentation

Scoring

Page 38: Business Intelligence Using SAS Final Presentation

Scoring

Actual Data Scores:– Percentage No = 88.476%– Percentage Yes = 11.524%

There is a slight difference of 1.725% in the prediction model outcome and the actual outcome.

Page 39: Business Intelligence Using SAS Final Presentation

Association Rule Mining

Page 40: Business Intelligence Using SAS Final Presentation
Page 41: Business Intelligence Using SAS Final Presentation

Data Pre-processing

• Default : D(Yes), D(No)

• Housing : H(Yes) , H(No)

• Personnel Loan : PL(Yes),PL(No)

• Age : 20- 40, 40-60, 60-90, and 90-100

Page 42: Business Intelligence Using SAS Final Presentation

Results & Interpretation

Page 43: Business Intelligence Using SAS Final Presentation

Managerial Implicationsif pdays < 19.5 or MISSING

AND month IS ONE OF: MAY, JUN, JUL, AUG, NOV, JAN or MISSING AND duration < 348.5 or MISSING

AND age < 60.5 or MISSING then Predicted: y=YES = 0.02 Predicted: y=NO = 0.98 A total of 100 customers who and the cost of calling a customer is $12 then there will be a saving $1200 just by not contacting these set of customers.

if pdays < 19.5 or MISSING

AND month IS ONE OF: FEB AND housing IS ONE OF: NO

AND duration < 466.5 or MISSING AND day < 20.5 AND day >= 9.5

AND age < 60.5 or MISSING then Predicted: y=YES = 0.75 Predicted: y=NO= 0.25 A total of 100 customers and cost of calling a customer is $12 and if the profit is $100 then the Bank could generate revenue of $10,000.

Page 44: Business Intelligence Using SAS Final Presentation

Decision Tree Model – Our best-fit model for maximizing profits

Decision Tree

Predicted

Positive Negative

Actual Positive 1314 1330

Negative 863 19098

Predicted

Positive Negative

Actual Positive $15,768 $0

Negative -$10,356 $0

Page 45: Business Intelligence Using SAS Final Presentation

Regression Model• If the pdays increase by 1-unit then it has

absolutely no impact on the odds of not subscribing to the term deposit.

��.���𝟖𝟗� ≈ �

Page 46: Business Intelligence Using SAS Final Presentation

Challenges

• To implement the Profit/Loss matrix

• Absence of ROC plot in the result of Model Assessment

• Non-convergence of Neural Network

• Scoring

Page 47: Business Intelligence Using SAS Final Presentation

Conclusion

• Successfully implemented 2 predictive analysis models to predict the outcome of term deposit subscription

• Decision Tree best fit-model based on Profit/Loss• Decision Tree best-fit model based on

Misclassification Rate• Using the Decision rules

– Results in a saving of $1200 – Generates a revenue of $10,000

• Using Profit/Loss Matrix– Profit of $15,768– Savings of $10,356

Page 48: Business Intelligence Using SAS Final Presentation

Q&A