Financial Econometrics

37
Financial Econometrics – 2014, Dr. Kashif Saleem (LUT) 1 Financial Econometrics Dr. Kashif Saleem Associate Professor (Finance) Lappeenranta School of Business

description

Financial Econometrics. Dr. Kashif Saleem Associate Professor (Finance) Lappeenranta School of Business. Lecture 1 : Agenda. Introduction What is econometrics? Types of data Steps involved in formulating an econometric model - PowerPoint PPT Presentation

Transcript of Financial Econometrics

Page 1: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)1

Financial Econometrics

Dr. Kashif Saleem

Associate Professor (Finance)

Lappeenranta School of Business

Page 2: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)2

Lecture 1 : Agenda

IntroductionWhat is econometrics? Types of data Steps involved in formulating an econometric model Points to consider when reading articles in empirical finance

A brief overview of the classical linear regression model What is a regression model? Regression versus correlation Simple regression The assumptions underlying the classical linear regression model Properties of the OLS estimator Precision and standard errors

Introduction and Empirical data exercises with Eviews

Page 3: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)3

The Nature and Purpose of Econometrics

• What is Econometrics?

• Literal meaning is “measurement in economics”.

• Definition of financial econometrics:

The application of statistical and mathematical techniques to problems in finance.

Page 4: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)4

Examples of the kind of problems that may be solved by an Econometrician

Financial econometrics can be useful for:

• testing theories in finance• determining asset prices or returns• testing hypotheses concerning the relationships between

variables• examining the effect on financial markets of changes in

economic conditions• forecasting future values of financial variables • financial decision-making

Page 5: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)5

Types of Data and Notation

• There are 3 types of data which econometricians might use for analysis:1. Time series data2. Cross-sectional data3. Panel data, a combination of 1. & 2.

• The data may be quantitative (e.g. exchange rates, stock prices, number of shares outstanding), or qualitative (e.g. day of the week).

• Time series data: Data that have been collected over a period of time on one or more variables

Page 6: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)6

Types of Data and Notation

• Examples of time series dataSeries FrequencyGNP or unemployment monthly, or quarterlygovernment budget deficit annuallymoney supply weeklyvalue of a stock market index as transactions occur

• Examples of Problems that Could be Tackled Using a Time Series Regression

- Relationship between country’s stock index macroeconomic variables

- Relationship between company’s stock price its dividend payment.

- Relationship between different macroeconomic and financial variables

Page 7: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)7

Types of Data and Notation (cont’d)

• Cross-sectional data: are data on one or more variables collected at a single point in time, e.g.

- gross annual income for each of 1000 randomly chosen households in Helsinki City for the year 2000.

- A poll of usage of internet stock broking services

- Cross-section of stock returns on the New York Stock Exchange

- A sample of bond credit ratings for UK banks

Page 8: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)8

Types of Data and Notation (cont’d)

• Panel Data has the dimensions of both time series and cross-sections, it is mult- dimensional data.

• Panel data contains observations on multiple phenomena observed over multiple time periods for the same firms or individuals

• e.g. the daily prices of a number of blue chip stocks over two years.

• Examples:

– Annual unemployment rates of each state over several years

– Quarterly sales of individual stores over several quarters

– Wages for the same worker, working at several different jobs

Page 9: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)9

Cross-sectional, Time series and panel data (example):

• Consider, for example, a set of 1000 households randomly chosen from all households of Lappeenranta

• The observed variable is the gross annual income. • A set of 1000 annual income values for year 1995 for each household is an

example of cross-sectional data. • From such data one could derive information on how income was distributed

among households in Lappeenranta in 1995. • A series of 10 values of the average annual income of ALL 1000 households

for each year since 1991 to 2000 is an example of Time series data. • From such data one could derive information on how average income changed

during the decade from 1991-2000 • A set of 1000x10 = 10,000 values of the annual income for each household for

each year from 1991 to 2000 is an example of Panel data.

Page 10: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)10

• It is preferable not to work directly with asset prices, so we usually convert the raw prices into a series of returns. There are two ways to do this:Simple returns or log returns

 Formulas on Board

 

Returns in Financial Modelling

Page 11: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)11

Steps involved in the formulation of econometric models

Economic or Financial Theory (Previous Studies)

Formulation of an Estimable Theoretical Model

Collection of Data

Model Estimation

Is the Model Statistically Adequate?

No Yes

Reformulate Model Interpret Model

Use for Analysis

Page 12: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)12

1. Does the paper involve the development of a theoretical model or is it

merely a technique looking for an application, or an exercise in data

mining?

2. Is the data of “good quality”? Is it from a reliable source? Is the size of

the sample sufficiently large for asymptotic theory to be invoked?

3. Have the techniques been validly applied? Have diagnostic tests for

violations of been conducted for any assumptions made in the estimation

of the model?

Some Points to Consider when reading papers in the academic finance literature

Page 13: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)13

4. Have the results been interpreted sensibly? Is the strength of the results

exaggerated? Do the results actually address the questions posed by the

authors?

5. Are the conclusions drawn appropriate given the results, or has the

importance of the results of the paper been overstated?

Some Points to Consider when reading papers in the academic finance literature (cont’d)

Page 14: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)14

A brief overview of the

classical linear regression model

Page 15: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)15

Regression

• Regression is probably the single most important tool at the econometrician’s disposal.

But what is regression analysis?

• It is concerned with describing and evaluating the relationship between a given variable (usually called the dependent variable) and one or more other variables (usually known as the independent variable(s)).

• More specifically, regression is an attempt to explain movements in a variable by reference to the movements in one or more other variables.

Page 16: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)16

Some Notation

• Denote the dependent variable by y and the independent variable(s) by x1, x2, ... , xk where there are k independent variables.

y (whose movements the regression seeks to explain)

x ( the variables which are used to explain those variations)

• Some alternative names for the y and x variables:

dependent variable independent variables

regressand regressors

effect variable causal variables

explained variable explanatory variable

• Note that there can be many x variables but we will limit ourselves to the case where there is only one x variable to start with. In our set-up, there is only one y variable.

Page 17: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)17

Regression is different from Correlation

• The correlation between two variables measures the degree of linear association between them.

• Thus, it is not implied that changes in x cause changes in y

• Rather, it is simply stated that there is evidence for a linear relationship between the two variables, and that movements in the two are on average related to an extent given by the correlation coefficient.

• In regression, the dependent variable (y) and the independent variable(s) (xs) are treated very differently.

• The y variable is assumed to be random or ‘stochastic’ in some way, i.e. to have a probability distribution.

• The x variables are, however, assumed to have fixed (‘non-stochastic’) values in repeated samples.

• Regression as a tool is more flexible and more powerful than correlation.

Page 18: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)18

Simple Regression

• For simplicity, say k=1. This is the situation where y depends on only one x variable.

• Examples of the kind of relationship that may be of interest include:

– How asset returns vary with their level of market risk

– Measuring the long-term relationship between stock prices and dividends.

Page 19: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)19

Simple Regression: An Example

• Suppose that we have the following data on the excess returns on a fund manager’s portfolio (“fund XXX”) together with the excess returns on a market index:

• We have some intuition that the beta on this fund is positive, and we therefore want to find whether there appears to be a relationship between x and y given the data that we have. The first stage would be to form a scatter plot of the two variables.

Year, t Excess return= rXXX,t – rft

Excess return on market index= rmt - rft

1 17.8 13.72 39.0 23.23 12.8 6.94 24.2 16.85 17.2 12.3

Page 20: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)20

Finding a Line of Best Fit

• We can use the general equation for a straight line,

y=a+bx

to get the line that best “fits” the data.

• However, this equation (y=a+bx) is completely deterministic.

• Is this realistic? No. So what we do is to add a random disturbance term, u into the equation.

yt = + xt + ut

where t = 1,2,3,4,5

Page 21: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)21

Why do we include a Disturbance term?

• The disturbance term can capture a number of features:

- We always leave out some determinants of yt (effect of other variables on Y which are not in the model)

- There may be errors in the measurement of yt that cannot be

modelled.

- Random outside influences on yt which we cannot model (unexpected

events)

Page 22: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)22

Determining the Regression Coefficients

• So how do we determine what and are?

• Choose and so that the (vertical) distances from the data points to the fitted lines are minimised (so that the line fits the data as closely as possible): y

x

Page 23: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)23

Ordinary Least Squares

• The most common method used to fit a line to the data is known as OLS (ordinary least squares).

• What we actually do is take each distance and square it (i.e. take the area of each of the squares in the diagram) and minimise the total sum of the squares (hence least squares).

• Tightening up the notation, let

yt denote the actual data point t

denote the fitted value from the regression line

denote the residual, yt - tyty

tu

Page 24: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)24

OLS

Page 25: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)25

Actual and Fitted Value

y ix x

iy

iy

iu

Page 26: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)26

Deriving the OLS Estimator

ON BOARD

Page 27: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)27

What do We Use and For?

• If “x” increases by by 1 unit, “y” will be expected, every thing else being equal, to increase by the value of . of course, if beta had been negative, a rise in “x” would on average cause a fall in “y”

• Alpha is known as intercept coefficient estimate, can be interpreted as the value that would be taken by the dependent variable “y” if the independent variable “x” took a value of zero

Page 28: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)28

What do We Use and For?

• In the CAPM example, plugging the 5 observations in to the formulae derived would lead to the estimates

= xxx and = xxx. -- on board

• Question: If an analyst tells you that she expects the market to yield a return 20% higher than the risk-free rate next year, what would you expect the return on fund XXX to be?

• Solution: on board

Page 29: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)29

The Population and the Sample

• The population is the total collection of all objects or people to be studied, for example,

• Interested in Population of interest

predicting outcome the entire electorate

of an election

• A sample is a selection of just some items from the population.

• A random sample is a sample in which each individual item in the population is equally likely to be drawn.

Page 30: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)30

Linearity

• In order to use OLS, we need a model which is linear in the parameters ( and ). It does not necessarily have to be linear in the variables (y and x).

• Linear in the parameters means that the parameters are not multiplied together, divided, squared or cubed etc.

• Some models can be transformed to linear ones by a suitable substitution or manipulation, e.g. the exponential regression model

• Then let yt=ln Yt and xt=ln Xt

ttt uxy

tttu

tt uXYeXeY t lnln

Page 31: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)31

Estimator or Estimate?

• Estimators are the formulae used to calculate the coefficients

• Estimates are the actual numerical values for the coefficients.

Page 32: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)32

The Assumptions Underlying the Classical Linear Regression Model (CLRM)

• The model which we have used is known as the classical linear regression model.

• We observe data for xt, but since yt also depends on ut, we must be specific about how the ut are generated.

• We usually make the following set of assumptions about the ut’s (the unobservable error terms):

• On Board

Page 33: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)33

BLUE

• If assumptions 1. through 4. hold, then the estimators and determined by OLS are known as Best Linear Unbiased Estimators (BLUE).

What does the acronym stand for?

• “Estimator” - is an estimator of the true value of .

• “Linear”- is a linear estimator

• “Unbiased” - On average, the actual value of the and ’s will be equal to the true values.

• “Best” - means that the OLS estimator has minimum variance among the class of linear unbiased estimators.

The Gauss-Markov theorem proves that the OLS estimator is best.

Page 34: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)34

Properties of the OLS Estimator

• ConsistentThe least squares estimators and are consistent. That is, the estimates will converge to their true values as the sample size increases to infinity. Need the assumptions E(xtut)=0 and Var(ut)=2 < to prove this.

• UnbiasedThe least squares estimates of and are unbiased. That is E( )= and E( )=Thus on average the estimated value will be equal to the true values. To prove this also requires the assumption that E(ut)=0. Unbiasedness is a stronger condition than consistency.

• EfficiencyAn estimator of parameter is said to be efficient if it is unbiased and no other unbiased estimator has a smaller variance. If the estimator is efficient, we are minimising the probability that it is a long way off from the true value of .

Page 35: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)35

Precision and Standard Errors

• Any set of regression estimates of and are specific to the sample used in their estimation.

• Recall that the estimators of and from the sample parameters ( and ) are given by

• What we need is some measure of the reliability or precision of the estimators( and ). The precision of the estimate is given by its standard error. Given assumptions 1 - 4 above, then the standard errors can be shown to be given by

• Formulas on Board

xy

xTx

yxTyx

t

tt ˆˆandˆ22

Page 36: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)36

Estimating the Variance of the Disturbance Term (cont’d)

Some Comments on the Standard Error Estimators

1. Both SE( ) and SE( ) depend on s2 (or s). The greater the variance s2, then the more dispersed the errors are about their mean value and therefore the more dispersed y will be about its mean value.

2. The sum of the squares of x about their mean appears in both formulae.

The larger the sum of squares, the smaller the coefficient variances.

3. The larger the sample size, T, the smaller will be the coefficient variances.

Page 37: Financial Econometrics

Financial Econometrics – 2014, Dr. Kashif Saleem (LUT)37

Example: How to Calculate the Parameters and Standard Errors

• Assume we have the following data calculated from a regression of y on a single variable x and a constant over 22 observations.

• Data:

• Calculations:

on board

6.130,3919654

,65.86,5.416,22,8301022

RSSx

yxTyx

t

tt