A Theory of Family Business Groups and Pyramidal …...A Theory of Family Business Groups and...

A Theory of Family Business Groups

and Pyramidal Ownership*

Heitor AlmeidaNew York [email protected]

Daniel WolfenzonNew York [email protected]

(This Draft: August 14, 2003 )

Abstract

Existing theories of pyramidal business groups cannot explain recent empirical …ndings about theprevalence of these groups and their characteristics. In this paper, we propose a simple theory ofpyramids that helps us understand the empirical evidence. Our theory explicitly recognizes that afull understanding of pyramidal business groups necessitates answering two di¤erent questions in anintegrated framework: why business groups arise, and what determines their ownership structure,that is, why …rms are owned through pyramidal chains. In our model business groups arise tosubstitute for missing …nancial markets, and families choose pyramidal ownership structures whenthe bene…t of using retained earnings that belong to existing …rms in the group is large. Our modelmakes a number of empirical predictions that are consistent with available evidence and suggestsother implications that can be tested in future research.

Key words: pyramids, business groups, investor protection, ownership structure.JEL classi…cation:...

* We thank... for valuable comments.

1 Introduction

Many …rms in the world have a controlling shareholder, usually a family or the State (La Porta,

Lopez-de-Silanes and Shleifer, 1999). In fact, in several countries a few wealthy families control a

large number of …rms and consequently a vast amount of capital through the creation of business

groups.1 This concentration of control is usually accomplished with the use of pyramidal ownership

structures. A typical pyramidal business group consists of a collection of legally independent …rms,

controlled by the family through a chain of ownership stakes. Because the family does not need

to retain 100% of the shares to retain control at each chain of the pyramid, pyramidal ownership

structures allow for separation of cash ‡ow and control rights, and can facilitate family control.

There is a large economics and …nance literature that attempts to explain the existence of

pyramidal business groups, and the role they play in the economy. While some authors argue that

pyramidal business groups arise to substitute for missing or poorly functioning markets (Le¤, 1978,

Khanna and Palepu, 1997),2 others believe that pyramids are used by families to retain control over

a country’s corporate sector with a low cash ‡ow stake.3 In particular, because of the potential

separation of cash ‡ow from control rights engendered by pyramids, their existence can exacerbate

agency problems between controlling and minority shareholders.

The fact that pyramidal business groups are more common in countries with poor investor

protection (La Porta, Lopez-de-Silanes and Shleifer, 1999), appears to be consistent with a role

of business groups as substitutes for poorly developed …nancial markets. However, there is also

recent evidence that a …rm’s group membership increases the scope for expropriation of the …rm’s

minority shareholders by the controlling one (Bertrand, Mehta and Mullanaithan, 2002, Bae, Jang

and Kim, 2002). This evidence appears to be more consistent with the view that pyramids are a

mechanism to retain control and extract private bene…ts. Proponents of this view argue that the

reason why pyramids are more common in countries with poorly developed …nancial markets is

that in those countries private bene…ts of control are large (Bebchuk, 1999). Thus, while families

might bene…t from the concentrated control achieved through pyramids, the resulting allocation of1See, among others, Claessens, Djankov, and Lang (2002) for the evidence on East Asia, Faccio and Lang (2002)

for Western Europe, Khanna (2000) for emerging markets, and Morck, Strangeland and Yeung (2000) for Canada.2Khanna and Palepu’s value-enhancing argument is really about all types of business groups, not only about

pyramidal business groups. We come back to this important issue below.3This argument goes back at least to Berle and Means (1932). In their classic book they describe the use of

pyramids in the railroad industry in early 20th century America.

1

corporate control may become sub-optimal for the country (Morck, Strangeland and Yeung, 2000).

At …rst glance, the agency view of pyramids appears to be broadly consistent with the data, and

one can argue that it has become the traditional view of pyramids in the academic literature.4

Nevertheless, a more detailed examination of the available data on pyramidal ownership struc-

tures reveals some facts that cannot be adequately explained by this traditional view. Notice that

the traditional story revolves around the notion of control of voting rights. If pyramids are simply

a way of minimizing the monetary cost of achieving control, one should never observe cash ‡ow

stakes in individual …rms that are larger than the treshold that is required to achieve control. A

major problem for the conventional story is the substantial empirical evidence that in many coun-

tries families do hold large cash ‡ow stakes in the …rms that they control through pyramids. For

example, Franks and Mayer (2002) …nd that in Germany, for a large majority of the …rms con-

trolled through pyramids the controlling shareholder could have achieved the same level of control

by simply holding shares directly in the …rm. The authors conclude from the evidence they compile

that, in Germany, pyramids are simply not used as a device to achieve control. Similar …ndings

have been reported for other countries such as Brazil (Valadares and Leal, 2001), Chile (Lefort and

Walker, 1999), and Canada (Attig, Fischer and Gadhoum, 2003).

Another problem with the traditional story is that pyramids are not always the only available

method to separate cash ‡ow rights from voting rights. For example, if there are no restrictions

to the use of dual-class shares, any degree of separation is possible by the use of dual-class shares.

In such a case, why would a family choose to control a …rm through a pyramid instead of issuing

dual-class shares? Nevertheless, the evidence compiled by La Porta, Lopez-de-Silanes and Shleifer

(1999) suggests that pyramids are much more common throughout the world than the direct use

of shares with superior voting rights. Of course, in many countries there are restrictions to the use

of dual class shares. If these restrictions exist, there will be an upper bound to the deviation that

can be achieved by a combination of direct holdings and dual-class shares, and the conventional

story would predict that pyramids would appear when the desired deviation from one-share-one-

vote is above the maximum allowed by the law. Empirically, however, we do observe the use of

pyramids to separate cash ‡ows and votes below the upper bound achieved by a combination of

4However, this traditional story does not explain why shareholders agree to buy into business groups that theyknow will expropriate them (see, e.g., Bertrand, Mehta and Mullanaithan, 2002).

2

direct holding and dual-class shares (see Bianchi, Bianco, and Enriques, 2001, and Volpin, 2002, for

Italian evidence). All this evidence suggests that considerations other than control rights motivate

the creation of pyramidal groups.

Our goal in this paper is to propose a simple theory of business groups and pyramidal ownership

that is consistent with the empirical evidence. Our theory blends elements of the value-enhancing

view with the agency view to build a positive theory that explains when pyramids should arise,

and the properties that pyramidal ownership structures should have.

We use Khanna and Palepu (1997)’s argument that managerial talent is a scarce commodity,

and thus ownership of several …rms by the same family may become a viable allocation of control

in some countries. However, it is unlikely that families are always the best possible managers for

all the …rms they end up controlling. In order to explain how families can end up controlling a

large number of …rms in countries with poor investor protection, we embed Khanna and Palepu

(1997)’s argument in a framework that builds upon Shleifer and Wolfenzon (2002)’s model of the

relationship between investor protection, equity markets and the allocation of capital. In that

model poor investor protection limits the amount of capital that talented entrepreneurs can raise

in …nancial markets. This limitation allows wealthy families to own …rms even when they are not

the best possible managers for these …rms, because they own another scarce resource - their wealth.

These considerations imply that families should own more …rms in countries with poor investor

protection, and thereby explains why business groups are more likely to arise in those countries.

This part of our model can be thought of as a simple formalization of the value-enhancing view

of business groups - they arise to substitute for missing markets, in our particular case …nancial

markets that are endogenously curtailed by poor investor protection.

However, this argument on its own cannot explain why pyramids arise. Even when families

have scarce resources that make them the constrained-e¢cient owners of a …rm, this does not mean

that the ownership structure has to be pyramidal. In fact, the family has a choice of owning the

…rm through a horizontal structure where the …rm will be controlled directly, or placing the …rm

in a pyramid where it will be controlled by the family but legally owned by another …rm that is

also controlled by the family. This simple observation underscores an important point that has

been neglected in previous literature. The questions of why business groups arise and their optimal

3

ownership structure are two di¤erent questions.5 Clearly, a model that explains pyramidal business

groups must tackle both questions in an integrated framework.

We argue that there are three main di¤erences between the pyramidal and the horizontal struc-

ture. In the pyramidal structure, the family shares the security bene…ts (that is, the fraction of the

bene…ts that is not diverted by the family) of new projects with existing shareholders. Furthermore,

because pyramidal …rms are owned through a chain of control diversion of cash ‡ows tend to be

higher in the pyramid. However, the pyramidal structure also generates a potential bene…t for the

controlling family. Suppose the family already owns a …rm A. If …rm A has been successful in the

past, it will have accumulated a stock of retained earnings that can be used to make investments

in new …rms. Crucially, the family will not be able to use the whole cash that belongs to …rm A

(and thus to the existing shareholders) to set up a new …rm using a horizontal structure, because

the new …rm in this case would belong only to the family (and to new shareholders who make fresh

investments in the new …rm). However, the new project can be undertaken using the whole cash

when the family forms a pyramid, because in this case the new project belongs to all the existing

shareholders.

We allow the family to choose freely whether it will undertake new projects in a pyramid (with

access to the entire stock of cash but sharing the NPV with the existing shareholders), or using a

horizontal structure. Because the family must share the security bene…ts of the new project when

it uses a pyramid, it will have greater incentives to divert the cash ‡ows of the new project from

security holders to its own pockets. This e¤ect tends to create more diversion in pyramids than

in horizontal structures. Because the family internalizes the costs of diversion every time it must

issue new shares, it has an incentive to keep its ownership of cash ‡ow rights in the new project

as high as possible and minimize the issuance of new shares. This intuition might explain why we

observe high cash ‡ow stakes in …rms that are owned through pyramids.

Still, the potential for high levels of diversion in the pyramid remains. We show that pyramids

are chosen over horizontal structures only if the family expects to divert enough cash ‡ows of new

projects away from existing shareholders. E¤ectively, the new investments that are undertaken

through pyramids always become negative NPV investments for existing shareholders, in an ex-

5Again, the exception is Khanna and Palepu (1997), who say explicitly that their theory is about all types ofbusiness groups, irrespective of their ownership structures.

4

post sense. In other words, existing shareholders would prefer that the cash stock was paid out

as a dividend, but the family may choose to use the cash to invest in new projects. Importantly,

this does not mean that the pyramidal structure is ine¢cient, because using the whole cash (and

investing through pyramids) might be the only way to undertake positive NPV investments when

the project cannot be …nanced using only part of the cash.6 Thus, the excessive diversion that

occurs in pyramids may be a “necessary evil” when the availability of cash is important enough,

and when the family has some talent to manage the project. We believe these observations help

conciliate the two seemingly disparate views of pyramids that have been proposed in the literature.

In addition, our model generates a number of more speci…c empirical implications. Some of them

are consistent with available data and anecdotal evidence, while others have not been systematically

tested. We discuss our implications in detail in section 6. For example, our theory suggests that

pyramids are more likely to arise following successful performance of the existing …rms in the group

(because of the crucial role of the stock of cash). Firms that are owned through pyramids will also

tend to be more capital intensive, and less pro…table (because these …rms are harder to …nance

as stand-alones or in horizontal structures). Finally, the model predicts that the prevalence of

pyramids will be higher when investor protection is lower. There are two reasons for this. First,

the family is more likely to own several …rms in countries with poor investor protection, because

high investor protection will decrease the comparative advantage that families have in undertaking

new projects. Second, families are more likely to choose pyramids over horizontal structures when

investor protection is lower, because pyramids are only chosen when there is enough diversion and

the costs of diversion are lower in countries with poor investor protection. In fact, we show that the

same conditions that are conducive to the appearance of business groups are also conducive to the

appearance of pyramidal structures. This result might explain why most business groups appear

to be organized as pyramids.

In the next section (section 2), we discuss the related literature on pyramids and business

groups. We introduce our model in section 3. In this section we consider a family who already

owns and controls an established …rm and can set up a new …rm. We assume that the second …rm

can only be set up by the family, e¤ectively forming a business group. In section 4 we relax this last

6Notice also that the negative NPV of ex-post investments for existing shareholders does not mean that theycannot break even ex-ante.

5

assumption and explore the issue of when a business group is formed. Section 5 analyzes the origin

of the business group and discusses the optimality of committing to a particular organizational

structure. Section 6 discusses the empirical implications of the model, and section 7 concludes the

paper.

2 Related Literature

The term ‘business group’ is used in the literature to describe many di¤erent types of organizations.

It is used for a collection of legally independent …rms controlled by a single family, such as the

family groups in Western Europe, Latin America, and East Asia. However, the term has also

been used for organizations with a lower degree of central control achieved through interlocking

directorates, common ethnicity, school ties, etc. Example of these groups are the Japanese keiretsu

in which individual managers have a great degree of autonomy in their …rms but coordinate their

activities through the President Council and a common Main Bank (Hoshi and Kashyap, 2003).

Another example of these loose associations are the horizontal …nancial-industrial groups in Russia,

‘which are more properly industry alliances’ (Perotti and Gelfer, 2001, p. 1604). In this paper we

concentrate on family business groups.7

There is a vast economics’ literature on family business groups (see Khanna, 1999, for a survey).

The literature has suggested a number of arguments about why business groups exist, and why they

use di¤erent ownership structures. To achieve control of the various member …rms, a family can

hold shares directly in each one (‘horizontal structure’) or, alternatively, it can control some of

the …rms through a chain of ownership relations (‘pyramidal ownership’). We consider …rst the

literature on the …rst issue (why business groups exist), and then we discuss the arguments about

ownership structure in business groups. Our goal is not only to review the existing theoretical and

empirical literature, but also to point out to some important questions that we believe are not

well explained by existing theories of pyramidal business groups. We conclude this section with a

brief discussion of the relationship of our approach to other related strands of theory in corporate

…nance, especially the literature on internal capital markets.

7Khanna (1999) also argues that the degree of central control in business groups varies across countries.

6

2.1 Why business groups?

Most theories explain the presence of business groups as an e¢cient organizational form that adds

value to member …rms. Le¤ (1978) argued that business groups arise to substitute for missing

markets (e.g. labor and …nancial markets).8 The modern reincarnation of this argument is in

Khanna and Palepu (1997, 1999), who argue that business groups provide an important substitute

for active external capital markets in developing countries. Some of the empirical evidence in

La Porta, Lopez-de-Silanes and Shleifer (1999) can be interpreted as being consistent with this

argument. Indeed, La Porta, Lopez-de-Silanes and Shleifer show that pyramids (one type of business

groups) are more common in countries with low investor protection, which are also countries with

less developed …nancial markets (La Porta et al., 1997).9

An alternative value-enhancing argument suggests that business groups add value to member

…rms because they are better positioned to lobby the governments for favors (Pagano and Volpin,

2001). These political economy theories suggest that business groups should be more prevalent

in countries where the government has a stronger role in allocating resources, or in more corrupt

countries.

There is mixed empirical evidence on the direct valuation e¤ects of group membership. Khanna

and Rivking (1999) study 15 countries, and …nd that in only 3 of them a¢liation with a business

group adds value to member …rms. They …nd, however, that a¢liation does not decrease value

in any of the cases. Khanna and Palepu (2000) analyze the performance of business groups in

India and …nd that only the members of the largest groups have valuations higher than those

of independent …rms. Member …rms of medium-sized Indian groups have valuations below their

independent counterparts. Claessens et al. (2002) …nd evidence that …rms in business groups

organized as pyramids (especially those in which the divergence between votes and cash ‡ows is

the greatest) have lower Tobin’s Q. Similar results are found by Lemmons and Lins (2003), Lins

(2003), Mitton (2002) and Joh (2001). Finally, for a sample of East Asian countries, Claessens,

Djankov, and Lang (2002) …nd that group a¢liation has on average no e¤ect on …rm valuation.

8For example, business groups might act as a risk sharing mechanism (Aoki, 1984). Khanna and Yafeh (2001)test the risk-sharing hypothesis and …nd no evidence for it.

9However, this is not necessarily evidence that business groups are more common in countries with low investorprotection, because La Porta, Lopez-de-Silanes and Shleifer (1999) measure the proportion of pyramids as a fractionof the total number of …rms in their sample, some of which could belong to non-pyramidal business groups.

7

They …nd, however, that there are bene…ts and costs for di¤erent types of member …rms: The

oldest and slowest-growing …rms in the group bene…t at the expense of younger, fastest-growing

…rms.

In contrast to the value-enhancing view, a recent string of papers argues that business groups

exacerbate agency con‡icts and are thus detrimental to …rm value. The main argument is that when

…rms belong to a group, the scope for expropriation of minority shareholders by the controlling

shareholder might increase. Bertrand, Mehta, and Mullanaithan (2002) …nd evidence consistent

with expropriation in Indian business groups, from …rms in which the controlling shareholder holds a

small equity stake to …rms in which the controlling shareholder holds a large stake. Bae, Kang, and

Kim (2002) …nd that Korean chaebols use M&A transactions between member …rms to expropriate

shareholders of the bidder …rm and bene…t the controlling family. These …ndings raise important

questions. Why are business groups so common if they exacerbate agency con‡icts? And if business

groups do increase agency costs, why is it that they arise precisely in countries with low investor

protection, where these costs are high to start with?

2.2 Why pyramidal ownership?

Pyramidal ownership structures are very prevalent around the world, especially in countries with

low investor protection (La Porta, Lopez-de-Silanes and Shleifer, 1999, Claessens, Djankov, and

Lang, 2000, Faccio and Lang, 2002). While these empirical …ndings might appear to be consistent

with the view that groups arise to replace missing markets, they also raise a puzzling question.

One of the key di¤erences between a pyramidal structure and alternative organizational structures

such as a horizontal one is that the pyramid exacerbates the potential for agency con‡icts between

minority shareholders and the family who retains control. One example is suggested by the evidence

in Bertrand, Mehta, and Mullanaithan (2002). Since controllers have incentives to tunnel cash ‡ows

from …rms where they have low cash ‡ow stakes to …rms where they have higher stakes, pyramids

will automatically create an incentive for the controlling family to expropriate shareholders of …rms

that are at the bottom of the pyramid. But if this is the case, then why is the pyramid created

in the …rst place? Even if there is some value-enhancing reason for the …rm to be controlled by

the family, why can’t the family control the …rm directly through a horizontal structure that will

presumably reduce the incentives to steal?

8

The conventional wisdom for the existence of pyramids gives up value-enhancing explanations

altogether. The traditional view starts from the assumption that families value control because

of the large private bene…ts associated with it. With a pyramidal structure, a family can obtain

control of a …rm with a small equity stake. Furthermore, a family might want to reduce its equity

holdings while retaining control, in order to maximize the amount of capital under its control

(Hilferding, 1910, and Berle and Means, 1932). An alternative, but similar argument is that when

private bene…ts of control are large families might elect to maintain a lock on control by separating

cash ‡ow from voting rights (Bebchuk, 1999, and Bebchuk, Kraakman, and Triantis, 2000). The

common element in these stories is that pyramids are cost-e¢cient mechanisms used by families to

retain control of voting rights. In particular, even though the literature does not explicitly make

this point, pyramids could be more e¢cient than horizontal structures because a group of …rms can

be controlled in a pyramid with a lower aggregate cash ‡ow stake.

Some authors have argued that this traditional view of pyramids can potentially explain why

pyramids are observed in countries with low investor protection, if these are also countries where

private bene…ts of control are particularly large (Bebchuk, Kraakman, and Triantis, 2000). The

evidence in Bertrand, Mehta, and Mullanaithan (2002) can also be consistent with this view of

pyramids. If they are set up to extract private bene…ts of control, the fact that there is tunneling

in pyramids is not particularly surprising.

However, there are several reasons why a more complete theory of pyramids is needed. Notice

that the traditional story revolves around the notion of control of voting rights. If pyramids are

simply a way of minimizing the monetary cost of achieving control, one should never observe cash

‡ow stakes in individual …rms that are larger than the treshold that is required to achieve control.

This treshold is at most 50%, but will in general be lower than that.10 Thus, one can only argue

that pyramids enhance family control above their cash ‡ow stake when the family’s cash ‡ow stake

in individual …rms is su¢ciently low.

A major problem for the conventional story is the substantial empirical evidence that in many

countries families do hold large cash ‡ow stakes in the …rms that they control through pyramids.

For example, Franks and Mayer (2002) …nd that in 2/3 of the …rms controlled through a pyramid,

10For example, La Porta, Lopez-de-Silanes and Shleifer (1999) use empirical control thresholds of 10% and 20%.Deviations from one-share-one-vote reduce these thresholds even further.

9

the cash ‡ow and voting stake of the controlling shareholder does not straddle a control treshold.

That is, the controlling shareholder could have achieved the same level of control by simply holding

shares directly in the …rm (see also Emmons and Schimd 1998). Valadares and Leal (2001) conclude

that, in Brazil, pyramids are not used to deviate from one-share-one-vote. In a study of ownership

and control of Chilean …rms, Lefort and Walker (1999) …nd that the controlling shareholder owns

more cash ‡ow than necessary to achieve control. They compute the ultimate cash ‡ow ownership

of the controlling shareholder in all the members of a pyramidal group and …nd this ‘integrated’

ownership to be 57%. Thus, the separation of ownership and control achieved through pyramids

is minimal. Attig, Fischer and Gadhoum (2003) …nd that, in Canada, the cash ‡ow stake of the

controlling shareholder in a pyramid is, on average, 31.78% while the controlling stake is only a bit

higher, 41.68%.

Another problem for the traditional story is that pyramids are not always the only available

method to separate cash ‡ow rights from voting rights. For example, in the extreme case in which

there are no restrictions to the use of dual-class shares, any degree of separation is possible by a

combination of direct holding and dual-class shares. In such a case, why would a family choose

to control a …rm through a pyramid instead of issuing dual-class shares? In fact, if the issue is to

control the maximum possible number of …rms with the minimum amount of capital, why waste

valuable capital in the form of high capital investments in individual …rms instead of issuing non-

voting shares? Nevertheless, the evidence compiled by La Porta, Lopez-de-Silanes and Shleifer

(1999) suggests that pyramids are much more common throughout the world than the direct use of

shares with superior voting rights. This evidence suggests that considerations other than control

rights motivate the creation of pyramidal groups.

In many countries there are restrictions to the use of dual class shares. If these restrictions

exist, there will be an upper bound to the deviation that can be achieved by a combination of

direct holdings and dual-class shares, and the conventional story would predict that pyramids

would appear when the desired deviation from one-share-one-vote is above the maximum allowed

by the law. Empirically, however, we do observe the use of pyramids to separate cash ‡ows and

votes below the upper bound achieved by a combination of direct holding and dual-class shares.

For example, Bianchi, Bianco, and Enriques (2001) measure the ultimate ownership in each …rm

of a pyramid and compute the number of units of capital that the controlling shareholder controls

10

with one unit of his own capital. Then they average this ratio for all the …rms in a pyramid. As a

benchmark, consider a family who holds directly 50% of the cash ‡ows and votes in a …rm. In this

case the ratio is 2. The family can increase this ratio because Italian law allows the issuance of

50% of the …rm’s capital in non-voting shares (savings shares or azioni di risparmio). If the family

uses the maximum fraction of dual class shares and retains 50% of the voting shares (i.e., 25% of

the total capital), it can achieve a ratio of 4. Bianchi, Bianco and Enriques …nd that, while some

pyramids allow the controlling shareholder to control a large amount of capital (e.g., the ratio for

the De Benedetti group is 10.33 and that for the Agnelli group is 8.86), the ratio for other groups

is below 4, and sometimes even below 2 (e.g., for the Berlusconi group, it is 3.66, and for the Pirelli

group it is only 1.95). Volpin (2002) also …nds that, although in the level 3 and above in a pyramid

the separation of ownership and control is very large (CF=6%, V=42.6%), in the second level of a

pyramid the separation (CF=22.5%, V=52.5%) is not high relative to what it could be achieved by

a combination of direct holding and dual class shares. Again, this evidence suggests that there must

other reasons for the existence of pyramids, which are not captured by the traditional argument.11

A partial explanation based on regulatory or tax considerations might help explain some of this

empirical evidence.12 Indeed, regulatory and tax considerations such as taxes on inter-company

dividends do seem to a¤ect the incidence of pyramidal structures (Morck, 2003). Other countries

impose explicit restrictions to prevent some type of …rms (e.g. banks, pension funds, etc.) from

obtaining a controlling interest in others (see Roe, 1991, for the case of the US). These are not

complete theories as the bene…t of pyramids is not spelled out. A related argument suggested in

the literature is that pyramidal structures can be used as an elusive tool to hide the identity of the

ultimate owner from either the market or the state (Bianchi, Bianco, and Enriques, 2001). Our

theory in this paper will not rely on tax and regulatory considerations, even though we believe

these to be important in the real world.

11 In addition, Claessens, Djankov and Lang (2000) and Faccio and Lang (2002) …nd that pyramids are used moreoften than dual class shares even in countries (such as Italy, Brazil and Sweden) where the restrictions on dual classshares do not always bind.12Gomes (2000) also suggests a rationale for why separation of cash ‡ow and voting rights might appear in countries

with poor investor protection in a model where reputation e¤ects are an important governance mechanism. However,and similarly to the traditional view of pyramids, his model cannot explain why pyramids appear to be more commonthan direct issuance of dual-class shares.

11

2.3 Related theory

The argument that business groups arise as a substitute for missing …nancial markets is directly

related to the idea that internal capital markets in conglomerates mitigate the e¤ect of external

…nancing constraints on the conglomerate’s divisions. There is a vast literature in corporate …nance

which models and tests this idea (see Stein, 2001, for a survey).13 While some aspects of our theory

can be thought of as applications of these ideas to the question of pyramidal business groups, we

believe that there are also important di¤erences in the arguments. The most important di¤erence is

that the literature on internal capital markets does not focus on the question of optimal ownership

structure, which is so crucial for the analysis of pyramidal business groups. In other words, the

literature on internal capital markets considers only the question of whether a division should be

a part of a conglomerate with other divisions, or a stand-alone …rm. This corresponds to one of

the questions that we need to answer when building a model of pyramidal business groups - why

business groups arise. However, the existing literature on internal capital markets has little to say

about the second question that we analyze in this paper - why business groups are organized as

pyramids, as opposed to a horizontal structure.

This second question may seem to be more directly related to the literature on the optimal

design of organizations (see Harris and Raviv, 2002, and Stein, 2002, for recent contributions

that include surveys of this literature). One might conjecture, for example, that a horizontal

structure is a “‡atter” organizational structure than a pyramid, which looks more hierarchical.14

In this literature it might matter whether organizations are ‡atter or more hierarchical because,

for example, of frictions in the ‡ow of information among decision-makers. While we do not deny

the possibility that these considerations might have some explanatory power for the structure of

pyramidal business groups, our approach in this paper follows a very di¤erent route. In fact, we

assume that …rm productivity (gross of diversion costs) is independent of whether business groups

are organized as pyramids or horizontal structures. We believe this is a reasonable assumption,

especially when the controlling family e¤ectively controls and manages all the di¤erent …rms in

13Stein (1997), Fluck and Lynch (1999), and Inderst and Muller (2003) are a few theoretical papers that are mostclosely related to our approach.14 In particular, a pyramid might represent a hierarchy in which managers at the lower levels of the pyramid are

subordinates of managers at upper levels.

12

pyramids and horizontal structures.15 In such a case, it is not clear whether there are any signi…cant

di¤erences in the ‡ow of information (for example) between the pyramidal and the horizontal cases.

Furthermore, because the empirical evidence indicates very strongly that new …rms are always set

up at the bottom of the pyramid, in order to explain pyramidal ownership as a hierarchy one would

require an organization theory that predicts that managers of new …rms are always subordinated

to the previous managers.

3 Pyramidal and horizontal structures

We introduce now our basic model. Consider the following set up, which has three dates. At date

0 a family (F) sets up a …rm A, keeping a fraction ® of its shares. At date 1, …rm A generates

a free cash ‡ow equal to c, and the opportunity to set up another …rm (…rm B) arises. In this

section we assume that ® and c are given, and that the family is the only possible owner of …rm

B. The latter assumption e¤ectively means that …rm B must belong to a business group that is

controlled by family F. Later we consider the e¤ect of a potential competitor that can own …rm

B, and discuss properties of the optimal contract at date 0, especially whether the family wishes

to rule out pyramids or not. We also assume that the family has all its wealth invested in …rm

A. We analyze the optimal ownership choice of …rm B, that is, whether B will be owned by the

shareholders of …rm A (the pyramidal case), or whether it will be owned directly by the family (the

horizontal structure). In both cases the family can issue new shares in B if it chooses to.

Firm B requires an investment of i and generates revenue of r at date 2. We assume that r > i,

that is, the project is positive NPV. F decides whether to set up …rm B and which structure to use

(pyramidal or horizontal). In a pyramidal structure, F sets up …rm B as a subsidiary of …rm A and

thus can use the cash c in …rm A to set up …rm B. To raise more funds, F can sell additional shares

of …rm B. In an horizontal structure, F sets up …rm B himself. He has access only to his personal

wealth of ®c: In this case, he can also sell a stake in …rm B to raise additional funds.

We assume that F always retains control of …rm B, irrespective of its cash ‡ow stake in B.

This would be the case, for example, if there was no legal limit to the deviation from one-share-

one-vote that can be achieved with dual class shares. This assumption is meant to rule out the

15La Porta, Lopez-de-Silanes and Shleifer. (1999) have direct evidence that the controlling family participatesdirectly in the management of most …rms controlled through pyramids.

13

advantage that the conventional argument have attributed to pyramids, namely, that they allow

a larger deviation from one-share-one-vote than dual class shares and hence allow a family to

control a …rm with a smaller cash ‡ow stake. Without legal restrictions on dual-class shares, F can

control …rm B in a horizontal structure holding as little a stake as it wants. Thus, in our setting,

the conventional argument is mute as to the choice of structure. While counterfactual in some

countries, this assumption allows us to concentrate on the trade-o¤ that we want to emphasize. We

introduce legal limits to dual class shares in section ?? (to be done).

Control allows F to divert cash from …rm B into his pocket. At date 2, and regardless of the

structure, F can divert funds from …rm B at a cost. We assume that this cost depends on the

amount diverted and on the level of investor protection. In particular, when F diverts a fraction d

of the cash ‡ows he pays a cost (one can think of this as waste involved in the diversion process)

that equals a fraction kd2=2 of cash ‡ow of the …rm. Along the lines of Shleifer and Wolfenzon

(2002), we interpret the parameter k as the level of investor protection: a high cost of diversion

(high k) corresponds to good investor protection.16

The crucial trade-o¤ that we analyze is the following. In the pyramidal structure, F has more

internal cash (the entire c) to invest. However, even though the potential NPV of the project is the

same in the two structures (r ¡ i), the actual NPV that is captured by the family can be di¤erentbecause of two reasons. First, the existing shareholders of …rm A will be entitled to a share (1¡®)of the non-diverted fraction of the payo¤ (fraction 1¡ d). If there is no diversion, for example, theexisting shareholders will get (1¡ ®)(r ¡ i) out of the new project. However, in general there will16We assume that the diversion opportunities are the same regardless of the structure F uses. In a more complete

model, this assumption could be derived from the assumption that F retains control. Control confers F the right totake a number of actions not ruled out by a previous contract or the law. If F retains control regardless of structure,his set of feasible actions and hence his diversion opportunities should be the same. A di¤erent assumption, implicit indiscussion about pyramids (perhaps inspired by the structures’ ‘geometry’) is that, while in the horizontal structurediversion from …rm B goes to the F’s pocket, in the pyramidal structure the diverted must go to …rm A. First,empirically, it is not always the case that diversion goes to the …rms at the top of the pyramids. In some instances,controlling families divert resources from …rms in the pyramid to themselves (in the form of high salaries, for example).It is also the case that families divert from …rms in the pyramid to other …rms they hold that are not part of thepyramid (quote see FT on Turkey). In any case, empirical evidence on actual diversion (an equilibrium outcome) cannot shed light on the diversion opportunities, which is what our assumption is about. As an example assume thatcontrol confers F the right to sell the output of …rm B to any …rm at any price. Our assumption simply states that,since F controls …rm B regardless of the structure he chooses, he also has this right in both structures. Consider nowthe actual diversion level. Since in a pyramid the stake of F in …rm B is smaller, F will have an incentive to sell theoutput of B to …rm A at below market prices (a form of diversion). In an horizontal structure, however, the relativesize of F’s stake in …rm A and …rm B is not mechanically determined. If for example, the stakes are of roughly thesame size, F will have no incentives to price the output of …rm B either above or below the market price. Thus, thefact that we observe actual diversion from …rm B to …rm A in a pyramid but not in an horizontal structure does nottell us anything about diversion opportunities.

14

be diversion in equilibrium, and the amount of diversion will depend on the ownership structure.

In particular, for a given direct ownership level that the family has in …rm B diversion will be

greater in the pyramid, because of the chain of control - if direct ownership in …rm B is equal to ¯,

the ultimate ownership in the pyramidal case is ®¯. Thus, diversion will tend to be greater in the

pyramid.

To sum up, the pyramidal structure increases the availability of internal funds but it may

decrease the NPV of the project and the fraction of the security bene…ts that is captured by the

family. The horizontal structure decreases …nancing capacity,17 but it also decreases diversion

(increasing the NPV net of diversion costs), and ensures that the family will capture the entire

NPV of the project.

We solve the model by working backwards.

3.1 Date 2

At date 2, F can divert from …rm B to its pocket. Assuming the “ultimate” (taking into account

the chain of control) cash ‡ow stake of F is x, then at date 2 F maximizes:

maxd2[0;1]

x(1¡ d)r + dr ¡ kd2

2r;

The FOC is: ¡x+ 1¡ kd ¸ 0 (with strict inequality only if x = 1) and so:

d(x) =

½1¡xk if k ¸ 1¡ x1 k < 1¡ x : (1)

The level of diversion decreases with ultimate ownership concentration and with the level of investor

protection. For future reference, we de…ne NPV (x) = r ¡ i¡ kd(x)2

2 r as the total value generated

by …rm B when the ultimate ownership concentration is x: Note that NPV 0(x) > 0 because higher

ownership concentration reduces diversion and consequently reduces the total cost of diversion.

17 If F could costlessly divert the cash of …rm A and use the entire c to set up his own …rm, the horizontal structurewould clearly dominate. However, at date 0, it would be impossible for F to sell any shares of …rm A as investor wouldexpect to get nothing in return. As a result all …rms would be …nanced entirely by F, with no outside shareholders.We could consider an intermediate case in which F could divert at a cost some of the cash of …rm A. In this case,however, the results would be similar to those we have, since the same trade-o¤ would appear. In a pyramidalstructure, F would have access to the entire cash c; and in an horizontal structure, F would have access to less cash(possibly, however, more than ®c due to diversion).

15

3.2 Date 1

We next analyze the choice of ownership structure at date 1. For each structure (horizontal or

pyramidal), we calculate F’s payo¤ and also whether the structure is feasible (that is, whether F

can raise su¢cient funds to set up …rm B). We then characterize the region of the parameter space

over which each structure is chosen.

3.2.1 Horizontal

F has cash of ®c available from his ownership in …rm A. He sells 1 ¡ ¯H of …rm B to the public

and keeps a fraction ¯H (his ultimate ownership). Investors expect F to divert a fraction d(¯H) of

the cash ‡ows. The total cash available to F for investment is:

RH = ®c+ (1¡ ¯H)(1¡ d(¯H))r: (2)

There are two opposing e¤ects of increasing ¯H : On the one hand, the direct e¤ect is that fewer

shares are sold, reducing the amount collected. On the other hand, with higher ownership con-

centration, diversion is lower and as a result the price per share is higher, increasing the amount

collected. We show in the appendix that RH is concave, …rst increasing with ¯H and decreasing.

The maximum …rm size that can be …nanced in an horizontal structure is given by

RH = max¯H2[0;1]

RH =

½®c+ k

4r if k · 2®c+ (1¡ 1

k )r if k > 2:

The function RH attains its maximum when

¯H =

½1¡ k

2 if k · 20 if k > 2

:

Conditional on RH ¸ i; F’s payo¤ at the end of date 2 is given by:

UH = ®c+ r ¡ i¡ kd(¯¤H)

2

2r (3)

= ®c+NPV (¯¤H)

where ¯¤H is the optimal ownership concentration.18 F gets the entire NPV of the project and incurs

the costs associated with diversion. To minimize diversion (i.e., maximize his payo¤), he chooses

18We use an upper bar for variables associated with maximizing the amount raised, and an asterix to denotevariables associated with maximizing F’s payo¤.

16

the highest level of ¯H that satis…es RH(¯H) ¸ i: By continuity of RH(); ¯¤H is the maximum

solution to

RH(¯¤H) = i (4)

3.2.2 Pyramidal

There is a stock of cash equal to c available in …rm A. Firm A sells ¯P of …rm B to raise more cash

and retains 1¡ ¯P : The ultimate ownership of F in …rm B is then ®¯P and so diversion is given

by d(®¯P ): The total cash available to F for investment is:

RP = c+ (1¡ ¯P )(1¡ d(®¯P ))r: (5)

As in the case of RH ; RP …rst increases and then decreases with ¯P :

The maximum amount that can be raised in a pyramidal structure is given by

RP = max¯P2[0;1]

RP =

8<:c if k < 1¡ ®

c+ (®¡1+k)24®k r if 1¡ ® · k · 1 + ®

c+ (1¡ 1k )r if k > 1 + ®

The corresponding values of ¯P that maximize RP are:

¯P =

8<:1 if k < 1¡ ®

1+®¡k2® if 1¡ ® · k · 1 + ®0 if k > 1 + ®

Finally, conditional on RP > i; F’s payo¤ is

UP = ®¯¤P (1¡ d(®¯¤P ))r + d(®¯¤P )r ¡

kd(®¯¤P )2

2r (6)

= ®c+ ® [(1¡ d(®¯¤P ))r ¡ i] +·d(®¯¤P )¡

kd(®¯¤P )2

2

¸i (7)

= ®c+NPV (®¯¤P )¡ (1¡ ®)[(1¡ d(®¯¤P ))r ¡ i]

where ¯¤P is the optimal direct ownership concentration when the pyramidal structure is chosen.

As in the case of the horizontal structure, ¯¤P is the maximum solution to:

RP (¯¤P ) = i (8)

The second line of the equation 6 follows by using the above equation.

The expression in equation 6 makes it clear what are the di¤erences in payo¤ between the

pyramid and the horizontal structures. In a pyramidal structure the family gets the NPV of the

17

Pyramidal structure Horizontal structure

Stake Contribution Stake Contribution

Family ®¯P ®c Family ®¯P ®cNon-familyshareholders of A

(1¡ ®)¯P (1¡ ®)c (1¡ ®)¯P (1¡ ®)¯PMVNew directshareholders

1¡ ¯P (1¡ ¯P )MV New directshareholders

+ 1¡ ¯P + (1¡ ¯P )MV1¡ ®¯P (1¡ ®¯P )MV

Table 1: Types of shareholders in …rm B according to structure

project but, since …rm B is set up as a subsidiary of …rm A, F shares part of the value with the

shareholders of …rm A. Since shareholders of …rm A do not participate in the diversion proceeds,

they just get their share (1¡®) of the security bene…ts (i.e, bene…ts after expropriation) created by…rm B. Notice that these bene…ts are not necessarily positive when diversion occurs in equilibrium.

Furthermore, the NPV of the project net of diversion costs is endogenous, and depends on the

ultimate ownership stake held by the family. Of course, the family’s payo¤ in equation 6, as

well as in equation 3, are conditional on the project being taken. The choice between pyramidal

and horizontal structures will also be driven by di¤erences in the …nancing capacity in the two

structures, RH and RP .

3.2.3 Choice of structure

We …nd the optimal structure by comparing F’s payo¤ for all feasible structures. We …rst state

a number of results and then fully characterize the region of the parameter space for which each

structure is chosen.

Result 1 When the pyramid is chosen, shareholders of …rm A realize a negative return. That is,

when the pyramid is chosen the market value of …rm B (1¡d(®¯¤P ))r; is smaller than the investmenti:

The proof of this result, as well as all other proofs, is in the appendix.

The intuition behind this result lies in the shareholder distribution in a horizontal and a pyrami-

dal structure. Table 1 shows this distribution assuming that the ultimate ownership of the family in

…rm B is the same in both structures, that is, ¯H = ®¯P . This assumption makes the comparison

relevant, as it keeps the value of …rm B constant across structures.

18

When …rm B is a pyramid, it has three types of shareholders: the family, non-family shareholders

of …rm A, and new direct shareholders. New direct shareholders hold a fraction 1¡ ¯P of …rm B

and pay a fair price for their shares, (1¡¯P )MV , where MV = (1¡ d(®¯P ))r is the market valueof …rm B. Non-family shareholders of …rm A hold a stake (1¡®)¯P of …rm B. Since F uses the cashc in …rm A to set up the …rm, these shareholders, in e¤ect, pay (1¡ ®)c for their stake. Similarly,the family holds a fraction ®¯P of …rm B and pays ®c for it.

In an horizontal structure there are only two types of shareholders in …rm B: the family and

new direct shareholders. New shareholders buy a stake of 1 ¡ ¯H = 1 ¡ ®¯P in …rm B and pay

a fair price for it, (1¡ ®¯P )MV: To allow easy comparison, table 1 decomposes the stake of newdirect investors in two stakes of (1¡ ®)¯P and 1¡ ¯P : The family retains a fraction ¯H = ®¯P of…rm B and contributes all his wealth to this …rm, i.e., ®c:

Comparing line by line in the table, it can be seen that the di¤erence between structures is

that while in horizontal structure the stake (1 ¡ ®)¯P is sold at market prices, in the pyramidalstructure, this stake is e¤ectively sold to non-family shareholders of …rm A at a price of (1¡ ®)c:

The result above can now be explained. Suppose the price paid by non-family shareholders of

…rm A is below the market price (that is, c < ¯PMV ). These will be the only cases when the

existing shareholders will end up with a positive gain in the pyramidal structure. We will show

here that in such a case, the family always chooses the horizontal structure. First, notice that the

family prefers the horizontal structure in this case because it does not need to share the positive

non-diverted NPV with non-family shareholders of …rm A. In addition, the family can also invest

more with the horizontal structure, because new shareholders will pay more for the stake (1¡®)¯Pthan do non-family shareholders of …rm A (since ¯PMV > c). Thus, every time the non-family

shareholders of A gain in the pyramidal structure, the horizontal structure is both feasible and

better for the controlling shareholder.19

Next we describe the e¤ect of the parameters in the choice of structure.

Result 2 The pyramidal structure is less likely to be optimal when

² Firm B generates high revenues

19This intuitive proof implicitly assumes that the family issues new shares in both cases. However, the result holdsmore generally even when the family does not need to issue new shares.

19

² Firm B requires a small investment

² Firm A generates higher cash ‡ows

² Investor protection is good

The intuition for this result can be derived from result 1. If, when set up in a pyramid, the

market value of …rm B is above the necessary investment (that is, (1 ¡ d(®¯¤P ))r ¸ i), then our

previous result shows that the pyramidal structure will not be chosen because the family prefers

to keep the entire NPV of the project by using a horizontal structure. Now, if the revenue of …rm

B is high, the inequality 1¡ d(®¯¤P ))r ¸ i will be relaxed for two reasons. First, keeping diversionconstant higher revenues translate into higher cash ‡ows for …rm B’s shareholders. Second, when

revenues are high the family needs to sell a smaller cash ‡ow stake in …rm B. This increases

ownership concentration and reduces diversion also contributing to the increase in market value.

When the required investment in …rm B decreases, it is more likely that the market value of

…rm B is above the investment required for two reasons. The …rst is the obvious decrease in the

investment. The second is that when the investment required goes down the family needs to sell

a smaller cash ‡ow stake to raise the necessary funding. As a result, ownership concentration

increases and so does the market value.

The e¤ect of higher c on the availability of internal funds is stronger in the pyramidal case. As

a result the pyramidal structure becomes more attractive.

Finally, when investor protection increases diversion is smaller for any level of ownership con-

centration. As a result, the market value of B is higher and so it is less likely that the value is

below its investment requirements.

Figure 1 shows the regions of the parameter space over which each structure is chosen. The

pyramidal structure is chosen for low levels of r and high levels of i. When investor protection

increases (not shown in the …gure), the curve that separates the regions where horizontal and

pyramidal structures are chosen shifts down, e¤ectively reducing the area over which the pyramidal

structure is chosen.

This …gure also shows a region over which the pyramidal structure is the only feasible structure.

This region exists when the stock of cash c is high enough. Still in this range existing shareholders

of …rm A must lose when the family makes the new investment. However, the existence of such a

20

region makes it possible that the pyramid is e¢cient for the “coalition” of the controlling family

and the minority shareholders. This happens because the full NPV of the project, net of diversion

costs, might be positive and so the total value generated by the pyramidal investment is positive.

(we haven’t proved yet that the total NPV, net of diversion costs, can really be positive in this

region but we are almost sure this will hold).

In addition to the choice of structure, our model has implications for the ultimate ownership in

each structure, when it is chosen in equilibrium. Of course, whether a structure is chosen or not

is a function of the underlying parameters of the model. Here we focus on variation in ownership

choices induced by the revenue of …rm B (r).

Result 3 Suppose there are a number of families that di¤er only with respect to the parameter

measuring …rm B’s revenue, r, and investment required. Variation in r and i leads to corresponding

variation in ownership structures, with di¤erent ownership concentrations. Ultimate ownership

concentration in any of the resulting pyramidal structures is lower than that of any of the resulting

horizontal structures.

Note that this result is stronger than simply comparing the ultimate ownership for given param-

eter values. It says that the observed ultimate ownership should be lower in a pyramidal structure.

Note also that this result is not trivial. In the pyramidal structure the chain of control mechani-

cally reduces the ultimate ownership. However, the fact that the pyramid has more funds to invest

(compared to the horizontal structure) allows for a higher direct ownership in …rm B. The above

results show that the …rst e¤ect dominates.

It should also be the case that direct ownership will be larger in the pyramid, but we haven’t

proved it yet. In both structures you need to raise i. The horizontal structure has greater …nancing

needs, and thus from this e¤ect we get that direct ownership in the horizontal structure is lower.

However, there is a countervailing e¤ect. The pyramidal needs less external money but it is worse

at raising money than the horizontal (since the chain of control reduces share prices). When c is

large, it is very likely that the direct ownership in the pyramidal is higher.

21

4 Business groups

We have de…ned a business group as an organization in which a family owns and controls more

than one …rm. In the last section we assumed that F is the only party with the ability to set up

…rm B. This e¤ectively means that we assumed the existence of a business group. In this section

we investigate the conditions under which a business groups arises.

We modify the model by assuming that at date 1 there is an entrepreneur with the ability to set

up …rm B. The set up cost of …rm B for the entrepreneur is also i: In order to capture the possibility

that the family may not be the best owner of …rm B, we assume that the entrepreneur is a better

manager than the family, so that under his control revenues of …rm B are (1 + t)r with t > 0.20

Assume also that the entrepreneur’s wealth is zero. This assumption intends to capture the idea

that the family has higher …nancing capacity than the entrepreneur, because of the accumulation

of internal funds in existing …rms owned by the family. Recall that we also assumed that the

family’s outside wealth is equal to zero, so this assumption really means that the entrepreneur is

less wealthy than the family. We believe this to be the empirically relevant case.

We assume that the market in question only allows for one …rm, and that if the entrepreneur can

raise su¢cient funds he will be the only one to enter the market because of his higher productivity.

If he cannot raise the necessary funds, then the family sets up …rm B using any of the two structures

as described in the last section.

Result 4 Business groups are less likely when investor protection is good and when the pro…tability

of …rm B is high.

The comparative advantage of the family is that they have accumulated wealth, and thus need

to rely less on external capital markets. As investor protection improves, the comparative advantage

of the family eventually disappears and the entrepreneur is able to set up his …rm. The entrepreneur

is also better able to set up the …rm when the …rm is very pro…table, since the higher the revenues

the more the entrepreneur can raise in the external market.

Recall that in section 3 we argued that the horizontal structure is chosen when …rm B is

pro…table and when investor protection is higher. However, it is precisely in these situations that

20The case when the family is the best owner is trivial, because the family also has more …nancing capacity and sowill always own …rm B.

22

the entrepreneur can set up the …rm. As a result, competition from an entrepreneur reduces the

region of the parameter space over which a business group uses an horizontal structure.

Result 5 Whenever they arise, business groups are more likely to use pyramidal structures than

horizontal ones.

Figure 2 shows the region of the parameter space over which the entrepreneur sets up a …rm,

and the region over which the business group is formed. As the talent t of the entrepreneur,

investor protection or the revenue r increase, the region over which the entrepreneur arises grows

and that over which the business group is formed is reduced. One can also see from the …gure that

the area that used to correspond to the horizontal structure starts to disappear …rst. Thus, the

model suggests that the conditions that are conducive to the formation of business groups are also

conducive to the formation of pyramids.

5 Ruling out pyramidal structures

Section 3 shows that whenever the pyramidal structure is chosen, the shareholders of …rm A lose.

In addition, the ultimate ownership structure in a pyramidal structure is lower than that in an

horizontal structure. This leads to higher diversion levels in a pyramid. These results are consistent

with a growing empirical literature showing signi…cant expropriation of investors in …rms that

belong to pyramidal structures. On the other hand, these results beg the question of why pyramids

are chosen in the …rst place. After all, rational investors who anticipate expropriation discount the

price of the shares they buy. Even more problematic is the fact that pyramidal structures arise in

countries with poor investor protection. It is precisely in these countries that the family would like

to reduce expropriation.

The …rst response that we have to these questions is that families do not set up all their …rms

at one point in time, but rather do it over time.21 Thus, they may face a time inconsistency

problem. Once their …rst …rm is set up and they have sold some of the shares, they will be facing

the type of incentives described in section 3. When projects with low pro…tability or high …nancing

requirements arise, they will prefer to set them up in a pyramidal structure. If their existing

…rms have accumulated enough internal funds, they will be able to …nance them. Absent a good

21This also seems to be the case in the data, as we discuss in section 6.

23

commitment mechanism the family will end up using pyramids. In addition, as shown in section 3,

they will be more likely to do so in poor investor protection countries.

The family can solve the time inconsistency problem by committing not to set up a pyramid.

There is, however, a cost of doing this. In particular, as shown in …gure 1, there is a region of

the parameter space over which the only feasible way to set up the …rm is by using the pyramidal

structure (need to prove NPV can be positive in this range). Thus, committing not to use a pyramid

will raise the price of shares of …rm A, but at the expense of a reduced probability of undertaking

…rm B.

6 Empirical Implications

Our theory generates a number of empirical implications, which we list and discuss here. We also

mention empirical and anecdotal evidence that is consistent with our theory, and suggest some

additional implications that can be tested in the future.

1. Family business groups will be more prevalent in countries with poor investor

protection

This implication is a direct consequence of the “value-enhancing” part of our theory (business

groups arise to substitute for missing …nancial markets). We believe there is no systematic evidence

about this, but there is some scattered and anecdotal evidence. For example, La Porta, Lopez-de-

Silanes and Shleifer (1999) show that pyramids (though not necessarily all types of business groups)

are more prevalent in countries with poor investor protection. Claessens et al. (2002) show that in

several developing Asian countries more than 50% of the …rms in their sample are a¢liated with

business groups.

2. Business groups are more likely to be organized as pyramids

In our model, the conditions that are conducive for the choice of pyramids over horizontal

structures (such as low investor protection), are also the same conditions that make business groups

more likely. Thus when business groups tend to arise, they are likely to be organized as pyramids.

Again, there is no systematic evidence on this point. Perhaps the best evidence that groups are

organized as pyramids is the fact that researchers have treated these two terms as synonymous.

24

3. It is possible to observe pyramids where the controlling family has high cash

‡ow stakes in member …rms, and thus the separation between ownership and control

is not large

In our model the controlling family has an incentive to issue as few new shares as possible when

undertaking new investments, in order to minimize diversion. This is true both when horizontal

structures or pyramids are chosen. It is also true for una¢liated …rms. However, direct ownership

in pyramidal …rms tend to be particularly high because the pyramid needs to raise the least amount

of capital.

Thus, our model is consistent with the substantial empirical evidence that in many countries

families hold large cash ‡ow stakes in the …rms that they control through pyramids (see also the

discussion in sections 1 and 2). Furthermore, it is not surprising to …nd that pyramids arise even

when families did not exhaust the possibility of issuing dual-class shares to separate ownership and

control. In our model, pyramids are not used to separate ownership and control.

4. Diversion is higher in …rms placed at the bottom levels of the pyramid, than in

…rms controlled directly by the family through horizontal structure and …rms at the

top of the pyramid

In our model, the amount of diversion is determined by ultimate ownership of cash ‡ow rights

(as opposed to direct ownership), taking into account the e¤ect of the pyramid’s chain of control.

We show that every time the pyramid is chosen the ultimate ownership of the family in new …rms

will be lower than in horizontal structures, and thus diversion is higher. Despite the fact that

pyramids replace missing markets in our model, they are associated with high levels of diversion.

This prediction is consistent with empirical …ndings that pyramids increase incentives to expropriate

(Bertrand, Mehta, and Mullanaithan 2002).

One caveat is that the comparison with stand-alone …rms is not entirely clear cut, because these

…rms need to raise more funds in order to invest, and thus may be pushed towards lower ultimate

ownership.

5. Performance will tend to be lower in …rms that are owned through pyramids,

than in una¢liated …rms or horizontal structures

Our model predicts that projects of lower pro…tability will be undertaken inside pyramids.

25

Thus, even when the pyramid does not have a direct negative e¤ect on performance, one could

observe a negative relationship between measures of …rm value such as Tobin’s Q and pyramidal

membership. As we discussed in section 2, there is evidence in the literature that …rms in business

groups organized as pyramids have lower Tobin’s Q (Claessens et al., 2002). Similar results are

found by Lemmons and Lins (2003), Lins (2003), Mitton (2002) and Joh (2003). There is also

evidence that low Q predicts pyramidal membership (Attig, Fischer and Gadhoum, 2003), which

is consistent with the idea that pyramids undertake lower pro…tability projects.

However, the evidence on the relationship between pyramidal ownership and value is not unam-

biguous. Some papers …nd little evidence of a correlation between membership in business groups

and value (Claessens et al., 2002), while others …nd a positive e¤ect for some …rms (Khanna and

Palepu, 2000). Our model suggests that the di¤erence in performance between pyramids and un-

a¢liated …rms should become more negative when the divergence between votes and cash ‡ows

is greater in pyramids (the chain of control e¤ect). This prediction is consistent with …ndings in

Claessens et al. (2002).

6. Firms in pyramids are larger or they are in capital intensive industries

As the required investment increases, projects are more likely to be undertaken inside pyramids

(as opposed to horizontal structures and una¢liated …rms). Attig, Fischer and Gadhoum (2003)

…nd evidence consistent with this implication, using Canadian data. Claessens et al. (2002) also

…nd that in East Asia group …rms tend to be larger than una¢liated …rms. Bianchi, Bianco and

Enriques (2001) …nd similar evidence for Italy.

7. Pyramids are created dynamically

Our model suggests that pyramids are created as families use retained earnings from existing

…rms to invest in new ones. Thus, pyramids evolve over time, as a function of the performance of

the existing …rms in the pyramid. There is some anecdotal evidence for this. Khanna and Palepu

(2000) claim that one of the most important role of groups is to set up new …rms in which the

family and the member …rms acquire equity stakes (p. 869). Aganin and Volpin (2003) describe

the evolution of the Pesenti group in Italy, and show that this group was created by adding new

subsidiaries to the …rms the Pesenti family already owned. Claessens et al. (2002) …nd that …rms

with the highest separation of votes and ownership (those at the bottom of the pyramid) are younger

26

than those with less separation (those at the top).

8. Pyramids are more likely to be created following good performance of existing

family …rms

In our model previous good performance of the existing group increases the chance that pyramids

arise, because of the crucial role of internal funds in the model. Again, there is some anecdotal

evidence for this. Aganin and Volpin (2003) suggest that, in Italy, business groups expand through

acquisitions when they are big and have signi…cant cash resources.

7 Concluding Remarks

(to be written...)

27

References

Aganin, A. and P. Volpin, 2003, History of Corporate Ownership in Italy, mimeo.

Aoki, M. (editor), 1984, The economic analysis of the Japanese …rm, North Holland.

Attig, N., K. Fischer and Y. Gadhoum, 2003, On the determinants of pyramidal ownership:Evidence on expropriation of minority interests, working paper, Laval University.

Bae, K. H., J. Kang, and J. M. Kim, 2002, Tunneling or value added? Evidence from mergers byKorean business groups, Journal of Finance, 2695-2740.

Bebchuk, L., 1999, A rent protection theory of corporate ownership and control, NBER workingpaper 7203.

Bebchuk, L., R. Kraakman, and G. Triantis, 2000, Stock pyramids, cross-ownership, and dual classequity, in Randall K. Morck, ed.: Concentrated Corporate Ownership (University of ChicagoPress, Chicago, IL).

Berle, A. and G. Means, 1932, The modern corporation and private property, McMillan.

Bertrand, M., P. Mehta and S. Mullanaithan, 2002, Ferreting out tunneling: An application toIndian business groups, Quarterly Journal of Economics, 121-148.

Bianchi, M., M. Bianco, and L. Enriques, 2001, The separation between ownership and control inItaly, mimeo, Bank of Italy.

Claessens, S., S. Djankov, J. Fan, and L. Lang, 2002, Disentangling the incentive and entrenchmente¤ects of large shareholdings, Journal of Finance 57(6), 2741-2771.

Claessens, S., S. Djankov, and L. Lang, 2000, The separation of ownership and control in EastAsian Corporations, Journal of Financial Economics 58, 81–112.

Claessens, S., S. Djankov, and L. Lang, 2002, The bene…ts of group a¢liation: Evidence fromEast Asia Working paper University of Amsterdam.

Emmons, W. and F. Schmid, 1998, Universal banking, control rights and corporate …nance inGermany, Federal Reserve Bank of St. Louis Review July/August, 19-42

Faccio, M., and L. Lang, 2002, The ultimate ownership of Western European Corporations, Journalof Financial Economics 65, 365-395.

Fluck, Z., and A. Lynch, 1999, Why do …rms merge and then divest? A theory of …nancial synergy,Journal of Business 72, 319-46.

Franks, J., and C. Mayer, 2001, Ownership and control of German corporations, Review of Fi-nancial Studies 14(4), 943-977.

Gomes, A., 2000, Going public without governance: managerial reputation e¤ects, Journal ofFinance 55, p. 615-646.

Harris, M. and A. Raviv, 2002, Organization design, Management Science 48, p. 852-865.

Inderst, R., and H. Mueller, 2003, Internal vs. external …nancing: An optimal contracting ap-proach, forthcoming, Journal of Finance.

28

Joh, S. W., 2001, Corporate governance and pro…tability: evidence fromKorea before the economiccrisis, forthcoming, Journal of Financial Economics.

Khanna, T., 1999, Business groups and social welfare in emerging markets: existing evidence andunanswered questions, forthcoming, European Economic Review.

Khanna, T., and K. Palepu, 1997, Why focused strategies may be wrong for emerging countries,Harvard Business Review 75, 41-51.

Khanna, T., and K. Palepu, 1999, The right way to restructure conglomerates in emerging markets,Harvard Business Review 77, 125-134.

Khanna, T., and K. Palepu, 2000, Is group a¢liation pro…table in emerging markets? An analysisof diversi…ed Indian business groups, Journal of Finance 55(2), 867-891.

Khanna, T., and J. Rivking, 1999, Estimating the performance e¤ects of groups in emergingmarkets, Harvard Business School working paper.

Khanna, T., and Y. Yafeh, 2001, Business groups and risk sharing around the world, HarvardBusiness School Working Paper.

La Porta, R., F. Lopez-De-Silanes, and A. Shleifer, and R. Vishny, 1997, Legal determinants ofexternal …nance, Journal of Finance 52, 1131-1150.

La Porta, Rafael, Florencio Lopez-de-Silanes, and Andrei Shleifer, 1999, Corporate Ownershiparound the world, Journal of Finance 54(2), 471-517.

Lefort, Fernando, and EduardoWalker, 1999, Ownership and capital structure of Chilean conglom-erates: Facts and hypotheses for governance, Working paper Ponti…cia Universidad Catolicade Chile.

Lemmon, Michael L., and Karl V. Lins, 2003, Ownership structure, corporate governance and…rm value: Evidence from the East Asian …nancial crisis, Journal of Finance xx, xx-xx.

Lins, Karl V., 2003, Equity ownership and …rm value in emerging markets, Journal of Financialand Quantitative Analysis xx, xx-xx.

Mitton, Todd, 2002, A cross-…rm analysis of the impact of corporate governance on the East Asian…nancial crisis, Journal of Financial Economics 64, 215-241.

Morck, Randall K., 2003, Why some double taxation might make sense: The special case ofinter-corporate dividends, Working paper University of Alberta.

Morck, Randal K., David A. Strangeland, and Bernard Yeung, 2000, Inherited wealth, corporatecontrol and economic growth: The Canadian disease, in Randall Morck ed.: ConcentratedCorporate Ownership (University of Chicago Press, Chicago, IL).

Pagano, M. and P. Volpin, 2001, The Political Economy of Finance, Oxford Review of EconomicPolicy.

Perotti, Enrico C., and Stanislav Gelfer, 2001, Red barons or robber barons? Governance andinvestment in Russian …nancial-industrial groups, European Economic Review 45, 1601-1617

Stein, J., 1997, Internal capital markets and the competition for corporate resources, Journal ofFinance 52, 111-133.

29

Stein, J., 2001, Agency, information and corporate investment, forthcoming in the Handbook ofEconomics and Finance, edited by G. Constantinides, M. Harris and R. Stulz.

Stein, J., 2002, Information production and capital allocation. Decentralized versus hierarchical…rms, Journal of Finance 57, p. 1891-1821.

Valadares, Silvia, and Ricardo P. C. Leal, Ownership and control structure of Brazilian companies,Working paper Universidade Federal do Rio de Janeiro.

Volpin, Paolo, 2002, Governance with poor investor protection: Evidence from top executiveturnover in Italy, Journal of Financial Economics 64, 61-90.

30

Appendix

Proof of Result 1Suppose not, that is, the pyramid is chosen and

(1¡ d(®¯¤P ))r ¸ i: (9)

Recall that ¯¤H and ¯¤P are de…ned (see equations 4 and 8) by RH(¯¤H) = i and RP (¯

¤P ) = i; respectively. From the

last equality

RP (¯¤P ) = c+ (1¡ ¯¤P )(1¡ d(®¯¤P ))r = i (10)

¯¤P (1¡ d(®¯¤P ))r = c+ (1¡ d(®¯¤P ))r ¡ i¯¤P (1¡ d(®¯¤P ))r ¸ c

Where the second line is obtained by rearranging terms and the last line follows from equation 9. Now,

RH ¸ RH(®¯¤P ) = ®c+ (1¡ ®¯¤P )(1¡ d(®¯¤P ))r= ®c+ (1¡ ¯¤P )(1¡ d(®¯¤P ))r + (1¡ ®)¯¤P (1¡ d(®¯¤P ))r¸ ®c+ (1¡ ¯¤P )(1¡ d(®¯¤P ))r + (1¡ ®)c= RP (¯

¤P ) ¸ i

where the …rst inequality follows because RH is the maximum value of RH(¯H), the second line follows from algebraicmanipulations, the third line uses equation 10, and the last line is the de…nition of RP : Two conclusions can be drawnfrom the above. First, RH ¸ i, that is the horizontal structure is feasible. Second, since RH(®¯¤P ) ¸ i andRH(¯

¤H) = i, then it follows from the fact that RH is decreasing that ¯¤H > ®¯

¤P :

Finally,

UH(¯¤H)¡ UP (¯¤P ) = NPV (¯¤H)¡NPV (®¯¤P ) + (1¡ ®)[(1¡ d(®¯¤P ))r ¡ i] > 0:Since ¯¤H > ®¯¤P then NPV (¯¤H) > NPV (®¯¤P )and (1 ¡ d(®¯¤P ))r ¡ i > 0 by equation 9. This is a contradictionsince the horizontal structure is feasible and yields a higher payo¤ to F.¥

Proof of Result 2In the text.¥

Proof of Result 3We show the result for variation in r: The proof for i is almost identical. Suppose that r is the treshold level

such that for r ¸ r the horizontal structure is chosen and for r · r the pyramidal structure is chosen (this is shownin Result 2). We let ¯¤H(r) and ¯

¤P (r) be the direct ownership chosen when revenue of …rm B is r in the horizontal

and in the pyramidal structure, respectively. Note that when r increases so do ¯¤H(r) and ¯¤P (r) since fewer shares

need to be sold to …nance a given investment.We …rst show that ¯¤H(r) ¸ ®¯¤P (r): By de…nition of r; F is indi¤erent between the two structures, that is

UH ´ ac+NPV (¯¤H(r)) = ®c+NPV (®¯¤P (r))¡ (1¡ ®)[(1¡ d(®¯¤P (r))r ¡ i] ´ UP :By Result 1, it must that [(1¡ d(®¯¤P (r))r¡ i] · 0 which implies that NPV (¯¤H(r)) ¸ NPV (®¯¤P (r)): Since NPV ()is increasing, then ¯¤H(r) ¸ ®¯¤P (r):

Take any set of parameters values (indexed by 1) for which the horizontal structure is chosen, and a di¤erentset of parameter values (indexed by 2) for which the pyramidal structure is chosen. By Result 2, it must be thatr1 ¸ r ¸ r2: Finally, ¯¤H(r1) ¸ ¯¤H(r) ¸ ®¯¤P (r) ¸ ®¯¤P (r2);where the …rst and third inequalities follow because both¯¤H(r) and ¯

¤P (r) are increasing and the second inequality was proved above.¥

31

Figure 1: Optimal structure in the (r,i) parameter space

r

i

r=iNo structure isfeasible

Horizontal not feasible

Pyramidal

Horizontal

Pyramidal not feasible

r

i

r=iNo structure isfeasible

Business group(Pyramidal)

Business group(Horizontal)

Entrepreneur enters

No business group

Improvement in investorprotection

Figure 2: Business group formation

A Theory of Family Business Groups and Pyramidal …...A Theory of Family Business Groups and...

Documents

Transcript of A Theory of Family Business Groups and Pyramidal …...A Theory of Family Business Groups and...