SCIENCE EUROPEAN JOURNAL OF OPERATIONAL...

18
ELSEVIER - Abstract Thispaperusestherelationshipsbetweenthreebasic,fundamentalandprovenconceptsinmanufacturing(resource commitmenttoimprovementprograms,flexibilitytochangesinoperations,andcustomerdeliveryperformance)asthe empiricalcontextforreviewingandcomparingtwocasualmodelingapproaches(structuralequationmodelingand Bayesiannetworks) .Specifically,investmentsintotalqualitymanagement(TQM),processanalysis,andemployee participationprogramsareconsideredasresourcecommitments .Thepaperbeginswiththecentralissueofthere- quirementsforamodelofassociationstobeconsideredcausal .Thisphilosophicalissueisaddressedinreferenceto probabilisticcausationtheory .Then,eachmethodisreviewedinthecontextofaunifiedcausalmodelingframework consistentwithprobabilisticcausationtheoryandappliedtoacommondataset .Thecomparisonsincludeconcept representation,distributionandfunctionalassumptions,samplesizeandmodelcomplexityconsiderations,measure- mentissues,specificationsearch,modeladequacy,theorytestingandinferencecapabilities .Thepaperconcludeswitha summaryofrelativeadvantagesanddisadvantagesofthemethodsandhighlightsthefindingsrelevanttotheliterature onTQMandon-timedeliveries . ©2003ElsevierB .V .Allrightsreserved . Keywords : Manufacturing ;TQM ;Deliveryperformance ;Causalmodeling ;Structuralequationmodeling ;Bayesiannetworks Availableonlineat www.sciencedirect .co m SCIENCE DIRRCT® EuropeanJournalofOperationalResearch156(2004)92-109 Production,ManufacturingandLogistics EUROPEAN JOURNAL OFOPERATIONAL RESEARCH www .elsevier .com/locate/ds w Causalmodelingalternativesinoperationsresearch : Overviewandapplication RonaldD .Anderson a,1, GyulaVastag b,* a KelleySchoolofBusiness,IndianaUniversity,801WestMichiganStreet,BS4053,Indianapolis,IN46202-5151,USA b KelleySchoolofBusiness,IndianaUniversity,801WestMichiganStreet,BS4027,Indianapolis,IN46202-5151,USA Received 1July 2002;accepted26November2002 1 .Introduction those in thephysicalsciences .Afundamentalap- pealofcausalmodelingistheabilitytocombine Interestincausalmodelingmethodologiesin cause-effectinformation,basedontheoretical thesocialsciencesstemsfromthedesiretoestab- construction,withstatisticaldatatoprovidea lishpatternsofregularitiesorlawsanalogousto quantitativeassessmentofrelationshipsamongthe studiedvariables .Thepurposesforemploying causalmodelinginthestudyofoperationsareto developanexplanationofrelationshipsandto `Correspondingauthor .Tel . :+1-317-278-3681 ;fax:+1-317- provideabasisforinference .Theportrayal,eval- 274-3312/278-3312. E-mailaddresses: r anders@iupui .ed u (R.D . Anderson), uationandsummarizationofassumedcausal re- gvastag@iupui.edu (G.Vastag) . lationshipsarethecomponentsofexplanation . ' Tel . :+1-317-274-2446 ;fax :+1-317-274-3312 . Theserelationshipsarethenusedtodevelop 0377-2217/$ - seefrontmatter®2003ElsevierB .V.Allrightsreserved . doi :10 .1016/50377-2217(02)009049

Transcript of SCIENCE EUROPEAN JOURNAL OF OPERATIONAL...

Page 1: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

ELSEVIER

- Abstract

This paper uses the relationships between three basic, fundamental and proven concepts in manufacturing (resourcecommitment to improvement programs, flexibility to changes in operations, and customer delivery performance) as theempirical context for reviewing and comparing two casual modeling approaches (structural equation modeling andBayesian networks). Specifically, investments in total quality management (TQM), process analysis, and employeeparticipation programs are considered as resource commitments . The paper begins with the central issue of the re-quirements for a model of associations to be considered causal . This philosophical issue is addressed in reference toprobabilistic causation theory. Then, each method is reviewed in the context of a unified causal modeling frameworkconsistent with probabilistic causation theory and applied to a common dataset . The comparisons include conceptrepresentation, distribution and functional assumptions, sample size and model complexity considerations, measure-ment issues, specification search, model adequacy, theory testing and inference capabilities . The paper concludes with asummary of relative advantages and disadvantages of the methods and highlights the findings relevant to the literatureon TQM and on-time deliveries .© 2003 Elsevier B .V. All rights reserved.

Keywords: Manufacturing; TQM; Delivery performance ; Causal modeling; Structural equation modeling; Bayesian networks

Available online at www.sciencedirect.comSCIENCE

DIRRCT®

European Journal of Operational Research 156 (2004) 92-109

Production, Manufacturing and Logistics

EUROPEANJOURNAL

OF OPERATIONALRESEARCH

www.elsevier .com/locate/ds w

Causal modeling alternatives in operations research :Overview and application

Ronald D. Anderson a,1, Gyula Vastag b,*a Kelley School of Business, Indiana University, 801 West Michigan Street, BS4053, Indianapolis, IN 46202-5151, USAb Kelley School of Business, Indiana University, 801 West Michigan Street, BS4027, Indianapolis, IN 46202-5151, USA

Received 1 July 2002; accepted 26 November 2002

1. Introduction

those in the physical sciences . A fundamental ap-peal of causal modeling is the ability to combine

Interest in causal modeling methodologies in cause-effect information, based on theoreticalthe social sciences stems from the desire to estab- construction, with statistical data to provide alish patterns of regularities or laws analogous to

quantitative assessment of relationships among thestudied variables. The purposes for employingcausal modeling in the study of operations are todevelop an explanation of relationships and to`Corresponding author . Tel . : +1-317-278-3681 ; fax: +1-317-

provide a basis for inference . The portrayal, eval-274-3312/278-3312.E-mail addresses: randers@iupui .ed u (R.D . Anderson), uation and summarization of assumed causal re-

[email protected] (G. Vastag) .

lationships are the components of explanation .' Tel . : +1-317-274-2446; fax : +1-317-274-3312 . These relationships are then used to develop

0377-2217/$ - see front matter ® 2003 Elsevier B .V. All rights reserved .doi :10 .1016/50377-2217(02)009049

Page 2: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

RD. Anderson, G. Vastag / European Journal of Operational Research 156 (2004) 92-109

93

inferences for diagnostic reasoning from effects tocauses and for the prediction of outcomes thatwould follow from a policy or procedure inter-vention. Available modeling methods offer differ-ing functional advantages and limitations .However, any method should have potentialmanagerial usefulness by providing outputs withclear interpretation and the capability to assess theimpact of potential changes in the modeled pro-cess .

Ideally, a causal study would take the form of arandomized controlled experiment conducted overan appropriate time period . Such a research designwould minimize construct, internal, external, andstatistical threats to validity (Cook and Campbell,1979), and allow the possibility of causal conclu-sions to be reached . Unfortunately, randomizedcontrolled experiments can seldom, if ever, beutilized to provide causal knowledge for strategyand policy issues . Thus, causal modeling methodsfor non-experimental data are of interest .

Bayesian networks and structural equationmodels (SEM) are the causal modeling methodsfor non-experimental data reviewed and comparedin this paper. The paper begins with the centralissue of the requirements for a model of associa-tions to be considered causal . This philosophicalissue is addressed in reference to probabilisticcausation theory . Then, each method is reviewedin the context of a unified causal modelingframework consistent with probabilistic causationtheory, and applied to a common dataset. Thecomparisons include concept representation, dis-tribution and functional assumptions, sample sizeand model complexity, measurement, specificationsearch, model adequacy, theory testing and infere-nce capabilities . The paper concludes with a sum-mary of the relative advantages and disadvantagesof the methods .

2. Probabilistic causation theory

The area of causation has been extremely activeover the past twenty years with numerous inter-actions between the fields of philosophy, statisticsand computer science. This activity has spawnspirited controversy on a wide variety of concep-

tual and methodological issues (McKim andTurner, 1997). Causality, as a theoretical postu-late, has been the subject of highly contested dis-cussions since the reductive account offered byHume (1969). Hume characterized causation bythe regularity of constantly conjoined pairs ofevents (Effect = f (Cause)), under conditions oftemporal priority (a cause must precede an effect),and contiguity (a cause is temporally adjacent toan effect). However, Hume's account does notprovide for imperfect regularities nor does it havethe ability to distinguish between a genuine causalrelation and a spurious association . These weak-nesses motivated development of theories of cau-sation that cast causal relationships betweengeneral events in terms of stochastic descriptions(Supper, 1970) .

The key feature of probabilistic causation is aparadigm switch from the absolute determinationof an effect due to the occurrence of a cause to theoccurrence of a cause increasing the probability ofan effect . An assumption underlying this perspec-tive is that incomplete knowledge of causes resultsin uncertain cause-effect relationships . This con-ceptualization, labeled as pseudo-indeterminism(Spirtes et al., 1993), assumes that specified causesdo not alone determine an effect, but do so inconjunction with unspecified unobserved causes .Thus, pseudo-indeterminism assumes that sets ofindependent specified causes and unspecified cau-ses are the direct causes (-+) of an effect: specifiedcauses -+ effect E- unspecified causes .

Cause-effect relationships, under the assump-tion of pseudo-ihdeterminism, may be encodedinto a graphical structure known as a directedacyclic graph or simply a DAG . Each arrow in aDAG depicts causal dependence and the absenceof a connecting arrow indicates causal indepen-dence. The encoded structure is characterized asdirected, since two-headed arrows depicting non-causal association are not allowed and as acyclic,since feedback loops (e.g ., X -+ Y ---+ X) are notallowed .

The common cause principle states if two vari-ables in a population are associated and neither isa cause of the other, they must share a commoncause (Reichenbach, 1956) . The term association isused, in reference to probabilistic dependence

Page 3: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

94

R. D. Anderson, G. Vastag / European Journal of Operational Research 156 (2004) 92-109

p(X, Y) > p(X)p(Y), rather than the narrowerterm correlation that implies a measure of linear-ity. When an event C is a common cause of X andY, X i- C -+ Y, or is intermediate to X and Y,X - C -* Y, C is said to screen off values of Xfrom values of Y resulting in probabilistic inde-pendence p(X, YIC) = p(XIC)p(YIC) . Thus, acausal system with asymmetric relations is as-sumed to always exist in reference to populationassociations, but usually is not known with cer-tainty and must be inferred from sample data .

The common cause principle is the justificationfor the causal Markov condition that states everyeffect variable, conditional on its direct causes, isindependent of all variables that are not its effects(Spirtes et al ., 1993) . The causal Markov conditioncan also be expressed in probability statements : ifX does not cause Y, then p(X I Y & direct causes ofX) =p(XIdirect causes of X). The notion of anunderlying causal mechanism generating observa-tions implies the converse of the causal Markovcondition . That is, empirical regularities of con-ditional independence relations observed from apopulation are due to a causal structure not co-incidence (Scheines, 1997) . This assumption, la-beled as the stability condition (Pearl, 2000) or thefaithfulness condition (Spirtes et al ., 1993), statesprobabilistic independencies are a stable result ofcausal structure and not due to happenstance orspecific parameter values . Therefore, the jointpopulation probability distribution over a definedvariable set is assumed stable or faithful to theunderlying causal structure as specified in theDAG. Lastly, if the variable set includes all rele-vant common causes, it is said to be causally suf-ficient . Assertions portrayed by a DAG areassumed to be causal when combined with causalsufficiency, the causal Markov and faithfulnessconditions, and independence of specified andunspecified causes .

SEM and Bayesian networks are viewed ascausal models when the above conditions are sat-isfied. Druzdzel and Simon (1993) demonstrate aBayesian network can be represented by a simul-taneous equation model with hidden variables andaddress causal interpretation in the context of theunderlying principles of SEM . Pearl (1995) pro-vides a detailed exposition of causal diagrams in

empirical research and emphasizes inferences reston causal assumptions .

The above causation account is not universallyaccepted in the philosophy of science, even bythose that adopt a probabilistic viewpoint (Cart-wright, 1997) . There is an often-cited exception tothe common cause principle and the causal Mar-kov condition in the microscopic world of quan-tum mechanics (Hausman and Woodward, 1999) .At issue in this environment is the behavior ofcomplementary pairs of particles that are notprobabilistically dependent on one another or theeffects of a common cause. Yet under manipula-tion of one particle, the other instantly behaves inan identical manner. Various elements of thisparadox have been debated in physics and phi-losophy since publication of the Einstein-Podol-sky-Rosen thought experiment (Einstein et al .,1935). The implications of these debates are thatthe probabilistic behavior of some phenomena donot conform to the pseudo-indeterminism pers-pective, but is due to inherently stochastic prop-erties in Nature .

2.1. A causal model

A causal model may be expressed asM = {S, Os}, where S is the structure of the causalassertions of the variable set V portrayed by aDAG and Os is a set of parameters compatiblewith S. A DAG can always be translated into a setof recursive structural equations with independenterrors that satisfies the causal Markov condition(Kiiveri and Speed, 1982) . A set of recursivestructural equations that describe the data-gener-ating process of a DAG is specified byY = f(Parents(V), Ut )

( 1)

where Y is a consequence variable linked by afunction f to a configuration set of direct causes,Parents(V), and U;, an error term (Pearl, 2000) .Each V is represented by an individual equationthat corresponds to a distinct causal mechanismwhere the function f is invariant over a range ofvalues of Parents(V) and U; . The error terms areassumed to represent mutually independent un-observed variables, each with a probability distri-bution function p(U,). Eq. (1) reflects the view that

Page 4: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

RD. Anderson, G. Vastag / European Journal of Operational Research 156 (2004) 92-109

Nature possesses stable causal mechanisms des-cribed byf that are deterministic functional rela-tionships between variables, while p( U,) reflectsincomplete causal knowledge associated withpseudo-indeterminism : Parents(Y) -+ Y F- U; .The specification of Eq . (1) can accommodate non-linear relationships, expressed in terms of the es-timated probability of an effect, as well as thetypical linear specification.

The parameters in O s assign a function f toeach Y and a probability measure p(U,) to each U,.A structure S is taken to portray a set of condi-tional independence assertions among the m vari-ables under study, which permits the factorizationof the associated joint probability distribution as

p(V,, V, . . ., Vm ) = p(V,IParents(V,))px (V2IParents(V2)) . . .px (VmlParents(Vm))

(2)

The factorability condition follows from thecausal Markov condition (Hausman and Wood-ward, 1999) .

3 . Structural equation models (SEM)

The true score model (Spearman, 1904), com-mon factor analysis (Thurstone, 1935), and sta-tistical factor analysis (Lawley and Maxwell, 1963)provide the foundation to represent the within-concept measurement model in SEM . This repre-sentation assumes a measured variable MV ; iscaused by two unrelated latent variables ; a speci-fied common cause representing concept j, LV1 ,and an error term, e,, for unspecified causes :LV1 --p MV; E- ei . The general measurementequation of an indicator, a specialization of Eq .(1), is given byMV; = 2;1LV1 + ei

(3)

where Ai1 is a path coefficient linking commoncause j to measure variable i and e, is the mea-surement error .

Path analysis (Wright, 1934) is generally ac-knowledged as the common parent of SEM(Joreskog, 1973) and Bayesian networks (Pearl,1988). Wright's method of path coefficients con-

95

lists of developing a graphical representation as-sumed to describe a causal process, thendecomposing the correlation coefficient for eachpair of variables (V,•, Y) into a sum of products ofpath coefficients and residual correlations . Theconcept equations are an application of pathanalysis that specializes Eq . (1) into the standardlinear equations :LV1 = EbjkLVk + u;

(4)where bbk is a path coefficient, uj is the disturbance,and the index k ranges over all parents of LV1 .Thus, the variable set for SEM contains a set ofmeasured variables and two sets of latent vari-ables, V = {MV, LV, U}, where U = {e;, of} .

The distribution assumption for SEM is a nor-mal probability density under maximum likelihoodestimation, which results in normal conditionalprobability densities for each L1; and a multivari-ate normal joint distribution for the measuredvariables. Estimation by generalized least-squaresor asymptotic distribution free methods are notconstrained by the normality assumption, but im-plementation problems has resulted in sparseusage. The set of matrix parameters consistentwith Eqs. (3) and (4) is Os = {A, B, Y',,, Oe }, whereA is a matrix of measurement coefficients, B is amatrix of concept path coefficients, 'I'„ is a co-variance matrix of concept disturbances, and Oe isa covariance matrix of measurement errors . Themeasured variable matrix equation is establishedby substituting Eq. (4) into Eq. (3),MV = A(I - B)u + e. The parameter matrices ofOs can be combined to form the covariance of themultivariate normal joint distribution of the mea-sured variables, E = E[MVMV'] = A(I - B)'I'„ (I -B')A' + Oe .

The objective of the parameter estimation,usually based on the maximum likelihood crite-rion, is to reproduce the sample covariance asclosely as possible with a covariance matrix im-plied by the structure of measurement and struc-tural assertions . The overall fit of the model isassessed by significance testing of the discrepancyfunction formed from the differences between theimplied covariance matrix and the sample covari-ance matrix, and by descriptive indexes. Also,unconstrained matrix elements of O s can be tested

Page 5: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

96 R. D. Anderson, G. Vastag / European Journal of Operational Research 156 (2004) 92-109

for significance. Complete details are provided inBollen (1989) .

4. Bayesian networks

An influence diagram provides a graphicalscheme for representing conditional dependenciesin a decision-making framework (Shachter, 1986) .Such diagrams model uncertain knowledge, deci-sions and utilities to assess the actions that willyield the highest expected utility . Bayesian net-works are a subset of influence diagrams that focuson the uncertain knowledge component. A discreteBayesian network (Pearl, 1988) is a specializationofM = {S, Os}, where the structure S implies a setof conditional probability distributions

Os = {p(V, IParents(V, ), 6 1 ),p

x (V2IParents(V2),02 ), . . .,px (V,„IParents(V,n),0,„)} .

Each variable has c t discrete values or states,and each Bt is assumed to be a collection of mul-tinomial distributions, one for each parent con-figuration. The probability assignment may besubjective or based on frequency ratios froma database or a combination of both . The associ-ated joint probability distribution of the networkvariables follows Eq . (2) as the product of theconditional probabilities in Os, p(vi, . . . , v,„) =Ilnp( lj l Parents(Vt ), 6 . ) . Conceptually, the frame-work for Bayesian networks is applicable to con-tinuous or discrete variables or both . However,applications have been concerned with discreteBayesian networks where the vast majority ofcomputational work has been focused .

The Bayesian network methodology has beenclosely associated with causation in philosophy(McKim and Turner, 1997) artificial intelligence(Pearl, 1988) and knowledge discovery (Spirteset al., 1993) . In addition, applications have beenmade in agriculture, computer imaging, computersoftware, education, information retrieval, medi-cine, space research and weather forecasting (Jensen, 1996) . Cooper (1999) and Heckerman (1999)provide technical reviews of Bayesian networks .

Two approaches have been employed to evalu-ate the independencies and dependencies in thegraphical structure S : an independence-testingmethod (Spirtes et al ., 1993) and a Bayesianscoring method (Cooper and Herskovits, 1992) .The independence-testing approach is consistentwith traditional statistical methods utilizing like-lihood ratio significance tests and maximum like-lihood estimation constrained to satisfy toconditional independencies implied by the data.Whereas, the Bayesian approach employs a scor-ing metric based on the probability of S in thecontext of conjugate analysis (Bernardo andSmith, 2000). Both approaches allow incorpora-tion of prior knowledge with the search algorithmsto specify S .

4.1. Independence-testing approach

The frequentist viewpoint, where a parameter01 = p(U,) is assumed as an unknown fixed quan-tity and the estimator of 6 is a random variable,underlies the independence-testing approach . Ingeneral, classical test statistics can be utilizedunder the assumption the data results from mul-tinomial sampling . The validity of the claim that Xcauses Y, with no screening off a common cause orintermediate variable, implies the populationprobability statement p(YIX) # p(Y) . This causalassertion can be evaluated by sample tests of in-dependence employing the maximum likelihoodchi-square test statistic, G2. Rejection of the nullhypothesis of independence ofX and Y,

Ho : p(YIX) =p(Y),provides evidence that p(YIX) 54 p(Y) in supportof the causal claim that X --+ Y . When an commoncause or intermediate screening variable is postu-lated as a claim that X and Y are independentconditional on C, i.e ., C screens off X from Y, asample test of conditional independence of X andY given C will provide evidence for support orrejection of the claim .

The approach is a two-step procedure imple-mented by the PC algorithm of TETRAD II(Scheines et al ., 1994), which first tests conditionalindependence relationships in a saturated structureor a structure constrained by prior knowledge . A

Page 6: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

RD. Anderson, G. Vastag / European Journal of Operational Research 156 (2004) 92-109

97

sequential testing procedure is employed to elimi-nate dependency linkages at a specified significancelevel. Thus, a reject or retain decision of condi-tional independence is obtained for each testedlinkage. The rejected independence relationshipsare taken as dependency linkages and assembled toprovide the graphical structure. The output of thisassessment is a pattern that represents a set ofequivalent DAGs, rather than a unique set of di-rectional dependencies, that entail the same inde-pendence relations consistent with user-providedbackground knowledge . Search algorithms otherthan the PC algorithm are available for evaluatingcausal assertions (Spirtes et al ., 1993) .

In the second step, maximum likelihood esti-mates of the conditional probabilities O s areconstrained to conform to the structure of the firststep. When a database is used for estimation, theconditional probabilities are relative frequencyratios that are calculated as the joint frequency ofY and its parents divided by the frequency of theparents .

4.2. Bayesian scoring approach

In the alternative Bayesian approach, the esti-mator of Oi is assumed fixed, conditional on thedata D, and the parameter 0; is viewed as a randomvariable. The prior probability distribution of 0,,p(O i), describes what is known about 0 i withoutknowledge of the data. The posterior probabilitydistribution of 0,, p(Oi ID), describes what is knownabout 0. with knowledge of the data . The rela-tionship between the posterior distribution and theprior distribution is provided by Bayes' theorem(Bayes, 1763), p(0;ID) = p(D10i)p(0i)/p(D), wherep(D10 i) is the likelihood of the data given 0; .

The Bayesian approach assumes that the struc-ture S and the conditional probabilities in O s arestochastic variables with prior probability distri-butions provided from prior knowledge . Bayes'theorem provides the mechanism to revise theprior distribution of a model M, given the data D,to the posterior distribution, p(MID) = p(M, D)/p(D) =p(M)p(D1M)/p(D) . The p(D) doesnot change over the values of M, and can beviewed as a normalizing constant needed to scalep(M)p(DIM) to sum to unity over all outcomes of

M. Thus, the posterior distribution of M isproportional to the product of the prior distribu-tion ofM and the likelihood of the data D givenM, i.e ., Posterior Distribution cc Prior Distribu-tion x Likelihood of the Data or p(MID) ocp(M)p(DI M) .

Prior knowledge of Os is usually very minimalor non-existent and the estimation of the condi-tional probability distributions relies primarily onavailable data. The Bayesian approach uses theproportionality relationship, Posterior Distribu-tion oc Prior Distribution x Likelihood of theData, to estimate the maximum posterior proba-bility for variable i in state k for parent configu-ration j. The estimation task is greatly simplifiedwhen the prior and posterior distributions have thesame functional form, which is termed as beingconjugate. The family of Dirichlet distributions isconjugate for multinomial sampling and providesthe basis for prior-to-posterior analysis in manycomputational programs (Cowell et al ., 1999, ap-pendix A). A Dirichlet prior distribution of 0iwhen combined with the data frequency countsyields a Dirichlet posterior distribution (Ramoniand Sebastiani, 1999) .

When the situation of initial ignorance exists,the prior probability of 0i is generally taken as auniform distribution (Geiger and Herkerman,1997). Parameter specification of the prior distri-bution provides a summary of prior experience,and is specified by a quantity called the prior dis-tribution precision . The value of the precisionnecessary for uniform priors is judgmental, andconceptually considered the number of cases rep-resenting prior experience, often termed as theequivalent sample size (Winkler, 1967) . An equiv-alent sample size of I indicates the lowest level ofconfidence in prior estimation .

4.3. Combining independence-testing and Bayesianscoring methods

Although the philosophical positions underly-ing the independence-testing and Bayesian scoringmethods are very different, there has been an effortto combine the two approaches in developingBayesian networks . The PC algorithm, with largesamples, will recover all causal relationship from

Page 7: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

98 RD. Anderson, G. Vastag / European Journal of Operational Research 156 (2004) 92-109

observational data assuming the causal Markovand faithfulness conditions, causal sufficiency, in-dependent sources of variation, and valid statisti-cal testing (Cooper, 1999) . Scheines (1999)adopted a hybrid approach of first using inde-pendence-testing for construction of causal modelstructure S . Then he employed the Bayesian scor-ing method, using the multinomial-Dirichlet dis-tribution to estimate Os . This hybrid approach isadopted in the following application .

5 . A manufacturing application: Concepts, objec-tives and data

A study of the relationships between threeconcepts (resource commitment to improvementprograms, flexibility to changes in operations andcustomer delivery performance) provides the em-pirical context for reviewing and comparing theBayesian network and SEM methodologies . Thecomparison and tests of the two approaches useddata from the second round survey of the GlobalManufacturing Research Group (GMRG) . Why-bark and Vastag (1993) have the questionnaireused in the survey and it provides details of thisglobal data gathering effort primarily focusing ontwo industries: small machine tools and non-fashion textile manufacturing. A brief overview ofthis project is provided in Appendix A .

The variables associated with resource com-mitment are programs in which the respondingcompany had invested money, time and humanresources in the past two years . Three specificprograms were measured on five-point scales(1 = none to 5 =a large amount) : Total qualitymanagement (labeled TQM), process analysis (la-beled Analysis) and employee participation (la-beled Involve). The concept flexibility to changewas measured by flexibility to change product(labeled Product) and flexibility to change outputvolume (labeled Volume) . Customer delivery per-formance was measured by delivery speed (labeledSpeed) and delivery as promised (labeled On-Time). The flexibility and performance variableswere measured on five-point scales in reference tocompetitors (1 = far worse than competitors to3 = about the same as competitors to 5 = far

better than competitors) . However, scale responses1 and 2 were combined due to extreme sparsenessin the lowest scale category .

The general goals of causal modeling, explana-tion and inference, are translated into three ob-jectives specific to the application . The firstobjective is to provide an adequate explanation ofhow resource allocation for performance im-provement influences flexibility and delivery per-formances . The concepts are assumed to have thefollowing temporal ordering: resource commit-ment precedes flexibility to change and deliveryperformance, and flexibility to change precedesdelivery performance. Potential explanations, atthe concept level, include a common effect modelwhere resource commitment is a direct cause offlexibility to change and delivery performance, andflexibility to change is a direct cause of deliveryperformance. An alternative is a mediation modelwhere flexibility to change is a mediating variablebetween resource commitment and delivery per-formance .

The second objective is to utilize the selectedmodel to develop predictions of observable per-formance outcomes given assumed resource allo-cation levels in the TQM program . The finalobjective is to provide diagnostics from the se-lected model for assessment of relative changesin variables when an intervention manipulatesthe state of the On-Time variable to certainty . Thelast two objectives require a modeling methodto have the capability to translate the explana-tory content at the conceptual level to inferentialcontent at the empirical level . The methods ofSEM and Bayesian networks are applied below todata .

5 .1. Structural equation model

The concepts of resource commitment, flexibil-ity to change and delivery performance are repre-sented by the latent variables labeled asPROGRAMS, FLEXIBLE and DELIVERY. Theset of model variables for the SEM application isdefined by V = {PROGRAMS, FLEXIBLE, DE-LIVERY, TQM, Analysis, Involve, Product, Vol-ume, Speed, On-time} and U={uj,u2,u3,e,,e2,e3, e4, e5, e6, e7} . The hypothesized within-concept

Page 8: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

RD. Anderson, G. Vastag / European Journal of Operational Research 156 (2004) 92-109

99

structures are the basis for the following mea-surement equations :

TQM = .,PROGRAMS + e l ,Analysis = A2PROGRAMS + e2,Involve =1 3PROGRAMS + e3,Product = .i4FLEXIBLE + e4 ,Volume =1SFLEXIBLE + e5,Speed = 26DELIVERY + e6,On-time = 27DELIVERY + e7-

5.1 .1. Distribution assumptionsThe analysis began by testing the assump-

tion that the joint probability distribution of themeasured variables is multivariate normal . Exces-sive kurtosis is known to yield a maximum likeli-hood test statistic that is a poor approximationof a chi-square variate (Babakus et al ., 1987 ;Johnson and Creech, 1983) . If multivariate kur-tosis is extreme, the Satorra-Bentler scaled chi-square can be used to adjust the inflated chi-squaregoodness-of-fit statistic, with robust standarderror estimates for model parameters (Bentler,1995). However, the Mardia standardized coeffi-cient of multivariate kurtosis (Mardia, 1970) in-dicated that deviation from multivariate normalitywas minor (Z = 3 .29) .

5.1.2. Unidimensionality, convergent validity anddiscriminant validity

Confirmatory factor analysis was applied toassess the measurement properties of unidimensi-onality, convergent validity and discriminant va-lidity . Unidimensionality, which assumes each setof indicators reflect a single concept, was evaluatedby the goodness-of-fit of the congeneric factormodel (Joreskog, 1971) and was supported(x2 = 13.45, df = 11, p = 0 .27) . The congenericfactor model was also employed to evaluate con-vergent validity, the degree of agreement amongindicators used to measure the same concept . Theminimum requirement for convergent validity isevidence of significant link coefficients betweeneach indicator and the common factor (Pedhazurand Schmelkin, 1991). The ratio of the coefficientestimate to the estimated standard error, which is

approximately distributed as a normal variate,provides the test statistic for the null hypothesisthe coefficient is equal to zero . Each A-coefficient inthe congeneric model was at least fifteen standarderrors from the hypothesized value of zero, pro-viding evidence of convergent validity .

Discriminant validity, the degree to which alatent variable and its indicators differ from otherlatent variables and their indicators, was assessedby fixing the correlation between each pair ofconcepts to 1 .0, then estimating the constrainedmodel, and comparing the resulting value of chi-square with the unconstrained model chi-square(Pedhazur and Schmelkin, 1991) . The null hy-pothesis of a unit (perfect) correlation between twoconcepts was rejected each of the pairs of latentvariables, at the 0.05 significance level, thus pro-viding evidence of discriminant validity .

5.1.3. Tau-equivalent measures and reliabilitySince the congeneric model was acceptable, tau-

equivalent confirmatory factor analyses wasapplied to determine if the within-concept 2-coef-ficients are equal (Joreskog and Sorbom, 1993) . Thetau-equivalent model was found to have an ac-ceptable fit (x2 = 20.82, df = 15, p = 0.14). Fur-ther, in comparison with the congeneric model, thecomposite hypothesis that (A, =12 = 23, 24 = 2s,26 = 2 7 ) was supported (0x2 = 7.37, idf = 4,p = 0.12). Thus, each LVI was assumed to con-tribute equally to each measured variable withineach concept and the three models of concept re-lationships were constrained to tau-equivalentmeasurement.

Tau-equivalent or parallel measures are re-quired for appropriate assessment of internalconsistency by Cronbach's alpha coefficient of re-liability (Bollen, 1989, p . 217) . The construct reli-abilities for PROGRAMS, FLEXIBLE andDELIVERY are 0.75, 0.71, and 0.81, respectively,and are in the acceptable range .

5.1.4. Structural modelsThe common effect model and the mediation

model are potential explanations of the relation-ships between the assumed latent variables . Thesestructures are specified as

Page 9: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

100

Common Effect Model :

PROGRAMS = u l

FLEXIBLE = b iPROGRAMS + u2

DELIVERY = b 2PROGRAMS

+ b3FLEXIBLE + U3 ;

Mediation Model :

PROGRAMS = u,

FLEXIBLE = b,PROGRAMS + u2

DELIVERY = b 3 FLEXIBLE + u3 .

The common effect model assumes that FLEX-IBLE and DELIVERY are the common effects ofPROGRAMS. The model had an adequate statis-tical fit (X2 = 20.82, df = 15,p = 0.14) and excel-lent descriptive fit indexes (CFI = 0 .99, AGFI =0.98, RMSEA = 0 .03) . The mediation model alsoshowed an acceptable statistical fit (X 2 = 21 .36,df = 16, p = 0 .16) and descriptive fit (CFI =0.99, AGFI = 0.98, RMSEA = 0 .02). The differ-ence chi-square comparing the common effectmodel to the mediated model and evaluating thehypothesis b2 = 0 was not significant (X2 = 0.54,df = l, p = 0 .46) . Each concept equation coeffi-cient for the mediation model was significant (b, =0.31, s .e . = 0 .04 ; b3 = 0.63, s .e . = 0 .05). Thus, themore complex common effect model was rejectedin favor of the parsimonious mediation model,PROGRAMS FLEXIBLE DELIVERY.

The standardized parameters for the mediationmodel are displayed in Fig . 1 .

R. D . Anderson, G. Vastag / European Journal of Operational Research 156 (2004) 92-109

Fig. 1 . SEM mediation model: standardized parameter esti-mates.

Each set of standardized measurement coeffi-cients show the relative influence of a conceptvariable and an error variable on a measuredvariable. The square of a standardized coefficientshows the proportion of observed variance at-tributed to specified causes, the LV variable, andunspecified causes, the error term. For example,PROGRAMS contributes 49% (0.702 ) of the unitvariance of TQM and the unspecified causes at-tribute 51% (0.71 2 ) . The standardized concept co-efficients similarly show the relative influence of aspecified concept variable cause and unspecifiedcauses, the u-variables. Thus, PROGRAMS vari-ance is entirely due to unspecified causes, 15%(0.38 2) of the FLEXIBLE variance is attributed toPROGRAMS and 85% (0 .922 ) to unspecifiedcauses, and 36% (0.60 2) of the DELIVERY vari-ables is accounted for by FLEXIBLE, with 64%(0.802 ) assigned to unspecified -causes . The pathmodel interpretation of Fig . 1 shows a one stan-dard deviation change in FLEXIBLE is expectedto result in a 0.60 standard deviation change inDELIVERY. A one standard deviation change inPROGRAMS is expected to result in a 0 .38 stan-dard deviation change in FLEXIBLE, and a 0 .23(= 0.38 x 0 .60) standard deviation change in DE-LIVERY.

The prediction of the impacts of changes inTQM or On-Time on other observed variables isnot possible unless the latent variables can bemeasured. However, measurement of latent vari-ables is indeterminate (Acito and Anderson, 1986) .Thus, prediction and diagnostics of measuredvariables is not a capability of SEM .

5.2. A Bayesian network

The set of model variables for the Bayesiannetwork application is defined by V = {TQM,Analysis, Involve, Product, Volume, Speed,On-Time} and U = {u,, u2 i u3, u4, u5, u6, u'7} . Ahybrid combination of the independence testingand Bayesian approaches, as discussed above, wasadopted for developing the network structure andestimating the conditional probabilities. Indepen-dence testing was employed to build the structureand a Dirichlet distribution conjugate analysis wasused for probability estimation .

Page 10: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

RD. Anderson, G. Vastag /European Journal of Operational Research 156 (2004) 92-109

5.2.1. Building the network structureThe process of constructing the graphical

structure started with a within-concept temporalordering. The TQM is the most general improve-ment program, and is assumed to be a commoncause of process analysis and employee involve-ment programs (Analysis 4- TQM --> Involve).Further, a process analysis program is assumed tobe a cause of employee involvement programs(Analysis --* Involve) . In terms of flexibility,product flexibility is assumed to be a direct causeof volume flexibility (Product -4 Volume). Fordelivery, speed of delivery is viewed as a directcause of on-time delivery (Speed -+ On-Time) .

The PC algorithm, which permits the incorpo-ration of variable order specification and restric-tion of the parent relationships, was used forconformation of the within-concept structure andto search for between-concept relationships . Thewithin-concept structure was confirmed by rejec-tion of the hypothesis of independence for eachrelationship (p < 0 .001). Additionally, four be-tween-concept relationships were deemed to besignificant. Independence between TQM andproduct flexibility was rejected (p < 0.001), andindependence between employee involvementprograms and product flexibility was rejected(p < 0.005). These results imply dependencies ofTQM -+ Product and Involve --+ Product. Fur-ther, the independence testing results implied thedependencies of Product --* On-Time andVolume -+ Speed (p < 0.001) .

The complete structure of the Bayesian networkis shown in Fig. 2 .

Fig . 2 . Bayesian network structure .

101

The structure implies the following equations :

TQM = ft(ui),Analysis = f2(TQM, u2),

Involve= f3(TQM, Analysis, u3),

Product= f4 (TQM, Involve, u4),

Volume = f5(Product, u 5 ),

Speed = f6 (Volume, U6),

On-Time = f7(Product, Speed, U7)-

5.2.2. Conditional probability estimationThe Bayesian approach was used to estimate

the conditional probabilities for the network . Eachp( U,) = 0, was assumed to be a collection of mul-tinomial distributions, one for each parent con-figuration . The precision of the prior estimationwas set at an equivalent sample size of 1 . TheKnowledge Discoverer program (Sebastiani andRamoni, 2000) was used to estimate the posteriorprobabilities for the sets of parents of each con-sequence variable .

5.2.3. Probabilistic inferenceProbabilistic inference is concerned with revis-

ing probabilities for a variable or set of variables,called the query, when an intervention fixes thevalues of another variable or set of variables,called the evidence . Any of the variables in theBayesian network can serve as a query or as evi-dence, thus allowing forward inference from cau-ses to effects (prediction) or backward inferencefrom effects to causes (diagnostics) . The simplesttype of intervention is one where a single variableis forced to take on a fixed value . The interventionreplaces the functional mechanism for the evidencevariable, Ei = f(pail , u i ) with a fixed value, E, = ei ,assumed known with certainty, in all equations .This creates a new model by removal of the net-work arrows from the parent set of the instanti-ated variable, which represents the system'sbehavior under the intervention . The relativemagnitude of an influence from an interventionmay be measured as the percent change from pre-intervention probabilities to post-interventionprobabilities of the query for evidence in differentstates .

Page 11: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

102

R D. Anderson, G. Vastag / European Journal of Operational Research 156 (2004) 92-109

1

Speed6

5

4

3

2

I

On Time

I

I

L

'

0 ' 0-45

-15

15

45-45

-15

15

45-45

-15

15

45 -45

-15

15

45 -45

-15

15

45-45

-15

15

45Percentage from Base Percent Change from Base Percent Change from Base Percent Change from Bane Patent Change from Base Percent Change from Base

Fig. 3. Summary of changes due to TQM interventions .

t

i 6

s

4

3

2

I

Volume

5.2.4. PredictionThe objective of predicting the impact of po-

tential changes in TQM allocation levels on flexi-bility and delivery variables was accomplished byTQM interventions as the evidence . The magni-tude of the impacts of the interventions wasgreatest on the direct effects of Involve, Analysis,and Product, as would be expected from theBayesian network. The indirect influences onVolume, Speed and On-Time were relatively small .

The distributions resulting from the TQM in-terventions can be summarized by expected valuesand compared to the expected values of pre-inter-vention distributions . Fig. 3 provides a summary ofexpected changes given TQM interventions .

Fig. 3 follows the basic principles of Trellisdisplays (Cleveland, 1993) . The variables from theleft to the right show an increasing impact of TQMintervention . The level of TQM intervention ran-ges from 1 = None (at the bottom of the graphs)to 5 = Large Amount (at the top) . Over this rangeof interventions, the customer delivery perfor-mance measures (Speed and On-Time) show theleast variation . Investments in employee partici-pation and process analysis (Involve and Analysis)show the greatest variation . The pre-interventionexpected value of Involve was 2 .72 (where 1 = noallocation and 5 = large allocation) . The expectedvalue of Involve with the TQM intervention set tono allocation was 1 .64 or an expected decrease of39.7%, as displayed in Fig . 3. In a similar fashion,the TQM intervention at the largest allocation

Product6

s

4

2

1

AnalyisI I

I

Me-

I

6

5

4

3

2

1

Involvei

level showed an expected increase in Involve of34.2%. Each expected percent change in Fig . 3 hasa similar interpretation .

These results are consistent with our currentunderstanding of TQM programs and systems .There is a saying attributed to Kaoru Ishikawathat summarizes the people aspect of this programsuccinctly: "TQM starts with education and endswith education." Furthermore, process improve-ments and continuous improvement programshave always been essential parts of quality mana-gement philosophies .

The expected changes in the delivery variableswere small. The range of expected changes inSpeed of delivery was from -3 .4% for no TQMallocations to 1 .5% for the highest TQM allocationlevel. The On-Time delivery variable also dis-played a narrow range of expected percent changesover the TQM interventions . Similar small impactswould be obtained from interventions on Analysisand Involve since the influences are mediated bythe flexibility variables .

i

I

5.2.5. DiagnosticsThe application of Bayes' theorem allows

"backward" probabilistic inference with interven-tions of On-Time delivery . The expected percentchanges in the causes of On-Time delivery result-ing from On-Time interventions are summarized inFig. 4. As with Fig . 3, the expected changes arebased on comparing pre-intervention and post-intervention expected values .

Page 12: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

SAnalysis

RD. Anderson, G. Vastag I European Journal of Operational Research 156 (2004) 92-109

Involve TQM Product

The expected changes in Fig . 4 may be viewedas measures of diagnostic importance for On-Timedelivery. The highest level of Speed of delivery isthe most important cause of improvement in On-Time delivery. The indirect cause of Volume flexi-bility is second in importance, followed by thedirect cause of Product flexibility. Expected chan-ges in the allocation levels of the improvementprograms were minor .

These findings are consistent with the literatureon JIT systems, including papers on the linkagesbetween quick set-ups (and thus increased manu-facturing flexibility), manufacturing lead-time re-duction, and delivery speed improvements(Schmenner, 1988; Vastag and Whybark, 1993 ;Vastag and Montabon, 2001) .

5.3. Specific comparisons

Both the Bayesian network and SEM, based onthe results of traditional significance testing, sup-ported the model of flexibility of change mediatingresource commitment programs and delivery per-formance. However, SEM provided the moreparsimonious explanation of the mediation modelat the concept level. SEM relationships, describedin the dimensionless regression metric of stan-dardized values, are appropriate since the latentvariables are not measured, and thus have noempirical metric . Although the latent variables arenot measured, a great deal of emphasis is placedon the measurement sub-model relating observed

S

4

3

2

1

Fig . 4 . Summary of changes due to On-Time intervention .

VolumeI 1

I

it1

0I

1

i

r

i n

I

I 0

'

r

'

045

-15

15

45 45

-15

15

45 45

-15

15

45-45

-15

13

45 45

-15

15

45 -45

-15

1s

45Pasoerd Change from Base Percent Change hum Bae

PacentChmgefromBase Percent Change from Base

Parent C angefumBaee Percent Chmp from But:

S

4

3

2

1

Speed

I

I

onI

isI

103

variables to the hypothetical latent variables . Thestrength of measurement by tau-equivalent evalu-ations and alpha reliability coefficients of internalconsistency play a central role in the SEM analy-sis. However, the measurement evaluations andthe fit of relationships between latent variablesprovide only an abstract causal description, sinceprediction and diagnostics are not possible . Thus,SEM could only achieve the explanation objectiveof the application .

The Bayesian network approach, using onlymeasured variables, allows interventions on theTQM and On-Time network variables that resultsin predictions and diagnostics . The distributionchanges from pre-intervention to post-interventionwere summarized to expected values and then des-cribed by expected percent changes, as shown inFigs. 3 and 4. Probabilistic Inference could also beconducted with interventions on other networkvariables.

In summary, SEM provides a parsimoniousdescription of observed and hypothetical variablesrich in support by psychometric indexes . Whereas,the Bayesian network provides predictions de-scribed in terms of probabilities and percents . Ifthe objective were only a description of theoreticalconstructs with no interest in current or futureinference to observable variables, then SEM islikely be the preferred method . If objectives in-cluded prediction and diagnostics of observedvariables, then the Bayesian network approachshould be selected .

4-III

''I I

3- ~~

- 3- ~a

- 3-

o

-

2- I -1 I

1- - 1- -

1 1 I

Page 13: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

6. General comparisons

Both SEM and Bayesian networks are specifiedby Eq . (1), and each coveys the causal assertions ofa model M = {S, Os} using a DAG to portray thestructure of assumed functional relationships .However, fundamental differences exist betweenmethods for the structure S and the set of param-eters Os compatible with S . Concept representa-tion, distribution and functional assumptions,sample size, model complexity, measurement,specification search, model adequacy, theory test-ing and inference capabilities are the characteristicsdiscussed below for each of reviewed methods .Table 1 provides a generalized summary of thesecomparisons .

6.1. Concept representation

Bayesian networks are often labeled as Baye-sian belief networks . However, SEM also portraysa belief structure based on a conceptualization ofperceived reality derived from existing theory,empirical evidence and speculation . The primarybelief in SEM is an observable variable MV t iscaused by two unrelated unobservable hypotheti-cal variables .

The level of abstraction entailed by the modelstructure varies across the methods in relation tothe form of within-concept representation . Con-

cept representation in SEM is highly abstract andparsimonious; with the simple methodologicallyform of linear regression . SEM portrays covaria-tions between measured variables as the result of acommon cause, LVi , which results in the mea-surement model of e t -* MV t f- LV;. Finally,SEM greatly simplifies the DAG since only theLV; variables require temporal ordering .

6.2. Function, distribution and measurement as-sumptions

The functional form for discrete Bayesian net-works is non-parametric and non-linear. Dirichletdistributions are assumed to result from multino-mial sampling, and a subjective estimate of preci-sion is required for the prior distributions . SEMmethods are parametric in function and distribu-tion, assuming normality and linearity . Althoughmultivariate normality is usually assumed in SEM,the assumption is often not tested (Breckler, 1990)nor supported in applications (Micceri, 1989) .

Continuous measurement is generally assumedfor SEM due to the normality assumption . How-ever, SEM offers considerable flexibility in han-dling non-normal continuous variables (Westet al., 1995), ordinal data (Coenders et al ., 1997)and categorical data (Muthen and Muthen, 1998) .Bayesian networks currently require categoricalmeasurement . Scale compression, such as imple-

104

RD. Anderson, G. Vastag / European Journal of Operational Research 156 (2004) 92-109

Table ISummary of model comparisons

Comparison SEM Bayesian network

Concept representationParsimony High LowMeasurement assumption Continuous DiscreteRelationship assumption Common cause and causally sufficient Causally sufficient

Operating characteristicsFunctional form Linear Non-linearDistribution assumption Normal MultinomialConcept network structure Parsimonious ComplexProcess modeling capability Limited Large

Assessment of adequacyGoodness-of-fit Global statistical tests Tests of independence

Model and measurement fit indexesSignificance tests for measurement andconcept coefficients

Inference Latent variable prediction Observed variable prediction

Page 14: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

RD. Anderson, G. Vastag / European Journal of Operational Research 156 (2004) 92-109

mented in the above application, will result insome loss of information in a Bayesian network .The tradeoff is parsimonious intuitive measure-ment, which is consistent with an emphasis on theanalysis of actions, versus full measurement utili-zation of SEM. If the tenuous assumption ofnormality for quasi-continuous scales holds inSEM, statistical testing will have greater powerthan significance testing in a discrete Bayesiannetwork. However, the magnitude of power dif-ferences is not known .

6.3. Sample size and model complexity

There is some general agreement that the mini-mum sample size for SEM should at least 100(Guadognoli and Velicer, 1988) . Minimum samplesize requirement for a Bayesian network is an openissue, but we speculate that 100 cases would be areasonable minimum requirement. However, theissue of minimum sample size for a Bayesian net-work would seem to be highly related to thenumber of states in the categorical variables andthe maximum number of parents of the variablesin the modeled system .

In general, SEM is best suited for modelingrather small processes, as Bentler and Chou (1987)have recommended an analysis be limited to 20 orfewer measured variables. This recommendation isnot too restrictive when item parcels employingsingle indicator SEM are used to represent con-cepts (Little et al ., 2002). Although SEM assumesa linear process, SEM can handle latent variableinteractions and quadratic effects (Ping, 1996) .However, branching patterns in modeled processthat create missing values require separate groupsbe formed for analysis. Bayesian networks havethe capability to be applied to very large processes,with potentially thousands of variables (Spiegel-halter et al ., 1993) . Further, Bayesian network canaccommodate a non-linear process, resulting frombranching patterns .

6.4. Specification search

Specification search in causal models is a pri-mary area of algorithm research, as well as asource of philosophical controversy (McKim and

105

Turner, 1997). The controversy centers on ac-ceptable ways of knowledge building and thepossibility of inferring causation from association(Glymour and Cooper, 1999, part 3) . The magni-tude of a search effort for a DAG without tem-poral order specification is enormous . Forexample, given the objective to identify the "true"model of relationships between ten unorderedvariables, 4, 175, 098, 976, 430, 596, 143 DAGsshould be searched and evaluated in reference toan appropriate fit criterion. When the backgroundknowledge gives complete variable order for aBayesian network, casual direction will availablein the pattern . However in applications, estab-lishing variable order is often an issue . Geneticalgorithms based on permutation assessmentshave been developed to assist in with this problem(Larranaga et al ., 1996) .

SEM methods also generally employ modelspecification searches in applications . Joreskogand Sorbom (1993) describe three approaches forSEM construction and development : strictly con-firmatory, model generation and model compari-son. MacCallum (1995) observes that the strictlyconfirmatory approach in SEM is very rare inapplications, with model modification being verycommon. Model revisions in 'SEM are usuallybased on a local modification index, such as theLagrange multiplier test, to assess the improve-ment in fit if a selected subset of fixed parameterswere converted into free parameters . Model com-parison posits a small number of feasible repre-sentations and selects the most acceptable model,based on evidence from nested significance testingor fit indexes . The SEM application in this paperprovides an example of specification search bymodel comparison .

6.5. Model adequacy

Bayesian networks employ significance tests ofconditional independence for a statistical assess-ment of structure adequacy. A maximum scoringapproach is also available, where log likelihoodscores for various parent sets are computed foreach measured variable . SEM uses a global sta-tistical test of goodness-fit, significance tests ofmeasurement and concept coefficients, and

Page 15: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

106

descriptive model and equation indexes to supportthe adequacy of a given model .

6.6. Theory testing

The qualitative tasks of concept definition,concept representation, temporal ordering andspecification of causal relationships provide themajor explanations of an investigated theorywhether the concepts are portrayed by latentvariables or data-dependent composites or directmeasured variables. The explanation contributionsof a quantitative method, regardless whether aregression or probability metric is employed, arelimited to assessment of the model's adequacy re-flected by significance testing, descriptive fit in-dexes, and estimation of the magnitude of causalinfluences .

Neither a Bayesian network, nor a SEM offers aunique model of reality; rather a class of observ-ationally equivalent models is portrayed thatcannot be distinguished by statistical analysis(Verma and Pearl, 1990 ; MacCallum et al ., 1993).Further, Cooper (1999) points out that the inter-pretation of a Bayesian network must be maderelative to the categorical variables represented bythe nodes in the DAG . The existence of equivalentmodels is similar to confounding in experimentaldesign, where different parameterizations cannotbe distinguished in reference to fit to the data(Stelzl, 1986) .

6.7. Inference

Science typically views theory validation ascoming from predictive verification of expectedtheoretical results based on empirical evidence . Acausal model should provide an explanatory des-cription of causal relationships, plus manipulationcapabilities for diagnosing the key changes neces-sary for system improvement and for predictingthe impacts of potential change actions. Since eachequation, implied by a structure S, represents adistinct causal mechanism, it is conceptually pos-sible to set any represented variable to a specificvalue, i .e ., impose an intervention .

Bayesian networks permit both forward andbackward inference, allowing diagnostics and

R D. Anderson, G. Vastag / European Journal of Operational Research 156 (2004) 92-109

prediction based on any set of selected measuredvariables as illustrated in the above application . Asnoted above, SEM has prediction capabilities onlyat the hypothetical variable level . Even when allaspects of a SEM representation are completelysupported, prediction of observable consequencesfrom potential managerial action is not attainable .Simply, the model assumes that every observablevariable is a function of two unobservable vari-ables, a common cause and an unspecified causethat cannot be subject to managerial interventions .The inability to translate knowledge gained fromthe theoretical model evaluation into a basis formanagerial actions is the major weakness SEM .

7. Summary

The focus of this paper is on methods that viewcausation from a graphical viewpoint . The notionof a DAG is utilized to represent the conditionalindependence between any two variables, whichimplies the absence of a direct causal relationship .Further, assuming the nodes of a DAG representrandom variables, the joint probability distribu-tion of these variables can be factored in a causalmeaningful manner. These general results aresummarized in the specification of a causal modelas a structure S conforming to a DAG and a set ofassociated parameters O s consistent with thestructure. SEM and Bayesian networks are por-trayed as specializations of the general causalmodel specification, M = {S, Os } .

The emphasis of causal modeling applied inoperations management has been primarily con-centrated on providing increased process under-standing by emphasizing psychometric andstatistical support for theory-based explanations .SEM have been the most frequently used methodfor quantifying and evaluating an assumed causalprocess. The primary objective, under this ap-proach, is to assess whether a postulated theoret-ical network is a reasonable approximation of theprocess that generated the study data . Indexessupporting construct validity, measurement reli-ability, parameter significance, model fit, andcausal effects tend to dominate the reported re-sults. Since the analysis is usually based on a high

Page 16: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

RD. Anderson, G. Vastag / European Journal of Operational Research 156 (2004) 92-109

level of theoretical and knowledge domain sup-port, the confirmatory findings typically supportthe large majority of hypothesized relationships .Thus, conclusions tend to provide value by incre-mental extension of existing conceptualizationsthat cannot be extended to prediction of observedoutcomes .

Bayesian networks, in contrast to SEM, assumethe main role of causal modeling is to facilitate theanalysis of potential and actual actions, ratherthan focus on theory confirmation . Indeed, Baye-sian networks offer the capabilities to explain sys-tem relationships and to predict the impacts ofpotential actions as alternative structures that canbe evaluated by traditional tests of significance orby posterior probabilities or both, as demon-strated in the above application . The probabilitymetric provides non-linear detailed relationshipinformation that should be easily consumable bythe managers as well as academics . The modelingeffort is concerned with observable variables, nothypothetical concepts . Thus, it is possible to in-troduce a conceptual intervention and evaluate theexpected observable changes . More specifically,the posterior probabilities resulting from the in-tervention can be compared to the pre-interven-tion probabilities to provide a quantitativemeasure of expected change .

Appendix A. Description of the global manufactur-ing research group survey

In the late 1980s and in the mid 1990s, theGlobal Manufacturing Research Group carriedout two worldwide surveys focusing on smallmachine tools and no-fashion textile manufactur-ing. Data were collected in about 30 countriesrepresenting market, transitional and plannedeconomies .

In each country, directories of trade associationmembers were used to select a random sample offirms. A manufacturing executive from each of theselected companies was contacted by telephone .The executive was presented evidence of the tradeassociation support and was invited to participatein the study . As an incentive, firms were told that ifthey participated in the survey, the average res-

ponses for firms in their industry would be pro-vided to them. Follow-up telephone calls weremade to solicit responses and remind the partici-pants to complete and return the questionnaires . Ifnecessary, a second round of mailing was done toincrease the sample size. In this paper, we selecteda subset of the GMRG database, countries withlarge number of responses .

In the subset of data we used in this paper, wehad 588 respondents from the second survey . Therespondents were from Canada (88), Japan (77),Mexico (93), Russia (92) and the United States(238). Industry representation was classified asmachine tool (209), textile (207) and others (172) .

References

Acito, F., Anderson, R.D., 1986. Factor score indeterminacy :Correlates and implications for marketing research . Journalof Marketing Research 20 (2), 111-118 .

Babakus, E., Ferguson, C.E., Joreskog, K.G., 1987 . Thesensitivity of confirmatory maximum likelihood factoranalysis to violations of measurement scale and distributionassumptions. Journal of Marketing Research 37, 72-141 .

Bayes, T ., 1763 . An essay toward solving a problem in thedoctrine of chance. Philosophical Transactions of the RoyalSociety 53, 370-418.

Bentler, P.M ., 1995. EQS Structural Equations ProgramManual. Multivariate Software, Encino, CA .

Bentler, P.M ., Chou, C.P., 1987. Practical issues in structuralmodeling. Sociological Methods and Research 16, 78-117 .

Bernardo, J.M ., Smith, A.F.M ., 2000. Bayesian Theory . Wiley,New York .

Bollen, K.A ., 1989 . Structural Equations with Latent Variables .Wiley, New York .

Breckler, S .J., 1990. Application of covariance structure mod-eling in psychology: Cause for concern? PsychologicalBulletin 107, 260-273 .

Cartwright, N ., 1997. What is a causal structure? In : McKim,V.R ., Turner, P.T . (Eds.), Causality in Crisis?. University ofNotre Dame Press, Notre Dame, IN, pp. 343-357 .

Cleveland, W .S ., 1993. Visualizing Data. Hobart Press, Sum-mit, NJ .

Coenders, G ., Satorra, A ., Saris, W.E., 1997. Alternativeapproaches to structural modeling of ordinal data . Struc-tural Equation Modeling 4 (4), 261-282 .

Cook, T.D., Campbell, D.T ., 1979. Quasi-Experimentation :Design and Analysis Issues for Field Settings . RandMcNally, Chicago .

Cooper, G.F ., 1999. An overview of the representation anddiscovery of causal relationships using bayesian networks .In: Glymour, C., Cooper, G .F . (Eds .), Computation,

107

Page 17: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

108

R D. Anderson, G. Vastag l European Journal of Operational Research 156 (2004) 92-109

Causation and Discovery. MIT Press, Menlo Park, CA, pp.3-62 .

Cooper, G.F, Herskovits, E., 1992 . A Bayesian method for theinduction of probabilistic network from data . MachineLearning 9, 309-347.

Cowell, R.G., Dawid, A.P., Lauritzen, S .L., Spiegelhalter, D.J .,1999. Probabilistic Networks and Expert Systems . Springer-Verlag, New York .

Druzdzel, M .J ., Simon, H.A ., 1993 . Causality in Bayesian beliefnetworks . In : Proceedings of Ninth Annual Conference inArtificial Intelligence. Morgan Kaufmann Publishers, SanFrancisco, pp . 3-11 .

Einstein, A., Podolsky, B ., Rosen, N ., 1935 . Can quantum-mechanical description of physical reality be consideredcomplete? Physical Review 41, 777 .

Geiger, D., Herkerman, D ., 1997 . A characterization ofDirichlet distributions through local and global indepen-dence . Annals of Statistics 25, 1344-1368 .

Glymour, C., Cooper, G.F., 1999 . Computation, Causationand Discovery. MIT Press, Cambridge, MA .

Guadognoli, E ., Velicer, W.F., 1988 . Relationship of samplesize to the stability of component patterns . PsychologicalBulletin 103, 265-275.

Hausman, D.M ., Woodward, J ., 1999 . Independence, invari-ance and the causal Markov condition . British Journal forthe Philosophy of Science 50, 521-583 .

Heckerman, D ., 1999 . A tutorial on learning with Bayesiannetworks . In : Jordan, M.I. (Ed .), Learning in GraphicalModels. MIT Press, Cambridge, MA, pp. 301-354 .

Hume, D., 1969 . A Treatise of Human Nature. Penguin Books,London .

Jensen, F., 1996 . An Introduction to Bayesian Network.University College London Press, London .

Johnson, D.R ., Creech, J.C., 1983 . Ordinal measures inmultiple indicator models. American Sociological Review48, 398-407 .

Joreskog, K.G., 1971 . Statistical analysis of set of congenerictest. Psychometrika 36, 109-133 .

Joreskog, K.G., 1973. A general method for estimating a linearstructural equation system. In : Goldberger, A.S ., Duncan,O.D . (Eds .), Structural Equation Models in the SocialSciences. Academic Press, New York, pp . 85-112 .

Joreskog, K.G., Sorbom, D., 1993. LISREL 8, Analysis ofLinear Structural Relations by the Method of MaximumLikelihood. International Education Services, Chicago .

Kiiveri, H .T ., Speed, T.P., 1982. Structural analysis of multi-variate data : A review . In : Leinhardt, S . (Ed.), SociologicalMethodology. Jossy Bass, San Francisco, pp . 209-289.

Larranaga, P., Poza, M., Yurramendi, Y., Murga, R.H .,Kuijpers, C.M.H., 1996. Structure learning of Bayesiannetworks by genetic algorithms : A performance analysis ofcontrol parameters. IEEE Journal on Pattern Analysis andMachine Intelligence 18 (9), 912-926.

Lawley, D.N ., Maxwell, A.E., 1963. Factor Analysis as aStatistical Method . Butterworth, London .

Little, T.D ., Cunningham, W.A ., Shahar, G ., Widaman, K.F.,2002. To parcel or not to parcel: Exploring the question,

weighing the merits. Structural Equation Modeling 9, 151-173 .

MacCallum, R.C ., 1995. Model specification: Procedures,strategies, and related issues. In : Hoyle, R .H . (Ed .), Struc-tural Equation Modeling : Concepts, Issues and Applica-tions. Sage, Thousand Oaks, CA, pp . 16-36.

MacCallum, R.C., Wegener, D .T., Uchino, B.M ., Fabrigar,L.R ., 1993. The problem of equivalent models in applica-tions of covariance structure analysis . Psychological Bulle-tin 114, 185-199.

Mardia, K.V., 1970. Measures of multivariate skewness andkurtosis with applications. Biometrika 57, 519-530 .

McKim, V.R ., Turner, S.P., 1997 . Causality in Crisis? Univer-sity of Notre Dame Press, Notre Dame, IN .

Micceri, T., 1989. The unicorn, the normal curve, and otherimprobable creatures. Psychological Bulletin 105, 156-166 .

Muthen, B.O ., . Muthen, L.K ., 1998. Mplus: User's Guide .Muthen and Muthen, Los Angeles, CA .

Pearl, J ., 1988. Probabilistic Inference in Intelligent Systems .Morgan Kaufmann, San Mateo, CA .

Pearl, J ., 1995 . Causal diagrams for empirical research .Biometrika 82 (4), 669-710 .

Pearl, J ., 2000 . Causality: Models, Reasoning and Inference .Cambridge University Press, Cambridge, UK .

Pedhazur, E.J ., Schmelkin, L ., 1991 . Measurement, Design, andAnalysis . Lawrence Erlbaum Associates, Hillsdale, NJ .

Ping, R ., 1996 . Latent variable interaction and quadratic effectestimation: A two-step technique using structural equationanalysis . Psychological Bulletin 119, 166-175 .

Ramoni, M., Sebastiani, P., 1999 . Bayesian methods forintelligent data analysis . In: Bethold, M., Hand, D .J.(Eds .), An Introduction to Intelligent Data Analysis.Springer-Verlag, New York, pp . 129-166.

Reichenbach, Hans., 1956. The Direction of Time . Universityof California Press, Berkeley .

Scheines, R ., 1997 . An introduction to causal inference . In:McKim, V.R., Turner, P.T. (Eds.), Causality in Crisis?.University of Notre Dame Press, Notre Dame, IN, pp . 185-199 .

Scheines, R ., 1999 . Estimating causal influences : TETRAD IImodel selection and Bayesian parameter estimation . In :Glymour, C., Cooper, G.F. (Eds .), Computation, Causationand Discovery . MIT Press, Menlo Park, CA, pp . 425-437.

Scheines, R., Spirtes, P., Glymour, C., Meek, C., 1994 .TETRAD II: User's Manual . Lawrence Erlbaum, Hillsdale,NJ.

Schmenner, R.W., 1988. The merit of making things fast . SloanManagement Review 30 (1), 11-17 .

Sebastiani, P., Ramoni, M ., 2000 . Bayesian inference withmissing data using bound and collapse . Journal of Compu-tational and Graphical Statistics 9 (4), 779-800.

Shachter, R.D., 1986. Evaluating influence diagrams. Opera-tions Research 34 (6), 871-882.

Spearman, C ., 1904. General intelligence, objectively deter-mined and measured . American Journal of Psychology 15,201-293 .

Page 18: SCIENCE EUROPEAN JOURNAL OF OPERATIONAL …home.kelley.iupui.edu/gvastag/Papers/EJOR_2004_CausalModeling.pdf · This paper uses the relationships between three basic, ... (Spirtes

RD. Anderson, G. Vastag / European Journal of Operational Research 156 (2004) 92-109

Spiegelhalter, D.J ., Dawid, A.P., Lauritzen, S.L ., Cowell, R.G .,1993 . Bayesian analysis with expert systems (with discus-sion). Statistical Science 8, 219-283 .

Spirtes, P., Glymour, C ., Schienes, R., 1993. Causation,Prediction, and Search. Springer-Verlag, New York .

Stelzl, I ., 1986 . Changing causal relationships withoutchanging the fit: Some rules for generating equivalentLISREL-models . Multivariate behavioral Research 21,309-331 .

Suppes, P ., 1970 . A Probabilistic Theory of Causality . North-Holland Publishing, Amsterdam .

Thurstone, L.L ., 1935. Vectors of the Mind . Chicago UniversityPress, Chicago .

Vastag, G., Whybark, D.C., 1993 . Global relations betweeninventory, manufacturing lead time and delivery datepromises. International Journal of Production Economics30-31,563-569 .

Vastag, G., Montabon, F ., 2001 . Linkages among manufactur-ing concepts, inventories, delivery service and competitive-

109

ness. International Journal of Production Economics 71,195-204.

Verma, T., Pearl, J ., 1990. Equivalence and synthesis of causalmodels. In : Proceedings, Sixth Workshop on Uncertainty inArtificial Intelligence . North Holland, Amsterdam, pp. 220-227 .

West, S.G ., Finch, J .F., Curran, P.J ., 1995. Structural equationmodels with nonnormal variables : Problems and remedies .In: Hoyle, R.H. (Ed .), Structural Equation Modeling :Concepts, Issues and Applications. Sage, Thousand Oaks,CA, pp . 56-75 .

Wright, S., 1934. The method of path coefficients . Annals ofMathematical Statistics 5, 161-215 .

Whybark, D.C., Vastag, G. (Eds.), 1993. Global Manufactur-ing Practices: A Worldwide Survey of Practices and Manu-facturing Planning and Control . Elsevier, Amsterdam .

Winkler, R.L ., 1967. The assessment of prior distributions inBayesian analysis . Journal of American Statistical Associ-ation 62, 776-800 .