Optimal Verification Testing with Graphical Effects Analysis

24
Optimal Verification Testing with Graphical Effects Analysis 1 Optimal Verification Testing with Graphical Effects Analysis Presented by Mark J. Anderson, PE, CQE, MBA Stat-Ease, Inc., Minneapolis, Minnesota [email protected] |Standardized Effect| 0.00 0.70 1.39 2.09 2.79 3.48 4.18 4.88 0 10 20 30 50 70 80 90 95 99 Anecdote: Verifying tooth fairy

Transcript of Optimal Verification Testing with Graphical Effects Analysis

Page 1: Optimal Verification Testing with Graphical Effects Analysis

Optimal Verification Testing with Graphical Effects Analysis

1

Optimal Verification Testing with Graphical Effects Analysis

Presented by Mark J. Anderson, PE, CQE, MBA Stat-Ease, Inc., Minneapolis, Minnesota

[email protected]

|Standardized Effect|0.00 0.70 1.39 2.09 2.79 3.48 4.18 4.88

0102030

50

70

80

90

95

99

Anecdote: Verifying tooth fairy

Page 2: Optimal Verification Testing with Graphical Effects Analysis

Optimal Verification Testing with Graphical Effects Analysis

2

Statistical Design of Experiments (DOE)

Process

Controllable Factors (X)

Responses (Y)

Uncontrolled Factors

“A series of tests, in

which purposeful changes

are made to input factors,

to identify causes

for significant changes

in the output responses.”

This talk will detail state-of-the-art tools for the design and analysis of verification tests that include both numeric and categoric factors.

Page 3: Optimal Verification Testing with Graphical Effects Analysis

Optimal Verification Testing with Graphical Effects Analysis

3

Multi-Factor Testing (VS OFAT)(Bearing life from accelerated test)

"To make knowledge work productive will be the great management task of this century."

-- Peter Drucker

A- A+B-

B+

C-

C+

17

19

26

16

25

21

85

128

Relative efficiency = 16/8

2 to 1!

Factors:A. Heat TreatmentB. OsculationC. Material**Categorical

Page 4: Optimal Verification Testing with Graphical Effects Analysis

Optimal Verification Testing with Graphical Effects Analysis 4

Strategy of ExperimentationFocus on Verification

yes

Factor effectsand interactions

ResponseSurfaceMethods

Curvature?

Confirm?

KnownFactors

UnknownFactors

Screening

Backup

Celebrate!

no

no

yes

Trivialmany

Vital few

Screening

Characterization

Optimization

Verification

Factor effectsand interactions

ResponseSurfaceMethods

Curvature?

Confirm?

KnownFactors

UnknownFactors

Screening

Backup

Celebrate!

no

no

yes

Trivialmany

Vital few

Screening

Characterization

Optimization

Verification

Page 5: Optimal Verification Testing with Graphical Effects Analysis

Factorial Design Planning Process

1. State objective in terms of measurable responses. For each:

a. Define the effect (response difference ∆y) that is important to detect for each response. (This is the signal, at a minimum, you are listening for.)

b. Estimate experimental error (response variation σ) for each response. (The noise.)

2. Select the input factors to study and establish their ranges.(Wider the better for creating effects exceeding ∆y.)

3. Select a design and evaluate it for:

Resolution of effects (beware of aliasing).

Power based on its signal to noise ratio (∆y/σ).(For verification aim high: > 90 % for every response.)

Examine all runs for unsafe factor combinations.(Pre-test any that may not work and/or create hazards.)

5Optimal Verification Testing with Graphical Effects Analysis

Page 6: Optimal Verification Testing with Graphical Effects Analysis

Requisites of a Good Test Design*

The test design matches the test objectives

All the important responses are measured

Factor ranges are practical

Replication – measures experimental error

Randomization – counteracts lurking variables

Blocking – filters systematic variation

Everyone is involved (teamwork)

Optimal Verification Testing with Graphical Effects Analysis

6

*(Adapted from “Proven Cost Savings Using Modern Design Of Experiments (MDOE)” presented by William B. Line, DOES (Design of Experiments) Institute, to the

American Institute of Aeronautics and Astronautics Aerospace Sciences Meeting,Orlando, Florida January 4, 2010)

Page 7: Optimal Verification Testing with Graphical Effects Analysis

Optimal Verification Testing with Graphical Effects Analysis

7

Verification Designswith Categorical Factors

Ideally these test matrices can accommodate any number of levels for any many factors, some or all of which might be categorical.

Typical problem*: “We have 6 factors – 5 at 3 levels each and 1 at 2 levels. Each

run is ~$600 (ouch). Our budget will likely support ~60 experimental runs, if needed – but I’d like to conduct this experiment in less runs and save costs.”There are 486 possible combinations, which would cost almost $300,000 to perform. Is there a way to run only a fraction?

*(Correspondence to author on 2/8/10)

Page 8: Optimal Verification Testing with Graphical Effects Analysis

Optimal Verification Testing with Graphical Effects Analysis

8

Optimal Design to Minimize Runs

1. Specify a polynomial that you think is needed to get a decent approximation of the actual mechanism.

Do not overlook two-factor interactions (“2FIs”).

2. Select minimal points to estimate all coefficients in your design-for model.(Computer-based exchange algorithm.)

3. Consider augmenting the design with points for:

– Replicates: To estimate pure error.

– Lack-of-fit: To test how well the model represents actual behavior in our region of interest.

Page 9: Optimal Verification Testing with Graphical Effects Analysis

Optimal Verification Testing with Graphical Effects Analysis

9

D-optimal

Criterion: D-optimal design minimizes the determinant of the (X'X)-1 matrix. This minimizes the volume of the confidence ellipsoid for the coefficients and maximizes information about the polynomial coefficients.

β2

β1

Correlated Coefficients

β1

β2

Uncorrelated Coefficients

Statisticaldetail

Page 10: Optimal Verification Testing with Graphical Effects Analysis

Importance vs SignificanceVerification

Many are unclear on this difference!

Optimal Verification Testing with Graphical Effects Analysis

10

Significant

No YesIm

port

ant No

Yes?

\!/

For example, let’s look at a two-level factorial verification test on a system that must be must not exceed 35 units of response due to factors varied within ranges that may be encountered in the field.

Page 11: Optimal Verification Testing with Graphical Effects Analysis

Edme section 1

Half Normal Probability PaperSorting the vital few effects from the trivial many.

11

Hal

f-Nor

mal

% P

roba

bilit

y

|Standardized Effect|

0.00 5.38 10.75 16.13 21.50

0102030

50

70

80

90

95

B

C

BC

Significant effects:The model terms!

Negligible effects: The error estimate!

Before looking at the two-level factorial verification test case, this vital tool for analysis must be explained.

Page 12: Optimal Verification Testing with Graphical Effects Analysis

Verified :Not Significant, nor Important

12

Is anything statistically significant?

Hal

f-Nor

mal

% P

roba

bilit

y

|Standardized Effect|

0.00 7.13 14.25 21.38 28.50

0102030

50

70

80

90

95

99 Warning! No terms are selected.

No

Hal

f-Nor

mal

% P

roba

bilit

y

|Standardized Effect|

0.00 7.13 14.25 21.38 28.50

0102030

50

70

80

90

95

99 Warning! No terms are selected.

No!∆ of 28.5

Could largest effect be important?

Optimal Verification Testing with Graphical Effects Analysis

Must not exceed 35 units of response due to factors varied

within ranges that may be encountered in the field.

Page 13: Optimal Verification Testing with Graphical Effects Analysis

Verification Failed:Not Significant, but Important (?)

13

Hal

f-Nor

mal

% P

roba

bilit

y

|Standardized Effect|

0.00 10.69 21.38 32.06 42.75

0102030

50

70

80

90

95

99 Warning! No terms are selected.

Is anything statistically significant?

Could largest effect be important?

Yes!∆ of 42.75H

alf-N

orm

al %

Pro

babi

lity

|Standardized Effect|

0.00 10.69 21.38 32.06 42.75

0102030

50

70

80

90

95

99 Warning! No terms are selected.

Optimal Verification Testing with Graphical Effects Analysis

Must not exceed 35 units

Page 14: Optimal Verification Testing with Graphical Effects Analysis

Verified (Qualified) :Significant, but not Important

14

Is anything statistically significant?

Could largest effect be important?

Yes!

Hal

f-Nor

mal

% P

roba

bilit

y

|Standardized Effect|

0.00 5.50 11.00 16.50 22.00

0102030

50

70

80

90

95

99

F

No!∆ of 22H

alf-N

orm

al %

Pro

babi

lity

|Standardized Effect|

0.00 5.50 11.00 16.50 22.00

0102030

50

70

80

90

95

99

F

Optimal Verification Testing with Graphical Effects Analysis

Page 15: Optimal Verification Testing with Graphical Effects Analysis

Verification Failed:Significant and Important

15

Is anything statistically significant?

Could largest effect be important?

\!/

Yes!

Hal

f-Nor

mal

% P

roba

bilit

y

|Standardized Effect|

0.00 11.16 22.31 33.47 44.63

0102030

50

70

80

90

95

99

F

Yes!∆ of

44.62H

alf-N

orm

al %

Pro

babi

lity

|Standardized Effect|

0.00 11.16 22.31 33.47 44.63

0102030

50

70

80

90

95

99

F

Optimal Verification Testing with Graphical Effects Analysis

Must not exceed 35 units

Page 16: Optimal Verification Testing with Graphical Effects Analysis

Case Study:Hydraulic Gear Pump*

Optimal Verification Testing with Graphical Effects Analysis

16

Hydraulic gear pumps are vital for many machines including vehicles and airplanes. However, they tend to lose efficiency due to internal leakage. OEM engineers must verify that one such device will remain within a specified range of performance regardless of normal production variations. They settle on nine factors, primarily categorical -- shown at right with number of levels each.

*(Loosely based on “Experiments for derived factors with application to hydraulic gear pumps,” by C. J. Sexton , S. M. Lewis & C. P. Please, Journal of the Royal Statistical Society: Series C (Applied Statistics), 2002, V50, Part 2, pp 155-170)

A. Flange (3)

B. Cover (3)

C. Float (3)

D. Bearing (2)

E. Involute (2)

F. Lead edge (2)

G. Side Gap (2)

H. Pressure (3)

I. Speed (3)

Page 17: Optimal Verification Testing with Graphical Effects Analysis

Pump CaseSpecifying the Experiment Design

Optimal Verification Testing with Graphical Effects Analysis

17

Based on subject-matter knowledge, the engineers are most concerned about interactions involving the float (factor C).

Customized pumps are costly, so a minimal-run design is desired.

Choose a d-optimal design for a reduced 2FI model: intercept, 9 main effects and 8 two-factor interactions (those involving C).

39 model points (builds) must be picked from the 3888 (3x3x3x2x2x2x2x3x3) possible combinations, in other words, approximately a 1/10th fraction.

To estimate lack-of-fit pick 5 more unique combinations

From these 44 builds, select 4 to replicated for pure error.

48 pumps will be built in total.

Differences in leak-back of 2 units are of interest. It must not exceed 10 units overall.

Page 18: Optimal Verification Testing with Graphical Effects Analysis

Pump CaseWill the design provided proper power?

1.Define the change (∆y) that is important. (Signal)

2.Estimate experimental error (σ). (Noise)

3.From signal-to-noise ratio (∆y/σ) estimate power.If runs suffice, the averaging provided by the matrix will cut the grass (noise) to reveal the snake (effect)!

Optimal Verification Testing with Graphical Effects Analysis

18

Yes, power at 5% risk to see signal exceeds the 90% guideline for verification testing.

Page 19: Optimal Verification Testing with Graphical Effects Analysis

Pump CaseResults

Optimal Verification Testing with Graphical Effects Analysis

19

Design-Expert® SoftwareLeak-back

Shapiro-Wilk testW-value = 0.702p-value = 0.000A: FlangeB: CoverC: FloatD: BearingE: InvoluteF: Lead edgeG: Side GapH: Pressure J: Speed

Half-Normal Plot

Hal

f-Nor

mal

% P

roba

bilit

y

|Normal Effect|

0.00 2.31 4.62 6.93 9.25

0102030

50

70

80

90

95

99

C

D

E

H

J

CH

Significant effects, but is the largest less than 10 leak-back units? We cannot tell with this graph because the effects are in a

“normal” scale similar to a Z score.

This is not a two-level factorial.

Some trickery is required (see Ref.

#2) to plot multiple contrasts on a

comparative scale.

Page 20: Optimal Verification Testing with Graphical Effects Analysis

Pump CaseEffects Plots

Optimal Verification Testing with Graphical Effects Analysis

20

Three significant main effects (one-factor), plus one two-factor interaction. =>

Does the predicted response range more than 10 units? If so, it fails verification.

E: Involute

Leak

-bac

k

One Factor

Form10 Form20

6

8

10

12

14

16

D: BearingLe

ak-b

ack

One Factor

N4 W5

6

8

10

12

14

16

J: Speed

Leak

-bac

k

One Factor

Slo Mid Fast

6

8

10

12

14

16

H: Pressure

SB SC SD

Interaction

C: Float

Leak

-bac

k

6

8

10

12

14

16

H1

H2

H3

Page 21: Optimal Verification Testing with Graphical Effects Analysis

Pump CaseNumerical Optimization

Optimal Verification Testing with Graphical Effects Analysis

21

MinimumLeak-Back

MaximumLeak-Back

Difference < 10 units

Predicted response ranges less than 10 units, so it passes verification.

Page 22: Optimal Verification Testing with Graphical Effects Analysis

Optimal Verification Testing with Graphical Effects Analysis

22

Take-Home Messages

By way of a case study on verification testing of an hydraulic gear pump, this presentation on design of experiments (DOE) provided insights into statistically-optimal designs involving many categorical factors at multiple levels.

Upon collection of the response data, an innovative new graphical approach to assessing effects – the half-normal plot – revealed at-a- glance the likely significance and, for two-level factorials the importance of the signal generated by the experiment. (General factorials require special scaling per Ref #2 shown on following slide.)

Multifactorial DOEs like this are more efficient than traditional one-factor-at-a-time (OFAT) testing, which would never reveal an interaction such as the one that came to light in this case.

|Standardized Effect|0.00 0.70 1.39 2.09 2.79 3.48 4.18 4.88

0102030

50

7080

90

95

99

Page 23: Optimal Verification Testing with Graphical Effects Analysis

Further Reading for More Detail on Methodology

1. Mark J. Anderson and Patrick J. Whitcomb, DOE Simplified –Practical Tools for Effective Experimentation, 2nd Edition, Productivity Press, NY, NY, 2007.

2. Patrick J. Whitcomb and Gary W. Oehlert, “Graphical Selection of Effects in General Factorials,” 2007 Fall Technical Conference of the American Society for Quality (ASQ) and the American Statistical Association (ASA).

23

|Standardized Effect|0.00 0.70 1.39 2.09 2.79 3.48 4.18 4.88

0102030

50

7080

90

95

99

Optimal Verification Testing with Graphical Effects Analysis

Page 24: Optimal Verification Testing with Graphical Effects Analysis

Your thoughts welcomed

For a copy of updated “Optimal Verification Testing with Graphical Effects Analysis,” e-mail [email protected]. Feel free to provide comments and suggestions!

Thanks for listening!

24

“It may happen that small differences in the initial conditionsproduce great ones in the final phenomena.”

- Henri Poincare

Optimal Verification Testing with Graphical Effects Analysis