Optimal Verification Testing with Graphical Effects Analysis
Transcript of Optimal Verification Testing with Graphical Effects Analysis
Optimal Verification Testing with Graphical Effects Analysis
1
Optimal Verification Testing with Graphical Effects Analysis
Presented by Mark J. Anderson, PE, CQE, MBA Stat-Ease, Inc., Minneapolis, Minnesota
|Standardized Effect|0.00 0.70 1.39 2.09 2.79 3.48 4.18 4.88
0102030
50
70
80
90
95
99
Anecdote: Verifying tooth fairy
Optimal Verification Testing with Graphical Effects Analysis
2
Statistical Design of Experiments (DOE)
Process
Controllable Factors (X)
Responses (Y)
Uncontrolled Factors
“A series of tests, in
which purposeful changes
are made to input factors,
to identify causes
for significant changes
in the output responses.”
This talk will detail state-of-the-art tools for the design and analysis of verification tests that include both numeric and categoric factors.
Optimal Verification Testing with Graphical Effects Analysis
3
Multi-Factor Testing (VS OFAT)(Bearing life from accelerated test)
"To make knowledge work productive will be the great management task of this century."
-- Peter Drucker
A- A+B-
B+
C-
C+
17
19
26
16
25
21
85
128
Relative efficiency = 16/8
2 to 1!
Factors:A. Heat TreatmentB. OsculationC. Material**Categorical
Optimal Verification Testing with Graphical Effects Analysis 4
Strategy of ExperimentationFocus on Verification
yes
Factor effectsand interactions
ResponseSurfaceMethods
Curvature?
Confirm?
KnownFactors
UnknownFactors
Screening
Backup
Celebrate!
no
no
yes
Trivialmany
Vital few
Screening
Characterization
Optimization
Verification
Factor effectsand interactions
ResponseSurfaceMethods
Curvature?
Confirm?
KnownFactors
UnknownFactors
Screening
Backup
Celebrate!
no
no
yes
Trivialmany
Vital few
Screening
Characterization
Optimization
Verification
Factorial Design Planning Process
1. State objective in terms of measurable responses. For each:
a. Define the effect (response difference ∆y) that is important to detect for each response. (This is the signal, at a minimum, you are listening for.)
b. Estimate experimental error (response variation σ) for each response. (The noise.)
2. Select the input factors to study and establish their ranges.(Wider the better for creating effects exceeding ∆y.)
3. Select a design and evaluate it for:
Resolution of effects (beware of aliasing).
Power based on its signal to noise ratio (∆y/σ).(For verification aim high: > 90 % for every response.)
Examine all runs for unsafe factor combinations.(Pre-test any that may not work and/or create hazards.)
5Optimal Verification Testing with Graphical Effects Analysis
Requisites of a Good Test Design*
The test design matches the test objectives
All the important responses are measured
Factor ranges are practical
Replication – measures experimental error
Randomization – counteracts lurking variables
Blocking – filters systematic variation
Everyone is involved (teamwork)
Optimal Verification Testing with Graphical Effects Analysis
6
*(Adapted from “Proven Cost Savings Using Modern Design Of Experiments (MDOE)” presented by William B. Line, DOES (Design of Experiments) Institute, to the
American Institute of Aeronautics and Astronautics Aerospace Sciences Meeting,Orlando, Florida January 4, 2010)
Optimal Verification Testing with Graphical Effects Analysis
7
Verification Designswith Categorical Factors
Ideally these test matrices can accommodate any number of levels for any many factors, some or all of which might be categorical.
Typical problem*: “We have 6 factors – 5 at 3 levels each and 1 at 2 levels. Each
run is ~$600 (ouch). Our budget will likely support ~60 experimental runs, if needed – but I’d like to conduct this experiment in less runs and save costs.”There are 486 possible combinations, which would cost almost $300,000 to perform. Is there a way to run only a fraction?
*(Correspondence to author on 2/8/10)
Optimal Verification Testing with Graphical Effects Analysis
8
Optimal Design to Minimize Runs
1. Specify a polynomial that you think is needed to get a decent approximation of the actual mechanism.
Do not overlook two-factor interactions (“2FIs”).
2. Select minimal points to estimate all coefficients in your design-for model.(Computer-based exchange algorithm.)
3. Consider augmenting the design with points for:
– Replicates: To estimate pure error.
– Lack-of-fit: To test how well the model represents actual behavior in our region of interest.
Optimal Verification Testing with Graphical Effects Analysis
9
D-optimal
Criterion: D-optimal design minimizes the determinant of the (X'X)-1 matrix. This minimizes the volume of the confidence ellipsoid for the coefficients and maximizes information about the polynomial coefficients.
β2
β1
Correlated Coefficients
β1
β2
Uncorrelated Coefficients
Statisticaldetail
Importance vs SignificanceVerification
Many are unclear on this difference!
Optimal Verification Testing with Graphical Effects Analysis
10
Significant
No YesIm
port
ant No
Yes?
\!/
For example, let’s look at a two-level factorial verification test on a system that must be must not exceed 35 units of response due to factors varied within ranges that may be encountered in the field.
Edme section 1
Half Normal Probability PaperSorting the vital few effects from the trivial many.
11
Hal
f-Nor
mal
% P
roba
bilit
y
|Standardized Effect|
0.00 5.38 10.75 16.13 21.50
0102030
50
70
80
90
95
B
C
BC
Significant effects:The model terms!
Negligible effects: The error estimate!
Before looking at the two-level factorial verification test case, this vital tool for analysis must be explained.
Verified :Not Significant, nor Important
12
Is anything statistically significant?
Hal
f-Nor
mal
% P
roba
bilit
y
|Standardized Effect|
0.00 7.13 14.25 21.38 28.50
0102030
50
70
80
90
95
99 Warning! No terms are selected.
No
Hal
f-Nor
mal
% P
roba
bilit
y
|Standardized Effect|
0.00 7.13 14.25 21.38 28.50
0102030
50
70
80
90
95
99 Warning! No terms are selected.
No!∆ of 28.5
Could largest effect be important?
Optimal Verification Testing with Graphical Effects Analysis
Must not exceed 35 units of response due to factors varied
within ranges that may be encountered in the field.
Verification Failed:Not Significant, but Important (?)
13
Hal
f-Nor
mal
% P
roba
bilit
y
|Standardized Effect|
0.00 10.69 21.38 32.06 42.75
0102030
50
70
80
90
95
99 Warning! No terms are selected.
Is anything statistically significant?
Could largest effect be important?
Yes!∆ of 42.75H
alf-N
orm
al %
Pro
babi
lity
|Standardized Effect|
0.00 10.69 21.38 32.06 42.75
0102030
50
70
80
90
95
99 Warning! No terms are selected.
Optimal Verification Testing with Graphical Effects Analysis
Must not exceed 35 units
Verified (Qualified) :Significant, but not Important
14
Is anything statistically significant?
Could largest effect be important?
Yes!
Hal
f-Nor
mal
% P
roba
bilit
y
|Standardized Effect|
0.00 5.50 11.00 16.50 22.00
0102030
50
70
80
90
95
99
F
No!∆ of 22H
alf-N
orm
al %
Pro
babi
lity
|Standardized Effect|
0.00 5.50 11.00 16.50 22.00
0102030
50
70
80
90
95
99
F
Optimal Verification Testing with Graphical Effects Analysis
Verification Failed:Significant and Important
15
Is anything statistically significant?
Could largest effect be important?
\!/
Yes!
Hal
f-Nor
mal
% P
roba
bilit
y
|Standardized Effect|
0.00 11.16 22.31 33.47 44.63
0102030
50
70
80
90
95
99
F
Yes!∆ of
44.62H
alf-N
orm
al %
Pro
babi
lity
|Standardized Effect|
0.00 11.16 22.31 33.47 44.63
0102030
50
70
80
90
95
99
F
Optimal Verification Testing with Graphical Effects Analysis
Must not exceed 35 units
Case Study:Hydraulic Gear Pump*
Optimal Verification Testing with Graphical Effects Analysis
16
Hydraulic gear pumps are vital for many machines including vehicles and airplanes. However, they tend to lose efficiency due to internal leakage. OEM engineers must verify that one such device will remain within a specified range of performance regardless of normal production variations. They settle on nine factors, primarily categorical -- shown at right with number of levels each.
*(Loosely based on “Experiments for derived factors with application to hydraulic gear pumps,” by C. J. Sexton , S. M. Lewis & C. P. Please, Journal of the Royal Statistical Society: Series C (Applied Statistics), 2002, V50, Part 2, pp 155-170)
A. Flange (3)
B. Cover (3)
C. Float (3)
D. Bearing (2)
E. Involute (2)
F. Lead edge (2)
G. Side Gap (2)
H. Pressure (3)
I. Speed (3)
Pump CaseSpecifying the Experiment Design
Optimal Verification Testing with Graphical Effects Analysis
17
Based on subject-matter knowledge, the engineers are most concerned about interactions involving the float (factor C).
Customized pumps are costly, so a minimal-run design is desired.
Choose a d-optimal design for a reduced 2FI model: intercept, 9 main effects and 8 two-factor interactions (those involving C).
39 model points (builds) must be picked from the 3888 (3x3x3x2x2x2x2x3x3) possible combinations, in other words, approximately a 1/10th fraction.
To estimate lack-of-fit pick 5 more unique combinations
From these 44 builds, select 4 to replicated for pure error.
48 pumps will be built in total.
Differences in leak-back of 2 units are of interest. It must not exceed 10 units overall.
Pump CaseWill the design provided proper power?
1.Define the change (∆y) that is important. (Signal)
2.Estimate experimental error (σ). (Noise)
3.From signal-to-noise ratio (∆y/σ) estimate power.If runs suffice, the averaging provided by the matrix will cut the grass (noise) to reveal the snake (effect)!
Optimal Verification Testing with Graphical Effects Analysis
18
Yes, power at 5% risk to see signal exceeds the 90% guideline for verification testing.
Pump CaseResults
Optimal Verification Testing with Graphical Effects Analysis
19
Design-Expert® SoftwareLeak-back
Shapiro-Wilk testW-value = 0.702p-value = 0.000A: FlangeB: CoverC: FloatD: BearingE: InvoluteF: Lead edgeG: Side GapH: Pressure J: Speed
Half-Normal Plot
Hal
f-Nor
mal
% P
roba
bilit
y
|Normal Effect|
0.00 2.31 4.62 6.93 9.25
0102030
50
70
80
90
95
99
C
D
E
H
J
CH
Significant effects, but is the largest less than 10 leak-back units? We cannot tell with this graph because the effects are in a
“normal” scale similar to a Z score.
This is not a two-level factorial.
Some trickery is required (see Ref.
#2) to plot multiple contrasts on a
comparative scale.
Pump CaseEffects Plots
Optimal Verification Testing with Graphical Effects Analysis
20
Three significant main effects (one-factor), plus one two-factor interaction. =>
Does the predicted response range more than 10 units? If so, it fails verification.
E: Involute
Leak
-bac
k
One Factor
Form10 Form20
6
8
10
12
14
16
D: BearingLe
ak-b
ack
One Factor
N4 W5
6
8
10
12
14
16
J: Speed
Leak
-bac
k
One Factor
Slo Mid Fast
6
8
10
12
14
16
H: Pressure
SB SC SD
Interaction
C: Float
Leak
-bac
k
6
8
10
12
14
16
H1
H2
H3
Pump CaseNumerical Optimization
Optimal Verification Testing with Graphical Effects Analysis
21
MinimumLeak-Back
MaximumLeak-Back
Difference < 10 units
Predicted response ranges less than 10 units, so it passes verification.
Optimal Verification Testing with Graphical Effects Analysis
22
Take-Home Messages
By way of a case study on verification testing of an hydraulic gear pump, this presentation on design of experiments (DOE) provided insights into statistically-optimal designs involving many categorical factors at multiple levels.
Upon collection of the response data, an innovative new graphical approach to assessing effects – the half-normal plot – revealed at-a- glance the likely significance and, for two-level factorials the importance of the signal generated by the experiment. (General factorials require special scaling per Ref #2 shown on following slide.)
Multifactorial DOEs like this are more efficient than traditional one-factor-at-a-time (OFAT) testing, which would never reveal an interaction such as the one that came to light in this case.
|Standardized Effect|0.00 0.70 1.39 2.09 2.79 3.48 4.18 4.88
0102030
50
7080
90
95
99
Further Reading for More Detail on Methodology
1. Mark J. Anderson and Patrick J. Whitcomb, DOE Simplified –Practical Tools for Effective Experimentation, 2nd Edition, Productivity Press, NY, NY, 2007.
2. Patrick J. Whitcomb and Gary W. Oehlert, “Graphical Selection of Effects in General Factorials,” 2007 Fall Technical Conference of the American Society for Quality (ASQ) and the American Statistical Association (ASA).
23
|Standardized Effect|0.00 0.70 1.39 2.09 2.79 3.48 4.18 4.88
0102030
50
7080
90
95
99
Optimal Verification Testing with Graphical Effects Analysis
Your thoughts welcomed
For a copy of updated “Optimal Verification Testing with Graphical Effects Analysis,” e-mail [email protected]. Feel free to provide comments and suggestions!
Thanks for listening!
24
“It may happen that small differences in the initial conditionsproduce great ones in the final phenomena.”
- Henri Poincare
Optimal Verification Testing with Graphical Effects Analysis