Representing and Analyzing Uncertainty in Large-Scale Complex System Models
description
Transcript of Representing and Analyzing Uncertainty in Large-Scale Complex System Models
Partnership for AiR Transportation Noise and Emission Reduction Partnership for AiR Transportation Noise and Emission Reduction
An FAA/NASA/TC-sponsored Center of Excellence
Representing and Analyzing Representing and Analyzing Uncertainty in Large-Scale Uncertainty in Large-Scale Complex System ModelsComplex System Models
Doug Allaire
2
Outline
1. Modeling Uncertainty
2. Research Objectives
3. The Aviation Environmental Portfolio Management Tool
4. Global Sensitivity Analysis
5. Future Work
3
Modeling Uncertainty
• Uncertainties in modeling are unavoidable
• Uncertainty should be properly represented– Estimates and predictions– Risk analysis– Cost-benefit analysis– Furthering model development
• Uncertainty in models can be represented with three different methodologies– Probabilistic (aleatory)– Fuzzy sets (epistemic)– Fuzzy Randomness (both)
4
Large-Scale Complex System Models
• Multiple disciplines– Economics, aerodynamics, atmospheric science, …
• Many inputs, many outputs
• Systems of models of different character– Physics-based, empirical
• Many assumptions– Subjective, objective
• Computationally intensive
• Characterizing, representing, quantifying, and accounting for uncertainty is key– To both development and application of models
5
Uncertainty in Large-Scale Complex System Models
Goals of a formal uncertainty analysis:
• Further the development of a model– Identify gaps in functionality that significantly impact the
achievement of model requirements, leading to the identification of high-priority areas for further development.
– Rank inputs based on contributions to output variability to inform future research
• Inform decision-making– Provide quantitative evaluation of the performance of the model
relative to fidelity requirements for various analysis scenarios.– Properly represent different types of uncertainty in the model
Especially important for complex systems that comprise multiple models of different disciplines, different character, different assumptions
6
What exists today
• Probabilistic and fuzzy random uncertainty models
• Rigorous model independent sensitivity analysis method – Expensive
• Several model dependent sensitivity analysis methods– Inexpensive but difficult to ensure proper use
• Limited research into the use of surrogate models in uncertainty analysis– Critical for models with long runtimes
• Numerous sampling techniques for improving runtimes of probabilistic analyses
7
Research Objectives
• For specific large-scale, complex system models, identify how to properly represent and analyze uncertainty.
• Systematically develop surrogate models for situations where proper representation and analysis of uncertainty is computationally prohibitive. Limitations and uncertainty associated with the use of a surrogate should be identified.
• Use appropriate sampling techniques to improve computational costs in uncertainty analysis. Limitations and uncertainty associated with the use of chosen sampling methods should be identified.
8
Aviation Environmental Portfolio Management Tool (APMT)
APMT
Policy scenarios•Certification stringency•Market-based measures•Land-use controls•Sound insulation
Market scenarios•Demand•Fuel prices•Fleet
Environmental scenarios•CO2 growth
Technology and operational advances•CNS/ATM, NGATS•Long term technology forecasts
Cost-effectiveness•$/kg NOx reduced•$/# people removed from 65dB DNL •$/kg PM reduced•$/kg CO2 reduced
Benefit-cost•Health and welfare impacts•Change in societal welfare ($)
Distributional analyses•Who benefits, who pays•Consumers•Airports•Airlines•Manufacturers•People impacted by noise and pollution•Special groups•Geographical regions
Global, Regional, Airport-local
inp
uts
ou
tpu
ts
9
APMT PARTIAL EQUILIBRIUM BLOCK
NOISE IMPACTS
LOCAL AIR QUALITY IMPACTS
CLIMATE IMPACTS
APMT COSTS & BENEFITS
New Aircraft
Emissions
Noise
APMT BENEFITS VALUATION BLOCK
Monetized Benefits
CollectedCosts
Emissions
Emissions & Noise
Policy and Scenarios
AEDT
Fares
DEMAND (Consumers)
SUPPLY (Carriers)
Operations
Schedule &
Fleet
Airport-levelEmissions
Tools
GlobalNoise
Assessment
Airport-levelNoiseTools
GlobalEmissionsInventories
EDS VehicleNoise
Design Tools
VehicleEmissions
Design Tools
Technology
ImpactForecastin
gVehicle
CostAssessme
nt
DesignTools
Interface
APMT approach
10
APMT Uncertainty Analysis Goals
• AQ1. What are the key assumptions employed within the module? How do these assumptions translate into quantifiable uncertainty in module outputs?
• AQ2. What are the key assumptions employed within the module databases? How do these assumptions translate into quantifiable uncertainty in module outputs?
• AQ3. How do assumptions/limitations in modeling and databases impact the applicability of the module for certain classes of problems? What are the implications for future development efforts?
• AQ4. How do uncertainties in module inputs propagate to uncertainties in module outputs? Further, what are the key inputs that contribute to variability in module outputs?
• AQ5. For assumptions, limitations and inputs where effects cannot be quantified, what are the expected influences (qualitatively) on module outputs
• AQ6. How do assessment results translate into guidelines for use?
11
Answering AQ4 Probabilistically
• AQ4. How do uncertainties in module inputs propagate to uncertainties in module outputs? Further, what are the key inputs that contribute to variability in module outputs?
• Assign factor distributions
• Propagate uncertainty
• Apportion output variance to model factors
12
Assigning Factor Distributions
• Example: APMT Aircraft Emissions Module
EINOx: ( ) ( )( )
17.36 .24 , .24 0
17.36 .24 , 0 .24EI
x for xP x
x for x
+ − ≤ ≤⎧⎪=⎨
− ≤ ≤⎪⎩
( ) ( )( )
82.64 .11 , .11 0
82.64 .11 , 0 .11T
x for xP x
x for x
+ − ≤ ≤⎧⎪=⎨
− ≤ ≤⎪⎩Temperature:
( ) ( )( )
1111.11 .03 , .03 0
1111.11 .03 , 0 .03P
x for xP x
x for x
+ − ≤ ≤⎧⎪=⎨
− ≤ ≤⎪⎩Pressure:
( ) ( )( )
34.60 .17 , .17 0
34.60 .17 , 0 .17H
x for xP x
x for x
+ − ≤ ≤⎧⎪=⎨
− ≤ ≤⎪⎩Rel. Humidity:
( ) 10, .05 .05FFP x for x= − ≤ ≤Fuel Flow:
.24-.24 0
4.17
-.11 0 .11
9.09
-.03 0 .03
33.33
-.17 0 .17
5.88
-.05 0 .05
10
13
Propagate Uncertainty
gT g(T)
Assume a model:
o.w.
• Would like to compute the mean value of g(T)
( )2m−Ο• Trapezoidal Rule Error goes as and 1/ 1sm N= −
[2][1]
14
Propagate Uncertainty: Monte Carlo Simulation
• Assume is a random variable with a probability density function fX(x).
• Monte Carlo mean estimate (for any number of factors)
• Error goes probabilistically as– Strong law of large numbers– Central limit theorem
( ) ( ) ( ) ( )1
1
s
N
nXnI
g X g x f x d x g xN =
Ε = ≈⎡ ⎤⎣ ⎦ ∑∫
X
( )1/ 2N −Ο
• Compute mean value of g(X) using random samples
• X is any factor (temperature, pressure, etc.)gX g(X)
Assume same model:
15
Partitioning Output Variance
Analysis of Variance (ANOVA)– Assumes a linear statistical model– Requires specification of factors and factor interactions total to
be comprehensive, where S is the number of factors
Vary-all-but-one methods– Calculate factor variance contributions by fixing a factor and performing a
Monte Carlo simulation. – Requires specification of where to fix a factor– N(S+1) model evaluations
Global Sensitivity Analysis– All factors varying– Computes total sensitivity indices for each factor in N(S+1) model
evaluations using Monte Carlo simulation– Variance contributions take into account underlying factor distributions
( )2 1S −
16
Global Sensitivity Analysis
• What is the goal?
• ANOVA Decomposition
• Partitioning output variance
• Monte Carlo Estimates
• Total Sensitivity Indices
17
The goal of Global Sensitivity Analysis
• Ranking model factors on the basis of contribution to output variability– Useful for model development– Useful for understanding model outputs
• The goal is to partition output variance amongst the factors of the model– Main effects– Interaction effects
Factor 1
Factor 2
Factor 1, Factor 2 Interaction
Output Variance
18
ANOVA Decomposition
• High-Dimensional Model Representation of f(x) [4]
• Example HDMR for a function of three parameters
19
ANOVA Decomposition
• High-Dimensional Model Representation of f(x) [4]
• Example HDMR for a function of three parameters
Mean value
20
ANOVA Decomposition
• High-Dimensional Model Representation of f(x) [4]
• Example HDMR for a function of three parameters
Mean value Main effects
21
ANOVA Decomposition
• High-Dimensional Model Representation of f(x) [4]
• Example HDMR for a function of three parameters
Mean value Main effects
First-order interactions
22
ANOVA Decomposition
• High-Dimensional Model Representation of f(x) [4]
• Example HDMR for a function of three parameters
Mean value Main effects
First-order interactions
Second-order interaction
23
Example of ANOVA Decomposition
• Consider a function
• Then the ANOVA decomposition is:
24
Calculating Variances
• Given a function f(x) that is square integrable,
• Similarly,
• Example for a function of two parameters:
25
The goal of global sensitivity analysis
• The goal is to partition output variance amongst the factors of the model– Main effects– Interaction effects
Factor 1
Factor 2
Factor 1, Factor 2 Interaction
Output Variance
D D1
D12
D2
26
Monte Carlo Estimates
• The expected value of f(x) may be estimated from:
27
Monte Carlo Estimates
• The expected value of f(x) may be estimated from:
• The variance of f(x) may then be estimated from:
28
Monte Carlo Estimates
• The expected value of f(x) may be estimated from:
• The variance of f(x) may then be estimated from:
• For the one indexed terms the variance may be estimated from:
29
Total Sensitivity Indices
• Sensitivity indices:
• Total Sensitivity indices:
30
APMT ResultsGlobal Sensitivity Analysis
0
0.2
0.4
0.6
0.8
HousingGrowth Rate
DiscountRate
Quiet Level ContourUncertainty
NDI
Total Global Sensitivity Index
Total Sensitivity Indices
0.69 0.670.67
0.360.300.28
0.110.05 0.010.02 0.02
0
0.2
0.4
0.6
0.8
1.0
1.2
IntegratedTemperature Change
Damage NPV
OthersRF* short-livedclimate sensitivityDamage coefficient
BVB-Noise Module Analysis
BVB-Climate Module Analysis
31
Future WorkUncertainty Model
Selection
Problem Statement
Define model Define methods
Compare methods
Test functions
Surrogate Reqs. Sampling Reqs.
Application
Large-scale model (APMT Module)
Probabilistic
Problem Statement
Define model Define methods
Compare methods
Test functions
Application
Large-scale model (APMT Module)
Fuzzy Random
Surrogate Reqs. Sampling Reqs.
Compare
32
Questions?
33
References
[1]http://www.krellinst.org/UCES/archive/modules/potential/trap/index.html
[2]http://www.statisticalengineering.com/curse_of_dimensionality.htm
[3] T. Homma, A. Saltelli, “Importance Measures in Global Sensitivity Analysis of Nonlinear Models.” Reliability Engineering and System Safety, 52(1996) pp1-17.
[4] I.M. Sobol’, “Global Sensitivity Indices for Nonlinear Mathematical Models and their Monte Carlo Estimates,” Mathematics and Computation in Simulation 55(2001) 271-280.
34
Aviation Environmental Portfolio Management Tool (APMT): Motivation
• Aviation benefits and environmental effects result from a complex system of interdependent technologies, operations, policies and market conditions
• Community responses, policy and R&D options typically considered in a limited context
– only noise, only local air quality, only climate change
– only partial economic effects
• Actions in one domain may produce unintended negative consequences in another
• Tools and processes do not support recommended practice
– NPV of benefits-costs is recommended basis for informing policy decisions in U.S., Canada and Europe
35
ANOVA Decomposition (Cont.)
• A high-dimensional model representation is unique if
where and
• The individual functions are orthogonal
• The expected value of each individual function is zero
e.g.
• The expected value of f(x) is then
1 1
1
,...,
0
( ,..., ) 0s si i i i kf x x dx =∫ 1,..., sk i i= 1,2,...,s n=
1 1 1 1( , , ) ( , , ) 0
s s t ti i i i j j j jf x x f x x dx =∫ K KK K
1 1 1( ) 0f x dx =∫
0( )f x dx f=∫
36
Improving Runtime: Surrogate Modeling
• A surrogate is a less expensive, (often) lower-fidelity model that represents the system input/output behavior
• Surrogates are often classified into three categories:1. Data fit, e.g. response surface models, Kriging models EDS2. Reduced-order models, derived using
system mathematical structure3. Hierarchical models,
e.g. coarser grid, neglected physics, aggregation AEDT
• Why surrogates?– If one analysis takes 1 minute, then a Monte Carlo simulation
with 10,000 samples takes 10,000 minutes = 7 days– A global sensitivity analysis requiring a Monte Carlo simulation
for each factor would take 7(S+1) days, where S is the number of factors
• Relating uncertainty analysis results done with a surrogate model to results from the full model is critical
37
Surrogate Modeling Approach for APMT
• Adaptive approach for Aircraft Emissions Module– Monte Carlo Simulation of a single day of flights– Build up from a small number of flights to representative set
• Statistical approach for Aircraft Performance Module– Determine critical factors using expert opinion
• Fuel burn per meter• Emissions per meter• Segment level thrust distribution• …
– Group aircraft/engine pairs based on statistical similarities– Select representative from each group
• Important questions to ask– How do these surrogates impact Monte Carlo results?– How do these surrogates impact global sensitivity results?
38
Improving Runtime: Sampling Methods
• Some well-known methods:– Brute-force Monte Carlo Methods
• Use pseudorandom numbers on [0,1] and the inversion method• The probabilistic error bound is .
– Quasi-Monte Carlo Methods• Use well-chosen deterministic points rather than random samples• Low discrepancy sequences (e.g. Halton, Hammersley)• Deterministic error bound
– Stratification -- Latin Hypercube Sampling• Stratify on all factor dimensions simultaneously• Leads to lower variance in integral estimates than independent,
identically distributed samples• Probabilistic error bound .
• Things to keep in mind– How bias in sampling impacts analysis metrics– Obtaining more samples if a metric has not converged sufficiently
( )1/ 2N −Ο
((log ) / )sN NΟ
( )1/ 2N −Ο