[]_Geostatistics_in_12_Lessons.doc
-
Upload
hektor-ektropos -
Category
Documents
-
view
215 -
download
0
Transcript of []_Geostatistics_in_12_Lessons.doc
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
1/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
2/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
3/201
Preface to Geostatistics in 12 Lessons
Introduction
This web page is a set of companion notes to accompany the twelve lectures presented in the summer
of 1777 8or #andmark raphics in !ustin Teas The lectures are intended to be an informal training
seminar for those employees involved in the development, documentation, and testing of software that
implement geostatistics
Key Tasks
There are some key tasks that will be accomplished by the end of the summer These include"
o un informal training seminars with members of the testing, documentation, and
development groups
o 9evelop a web-based training resource for testing, documentation, and development
groups
o 9evelop a glossary of geostatistical terms and key concepts
o :ork with the testing manager to develop a test plan for the testing of geostatistical
components
o 9evelop workflow specifications for shared earth modeling
o 9ocument procedure and assumptions * underlying techni$ues in the eostat 9;
0raining #e.inars
The training seminars will be presented in 12 lectures #ectures will be presented on Tuesday, and
Thursday at 1" am until noon The lectures will be presented in varying depth 5n depth seminars
will be held on Tuesday and Thursday, and light seminars will be held on :ednesday The :ednesday
seminars will be a high-level overview of the Tuesday and Thursday seminars sers of the web site will have the
opportunity to submit their $ui66es by email for marking or allow for self-marking
Glossar! of 0er.s
! glossary of terms will be provided to assist the user :ords that appear in the training resource that
also appear in the glossary will be hot linked
0esting
8amiliarity with geostatistical concepts is essential in any testing strategy 8amiliarity will come as a
result of the seminars and the training resource 'ommon mistakes in the construction of a
3
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
4/201
geostatistical model and items that indicate problems will be outlined 5n depth pages will tackle theory
and provide tools for proving algorithms by hand
orflo Diagra.sThe training resource will provide a diagrammatic illustration of a workflow for
reservoir modeling The training resource will be hot linked to the workflow model =ot linking the
workflow diagram will allow the user to go to specific lectures instead of having to browse through the
entire training resource
Docu.entation of ssu.ptions / Procedures
The construction of a geostatistical model re$uires sometimes confusing procedures and assumptions
The training resource and the seminars will clearly state and eplain all assumptions and procedures
5n discussion with
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
5/201
o eservoir planning
o
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
6/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
7/201
4arts of this website are patterned after the book Geostatistical Reservoir Modeling, a currently
unpublished book authored by 9r 'layton 9eutsch 9r 9eutsch is aware and has granted permission
for the use of material
0
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
8/201
Lecture 1: Purpose / Motivation for Geostatistics
Bualitative easoning
eservoir 4lanning
ncertainty 4ortfolio &anagement
The #ife 'ycle of a 4roect
#ecture 1 Bui6
Introduction
5n view of #andmarkEs core business, ie reservoir characteri6ation, geostatistics can be defined as a
collection of tools for $uantifying geological information leading to the construction of 39 numerical
geological models to be used for assessment and prediction of reservoir performance
eostatistics deals with spatially distributed and spatially correlated phenomena eostatistics allows
$uantification of spatial correlation and uses this to infer geological $uantities from reservoir data at
locations where there are no well data (through interpolation and etrapolation) 5n addition, the main
benefits from geostatistics are" (1) modeling of reservoir heterogeneity, (2) integrating different types
of data of perhaps different support and different degrees of reliability, and (3) assessing and
$uantifying uncertainty in the reservoir model
This course was not developed as a cookbook recipe for geostatistics
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
9/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
10/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
11/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
12/201
$ualified decision making ! distribution of uncertainty is generated, and using a loss function the risk
is assessed and an optimal estimate is determined, the estimate that incurs the least loss 9ifferent loss
functions can be used for pessimistic and optimistic estimates
8igure 13, !n illustration showing the concept of risk $ualified decision making Dote that the loss function is scenariospecific, and that the histogram of possible costs are in addition to those costs if the estimate were correct
ortfolio Management
4ortfolio management re$uires that the best possible decisions be made in the face of uncertainty
ome of these decisions include"
67ploration License Bidding:using limited seismic and well data, decide which 6ones to
focus on and * or commit resources to
67ploration Drilling:given a few wells (1-3), decide whether or not the field warrants further
investigation
Drilling Ca.paign: decide how many wells to drill, placement of wells, and timing of
enhanced oil recovery tactics
Develop.ent Planning:decide how large support facilities must be, negotiate pipeline or salesagreements and contractor commitments
Mature field:decide on the value of infill drilling or the implementation of enhanced oil
recovery schemes such as flooding and steam inection
*andon.ent / #ale:timing of environmentally and economically sound closing of facilities
These decisions are being made with less data and greater uncertainty for proects that are marginally
profitable ound estimates backed by rigorous mathematical methods secures investors and fosters
good economic relations
12
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
13/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
14/201
Lecture 1: Purpose / Motivation for Geostatistics, The Quiz
Question )
#ist three maor benefits that geostatistics offers, and describe what each mean and eplain why they
are important
Question *
9ecision making in presence of uncertainty is important :hat are the two steps for risk-$ualified
decision makingK
Question +
5n general terms, eplain the link between spatial variability (heterogeneity) and uncertainty
Question
5n your own words describe the information effect and how it relates to uncertainty
Question -
eostatistics is useful at every point in the life cycle of a reservoir, but where is it most useful and
whyK
solutions
Lecture 1: Purpose / Motivation for Geostatistics, The Quiz
Question )
#ist three maor benefits that geostatistics offers, and describe what each mean and eplain why they
are important
Buantification of uncertainty" summari6es our lack of knowledge for better decision making
igorous mathematics" means that there are sound mathematical laws applied for repeatability
9ata 5ntegration" data of many types can be integrated using geostatistical tools
Question *
9ecision making in presence of uncertainty is important :hat are the two steps for risk-$ualified
decision makingK
Buantification of uncertainty and then $uantification of risk
Question +
5n general terms, eplain the link between spatial variability (heterogeneity) and uncertainty
1+
http://longhorn.zycor.lgc.com/geostats/lec1/quiz1sol.htmhttp://longhorn.zycor.lgc.com/geostats/lec1/quiz1sol.htm -
7/26/2019 []_Geostatistics_in_12_Lessons.doc
15/201
!s spatial variability increases heterogeneity increases and hence uncertainty increases
Question
5n your own words describe the information effect and how it relates to uncertainty
The information effect is the result of increased available information which leads to less uncertainty
Question -
eostatistics is useful at every point in the life cycle of a reservoir, but where is it most useful and
whyK
eostatistics is most important in the early stages of the life cycle because it makes intelligent use of
limited data and allows for decision making that is tempered with a knowledge and understanding of
the uncertainty inherent in the numerical-geological model
Auly /, 1777
1
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
16/201
Lecture 2: Basic Concepts
tatistical Tools
=istograms
4robability 9istribution
'ategorical Cariables
'omparing =istograms 9ata Transformation
&onte 'arlo imulation
%ootstrap
eostatistical, and .ther ;ey 'oncepts
Dumerical 8acies &odeling
'ell %ased &odeling
.bect %ased &odeling
#ecture 2
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
17/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
18/201
and
(23)
Median
The midpoint of the ranked (ie sorted from smallest to largest) data 5f there were 2 data, the median
would be the 13th value 5t also represents the th percentile in a cumulative histogram
Mode
The mode is the most commonly occurring data value in the data set
8ariance
The variance is a measure of spread 5t can be thought of as the average s$uared-distance of the data
from the mean 5t can be found using the e$uation below"
(2+)
#tandard Deviation
The standard deviation is the s$uare root of the variance 5t is sometimes the preferred measure of
spread because it has the same units as the mean whereas the variance has s$uared units
(2)
Coefficient of #eness
The coefficient of skewness is the average cubed difference between the data values and the mean 5f a
distribution has many small values and a long tail of high values then the skewness is positive, and the
distribution is said to be positively skewed 'onversely, if the distribution has a long tail of small
values and many large values then it is negatively skewed 5f the skewness is 6ero then the distribution
is symmetric 8or most purposes we will only be concerned with the sign of the coefficient and not its
value
(2/)
Coefficient of 8ariation
The coefficient of variation is the ratio of the variance and the mean :hile the standard deviation and
the variance are measures of absolute variation from the mean the coefficient of variation is a relativemeasure of variation and gives the standard deviation as a percentage of the mean 5t is much more
fre$uently used than the coefficient of skewness ! coefficient of variation ('C) greater than 1 often
indicates the presence of some high erratic values (outliers)
1?
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
19/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
20/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
21/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
22/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
23/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
24/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
25/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
26/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
27/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
28/201
8ig 27 &onte 'arlo simulation consists of drawing a normally distributed number and recording the appropriate valuefrom the cdf
&onte-'arlo simulation is the foundation of all stochastic simulation techni$ues &uch care should be
taken to ensure that the parent cdf is a representative distribution, as any biases will be translated intothe results during the transformation
Bootstra#
The bootstrap is a method of statistical resampling that allows uncertainty in the data to be assessed
from the the data themselves The procedure is as follows"
1 draw nvalues from the original data set with replacement
2 calculate the re$uired statistic The re$uired statistic could be any of the common summary
statistics 8or eample we could calculate the uncertainty in the mean from the first set of n
values3 repeat 1times to build up a distribution of uncertainty about the statistic of interest 8or the
eample above we would find the mean of the n values 1 times yielding a distribution of
uncertainty about the mean
2?
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
29/201
8ig 21 The bootstrap is used to determine the uncertainty in the data itself This diagram shows how the uncertainty in
the mean is found 8irst randomly draw nvalues from the data set and calculate the mean epeat this many times, and thedistribution of the mean $uantifies the uncertainty about the mean
Geostatistical, and 2t$er Key Conce#ts
Petrop-!sical Properties
There are three principle petrophysical properties discussed in this course" (1) lithofacies type, (2)
porosity, and (3) permeability =ard datameasurements are the lithofacies assignments porosity and
permeability measurements taken from core (perhaps log) !ll other data types including well logs and
seismic data are called soft dataand must be calibrated to the hard data H9eutsch, 177?I
Modeling #cale
5t is not possible nor optimal to model the reservoir at the scale of the hard core data The core data
must be scaled to some intermediate resolution (typical geological modeling cell si6e" 1 ft N 1 ft N
3 ft) &odels are built to the intermediate scale and then possibly further scaled to coarser resolutions
for flow simulation
27
http://longhorn.zycor.lgc.com/geostats/glossary.html#hard_datahttp://longhorn.zycor.lgc.com/geostats/glossary.html#soft_datahttp://longhorn.zycor.lgc.com/geostats/glossary.html#hard_datahttp://longhorn.zycor.lgc.com/geostats/glossary.html#soft_data -
7/26/2019 []_Geostatistics_in_12_Lessons.doc
30/201
>ni?ueness; #.oot-ing; and @eterogeneit!
'onventional mapping algorithms were devised to create smooth maps that reveal large scale geologic
trendsG for fluid flow problems, however the etreme high and low values have been diluted and will
often have a large impact on the flow response (eg time to breakthrough would be systematically
under-estimated during a water flood) These algorithms remove the inherent variability of the
reservoirG they remove the heterogeneity within the reservoir 8urthermore, they only provide one
uni$ue representation of the reservoir
nalogue Data
There are rarely enough data to provide reliable statistics, especially hori6ontal measures of continuity
8or this reason analogue data from outcrops and similar more densely drilled reservoirs are used to
help infer spatial statistics that are impossible to calculate from the available subsurface reservoir data
D!na.ic Reservoir C-anges
eostatistical models provide static descriptions of the petrophysical properties Time dependent
processes such as changes in pressure and fluid saturation are best modeled with flow simulatorsbecause they take into account physical laws such as conservation of mass and so on
Data 0!pes
The following list represents the most common types of data used in the modeling of a reservoir"
'ore data ( andby lithofacies)
:ell log data (stratigraphic surfaces, faults, measurements of petrophysical
properties)
eismic derived structural data (surface grids * faults, velocities)
:ell test and production data (interpreted , thickness, channel widths,
connected flow paths, barriers) e$uence stratigraphic interpretations * layering (definition the continuity and
the trends within each layer of the reservoir)
3
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
31/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
32/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
33/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
34/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
35/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
36/201
Lecture !: Geo"oica" Princip"es for $eservoir Mo%e"in
eservoir Types and
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
37/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
38/201
sheets
%9rrentl ver im.ortant
approaches
Car*onate Reservoirs
%y definition carbonate (limestone) rocks are those have greater than O carbonate material Thecarbonate material is either derived from organisms that secrete carbonate as skeletal material or as
fecal matter, or precipitated out of solution #imestone is chemically unstable and is easily converted
to dolomite when hydrothermal fluids rich in magnesium pass through it #imestone is also easily
metamorphised into other rock types such as marble &ost (7O) of carbonate reservoirs can be
modeled using cell based indicator simulation to model limestone * dolomite conversion
9olomiti6ation often has a directional trend 8luid flow in rock is almost always directional and the
magnesium re$uired for dolomit6ation is carried by hydrothermal fluids The fluid flows through the
rock and magnesium replaces calcium creating dolomite The trends can be seen with seismic
(dolomiti6ed limestone has different acoustic properties than limestone) %ecause there are at least two
rock types (limestone and dolostone) we must use estimation methods that make use of multiple
variables Trends such as the conversion of limestone to dolostone may also show up in the geologic
contour maps from the wells
5n other cases, reefs may sometimes be modeled as obects, and there may be areal trends associated
with these as well (! change in sea level may cause a reef to die out and another to form further in or
out)
Table 32 Table for 'arbonate reservoirs
Reservoir0!pe
C-aracteristic #-apes 67a.ples / .portance Modeling0ec-ni?ue
'arbonate
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
39/201
37
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
40/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
41/201
8igure 3+, ! location map of a sample data set
The assessment of the sampling scheme was correct, there is a northerly bias in the sampling scheme
5t is useful to draw a contour map of the data ! contour map helps gain some insight to the nature of
the data, and can sometimes reveals important trends The map below shows that most of the sampling
occurred in areas of high potential The map in 8igure 3 does not reveal any trends but illustrates thevalue of a contour map
8igure 3, ! contour map using the sample data set The accuracy of the map is not critical 5ts purpose is to simplyillustrate trends
The contour map illustrates that any areas of high potential (red areas) are heavily sampledG a biased
sampling procedure The contour map also illustrates that we may want to etend the map in the east
direction
5t is common practice to use the 'artesian coordinate system and corner-point grids for geological
modeling The corner-point grid system is illustrated in 8igure 3/
+1
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
42/201
8ig 3/ The standard grid system used for geological modeling
Dotice that theZdimension $in 8igure 3/ is not the same as the dimension ain the areal grid, but the
XY dimension for both the areal and vertical grids are the same 8or the sake of computational
efficiency the stacked areal grids are aligned with Zais, but for fleibility theZais need not be of
the same dimensions as the areal grid This techni$ue proves valuable for"
1 modeling the hydrocarbon bearing formation as a stack of stratigraphic layers" 5t is intuitively
obvious that a model should be built layer by layer with each layer derived from a homogenous
depositional environment !lthough each depositional environment occurred over a large span
of time in our contet the depositional environment actually occurred for only a brief period of
geological time and for our purposes can be classified as a homogenous depositional
environment
2 volume calculations" The model must conform to the stratigraphic thickness as closely as
possible &odeling the formation as a @sugar cube@ model leads to poor estimates
3 flow calculations" 8low nets must have e$uipotential across facies ! @sugar cube@ modelwould yield erroneous results
This permits modeling the geology in stratigraphic layers The stratigraphic layers are modeled as 2-9
surface maps with a thickness and are then stacked for the final model Thus having a non regular grid
in theZdirection allows for conformity to thickness permitting accurate volume calculations, also
allows for flow nets (must be e$uipotential across any face)
eological events are rarely oriented with longitude and latitude There is usually some a6imuth, dip,
or plunge to the formation 5f the angle between the formation and the coordinate ais is large there
will be error a'in the cell dimensions as indicated by 8igure 30 !lso, it is confusing to have to deal
with the angles associated with a6imuth, dip, and plunge, so we remove them and model in some moreeasily understood coordinate system
+2
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
43/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
44/201
8igure 3? 5llustrates the process of rotating the coordinate ais to be aligned with the maor ais of the reservoir 8irst theais are rotated about the 6 ais to accommodate the the a6imuth of there reservoir, second ais are rotated about the y aisto accommodate dip in the reservoir
The rotations can be removed using the the transforms indicated in &atri 33 and 3+
(33)
(3+)
ometimes fluvial channels can be difficult to model because they deviate significantly 5n this case it
is possible to straighten the channel using the transform in
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
45/201
8igure 3?, Transforming a twisty channel into a straight channel
"tratigra#$ic Coordinates
eservoirs often consist of stratigraphic layers separated by a surfaces that correspond to somese$uence of geologic time events, much like growth rings in a tree The bounding surfaces that
differentiate the strata are the result of periods of deposition or periods of deposition followed by
erosion The surfaces are named according to these geologic events"
Proportional:The strata conform to the eisting top and base The strata may vary in thickness
due to differential compaction, lateral earth pressures, different sedimentation rates, but there is
no significant onlap or erosion (9eutsch, 1777)
0runcation" The strata conform to an eisting base but have been eroded on top The
stratigraphic elevation in this case is the distance up from the base of the layer
)nlap" The strata conform the eisting top (no erosion) but have @filled the eistingtopography so that a base correlation grid is re$uired
Co.*ination" The strata neither conform to either the eisting top or bottom surfaces Two
additional grids are re$uired
+
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
46/201
8igure 37 5llustrates proportional, truncation, onlap, and combination type correlation surfaces
The stratigraphic layers must me be moved so that they conform to a regular grid This is done by
transforming the 6 coordinate to a relative elevation using"
(3)
8igure 31 shows how the strata is moved to a regular grid Dote that features remain intact, ust theelevation has been altered to a relative elevation
+/
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
47/201
8igure 31 5llustrates the result of transferring the 6 coordinate to a regular grid using formula 37
5t is important to note that these are coordinate transformsG the data is transformed to a modeling spaceand transformed back to reality There are no property or distance changes here, ust the movement
from reality to some virtual space then back to reality
Cell #iAe
The cell si6e used in the model is a serious issue 5f the cell si6e is to small an enormous number of
cells will be re$uired to populate the model Too many cells holds the conse$uence of having a model
that is too difficult to manipulate and very taing on the '4> 5f the cells are too large then important
geological features will be removed from the model !s processing power increases model si6e is of
lesser and lesser importance, but with todayEs computers models that range from 1 million cells to million cells are appropriate
&or' ("o)
The specific process employed for 3-9 model building will depend on the data available, the time
available, the type of reservoir, and the skills of the people available 5n general, the following maor
steps are re$uired"
1 9etermine the areal and vertical etent of the model and the geological modeling cell si6e
2
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
48/201
f) enerate 3-9 permeability models
g) &erge and translate back to real coordinates
+ Cerify the model
'ombine 6ones into a single model
8igure 311 illustrates the modeling concepts discussed in this lecture"
8ig 311 ! flow chart showing the geostatistical work flow for #ecture 3
Auly 7, 1777
+?
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
49/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
50/201
Question *
:hat does a distribution look like when the coefficient of skewness isG positiveK, negativeK, 6eroK 'an
you provide sets petrophysical properties that would present" a positive coefficient of skewnessK a
negative coefficient of skewnessK a low coefficient of skewnessK
, a) permeability, b) watersaturation, c) porosity
Question +
9efine the coefficient of variation :hat does it mean when the coefficient of variation is greater than
oneK
there are a lot of outliers
Question
4rovide 2 geophysical properties that are positively correlated, negatively correlated, and not
correlated
+ve: porosity and permeability
-ve: impedance and porosity
not: permeability and the weather
Question -
=ow would you interpret a $uantile-$uantile plot whose plotted points deviated from the reference
lineG in a parallel fashionK at an angle from the +th$uantileK
a parallel deviation indicates a dierence in the mean,and a change in slope indicates a dierence in
variance
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
51/201
Auly /, 1777
1
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
52/201
Lecture : -ata .na"*sis
9ata !nalysis
5nference
9eclustering
Trends
econciliation of 9ata oft 9ata 9eclustering
:ork 8low
#ecture + Bui6
Data 5nalysis
9ata analysis is the gathering, display, and summary of the data 9ata analysis is an important step for
building reliable numerical models 5mportant features of the data are reali6ed, erroneous data, and
outliers are revealed The issues addressed by a 9ata !nalysis are"
>nderstanding and cleaning the data
lobal resources assessment" first order * back-of-the-envelop calculations of volumetrics,
facies*6one summary statistics, used to estimate and confirm magnitude of epected results
5dentification of geological populations
5dentification of geological trends
econciliation of different data types" eg transform log-derived porosity measurements to
match the core-derived data
5nference of representative statistics * distributions
'alibration of soft data
Inference
5n recent years the growth of eostatistics has made itself felt more in the petroleum industry than any
other, and an important feature of this growth is the shift in philosophy from deterministic response to
stochastic inference tochastic inference concerns generali6ations based on sample data, and beyond
sample data The inference process aims at estimating the parameters of the random function model
from sample information available over the study area The use of sample statistics as estimates of the
population parameters re$uires that the samples be volume * areally representative of the underlying
population ampling schemes can be devised to ensure statistical representativity, but they are rarely
used in reality 5t is up to the geoscientist to repair the effect of biased sampling, integrate data of
different types, cope with trends, etc, and in general ensure that the data truly is representative of the
population
)utliers and 6rroneous Data
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
53/201
earch the location map for the outliers 9o they appear to be all in the same locationK 9o they
appear to be inappropriateK
how a cross plot of the local averages versus the data (every point is mapped versus the
average of the surrounding data)
8igure +a, a mean that differs from the mode significantly, a maimum that is significantly higher than the mean, or evena posting that sticks out indicates the presence of outliers
8igure +1 shows some of the things that indicate an outlier ! mean that deviates significantly from
the median, a maimum that deviates significantly from the mean or the median, or a single posting
way out in the middle of nowhere There are three possible solutions for coping with outliers (1) we
can decide to leave them as they are, (2) we can remove them from the data set, (3) or we alter the
value to something more appropriate to the surrounding data
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
54/201
8igure +b, hould two distributions be separatedK 5t depends on the study
9ecisions made in the geostatistical study must be backed up by sound practice and good udgement
The strategies indicated only serve as an aid in ustifying your decisions you must also document your
decisionsG you know when you may need to ustify them
Declustering
nfortunately this
sampling practice leads to location biased sampling 8igure +1, the location map of the sample data
illustrates location biased sampling The areas of low potential are not as well represented as the areas
of high potential
+
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
55/201
8igure +1, Dote that all areas are not sampled unbiasedly ome are areas are heavily sampled and others are poorlysampled
9eclustering corrects the distribution for the effect of location-biased sampling 9eclustering assigns a
weight to each data and calculates the summary statistics using each weighted data
8igure +2, The single data in !rea 1 informs a much larger area than the data in !rea 2
8igure +2 illustrates location-biased sampling The single data in !rea 1 informs a larger area than the
data of !rea 2 5ntuitively one would weight each of the data in !rea 2 by one fifth and the data in
!rea 1 by one 'alculating the weights this way is effective but a more efficient way is to overlay a
grid and weight each data relative to the number of data in the cell area defined by the grid using
8unction +1 below"
(+1)
where /i(c)is the weight, niis the number of data appearing in the cell, and 1+is total number of cells
with data 8igure +3 shows how geometric declustering, or cell declustering works
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
56/201
8igure +3 The declustering weights for three different cells has been calculated, all other declustering weights are found inthe same way
The declustering algorithm can be summari6ed by"
1 choose an initial declustering cell si6e that includes about one datum per cell and calculate the
mean The goal to finding the optimal declustering cell si6e is to find the lowest mean for datasets where the high potential areas have been over sampled or the largest mean for data sets
where the low potential areas have been over sampled, and there are two parameters that can
effect the declustered mean (1) the declustering cell si6e and the location of the grid origin To
ensure that the optimal declustering cell si6e is chosen several cell si6es and grid origin
locations should be used to calculate the mean The cell si6e that yields the lowest * largest
mean is chosen as the optimal declustering cell si6e 8igure ++ shows the declustered mean
and declustering cell si6e using 2+ different cell si6es between 1 and 2 units and grid offsets
The posted declustered mean is the average of the mean calculated from grid offsets
2 8igure ++, The scatter plot shows the declusteredmean for a variety of grid si6es The minimum declustered mean is the lowest mean The declustering cell si6e is a cell yielding a declustered mean of 2+2
3 >sing the optimal declustering cell si6e decluster the data ecall that step 1 only determines
the optimal declustering cell si6e, it does not decluster the data 8igure + shows histograms
before and after declustering Dotice that the low values (the poorly represented values) are
now more represented and the high value (the overly represented values) are less represented in
the declustered histogram !lso notice that the declustered mean and variance are lower than
the clustered statistics 8igure +/ is a map showing the magnitude of weight applied to each
data Dote the clustered data are under weighted, the sparse data are over weighted, and the
well spaced data are not weighted
/
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
57/201
+ 8igure+, =istograms of the data before and after declustering 4oints to note are the clustered mean is larger than the
declustered mean, which is the epected result, and the clustered variance is larger than the declustered variance,also an epected result
8igure +/, ! map of the magnitude of weight Dotice that sparsely sampled areas are up weighted, clustered areasare down weighted, and well spaced are not weighted
#oft Data Declustering
eometric declustering is the preferred method of declustering, but what should be done when there
are too few data to decluster, or when there is systematic preferential drillingK .ne method is to use a
soft data set to develop a distribution for the hard data in locations where there is no hard data, only
soft data 9ata is weighted by the value of the secondary variable at its location oft data declustering
re$uires"
ridded secondary variable, that is defined over the area of study
'alibration cross-plot
&athematical manipulation
0
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
58/201
'onsider a reservoir that has a porosity known to be negatively correlated with depth, such as that
indicated in 8igure +0 The wells were preferentially drilled along the crest of the structure, and a
model of the reservoir is to be built 'learly the well data will not represent the reservoir unbiasedly,
but seismic data has been collected over the entire area of interest
8igure +0, ! location map showing the locations of well and seismic data The well data is not representative of the entirereservoir and there is very little data to decluster, thus soft data declustering is used
The protocol for soft data declustering is"
&ap the secondary variable over the area of interest
9evelop a bivariate relation between the primary and secondary data, such as that in 8igure
+?a The calibration is critical to this operation and great care must be eercised in developing
it
construct a representative distribution by accumulating all of the conditional distributions
8igure +?b illustrates the concept of adding distributions to create a single distribution
8igure +?a, ! scatter plot of the hard porosity data versus the seismic depth data The inferred correlation between the hardand soft data is shown The relation is inferred from analogue data and the data themselves
?
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
59/201
Trends
Cirtually all natural phenomena ehibit trends ravity worksG vertical profiles of permeability and
porosity fine upward within each successive strata (9eutsch, 1777) ince almost all natural
phenomena ehibit a trend it is not always appropriate to model using a stationary C educing the
si6e of the study area to a si6e where the assumption of stationarity is appropriate or reducing the
assumption of stationarity to the search radius are two methods for coping with trends >niversal
kriging, an adaptation of ordinary kriging system produces good local estimates in the presence of
trend >niversal kriging can also be used to calculate a trend automatically, but its use should be
tempered with good udgement and sound reasoning instead of ust accepting the result The best
method for coping with a trend is to determine the trend (as a deterministic process) subtract it fromthe observed local values and estimate the residuals and add the trend back in for the final estimate
(&ohan, 17?7) .ften it is possible to infer areal or vertical trends in the distribution of rock types
and*or petrophysical properties, and inect this deterministic information into the model
8igure +?a, Trend mapping is important for at least two reasons" (1) it is wise to inect all deterministic features and atrend is a deterministic feature, and estimation re$uires stationarity and a trend implies that there is no stationary mean
Thus, remove the trend and estimate the residual
7
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
60/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
61/201
&or' ("o)
Auly 1, 1777
/1
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
62/201
Lecture : -ata .na"*sis, The Quiz
Question )
%riefly eplain the principle of declusteringK :hy is declustering important in geostatistical oil
reservoir characteri6ationK
Question *
hould all outliers be automatically be removedK :hy or why notK
Question +
:hat tools you use to split data sets into different faciesK =ow would you proceedK
Question
:hy bother using soft data to decluster hard dataK
Question -
:hen we speak of inference in the contet of data analysis what are we striving forK 5s it importantK
:hyK
/2
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
63/201
Lecture : -ata .na"*sis, The Quiz +o"utions
Question )
%riefly eplain the principle of declusteringK :hy is declustering important in geostatistical oil
reservoir characteri6ationK
:eighting the data so that each is volume representative of the volume of interest 5t is important
because we re$uire that the data used is as representative of the reservoir as possible arbage in
garbage out
Question *
hould all outliers be automatically be removedK :hy or why notK
Do, not all outliers are bad Fou must refer to the location of the data 5s it surrounded by other high
ranking dataK 5s it sitting out in the middle of nowhereK 5s it surrounded by data that would imply thatthere should be a low * high value instead of what is thereK
Question +
:hat tools you use to split data sets into different faciesK =ow would you proceedK
=istograms, scatterplots, $uantile-$uantile plots, location (base maps), contour maps tart with a
location map and a contour map eparate those data that appear to show trends or directional
continuity look at the histogram, do the trends and the peaks (indicating different distributions)
correspond to those locations on the mapsK once separated, look at the scatterplots !re the data setswell correlatedK
Question
:hy bother using soft data to decluster hard dataK
:e know that the data is not representative and it must be declustered 5f we ignore the problem then
we are building models that are not accurate, and there is no way to tell how inaccurate oft data
declustering may seem crude but it is much better than ignoring the problem
Question -
:hen we speak of inference in the contet of data analysis what are we striving forK 5s it importantK
:hyK
:e are striving for summary statistics that are as representative as possible 5t is very important The
entire estimation * simulation process relies on an accurate representation of the reservoir
/3
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
64/201
Lecture : +patia" -ata .na"*sis
1 Cariograms
2 Cariogram 5nterpretation and &odeling
3 #ecture Bui6
+
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
65/201
8igure 2 Two variograms and the corresponding maps The variogram on the right shows no spatial correlation and theresulting map is random The variogram on the right is very continuous showing etensive spatial correlation and therelevant map shows good spatial correlation
8igure 2 shows a map that was made using the variogram 5&2a The variogram indicates that the data
have no correlation at any distance, and hence image a is a random map 5mage 2$was made using
variogram 2$ Cariogram 2$indicates that the data are well correlated at long distances and image
2$shows some correlation at long distance 8igure 3 shows a close up the two images in 8igure2 Dotice that in figure 3$the colors gradually change from blue to green to orange then red, and
that this is not the case in figure 3$ 5n 8igure 3athe colors change randomly with no correlation
from one piel to the net ! piel in image 3ais not well correlated to a neighboring piel, whereas
in image 3$neighboring piels are well correlated
8igure 3 ! close up of an area on each map shows that map a using the variogram having no spatial correlation israndom whereas the map on the right which used a variogram that is continuos and thus the map shows spatial correlation
'orrelation is the characteristic of having linear interdependence between random variables or between
sets of numbers %etween what variables is the variogram measuring correlationK 5n the variograms
presented so far, correlation is being measured between the same variable, but separated by a distance
approimately e$ual toh 8igure + shows conceptually how an eperimental variogram is calculated
The lag distance or distance h is decided upon by the practitioner The two variables are (1) the data at
the head of the vector, and (2) the data at the tail of the vector The data tail of the vector (the circled
end in figure +) is calledz(u)(the random variable at location u)and the data at the head of the vector
is called z(u+h)(the random variable at location u+h) tarting with the smallest lag distance the
algorithm visits each data and determines if there are any data approimately one lag away 5f there
are, the algorithm computes variogram value for one lag !fter each data has been visited, the
/
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
66/201
algorithm doubles the lag distance and repeats the calculation 5n this way the eperimental variogram
$uantifies the spatial correlation of the data
8igure + The variogram is not calculated from one single point over varying distances -, rather it moves from point to
point and calculates the variogram for each distance -at each data location
Com#onents of t$e 0ariogram
There are a few parameters that define some important properties of the variogram"
1 ill" the sill is e$ual to the variance of the data (if the data are normal score the
sill will be one)2 ange" the range is the distance at which the variogram reaches the sill
3 Dugget
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
67/201
8igure a The components of the variogramG the sill is the variance of the variable under study, the range is the distanceat which the variogram plateaus, the nugget effect is the short scale variability
The nugget effect is a measure of short scale variability, any error in the measurement value or the
location assigned to the measurement contributes to the nugget effect The range shows the etent of
correlation, and the sill indicates the maimum variability, or the variance of the data 8igure b
shows what happens when we change two parameters the nugget effect and the range ecall that the
sill is a fied value 5t is the variance of the data 5mages a $ and c in 8igure b shows the effect of
different ranges ! variogram with no range is shown in image a image $has an intermediate range,
and image chas a long range 5mages d eandfshow the effect of increasing nugget effect 5mage d
shows the effect of no nugget effect, or no short scale variability, image e shows an intermediate
amount of nugget effect, and imagefshows pure nugget effect, or complete short scale variability
/0
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
68/201
/?
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
69/201
8igure b ome variogram eamples to show the effect of different parameters
Qualitative "#atial Data 5nalysis
5n probabilistic notation the variogram is written"
(1)
:hich says the variogram is the epected value of the s$uared difference of Z(u) andZ(u#h)& The
semivariogram is defined"
(2)
To be precise the semivariogram is one half the variogram 5n this lecture we will assume that the
variogram and the semivariogram are synonymous
%efore the variogram is calculated some data preparation must be performed 9ata must be free from
outliers and systematic trends, as well, since geostatistical simulation re$uires normally distributed
data, the data must be transformed into normal space
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
70/201
8igure / !n illustration of the the lag, lag tolerance, a6imuth, a6imuth tolerance and bandwidth parameters for variogrammodeling
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
71/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
72/201
8igure 7a
There is some discussion as to whether or not the sill is or is not e$ual to the variance The concern is
that the variance is susceptible to outliers, this is why it is important weed out outliers before the
variogram analysis another concern is the use of declustering weights using declustering weights
reduces the variance so which variance do we useK for now we will use the apriori variance the final
issue concerns the dispersion variance 5 will leave the issue of dispersion variance for the reader to
investigate
5nisotro#y
5f a petrophysical property has a range of correlation that is dependent on direction then the
petrophysical property is said to ehibit geometric anisotropy if the petrophysical property reaches the
sill in one direction and not in another it is said to ehibit 6onal anisotropy 5n other words, a
variogram ehibits 6onal anisotropy when the variogram does not reach the epected sill&ost
reservoir data ehibit both geometric and 6onal anisotropy 8igure 7 first geometric anisotropy,
second 6onal anisotropy, and lastly both forms of anisotropy
figure 7b
Sonal anisotropy can be the result of two different reservoir features" (1) layering, the hori6ontal
variogram does not reach the epected sill because there are layer like trends that eist and variogram
is not reaching full variabilityG and (2) areal trends, the vertical variogram does not reach the epected
sill due to a significant difference in the average value in each well
Cyclicity
eological phenomenon often formed in repeating cycles, that is similar depositional environments
occurring over and over ! variogram will show this feature as cyclicity as the variogram measures the
02
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
73/201
spatial correlation it will pass through regions that bear positive then negative correlation while still
trending to no correlation ! cyclic variogram can be seen in 8igure 1
8igure 1 ray scale image of an
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
74/201
(arge "cale Trends
virtually all geological processes impart a trend in the petrophysical property distribution
9olomiti6ation is the result of hydrothermal fluid flow, upward fining of clastics, and so on, are largescale trends 8igure 11 shows how large scale trends affect the histogram Trending causes the
variogram to climb up and beyond the sill of the variogram
0+
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
75/201
0ariogram modeling
!ll directional variograms must be considered simultaneously to understand the 39 spatial correlation
1 'ompute and plot eperimental variograms in what are believed to be the principal directions
of continuity based on a-priori geological knowledge
2 4lace a hori6ontal line representing the theoretical sill >se the value of the eperimental(stationary) variance for continuous variables (1 if the data has been transformed to normal
score) and p(l Pp) for categorical variables where p is the global proportion of the category of
interest 5n general, variograms are systematically fit to the theoretical sill and the whole
variance below the sill must be eplained in the following steps
3 5f the eperimental variogram clearly rises above the theoretical sill, then it is very likely that
there eists a trend in the data The trend should be removed as detailed above, before
proceeding to interpretation of the eperimental variogram
+ 5nterpretation"
o #-ort=scale variance: the nugget effect is a discontinuity in the variogram at the origin
corresponding to short scale variability 5t must be chosen as to be e$ual in all
directionsG pick from the directional eperimental variogram ehibiting the smallestnugget !t times, one may chose to lower it or even set it to
o nter.ediate=scale variance:geometric anisotropy corresponds to a phenomenon with
different correlation ranges in different directions
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
76/201
8igure 13
2 The spherical model The spherical model is the most common variogram model type The
spherical model is mathematically defined by formula 1+, and 8igure 1+ shows a spherical
type model
(+)
8igure 1+
3 The eponential model The eponential model is similar to the spherical model but it
approaches the sill asymptotically 5t is mathematically defined by formula and shown as a
variogram in 8igure 1
()
8igure 1
0/
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
77/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
78/201
8igure 1?
/ The last model type is known as the dampened hole effect because it includes a damping
function in its mathematical formula (formula 7) The model variogram is shown in 8igure
17
(7)
8igure 17
!4am#les
The variogram models in the principal directions (maor hori6ontal, minor hori6ontal, and vertical)
must be consistent, ie, same nugget effect and same number and type of structures This is re$uired so
that we can compute variogram values at angles not aligned with the principle ais in off-diagonal
directions and between eperimental values The responsibility for a licit variogram model is left to
practitioner, current software does not help very much
%asic idea is to eplain the total variability by a set of nested structures where each nested structure
each having different range parameters in different directions"
(1)
where his the distance, h, hy, h6are the direction specific distance parameters, and a, ay, a6, are the
directional range parameters derived from the variogram models The range parameters a , ay, a6can
approach or positive infinity
=ow do we ensure a legitimate modelK
0?
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
79/201
1 pick a single (lowest) isotropic nugget effect
2 choose the same number of variogram structures for all directions based on most comple
direction
3 ensure that the same sill parameter is used for all variogram structures in all directions
+ allow a different range parameter in each direction
model a 6onal anisotropy by setting a very large range parameter in one or more of the
principal directions
8igure 2 shows some simple 19 eperimental variograms, the respective models and parameters
8or each eample the sill is 1
8igure 2, The top eample shows a 19 eample with two nested structures 4oints to note are (1) the sum of the sill
contributions is e$ual to one, (2) the total range of the variogram is the sum of the component ranges The middle eampleshows a power model variogram, note that neither the eperimental or the the model variogram reach a sill The bottomvariogram is a simple eponential model
&ost beginning practitioners over model the eperimental variogram That is most beginners apply to
many nested structures and try very hard to catch each point in the model There should never be more
than three nested structure (not including the nugget effect), and the model need only be accuratewithin the range of its use eferring to the middle variogram of 8igure 2, if the variogram is not
needed beyond 12 units there is no need to model beyond 12 units 8igure 21 shows some more
07
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
80/201
comple eamples Dote that they are all 29 eamples &odeling 39 variograms is a simple etension
of the same principles used to model 29 5n short only model what you need to
8igure 21, The top left and bottom right eamples model 6onal anisotropy The bottom left eample shows a model ofgeometric anisotropy Th top right eample is $uite comple, re$uiring the need of a hole effect model to correctly modelthe spatial correlation
6ork &lo1
&odeling the spatial correlation is the most difficult and important step in the geostatistical modeling
process reat care should be taken
?
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
81/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
82/201
Lecture : +patia" -ata .na"*sis, The Quiz
Question )
The scatterplot is an ecellent visual tool to display correlation The correlation coefficient is an
intuitive measure of two point correlation 8or every point on a variogram plot ((-) versus -) there is
a scatterplot of z(u) and z(u-) values 9raw a scatterplot and give an approimate correlation
coefficient for the three points labeled !, %, and ' on the above figure
Question *
!n analytical variogram model is fit to eperimental points ive three reasons why such variogram
modeling is necessary
Question +
:hat is the difference between the variogramandsemivariogramK :hat is the difference between an
eperimental and a model semivariogramK
Question
'omplete the shorthand notation for the following variogram model"
Question -
=ow does geometric anisotropy differ from 6onal anisotropyK :hat is happening in an eperimental
variogram ehibits 6onal anisotropyK
?2
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
83/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
84/201
Question -
=ow does geometric anisotropy differ from 6onal anisotropyK :hat is happening in an eperimental
variogram ehibits 6onal anisotropyK
a) eometric" the range in one direction differs than the range in another Sonal" anges are the same
but one direction does not reach the sill
b) The eperimental variogram in the direction that is reaching the sill is achieving full variability, and
the other is not The could be a result of either an areal trend, or layering in the reservoir
?+
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
85/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
86/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
87/201
8igure /2 The kriging weights must consider redudancy of the data, the closeness of the data, and the
direction and magnitude of continuity
There is one other goal when estimating the unknown attribute" minimi6e the error variance 5f the
error variance is minimi6ed then the estimate will be the best estimate The error variance is theepected value of the difference between the known and the estimate and is defined by"
(/+)
where6N(u) is the estimator, andz(u) is the true value .ne obvious $uestion raised by this e$uation is
how can we determine the error if we do not know the true valueK True, we do not know the true value,
but we can choose weights that do minim6e the error To minimi6e the estimation variance take the
partial derivative of the error variance (e$uation /+) and set to , but before taking the derivative
e$uation /+ is epanded"
(/)
The result is an e$uation that refers to the covariance between the data points %(u;u), and the data
and estimator%(u;u) ! first this may seem like a problem because we have not dicussed the
covariance between the data and the estimator, but we did discuss the variogram, and the variogram
and the covariance are related ecall that the variogram is defined by"
and note that the covariance is defined by (the covariance is not the s$uared difference whereas the
variogram is)"
The link between the variogram and the covariance is"
?0
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
88/201
so the variogram and the covariance are linked by"
(//)
where (-) is the variogram, '() is the variance of the data, and '(-) is the covariance
This makes it possible to perform kriging in terms of the variogram instead of the covariance
'ontinuing with the derivation of the kriging e$uations, we know that formula / must be minimi6ed
by taking the partial derivative with respect to the weights and set to 6ero"
setting to 6ero
(/0)
The result of the derivation in terms of the variogram is the same because both the variogram and the
covariance measure spatial correlation, mathematically"
(/?)
and the system of e$uatiuons in terms of the variogram is"
(/7)
This is known as simple kriging There are other types of kriging but they all use the same fundamental
concepts derived here
??
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
89/201
Discussion
There are a couple of motivations behind deriving kriging e$uaitons in terms of the covariance"
1 5ts easier olving the kriging e$uations in terms of the variogram re$uires that the mean be
carried throughout the derivation 5t is easier to simplify in terms of covariance
2 5t is possible to have the variance at hL be 6ero with a variogram, this makes the matri veryunstable The covariance is defined as the epected vaue of the difference, not the s$uared
difference therefore the value of the covariance at hL is always large and hence the main
diagonal in the matri will always be large &atrices that have small main diagonal elements
such as when using the variogram are difficult for solution algorithms to solve due to truncation
errors and so on
3 it is eay to convert the variogram to covariance
Im#limenting Kriging
.nce again, consider the problem of estimating the value of an attribute at any unsampled location u,
denoted 6N(u), using only sample data collected over the study area A, denoted byz(un) as illustrated in
8igure /3 8igure /3 shows the estimator (the cube), and the data (z(un)) To perform kriging ust fill
in the matrices 8or eample, filling in the left hand matri, entry 1,1, consider the variogram between
points 1 and 1 The distance between a point and itself is , and thus the first entry would be the nugget
effect entry number 1,2, consider the distance h between points 1 and 2, read the appropriate
variogram measure and enter it into the matri repeat for the all of the variogram entreis and solve for
the weights n
8igure /3
?7
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
90/201
8igure /+
8igure /
The estimate is then calculated as"
(/1)
The result is an estimate of the true value and the error associated with the estimate, as 8igure //
illustrates
8igure //
;riging provides the best estimate but there are some issues"
T$e ros and Cons of Kriging
ros7 Cons7
The best linear unbiased estimator
Smooths
7
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
91/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
92/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
93/201
(/10)
The se$uential simulation workflow is as follows"
1 Transform the original S data to a standard normal distribution (all work will be done in
@normal@ space) :e will see later why this is necessary
2 o to a location u and perform kriging to obtain kriged estimate and the corresponding kriging
variance"
9raw a random residual (u) that follows a normal distribution with mean of and varianceof 2;(u)
!dd the kriged estimate and residual to get simulated value"
Dote that ZN (u) could be e$uivalently obtained by drawing from a normal distribution with
meanZN(u) and variance 2;(u)
!dd F? (u) to the set of data to ensure that the covariance with this value and all futurepredictions is correct !s stated above, this is the key idea of se$uential simulation, that is, to
consider previously simulated values as data so that we reproduce the covariance between all of
the simulated values
Cisit all locations in random order (to avoid artefacts of limited search)
%ack-transform all data values and simulated values when model is populated
'reate another e$uiprobable reali6ation by repeating with different random number seed
(9eutsch, 1777)
6$y a Gaussian 8%ormal9 Distriution:
The key mathematical properties that make se$uential aussian simulation work are not limited to the
aussian distribution The covariance reproduction of kriging holds regardless of the data distribution,
the correction of variance by adding a random residual works regardless of shape of the residual
distribution (the mean must be 6ero and the variance e$ual to the variance that must be re-introduced),
and the covariance reproduction property of kriging holds when data are added se$uentially from any
distribution There is one very good reason why the aussian distribution is used" the use of any other
distribution does not lead to a correct distribution of simulated values The mean may be correct, thevariance is correct, the variogram of the values taken all together is correct, but the @shape@ will not be
5n @aussian space@ all distributions are aussian H13I There is a second, less important, reason why
73
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
94/201
the aussian distribution is used" the central limit theorem tells us that the se$uential addition of
random residuals to obtain simulated values leads to a aussian distribution The construction of
kriging estimates is additive %y construction, the residuals are independent and there are many of
them The only caveat of the central limit theorem that we could avoid is the use of the same shape of
distribution, that is, we may avoid multivariate aussianity if the shape of the residual distribution was
changed at different locations The challenge would be to determine what the shape should be
(9eutsch, 1777)
T$e ros and Cons of "imulation
ros7 Cons7
Honors the histogram ulti!le images
Honors the variogram
"once!tually difficult to
understand
#uantifies globaluncertainty
$ot locally accurate
a%es avaiable multi!lereali&ations
&acies "imulation
8acies are considered an indicator (categorical) variable and simulating facies re$uires indicator
simulation
7+
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
95/201
Lecture 6: Geostatistica !a""i#$ %o#ce"ts& he uiz
9uestion 1
;riging is often referred to as a $est linear 9n$iased estimatorat an unsampled location
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
96/201
9uestion 3
:hat is &onte 'arlo simulationK =ow is &onte 'arlo simulation performed from a cdf :(x);
draw a random number read the appropriate $uantile
9uestion 4
:hy is decl9steringused prior to stochastic simulation of porosityK
:e use the distribution in se$uential aussian simulation so the distribution must be as accurate as
possible
9uestion "
:hat are the features of a simulated reali6ation that make it preferable to a ;riging map for oil
reservoir evaluationK :hat features are not appropriate for aussian simulationK
models the heterogeneity whereas the kriging map smooths out the heterogeneity
features that are connected like permeability are not appropriate for simulation
7/
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
97/201
Lecture : +tructura" Mo%e"in
Celocity >ncertainty
urface %ased &odeling
urface 8lapping 9istribution
8ault =andling #ecture 0 Bui6
Introduction
5t is estimated that hydrocarbon reserves recoverable through improved reservoir management eceed
new reserves that can be added through eploration 5ncreasingly, it is being recogni6ed that 39
seismic data analysis is a critical reservoir management technology and plays a key role in reservoir
detection, delineation, characteri6ation, and monitoring =owever, 39 seismic alone is inade$uate for
many applications due to (1) limited resolution, and (2) the indirect and*or weak relationships between
critical reservoir parameters such as permeability, porosity, and water saturation !s a result, it isgenerally recogni6ed by reservoir scientists that proper reservoir description and monitoring re$uire
full integration of 39 seismic with engineering, geological (including geochemical and geostatistical),
petrophysical, and borehole geophysical methods
eismic is very good at many things such as resolving large scale structural features, recovering
information from 7777O of the volume of interest, but it is not good at resolving fine scale features
such as resolving petrophysical properties, this task is left to geostatistics .ther more intimate
petrophysical property sensing tools such as logs offer fine scale measures of petrophysical properties
but offer no insight into what lies beyond the tools range, this task is left to geostatistics 'ore data is
an even more finely scaled measure of petrophysical properties, but it range of insight is even less than
that of log data eostatistics
eostatistics uses the coarse-scale structural information offered by seismic, the mid-scale information
offered by electric logs and the fine scale information of core data to generate high resolution models
of oil reservoirs ! reservoir model is built starting with the large scale features, the structural features,
first This lecture will discuss the uncertainty in interpreted seismic surfaces, and in the event there is
no reliable seismic data, how to simulate the surfaces that would define structure and fault handling
0elocity 'ncertainty
The geophysical branch of the eploration science is primarily concerned with defining subsurface
geological features through the use of seismic techni$ues, or the study of energy wave transmissionsthrough rock eismic techni$ues can be used to make subsurface maps similar to those developed by
standard geological methods
The three main rock properties that the geophysicist studies are 1) elastic characteristics, 2) magnetic
properties, and 3) density !lthough studies of density and magnetic properties provide useful
information, elastic characteristics are considerably more important since they govern the transmission
of energy waves through rock 5t is this elastic characteristic that is studied in seismic surveys The
word seismic pertains to earth vibrations which result from either earth$uakes or artificially induced
disturbances
eflection seismic surveys record the seismic waves that return or reflect from subsurface formation
interfaces after a seismic shock wave has been created on the surface %y measuring the time re$uired
70
http://longhorn.zycor.lgc.com/geostats/lec7/lec7.html#velocity_uncertaintyhttp://longhorn.zycor.lgc.com/geostats/lec7/lec7.html#velocity_uncertaintyhttp://longhorn.zycor.lgc.com/geostats/lec7/lec7.html#surface_flappinghttp://longhorn.zycor.lgc.com/geostats/lec7/lec7.html#fault_handlinghttp://longhorn.zycor.lgc.com/geostats/lec7/quiz7.htmhttp://longhorn.zycor.lgc.com/geostats/lec7/lec7.html#velocity_uncertaintyhttp://longhorn.zycor.lgc.com/geostats/lec7/lec7.html#velocity_uncertaintyhttp://longhorn.zycor.lgc.com/geostats/lec7/lec7.html#surface_flappinghttp://longhorn.zycor.lgc.com/geostats/lec7/lec7.html#fault_handlinghttp://longhorn.zycor.lgc.com/geostats/lec7/quiz7.htm -
7/26/2019 []_Geostatistics_in_12_Lessons.doc
98/201
for different waves to be reflected from different formations, the geophysicist can identify structural
variations of the formations 8igure 01 illustrates this process in a typical land survey operation
8igure 01, a typical land survey operation
The obective of seismic work is to develop maps that indicate structures which might form traps for
oil or gas from the data provided on the record cross sections The geophysicist makes maps bycalibrating the seismic attribute to core data, well log data, and analogue data !ny feature that causes
a change in propagation of sound in rock shows up in the seismic survey 'hanges in dip, different
rock types, possible faults, and other geological features that are some of the features indicated in the
sections These features are not immediately evident in the seismic data everal steps are necessary to
convert seismic data into useful structural and stratigraphic information
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
99/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
100/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
101/201
8igure 0+ The parametric surface after @undulation@ has been added
The model is built surface by surface and each surface is deposited on top of the eisting surfaces (if
there are any, and there wont be at first) using the following protocol"
1 The central location of the new surface (x+ +) is selected stochastically The distribution used
for selection of the location is derived from the distribution of possible locations given the
thickness of the reservoir !t the beginning of the simulation all surfaces have the same
probability of selection, but as the simulation continues the reservoir builds up thickness and
there fewer permissible surfaces that will comply to the selection and conditioning criteria
2 The length of the surface Xis randomly selected from a triangular pdf with the minimum and
maimum parameters being user selected
3 The inner and outer widths, the height of the surface and the orientation of the surface are
selected from a triangular distribution with parameters provided by the user
+ The surface is @dropped@ onto the reservoir !ny eisting surfaces will truncate the new
surface 8igure 0 shows the dropping principle in action
8igure 0 The dropping principle used in the simulation of surfaces
'ondition the surface to the data !ll of the surfaces are dropped to the same datum as
indicated in figure 0 There are two solutions if the surface does not conform to the
intersections provided by the well data (1) raise the surface to meet the intersection, and (2)
lower the surface to meet the intersection 5f the surface is raised it could be reected if it too
short and will not be truncated by eisting surfaces, instead, the surface is lowered to the
intersection as in 8igure 0/
11
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
102/201
8igure 0/ 'onditioning of the surface to a single well data
/ epeat until the reservoir is fully populated
8igure 00 shows an eample of simple parametric surface simulation
12
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
103/201
"urface &la##ing
urface flapping is an acronym for surface uncertainty 8igure 0? shows surface flapping The pink
vertical lines are well data that the surface must be conditioned to The light blue hori6ontal line is the
gas oil contact and the pink hori6ontal line is the oil water contact The dark blue lines are top surface
reali6ations and the green lines are bottom surface reali6ations There is uncertainty in the true location
of the top surface and the bottom surfaces everywhere ecept at the wells The blue and green lines
illustrate the etent of uncertainty about these surfaces The green lines do not flap as wildly as the
blue lines There is sound reasoning for this urface uncertainty cannot be assessed independently,
once the uncertainty in the top surface with respect to the present-day surface has been established allother remaining surfaces will have less uncertainty The uncertainty in remaining surfaces is accounted
for in the uncertainty with respect to the distance between layers .ne could imagine considering each
of the surface uncertainties modeled independently but in doing so negative volumes could be created
(the surface lines could cross) !lso the distribution of thickness would be ignored 5n those locations
where the top and bottom surfaces cross there might be at minimum 6ero thickness, but the seismic
derived distribution of thickness might suggest that there is a very low possibility of 6ero thickness
This is why we model the top surface uncertainty first conditioned to the uncertainty of the surface
with respect to the present-day surface and all remaining surfaces are modeled conditional to the
uncertainty of the thickness between surfaces !n important point that must be pointed out is velocity
uncertainty and how this uncertainty relates to depth * surface determination is not considered here,
this is a simplified model meant only to illustrate surface uncertainty with respect to the well data The
uncertainty in the surfaces can be modeled using se$uential aussian simulation
13
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
104/201
8igure 0? ! diagram of surface uncertainty
!ssessing the uncertainty in surfaces is important for the determination of pore volume and hence
predicted oil in place volumes 8or eample consider the calculation of the gross pore volume"
The net-to-gross ratio and net porosity are inferred from the well data, available seismic data and
geological interpretations There are uncertainties eisting in the determination of the net-to-gross
ratio and the net porosity due to limited well data and uncertainty in the calibration of soft seismic and
geological data >ncertainties in all factors propagate to uncertainty in the final calculation of pore
volume The uncertainty in pore volume is a function of the multivariate distribution of the three
contributing factors" C, net-to-gross ratio, and net porosity 5nference of this multivariate
distribution is difficult due to the poorly known dependencies such as the relationship between
porosity and surface interpretation ! particular model of this multivariate distribution can be built
assuming that the three factors are independent :e will adopt such a model The distributions of
uncertainty in the three controlling variables must be determined
The top and bottom surfaces will be stochastically modeled to $uantify the distribution of uncertainty
in the C This modeling is guided by well data and the best estimate of the surface from seismic
The uncertainty of the average net-to-gross ratio and the net porosity are determined by bootstrap
resampling from the best distribution that can be inferred from limited well data and supplementary
seismic and geologic data The gross roc vol9me is the reservoir volume above the oil*water contact
(.:') constrained by the top and bottom surfaces of reservoir ! gas-oil contact is needed for
reservoirs with gas 8igure 07 shows a cross section view of a reservoir
1+
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
105/201
8igure 07 ! cross-section of a hypothetical oil reservoir
The reservoir is constrained by a top and bottom surfaces (black curves) The .:' is represented by a
red hori6ontal line and the gas-oil contact G
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
106/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
107/201
Lecture : +tructura" Mo%e"in, The Quiz
Question )
:e are not eplicitly trying to model surfaces that define geological structures :hat are we trying to
modelK :hyK
Question *
:hat kind of data is geophysics usefull for (in the contet of this lecture)K :hat is the place of
geostatistics, and how does it integrate with geophysical data to provide accurate modelsK
Question +
:hy model the surface closest to present-day surface firstK :hy not model each surface with the same
uncertaintyK :hat does this techni$ue preventK
Question
Dame two sources of velocity uncertainty, and suggest how these sources can impact the reservoir
model
Question -
mall scale faults are often ingnored in the reservoir model, whyK
solutions
Auly 31, 1777
10
http://longhorn.zycor.lgc.com/geostats/lec7/quiz7sol.htmhttp://longhorn.zycor.lgc.com/geostats/lec7/quiz7sol.htm -
7/26/2019 []_Geostatistics_in_12_Lessons.doc
108/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
109/201
Question -
mall scale fratures are often ingnored in the reservoir model, whyK
5t is assumed that small scale fractures cam be approimately handled by effective flow properties
Auly 31, 1777
17
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
110/201
Lecture : Ce"" Base% (acies Mo%e"in
ðodology
e$uential 5ndicator ðods
Truncated aussian ðods
'leaning 'ell %ased eali6ations
#ecture ? Bui6
Introduction
eservoir simulation and decision making re$uires 39 distributions of petrophysical attribute like
porosity, permeability and saturation functions There is no direct need for lithofacies models
Devertheless lithofacies are considered important because petrophysical properties are often highly
correlated within the lithofacies types ;nowledge of the lithofacies types serves to constrain the range
of variability to the distribution of properties within the lithofacies #ithofacies are distinguished by
different grain si6e diagenetic alteration, or any other distinguishing feature 8or eample shale is
different than sandstone which is different than limestone which is in turn different that dolomite 5tshould be born in mind however that we are first and foremost concerned with making realistic
distributions for the purpose of decision making, not making pretty pictures
Met$odology
%efore even beginning to model lithofacies one should ask if it is a worthwhile venture &odeling
lithofacies may not always yield improved prediction of reservoir performance To make this decision
easier consider the following"
1 The lithofacies types must have significant control over the petrophysical properties
ignificant control is a contentious issue, however a useful guide would be to consider adifference in the mean, variance, and shape of at least 3O !s well the saturation function
should not overlap between lithofacies, and here there should also be a difference in the overall
average of 3O between lithofacies
2 9istinct lithofacies must be easily discerned in well log data and core data
3 the lithofacies must be at least as easy to model as the petrophysical properties implicity rules
here .verly elaborate models will be detrimental to the model
=ow many lithofacies should be modeledK The number of lithofacies to be modeled is a decision that
must be made at the time of modeling, however, there are some pieces of advice to be offered Two
@net@ and one @non-net@ (net means oil bearing) lithofacies often provide sufficient detail for most
reservoirs The limit of workability is three net and three non-net lithofacies %eyond this limit modelsbecome nearly unworkable
:ith the lithofacies distinguished select the modeling techni$ue"
1 cell based modeling (purely stochastic)
2 obect based modeling (partly stochastic and partly deterministic)
3 deterministic modeling (purely deterministic)
Dote that deterministic modeling is always preferred :hy leave things to chance when know the
responseK
'ell based modeling is by far the most preferred method for modeling lithofacies ome of the reasons
include"
11
http://longhorn.zycor.lgc.com/geostats/lec8/lec8.html#methodologyhttp://longhorn.zycor.lgc.com/geostats/lec8/lec8.html#sequential_indicator_methodshttp://longhorn.zycor.lgc.com/geostats/lec8/lec8.html#truncated_gaussian_methodshttp://longhorn.zycor.lgc.com/geostats/lec8/lec8.html#Cleaning_cell_based_realizationshttp://longhorn.zycor.lgc.com/geostats/lec8/quiz8.htmhttp://longhorn.zycor.lgc.com/geostats/lec8/lec8.html#methodologyhttp://longhorn.zycor.lgc.com/geostats/lec8/lec8.html#sequential_indicator_methodshttp://longhorn.zycor.lgc.com/geostats/lec8/lec8.html#truncated_gaussian_methodshttp://longhorn.zycor.lgc.com/geostats/lec8/lec8.html#Cleaning_cell_based_realizationshttp://longhorn.zycor.lgc.com/geostats/lec8/quiz8.htm -
7/26/2019 []_Geostatistics_in_12_Lessons.doc
111/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
112/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
113/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
114/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
115/201
8igure ?+
Cleaning Cell Based Realisually the smaller proportions suffer
This is a product of the order relations carried out in the indicator kriging algorithm (correcting for
negative kriging weights)
The easiest way out of this dilemma is to post process the reali6ations to honor the target proportions
.ne useful method for cleaning is $uantile transformation Buantile transformation works well when
there is a natural nesting order to the lithofacies categories, such as a continuous variable transformed
to a categorical variable, however, artifacts can occur when dealing with more than two unorderedlithofacies
Ma7i.u. Posteriori #election
&aimum a posteriori selection (&!4) replaces the lithofacies type at each location uby the most
probable lithofacies type based on a local neighborhood The probability of each lithofacies type is
based on the following criteria"
1 'loseness of the data in the window to the location u
2 whether the data is a conditioning data
3 mismatch from the target proportion
11
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
116/201
8igure ? The &!4 algorithm at work
8ormally, consider an indicator reali6ation,
where the proportions of each lithofacies type in the reali6ation are
(the probability of the indicator being state is between and 1) with the sum of all proportions being
e$ual to one, and the target proportions of each lithofacies bound by the same properties
Dow consider these steps to cleaning the reali6ation
and bringing the the proportions
closer to the target proportions
!t each of the=locations for al of the locations within the area of study, calculate the probability >(u)
for each indicator based on a weighted combination of surrounding indicator values"
11/
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
117/201
-
7/26/2019 []_Geostatistics_in_12_Lessons.doc
118/201
gLweight to ensure that the new lithofacies are closer to the target global proportionsG
specifically, increase the proportion if it is less than the target proportion, and decrease it if it is
too high"
where . are the target proportions and .()
are the reali6ation proportions
!s one would epect, the si6e of the window ?(uE) and the distance weighting have significant impact
on the @cleanliness@ of the result c(uE) has the effect of cleaning in favour of conditioning datagLdoes not impose the targ