Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
104 views15 pages

Journel1985 PDF

Uploaded by

Ariel Zúñiga
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
104 views15 pages

Journel1985 PDF

Uploaded by

Ariel Zúñiga
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

Mathematical Geology, Vol. 17, No.

1, 1985

The Deterministic Side of Geostatistics I

A. G. Journel 2

The probabilistic approach is but one language used by geostatisticians to characterize spatial
variability and to express a very simple criterion for goodness of estimation. Notions such as
stationarity and ergodicity are important for the consistency of the probabilistic language
but are irrelevant to the real problem, that of estimating a well-defined deterministic spatial
average. The kriging algorithm is established without any recourse to probabilistic modeling
or notation.

KEY WORDS: geostatistics, stationaritY, ergodicity, spatial average, deterministic kriging.

INTRODUCTION
The probabilistic interpretation of a natural phenomenon known to be unique
and the related hypothesis of stationarity do not always appeal to geologists'
intuition, even though the positive aspects of geostatistical applications may be
convincing. In fact the probabilistic approach and modeling are but one language
(and possibly not the simplest) to express very simple deterministic criteria for
estimation. Stationarity is not an intrinsic property of the deposit but is a prop-
erty of the probabilistic model. A model choice and its properties must be judged
on its efficiency in capturing and solving the problem at hand. Engineers are
correct in judging geostatistics by efficiency and practical records rather than by
the intricacy of its theoretical developments; if there existed a simpler language
leading to the same algorithms and results there is little doubt that it would be
adopted.
Probabilistic hypotheses such as stationarity and isotropy are shown to be
equivalent to an experimenter's decision to look at average spatial characteristics
over particular subareas and/or directions. Kriging is shown to be equivalent to a
deterministic process of averaging errors of a same type over a predefined field
where such averaging makes physical (geological) sense.

1Manuscript received 15 June 1983; revised 14 November 1983.


2App/ied Earth Sciences Department, Stanford University, Stanford, California 94305
U.S.A.
1
0020-5958 85/0100-0001504.50~0 © 1985 Plenum Publishing Corporation
2 Journel

Stationarity: A Model Choice

The probabilistic inference of a phenomenon requires some repetitiveness


of the phenomenon observed. Natural phenomena are rarely as repetitive as the
rolling of a die could be. In the earth sciences repetitiveness is usually obtained
by moving within a certain area or volume. For example, consider a copper de-
posit D, z(x) being the copper grade at point x E D. The population D includes
several heterogeneous subpoputations: oxide and sulfide mineralizations, each
with various degrees of alterations, rock types, etc. If the mean grade over D is
required, repetitiveness is considered over the whole deposit D, care being given
to represent evenly each subpopulation or mineralization. The grade assessment
may also be done per type of mineralization; repetitiveness is then obtained by
moving the location x within that type.
The definition of the population within which repetitiveness is obtained is
a decision of the experimenter, for no two individuals are exactly identical and
the initial large population can be split into as many subgroups as there are in-
dividuals. That decision of the experimenter is guided by various considerations:

(i) The chemistry (or physics) of the phenomenon studied: oxide and sulfide
Cu mineralizations should be considered separately not only because they will
require different milling processes, but also because they correspond to clearly
different geological geneses. Conversely, for sulfide mineralization, it is not
essential to separate bornite from chalcopyrite mineralization.
(ii) The amount and type of information available: if many populations are con-
sidered, many more data are needed to characterize each of them. At an early
stage of exploration, the geologist-geostatistician has no choice but to consider
a single population; as more information becomes available, he/she can start
separating mineralizations. The distinction between bornite (very high Cu
grade) and chalcopyrite may always remain inaccessible if data are defined
on a large support, such as a core of 5 m length; moreover such distinction is
irrelevant because no mining or milling will be able to separate these two
sulfide mineralizations.

In probabilistic terms, the area or subpopulation within which repetitive-


ness is assumed corresponds to a "stationary" area or subpopulation. More pre-
cisely, the spatial variability of the phenomenon z(x), where x is in the area A,
is modeled by a stationary random function (Z(x), x E A}. A random function
Z(x) is a stationary if its multivariate distribution is invariant by translation
within the space or area A, entailing that all its moments are also invariant by
translation; see Doob (1953).
Stationarity allows statistical inference, for example

(1) since Prob (Z(x)~< z} = F(z), for all x ~ A, the distribution function F(z)
can be estimated by the cumulative histogram of data z(x'cOtaken at differ-
ent locations x~ ~ A.
The Deterministic Side of Geostatistics 3

(2) the variogram 27(h) = E([Z(x + h)- Z(x)] 2} can be estimated by pooling
together pairs of data z(x~ + h), z(xa) taken at different locations x~ C A.
Stationarity is a property of the probabilistic model, hence it is a decision
(a choice) of the experimenter, not an intrinsic property of the real phenomenon
studied. As indicated before, the choice of a stationary model may change de-
pending on the scale of the observation made and the amount of data available.
A statement such as "the sulfide mineralization is stationary" is incorrect and
should be reworded as "given the scale of the study, which does not require nor
allow differentiation of sulfide types, a stationary random function modelZ(x)
is considered to represent the spatial variability of copper grade within the pool
of all sulfide mineralizations."
Stationarity being a property of a model chosen by the experimenter and
not an intrinsic property of reality, it cannot be proved or rejected from data.
But as with any model choice, stationarity can be validated a posteriori by judg-
ing whether this choice has proven to be efficient in solving the particular prob-
lem at hand.

Spatial Averages and Moments


Consider the spatial phenomenon (z(x), x C D} of copper grade at locations
x within the deposit D. Within the area A C D, this spatial phenomenon is mod-
eled by a stationary random function (in short RF)model (ZA(x), x CA). Sta-
tistical inference about the RF model ZA(X ) is made by sampling within A.
Consider for example a sample of size N randomly distributed within A.
The following experimental moments are defined.
experimental mean
1 N
N z(x )

experimental variance
,, 1 N
O~=l

experimental cumulative distribution function (in short cdf)


1 N ~1, if:z(xa)~z
F ( z ) = ~ : ~ i(x~;z) with i(xa;z)= [ (1)
Iv = 1 O, if not
The limit of these experimental values, when N ~ oo i.e., when an exhaustive
sampling of area A is available, are the following spatial averages
mean over A
4 Journel

variance within A :

s~ = ~ [z(x) - real 2" dx

proportion of grades <~z within A

FA(Z)=~A}fAi(x;z)'dx with (A}= fAdx:measureofA (2)

These spatial averages have a clear physical meaning and the corresponding mo-
ments of the stationary RF model ZA(X ) are identified to them.
ZA(X ) such that
E{ZA(X)} = mA Var {ZA(X)} -- s~
(3)
F(z) = Prob {Z(x) ~<z} - FA(Z)
The identification process (3) allows considering the experimental statistics (1)
as estimates of the moments of the stationary model ZA(x).

Remarks
Existence of the moments OfZA(X ) is ensured by definitions (3).
Questions about the ergodicity of the RF model ZA(X ) are irrelevant, for
the moments of Za(x ) are defined from the spatial averages (2) not from their
"hypothetical" limits when (A}-+ oo. It is recalled, see, for example, Breiman
(1969), that a stationary RF Z(x) is said to be "ergodic" if the limits of spatial
averages of type (2), when {A}-+ 0% exist and identify the corresponding RF
moments, e.g.

lim ~ ~-~
{A)-~ 1 z(x). dx = ~(Z(x))

As with stationarity, definitions (3) cannot be debated or tested a priori,


but can be validated or rejected a posteriori by judging the efficiency of the
model ZA(X ) in solving the problem at hand.
In addition to the previous univariate statistics m, 0 2 , F(z), geostatisticians
consider bivariate statistics (joint properties of grades taken at two different
locations x and x + h) such as the experimental variogram and bivariate cumula-
tive histogram

1 nO)
2.~(h) = 2 ~ ~ [z(x~ + h) - z(x~)l ~
(4)
F(h;z,z')= 1 nO) i(xa +h;z)'i(xa;z')
The Deterministic Side of Geostatisfics 5

with n(h) being the number of pairs of data, within A, approximately separated
by the vector h.
The limits of these h-bivariate statistics depend not only on the area A but
also on the vector h. Indeed, when N ~ oo (exhaustive sampling of A)

2~(h) -+ 27A(h) - 1 ; [z(x + h) - z(x)] ~- dx


{A(h)} JA(h)
(5)
~(h;z,z,)~FA(h;z,z,)_ 1 ( [i(x +h;z)'i(x;z')] "dx
(A(h)) dA(h)
with: A(h)= A • A-h being the intersection of A with its translation A_h, and
(A(h)} = fA(h) dx: measure of A(h).
Again, the stationary RF model ZA(X ) is chosen such that its bivariate
characteristics identify the spatial averages defined in (5).
27(h) = E([ZA(x + h) - ZA(X)] z} ~ 27A(h)
F(h;z,z')=Prob(ZA(x+h)<~z and ZA(X)<.z'} (6)

=-FA(h;z,z' )

Remarks
As opposed to the spatial averages (2), the spatial averages (5) a that is the
bivariate characteristics (6) of the RF model Z A (x) are not representative of the
whole area A but only of the subarea A(h) C A.
The spatial averages over A(h) and A(-h) are usually different. For example:
FA(h; z , z') :~ FA(-h; z, z') = FA(h; z', z); however: "YA(h)= 'yA(-h).
Thus the variogram 27A(h) is representative of the area A(h) U A(-h); see
hatched area on Fig. 1.
If the area A is rectangular as in Fig. 1, and the vector h is parallel to one
of the sides of A with modulus Ih/less than one-half of this side denoted by
lA (h), then:
A(h) U A ( - h ) ~ A for ]h[~<½ (lA(h)). (7)

\ ", ', \'l ', \ \ \\[ [

Fig. 1. Areas of ~epresentativity of the h-bivariate


statistics.
6 Journel

For the spatial average 7A(h) to be representative of the whole area A, the inter-
distance vector h must not be too large. Indeed, geostatisticians never consider
experimental variograms beyond one-half of the maximum experimental inter-
distance available, cf., Journel and Huijbregts (1978, p. 194).

Regrouping Statistics
By restricting the area of stationarity, the experimenter can better focus on
the specific properties of that area, provided he has enough data to do so. By
extending the area of stationarity, the amount of data is increased and the esti-
mates of the spatial averages m A , FA(h), ~/A(h) are improved, provided that such
averaging makes sense.
A preliminary decision to split a deposit D, say into two stationary areas,
D = A t_)A', may be reconsidered in view of the two local statistics. For example,
the two areas may have been defined on rock type considerations, but it may
turn out that the corresponding two local statistics of Cu grades do not differ
sensibly; these local statistics can then be regrouped with due respect to their
respective zones of influence. If the two areas A and A' do not overlap, and D =
A U A', the spatial averages over D are written

1
m D = -~
fDz(x)" dx =
(A}'mA+(A'}'mA'
{A) + {A'}
(8)

1
~D [z(x + h) - z(x)] 2" d x = (A(h)}" 7A(h) + (A'(h)} • 7A'(h)
"YD(h) - {D(h)} (h) {A(h)} + {A'(h)}

Consequently, if the N + N ' data available over A UA' = D are regularly


located (on a grid) over D, the experimental statistics should be regrouped as

A N ' r n A + N " m" A, 1 N+N'


mD= N+N' N+N' y~ z(xa)
0~=1
(9)
n(h) . 2L4(h) + n'(h) " %4'(h)
3'D(h) = n(h) + n'(h)

with n(h) and n'(h) being the number of pairs of data used for ~'A (h) and 7A'(h).
If the N + N' data available are not regularly located over D, weighting by
areas of influence of the type (8) is warranted. Hence it appears that a split D ---
A t3 A' may be required, not for reasons due to the homogeneity of the under-
lying phenomenon but because the data set is clustered; if clusters happen to be
in area A, this area being more sampled, naive weighted statistics of the type
(9) would overrepresent area A.
The Deterministic Side of Geostatistics 7

Grouping Directions
In addition to grouping areas, one may want to group directional vario-
grams. Consider an area A and the following variogram in direction w

27A(h) = 27A([h I, w) - (A(I h1I, w) fa (lhl, w) [z(x + h) - z(x)] 2. dx

the vector h being of modulus ]hi and direction w. Regrouping over all direc-
tions of the three-dimensional space R 3 would provide the so-called "omni-
directional" variogram

1 fu nit sphere "rA(IhI'w)'dw


2~'A(ih [) = ~
(10)

In practice, if the )~.=1 n(Ih i, w) pairs of data available provide a regular


(uniform density) coverage of the unit sphere in R3, straightforward weighting
by number of pairs can be done, which amounts to ignoring the direction of the
data pair in the variogram calculation:

L
Z n(lhE,w)'*A({h[, w)
2@A([hl) = w=l 1 n(lhl)
L -n(Ihl) Iz(xo + h ) z(x)l 2
y" n([h[,w)=n(lh[)
W=I

(11)

If one direction is overrepresented in the data set (a classical example being


the down-hole direction in mining applications), the weighting formula (11)
which is used by most variogram software would give an overrepresentation of
that direction.
It thus appears that an hypothesis of isotropy made by the experimenter is
of the same type as that of stationarity. Again, rather than an hypothesis subject
to tests, it is an operational decision validated a posteriori by its efficiency. Cor-
respondingly, the variograms (10) and (11) should not be called isotropic but
rather "omnidirectional."

Grouping Distances
The last step in grouping is that of grouping distances. Consider the omni-
directional semi-variogram 7A(thi) defined by relation (10) and proceed to a
grouping of all distances [h I such that (x, x + [h l) E A.
8 Journel

~A-=~-~ 7A(lh[)'dlh [ - 2(A} 2 dx [z(x)- z(x')]2.dx '

-= s~ as defined by relation (2).


This total grouping is so strong that from the semivariogram 7A(h) only the
variance characteristic s~ remains: 7.4 = s~ =Var {ZA(X)} , cf. relations (2) and
(3). This extreme case of regrouping amounts to ignoring all spatial correlation,
i.e., to considering a pure nugget effect model. Again this averaging out of all
spatial correlation is not an intrinsic property of the real phenomenon but a
model choice of the experimenter. Such an extreme model would prove totally
inefficient for local estimation but may prove good enough for global estimation,
say the estimation of the spatial average m D over the whole deposit D.
As a general rule, the largest regrouping that can do the job would be adopted
by the geostatistician, because a large regrouping usually means better statistical
inference from a larger data set. Each split should be justified not only be dif-
ferent statistics (they may be so because of statistical fluctuations due to the
paucity of data) but also by physical considerations; e.g., strong anisotropic
features should have clear geological explanations, otherwise an omnidirectional
model would be safest.

The Composite Random Function Model


The stationary RF model ZA(x ) defined by the identification relations (3)
and (6) may be introduced-in a more "practical" w a y - b y two successive pro-
cesses of conditioning and compositing over the area A.
Classically, the spatial regionalization of true values {z(x), x E A ) is inter-
preted as an outcome of a (not necessarily stationary) random function model
Z(x). It may seem better to consider a RF model customized to the actual area
A under consideration, the extreme case being a RF Z(x) conditioned by its
actual outcome over A : {z(x), x C A }.
Denote by Z(xlA ) the conditioned RF model (Z(x)[Z(x)= z(x), x EA}.
The corresponding univariate cdf is a Dirac step function of parameter z(x), and
is nonstationary because the parameter value z(x) depends on the location x:
0, if z <. z(x)
Prob (Z(x[A)<~z} =Az(x)(Z)= 1, otherwise (13)

= i0c;z) for a l l x E A
The corresponding nonstationary mean and variance are

E{Z(x ]A)} =f z " dAz(x)(Z ) = z(x)

1,, (14)
Var (Z(x ]A)} = [ z 2 "dAz(x)(Z ) - z2(x) = 0 for all x E A
J
Tlle Deterministic Side of Geostatistics 9

Similarly, for any pair of locations x, x ' E A, the corresponding pair of con-
ditioned random variables Z(x]A) and Z(x']A) has a Dirac bivariate cdf with
parameters z(x), z(x')
Prob (Z(x [A) ~<z and Z(x'[A) <~ z'} : &z(x), z(x')( z, z')
1, if z(x)<.z and z(x')<.z' (15)
i(x; z) . i(x'; z')
0, otherwise
The corresponding covariance and nonstationary variogram are

Cov {Z(x ]A), Z(x' [A)} : ; f zz' . d2Az(x), z(x,)(z, z') - z(x) . z(x')

=0 (16)

E([Z(x]A)- Z(x'IA)12}= [z(x)- z(x')] 2 for allx,x' EA


The conditioned RF model Z(x IA) thus appears as a rather particular RF with
zero covariance and nonstationary expected value and variogram. Clearly this
model is of no practical use.

The Compositing Process


The conditioned RF Z(x ]A) is of no practical use because its characteristics
cannot be inferred; for example, its variogram defined by relation (16) depends
on both locations x and x'. The idea, quite standard in statistical inference, con-
sists in averaging (compositing) the previous conditional variogram values.
Consider the RF YA(X) with the following bivariate cdf
Prob(YA(x+h)<~z and YA(x)~z'}

_ 1 ( Az(x +h), z(x)(z, z') " dx


CA(h)) JA(h)
(17)
_ 1 ( [i(x+h;z)'i(x;z')l "dx
(A (h)} JA(h)
==-FA(h; z, z') as defined in relation (5), for all x E A(h)
The compositing (averaging) of the nonstationary bivariate cdf's &z(x +h), z(x)
(z, z') of all pairs of conditioned RVs Z(x + h [A) and Z(x [A) results in the spa-
tial average FA(h; z, z') and a stationary RF YA(X) isomorphic to the RF model
ZA(X ) defined by the relations (3) and (6). The two RF models are isomorphic
in the sense that they have the same bivariate cdf.

Remarks
If the two RF models YA(x) and ZA(X ) share a common bivariate cdf, they
necessarily share common univariate cdf and moments:
10 Journel

Prob {YA(X) ~ z} = FA(Z) for all x ~ A


(is)
~ ( r A ( x ) ) = mA Vat (rA(x)) = s~
with FA(z), mA and s~ defined by the spatial averages (2).
For all practical purposes and provided that only bivariate and univariate
characteristics are involved, the two isomorphic RF models YA(X) and ZA(X)
can be considered as identical. Thus the two notations YA(x) and ZA(x) are
interchangeable.
Whether the initial RF model Z(x) is stationary or not, the model ZA(X ) is
stationary, and up to the bivariate level all its moments are identified to the
exhaustive statistics (spatial averages) over A.
The decision to customize the RF model to the entire deposit D or to any
subarea A C D is a decision of the experimeter. Customizing the model to D
rather than A would amount to extending the compositing process (17) to the
entire deposit D, thus defining another RF model ZD(X ).
Inference of the variograms 27A (h) of the RF model Z A (x) amounts to esti-
mating the spatial (Riemann) integral of the squared differences [z(x + h) - z(x)] 2
over A ; see relation (5).
This last remark is of importance for guiding research in the field of robust
variography. The problem is to estimate a very real and well-defined spatial
average 27A(h), not the "elusive" characteristic 2"y(h) of a no less "elusive" RF
model Z(x) with a a priori strong multinormal flavor. There is no a priori RF
Z(x) a realization of which is the deposit under study, but rather a stationary
RF model ZD(X), or ZA(X), is built whose moments identify the corresponding
spatial averages over D, or A.
As is the case of any discrete estimation of an integral over a domain D, one
major problem is to ensure that all subdomains of D are properly represented.
This problem becomes crucial when the available data locations present clusters
over preferentially rich (or poor) z-valued areas within D; cf. Omre (1984).

Kriging Under the Model YA(x)


The definition of the composite conditional RF model YA(x) = ZA(x) with
variogram 27A(h) does not change the ordinary kriging process familiar to geo-
statisticians; it allows, though, a better interpretation of its minimization criterion.
Indeed, when using the variogram 27.4(h) or, in practice, its estimate 2"yA(h), the
criterion being minimized is not the nonconditional estimation variance E{ [Z(x) -
Z*(x)] 2} but rather the composite conditional estimation variance

{A} E{tZ(xIA) - Z*(xlA)l=} • dx = [z(x) - z*(x)l 2" dx (19)

Ordinary kriging is thus the process of minimizing the average squared error
[z(x) - z*(x)] 2 when the whole configuration of estimation at location x is
moved throughout the area A.
The Deterministic Side of Geostatistics 11

Remarks
The configuration of estimation is the relative geometry of the location x
and the locations x~ of the N data (a = 1, N) used for the estimate z*(x). Strictly
speaking x can take only those locations x' ~ A such that all corresponding data
t t
location xc~ = x + (x - x~), a = 1, N, still remain within A. Hence the area of
integration of formula (19) is not A b u t : A * ( x , N ) = {x' E A: x~ = x' + (x - x~)
A,a=I,N}.
Just like the compositing process defining the RF model YA(X), this moving
process over A can be considered whether the initial RF Z(x) is stationary or
not. More precisely, stationarity (and ergodicity) of the initial RF model Z(x) is
irrelevant to the definition of that moving process.
The preceding process of moving the configuration of estimation over the
area of interest was used by Matheron in his early work (1964, p. 83) to charac-
terize the error of estimation of a surface by a grid of holes, cf. Fig. 2. For each
location xo of the grid origin within the grid cell ( a l , a 2 ) of size ala2, an estimate
S*(xo) of the surface S is obtained by summing the positive cell areas. By vary-
ing the origin Xo within the basic cell (a~, a2) a variance, called "transitive esti-
mation variance of S " is defined

02(al, a 2 ) = 1( [S*(xo)-Sl2"dxo
ala2 J (a 1,a2)

The transitive theory of Matheron consists in formulating the limited expansion


of the function o2(al, a2), when the basic cell gets narrower (al ~ 0, a2 ~ 0),

02 02

1o, I i ! Io,

i I !

Fig. 2. Estimation of the same surface from two distinct origins of the data grid.
12 Journel

and in estimating the first terms of that expansion. In a more recent work "Es-
timer et Choisir" (1978), Matheron further developed his "transitive approach"
to provide a broad and philosophical discussion of the usage of probability in
the earth sciences.
An important question should be asked, "Does the previous minimization
criterion (19) make sense?" In particular, if A contains various mineralizations
and/or rock types, the configuration of estimation (x, xa, a = 1, n) may have to
be limited to homogeneous subareas of A. This is the deterministic wording of
nonstationarity within A.
In the presence of selective mining where only the rich grades z ( x ) > z c
(cutoff) will be recovered, it would make more sense to move the configuration
of estimation only over those rich subareas A*(zc) C A. The problem is that the
locations of these rich areas A*(zc) are not known beforehand.

The Deterministic Version of Kriging

The probabilistic model YA(X) appears as a convenient tool for expressing


what is actually a deterministic estimation criterion: minimization of the average
squared error (19) when the estimation configuration is moved throughout a
predefined area A. Hence there must be a deterministic way of expressing this
criterion which would forego the probabilistic approach altogether.
Consider the spatial regionalization (z(x), x E A} over an "homogeneous"
(an experimenter's decision) area A. The information consists of N data values
(z(xc~), xc~ E (N) C A } from which an estimate 7A(h) of the spatial average "YA(h)
is derived.
For estimating Z(Xo), Xo ~ (N), consider the linear combination

n(xo)
z*(xo) = ~ xa(x0) . z(x,~) (2o)
~=1

where [n(xo)] C (N) is the subset of data locations actually used in the estima-
tion neighborhood of Z(Xo).
The weights Xa(Xo) depend on each point x o being estimated and on the
particular configuration of the n(Xo) data used. For convenience, this expression
can be rewritten with the origin of the coordinate system centered at the point
Xo being estimated

n(xo)
z*(Xo) = ~ Xa(Xo) "Z(Xo + ho~) with
~=1 (21)

hc~ = xo~ - xo a = 1, n(xo)


The Deterministic Side of Geostatistics t3

Let A~ C A be the subset of A defined by

A* = {x: x E A andx + h a E A , a = 1,n(xo)}

The estimation configuration Ix, x + hc~, a = 1, n(xo)] can thus be moved within
A* without any datum spilling out of the area A. This moving process over A
generates the following average error characteristics

EA(Xo) = ~ 1 ~- A [ z ( x ) - z*(x)] -dx

(22)

VA(xo)=(-~o fA [z(x)-z*(x)]2"dx with {A~}= fA

In the following we will make the approximation that A~ ~ A , that is, that
the dimensions h a of the estimation neighborhood are much smaller than the di-
mensions of A. In other terms, border effects are ignored in the present analysis.
A first step consists of setting the average error EA(xo) to zero, the deter-
ministic equivalent of ensuring unbiasedness.

E4(xo) = ~ I fAA z(x)" dx - n(xo)


~_, Xa(xo) • ~ l j ~ z(x + hc~)"dx
Ot=l

~--rnA " I 1 n(xo)


~ X~(Xo
)1
~=1

Whatever the usually known value mA, the average error is zero if the sum
of the weights Xc~(x0) equals one

n(xo)
EA(Xo) = 0 if ~ ~(Xo) = 1 (23)
~=1

The second step consists of minimizing the average squared error VA(xo)
under the previous constraint. Under condition (23), the squared error can be
developed as

[z(x) - z*(x)] 2 = [z*(x) - m4 ] - 2~--~, Xc~(x) " [z(x). z(x + ha) - m~4]
O~

+ Z ~ x,~(~) - x~(x). [~(x + h~). z(x + h~) - m~ ]

Under the approximation A* ~ A o, all summations being done over c~ or/3 = !,


n(xo), the average squared error is written
14 Journel

VA(xo)= [~A} £ z2('Oax- mi 1

- 2 ~-"Xa(x°) [~A} £ z(x)'z(x +h~) ax - m~]

+ h~) dx - m£J
c~ t3

= s~ - 2 ~ Xu(Xo) "eA(he~)+~_~ ~ X~(Xo)" X~(Xo)" CA(hc~- h~)


a o~ e

= 2 F~ x,~(Xo) • %4(h.) - Z Z X~(Xo) • Xe(Xo) • %4(h~ - h e)


a a

with: CA(h) ~ z(x) " z(x + h) dx - m~4 = s~4 - TA(h)

under the approximation A (h) = A.


Note that the spatial average CA(h) is the deterministic equivalent of a
covariance, as 27A(h) is that of a variogram.
Minimization of the last expression for VA(xo) under the constraint (23) is
done using a standard Lagrange formalism yielding a system of n(Xo) + 1 equa-
tions with n(Xo) + 1 unknowns: the n(xo) optimal weights Xa(xo) and the La-
grange parameter ta(Xo).

n(xo)
Z X~(Xo). ~.4(h. - he) - ~(Xo) = yah.) ~ = 1, n(Xo)
(24)
n(xo)
E X~(xo)= 1
~3=1

The minimized average squared error is then:

.(xo)
Min (VA(Xo) } - ~ X.(Xo) " 7A(h.) - ~(xo) (25)

The system (24) is none other than the standard ordinary kriging system.
In practice, the spatial average 7A(h) is replaced by a model fitted from the dis-
crete estimate ~A(h) using whatever data are available over the "homogeneous"
area A.
This kriging algorithm having been established without any recourse to
probabilistic modeling or notation, it appears that the probabilistic approach is
The Deterministic Side of Geostatistics !5

but one language to express the actual estimation criterion. Such notions as sta-
tionarity or ergodicity are important for the consistency of the language, but are
irrelevant to the real problem addressed.

CONCLUSIONS
The probabilistic language used by geostatisticians may induce them into
nonrewarding debates over questions of language and mask some real problems
that deserve more of their attention.
It has been shown that repetitiveness of the error of estimation can be ob-
tained by moving the configuration of estimation throughout a predefined area,
with the variogram model being identified to the corresponding spatial average
of squared differences. In probabilistic language, this moving process corresponds
to a double process of conditioning then compositing over the area.
This analysis did not provide any new estimation algorithm, but may give a
more physical appreciation of

(i) The hypothesis of stationarity underlying the probabilistic approach, which


corresponds in fact to the experimenter's decision to average out errors over
a particular area.
(ii) The problem of "robust" estimation of the variogram 2%4(h), a spatial aver-
age over A rather than a moment of an elusive, normal-flavored random
function.
(iii) The criterion for estimation and its limitation; in some cases, there are more
appropriate criteria than minimizing the average squared error.

REFERENCES
Breiman, L., 1969, Probability and stochastic processes: Houghton Mifflin Co., Boston,
324 p.
Doob, J. L., 1953, Stochastic processes: John Wiley & Sons, New York, 654 p.
Journel, A. G. and Huijbreghts, Ch. J., 1978, Mining geostatistics: Academic Press, London,
600 p.
Matheron, G., 1964, La theorie des variables regionalisees et ses applications: Masson, Paris,
305 p.
Matheron, G., 1978, Estimer et choisir: Les Cahiers du CGMM, Fontainebleau, 175 p.
Omre, H., 1984. The variogram and its estimation: in "Geostatistics for Natural Resources
Characterization" ed. Verly et al. puN. Reidel, Dordrecht, Holland, Part I, pp. 107-
I25.

You might also like