Model-based approach
Consequently,
and
where
is the Euclidean distance between the coordinates
and
,
and
is an isotropic covariance function depending on the parameter
.
Assume we observe a random variable
at each region
and we are interested in predict/estimate this variable in each of the
regions
.
Now suppose the random variable
varies continuously over
and is defined as follows
where
where
and
are independent. For now, let’s make the unrealistic assumption that all
those parameters are known. Then, our assumption is that the observed
data is as follows
where
returns the area of a polygon. Furthermore, it can be shown that (using
Fubini’s Theorem and some algebraic manipulation)
where
is a positive definite correlation function. Now, let
be a correlation matrix such that
thus,
Then, if we assume
to be jointly normal, we
use can the conditional mean of
given
to estimate the observed random variable in the partition
.
Now, suppose the parameters
are unknown. The Likelihood of
can still be computed.
In particular, if we use the parametrization
,
we have closed form for the Maximum Likelihood estimators both for
and
.
Thus, we can optimize the profile likelihood for
and
numerically. Then, we resort on conditional Normal properties again to
compute the predictions in a new different set of regions.
Areal Interpolation (AI)
Areal interpolation is a nonparametric approach that interpolates
’s
to construct
’s.
Define an
matrix
,
where
is the weight associated with the polygon
in constructing
.
The weights are
(Goodchild and Lam 1980; Gotway and Young
2002). The interpolation for
is constructed as
The expectation and
variance of the predictor are, respectively,
and
In practice, the
covariance matrix
is unknown and, consequently needs to be estimated.
The variance each predictor
is needed as an uncertainty measure. It relies on both the variances of
’s
and their covariances:
The variances are often
observed in survey data, but the covariances are not. For practical
purpose, we propose an approximation for
based on Moran’s I, a global spatial autocorrelation. Specifically, let
be the Moran’s I calculated with a weight matrix constructed with
first-degree neighbors. That is,
is the average of the pairwise correlation for all neighboring pairs.
For regions
and
,
if they are neighbors of each other, our approximation is
The covariance between
non-neighboring
and
is discarded. The final uncertainty approximation for
will be an underestimate. Alternatively, we can derive, at least, an
upper bound for the variance of the estimates by using a simple
application from the Cauchy–Schwartz inequality, in which case,
is replaced with~1.