Inverse distance weighting (IDW) is a type of
deterministic method for
multivariate interpolation
In numerical analysis, multivariate interpolation is interpolation on functions of more than one variable; when the variates are spatial coordinates, it is also known as spatial interpolation.
The function to be interpolated is known at given poin ...
with a known scattered set of points. The assigned values to unknown points are calculated with a
weighted average
The weighted arithmetic mean is similar to an ordinary arithmetic mean (the most common type of average), except that instead of each of the data points contributing equally to the final average, some data points contribute more than others. The ...
of the values available at the known points. This method can also be used to create spatial weights matrices in
spatial autocorrelation
Spatial analysis or spatial statistics includes any of the formal techniques which studies entities using their topological, geometric, or geographic properties. Spatial analysis includes a variety of techniques, many still in their early dev ...
analyses (e.g.
Moran's ''I'').
The name given to this type of method was motivated by the
weighted average
The weighted arithmetic mean is similar to an ordinary arithmetic mean (the most common type of average), except that instead of each of the data points contributing equally to the final average, some data points contribute more than others. The ...
applied, since it resorts to the inverse of the distance to each known point ("amount of proximity") when assigning weights.
Definition of the problem
The expected result is a discrete assignment of the unknown function
in a study region:
:
where
is the study region.
The set of
known data points can be described as a list of
tuple
In mathematics, a tuple is a finite ordered list (sequence) of elements. An -tuple is a sequence (or ordered list) of elements, where is a non-negative integer. There is only one 0-tuple, referred to as ''the empty tuple''. An -tuple is defi ...
s:
:
The function is to be "smooth" (continuous and once differentiable), to be exact (
) and to meet the user's intuitive expectations about the phenomenon under investigation. Furthermore, the function should be suitable for a computer application at a reasonable cost (nowadays, a basic implementation will probably make use of
parallel resources).
Shepard's method
Historical reference
At the Harvard Laboratory for Computer Graphics and Spatial Analysis, beginning in 1965, a varied collection of scientists converged to rethink, among other things, what are now called
geographic information system
A geographic information system (GIS) is a type of database containing Geographic data and information, geographic data (that is, descriptions of phenomena for which location is relevant), combined with Geographic information system software, sof ...
s.
The motive force behind the Laboratory, Howard Fisher, conceived an improved computer mapping program that he called SYMAP, which, from the start, Fisher wanted to improve on the interpolation. He showed Harvard College freshmen his work on SYMAP, and many of them participated in Laboratory events. One freshman, Donald Shepard, decided to overhaul the interpolation in SYMAP, resulting in his famous article from 1968.
Shepard's algorithm was also influenced by the theoretical approach of
William Warntz and others at the Lab who worked with spatial analysis. He conducted a number of experiments with the exponent of distance, deciding on something closer to the gravity model (exponent of -2). Shepard implemented not just basic inverse distance weighting, but also allowed barriers (permeable and absolute) to interpolation.
Other research centers were working on interpolation at this time, particularly University of Kansas and their SURFACE II program. Still, the features of SYMAP were state-of-the-art, even though programmed by an undergraduate.
Basic form
Given a set of sample points
, the IDW interpolation function
is defined as:
:
where
:
is a simple IDW weighting function, as defined by Shepard,
[ x denotes an interpolated (arbitrary) point, x''i'' is an interpolating (known) point, is a given distance (]metric
Metric or metrical may refer to:
* Metric system, an internationally adopted decimal system of measurement
* An adjective indicating relation to measurement in general, or a noun describing a specific type of measurement
Mathematics
In mathema ...
operator) from the known point x''i'' to the unknown point x, ''N'' is the total number of known points used in interpolation and is a positive real number, called the power parameter.
Here weight decreases as distance increases from the interpolated points. Greater values of assign greater influence to values closest to the interpolated point, with the result turning into a mosaic of tiles (a Voronoi diagram
In mathematics, a Voronoi diagram is a partition of a plane into regions close to each of a given set of objects. In the simplest case, these objects are just finitely many points in the plane (called seeds, sites, or generators). For each seed th ...
) with nearly constant interpolated value for large values of ''p''. For two dimensions, power parameters cause the interpolated values to be dominated by points far away, since with a density of data points and neighboring points between distances to , the summed weight is approximately
:
which diverges for and . For ''M'' dimensions, the same argument holds for . For the choice of value for ''p'', one can consider the degree of smoothing desired in the interpolation, the density and distribution of samples being interpolated, and the maximum distance over which an individual sample is allowed to influence the surrounding ones.
''Shepard's method'' is a consequence of minimization of a functional related to a measure of deviations between tuple
In mathematics, a tuple is a finite ordered list (sequence) of elements. An -tuple is a sequence (or ordered list) of elements, where is a non-negative integer. There is only one 0-tuple, referred to as ''the empty tuple''. An -tuple is defi ...
s of interpolating points and ''i'' tuples of interpolated points , defined as:
:
derived from the minimizing condition:
:
The method can easily be extended to other dimensional spaces and it is in fact a generalization of Lagrange
approximation into a multidimensional spaces. A modified version of the algorithm designed for trivariate interpolation was developed by Robert J. Renka and is available in Netlib Netlib is a repository of software for scientific computing maintained by AT&T, Bell Laboratories, the University of Tennessee and Oak Ridge National Laboratory. Netlib comprises many separate programs and libraries. Most of the code is written in ...
as algorithm 661 in the toms library.
Example in 1 dimension
Modified Shepard's method
Another modification of Shepard's method calculates interpolated value using only nearest neighbors within ''R''-sphere (instead of full sample). Weights are slightly modified in this case:
:
When combined with fast spatial search structure (like kd-tree
In computer science, a ''k''-d tree (short for ''k-dimensional tree'') is a space-partitioning data structure for organizing points in a ''k''-dimensional space. ''k''-d trees are a useful data structure for several applications, such as search ...
), it becomes efficient ''N'' log ''N'' interpolation method suitable for large-scale problems.
See also
* Field (geography)
In the context of spatial analysis, geographic information systems, and geographic information science, a field is a property that fills space, and varies over space, such as temperature or density. This use of the term has been adopted from ph ...
* Gravity model Gravity models are used in various social sciences to predict and describe certain behaviors that mimic gravitational interaction as described in Isaac Newton's laws of gravity. Generally, the social science models contain some elements of mass and ...
* Kernel density estimation
In statistics, kernel density estimation (KDE) is the application of kernel smoothing for probability density estimation, i.e., a non-parametric method to estimate the probability density function of a random variable based on ''kernels'' as w ...
* Spatial analysis
Spatial analysis or spatial statistics includes any of the formal techniques which studies entities using their topological, geometric, or geographic properties. Spatial analysis includes a variety of techniques, many still in their early deve ...
* Tobler's first law of geography The First Law of Geography, according to Waldo Tobler, is "everything is related to everything else, but near things are more related than distant things." This first law is the foundation of the fundamental concepts of spatial dependence and spatia ...
* Tobler's second law of geography The second law of geography, according to Waldo Tobler, is "the phenomenon external to a geographic area of interest affects what goes on inside."
Background
Tobler's second law of geography, "the phenomenon external to a geographic area of inte ...
References
{{DEFAULTSORT:Inverse Distance Weighting
Geostatistics
Multivariate interpolation