Point pattern analysis (PPA) is the study of ''point patterns'', the spatial arrangements of points in space (usually
2-dimensional space). The simplest formulation is a set ''X'' = where ''D'', which can be called the 'study region,' is a subset of R
''n'', a
''n''-dimensional Euclidean space
Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, that is, in Euclid's Elements, Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics ther ...
.
Description
The easiest way to visualize a 2-D point pattern is a map of the locations, which is simply a scatterplot but with the provision that the axes are equally scaled. If D is not the boundary of the map then it should also be indicated. An empirical definition of D would be the convex hull of the points, or at least their bounding box, a matrix of the ranges of the coordinates. Another straightforward way to visualize the points is a 2D histogram (sometimes called a quadrats) that bins the points into rectangular regions. A benefit of quadrat analysis is that it forces the analysis to take into account possible scales within which statistically significant inhomogeneities may be occurring.
Modeling
The null model for point patterns is
complete spatial randomness
Complete spatial randomness (CSR) describes a point process whereby point events occur within a given study area in a completely random fashion. It is synonymous with a ''homogeneous spatial Poisson process''.O. Maimon, L. Rokach, ''Data Mining and ...
(CSR), modeled as a
Poisson process
In probability, statistics and related fields, a Poisson point process is a type of random mathematical object that consists of points randomly located on a mathematical space with the essential feature that the points occur independently of one ...
in R
''n'', which implies that the number of points in any arbitrary region ''A'' in ''D'' will be proportional to the area or volume of ''A''. Exploring models is generally iterative: if CSR is accepted not much more can be said, but if rejected, there are two avenues. First, one must decide which models are worth exploring, such as investigations of clustering, density, trends, etc. And for each of these models there are appropriate scale ranges, from the finest, which essentially mirrors the point pattern, to the coarsest, which aggregates ''D''. It is generally interesting to explore a range of scales within these limits.
A particularly robust model of clustered point patterns is
diffusion
Diffusion is the net movement of anything (for example, atoms, ions, molecules, energy) generally from a region of higher concentration to a region of lower concentration. Diffusion is driven by a gradient in Gibbs free energy or chemical p ...
, which can also be thought of as the trajectory of a point doing a
random walk
In mathematics, a random walk is a random process that describes a path that consists of a succession of random steps on some mathematical space.
An elementary example of a random walk is the random walk on the integer number line \mathbb Z ...
.
Estimation
A fundamental problem of PPA is inferring whether a given arrangement is merely random or the result of some process. The picture illustrates patterns of 256 points using four point processes. The clustered process results in all points having the same location. Popular models are those based on simple
circle
A circle is a shape consisting of all points in a plane that are at a given distance from a given point, the centre. Equivalently, it is the curve traced out by a point that moves in a plane so that its distance from a given point is const ...
s and
ellipse
In mathematics, an ellipse is a plane curve surrounding two focus (geometry), focal points, such that for all points on the curve, the sum of the two distances to the focal points is a constant. It generalizes a circle, which is the special ty ...
s, inter-point (and especially nearest neighbor) distances, quadrats, and intensity functions. Each model yields estimates (that can increase insights into the underlying real-world processes) as well as associated
goodness-of-fit
The goodness of fit of a statistical model describes how well it fits a set of observations. Measures of goodness of fit typically summarize the discrepancy between observed values and the values expected under the model in question. Such measure ...
diagnostics.
Applications
PPA has applications in a wide range of areas, including astronomy, archaeology,
geography, ecology, biology, and epidemiology. A few topics in the last area are discussed here.
# A
case control study
Case or CASE may refer to:
Containers
* Case (goods), a package of related merchandise
* Cartridge case or casing, a firearm cartridge component
* Bookcase, a piece of furniture used to store books
* Briefcase or attaché case, a narrow box to c ...
compares the point patterns of organisms both with and without some condition to determine if there were significant differences in their arrangements.
# Environmental exposure examines the locations of cases and possible sources (e.g. of pollution or carcinogens).
#
Contagion explores the temporal unfolding of the pattern, asking about such phenomena as the location of the 'index case.'
# Examination of
infection
An infection is the invasion of tissues by pathogens, their multiplication, and the reaction of host tissues to the infectious agent and the toxins they produce. An infectious disease, also known as a transmissible disease or communicable dise ...
compares the arrangements of parasites and hosts (predators and prey, agents and organisms).
# Analysis of the regularity of
retinal mosaic
Retinal mosaic is the name given to the distribution of any particular type of neuron across any particular layer in the retina. Typically such distributions are somewhat regular; it is thought that this is so that each part of the retina is served ...
s, particularly as a quantitative tool to understand development of the
retina
The retina (from la, rete "net") is the innermost, light-sensitive layer of tissue of the eye of most vertebrates and some molluscs. The optics of the eye create a focused two-dimensional image of the visual world on the retina, which then ...
.
References
Further reading
* Cressie, N. A. C. and C. K. Wikle (2011) ''Statistics for spatio-temporal data''. Hoboken, N.J., Wiley. {{ISBN, 978-0-471-69274-4
Spatial analysis