image processing An image is a visual representation of something. It can be two-dimensional, three-dimensional, or somehow otherwise feed into the visual system to convey information. An image can be an artifact, such as a photograph or other two-dimension ...

and

computer vision Computer vision is an Interdisciplinarity, interdisciplinary scientific field that deals with how computers can gain high-level understanding from digital images or videos. From the perspective of engineering, it seeks to understand and automate t ...

, a

scale space Scale-space theory is a framework for multi-scale signal representation developed by the computer vision, image processing and signal processing communities with complementary motivations from physics and biological vision. It is a formal theor ...

framework can be used to represent an image as a family of gradually smoothed images. This framework is very general and a variety of scale space representations exist. A typical approach for choosing a particular type of scale space representation is to establish a set of scale-space axioms, describing basic properties of the desired scale-space representation and often chosen so as to make the representation useful in practical applications. Once established, the axioms narrow the possible scale-space representations to a smaller class, typically with only a few free parameters. A set of standard scale space axioms, discussed below, leads to the linear Gaussian scale-space, which is the most common type of scale space used in image processing and computer vision.

Scale space axioms for the linear scale-space representation

The linear

representation

L(x, y, t) = (T_t f)(x, y) = g(x, y, t)*f(x, y)

of signal

f(x, y)

obtained by smoothing with the Gaussian kernel

g(x, y, t)

satisfies a number of properties 'scale-space axioms' that make it a special form of multi-scale representation: ;linearity :

T_t(a f + b h) = a T_t f + b T_t h

:where

f

and

h

are signals while

a

and

b

are constants, ;shift invariance :

T_t S_ f = S_ T_t f

:where

S_

denotes the shift (translation) operator

(S_ f)(x, y) = f(x-\Delta x, y - \Delta y)

;semi-group structure :

g(x, y, t_1) * g(x, y, t_2) = g(x, y, t_1 + t_2)

:with the associated ''cascade smoothing property'' :

L(x, y, t_2) = g(x, y, t_2 - t_1) * L(x, y, t_1)

;existence of an ''infinitesimal generator''

A

\partial_t L(x, y, t) =  (A L)(x, y, t)

;''non-creation of local extrema'' (zero-crossings) in one dimension, ;''non-enhancement of local extrema'' in any number of dimensions :

\partial_t L(x, y, t) \leq 0

at spatial maxima and

\partial_t L(x, y, t) \geq 0

at spatial minima, ;rotational symmetry :

g(x, y, t) = h(x^2+y^2, t)

for some function

h

, ;scale invariance :

\hat(\omega_x, \omega_y, t) = \hat(\frac, \frac)

:for some functions

\varphi

and

\hat

where

\hat

denotes the Fourier transform of

g

, ;positivity :

g(x, y, t) \geq 0

, ;normalization :

\int_^ \int_^{\infty} g(x, y, t) \, dx \, dy = 1

. In fact, it can be shown that the Gaussian kernel is a ''unique choice'' given several different combinations of subsets of these scale-space axioms:Lindeberg, T., "Scale-space for discrete signals," PAMI(12), No. 3, March 1990, pp. 234–254.
/ref>Lindeberg, T.: On the axiomatic foundations of linear scale-space: Combining semi-group structure with causality vs. scale invariance. In: J. Sporring et al. (eds.) Gaussian Scale-Space Theory: Proc. PhD School on Scale-Space Theory, (Copenhagen, Denmark, May 1996), pages 75–98, Kluwer Academic Publishers, 1997.
/ref>Lindeberg, T. Generalized Gaussian scale-space axiomatics comprising linear scale-space, affine scale-space and spatio-temporal scale-space, Journal of Mathematical Imaging and Vision, Volume 40, Number 1, 36-81, 2011.
/ref>Lindeberg, T. Generalized axiomatic scale-space theory'', Advances in Imaging and Electron Physics, Elsevier, volume 178, pages 1-96, 2013.
/ref> most of the axioms (linearity, shift-invariance, semigroup) correspond to scaling being a semigroup of shift-invariant linear operator, which is satisfied by a number of families

integral transforms In mathematics, an integral transform maps a function from its original function space into another function space via integration, where some of the properties of the original function might be more easily characterized and manipulated than i ...

, while "non-creation of local extrema" for one-dimensional signals or "non-enhancement of local extrema" for higher-dimensional signals are the crucial axioms which relate scale-spaces to smoothing (formally,

parabolic partial differential equation A parabolic partial differential equation is a type of partial differential equation (PDE). Parabolic PDEs are used to describe a wide variety of time-dependent phenomena, including heat conduction, particle diffusion, and pricing of derivati ...

s), and hence select for the Gaussian. The Gaussian kernel is also separable in Cartesian coordinates, i.e.

g(x, y, t) = g(x, t) \, g(y, t)

. Separability is, however, not counted as a scale-space axiom, since it is a coordinate dependent property related to issues of implementation. In addition, the requirement of separability in combination with rotational symmetry per se fixates the smoothing kernel to be a Gaussian. There exists a generalization of the Gaussian scale-space theory to more general affine and spatio-temporal scale-spaces. In addition to variabilities over scale, which original scale-space theory was designed to handle, this ''generalized scale-space theory'' also comprises other types of variabilities, including image deformations caused by viewing variations, approximated by local

affine transformation In Euclidean geometry, an affine transformation or affinity (from the Latin, ''affinis'', "connected with") is a geometric transformation that preserves lines and parallelism, but not necessarily Euclidean distances and angles. More generall ...

s, and relative motions between objects in the world and the observer, approximated by local

Galilean transformation In physics, a Galilean transformation is used to transform between the coordinates of two reference frames which differ only by constant relative motion within the constructs of Newtonian physics. These transformations together with spatial rotati ...

s. In this theory, rotational symmetry is not imposed as a necessary scale-space axiom and is instead replaced by requirements of affine and/or Galilean covariance. The generalized scale-space theory leads to predictions about receptive field profiles in good qualitative agreement with receptive field profiles measured by cell recordings in biological vision.Lindeberg, T. A computational theory of visual receptive fields, Biological Cybernetics, 107(6): 589-635, 2013.
/ref>T. Lindeberg "Normative theory of visual receptive fields", Heliyon 7(1):e05897, 2021.
/ref> In the

and

signal processing Signal processing is an electrical engineering subfield that focuses on analyzing, modifying and synthesizing '' signals'', such as sound, images, and scientific measurements. Signal processing techniques are used to optimize transmissions, ...

literature there are many other multi-scale approaches, using

wavelets A wavelet is a wave-like oscillation with an amplitude that begins at zero, increases or decreases, and then returns to zero one or more times. Wavelets are termed a "brief oscillation". A taxonomy of wavelets has been established, based on the num ...

and a variety of other kernels, that do not exploit or require the same requirements as

descriptions do; please see the article on related

multi-scale approaches The scale space representation of a signal obtained by Gaussian smoothing satisfies a number of special properties, scale-space axioms, which make it into a special form of multi-scale representation. There are, however, also other types of "mult ...

. There has also been work on discrete scale-space concepts that carry the scale-space properties over to the discrete domain; see the article on

scale space implementation In the areas of computer vision, image analysis and signal processing, the notion of scale-space representation is used for processing measurement data at multiple scales, and specifically enhance or suppress image features over different ranges o ...

for examples and references.

References

Image processing Computer vision

Scale space axioms for the linear scale-space representation

See also

References