HOME

TheInfoList



OR:

In
geometry Geometry (; ) is, with arithmetic, one of the oldest branches of mathematics. It is concerned with properties of space such as the distance, shape, size, and relative position of figures. A mathematician who works in the field of geometry is c ...
, the hyperplane separation theorem is a theorem about disjoint
convex set In geometry, a subset of a Euclidean space, or more generally an affine space over the reals, is convex if, given any two points in the subset, the subset contains the whole line segment that joins them. Equivalently, a convex set or a convex r ...
s in ''n''-dimensional
Euclidean space Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, that is, in Euclid's Elements, Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics ther ...
. There are several rather similar versions. In one version of the theorem, if both these sets are closed and at least one of them is
compact Compact as used in politics may refer broadly to a pact or treaty; in more specific cases it may refer to: * Interstate compact * Blood compact, an ancient ritual of the Philippines * Compact government, a type of colonial rule utilized in British ...
, then there is a
hyperplane In geometry, a hyperplane is a subspace whose dimension is one less than that of its ''ambient space''. For example, if a space is 3-dimensional then its hyperplanes are the 2-dimensional planes, while if the space is 2-dimensional, its hyper ...
in between them and even two parallel hyperplanes in between them separated by a gap. In another version, if both disjoint convex sets are open, then there is a hyperplane in between them, but not necessarily any gap. An axis which is orthogonal to a separating hyperplane is a separating axis, because the orthogonal projections of the convex bodies onto the axis are disjoint. The hyperplane separation theorem is due to
Hermann Minkowski Hermann Minkowski (; ; 22 June 1864 – 12 January 1909) was a German mathematician and professor at Königsberg, Zürich and Göttingen. He created and developed the geometry of numbers and used geometrical methods to solve problems in number t ...
. The Hahn–Banach separation theorem generalizes the result to
topological vector spaces In mathematics, a topological vector space (also called a linear topological space and commonly abbreviated TVS or t.v.s.) is one of the basic structures investigated in functional analysis. A topological vector space is a vector space that is als ...
. A related result is the supporting hyperplane theorem. In the context of
support-vector machine In machine learning, support vector machines (SVMs, also support vector networks) are supervised learning models with associated learning algorithms that analyze data for classification and regression analysis. Developed at AT&T Bell Laboratorie ...
s, the ''optimally separating hyperplane'' or ''maximum-margin hyperplane'' is a
hyperplane In geometry, a hyperplane is a subspace whose dimension is one less than that of its ''ambient space''. For example, if a space is 3-dimensional then its hyperplanes are the 2-dimensional planes, while if the space is 2-dimensional, its hyper ...
which separates two
convex hull In geometry, the convex hull or convex envelope or convex closure of a shape is the smallest convex set that contains it. The convex hull may be defined either as the intersection of all convex sets containing a given subset of a Euclidean space ...
s of points and is
equidistant A point is said to be equidistant from a set of objects if the distances between that point and each object in the set are equal. In two-dimensional Euclidean geometry, the locus of points equidistant from two given (different) points is the ...
from the two.


Statements and proof

The proof is based on the following lemma: Proof of lemma: Let \delta = \inf \. Let x_j be a sequence in K such that , x_j, \to \delta. Note that (x_i + x_j)/2 is in K since K is convex and so , x_i + x_j, ^2 \ge 4 \delta^2. Since :, x_i - x_j, ^2 = 2, x_i, ^2 + 2, x_j, ^2 - , x_i + x_j, ^2 \le 2, x_i, ^2 + 2, x_j, ^2 - 4\delta^2 \to 0 as i, j \to \infty, x_i is a
Cauchy sequence In mathematics Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in m ...
and so has limit ''x'' in K. It is unique since if ''y'' is in K and has norm δ, then, x - y, ^2 \le 2, x, ^2 + 2, y, ^2 - 4\delta^2 = 0 and ''x'' = ''y''. \square Proof of theorem: Given disjoint nonempty convex sets ''A'', ''B'', let :K = A + (-B) = \. Since -B is convex and the sum of convex sets is convex, K is convex. By the lemma, the closure \overline of K, which is convex, contains a vector v of minimum norm. Since \overline is convex, for any n in K, the line segment :v + t(n - v), \, 0 \le t \le 1 lies in \overline and so :, v, ^2 \le , v + t(n - v), ^2 = , v, ^2 + 2 t \langle v, n - v \rangle + t^2, n-v, ^2. For 0 < t \le 1, we thus have: :0 \le 2 \langle v, n \rangle - 2 , v, ^2 + t, n-v, ^2 and letting t \to 0 gives: \langle n, v \rangle \ge , v, ^2. Hence, for any x in ''A'' and ''y'' in ''B'', we have: \langle x - y, v \rangle \ge , v, ^2. Thus, if ''v'' is nonzero, the proof is complete since :\inf_ \langle x, v \rangle \ge , v, ^2 + \sup_ \langle y, v \rangle. More generally (covering the case ''v'' = 0), let us first take the case when the interior of K is nonempty. The interior can be exhausted by a nested sequence of nonempty compact convex subsets K_1\subset K_2\subset K_3\subset\cdots (namely, put K_j \equiv j,jn \cap \ ). Since 0 is not in K, each K_n contains a nonzero vector v_n of minimum length and by the argument in the early part, we have: \langle x, v_n \rangle \ge 0 for any x \in K_n. We can normalize the v_n's to have length one. Then the sequence v_n contains a convergent subsequence (because the
n-sphere In mathematics, an -sphere or a hypersphere is a topological space that is homeomorphic to a ''standard'' -''sphere'', which is the set of points in -dimensional Euclidean space that are situated at a constant distance from a fixed point, cal ...
is compact) with limit ''v'', which is nonzero. We have \langle x, v \rangle \ge 0 for any ''x'' in the interior of K and by continuity the same holds for all ''x'' in K. We now finish the proof as before. Finally, if K has empty interior, the
affine set In mathematics, an affine space is a geometric structure that generalizes some of the properties of Euclidean spaces in such a way that these are independent of the concepts of distance and measure of angles, keeping only the properties related ...
that it spans has dimension less than that of the whole space. Consequently K is contained in some hyperplane \langle \cdot, v \rangle = c; thus, \langle x, v \rangle \ge c for all ''x'' in K and we finish the proof as before. \square The number of dimensions must be finite. In infinite-dimensional spaces there are examples of two closed, convex, disjoint sets which cannot be separated by a closed hyperplane (a hyperplane where a ''continuous'' linear functional equals some constant) even in the weak sense where the inequalities are not strict. The above proof also proves the first version of the theorem mentioned in the lede (to see it, note that K in the proof is closed under the hypothesis of the theorem below.) Here, the compactness in the hypothesis cannot be relaxed; see an example in the next section. This version of the separation theorem does generalize to infinite-dimension; the generalization is more commonly known as the Hahn–Banach separation theorem. We also have: This follows from the standard version since the separating hyperplane cannot intersect the interiors of the convex sets.


Converse of theorem

Note that the existence of a hyperplane that only "separates" two convex sets in the weak sense of both inequalities being non-strict obviously does not imply that the two sets are disjoint. Both sets could have points located on the hyperplane.


Counterexamples and uniqueness

If one of ''A'' or ''B'' is not convex, then there are many possible counterexamples. For example, ''A'' and ''B'' could be concentric circles. A more subtle counterexample is one in which ''A'' and ''B'' are both closed but neither one is compact. For example, if ''A'' is a closed half plane and B is bounded by one arm of a hyperbola, then there is no strictly separating hyperplane: :A = \ :B = \.\ (Although, by an instance of the second theorem, there is a hyperplane that separates their interiors.) Another type of counterexample has ''A'' compact and ''B'' open. For example, A can be a closed square and B can be an open square that touches ''A''. In the first version of the theorem, evidently the separating hyperplane is never unique. In the second version, it may or may not be unique. Technically a separating axis is never unique because it can be translated; in the second version of the theorem, a separating axis can be unique up to translation.


Use in collision detection

The separating axis theorem (SAT) says that: Two convex objects do not overlap if there exists a line (called axis) onto which the two objects' projections do not overlap. SAT suggests an algorithm for testing whether two convex solids intersect or not. Regardless of dimensionality, the separating axis is always a line. For example, in 3D, the space is separated by planes, but the separating axis is perpendicular to the separating plane. The separating axis theorem can be applied for fast
collision detection Collision detection is the computational problem of detecting the intersection (Euclidean geometry), intersection of two or more objects. Collision detection is a classic issue of computational geometry and has applications in various computing ...
between polygon meshes. Each
face The face is the front of an animal's head that features the eyes, nose and mouth, and through which animals express many of their emotions. The face is crucial for human identity, and damage such as scarring or developmental deformities may aff ...
's
normal Normal(s) or The Normal(s) may refer to: Film and television * ''Normal'' (2003 film), starring Jessica Lange and Tom Wilkinson * ''Normal'' (2007 film), starring Carrie-Anne Moss, Kevin Zegers, Callum Keith Rennie, and Andrew Airlie * ''Norma ...
or other feature direction is used as a separating axis. Note that this yields possible separating axes, not separating lines/planes. In 3D, using face normals alone will fail to separate some edge-on-edge non-colliding cases. Additional axes, consisting of the cross-products of pairs of edges, one taken from each object, are required. For increased efficiency, parallel axes may be calculated as a single axis.


See also

*
Dual cone Dual or Duals may refer to: Paired/two things * Dual (mathematics), a notion of paired concepts that mirror one another ** Dual (category theory), a formalization of mathematical duality *** see more cases in :Duality theories * Dual (grammatica ...
* Farkas's lemma *
Kirchberger's theorem Kirchberger's theorem is a theorem in discrete geometry, on linear separability. The two-dimensional version of the theorem states that, if a finite set of red and blue points in the Euclidean plane has the property that, for every four points, the ...
*
Optimal control Optimal control theory is a branch of mathematical optimization that deals with finding a control for a dynamical system over a period of time such that an objective function is optimized. It has numerous applications in science, engineering and ...


Notes


References

* * *


External links


Collision detection and response
{{DEFAULTSORT:Separating Axis Theorem Theorems in convex geometry Hermann Minkowski fr:Séparation des convexes