Legendre–Fenchel Transformation
   HOME

TheInfoList



OR:

In
mathematics Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
and
mathematical optimization Mathematical optimization (alternatively spelled ''optimisation'') or mathematical programming is the selection of a best element, with regard to some criteria, from some set of available alternatives. It is generally divided into two subfiel ...
, the convex conjugate of a function is a generalization of the
Legendre transformation In mathematics, the Legendre transformation (or Legendre transform), first introduced by Adrien-Marie Legendre in 1787 when studying the minimal surface problem, is an involutive transformation on real-valued functions that are convex on a rea ...
which applies to non-convex functions. It is also known as Legendre–Fenchel transformation, Fenchel transformation, or Fenchel conjugate (after
Adrien-Marie Legendre Adrien-Marie Legendre (; ; 18 September 1752 – 9 January 1833) was a French people, French mathematician who made numerous contributions to mathematics. Well-known and important concepts such as the Legendre polynomials and Legendre transforma ...
and
Werner Fenchel Moritz Werner Fenchel (; 3 May 1905 – 24 January 1988) was a German-Danish mathematician known for his contributions to geometry and to optimization theory. Fenchel established the basic results of convex analysis and nonlinear opti ...
). The convex conjugate is widely used for constructing the
dual problem In mathematical optimization theory, duality or the duality principle is the principle that optimization problems may be viewed from either of two perspectives, the primal problem or the dual problem. If the primal is a minimization problem then th ...
in
optimization theory Mathematical optimization (alternatively spelled ''optimisation'') or mathematical programming is the selection of a best element, with regard to some criteria, from some set of available alternatives. It is generally divided into two subfiel ...
, thus generalizing
Lagrangian duality In mathematical optimization theory, duality or the duality principle is the principle that optimization problems may be viewed from either of two perspectives, the primal problem or the dual problem. If the primal is a minimization problem then th ...
.


Definition

Let X be a real
topological vector space In mathematics, a topological vector space (also called a linear topological space and commonly abbreviated TVS or t.v.s.) is one of the basic structures investigated in functional analysis. A topological vector space is a vector space that is als ...
and let X^ be the
dual space In mathematics, any vector space ''V'' has a corresponding dual vector space (or just dual space for short) consisting of all linear forms on ''V,'' together with the vector space structure of pointwise addition and scalar multiplication by cons ...
to X. Denote by :\langle \cdot , \cdot \rangle : X^ \times X \to \mathbb the canonical
dual pair Dual or Duals may refer to: Paired/two things * Dual (mathematics), a notion of paired concepts that mirror one another ** Dual (category theory), a formalization of mathematical duality *** see more cases in :Duality theories * Dual number, a ...
ing, which is defined by \left\langle x^*, x \right\rangle \mapsto x^* (x). For a function f : X \to \mathbb \cup \ taking values on the
extended real number line In mathematics, the extended real number system is obtained from the real number system \R by adding two elements denoted +\infty and -\infty that are respectively greater and lower than every real number. This allows for treating the potential ...
, its is the function :f^ : X^ \to \mathbb \cup \ whose value at x^* \in X^ is defined to be the
supremum In mathematics, the infimum (abbreviated inf; : infima) of a subset S of a partially ordered set P is the greatest element in P that is less than or equal to each element of S, if such an element exists. If the infimum of S exists, it is unique, ...
: :f^ \left( x^ \right) := \sup \left\, or, equivalently, in terms of the
infimum In mathematics, the infimum (abbreviated inf; : infima) of a subset S of a partially ordered set P is the greatest element in P that is less than or equal to each element of S, if such an element exists. If the infimum of S exists, it is unique ...
: :f^ \left( x^ \right) := - \inf \left\. This definition can be interpreted as an encoding of the
convex hull In geometry, the convex hull, convex envelope or convex closure of a shape is the smallest convex set that contains it. The convex hull may be defined either as the intersection of all convex sets containing a given subset of a Euclidean space, ...
of the function's epigraph in terms of its
supporting hyperplane In geometry, a supporting hyperplane of a Set (mathematics), set S in Euclidean space \mathbb R^n is a hyperplane that has both of the following two properties: * S is entirely contained in one of the two closed set, closed Half-space (geometry), h ...
s.


Examples

For more examples, see . * The convex conjugate of an
affine function In Euclidean geometry, an affine transformation or affinity (from the Latin, ''wikt:affine, affinis'', "connected with") is a geometric transformation that preserves line (geometry), lines and parallel (geometry), parallelism, but not necessarily ...
f(x) = \left\langle a, x \right\rangle - b is f^\left(x^ \right) = \begin b, & x^ = a \\ +\infty, & x^ \ne a. \end * The convex conjugate of a
power function In mathematics, exponentiation, denoted , is an operation involving two numbers: the ''base'', , and the ''exponent'' or ''power'', . When is a positive integer, exponentiation corresponds to repeated multiplication of the base: that is, i ...
f(x) = \frac, x, ^p, 1 < p < \infty is f^\left(x^ \right) = \frac, x^, ^q, 1 * The convex conjugate of the
absolute value In mathematics, the absolute value or modulus of a real number x, is the non-negative value without regard to its sign. Namely, , x, =x if x is a positive number, and , x, =-x if x is negative (in which case negating x makes -x positive), ...
function f(x) = \left, x \ is f^\left(x^ \right) = \begin 0, & \left, x^ \ \le 1 \\ \infty, & \left, x^ \ > 1. \end * The convex conjugate of the exponential function f(x)= e^x is f^\left(x^ \right) = \begin x^ \ln x^ - x^ , & x^ > 0 \\ 0 , & x^ = 0 \\ \infty , & x^ < 0. \end The convex conjugate and Legendre transform of the exponential function agree except that the
domain A domain is a geographic area controlled by a single person or organization. Domain may also refer to: Law and human geography * Demesne, in English common law and other Medieval European contexts, lands directly managed by their holder rather ...
of the convex conjugate is strictly larger as the Legendre transform is only defined for
positive real numbers In mathematics, the set of positive real numbers, \R_ = \left\, is the subset of those real numbers that are greater than zero. The non-negative real numbers, \R_ = \left\, also include zero. Although the symbols \R_ and \R^ are ambiguously used fo ...
.


Connection with expected shortfall (average value at risk)

Se
this article for example.
Let ''F'' denote a
cumulative distribution function In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. Ever ...
of a
random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
 ''X''. Then ( integrating by parts), f(x):= \int_^x F(u) \, du = \operatorname\left max(0,x-X)\right= x-\operatorname \left min(x,X)\right/math> has the convex conjugate f^(p)= \int_0^p F^(q) \, dq = (p-1)F^(p)+\operatorname\left min(F^(p),X)\right = p F^(p)-\operatorname\left max(0,F^(p)-X)\right


Ordering

A particular interpretation has the transform f^\text(x):= \arg \sup_t t\cdot x-\int_0^1 \max\ \, du, as this is a nondecreasing rearrangement of the initial function ''f''; in particular, f^\text= f for ''f'' nondecreasing.


Properties

The convex conjugate of a closed convex function is again a closed convex function. The convex conjugate of a polyhedral convex function (a convex function with
polyhedral In geometry, a polyhedron (: polyhedra or polyhedrons; ) is a three-dimensional figure with flat polygonal faces, straight edges and sharp corners or vertices. The term "polyhedron" may refer either to a solid figure or to its boundary surfa ...
epigraph) is again a polyhedral convex function.


Order reversing

Declare that f \le g if and only if f(x) \le g(x) for all x. Then convex-conjugation is order-reversing, which by definition means that if f \le g then f^* \ge g^*. For a family of functions \left(f_\alpha\right)_\alpha it follows from the fact that supremums may be interchanged that :\left(\inf_\alpha f_\alpha\right)^*(x^*) = \sup_\alpha f_\alpha^*(x^*), and from the max–min inequality that :\left(\sup_\alpha f_\alpha\right)^*(x^*) \le \inf_\alpha f_\alpha^*(x^*).


Biconjugate

The convex conjugate of a function is always
lower semi-continuous In mathematical analysis, semicontinuity (or semi-continuity) is a property of extended real-valued functions that is weaker than continuity. An extended real-valued function f is upper (respectively, lower) semicontinuous at a point x_0 if, r ...
. The biconjugate f^ (the convex conjugate of the convex conjugate) is also the closed convex hull, i.e. the largest
lower semi-continuous In mathematical analysis, semicontinuity (or semi-continuity) is a property of extended real-valued functions that is weaker than continuity. An extended real-valued function f is upper (respectively, lower) semicontinuous at a point x_0 if, r ...
convex function with f^ \le f. For proper functions f, :f = f^
if and only if In logic and related fields such as mathematics and philosophy, "if and only if" (often shortened as "iff") is paraphrased by the biconditional, a logical connective between statements. The biconditional is true in two cases, where either bo ...
f is convex and lower semi-continuous, by the Fenchel–Moreau theorem.


Fenchel's inequality

For any function and its convex conjugate , Fenchel's inequality (also known as the Fenchel–Young inequality) holds for every x \in X and :\left\langle p,x \right\rangle \le f(x) + f^*(p). Furthermore, the equality holds only when p \in \partial f(x). The proof follows from the definition of convex conjugate: f^*(p) = \sup_ \left\ \ge \langle p,x \rangle - f(x).


Convexity

For two functions f_0 and f_1 and a number 0 \le \lambda \le 1 the convexity relation :\left((1-\lambda) f_0 + \lambda f_1\right)^ \le (1-\lambda) f_0^ + \lambda f_1^ holds. The operation is a convex mapping itself.


Infimal convolution

The infimal convolution (or epi-sum) of two functions f and g is defined as :\left( f \operatorname g \right)(x) = \inf \left\. Let f_1, \ldots, f_ be proper, convex and
lower semicontinuous In mathematical analysis, semicontinuity (or semi-continuity) is a property of extended real-valued functions that is weaker than continuity. An extended real-valued function f is upper (respectively, lower) semicontinuous at a point x_0 if, r ...
functions on \mathbb^. Then the infimal convolution is convex and lower semicontinuous (but not necessarily proper), and satisfies :\left( f_1 \operatorname \cdots \operatorname f_m \right)^ = f_1^ + \cdots + f_m^. The infimal convolution of two functions has a geometric interpretation: The (strict) epigraph of the infimal convolution of two functions is the
Minkowski sum In geometry, the Minkowski sum of two sets of position vectors ''A'' and ''B'' in Euclidean space is formed by adding each vector in ''A'' to each vector in ''B'': A + B = \ The Minkowski difference (also ''Minkowski subtraction'', ''Minkowsk ...
of the (strict) epigraphs of those functions.


Maximizing argument

If the function f is differentiable, then its derivative is the maximizing argument in the computation of the convex conjugate: :f^\prime(x) = x^*(x):= \arg\sup_ -f^\left( x^ \right) and :f^\left( x^ \right) = x\left( x^ \right):= \arg\sup_x - f(x); hence :x = \nabla f^\left( \nabla f(x) \right), :x^ = \nabla f\left( \nabla f^\left( x^ \right)\right), and moreover :f^(x) \cdot f^\left( x^(x) \right) = 1, :f^\left( x^ \right) \cdot f^\left( x(x^) \right) = 1.


Scaling properties

If for some \gamma>0, g(x) = \alpha + \beta x + \gamma \cdot f\left( \lambda x + \delta \right), then :g^\left( x^ \right)= - \alpha - \delta\frac \lambda + \gamma \cdot f^\left(\frac \right).


Behavior under linear transformations

Let A : X \to Y be a
bounded linear operator In functional analysis and operator theory, a bounded linear operator is a linear transformation L : X \to Y between topological vector spaces (TVSs) X and Y that maps bounded subsets of X to bounded subsets of Y. If X and Y are normed vector ...
. For any convex function f on X, :\left(A f\right)^ = f^ A^ where :(A f)(y) = \inf\ is the preimage of f with respect to A and A^ is the
adjoint operator In mathematics, specifically in operator theory, each linear operator A on an inner product space defines a Hermitian adjoint (or adjoint) operator A^* on that space according to the rule :\langle Ax,y \rangle = \langle x,A^*y \rangle, where \l ...
of A. A closed convex function f is symmetric with respect to a given set G of orthogonal linear transformations, :f(A x) = f(x) for all x and all A \in G if and only if its convex conjugate f^ is symmetric with respect to G.


Table of selected convex conjugates

The following table provides Legendre transforms for many common functions as well as a few useful properties.


See also

*
Dual problem In mathematical optimization theory, duality or the duality principle is the principle that optimization problems may be viewed from either of two perspectives, the primal problem or the dual problem. If the primal is a minimization problem then th ...
* Fenchel's duality theorem *
Legendre transformation In mathematics, the Legendre transformation (or Legendre transform), first introduced by Adrien-Marie Legendre in 1787 when studying the minimal surface problem, is an involutive transformation on real-valued functions that are convex on a rea ...
*
Young's inequality for products In mathematics, Young's inequality for products is a mathematical inequality about the product of two numbers. The inequality is named after William Henry Young and should not be confused with Young's convolution inequality. Young's inequality ...


References

* * *


Further reading

* * *

(271 pages) *

(24 pages) {{Convex analysis and variational analysis Convex analysis Duality theories Theorems involving convexity Transforms