Submodular Valuations
   HOME

TheInfoList



OR:

In mathematics, a submodular set function (also known as a submodular function) is a set function whose value, informally, has the property that the difference in the incremental value of the function that a single element makes when added to an input set decreases as the size of the input set increases. Submodular functions have a natural
diminishing returns In economics, diminishing returns are the decrease in marginal (incremental) output of a production process as the amount of a single factor of production is incrementally increased, holding all other factors of production equal ( ceteris paribu ...
property which makes them suitable for many applications, including
approximation algorithms In computer science and operations research, approximation algorithms are efficient algorithms that find approximate solutions to optimization problems (in particular NP-hard problems) with provable guarantees on the distance of the returned solut ...
,
game theory Game theory is the study of mathematical models of strategic interactions among rational agents. Myerson, Roger B. (1991). ''Game Theory: Analysis of Conflict,'' Harvard University Press, p.&nbs1 Chapter-preview links, ppvii–xi It has appli ...
(as functions modeling user preferences) and electrical networks. Recently, submodular functions have also found immense utility in several real world problems in
machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...
and
artificial intelligence Artificial intelligence (AI) is intelligence—perceiving, synthesizing, and inferring information—demonstrated by machines, as opposed to intelligence displayed by animals and humans. Example tasks in which this is done include speech re ...
, including
automatic summarization Automatic summarization is the process of shortening a set of data computationally, to create a subset (a summary) that represents the most important or relevant information within the original content. Artificial intelligence algorithms are commo ...
,
multi-document summarization Multi-document summarization is an automatic procedure aimed at extraction of information from multiple texts written about the same topic. The resulting summary report allows individual users, such as professional information consumers, to quickl ...
, feature selection, active learning, sensor placement, image collection summarization and many other domains.


Definition

If \Omega is a finite
set Set, The Set, SET or SETS may refer to: Science, technology, and mathematics Mathematics *Set (mathematics), a collection of elements *Category of sets, the category whose objects and morphisms are sets and total functions, respectively Electro ...
, a submodular function is a set function f:2^\rightarrow \mathbb, where 2^\Omega denotes the power set of \Omega, which satisfies one of the following equivalent conditions. # For every X, Y \subseteq \Omega with X \subseteq Y and every x \in \Omega \setminus Y we have that f(X\cup \)-f(X)\geq f(Y\cup \)-f(Y). # For every S, T \subseteq \Omega we have that f(S)+f(T)\geq f(S\cup T)+f(S\cap T). # For every X\subseteq \Omega and x_1,x_2\in \Omega\backslash X such that x_1\neq x_2 we have that f(X\cup \)+f(X\cup \)\geq f(X\cup \)+f(X). A nonnegative submodular function is also a subadditive function, but a subadditive function need not be submodular. If \Omega is not assumed finite, then the above conditions are not equivalent. In particular a function f defined by f(S) = 1 if S is finite and f(S) = 0 if S is infinite satisfies the first condition above, but the second condition fails when S and T are infinite sets with finite intersection.


Types and examples of submodular functions


Monotone

A submodular function f is ''monotone'' if for every T\subseteq S we have that f(T)\leq f(S). Examples of monotone submodular functions include: ; Linear (Modular) functions : Any function of the form f(S)=\sum_w_i is called a linear function. Additionally if \forall i,w_i\geq 0 then f is monotone. ; Budget-additive functions : Any function of the form f(S)=\min\left\ for each w_i\geq 0 and B\geq 0 is called budget additive. ; Coverage functions : Let \Omega=\ be a collection of subsets of some ground set \Omega'. The function f(S)=\left, \bigcup_E_i\ for S\subseteq \Omega is called a coverage function. This can be generalized by adding non-negative weights to the elements. ;
Entropy Entropy is a scientific concept, as well as a measurable physical property, that is most commonly associated with a state of disorder, randomness, or uncertainty. The term and the concept are used in diverse fields, from classical thermodynam ...
: Let \Omega=\ be a set of
random variables A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...
. Then for any S\subseteq \Omega we have that H(S) is a submodular function, where H(S) is the entropy of the set of random variables S, a fact known as Shannon's inequality. Further inequalities for the entropy function are known to hold, see
entropic vector The entropic vector or entropic function is a concept arising in information theory. It represents the possible values of Shannon's information entropy that subsets of one set of random variables may take. Understanding which vectors are entropic ...
. ; Matroid rank functions : Let \Omega=\ be the ground set on which a matroid is defined. Then the rank function of the matroid is a submodular function.Fujishige (2005) p.22


Non-monotone

A submodular function that is not monotone is called ''non-monotone''.


Symmetric

A non-monotone submodular function f is called ''symmetric'' if for every S\subseteq \Omega we have that f(S)=f(\Omega-S). Examples of symmetric non-monotone submodular functions include: ; Graph cuts : Let \Omega=\ be the vertices of a graph. For any set of vertices S\subseteq \Omega let f(S) denote the number of edges e=(u,v) such that u\in S and v\in \Omega-S. This can be generalized by adding non-negative weights to the edges. ;
Mutual information In probability theory and information theory, the mutual information (MI) of two random variables is a measure of the mutual dependence between the two variables. More specifically, it quantifies the " amount of information" (in units such ...
: Let \Omega=\ be a set of
random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...
s. Then for any S\subseteq \Omega we have that f(S)=I(S;\Omega-S) is a submodular function, where I(S;\Omega-S) is the mutual information.


Asymmetric

A non-monotone submodular function which is not symmetric is called asymmetric. ; Directed cuts : Let \Omega=\ be the vertices of a directed graph. For any set of vertices S\subseteq \Omega let f(S) denote the number of edges e=(u,v) such that u\in S and v\in \Omega-S. This can be generalized by adding non-negative weights to the directed edges.


Continuous extensions


Definition

A set-valued function f:2^\rightarrow \mathbb with , \Omega, =n can also be represented as a function on \^, by associating each S\subseteq \Omega with a binary vector x^\in \^ such that x_^=1 when i\in S, and x_^=0 otherwise. The ''continuous
extension Extension, extend or extended may refer to: Mathematics Logic or set theory * Axiom of extensionality * Extensible cardinal * Extension (model theory) * Extension (predicate logic), the set of tuples of values that satisfy the predicate * E ...
'' of f is defined to be any continuous function F:
, 1 The comma is a punctuation mark that appears in several variants in different languages. It has the same shape as an apostrophe or single closing quotation mark () in many typefaces, but it differs from them in being placed on the baseline (t ...
\rightarrow \mathbb such that it matches the value of f on x\in \^, i.e. F(x^)=f(S). In the context of submodular functions, there are a few examples of continuous extensions that are commonly used, which are described as follows.


Examples


Lovász extension

This extension is named after mathematician László Lovász. Consider any vector \mathbf=\ such that each 0\leq x_i\leq 1. Then the Lovász extension is defined as f^L(\mathbf)=\mathbb(f(\)) where the expectation is over \lambda chosen from the
uniform distribution Uniform distribution may refer to: * Continuous uniform distribution * Discrete uniform distribution * Uniform distribution (ecology) * Equidistributed sequence In mathematics, a sequence (''s''1, ''s''2, ''s''3, ...) of real numbers is said to be ...
on the interval ,1/math>. The Lovász extension is a convex function if and only if f is a submodular function.


Multilinear extension

Consider any vector \mathbf=\ such that each 0\leq x_i\leq 1. Then the multilinear extension is defined as F(\mathbf)=\sum_ f(S) \prod_ x_i \prod_ (1-x_i).


Convex closure

Consider any vector \mathbf=\ such that each 0\leq x_i\leq 1. Then the convex closure is defined as f^-(\mathbf)=\min\left(\sum_S \alpha_S f(S):\sum_S \alpha_S 1_S=\mathbf,\sum_S \alpha_S=1,\alpha_S\geq 0\right). The convex closure of any set function is convex over ,1n.


Concave closure

Consider any vector \mathbf=\ such that each 0\leq x_i\leq 1. Then the concave closure is defined as f^+(\mathbf)=\max\left(\sum_S \alpha_S f(S):\sum_S \alpha_S 1_S=\mathbf,\sum_S \alpha_S=1,\alpha_S\geq 0\right).


Connections between extensions

For the extensions discussed above, it can be shown that f^(\mathbf) \geq F(\mathbf) \geq f^(\mathbf)=f^L(\mathbf) when f is submodular.


Properties

# The class of submodular functions is
closed Closed may refer to: Mathematics * Closure (mathematics), a set, along with operations, for which applying those operations on members always results in a member of the set * Closed set, a set which contains all its limit points * Closed interval, ...
under non-negative linear combinations. Consider any submodular function f_1,f_2,\ldots,f_k and non-negative numbers \alpha_1,\alpha_2,\ldots,\alpha_k. Then the function g defined by g(S)=\sum_^k \alpha_i f_i(S) is submodular. #For any submodular function f, the function defined by g(S)=f(\Omega \setminus S) is submodular. #The function g(S)=\min(f(S),c), where c is a real number, is submodular whenever f is monotone submodular. More generally, g(S)=h(f(S)) is submodular, for any non decreasing concave function h. # Consider a random process where a set T is chosen with each element in \Omega being included in T independently with probability p. Then the following inequality is true \mathbb (T)geq p f(\Omega)+(1-p) f(\varnothing) where \varnothing is the empty set. More generally consider the following random process where a set S is constructed as follows. For each of 1\leq i\leq l, A_i\subseteq \Omega construct S_i by including each element in A_i independently into S_i with probability p_i. Furthermore let S=\cup_^l S_i. Then the following inequality is true \mathbb (S)geq \sum_ \Pi_p_i \Pi_(1-p_i)f(\cup_A_i).


Optimization problems

Submodular functions have properties which are very similar to convex and
concave function In mathematics, a concave function is the negative of a convex function. A concave function is also synonymously called concave downwards, concave down, convex upwards, convex cap, or upper convex. Definition A real-valued function f on an in ...
s. For this reason, an optimization problem which concerns optimizing a convex or concave function can also be described as the problem of maximizing or minimizing a submodular function subject to some constraints.


Submodular set function minimization

The hardness of minimizing a submodular set function depends on constraints imposed on the problem. # The unconstrained problem of minimizing a submodular function is computable in (strongly)
polynomial time In computer science, the time complexity is the computational complexity that describes the amount of computer time it takes to run an algorithm. Time complexity is commonly estimated by counting the number of elementary operations performed by ...
. Computing the
minimum cut In graph theory, a minimum cut or min-cut of a graph is a cut (a partition of the vertices of a graph into two disjoint subsets) that is minimal in some metric. Variations of the minimum cut problem consider weighted graphs, directed graphs, term ...
in a graph is a special case of this minimization problem. # The problem of minimizing a submodular function with a cardinality lower bound is
NP-hard In computational complexity theory, NP-hardness ( non-deterministic polynomial-time hardness) is the defining property of a class of problems that are informally "at least as hard as the hardest problems in NP". A simple example of an NP-hard pr ...
, with polynomial factor lower bounds on the approximation factor.


Submodular set function maximization

Unlike the case of minimization, maximizing a generic submodular function is
NP-hard In computational complexity theory, NP-hardness ( non-deterministic polynomial-time hardness) is the defining property of a class of problems that are informally "at least as hard as the hardest problems in NP". A simple example of an NP-hard pr ...
even in the unconstrained setting. Thus, most of the works in this field are concerned with polynomial-time approximation algorithms, including greedy algorithms or local search algorithms. # The problem of maximizing a non-negative submodular function admits a 1/2 approximation algorithm. Computing the
maximum cut For a graph, a maximum cut is a cut whose size is at least the size of any other cut. That is, it is a partition of the graph's vertices into two complementary sets and , such that the number of edges between and is as large as possible. Fin ...
of a graph is a special case of this problem. # The problem of maximizing a monotone submodular function subject to a cardinality constraint admits a 1 - 1/e approximation algorithm. The
maximum coverage problem The maximum coverage problem is a classical question in computer science, computational complexity theory, and operations research. It is a problem that is widely taught in approximation algorithms. As input you are given several sets and a number ...
is a special case of this problem. # The problem of maximizing a monotone submodular function subject to a matroid constraint (which subsumes the case above) also admits a 1 - 1/e approximation algorithm. Many of these algorithms can be unified within a semi-differential based framework of algorithms.


Related optimization problems

Apart from submodular minimization and maximization, there are several other natural optimization problems related to submodular functions. # The difference of submodular optimization problem is not only NP hard, but also inapproximable. # Minimization/maximization of a submodular function subject to a submodular level set constraint (also known as submodular optimization subject to submodular cover or submodular knapsack constraint) admits bounded approximation guarantees. # Partitioning data based on a submodular function to maximize the average welfare is known as the submodular welfare problem, which also admits bounded approximation guarantees.


Applications

Submodular functions naturally occur in several real world applications, in
economics Economics () is the social science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services. Economics focuses on the behaviour and intera ...
,
game theory Game theory is the study of mathematical models of strategic interactions among rational agents. Myerson, Roger B. (1991). ''Game Theory: Analysis of Conflict,'' Harvard University Press, p.&nbs1 Chapter-preview links, ppvii–xi It has appli ...
,
machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...
and
computer vision Computer vision is an interdisciplinary scientific field that deals with how computers can gain high-level understanding from digital images or videos. From the perspective of engineering, it seeks to understand and automate tasks that the hum ...
. Owing to the diminishing returns property, submodular functions naturally model costs of items, since there is often a larger discount, with an increase in the items one buys. Submodular functions model notions of complexity, similarity and cooperation when they appear in minimization problems. In maximization problems, on the other hand, they model notions of diversity, information and coverage.


See also

*
Supermodular function In mathematics, a function :f\colon \mathbb^k \to \mathbb is supermodular if : f(x \uparrow y) + f(x \downarrow y) \geq f(x) + f(y) for all x, y \isin \mathbb^, where x \uparrow y denotes the componentwise maximum and x \downarrow y the componentw ...
* Matroid,
Polymatroid In mathematics, a polymatroid is a polytope associated with a submodular function. The notion was introduced by Jack Edmonds in 1970. It is also described as the multiset analogue of the matroid. Definition Let E be a finite set and f: 2^E\rig ...
*
Utility functions on indivisible goods Some branches of economics and game theory deal with indivisible goods, discrete items that can be traded only as a whole. For example, in combinatorial auctions there is a finite set of items, and every agent can buy a subset of the items, but an i ...


Citations


References

* * * * *{{citation , last=Oxley , first=James G. , title=Matroid theory , series=Oxford Science Publications , location=Oxford , publisher=
Oxford University Press Oxford University Press (OUP) is the university press of the University of Oxford. It is the largest university press in the world, and its printing history dates back to the 1480s. Having been officially granted the legal right to print books ...
, year=1992 , isbn=0-19-853563-5 , zbl=0784.05002


External links

* http://www.cs.berkeley.edu/~stefje/references.html has a longer bibliography * http://submodularity.org/ includes further material on the subject Matroid theory