theoretical computer science Theoretical computer science is a subfield of computer science and mathematics that focuses on the Abstraction, abstract and mathematical foundations of computation. It is difficult to circumscribe the theoretical areas precisely. The Associati ...

, smoothed analysis is a way of measuring the complexity of an algorithm. Since its introduction in 2001, smoothed analysis has been used as a basis for considerable research, for problems ranging from

mathematical programming Mathematical optimization (alternatively spelled ''optimisation'') or mathematical programming is the selection of a best element, with regard to some criteria, from some set of available alternatives. It is generally divided into two subfiel ...

numerical analysis Numerical analysis is the study of algorithms that use numerical approximation (as opposed to symbolic computation, symbolic manipulations) for the problems of mathematical analysis (as distinguished from discrete mathematics). It is the study of ...

machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...

, and

data mining Data mining is the process of extracting and finding patterns in massive data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and ...

. It can give a more realistic analysis of the practical performance (e.g., running time, success rate, approximation quality) of the algorithm compared to analysis that uses worst-case or average-case scenarios. Smoothed analysis is a hybrid of worst-case and average-case analyses that inherits advantages of both. It measures the expected performance of algorithms under slight random perturbations of worst-case inputs. If the smoothed complexity of an algorithm is low, then it is unlikely that the algorithm will take a long time to solve practical instances whose data are subject to slight noises and imprecisions. Smoothed complexity results are strong probabilistic results, roughly stating that, in every large enough neighbourhood of the space of inputs, most inputs are easily solvable. Thus, a low smoothed complexity means that the hardness of inputs is a "brittle" property. Although worst-case complexity has been widely successful in explaining the practical performance of many algorithms, this style of analysis gives misleading results for a number of problems. Worst-case complexity measures the time it takes to solve any input, although hard-to-solve inputs might never come up in practice. In such cases, the worst-case running time can be much worse than the observed running time in practice. For example, the worst-case complexity of solving a

linear program Linear programming (LP), also called linear optimization, is a method to achieve the best outcome (such as maximum profit or lowest cost) in a mathematical model whose requirements and objective are represented by linear relationships. Linear ...

using the

simplex algorithm In mathematical optimization, Dantzig's simplex algorithm (or simplex method) is a popular algorithm for linear programming. The name of the algorithm is derived from the concept of a simplex and was suggested by T. S. Motzkin. Simplices are ...

is exponential, although the observed number of steps in practice is roughly linear. The simplex algorithm is in fact much faster than the

ellipsoid method In mathematical optimization, the ellipsoid method is an iterative method for convex optimization, minimizing convex functions over convex sets. The ellipsoid method generates a sequence of ellipsoids whose volume uniformly decreases at every ste ...

in practice, although the latter has

polynomial-time In theoretical computer science, the time complexity is the computational complexity that describes the amount of computer time it takes to run an algorithm. Time complexity is commonly estimated by counting the number of elementary operations p ...

worst-case complexity. Average-case analysis was first introduced to overcome the limitations of worst-case analysis. However, the resulting average-case complexity depends heavily on the

probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...

that is chosen over the input. The actual inputs and distribution of inputs may be different in practice from the assumptions made during the analysis: a random input may be very unlike a typical input. Because of this choice of data model, a theoretical average-case result might say little about practical performance of the algorithm. Smoothed analysis generalizes both worst-case and average-case analysis and inherits strengths of both. It is intended to be much more general than average-case complexity, while still allowing low complexity bounds to be proven.

History

ACM and the European Association for Theoretical Computer Science awarded the 2008

Gödel Prize The Gödel Prize is an annual prize for outstanding papers in the area of theoretical computer science, given jointly by the European Association for Theoretical Computer Science (EATCS) and the Association for Computing Machinery Special Inter ...

to Daniel Spielman and Shanghua Teng for developing smoothed analysis. The name Smoothed Analysis was coined by

Alan Edelman Alan Stuart Edelman (born June 1963) is an American mathematician and computer scientist. He is a professor of applied mathematics at the Massachusetts Institute of Technology (MIT) and a Principal Investigator at the MIT Computer Science and Ar ...

. In 2010 Spielman received the Nevanlinna Prize for developing smoothed analysis. Spielman and Teng's JACM paper "Smoothed analysis of algorithms: Why the simplex algorithm usually takes polynomial time" was also one of the three winners of the 2009

Fulkerson Prize The Fulkerson Prize for outstanding papers in the area of discrete mathematics is sponsored jointly by the Mathematical Optimization Society (MOS) and the American Mathematical Society (AMS). Up to three awards of $1,500 each are presented at e ...

sponsored jointly by the Mathematical Programming Society (MPS) and the

American Mathematical Society The American Mathematical Society (AMS) is an association of professional mathematicians dedicated to the interests of mathematical research and scholarship, and serves the national and international community through its publications, meetings, ...

(AMS).

Examples

Simplex algorithm for linear programming

The

is a very efficient algorithm in practice, and it is one of the dominant algorithms for

linear programming Linear programming (LP), also called linear optimization, is a method to achieve the best outcome (such as maximum profit or lowest cost) in a mathematical model whose requirements and objective are represented by linear function#As a polynomia ...

in practice. On practical problems, the number of steps taken by the algorithm is linear in the number of variables and constraints. Yet in the theoretical worst case it takes exponentially many steps for most successfully analyzed pivot rules. This was one of the main motivations for developing smoothed analysis. For the perturbation model, we assume that the input data is perturbed by noise from a

Gaussian distribution In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real number, real-valued random variable. The general form of its probability density function is f(x ...

. For normalization purposes, we assume the unperturbed data

\bar \in \mathbb^, \bar \in \mathbb^n, \mathbf c \in \mathbb^d

satisfies

\, (\bar_i, \bar_i)\, _2 \leq 1

for all rows

(\bar_i, \bar_i)

of the matrix

(\bar, \bar).

The noise

(\hat, \hat)

has independent entries sampled from a Gaussian distribution with mean

0

and standard deviation

\sigma

. We set

\mathbf A = \bar + \hat, \mathbf b = \bar + \hat

. The smoothed input data consists of the linear program :maximize ::

\mathbf \cdot \mathbf

:subject to ::

\mathbf A \mathbf \leq \mathbf b

. If the running time of our algorithm on data

\mathbf A, \mathbf b, \mathbf c

is given by

T(\mathbf A, \mathbf b,\mathbf c)

then the smoothed complexity of the simplex method is ::

= (d,\log n, \sigma^).

This bound holds for a specific pivot rule called the shadow vertex rule. The shadow vertex rule is slower than more commonly used pivot rules such as Dantzig's rule or the steepest edge rule but it has properties that make it very well-suited to probabilistic analysis.

Local search for combinatorial optimization

A number of local search algorithms have bad worst-case running times but perform well in practice. One example is the 2-opt

heuristic A heuristic or heuristic technique (''problem solving'', '' mental shortcut'', ''rule of thumb'') is any approach to problem solving that employs a pragmatic method that is not fully optimized, perfected, or rationalized, but is nevertheless ...

for the

traveling salesman problem In the theory of computational complexity, the travelling salesman problem (TSP) asks the following question: "Given a list of cities and the distances between each pair of cities, what is the shortest possible route that visits each city exac ...

. It can take exponentially many iterations until it finds a locally optimal solution, although in practice the running time is subquadratic in the number of vertices. The

approximation ratio In computer science and operations research, approximation algorithms are efficient algorithms that find approximate solutions to optimization problems (in particular NP-hard problems) with provable guarantees on the distance of the returned sol ...

, which is the ratio between the length of the output of the algorithm and the length of the optimal solution, tends to be good in practice but can also be bad in the theoretical worst case. One class of problem instances can be given by

n

points in the box

,1 d

, where their pairwise distances come from a norm. Already in two dimensions, the 2-opt heuristic might take exponentially many iterations until finding a local optimum. In this setting, one can analyze the perturbation model where the vertices

v_1,\dots,v_n

are independently sampled according to probability distributions with

probability density function In probability theory, a probability density function (PDF), density function, or density of an absolutely continuous random variable, is a Function (mathematics), function whose value at any given sample (or point) in the sample space (the s ...

f_1,\dots,f_n :,1 d \rightarrow,\theta /math>. For \theta = 1, the points are uniformly distributed. When \theta > 1 is big, the adversary has more ability to increase the likelihood of hard problem instances. In this perturbation model, the expected number of iterations of the 2-opt heuristic, as well as the approximation ratios of resulting output, are bounded by polynomial functions of n and \theta . Another local

search algorithm In computer science, a search algorithm is an algorithm designed to solve a search problem. Search algorithms work to retrieve information stored within particular data structure, or calculated in the Feasible region, search space of a problem do ...

for which smoothed analysis was successful is the k-means method. Given

n

points in

,1 d

, it is

NP-hard In computational complexity theory, a computational problem ''H'' is called NP-hard if, for every problem ''L'' which can be solved in non-deterministic polynomial-time, there is a polynomial-time reduction from ''L'' to ''H''. That is, assumi ...

to find a good partition into clusters with small pairwise distances between points in the same cluster.

Lloyd's algorithm In electrical engineering and computer science, Lloyd's algorithm, also known as Voronoi iteration or relaxation, is an algorithm named after Stuart P. Lloyd for finding evenly spaced sets of points in subsets of Euclidean spaces and partitions of ...

is widely used and very fast in practice, although it can take

e^

iterations in the worst case to find a locally optimal solution. However, assuming that the points have independent Gaussian distributions, each with expectation in

,1 d

and standard deviation

\sigma

, the expected number of iterations of the algorithm is bounded by a polynomial in

n

d

and

\sigma

References

{{Reflist, refs= {{Citation , last = Andrei , first = Neculai , title = Andrei, Neculai. "On the complexity of MINOS package for linear programming , journal = Studies in Informatics and Control , volume = 13 , issue = 1 , pages = 35–46 , year = 2004 {{Citation , last1 = Amenta , first1 = Nina , author-link = Nina Amenta , last2 = Ziegler , first2 = Günter , author2-link = Gunter M. Ziegler , title = Deformed products and maximal shadows of polytopes , journal = Contemporary Mathematics , volume = 223 , pages = 10–19 , publisher = American Mathematical Society , mr = 1661377 , doi = 10.1090/conm/223 , isbn = 9780821806746 , citeseerx = 10.1.1.80.3241 , year = 1999 {{Citation , last1 = Borgwardt , first1 = Karl-Heinz , last2 = Damm , first2 = Renate , last3 = Donig , first3 = Rudolf , last4 = Joas , first4 = Gabriele , title = Empirical studies on the average efficiency of simplex variants under rotation symmetry , journal = ORSA Journal on Computing , publisher = Operations Research Society of America , year = 1993 , volume = 5 , issue = 3 , pages = 249–260 , doi = 10.1287/ijoc.5.3.249 {{Citation , last = Borgwardt , first = Karl-Heinz , title = The Simplex Method: A Probabilistic Analysis , volume = 1 , year = 1987 , publisher = Springer-Verlag , isbn = 978-3-540-17096-9 , doi = 10.1007/978-3-642-61578-8 , series = Algorithms and Combinatorics , url = https://nbn-resolving.org/urn:nbn:de:bvb:384-opus4-143220 , url-access = subscription {{Citation , last1 = Arthur , first1 = David , last2 = Manthey , first2 = Bodo , last3 = Röglin , first3 = Heiko , title = Smoothed Analysis of the k-Means Method , year = 2011 , journal = Journal of the ACM , volume = 58 , issue = 5 , pages = 1–31 , doi = 10.1145/2027216.2027217 , s2cid = 5253105 , url = https://ris.utwente.nl/ws/files/6539396/JACM_ArthurEA_kMeansSmoothed.pdf {{Citation , last1 = Englert , first1 = Matthias , last2 = Röglin , first2 = Heiko , last3 = Vöcking , first3 = Berthold , title = Worst Case and Probabilistic Analysis of the 2-Opt Algorithm for the TSP , journal = Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms , volume = 68 , pages = 190–264 , year = 2007 , doi = 10.1007/s00453-013-9801-4 , doi-access = free , arxiv = 2302.06889 {{Citation , last = Shamir , first = Ron , author-link = Ron Shamir , title = The Efficiency of the Simplex Method: A Survey , journal = Management Science , volume = 33 , issue = 3 , pages = 301–334 , year = 1987 , doi = 10.1287/mnsc.33.3.301 {{Citation , last1=Spielman , first1=Daniel , last2=Teng , first2=Shang-Hua , title=Proceedings of the thirty-third annual ACM symposium on Theory of computing , chapter=Smoothed analysis of algorithms , date=2001 , author1-link=Daniel Spielman , author2-link=Shanghua Teng , publisher=ACM , isbn=978-1-58113-349-3 , doi=10.1145/380752.380813 , pages=296–305 , arxiv=cs/0111050 , bibcode=2001cs.......11050S , s2cid=1471 {{Citation , last1=Spielman , first1=Daniel , author-link=Daniel Spielman , last2=Teng , first2=Shang-Hua , author2-link=Shanghua Teng , title=Smoothed analysis: an attempt to explain the behavior of algorithms in practice , url=http://cs-www.cs.yale.edu/homes/spielman/Research/cacmSmooth.pdf , journal=Communications of the ACM , volume=52 , issue=10 , pages=76–84 , year=2009 , publisher=ACM , doi=10.1145/1562764.1562785 , s2cid=7904807 Computational complexity theory Mathematical optimization Analysis of algorithms

History

Examples

Simplex algorithm for linear programming

Local search for combinatorial optimization

See also

References