Luus–Jaakola
   HOME

TheInfoList



OR:

In
computational engineering Computational science and engineering (CSE) is a relatively new discipline that deals with the development and application of computational models and simulations, often coupled with high-performance computing, to solve complex physical problems ...
, Luus–Jaakola (LJ) denotes a
heuristic A heuristic (; ), or heuristic technique, is any approach to problem solving or self-discovery that employs a practical method that is not guaranteed to be optimal, perfect, or rational, but is nevertheless sufficient for reaching an immediate ...
for
global Global means of or referring to a globe and may also refer to: Entertainment * ''Global'' (Paul van Dyk album), 2003 * ''Global'' (Bunji Garlin album), 2007 * ''Global'' (Humanoid album), 1989 * ''Global'' (Todd Rundgren album), 2015 * Bruno ...
optimization Mathematical optimization (alternatively spelled ''optimisation'') or mathematical programming is the selection of a best element, with regard to some criterion, from some set of available alternatives. It is generally divided into two subfi ...
of a real-valued function. In engineering use, LJ is not an
algorithm In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific problems or to perform a computation. Algorithms are used as specifications for performing ...
that terminates with an optimal solution; nor is it an
iterative method In computational mathematics, an iterative method is a mathematical procedure that uses an initial value to generate a sequence of improving approximate solutions for a class of problems, in which the ''n''-th approximation is derived from the pr ...
that generates a sequence of points that converges to an optimal solution (when one exists). However, when applied to a twice continuously differentiable function, the LJ heuristic is a proper iterative method, that generates a sequence that has a convergent subsequence; for this class of problems, Newton's method is recommended and enjoys a quadratic rate of convergence, while no convergence rate analysis has been given for the LJ heuristic. In practice, the LJ heuristic has been recommended for functions that need be neither
convex Convex or convexity may refer to: Science and technology * Convex lens, in optics Mathematics * Convex set, containing the whole line segment that joins points ** Convex polygon, a polygon which encloses a convex set of points ** Convex polytop ...
nor
differentiable In mathematics, a differentiable function of one real variable is a function whose derivative exists at each point in its domain. In other words, the graph of a differentiable function has a non-vertical tangent line at each interior point in its ...
nor locally Lipschitz: The LJ heuristic does not use a
gradient In vector calculus, the gradient of a scalar-valued differentiable function of several variables is the vector field (or vector-valued function) \nabla f whose value at a point p is the "direction and rate of fastest increase". If the gr ...
or subgradient when one be available, which allows its application to non-differentiable and non-convex problems. Proposed by Luus and Jaakola, LJ generates a sequence of iterates. The next iterate is selected from a sample from a neighborhood of the current position using a uniform distribution. With each iteration, the neighborhood decreases, which forces a subsequence of iterates to converge to a cluster point. Luus has applied LJ in optimal control, transformer design,
metallurgical processes Metallurgy is a domain of materials science and engineering that studies the physical and chemical behavior of metallic elements, their inter-metallic compounds, and their mixtures, which are known as alloys. Metallurgy encompasses both the sc ...
, and
chemical engineering Chemical engineering is an engineering field which deals with the study of operation and design of chemical plants as well as methods of improving production. Chemical engineers develop economical commercial processes to convert raw materials int ...
.


Motivation

At each step, the LJ heuristic maintains a box from which it samples points randomly, using a uniform distribution on the box. For a
unimodal function In mathematics, unimodality means possessing a unique mode. More generally, unimodality means there is only a single highest value, somehow defined, of some mathematical object. Unimodal probability distribution In statistics, a unimodal pr ...
, the probability of reducing the objective function decreases as the box approach a minimum. The picture displays a one-dimensional example.


Heuristic

Let ''f'': ℝ''n'' → ℝ be the fitness or cost function which must be minimized. Let x ∈ ℝ''n'' designate a position or candidate solution in the search-space. The LJ heuristic iterates the following steps: * Initialize x ~ ''U''(blo,bup) with a random
uniform A uniform is a variety of clothing worn by members of an organization while participating in that organization's activity. Modern uniforms are most often worn by armed forces and paramilitary organizations such as police, emergency services, ...
position in the search-space, where blo and bup are the lower and upper boundaries, respectively. * Set the initial sampling range to cover the entire search-space (or a part of it): d = bup − blo * Until a termination criterion is met (e.g. number of iterations performed, or adequate fitness reached), repeat the following: ** Pick a random vector a ~ ''U''(−d, d) ** Add this to the current position x to create the new potential position y = x + a ** If (''f''(y) < ''f''(x)) then move to the new position by setting x = y, otherwise decrease the sampling-range: d = ''0.95'' d * Now x holds the best-found position.


Variations

Luus notes that ARS (Adaptive Random Search) algorithms proposed to date differ in regard to many aspects. * Procedure of generating random trial points. * Number of internal loops (NIL, the number of random search points in each cycle). * Number of cycles (NEL, number of external loops). * Contraction coefficient of the search region size. (Some example values are 0.95 to 0.60.) ** Whether the region reduction rate is the same for all variables or a different rate for each variable (called the M-LJ algorithm). ** Whether the region reduction rate is a constant or follows another distribution (e.g. Gaussian). * Whether to incorporate a line search. * Whether to consider constraints of the random points as acceptance criteria, or to incorporate a quadratic penalty.


Convergence

Nair proved a convergence analysis. For twice continuously differentiable functions, the LJ heuristic generates a sequence of iterates having a convergent subsequence. For this class of problems, Newton's method is the usual optimization method, and it has
quadratic convergence In numerical analysis, the order of convergence and the rate of convergence of a convergent sequence are quantities that represent how quickly the sequence approaches its limit. A sequence (x_n) that converges to x^* is said to have ''order of co ...
(''regardless of the dimension'' of the space, which can be a Banach space, according to
Kantorovich Leonid Vitalyevich Kantorovich ( rus, Леони́д Вита́льевич Канторо́вич, , p=lʲɪɐˈnʲit vʲɪˈtalʲjɪvʲɪtɕ kəntɐˈrovʲɪtɕ, a=Ru-Leonid_Vitaliyevich_Kantorovich.ogg; 19 January 19127 April 1986) was a Soviet ...
's analysis). The worst-case complexity of minimization on the class of unimodal functions grows exponentially in the dimension of the problem, according to the analysis of Yudin and Nemirovsky, however. The Yudin-Nemirovsky analysis implies that no method can be fast on high-dimensional problems that lack convexity:
"The catastrophic growth n the number of iterations needed to reach an approximate solution of a given accuracyas he number of dimensions increases to infinityshows that it is meaningless to pose the question of constructing universal methods of solving ... problems of any appreciable dimensionality 'generally'. It is interesting to note that the same onclusionholds for ... problems generated by uni-extremal hat is, unimodal(but not convex) functions." Page 7 summarizes the later discussion of .
When applied to twice continuously differentiable problems, the LJ heuristic's rate of convergence decreases as the number of dimensions increases.


See also

*
Random optimization Random optimization (RO) is a family of numerical optimization methods that do not require the gradient of the problem to be optimized and RO can hence be used on functions that are not continuous or differentiable. Such optimization methods are al ...
is a related family of optimization methods that sample from general distributions, for example the uniform distribution. *
Random search Random search (RS) is a family of numerical optimization methods that do not require the gradient of the problem to be optimized, and RS can hence be used on functions that are not continuous or differentiable. Such optimization methods are also k ...
is a related family of optimization methods that sample from general distributions, for example, a uniform distribution on the
unit Unit may refer to: Arts and entertainment * UNIT, a fictional military organization in the science fiction television series ''Doctor Who'' * Unit of action, a discrete piece of action (or beat) in a theatrical presentation Music * ''Unit'' (a ...
sphere A sphere () is a geometrical object that is a three-dimensional analogue to a two-dimensional circle. A sphere is the set of points that are all at the same distance from a given point in three-dimensional space.. That given point is th ...
. * Pattern search are used on noisy observations, especially in
response surface methodology In statistics, response surface methodology (RSM) explores the relationships between several explanatory variables and one or more response variables. The method was introduced by George E. P. Box and K. B. Wilson in 1951. The main idea of RSM ...
in
chemical engineering Chemical engineering is an engineering field which deals with the study of operation and design of chemical plants as well as methods of improving production. Chemical engineers develop economical commercial processes to convert raw materials int ...
. They do not require users to program gradients or hessians.


References

{{DEFAULTSORT:Luus-Jaakola Optimization algorithms and methods Heuristic algorithms