Obnoxious Facility Location
   HOME

TheInfoList



OR:

The study of facility location problems (FLP), also known as location analysis, is a branch of
operations research Operations research ( en-GB, operational research) (U.S. Air Force Specialty Code: Operations Analysis), often shortened to the initialism OR, is a discipline that deals with the development and application of analytical methods to improve deci ...
and
computational geometry Computational geometry is a branch of computer science devoted to the study of algorithms which can be stated in terms of geometry. Some purely geometrical problems arise out of the study of computational geometric algorithms, and such problems ar ...
concerned with the optimal placement of facilities to minimize transportation costs while considering factors like avoiding placing hazardous materials near housing, and competitors' facilities. The techniques also apply to
cluster analysis Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). It is a main task of ...
.


Minimum facility location

A simple facility location problem is the
Weber problem In geometry, the Weber problem, named after Alfred Weber, is one of the most famous problems in location theory. It requires finding a point in the plane that minimizes the sum of the transportation costs from this point to ''n'' destination point ...
, in which a single facility is to be placed, with the only optimization criterion being the minimization of the weighted sum of distances from a given set of point sites. More complex problems considered in this discipline include the placement of multiple facilities, constraints on the locations of facilities, and more complex optimization criteria. In a basic formulation, the facility location problem consists of a set of potential facility sites ''L'' where a facility can be opened, and a set of demand points ''D'' that must be serviced. The goal is to pick a subset ''F'' of facilities to open, to minimize the sum of distances from each demand point to its nearest facility, plus the sum of opening costs of the facilities. The facility location problem on general graphs is
NP-hard In computational complexity theory, NP-hardness ( non-deterministic polynomial-time hardness) is the defining property of a class of problems that are informally "at least as hard as the hardest problems in NP". A simple example of an NP-hard pr ...
to solve optimally, by reduction from (for example) the
set cover problem The set cover problem is a classical question in combinatorics, computer science, operations research, and complexity theory. It is one of Karp's 21 NP-complete problems shown to be NP-complete in 1972. Given a set of elements (called the univ ...
. A number of approximation algorithms have been developed for the facility location problem and many of its variants. Without assumptions on the set of distances between clients and sites (in particular, without assuming that the distances satisfy the
triangle inequality In mathematics, the triangle inequality states that for any triangle, the sum of the lengths of any two sides must be greater than or equal to the length of the remaining side. This statement permits the inclusion of degenerate triangles, but ...
), the problem is known as non-metric facility location and can be approximated to within a factor O(log ''n''). This factor is tight, via an approximation-preserving reduction from the set cover problem. If we assume distances between clients and sites are undirected and satisfy the triangle inequality, we are talking about a metric facility location (MFL) problem. The MFL is still NP-hard and hard to approximate within factor better than 1.463. The currently best known approximation algorithm achieves approximation ratio of 1.488.


Minimax facility location

The minimax facility location problem seeks a location which minimizes the maximum distance to the sites, where the distance from one point to the sites is the distance from the point to its nearest site. A formal definition is as follows: Given a point set ''P'' ⊂ ℝ''d'', find a point set ''S'' ⊂ ℝ''d'', , ''S'',  = ''k'', so that maxp ∈ ''P''(minq ∈ ''S''(d(p, q)) ) is minimized. In the case of the Euclidean metric for ''k'' = 1, it is known as the
smallest enclosing sphere In mathematics, given a non-empty set of objects of finite extension in d-dimensional space, for example a set of points, a bounding sphere, enclosing sphere or enclosing ball for that set is an d-dimensional solid sphere containing all of thes ...
problem or
1-center problem The 1-center problem, also known as minimax problem or minmax location problem, is a classical combinatorial optimization problem in operations research of facilities location type. In its most general case the problem is stated as follows: given a ...
. Its study traced at least to the year of 1860. see
smallest enclosing circle The smallest-circle problem (also known as minimum covering circle problem, bounding circle problem, least bounding circle problem, smallest enclosing circle problem) is a mathematical problem of computing the smallest circle that contains all of ...
and
bounding sphere In mathematics, given a non-empty set of objects of finite extension in d-dimensional space, for example a set of points, a bounding sphere, enclosing sphere or enclosing ball for that set is an d-dimensional solid sphere containing all of thes ...
for more details.


NP hardness

It has been proven that exact solution of ''k''-center problem is NP hard. . Approximation to the problem was found to be also NP hard when the error is small. The error level in the
approximation algorithm In computer science and operations research, approximation algorithms are efficient algorithms that find approximate solutions to optimization problems (in particular NP-hard problems) with provable guarantees on the distance of the returned solu ...
is measured as an approximation factor, which is defined as the ratio between the approximation and the optimum. It's proved that the ''k''-center problem approximation is NP hard when approximation factor is less than 1.822 (dimension = 2) or 2 (dimension > 2).


Algorithms

Exact solver There exist algorithms to produce exact solutions to this problem. One exact solver runs in time n^. 1 + ''ε'' approximation 1 + ''ε'' approximation is to find a solution with approximation factor no greater than 1 + ''ε''. This approximation is NP hard as ''ε'' is arbitrary. One approach based on the
coreset In computational geometry, a coreset is a small set of points that approximates the shape of a larger point set, in the sense that applying some geometric measure to the two sets (such as their minimum bounding box volume) results in approximately e ...
concept is proposed with execution complexity of O(2^dn). As an alternative, another algorithm also based on core sets is available. It runs in O(k^n). The author claims that the running time is much less than the worst case and thus it's possible to solve some problems when ''k'' is small (say ''k'' < 5). Farthest-point clustering For the hardness of the problem, it's impractical to get an exact solution or precise approximation. Instead, an approximation with factor = 2 is widely used for large ''k'' cases. The approximation is referred to as the farthest-point clustering (FPC) algorithm, or
farthest-first traversal In computational geometry, the farthest-first traversal of a compact metric space is a sequence of points in the space, where the first point is selected arbitrarily and each successive point is as far as possible from the set of previously-sel ...
. The algorithm is quite simple: pick any point from the set as one center; search for the farthest point from remaining set as another center; repeat the process until ''k'' centers are found. It is easy to see that this algorithm runs in linear time. As approximation with factor less than 2 is proved to be NP hard, FPC was regarded as the best approximation one can find. As per the performance of execution, the time complexity is later improved to O(''n'' log ''k'') with box decomposition technique.


Maxmin facility location

The maxmin facility location or obnoxious facility location problem seeks a location which maximizes the minimum distance to the sites. In the case of the Euclidean metric, it is known as the
largest empty sphere In computational geometry, the largest empty sphere problem is the problem of finding a hypersphere of largest radius in ''d''-dimensional space whose interior does not overlap with any given obstacles. Two dimensions The largest empty circle ...
problem. The planar case (
largest empty circle In computational geometry, the largest empty sphere problem is the problem of finding a hypersphere of largest radius in ''d''-dimensional space whose interior does not overlap with any given obstacles. Two dimensions The largest empty circl ...
problem) may be solved in optimal time Θ(''n'' log n).
p. 256
/ref>


Integer programming formulations

Facility location problems are often solved as
integer programs Linear programming (LP), also called linear optimization, is a method to achieve the best outcome (such as maximum profit or lowest cost) in a mathematical model whose requirements are represented by linear relationships. Linear programming is ...
. In this context, facility location problems are often posed as follows: suppose there are n facilities and m customers. We wish to choose (1) which of the n facilities to open, and (2) which (open) facilities to use to supply the m customers, in order to satisfy some fixed demand at minimum cost. We introduce the following notation: let f_i denote the (fixed) cost of opening facility i, for i=1,\dots,n. Let c_denote the cost to ship a product from facility i to customer j for i=1,\dots,n and j=1,\dots,m. Let d_j denote the demand of customer j for j=1,\dots,m. Further suppose that each facility has a maximum output. Let u_i denote the maximum amount of product that can be produced by facility i, that is, let u_i denote the ''capacity'' of facility i. The remainder of this section follows


Capacitated facility location

In our initial formulation, introduce a binary variable x_i for i=1,\dots,n, where x_i=1 if facility i is open, and x_i=0 otherwise. Further introduce the variable y_ for i=1,\dots,n and j=1,\dots,m which represents the fraction of the demand d_j filled by facility i. The so-called capacitated facility location problem is then given by\begin \min & \displaystyle\sum_^n\sum_^mc_ d_j y_+\sum_^nf_ix_i \\ \text & \displaystyle\sum_^ny_=1 \textj=1,\dots,m \\ & \displaystyle \sum_^md_jy_\leqslant u_ix_i\texti=1\dots,n \\ &y_\geqslant0\texti=1,\dots,n \textj=1,\dots,m\\ &x_i\in\\text i=1,\dots,n \end Note that the second set of constraints ensure that if x_i=0, that is, facility i isn't open, then y_=0 for all j, that is, no demand for any customer can be filled from facility i.


Uncapacitated facility location

A common case of the capacitated facility location problem above is the case when u_i=+\infty for all i=1,\dots,n. In this case, it is always optimal to satisfy all of the demand from customer j from the nearest open facility. Because of this, we may replace the continuous variables y_ from above with the
binary variables Binary data is data whose unit can take on only two possible states. These are often labelled as 0 and 1 in accordance with the binary numeral system and Boolean algebra. Binary data occurs in many different technical and scientific fields, wher ...
z_, where z_=1 if customer j is supplied by facility i, and z_=0 otherwise. The uncapacitated facility location problem is then given by\begin \min & \displaystyle\sum_^n\sum_^mc_ d_j z_+\sum_^nf_ix_i \\ \text & \displaystyle\sum_^nz_=1 \textj=1,\dots,m \\ & \displaystyle \sum_^mz_\leqslant Mx_i\texti=1\dots,n \\ &z_\in\\texti=1,\dots,n \textj=1,\dots,m\\ &x_i\in\\text i=1,\dots,n \end where M is a constant chosen to be suitably large. The choice of M can affect computation results—the best choice in this instance is obvious: take M=m. Then, if x_i=1, any choice of the z_ will satisfy the second set of constraints. Another formulation possibility for the uncapacitated facility location problem is to ''disaggregate'' the capacity constraints (the big-M constraints). That is, replace the constraints\sum_^z_\leqslant Mx_i\texti=1,\dots,nwith the constraintsz_\leqslant x_i\texti=1,\dots,n \textj=1,\dots,mIn practice, this new formulation performs significantly better, in the sense that it has a tighter
Linear programming Linear programming (LP), also called linear optimization, is a method to achieve the best outcome (such as maximum profit or lowest cost) in a mathematical model whose requirements are represented by linear function#As a polynomial function, li ...
relaxation than the first formulation. Notice that summing the new constraints together yields the original big-M constraints. In the capacitated case, these formulations are not equivalent. More information about the uncapacitated facility location problem can be found in Chapter 3 of "Discrete location theory".


Applications


Healthcare

In healthcare, incorrect facility location decisions have a serious impact on the community beyond simple cost and service metrics; for instance, hard-to-access healthcare facilities are likely to be associated with increased morbidity and mortality. From this perspective, facility location modeling for healthcare is more critical than similar modeling for other areas.


Solid waste management

Municipal solid waste management still remains a challenge for developing countries because of increasing waste production and high costs associated with waste management. Through the formulation and exact resolution of a facility location problem it is possible to optimize the location of landfills for waste disposal.


Clustering

A particular subset of
cluster analysis Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). It is a main task of ...
problems can be viewed as facility location problems. In a centroid-based clustering problem, the objective is to partition n data points (elements of a common metric space) into equivalence classes—often called ''colors''—such that points of the same color are close to one another (equivalently, such that points of different colors are far from one another). To see how one might view (read "transform" or "reduce") a centroid-based clustering problem as a (metric) facility location problem, view each data point in the former as a demand point in the latter. Suppose that the data to be clustered are elements of a metric space M (e.g. let M be p -dimensional
Euclidean space Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, that is, in Euclid's Elements, Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics ther ...
for some fixed p ). In the facility location problem that we are constructing, we permit facilities to be placed at any point within this metric space M ; this defines the set of allowed facility locations L . We define the costs c_ to be the pairwise distances between location-demand point pairs (e.g., see
metric k-center In graph theory, the metric -center problem is a combinatorial optimization problem studied in theoretical computer science. Given cities with specified distances, one wants to build warehouses in different cities and minimize the maximum dist ...
). In a centroid-based clustering problem, one partitions the data into k equivalence classes (i.e. colors) each of which has a centroid. Let us see how a solution to our constructed facility location problem also achieves such a partition. A feasible solution is a non-empty subset L' \subseteq L of k locations. These locations in our facility location problem comprise a set of k centroids in our centroid-based clustering problem. Now, assign each demand point d to the location \ell^* that minimizes its servicing-cost; that is, assign the data point d to the centroid \ell^* := \mathrm_ \ (break ties arbitrarily). This achieves the partitioning provided that the facility location problem's costs c_ are defined such that they are the images of the centroid-based clustering problem's distance function. The popular algorithms textbook ''Algorithm Design'' provides a related problem-description and an approximation algorithm. The authors refer to the metric facility location problem (i.e. the centroid-based clustering problem or the metric k -center problem) as the ''center selection problem'', thereby growing the list of synonyms. Furthermore, see that in our above definition of the facility location problem that the objective function f is general. Specific choices of f yield different variants of the facility location problem, and hence different variants of the centroid-based clustering problem. For example, one might choose to minimize the sum of distances from each location to each of its assigned demand points (à la the
Weber problem In geometry, the Weber problem, named after Alfred Weber, is one of the most famous problems in location theory. It requires finding a point in the plane that minimizes the sum of the transportation costs from this point to ''n'' destination point ...
), or one might elect to minimize the maximum of all such distances (à la the
1-center problem The 1-center problem, also known as minimax problem or minmax location problem, is a classical combinatorial optimization problem in operations research of facilities location type. In its most general case the problem is stated as follows: given a ...
).


See also

*
Graph center The center (or Jordan center Wasserman, Stanley, and Faust, Katherine (1994), ''Social Network Analysis: Methods and Applications'', page 185. Cambridge: Cambridge University Press. ) of a graph is the set of all vertices of minimum eccentricit ...
*
Quadratic assignment problem The quadratic assignment problem (QAP) is one of the fundamental combinatorial optimization problems in the branch of optimization or operations research in mathematics, from the category of the facilities location problems first introduced by Ko ...
*
Location-allocation Location-allocation refers to algorithms used primarily in a geographic information system to determine an optimal location for one or more facilities that will service demand from a given set of points. Algorithms can assign those demand points t ...
* Dijkstra's algorithm *
List of spatial analysis software Spatial analysis software is software written to enable and facilitate spatial analysis. Currently, there are several packages, both free software and proprietary software, which cover most of the spatial data infrastructure stack. Packages See ...
*
Competitive facility location game The competitive facility location game is a kind of competitive game in which service-providers select locations to place their facilities in order to maximize their profits.Eva Tardos and Tom Wexler, "Network Formation Games". Chapter 19 in The ga ...
*
Vertex k-center problem The vertex ''k''-center problem is a classical NP-hard problem in computer science. It has application in facility location and clustering. Basically, the vertex ''k''-center problem models the following real problem: given a city with n facilit ...
*
geometric median In geometry, the geometric median of a discrete set of sample points in a Euclidean space is the point minimizing the sum of distances to the sample points. This generalizes the median, which has the property of minimizing the sum of distances ...


References


External links

* EWGLAbr>EURO Working Group on Locational Analysis
*
INFORMS The Institute for Operations Research and the Management Sciences (INFORMS) is an international society for practitioners in the fields of operations research (O.R.), management science, and analytics. It was established in 1995 with the merger of ...
br>section on location analysis
a professional society concerned with facility location.
Bibliography on facility location
collected by
Trevor Hale Trevor S. Hale (born c. 1965) is a professor of business analytics at Texas A&M University (Mays Business School). He received his Ph.D. in Operations research from Texas A&M University, an MS in Engineering management from Northeastern Universi ...
, containing over 3400 articles.
Library of location algorithmsWeb-based facility location utility (single facility)Facility Location Optimizer
a MATLAB-based tool for solving facility location problems. {{DEFAULTSORT:Facility Location Mathematical optimization in business Computational problems in graph theory Spatial analysis