Data Envelopment Analysis
   HOME

TheInfoList



OR:

Data envelopment analysis (DEA) is a
nonparametric Nonparametric statistics is the branch of statistics that is not based solely on parametrized families of probability distributions (common examples of parameters are the mean and variance). Nonparametric statistics is based on either being dist ...
method in
operations research Operations research ( en-GB, operational research) (U.S. Air Force Specialty Code: Operations Analysis), often shortened to the initialism OR, is a discipline that deals with the development and application of analytical methods to improve deci ...
and
economics Economics () is the social science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services. Economics focuses on the behaviour and intera ...
for the estimation of production frontiers.Charnes et al (1978) DEA has been applied in a large range of fields including international banking, economic sustainability, police department operations, and logistical applicationsCharnes et al (1995) Emrouznejad et al (2016)Thanassoulis (1995) Additionally, DEA has been used to assess the performance of natural language processing models, and it has found other applications within machine learning.Zhou et al (2022)Guerrero et al (2022)


Description

DEA is used to
empirically In philosophy, empiricism is an epistemological theory that holds that knowledge or justification comes only or primarily from sensory experience. It is one of several views within epistemology, along with rationalism and skepticism. Empiri ...
measure
productive efficiency In microeconomic theory, productive efficiency (or production efficiency) is a situation in which the economy or an economic system (e.g., bank, hospital, industry, country) operating within the constraints of current industrial technology canno ...
of decision-making units (DMUs). Although DEA has a strong link to
production theory Production is the process of combining various inputs, both material (such as metal, wood, glass, or plastics) and immaterial (such as plans, or knowledge) in order to create output. Ideally this output will be a good or service which has value an ...
in economics, the method is also used for
benchmarking Benchmarking is the practice of comparing business processes and performance metrics to industry bests and best practices from other companies. Dimensions typically measured are quality, time and cost. Benchmarking is used to measure performan ...
in operations management, whereby a set of measures is selected to benchmark the performance of manufacturing and service operations. In benchmarking, the efficient DMUs, as defined by DEA, may not necessarily form a “production frontier”, but rather lead to a “best-practice frontier.” In contrast to parametric methods that require the '' ex-ante'' specification of a production- or cost-function, non-parametric approaches compare feasible input and output combinations based on the available
data In the pursuit of knowledge, data (; ) is a collection of discrete values that convey information, describing quantity, quality, fact, statistics, other basic units of meaning, or simply sequences of symbols that may be further interpreted ...
only.Cooper et al (2007) DEA, one of the most commonly used non-parametric methods, owes its name to its enveloping property of the dataset's efficient DMUs, where the empirically observed, most efficient DMUs constitute the production frontier against which all DMUs are compared. DEA's popularity stems from its relative lack of assumptions, the ability to benchmark multi-dimensional inputs and outputs as well as its computational ease owing to it being expressable as a
linear program Linear programming (LP), also called linear optimization, is a method to achieve the best outcome (such as maximum profit or lowest cost) in a mathematical model whose requirements are represented by linear relationships. Linear programming is ...
, despite its task to calculate
efficiency ratio The efficiency ratio indicates the expenses as a percentage of revenue (''expenses'' / ''revenue''), with a few variations – it is essentially how much a corporation or individual spends to make a dollar; entities are supposed to attempt minimizi ...
s.Cooper et al (2011)


History

Building on the ideas of Farrell,Farrell (1957) the 1978 work "Measuring the efficiency of decision-making units" by Charnes,
Cooper Cooper, Cooper's, Coopers and similar may refer to: * Cooper (profession), a maker of wooden casks and other staved vessels Arts and entertainment * Cooper (producers), alias of Dutch producers Klubbheads * Cooper (video game character), in ...
&
Rhodes Rhodes (; el, Ρόδος , translit=Ródos ) is the largest and the historical capital of the Dodecanese islands of Greece. Administratively, the island forms a separate municipality within the Rhodes regional unit, which is part of the So ...
applied linear programming to estimate, for the first time, an
empirical Empirical evidence for a proposition is evidence, i.e. what supports or counters this proposition, that is constituted by or accessible to sense experience or experimental procedure. Empirical evidence is of central importance to the sciences and ...
, production-technology frontier. In
Germany Germany,, officially the Federal Republic of Germany, is a country in Central Europe. It is the second most populous country in Europe after Russia, and the most populous member state of the European Union. Germany is situated betwe ...
, the procedure had earlier been used to estimate the
marginal productivity In economics and in particular neoclassical economics, the marginal product or marginal physical productivity of an input ( factor of production) is the change in output resulting from employing one more unit of a particular input (for instance, t ...
of R&D and other factors of production. Since then, there have been a large number of books and journal articles written on DEA or about applying DEA to various sets of problems. Starting with the CCR model, named after Charnes, Cooper, and Rhodes, many extensions to DEA have been proposed in the literature. They range from adapting implicit model assumptions such as input and output orientation, distinguishing technical and allocative efficiency,Fried et al (2008) adding limited disposabilityCooper et al (2000) of inputs/outputs or varying returns-to-scaleBanker et al (1984) to techniques that utilize DEA results and extend them for more sophisticated analyses, such as stochastic DEAOlesen (2016) or cross-efficiency analysis.


Techniques

In a one-input, one-output scenario,
efficiency Efficiency is the often measurable ability to avoid wasting materials, energy, efforts, money, and time in doing something or in producing a desired result. In a more general sense, it is the ability to do things well, successfully, and without ...
is merely the ratio of output over input that can be produced, while comparing several entities/DMUs based on it is trivial. However, when adding more inputs or outputs the efficiency computation becomes more complex. Charnes, Cooper, and Rhodes (1978) in their basic DEA model (the CCR) define the objective function to find DMU_j's efficiency (\theta_j) as: :\max \quad \theta_j = \frac, where the DMU_j's known M outputs y_1^j,...,y_m^j are multiplied by their respective weights u_1^j,...,u_m^j and divided by the N inputs x_1^j,...,x_n^j multiplied by their respective weights v_1^j,...,v_n^j. The efficiency score \theta_j is sought to be maximized, under the constraints that using those weights on each DMU_k \quad k=1,...,K, no efficiency score exceeds one: :\frac \leq 1 \qquad k = 1,...,K, and all inputs, outputs and weights have to be non-negative. To allow for linear optimization, one typically constrains either the sum of outputs or the sum of inputs to equal a fixed value (typically 1. See later for an example). Because this
optimization Mathematical optimization (alternatively spelled ''optimisation'') or mathematical programming is the selection of a best element, with regard to some criterion, from some set of available alternatives. It is generally divided into two subfi ...
problem's dimensionality is equal to the sum of its inputs and outputs, selecting the smallest number of inputs/outputs that collectively, accurately capture the process one attempts to characterize is crucial. And because the production frontier envelopment is done empirically, several guidelines exist on the minimum required number of DMUs for good discriminatory power of the analysis, given homogeneity of the sample. This minimum number of DMUs varies between twice the sum of inputs and outputs (2 (M + N)) and twice the product of inputs and outputs (2 M N). Some advantages of the DEA approach are: * no need to explicitly specify a mathematical form for the production function * capable of handling multiple inputs and outputs * capable of being used with any input-output measurement, although ordinal variables remain tricky * the sources of inefficiency can be analysed and quantified for every evaluated unit * using the dual of the optimization problem identifies which DMU is evaluating itself against which other DMUs Some of the disadvantages of DEA are: * results are sensitive to the selection of inputs and outputs * high-efficiency values can be obtained by being truly efficient or having a niche combination of inputs/outputs * the number of efficient firms on the frontier increases with the number of inputs and output variables * a DMU's efficiency scores may be obtained by using non-unique combinations of weights on the input and/or output factors


Example

Assume that we have the following data: * Unit 1 produces 100 items per day, and the inputs per item are 10 dollars for materials and 2 labour-hours * Unit 2 produces 80 items per day, and the inputs are 8 dollars for materials and 4 labour-hours * Unit 3 produces 120 items per day, and the inputs are 12 dollars for materials and 1.5 labour-hours To calculate the efficiency of unit 1, we define the objective function (OF) as *Max Efficiency :(100u_1)/(10v_1+2v_2) which is subject to (ST) all efficiency of other units (efficiency cannot be larger than 1): *Efficiency of unit 1: (100u_1)/(10v_1+2v_2)\leq 1 *Efficiency of unit 2: (80u_1)/(8v_1+4v_2)\leq 1 *Efficiency of unit 3: (120u_1)/(12v_1+1.5v_2)\leq 1 and non-negativity: *u,v \geq 0 A fraction with decision variables in the numerator and denominator is nonlinear. Since we are using a linear programming technique, we need to linearize the formulation, such that the denominator of the objective function is constant (in this case 1), then maximize the numerator. The new formulation would be: * OF **Max Efficiency :100u_1 *ST ** Efficiency of unit 1: 100u_1-(10v_1+2v_2)\leq 0 ** Efficiency of unit 2: 80u_1-(8v_1+4v_2)\leq 0 ** Efficiency of unit 3: 120u_1-(12v_1+1.5v_2)\leq 0 **Denominator of nonlinear OF'':'' 10v_1+2v_2=1 ** Non-negativity: u,v \geq 0


Extensions

A desire to improve upon DEA by reducing its disadvantages or strengthening its advantages has been a major cause for discoveries in the recent literature. The currently most often DEA-based method to obtain unique efficiency rankings is called "cross-efficiency." Originally developed by Sexton et al. in 1986,Sexton (1986) it found widespread application ever since Doyle and Green's 1994 publication.Doyle (1994) Cross-efficiency is based on the original DEA results, but implements a secondary objective where each DMU peer-appraises all other DMU's with its own factor weights. The average of these peer-appraisal scores is then used to calculate a DMU's cross-efficiency score. This approach avoids DEA's disadvantages of having multiple efficient DMUs and potentially non-unique weights.Dyson (2001) Another approach to remedy some of DEA's drawbacks is Stochastic DEA, which synthesizes DEA and
Stochastic Frontier Analysis Stochastic frontier analysis (SFA) is a method of economic modeling. It has its starting point in the stochastic production frontier models simultaneously introduced by Aigner, Lovell and Schmidt (1977) and Meeusen and Van den Broeck (1977). The ...
(SFA).Olesen et al (2016)


Footnotes


References

* * * * * * * * * * * * * * * * * Lovell, C.A.L., & P. Schmidt (1988) "A Comparison of Alternative Approaches to the Measurement of Productive Efficiency, in Dogramaci, A., & R. Färe (eds.) ''Applications of Modern Production Theory: Efficiency and Productivity'', Kluwer: Boston. * * * * * *


Further reading

* * {{cite journal , last= Tofallis, first=Chris , year = 2001, url=https://papers.ssrn.com/sol3/papers.cfm?abstract_id=1353122, accessdate=27 January 2022 , title = Combining two approaches to efficiency assessment , ssrn = 1353122 , journal =
Journal of the Operational Research Society The ''Journal of the Operational Research Society'' is a peer-reviewed academic journal covering operations research. It is an official journal of The Operational Research Society and has been in existence since 1950. It publishes full length ca ...
, volume = 52 , issue = 11, pages = 1225–1231 , doi=10.1057/palgrave.jors.2601231, hdl = 2299/917 , s2cid = 15258094 , hdl-access = free


External links


Data Envelopment Analysis
official website
''Journal of Productivity Analysis''
official website Linear programming Production economics Mathematical optimization in business