The secretary problem demonstrates a scenario involving

optimal stopping In mathematics, the theory of optimal stopping or early stopping : (For French translation, secover storyin the July issue of ''Pour la Science'' (2009).) is concerned with the problem of choosing a time to take a particular action, in order to ...

theory For French translation, se
cover story
in the July issue of ''Pour la Science'' (2009). that is studied extensively in the fields of

applied probability Applied probability is the application of probability theory to statistical problems and other scientific and engineering domains. Scope Much research involving probability is done under the auspices of applied probability. However, while such res ...

statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...

, and

decision theory Decision theory (or the theory of choice; not to be confused with choice theory) is a branch of applied probability theory concerned with the theory of making decisions based on assigning probabilities to various factors and assigning numerical ...

. It is also known as the marriage problem, the sultan's dowry problem, the fussy suitor problem, the googol game, and the best choice problem. The basic form of the problem is the following: imagine an administrator who wants to hire the best secretary out of

n

rankable applicants for a position. The applicants are interviewed one by one in random order. A decision about each particular applicant is to be made immediately after the interview. Once rejected, an applicant cannot be recalled. During the interview, the administrator gains information sufficient to rank the applicant among all applicants interviewed so far, but is unaware of the quality of yet unseen applicants. The question is about the optimal strategy (

stopping rule In probability theory, in particular in the study of stochastic processes, a stopping time (also Markov time, Markov moment, optional stopping time or optional time ) is a specific type of “random time”: a random variable whose value is inter ...

) to maximize the probability of selecting the best applicant. If the decision can be deferred to the end, this can be solved by the simple maximum

selection algorithm In computer science, a selection algorithm is an algorithm for finding the ''k''th smallest number in a list or array; such a number is called the ''k''th ''order statistic''. This includes the cases of finding the minimum, maximum, and median el ...

of tracking the running maximum (and who achieved it), and selecting the overall maximum at the end. The difficulty is that the decision must be made immediately. The shortest rigorous proof known so far is provided by the

odds algorithm The odds algorithm (or Bruss algorithm) is a mathematical method for computing optimal strategies for a class of problems that belong to the domain of optimal stopping problems. Their solution follows from the ''odds strategy'', and the importance ...

. It implies that the optimal win probability is always at least

1/e

(where '' e'' is the base of the

natural logarithm The natural logarithm of a number is its logarithm to the base of the mathematical constant , which is an irrational and transcendental number approximately equal to . The natural logarithm of is generally written as , , or sometimes, if ...

), and that the latter holds even in a much greater generality. The optimal stopping rule prescribes always rejecting the first

\sim n/e

applicants that are interviewed and then stopping at the first applicant who is better than every applicant interviewed so far (or continuing to the last applicant if this never occurs). Sometimes this strategy is called the

1/e

stopping rule, because the probability of stopping at the best applicant with this strategy is about

1/e

already for moderate values of

n

. One reason why the secretary problem has received so much attention is that the optimal policy for the problem (the stopping rule) is simple and selects the single best candidate about 37% of the time, irrespective of whether there are 100 or 100 million applicants.

Formulation

Although there are many variations, the basic problem can be stated as follows: * There is a single position to fill. * There are ''n'' applicants for the position, and the value of ''n'' is known. * The applicants, if seen altogether, can be ranked from best to worst unambiguously. * The applicants are interviewed sequentially in random order, with each order being equally likely. * Immediately after an interview, the interviewed applicant is either accepted or rejected, and the decision is irrevocable. * The decision to accept or reject an applicant can be based only on the relative ranks of the applicants interviewed so far. * The objective of the general solution is to have the highest probability of selecting the best applicant of the whole group. This is the same as maximizing the expected payoff, with payoff defined to be one for the best applicant and zero otherwise. A ''candidate'' is defined as an applicant who, when interviewed, is better than all the applicants interviewed previously. ''Skip'' is used to mean "reject immediately after the interview". Since the objective in the problem is to select the single best applicant, only candidates will be considered for acceptance. The "candidate" in this context corresponds to the concept of record in permutation.

Deriving the optimal policy

The optimal policy for the problem is a

. Under it, the interviewer rejects the first ''r'' − 1 applicants (let applicant ''M'' be the best applicant among these ''r'' − 1 applicants), and then selects the first subsequent applicant that is better than applicant ''M''. It can be shown that the optimal strategy lies in this class of strategies. (Note that we should never choose an applicant who is not the best we have seen so far, since they cannot be the best overall applicant.) For an arbitrary cutoff ''r'', the probability that the best applicant is selected is :

\cdot \frac \\ &=\frac \sum_^ \frac. \end

The sum is not defined for ''r'' = 1, but in this case the only feasible policy is to select the first applicant, and hence ''P''(1) = 1/''n''. This sum is obtained by noting that if applicant ''i'' is the best applicant, then it is selected if and only if the best applicant among the first ''i'' − 1 applicants is among the first ''r'' − 1 applicants that were rejected. Letting ''n'' tend to infinity, writing

x

as the limit of ''(r−1)''/''n'', using ''t'' for ''(i−1)''/''n'' and ''dt'' for 1/''n'', the sum can be approximated by the integral :

P(x)=x \int_^\frac\,dt = -x \ln(x)\;.

Taking the derivative of ''P''(''x'') with respect to

x

, setting it to 0, and solving for ''x'', we find that the optimal ''x'' is equal to 1/''e''. Thus, the optimal cutoff tends to ''n''/''e'' as ''n'' increases, and the best applicant is selected with probability 1/''e''. For small values of ''n'', the optimal ''r'' can also be obtained by standard

dynamic programming Dynamic programming is both a mathematical optimization method and a computer programming method. The method was developed by Richard Bellman in the 1950s and has found applications in numerous fields, from aerospace engineering to economics. I ...

methods. The optimal thresholds ''r'' and probability of selecting the best alternative ''P'' for several values of ''n'' are shown in the following table. The probability of selecting the best applicant in the classical secretary problem converges toward

1/e\approx 0.368

Alternative solution

This problem and several modifications can be solved (including the proof of optimality) in a straightforward manner by the

, which also has other applications. Modifications for the secretary problem that can be solved by this algorithm include random availabilities of applicants, more general hypotheses for applicants to be of interest to the decision maker, group interviews for applicants, as well as certain models for a random number of applicants.

Limitations

The solution of the secretary problem is only meaningful if it is justified to assume that the applicants have no knowledge of the decision strategy employed, because early applicants have no chance at all and may not show up otherwise. One important drawback for applications of the solution of the classical secretary problem is that the number of applicants

n

must be known in advance, which is rarely the case. One way to overcome this problem is to suppose that the number of applicants is a random variable

N

with a known distribution of

P(N=k)_

(Presman and Sonin, 1972). For this model, the optimal solution is in general much harder, however. Moreover, the optimal success probability is now no longer around 1/''e'' but typically lower. This can be understood in the context of having a "price" to pay for not knowing the number of applicants. However, in this model the price is high. Depending on the choice of the distribution of

N

, the optimal win probability can approach zero. Looking for ways to cope with this new problem led to a new model yielding the so-called 1/e-law of best choice.

1/e-law of best choice

The essence of the model is based on the idea that life is sequential and that real-world problems pose themselves in real time. Also, it is easier to estimate times in which specific events (arrivals of applicants) should occur more frequently (if they do) than to estimate the distribution of the number of specific events which will occur. This idea led to the following approach, the so-called ''unified approach'' (1984): The model is defined as follows: An applicant must be selected on some time interval

,T /math> from an unknown number N of rankable applicants. The goal is to maximize the probability of selecting only the best under the hypothesis that all arrival orders of different ranks are equally likely. Suppose that all applicants have the same, but independent to each other, arrival time density f on,T /math> and let F denote the corresponding arrival time distribution function, that is

: F(t) = \int_^ f(s)ds, \, 0\le t\le T .

Let \tau be such that F(\tau)=1/e. Consider the strategy to wait and observe all applicants up to time \tau and then to select, if possible, the first candidate after time \tau which is better than all preceding ones. Then this strategy, called ''1/e-strategy'', has the following properties:

The ''1/e-strategy''

:(i) yields for all N a success probability of at least 1/e,

:(ii) is a minimax-optimal strategy for the selector who does not know N,

:(iii) selects, if there is at least one applicant, none at all with probability exactly 1/e.

The 1/e-law, proved in 1984 by

F. Thomas Bruss Franz Thomas Bruss is Emeritus Professor of Mathematics at the Université Libre de Bruxelles, where he had been director of "Mathématiques Générales" and co-director of the probability chair, and where he continues his research as invited profe ...

, came as a surprise. The reason was that a value of about 1/e had been considered before as being out of reach in a model for unknown

N

, whereas this value 1/e was now achieved as a lower bound for the success probability, and this in a model with arguably much weaker hypotheses (see e.g. Math. Reviews 85:m). However, there are many other strategies that achieve (i) and (ii) and, moreover, perform strictly better than the 1/e-strategy simultaneously for all

N

>2. A simple example is the strategy which selects (if possible) the first relatively best candidate after time

\tau

provided that at least one applicant arrived before this time, and otherwise selects (if possible) the second relatively best candidate after time

\tau

. The 1/e-law is sometimes confused with the solution for the classical secretary problem described above because of the similar role of the number 1/e. However, in the 1/e-law, this role is more general. The result is also stronger, since it holds for an unknown number of applicants and since the model based on an arrival time distribution F is more tractable for applications.

The game of googol

According to

Thomas S. Ferguson Thomas Shelburne Ferguson (born December 14, 1929) is an American mathematician and statistician. He is a professor emeritus of mathematics and statistics at the University of California, Los Angeles. Education and career Ferguson was born in O ...

, the secretary problem appeared for the first time in print when it was featured by

Martin Gardner Martin Gardner (October 21, 1914May 22, 2010) was an American popular mathematics and popular science writer with interests also encompassing scientific skepticism, micromagic, philosophy, religion, and literatureespecially the writings of Lewis ...

in his February 1960

Mathematical Games column Over a period of 24 years (January 1957 – December 1980), Martin Gardner wrote 288 consecutive monthly "Mathematical Games" columns for ''Scientific American'' magazine. During the next years, through June 1986, Gardner wrote 9 more columns, ...

in ''

Scientific American ''Scientific American'', informally abbreviated ''SciAm'' or sometimes ''SA'', is an American popular science magazine. Many famous scientists, including Albert Einstein and Nikola Tesla, have contributed articles to it. In print since 1845, it i ...

''. Here is how Gardner formulated it: "Ask someone to take as many slips of paper as he pleases, and on each slip write a different positive number. The numbers may range from small fractions of 1 to a number the size of a ''googol'' (1 followed by a hundred zeroes) or even larger. These slips are turned face down and shuffled over the top of a table. One at a time you turn the slips face up. The aim is to stop turning when you come to the number that you guess to be the largest of the series. You cannot go back and pick a previously turned slip. If you turn over all the slips, then of course you must pick the last one turned." In the article "Who solved the Secretary problem?" Ferguson pointed out that the secretary problem remained unsolved as it was stated by M. Gardner, that is as a two-person

zero-sum game Zero-sum game is a mathematical representation in game theory and economic theory of a situation which involves two sides, where the result is an advantage for one side and an equivalent loss for the other. In other words, player one's gain is e ...

with two antagonistic players. In this game Alice, the informed player, writes secretly distinct numbers on

n

cards. Bob, the stopping player, observes the actual values and can stop turning cards whenever he wants, winning if the last card turned has the overall maximal number. The difference with the basic secretary problem is that Bob observes the actual values written on the cards, which he can use in his decision procedures. The numbers on cards are analogous to the numerical qualities of applicants in some versions of the secretary problem. The

joint probability distribution Given two random variables that are defined on the same probability space, the joint probability distribution is the corresponding probability distribution on all possible pairs of outputs. The joint distribution can just as well be considered ...

of the numbers is under the control of Alice. Bob wants to guess the maximal number with the highest possible probability, while Alice's goal is to keep this probability as low as possible. It is not optimal for Alice to sample the numbers independently from some fixed distribution, and she can play better by choosing random numbers in some dependent way. For

n=2

Alice has no

minimax Minimax (sometimes MinMax, MM or saddle point) is a decision rule used in artificial intelligence, decision theory, game theory, statistics, and philosophy for ''mini''mizing the possible loss for a worst case (''max''imum loss) scenario. When de ...

strategy, which is closely related to a paradox of T. Cover. But for

n>2

the game has a solution: Alice can choose random numbers (which are dependent random variables) in such a way that Bob cannot play better than using the classical stopping strategy based on the relative ranks.

Heuristic performance

The remainder of the article deals again with the secretary problem for a known number of applicants. derived the expected success probabilities for several psychologically plausible heuristics that might be employed in the secretary problem. The heuristics they examined were: * The cutoff rule (CR): Do not accept any of the first ''y'' applicants; thereafter, select the first encountered candidate (i.e., an applicant with relative rank 1). This rule has as a special case the optimal policy for the classical secretary problem for which ''y'' = ''r''. * Candidate count rule (CCR): Select the ''y''-th encountered candidate. Note, that this rule does not necessarily skip any applicants; it only considers how many candidates have been observed, not how deep the decision maker is in the applicant sequence. * Successive non-candidate rule (SNCR): Select the first encountered candidate after observing ''y'' non-candidates (i.e., applicants with relative rank > 1). Each heuristic has a single parameter ''y''. The figure (shown on right) displays the expected success probabilities for each heuristic as a function of ''y'' for problems with ''n'' = 80.

Cardinal payoff variant

Finding the single best applicant might seem like a rather strict objective. One can imagine that the interviewer would rather hire a higher-valued applicant than a lower-valued one, and not only be concerned with getting the best. That is, the interviewer will derive some value from selecting an applicant that is not necessarily the best, and the derived value increases with the value of the one selected. To model this problem, suppose that the

n

applicants have "true" values that are

random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...

s ''X'' drawn

i.i.d. In probability theory and statistics, a collection of random variables is independent and identically distributed if each random variable has the same probability distribution as the others and all are mutually independent. This property is usual ...

from a

uniform distribution Uniform distribution may refer to: * Continuous uniform distribution * Discrete uniform distribution * Uniform distribution (ecology) * Equidistributed sequence In mathematics, a sequence (''s''1, ''s''2, ''s''3, ...) of real numbers is said to be ...

on , 1 Similar to the classical problem described above, the interviewer only observes whether each applicant is the best so far (a candidate), must accept or reject each on the spot, and ''must'' accept the last one if he/she is reached. (To be clear, the interviewer does not learn the actual relative rank of ''each'' applicant. He/she learns only whether the applicant has relative rank 1.) However, in this version the ''payoff'' is given by the true value of the selected applicant. For example, if he/she selects an applicant whose true value is 0.8, then he/she will earn 0.8. The interviewer's objective is to maximize the expected value of the selected applicant. Since the applicant's values are i.i.d. draws from a uniform distribution on , 1 the

expected value In probability theory, the expected value (also called expectation, expectancy, mathematical expectation, mean, average, or first moment) is a generalization of the weighted average. Informally, the expected value is the arithmetic mean of a l ...

of the ''t''th applicant given that

x_=\max\left\

is given by :

E_=E\left(X_, I_=1\right)=\frac.

As in the classical problem, the optimal policy is given by a threshold, which for this problem we will denote by

c

, at which the interviewer should begin accepting candidates. Bearden showed that ''c'' is either

\lfloor \sqrt n \rfloor

\lceil \sqrt n \rceil

. (In fact, whichever is closest to

\sqrt n

.) This follows from the fact that given a problem with

n

applicants, the expected payoff for some arbitrary threshold

1 \le c \le n

is :

frac=.

Differentiating

V_(c)

with respect to ''c'', one gets :

\frac = \frac.

Since

\partial^V / \partial c^<0

for all permissible values of

c

, we find that

V

is maximized at

c=\sqrt n

. Since ''V'' is convex in

c

, the optimal integer-valued threshold must be either

\lfloor \sqrt n \rfloor

\lceil \sqrt n \rceil

. Thus, for most values of

n

the interviewer will begin accepting applicants sooner in the cardinal payoff version than in the classical version where the objective is to select the single best applicant. Note that this is not an asymptotic result: It holds for all

n

. However, this is not the optimal policy to maximize expected value from a known distribution. In the case of a known distribution, optimal play can be calculated via dynamic programming. A more general form of this problem introduced by Palley and Kremer (2014) assumes that as each new applicant arrives, the interviewer observes their rank relative to all of the applicants that have been observed previously. This model is consistent with the notion of an interviewer ''learning'' as they continue the search process by accumulating a set of past data points that they can use to evaluate new candidates as they arrive. A benefit of this so-called partial-information model is that decisions and outcomes achieved given the relative rank information can be directly compared to the corresponding optimal decisions and outcomes if the interviewer had been given full information about the value of each applicant. This full-information problem, in which applicants are drawn independently from a known distribution and the interviewer seeks to maximize the expected value of the applicant selected, was originally solved by Moser (1956), Sakaguchi (1961), and Karlin (1962).

Other modifications

There are several variants of the secretary problem that also have simple and elegant solutions. One variant replaces the desire to pick the best with the desire to pick the second-best.

Robert J. Vanderbei Robert J. Vanderbei (born 1955) is an American mathematician and Professor in the Department of Operations Research and Financial Engineering at Princeton University. Biography Robert J. Vanderbei was born in Grand Rapids, MI, in 1955. He receiv ...

calls this the "postdoc" problem arguing that the "best" will go to Harvard. For this problem, the probability of success for an even number of applicants is exactly

\frac

. This probability tends to 1/4 as n tends to infinity illustrating the fact that it is easier to pick the best than the second-best. For a second variant, the number of selections is specified to be greater than one. In other words, the interviewer is not hiring just one secretary but rather is, say, admitting a class of students from an applicant pool. Under the assumption that success is achieved if and only if all the selected candidates are superior to all of the not-selected candidates, it is again a problem that can be solved. It was shown in that when n is even and the desire is to select exactly half the candidates, the optimal strategy yields a success probability of

\frac

. Another variant is that of selecting the best

k

secretaries out of a pool of

n

, again in an on-line algorithm. This leads to a strategy related to the classic one and cutoff threshold of

\frac

for which the classic problem is a special case.

Multiple choice problem

A player is allowed

r

choices, and he wins if any choice is the best. showed that an optimal strategy is given by a threshold strategy (cutoff strategy). An optimal strategy belongs to the class of strategies defined by a set of threshold numbers

(a_1, a_2, ... , a_r)

, where

a_1 . The first choice is to be used on the first candidates starting with a_1 th applicant, and once the first choice is used, second choice is to be used on the first candidate starting with a_2 th applicant, and so on.

Gilbert and Mosteller showed that \left( \frac, \frac, \frac, \frac \right)\rightarrow \left( e^, e^, e^, e^ \right)  (n \rightarrow \infty) .
For further cases that r=5,6,...,10, see  (for example \frac \rightarrow e^).

When r=2, the probability of win converges to e^+ e^  (n \rightarrow \infty) .
 showed that for any positive integer r, the probability of win (of r choice secretary problem) converges to p_1+p_2+\cdots +p_r where p_i=\lim_ \frac . Thus, the probability of win converges to e^+ e^+e^and e^+e^+e^+e^when r=3, 4 respectively.

Experimental studies

Experimental

psychologists A psychologist is a professional who practices psychology and studies mental states, perceptual, cognitive, emotional, and social processes and behavior. Their work often involves the experimentation, observation, and interpretation of how indi ...

and

economists An economist is a professional and practitioner in the social science discipline of economics. The individual may also study, develop, and apply theories and concepts from economics and write about economic policy. Within this field there are ...

have studied the decision behavior of actual people in secretary problem situations. In large part, this work has shown that people tend to stop searching too soon. This may be explained, at least in part, by the cost of evaluating candidates. In real world settings, this might suggest that people do not search enough whenever they are faced with problems where the decision alternatives are encountered sequentially. For example, when trying to decide at which gas station along a highway to stop for gas, people might not search enough before stopping. If true, then they would tend to pay more for gas than if they had searched longer. The same may be true when people search online for airline tickets. Experimental research on problems such as the secretary problem is sometimes referred to as

behavioral operations research Behavioral operations management (often called behavioral operations) examines and takes into consideration human behaviors and emotions when facing complex decision problems. It relates to the behavioral aspects of the use of operations research an ...

Neural correlates

While there is a substantial body of

neuroscience Neuroscience is the scientific study of the nervous system (the brain, spinal cord, and peripheral nervous system), its functions and disorders. It is a multidisciplinary science that combines physiology, anatomy, molecular biology, development ...

research on information integration, or the representation of belief, in perceptual decision-making tasks using both animal and human subjects, there is relatively little known about how the decision to stop gathering information is arrived at. Researchers have studied the neural bases of solving the secretary problem in healthy volunteers using

functional MRI Functional magnetic resonance imaging or functional MRI (fMRI) measures brain activity by detecting changes associated with blood flow. This technique relies on the fact that cerebral blood flow and neuronal activation are coupled. When an area o ...

. A Markov decision process (MDP) was used to quantify the value of continuing to search versus committing to the current option. Decisions to take versus decline an option engaged parietal and

dorsolateral prefrontal The dorsolateral prefrontal cortex (DLPFC or DL-PFC) is an area in the prefrontal cortex of the primate brain. It is one of the most recently derived parts of the human brain. It undergoes a prolonged period of maturation which lasts until adultho ...

cortices, as well

ventral striatum The striatum, or corpus striatum (also called the striate nucleus), is a nucleus (a cluster of neurons) in the subcortical basal ganglia of the forebrain. The striatum is a critical component of the motor and reward systems; receives glutamate ...

anterior insula The insular cortex (also insula and insular lobe) is a portion of the cerebral cortex folded deep within the lateral sulcus (the fissure separating the temporal lobe from the parietal and frontal lobes) within each hemisphere of the mammalian br ...

, and

anterior cingulate In the human brain, the anterior cingulate cortex (ACC) is the frontal part of the cingulate cortex that resembles a "collar" surrounding the frontal part of the corpus callosum. It consists of Brodmann areas 24, 32, and 33. It is involved i ...

. Therefore, brain regions previously implicated in evidence integration and

reward Reward may refer to: Places * Reward (Shelltown, Maryland), a historic home in Shelltown Maryland * Reward, California (disambiguation) * Reward-Tilden's Farm, a historic home in Chestertown Maryland Arts, entertainment, and media * "Rewa ...

representation encode threshold crossings that trigger decisions to commit to a choice.

History

The secretary problem was apparently introduced in 1949 by

Merrill M. Flood Merrill Meeks Flood (1908 – 1991) was an American mathematician, notable for developing, with Melvin Dresher, the basis of the game theoretical Prisoner's dilemma model of cooperation and conflict while being at RAND in 1950 ( Albert W. Tucker ...

, who called it the fiancée problem in a lecture he gave that year. He referred to it several times during the 1950s, for example, in a conference talk at

Purdue Purdue University is a public land-grant research university in West Lafayette, Indiana, and the flagship campus of the Purdue University system. The university was founded in 1869 after Lafayette businessman John Purdue donated land and money ...

on 9 May 1958, and it eventually became widely known in the folklore although nothing was published at the time. In 1958 he sent a letter to

Leonard Gillman Leonard E. Gillman (January 8, 1917 – April 7, 2009) was an American mathematician, emeritus professor at the University of Texas at Austin. He was also an accomplished classical pianist. Biography Early life and education Gillman was born i ...

, with copies to a dozen friends including

Samuel Karlin Samuel Karlin (June 8, 1924 – December 18, 2007) was an American mathematician at Stanford University in the late 20th century. Biography Karlin was born in Janów, Poland and immigrated to Chicago as a child. Raised in an Orthodox Jewish hous ...

and J. Robbins, outlining a proof of the optimum strategy, with an appendix by R. Palermo who proved that all strategies are dominated by a strategy of the form "reject the first ''p'' unconditionally, then accept the next candidate who is better". The first publication was apparently by

in Scientific American, February 1960. He had heard about it from John H. Fox Jr., and L. Gerald Marnie, who had independently come up with an equivalent problem in 1958; they called it the "game of googol". Fox and Marnie did not know the optimum solution; Gardner asked for advice from

Leo Moser Leo Moser (11 April 1921, Vienna – 9 February 1970, Edmonton) was an Austrian-Canadian mathematician, best known for his polygon notation. A native of Vienna, Leo Moser immigrated with his parents to Canada at the age of three. He received his ...

, who (together with J. R. Pounder) provided a correct analysis for publication in the magazine. Soon afterwards, several mathematicians wrote to Gardner to tell him about the equivalent problem they had heard via the grapevine, all of which can most likely be traced to Flood's original work. The 1/''e''-law of best choice is due to

. Ferguson has an extensive bibliography and points out that a similar (but different) problem had been considered by

Arthur Cayley Arthur Cayley (; 16 August 1821 – 26 January 1895) was a prolific United Kingdom of Great Britain and Ireland, British mathematician who worked mostly on algebra. He helped found the modern British school of pure mathematics. As a child, C ...

in 1875 and even by

Johannes Kepler Johannes Kepler (; ; 27 December 1571 – 15 November 1630) was a German astronomer, mathematician, astrologer, natural philosopher and writer on music. He is a key figure in the 17th-century Scientific Revolution, best known for his laws ...

long before that.

Combinatorial generalization

The secretary problem can be generalized to the case where there are multiple different jobs. Again, there are

n

applicants coming in random order. When a candidate arrives, she reveals a set of nonnegative numbers. Each value specifies her qualification for one of the jobs. The administrator not only has to decide whether or not to take the applicant but, if so, also has to assign her permanently to one of the jobs. The objective is to find an assignment where the sum of qualifications is as big as possible. This problem is identical to finding a maximum-weight matching in an edge-weighted

bipartite graph In the mathematical field of graph theory, a bipartite graph (or bigraph) is a graph whose vertices can be divided into two disjoint and independent sets U and V, that is every edge connects a vertex in U to one in V. Vertex sets U and V are ...

where the

n

nodes of one side arrive online in random order. Thus, it is a special case of the online bipartite matching problem. By a generalization of the classic algorithm for the secretary problem, it is possible to obtain an assignment where the expected sum of qualifications is only a factor of

e

less than an optimal (offline) assignment.

Notes

References

* * * * * * * * * eprints his original column published in February 1960 with additional comments* * * * * Hill, T.P.
Knowing When to Stop
. ''American Scientist'', Vol. 97, 126-133 (2009). (For French translation, se
cover story
in the July issue of ''Pour la Science'' (2009)) * * * * * * * *

External links

* * *

book by Thomas S. Ferguson {{DEFAULTSORT:Secretary Problem Decision theory Sequential methods Matching (graph theory) Optimal decisions Probability problems Mathematical optimization in business