Selection is a
genetic operator
A genetic operator is an Operator (programming), operator used in evolutionary algorithms (EA) to guide the algorithm towards a solution to a given problem. There are three main types of operators (Mutation (evolutionary algorithm) , mutation, Cro ...
in an
evolutionary algorithm
Evolutionary algorithms (EA) reproduce essential elements of the biological evolution in a computer algorithm in order to solve "difficult" problems, at least Approximation, approximately, for which no exact or satisfactory solution methods are k ...
(EA). An EA is a
metaheuristic inspired by
biological evolution
Evolution is the change in the heritable characteristics of biological populations over successive generations. It occurs when evolutionary processes such as natural selection and genetic drift act on genetic variation, resulting in certai ...
and aims to solve challenging problems at least
approximately. Selection has a dual purpose: on the one hand, it can choose individual genomes from a population for subsequent breeding (e.g., using the
crossover operator). In addition, selection mechanisms are also used to choose candidate solutions (individuals) for the next generation. The biological model is
natural selection
Natural selection is the differential survival and reproduction of individuals due to differences in phenotype. It is a key mechanism of evolution, the change in the Heredity, heritable traits characteristic of a population over generation ...
.
Retaining the best individual(s) of one generation unchanged in the next generation is called ''elitism'' or ''elitist selection''. It is a successful (slight) variant of the general process of constructing a new population.
The basis for selection is the quality of an individual, which is determined by the
fitness function
A fitness function is a particular type of objective or cost function that is used to summarize, as a single figure of merit, how close a given candidate solution is to achieving the set aims. It is an important component of evolutionary algorit ...
. In
memetic algorithms, an extension of EA, selection also takes place in the selection of those offspring that are to be improved with the help of a
meme
A meme (; ) is an idea, behavior, or style that Mimesis, spreads by means of imitation from person to person within a culture and often carries symbolic meaning representing a particular phenomenon or theme. A meme acts as a unit for carrying c ...
(e.g. a
heuristic
A heuristic or heuristic technique (''problem solving'', '' mental shortcut'', ''rule of thumb'') is any approach to problem solving that employs a pragmatic method that is not fully optimized, perfected, or rationalized, but is nevertheless ...
).
A selection procedure for breeding used early on may be implemented as follows:
#The fitness values that have been computed (
fitness function
A fitness function is a particular type of objective or cost function that is used to summarize, as a single figure of merit, how close a given candidate solution is to achieving the set aims. It is an important component of evolutionary algorit ...
) are normalized, such that the sum of all resulting fitness values equals 1.
#Accumulated normalized fitness values are computed: the accumulated fitness value of an individual is the sum of its own fitness value plus the fitness values of all the previous individuals; the accumulated fitness of the last individual should be 1, otherwise something went wrong in the normalization step.
#A random number ''R'' between 0 and 1 is chosen.
#The selected individual is the first one whose accumulated normalized value is greater than or equal to ''R''.
For many problems the above algorithm might be computationally demanding. A simpler and faster alternative uses the so-called stochastic acceptance.
If this procedure is repeated until there are enough selected individuals, this selection method is called
fitness proportionate selection or ''roulette-wheel selection''. If instead of a single pointer spun multiple times, there are multiple, equally spaced pointers on a wheel that is spun once, it is called
stochastic universal sampling.
Repeatedly selecting the best individual of a randomly chosen subset is
tournament selection. Taking the best half, third or another proportion of the individuals is
truncation selection.
There are other selection algorithms that do not consider all individuals for selection, but only those with a fitness value that is higher than a given (arbitrary) constant. Other algorithms select from a restricted pool where only a certain percentage of the individuals are allowed, based on fitness value.
Methods of selection
The listed methods differ mainly in the selection pressure, which can be set by a strategy parameter in the rank selection described below. The higher the selection pressure, the faster a population converges against a certain solution and the search space may not be explored sufficiently. This
premature convergence can be counteracted by
structuring the population appropriately.
There is a close correlation between the population model used and a suitable selection pressure.
If the pressure is too low, it must be expected that the population will not converge even after a long computing time. For more selection methods and further detail see.
Roulette wheel selection
In the
roulette wheel selection, the probability of choosing an individual for breeding of the next generation is proportional to its fitness, the better the fitness is, the higher chance for that individual to be chosen.
Choosing individuals can be depicted as spinning a roulette that has as many pockets as there are individuals in the current generation, with sizes depending on their probability.
Probability of choosing individual
is equal to
, where
is the fitness of
and
is the size of current generation (note that in this method one individual can be drawn multiple times).
Stochastic universal sampling
Stochastic universal sampling is a development of roulette wheel selection with minimal spread and no bias.
Rank selection
In rank selection, the probability for selection does not depend directly on the fitness, but on the fitness rank of an individual within the population.
[ The exact fitness values themselves do not have to be available, but only a sorting of the individuals according to quality.
In addition to the adjustable selection pressure, an advantage of rank-based selection can be seen in the fact that it also gives worse individuals a chance to reproduce and thus to improve.] This can be particularly helpful in applications with restrictions, since it facilitates the overcoming of a restriction in several intermediate steps, i.e. via a sequence of several individuals rated poorly due to restriction violations.
Linear rank selection
Linear ranking, which goes back to Baker, is often used. It allows the selection pressure to be set by the parameter , which can take values between 1.0 (no selection pressure) and 2.0 (high selection pressure). The probability for rank positions is obtained as follows:
:
Another definition for the probability for rank positions is:
:
Exponential rank selection
Exponential rank selection is defined as follows:[
]
Steady state selection
In every generation few chromosomes are selected (good - with high fitness) for creating a new offspring. Then some (bad - with low fitness) chromosomes are removed and the new offspring is placed in their place. The rest of population survives to new generation.
Tournament selection
Tournament selection is a method of choosing the individual from the set of individuals. The winner of each tournament is selected to perform crossover.
Truncation selection
For truncation selection, individuals are sorted according to their fitness and a portion (10% to 50%) of the top individuals is selected for next generation.[
]
Elitist selection
Often to get better results, strategies with partial reproduction are used. One of them is elitism, in which a small portion of the best individuals from the last generation is carried over (without any changes) to the next one.
Boltzmann selection
In Boltzmann selection, a continuously varying temperature controls the rate of selection according to a preset schedule. The temperature starts out high, which means that the selection pressure is low. The temperature is gradually lowered, which gradually increases the selection pressure, thereby allowing the GA to narrow in more closely to the best part of the search space while maintaining the appropriate degree of diversity.
See also
* Natural selection
Natural selection is the differential survival and reproduction of individuals due to differences in phenotype. It is a key mechanism of evolution, the change in the Heredity, heritable traits characteristic of a population over generation ...
* Sexual selection
Sexual selection is a mechanism of evolution in which members of one sex mate choice, choose mates of the other sex to mating, mate with (intersexual selection), and compete with members of the same sex for access to members of the opposite sex ...
References
External links
Introduction to Genetic Algorithms
{{Evolutionary computation
nl:Genetisch algoritme#Selectie