A Moran process or Moran model is a simple

stochastic process In probability theory and related fields, a stochastic () or random process is a mathematical object usually defined as a family of random variables in a probability space, where the index of the family often has the interpretation of time. Sto ...

used in

biology Biology is the scientific study of life and living organisms. It is a broad natural science that encompasses a wide range of fields and unifying principles that explain the structure, function, growth, History of life, origin, evolution, and ...

to describe finite populations. The process is named after Patrick Moran, who first proposed the model in 1958. It can be used to model variety-increasing processes such as

mutation In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, ...

as well as variety-reducing effects such as

genetic drift Genetic drift, also known as random genetic drift, allelic drift or the Wright effect, is the change in the Allele frequency, frequency of an existing gene variant (allele) in a population due to random chance. Genetic drift may cause gene va ...

and

natural selection Natural selection is the differential survival and reproduction of individuals due to differences in phenotype. It is a key mechanism of evolution, the change in the Heredity, heritable traits characteristic of a population over generation ...

. The process can describe the probabilistic dynamics in a finite population of constant size ''N'' in which two

allele An allele is a variant of the sequence of nucleotides at a particular location, or Locus (genetics), locus, on a DNA molecule. Alleles can differ at a single position through Single-nucleotide polymorphism, single nucleotide polymorphisms (SNP), ...

s A and B are competing for dominance. The two alleles are considered to be true replicators (i.e. entities that make copies of themselves). In each time step a random individual (which is of either type A or B) is chosen for reproduction and a random individual is chosen for death; thus ensuring that the population size remains constant. To model selection, one type has to have a higher fitness and is thus more likely to be chosen for reproduction. The same individual can be chosen for death and for reproduction in the same step.

Neutral drift

Neutral drift is the idea that a neutral mutation can spread throughout a population, so that eventually the original

is lost. A neutral mutation does not bring any fitness advantage or disadvantage to its bearer. The simple case of the Moran process can describe this phenomenon. The Moran process is defined on the state space which count the number of A individuals. Since the number of A individuals can change at most by one at each time step, a transition exists only between state ''i'' and state and . Thus the transition matrix of the stochastic process is tri-diagonal in shape and the transition probabilities are :

\begin

P_ &= \frac \frac\\
P_ &= 1- P_ - P_\\
P_ &= \frac \frac\\

\end

The entry

P_

denotes the probability to go from state ''i'' to state ''j''. To understand the formulas for the transition probabilities one has to look at the definition of the process which states that always one individual will be chosen for reproduction and one is chosen for death. Once the A individuals have died out, they will never be reintroduced into the population since the process does not model

s (A cannot be reintroduced into the population once it has died out and ''vice versa'') and thus

P_=1

. For the same reason the population of A individuals will always stay ''N'' once they have reached that number and taken over the population and thus

P_=1

. The states 0 and ''N'' are called ''absorbing'' while the states are called ''transient''. The intermediate transition probabilities can be explained by considering the first term to be the probability to choose the individual whose abundance will increase by one and the second term the probability to choose the other type for death. Obviously, if the same type is chosen for reproduction and for death, then the abundance of one type does not change. Eventually the population will reach one of the absorbing states and then stay there forever. In the transient states, random fluctuations will occur but eventually the population of A will either go extinct or reach fixation. This is one of the most important differences to deterministic processes which cannot model random events. The

expected value In probability theory, the expected value (also called expectation, expectancy, expectation operator, mathematical expectation, mean, expectation value, or first Moment (mathematics), moment) is a generalization of the weighted average. Informa ...

and the

variance In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...

of the number of A individuals at timepoint ''t'' can be computed when an initial state is given: :

&= i \\ \operatorname(X(t)\mid X(0) = i) &= \tfrac \left(1-\tfrac \right ) \frac \end

The probability that A reaches fixation is called ''fixation probability''. For the simple Moran process this probability is Since all individuals have the same fitness, they also have the same chance of becoming the ancestor of the whole population; this probability is and thus the sum of all ''i'' probabilities (for all A individuals) is just The mean time to absorption starting in state ''i'' is given by :

k_i = N \left \sum_^ \frac + \sum_^ \frac \right /math>



For large ''N'' the approximation

: \lim_ k_i \approx -N^2 \left (1-x_i) \ln(1-x_i) + x_i \ln(x_i) \right /math>
holds.

Selection

If one allele has a fitness advantage over the other allele, it will be more likely to be chosen for reproduction. This can be incorporated into the model if individuals with

A have fitness

f_i>0

and individuals with

B have fitness

g_i>0

where

i

is the number of individuals of type A; thus describing a general birth-death process. The transition matrix of the stochastic process is tri-diagonal in shape. Let

r_i:=f_i/g_i

, then the transition probabilities are :

\begin

P_ &= \frac \cdot \frac=\frac\cdot\frac\\
P_ &= 1- P_ - P_\\
P_ &= \frac \cdot \frac=\frac\cdot\frac\\

\end

The entry

P_

denotes the probability to go from state ''i'' to state ''j''. The difference to neutral selection above is now that a mutant, i.e. an individual with allele A, is selected for procreation with probability :

\frac,

and an individual with allele B is chosen with probability :

\frac,

when the number of individuals with allele A is exactly i. Also in this case, fixation probabilities when starting in state ''i'' is defined by the recurrence :

x_i = \begin 0 & i=0\\ \beta_i x_+(1-\alpha_i-\beta_i)x_i+\alpha_ix_ & 1 \leq i \leq N-1\\ 1 & i =N
\end

And the closed form is given by :

x_i = \frac  \qquad \text

where

\gamma_i = P_ / P_

per definition and will just be

g_i / f_i

for the general case. This general case where the fitness of A and B depends on the abundance of each type is studied in

evolutionary game theory Evolutionary game theory (EGT) is the application of game theory to evolving populations in biology. It defines a framework of contests, strategies, and analytics into which Darwinism, Darwinian competition can be modelled. It originated in 1973 wi ...

. Less complex results are obtained if a constant fitness ratio

r=1/\gamma_i

, for all i, is assumed. Individuals of type A reproduce with a constant rate ''r'' and individuals with allele B reproduce with rate 1. Thus if A has a fitness advantage over B, ''r'' will be larger than one, otherwise it will be smaller than one. Thus the transition matrix of the stochastic process is tri-diagonal in shape and the transition probabilities are :

\begin
P_&=1\\
P_ &= \frac \cdot \frac=\frac\cdot\frac
\\
P_ &= 1- P_ - P_\\
P_ &= \frac \cdot \frac=\frac\cdot\frac\\
P_&=1.
\end

In this case

\gamma_i = 1/r

is a constant factor for each composition of the population and thus the fixation probability from equation (1) simplifies to :

x_i = \frac  \quad \Rightarrow \quad x_1 = \rho = \frac  \qquad \text

where the fixation probability of a single mutant ''A'' in a population of otherwise all ''B'' is often of interest and is denoted by . Also in the case of selection, the expected value and the variance of the number of ''A'' individuals may be computed :

&= p s \dfrac + i \\ \operatorname( X(t+1) \mid X(t)=i) &=p(1-p)\dfrac \end

where and .

Rate of evolution

In a population of all ''B'' individuals, a single mutant ''A'' will take over the whole population with the probability :

\rho = \frac. \qquad \text

If the

mutation rate In genetics, the mutation rate is the frequency of new mutations in a single gene, nucleotide sequence, or organism over time. Mutation rates are not constant and are not limited to a single type of mutation; there are many different types of mu ...

(to go from the ''B'' to the ''A'' allele) in the population is ''u'' then the rate with which one member of the population will mutate to ''A'' is given by and the rate with which the whole population goes from all ''B'' to all ''A'' is the rate that a single mutant ''A'' arises times the probability that it will take over the population (''fixation probability''): :

R = N \cdot u \cdot \rho = u  \quad \text \quad  \rho = \frac.

Thus if the mutation is neutral (i.e. the ''fixation probability'' is just 1/''N'') then the rate with which an allele arises and takes over a population is independent of the population size and is equal to the mutation rate. This important result is the basis of the neutral theory of evolution and suggests that the number of observed point mutations in the

genome A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as ...

s of two different

species A species () is often defined as the largest group of organisms in which any two individuals of the appropriate sexes or mating types can produce fertile offspring, typically by sexual reproduction. It is the basic unit of Taxonomy (biology), ...

would simply be given by the mutation rate multiplied by two times the time since

divergence In vector calculus, divergence is a vector operator that operates on a vector field, producing a scalar field giving the rate that the vector field alters the volume in an infinitesimal neighborhood of each point. (In 2D this "volume" refers to ...

. Thus the neutral theory of evolution provides a

molecular clock The molecular clock is a figurative term for a technique that uses the mutation rate of biomolecules to deduce the time in prehistory when two or more life forms diverged. The biomolecular data used for such calculations are usually nucleot ...

, given that the assumptions are fulfilled which may not be the case in reality.

References

External links

* {{Stochastic processes Evolutionary dynamics Stochastic processes Population genetics

Neutral drift

Selection

Rate of evolution

See also

References

Further reading

External links