Genetic distance (individuals)
   HOME

TheInfoList



OR:

Genetic distance is a measure of the genetic divergence between
species In biology, a species is the basic unit of classification and a taxonomic rank of an organism, as well as a unit of biodiversity. A species is often defined as the largest group of organisms in which any two individuals of the appropriate s ...
or between
populations Population typically refers to the number of people in a single area, whether it be a city or town, region, country, continent, or the world. Governments typically quantify the size of the resident population within their jurisdiction using a ...
within a species, whether the distance measures time from
common ancestor Common descent is a concept in evolutionary biology applicable when one species is the ancestor of two or more species later in time. All living beings are in fact descendants of a unique ancestor commonly referred to as the last universal comm ...
or degree of differentiation. Populations with many similar
allele An allele (, ; ; modern formation from Greek ἄλλος ''állos'', "other") is a variation of the same sequence of nucleotides at the same place on a long DNA molecule, as described in leading textbooks on genetics and evolution. ::"The chro ...
s have small genetic distances. This indicates that they are closely related and have a recent common ancestor. Genetic distance is useful for reconstructing the history of populations, such as the multiple human expansions
out of Africa ''Out of Africa'' is a memoir by the Danish author Karen Blixen. The book, first published in 1937, recounts events of the seventeen years when Blixen made her home in Kenya, then called British East Africa. The book is a lyrical meditation on ...
. It is also used for understanding the origin of
biodiversity Biodiversity or biological diversity is the variety and variability of life on Earth. Biodiversity is a measure of variation at the genetic (''genetic variability''), species (''species diversity''), and ecosystem (''ecosystem diversity'') l ...
. For example, the genetic distances between different breeds of domesticated animals are often investigated in order to determine which breeds should be protected to maintain genetic diversity.


Biological foundation

In the genome of an
organism In biology, an organism () is any living system that functions as an individual entity. All organisms are composed of cells (cell theory). Organisms are classified by taxonomy into groups such as multicellular animals, plants, and ...
, each
gene In biology, the word gene (from , ; "... Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
is located at a specific place called the
locus Locus (plural loci) is Latin for "place". It may refer to: Entertainment * Locus (comics), a Marvel Comics mutant villainess, a member of the Mutant Liberation Front * ''Locus'' (magazine), science fiction and fantasy magazine ** ''Locus Award' ...
for that gene. Allelic variations at these loci cause phenotypic variation within species (e.g. hair colour, eye colour). However, most alleles do not have an observable impact on the phenotype. Within a population new alleles generated by mutation either die out or spread throughout the population. When a population is split into different isolated populations (by either geographical or ecological factors), mutations that occur after the split will be present only in the isolated population. Random fluctuation of allele frequencies also produces genetic differentiation between populations. This process is known as
genetic drift Genetic drift, also known as allelic drift or the Wright effect, is the change in the frequency of an existing gene variant (allele) in a population due to random chance. Genetic drift may cause gene variants to disappear completely and there ...
. By examining the differences between
allele frequencies Allele frequency, or gene frequency, is the relative frequency of an allele (variant of a gene) at a particular locus in a population, expressed as a fraction or percentage. Specifically, it is the fraction of all chromosomes in the population that ...
between the populations and computing genetic distance, we can estimate how long ago the two populations were separated.


Measures

Although it is simple to define genetic distance as a measure of genetic divergence, there are several different statistical measures that have been proposed. This has happened because different authors considered different evolutionary models. The most commonly used are Nei's genetic distance, Cavalli-Sforza and Edwards measure, and Reynolds, Weir and Cockerham's genetic distance, listed below. In all the formulae in this section, X and Y represent two different populations for which L loci have been studied. Let X_ represent the uth allele frequency at the lth locus.


Nei's standard genetic distance

In 1972,
Masatoshi Nei (born January 2, 1931) is a Japanese-born American evolutionary biologist currently affiliated with the Department of Biology at Temple University as a Carnell Professor. He was, until recently, Evan Pugh Professor of Biology at Pennsylvania S ...
published what came to be known as Nei's standard genetic distance. This distance has the nice property that if the rate of genetic change (amino acid substitution) is constant per year or generation then Nei's standard genetic distance (''D'') increases in proportion to divergence time. This measure assumes that genetic differences are caused by
mutation In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA replication, DNA or viral repl ...
and
genetic drift Genetic drift, also known as allelic drift or the Wright effect, is the change in the frequency of an existing gene variant (allele) in a population due to random chance. Genetic drift may cause gene variants to disappear completely and there ...
. : D=-\ln\frac This distance can also be expressed in terms of the arithmetic mean of gene identity. Let j_X be the probability for the two members of population X having the same allele at a particular locus and j_Y be the corresponding probability in population Y. Also, let j_ be the probability for a member of X and a member of Y having the same allele. Now let J_X, J_Y and J_ represent the arithmetic mean of j_X, j_Y and j_ over all loci, respectively. In other words, : J_X=\sum_u \frac : J_Y=\sum_u \frac : J_=\sum_\ell \sum_u \frac where L is the total number of loci examined. Nei's standard distance can then be written as : D = -\ln \frac


Cavalli-Sforza chord distance

In 1967
Luigi Luca Cavalli-Sforza Luigi Luca Cavalli-Sforza (; 25 January 1922 – 31 August 2018) was an Italian geneticist. He was a population geneticist who taught at the University of Parma, the University of Pavia and then at Stanford University. Works Schooling and po ...
and
A. W. F. Edwards Anthony William Fairbank Edwards, FRS (born 1935) is a British statistician, geneticist and evolutionary biologist. He is the son of the surgeon Harold C. Edwards, and brother of medical geneticist John H. Edwards. He has sometimes been called ...
published this measure. It assumes that genetic differences arise due to
genetic drift Genetic drift, also known as allelic drift or the Wright effect, is the change in the frequency of an existing gene variant (allele) in a population due to random chance. Genetic drift may cause gene variants to disappear completely and there ...
only. One major advantage of this measure is that the populations are represented in a hypersphere, the scale of which is one unit per gene substitution. The
chord distance Chord may refer to: * Chord (music), an aggregate of musical pitches sounded simultaneously ** Guitar chord a chord played on a guitar, which has a particular tuning * Chord (geometry), a line segment joining two points on a curve * Chord (a ...
in the hyperdimensional sphere is given by : D_\text = \frac \sqrt Some authors drop the factor \frac to simplify the formula at the cost of losing the property that the scale is one unit per gene substitution.


Reynolds, Weir, and Cockerham's genetic distance

In 1983, this measure was published by John Reynolds, Bruce Weir and
C. Clark Cockerham Columbus Clark Cockerham (December 21, 1921 – November 4, 1996) was an American statistical geneticist known for his work in quantitative genetics. Early life and education Cockerham was born on December 21, 1921 in Mountain Park, North Carol ...
. This measure assumes that genetic differentiation occurs only by
genetic drift Genetic drift, also known as allelic drift or the Wright effect, is the change in the frequency of an existing gene variant (allele) in a population due to random chance. Genetic drift may cause gene variants to disappear completely and there ...
without mutations. It estimates the coancestry coefficient \Theta which provides a measure of the genetic divergence by: : \Theta_w=\sqrt


Other measures

Many other measures of genetic distance have been proposed with varying success.


Nei's ''D''''A'' distance 1983

This distance assumes that genetic differences arise due to
mutation In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA replication, DNA or viral repl ...
and
genetic drift Genetic drift, also known as allelic drift or the Wright effect, is the change in the frequency of an existing gene variant (allele) in a population due to random chance. Genetic drift may cause gene variants to disappear completely and there ...
, but this distance measure is known to give more reliable population trees than other distances particularly for microsatellite DNA data. : D_A = 1-\sum_\ell \sum_u \sqrt/


Euclidean distance

: D_=\sqrt


Goldstein distance 1995

It was specifically developed for microsatellite markers and is based on the stepwise-mutation model (SMM). \mu_X and \mu_Y are the means of the allele sizes in population X and Y. : (\delta\mu)^2=\sum_\ell \fracL


Nei's minimum genetic distance 1973

This measure assumes that genetic differences arise due to
mutation In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA replication, DNA or viral repl ...
and
genetic drift Genetic drift, also known as allelic drift or the Wright effect, is the change in the frequency of an existing gene variant (allele) in a population due to random chance. Genetic drift may cause gene variants to disappear completely and there ...
. : D_m=\frac-J_


Roger's distance 1972

: D_R = \frac\sqrt\frac Rogers, J. S. (1972). Measures of similarity and genetic distance. In ''Studies in Genetics'' VII. pp. 145−153. University of Texas Publication 7213. Austin, Texas.


Fixation index

A commonly used measure of genetic distance is the
fixation index The fixation index (FST) is a measure of population differentiation due to genetic structure. It is frequently estimated from genetic polymorphism data, such as single-nucleotide polymorphisms (SNP) or microsatellites. Developed as a special case o ...
(FST) which varies between 0 and 1. A value of 0 indicates that two populations are genetically identical (minimal or no genetic diversity between the two populations) whereas a value of 1 indicates that two populations are genetically different (maximum genetic diversity between the two populations). No mutation is assumed. Large populations between which there is much migration, for example, tend to be little differentiated whereas small populations between which there is little migration tend to be greatly differentiated. FST is a convenient measure of this differentiation, and as a result FST and related statistics are among the most widely used descriptive statistics in population and evolutionary genetics. But FST is more than a descriptive statistic and measure of genetic differentiation. FST is directly related to the ''Variance'' in allele frequency among populations and conversely to the degree of resemblance among individuals within populations. If FST is small, it means that allele frequencies within each population are very similar; if it is large, it means that allele frequencies are very different.


Software

*
PHYLIP PHYLogeny Inference Package (PHYLIP) is a free computational phylogenetics package of programs for inferring evolutionary trees (Phylogenetics, phylogenies). It consists of 65 Porting, portable programs, i.e., the source code is written in the prog ...
use
GENDIST
** Nei's standard genetic distance 1972 ** Cavalli-Sforza and Edwards 1967 ** Reynolds, Weir, and Cockerham's 1983
TFPGA
** Nei's standard genetic distance (original and unbiased) ** Nei's minimum genetic distance (original and unbiased) ** Wright's (1978) modification of Roger's (1972) distance ** Reynolds, Weir, and Cockerham's 1983
GDA

POPGENE


** Commonly used genetic distances and gene diversity analysis

** Nei's standard genetic distance 1972 ** Nei's DA distance between populations 1983


See also

*
Coefficient of relationship The coefficient of relationship is a measure of the degree of consanguinity (or biological relationship) between two individuals. The term coefficient of relationship was defined by Sewall Wright in 1922, and was derived from his definition of th ...
*
Degree of consanguinity Consanguinity ("blood relation", from Latin '' consanguinitas'') is the characteristic of having a kinship with another person (being descended from a common ancestor). Many jurisdictions have laws prohibiting people who are related by blood fro ...
*
Human genetic variation Human genetic variation is the genetic differences in and among populations. There may be multiple variants of any given gene in the human population (alleles), a situation called polymorphism. No two humans are genetically identical. Even ...
*
Phylogenetics In biology, phylogenetics (; from Greek φυλή/ φῦλον [] "tribe, clan, race", and wikt:γενετικός, γενετικός [] "origin, source, birth") is the study of the evolutionary history and relationships among or within groups ...
*
Allele frequency Allele frequency, or gene frequency, is the relative frequency of an allele (variant of a gene) at a particular locus in a population, expressed as a fraction or percentage. Specifically, it is the fraction of all chromosomes in the population tha ...


References


External links


''The Estimation of Genetic Distance and Population Substructure from Microsatellite allele frequency data.'', Brent W. Murray (May 1996), McMaster University website on genetic distance


{{DEFAULTSORT:Genetic Distance Genetics concepts Phylogenetics