Complex Traits
   HOME

TheInfoList



OR:

Complex traits are phenotypes that are controlled by two or more genes and do not follow Mendel's Law of Dominance. They may have a range of expression which is typically continuous. Both environmental and genetic factors often impact the variation in expression.
Human height Human height or stature is the distance from the bottom of the feet to the top of the head in a human body, standing erect. It is measured using a stadiometer, in centimetres when using the metric system or SI system, or feet and inches when ...
is a continuous trait meaning that there is a wide range of heights. There are an estimated 50 genes that affect the height of a human. Environmental factors, like nutrition, also play a role in a human's height. Other examples of complex traits include: crop yield, plant color, and many diseases including diabetes and
Parkinson's disease Parkinson's disease (PD), or simply Parkinson's, is a neurodegenerative disease primarily of the central nervous system, affecting both motor system, motor and non-motor systems. Symptoms typically develop gradually and non-motor issues become ...
. One major goal of genetic research today is to better understand the molecular mechanisms through which genetic variants act to influence complex traits. Complex traits are also known as polygenic traits and multigenic traits. The existence of complex traits, which are far more common than Mendelian traits, represented a significant challenge to the acceptance of Mendel's work. Modern understanding has 3 categories of complex traits: quantitative, meristic, and threshold. These traits have been studied on a small scale with observational techniques like twin studies. They are also studied with statistical techniques like
quantitative trait loci A quantitative trait locus (QTL) is a Locus (genetics), locus (section of DNA) that correlates with variation of a quantitative trait in the phenotype of a Population genetics, population of organisms. QTLs are mapped by identifying which molecula ...
(QTL) mapping, and
genome-wide association studies In genomics, a genome-wide association study (GWA study, or GWAS), is an observational study of a genome-wide set of genetic variants in different individuals to see if any variant is associated with a trait. GWA studies typically focus on assoc ...
(GWAS) on a large scale. The overall goal of figuring out how genes interact with each other and the environment and how those interactions can lead to variation in a trait is called genetic architecture.


History

When Mendel's work on inheritance was rediscovered in 1900, scientists debated whether Mendel's laws could account for the continuous variation observed for many traits. One group known as the biometricians argued that continuous traits such as height were largely
heritable Heredity, also called inheritance or biological inheritance, is the passing on of Phenotypic trait, traits from parents to their offspring; either through asexual reproduction or sexual reproduction, the offspring cell (biology), cells or orga ...
, but could not be explained by the inheritance of single Mendelian genetic factors. Work published by
Ronald Fisher Sir Ronald Aylmer Fisher (17 February 1890 – 29 July 1962) was a British polymath who was active as a mathematician, statistician, biologist, geneticist, and academic. For his work in statistics, he has been described as "a genius who a ...
in 1919 mostly resolved debate by demonstrating that the variation in continuous traits could be accounted for if multiple such factors contributed additively to each trait. However, the number of genes involved in such traits remained undetermined; until recently,
genetic loci In genetics, a locus (: loci) is a specific, fixed position on a chromosome where a particular gene or genetic marker is located. Each chromosome carries many genes, with each gene occupying a different position or locus; in humans, the total numb ...
were expected to have moderate effect sizes and each explain several percent of heritability. After the conclusion of the
Human Genome Project The Human Genome Project (HGP) was an international scientific research project with the goal of determining the base pairs that make up human DNA, and of identifying, mapping and sequencing all of the genes of the human genome from both a ...
in 2001, it seemed that the
sequencing In genetics and biochemistry, sequencing means to determine the primary structure (sometimes incorrectly called the primary sequence) of an unbranched biopolymer. Sequencing results in a symbolic linear depiction known as a sequence which succ ...
and mapping of many individuals would soon allow for a complete understanding of traits'
genetic architecture Genetic architecture is the underlying genetic basis of a phenotypic trait and its variational properties. Phenotypic variation for quantitative traits is, at the most basic level, the result of the segregation of alleles at quantitative trait ...
s. However,
variants Variant may refer to: Arts and entertainment * ''Variant'' (magazine), a former British cultural magazine * Variant cover, an issue of comic books with varying cover art * ''Variant'' (novel), a novel by Robison Wells * " The Variant", 2021 epis ...
discovered through
genome-wide association studies In genomics, a genome-wide association study (GWA study, or GWAS), is an observational study of a genome-wide set of genetic variants in different individuals to see if any variant is associated with a trait. GWA studies typically focus on assoc ...
(GWASs) accounted for only a small percentage of predicted heritability; for example, while height is estimated to be 80–90% heritable, early studies only identified variants accounting for 5% of this heritability. Later research showed that most missing heritability could be accounted for by common variants missed by GWASs because their effect sizes fell below significance thresholds; a smaller percentage is accounted for by rare variants with larger effect sizes, although in certain traits such as
autism Autism, also known as autism spectrum disorder (ASD), is a neurodevelopmental disorder characterized by differences or difficulties in social communication and interaction, a preference for predictability and routine, sensory processing d ...
, rare variants play a more dominant role. While many genetic factors involved in complex traits have been identified, determining their specific contributions to phenotypes—specifically, the molecular mechanisms through which they act—remains a major challenge.


Types of complex traits


Quantitative traits

Quantitative traits have phenotypes that are expressed on continuous ranges. They have many different genes that impact the phenotype, with differing effect sizes. Many of these traits are somewhat heritable. For example, height is estimated to be 60–80% heritable; however, other quantitative traits have varying heritability.


Meristic traits

Meristic traits have phenotypes that are described by whole numbers. An example is the rate chickens lay eggs. A chicken can lay one, two, or five eggs a week, but never half an egg. The environment can also impact expression, as chickens will not lay as many eggs depending on the time of year.


Threshold traits

Threshold traits have phenotypes that have limited expressions (usually two). It is a complex trait because multiple genetic and environmental factors impact the phenotype. The phenotype before the threshold is referred to as normal or absent, and after the threshold as lethal or present. These traits are often examined in a medical context, because many diseases exhibit this pattern or similar. An example of this is
type 2 diabetes Type 2 diabetes (T2D), formerly known as adult-onset diabetes, is a form of diabetes mellitus that is characterized by high blood sugar, insulin resistance, and relative lack of insulin. Common symptoms include increased thirst, frequent ...
, the phenotype is either normal/healthy or lethal/diseased.


Methods for finding complex traits


Twin studies

Twin studies Twin studies are studies conducted on Identical twin, identical or Fraternal twin, fraternal twins. They aim to reveal the importance of environmental and genetics, genetic influences for traits, phenotypes, and disorders. Twin research is consid ...
is an observational test using
monozygotic twins Twins are two offspring produced by the same pregnancy.MedicineNet > Definition of Twin Last Editorial Review: 19 June 2000 Twins can be either ''monozygotic'' ('identical'), meaning that they develop from one zygote, which splits and forms two e ...
and
dizygotic twins Twins are two offspring produced by the same pregnancy.MedicineNet > Definition of Twin Last Editorial Review: 19 June 2000 Twins can be either ''monozygotic'' ('identical'), meaning that they develop from one zygote, which splits and forms two e ...
, preferably same sex. They are used to figure out the environmental influence on complex traits. Monozygotic twins in particular are estimated to share 100% of their DNA with each other so any phenotypic differences should be caused by environmental influences.


QTL mapping

Many complex traits are genetically determined by quantitative trait loci (QTL). A Quantitative Trait Loci analysis can be used to find regions on the
genome sequence A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as ...
that are associated with a complex trait. To find these regions, researchers will select a trait of interest and take a group of individuals of a species with varying expressions of this trait. They will label the individuals as founding parents and attempt to measure the trait. This can be difficult as most traits do not have a direct cut off point. Researchers will then
genotype The genotype of an organism is its complete set of genetic material. Genotype can also be used to refer to the alleles or variants an individual carries in a particular gene or genetic location. The number of alleles an individual can have in a ...
the parents using molecular markers such as
SNPs In genetics and bioinformatics, a single-nucleotide polymorphism (SNP ; plural SNPs ) is a germline substitution of a single nucleotide at a specific position in the genome. Although certain definitions require the substitution to be present in ...
or RFLPs. These act as signposts pointing to an area of where the genes associated with a trait are. From there, the parents are crossed to produce offspring. These offspring are then made to produce new offspring, but who they breed with can vary. They can either reproduce with their siblings, with themselves (different from asexual reproduction), or
backcross Backcrossing is a crossing of a hybrid with one of its parents or an individual genetically similar to its parent, to achieve offspring with a genetic identity closer to that of the parent. It is used in horticulture, animal breeding, and produc ...
. After this, a new generation is produced that are more genetically diverse. This is due to recombination. The genotype and phenotype of this new generation are measured and compared with the molecular markers to identify which
alleles An allele is a variant of the sequence of nucleotides at a particular location, or locus, on a DNA molecule. Alleles can differ at a single position through single nucleotide polymorphisms (SNP), but they can also have insertions and deletions ...
are associated with the trait. This does not mean there is a direct causal relationship between these regions and the trait, but it does give insight that there are genes that do have some relationship with the trait and reveals where to look in future research.


GWAS

A
Genome-Wide Association Study In genomics, a genome-wide association study (GWA study, or GWAS), is an observational study of a genome-wide set of Single-nucleotide polymorphism, genetic variants in different individuals to see if any variant is associated with a trait. GWA s ...
(GWAS) is a technique used to find gene variants linked to complex traits. A GWAS is done with populations that mate randomly because all the genetic variants are tested at once. Then researchers can compare the different alleles at a locus. It is similar to
QTL mapping A quantitative trait locus (QTL) is a locus (section of DNA) that correlates with variation of a quantitative trait in the phenotype of a population of organisms. QTLs are mapped by identifying which molecular markers (such as SNPs or AFLPs) co ...
. The most common set-up for a GWAS is a case study which creates two populations one with the trait we are looking at and one without the trait. With the two populations researchers will map every subject's genome and compare them to find different variance in the
SNPs In genetics and bioinformatics, a single-nucleotide polymorphism (SNP ; plural SNPs ) is a germline substitution of a single nucleotide at a specific position in the genome. Although certain definitions require the substitution to be present in ...
between the two populations. Both populations should have similar environmental backgrounds. GWAS is only looking at the DNA and does not include differences that would be caused by environmental factors.Statistical test, such as a chi squared is used to find if there is association with the trait and each of the SNPs tested. The statistical test produces a
p-value In null-hypothesis significance testing, the ''p''-value is the probability of obtaining test results at least as extreme as the result actually observed, under the assumption that the null hypothesis is correct. A very small ''p''-value means ...
which the researcher will use to conclude if the SNP is significant. This p-value cut off can range from being a higher number or a lower number at the researcher's discretion. The data can then be visualized in a
Manhattan plot A Manhattan plot is a type of plot, usually used to display data with a large number of data-points, many of non-zero amplitude, and with a distribution of higher-magnitude values. The plot is commonly used in genome-wide association studies (GWA ...
which takes the -log (p-value) so all the significant SNPs are at the top of the graph.


Genetic architecture

Genetic architecture Genetic architecture is the underlying genetic basis of a phenotypic trait and its variational properties. Phenotypic variation for quantitative traits is, at the most basic level, the result of the segregation of alleles at quantitative trait ...
is an overall explanation of all the genetic factors that play a role in a complex trait and exists as the core foundation of
quantitative genetics Quantitative genetics is the study of quantitative traits, which are phenotypes that vary continuously—such as height or mass—as opposed to phenotypes and gene-products that are Categorical variable, discretely identifiable—such as eye-col ...
. With the use of mathematical models and statistical analysis, like GWAS, researchers can determine the number of genes affecting a trait as well as the level of influence each gene has on the trait. This is not always easy as the architecture of one trait can be different between two separate populations of the same species. This can be due to the fact that both populations live in different environments. Differing environments can lead to different interactions between genes and the environment, changing the architecture of both populations. Recently, with rapid increases in available genetic data, researchers have begun to characterize the genetic architecture of complex traits better. One surprise has been the observation that most loci identified in GWASs are found in
noncoding Non-coding DNA (ncDNA) sequences are components of an organism's DNA that do not encode protein sequences. Some non-coding DNA is transcribed into functional non-coding RNA molecules (e.g. transfer RNA, microRNA, piRNA, ribosomal RNA, and regula ...
regions of the genome; therefore, instead of directly altering
protein sequences Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthes ...
, such variants likely affect
gene regulation Regulation of gene expression, or gene regulation, includes a wide range of mechanisms that are used by cells to increase or decrease the production of specific gene products (protein or RNA). Sophisticated programs of gene expression are wide ...
. To understand the precise effects of these variants, QTL mapping has been employed to examine data from each step of gene regulation; for example, mapping
RNA-sequencing RNA-Seq (named as an abbreviation of RNA sequencing) is a technique that uses next-generation sequencing to reveal the presence and quantity of RNA molecules in a biological sample, providing a snapshot of gene expression in the sample, also kn ...
data can help determine the effects of variants on
mRNA In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of Protein biosynthesis, synthesizing a protein. mRNA is ...
expression levels, which then presumably affect the numbers of proteins translated. A comprehensive analysis of QTLs involved in various regulatory steps— promotor activity, transcription rates, mRNA expression levels,
translation Translation is the communication of the semantics, meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The English la ...
levels, and protein expression levels—showed that high proportions of QTLs are shared, indicating that regulation behaves as a "sequential ordered cascade" with variants affecting all levels of regulation. Many of these variants act by affecting
transcription factor In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription (genetics), transcription of genetics, genetic information from DNA to messenger RNA, by binding t ...
binding and other processes that alter
chromatin Chromatin is a complex of DNA and protein found in eukaryote, eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important r ...
function—steps which occur before and during RNA transcription. To determine the functional consequences of these variants, researchers have largely focused on identifying key genes, pathways, and processes that drive complex trait behavior; an inherent assumption has been that the most statistically significant variants have the greatest impact on traits because they act by affecting these key drivers. For example, one study hypothesizes that there exist rate-limiting genes pivotal to the function of
gene regulatory network A gene (or genetic) regulatory network (GRN) is a collection of molecular regulators that interact with each other and with other substances in the cell to govern the gene expression levels of mRNA and proteins which, in turn, determine the fu ...
s. Others studies have identified the functional impacts of key genes and mutations on disorders, including autism and
schizophrenia Schizophrenia () is a mental disorder characterized variously by hallucinations (typically, Auditory hallucination#Schizophrenia, hearing voices), delusions, thought disorder, disorganized thinking and behavior, and Reduced affect display, f ...
. However, a 2017 analysis by Boyle et al. argues that while genes which directly impact complex traits do exist, regulatory networks are so interconnected that any expressed gene affects the functions of these "core" genes; this idea is called the " omnigenic" hypothesis. While these "peripheral" genes each have small effects, their combined impact far exceeds the contributions of core genes themselves. To support the hypothesis that core genes play a smaller than expected role, the authors describe three main observations: the heritability for complex traits is spread broadly, often uniformly, across the genome; genetic effects do not appear to be mediated by cell-type specific function; and genes in the relevant functional categories only modestly contribute more to heritability than other genes. One alternative to the omnigenic hypothesis is the idea that peripheral genes act not by altering core genes but by altering cellular states, such as the speed of cell division or hormone response.


References

{{Reflist, 32em, refs = {{cite journal , vauthors = Marouli E, Graff M, Medina-Gomez C, Lo KS, Wood AR, Kjaer TR, Fine RS, Lu Y, Schurmann C, Highland HM, Rüeger S, Thorleifsson G, Justice AE, Lamparter D, Stirrups KE, Turcot V, Young KL, Winkler TW, Esko T, Karaderi T, Locke AE, Masca NG, Ng MC, Mudgal P, Rivas MA, Vedantam S, Mahajan A, Guo X, Abecasis G, Aben KK, Adair LS, Alam DS, Albrecht E, Allin KH, Allison M, Amouyel P, Appel EV, Arveiler D, Asselbergs FW, Auer PL, Balkau B, Banas B, Bang LE, Benn M, Bergmann S, Bielak LF, Blüher M, Boeing H, Boerwinkle E, Böger CA, Bonnycastle LL, Bork-Jensen J, Bots ML, Bottinger EP, Bowden DW, Brandslund I, Breen G, Brilliant MH, Broer L, Burt AA, Butterworth AS, Carey DJ, Caulfield MJ, Chambers JC, Chasman DI, Chen YI, Chowdhury R, Christensen C, Chu AY, Cocca M, Collins FS, Cook JP, Corley J, Galbany JC, Cox AJ, Cuellar-Partida G, Danesh J, Davies G, de Bakker PI, de Borst GJ, de Denus S, de Groot MC, de Mutsert R, Deary IJ, Dedoussis G, Demerath EW, den Hollander AI, Dennis JG, Di Angelantonio E, Drenos F, Du M, Dunning AM, Easton DF, Ebeling T, Edwards TL, Ellinor PT, Elliott P, Evangelou E, Farmaki AE, Faul JD, Feitosa MF, Feng S, Ferrannini E, Ferrario MM, Ferrieres J, Florez JC, Ford I, Fornage M, Franks PW, Frikke-Schmidt R, Galesloot TE, Gan W, Gandin I, Gasparini P, Giedraitis V, Giri A, Girotto G, Gordon SD, Gordon-Larsen P, Gorski M, Grarup N, Grove ML, Gudnason V, Gustafsson S, Hansen T, Harris KM, Harris TB, Hattersley AT, Hayward C, He L, Heid IM, Heikkilä K, Helgeland Ø, Hernesniemi J, Hewitt AW, Hocking LJ, Hollensted M, Holmen OL, Hovingh GK, Howson JM, Hoyng CB, Huang PL, Hveem K, Ikram MA, Ingelsson E, Jackson AU, Jansson JH, Jarvik GP, Jensen GB, Jhun MA, Jia Y, Jiang X, Johansson S, Jørgensen ME, Jørgensen T, Jousilahti P, Jukema JW, Kahali B, Kahn RS, Kähönen M, Kamstrup PR, Kanoni S, Kaprio J, Karaleftheri M, Kardia SL, Karpe F, Kee F, Keeman R, Kiemeney LA, Kitajima H, Kluivers KB, Kocher T, Komulainen P, Kontto J, Kooner JS, Kooperberg C, Kovacs P, Kriebel J, Kuivaniemi H, Küry S, Kuusisto J, La Bianca M, Laakso M, Lakka TA, Lange EM, Lange LA, Langefeld CD, Langenberg C, Larson EB, Lee IT, Lehtimäki T, Lewis CE, Li H, Li J, Li-Gao R, Lin H, Lin LA, Lin X, Lind L, Lindström J, Linneberg A, Liu Y, Liu Y, Lophatananon A, Luan J, Lubitz SA, Lyytikäinen LP, Mackey DA, Madden PA, Manning AK, Männistö S, Marenne G, Marten J, Martin NG, Mazul AL, Meidtner K, Metspalu A, Mitchell P, Mohlke KL, Mook-Kanamori DO, Morgan A, Morris AD, Morris AP, Müller-Nurasyid M, Munroe PB, Nalls MA, Nauck M, Nelson CP, Neville M, Nielsen SF, Nikus K, Njølstad PR, Nordestgaard BG, Ntalla I, O'Connel JR, Oksa H, Loohuis LM, Ophoff RA, Owen KR, Packard CJ, Padmanabhan S, Palmer CN, Pasterkamp G, Patel AP, Pattie A, Pedersen O, Peissig PL, Peloso GM, Pennell CE, Perola M, Perry JA, Perry JR, Person TN, Pirie A, Polasek O, Posthuma D, Raitakari OT, Rasheed A, Rauramaa R, Reilly DF, Reiner AP, Renström F, Ridker PM, Rioux JD, Robertson N, Robino A, Rolandsson O, Rudan I, Ruth KS, Saleheen D, Salomaa V, Samani NJ, Sandow K, Sapkota Y, Sattar N, Schmidt MK, Schreiner PJ, Schulze MB, Scott RA, Segura-Lepe MP, Shah S, Sim X, Sivapalaratnam S, Small KS, Smith AV, Smith JA, Southam L, Spector TD, Speliotes EK, Starr JM, Steinthorsdottir V, Stringham HM, Stumvoll M, Surendran P, 't Hart LM, Tansey KE, Tardif JC, Taylor KD, Teumer A, Thompson DJ, Thorsteinsdottir U, Thuesen BH, Tönjes A, Tromp G, Trompet S, Tsafantakis E, Tuomilehto J, Tybjaerg-Hansen A, Tyrer JP, Uher R, Uitterlinden AG, Ulivi S, van der Laan SW, Van Der Leij AR, van Duijn CM, van Schoor NM, van Setten J, Varbo A, Varga TV, Varma R, Edwards DR, Vermeulen SH, Vestergaard H, Vitart V, Vogt TF, Vozzi D, Walker M, Wang F, Wang CA, Wang S, Wang Y, Wareham NJ, Warren HR, Wessel J, Willems SM, Wilson JG, Witte DR, Woods MO, Wu Y, Yaghootkar H, Yao J, Yao P, Yerges-Armstrong LM, Young R, Zeggini E, Zhan X, Zhang W, Zhao JH, Zhao W, Zhao W, Zheng H, Zhou W, Rotter JI, Boehnke M, Kathiresan S, McCarthy MI, Willer CJ, Stefansson K, Borecki IB, Liu DJ, North KE, Heard-Costa NL, Pers TH, Lindgren CM, Oxvig C, Kutalik Z, Rivadeneira F, Loos RJ, Frayling TM, Hirschhorn JN, Deloukas P, Lettre G , display-authors = 6 , title = Rare and low-frequency coding variants alter human adult height , journal = Nature , volume = 542 , issue = 7640 , pages = 186–190 , date = February 2017 , pmid = 28146470 , pmc = 5302847 , doi = 10.1038/nature21039 , bibcode = 2017Natur.542..186M Quantitative genetics