Nested association mapping (NAM) is a technique designed by the labs of
Edward BucklerJames Holland an
for identifying and dissecting the genetic architecture of
complex traits
Complex traits are phenotypes that are controlled by two or more genes and do not follow Mendel's Law of Dominance. They may have a range of expression which is typically continuous. Both environmental and genetic factors often impact the variat ...
in corn (''
Zea mays
Maize (; ''Zea mays''), also known as corn in North American English, is a tall stout Poaceae, grass that produces cereal grain. It was domesticated by indigenous peoples of Mexico, indigenous peoples in southern Mexico about 9,000 years ago ...
''). It is important to note that nested association mapping (unlike
association mapping) is a specific technique that cannot be performed outside of a specifically designed population such as the Maize NAM population,
the details of which are described below.
Theory behind NAM
NAM was created as a means of combining the advantages and eliminating the disadvantages of two traditional methods for identifying
quantitative trait loci
A quantitative trait locus (QTL) is a Locus (genetics), locus (section of DNA) that correlates with variation of a quantitative trait in the phenotype of a Population genetics, population of organisms. QTLs are mapped by identifying which molecula ...
:
linkage analysis and association mapping. Linkage analysis depends upon recent
genetic recombination
Genetic recombination (also known as genetic reshuffling) is the exchange of genetic material between different organisms which leads to production of offspring with combinations of traits that differ from those found in either parent. In eukaryot ...
between two different plant lines (as the result of a genetic cross) to identify general regions of interest, with the advantage of requiring few
genetic marker
A genetic marker is a gene or DNA sequence with a known location on a chromosome that can be used to identify individuals or species. It can be described as a variation (which may arise due to mutation or alteration in the genomic loci) that can ...
s to ensure genome wide coverage and high statistical power per allele. Linkage analysis, however, has the disadvantages of low mapping resolution and low allele richness.
Association mapping, by contrast, takes advantage of historic recombination, and is performed by scanning a genome for
SNPs
In genetics and bioinformatics, a single-nucleotide polymorphism (SNP ; plural SNPs ) is a germline substitution of a single nucleotide at a specific position in the genome. Although certain definitions require the substitution to be present in ...
in
linkage disequilibrium Linkage disequilibrium, often abbreviated to LD, is a term in population genetics referring to the association of genes, usually linked genes, in a population. It has become an important tool in medical genetics and other fields
In defining LD, it ...
with a trait of interest. Association mapping has advantages over linkage analysis in that it can map with high resolution and has high allelic richness, however, it also requires extensive knowledge of SNPs within the genome and is thus only now becoming possible in diverse species such as maize.
NAM takes advantage of both historic and recent recombination events in order to have the advantages of low marker density requirements, high allele richness, high mapping resolution, and high statistical power, with none of the disadvantages of either linkage analysis or association mapping.
In these regards, the NAM approach is similar in principle to the MAGIC lines and AMPRILs in Arabidopsis and the Collaborative Cross in mouse.
Creation of the maize NAM population
Twenty-five diverse corn lines were chosen as the parental lines for the NAM population in order to encompass the remarkable diversity of maize and preserve historic linkage disequilibrium. Each parental line was crossed to the B73 maize inbred (chosen as a reference line due to its use in th
public maize sequencing project
and wide deployment as one of the most successful commercial inbred lines) to create the F1 population. The F1 plants were then self-fertilized for six generations in order to create a total of 200 homozygous recombinant inbred lines (RILs) per family, for a total of 5000 RILs within the NAM population. The lines are publicly available through th
USDA-ARS Maize Stock Center
Each RIL was then genotyped with the same 1106 molecular markers (for this to be possible, the researchers selected markers for which B73 had a rare allele), in order to identify recombination blocks. After genotyping with the 1106 markers, each of the parental lines was either sequenced or high-density genotyped, and the results of that sequencing/genotyping overlaid on the recombination blocks identified for each RIL. The result was 5000 RILs that were either fully sequenced or high density genotyped that, due to genotyping with the common 1106 markers, could all be compared to each other and analyzed together (Figure 1).[
The second aspect of the NAM population characterization is the sequencing of the parental lines. This captures information on the natural variation that went into the population and a record of the extensive recombination captured in the history of maize variation. The first phase of this sequencing was by reduced representation sequencing using next generation sequencing technology, as report in Gore, Chia et al. in 2009. This initial sequencing discovered 1.6 million variable regions in maize, which is now facilitating analysis of a wide range of traits.
]
Process
As with traditional QTL mapping
A quantitative trait locus (QTL) is a locus (section of DNA) that correlates with variation of a quantitative trait in the phenotype of a population of organisms. QTLs are mapped by identifying which molecular markers (such as SNPs or AFLPs) co ...
strategies, the general goal in Nested Association Mapping is to correlate a phenotype of interest with specific genotypes. One of the creators' stated goals for the NAM population was to be able to perform genome-wide association studies in maize by looking for associations between SNPs within the NAM population and quantitative traits of interest (e.g. flowering time, plant height, carotene content).[ As of 2009, however, the sequencing of the original parental lines was not yet completed to the degree necessary to perform these analyses. The NAM population has, however, been successfully used for linkage analysis. In the linkage study that has been released, the unique structure of the NAM population, described in the previous section, allowed for joint ]stepwise regression
In statistics, stepwise regression is a method of fitting regression models in which the choice of predictive variables is carried out by an automatic procedure. In each step, a variable is considered for addition to or subtraction from the set of ...
and joint inclusive composite interval mapping of the combined NAM families to identify QTLs for flowering time.
Current use
The first publication in which NAM was used to identify QTLs was authored by the Buckler lab on the genetic architecture of maize flowering time, and published in the summer of 2009.[ In this groundbreaking study, the authors scored days to silking, days to anthesis, and the silking-anthesis interval for nearly one million plants, then performed single and joint stepwise regression and inclusive composite interval mapping (ICIM) to identify 39 QTLs explaining 89% of the variance in days to silking and days to anthesis and 29 QTLs explaining 64% of the variance in the silking-anthesis interval.][
Ninety-eight percent of the flowering time QTLs identified in this paper were found to affect flowering time by less than one day (as compared to the B73 reference). These relatively small QTL effects, however, were also shown to sum for each family to equal large differences and changes in days to silking. Furthermore, it was observed that while most QTLs were shared between families, each family appears to have functionally distinct alleles for most QTLs. These observations led the authors to propose a model of "Common genes with uncommon variants"][ to explain flowering time diversity in maize. They tested their model by documenting an allelic series in the previously studied maize flowering time QTL Vgt1 (vegetation-to-transition1)] by controlling for genetic background and estimating the effects of vgt1 in each family. They then went on to identify specific sequence variants that corresponded to the allelic series, including one allele containing a miniature transposon
A transposable element (TE), also transposon, or jumping gene, is a type of mobile genetic element, a nucleic acid sequence in DNA that can change its position within a genome.
The discovery of mobile genetic elements earned Barbara McClinto ...
strongly associated with early flowering, and other alleles containing SNPs associated with later flowering.[
Maize NAMs have helped to map otherwise difficult traits conveying resistance to ]fungi
A fungus (: fungi , , , or ; or funguses) is any member of the group of eukaryotic organisms that includes microorganisms such as yeasts and mold (fungus), molds, as well as the more familiar mushrooms. These organisms are classified as one ...
including Kump et al 2011, for southern leaf blight resistance, and Poland et al 2011, for northern leaf blight resistance.[{{cite journal , last1=Huang , first1=Xuehui , last2=Han , first2=Bin , title=Natural Variations and Genome-Wide Association Studies in Crop Plants , journal=]Annual Review of Plant Biology
''Annual Review of Plant Biology'' is a peer-reviewed scientific journal published by Annual Reviews. It was first published in 1950 as the ''Annual Review of Plant Physiology''. Sabeeha Merchant has been the editor since 2005, making her the lo ...
, publisher= Annual Reviews , volume=65 , issue=1 , date=2014-04-29 , issn=1543-5008 , doi=10.1146/annurev-arplant-050213-035715 , pages=531–551, pmid=24274033
Implications
Nested association mapping has tremendous potential for the investigation of agronomic traits in maize and other species. As the initial flowering time study demonstrates, NAM has the power to identify QTLs for agriculturally relevant traits and to relate those QTLs to homologs and candidate gene
The candidate gene approach to conducting genetic association studies focuses on associations between genetic variation within pre-specified genes of interest, and Phenotype (clinical medicine), phenotypes or disease states. This is in contrast to ...
s in non-maize species. Furthermore, the NAM lines become a powerful public resource for the maize community, and an opportunity for the sharing of maize germplasm as well as the results of maize studies via common databases (see external links), further facilitating future research into maize agricultural traits. Given that maize is one of the most important agricultural crops worldwide, such research has powerful implications for the genetic improvement of crops, and subsequently, worldwide food security
Food security is the state of having reliable access to a sufficient quantity of affordable, healthy Human food, food. The availability of food for people of any class, gender, ethnicity, or religion is another element of food protection. Simila ...
.[
Similar designs are also being created for ]wheat
Wheat is a group of wild and crop domestication, domesticated Poaceae, grasses of the genus ''Triticum'' (). They are Agriculture, cultivated for their cereal grains, which are staple foods around the world. Well-known Taxonomy of wheat, whe ...
, barley
Barley (), a member of the grass family, is a major cereal grain grown in temperate climates globally. It was one of the first cultivated grains; it was domesticated in the Fertile Crescent around 9000 BC, giving it nonshattering spikele ...
, sorghum
''Sorghum bicolor'', commonly called sorghum () and also known as great millet, broomcorn, guinea corn, durra, imphee, jowar, or milo, is a species in the Poaceae, grass genus ''Sorghum (genus), Sorghum'' cultivated for its grain. The grain i ...
, and ''Arabidopsis thaliana
''Arabidopsis thaliana'', the thale cress, mouse-ear cress or arabidopsis, is a small plant from the mustard family (Brassicaceae), native to Eurasia and Africa. Commonly found along the shoulders of roads and in disturbed land, it is generally ...
''.
See also
* Association mapping
* Family based QTL mapping
* Marker assisted selection
* Molecular breeding
* QTL mapping
A quantitative trait locus (QTL) is a locus (section of DNA) that correlates with variation of a quantitative trait in the phenotype of a population of organisms. QTLs are mapped by identifying which molecular markers (such as SNPs or AFLPs) co ...
References
External links
Maize Databases:
Panzea.org – the official Nested Association Mapping database
MaizeGDB – community database for biological information about maize
Maize genome sequence browser
Gramene – assembled genomes for many plant genetics systems, including maize, rice, and sorghum
Statistical genetics