Hybridization-based methods
Several applications have been developed that interrogate SNPs by hybridizing complementary DNA probes to the SNP site. The challenge of this approach is reducing cross-hybridization between the allele-specific probes. This challenge is generally overcome by manipulating the hybridization stringency conditions.Dynamic allele-specific hybridization
Dynamic allele-specific hybridization (DASH) genotyping takes advantage of the differences in the melting temperature in DNA that results from the instability of mismatched base pairs. The process can be vastly automated and encompasses a few simple principles. In the first step, a genomic segment is amplified and attached to a bead through a PCR reaction with a biotinylated primer. In the second step, the amplified product is attached to a streptavidin column and washed with NaOH to remove the unbiotinylated strand. AnMolecular beacons
SNP detection throughSNP microarrays
In high-density oligonucleotide SNP arrays, hundreds of thousands of probes are arrayed on a small chip, allowing for many SNPs to be interrogated simultaneously. Because SNP alleles only differ in one nucleotide and because it is difficult to achieve optimal hybridization conditions for all probes on the array, the target DNA has the potential to hybridize to mismatched probes. This is addressed somewhat by using several redundant probes to interrogate each SNP. Probes are designed to have the SNP site in several different locations as well as containing mismatches to the SNP allele. By comparing the differential amount of hybridization of the target DNA to each of these redundant probes, it is possible to determine specific homozygous and heterozygous alleles. Although oligonucleotide microarrays have a comparatively lower specificity and sensitivity, the scale of SNPs that can be interrogated is a major benefit. The Affymetrix Human SNP 5.0 GeneChip performs a genome-wide assay that can genotype over 500,000 human SNPs (Affymetrix 2007)..Enzyme-based methods
A broad range of enzymes includingRestriction fragment length polymorphism
Restriction fragment length polymorphism (RFLP) is considered to be the simplest and earliest method to detect SNPs. SNP-RFLP makes use of the many different restriction endonucleases and their high affinity to unique and specific restriction sites. By performing a digestion on a genomic sample and determining fragment lengths through a gel assay it is possible to ascertain whether or not the enzymes cut the expected restriction sites. A failure to cut the genomic sample results in an identifiably larger than expected fragment implying that there is a mutation at the point of the restriction site which is rendering it protection from nuclease activity. Unfortunately, the combined factors of the high complexity of most eukaryotic genomes, the requirement for specific endonucleases, the fact that the exact mutation cannot necessarily be resolved in a single experiment, and the slow nature of gel assays make RFLP a poor choice for high throughput analysis.PCR-based methods
Tetra-primer amplification refractory mutation system PCR, or ARMS-PCR, employs two pairs of primers to amplify two alleles in one PCR reaction. The primers are designed such that the two primer pairs overlap at a SNP location but each match perfectly to only one of the possible SNPs. The basis of the invention is that unexpectedly, oligonucleotides with a mismatched 3'-residue will not function as primers in the PCR under appropriate conditions. As a result, if a given allele is present in the PCR reaction, the primer pair specific to that allele will produce product but not to the alternative allele with a different SNP. The two primer pairs are also designed such that their PCR products are of a significantly different length allowing for easily distinguishable bands by gel electrophoresis or melt temperature analysis. In examining the results, if a genomic sample is homozygous, then the PCR products that result will be from the primer that matches the SNP location and the outer opposite-strand primer, as well from the two outer primers. If the genomic sample is heterozygous, then products will result from the primer of each allele and their respective outer primer counterparts as well as the outer primers. An alternative strategy is to run multiple qPCR reactions with different primer sets that target each allele separately. Well-designed primers will amplify their target SNP at a much earlier cycle than the other SNPs. This allows more than two alleles to be distinguished, although an individual qPCR reaction is required for each SNP. To achieve high enough specificity, the primer sequence may require placement of an artificial mismatch near its 3'-end, which is an approach generally known as Taq-MAMA.Flap endonuclease
Flap endonuclease (FEN) is an endonuclease that catalyzes structure-specific cleavage. This cleavage is highly sensitive to mismatches and can be used to interrogate SNPs with a high degree of specificity In the basic Invader assay, a FEN called cleavase is combined with two specific oligonucleotide probes, that together with the target DNA, can form a tripartite structure recognized by cleavase. The first probe, called the Invader oligonucleotide is complementary to the 3’ end of the target DNA. The last base of the Invader oligonucleotide is a non-matching base that overlaps the SNP nucleotide in the target DNA. The second probe is an allele-specific probe which is complementary to the 5’ end of the target DNA, but also extends past the 3’ side of the SNP nucleotide. The allele-specific probe will contain a base complementary to the SNP nucleotide. If the target DNA contains the desired allele, the Invader and allele-specific probes will bind to the target DNA forming the tripartite structure. This structure is recognized by cleavase, which will cleave and release the 3’ end of the allele-specific probe. If the SNP nucleotide in the target DNA is not complementary to the allele-specific probe, the correct tripartite structure is not formed and no cleavage occurs. The Invader assay is usually coupled with fluorescence resonance energy transfer (FRET) system to detect the cleavage event. In this setup, a quencher molecule is attached to the 3’ end and a fluorophore is attached to the 5’ end of the allele-specific probe. If cleavage occurs, the fluorophore will be separated from the quencher molecule generating a detectable signal. Only minimal cleavage occurs with mismatched probes making the Invader assay highly specific. However, in its original format, only one SNP allele could be interrogated per reaction sample and it required a large amount of target DNA to generate a detectable signal in a reasonable time frame. Several developments have extended the original Invader assay. By carrying out secondary FEN cleavage reactions, the Serial Invasive Signal Amplification Reaction (SISAR) allows both SNP alleles to be interrogated in a single reaction. SISAR Invader assay also requires less target DNA, improving the sensitivity of the original Invader assay. The assay has also been adapted in several ways for use in a high-throughput format. In one platform, the allele-specific probes are anchored to microspheres. When cleavage by FEN generates a detectable fluorescent signal, the signal is measured using flow-cytometry. The sensitivity of flow-cytometry, eliminates the need for PCR amplification of the target DNA (Rao et al. 2003). These high-throughput platforms have not progressed beyond the proof-of-principle stage and so far the Invader system has not been used in any large scale SNP genotyping projects.Primer extension
Primer extension is a two step process that first involves the hybridization of a probe to the bases immediately upstream of the SNP nucleotide followed by a ‘mini-sequencing’ reaction, in which DNA polymerase extends the hybridized primer by adding a base that is complementary to the SNP nucleotide. This incorporated base is detected and determines the SNP allele (Goelet et al. 1999; Syvanen 2001). Because primer extension is based on the highly accurate DNA polymerase enzyme, the method is generally very reliable. Primer extension is able to genotype most SNPs under very similar reaction conditions making it also highly flexible. The primer extension method is used in a number of assay formats. These formats use a wide range of detection techniques that include MALDI-TOF Mass spectrometry (see Sequenom) and5’- nuclease
Taq DNA polymerase's 5’-nuclease activity is used in theOligonucleotide Ligation Assay
DNA ligase catalyzes the ligation of the 3' end of a DNA fragment to the 5' end of a directly adjacent DNA fragment. This mechanism can be used to interrogate a SNP by hybridizing two probes directly over the SNP polymorphic site, whereby ligation can occur if the probes are identical to the target DNA. In the oligonucleotide ligase assay, two probes are designed; an allele-specific probe which hybridizes to the target DNA so that its 3' base is situated directly over the SNP nucleotide and a second probe that hybridizes the template upstream (downstream in the complementary strand) of the SNP polymorphic site providing a 5' end for the ligation reaction. If the allele-specific probe matches the target DNA, it will fully hybridize to the target DNA and ligation can occur. Ligation does not generally occur in the presence of a mismatched 3' base. Ligated or unligated products can be detected by gel electrophoresis, MALDI-TOF mass spectrometry or by capillary electrophoresis for large-scale applications. With appropriate sequences and tags on the oligonucleotides, high-throughput sequence data can be generated from the ligated products and genotypes determined (Curry et al., 2012). The use of large numbers of sample indexes allows high-throughput sequence data on hundreds of SNPs in thousands of samples to be generated in a small portion of a high-throughput sequencing run. This is a massive genotyping by sequencing technology (MGST).Other post-amplification methods based on physical properties of DNA
The characteristic DNA properties of melting temperature and single stranded conformation have been used in several applications to distinguish SNP alleles. These methods very often achieve high specificity but require highly optimized conditions to obtain the best possible results.Single strand conformation polymorphism
Single-stranded DNA (ssDNA) folds into a tertiary structure. The conformation is sequence dependent and most single base pair mutations will alter the shape of the structure. When applied to a gel, the tertiary shape will determine the mobility of the ssDNA, providing a mechanism to differentiate between SNP alleles. This method first involves PCR amplification of the target DNA. The double-stranded PCR products are denatured using heat and formaldehyde to produce ssDNA. The ssDNA is applied to a non-denaturing electrophoresis gel and allowed to fold into a tertiary structure. Differences in DNA sequence will alter the tertiary conformation and be detected as a difference in the ssDNA strand mobility (Costabile et al. 2006). This method is widely used because it is technically simple, relatively inexpensive and uses commonly available equipment. However compared to other SNP genotyping methods, the sensitivity of this assay is lower. It has been found that the ssDNA conformation is highly dependent on temperature and it is not generally apparent what the ideal temperature is. Very often the assay will be carried out using several different temperatures. There is also a restriction on the length of fragment because the sensitivity drops when sequences longer than 400 bp are used (Costabile et al. 2006).Temperature gradient gel electrophoresis
The temperature gradient gel electrophoresis (TGGE) or temperature gradient capillary electrophoresis (TGCE) method is based on the principle that partially denatured DNA is more restricted and travels slower in a porous material such as a gel. This property allows for the separation of DNA by melting temperature. To adapt these methods for SNP detection, two fragments are used; the target DNA which contain the SNP polymorphic site being interrogated and an allele-specific DNA sequence, referred to as the normal DNA fragment. The normal fragment is identical to the target DNA except potentially at the SNP polymorphic site, which is unknown in the target DNA. The fragments are denatured and then reannealed. If the target DNA has the same allele as the normal fragment, homoduplexes will form that will have the same melting temperature. When run on the gel with a temperature gradient, only one band will appear. If the target DNA has a distinct allele, four products will form following the reannealing step; homoduplexes consisting of target DNA, homoduplexes consisting of normal DNA and two heterduplexes of each strand of target DNA hybridized with the normal DNA strand. These four products will have distinct melting temperatures and will appear as four bands in the denaturing gel.Denaturing high performance liquid chromatography
Denaturing high performance liquid chromatography (DHPLC) uses reversed-phase HPLC to interrogate SNPs. The key to DHPLC is the solid phase which has differential affinity for single and double-stranded DNA. In DHPLC, DNA fragments are denatured by heating and then allowed to reanneal. The melting temperature of the reannealed DNA fragments determines the length of time they are retained in the column. Using PCR, two fragments are generated; target DNA containing the SNP polymorphic site and an allele-specific DNA sequence, referred to as the normal DNA fragment. This normal fragment is identical to the target DNA except potentially at the SNP polymorphic site, which is unknown in the target DNA. The fragments are denatured and then allowed to gradually reanneal. The reannaled products are added to the DHPLC column. If the SNP allele in the target DNA matches the normal DNA fragment, only identical homoduplexes will form during the reannealing step. If the target DNA contains a different SNP allele than the normal DNA fragment, heteroduplexes of the target DNA and normal DNA containing a mismatched polymorphic site will form in addition to homoduplexes. The mismatched heteroduplexes will have a different melting temperature than the homoduplexes and will not be retained in the column as long. This generates a chromatograph pattern that is distinctive from the pattern that would be generated if the target DNA fragment and normal DNA fragments were identical. The eluted DNA is detected by UV absorption. DHPLC is easily automated as no labeling or purification of the DNA fragments is needed. The method is also relatively fast and has a high specificity. One major drawback of DHPLC is that the column temperature must be optimized for each target in order to achieve the right degree of denaturation.High-resolution melting of the entire amplicon
High Resolution Melting analysis is the simplest PCR-based method to understand. Basically, the same thermodynamic properties that allowed for the gel techniques to work apply here, and in real-time. A fluorimeter monitors the post-PCR denaturation of the entire dsDNA amplicon. You make primers specific to the site you want to amplify. You "paint" the amplicon with a double-strand specific dye, included in the PCR mix. The ds-specific dye integrates itself into the PCR product. In essence, the entire amplicon becomes a probe. This opens up new possibilities for discovery. Either you position the primers very close to either side of the SNP in question (small amplicon genotyping, Liew, 2004) or amplify a larger region (100-400bp in length) for scanning purposes. For simple genotyping of an SNP, it is easier to just make the amplicon small to minimize the chances you mistake one SNP for another. The melting temperature (Tm) of the entire amplicon is determined and most homozygotes are sufficiently different (in the better instruments) in Tm to genotype. Heterozygotes are even easier to differentiate because they have heteroduplexes generated (refer to the gel-based explanations) which broadens the melt transition and usually gives two discernible peaks. Amplicon melting using a fluorescently-labeled primer has been described (Gundry et al., 2003) but is less practical than using ds-specific dyes due to the cost of the fluorogenic primer. Scanning of larger amplicons is based on the same principles as outlined above. However, melting temperature and the overall shape of the melting curve become informative. For amplicons >c.150bp there are often >2 melting peaks, each of which can vary, depending on the DNA template composition. Numerous investigators have been able to successfully eliminate the majority of their sequencing through melt-based scanning, allowing accurate locus-based genotyping of large numbers of individuals. Many investigators have found scanning for mutations using high resolution melting as a viable and practical way to study entire genes.Use of DNA mismatch-binding proteins
DNA mismatch-binding proteins can distinguish single nucleotide mismatches and thus facilitate differential analysis of SNPs. For example, MutS protein from ''Thermus aquaticus'' binds different single nucleotide mismatches with different affinities and can be used in capillary electrophoresis to differentiate all six sets of mismatches (Drabovich & Krylov 2006).SNPlex
SNPlex is a proprietary genotyping platform sold bySurveyor nuclease assay
Surveyor nuclease is a mismatch endonuclease enzyme that recognizes all base substitutions and small insertions/deletions (indels), and cleaves the 3′ side of mismatched sites in both DNA strands.Sequencing
References
Further reading
* * * * * * * * {{refendExternal links