Transposon Insertion Sequencing
   HOME

TheInfoList



OR:

Transposon insertion sequencing (Tn-seq) combines transposon insertional mutagenesis with
massively parallel sequencing Massive parallel sequencing or massively parallel sequencing is any of several high-throughput approaches to DNA sequencing using the concept of massively parallel processing; it is also called next-generation sequencing (NGS) or second-generation s ...
(MPS) of the transposon insertion sites to identify genes contributing to a function of interest in
bacteria Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were among ...
. The method was originally established by concurrent work in four laboratories under the acronyms HITS, INSeq, TraDIS, and Tn-Seq.van Opijnen T, Bodi KL, Camilli A. Tn-seq: high-throughput parallel sequencing for fitness and genetic interaction studies in microorganisms. Nat Methods. 2009;6:767–72. doi: 10.1038/nmeth.1377. Numerous variations have been subsequently developed and applied to diverse biological systems. Collectively, the methods are often termed Tn-Seq as they all involve monitoring the fitness of transposon insertion mutants via DNA sequencing approaches. Transposons are highly regulated, discrete DNA segments that can relocate within the genome. They are universal and are found in
Eubacteria Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of prokaryotic microorganisms. Typically a few micrometr ...
,
Archaea Archaea ( ; singular archaeon ) is a domain of single-celled organisms. These microorganisms lack cell nuclei and are therefore prokaryotes. Archaea were initially classified as bacteria, receiving the name archaebacteria (in the Archaebac ...
, and
Eukarya Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bacte ...
, including humans. Transposons have a large influence on
gene expression Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, protein or non-coding RNA, and ultimately affect a phenotype, as the final effect. The ...
and can be used to determine gene function. In fact, when a transposon inserts itself in a gene, the gene's function will be disrupted. Because of that property, transposons have been manipulated for use in insertional mutagenesis. The development of microbial genome sequencing was a major advance for the use of transposon mutagenesis. The function affected by a transposon insertion could be linked to the disrupted gene by sequencing the genome to locate the transposon insertion site. Massively parallel sequencing allows simultaneous sequencing of transposon insertion sites in large mixtures of different mutants. Therefore, genome-wide analysis is feasible if transposons are positioned throughout the genome in a mutant collection. Transposon sequencing requires the creation of a transposon insertion library, which will contain a group of mutants that collectively have transposon insertions in all non-essential genes. The library is grown under an experimental condition of interest. Mutants with transposons inserted in genes required for growth under the test condition will diminish in frequency from the population. To identify mutants being lost, genomic sequences adjacent to the transposon ends are amplified by
PCR PCR or pcr may refer to: Science * Phosphocreatine, a phosphorylated creatine molecule * Principal component regression, a statistical technique Medicine * Polymerase chain reaction ** COVID-19 testing, often performed using the polymerase chain r ...
and sequenced by MPS to determine the location and abundance of each insertion mutation. The importance of each gene for growth under the test condition is determined by comparing the abundance of each mutant before and after growth under the condition being examined. Tn-seq is useful for both the study of a single gene's fitness as well as gene interactions Signature–tagged mutagenesis (STM) is an older technique that also involves pooling transposon insertion mutants to determine the importance of the disrupted genes under selective growth conditions. High-throughput versions of STM use genomic microarrays, which are less accurate and have a lower dynamic range than massively-parallel sequencing. With the invention of next generation sequencing, genomic data became increasingly available. However, despite the increase in genomic data, our knowledge of gene function remains the limiting factor in our understanding of the role genes play. Therefore, a need for a high throughput approach to study genotype–phenotype relationships like Tn-seq was necessary.


Methodology

Transposon sequencing begins by transducing bacterial populations with transposable elements using
bacteriophages A bacteriophage (), also known informally as a ''phage'' (), is a duplodnaviria virus that infects and replicates within bacteria and archaea. The term was derived from "bacteria" and the Greek φαγεῖν ('), meaning "to devour". Bacterio ...
. Tn-seq uses the Himar I Mariner transposon, a common and stable transposon. After transduction, the DNA is cleaved and the inserted sequence amplified through
PCR PCR or pcr may refer to: Science * Phosphocreatine, a phosphorylated creatine molecule * Principal component regression, a statistical technique Medicine * Polymerase chain reaction ** COVID-19 testing, often performed using the polymerase chain r ...
. The recognition sites for MmeI, a type IIS restriction endonuclease, can be introduced by a single nucleotide change in the terminal repeats of Mariner. It is located 4 base pairs before the end of the terminal repeat. MmeI makes a 2 base pair staggered cut 20 bases downstream of the recognition site. When MmeI digests DNA from a library of transposon insertion mutants, fragmented DNA including the left and right transposon and 16 base pair of surrounding genomic DNA is produced. The 16 base pair fragment is enough to determine the location of the transposon insertion in the bacterial genome. The ligation of the adaptor is facilitated by the 2 base overhang. A primer specific to the adaptor and a primer specific to the transposon are used to amplify the sequence via PCR. The 120 base pair product is then isolated using agarose gel or PAGE purification.
Massively parallel sequencing Massive parallel sequencing or massively parallel sequencing is any of several high-throughput approaches to DNA sequencing using the concept of massively parallel processing; it is also called next-generation sequencing (NGS) or second-generation s ...
is then used to determine the sequences of the flanking 16 base pairs. Gene function is inferred after looking at the effects of the insertion on gene function under certain conditions.


Advantages and disadvantages

Unlike high-throughput insertion track by deep sequencing (HITS) and transposon-directed insertion site sequencing (TraDIS), Tn-seq is specific to the Himar I Mariner transposon, and cannot be applied to other transposons or insertional elements. However, the protocol for Tn-seq is less time intensive. HITS and TraDIS use a DNA shearing technique that produce a range of
PCR PCR or pcr may refer to: Science * Phosphocreatine, a phosphorylated creatine molecule * Principal component regression, a statistical technique Medicine * Polymerase chain reaction ** COVID-19 testing, often performed using the polymerase chain r ...
product sizes that could cause shorter DNA templates being preferentially amplified over longer templates. Tn-seq produces a product that is uniform in size, therefore reducing the possibility of PCR bias. Tn-seq can be used to identify both the fitness of single genes and to map gene interactions in microorganisms. Existing methods for these types of study are dependent on preexisting genomic microarrays or gene knockout arrays, whereas Tn-seq is not. Tn-seq's utilization of
massively parallel sequencing Massive parallel sequencing or massively parallel sequencing is any of several high-throughput approaches to DNA sequencing using the concept of massively parallel processing; it is also called next-generation sequencing (NGS) or second-generation s ...
makes this technique easily reproducible, sensitive, and robust.


Applications

Tn-seq has proven to be a useful technique for identifying new gene functions. The highly sensitive nature of Tn-seq can be used to determine phenotype-genotype relationships that may have been deemed insignificant by less sensitive methods. Tn-seq identified essential genes and pathways that are important for the utilization of cholesterol in ''
Mycobacterium tuberculosis ''Mycobacterium tuberculosis'' (M. tb) is a species of pathogenic bacteria in the family Mycobacteriaceae and the causative agent of tuberculosis. First discovered in 1882 by Robert Koch, ''M. tuberculosis'' has an unusual, waxy coating on its c ...
''. Tn-seq has been used to study higher order genome organization using gene interactions. Genes function as a highly linked network. Therefore, in order to study a gene's impact on
phenotype In genetics, the phenotype () is the set of observable characteristics or traits of an organism. The term covers the organism's morphology or physical form and structure, its developmental processes, its biochemical and physiological proper ...
, gene interactions must also be considered. These gene networks can be studied by screening for synthetic lethality and gene interactions where a double mutant shows an unexpected fitness value compared to each individual mutant. Tn-seq was used to determine genetic interactions between five query genes and the rest of the genome in Streptococcus pneumoniae, which revealed both aggravating and alleviating genetic interactions. Tn-seq used in combination with
RNA-seq RNA-Seq (named as an abbreviation of RNA sequencing) is a sequencing technique which uses next-generation sequencing (NGS) to reveal the presence and quantity of RNA in a biological sample at a given moment, analyzing the continuously changing c ...
can be utilized to examine the role of non-coding DNA regions.


References

{{reflist Mobile genetic elements DNA sequencing