TAL (transcription activator-like) effectors (often referred to as TALEs, but not to be confused with the
three amino acid loop extension homeobox class of proteins) are
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respon ...
s secreted by some
β- and
Îł-proteobacteria.
Most of these are
Xanthomonad
The Xanthomonadales are a bacterial order within the Gammaproteobacteria. They are one of the largest groups of bacterial phytopathogens, harbouring species such as ''Xanthomonas citri'', ''Xanthomonas euvesicatoria'', ''Xanthomonas oryzae'' and ...
s.
Plant pathogenic
Plant pathology (also phytopathology) is the scientific study of diseases in plants caused by pathogens (infectious organisms) and environmental conditions (physiological factors). Organisms that cause infectious disease include fungi, oomyc ...
''
Xanthomonas'' bacteria are especially known for TALEs, produced via their
type III secretion system
The type III secretion system (T3SS or TTSS), also called the injectisome, is one of the bacterial secretion systems used by bacteria to secrete their effector proteins into the host's cells to promote virulence and colonisation. The T3SS is a n ...
. These proteins can bind
promoter sequences in the host plant and activate the
expression of plant genes that aid bacterial infection. The TALE domain responsible for binding to DNA is known to have 1.5 to 33.5 short sequences that are repeated multiple times (tandem repeats).
Each of these repeats was found to be specific for a certain base pair of the DNA.
These repeats also have repeat variable residues (RVD) that can detect specific DNA base pairs.
They recognize plant
DNA sequence
DNA sequencing is the process of determining the nucleic acid sequence – the order of nucleotides in DNA. It includes any method or technology that is used to determine the order of the four bases: adenine, guanine, cytosine, and thymine. Th ...
s through a central repeat domain consisting of a variable number of ~34 amino acid repeats. There appears to be a one-to-one correspondence between the identity of two critical amino acids in each repeat and each DNA base in the target sequence. These proteins are interesting to researchers both for their role in disease of important crop species and the relative ease of retargeting them to bind new DNA sequences. Similar proteins can be found in the pathogenic bacterium ''
Ralstonia solanacearum
''Ralstonia solanacearum'' is an aerobic non-spore-forming, Gram-negative, plant pathogenic bacterium. ''R. solanacearum'' is soil-borne and motile with a polar flagellar tuft. It colonises the xylem, causing bacterial wilt in a very wide ra ...
''
and ''
Burkholderia rhizoxinica
''Paraburkholderia rhizoxinica'' is a gram-negative, oxidase and catalase-positiv, motile bacterium from the genus '' Paraburkholderia'' and the family Burkholderiaceae which was isolated from the plant pathogenic fungus, ''Rhizopus microspo ...
'',
as well as yet unidentified marine microorganisms. The term
TALE-likes is used to refer to the putative protein family encompassing the TALEs and these related proteins.
Function in plant pathogenesis
''Xanthomonas''
''Xanthomonas'' are Gram-negative bacteria that can infect a wide variety of plant species including pepper/capsicum, rice, citrus, cotton, tomato, and soybeans.
Some types of Xanthomonas cause localized leaf spot or leaf streak while others spread systemically and cause black rot or leaf blight disease. They inject a number of effector proteins, including TAL effectors, into the plant via their
type III secretion system
The type III secretion system (T3SS or TTSS), also called the injectisome, is one of the bacterial secretion systems used by bacteria to secrete their effector proteins into the host's cells to promote virulence and colonisation. The T3SS is a n ...
. TAL effectors have several motifs normally associated with eukaryotes including multiple nuclear localization signals and an acidic activation domain. When injected into plants, these proteins can enter the nucleus of the plant cell, bind plant promoter sequences, and activate transcription of plant genes that aid in bacterial infection.
Plants have developed a defense mechanism against type III effectors that includes R (resistance) genes triggered by these effectors. Some of these R genes appear to have evolved to contain TAL-effector binding sites similar to site in the intended target gene. This competition between pathogenic bacteria and the host plant has been hypothesized to account for the apparently malleable nature of the TAL effector DNA binding domain.
Non-''Xanthomonas''
''R. solanacearum'', ''B. rhizoxinica'', and
banana blood disease (a bacterium not yet definitively identified, in the ''R. solanacearum'' species group).
DNA recognition
The most distinctive characteristic of TAL effectors is a central repeat domain containing between 1.5 and 33.5 repeats that are usually 34 residues in length (the C-terminal repeat is generally shorter and referred to as a “half repeat”).
A typical repeat sequence is , but the residues at the 12th and 13th positions are hypervariable (these two amino acids are also known as the repeat variable diresidue or RVD). There is a simple relationship between the identity of these two residues in sequential repeats and sequential DNA bases in the TAL effector's target site.
The crystal structure of a TAL effector bound to DNA indicates that each repeat comprises two alpha helices and a short RVD-containing loop where the second residue of the RVD makes sequence-specific DNA contacts while the first residue of the RVD stabilizes the RVD-containing loop.
Target sites of TAL effectors also tend to include a thymine flanking the 5’ base targeted by the first repeat; this appears to be due to a contact between this T and a conserved
tryptophan
Tryptophan (symbol Trp or W)
is an α-amino acid that is used in the biosynthesis of proteins. Tryptophan contains an α-amino group, an α-carboxylic acid group, and a side chain indole, making it a polar molecule with a non-polar aromati ...
in the region N-terminal of the central repeat domain.
[ However, this "zero" position does not always contain a thymine, as some scaffolds are more permissive.
The TAL-DNA code was broken by two separate groups in 2010.] The first group, headed by Adam Bogdanove, broke this code computationally by searching for patterns in protein sequence alignments and DNA sequences of target promoters derived from a database of genes upregulated by TALEs. The second group (Boch) deduced the code through molecular analysis of the TAL effector AvrBs3 and its target DNA sequence in the promoter of a pepper gene activated by AvrBs3. The experimentally validated code between RVD sequence and target DNA base can be expressed as follows:
Target genes
TAL effectors can induce susceptibility genes that are members of the NODULIN3 (N3) gene family. These genes are essential for the development of the disease. In rice two genes, Os-8N3 and Os-11N3, are induced by TAL effectors. Os-8N3 is induced by PthXo1 and Os-11N3 is induced by PthXo3 and AvrXa7.
Two hypotheses exist about possible functions for N3 proteins:
*They are involved in copper transport, resulting in detoxification of the environment for bacteria. The reduction in copper level facilitates bacterial growth.
*They are involved in glucose transport, facilitating glucose flow. This mechanism provides nutrients to bacteria and stimulates pathogen growth and virulence
Engineering TAL effectors
This simple correspondence between amino acids in TAL effectors and DNA bases in their target sites makes them useful for protein engineering applications. Numerous groups have designed artificial TAL effectors capable of recognizing new DNA sequences in a variety of experimental systems. Such engineered TAL effectors have been used to create artificial transcription factors that can be used to target and activate or repress endogenous
Endogenous substances and processes are those that originate from within a living system such as an organism, tissue, or cell.
In contrast, exogenous substances and processes are those that originate from outside of an organism.
For example, ...
genes in tomato
The tomato is the edible berry of the plant ''Solanum lycopersicum'', commonly known as the tomato plant. The species originated in western South America, Mexico, and Central America. The Mexican Nahuatl word gave rise to the Spanish word , ...
,[ '']Arabidopsis thaliana
''Arabidopsis thaliana'', the thale cress, mouse-ear cress or arabidopsis, is a small flowering plant native to Eurasia and Africa. ''A. thaliana'' is considered a weed; it is found along the shoulders of roads and in disturbed land.
A winter ...
'',[ and human cells.]
Genetic constructs to encode TAL effector-based proteins can be made using either conventional gene synthesis or modular assembly.
plasmid kit
for assembling custom TALEN and other TAL effector constructs is available through the public, not-for-profit repository Addgene. Webpages providing access to public software, protocols, and other resources for TAL effector-DNA targeting applications include th
TAL Effector-Nucleotide Targeter
an
taleffectors.com
Applications
Engineered TAL effectors can also be fused to the cleavage domain of FokI to create TAL effector nucleases (TALEN) or to meganucleases (nucleases with longer recognition sites) to create "megaTALs." Such fusions share some properties with zinc finger nucleases and may be useful for genetic engineering
Genetic engineering, also called genetic modification or genetic manipulation, is the modification and manipulation of an organism's genes using technology. It is a set of technologies used to change the genetic makeup of cells, including ...
and gene therapy
Gene therapy is a medical field which focuses on the genetic modification of cells to produce a therapeutic effect or the treatment of disease by repairing or reconstructing defective genetic material. The first attempt at modifying human D ...
applications.
TALEN-based approaches are used in the emerging fields of gene editing and genome engineering. TALEN fusions show activity in a yeast-based assay, at endogenous yeast genes, in a plant reporter assay, at an endogenous plant gene, at endogenous zebrafish
The zebrafish (''Danio rerio'') is a freshwater fish belonging to the minnow family (Cyprinidae) of the order Cypriniformes. Native to South Asia, it is a popular aquarium fish, frequently sold under the trade name zebra danio (and thus often c ...
genes, at an endogenous rat
Rats are various medium-sized, long-tailed rodents. Species of rats are found throughout the order Rodentia, but stereotypical rats are found in the genus ''Rattus''. Other rat genera include ''Neotoma'' ( pack rats), ''Bandicota'' (bandicoot ...
gene, and at endogenous human genes. The human HPRT1 gene has been targeted at detectable, but unquantified levels. In addition, TALEN constructs containing the FokI cleavage domain fused to a smaller portion of the TAL effector still containing the DNA binding domain have been used to target the endogenous NTF3
Neurotrophin-3 is a protein that in humans is encoded by the ''NTF3'' gene.
The protein encoded by this gene, NT-3, is a neurotrophic factor in the NGF (Nerve Growth Factor) family of neurotrophins. It is a protein growth factor which has activi ...
and CCR5
C-C chemokine receptor type 5, also known as CCR5 or CD195, is a protein on the surface of white blood cells that is involved in the immune system as it acts as a receptor for chemokines.
In humans, the ''CCR5'' gene that encodes the CCR5 pr ...
genes in human cells with efficiencies of up to 25%. TAL effector nucleases have also been used to engineer human embryonic stem cell
Embryonic stem cells (ESCs) are pluripotent stem cells derived from the inner cell mass of a blastocyst, an early-stage pre- implantation embryo. Human embryos reach the blastocyst stage 4–5 days post fertilization, at which time they cons ...
s and induced pluripotent stem cells (IPSCs) and to knock out the endogenous ''ben-1'' gene in ''C. elegans
''Caenorhabditis elegans'' () is a free-living transparent nematode about 1 mm in length that lives in temperate soil environments. It is the type species of its genus. The name is a blend of the Greek ''caeno-'' (recent), ''rhabditis'' ( ...
''.
TALE-induced