HOME

TheInfoList



OR:

TAR DNA-binding protein 43 (TDP-43, transactive response DNA binding protein 43 
kDa The dalton or unified atomic mass unit (symbols: Da or u) is a non-SI unit of mass widely used in physics and chemistry. It is defined as of the mass of an unbound neutral atom of carbon-12 in its nuclear and electronic ground state and at re ...
) is a
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, res ...
that in humans is encoded by the ''TARDBP''
gene In biology, the word gene (from , ; "... Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
.


Structure

TDP-43 is 414
amino acid residues Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, the monomers of the polymer. A single amino acid monomer may ...
long. It consists of 4 domains: an N-terminal domain spanning residues 1-76 (NTD) with a well-defined fold that has been shown to form a
dimer Dimer may refer to: * Dimer (chemistry), a chemical structure formed from two similar sub-units ** Protein dimer, a protein quaternary structure ** d-dimer * Dimer model, an item in statistical mechanics, based on ''domino tiling'' * Julius Dimer ...
or oligomer; 2 highly conserved folded
RNA recognition motif RNA recognition motif, RNP-1 is a putative RNA-binding domain of about 90 amino acids that are known to bind single-stranded RNAs. It was found in many eukaryotic proteins. The largest group of single strand RNA-binding protein is the eukaryot ...
s spanning residues 106-176 (RRM1) and 191-259 (RRM2), respectively, required to bind target
RNA Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
and DNA; an unstructured C-terminal domain encompassing residues 274-414 (CTD), which contains a
glycine Glycine (symbol Gly or G; ) is an amino acid that has a single hydrogen atom as its side chain. It is the simplest stable amino acid ( carbamic acid is unstable), with the chemical formula NH2‐ CH2‐ COOH. Glycine is one of the proteinog ...
-rich region, is involved in protein-protein interactions, and harbors most of the
mutation In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA replication, DNA or viral repl ...
s associated with familial amyotrophic lateral sclerosis. The entire protein devoid of large solubilising tags has been purified. The full-length protein is a dimer. The dimer is formed due to a self-interaction between two NTD domains, where the dimerisation can be propagated to form higher-order oligomers. The protein sequence also has a
nuclear localization signal A nuclear localization signal ''or'' sequence (NLS) is an amino acid sequence that 'tags' a protein for import into the cell nucleus by nuclear transport. Typically, this signal consists of one or more short sequences of positively charged lysines o ...
(NLS, residues 82–98), a former
nuclear export signal A nuclear export signal (NES) is a short target peptide containing 4 hydrophobic residues in a protein that targets it for export from the cell nucleus to the cytoplasm through the nuclear pore complex using nuclear transport. It has the opposite ...
(NES residues 239–250) and 3 putative caspase-3 cleavage sites (residues 13, 89, 219). In December 2021 the structure of TDP-43 was resolved with
cryo-EM Cryogenic electron microscopy (cryo-EM) is a cryomicroscopy technique applied on samples cooled to cryogenic temperatures. For biological specimens, the structure is preserved by embedding in an environment of vitreous ice. An aqueous sample so ...
but shortly after it was argued that in the context of FTLD-TDP the protein involved could be
TMEM106B Transmembrane protein 106B is a protein that is encoded by the ''TMEM106B'' gene. It is found primarily within neurons and oligodendrocytes in the central nervous system with its subcellular location being in lysosomal membranes. TMEM106B helps fa ...
(which has been also resolved with cryo-EM), rather than of TDP-43.


N-Terminal domain (NTD)

The NTD located between residues 1 and 76 is involved in TDP-43
polymerization In polymer chemistry, polymerization (American English), or polymerisation (British English), is a process of reacting monomer molecules together in a chemical reaction to form polymer chains or three-dimensional networks. There are many fo ...
. Indeed, dimers are formed by head-to-head interactions between NTDs, and the polymer thus obtained allows for
pre-mRNA A primary transcript is the single-stranded ribonucleic acid ( RNA) product synthesized by transcription of DNA, and processed to yield various mature RNA products such as mRNAs, tRNAs, and rRNAs. The primary transcripts designated to be mRNAs ...
splicing. However, further oligomerization brings to more toxic accumulates. This process of polymerization into dimers, larger forms or just stabilizing monomers is dependent on TDP-43 conformational equilibrium between monomers, homodimers and oligomers. Hence, in TDP-43 diseased cells, TDP-43's over-expression leads to the NTD showing high propensity to aggregate. Contrary to this, in normal cells, normal levels of TDP-43 allow for folded NTD, preventing aggregates and polymers formation. More recently, this domain was found to have a
ubiquitin Ubiquitin is a small (8.6 kDa) regulatory protein found in most tissues of eukaryotic organisms, i.e., it is found ''ubiquitously''. It was discovered in 1975 by Gideon Goldstein and further characterized throughout the late 1970s and 1980s. Fo ...
-like structure. It bears 27,6% of homology with Ubiquitin-1 and a β1-β2- α1-β3-β4-β5-β6 + 2* SO42- form. Ubiquitin-like domain are usually associated with a greater affinity for
RNA Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
/ DNA. However, in the unique case of TDP-43, the Ubiquitin-like NTD binds directly to ssDNA. This interaction permits the conformational equilibrium cited higher to shift towards non-aggregated forms. The domain spanning from ,80has a solenoid-like structure which sterically impedes interactions between aggregation prone C-term regions. All of this raises the possibility that NTD and the RNA Recognition Motifs (later on defined) could cooperatively interact with nucleic acids to accomplish TDP-43's physiological functions.


Mitochondrial localization signal

There are six mitochondrial localization signals to be accounted on TDP-43's
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha a ...
sequence, although only M1, M3, and M5 were shown to be essential for mitochondial localization. Indeed, their ablation leads to a lessened mitochondrial localization. These localizing sequences are found on the following amino acids: M1: 5, 41 M2: 05, 112 M3: 46-150 M4: 28, 235 M5: 94, 300 M6: 28, 236


Nuclear localization signal (NLS)

The
nuclear Nuclear may refer to: Physics Relating to the nucleus of the atom: * Nuclear engineering *Nuclear physics *Nuclear power *Nuclear reactor *Nuclear weapon *Nuclear medicine *Radiation therapy *Nuclear warfare Mathematics *Nuclear space *Nuclear ...
localization signal (NLS) domain is located between residues 82 and 98 is of critical importance in ALS, and such is witnessed by the depletion or the mutations (notably A90V) of this domain, which cause loss-of-function from nucleus and promote aggregating, two processes very likely to conduct to TDP-43's toxic gain of function. It is thereby of the utmost importance to note that TDP-43's nuclear localization is absolutely critical for it to fulfill its physiological functions.


RNA recognition motif

The
RNA recognition motif RNA recognition motif, RNP-1 is a putative RNA-binding domain of about 90 amino acids that are known to bind single-stranded RNAs. It was found in many eukaryotic proteins. The largest group of single strand RNA-binding protein is the eukaryot ...
ranges between residues 105 and 181, much like many hnRNPs, TDP-43's RRMs encompass highly conserved motifs of primary importance for fulfilling their function. Both RRMs follow this pattern: β1-α1-β2-β3-α2-β4-β5, which allows them to bind to both
RNA Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
and DNA onto U G/ T G-repeats of
3'UTR In molecular genetics, the three prime untranslated region (3′-UTR) is the section of messenger RNA (mRNA) that immediately follows the translation termination codon. The 3′-UTR often contains regulatory regions that post-transcriptionally ...
(Untranslated Terminal Regions) end of
mRNA In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein. mRNA is created during the ...
/DNA. These sequences mainly ensure mRNA processing, RNA export and RNA stabilizing. It is notably thanks to these sequences that TDP-43 importantly binds to its own mRNA regulates its very own
solubility In chemistry, solubility is the ability of a substance, the solute, to form a solution with another substance, the solvent. Insolubility is the opposite property, the inability of the solute to form such a solution. The extent of the solub ...
and
polymerization In polymer chemistry, polymerization (American English), or polymerisation (British English), is a process of reacting monomer molecules together in a chemical reaction to form polymer chains or three-dimensional networks. There are many fo ...
.


RRM2

RRM2 spans between residues 181 and 261. In pathological conditions, it notably binds to p65/NF-kB, an apoptosis implicated factor, and is thus a potential therapeutic target. Moreover it can be burdened with a mutation, D169G, altering a key cleaving site for regulating formation of toxic inclusions.


Nuclear export signal (NES)

The nuclear export signal is located between residues 239 and 251 sequence probably bears a role in TDP-43's shuttling function, and was recently found using a prediction algorithm.


Disordered glycin rich C-terminal domain (CTD)

The Disordered
Glycin Glycin, or N-(4-hydroxyphenyl)glycine, is N-substituted p-aminophenol. It is a photographic developing agent used in classic black-and-white developer solutions. It is unrelated to the amino acid glycine. It is typically characterized as thin pl ...
Rich C-terminal domain is located between residues 277 and 414. Much like 70 other RNA binding proteins, TDP-43 bears a Q/ N rich domain 44, 366which resembles
yeast Yeasts are eukaryotic, single-celled microorganisms classified as members of the fungus kingdom. The first yeast originated hundreds of millions of years ago, and at least 1,500 species are currently recognized. They are estimated to constit ...
prion sequence. This sequence is called a Prion-Like Domain (PLD). PLDs are low complexity sequences that have been reported to mediate gene regulation via Liquid-Liquid Phase Transition (LLP) thus driving RNP granule assembly. Forming these microscopically visible RNP granules is thought to induce more effective gene regulatory process. It is here noted that LLP are reversible phenomenons of de-mixing a solution into two distinct liquid phases, hereby forming granules. Mutations within the TDP-43 proteins Glycine Rich Region (GRR) have recently been identified as associates that can contribute to various neurodegenerative diseases, with the most notable and common NDD being ALS, about 10% of the mutations causing familial ALS are accredited with the TDP-43 protein This CTD is often reported to play important role in pathogenic behavior of TDP-43: RNPs granules could have a role in stress response, and thus, aging, or persistence stress could lead the LLPs to turn into irreversible Liquid Solid Phase separation, pathological aggregates notably found in ALS neurons. CTD's disorganized structure can turn into a full fledged
Amyloid Amyloids are aggregates of proteins characterised by a fibrillar morphology of 7–13 nm in diameter, a beta sheet (β-sheet) secondary structure (known as cross-β) and ability to be stained by particular dyes, such as Congo red. In the huma ...
-like
Beta-sheet The beta sheet, (β-sheet) (also β-pleated sheet) is a common motif of the regular protein secondary structure. Beta sheets consist of beta strands (β-strands) connected laterally by at least two or three backbone hydrogen bonds, forming a ge ...
rich structure causing it to adopt prion-like properties. Moreover, CTFs are a common maker in diseased
neuron A neuron, neurone, or nerve cell is an electrically excitable cell that communicates with other cells via specialized connections called synapses. The neuron is the main component of nervous tissue in all animals except sponges and placozoa. ...
s and are argued to be of high toxicity. However, notice is to be taken that some points are not always consensual. Indeed, due to its
hydrophobic In chemistry, hydrophobicity is the physical property of a molecule that is seemingly repelled from a mass of water (known as a hydrophobe). In contrast, hydrophiles are attracted to water. Hydrophobic molecules tend to be nonpolar and, t ...
structure, TDP-43 can be hard to analyze, and parts of it remain somewhat vague. Precise sites of phosphorylation, methylation, or even binding are still a bit elusive.


Function

TDP-43 is a transcriptional repressor that binds to chromosomally integrated TAR DNA and represses
HIV-1 The subtypes of HIV include two major types, HIV type 1 (HIV-1) and HIV type 2 (HIV-2). HIV-1 is related to viruses found in chimpanzees and gorillas living in western Africa, while HIV-2 viruses are related to viruses found in the sooty mangabey ...
transcription. In addition, this protein regulates alternate splicing of the
CFTR Cystic fibrosis transmembrane conductance regulator (CFTR) is a membrane protein and anion channel in vertebrates that is encoded by the ''CFTR'' gene. Geneticist Lap-Chee Tsui and his team identified the CFTR gene in 1989 as the gene linked wi ...
gene. In particular, TDP-43 is a splicing factor binding to the intron8/exon9 junction of the CFTR gene and to the intron2/exon3 region of the apoA-II gene. A similar pseudogene is present on chromosome 20. TDP-43 has been shown to bind both DNA and RNA and have multiple functions in transcriptional repression, pre-mRNA splicing and translational regulation. Recent work has characterized the transcriptome-wide binding sites revealing that thousands of RNAs are bound by TDP-43 in neurons. TDP-43 was originally identified as a transcriptional repressor that binds to chromosomally integrated
trans-activation response element The HIV trans-activation response (TAR) element is an RNA element which is known to be required for the trans-activation of the viral promoter and for virus replication. The TAR hairpin is a dynamic structure that acts as a binding site for the ...
(TAR) DNA and represses
HIV-1 The subtypes of HIV include two major types, HIV type 1 (HIV-1) and HIV type 2 (HIV-2). HIV-1 is related to viruses found in chimpanzees and gorillas living in western Africa, while HIV-2 viruses are related to viruses found in the sooty mangabey ...
transcription. It was also reported to regulate alternate splicing of the
CFTR Cystic fibrosis transmembrane conductance regulator (CFTR) is a membrane protein and anion channel in vertebrates that is encoded by the ''CFTR'' gene. Geneticist Lap-Chee Tsui and his team identified the CFTR gene in 1989 as the gene linked wi ...
gene and the apoA-II gene. In spinal motor neurons TDP-43 has also been shown in humans to be a low molecular weight neurofilament (hNFL) mRNA-binding protein. It has also shown to be a neuronal activity response factor in the dendrites of hippocampal neurons suggesting possible roles in regulating mRNA stability, transport and local translation in neurons. It has been demonstrated that zinc ions are able to induce aggregation of endogenous TDP-43 in cells. Moreover, zinc could bind to RNA binding domain of TDP-43 and induce the formation of amyloid-like aggregates ''in vitro.''


DNA repair

TDP-43 protein is a key element of the
non-homologous end joining Non-homologous end joining (NHEJ) is a pathway that repairs double-strand breaks in DNA. NHEJ is referred to as "non-homologous" because the break ends are directly ligated without the need for a homologous template, in contrast to homology direc ...
(NHEJ) enzymatic pathway that repairs DNA
double-strand breaks DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. In human cells, both normal metabolic activities and environmental factors such as radiation can cause DNA dama ...
(DSBs) in pluripotent stem cell-derived motor neurons. TDP-43 is rapidly recruited to DSBs where it acts as a scaffold for the further recruitment of the
XRCC4 DNA repair protein XRCC4 also known as X-ray repair cross-complementing protein 4 or XRCC4 is a protein that in humans is encoded by the XRCC4 gene. In addition to humans, the XRCC4 protein is also expressed in many other metazoans, fungi and in ...
-
DNA ligase DNA ligase is a specific type of enzyme, a ligase, () that facilitates the joining of DNA strands together by catalyzing the formation of a phosphodiester bond. It plays a role in repairing single-strand breaks in duplex DNA in living orga ...
protein complex that then acts to seal the DNA breaks. In TDP-43 depleted human neural stem cell-derived motor neurons, as well as in sporadic ALS patients' spinal cord specimens there is significant DSB accumulation and reduced levels of NHEJ.


Clinical significance

A hyper-
phosphorylated In chemistry, phosphorylation is the attachment of a phosphate group to a molecule or an ion. This process and its inverse, dephosphorylation, are common in biology and could be driven by natural selection. Text was copied from this source, wh ...
,
ubiquitin Ubiquitin is a small (8.6 kDa) regulatory protein found in most tissues of eukaryotic organisms, i.e., it is found ''ubiquitously''. It was discovered in 1975 by Gideon Goldstein and further characterized throughout the late 1970s and 1980s. Fo ...
ated and cleaved form of TDP-43—known as pathologic TDP43—is the major disease protein in
ubiquitin Ubiquitin is a small (8.6 kDa) regulatory protein found in most tissues of eukaryotic organisms, i.e., it is found ''ubiquitously''. It was discovered in 1975 by Gideon Goldstein and further characterized throughout the late 1970s and 1980s. Fo ...
-positive, tau-, and
alpha-synuclein Alpha-synuclein is a protein that, in humans, is encoded by the ''SNCA'' gene. Alpha-synuclein is a neuronal protein that regulates synaptic vesicle trafficking and subsequent neurotransmitter release. It is abundant in the brain, while smaller a ...
-negative frontotemporal dementia (FTLD-TDP, previously referred to as FTLD-U) and in amyotrophic lateral sclerosis (ALS). Elevated levels of the TDP-43 protein have also been identified in individuals diagnosed with
chronic traumatic encephalopathy Chronic traumatic encephalopathy (CTE) is a neurodegenerative disease linked to repeated trauma to the head. The encephalopathy symptoms can include behavioral problems, mood problems, and problems with thinking. The disease often gets worse ...
, and has also been associated with ALS leading to the inference that athletes who have experienced multiple concussions and other types of
head injury A head injury is any injury that results in trauma to the skull or brain. The terms ''traumatic brain injury'' and ''head injury'' are often used interchangeably in the medical literature. Because head injuries cover such a broad scope of inju ...
are at an increased risk for both encephalopathy and motor neuron disease (ALS). Abnormalities of TDP-43 also occur in an important subset of Alzheimer's disease patients, correlating with clinical and neuropathologic features indexes. Misfolded TDP-43 is found in the brains of
older adults Old age refers to ages nearing or surpassing the life expectancy of human beings, and is thus the end of the human life cycle. Terms and euphemisms for people at this age include old people, the elderly (worldwide usage), OAPs (British usage ...
over age 85 with
limbic-predominant age-related TDP-43 encephalopathy LATE is a term that describes a prevalent condition with impaired memory and thinking in advanced age, often culminating in the dementia clinical syndrome. In other words, the symptoms of LATE are similar to those of Alzheimer's disease.   The ...
, (LATE), a form of dementia. New monoclonal antibodies, 2G11 and 2H1, have been developed to specify different TDP-43 inclusion types that occur across neurodegenerative diseases, without relying on hyper-phosphorylated epitopes. These antibodies were raised against an epitope within the RRM2 domain (amino acid residues 198–216).
HIV The human immunodeficiency viruses (HIV) are two species of ''Lentivirus'' (a subgroup of retrovirus) that infect humans. Over time, they cause acquired immunodeficiency syndrome (AIDS), a condition in which progressive failure of the immune ...
-1, the causative agent of
acquired immunodeficiency syndrome Human immunodeficiency virus infection and acquired immunodeficiency syndrome (HIV/AIDS) is a spectrum of conditions caused by infection with the human immunodeficiency virus (HIV), a retrovirus. Following initial infection an individual ma ...
(AIDS), contains an
RNA Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
genome In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding g ...
that produces a chromosomally integrated DNA during the replicative cycle. Activation of HIV-1 gene expression by the transactivator "Tat" is dependent on an RNA regulatory element (TAR) located "downstream" (i.e. to-be transcribed at a later point in time) of the transcription initiation site. Mutations in the ''TARDBP'' gene are associated with neurodegenerative disorders including
frontotemporal lobar degeneration Frontotemporal lobar degeneration (FTLD) is a pathological process that occurs in frontotemporal dementia. It is characterized by atrophy in the frontal lobe and temporal lobe of the brain, with sparing of the parietal and occipital lobes. Commo ...
and amyotrophic lateral sclerosis (ALS). In particular, the TDP-43 mutants M337V and Q331K are being studied for their roles in ALS. Cytoplasmic TDP-43 pathology is the dominant histopathological feature of multisystem proteinopathy. The N-terminal domain, which contributes importantly to the aggregation of the C-terminal region, has a novel structure with two negatively charged loops.. A recent study has demonstrated that cellular stress can trigger the abnormal cytoplasmic mislocalisation of TDP-43 in spinal motor neurons in vivo, providing insight into how TDP-43 pathology may develop in sporadic ALS patients.


Figures


References


Further reading

* * * * * * * * * * * * * * * *


External links


GeneReviews/NCBI/NIH/UW entry on TARDBP-Related Amyotrophic Lateral Sclerosis
* {{PDB Gallery, geneid=23435 DNA-binding proteins