Titin
   HOME

TheInfoList



OR:

Titin (contraction for Titan protein) (also called connectin) is a
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, res ...
that in humans is encoded by the ''TTN''
gene In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
. Titin is a giant protein, greater than 1
µm The micrometre ( international spelling as used by the International Bureau of Weights and Measures; SI symbol: μm) or micrometer ( American spelling), also commonly known as a micron, is a unit of length in the International System of Uni ...
in length, that functions as a molecular
spring Spring(s) may refer to: Common uses * Spring (season), a season of the year * Spring (device), a mechanical device that stores energy * Spring (hydrology), a natural source of water * Spring (mathematics), a geometric surface in the shape of a h ...
that is responsible for the passive elasticity of
muscle Skeletal muscles (commonly referred to as muscles) are organs of the vertebrate muscular system and typically are attached by tendons to bones of a skeleton. The muscle cells of skeletal muscles are much longer than in the other types of mus ...
. It comprises 244 individually folded protein domains connected by unstructured
peptide Peptides (, ) are short chains of amino acids linked by peptide bonds. Long chains of amino acids are called proteins. Chains of fewer than twenty amino acids are called oligopeptides, and include dipeptides, tripeptides, and tetrapeptides. ...
sequences. These domains unfold when the protein is stretched and refold when the tension is removed. Titin is important in the contraction of striated muscle tissues. It connects the Z line to the M line in the sarcomere. The protein contributes to force transmission at the Z line and resting tension in the I band region. It limits the range of motion of the sarcomere in tension, thus contributing to the passive stiffness of muscle. Variations in the sequence of titin between different types of striated muscle ( cardiac or
skeletal A skeleton is the structural frame that supports the body of an animal. There are several types of skeletons, including the exoskeleton, which is the stable outer shell of an organism, the endoskeleton, which forms the support structure inside ...
) have been correlated with differences in the mechanical properties of these muscles. Titin is the third most abundant protein in muscle (after
myosin Myosins () are a superfamily of motor proteins best known for their roles in muscle contraction and in a wide range of other motility processes in eukaryotes. They are ATP-dependent and responsible for actin-based motility. The first myosin (M ...
and
actin Actin is a family of globular multi-functional proteins that form microfilaments in the cytoskeleton, and the thin filaments in muscle fibrils. It is found in essentially all eukaryotic cells, where it may be present at a concentration of ov ...
), and an adult human contains approximately 0.5 kg of titin. With its length of ~27,000 to ~35,000
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha ...
s (depending on the splice isoform), titin is the largest known
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, res ...
. Furthermore, the gene for titin contains the largest number of exons (363) discovered in any single gene, as well as the longest single exon (17,106 bp).


Discovery

Reiji Natori in 1954 was the first to propose an elastic structure in muscle fiber to account for the return to the resting state when muscles are stretched and then released. In 1977, Koscak Maruyama and coworkers isolated an elastic protein from muscle fiber that they called connectin. Two years later, Kuan Wang and coworkers identified a doublet band on electrophoresis gel corresponding to a high molecular weight, elastic protein that they named titin. Siegfried Labeit in 1990 isolated a partial
cDNA In genetics, complementary DNA (cDNA) is DNA synthesized from a single-stranded RNA (e.g., messenger RNA (mRNA) or microRNA (miRNA)) template in a reaction catalyzed by the enzyme reverse transcriptase. cDNA is often used to express a sp ...
clone of titin. In 1995, Labeit and
Bernhard Kolmerer Bernhard is both a given name and a surname. Notable people with the name include: Given name *Bernhard of Saxe-Weimar (1604–1639), Duke of Saxe-Weimar * Bernhard, Prince of Saxe-Meiningen (1901–1984), head of the House of Saxe-Meiningen 194 ...
determined the cDNA sequence of human cardiac titin. Labeit and colleagues in 2001 determined the complete sequence of the human titin gene.


Genetics

The human gene encoding for titin is located on the long arm of chromosome 2 and contains 363 exons, which together code for 38,138
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha ...
residues (4200 kDa). Within the gene are found a large number of PEVK (proline-glutamate-valine-lysine -abundant structural motifs) exons 84 to 99 nucleotides in length, which code for conserved 28- to 33-residue motifs that may represent structural units of the titin PEVK spring. The number of PEVK motifs in the titin gene appears to have increased during evolution, apparently modifying the genomic region responsible for titin's spring properties.


Isoforms

A number of titin
isoform A protein isoform, or "protein variant", is a member of a set of highly similar proteins that originate from a single gene or gene family and are the result of genetic differences. While many perform the same or similar biological roles, some is ...
s are produced in different striated muscle tissues as a result of alternative splicing. All but one of these isoforms are in the range of ~27,000 to ~36,000 amino acid residues in length. The exception is the small cardiac novex-3 isoform, which is only 5,604 amino acid residues in length. The following table lists the known titin isoforms:


Structure

Titin is the largest known protein; its human variant consists of 34,350
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha ...
s, with the
molecular weight A molecule is a group of two or more atoms held together by attractive forces known as chemical bonds; depending on context, the term may or may not include ions which satisfy this criterion. In quantum physics, organic chemistry, and bio ...
of the mature "canonical" isoform of the protein being approximately 3,816,030.05 Da. Its mouse homologue is even larger, comprising 35,213 amino acids with a molecular weight of 3,906,487.6 Da. It has a theoretical
isoelectric point The isoelectric point (pI, pH(I), IEP), is the pH at which a molecule carries no net electrical charge or is electrically neutral in the statistical mean. The standard nomenclature to represent the isoelectric point is pH(I). However, pI is also ...
of 6.02. The protein's
empirical Empirical evidence for a proposition is evidence, i.e. what supports or counters this proposition, that is constituted by or accessible to sense experience or experimental procedure. Empirical evidence is of central importance to the sciences and ...
chemical formula In chemistry, a chemical formula is a way of presenting information about the chemical proportions of atoms that constitute a particular chemical compound or molecule, using chemical element symbols, numbers, and sometimes also other symbols, ...
is C169,719H270,466N45,688O52,238S911. It has a theoretical instability index (II) of 42.38, classifying the protein as unstable. The protein's
in vivo Studies that are ''in vivo'' (Latin for "within the living"; often not italicized in English) are those in which the effects of various biological entities are tested on whole, living organisms or cells, usually animals, including humans, and p ...
half-life Half-life (symbol ) is the time required for a quantity (of substance) to reduce to half of its initial value. The term is commonly used in nuclear physics to describe how quickly unstable atoms undergo radioactive decay or how long stable ...
, the time it takes for half of the amount of protein in a cell to break down after its synthesis in the cell, is predicted to be approximately 30 hours (in
mammalian Mammals () are a group of vertebrate animals constituting the class Mammalia (), characterized by the presence of mammary glands which in females produce milk for feeding (nursing) their young, a neocortex (a region of the brain), fu ...
reticulocytes). The titin protein is located between the
myosin Myosins () are a superfamily of motor proteins best known for their roles in muscle contraction and in a wide range of other motility processes in eukaryotes. They are ATP-dependent and responsible for actin-based motility. The first myosin (M ...
thick filament and the Z disk. Titin consists primarily of a linear array of two types of modules, also referred to as protein domains (244 copies in total): type I fibronectin type III domain (132 copies) and type II
immunoglobulin domain The immunoglobulin domain, also known as the immunoglobulin fold, is a type of protein domain that consists of a 2-layer sandwich of 7-9 antiparallel β-strands arranged in two β-sheets with a Greek key topology, consisting of about 125 amino ac ...
(112 copies). However, the exact number of these domains is different in different species. This linear array is further organized into two regions: *
N-terminal The N-terminus (also known as the amino-terminus, NH2-terminus, N-terminal end or amine-terminus) is the start of a protein or polypeptide, referring to the free amine group (-NH2) located at the end of a polypeptide. Within a peptide, the ami ...
I-band: acts as the elastic part of the molecule and is composed mainly of type II modules. More specifically the I-band contains two regions of tandem type II immunoglobulin domains on either side of a PEVK region that is rich in
proline Proline (symbol Pro or P) is an organic acid classed as a proteinogenic amino acid (used in the biosynthesis of proteins), although it does not contain the amino group but is rather a secondary amine. The secondary amine nitrogen is in the p ...
(P), glutamate (E),
valine Valine (symbol Val or V) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α- amino group (which is in the protonated −NH3+ form under biological conditions), an α- carboxylic acid group (which is in the deprotona ...
(V) and lysine (K). * C-terminal A-band: is thought to act as a protein-ruler and is composed of alternating type I (Fn3) and II (Ig) modules with super-repeat segments. These have been shown to align to the 43 nm axial repeats of myosin thick filaments with immunoglobulin domains correlating to myosin crowns. The C-terminal region also contains a serine
kinase In biochemistry, a kinase () is an enzyme that catalyzes the transfer of phosphate groups from high-energy, phosphate-donating molecules to specific substrates. This process is known as phosphorylation, where the high-energy ATP molecule don ...
domain that is primarily known for adapting the muscle to mechanical strain. It is “stretch-sensitive” and helps repair overstretching of the sarcomere. The N-terminal (the Z-disc end) contains a "Z repeat" that recognizes
Actinin alpha 2 Alpha-actinin-2 is a protein which in humans is encoded by the ''ACTN2'' gene. This gene encodes an alpha-actinin isoform that is expressed in both skeletal and cardiac muscles and functions to anchor myofibrillar actin thin filaments and titin ...
. The elasticity of the PEVK region has both entropic and
enthalpic Enthalpy , a property of a thermodynamic system, is the sum of the system's internal energy and the product of its pressure and volume. It is a state function used in many measurements in chemical, biological, and physical systems at a constant p ...
contributions and is characterized by a polymer persistence length and a stretch modulus. At low to moderate extensions PEVK elasticity can be modeled with a standard worm-like chain (WLC) model of entropic elasticity. At high extensions PEVK stretching can be modeled with a modified WLC model that incorporates enthalpic elasticity. The difference between low-and high- stretch elasticity is due to electrostatic stiffening and hydrophobic effects. Embedded between the PEVK and Ig residues are N2A domains.


Evolution

The titin domains have evolved from a common ancestor through many gene duplication events. Domain duplication was facilitated by the fact that most domains are encoded by single exons. Other giant sarcomeric proteins made out of Fn3/Ig repeats include obscurin and myomesin. Throughout evolution, titin mechanical strength appears to decrease through the loss of disulfide bonds as the organism becomes heavier. Titin A-band has homologs in invertebrates, such as twitchin (unc-22) and projectin, which also contain Ig and FNIII repeats and a protein kinase domain. The gene duplication events took place independently but were from the same ancestral Ig and FNIII domains. It is said that the protein titin was the first to diverge out of the family. ''Drosophila'' projectin, officially known as bent (''bt''), is associated with lethality by failing to escape the egg in some mutations as well as dominant changes in wing angles. ''Drosophila'' Titin, also known as Kettin or sallimus (''sls''), is kinase-free. It has roles in the elasticity of both muscle and chromosomes. It is homologous to vertebrate titin I-band and contains Ig PEVK domains, the many repeats being a hot target for splicing. There also exists a titin homologue, ''ttn-1'', in ''
C. elegans ''Caenorhabditis elegans'' () is a free-living transparent nematode about 1 mm in length that lives in temperate soil environments. It is the type species of its genus. The name is a blend of the Greek ''caeno-'' (recent), ''rhabditis'' (r ...
''. It has a kinase domain, some Ig/Fn3 repeats, and PEVT repeats that are similarly elastic.


Function

Titin is a large abundant protein of striated muscle. Titin's primary functions are to stabilize the thick filament, center it between the thin filaments, prevent overstretching of the sarcomere, and to recoil the sarcomere like a spring after it is stretched. An N-terminal Z-disc region and a C-terminal M-line region bind to the Z-line and M-line of the sarcomere, respectively, so that a single titin molecule spans half the length of a sarcomere. Titin also contains binding sites for muscle-associated proteins so it serves as an adhesion template for the assembly of contractile machinery in muscle cells. It has also been identified as a structural protein for
chromosome A chromosome is a long DNA molecule with part or all of the genetic material of an organism. In most chromosomes the very long thin DNA fibers are coated with packaging proteins; in eukaryotic cells the most important of these proteins ar ...
s. Considerable variability exists in the I-band, the M-line and the Z-disc regions of titin. Variability in the I-band region contributes to the differences in elasticity of different titin isoforms and, therefore, to the differences in elasticity of different muscle types. Of the many titin variants identified, five are described with complete transcript information available. Dominant mutation in TTN causes predisposition to
hernia A hernia is the abnormal exit of tissue or an organ, such as the bowel, through the wall of the cavity in which it normally resides. Various types of hernias can occur, most commonly involving the abdomen, and specifically the groin. Groin her ...
s. Titin interacts with many sarcomeric proteins including: * Z line region: telethonin and alpha-actinin * I band region: calpain-3 and obscurin * M line region: myosin-binding protein C, calmodulin 1, CAPN3, and MURF1


Clinical relevance

Mutation In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, m ...
s anywhere within the unusually long sequence of this gene can cause
premature stop codon In genetics, a nonsense mutation is a point mutation in a sequence of DNA that results in a premature stop codon, or a ''nonsense codon'' in the transcribed mRNA, and in leading to a truncated, incomplete, and usually nonfunctional protein produ ...
s or other defects. Titin mutations are associated with hereditary
myopathy In medicine, myopathy is a disease of the muscle in which the muscle fibers do not function properly. This results in muscular weakness. ''Myopathy'' means muscle disease ( Greek : myo- ''muscle'' + patheia '' -pathy'' : ''suffering''). This me ...
with early respiratory failure, early-onset myopathy with fatal
cardiomyopathy Cardiomyopathy is a group of diseases that affect the heart muscle. Early on there may be few or no symptoms. As the disease worsens, shortness of breath, feeling tired, and swelling of the legs may occur, due to the onset of heart failure. ...
, core myopathy with heart disease, centronuclear myopathy, limb-girdle muscular dystrophy type 2J, familial dilated cardiomyopathy 9,
hypertrophic cardiomyopathy Hypertrophic cardiomyopathy (HCM, or HOCM when obstructive) is a condition in which the heart becomes thickened without an obvious cause. The parts of the heart most commonly affected are the interventricular septum and the ventricles. This r ...
and tibial muscular dystrophy. Further research also suggests that no genetically linked form of any
dystrophy Dystrophy is the degeneration of tissue, due to disease or malnutrition, most likely due to heredity. Types * Muscular dystrophy ** Duchenne muscular dystrophy ** Becker's muscular dystrophy ** Myotonic dystrophy * Reflex neurovascular d ...
or myopathy can be safely excluded from being caused by a mutation on the TTN gene. Truncating mutations in dilated cardiomyopathy patients are most commonly found in the A region; although truncations in the upstream I region might be expected to prevent translation of the A region entirely, alternative splicing creates some transcripts that do not encounter the premature stop codon, ameliorating its effect. mRNA splicing factors such as RBM20 and SLM2 ( KHDRBS3) were shown to mediated alternative mRNA splicing of titin mRNA contributing to the development of
heart failure Heart failure (HF), also known as congestive heart failure (CHF), is a syndrome, a group of signs and symptoms caused by an impairment of the heart's blood pumping function. Symptoms typically include shortness of breath, excessive fatigue, ...
due to cardiomyopathies. Autoantibodies to titin are produced in patients with the autoimmune disease Myasthenia gravis.


Interactions

Titin has been shown to
interact Advocates for Informed Choice, doing business as, dba interACT or interACT Advocates for Intersex Youth, is a 501(c)(3) nonprofit organization using innovative strategies to advocate for the legal and human rights of children with intersex trai ...
with: * ANK1, * ANKRD1, *
ANKRD23 Ankyrin repeat domain-containing protein 23 is a protein that in humans is encoded by the ''ANKRD23'' gene. This gene is a member of the muscle ankyrin repeat protein (MARP) family and encodes a protein with four tandem ankyrin-like repeats. The ...
* CAPN3, * FHL2, *
OBSCN Obscurin is a protein that in humans is encoded by the ''OBSCN'' gene. Obscurin belongs to the family of giant sarcomeric signaling proteins that includes titin and nebulin. Obscurin is expressed in cardiac and skeletal muscle, and plays a role i ...
, * TCAP, and *
TRIM63 E3 ubiquitin-protein ligase TRIM63, also known as "MuRF1" (Muscle Ring-Finger Protein-1), is an enzyme that in humans is encoded by the ''TRIM63'' gene. This gene encodes a member of the RING zinc finger protein family found in striated muscle a ...
.


Linguistic significance

The name titin is derived from the Greek
Titan Titan most often refers to: * Titan (moon), the largest moon of Saturn * Titans, a race of deities in Greek mythology Titan or Titans may also refer to: Arts and entertainment Fictional entities Fictional locations * Titan in fiction, fictiona ...
(a giant deity, anything of great size). As the largest known protein, titin also has the longest IUPAC name of a protein. The full chemical name of the human canonical form of titin, which starts '' methionyl...'' and ends ''...
isoleucine Isoleucine (symbol Ile or I) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form under biological conditions), an α-carboxylic acid group (which is in the depr ...
'', contains 189,819 letters and is sometimes stated to be the longest word in the English language, or of any language. However, lexicographers regard generic names of
chemical compound A chemical compound is a chemical substance composed of many identical molecules (or molecular entities) containing atoms from more than one chemical element held together by chemical bonds. A molecule consisting of atoms of only one element ...
s as ''verbal
formulae In science, a formula is a concise way of expressing information symbolically, as in a mathematical formula or a '' chemical formula''. The informal use of the term ''formula'' in science refers to the general construct of a relationship betw ...
'' rather than English words.


References


Further reading

* * * * * * * * *


External links


GeneReviews/NIH/NCBI/UW entry on Familial Hypertrophic Cardiomyopathy Overview

GeneReviews/NCBI/NIH/UW entry on Udd Distal Myopathy, Tibial Muscular Dystrophy, Udd Myopathy

GeneReviews/NIH/NCBI/UW entry on Salih Myopathy or Early-Onset Myopathy with Fatal Cardiomyopathy

InterPro domain organization of titin
{{NLM content Structural proteins Genes on human chromosome 2 EC 2.7.11 Muscular system