HOME

TheInfoList



OR:

A signal peptide (sometimes referred to as signal sequence, targeting signal, localization signal, localization sequence, transit peptide, leader sequence or leader peptide) is a short
peptide Peptides are short chains of amino acids linked by peptide bonds. A polypeptide is a longer, continuous, unbranched peptide chain. Polypeptides that have a molecular mass of 10,000 Da or more are called proteins. Chains of fewer than twenty am ...
(usually 16–30
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
s long) present at the
N-terminus The N-terminus (also known as the amino-terminus, NH2-terminus, N-terminal end or amine-terminus) is the start of a protein or polypeptide, referring to the free amine group (-NH2) located at the end of a polypeptide. Within a peptide, the amin ...
(or occasionally nonclassically at the
C-terminus The C-terminus (also known as the carboxyl-terminus, carboxy-terminus, C-terminal tail, carboxy tail, C-terminal end, or COOH-terminus) is the end of an amino acid chain (protein Proteins are large biomolecules and macromolecules that comp ...
or internally) of most newly synthesized
proteins Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, re ...
that are destined toward the
secretory pathway Secretion is the movement of material from one point to another, such as a secreted chemical substance from a cell (biology), cell or gland. In contrast, excretion is the removal of certain substances or waste products from a cell or organism. Th ...
. These proteins include those that reside either inside certain organelles (the
endoplasmic reticulum The endoplasmic reticulum (ER) is a part of a transportation system of the eukaryote, eukaryotic cell, and has many other important functions such as protein folding. The word endoplasmic means "within the cytoplasm", and reticulum is Latin for ...
, Golgi or endosomes), secreted from the cell, or inserted into most cellular membranes. Although most type I membrane-bound proteins have signal peptides, most type II and multi-spanning membrane-bound proteins are targeted to the secretory pathway by their first transmembrane domain, which biochemically resembles a signal sequence except that it is not cleaved. They are a kind of target peptide.


Function (translocation)

Signal peptides function to prompt a cell to translocate the protein, usually to the cellular membrane. In
prokaryotes A prokaryote (; less commonly spelled procaryote) is a single-celled organism whose cell lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Ancient Greek (), meaning 'before', and (), meaning 'nut' ...
, signal peptides direct the newly synthesized protein to the SecYEG protein-conducting channel, which is present in the
plasma membrane The cell membrane (also known as the plasma membrane or cytoplasmic membrane, and historically referred to as the plasmalemma) is a biological membrane that separates and protects the interior of a cell from the outside environment (the extr ...
. A homologous system exists in
eukaryotes The eukaryotes ( ) constitute the domain of Eukaryota or Eukarya, organisms whose cells have a membrane-bound nucleus. All animals, plants, fungi, seaweeds, and many unicellular organisms are eukaryotes. They constitute a major group of ...
, where the signal peptide directs the newly synthesized protein to the Sec61 channel, which shares structural and sequence homology with SecYEG, but is present in the endoplasmic reticulum. Both the SecYEG and Sec61 channels are commonly referred to as the translocon, and transit through this channel is known as translocation. While secreted proteins are threaded through the channel, transmembrane domains may diffuse across a lateral gate in the translocon to partition into the surrounding membrane.


Structure

The core of the signal peptide contains a long stretch of hydrophobic amino acids (about 5–16 residues long) that has a tendency to form a single alpha-helix and is also referred to as the "h-region". In addition, many signal peptides begin with a short positively charged stretch of amino acids, which may help to enforce proper topology of the polypeptide during translocation by what is known as the positive-inside rule. Because of its close location to the
N-terminus The N-terminus (also known as the amino-terminus, NH2-terminus, N-terminal end or amine-terminus) is the start of a protein or polypeptide, referring to the free amine group (-NH2) located at the end of a polypeptide. Within a peptide, the amin ...
it is called the "n-region". At the end of the signal peptide there is typically a stretch of amino acids that is recognized and cleaved by
signal peptidase Signal peptidases are enzymes that convert secretory and some membrane proteins to their mature or pro forms by cleaving their signal peptides from their N-termini. Signal peptidases were initially observed in endoplasmic reticulum (ER)-deri ...
and therefore named cleavage site. This cleavage site is absent from transmembrane-domains that serve as signal peptides, which are sometimes referred to as signal anchor sequences. Signal peptidase may cleave either during or after completion of translocation to generate a free signal peptide and a mature protein. The free signal peptides are then digested by specific proteases. Moreover, different target locations are aimed by different types of signal peptides. For example, the structure of a target peptide aiming for the mitochondrial environment differs in terms of length and shows an alternating pattern of small positively charged and hydrophobic stretches. Nucleus aiming signal peptides can be found at both the N-terminus and the C-terminus of a protein and are in most cases retained in the mature protein.


Co-translational versus post-translational translocation

In both prokaryotes and eukaryotes signal sequences may act co-translationally or post-translationally. The co-translational pathway is initiated when the signal peptide emerges from the
ribosome Ribosomes () are molecular machine, macromolecular machines, found within all cell (biology), cells, that perform Translation (biology), biological protein synthesis (messenger RNA translation). Ribosomes link amino acids together in the order s ...
and is recognized by the signal-recognition particle (SRP). SRP then halts further translation (translational arrest only occurs in Eukaryotes) and directs the signal sequence-ribosome-mRNA complex to the SRP receptor, which is present on the surface of either the plasma membrane (in prokaryotes) or the ER (in eukaryotes). Once membrane-targeting is completed, the signal sequence is inserted into the translocon. Ribosomes are then physically docked onto the cytoplasmic face of the translocon and protein synthesis resumes. The post-translational pathway is initiated after protein synthesis is completed. In prokaryotes, the signal sequence of post-translational substrates is recognized by the SecB chaperone protein that transfers the protein to the SecA ATPase, which in turn pumps the protein through the translocon. Although post-translational translocation is known to occur in eukaryotes, it is poorly understood. It is known that in yeast post-translational translocation requires the translocon and two additional membrane-bound proteins, Sec62 and Sec63.


Secretion efficiency determination

Signal peptides are extremely heterogeneous, many prokaryotic and eukaryotic ones are functionally interchangeable within or between species and all determine protein secretion efficiency.


Nucleotide level features

In vertebrates, the region of the
mRNA In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of Protein biosynthesis, synthesizing a protein. mRNA is ...
that codes for the signal peptide (i.e. the signal sequence coding region, or SSCR) can function as an RNA element with specific activities. SSCRs promote nuclear mRNA export and the proper localization to the surface of the endoplasmic reticulum. In addition SSCRs have specific sequence features: they have low
adenine Adenine (, ) (nucleoside#List of nucleosides and corresponding nucleobases, symbol A or Ade) is a purine nucleotide base that is found in DNA, RNA, and Adenosine triphosphate, ATP. Usually a white crystalline subtance. The shape of adenine is ...
-content, are enriched in certain motifs, and tend to be present in the first
exon An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term ''exon'' refers to both the DNA sequence within a gene and to the corresponding sequence ...
at a frequency that is higher than expected.


Alternate secretion mechanisms

Proteins without signal peptides can also be secreted by unconventional mechanisms. E.g. Interleukin, Galectin. The process by which such secretory proteins gain access to the cell exterior is termed unconventional protein secretion (UPS). In plants, even 50% of secreted proteins can be UPS dependent.


Nonclassical sequences

Signal peptides are usually located at the N-terminus of proteins. Some have C-terminal or internal signal peptides (examples: peroxisomal targeting signal and nuclear localisation signal). The structure of these nonclassical signal peptides differs vastly from the N-terminal signal peptides.


Nomenclature

Signal peptides are not to be confused with the leader peptides sometimes encoded by leader mRNA, although both are sometimes ambiguously referred to as "leader peptides." These other leader peptides are short polypeptides that do not function in protein localization, but instead may regulate transcription or translation of the main protein, and are not part of the final protein sequence. This type of leader peptide primarily refers to a form of gene regulation found in bacteria, although a similar mechanism is used to regulate eukaryotic genes, which is referred to as uORFs (upstream open reading frames).


Signal peptide as a therapeutic target

Signal peptide is a potential (therapeutic) antiviral target. Signal peptides with penultimate N-terminus glycine is a target for NMT inhibitors, which inhibit the myristoylation of signal peptides and target the signal peptide for degradation, which affects virus-cellular fusion.


See also

* Protein targeting * Target peptide * Topogenic sequence


References


External links

*
SPdb (Signal Peptide DataBase)

SignalP
— predicts the presence and location of signal peptide cleavage sites in amino acid sequences from different organisms. {{DEFAULTSORT:Signal Peptide Gene expression Protein targeting