HOME

TheInfoList



OR:

Protein production is the biotechnological process of generating a specific
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, res ...
. It is typically achieved by the manipulation of
gene expression Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, protein or non-coding RNA, and ultimately affect a phenotype, as the final effect. T ...
in an organism such that it expresses large amounts of a
recombinant gene Recombinant DNA (rDNA) molecules are DNA molecules formed by laboratory methods of genetic recombination (such as molecular cloning) that bring together genetic material from multiple sources, creating sequences that would not otherwise be fo ...
. This includes the transcription of the
recombinant DNA Recombinant DNA (rDNA) molecules are DNA molecules formed by laboratory methods of genetic recombination (such as molecular cloning) that bring together genetic material from multiple sources, creating sequences that would not otherwise be f ...
to messenger RNA (
mRNA In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein. mRNA is created during the ...
), the
translation Translation is the communication of the meaning of a source-language text by means of an equivalent target-language text. The English language draws a terminological distinction (which does not exist in every language) between ''transla ...
of mRNA into polypeptide chains, which are ultimately folded into functional
proteins Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respo ...
and may be targeted to specific subcellular or extracellular locations. Protein production systems (also known as
expression system Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, protein or non-coding RNA, and ultimately affect a phenotype, as the final effect. The ...
s) are used in the
life sciences This list of life sciences comprises the branches of science that involve the scientific study of life – such as microorganisms, plants, and animals including human beings. This science is one of the two major branches of natural science, th ...
,
biotechnology Biotechnology is the integration of natural sciences and engineering sciences in order to achieve the application of organisms, cells, parts thereof and molecular analogues for products and services. The term ''biotechnology'' was first used ...
, and
medicine Medicine is the science and practice of caring for a patient, managing the diagnosis, prognosis, prevention, treatment, palliation of their injury or disease, and promoting their health. Medicine encompasses a variety of health care pr ...
.
Molecular biology Molecular biology is the branch of biology that seeks to understand the molecular basis of biological activity in and between cells, including biomolecular synthesis, modification, mechanisms, and interactions. The study of chemical and phys ...
research uses numerous proteins and enzymes, many of which are from expression systems; particularly
DNA polymerase A DNA polymerase is a member of a family of enzymes that catalyze the synthesis of DNA molecules from nucleoside triphosphates, the molecular precursors of DNA. These enzymes are essential for DNA replication and usually work in groups to crea ...
for PCR,
reverse transcriptase A reverse transcriptase (RT) is an enzyme used to generate complementary DNA (cDNA) from an RNA template, a process termed reverse transcription. Reverse transcriptases are used by viruses such as HIV and hepatitis B to replicate their genom ...
for RNA analysis, restriction endonucleases for cloning, and to make proteins that are screened in
drug discovery In the fields of medicine, biotechnology and pharmacology, drug discovery is the process by which new candidate medications are discovered. Historically, drugs were discovered by identifying the active ingredient from traditional remedies or b ...
as biological targets or as potential drugs themselves. There are also significant applications for expression systems in industrial fermentation, notably the production of biopharmaceuticals such as human
insulin Insulin (, from Latin ''insula'', 'island') is a peptide hormone produced by beta cells of the pancreatic islets encoded in humans by the ''INS'' gene. It is considered to be the main anabolic hormone of the body. It regulates the metabolism ...
to treat
diabetes Diabetes, also known as diabetes mellitus, is a group of metabolic disorders characterized by a high blood sugar level ( hyperglycemia) over a prolonged period of time. Symptoms often include frequent urination, increased thirst and increased ...
, and to manufacture enzymes.


Protein production systems

Commonly used protein production systems include those derived from
bacteria Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were am ...
,
yeast Yeasts are eukaryotic, single-celled microorganisms classified as members of the fungus kingdom. The first yeast originated hundreds of millions of years ago, and at least 1,500 species are currently recognized. They are estimated to constit ...
, baculovirus/
insect Insects (from Latin ') are pancrustacean hexapod invertebrates of the class Insecta. They are the largest group within the arthropod phylum. Insects have a chitinous exoskeleton, a three-part body ( head, thorax and abdomen), three pa ...
,
mammalian Mammals () are a group of vertebrate animals constituting the class Mammalia (), characterized by the presence of mammary glands which in females produce milk for feeding (nursing) their young, a neocortex (a region of the brain), fu ...
cells, and more recently filamentous fungi such as '' Myceliophthora thermophila''. When biopharmaceuticals are produced with one of these systems, process-related impurities termed
host cell protein Host cell proteins (HCPs) are process-related protein impurities that are produced by the host organism during biotherapeutic manufacturing and production. During the purification process, a majority of produced HCPs are removed from the final pro ...
s also arrive in the final product in trace amounts.


Cell-based systems

The oldest and most widely used expression systems are cell-based and may be defined as the "''combination of an
expression vector An expression vector, otherwise known as an expression construct, is usually a plasmid or virus designed for gene expression in cells. The vector is used to introduce a specific gene into a target cell, and can commandeer the cell's mechanism for ...
, its cloned DNA, and the host for the vector that provide a context to allow foreign gene function in a host cell, that is, produce proteins at a high level''". Overexpression is an abnormally and excessively high level of
gene expression Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, protein or non-coding RNA, and ultimately affect a phenotype, as the final effect. T ...
which produces a pronounced gene-related
phenotype In genetics, the phenotype () is the set of observable characteristics or traits of an organism. The term covers the organism's morphology (biology), morphology or physical form and structure, its Developmental biology, developmental proc ...
. There are many ways to introduce foreign DNA to a cell for expression, and many different host cells may be used for expression — each expression system has distinct advantages and liabilities. Expression systems are normally referred to by the host and the DNA source or the delivery mechanism for the genetic material. For example, common hosts are
bacteria Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were am ...
(such as ''
E.coli ''Escherichia coli'' (),Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. also known as ''E. coli'' (), is a Gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus ''Esche ...
'', ''
B. subtilis ''Bacillus subtilis'', known also as the hay bacillus or grass bacillus, is a Gram-positive, catalase-positive bacterium, found in soil and the gastrointestinal tract of ruminants, humans and marine sponges. As a member of the genus ''Bacillus' ...
''),
yeast Yeasts are eukaryotic, single-celled microorganisms classified as members of the fungus kingdom. The first yeast originated hundreds of millions of years ago, and at least 1,500 species are currently recognized. They are estimated to constit ...
(such as '' S.cerevisiae'') or eukaryotic cell lines. Common DNA sources and delivery mechanisms are
virus A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Since Dmitri Ivanovsk ...
es (such as baculovirus,
retrovirus A retrovirus is a type of virus that inserts a DNA copy of its RNA genome into the DNA of a host cell that it invades, thus changing the genome of that cell. Once inside the host cell's cytoplasm, the virus uses its own reverse transcriptas ...
, adenovirus),
plasmid A plasmid is a small, extrachromosomal DNA molecule within a cell that is physically separated from chromosomal DNA and can replicate independently. They are most commonly found as small circular, double-stranded DNA molecules in bacteria; how ...
s,
artificial chromosome Artificial chromosome may refer to: * Yeast artificial chromosome * Bacterial artificial chromosome * Human artificial chromosome * P1-derived artificial chromosome A P1-derived artificial chromosome, or PAC, is a DNA construct derived from the DN ...
s and
bacteriophage A bacteriophage (), also known informally as a ''phage'' (), is a duplodnaviria virus that infects and replicates within bacteria and archaea. The term was derived from "bacteria" and the Greek φαγεῖν ('), meaning "to devour". Bac ...
(such as
lambda Lambda (}, ''lám(b)da'') is the 11th letter of the Greek alphabet, representing the voiced alveolar lateral approximant . In the system of Greek numerals, lambda has a value of 30. Lambda is derived from the Phoenician Lamed . Lambda gave ri ...
). The best expression system depends on the
gene In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
involved, for example the ''
Saccharomyces cerevisiae ''Saccharomyces cerevisiae'' () (brewer's yeast or baker's yeast) is a species of yeast (single-celled fungus microorganisms). The species has been instrumental in winemaking, baking, and brewing since ancient times. It is believed to have b ...
'' is often preferred for proteins that require significant
posttranslational modification Post-translational modification (PTM) is the covalent and generally enzymatic modification of proteins following protein biosynthesis. This process occurs in the endoplasmic reticulum and the golgi apparatus. Proteins are synthesized by ribos ...
.
Insect Insects (from Latin ') are pancrustacean hexapod invertebrates of the class Insecta. They are the largest group within the arthropod phylum. Insects have a chitinous exoskeleton, a three-part body ( head, thorax and abdomen), three pa ...
or
mammal Mammals () are a group of vertebrate animals constituting the class Mammalia (), characterized by the presence of mammary glands which in females produce milk for feeding (nursing) their young, a neocortex (a region of the brain), fur ...
cell lines are used when human-like splicing of mRNA is required. Nonetheless, bacterial expression has the advantage of easily producing large amounts of protein, which is required for
X-ray crystallography X-ray crystallography is the experimental science determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to diffract into many specific directions. By measuring the angles ...
or nuclear magnetic resonance experiments for structure determination. Because bacteria are
prokaryote A prokaryote () is a single-celled organism that lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Greek πρό (, 'before') and κάρυον (, 'nut' or 'kernel').Campbell, N. "Biology:Concepts & Con ...
s, they are not equipped with the full enzymatic machinery to accomplish the required post-translational modifications or molecular folding. Hence, multi-domain eukaryotic proteins expressed in bacteria often are non-functional. Also, many proteins become insoluble as inclusion bodies that are difficult to recover without harsh denaturants and subsequent cumbersome protein-refolding. To address these concerns, expressions systems using multiple eukaryotic cells were developed for applications requiring the proteins be conformed as in, or closer to eukaryotic organisms: cells of plants (i.e. tobacco), of insects or mammalians (i.e. bovines) are transfected with genes and cultured in suspension and even as tissues or whole organisms, to produce fully folded proteins. Mammalian ''
in vivo Studies that are ''in vivo'' (Latin for "within the living"; often not italicized in English) are those in which the effects of various biological entities are tested on whole, living organisms or cells, usually animals, including humans, and p ...
'' expression systems have however low yield and other limitations (time-consuming, toxicity to host cells,..). To combine the high yield/productivity and scalable protein features of bacteria and yeast, and advanced epigenetic features of plants, insects and mammalians systems, other protein production systems are developed using unicellular eukaryotes (i.e. non-pathogenic '''
Leishmania ''Leishmania'' is a parasitic protozoan, a single-celled organism of the genus '' Leishmania'' that are responsible for the disease leishmaniasis. They are spread by sandflies of the genus ''Phlebotomus'' in the Old World, and of the genus ' ...
''' cells).


Bacterial systems


= ''Escherichia coli''

= '' E. coli'' is one of the most widely used expression hosts, and DNA is normally introduced in a
plasmid A plasmid is a small, extrachromosomal DNA molecule within a cell that is physically separated from chromosomal DNA and can replicate independently. They are most commonly found as small circular, double-stranded DNA molecules in bacteria; how ...
expression vector. The techniques for overexpression in ''E. coli'' are well developed and work by increasing the number of copies of the gene or increasing the binding strength of the promoter region so assisting transcription. For example, a DNA sequence for a protein of interest could be
cloned Cloning is the process of producing individual organisms with identical or virtually identical DNA, either by natural or artificial means. In nature, some organisms produce clones through asexual reproduction. In the field of biotechnology, ...
or
subcloned In molecular biology, subcloning is a technique used to move a particular DNA sequence from a ''parent vector'' to a ''destination vector''. Subcloning is not to be confused with molecular cloning, a related technique. Procedure Restriction e ...
into a high copy-number plasmid containing the '' lac'' (often LacUV5) promoter, which is then transformed into the bacterium ''E. coli''. Addition of IPTG (a
lactose Lactose is a disaccharide sugar synthesized by galactose and glucose subunits and has the molecular formula C12H22O11. Lactose makes up around 2–8% of milk (by mass). The name comes from ' (gen. '), the Latin word for milk, plus the suffix ' ...
analog) activates the lac promoter and causes the bacteria to express the protein of interest. ''E. coli'' strain BL21 and BL21(DE3) are two strains commonly used for protein production. As members of the B lineage, they lack ''
lon Lon or LON may refer to: People * Lon (photographer), pseudonym of Alonzo Hanagan, also known as "Lon of New York" * Lon (name), a list of people with the given name, nickname or surname Fictional characters * Lon Cohen, a character in the Ne ...
'' and '' OmpT'' proteases, protecting the produced proteins from degradation. The DE3 prophage found in BL21(DE3) provides T7 RNA polymerase (driven by the LacUV5 promoter), allowing for vectors with the T7 promoter to be used instead.


= ''Corynebacterium''

= Non-pathogenic species of the gram-positive ''
Corynebacterium ''Corynebacterium'' () is a genus of Gram-positive bacteria and most are aerobic. They are bacilli (rod-shaped), and in some phases of life they are, more specifically, club-shaped, which inspired the genus name ('' coryneform'' means "club- ...
'' are used for the commercial production of various amino acids. The '' C. glutamicum'' species is widely used for producing glutamate and lysine, components of human food, animal feed and pharmaceutical products. Expression of functionally active human
epidermal growth factor Epidermal growth factor (EGF) is a protein that stimulates cell growth and differentiation by binding to its receptor, EGFR. Human EGF is 6-k Da and has 53 amino acid residues and three intramolecular disulfide bonds. EGF was originally de ...
has been done in ''C. glutamicum'', thus demonstrating a potential for industrial-scale production of human proteins. Expressed proteins can be targeted for secretion through either the general, secretory pathway (Sec) or the
twin-arginine translocation pathway The twin-arginine translocation pathway (Tat pathway) is a protein export, or secretion pathway found in plants, bacteria, and archaea. In contrast to the Sec pathway which transports proteins in an unfolded manner, the Tat pathway serves to ac ...
(Tat). Unlike gram-negative bacteria, the gram-positive ''Corynebacterium'' lack lipopolysaccharides that function as antigenic endotoxins in humans.


= ''Pseudomonas fluorescens''

= The non-pathogenic and gram-negative bacteria, '' Pseudomonas fluorescens'', is used for high level production of recombinant proteins; commonly for the development bio-therapeutics and vaccines. '' P. fluorescens'' is a metabolically versatile organism, allowing for high throughput screening and rapid development of complex proteins. ''P. fluorescens'' is most well known for its ability to rapid and successfully produce high titers of active, soluble protein.


Eukaryotic systems


= Yeasts

= Expression systems using either ''
S. cerevisiae ''Saccharomyces cerevisiae'' () (brewer's yeast or baker's yeast) is a species of yeast (single-celled fungus microorganisms). The species has been instrumental in winemaking, baking, and brewing since ancient times. It is believed to have been o ...
'' or '' Pichia pastoris'' allow stable and lasting production of proteins that are processed similarly to mammalian cells, at high yield, in chemically defined media of proteins.


= Filamentous fungi

= Filamentous fungi, especially '' Aspergillus'' and ''
Trichoderma ''Trichoderma'' is a genus of fungi in the family Hypocreaceae that is present in all soils, where they are the most prevalent culturable fungi. Many species in this genus can be characterized as opportunistic avirulent plant symbionts. This ref ...
'', but also more recently '' Myceliophthora thermophila'' C1 have been developed into expression platforms for screening and production of diverse
industrial enzymes Industrial enzymes are enzymes that are commercially used in a variety of industries such as pharmaceuticals, chemical production, biofuels, food & beverage, and consumer products. Due to advancements in recent years, biocatalysis through isolated ...
. The expression system C1 shows a low viscosity morphology in submerged culture, enabling the use of complex growth and production media.


= ''Baculovirus''-infected cells

= Baculovirus-infected insect cells (
Sf9 SF9 ( ko, 에스에프나인 ; shortened from Sensational Feeling 9) is a South Korean boy band formed by FNC Entertainment and the company's first dance boy group. Consisting of nine members, the group debuted on October 5, 2016 with the releas ...
, Sf21, High Five strains) or mammalian cells ( HeLa, HEK 293) allow production of glycosylated or membrane proteins that cannot be produced using fungal or bacterial systems. It is useful for production of proteins in high quantity. Genes are not expressed continuously because infected host cells eventually lyse and die during each infection cycle.


= Non-lytic insect cell expression

= Non-lytic insect cell expression is an alternative to the lytic baculovirus expression system. In non-lytic expression, vectors are transiently or stably transfected into the chromosomal DNA of insect cells for subsequent gene expression. This is followed by selection and screening of recombinant clones. The non-lytic system has been used to give higher protein yield and quicker expression of recombinant genes compared to baculovirus-infected cell expression. Cell lines used for this system include:
Sf9 SF9 ( ko, 에스에프나인 ; shortened from Sensational Feeling 9) is a South Korean boy band formed by FNC Entertainment and the company's first dance boy group. Consisting of nine members, the group debuted on October 5, 2016 with the releas ...
, Sf21 from '' Spodoptera frugiperda'' cells, Hi-5 from ''
Trichoplusia ni The cabbage looper (''Trichoplusia ni'') is a medium-sized moth in the family Noctuidae, a family commonly referred to as owlet moths. Its common name comes from its preferred host plants and distinctive crawling behavior. Cruciferous vegetabl ...
'' cells, and Schneider 2 cells and Schneider 3 cells from ''
Drosophila melanogaster ''Drosophila melanogaster'' is a species of fly (the taxonomic order Diptera) in the family Drosophilidae. The species is often referred to as the fruit fly or lesser fruit fly, or less commonly the " vinegar fly" or "pomace fly". Starting with ...
'' cells. With this system, cells do not lyse and several cultivation modes can be used. Additionally, protein production runs are reproducible. This system gives a homogeneous product. A drawback of this system is the requirement of an additional screening step for selecting viable clones.


= '' Excavata''

= ''
Leishmania ''Leishmania'' is a parasitic protozoan, a single-celled organism of the genus '' Leishmania'' that are responsible for the disease leishmaniasis. They are spread by sandflies of the genus ''Phlebotomus'' in the Old World, and of the genus ' ...
tarentolae'' (cannot infect mammals) expression systems allow stable and lasting production of proteins at high yield, in chemically defined media. Produced proteins exhibit fully eukaryotic post-translational modifications, including
glycosylation Glycosylation is the reaction in which a carbohydrate (or ' glycan'), i.e. a glycosyl donor, is attached to a hydroxyl or other functional group of another molecule (a glycosyl acceptor) in order to form a glycoconjugate. In biology (but not al ...
and disulfide bond formation.


= Mammalian systems

= The most common mammalian expression systems are Chinese Hamster
ovary The ovary is an organ in the female reproductive system that produces an ovum. When released, this travels down the fallopian tube into the uterus, where it may become fertilized by a sperm. There is an ovary () found on each side of the body. ...
(CHO) and Human embryonic kidney (HEK) cells. * Chinese hamster ovary cell *
Mouse A mouse ( : mice) is a small rodent. Characteristically, mice are known to have a pointed snout, small rounded ears, a body-length scaly tail, and a high breeding rate. The best known mouse species is the common house mouse (''Mus musculus' ...
myeloma lymphoblstoid (e.g. NS0 cell) * Fully Human ** Human embryonic kidney cells ( HEK-293) ** Human embryonic retinal cells (Crucell's Per.C6) ** Human amniocyte cells (Glycotope and CEVEC)


Cell-free systems

Cell-free production of proteins is performed ''in vitro'' using purified RNA polymerase, ribosomes, tRNA and ribonucleotides. These reagents may be produced by extraction from cells or from a cell-based expression system. Due to the low expression levels and high cost of cell-free systems, cell-based systems are more widely used.


See also

*
Cellosaurus Cellosaurus is an online knowledge base on cell lines, which attempts to document all cell lines used in biomedical research. It is provided by the Swiss Institute of Bioinformatics (SIB). It is an ELIXIR Core Data Resource as well as an IR ...
, a database of cell lines *
Gene expression Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, protein or non-coding RNA, and ultimately affect a phenotype, as the final effect. T ...
*
Single-cell protein Single-cell proteins (SCP) or microbial proteins refer to edible unicellular microorganisms. The biomass or protein extract from pure or mixed cultures of algae, yeasts, fungi or bacteria may be used as an ingredient or a substitute for protein-ric ...
*
Protein purification Protein purification is a series of processes intended to isolate one or a few proteins from a complex mixture, usually cells, tissues or whole organisms. Protein purification is vital for the specification of the function, structure and interact ...
* Precision fermentation *
Host cell protein Host cell proteins (HCPs) are process-related protein impurities that are produced by the host organism during biotherapeutic manufacturing and production. During the purification process, a majority of produced HCPs are removed from the final pro ...
* List of recombinant proteins


References


Further reading

* *


External links

{{Microorganisms Gene expression Biotechnology