Coli Bacilli
   HOME

TheInfoList



OR:

''Escherichia coli'' (),Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. also known as ''E. coli'' (), is a Gram-negative,
facultative anaerobic A facultative anaerobic organism is an organism that makes ATP by aerobic respiration if oxygen is present, but is capable of switching to fermentation if oxygen is absent. Some examples of facultatively anaerobic bacteria are '' Staphylococc ...
,
rod-shaped A bacillus (), also called a bacilliform bacterium or often just a rod (when the context makes the sense clear), is a rod-shaped bacterium or archaeon. Bacilli are found in many different taxonomic groups of bacteria. However, the name ''Bacillu ...
, coliform bacterium of the genus '' Escherichia'' that is commonly found in the lower
intestine The gastrointestinal tract (GI tract, digestive tract, alimentary canal) is the tract or passageway of the digestive system that leads from the mouth to the anus. The GI tract contains all the major organs of the digestive system, in humans ...
of warm-blooded organisms. Most ''E. coli'' strains are harmless, but some
serotype A serotype or serovar is a distinct variation within a species of bacteria or virus or among immune cells of different individuals. These microorganisms, viruses, or cells are classified together based on their surface antigens, allowing the epi ...
s ( EPEC, ETEC etc.) can cause serious
food poisoning Foodborne illness (also foodborne disease and food poisoning) is any illness resulting from the spoilage of contaminated food by pathogenic bacteria, viruses, or parasites that contaminate food, as well as prions (the agents of mad cow disease) ...
in their hosts, and are occasionally responsible for
food contamination Food contamination refers to the presence of harmful chemicals and microorganisms in food, which can cause consumer illness. This article addresses the chemical contamination of foods, as opposed to microbiological contamination, which can be found ...
incidents that prompt product recalls. Most strains do not cause disease in humans and are part of the normal microbiota of the gut; such strains are harmless or even beneficial to humans (although these strains tend to be less studied than the pathogenic ones). For example, some strains of ''E. coli'' benefit their hosts by producing vitamin K2 or by preventing the colonization of the intestine by
pathogenic bacteria Pathogenic bacteria are bacteria that can cause disease. This article focuses on the bacteria that are pathogenic to humans. Most species of bacteria are harmless and are often Probiotic, beneficial but others can cause infectious diseases. The n ...
. These mutually beneficial relationships between ''E. coli'' and humans are a type of mutualistic biological relationship — where both the humans and the ''E. coli'' are benefitting each other. ''E. coli'' is expelled into the environment within fecal matter. The bacterium grows massively in fresh faecal matter under aerobic conditions for three days, but its numbers decline slowly afterwards. ''E. coli'' and other facultative anaerobes constitute about 0.1% of gut microbiota, and fecal–oral transmission is the major route through which pathogenic strains of the bacterium cause disease. Cells are able to survive outside the body for a limited amount of time, which makes them potential indicator organisms to test environmental samples for
fecal contamination Feces ( or faeces), known colloquially and in slang as poo and poop, are the solid or semi-solid remains of food that was not digested in the small intestine, and has been broken down by bacteria in the large intestine. Feces contain a relat ...
. A growing body of research, though, has examined environmentally persistent ''E. coli'' which can survive for many days and grow outside a host. The bacterium can be grown and cultured easily and inexpensively in a laboratory setting, and has been intensively investigated for over 60 years. ''E. coli'' is a chemoheterotroph whose chemically defined medium must include a source of carbon and energy. ''E. coli'' is the most widely studied prokaryotic
model organism A model organism (often shortened to model) is a non-human species that is extensively studied to understand particular biological phenomena, with the expectation that discoveries made in the model organism will provide insight into the workin ...
, and an important species in the fields of biotechnology and
microbiology Microbiology () is the scientific study of microorganisms, those being unicellular (single cell), multicellular (cell colony), or acellular (lacking cells). Microbiology encompasses numerous sub-disciplines including virology, bacteriology, prot ...
, where it has served as the host organism for the majority of work with recombinant DNA. Under favourable conditions, it takes as little as 20 minutes to reproduce.


Biology and biochemistry


Type and morphology

''E. coli'' is a Gram-negative, facultative anaerobe, nonsporulating coliform bacterium. Cells are typically rod-shaped, and are about 2.0
μm The micrometre ( international spelling as used by the International Bureau of Weights and Measures; SI symbol: μm) or micrometer (American spelling), also commonly known as a micron, is a unit of length in the International System of Unit ...
long and 0.25–1.0 μm in diameter, with a cell volume of 0.6–0.7 μm3. Antibiotics can effectively treat ''E. coli'' infections outside the digestive tract and most intestinal infections but are not used to treat intestinal infections by one strain of these bacteria. The
flagella A flagellum (; ) is a hairlike appendage that protrudes from certain plant and animal sperm cells, and from a wide range of microorganisms to provide motility. Many protists with flagella are termed as flagellates. A microorganism may have f ...
which allow the bacteria to swim have a
peritrichous A flagellum (; ) is a hairlike appendage that protrudes from certain plant and animal sperm cells, and from a wide range of microorganisms to provide motility. Many protists with flagella are termed as flagellates. A microorganism may have f ...
arrangement. It also attaches and effaces to the microvilli of the intestines via an adhesion molecule known as intimin.


Metabolism

''E. coli'' can live on a wide variety of substrates and uses mixed acid fermentation in anaerobic conditions, producing
lactate Lactate may refer to: * Lactation, the secretion of milk from the mammary glands * Lactate, the conjugate base of lactic acid Lactic acid is an organic acid. It has a molecular formula . It is white in the solid state and it is miscible with ...
, succinate, ethanol,
acetate An acetate is a salt (chemistry), salt formed by the combination of acetic acid with a base (e.g. Alkali metal, alkaline, Alkaline earth metal, earthy, Transition metal, metallic, nonmetallic or radical Radical (chemistry), base). "Acetate" als ...
, and carbon dioxide. Since many pathways in mixed-acid fermentation produce hydrogen gas, these pathways require the levels of hydrogen to be low, as is the case when ''E. coli'' lives together with hydrogen-consuming organisms, such as
methanogen Methanogens are microorganisms that produce methane as a metabolic byproduct in hypoxic conditions. They are prokaryotic and belong to the domain Archaea. All known methanogens are members of the archaeal phylum Euryarchaeota. Methanogens are com ...
s or
sulphate-reducing bacteria Sulfate-reducing microorganisms (SRM) or sulfate-reducing prokaryotes (SRP) are a group composed of sulfate-reducing bacteria (SRB) and sulfate-reducing archaea (SRA), both of which can perform anaerobic respiration utilizing sulfate () as termina ...
. In addition, ''E. coli''s metabolism can be rewired to solely use CO2 as the source of carbon for biomass production. In other words, this obligate heterotroph's metabolism can be altered to display autotrophic capabilities by heterologously expressing
carbon fixation Biological carbon fixation or сarbon assimilation is the process by which inorganic carbon (particularly in the form of carbon dioxide) is converted to organic compounds by living organisms. The compounds are then used to store energy and as ...
genes as well as
formate dehydrogenase Formate dehydrogenases are a set of enzymes that catalyse the oxidation of formate to carbon dioxide, donating the electrons to a second substrate, such as NAD+ in formate:NAD+ oxidoreductase () or to a cytochrome in formate:ferricytochrome-b1 o ...
and conducting laboratory evolution experiments. This may be done by using
formate Formate (IUPAC name: methanoate) is the conjugate base of formic acid. Formate is an anion () or its derivatives such as ester of formic acid. The salts and esters are generally colorless.Werner Reutemann and Heinz Kieczka "Formic Acid" in ''Ull ...
to reduce electron carriers and supply the ATP required in anabolic pathways inside of these synthetic autotrophs. ''E. coli'' have three native glycolytic pathways: EMPP, EDP, and OPPP. The EMPP employs ten enzymatic steps to yield two
pyruvates Pyruvic acid (CH3COCOOH) is the simplest of the alpha-keto acids, with a carboxylic acid and a ketone functional group. Pyruvate, the conjugate base, CH3COCOO−, is an intermediate in several metabolic pathways throughout the cell. Pyruvic a ...
, two
ATP ATP may refer to: Companies and organizations * Association of Tennis Professionals, men's professional tennis governing body * American Technical Publishers, employee-owned publishing company * ', a Danish pension * Armenia Tree Project, non ...
, and two NADH per glucose molecule while OPPP serves as an oxidation route for
NADPH Nicotinamide adenine dinucleotide phosphate, abbreviated NADP or, in older notation, TPN (triphosphopyridine nucleotide), is a cofactor used in anabolic reactions, such as the Calvin cycle and lipid and nucleic acid syntheses, which require NAD ...
synthesis. Although the EDP is the more thermodynamically favourable of the three pathways, ''E. coli'' do not use the EDP for glucose metabolism, relying mainly on the EMPP and the OPPP. The EDP mainly remains inactive except for during growth with gluconate.


Catabolite repression

When growing in the presence of a mixture of sugars, bacteria will often consume the sugars sequentially through a process known as
catabolite Catabolism () is the set of metabolic pathways that breaks down molecules into smaller units that are either oxidized to release energy or used in other anabolic reactions. Catabolism breaks down large molecules (such as polysaccharides, lipid ...
repression. By repressing the expression of the genes involved in metabolizing the less preferred sugars, cells will usually first consume the sugar yielding the highest growth rate, followed by the sugar yielding the next highest growth rate, and so on. In doing so the cells ensure that their limited metabolic resources are being used to maximize the rate of growth. The well-used example of this with ''E. coli'' involves the growth of the bacterium on glucose and lactose, where ''E. coli'' will consume glucose before lactose. Catabolite repression has also been observed in ''E. coli'' in the presence of other non-glucose sugars, such as
arabinose Arabinose is an aldopentose – a monosaccharide containing five carbon atoms, and including an aldehyde (CHO) functional group. For biosynthetic reasons, most saccharides are almost always more abundant in nature as the "D"-form, or structurally ...
and
xylose Xylose ( grc, ξύλον, , "wood") is a sugar first isolated from wood, and named for it. Xylose is classified as a monosaccharide of the aldopentose type, which means that it contains five carbon atoms and includes an aldehyde functional gro ...
,
sorbitol Sorbitol (), less commonly known as glucitol (), is a sugar alcohol with a sweet taste which the human body metabolizes slowly. It can be obtained by reduction of glucose, which changes the converted aldehyde group (−CHO) to a primary alcohol g ...
,
rhamnose Rhamnose (Rha, Rham) is a naturally occurring deoxy sugar. It can be classified as either a methyl-pentose or a 6-deoxy-hexose. Rhamnose predominantly occurs in nature in its L-form as L-rhamnose (6-deoxy-L-mannose). This is unusual, since most o ...
, and ribose. In ''E. coli'', glucose catabolite repression is regulated by the phosphotransferase system, a multi-protein
phosphorylation In chemistry, phosphorylation is the attachment of a phosphate group to a molecule or an ion. This process and its inverse, dephosphorylation, are common in biology and could be driven by natural selection. Text was copied from this source, wh ...
cascade that couples glucose uptake and metabolism.


Culture growth

Optimum growth of ''E. coli'' occurs at , but some laboratory strains can multiply at temperatures up to . ''E. coli'' grows in a variety of defined laboratory media, such as lysogeny broth, or any medium that contains glucose, ammonium phosphate monobasic, sodium chloride, magnesium sulfate, potassium phosphate dibasic, and water. Growth can be driven by aerobic or anaerobic respiration, using a large variety of redox pairs, including the oxidation of
pyruvic acid Pyruvic acid (CH3COCOOH) is the simplest of the alpha-keto acids, with a carboxylic acid and a ketone functional group. Pyruvate, the conjugate base, CH3COCOO−, is an intermediate in several metabolic pathways throughout the cell. Pyruvic aci ...
,
formic acid Formic acid (), systematically named methanoic acid, is the simplest carboxylic acid, and has the chemical formula HCOOH and structure . It is an important intermediate in chemical synthesis and occurs naturally, most notably in some ants. Es ...
, hydrogen, and amino acids, and the reduction of substrates such as oxygen,
nitrate Nitrate is a polyatomic ion A polyatomic ion, also known as a molecular ion, is a covalent bonded set of two or more atoms, or of a metal complex, that can be considered to behave as a single unit and that has a net charge that is not zer ...
,
fumarate Fumaric acid is an organic compound with the formula HO2CCH=CHCO2H. A white solid, fumaric acid occurs widely in nature. It has a fruit-like taste and has been used as a food additive. Its E number is E297. The salts and esters are known as f ...
,
dimethyl sulfoxide Dimethyl sulfoxide (DMSO) is an organosulfur compound with the formula ( CH3)2. This colorless liquid is the sulfoxide most widely used commercially. It is an important polar aprotic solvent that dissolves both polar and nonpolar compounds a ...
, and trimethylamine N-oxide. ''E. coli'' is classified as a facultative anaerobe. It uses oxygen when it is present and available. It can, however, continue to grow in the absence of oxygen using fermentation or anaerobic respiration. Respiraton type is managed in part by the
arc system The Arc system is a two-component system found in some bacteria that regulates gene expression in faculatative anaerobes such as ''Escheria coli''. Two-component system means that it has a sensor molecule and a response regulator. Arc is an abb ...
. The ability to continue growing in the absence of oxygen is an advantage to bacteria because their survival is increased in environments where water predominates.


Cell cycle

The bacterial cell cycle is divided into three stages. The B period occurs between the completion of cell division and the beginning of DNA replication. The C period encompasses the time it takes to replicate the chromosomal DNA. The D period refers to the stage between the conclusion of DNA replication and the end of cell division. The doubling rate of ''E. coli'' is higher when more nutrients are available. However, the length of the C and D periods do not change, even when the doubling time becomes less than the sum of the C and D periods. At the fastest growth rates, replication begins before the previous round of replication has completed, resulting in multiple replication forks along the DNA and overlapping cell cycles. The number of replication forks in fast growing ''E. coli'' typically follows 2n (n = 1, 2 or 3). This only happens if
replication Replication may refer to: Science * Replication (scientific method), one of the main principles of the scientific method, a.k.a. reproducibility ** Replication (statistics), the repetition of a test or complete experiment ** Replication crisi ...
is initiated simultaneously from all origins of replications, and is referred to as synchronous
replication Replication may refer to: Science * Replication (scientific method), one of the main principles of the scientific method, a.k.a. reproducibility ** Replication (statistics), the repetition of a test or complete experiment ** Replication crisi ...
. However, not all cells in a culture replicate synchronously. In this case cells do not have multiples of two replication forks. Replication initiation is then referred to being asynchronous. However, asynchrony can be caused by mutations to for instance DnaA or DnaA initiator-associating protein DiaA.


Genetic adaptation

''E. coli'' and related bacteria possess the ability to transfer DNA via
bacterial conjugation Bacterial conjugation is the transfer of genetic material between bacterial cells by direct cell-to-cell contact or by a bridge-like connection between two cells. This takes place through a pilus. It is a parasexual mode of reproduction in bacteri ...
or transduction, which allows genetic material to spread horizontally through an existing population. The process of transduction, which uses the bacterial virus called a
bacteriophage A bacteriophage (), also known informally as a ''phage'' (), is a duplodnaviria virus that infects and replicates within bacteria and archaea. The term was derived from "bacteria" and the Greek φαγεῖν ('), meaning "to devour". Bacteri ...
, is where the spread of the gene encoding for the Shiga toxin from the '' Shigella'' bacteria to ''E. coli'' helped produce ''E. coli'' O157:H7, the Shiga toxin-producing strain of ''E. coli.''


Diversity

''E. coli'' encompasses an enormous population of bacteria that exhibit a very high degree of both genetic and phenotypic diversity. Genome sequencing of many isolates of ''E. coli'' and related bacteria shows that a taxonomic reclassification would be desirable. However, this has not been done, largely due to its medical importance, and ''E. coli'' remains one of the most diverse bacterial species: only 20% of the genes in a typical ''E. coli'' genome is shared among all strains. In fact, from the more constructive point of view, the members of genus ''Shigella'' (''S. dysenteriae'', ''S. flexneri'', ''S. boydii'', and ''S. sonnei'') should be classified as ''E. coli'' strains, a phenomenon termed
taxa in disguise In bacteriology, a taxon in disguise is a species, genus or higher unit of biological classification whose evolutionary history reveals has evolved from another unit of a similar or lower rank, making the parent unit paraphyletic. That happens whe ...
. Similarly, other strains of ''E. coli'' (e.g. the
K-12 K-1 is a professional kickboxing promotion established in 1993, well known worldwide mainly for its heavyweight division fights and Grand Prix tournaments. In January 2012, K-1 Global Holdings Limited, a company registered in Hong Kong, acquired ...
strain commonly used in recombinant DNA work) are sufficiently different that they would merit reclassification. A strain is a subgroup within the species that has unique characteristics that distinguish it from other strains. These differences are often detectable only at the molecular level; however, they may result in changes to the physiology or lifecycle of the bacterium. For example, a strain may gain pathogenic capacity, the ability to use a unique carbon source, the ability to take upon a particular ecological niche, or the ability to resist antimicrobial agents. Different strains of ''E. coli'' are often host-specific, making it possible to determine the source of fecal contamination in environmental samples. For example, knowing which ''E. coli'' strains are present in a water sample allows researchers to make assumptions about whether the contamination originated from a human, another
mammal Mammals () are a group of vertebrate animals constituting the class Mammalia (), characterized by the presence of mammary glands which in females produce milk for feeding (nursing) their young, a neocortex (a region of the brain), fur or ...
, or a bird.


Serotypes

A common subdivision system of ''E. coli'', but not based on evolutionary relatedness, is by serotype, which is based on major surface antigens (O antigen: part of lipopolysaccharide layer; H:
flagellin Flagellin is a globular protein that arranges itself in a hollow cylinder to form the filament in a bacterial flagellum. It has a mass of about 30,000 to 60,000 daltons. Flagellin is the principal component of bacterial flagella, and is present ...
; K antigen: capsule), e.g. O157:H7). It is, however, common to cite only the serogroup, i.e. the O-antigen. At present, about 190 serogroups are known. The common laboratory strain has a mutation that prevents the formation of an O-antigen and is thus not typeable.


Genome plasticity and evolution

Like all lifeforms, new strains of ''E. coli'' evolve through the natural biological processes of mutation,
gene duplication Gene duplication (or chromosomal duplication or gene amplification) is a major mechanism through which new genetic material is generated during molecular evolution. It can be defined as any duplication of a region of DNA that contains a gene. ...
, and horizontal gene transfer; in particular, 18% of the genome of the laboratory strain MG1655 was horizontally acquired since the divergence from ''
Salmonella ''Salmonella'' is a genus of rod-shaped (bacillus) Gram-negative bacteria of the family Enterobacteriaceae. The two species of ''Salmonella'' are ''Salmonella enterica'' and ''Salmonella bongori''. ''S. enterica'' is the type species and is fur ...
''. ''E. coli'' K-12 and ''E. coli'' B strains are the most frequently used varieties for laboratory purposes. Some strains develop traits that can be harmful to a host animal. These virulent strains typically cause a bout of diarrhea that is often self-limiting in healthy adults but is frequently lethal to children in the developing world. More virulent strains, such as O157:H7, cause serious illness or death in the elderly, the very young, or the immunocompromised. The genera '' Escherichia'' and ''
Salmonella ''Salmonella'' is a genus of rod-shaped (bacillus) Gram-negative bacteria of the family Enterobacteriaceae. The two species of ''Salmonella'' are ''Salmonella enterica'' and ''Salmonella bongori''. ''S. enterica'' is the type species and is fur ...
'' diverged around 102 million years ago (credibility interval: 57–176 mya), an event unrelated to the much earlier (see '' Synapsid'') divergence of their hosts: the former being found in mammals and the latter in birds and reptiles. This was followed by a split of an ''Escherichia'' ancestor into five species (''E. albertii'', ''E. coli'', ''E. fergusonii'', ''E. hermannii'', and ''E. vulneris''). The last ''E. coli'' ancestor split between 20 and 30 million years ago. The long-term evolution experiments using ''E. coli'', begun by Richard Lenski in 1988, have allowed direct observation of genome evolution over more than 65,000 generations in the laboratory. For instance, ''E. coli'' typically do not have the ability to grow aerobically with citrate as a carbon source, which is used as a diagnostic criterion with which to differentiate ''E. coli'' from other, closely, related bacteria such as ''
Salmonella ''Salmonella'' is a genus of rod-shaped (bacillus) Gram-negative bacteria of the family Enterobacteriaceae. The two species of ''Salmonella'' are ''Salmonella enterica'' and ''Salmonella bongori''. ''S. enterica'' is the type species and is fur ...
''. In this experiment, one population of ''E. coli'' unexpectedly evolved the ability to aerobically metabolize citrate, a major evolutionary shift with some hallmarks of microbial
speciation Speciation is the evolutionary process by which populations evolve to become distinct species. The biologist Orator F. Cook coined the term in 1906 for cladogenesis, the splitting of lineages, as opposed to anagenesis, phyletic evolution within ...
. In the microbial world, a relationship of predation can be established similar to that observed in the animal world. Considered, it has been seen that ''E. coli'' is the prey of multiple generalist predators, such as '' Myxococcus xanthus''. In this predator-prey relationship, a parallel evolution of both species is observed through genomic and phenotypic modifications, in the case of ''E. coli'' the modifications are modified in two aspects involved in their virulence such as mucoid production (excessive production of exoplasmic acid alginate ) and the suppression of the OmpT gene, producing in future generations a better adaptation of one of the species that is counteracted by the evolution of the other, following a co-evolutionary model demonstrated by the Red Queen hypothesis.


Neotype strain

''E. coli'' is the type species of the genus (''Escherichia'') and in turn ''Escherichia'' is the type genus of the family Enterobacteriaceae, where the family name does not stem from the genus ''Enterobacter'' + "i" (sic.) + " aceae", but from "enterobacterium" + "aceae" (enterobacterium being not a genus, but an alternative trivial name to enteric bacterium). The original strain described by Escherich is believed to be lost, consequently a new type strain (neotype) was chosen as a representative: the neotype strain is U5/41T, also known under the deposit names DSM 30083, ATCC 11775, and NCTC 9001, which is pathogenic to chickens and has an O1:K1:H7 serotype. However, in most studies, either O157:H7, K-12 MG1655, or K-12 W3110 were used as a representative ''E. coli''. The genome of the type strain has only lately been sequenced.


Phylogeny of ''E. coli'' strains

Many strains belonging to this species have been isolated and characterised. In addition to serotype (''vide supra''), they can be classified according to their phylogeny, i.e. the inferred evolutionary history, as shown below where the species is divided into six groups. Particularly the use of whole genome sequences yields highly supported phylogenies. Based on such data, five subspecies of ''E. coli'' were distinguished. The link between phylogenetic distance ("relatedness") and pathology is small, ''e.g.'' the O157:H7 serotype strains, which form a
clade A clade (), also known as a monophyletic group or natural group, is a group of organisms that are monophyletic – that is, composed of a common ancestor and all its lineal descendants – on a phylogenetic tree. Rather than the English term, ...
("an exclusive group")—group E below—are all enterohaemorragic strains (EHEC), but not all EHEC strains are closely related. In fact, four different species of ''Shigella'' are nested among ''E. coli'' strains (''vide supra''), while ''E. albertii'' and ''E. fergusonii'' are outside this group. Indeed, all ''Shigella'' species were placed within a single subspecies of ''E. coli'' in a phylogenomic study that included the type strain, and for this reason an according reclassification is difficult. All commonly used research strains of ''E. coli'' belong to group A and are derived mainly from Clifton's K-12 strain (λ+ F+; O16) and to a lesser degree from d'Herelle's ''
Bacillus coli ''Escherichia coli'' (),Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. also known as ''E. coli'' (), is a Gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus ''Esche ...
'' strain (B strain) (O7).


Genomics

The first complete
DNA sequence DNA sequencing is the process of determining the nucleic acid sequence – the order of nucleotides in DNA. It includes any method or technology that is used to determine the order of the four bases: adenine, guanine, cytosine, and thymine. Th ...
of an ''E. coli'' genome (laboratory strain K-12 derivative MG1655) was published in 1997. It is a circular DNA molecule 4.6 million
base pair A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
s in length, containing 4288 annotated protein-coding genes (organized into 2584 operons), seven ribosomal RNA (rRNA) operons, and 86
transfer RNA Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino ac ...
(tRNA) genes. Despite having been the subject of intensive genetic analysis for about 40 years, many of these genes were previously unknown. The coding density was found to be very high, with a mean distance between genes of only 118 base pairs. The genome was observed to contain a significant number of transposable genetic elements, repeat elements, cryptic prophages, and
bacteriophage A bacteriophage (), also known informally as a ''phage'' (), is a duplodnaviria virus that infects and replicates within bacteria and archaea. The term was derived from "bacteria" and the Greek φαγεῖν ('), meaning "to devour". Bacteri ...
remnants. More than three hundred complete genomic sequences of ''Escherichia'' and ''Shigella'' species are known. The genome sequence of the type strain of ''E. coli'' was added to this collection before 2014. Comparison of these sequences shows a remarkable amount of diversity; only about 20% of each genome represents sequences present in every one of the isolates, while around 80% of each genome can vary among isolates. Each individual genome contains between 4,000 and 5,500 genes, but the total number of different genes among all of the sequenced ''E. coli'' strains (the pangenome) exceeds 16,000. This very large variety of component genes has been interpreted to mean that two-thirds of the ''E. coli''
pangenome In the fields of molecular biology and genetics, a pan-genome (pangenome or supragenome) is the entire set of genes from all strains within a clade. More generally, it is the union of all the genomes of a clade. The pan-genome can be broken down ...
originated in other species and arrived through the process of horizontal gene transfer.


Gene nomenclature

Genes in ''E. coli'' are usually named in accordance with the uniform nomenclature proposed by Demerec et al. Gene names are 3-letter acronyms that derive from their function (when known) or mutant phenotype and are italicized. When multiple genes have the same acronym, the different genes are designated by a capital later that follows the acronym and is also italicized. For instance, ''recA'' is named after its role in homologous recombination plus the letter A. Functionally related genes are named ''recB'', ''recC'', ''recD'' etc. The proteins are named by uppercase acronyms, e.g. RecA, RecB, etc. When the genome of ''E. coli'' strain K-12 substr. MG1655 was sequenced, all known or predicted protein-coding genes were numbered (more or less) in their order on the genome and abbreviated by b numbers, such as b2819 (= ''recD''). The "b" names were created after Fred Blattner, who led the genome sequence effort. Another numbering system was introduced with the sequence of another ''E. coli'' K-12 substrain, W3110, which was sequenced in Japan and hence uses numbers starting by JW... (Japanese W3110), e.g. JW2787 (= ''recD''). Hence, ''recD'' = b2819 = JW2787. Note, however, that most databases have their own numbering system, e.g. the EcoGene database uses EG10826 for ''recD''. Finally, ECK numbers are specifically used for alleles in the MG1655 strain of ''E. coli'' K-12. Complete lists of genes and their synonyms can be obtained from databases such as EcoGene or Uniprot.


Proteomics


Proteome

The genome sequence of ''E. coli'' predicts 4288 protein-coding genes, of which 38 percent initially had no attributed function. Comparison with five other sequenced microbes reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within ''E. coli'' are also evident. The largest family of paralogous proteins contains 80 ABC transporters. The genome as a whole is strikingly organized with respect to the local direction of replication; guanines, oligonucleotides possibly related to replication and recombination, and most genes are so oriented. The genome also contains insertion sequence (IS) elements, phage remnants, and many other patches of unusual composition indicating genome plasticity through horizontal transfer. Several studies have experimentally investigated the proteome of ''E. coli''. By 2006, 1,627 (38%) of the predicted proteins ( open reading frames, ORFs) had been identified experimentally. Mateus et al. 2020 detected 2,586 proteins with at least 2 peptides (60% of all proteins).


Post-translational modifications (PTMs)

Although much fewer bacterial proteins seem to have post-translational modifications (PTMs) compared to eukaryotic proteins, a substantial number of proteins are modified in ''E. coli''. For instance, Potel et al. (2018) found 227 phosphoproteins of which 173 were phosphorylated on histidine. Interestingly, the majority of phosphorylated amino acids were
serine Serine (symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − form un ...
(1,220 sites) with only 246 sites on histidine and 501 phosphorylated theronines and 162 tyrosines.


Interactome

The interactome of ''E. coli'' has been studied by
affinity purification Affinity chromatography is a method of separating a biomolecule from a mixture, based on a highly specific molecular binding, macromolecular binding interaction between the biomolecule and another substance. The specific type of binding interacti ...
and
mass spectrometry Mass spectrometry (MS) is an analytical technique that is used to measure the mass-to-charge ratio of ions. The results are presented as a ''mass spectrum'', a plot of intensity as a function of the mass-to-charge ratio. Mass spectrometry is use ...
(AP/MS) and by analyzing the binary interactions among its proteins. Protein complexes. A 2006 study purified 4,339 proteins from cultures of strain K-12 and found interacting partners for 2,667 proteins, many of which had unknown functions at the time. A 2009 study found 5,993 interactions between proteins of the same ''E. coli'' strain, though these data showed little overlap with those of the 2006 publication. Binary interactions. Rajagopala ''et al.'' (2014) have carried out systematic yeast two-hybrid screens with most ''E. coli'' proteins, and found a total of 2,234 protein-protein interactions. This study also integrated genetic interactions and protein structures and mapped 458 interactions within 227 protein complexes.


Normal microbiota

''E. coli'' belongs to a group of bacteria informally known as
coliforms Coliform bacteria are defined as either motile or non-motile Gram-negative non-Endospore, spore forming Bacillus (shape), Bacilli that possess Beta-galactosidase, β-galactosidase to produce acids and gases under their optimal growth temperature o ...
that are found in the gastrointestinal tract of
warm-blooded animals Warm-blooded is an informal term referring to animal species which can maintain a body temperature higher than their environment. In particular, homeothermic species maintain a stable body temperature by regulating metabolic processes. The onl ...
. ''E. coli'' normally colonizes an infant's
gastrointestinal tract The gastrointestinal tract (GI tract, digestive tract, alimentary canal) is the tract or passageway of the digestive system that leads from the mouth to the anus. The GI tract contains all the major organ (biology), organs of the digestive syste ...
within 40 hours of birth, arriving with food or water or from the individuals handling the child. In the bowel, ''E. coli'' adheres to the mucus of the large intestine. It is the primary facultative anaerobe of the human gastrointestinal tract. ( Facultative anaerobes are organisms that can grow in either the presence or absence of oxygen.) As long as these bacteria do not acquire genetic elements encoding for virulence factors, they remain benign
commensals Commensalism is a long-term biological interaction (symbiosis) in which members of one species gain benefits while those of the other species neither benefit nor are harmed. This is in contrast with mutualism, in which both organisms benefit fro ...
.


Therapeutic use

Due to the low cost and speed with which it can be grown and modified in laboratory settings, ''E. coli'' is a popular expression platform for the production of recombinant proteins used in therapeutics. One advantage to using ''E. coli'' over another expression platform is that ''E. coli'' naturally does not export many proteins into the periplasm, making it easier to recover a protein of interest without cross-contamination. The ''E. coli'' K-12 strains and their derivatives (DH1, DH5α, MG1655, RV308 and W3110) are the strains most widely used by the biotechnology industry. Nonpathogenic ''E. coli'' strain Nissle 1917 (EcN), (Mutaflor) and ''E. coli'' O83:K24:H31 (Colinfant)) are used as probiotic agents in medicine, mainly for the treatment of various gastrointestinal diseases, including
inflammatory bowel disease Inflammatory bowel disease (IBD) is a group of inflammation, inflammatory conditions of the colon (anatomy), colon and small intestine, Crohn's disease and ulcerative colitis being the principal types. Crohn's disease affects the small intestine a ...
. It is thought that the EcN strain might impede the growth of opportunistic pathogens, including ''
Salmonella ''Salmonella'' is a genus of rod-shaped (bacillus) Gram-negative bacteria of the family Enterobacteriaceae. The two species of ''Salmonella'' are ''Salmonella enterica'' and ''Salmonella bongori''. ''S. enterica'' is the type species and is fur ...
'' and other coliform enteropathogens, through the production of microcin proteins the production of siderophores.


Role in disease

Most ''E. coli'' strains do not cause disease, naturally living in the gut, but virulent strains can cause
gastroenteritis Gastroenteritis, also known as infectious diarrhea and gastro, is an inflammation of the gastrointestinal tract including the stomach and intestine. Symptoms may include diarrhea, vomiting, and abdominal pain. Fever, lack of energy, and dehydra ...
, urinary tract infections,
neonatal meningitis Neonatal meningitis is a serious medical condition in infants that is rapidly fatal if untreated. Meningitis is an inflammation of the meninges, the protective membranes of the central nervous system, is more common in the neonatal period (infants ...
, hemorrhagic colitis, and
Crohn's disease Crohn's disease is a type of inflammatory bowel disease (IBD) that may affect any segment of the gastrointestinal tract. Symptoms often include abdominal pain, diarrhea (which may be bloody if inflammation is severe), fever, abdominal distension ...
. Common signs and symptoms include severe abdominal cramps, diarrhea, hemorrhagic colitis, vomiting, and sometimes fever. In rarer cases, virulent strains are also responsible for bowel necrosis (tissue death) and perforation without progressing to hemolytic-uremic syndrome, peritonitis, mastitis, sepsis, and Gram-negative pneumonia. Very young children are more susceptible to develop severe illness, such as hemolytic uremic syndrome; however, healthy individuals of all ages are at risk to the severe consequences that may arise as a result of being infected with ''E. coli''. Some strains of ''E. coli'', for example O157:H7, can produce Shiga toxin (classified as a bioterrorism agent). The Shiga toxin causes inflammatory responses in target cells of the gut, leaving behind lesions which result in the bloody diarrhea that is a symptom of a Shiga toxin-producing ''E. coli'' (STEC) infection. This toxin further causes premature destruction of the red blood cells, which then clog the body's filtering system, the kidneys, in some rare cases (usually in children and the elderly) causing hemolytic-uremic syndrome (HUS), which may lead to kidney failure and even death. Signs of hemolytic uremic syndrome include decreased frequency of urination, lethargy, and paleness of cheeks and inside the lower eyelids. In 25% of HUS patients, complications of nervous system occur, which in turn causes
stroke A stroke is a medical condition in which poor blood flow to the brain causes cell death. There are two main types of stroke: ischemic, due to lack of blood flow, and hemorrhagic, due to bleeding. Both cause parts of the brain to stop functionin ...
s. In addition, this strain causes the buildup of fluid (since the kidneys do not work), leading to
edema Edema, also spelled oedema, and also known as fluid retention, dropsy, hydropsy and swelling, is the build-up of fluid in the body's Tissue (biology), tissue. Most commonly, the legs or arms are affected. Symptoms may include skin which feels t ...
around the lungs, legs, and arms. This increase in fluid buildup especially around the lungs impedes the functioning of the heart, causing an increase in blood pressure. Uropathogenic ''E. coli'' (UPEC) is one of the main causes of urinary tract infections. It is part of the normal microbiota in the gut and can be introduced in many ways. In particular for females, the direction of wiping after defecation (wiping back to front) can lead to fecal contamination of the urogenital orifices. Anal intercourse can also introduce this bacterium into the male urethra, and in switching from anal to vaginal intercourse, the male can also introduce UPEC to the female urogenital system. Enterotoxigenic ''E. coli'' (ETEC) is the most common cause of traveler's diarrhea, with as many as 840 million cases worldwide in developing countries each year. The bacteria, typically transmitted through contaminated food or drinking water, adheres to the
intestinal lining The intestinal epithelium is the single cell layer that form the luminal surface (lining) of both the small and large intestine (colon) of the gastrointestinal tract. Composed of simple columnar epithelial cells, it serves two main functio ...
, where it secretes either of two types of
enterotoxins An enterotoxin is a protein exotoxin released by a microorganism that targets the intestines. Enterotoxins are chromosomally encoded or plasmid encoded exotoxins that are produced and secreted from several bacterial organisms. They are heat labi ...
, leading to watery diarrhea. The rate and severity of infections are higher among children under the age of five, including as many as 380,000 deaths annually. In May 2011, one ''E. coli'' strain, O104:H4, was the subject of a bacterial outbreak that began in Germany. Certain strains of ''E. coli'' are a major cause of foodborne illness. The outbreak started when several people in Germany were infected with enterohemorrhagic ''E. coli'' (EHEC) bacteria, leading to hemolytic-uremic syndrome (HUS), a medical emergency that requires urgent treatment. The outbreak did not only concern Germany, but also 15 other countries, including regions in North America. On 30 June 2011, the German ''Bundesinstitut für Risikobewertung (BfR)'' (Federal Institute for Risk Assessment, a federal institute within the German
Federal Ministry of Food, Agriculture and Consumer Protection The Federal Ministry of Food and Agriculture (german: Bundesministerium für Ernährung und Landwirtschaft, ), abbreviated BMEL, is a cabinet-level ministry of the Federal Republic of Germany. Its primary headquarters are located in Bonn with ...
) announced that seeds of
fenugreek Fenugreek (; ''Trigonella foenum-graecum'') is an annual plant in the family Fabaceae, with leaves consisting of three small obovate to oblong leaflets. It is cultivated worldwide as a semiarid crop. Its seeds and leaves are common ingredients ...
from Egypt were likely the cause of the EHEC outbreak. Some studies have demonstrated an absence of E. ''coli'' in the gut flora of subjects with the metabolic disorder Phenylketonuria. It is hypothesized that the absence of these normal bacterium impairs the production of the key vitamins B2 (riboflavin) and K2 (menaquinone) - vitamins which are implicated in many physiological roles in humans such as cellular and bone metabolism - and so contributes to the disorder. Carbapenem-resistant ''E. coli'' (carbapenemase-producing ''E. coli'') that are resistant to the carbapenem class of antibiotics, considered the drugs of last resort for such infections. They are resistant because they produce an enzyme called a carbapenemase that disables the drug molecule.


Incubation period

The time between ingesting the STEC bacteria and feeling sick is called the "incubation period". The incubation period is usually 3–4 days after the exposure, but may be as short as 1 day or as long as 10 days. The symptoms often begin slowly with mild belly pain or non-bloody diarrhea that worsens over several days. HUS, if it occurs, develops an average 7 days after the first symptoms, when the diarrhea is improving.


Diagnosis

Diagnosis of infectious diarrhea and identification of antimicrobial resistance is performed using a stool culture with subsequent antibiotic sensitivity testing. It requires a minimum of 2 days and maximum of several weeks to culture gastrointestinal pathogens. The sensitivity (true positive) and specificity (true negative) rates for stool culture vary by pathogen, although a number of human pathogens can not be
cultured Culture () is an umbrella term which encompasses the social behavior, institutions, and norms found in human societies, as well as the knowledge, beliefs, arts, laws, customs, capabilities, and habits of the individuals in these groups.Tylor ...
. For culture-positive samples, antimicrobial resistance testing takes an additional 12–24 hours to perform. Current point of care
molecular diagnostic Molecular diagnostics is a collection of techniques used to analyze biological markers in the genome and proteome, and how their cells express their genes as proteins, applying molecular biology to medical testing. In medicine the technique i ...
tests can identify ''E. coli'' and antimicrobial resistance in the identified strains much faster than culture and sensitivity testing. Microarray-based platforms can identify specific pathogenic strains of ''E. coli'' and ''E. coli''-specific AMR genes in two hours or less with high sensitivity and specificity, but the size of the test panel (i.e., total pathogens and antimicrobial resistance genes) is limited. Newer metagenomics-based infectious disease diagnostic platforms are currently being developed to overcome the various limitations of culture and all currently available molecular diagnostic technologies.


Treatment

The mainstay of treatment is the assessment of dehydration and replacement of fluid and electrolytes. Administration of antibiotics has been shown to shorten the course of illness and duration of excretion of enterotoxigenic ''E. coli'' (ETEC) in adults in endemic areas and in traveller's diarrhea, though the rate of resistance to commonly used antibiotics is increasing and they are generally not recommended. The antibiotic used depends upon susceptibility patterns in the particular geographical region. Currently, the antibiotics of choice are fluoroquinolones or azithromycin, with an emerging role for rifaximin.
Oral rifaximin The word oral may refer to: Relating to the mouth * Relating to the mouth, the first portion of the alimentary canal that primarily receives food and liquid **Oral administration of medicines ** Oral examination (also known as an oral exam or ora ...
, a semisynthetic rifamycin derivative, is an effective and well-tolerated antibacterial for the management of adults with non-invasive traveller's diarrhea. Rifaximin was significantly more effective than placebo and no less effective than ciprofloxacin in reducing the duration of diarrhea. While rifaximin is effective in patients with ''E. coli''-predominant traveller's diarrhea, it appears ineffective in patients infected with inflammatory or invasive
enteropathogen Gastroenteritis, also known as infectious diarrhea and gastro, is an inflammation of the gastrointestinal tract including the stomach and intestine. Symptoms may include diarrhea, vomiting, and abdominal pain. Fever, lack of energy, and dehydr ...
s.


Prevention

ETEC is the type of ''E. coli'' that most vaccine development efforts are focused on.
Antibodies An antibody (Ab), also known as an immunoglobulin (Ig), is a large, Y-shaped protein used by the immune system to identify and neutralize foreign objects such as pathogenic bacteria and viruses. The antibody recognizes a unique molecule of the ...
against the LT and major CFs of ETEC provide protection against LT-producing, ETEC-expressing
homologous Homology may refer to: Sciences Biology *Homology (biology), any characteristic of biological organisms that is derived from a common ancestor *Sequence homology, biological homology between DNA, RNA, or protein sequences * Homologous chrom ...
CFs. Oral inactivated vaccines consisting of toxin antigen and whole cells, i.e. the licensed recombinant cholera B subunit (rCTB)-WC cholera vaccine Dukoral, have been developed. There are currently no licensed vaccines for ETEC, though several are in various stages of development. In different trials, the rCTB-WC cholera vaccine provided high (85–100%) short-term protection. An oral ETEC vaccine candidate consisting of rCTB and formalin inactivated ''E. coli'' bacteria expressing major CFs has been shown in clinical trials to be safe, immunogenic, and effective against severe diarrhoea in American travelers but not against ETEC diarrhoea in young children in Egypt. A modified ETEC vaccine consisting of recombinant ''E. coli'' strains over-expressing the major CFs and a more LT-like hybrid toxoid called LCTBA, are undergoing clinical testing. Other proven prevention methods for ''E. coli'' transmission include handwashing and improved sanitation and drinking water, as transmission occurs through fecal contamination of food and water supplies. Additionally, thoroughly cooking meat and avoiding consumption of raw, unpasteurized beverages, such as juices and milk are other proven methods for preventing ''E. coli''. Lastly, cross-contamination of utensils and work spaces should be avoided when preparing food.


Model organism in life science research

Because of its long history of laboratory culture and ease of manipulation, ''E. coli'' plays an important role in modern
biological engineering Biological engineering or bioengineering is the application of principles of biology and the tools of engineering to create usable, tangible, economically-viable products. Biological engineering employs knowledge and expertise from a number o ...
and industrial microbiology. The work of Stanley Norman Cohen and
Herbert Boyer Herbert Wayne "Herb" Boyer (born July 10, 1936) is an American biotechnologist, researcher and entrepreneur in biotechnology. Along with Stanley N. Cohen and Paul Berg he discovered a method to coax bacteria into producing foreign proteins, ther ...
in ''E. coli'', using
plasmid A plasmid is a small, extrachromosomal DNA molecule within a cell that is physically separated from chromosomal DNA and can replicate independently. They are most commonly found as small circular, double-stranded DNA molecules in bacteria; how ...
s and restriction enzymes to create recombinant DNA, became a foundation of biotechnology. ''E. coli'' is a very versatile host for the production of heterologous proteins, and various protein expression systems have been developed which allow the production of recombinant proteins in ''E. coli''. Researchers can introduce genes into the microbes using plasmids which permit high level expression of protein, and such protein may be mass-produced in industrial fermentation processes. One of the first useful applications of recombinant DNA technology was the manipulation of ''E. coli'' to produce human
insulin Insulin (, from Latin ''insula'', 'island') is a peptide hormone produced by beta cells of the pancreatic islets encoded in humans by the ''INS'' gene. It is considered to be the main anabolic hormone of the body. It regulates the metabolism o ...
. Many proteins previously thought difficult or impossible to be expressed in ''E. coli'' in folded form have been successfully expressed in ''E. coli''. For example, proteins with multiple disulphide bonds may be produced in the periplasmic space or in the cytoplasm of mutants rendered sufficiently oxidizing to allow disulphide-bonds to form, while proteins requiring post-translational modification such as
glycosylation Glycosylation is the reaction in which a carbohydrate (or ' glycan'), i.e. a glycosyl donor, is attached to a hydroxyl or other functional group of another molecule (a glycosyl acceptor) in order to form a glycoconjugate. In biology (but not al ...
for stability or function have been expressed using the N-linked glycosylation system of '' Campylobacter jejuni'' engineered into ''E. coli''. Modified ''E. coli'' cells have been used in vaccine development, bioremediation, production of
biofuels Biofuel is a fuel that is produced over a short time span from biomass, rather than by the very slow natural processes involved in the formation of fossil fuels, such as oil. According to the United States Energy Information Administration (E ...
, lighting, and production of immobilised enzymes. Strain K-12 is a mutant form of ''E. coli'' that over-expresses the enzyme Alkaline Phosphatase (ALP). The mutation arises due to a defect in the gene that constantly codes for the enzyme. A gene that is producing a product without any inhibition is said to have constitutive activity. This particular mutant form is used to isolate and purify the aforementioned enzyme. Strain OP50 of ''Escherichia coli'' is used for maintenance of ''
Caenorhabditis elegans ''Caenorhabditis elegans'' () is a free-living transparent nematode about 1 mm in length that lives in temperate soil environments. It is the type species of its genus. The name is a blend of the Greek ''caeno-'' (recent), ''rhabditis'' (ro ...
'' cultures. Strain JM109 is a mutant form of ''E. coli'' that is recA and endA deficient. The strain can be utilized for blue/white screening when the cells carry the fertility factor episome. Lack of recA decreases the possibility of unwanted restriction of the DNA of interest and lack of endA inhibit plasmid DNA decomposition. Thus, JM109 is useful for cloning and expression systems.


Model organism

''E. coli'' is frequently used as a model organism in
microbiology Microbiology () is the scientific study of microorganisms, those being unicellular (single cell), multicellular (cell colony), or acellular (lacking cells). Microbiology encompasses numerous sub-disciplines including virology, bacteriology, prot ...
studies. Cultivated strains (e.g. ''E. coli'' K12) are well-adapted to the laboratory environment, and, unlike wild-type strains, have lost their ability to thrive in the intestine. Many laboratory strains lose their ability to form biofilms. These features protect wild-type strains from
antibodies An antibody (Ab), also known as an immunoglobulin (Ig), is a large, Y-shaped protein used by the immune system to identify and neutralize foreign objects such as pathogenic bacteria and viruses. The antibody recognizes a unique molecule of the ...
and other chemical attacks, but require a large expenditure of energy and material resources. ''E. coli'' is often used as a representative microorganism in the research of novel water treatment and sterilisation methods, including photocatalysis. By standard plate count methods, following sequential dilutions, and growth on agar gel plates, the concentration of viable organisms or CFUs (Colony Forming Units), in a known volume of treated water can be evaluated, allowing the comparative assessment of materials performance. In 1946, Joshua Lederberg and
Edward Tatum Edward Lawrie Tatum (December 14, 1909 – November 5, 1975) was an American geneticist. He shared half of the Nobel Prize in Physiology or Medicine in 1958 with George Beadle for showing that genes control individual steps in metabolism. The o ...
first described the phenomenon known as
bacterial conjugation Bacterial conjugation is the transfer of genetic material between bacterial cells by direct cell-to-cell contact or by a bridge-like connection between two cells. This takes place through a pilus. It is a parasexual mode of reproduction in bacteri ...
using ''E. coli'' as a model bacterium, and it remains the primary model to study conjugation. ''E. coli'' was an integral part of the first experiments to understand phage genetics, and early researchers, such as
Seymour Benzer Seymour Benzer (October 15, 1921 – November 30, 2007) was an American physicist, molecular biologist and behavioral geneticist. His career began during the molecular biology revolution of the 1950s, and he eventually rose to prominence in the ...
, used ''E. coli'' and phage T4 to understand the topography of gene structure. Prior to Benzer's research, it was not known whether the gene was a linear structure, or if it had a branching pattern. ''E. coli'' was one of the first organisms to have its genome sequenced; the complete genome of ''E. coli'' K12 was published by ''Science'' in 1997. From 2002 to 2010, a team at the Hungarian Academy of Science created a strain of ''Escherichia coli'' called MDS42, which is now sold by Scarab Genomics of Madison, WI under the name of "Clean Genome ''E. coli''", where 15% of the genome of the parental strain (''E. coli'' K-12 MG1655) were removed to aid in molecular biology efficiency, removing
IS elements Insertion element (also known as an IS, an insertion sequence element, or an IS element) is a short DNA sequence that acts as a simple transposon, transposable element. Insertion sequences have two major characteristics: they are small relative to o ...
,
pseudogene Pseudogenes are nonfunctional segments of DNA that resemble functional genes. Most arise as superfluous copies of functional genes, either directly by DNA duplication or indirectly by Reverse transcriptase, reverse transcription of an mRNA trans ...
s and
phages A bacteriophage (), also known informally as a ''phage'' (), is a duplodnaviria virus that infects and replicates within bacteria and archaea. The term was derived from "bacteria" and the Greek φαγεῖν ('), meaning "to devour". Bacterio ...
, resulting in better maintenance of plasmid-encoded toxic genes, which are often inactivated by transposons. Biochemistry and replication machinery were not altered. By evaluating the possible combination of nanotechnologies with landscape ecology, complex habitat landscapes can be generated with details at the nanoscale. On such synthetic ecosystems, evolutionary experiments with ''E. coli'' have been performed to study the spatial biophysics of adaptation in an island biogeography on-chip. In other studies, non-pathogenic ''E. coli'' has been used as a model microorganism towards understanding the effects of simulated microgravity (on Earth) on the same.


Uses in biological computing

Since 1961, scientists proposed the idea of genetic circuits used for computational tasks. Collaboration between biologists and computing scientists has allowed designing digital logic gates on the metabolism of ''E. coli''. As Lac operon is a two-stage process, genetic regulation in the bacteria is used to realize computing functions. The process is controlled at the transcription stage of DNA into messenger RNA. Studies are being performed attempting to program ''E. coli'' to solve complicated mathematics problems, such as the
Hamiltonian path problem In the mathematical field of graph theory the Hamiltonian path problem and the Hamiltonian cycle problem are problems of determining whether a Hamiltonian path (a path in an undirected or directed graph that visits each vertex exactly once) or a ...
. A computer to control protein production of ''E. coli'' within
yeast cell Yeasts are eukaryotic, single-celled microorganisms classified as members of the fungus kingdom. The first yeast originated hundreds of millions of years ago, and at least 1,500 species are currently recognized. They are estimated to constitu ...
s has been developed. A method has also been developed to use bacteria to behave as an LCD screen. In July 2017, separate experiments with ''E. coli'' published on Nature showed the potential of using living cells for computing tasks and storing information. A team formed with collaborators of the
Biodesign Institute The Biodesign Institute is a major research center known for nature-inspired solutions to global health, sustainability, and security challenges located on the Tempe campus of Arizona State University. The institute is organized into a growing ...
at
Arizona State University Arizona State University (Arizona State or ASU) is a public research university in the Phoenix metropolitan area. Founded in 1885 by the 13th Arizona Territorial Legislature, ASU is one of the largest public universities by enrollment in the ...
and Harvard’s
Wyss Institute for Biologically Inspired Engineering The Wyss Institute for Biologically Inspired Engineering (pronounced "veese") is a cross-disciplinary research institute at Harvard University focused on bridging the gap between academia and industry (translational medicine) by drawing inspirat ...
developed a biological computer inside ''E. coli'' that responded to a dozen inputs. The team called the computer "ribocomputer", as it was composed of
ribonucleic acid Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
. Meanwhile, Harvard researchers probed that is possible to store information in bacteria after successfully archiving images and movies in the DNA of living ''E. coli'' cells. In 2021, a team led by biophysicist Sangram Bagh realized a study with ''E. coli'' to solve 2 × 2 maze problems to probe the principle for distributed computing among cells.


History

In 1885, the German-Austrian pediatrician
Theodor Escherich Theodor Escherich (; 29 November 1857 – 15 February 1911) was a Germans, German-Austrians, Austrian pediatrics, pediatrician and a professor at universities in University of Graz, Graz and University of Vienna, Vienna. He discovered and de ...
discovered this organism in the feces of healthy individuals. He called it ''Bacterium coli commune'' because it is found in the colon. Early classifications of prokaryotes placed these in a handful of genera based on their shape and motility (at that time
Ernst Haeckel Ernst Heinrich Philipp August Haeckel (; 16 February 1834 – 9 August 1919) was a German zoologist, naturalist, eugenicist, philosopher, physician, professor, marine biologist and artist. He discovered, described and named thousands of new sp ...
's classification of bacteria in the kingdom
Monera Monera (/məˈnɪərə/) (Greek - μονήρης (monḗrēs), "single", "solitary") is a biological kingdom that is made up of prokaryotes. As such, it is composed of single-celled organisms that lack a nucleus. The taxon Monera was first p ...
was in place). ''Bacterium coli'' was the type species of the now invalid genus ''Bacterium'' when it was revealed that the former type species ("''Bacterium triloculare''") was missing. Following a revision of ''Bacterium'', it was reclassified as ''Bacillus coli'' by Migula in 1895 and later reclassified in the newly created genus '' Escherichia'',
named after A namesake is a person, geographic location, or other entity bearing the name of another. History The word is first attested around 1635, and probably comes from the phrase "for one's name's sake", which originates in English Bible translations ...
its original discoverer, by Aldo Castellani and
Albert John Chalmers Albert John Chalmers (28 March 1870 - 5 April 1920) was a British colonial physician who was a research pioneer in tropical medicine. Biography Albert John Chalmers was born in Manchester on 28 March 1870. He received his qualifications in 189 ...
. In 1996, the world's worst to date outbreak of ''E. coli'' food poisoning occurred in Wishaw, Scotland, killing 21 people. This death toll was exceeded in 2011, when the 2011 Germany ''E. coli'' O104:H4 outbreak, linked to organic fenugreek sprouts, killed 53 people.


Uses

''E. coli'' has several practical uses besides its use as a vector for genetic experiments and processes. For example, ''E. coli'' can be used to generate synthetic propane and recombinant human growth hormone.


See also

* Bacteriological water analysis * BolA-like protein family *
Carbon monoxide-releasing molecules Carbon monoxide-releasing molecules (CORMs) are chemical compounds designed to release controlled amounts of carbon monoxide (CO). CORMs are being developed as potential therapeutic agents to locally deliver CO to cells and tissues, thus overcomin ...
*
Contamination control Contamination control is the generic term for all activities aiming to control the existence, growth and proliferation of contamination in certain areas. Contamination control may refer to the atmosphere as well as to surfaces, to particulate matte ...
*
Dam dcm strain DNA methylation is a biological process by which methyl groups are added to the DNA molecule. Methylation can change the activity of a DNA segment without changing the sequence. When located in a gene promoter, DNA methylation typically acts t ...
*
Eijkman test Eijkman test, or differential coliform test, or confirmed Escherichia coli count, is a test used for the identification of coliform bacteria from warm-blooded animals based on the bacteria's ability to produce gas when grown in glucose media at 46 ...
*
Fecal coliform A fecal coliform (British: faecal coliform) is a facultatively anaerobic, rod-shaped, gram-negative, non-sporulating bacterium. Coliform bacteria generally originate in the intestines of warm-blooded animals. Fecal coliforms are capable of growth ...
* International Code of Nomenclature of Bacteria * List of strains of ''Escherichia coli'' * Mannan oligosaccharide-based nutritional supplements *
Overflow metabolism Overflow metabolism refers to the seemingly wasteful strategy in which cells incompletely oxidize their growth substrate (e.g. glucose) instead of using the respiratory pathway, even in the presence of oxygen. As a result of employing this metabol ...
* T4 rII system


References


Databases and external links

*
EcoCyc In bioinformatics EcoCyc is a biological database for the bacterium ''Escherichia coli'' K-12. The EcoCyc project performs literature-based curation of the ''E. coli'' genome, and of ''E. coli'' transcriptional regulation, transporters, and metabo ...
– literature-based curation of the entire genome, and of transcriptional regulation, transporters, and metabolic pathways *
Membranome database Membranome database provides structural and functional information about more than 6000 single-pass (bitopic) transmembrane proteins from ''Homo sapiens'', ''Arabidopsis thaliana'', ''Dictyostelium discoideum'', ''Saccharomyces cerevisiae'', '' ...
provides information about
single-pass transmembrane protein A single-pass membrane protein also known as single-spanning protein or bitopic protein is a transmembrane protein that spans the lipid bilayer only once. These proteins may constitute up to 50% of all transmembrane proteins, depending on the orga ...
s from ''E. coli'' and several other organisms
''E. coli'' statistics



Bacteriome
''E. coli'' interaction database
EcoGene
(genome database and website dedicated to ''Escherichia coli'' K-12 substrain MG1655)
EcoSal
Continually updated Web resource based on the classic ASM Press publication ''Escherichia coli and Salmonella: Cellular and Molecular Biology''
ECODAB
The structure of the O-antigens that form the basis of the serological classification of ''E. coli''
Coli Genetic Stock Center
Strains and genetic information on ''E. coli'' K-12
PortEco (formerly EcoliHub)
– NIH-funded comprehensive data resource for ''E. coli'' K-12 and its phage, plasmids, and mobile genetic elements
EcoliWiki
is the community annotation component o
PortEco

RegulonDB
RegulonDB is a model of the complex regulation of transcription initiation or regulatory network of the cell ''E. coli'' K-12.


AlignACE
Matrices that search for additional binding sites in the ''E. coli'' genomic sequence
''E. coli'' on Protein Data Bank
{{Authority control Gut flora bacteria Tropical diseases Model organisms Bacteria described in 1919