PhiX174
   HOME

TheInfoList



OR:

The phi X 174 (or ΦX174)
bacteriophage A bacteriophage (), also known informally as a ''phage'' (), is a duplodnaviria virus that infects and replicates within bacteria and archaea. The term was derived from "bacteria" and the Greek φαγεῖν ('), meaning "to devour". Bacteri ...
is a single-stranded DNA ( ssDNA) virus that infects ''
Escherichia coli ''Escherichia coli'' (),Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. also known as ''E. coli'' (), is a Gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus ''Escher ...
'', and the first DNA-based
genome In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ge ...
to be sequenced. This work was completed by
Fred Sanger Frederick Sanger (; 13 August 1918 – 19 November 2013) was an English biochemist who received the Nobel Prize in Chemistry twice. He won the 1958 Chemistry Prize for determining the amino acid sequence of insulin and numerous other ...
and his team in 1977. In 1962,
Walter Fiers Walter Fiers (31 January 1931 in Ypres, West Flanders – 28 July 2019 in Destelbergen) was a Belgian molecular biologist. He obtained a degree of Engineer for Chemistry and Agricultural Industries at the University of Ghent in 1954, and started ...
and Robert Sinsheimer had already demonstrated the physical, covalently closed circularity of ΦX174 DNA. Nobel prize winner
Arthur Kornberg Arthur Kornberg (March 3, 1918 – October 26, 2007) was an American biochemist who won the Nobel Prize in Physiology or Medicine 1959 for the discovery of "the mechanisms in the biological synthesis of ribonucleic acid and deoxyribonucleic aci ...
used ΦX174 as a model to first prove that DNA synthesized in a test tube by purified enzymes could produce all the features of a natural virus, ushering in the age of
synthetic biology Synthetic biology (SynBio) is a multidisciplinary area of research that seeks to create new biological parts, devices, and systems, or to redesign systems that are already found in nature. It is a branch of science that encompasses a broad ran ...
. In 1972–1974, Jerard Hurwitz,
Sue Wickner Sue Hengren Wickner is an American biochemist and geneticist who is a distinguished investigator and the head of the DNA Molecular Biology section of the National Institutes of Health.   Her laboratory is under the National Cancer Institute and i ...
, and
Reed Wickner Reed B. Wickner (born c. 1942) is an American yeast geneticist. In 1994 he proposed that the 'PSI''+and RE3phenotypes in ''Saccharomyces cerevisiae'', a form of budding yeast, were caused by prion forms of native proteins - specifically, the Su ...
with collaborators identified the genes required to produce the enzymes to catalyze conversion of the single stranded form of the virus to the double stranded replicative form. In 2003, it was reported by Craig Venter's group that the genome of ΦX174 was the first to be completely assembled ''in vitro'' from synthesized oligonucleotides. The ΦX174 virus particle has also been successfully assembled ''in vitro''. In 2012, it was shown how its highly overlapping genome can be fully decompressed and still remain functional.


Genome

This
bacteriophage A bacteriophage (), also known informally as a ''phage'' (), is a duplodnaviria virus that infects and replicates within bacteria and archaea. The term was derived from "bacteria" and the Greek φαγεῖν ('), meaning "to devour". Bacteri ...
has a sense circular single-stranded DNA genome of 5,386
nucleotide Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecules wi ...
s. The genome
GC-content In molecular biology and genetics, GC-content (or guanine-cytosine content) is the percentage of nitrogenous bases in a DNA or RNA molecule that are either guanine (G) or cytosine (C). This measure indicates the proportion of G and C bases out o ...
is 44% and 95% of nucleotides belong to coding genes. Because of the balance base pattern of the genome, it is used as the control DNA for Illumina sequencers.


Genes

ΦX174 encodes 11 genes, named as consecutive letters of the alphabet in the order they were discovered, with the exception of A* which is an alternative start codon within the large A genes. Only genes A* and K are thought to be non-essential, although there is some doubt about A* because its start codon could be changed to ATT but not any other sequence. It is now known that the ATT is still likely capable of producing protein within ''E. coli'' and therefore this gene may in fact be essential. The first half of the ΦX174 genome features high levels of
gene overlap An overlapping gene (or OLG) is a gene whose expressible nucleotide sequence partially overlaps with the expressible nucleotide sequence of another gene. In this way, a nucleotide sequence may make a contribution to the function of one or more gen ...
with eight out of 11 genes overlapping by at least one nucleotide. These overlaps have been shown to be non-essential although the refactored phage with all gene overlaps removed had decreased fitness from wild-type. Phage ΦX174 has been used to try to establish the absence of undiscovered genetic information through a "proof by synthesis" approach.


Transcriptome

In 2020, the
transcriptome The transcriptome is the set of all RNA transcripts, including coding and non-coding, in an individual or a population of cells. The term can also sometimes be used to refer to all RNAs, or just mRNA, depending on the particular experiment. The t ...
of ΦX174 was generated. Notable features of the ΦX174 transcriptome is a series of up to four relatively weak promoters in series with up to four Rho-independent (intrinsic) terminators and one Rho-dependent terminator.


Proteins

ΦX174 encodes 11
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respo ...
s.


Proteome

Identification of all ΦX174 proteins using mass spectrometry has recently been reported.


Infection Cycle

Infection begins when G protein binds to
lipopolysaccharide Lipopolysaccharides (LPS) are large molecules consisting of a lipid and a polysaccharide that are bacterial toxins. They are composed of an O-antigen, an outer core, and an inner core all joined by a covalent bond, and are found in the outer m ...
s on the bacterial host cell surface. H protein (or the DNA Pilot Protein) pilots the viral genome through the bacterial membrane of ''
E.coli ''Escherichia coli'' (),Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. also known as ''E. coli'' (), is a Gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus ''Esche ...
'' bacteria most likely via a predicted N-terminal
transmembrane domain A transmembrane domain (TMD) is a membrane-spanning protein domain. TMDs generally adopt an alpha helix topological conformation, although some TMDs such as those in porins can adopt a different conformation. Because the interior of the lipid bil ...
helix. However, it has become apparent that H protein is a multifunctional protein. This is the only viral
capsid A capsid is the protein shell of a virus, enclosing its genetic material. It consists of several oligomeric (repeating) structural subunits made of protein called protomers. The observable 3-dimensional morphological subunits, which may or may ...
protein of ΦX174 to lack a crystal structure for a couple of reasons. It has low aromatic content and high
glycine Glycine (symbol Gly or G; ) is an amino acid that has a single hydrogen atom as its side chain. It is the simplest stable amino acid (carbamic acid is unstable), with the chemical formula NH2‐ CH2‐ COOH. Glycine is one of the proteinogeni ...
content, making the protein structure very flexible and in addition, individual hydrogen atoms (the R group for glycines) are difficult to detect in protein crystallography. Additionally, H protein induces
lysis Lysis ( ) is the breaking down of the membrane of a cell, often by viral, enzymic, or osmotic (that is, "lytic" ) mechanisms that compromise its integrity. A fluid containing the contents of lysed cells is called a ''lysate''. In molecular bio ...
of the bacterial host at high concentrations as the predicted N-terminal transmembrane helix easily pokes holes through the bacterial wall. By
bioinformatics Bioinformatics () is an interdisciplinary field that develops methods and software tools for understanding biological data, in particular when the data sets are large and complex. As an interdisciplinary field of science, bioinformatics combi ...
, this protein contains four predicted
coiled-coil A coiled coil is a structural motif in proteins in which 2–7 alpha helix, alpha-helices are coiled together like the strands of a rope. (Protein dimer, Dimers and Protein trimer, trimers are the most common types.) Many coiled coil-type protei ...
domains which has a significant homology to known transcription factors. Additionally, it was determined that ''de novo'' H protein was required for optimal synthesis of other viral proteins. Mutations in H protein that prevent viral incorporation, can be overcome when excess amounts of protein B, the internal scaffolding protein, are supplied. The DNA is ejected through a hydrophilic channel at the 5-fold vertex. It is understood that H protein resides in this area but experimental evidence has not verified its exact location. Once inside the host bacterium, replication of the ssDNA genome proceeds via
negative sense DNA Negative may refer to: Science and mathematics * Negative number * Negative mass * Negative energy * Negative charge, one of the two types of electric charge * Negative (electrical polarity), in electric circuits * Negative result (disambig ...
intermediate. This is done as the phage genome supercoils and the secondary structure formed by such supercoiling attracts a
primosome In molecular biology, a primosome is a protein complex responsible for creating RNA primers on single stranded DNA during DNA replication. The primosome consists of seven proteins: DnaG primase, DnaB helicase, DnaC helicase assistant, DnaT, ...
protein complex. This translocates once around the genome and synthesizes a sDNA from the positive original genome. sDNA genomes to package into viruses are created from this by a rolling circle mechanism. This is the mechanism by which the double stranded supercoiled genome is nicked on the positive strand by a virus-encoded A protein, also attracting a bacterial
DNA polymerase A DNA polymerase is a member of a family of enzymes that catalyze the synthesis of DNA molecules from nucleoside triphosphates, the molecular precursors of DNA. These enzymes are essential for DNA replication and usually work in groups to create ...
(DNAP) to the site of cleavage. DNAP uses the negative strand as a template to make positive sense DNA. As it translocates around the genome it displaces the outer strand of already-synthesised DNA, which is immediately coated by
SSBP Single-stranded binding proteins (SSBs) are a class of proteins that have been identified in both viruses and organisms from bacteria to humans. Viral SSB Although the overall picture of ''human cytomegalovirus'' (HHV-5) DNA synthesis appears ...
proteins. The A protein cleaves the complete genome every time it recognises the origin sequence. As D protein is the most abundant gene transcript, it is the most protein in the viral procapsid. Similarly, gene transcripts for F, J, and G are more abundant than for H as the
stoichiometry Stoichiometry refers to the relationship between the quantities of reactants and products before, during, and following chemical reactions. Stoichiometry is founded on the law of conservation of mass where the total mass of the reactants equal ...
for these structural proteins is 5:5:5:1. The primosomes are protein complexes which attach/bind the enzyme
helicase Helicases are a class of enzymes thought to be vital to all organisms. Their main function is to unpack an organism's genetic material. Helicases are motor proteins that move directionally along a nucleic acid phosphodiester backbone, separatin ...
on the template. Primosomes gives RNA primers for DNA synthesis to strands.


Phylogenetics and diversity

ΦX174 is closely related to other
microviridae ''Microviridae'' is a family of bacteriophages with a single-stranded DNA genome. The name of this family is derived from the ancient Greek word (), meaning "small". This refers to the size of their genomes, which are among the smallest of the ...
, especially the NC phage (e.g. NC1, NC7, NC11, NC16, NC37, NC5, NC41, NC56, NC51, etc.) and more distantly related to the G4-like phages and even more distantly related to the α3-like phage. Rokyta et al. 2006 presented a phylogenetic tree of their relationships.


Uses


Experimental evolution

ΦX174 has been used as a
model organism A model organism (often shortened to model) is a non-human species that is extensively studied to understand particular biological phenomena, with the expectation that discoveries made in the model organism will provide insight into the workin ...
in many evolution experiments.


Biotechnology

ΦX174 is regularly used as a positive control in
DNA sequencing DNA sequencing is the process of determining the nucleic acid sequence – the order of nucleotides in DNA. It includes any method or technology that is used to determine the order of the four bases: adenine, guanine, cytosine, and thymine. Th ...
due to its relatively small genome size in comparison to other organisms, its relatively balanced nucleotide content — about 23% G, 22% C, 24% A, and 31% T, i.e., 45% G+C and 55% A+T, see the accession NC_001422.1 for its 5,386 nucleotide long sequence. Illumina's sequencing instruments use ΦX174 as a positive control, and a single Illumina sequencing run can cover the ΦX174 genome several million times over, making this very likely the most heavily sequenced genome in history. ΦX174 is also used to test the resistance of
personal protective equipment Personal protective equipment (PPE) is protective clothing, helmets, goggles, or other garments or equipment designed to protect the wearer's body from injury or infection. The hazards addressed by protective equipment include physical, elec ...
to bloodborne viruses. ΦX174 has also been modified to enable peptide display (phage display) from the viral capsid G protein.


Synthetic Biology

The ΦX174 genome was the first phage to be cloned in yeast, which provides a convenient drydock for genome modifications. ΦX174 was also the first genome to be fully decompressed, having all gene overlaps removed. The effect of these changes resulted in significantly reduced host attachment, protein expression dysregulation, and heat sensitivity.


See also

*
Bacteriophage MS2 Bacteriophage MS2 (''Emesvirus zinderi''), commonly called MS2, is an icosahedral, positive-sense single-stranded RNA virus that infects the bacterium ''Escherichia coli'' and other members of the Enterobacteriaceae. MS2 is a member of a family ...


References


External links

* {{Use dmy dates, date=March 2019 Microviridae