Phi X 174
   HOME

TheInfoList



OR:

The phi X 174 (or ΦX174) bacteriophage is a single-stranded DNA ( ssDNA) virus that infects ''
Escherichia coli ''Escherichia coli'' (),Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. also known as ''E. coli'' (), is a Gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus '' Esc ...
'', and the first DNA-based
genome In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding g ...
to be sequenced. This work was completed by
Fred Sanger Frederick Sanger (; 13 August 1918 – 19 November 2013) was an English biochemist who received the Nobel Prize in Chemistry twice. He won the 1958 Chemistry Prize for determining the amino acid sequence of insulin and numerous othe ...
and his team in 1977. In 1962,
Walter Fiers Walter Fiers (31 January 1931 in Ypres, West Flanders – 28 July 2019 in Destelbergen) was a Belgian molecular biologist. He obtained a degree of Engineer for Chemistry and Agricultural Industries at the University of Ghent in 1954, and started ...
and Robert Sinsheimer had already demonstrated the physical, covalently closed circularity of ΦX174 DNA. Nobel prize winner
Arthur Kornberg Arthur Kornberg (March 3, 1918 – October 26, 2007) was an American biochemist who won the Nobel Prize in Physiology or Medicine 1959 for the discovery of "the mechanisms in the biological synthesis of ribonucleic acid and deoxyribonucleic ac ...
used ΦX174 as a model to first prove that DNA synthesized in a test tube by purified enzymes could produce all the features of a natural virus, ushering in the age of
synthetic biology Synthetic biology (SynBio) is a multidisciplinary area of research that seeks to create new biological parts, devices, and systems, or to redesign systems that are already found in nature. It is a branch of science that encompasses a broad ran ...
. In 1972–1974, Jerard Hurwitz,
Sue Wickner Sue Hengren Wickner is an American biochemist and geneticist who is a distinguished investigator and the head of the DNA Molecular Biology section of the National Institutes of Health.   Her laboratory is under the National Cancer Institute and ...
, and Reed Wickner with collaborators identified the genes required to produce the enzymes to catalyze conversion of the single stranded form of the virus to the double stranded replicative form. In 2003, it was reported by Craig Venter's group that the genome of ΦX174 was the first to be completely assembled ''in vitro'' from synthesized oligonucleotides. The ΦX174 virus particle has also been successfully assembled ''in vitro''. In 2012, it was shown how its highly overlapping genome can be fully decompressed and still remain functional.


Genome

This bacteriophage has a sense circular single-stranded DNA genome of 5,386
nucleotide Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecule ...
s. The genome GC-content is 44% and 95% of nucleotides belong to coding genes. Because of the balance base pattern of the genome, it is used as the control DNA for Illumina sequencers.


Genes

ΦX174 encodes 11 genes, named as consecutive letters of the alphabet in the order they were discovered, with the exception of A* which is an alternative start codon within the large A genes. Only genes A* and K are thought to be non-essential, although there is some doubt about A* because its start codon could be changed to ATT but not any other sequence. It is now known that the ATT is still likely capable of producing protein within ''E. coli'' and therefore this gene may in fact be essential. The first half of the ΦX174 genome features high levels of gene overlap with eight out of 11 genes overlapping by at least one nucleotide. These overlaps have been shown to be non-essential although the refactored phage with all gene overlaps removed had decreased fitness from wild-type. Phage ΦX174 has been used to try to establish the absence of undiscovered genetic information through a "proof by synthesis" approach.


Transcriptome

In 2020, the
transcriptome The transcriptome is the set of all RNA transcripts, including coding and non-coding, in an individual or a population of cells. The term can also sometimes be used to refer to all RNAs, or just mRNA, depending on the particular experiment. The t ...
of ΦX174 was generated. Notable features of the ΦX174 transcriptome is a series of up to four relatively weak promoters in series with up to four Rho-independent (intrinsic) terminators and one Rho-dependent terminator.


Proteins

ΦX174 encodes 11
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, res ...
s.


Proteome

Identification of all ΦX174 proteins using mass spectrometry has recently been reported.


Infection Cycle

Infection begins when G protein binds to
lipopolysaccharide Lipopolysaccharides (LPS) are large molecules consisting of a lipid and a polysaccharide that are bacterial toxins. They are composed of an O-antigen, an outer core, and an inner core all joined by a covalent bond, and are found in the outer ...
s on the bacterial host cell surface. H protein (or the DNA Pilot Protein) pilots the viral genome through the bacterial membrane of ''
E.coli ''Escherichia coli'' (),Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. also known as ''E. coli'' (), is a Gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus ''Esche ...
'' bacteria most likely via a predicted N-terminal
transmembrane domain A transmembrane domain (TMD) is a membrane-spanning protein domain. TMDs generally adopt an alpha helix topological conformation, although some TMDs such as those in porins can adopt a different conformation. Because the interior of the lipid bi ...
helix. However, it has become apparent that H protein is a multifunctional protein. This is the only viral
capsid A capsid is the protein shell of a virus, enclosing its genetic material. It consists of several oligomeric (repeating) structural subunits made of protein called protomers. The observable 3-dimensional morphological subunits, which may or ma ...
protein of ΦX174 to lack a crystal structure for a couple of reasons. It has low aromatic content and high
glycine Glycine (symbol Gly or G; ) is an amino acid that has a single hydrogen atom as its side chain. It is the simplest stable amino acid ( carbamic acid is unstable), with the chemical formula NH2‐ CH2‐ COOH. Glycine is one of the proteinog ...
content, making the protein structure very flexible and in addition, individual hydrogen atoms (the R group for glycines) are difficult to detect in protein crystallography. Additionally, H protein induces lysis of the bacterial host at high concentrations as the predicted N-terminal transmembrane helix easily pokes holes through the bacterial wall. By bioinformatics, this protein contains four predicted coiled-coil domains which has a significant homology to known transcription factors. Additionally, it was determined that ''de novo'' H protein was required for optimal synthesis of other viral proteins. Mutations in H protein that prevent viral incorporation, can be overcome when excess amounts of protein B, the internal scaffolding protein, are supplied. The DNA is ejected through a hydrophilic channel at the 5-fold vertex. It is understood that H protein resides in this area but experimental evidence has not verified its exact location. Once inside the host bacterium, replication of the ssDNA genome proceeds via negative sense DNA intermediate. This is done as the phage genome supercoils and the secondary structure formed by such supercoiling attracts a
primosome In molecular biology, a primosome is a protein complex responsible for creating RNA primers on single stranded DNA during DNA replication. The primosome consists of seven proteins: DnaG primase, DnaB helicase, DnaC helicase assistant, DnaT, ...
protein complex. This translocates once around the genome and synthesizes a sDNA from the positive original genome. sDNA genomes to package into viruses are created from this by a rolling circle mechanism. This is the mechanism by which the double stranded supercoiled genome is nicked on the positive strand by a virus-encoded A protein, also attracting a bacterial
DNA polymerase A DNA polymerase is a member of a family of enzymes that catalyze the synthesis of DNA molecules from nucleoside triphosphates, the molecular precursors of DNA. These enzymes are essential for DNA replication and usually work in groups to create ...
(DNAP) to the site of cleavage. DNAP uses the negative strand as a template to make positive sense DNA. As it translocates around the genome it displaces the outer strand of already-synthesised DNA, which is immediately coated by SSBP proteins. The A protein cleaves the complete genome every time it recognises the origin sequence. As D protein is the most abundant gene transcript, it is the most protein in the viral procapsid. Similarly, gene transcripts for F, J, and G are more abundant than for H as the stoichiometry for these structural proteins is 5:5:5:1. The primosomes are protein complexes which attach/bind the enzyme
helicase Helicases are a class of enzymes thought to be vital to all organisms. Their main function is to unpack an organism's genetic material. Helicases are motor proteins that move directionally along a nucleic acid phosphodiester backbone, separatin ...
on the template. Primosomes gives RNA primers for DNA synthesis to strands.


Phylogenetics and diversity

ΦX174 is closely related to other
microviridae ''Microviridae'' is a family of bacteriophages with a single-stranded DNA genome. The name of this family is derived from the ancient Greek word (), meaning "small". This refers to the size of their genomes, which are among the smallest of the ...
, especially the NC phage (e.g. NC1, NC7, NC11, NC16, NC37, NC5, NC41, NC56, NC51, etc.) and more distantly related to the G4-like phages and even more distantly related to the α3-like phage. Rokyta et al. 2006 presented a phylogenetic tree of their relationships.


Uses


Experimental evolution

ΦX174 has been used as a model organism in many evolution experiments.


Biotechnology

ΦX174 is regularly used as a positive control in DNA sequencing due to its relatively small genome size in comparison to other organisms, its relatively balanced nucleotide content — about 23% G, 22% C, 24% A, and 31% T, i.e., 45% G+C and 55% A+T, see the accession NC_001422.1 for its 5,386 nucleotide long sequence. Illumina's sequencing instruments use ΦX174 as a positive control, and a single Illumina sequencing run can cover the ΦX174 genome several million times over, making this very likely the most heavily sequenced genome in history. ΦX174 is also used to test the resistance of
personal protective equipment Personal protective equipment (PPE) is protective clothing, helmets, goggles, or other garments or equipment designed to protect the wearer's body from injury or infection. The hazards addressed by protective equipment include physical, e ...
to bloodborne viruses. ΦX174 has also been modified to enable peptide display (phage display) from the viral capsid G protein.


Synthetic Biology

The ΦX174 genome was the first phage to be cloned in yeast, which provides a convenient drydock for genome modifications. ΦX174 was also the first genome to be fully decompressed, having all gene overlaps removed. The effect of these changes resulted in significantly reduced host attachment, protein expression dysregulation, and heat sensitivity.


See also

* Bacteriophage MS2


References


External links

* {{Use dmy dates, date=March 2019 Microviridae