Genomic streamlining is a theory in
evolutionary biology
Evolutionary biology is the subfield of biology that studies the evolutionary processes (natural selection, common descent, speciation) that produced the diversity of life on Earth. It is also defined as the study of the history of life fo ...
and
microbial ecology
Microbial ecology (or environmental microbiology) is the ecology of microorganisms: their relationship with one another and with their environment. It concerns the three major domains of life—Eukaryota, Archaea, and Bacteria—as well as viru ...
that suggests that there is a reproductive benefit to
prokaryote
A prokaryote () is a single-celled organism that lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Greek πρό (, 'before') and κάρυον (, 'nut' or 'kernel').Campbell, N. "Biology:Concepts & Connec ...
s having a smaller genome size with less
non-coding DNA
Non-coding DNA (ncDNA) sequences are components of an organism's DNA that do not encode protein sequences. Some non-coding DNA is transcribed into functional non-coding RNA molecules (e.g. transfer RNA, microRNA, piRNA, ribosomal RNA, and regul ...
and fewer non-essential genes.
There is a lot of variation in prokaryotic genome size, with the smallest free-living cell's genome being roughly ten times smaller than the largest prokaryote. Two of the bacterial taxa with the smallest genomes are ''
Prochlorococcus
''Prochlorococcus'' is a genus of very small (0.6 μm) marine cyanobacteria with an unusual pigmentation ( chlorophyll ''a2'' and ''b2''). These bacteria belong to the photosynthetic picoplankton and are probably the most abundant photosynth ...
'' and ''
Pelagibacter ubique
"''Candidatus'' Pelagibacter", with the single species "''Ca.'' P. communis", was isolated in 2002 and given a specific name, although it has not yet been described as required by the bacteriological code. It is an abundant member of the SAR11 c ...
,''
both highly abundant
marine bacteria
Marine prokaryotes are marine bacteria and marine archaea. They are defined by their habitat as prokaryotes that live in marine environments, that is, in the saltwater of seas or oceans or the brackish water of coastal estuaries. All cellular ...
commonly found in
oligotroph
An oligotroph is an organism that can live in an environment that offers very low levels of nutrients. They may be contrasted with copiotrophs, which prefer nutritionally rich environments. Oligotrophs are characterized by slow growth, low rates of ...
ic regions. Similar reduced genomes have been found in uncultured marine bacteria, suggesting that genomic streamlining is a common feature of
bacterioplankton.
This theory is typically used with reference to free-living organisms in oligotrophic environments.
Overview
Genome streamlining theory states that certain prokaryotic
genome
In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ge ...
s tend to be small in size in comparison to other prokaryotes, and all
eukaryote
Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bacte ...
s, due to selection against the retention of
non-coding DNA
Non-coding DNA (ncDNA) sequences are components of an organism's DNA that do not encode protein sequences. Some non-coding DNA is transcribed into functional non-coding RNA molecules (e.g. transfer RNA, microRNA, piRNA, ribosomal RNA, and regul ...
.
The known advantages of small genome size include faster
genome replication for cell division, fewer nutrient requirements, and easier co-regulation of multiple related genes, because gene density typically increases with decreased genome size.
This means that an organism with a smaller genome is likely to be more successful, or have higher
fitness, than one hindered by excessive amounts of unnecessary DNA, leading to selection for smaller genome sizes.
Some mechanisms that are thought to underlie genome streamlining include
deletion bias and
purifying selection
In natural selection, negative selection or purifying selection is the selective removal of alleles that are deleterious. This can result in stabilising selection through the purging of deleterious genetic polymorphisms that arise through random ...
.
Deletion bias is the phenomenon in bacterial genomes where the rate of DNA loss is naturally higher than the rate of DNA acquisition.
This is a passive process that simply results from the difference in these two rates.
Purifying selection is the process by which extraneous genes are selected against, making organisms lacking this genetic material more successful by effectively reducing their genome size.
Genes and non-coding DNA segments that are less crucial for an organism survival will be more likely to be lost over time.
This selective pressure is stronger in large marine prokaryotic
populations
Population typically refers to the number of people in a single area, whether it be a city or town, region, country, continent, or the world. Governments typically quantify the size of the resident population within their jurisdiction using a ...
, because
intra-species competition favours fast, efficient and inexpensive
replication.
This is because large population sizes increase competition among members of the same species, and thus increases
selective pressure
Any cause that reduces or increases reproductive success in a portion of a population potentially exerts evolutionary pressure, selective pressure or selection pressure, driving natural selection. It is a quantitative description of the amount of ...
and causes the reduction in genome size to occur more readily among organisms of large population sizes, like bacteria.
This may explain why genome streamlining seems to be particularly prevalent in prokaryotic organisms, as they tend to have larger population sizes than eukaryotes.
It has also been proposed that having a smaller genome can help minimize overall cell size, which increases a prokaryotes
surface-area to volume ratio. A higher surface-area to volume ratio allows for more nutrient uptake proportional to their size, which allows them to outcompete other larger organisms for nutrients.
This phenomenon has been noted particularly in nutrient depleted waters.
Genomic signatures
Genomic analysis of streamlined organisms have shown that low
GC content
In molecular biology and genetics, GC-content (or guanine-cytosine content) is the percentage of nitrogenous bases in a DNA or RNA molecule that are either guanine (G) or cytosine (C). This measure indicates the proportion of G and C bases out o ...
, low percentage of non-coding DNA, and a low fraction of genes encoding for
cytoplasmic membrane
The cell membrane (also known as the plasma membrane (PM) or cytoplasmic membrane, and historically referred to as the plasmalemma) is a biological membrane that separates and protects the interior of all cells from the outside environment ( ...
proteins,
periplasmic proteins
The periplasm is a concentrated gel-like matrix in the space between the inner cytoplasmic membrane and the bacterial outer membrane called the ''periplasmic space'' in gram-negative bacteria. Using cryo-electron microscopy it has been found that ...
,
transcriptionally related proteins, and
signal transduction
Signal transduction is the process by which a chemical or physical signal is transmitted through a cell as a series of molecular events, most commonly protein phosphorylation catalyzed by protein kinases, which ultimately results in a cellula ...
pathways are all characteristic of free-living streamlined prokaryotic organisms.
Oftentimes, highly streamlined organisms are difficult to isolate by culturing in a laboratory (SAR11 being a central example).
Model organisms
''Pelagibacter ubique'' (SAR11)
''
Pelagibacter ubique
"''Candidatus'' Pelagibacter", with the single species "''Ca.'' P. communis", was isolated in 2002 and given a specific name, although it has not yet been described as required by the bacteriological code. It is an abundant member of the SAR11 c ...
'' are members of the
SAR11 clade
The Pelagibacterales are an order in the Alphaproteobacteria composed of free-living marine bacteria that make up roughly one in three cells at the ocean's surface. Overall, members of the ''Pelagibacterales'' are estimated to make up between a q ...
, a
heterotroph
A heterotroph (; ) is an organism that cannot produce its own food, instead taking nutrition from other sources of organic carbon, mainly plant or animal matter. In the food chain, heterotrophs are primary, secondary and tertiary consumers, but ...
ic marine group which are found throughout the oceans and are rather common.
These microbes have the smallest genome and encode the smallest number of
Open Reading Frames
In molecular biology, open reading frames (ORFs) are defined as spans of DNA sequence between the start and stop codons. Usually, this is considered within a studied region of a prokaryotic DNA sequence, where only one of the six possible readin ...
of any known non-sessile microorganism.
''P. ubique'' has complete biosynthetic pathways and all necessary enzymes for the synthesis of 20
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha am ...
s and only lack a few
cofactors
Cofactor may also refer to:
* Cofactor (biochemistry), a substance that needs to be present in addition to an enzyme for a certain reaction to be catalysed
* A domain parameter in elliptic curve cryptography, defined as the ratio between the order ...
despite the genome's small size. The genome size for this microorganism is achieved by lack of, "pseudogenes, introns, transposons, extrachromosomal elements, or inteins". The genome also contains fewer
paralogs
Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a spec ...
compared to other members of the same clade and the shortest
intergenic spacers for any living cell.
In these organisms, unusual nutrient requirements were found due to the streamlining selection and gene loss when selection occurred for more efficient resource utilization in oceans with limited nutrients for uptake.
These observations indicate that some microbes may be difficult to grow in a laboratory setting because of unusual nutrient requirements.
''Prochlorococcus''
''
Prochlorococcus
''Prochlorococcus'' is a genus of very small (0.6 μm) marine cyanobacteria with an unusual pigmentation ( chlorophyll ''a2'' and ''b2''). These bacteria belong to the photosynthetic picoplankton and are probably the most abundant photosynth ...
'' is one of the dominant
cyanobacteria
Cyanobacteria (), also known as Cyanophyta, are a phylum of gram-negative bacteria that obtain energy via photosynthesis. The name ''cyanobacteria'' refers to their color (), which similarly forms the basis of cyanobacteria's common name, blu ...
and is a main participant in primary production in oligotrophic waters.
It is the smallest and most abundant photosynthetic organism recorded on Earth.
As a cyanobacteria, they have an incredible ability to adapt to environments with very poor nutrient availability, as they maintain their energy from light.
The
nitrogen assimilation pathway in this organism has been significantly modified to adapt to the nutritional limitations of the organisms’ habitats.
These adaptations led to the removal of key enzymes from the genome, such as
nitrate reductase
Nitrate reductases are molybdoenzymes that reduce nitrate (NO) to nitrite (NO). This reaction is critical for the production of protein in most crop plants, as nitrate is the predominant source of nitrogen in fertilized soils.
Types
Euka ...
,
nitrite reductase
Nitrite reductase refers to any of several classes of enzymes that catalyze the reduction of nitrite. There are two classes of NIR's. A multi haem enzyme reduces NO2− to a variety of products. Copper containing enzymes carry out a single elect ...
, and often
urease
Ureases (), functionally, belong to the superfamily of amidohydrolases and phosphotriesterases. Ureases are found in numerous bacteria, fungi, algae, plants, and some invertebrates, as well as in soils, as a soil enzyme. They are nickel-containin ...
.
Unlike some cyanobacterial counterparts, ''Prochlorococcus'' is not able to fix atmospheric nitrogen (N
2).
The only nitrogen sources found to be used by this species are ammonia, which is incorporated into glutamate via the enzyme
glutamine synthetase
Glutamine synthetase (GS) () is an enzyme that plays an essential role in the metabolism of nitrogen by catalyzing the condensation of glutamate and ammonia to form glutamine:
Glutamate + Adenosine triphosphate, ATP + NH3 → Glutamine + Ad ...
and uses less energy compared to nitrate usage, and in certain species, urea.
Moreover, metabolic regulation systems of ''Prochlorococcus'' were found to be greatly simplified.
Nitrogen-fixing marine cyanobacteria (UCYN-A)
Nitrogen-fixing marine cyanobacteria are known to support oxygen production in oceans by fixing inorganic nitrogen using the enzyme
nitrogenase
Nitrogenases are enzymes () that are produced by certain bacteria, such as cyanobacteria (blue-green bacteria) and rhizobacteria. These enzymes are responsible for the Organic redox reaction, reduction of nitrogen (N2) to ammonia (NH3). Nitrog ...
.
A special subset of these bacteria, UCYN-A, was found to lack the
photosystem II
Photosystem II (or water-plastoquinone oxidoreductase) is the first protein complex in the light-dependent reactions of oxygenic photosynthesis. It is located in the thylakoid membrane of plants, algae, and cyanobacteria. Within the photosystem ...
complex usually used in
photosynthesis
Photosynthesis is a process used by plants and other organisms to convert light energy into chemical energy that, through cellular respiration, can later be released to fuel the organism's activities. Some of this chemical energy is stored i ...
and that it lacks a number of major metabolic pathways but is still capable of using the electron transport chain to generate energy from a light source.
Furthermore,
anabolic
Anabolism () is the set of metabolic pathways that construct molecules from smaller units. These reactions require energy, known also as an endergonic process. Anabolism is the building-up aspect of metabolism, whereas catabolism is the breaking-do ...
enzymes needed for creating amino acids such as valine, leucine and isoleucine are missing, as well as some which lead to phenylalanine, tyrosine and tryptophan biosynthesis.
This organism seems to be an obligate
photoheterotroph
Photoheterotrophs ('' Gk'': ''photo'' = light, ''hetero'' = (an)other, ''troph'' = nourishment) are heterotrophic phototrophs – that is, they are organisms that use light for energy, but cannot use carbon dioxide as their sole carbon source. Cons ...
that uses carbon substrates for energy production and some biosynthetic materials for biosynthesis. It was discovered that UCYN-A developed a reduced genome of only 1.44 Megabases that is smaller but similar in structure to that of chloroplasts.
In comparison with related species such as ''
Crocosphaera watsonii'' and ''
Cyanothece
''Cyanothece'' is a genus of unicellular, diazotrophic, oxygenic photosynthesizing cyanobacteria.
Modern organisms and cellular organization
In 1976, Jiří Komárek defined the prokaryotic cyanobacteria genus ''Cyanothece'' as distinct from ' ...
'' sp., which employ genomes which range in length from 5.46 to 6.24 megabases, the UCYN-A genome is much smaller. The compacted genome is a single, circular chromosome with “1,214 identified protein-coding regions”.
The genome of UCYN-A is also highly conserved ( >97% nucleotide identity) across ocean waters, which is atypical of ocean microbes. The lack of UCYN-A genome diversity, presence of nitrogenase and
hydrogenase A hydrogenase is an enzyme that catalyses the reversible oxidation of molecular hydrogen (H2), as shown below:
Hydrogen uptake () is coupled to the reduction of electron acceptors such as oxygen, nitrate, sulfate, carbon dioxide (), and fumara ...
enzymes for the
TCA cycle
The citric acid cycle (CAC)—also known as the Krebs cycle or the TCA cycle (tricarboxylic acid cycle)—is a series of chemical reactions to release stored energy through the oxidation of acetyl-CoA derived from carbohydrates, fats, and proteins ...
, reduced genome size and coding efficiency of the DNA suggest that this microorganism may have symbiotic lifestyle and live in close association with a host. However, the true lifestyle of this microbe remains unknown.
Alternative cases of small genomes
Bacterial symbionts, commensals, parasites, and pathogens
Bacterial symbionts,
commensals,
parasites
Parasitism is a Symbiosis, close relationship between species, where one organism, the parasite, lives on or inside another organism, the Host (biology), host, causing it some harm, and is Adaptation, adapted structurally to this way of lif ...
, and
pathogens
In biology, a pathogen ( el, πάθος, "suffering", "passion" and , "producer of") in the oldest and broadest sense, is any organism or agent that can produce disease. A pathogen may also be referred to as an infectious agent, or simply a germ ...
often have even smaller genomes and fewer genes than free-living organisms, and non-pathogenic bacteria.
They reduce their "core" metabolic repertoire, making them more dependent on their host and environment.
Their genome reduction occurs by different evolutionary mechanisms than those of streamlined free-living organisms.
Pathogenic organisms are thought to undergo genome reduction due to
genetic drift
Genetic drift, also known as allelic drift or the Wright effect, is the change in the frequency of an existing gene variant (allele) in a population due to random chance.
Genetic drift may cause gene variants to disappear completely and there ...
, rather than
purifying selection
In natural selection, negative selection or purifying selection is the selective removal of alleles that are deleterious. This can result in stabilising selection through the purging of deleterious genetic polymorphisms that arise through random ...
.
Genetic drift is caused by small and effective populations within a microbial community, rather than large and dominating populations.
In this case, DNA mutations happen by chance, and thus often lead to maladaptive genome degradation and lower overall fitness.
Rather than losing non-coding DNA regions or extraneous genes to increase fitness during replication, they lose certain "core" metabolic genes that may now be supplemented by their host, symbiont, or environment.
Since their genome reduction is less dependent on fitness,
pseudogene
Pseudogenes are nonfunctional segments of DNA that resemble functional genes. Most arise as superfluous copies of functional genes, either directly by DNA duplication or indirectly by Reverse transcriptase, reverse transcription of an mRNA trans ...
s are frequent in these organisms.
They also typically undergo low rates of
horizontal gene transfer
Horizontal gene transfer (HGT) or lateral gene transfer (LGT) is the movement of genetic material between Unicellular organism, unicellular and/or multicellular organisms other than by the ("vertical") transmission of DNA from parent to offsprin ...
(HGT).
Viruses
Viral genomes resemble prokaryotic genomes in that they have very few non-coding regions. They are, however, significantly smaller than prokaryotic genomes. While viruses are
obligate intracellular parasites, viral genomes are considered streamlined due to the strong purifying selection that occurs when the virus has successfully infected a
host
A host is a person responsible for guests at an event or for providing hospitality during it.
Host may also refer to:
Places
* Host, Pennsylvania, a village in Berks County
People
*Jim Host (born 1937), American businessman
* Michel Host ...
.
During the initial phase of an
infection
An infection is the invasion of tissues by pathogens, their multiplication, and the reaction of host tissues to the infectious agent and the toxins they produce. An infectious disease, also known as a transmissible disease or communicable dise ...
, there is a large
bottleneck
Bottleneck literally refers to the narrowed portion (neck) of a bottle
A bottle is a narrow-necked container made of an impermeable material (such as glass, plastic or aluminium) in various shapes and sizes that stores and transports liquids ...
for the virus population which allows for more genetic diversity, but due to the rapid replication of these viruses, the population size is restored quickly and the diversity within the population is reduced.
RNA virus
An RNA virus is a virusother than a retrovirusthat has ribonucleic acid (RNA) as its genetic material. The nucleic acid is usually single-stranded RNA ( ssRNA) but it may be double-stranded (dsRNA). Notable human diseases caused by RNA viruses ...
es in particular are known to have exceptionally small genomes.
This is at least in part due to the fact that they have
overlapping gene
An overlapping gene (or OLG) is a gene whose expressible Nucleic acid sequence, nucleotide sequence partially overlaps with the expressible nucleotide sequence of another gene. In this way, a nucleotide sequence may make a contribution to the func ...
s.
By reducing their genome size, they increase their fitness due to faster replication.
The virus will then be able to increase population size more rapidly with faster replication rates.
Eukaryotes - birds
Genomic streamlining has been used to explain certain eukaryotic genome sizes as well, particularly bird genomes. Larger genomes require a larger nucleus, which typically translates to a larger cell size.
For this reason, many bird genomes have also been under selective pressure to decrease in size.
Flying with a larger mass due to larger cells is more energetically expensive than with a smaller mass.
References
{{Reflist, 32em
Evolutionary biology