Bacterial Phylodynamics
   HOME

TheInfoList



OR:

Bacterial phylodynamics is the study of
immunology Immunology is a branch of medicineImmunology for Medical Students, Roderick Nairn, Matthew Helbert, Mosby, 2007 and biology that covers the medical study of immune systems in humans, animals, plants and sapient species. In such we can see there ...
,
epidemiology Epidemiology is the study and analysis of the distribution (who, when, and where), patterns and determinants of health and disease conditions in a defined population. It is a cornerstone of public health, and shapes policy decisions and evidenc ...
, and
phylogenetics In biology, phylogenetics (; from Greek language, Greek wikt:φυλή, φυλή/wikt:φῦλον, φῦλον [] "tribe, clan, race", and wikt:γενετικός, γενετικός [] "origin, source, birth") is the study of the evolutionary his ...
of bacterial pathogens to better understand the
evolutionary Evolution is change in the heredity, heritable Phenotypic trait, characteristics of biological populations over successive generations. These characteristics are the Gene expression, expressions of genes, which are passed on from parent to ...
role of these pathogens. Phylodynamic analysis includes analyzing
genetic diversity Genetic diversity is the total number of genetic characteristics in the genetic makeup of a species, it ranges widely from the number of species to differences within species and can be attributed to the span of survival for a species. It is dis ...
,
natural selection Natural selection is the differential survival and reproduction of individuals due to differences in phenotype. It is a key mechanism of evolution, the change in the heritable traits characteristic of a population over generations. Charle ...
, and population dynamics of infectious disease pathogen phylogenies during pandemics and studying intra-host evolution of viruses. Phylodynamics combines the study of phylogenetic analysis, ecological, and evolutionary processes to better understand of the mechanisms that drive spatiotemporal incidence and phylogenetic patterns of bacterial pathogens. Bacterial phylodynamics uses
genome In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ge ...
-wide
single-nucleotide polymorphism In genetics, a single-nucleotide polymorphism (SNP ; plural SNPs ) is a germline substitution of a single nucleotide at a specific position in the genome. Although certain definitions require the substitution to be present in a sufficiently lar ...
s (SNP) in order to better understand the evolutionary mechanism of bacterial pathogens. Many phylodynamic studies have been performed on viruses, specifically
RNA viruses ''Orthornavirae'' is a kingdom of viruses that have genomes made of ribonucleic acid (RNA), those genomes encoding an RNA-dependent RNA polymerase (RdRp). The RdRp is used to transcribe the viral RNA genome into messenger RNA (mRNA) and to repli ...
(see
Viral phylodynamics Viral phylodynamics is defined as the study of how epidemiology, epidemiological, immune system, immunological, and evolutionary processes act and potentially interact to shape virus, viral Phylogenetic tree, phylogenies. Since the coining of the te ...
) which have high mutation rates. The field of bacterial phylodynamics has increased substantially due to the advancement of
next-generation sequencing Massive parallel sequencing or massively parallel sequencing is any of several high-throughput approaches to DNA sequencing using the concept of massively parallel processing; it is also called next-generation sequencing (NGS) or second-generation s ...
and the amount of data available.


Methods


Novel hypothesis (study design)

Studies can be designed to observe intra-host or inter-host interactions. Bacterial phylodynamic studies usually focus on inter-host interactions with samples from many different hosts in a specific geographical location or several different geographical locations. The most important part of a study design is how to organize the sampling strategy. For example, the number of sampled time points, the sampling interval, and the number of sequences per time point are crucial to phylodynamic analysis.
Sampling bias In statistics, sampling bias is a bias in which a sample is collected in such a way that some members of the intended population have a lower or higher sampling probability than others. It results in a biased sample of a population (or non-human fa ...
causes problems when looking at a diverse taxological samples. For example, sampling from a limited geographical location may impact effective population size.


Generating data


Experimental settings

Sequencing of the genome or genomic regions and what sequencing technique to use is an important experimental setting to phylodynamic analysis. Whole genome sequencing is often performed on bacterial genomes, although depending on the design of the study, many different methods can be utilized for phylodynamic analysis. Bacterial genomes are much larger and have a slower evolutionary rate than RNA viruses, limiting studies on the bacterial phylodynamics. The advancement of sequencing technology has made bacterial phylodynamics possible but proper preparation of the whole bacterial genomes is mandatory.


= Alignment

= When a new dataset with samples for phylodynamic analysis are obtained, the sequences in the new data set are aligned. A BLAST search is frequently executed to find similar strains of the pathogen of interest. Sequences collected from BLAST for an alignment will need the proper information to be added to a data set, such as sample collection date and geographical location of the sample. Multiple sequence alignment algorithms (e.g., MUSCLE, MAFFT, and CLUSAL W) will align the data set with all selected sequences. After the running a multiple sequence alignment algorithm, manual editing the alignment is highly recommended. Multiple sequence alignment algorithms can leave a large amount of indels in the sequence alignment when the indels do not exist. Manually editing the indels in the data set will allow a more accurate phylogenetic tree.


= Quality control

= In order to have an accurate phylodynamic analysis, quality control methods must be performed. This includes checking the samples in the data set for possible contamination, measuring
phylogenetic signal Phylogenetic signal is an evolutionary and ecological term, that describes the tendency or the pattern of related biological species to resemble each other more than any other species that is randomly picked from the same phylogenetic tree. Chara ...
of the sequences, and checking the sequences for possible signs of recombinant strains. Contamination of samples in the data set can be excluded with by various laboratory methods and by proper DNA/RNA extraction methods. There are several way to check for phylogenetic signal in an alignment, such as likelihood mapping, transition/transversions versus divergence plots, and the Xia test for saturation. If phylogenetic signal of an alignment is too low then a longer alignment or an alignment of another gene in the organism may be necessary to perform phylogenetic analysis. Typically substitution saturation is only in issue in data sets with viral sequences. Most algorithms used for phylogenetic analysis do not take into recombination into account, which can alter the molecular clock and coalescent estimates of a multiple sequence alignment. Strains that show signs of recombination should either be excluded from the data set or analyzed on their own.


Data analysis


Evolutionary model

The best fitting nucleotide or amino acid substitution model for a multiple sequence alignment is the first step in phylodynamic analysis. This can be accomplished with several different algorithms (e.g., IQTREE, MEGA).


Phylogeny inference

There are several different methods to infer phylogenies. These include methods include tree building algorithms such as
UPGMA UPGMA (unweighted pair group method with arithmetic mean) is a simple agglomerative (bottom-up) hierarchical clustering method. The method is generally attributed to Sokal and Michener. The UPGMA method is similar to its ''weighted'' variant, the ...
,
neighbor joining In bioinformatics, neighbor joining is a bottom-up (agglomerative) clustering method for the creation of phylogenetic trees, created by Naruya Saitou and Masatoshi Nei in 1987. Usually based on DNA or protein sequence data, the algorithm require ...
,
maximum parsimony In phylogenetics, maximum parsimony is an optimality criterion under which the phylogenetic tree that minimizes the total number of character-state changes (or miminizes the cost of differentially weighted character-state changes) is preferred. ...
,
maximum likelihood In statistics, maximum likelihood estimation (MLE) is a method of estimation theory, estimating the Statistical parameter, parameters of an assumed probability distribution, given some observed data. This is achieved by Mathematical optimization, ...
, and Bayesian analysis.


Hypothesis testing


Assessing phylogenetic support

Testing the reliability of the tree after inferring its phylogeny, is a crucial step in the phylodynamic pipeline. Methods to test the reliability of a tree include bootstrapping, maximum likelihood estimation, and
posterior probabilities The posterior probability is a type of conditional probability that results from updating the prior probability with information summarized by the likelihood via an application of Bayes' rule. From an epistemological perspective, the posterior p ...
in Bayesian analysis.


Phylodynamics inference

Several methods are used to assess phylodynamic reliability of a data set. These methods include estimating the data set's
molecular clock The molecular clock is a figurative term for a technique that uses the mutation rate of biomolecules to deduce the time in prehistory when two or more life forms diverged. The biomolecular data used for such calculations are usually nucleoti ...
, demographic history, population structure,
gene flow In population genetics, gene flow (also known as gene migration or geneflow and allele flow) is the transfer of genetic material from one population to another. If the rate of gene flow is high enough, then two populations will have equivalent a ...
, and selection analysis. Phylodynamic results of a data set can also influence better study designs in future experiments.


Examples


Phylodynamics of cholera

Cholera Cholera is an infection of the small intestine by some strains of the bacterium ''Vibrio cholerae''. Symptoms may range from none, to mild, to severe. The classic symptom is large amounts of watery diarrhea that lasts a few days. Vomiting and ...
is a diarrheal disease that is caused by the bacterium ''
Vibrio cholerae ''Vibrio cholerae'' is a species of Gram-negative, facultative anaerobe and comma-shaped bacteria. The bacteria naturally live in brackish or saltwater where they attach themselves easily to the chitin-containing shells of crabs, shrimps, and oth ...
.'' '' V. cholerae'' has been a popular bacterium for phylodynamic analysis after the 2010 cholera outbreak in Haiti. The cholera outbreak happened right after the
2010 earthquake in Haiti A catastrophic magnitude 7.0 Mw earthquake struck Haiti at 16:53 local time (21:53 UTC) on Tuesday, 12 January 2010. The epicenter was near the town of Léogâne, Ouest department, approximately west of Port-au-Prince, Haiti's capi ...
, which caused critical infrastructure damage, leading to the conclusion that the outbreak was most likely due to the '' V. cholerae'' bacterium being introduced naturally to the waters in Haiti from the earthquake. Soon after the earthquake, the UN sent
MINUSTAH ) , leader_title = Head , leader_name = Sandra Honoré (Special Representative of the Secretary-General) , status = Replaced by MINUJUSTH , formation = 1 June 2004 , websiteUN Peacekeeping: MINUSTAH
troops from Nepal to Haiti. Rumors started circulating about terrible conditions of the
MINUSTAH ) , leader_title = Head , leader_name = Sandra Honoré (Special Representative of the Secretary-General) , status = Replaced by MINUJUSTH , formation = 1 June 2004 , websiteUN Peacekeeping: MINUSTAH
camp, as well as people claiming that the
MINUSTAH ) , leader_title = Head , leader_name = Sandra Honoré (Special Representative of the Secretary-General) , status = Replaced by MINUJUSTH , formation = 1 June 2004 , websiteUN Peacekeeping: MINUSTAH
troops were deposing of their waste in the Artibonite River, which is the major water source in the surrounding area. Soon after the MINUSTAH troops arrival, the first cholera case was reported near the location of the
MINUSTAH ) , leader_title = Head , leader_name = Sandra Honoré (Special Representative of the Secretary-General) , status = Replaced by MINUJUSTH , formation = 1 June 2004 , websiteUN Peacekeeping: MINUSTAH
camp. Phylodynamic analysis was used to look into the source of the Haiti cholera outbreak. Whole genome sequencing of '' V. cholerae'' revealed that there was one single point source of the cholera outbreak in Haiti and it was similar to O1 strains circulating in South Asia. Before the MINUSTAH troops from Nepal were sent to Haiti, a cholera outbreak had just occurred in Nepal. In the original research to trace the origin of the outbreak, the Nepal strains were not available. Phylodynamic analyses were performed on the Haitian strain and the Nepalese strain when it became available and affirmed that the Haitian cholera strain was the most similar to the Nepalese cholera strain. This outbreak strain of cholera in Haiti showed signs of an altered or hybrid strain of '' V. cholerae'' associated with high virulence. Typically high quality single-nucleotide polymorphisms (hqSNP) from whole genome '' V. cholerae'' sequences are used for phylodynamic analysis. Using phylodynamic analysis to study cholera helps prediction and understanding of ''V. cholerae'' evolution during bacterial epidemics.


See also

*
2010 Haiti cholera outbreak The 2010s Haiti cholera outbreak is the first modern large-scale outbreak of cholera—a disease once considered beaten back largely due to the invention of modern sanitation. The disease was reintroduced to Haiti in October 2010, not long after ...
* Computational phylogenetics * MAFFT * United Nations Stabilisation Mission in Haiti *
Viral phylodynamics Viral phylodynamics is defined as the study of how epidemiology, epidemiological, immune system, immunological, and evolutionary processes act and potentially interact to shape virus, viral Phylogenetic tree, phylogenies. Since the coining of the te ...


References

{{Reflist Bacteriology Evolutionary biology Phylogenetics