HOME

TheInfoList



OR:

Clinical metagenomic next-generation sequencing (mNGS) is the comprehensive analysis of microbial and host genetic material ( DNA or
RNA Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
) in clinical samples from patients by
next-generation sequencing Massive parallel sequencing or massively parallel sequencing is any of several high-throughput approaches to DNA sequencing using the concept of massively parallel processing; it is also called next-generation sequencing (NGS) or second-generation s ...
. It uses the techniques of metagenomics to identify and characterize the genome of
bacteria Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were among ...
,
fungi A fungus ( : fungi or funguses) is any member of the group of eukaryotic organisms that includes microorganisms such as yeasts and molds, as well as the more familiar mushrooms. These organisms are classified as a kingdom, separately from ...
, parasites, and
virus A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Since Dmitri Ivanovsky's 1 ...
es without the need for a prior knowledge of a specific pathogen directly from clinical specimens. The capacity to detect all the potential
pathogen In biology, a pathogen ( el, πάθος, "suffering", "passion" and , "producer of") in the oldest and broadest sense, is any organism or agent that can produce disease. A pathogen may also be referred to as an infectious agent, or simply a germ ...
s in a sample makes metagenomic next generation sequencing a potent tool in the diagnosis of infectious disease especially when other more directed assays, such as
PCR PCR or pcr may refer to: Science * Phosphocreatine, a phosphorylated creatine molecule * Principal component regression, a statistical technique Medicine * Polymerase chain reaction ** COVID-19 testing, often performed using the polymerase chain r ...
, fail. Its limitations include clinical utility, laboratory validity, sense and sensitivity, cost and regulatory considerations. Outside of
clinical medicine Medicine is the science and practice of caring for a patient, managing the diagnosis, prognosis, prevention, treatment, palliation of their injury or disease, and promoting their health. Medicine encompasses a variety of health care practice ...
, similar work is done to identify genetic material in environmental samples, such as ponds or soil.


Definition

Next-generation sequencing Massive parallel sequencing or massively parallel sequencing is any of several high-throughput approaches to DNA sequencing using the concept of massively parallel processing; it is also called next-generation sequencing (NGS) or second-generation s ...
uses the techniques of metagenomics to identify and characterize the genome of
bacteria Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were among ...
,
fungi A fungus ( : fungi or funguses) is any member of the group of eukaryotic organisms that includes microorganisms such as yeasts and molds, as well as the more familiar mushrooms. These organisms are classified as a kingdom, separately from ...
, parasites, and
virus A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Since Dmitri Ivanovsky's 1 ...
es without the need for a prior knowledge of a specific pathogen directly from clinical specimens. The capacity to detect all the potential
pathogen In biology, a pathogen ( el, πάθος, "suffering", "passion" and , "producer of") in the oldest and broadest sense, is any organism or agent that can produce disease. A pathogen may also be referred to as an infectious agent, or simply a germ ...
s in a sample makes metagenomic next generation sequencing a potent tool in the diagnosis of infectious disease especially when other more directed assays, such as
PCR PCR or pcr may refer to: Science * Phosphocreatine, a phosphorylated creatine molecule * Principal component regression, a statistical technique Medicine * Polymerase chain reaction ** COVID-19 testing, often performed using the polymerase chain r ...
, fail.


Laboratory workflow

A typical mNGS workflow consists of the following steps: * Sample acquisition: the most commonly used samples for metagenomic sequencing are
blood Blood is a body fluid in the circulatory system of humans and other vertebrates that delivers necessary substances such as nutrients and oxygen to the cells, and transports metabolic waste products away from those same cells. Blood in the c ...
, stool,
cerebrospinal fluid Cerebrospinal fluid (CSF) is a clear, colorless body fluid found within the tissue that surrounds the brain and spinal cord of all vertebrates. CSF is produced by specialised ependymal cells in the choroid plexus of the ventricles of the bra ...
(CSF),
urine Urine is a liquid by-product of metabolism in humans and in many other animals. Urine flows from the kidneys through the ureters to the urinary bladder. Urination results in urine being excretion, excreted from the body through the urethra. Cel ...
, or nasopharyngeal swabs. Among these, blood and CSF are the cleanest, having less background noise, while the others are expected to have a great amount of commensals and/or
opportunistic infection An opportunistic infection is an infection caused by pathogens (bacteria, fungi, parasites or viruses) that take advantage of an opportunity not normally available. These opportunities can stem from a variety of sources, such as a weakened immune ...
s and thus have more background noise. Samples should be collected with much caution as surgical specimens could be contaminated during handling of the
biopsy A biopsy is a medical test commonly performed by a surgeon, interventional radiologist, or an interventional cardiologist. The process involves extraction of sample cells or tissues for examination to determine the presence or extent of a diseas ...
; for example, lumbar punctures to obtain CSF specimens may be contaminated during the procedure. * RNA/DNA extraction: the DNA and the RNA of the sample is extracted by using an extraction kit. If there is a strong previous suspicion of the pathogen genome composition and since the amount of pathogen nucleic acid in more noise samples is overwhelmed by the RNA/DNA of other organisms, selecting an extraction kit of only RNA or DNA would be a more specific and convenient approach. Some commerciable available kits are for example RNeasy PowerSoil Total RNA kit (
Qiagen QIAGEN N.V., the global corporate headquarter of the QIAGEN group, is located in Venlo, The Netherlands. Furthermore, European, American, and Asia regional headquarters are located in respectively Hilden, Germany, Maryland United States, and Sh ...
), RNeasy Minikit (Qiagen), MagMAX Viral Isolation kit (ABI), Viral RNA Minikit (Qiagen). * Optimization strategies for library preparation: because of high levels of background noise in metagenomic sequencing, several target enrichment procedures have been developed that aim to increase the probability of capturing pathogen-derived transcripts and/or genomes. Generally there are two main approaches that can be used to increase the amount of pathogen signal in a sample: negative selection and positive enrichment. # Negative selection (background depletion or subtraction) targets and eliminates the host and microbiome genomic background, while aiming to preserve the nucleic acid derived from the pathogens of interest. Degradation of genomic background can be performed through broad-spectrum digestion with
nuclease A nuclease (also archaically known as nucleodepolymerase or polynucleotidase) is an enzyme capable of cleaving the phosphodiester bonds between nucleotides of nucleic acids. Nucleases variously effect single and double stranded breaks in their ta ...
s, such as
DNase I Deoxyribonuclease I (usually called DNase I), is an endonuclease of the DNase family coded by the human gene DNASE1. DNase I is a nuclease that cleaves DNA preferentially at phosphodiester linkages adjacent to a pyrimidine nucleotide, yielding ...
for DNA background, or by removing abundant RNA species (rRNA, mtRNA, globin mRNA) using sequence-specific RNA depletion kits. Also
CRISPR-Cas9 Cas9 (CRISPR associated protein 9, formerly called Cas5, Csn1, or Csx12) is a 160 kilodalton protein which plays a vital role in the immunological defense of certain bacteria against DNA viruses and plasmids, and is heavily utilized in genetic ...
-based approaches can be performed to target and deplete human mitochrondrial RNA for example. Generally, however, subtraction approaches lead to a certain degree of loss of the targeted pathogen genome, as poor recovery may occur during the cleanup. # Positive enrichment is used to increase pathogen signal rather than reducing background noise. This is commonly done through
hybridization Hybridization (or hybridisation) may refer to: *Hybridization (biology), the process of combining different varieties of organisms to create a hybrid *Orbital hybridization, in chemistry, the mixing of atomic orbitals into new hybrid orbitals *Nu ...
-based target capture by probes, which are used to pull out nucleic acid of interest for downstream amplification and sequencing. Panviral probes have been shown to successfully identify diverse types of pathogens in different clinical fluid and respiratory samples, and have been used for sequencing and characterization of novel viruses. However, the probe approach includes extra hybridization and cleanup steps, requiring higher sample input, increasing the risk of losing the target, and increasing the cost and hands-on time. * High-throughput sequencing: all the nucleic acids fragments of the library are sequenced. The sequencing platform to be used is chosen depending on different factors such as laboratory's research objectives, personal experience and skill levels. So far, the Illumina MiSeq system has proven to be the most commonly used platform for infectious disease research, pathogen surveillance, and pathogen discovery in research and public health. The instrument is compact enough to fit on a laboratory bench, has a fast runtime as compared to other similar platforms, and has a strong user support community. However, with further improvements of this technology and with additional error reduction and software stabilization, the
MinION Places *Minions, Cornwall, a village in the United Kingdom People * Frank Minion (born 1929), American jazz and bop singer *Fred Minion, English professional footballer *Joseph Minion (born 1957), American film director and screenwriter *Marcus F ...
may be an excellent addition to the arsenal of current sequencing technologies for routine surveillance, especially in smaller laboratories with limited resources. For instance, the MinION was successfully used in the ZiBRA project for real-time
Zika virus ''Zika virus'' (ZIKV; pronounced or ) is a member of the virus family ''Flaviviridae''. It is spread by daytime-active ''Aedes'' mosquitoes, such as '' A. aegypti'' and '' A. albopictus''. Its name comes from the Ziika Forest of Uganda, whe ...
surveillance of mosquitoes and humans in
Brazil Brazil ( pt, Brasil; ), officially the Federative Republic of Brazil (Portuguese: ), is the largest country in both South America and Latin America. At and with over 217 million people, Brazil is the world's fifth-largest country by area ...
, and in
Guinea Guinea ( ),, fuf, 𞤘𞤭𞤲𞤫, italic=no, Gine, wo, Gine, nqo, ߖߌ߬ߣߍ߫, bm, Gine officially the Republic of Guinea (french: République de Guinée), is a coastal country in West Africa. It borders the Atlantic Ocean to the we ...
to perform real-time surveillance during the ongoing
Ebola Ebola, also known as Ebola virus disease (EVD) and Ebola hemorrhagic fever (EHF), is a viral hemorrhagic fever in humans and other primates, caused by ebolaviruses. Symptoms typically start anywhere between two days and three weeks after becom ...
outbreak. In general, for limited resources IlluminaMiSeq, iSeq, Ion Torrent PGM, Oxford Nanopore, MinION are used. While for substantial resources Illumina NextSeq, NovaSeq, PacBio Sequel, Oxford Nanopore and PromethION are preferred. Moreover, for pathogen sequencing the use of controls is of fundamental importance ensuring mNGS assay quality and stability over time; PhiX is used as sequencing control, then the other controls include the positive control, an additional internal control (e.g., spiked DNA or other known pathogen) and a negative control (usually water sample). * Bioinformatic analysis: Whereas the sequencing itself has been made widely accessible and more user friendly, the data analysis and interpretation that follows still requires specialized
bioinformatics Bioinformatics () is an interdisciplinary field that develops methods and software tools for understanding biological data, in particular when the data sets are large and complex. As an interdisciplinary field of science, bioinformatics combi ...
expertise and appropriate
computation Computation is any type of arithmetic or non-arithmetic calculation that follows a well-defined model (e.g., an algorithm). Mechanical or electronic devices (or, historically, people) that perform computations are known as ''computers''. An es ...
al resources. The raw data from a sequencing platform is usually cleaned, trimmed, and filtered to remove low-quality and duplicate reads. Removal of the host
genome In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ge ...
/
transcriptome The transcriptome is the set of all RNA transcripts, including coding and non-coding, in an individual or a population of cells. The term can also sometimes be used to refer to all RNAs, or just mRNA, depending on the particular experiment. The t ...
reads is performed to decrease background noise (e.g., host and environmental reads) and increase the frequency of pathogen reads. This step will also decrease downstream analysis time. Further background noise removal is achieved by mapping of sample reads to the reads from the negative control to ensure elimination of any contaminating reads, such as those associated with the reagents or sampling storage medium. The remaining reads are usually assembled de novo to produce long stretches of sequences called contigs.
Taxonomic Taxonomy is the practice and science of categorization or classification. A taxonomy (or taxonomical classification) is a scheme of classification, especially a hierarchical classification, in which things are organized into groups or types. ...
identification of the resulting contigs is performed by matching them to the genomes and sequences in nucleotide or protein databases; for this, various versions of BLAST are most commonly used. Advanced characterization of bacterial organisms can be also performed, allowing to obtain the necessary depth and breadth of coverage for genetic characterization results. Gene calling can be performed in a variety of ways, including RAST or using
NCBI The National Center for Biotechnology Information (NCBI) is part of the United States National Library of Medicine (NLM), a branch of the National Institutes of Health (NIH). It is approved and funded by the government of the United States. The ...
services at the time of full genome submission. Results of multiple annotation tools can be compared for accuracy and completeness and, if necessary, merged using BEACON. For characterization of antibiotic resistance genes, the Resistance Gene Identifier from the Comprehensive Antibiotic Resistance Database (CARD) is commonly used. To characterize virulence factor genes, ShortBRED offers analyses with a customized database from the Virulence Factor Database.


Applications in Infectious diseases

One way to detect these pathogens is detect part of their genome by metagenomics sequencing ( Next Generation Sequencing-mNGS), which can be targeted or untargeted.


Targeted

Because of that, the sensitivity to detect microorganisms that are being targeted usually increases, but this comes with a limitation of the amount of identifiable pathogens.


Untargeted

The untargeted analysis is a metagenomic
Shotgun sequencing In genetics, shotgun sequencing is a method used for sequencing random DNA strands. It is named by analogy with the rapidly expanding, quasi-random shot grouping of a shotgun. The Sanger sequencing#Method, chain-termination method of DNA sequencin ...
approach. The whole DNA and/or RNA is sequenced with this approach using universal primers. The resultant mNGS reads can be assembled into partial or complete genomes. These genome sequences allow to monitor hospital outbreaks to facilitate infection control and public health surveillance. Also, they can be used for subtyping (identificate a specific genetic variant of a microorganism). Untargeted mNGS is the most promising approach to analyse clinical samples and provide a comprehensive diagnosis of infections. Various groups have validated mNGS in Clinical Laboratory Improvement Amendments
CLIA
, such as meningitis or encephalitis, sepsis and pneumonia. This method can be very helpful in the settings where no exact infectious etiology is suspected. For example, in patients with suspected pneumonia, identification of the underlying infectious etiology as in
COVID-19 Coronavirus disease 2019 (COVID-19) is a contagious disease caused by a virus, the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The first known case was COVID-19 pandemic in Hubei, identified in Wuhan, China, in December ...
has important clinical and public health implications. The traditional method consists on formulating a differential diagnosis on the basis of the patient's history, a clinical presentation, imaging findings and laboratory testing. But here it is suggested a different way of diagnosis;
metagenomic Metagenomics is the study of genetic material recovered directly from environmental or clinical samples by a method called sequencing. The broad field may also be referred to as environmental genomics, ecogenomics, community genomics or microb ...
next-generation sequencing (NGS) is a promising method because a comprehensive spectrum of a potential causes (viral, bacterial; fungus and parasitic) can be identify by a single assay. Below are some examples of the metagenomic sequencing application in infectious diseases diagnosis.


Examples


= Diagnosis of meningitis and encephalitis

= The traditional method used to the diagnosis of infectious diseases has been challenged in some cases: neuroinflammatory diseases, lack of diagnostic tests for rare pathogens and the limited availability and volume of the
Central Nervous System The central nervous system (CNS) is the part of the nervous system consisting primarily of the brain and spinal cord. The CNS is so named because the brain integrates the received information and coordinates and influences the activity of all par ...
(CNS) samples, because of the requirement for invasive procedures. Owing to these problems, some assays suggest a different way of diagnosis, which is the
metagenomic Metagenomics is the study of genetic material recovered directly from environmental or clinical samples by a method called sequencing. The broad field may also be referred to as environmental genomics, ecogenomics, community genomics or microb ...
next-generation sequencing (NGS). Summarising, NGS can identify a broad range of pathogens in a single test. Some studies evaluate the clinical usefulness of metagenomic NGS for diagnosis neurologic infections, in parallel with conventional microbiologic testing. It has been seen that the highest diagnostic yield resulted from a combination of metagenomics NGS of CSF and conventional testing, including serologic testing and testing of sample types other than CSF.Sometimes neurologic infections remain undiagnosed in a proportion of patients despite conventional testing. The results of metagenomic NGS can also be valuable even when concordant with results of conventional testing, not only providing reassurance that the conventionally obtained diagnosis is correct but also potentially detecting or ruling out coinfections, specially in immunocompromised patients.


= Study of antimicrobial resistance

= Nowadays to detect resistances of different microbes is used a technique called Antibiotic Sensitivity (AST), but several studies have discovered that bacterial resistance is in the genoma and it is transferred by horizontal way (HGT), so sequencing methods are being developed to ease the identification and characterization of those genomes and metagenomes. For the moment exist the following methods to detect antimicrobial resistances: * Antibiotic sensitivity: an advantage about this method is that it gives information for patients treatment. There are also some disadvantages, one of them is that this technique is only useful in cultivable bacteria for which competent personal is required. * Sequencing methods: some advantages of those methods over the AST technique are that it is rapid and sensible, it is useful on both bacteria that grow on artificial media and those that do not, and it permits compare studies in several organisms. One type of sequencing method can be used in preference to another depending on the type of the sample, for a genomic sampl
assembly-based methods
is used; for a metagenomic sample it is preferable to use read-based methods. Metagenomic sequencing methods have provided better results than genomics, due to these present less number of false negatives. Within metagenomics sequencing, functional metagenomic is a powerful approach for characterizing
resistome The resistome has been used to describe to two similar yet separate concepts: * All the antibiotic resistance genes in communities of both pathogenic and non-pathogenic bacteria. * All of the resistance genes in an organism, how they are inherited, ...
s; a metagenomic library is generated by cloning the total community DNA extracted from a sample into an
expression vector An expression vector, otherwise known as an expression construct, is usually a plasmid or virus designed for gene expression in cells. The vector is used to introduce a specific gene into a target cell, and can commandeer the cell's mechanism for ...
, this library is assayed for antimicrobial resistance by plating on selective media that are lethal to the wild-type host. The selected inserts from the surviving recombinant, antimicrobial-resistant host cells are then sequenced, and resulting sequences are subsequently assembled and annotated (PARFuMS). Functional metagenomics has enabled the discovery of several new antimicrobial resistance mechanisms and their related genes, one such example is the recently discovered
tetracycline Tetracycline, sold under various brand names, is an oral antibiotic in the tetracyclines family of medications, used to treat a number of infections, including Acne vulgaris, acne, cholera, brucellosis, plague (disease), plague, malaria, and sy ...
resistance mechanism by tetracycline destructases. It is important to incorporate not only the antimicrobial resistance gene sequence and mechanism but also the genomic context, host bacterial species and geographic location (
metagenome Metagenomics is the study of genetic material recovered directly from environmental or clinical samples by a method called sequencing. The broad field may also be referred to as environmental genomics, ecogenomics, community genomics or microb ...
).


= Pandemic preparedness

= Potentially dangerous pathogens such as
ebolavirus The genus ''Ebolavirus'' (- or ; - or ) is a virological taxon included in the family ''Filoviridae'' (filament-shaped viruses), order ''Mononegavirales''. The members of this genus are called ebolaviruses, and encode their genome in the form ...
es,
coronavirus Coronaviruses are a group of related RNA viruses that cause diseases in mammals and birds. In humans and birds, they cause respiratory tract infections that can range from mild to lethal. Mild illnesses in humans include some cases of the com ...
es etc., and the closest genetic relative for unknown pathogens, could be identified immediately, prompting further follow-up. Its role in the future of pandemic preparedness is anticipated and could exist as the earliest surveillance system we may have to detect outbreaks of unknown etiology and to respond in an opportune manner.


= Clinical microbiome analyses

= The use of mNGS to characterize the microbiome has made possible the development of bacterial probiotics to be administrated as pills, for example, as a treatment of ''
Clostridium difficile ''Clostridioides difficile'' (syn. ''Clostridium difficile'') is a bacterium that is well known for causing serious diarrheal infections, and may also cause colon cancer. Also known as ''C. difficile'', or ''C. diff'' (), is Gram-positive spec ...
''-associated diseases.


= Human host response analyses

= The studying of genes expression allows to characterize a lot of infections, for example infections due to ''
Staphylococcus aureus ''Staphylococcus aureus'' is a Gram-positive spherically shaped bacterium, a member of the Bacillota, and is a usual member of the microbiota of the body, frequently found in the upper respiratory tract and on the skin. It is often positive ...
'',
Lyme disease Lyme disease, also known as Lyme borreliosis, is a vector-borne disease caused by the ''Borrelia'' bacterium, which is spread by ticks in the genus ''Ixodes''. The most common sign of infection is an expanding red rash, known as erythema migran ...
, candidiasis,
tuberculosis Tuberculosis (TB) is an infectious disease usually caused by '' Mycobacterium tuberculosis'' (MTB) bacteria. Tuberculosis generally affects the lungs, but it can also affect other parts of the body. Most infections show no symptoms, in ...
and
influenza Influenza, commonly known as "the flu", is an infectious disease caused by influenza viruses. Symptoms range from mild to severe and often include fever, runny nose, sore throat, muscle pain, headache, coughing, and fatigue. These symptoms ...
. Also, this approach can be used for
cancer Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal b ...
classification. RNAseq analysis have a lot of other purposes and applications such as to identify novel or under appreciated host–microbial interactions directly from clinical samples, to make indirect diagnosis on the basis of a pathogen specific human host response and to discriminate infectious versus noninfectious causes of acute illness.


Challenges


Clinical utility

Even though, most of the metagenomics outcomes data generated consist of
case report In medicine, a case report is a detailed report of the symptoms, signs, diagnosis, treatment, and follow-up of an individual patient. Case reports may contain a demographic profile of the patient, but usually describe an unusual or novel occurrence ...
s which belie the increasing interest on diagnostic metagenomics. Accordingly, there is an overall lack of penetration of this approach into the clinical microbiology laboratory, as making a
diagnosis Diagnosis is the identification of the nature and cause of a certain phenomenon. Diagnosis is used in many different disciplines, with variations in the use of logic, analytics, and experience, to determine " cause and effect". In systems engin ...
with metagenomics is still basically only useful in the context of case report but not for a true daily diagnostic purpose. As of 2018, cost-effectiveness modelling of metagenomics in the diagnosis of
fever of unknown origin Fever of unknown origin (FUO) refers to a condition in which the patient has an elevated temperature (fever) but, despite investigations by a physician, no explanation has been found.
concluded that, even after limiting the cost of diagnostic metagenomics to $100 – 1000 per test, it would require 2.5-4 times the diagnostic yield of
computed tomography of the abdomen and pelvis Computed tomography of the abdomen and pelvis is an application of computed tomography (CT) and is a sensitive method for diagnosis of abdominal diseases. It is used frequently to determine stage of cancer and to follow progress. It is also a ...
in order to be cost neutral and cautioned against ‘widespread rush’ to deploy metagenomic testing. Furthermore, in the case of the discovery of potential novel infectious agents, usually only the positive results are published even though the vast majority of sequenced cases are negative, thus resulting in very biased information. Besides, most of the discovery work based in metagenomic that precedes the diagnostic-based work even mentioned the known agents detected while screening unsolved cases for completely novel causes.


Laboratory validity

To date, most published testing has been run in an unvalidated, unreportable manner. The ‘standard microbiological testing’ that samples are subjected to prior to metagenomics is variable and has not included reverse transcription-polymerase chain reaction (RT-PCR) testing for common respiratory viruses or, routinely 16S/ITS PCR testing. Given the relative costs of validating and performing metagenomic versus 16S/ITS PCR testing, the second one is considered an easier and more efficient option. A potential exception to the 16S/ITS testing is blood, given the huge amount of 16S sequence available, making clean cutoffs for diagnostic purposes problematic. Furthermore, almost all of the organisms detected by metagenomics for which there is an associated treatment and thus would be truly actionable are also detectable by 16S/ITS testing (or 16S/ITS-NGS). This makes questionable the utility of metagenomics in many diagnostic cases. One of the main points to accomplish laboratory validity is the presence of
reference standard A drug reference standard or pharmaceutical reference standard is a highly characterized material suitable to test the identity, strength, quality and purity of substances for pharmaceutical use and medicinal products. Pharmacopoeial reference s ...
s and controls when performing mNGS assays. They are needed to ensure the quality and stability of this technique over time.


Sense and sensitivity

In
clinical microbiology Medical microbiology, the large subset of microbiology that is applied to medicine, is a branch of medical science concerned with the prevention, diagnosis and treatment of infectious diseases. In addition, this field of science studies various ...
labs, the quantitation of microbial burden is considered a routine function as it is associated with the severity and progression of the disease. To achieve a good
quantitation In mathematics and empirical science, quantification (or quantitation) is the act of counting and measuring that maps human sense observations and experiences into quantities. Quantification in this sense is fundamental to the scientific method. ...
a high sensitivity of the technique is needed. Whereas interfering substances represent a common problem to
clinical chemistry Clinical chemistry (also known as chemical pathology, clinical biochemistry or medical biochemistry) is the area of chemistry that is generally concerned with analysis of bodily fluids for diagnostic and therapeutic purposes. It is an applied ...
or to PCR diagnostics, the degree of interference from host (for example, in
tissue biopsies A biopsy is a medical test commonly performed by a surgeon, interventional radiologist, or an interventional cardiologist. The process involves extraction of sample cells or tissues for examination to determine the presence or extent of a disease ...
) or nonpathogen nucleic acids (for example, in stool) in metagenomics is a new twist. In addition, due to the relative size of the human genome in comparison with microbial genomes the interference can occur at low levels of contaminating material. Another challenge for clinical metagenomics in regards to sensitivity is the diagnosis of
coinfection Coinfection is the simultaneous infection of a host by multiple pathogen species. In virology, coinfection includes simultaneous infection of a single cell by two or more virus particles. An example is the coinfection of liver cells with hepatiti ...
s where there are present high-titer pathogens that can generate biased results as they may disproportionately soak up reads and make difficult to distinguish the less predominant pathogens. In addition to issues with interfering substances, specially in the diagnosis area, accurate quantitation and sensitivities are essential as a confusion in the results can affect to a third person, the patient. For these reason, practitioners currently have to be keenly aware of the index-swapping issues associated with Illumina sequencing which can lead to trace incorrectly barcoded samples. Since metagenomics has typically been used on patients for whom every other test to date has been negative, questions surrounding analytical sensitivity haven been less germane. But, for ruling out infections causes being one of the more important roles for clinical metagenomics it is essential to be capable to perform a deep enough sequencing to achieve adequate sensitivities. One way could be developing novel library preparation techniques.


Cost considerations

As of 2018, the Illumina monopoly on high-quality next-generation sequencing reagents has meant that the sequencing reagents alone cost more than
FDA The United States Food and Drug Administration (FDA or US FDA) is a federal agency of the Department of Health and Human Services. The FDA is responsible for protecting and promoting public health through the control and supervision of food s ...
-approved syndromic testing panels. Also additional direct costs of metagenomics such as extraction, library preparation, and computational analysis have to be considered. In general, metagenomic sequencing is most useful and cost efficient for
pathogen In biology, a pathogen ( el, πάθος, "suffering", "passion" and , "producer of") in the oldest and broadest sense, is any organism or agent that can produce disease. A pathogen may also be referred to as an infectious agent, or simply a germ ...
discovery when at least one of the following criteria are met: # the identification of the organism is not sufficient (one desires to go beyond discovery to produce data for genomic characterization), # a
coinfection Coinfection is the simultaneous infection of a host by multiple pathogen species. In virology, coinfection includes simultaneous infection of a single cell by two or more virus particles. An example is the coinfection of liver cells with hepatiti ...
is suspected, # other simpler assays are ineffective or will take an inordinate amount of time, # screening of environmental samples for previously undescribed or divergent pathogens.


See also

*
Sequencing In genetics and biochemistry, sequencing means to determine the primary structure (sometimes incorrectly called the primary sequence) of an unbranched biopolymer. Sequencing results in a symbolic linear depiction known as a sequence which succ ...
*
Clinical data management Clinical data management (CDM) is a critical process in clinical research, which leads to generation of high-quality, reliable, and statistically sound data from clinical trials. Clinical data management ensures collection, integration and availabi ...
*
Nanopore sequencing Nanopore sequencing is a third generation approach used in the sequencing of biopolymers — specifically, polynucleotides in the form of DNA or RNA. Using nanopore sequencing, a single molecule of DNA or RNA can be sequenced without the ne ...


References

{{reflist


External links


Clinical Laboratory Improvement Amendments
CDC
Improved annotation of antibiotic resistance determinants reveals microbial resistomes cluster by ecology
Article on functional metagenomic selections for resistance Genomics Metagenomics Pathogen genomics