Fungal DNA Barcoding
   HOME

TheInfoList



OR:

Fungal DNA barcoding is the process of identifying species of the biological kingdom
Fungi A fungus (: fungi , , , or ; or funguses) is any member of the group of eukaryotic organisms that includes microorganisms such as yeasts and mold (fungus), molds, as well as the more familiar mushrooms. These organisms are classified as one ...
through the amplification and
sequencing In genetics and biochemistry, sequencing means to determine the primary structure (sometimes incorrectly called the primary sequence) of an unbranched biopolymer. Sequencing results in a symbolic linear depiction known as a sequence which succ ...
of specific DNA sequences and their comparison with sequences deposited in a DNA barcode database such as the ISHAM reference database, or the
Barcode of Life Data System The Barcode of Life Data System (commonly known as BOLD or BOLDSystems) is a web platform specifically devoted to DNA barcoding. It is a cloud-based data storage and analysis platform developed at the Centre for Biodiversity Genomics in Canada. It ...
(BOLD). In this attempt, DNA barcoding relies on universal genes that are ideally present in all fungi with the same degree of sequence variation. The interspecific variation, i.e., the variation between species, in the chosen DNA barcode gene should exceed the intraspecific (within-species) variation. A fundamental problem in fungal systematics is the existence of teleomorphic and anamorphic stages in their life cycles. These morphs usually differ drastically in their
phenotypic In genetics, the phenotype () is the set of observable characteristics or traits of an organism. The term covers the organism's morphology (physical form and structure), its developmental processes, its biochemical and physiological propert ...
appearance, preventing a straightforward association of the asexual anamorph with the sexual teleomorph. Moreover, fungal species can comprise multiple strains that can vary in their morphology or in traits such as carbon- and nitrogen utilisation, which has often led to their description as different species, eventually producing long lists of synonyms. Fungal DNA barcoding can help to identify and associate anamorphic and teleomorphic stages of fungi, and through that to reduce the confusing multitude of fungus names. For this reason, mycologists were among the first to spearhead the investigation of species discrimination by means of DNA sequences, at least 10 years earlier than the DNA barcoding proposal for animals by Paul D. N. Hebert and colleagues in 2003, who popularised the term "DNA barcoding". The success of identification of fungi by means of DNA barcode sequences stands and falls with the quantitative (completeness) and qualitative (level of identification) aspect of the reference database. Without a database covering a broad taxonomic range of fungi, many identification queries will not result in a satisfyingly close match. Likewise, without a substantial curatorial effort to maintain the records at a high taxonomic level of identification, queries – even when they might have a close or exact match in the reference database – will not be informative if the closest match is only identified to
phylum In biology, a phylum (; : phyla) is a level of classification, or taxonomic rank, that is below Kingdom (biology), kingdom and above Class (biology), class. Traditionally, in botany the term division (taxonomy), division has been used instead ...
or
class Class, Classes, or The Class may refer to: Common uses not otherwise categorized * Class (biology), a taxonomic rank * Class (knowledge representation), a collection of individuals or objects * Class (philosophy), an analytical concept used d ...
level. Another crucial prerequisite for DNA barcoding is the ability to unambiguously trace the provenance of DNA barcode data back to the originally sampled specimen, the so-called voucher specimen. This is common practice in biology along with the description of new
taxa In biology, a taxon (back-formation from ''taxonomy''; : taxa) is a group of one or more populations of an organism or organisms seen by taxonomists to form a unit. Although neither is required, a taxon is usually known by a particular name and ...
, where the voucher specimens, on which the taxonomic description is based, become the type specimens. When the identity of a certain taxon (or a genetic sequence in the case of DNA barcoding) is in doubt, the original specimen can be re-examined to review and ideally solve the issue. Voucher specimens should be clearly labelled as such, including a permanent voucher identifier that unambiguously connects the specimen with the DNA barcode data derived from it. Furthermore, these voucher specimens should be deposited in publicly accessible repositories like scientific collections or
herbaria A herbarium (plural: herbaria) is a collection of preserved plant specimens and associated data used for scientific study. The specimens may be whole plants or plant parts; these will usually be in dried form mounted on a sheet of paper (called ...
to preserve them for future reference and to facilitate research involving the deposited specimens.


Barcode DNA markers


Internal Transcribed Spacer (ITS) – the primary fungal barcode

In fungi, the
Internal transcribed spacer Internal transcribed spacer (ITS) is the spacer DNA situated between the small-subunit ribosomal RNA (rRNA) and large-subunit rRNA genes in the chromosome or the corresponding transcribed region in the polycistronic rRNA precursor transcript. ...
(''ITS'') is a roughly 600
base pairs A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
long region in the
ribosomal Ribosomes () are macromolecular machines, found within all cells, that perform biological protein synthesis (messenger RNA translation). Ribosomes link amino acids together in the order specified by the codons of messenger RNA molecules to fo ...
tandem repeat In genetics, tandem repeats occur in DNA when a pattern of one or more nucleotides is repeated and the repetitions are directly adjacent to each other, e.g. ATTCG ATTCG ATTCG, in which the sequence ATTCG is repeated three times. Several protein ...
gene cluster A gene cluster is a group of two or more genes found within an organism's DNA that encode similar peptide, polypeptides or proteins which collectively share a generalized function and are often located within a few thousand base pairs of each othe ...
of the
nuclear genome Nuclear DNA (nDNA), or nuclear deoxyribonucleic acid, is the DNA contained within each cell nucleus of a eukaryotic organism. It encodes for the majority of the genome in eukaryotes, with mitochondrial DNA and plastid DNA coding for the rest. It ...
. The region is flanked by the DNA sequences for the ribosomal small subunit (SSU) or 18S subunit at the 5' end, and by the large subunit (LSU) or 28S subunit at the 3' end. The Internal Transcribed Spacer itself consists of two parts, ''ITS1'' and ''ITS2'', which are separated from each other by the 5.8S subunit nested between them. Like the flanking 18S and 28S subunits, the 5.8S subunit contains a highly conserved DNA sequence, as they code for structural parts of the
ribosome Ribosomes () are molecular machine, macromolecular machines, found within all cell (biology), cells, that perform Translation (biology), biological protein synthesis (messenger RNA translation). Ribosomes link amino acids together in the order s ...
, which is a key component in intracellular
protein synthesis Protein biosynthesis, or protein synthesis, is a core biological process, occurring inside cells, balancing the loss of cellular proteins (via degradation or export) through the production of new proteins. Proteins perform a number of critica ...
. Due to several advantages of ''ITS'' (see below) and a comprehensive amount of sequence data accumulated in the 1990s and early 2000s, Begerow et al. (2010) and Schoch et al. (2012) proposed the ''ITS'' region as primary DNA barcode region for the genetic identification of
fungi A fungus (: fungi , , , or ; or funguses) is any member of the group of eukaryotic organisms that includes microorganisms such as yeasts and mold (fungus), molds, as well as the more familiar mushrooms. These organisms are classified as one ...
. UNITE is an open ''ITS'' barcoding database for fungi and all other eukaryotes.


Primers

The conserved flanking regions of 18S and 28S serve as anchor points for the primers used for PCR amplification of the ''ITS'' region. Moreover, the conserved nested 5.8S region allows for the construction of "internal" primers, i.e., primers attaching to complementary sequences within the ITS region. White et al. (1990) proposed such internal primers, named ITS2 and ITS3, along with the flanking primers ITS1 and ITS4 in the 18S and the 28S subunit, respectively. Due to their almost universal applicability to ITS sequencing in fungi, these primers are still in wide use today. Optimised primers specifically for ITS sequencing in
Dikarya Dikarya is a subkingdom of Fungi that includes the divisions Ascomycota and Basidiomycota, both of which in general produce dikaryons, may be hypha, filamentous or unicellular, but are always without flagella. The Dikarya are most of the so-calle ...
(comprising
Basidiomycota Basidiomycota () is one of two large divisions that, together with the Ascomycota, constitute the subkingdom Dikarya (often referred to as the "higher fungi") within the kingdom Fungi. Members are known as basidiomycetes. More specifically, Basi ...
and
Ascomycota Ascomycota is a phylum of the kingdom Fungi that, together with the Basidiomycota, forms the subkingdom Dikarya. Its members are commonly known as the sac fungi or ascomycetes. It is the largest phylum of Fungi, with over 64,000 species. The def ...
) have been proposed by Toju et al. (2012). For the majority of fungi, the ITS primers proposed by White et al. (1990) have become the standard primers used for PCR amplification. These primers are: Forward primers: * ''ITS1'': * ''ITS2'': * ''ITS5'': Reverse primers: * ''ITS3'': * ''ITS4'':


Advantages and shortcomings

A major advantage of using the ITS region as molecular marker and fungal DNA barcode is that the entire ribosomal gene cluster is arranged in tandem repeats, i.e., in multiple copies. This allows for its PCR amplification and
Sanger sequencing Sanger sequencing is a method of DNA sequencing that involves electrophoresis and is based on the random incorporation of chain-terminating dideoxynucleotides by DNA polymerase during in vitro DNA replication. After first being developed by Fred ...
even from small material samples (given the DNA is not fragmented due to age or other degenerative influences). Hence, a high PCR success rate is usually observed when amplifying ''ITS''. However, this success rate varies greatly among fungal groups, from 65% in non-Dikarya (including the now
paraphyletic Paraphyly is a taxonomic term describing a grouping that consists of the grouping's last common ancestor and some but not all of its descendant lineages. The grouping is said to be paraphyletic ''with respect to'' the excluded subgroups. In co ...
Mucoromycotina, the
Chytridiomycota Chytridiomycota are a division of zoosporic organisms in the kingdom Fungi, informally known as chytrids. The name is derived from the Ancient Greek ('), meaning "little pot", describing the structure containing unreleased zoospores. Chytrid ...
and the
Blastocladiomycota Blastocladiomycota is one of the currently recognized phyla within the kingdom Fungi.Hibbett DS et al. 2007. A higher-level phylogenetic classification of the fungi. Mycological Research 111:509–47. Blastocladiomycota was originally the order ...
) to 100% in
Saccharomycotina Saccharomycotina is a subdivision (subphylum) of the division (phylum) Ascomycota in the kingdom Fungi. It comprises most of the ascomycete yeasts. The members of Saccharomycotina reproduce by budding and they do not produce ascocarps (fruiting ...
and
Basidiomycota Basidiomycota () is one of two large divisions that, together with the Ascomycota, constitute the subkingdom Dikarya (often referred to as the "higher fungi") within the kingdom Fungi. Members are known as basidiomycetes. More specifically, Basi ...
(with the exception of very low success in
Pucciniomycotina Pucciniomycotina is a subdivision of fungus within the division Basidiomycota. The subdivision contains 10 classes, 21 orders, and 38 families. Over 8400 species of Pucciniomycotina have been described - more than 8% of all described fungi. T ...
). Furthermore, the choice of primers for ''ITS'' amplification can introduce biases towards certain
taxonomic 280px, Generalized scheme of taxonomy Taxonomy is a practice and science concerned with classification or categorization. Typically, there are two parts to it: the development of an underlying scheme of classes (a taxonomy) and the allocation ...
fungus groups. For example, the "universal" ''ITS'' primers fail to amplify about 10% of the tested fungal specimens. The tandem repeats of the ribosomal gene cluster cause the problem of significant intragenomic sequence heterogeneity observed among ''ITS'' copies of several fungal groups. In Sanger sequencing, this will cause ''ITS'' sequence reads of different lengths to superpose each other, potentially rendering the resulting chromatograph unreadable. Furthermore, because of the non-coding nature of the ''ITS'' region that can lead to a substantial amount of
indel Indel (insertion-deletion) is a molecular biology term for an insertion or deletion of bases in the genome of an organism. Indels ≥ 50 bases in length are classified as structural variants. In coding regions of the genome, unless the lengt ...
s, it is impossible to consistently align ''ITS'' sequences from highly divergent species for further bigger-scale phylogenetic analyses. The degree of intragenomic sequence heterogeneity can be investigated in more detail through
molecular cloning Molecular cloning is a set of experimental methods in molecular biology that are used to assemble recombinant DNA molecules and to direct their DNA replication, replication within Host (biology), host organisms. The use of the word ''cloning'' re ...
of the initially PCR-amplified ITS sequences, followed by sequencing of the clones. This procedure of initial PCR amplification, followed by cloning of the
amplicon In molecular biology, an amplicon is a piece of DNA or RNA that is the source and/or product of amplification or replication events. It can be formed artificially, using various methods including polymerase chain reactions (PCR) or ligase chain ...
s and finally sequencing of the cloned PCR products is the most common approach of obtaining ''ITS'' sequences for
DNA metabarcoding Metabarcoding is the DNA barcoding, barcoding of DNA/RNA (or Environmental DNA, eDNA/Environmental DNA, eRNA) in a manner that allows for the simultaneous identification of many taxa within the same sample. The main difference between barcodin ...
of environmental samples, in which a multitude of different fungal species can be present simultaneously. However, this approach of sequencing after cloning was rarely done for the ''ITS'' sequences that make up the reference libraries used for DNA barcode-aided identification, thus potentially giving an underestimate of the existing ''ITS'' sequence variation in many samples. The
weighted arithmetic mean The weighted arithmetic mean is similar to an ordinary arithmetic mean (the most common type of average), except that instead of each of the data points contributing equally to the final average, some data points contribute more than others. Th ...
of the intraspecific (within-species) ''ITS'' variability among fungi is 2.51%. This variability, however, can range from 0% for example in ''
Serpula lacrymans ''Serpula lacrymans'' is a species of fungi known for causing dry rot. It is a basidiomycete in the order Boletales. It has the ability to rapidly colonise sites through unique and highly specialised mycelium which also leads to greater degrada ...
'' (n=93 samples) over 0.19% in ''
Tuber melanosporum ''Tuber melanosporum'', called the black truffle, Périgord truffle or French black truffle, is a species of truffle native to Southern Europe. It is one of the most expensive edible fungi in the world. In 2013, the truffle cost between 1,000 a ...
'' (n=179) up to 15.72% in ''
Rhizoctonia solani ''Rhizoctonia solani'' is a species of fungus in the order Cantharellales. Basidiocarps (fruit bodies) are thin, effused, and web-like, but the fungus is more typically encountered in its anamorphic state, as hyphae and sclerotia. The name ' ...
'' (n=608), or even 24.75% in '' Pisolithus tinctorius'' (n=113). In cases of high intraspecific ''ITS'' variability, the application of a threshold of 3% sequence variability – a canonical upper value for intraspecific variation – will therefore lead to a higher estimate of
operational taxonomic unit An operational taxonomic unit (OTU) is an operational definition used to classify groups of closely related individuals. The term was originally introduced in 1963 by Robert R. Sokal and Peter H. A. Sneath in the context of numerical taxonomy, wh ...
s (OTUs), i.e., putative species, than there actually are in a sample. In the case of medically relevant fungal species, a more strict threshold of 2.5% ''ITS'' variability allows only around 75% of all species to be accurately identified to the species level. On the other hand, morphologically well-defined, but evolutionarily young
species complex In biology, a species complex is a group of closely related organisms that are so similar in appearance and other features that the boundaries between them are often unclear. The taxa in the complex may be able to hybridize readily with each oth ...
es or
sibling species In biology, a species complex is a group of closely related organisms that are so similar in appearance and other features that the boundaries between them are often unclear. The taxa in the complex may be able to hybridize readily with each oth ...
may only differ (if at all) in a few nucleotides of the ''ITS'' sequences. Solely relying on ''ITS'' barcode data for the identification of such species pairs or complexes may thus obscure the actual diversity and might lead to misidentification if not accompanied by the investigation of morphological and ecological features and/or comparison of additional diagnostic
genetic marker A genetic marker is a gene or DNA sequence with a known location on a chromosome that can be used to identify individuals or species. It can be described as a variation (which may arise due to mutation or alteration in the genomic loci) that can ...
s. For some taxa, ''ITS'' (or its ''ITS2'' part) is not variable enough as fungal DNA barcode, as for example has been shown in ''
Aspergillus ' () is a genus consisting of several hundred mold species found in various climates worldwide. ''Aspergillus'' was first catalogued in 1729 by the Italian priest and biologist Pier Antonio Micheli. Viewing the fungi under a microscope, Miche ...
'', ''
Cladosporium ''Cladosporium'' is a genus of fungi including some of the most common indoor and outdoor molds. Some species are endophytes or plant pathogens, while others parasitize fungi. Description Species produce olive-green to brown or black colonie ...
'', ''
Fusarium ''Fusarium'' (; ) is a large genus of filamentous fungi, part of a group often referred to as hyphomycetes, widely distributed in soil and associated with plants. Most species are harmless saprobes, and are relatively abundant members of the s ...
'' and ''
Penicillium ''Penicillium'' () is a genus of Ascomycota, ascomycetous fungus, fungi that is part of the mycobiome of many species and is of major importance in the natural environment, in food spoilage, and in food and drug production. Some members of th ...
''. Efforts to define a universally applicable threshold value of ''ITS'' variability that demarcates intraspecific from interspecific (between-species) variability thus remain futile. Nonetheless, the probability of correct species identification with the ''ITS'' region is high in the
Dikarya Dikarya is a subkingdom of Fungi that includes the divisions Ascomycota and Basidiomycota, both of which in general produce dikaryons, may be hypha, filamentous or unicellular, but are always without flagella. The Dikarya are most of the so-calle ...
, and especially so in
Basidiomycota Basidiomycota () is one of two large divisions that, together with the Ascomycota, constitute the subkingdom Dikarya (often referred to as the "higher fungi") within the kingdom Fungi. Members are known as basidiomycetes. More specifically, Basi ...
, where even the ''ITS1'' part is often sufficient to identify the species. However, its discrimination power is partly superseded by that of the DNA-directed RNA polymerase II subunit ''RPB1'' (see also below). Due to the shortcomings of ''ITS as primary fungal DNA barcode, the necessity of establishing a second DNA barcode marker was expressed. Several attempts were made to establish other genetic markers that could serve as additional DNA barcodes, similar to the situation in
plant Plants are the eukaryotes that form the Kingdom (biology), kingdom Plantae; they are predominantly Photosynthesis, photosynthetic. This means that they obtain their energy from sunlight, using chloroplasts derived from endosymbiosis with c ...
s, where the plastidial genes ''rbcL'', ''matK'' and ''trnH-psbA'', as well as the nuclear ''ITS'' are often used in combination for DNA barcoding.


Translational elongation factor 1α (TEF1α) – the secondary fungal barcode

The translational elongation factor 1α is part of the eucaryotic elongation factor 1 complex, whose main function is to facilitate the elongation of the
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
chain of a
polypeptide Peptides are short chains of amino acids linked by peptide bonds. A polypeptide is a longer, continuous, unbranched peptide chain. Polypeptides that have a molecular mass of 10,000 Da or more are called proteins. Chains of fewer than twenty ...
during the
translation Translation is the communication of the semantics, meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The English la ...
process of
gene expression Gene expression is the process (including its Regulation of gene expression, regulation) by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, ...
. Stielow et al. (2015) investigated the ''TEF1α'' gene, among a number of others, as potential
genetic marker A genetic marker is a gene or DNA sequence with a known location on a chromosome that can be used to identify individuals or species. It can be described as a variation (which may arise due to mutation or alteration in the genomic loci) that can ...
for fungal DNA barcoding. The ''TEF1α'' gene coding for the translational elongation factor 1α is generally considered to have a slow
mutation rate In genetics, the mutation rate is the frequency of new mutations in a single gene, nucleotide sequence, or organism over time. Mutation rates are not constant and are not limited to a single type of mutation; there are many different types of mu ...
, and it is therefore generally better suited for investigating older splits deeper in the phylogenetic history of an organism group. Despite this, the authors conclude that ''TEF1α'' is the most promising candidate for an additional DNA barcode marker in fungi as it also features sequence regions of higher mutation rates. Following this, a quality-controlled reference database was established and merged with the previously existing ISHAM-ITS database for fungal ITS DNA barcodes to form the ISHAM database. ''TEF1α'' has been successfully used to identify a new species of ''
Cantharellus ''Cantharellus'' is a genus of mushrooms, commonly known as chanterelles (), a name which can also refer to the type species, '' Cantharellus cibarius''. They are mycorrhizal fungi, meaning they form symbiotic associations with plants. Ch ...
'' from
Texas Texas ( , ; or ) is the most populous U.S. state, state in the South Central United States, South Central region of the United States. It borders Louisiana to the east, Arkansas to the northeast, Oklahoma to the north, New Mexico to the we ...
and distinguish it from a morphologically similar species. In the genera '' Ochroconis'' and '' Verruconis'' (Sympoventuriaceae, Venturiales), however, the marker does not allow distinction of all species. ''TEF1α'' has also been used in phylogenetic analyses at the genus level, e.g. in the case of ''
Cantharellus ''Cantharellus'' is a genus of mushrooms, commonly known as chanterelles (), a name which can also refer to the type species, '' Cantharellus cibarius''. They are mycorrhizal fungi, meaning they form symbiotic associations with plants. Ch ...
'' and the entomopathogenic '' Beauveria'', and for the phylogenetics of early-diverging fungal lineages.


Primers

''TEF1α'' primers used in the broad-scale screening of the performance of DNA barcode gene candidates of Stielow et al. (2015) were the forward primer ''EF1-983F'' with the sequence , and the reverse primer ''EF1-1567R'' with the sequence . In addition, a number of new primers was developed, with the primer pair in bold resulting in a high average amplification success of 88%: Forward primers: * ''EF1-1018F'': * ''EF1-1002F'': * ''Al33_alternative_f'': * ''EF1_alternative_3f'': Reverse primers: * ''EF1-1620R'': * ''EF1-1688R'': * ''EF1_alternative_3r'': Primers used for the investigation of Rhizophydiales and especially '' Batrachochytrium dendrobatidis'', a pathogen of amphibia, are the forward primer ''tef1F'' with the nucleotide sequence , and the reverse primer ''tef1R'' with the sequence . These primers also successfully amplified the majority of ''
Cantharellus ''Cantharellus'' is a genus of mushrooms, commonly known as chanterelles (), a name which can also refer to the type species, '' Cantharellus cibarius''. They are mycorrhizal fungi, meaning they form symbiotic associations with plants. Ch ...
'' species investigated by Buyck et al. (2014), with the exception of a few species for which more specific primers were developed: the forward primer ''tef-1Fcanth'' with the sequence , and the reverse primer ''tef-1Rcanth'' with the sequence .


D1/D2 domain of the LSU ribosomal RNA

The D1/D2 domain is part of the nuclear large subunit ( 28S)
ribosomal Ribosomes () are macromolecular machines, found within all cells, that perform biological protein synthesis (messenger RNA translation). Ribosomes link amino acids together in the order specified by the codons of messenger RNA molecules to fo ...
RNA, and it is therefore located in the same ribosomal tandem repeat gene cluster as the Internal Transcribed Spacer (''ITS''). But unlike the non-coding ITS sequences, the D1/D2 domain contains coding sequence. With about 600 base pairs it is about the same nucleotide sequence length as ''ITS'', which makes amplification and sequencing rather straightforward, an advantage that has led to the accumulation of an extensive amount of ''D1/D2'' sequence data especially for
yeast Yeasts are eukaryotic, single-celled microorganisms classified as members of the fungus kingdom (biology), kingdom. The first yeast originated hundreds of millions of years ago, and at least 1,500 species are currently recognized. They are est ...
s. Regarding the molecular identification of basidiomycetous yeasts, ''D1/D2'' (or ''ITS'') can be used alone. However, Fell et al. (2000) and Scorzetti et al. (2002) recommend the combined analysis of the ''D1/D2'' and ''ITS'' regions, a practice that later became the standard required information for describing new taxa of asco- and basidiomycetous yeasts. When attempting to identify early diverging fungal lineages, the study of Schoch et al. (2012), comparing the identification performance of different genetic markers, showed that the large subunit (as well as the small subunit) of the ribosomal RNA performs better than ''ITS'' or ''RPB1''.


Primers

For basidiomycetous yeasts, the forward primer ''F63'' with the sequence , and the reverse primer ''LR3'' with the sequence have been successfully used for PCR amplification of the D1/D23 domain. The D1/D2 domain of ascomycetous yeasts like ''
Candida Candida, or Cándida (Spanish), may refer to: Biology and medicine * ''Candida'' (fungus), a genus of yeasts ** Candidiasis, an infection by ''Candida'' organisms * Malvasia Candida, a variety of grape Places * Candida, Campania, a ''comu ...
'' can be amplified with the forward primer ''NL-1'' (same as ''F63'') and the reverse primer ''NL-4'' (same as ''LR3'').


RNA polymerase II subunit RPB1

The RNA polymerase II subunit ''RPB1'' is the largest subunit of the
RNA polymerase II RNA polymerase II (RNAP II and Pol II) is a Protein complex, multiprotein complex that Transcription (biology), transcribes DNA into precursors of messenger RNA (mRNA) and most small nuclear RNA (snRNA) and microRNA. It is one of the three RNA pol ...
. In ''
Saccharomyces cerevisiae ''Saccharomyces cerevisiae'' () (brewer's yeast or baker's yeast) is a species of yeast (single-celled fungal microorganisms). The species has been instrumental in winemaking, baking, and brewing since ancient times. It is believed to have be ...
'', it is encoded by the ''RPO21'' gene. PCR amplification success of ''RPB1'' is very taxon-dependent, ranging from 70 to 80% in
Ascomycota Ascomycota is a phylum of the kingdom Fungi that, together with the Basidiomycota, forms the subkingdom Dikarya. Its members are commonly known as the sac fungi or ascomycetes. It is the largest phylum of Fungi, with over 64,000 species. The def ...
to 14% in early diverging fungal lineages. Apart from the early diverging lineages, ''RPB1'' has a high rate of species identification in all fungal groups. In the species-rich
Pezizomycotina Pezizomycotina is the largest subdivision of Ascomycota, containing the filamentous ascomycetes and most lichenized fungi. It is more or less synonymous with the older taxon Euascomycota. These fungi reproduce by fission rather than budding. Thi ...
it even outperforms ITS. In a study comparing the identification performance of four genes, ''RPB1'' was among the most effective genes when combining two genes in the analysis: combined analysis with either ''ITS'' or with the large subunit ribosomal RNA yielded the highest identification success. Other studies also used RPB2, the second-largest subunit of the RNA polymerase II, e.g. for studying the phylogenetic relationships among species of the genus ''
Cantharellus ''Cantharellus'' is a genus of mushrooms, commonly known as chanterelles (), a name which can also refer to the type species, '' Cantharellus cibarius''. They are mycorrhizal fungi, meaning they form symbiotic associations with plants. Ch ...
'' or for a phylogenetic study shedding light on the relationships among early-diverging lineages in the fungal kingdom.


Primers

Primers successfully amplifying RPB1 especially in Ascomycota are the forward primer ''RPB1-Af'' with the sequence , and the reverse primer ''RPB1-Ac-RPB1-Cr'' with the sequence .


Intergenic Spacer (IGS) of ribosomal RNA genes

The Intergenic Spacer (''IGS'') is the region of
non-coding DNA Non-coding DNA (ncDNA) sequences are components of an organism's DNA that do not encode protein sequences. Some non-coding DNA is transcribed into functional non-coding RNA molecules (e.g. transfer RNA, microRNA, piRNA, ribosomal RNA, and reg ...
between individual
tandem repeat In genetics, tandem repeats occur in DNA when a pattern of one or more nucleotides is repeated and the repetitions are directly adjacent to each other, e.g. ATTCG ATTCG ATTCG, in which the sequence ATTCG is repeated three times. Several protein ...
s of the
ribosomal Ribosomes () are macromolecular machines, found within all cells, that perform biological protein synthesis (messenger RNA translation). Ribosomes link amino acids together in the order specified by the codons of messenger RNA molecules to fo ...
gene cluster A gene cluster is a group of two or more genes found within an organism's DNA that encode similar peptide, polypeptides or proteins which collectively share a generalized function and are often located within a few thousand base pairs of each othe ...
in the
nuclear genome Nuclear DNA (nDNA), or nuclear deoxyribonucleic acid, is the DNA contained within each cell nucleus of a eukaryotic organism. It encodes for the majority of the genome in eukaryotes, with mitochondrial DNA and plastid DNA coding for the rest. It ...
, as opposed to the Internal Transcribed Spacer (ITS) that is situated within these tandem repeats. ''IGS'' has been successfully used for the differentiation of strains of ''Xanthophyllomyces dendrorhous'' as well as for species distinction in the
psychrophilic Psychrophiles or cryophiles (adj. ''psychrophilic'' or ''cryophilic'') are extremophile, extremophilic organisms that are capable of cell growth, growth and reproduction in low temperatures, ranging from to . They are found in places that are pe ...
genus ''Mrakia'' (
Cystofilobasidiales The Cystofilobasidiales are an order of fungi in the class Tremellomycetes of the Basidiomycota. They usually exhibit a life phase of free-living yeasts. The order contains two families with seven genera and some 25 species A species () is ...
). Due to these results, ''IGS'' has been recommended as a genetic marker for additional differentiation (along with D1/D2 and ''ITS'') of closely related species and even strains within one species in basidiomycete yeasts. The recent discovery of additional non-coding RNA genes in the IGS region of some basidiomycetes cautions against uncritical use of ''IGS'' sequences for DNA barcoding and phylogenetic purposes.


Other genetic markers

The
cytochrome c oxidase subunit I Cytochrome c oxidase I (COX1) also known as mitochondrially encoded cytochrome c oxidase I (MT-CO1) is a protein that is encoded by the ''MT-CO1'' gene in eukaryotes. The gene is also called ''COX1'', ''CO1'', or ''COI''. Cytochrome c oxidase ...
(''COI'') gene outperforms ''ITS'' in DNA barcoding of ''
Penicillium ''Penicillium'' () is a genus of Ascomycota, ascomycetous fungus, fungi that is part of the mycobiome of many species and is of major importance in the natural environment, in food spoilage, and in food and drug production. Some members of th ...
'' (Ascomycota) species, with species-specific barcodes for 66% of the investigated species versus 25% in the case of ''ITS''. Furthermore, a part of the
β-Tubulin Tubulin in molecular biology can refer either to the tubulin protein superfamily of globular proteins, or one of the member proteins of that superfamily. α- and β-tubulins polymerize into microtubules, a major component of the eukaryotic cytos ...
A (''BenA'') gene exhibits a higher taxonomic resolution in distinguishing ''Penicillium'' species as compared to ''COI'' and ''ITS''. In the closely related ''
Aspergillus niger ''Aspergillus niger'' is a mold classified within the ''Nigri'' section of the ''Aspergillus'' genus. The ''Aspergillus'' genus consists of common molds found throughout the environment within soil and water, on vegetation, in fecal matter, on de ...
'' complex, however, ''COI'' is not variable enough for species discrimination. In ''
Fusarium ''Fusarium'' (; ) is a large genus of filamentous fungi, part of a group often referred to as hyphomycetes, widely distributed in soil and associated with plants. Most species are harmless saprobes, and are relatively abundant members of the s ...
'', ''COI'' exhibits paralogues in many cases, and homologous copies are not variable enough to distinguish species. ''COI'' also performs poorly in the identification of basidiomycote rusts of the
order Order, ORDER or Orders may refer to: * A socio-political or established or existing order, e.g. World order, Ancien Regime, Pax Britannica * Categorization, the process in which ideas and objects are recognized, differentiated, and understood ...
Pucciniales Rusts are fungal plant pathogens of the order Pucciniales (previously known as Uredinales) causing plant fungal diseases. An estimated 168 rust genera and approximately 7,000 species, more than half of which belong to the genus '' Puccinia'', a ...
due to the presence of
intron An intron is any nucleotide sequence within a gene that is not expressed or operative in the final RNA product. The word ''intron'' is derived from the term ''intragenic region'', i.e., a region inside a gene."The notion of the cistron .e., gen ...
s. Even when the obstacle of introns is overcome, ''ITS'' and the
LSU rRNA Large subunit ribosomal ribonucleic acid (LSU rRNA) is the largest of the two major RNA components of the ribosome Ribosomes () are molecular machine, macromolecular machines, found within all cell (biology), cells, that perform Translation (b ...
( 28S) outperform ''COI'' as DNA barcode marker. In the subdivision Agaricomycotina, PCR amplification success was poor for ''COI'', even with multiple primer combinations. Successfully sequenced ''COI'' samples also included introns and possible paralogous copies, as reported for ''Fusarium''. ''
Agaricus bisporus ''Agaricus bisporus'', commonly known as the cultivated mushroom, is a basidiomycete mushroom native to grasslands in Eurasia and North America. It is cultivated in more than 70 countries and is one of the most commonly and widely consumed ...
'' was found to contain up to 19 introns, making the ''COI'' gene of this species the longest recorded, with 29,902 nucleotides. Apart from the substantial troubles of sequencing ''COI'', ''COI'' and ''ITS'' generally perform equally well in distinguishing basidiomycote mushrooms.
Topoisomerase DNA topoisomerases (or topoisomerases) are enzymes that catalyze changes in the topological state of DNA, interconverting relaxed and supercoiled forms, linked (catenated) and unlinked species, and knotted and unknotted DNA. Topological issues in ...
I (''
TOP1 DNA topoisomerase 1 is an enzyme that in humans is encoded by the ''TOP1'' gene. It is a DNA topoisomerase, an enzyme that catalyzes the transient breaking and rejoining of a single strand of DNA. Function This gene encodes a DNA topoisomeras ...
'') was investigated as additional DNA barcode candidate by Lewis et al. (2011) based on
proteome A proteome is the entire set of proteins that is, or can be, expressed by a genome, cell, tissue, or organism at a certain time. It is the set of expressed proteins in a given type of cell or organism, at a given time, under defined conditions. P ...
data, with the developed universal primer pair being subsequently tested on actual samples by Stielow et al. (2015). The forward primer ''TOP1_501-F'' with the sequence (where the first section marks the universal M13 forward primer tail, the second part consisting of ACGAT a spacer, and the third part the actual primer) and reverse the primer ''TOP1_501-R'' with (the first section marking the universal M13 reverse primer tail, the second part the actual TOP1 reverse primer) amplify a fragment of approximately 800 base pairs. ''TOP1'' was found to be a promising DNA barcode candidate marker for ascomycetes, where it can distinguish species in ''Fusarium'' and ''Penicillium'' – genera, in which the primary ''ITS'' barcode performs poorly. However, poor amplification success with the ''TOP1'' universal primers is observed in early-diverging fungal lineages and basidiomycetes except
Pucciniomycotina Pucciniomycotina is a subdivision of fungus within the division Basidiomycota. The subdivision contains 10 classes, 21 orders, and 38 families. Over 8400 species of Pucciniomycotina have been described - more than 8% of all described fungi. T ...
(where ''ITS'' PCR success is poor). Like ''TOP1'', the Phosphoglycerate kinase (''PGK'') was among the genetic markers investigated by Lewis et al. (2011) and Stielow et al. (2015) as potential additional fungal DNA barcodes. A number of universal primers was developed, with the PGK533 primer pair, amplifying a circa 1,000 base pair fragment, being the most successful in most fungi except Basidiomycetes. Like ''TOP1'', ''PGK'' is superior to ''ITS'' in species differentiation in ascomycete genera like ''Penicillium'' and ''Fusarium'', and both ''PGK'' and ''TOP1'' perform as good as ''TEF1α'' in distinguishing closely related species in these genera.


Applications


Food safety

A
citizen science The term citizen science (synonymous to terms like community science, crowd science, crowd-sourced science, civic science, participatory monitoring, or volunteer monitoring) is research conducted with participation from the general public, or am ...
project investigated the consensus between the labelling of dried, commercially sold mushrooms and the DNA barcoding results from these mushrooms. All samples were found to be correctly labelled. However, an obstacle was the unreliability of ITS reference databases in terms of the level of identification, as the two databases (GenBank and UNITE) used for ITS sequence comparison gave different identification results in some of the samples. Correct labelling of mushrooms intended for consumption was also investigated by Raja et al. (2016), who used the ''ITS'' region for DNA barcoding from dried mushrooms,
mycelium Mycelium (: mycelia) is a root-like structure of a fungus consisting of a mass of branching, thread-like hyphae. Its normal form is that of branched, slender, entangled, anastomosing, hyaline threads. Fungal colonies composed of mycelium are fo ...
powders, and
dietary supplement A dietary supplement is a manufactured product intended to supplement a person's diet by taking a pill (pharmacy), pill, capsule (pharmacy), capsule, tablet (pharmacy), tablet, powder, or liquid. A supplement can provide nutrients eithe ...
capsules. In only 30% of the 33 samples did the product label correctly state the
binomial Binomial may refer to: In mathematics *Binomial (polynomial), a polynomial with two terms *Binomial coefficient, numbers appearing in the expansions of powers of binomials *Binomial QMF, a perfect-reconstruction orthogonal wavelet decomposition * ...
fungus name. In another 30%, the genus name was correct, but the species epithet did not match, and in 15% of the cases not even the
genus Genus (; : genera ) is a taxonomic rank above species and below family (taxonomy), family as used in the biological classification of extant taxon, living and fossil organisms as well as Virus classification#ICTV classification, viruses. In bino ...
name of the binomial name given on the product label matched the result of the obtained ''ITS'' barcode. For the remaining 25% of the samples, no ''ITS'' sequence could be obtained. Xiang et al. (2013) showed that using ''ITS'' sequences, the commercially highly valuable the caterpillar fungus ''
Ophiocordyceps sinensis ''Ophiocordyceps sinensis'' (synonym (taxonomy), synonym ''Cordyceps sinensis''), known colloquially as caterpillar fungus, is an entomopathogenic fungus (a fungus that grows on insects) in the family Ophiocordycipitaceae. It is mainly found ...
'' and its counterfeit versions ('' O. nutans'', '' O. robertsii'', ''
Cordyceps ''Cordyceps'' is a genus of ascomycete fungi (sac fungi) that includes over 260 species worldwide, many of which are parasitic. Diverse variants of cordyceps have had more than 1,500 years of use in Chinese medicine. Most ''Cordyceps'' specie ...
cicadae'', ''C. gunnii'', '' C. militaris'', and the plant ''
Ligularia ''Ligularia'' (leopard plant) is a genus of Old World herbaceous perennial plants in the groundsel tribe within the sunflower family. They have yellow or orange composite flower heads with brown or yellow central disc florets, and are native ...
hodgsonii'') can be reliably identified to the species level.


Pathogenic fungi

A study by Vi Hoang et al. (2019) focused on the identification accuracy of pathogenic fungi using both the primary (''ITS'') and secondary (''TEF1α'') barcode markers. Their results show that in ''Diutina'' (a segregate of ''
Candida Candida, or Cándida (Spanish), may refer to: Biology and medicine * ''Candida'' (fungus), a genus of yeasts ** Candidiasis, an infection by ''Candida'' organisms * Malvasia Candida, a variety of grape Places * Candida, Campania, a ''comu ...
'') and ''
Pichia ''Pichia'' (''Hansenula'' and '' Hyphopichia'' are obsolete synonyms) is a genus of yeasts in the family Pichiaceae with spherical, elliptical, or oblong acuminate cells. ''Pichia'' is a teleomorph, and forms hat-shaped, hemispherical, or roun ...
'', species identification is straightforward with either the ''ITS'' or the ''TEF1α'' as well as with a combination of both. In the ''Lodderomyces'' assemblage, which contains three of the five most common pathogenic ''Candida'' species ('' C. albicans'', '' C. dubliniensis'', and '' C. parapsilosis''), ''ITS'' failed to distinguish '' Candida orthopsilosis'' and ''C. parapsilosis'', which are part of the ''Candida parapsilosis'' complex of closely related species. ''TEF1α'', on the other hand, allowed identification of all investigated species of the ''Lodderomyces'' clade. Similar results were obtained for '' Scedosporium'' species, which are attributed to a wide range of localised to invasive diseases: ''ITS'' could not distinguish between '' S. apiospermum'' and ''S. boydii'', whereas with ''TEF1α'' all investigated species of this genus could be accurately identified. This study therefore underlines the usefulness of applying more than one DNA barcoding marker for fungal species identification.


Conservation of cultural heritage

Fungal DNA barcoding has been successfully applied to the investigation of
foxing Foxing is an age-related process of deterioration that causes spots and browning on paper documents such as books, postage stamps, old paper money and certificates, and on textiles such as clothing and artists' canvasses. The name may be a var ...
phenomena, a major concern in the conservation of paper documents. Sequeira et al. (2019) sequenced ''ITS'' from foxing stains and found '' Chaetomium globosum'', '' Ch. murorum'', '' Ch. nigricolor'', ''
Chaetomium ''Chaetomium'' is a genus of fungi in the Chaetomiaceae family. It is a dematiaceous (dark-walled) Mold (fungus), mold normally found in soil, air, cellulose and plant debris. According to the ''Dictionary of the Fungi'' (10th edition, 2008), th ...
'' sp., '' Eurotium rubrum'', '' Myxotrichum deflexum'', ''
Penicillium chrysogenum ''Penicillium chrysogenum'' (formerly known as ''Penicillium notatum'') is a species of fungus in the genus ''Penicillium''. It is common in temperate and subtropical regions and can be found on salted food products, but it is mostly found in in ...
'', '' P. citrinum'', '' P. commune'', ''
Penicillium ''Penicillium'' () is a genus of Ascomycota, ascomycetous fungus, fungi that is part of the mycobiome of many species and is of major importance in the natural environment, in food spoilage, and in food and drug production. Some members of th ...
'' sp. and '' Stachybotrys chartarum'' to inhabit the investigated paper stains. Another study investigated fungi that act as biodeteriorating agents in the
Old Cathedral of Coimbra The Old Cathedral of Coimbra () is a Romanesque Roman Catholic building in Portugal. Construction of the Sé Velha began some time after the Battle of Ourique (1139), when Prince Afonso Henriques declared himself King of Portugal and chose Coi ...
, part of the
University of Coimbra The University of Coimbra (UC; , ) is a Public university, public research university in Coimbra, Portugal. First established in Lisbon in 1290, it went through a number of relocations until moving permanently to Coimbra in 1537. The university ...
, a
UNESCO The United Nations Educational, Scientific and Cultural Organization (UNESCO ) is a List of specialized agencies of the United Nations, specialized agency of the United Nations (UN) with the aim of promoting world peace and International secur ...
World Heritage Site World Heritage Sites are landmarks and areas with legal protection under an treaty, international treaty administered by UNESCO for having cultural, historical, or scientific significance. The sites are judged to contain "cultural and natural ...
. Sequencing the ''ITS'' barcode of ten samples with classical Sanger as well as with Illumina
next-generation sequencing Massive parallel sequencing or massively parallel sequencing is any of several high-throughput approaches to DNA sequencing using the concept of massively parallel processing; it is also called next-generation sequencing (NGS) or second-generation ...
techniques, they identified 49 fungal species. '' Aspergillus versicolor'', ''
Cladosporium cladosporioides ''Cladosporium cladosporioides'' is a darkly pigmented mold that occurs world-wide on a wide range of materials both outdoors and indoors. It is known for its role in the decomposition of organic matter and its presence in indoor and outdoor en ...
'', '' C. sphaerospermum'', '' C. tenuissimum'', '' Epicoccum nigrum'', '' Parengyodontium album'', '' Penicillium brevicompactum'', '' P. crustosum'', '' P. glabrum'', '' Talaromyces amestolkiae'' and '' T. stollii'' were the most common species isolated from the samples. Another study concerning objects of
cultural heritage Cultural heritage is the heritage of tangible and intangible heritage assets of a group or society that is inherited from past generations. Not all heritages of past generations are "heritage"; rather, heritage is a product of selection by socie ...
investigated the fungal diversity on a canvas painting by
Paula Rego Dame Maria Paula Figueiroa Rego (: 26 January 1935 – 8 June 2022) was a Portuguese visual artist, widely considered the pre-eminent woman artist of the late 20th and early 21st century, known particularly for her paintings and prints based o ...
using the ''ITS2'' subregion of the ''ITS'' marker. Altogether, 387 OTUs (putative species) in 117 genera of 13 different classes of fungi were observed.


See also

*
DNA barcoding DNA barcoding is a method of species identification using a short section of DNA from a specific gene or genes. The premise of DNA barcoding is that by comparison with a reference library of such DNA sections (also called " sequences"), an indiv ...
*
Microbial DNA barcoding Microbial DNA barcoding is the use of DNA metabarcoding to characterize a mixture of microorganisms. DNA metabarcoding is a method of DNA barcoding that uses universal genetic markers to identify DNA of a mixture of organisms. History Using m ...
*
Pollen DNA barcoding Pollen DNA barcoding is the process of identifying pollen donor plant species through the amplification and sequencing of specific, conserved regions of plant DNA. Being able to accurately identify pollen has a wide range of applications though i ...
* DNA barcoding in diet assessment *
Consortium for the Barcode of Life The Consortium for the Barcode of Life (CBOL) was an international initiative dedicated to supporting the development of DNA barcoding as a global standard for species identification. CBOL's Secretariat Office is hosted by the National Museum of ...


References


Further reading

* {{refend


External links


Aftol primer listing
(as used in James et al. 2006's six-gene phylogeny) Fungal morphology and anatomy DNA barcoding Fungi Molecular genetics Bioinformatics