CRISPR
   HOME

TheInfoList



OR:

CRISPR () (an acronym for clustered regularly interspaced short palindromic repeats) is a family of DNA sequences found in the
genome In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ge ...
s of
prokaryotic A prokaryote () is a single-celled organism that lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Greek πρό (, 'before') and κάρυον (, 'nut' or 'kernel').Campbell, N. "Biology:Concepts & Connec ...
organisms such as
bacteria Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were among ...
and
archaea Archaea ( ; singular archaeon ) is a domain of single-celled organisms. These microorganisms lack cell nuclei and are therefore prokaryotes. Archaea were initially classified as bacteria, receiving the name archaebacteria (in the Archaebac ...
. These sequences are derived from DNA fragments of
bacteriophage A bacteriophage (), also known informally as a ''phage'' (), is a duplodnaviria virus that infects and replicates within bacteria and archaea. The term was derived from "bacteria" and the Greek φαγεῖν ('), meaning "to devour". Bacteri ...
s that had previously infected the prokaryote. They are used to detect and destroy DNA from similar bacteriophages during subsequent infections. Hence these sequences play a key role in the antiviral (i.e. anti-phage) defense system of prokaryotes and provide a form of
acquired immunity The adaptive immune system, also known as the acquired immune system, is a subsystem of the immune system that is composed of specialized, systemic cells and processes that eliminate pathogens or prevent their growth. The acquired immune system ...
. CRISPR is found in approximately 50% of sequenced
bacterial genome Bacterial genomes are generally smaller and less variant in size among species when compared with genomes of eukaryotes. Bacterial genomes can range in size anywhere from about 130 kbp to over 14 Mbp. A study that included, but was not limited to ...
s and nearly 90% of sequenced archaea.
Cas9 Cas9 (CRISPR associated protein 9, formerly called Cas5, Csn1, or Csx12) is a 160 kilodalton protein which plays a vital role in the immunological defense of certain bacteria against DNA viruses and plasmids, and is heavily utilized in genetic ...
(or "CRISPR-associated protein 9") is an
enzyme Enzymes () are proteins that act as biological catalysts by accelerating chemical reactions. The molecules upon which enzymes may act are called substrates, and the enzyme converts the substrates into different molecules known as products. A ...
that uses CRISPR sequences as a guide to recognize and cleave specific strands of DNA that are complementary to the CRISPR sequence. Cas9 enzymes together with CRISPR sequences form the basis of a technology known as
CRISPR-Cas9 Cas9 (CRISPR associated protein 9, formerly called Cas5, Csn1, or Csx12) is a 160 kilodalton protein which plays a vital role in the immunological defense of certain bacteria against DNA viruses and plasmids, and is heavily utilized in genetic ...
that can be used to edit genes within organisms. This editing process has a wide variety of applications including basic biological research, development of
biotechnological Biotechnology is the integration of natural sciences and engineering sciences in order to achieve the application of organisms, cells, parts thereof and molecular analogues for products and services. The term ''biotechnology'' was first used ...
products, and treatment of diseases. The development of the CRISPR-Cas9 genome editing technique was recognized by the
Nobel Prize in Chemistry ) , image = Nobel Prize.png , alt = A golden medallion with an embossed image of a bearded man facing left in profile. To the left of the man is the text "ALFR•" then "NOBEL", and on the right, the text (smaller) "NAT•" then "M ...
in 2020 which was awarded to
Emmanuelle Charpentier Emmanuelle Marie Charpentier (; born 11 December 1968) is a French professor and researcher in microbiology, genetics, and biochemistry. As of 2015, she has been a director at the Max Planck Institute for Infection Biology in Berlin. In 2018, sh ...
and
Jennifer Doudna Jennifer Anne Doudna (; born February 19, 1964) is an American biochemist who has done pioneering work in CRISPR gene editing, and made other fundamental contributions in biochemistry and genetics. Doudna was one of the first women to share a ...
. In 2022, based on evidence presented in Interference 106, 115, the PTAB tipped the scale for invention covering application of CRISPR-Cas9 in eukaryotic cells in favour of Feng Zhang, a professor of the
Broad Institute The Eli and Edythe L. Broad Institute of MIT and Harvard (IPA: , pronunciation respelling: ), often referred to as the Broad Institute, is a biomedical and genomic research center located in Cambridge, Massachusetts, United States. The institu ...
.


History


Repeated sequences

The discovery of clustered DNA repeats took place independently in three parts of the world. The first description of what would later be called CRISPR is from
Osaka University , abbreviated as , is a public research university located in Osaka Prefecture, Japan. It is one of Japan's former Imperial Universities and a Designated National University listed as a "Top Type" university in the Top Global University Project. ...
researcher
Yoshizumi Ishino is a Japanese molecular biologist, known for his discovering the DNA sequence of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR). Biography Ishino was born in Kyoto Prefecture, Japan. He received his BS, MS and PhD in 198 ...
and his colleagues in 1987. They accidentally cloned part of a CRISPR sequence together with the "''iap" gene'' ''(isozyme conversion of alkaline phosphatase)'' from the genome of ''
Escherichia coli ''Escherichia coli'' (),Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. also known as ''E. coli'' (), is a Gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus ''Escher ...
'' which was their target. The organization of the repeats was unusual. Repeated sequences are typically arranged consecutively, without interspersing different sequences. They did not know the function of the interrupted clustered repeats. In 1993, researchers of ''
Mycobacterium tuberculosis ''Mycobacterium tuberculosis'' (M. tb) is a species of pathogenic bacteria in the family Mycobacteriaceae and the causative agent of tuberculosis. First discovered in 1882 by Robert Koch, ''M. tuberculosis'' has an unusual, waxy coating on its c ...
'' in the Netherlands published two articles about a cluster of interrupted
direct repeat Direct repeats are a type of genetic sequence that consists of two or more repeats of a specific sequence. In other words, the direct repeats are nucleotide sequences present in multiple copies in the genome. Generally, a direct repeat occurs when a ...
s (DR) in that bacterium. They recognized the diversity of the sequences that intervened in the direct repeats among different strains of ''M. tuberculosis'' and used this property to design a typing method that was named '' spoligotyping'', which is still in use, today.
Francisco Mojica Francisco Juan Martínez Mojica (born 5 October 1963) is a Spanish molecular biologist and microbiologist at the University of Alicante in Spain. He is known for his discovery of repetitive, functional DNA sequences in bacteria which he named CR ...
at the
University of Alicante The University of Alicante ( ca-valencia, Universitat d'Alacant, italic=no, ; es, Universidad de Alicante, italic=no, ; also known by the acronym ''UA'') was established in 1979 on the basis of the Center for University Studies (CEU), which was fo ...
in Spain studied repeats observed in the archaeal organisms of ''
Haloferax In taxonomy, ''Haloferax'' (common abbreviation: ''Hfx.'') is a genus of the Haloferacaceae. Genetic exchange Cells of ''H. mediterranei'' and cells of the related species '' H. volcanii'' can undergo a process of genetic exchange between two ...
'' and ''
Haloarcula ''Haloarcula'' (common abbreviation ''Har.'') is a genus of extreme halophilic Archaea in the class of Halobactaria. Cell Structure ''Haloarcula'' species can be distinguished from other genera in the family Halobacteriaceae by the presence of ...
'' species and their function. Mojica's supervisor surmised at the time that the clustered repeats had a role in correctly segregating replicated DNA into daughter cells during cell division because plasmids and chromosomes with identical repeat arrays could not coexist in ''
Haloferax volcanii ''Haloferax volcanii'' is a species of organism in the genus ''Haloferax'' in the Archaea. Description and significance Microbiologist Benjamin Elazari Volcani first discovered ''Haloferax volcanii'', a self-named extremophile, in the 1930s. '' ...
''. Transcription of the interrupted repeats was also noted for the first time; this was the first full characterization of CRISPR. By 2000, Mojica performed a survey of scientific literature and one of his students performed a search in published genomes with a program devised by himself. They identified interrupted repeats in 20 species of microbes as belonging to the same family. Because those sequences were interspaced, Mojica initially called these sequences "short regularly spaced repeats" (SRSR). In 2001, Mojica and
Ruud Jansen Ruud and Rud are surnames of Norwegian origin. Both are also Norwegian place names of numerous farmsteads named Rud or Ruud from Old Norse ''ruð'' meaning clearing. Ruud is also a Dutch masculine given name meaning "famous wolf" although it is also ...
, who were searching for additional interrupted repeats, proposed the acronym CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) to alleviate the confusion stemming from the numerous acronyms used to describe the sequences in the scientific literature. In 2002, Tang, et al. showed evidence that CRISPR repeat regions from the genome of ''
Archaeoglobus fulgidus ''Archaeoglobus'' is a genus of the phylum Euryarchaeota. ''Archaeoglobus'' can be found in high-temperature oil fields where they may contribute to oil field souring. Metabolism ''Archaeoglobus'' grow anaerobically at extremely high temperat ...
'' were transcribed into long RNA molecules that were subsequently processed into unit-length small RNAs, plus some longer forms of 2, 3, or more spacer-repeat units. In 2005,
yogurt Yogurt (; , from tr, yoğurt, also spelled yoghurt, yogourt or yoghourt) is a food produced by bacterial Fermentation (food), fermentation of milk. The bacteria used to make yogurt are known as ''yogurt cultures''. Fermentation of sugars in t ...
researcher
Rodolphe Barrangou Rodolphe Barrangou is the Todd R. Klaenhammer Distinguished Professor in Probiotics Research in the Department of Food, Bioprocessing and Nutrition Sciences at North Carolina State University; Co-Founder and Chief Executive Officer of CRISPR Bio ...
discovered that ''
Streptococcus thermophilus ''Streptococcus thermophilus'' also known as ''Streptococcus salivarius ''subsp.'' thermophilus'' is a gram-positive bacterium, and a fermentative facultative anaerobe, of the '' viridans'' group. It tests negative for cytochrome, oxidase, an ...
'', after iterative phage challenges, develops increased phage resistance, and this enhanced resistance is due to the incorporation of additional CRISPR spacer sequences. The Danish food company Danisco, which at that time Barrangou worked for, then developed phage-resistant ''S. thermophilus'' strains for use in yogurt production. Danisco was later bought out by
DuPont DuPont de Nemours, Inc., commonly shortened to DuPont, is an American multinational chemical company first formed in 1802 by French-American chemist and industrialist Éleuthère Irénée du Pont de Nemours. The company played a major role in ...
, which "owns about 50 percent of the global dairy culture market" and the technology went mainstream.


CRISPR-associated systems

A major addition to the understanding of CRISPR came with Jansen's observation that the prokaryote repeat cluster was accompanied by a set of homologous genes that make up CRISPR-associated systems or ''cas'' genes. Four ''cas'' genes (''cas'' 1–4) were initially recognized. The Cas proteins showed
helicase Helicases are a class of enzymes thought to be vital to all organisms. Their main function is to unpack an organism's genetic material. Helicases are motor proteins that move directionally along a nucleic acid phosphodiester backbone, separatin ...
and nuclease motifs, suggesting a role in the dynamic structure of the CRISPR loci. In this publication, the acronym CRISPR was used as the universal name of this pattern. However, the CRISPR function remained enigmatic. In 2005, three independent research groups showed that some CRISPR spacers are derived from
phage A bacteriophage (), also known informally as a ''phage'' (), is a duplodnaviria virus that infects and replicates within bacteria and archaea. The term was derived from "bacteria" and the Greek φαγεῖν ('), meaning "to devour". Bacter ...
DNA and extrachromosomal DNA such as
plasmids A plasmid is a small, extrachromosomal DNA molecule within a cell that is physically separated from chromosomal DNA and can replicate independently. They are most commonly found as small circular, double-stranded DNA molecules in bacteria; how ...
. In effect, the spacers are fragments of DNA gathered from viruses that previously tried to attack the cell. The source of the spacers was a sign that the CRISPR/''cas'' system could have a role in adaptive immunity in
bacteria Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were among ...
. All three studies proposing this idea were initially rejected by high-profile journals, but eventually appeared in other journals. The first publication proposing a role of CRISPR-Cas in microbial immunity, by Mojica and collaborators at the
University of Alicante The University of Alicante ( ca-valencia, Universitat d'Alacant, italic=no, ; es, Universidad de Alicante, italic=no, ; also known by the acronym ''UA'') was established in 1979 on the basis of the Center for University Studies (CEU), which was fo ...
, predicted a role for the RNA transcript of spacers on target recognition in a mechanism that could be analogous to the
RNA interference RNA interference (RNAi) is a biological process in which RNA molecules are involved in sequence-specific suppression of gene expression by double-stranded RNA, through translational or transcriptional repression. Historically, RNAi was known by ...
system used by eukaryotic cells. Koonin and colleagues extended this RNA interference hypothesis by proposing mechanisms of action for the different CRISPR-Cas subtypes according to the predicted function of their proteins. Experimental work by several groups revealed the basic mechanisms of CRISPR-Cas immunity. In 2007, the first experimental evidence that CRISPR was an adaptive immune system was published. A CRISPR region in ''
Streptococcus thermophilus ''Streptococcus thermophilus'' also known as ''Streptococcus salivarius ''subsp.'' thermophilus'' is a gram-positive bacterium, and a fermentative facultative anaerobe, of the '' viridans'' group. It tests negative for cytochrome, oxidase, an ...
'' acquired spacers from the DNA of an infecting
bacteriophage A bacteriophage (), also known informally as a ''phage'' (), is a duplodnaviria virus that infects and replicates within bacteria and archaea. The term was derived from "bacteria" and the Greek φαγεῖν ('), meaning "to devour". Bacteri ...
. The researchers manipulated the resistance of ''S. thermophilus'' to different types of phages by adding and deleting spacers whose sequence matched those found in the tested phages. In 2008, Brouns and Van der Oost identified a complex of Cas proteins (called Cascade) that in ''E. coli'' cut the CRISPR RNA precursor within the repeats into mature spacer-containing RNA molecules called CRISPR RNA (crRNA), which remained bound to the protein complex. Moreover, it was found that Cascade, crRNA and a helicase/nuclease (
Cas3 Cas3 is an ATP-dependent single-strand DNA (ssDNA) translocase/helicase enzyme that degrades DNA as part of CRISPR based immunity. Cas3 is a "signature" protein of class 1 CRISPR systems and functions in a complex known as CASCADE, with other c ...
) were required to provide a bacterial host with immunity against infection by a
DNA virus A DNA virus is a virus that has a genome made of deoxyribonucleic acid (DNA) that is replicated by a DNA polymerase. They can be divided between those that have two strands of DNA in their genome, called double-stranded DNA (dsDNA) viruses, and ...
. By designing an anti-virus CRISPR, they demonstrated that two orientations of the crRNA (sense/antisense) provided immunity, indicating that the crRNA guides were targeting dsDNA. That year Marraffini and Sontheimer confirmed that a CRISPR sequence of ''
S. epidermidis ''Staphylococcus epidermidis'' is a Gram-positive bacterium, and one of over 40 species belonging to the genus ''Staphylococcus''. It is part of the normal human microbiota, typically the skin microbiota, and less commonly the mucosal microbiot ...
'' targeted DNA and not RNA to prevent
conjugation Conjugation or conjugate may refer to: Linguistics * Grammatical conjugation, the modification of a verb from its basic form * Emotive conjugation or Russell's conjugation, the use of loaded language Mathematics * Complex conjugation, the chang ...
. This finding was at odds with the proposed RNA-interference-like mechanism of CRISPR-Cas immunity, although a CRISPR-Cas system that targets foreign RNA was later found in ''
Pyrococcus furiosus ''Pyrococcus furiosus'' is a heterotrophic, strictly anaerobic, extremophilic, model species of archaea. It is classified as a hyperthermophile because it thrives best under extremely high temperatures, and is notable for having an optimum growt ...
''. A 2010 study showed that CRISPR-Cas cuts both strands of phage and plasmid DNA in ''S. thermophilus''.


Cas9

A simpler CRISPR system from ''
Streptococcus pyogenes ''Streptococcus pyogenes'' is a species of Gram-positive, aerotolerant bacteria in the genus ''Streptococcus''. These bacteria are extracellular, and made up of non-motile and non-sporing cocci (round cells) that tend to link in chains. They are ...
'' relies on the protein
Cas9 Cas9 (CRISPR associated protein 9, formerly called Cas5, Csn1, or Csx12) is a 160 kilodalton protein which plays a vital role in the immunological defense of certain bacteria against DNA viruses and plasmids, and is heavily utilized in genetic ...
. The Cas9
endonuclease Endonucleases are enzymes that cleave the phosphodiester bond within a polynucleotide chain. Some, such as deoxyribonuclease I, cut DNA relatively nonspecifically (without regard to sequence), while many, typically called restriction endonucleases ...
is a four-component system that includes two small molecules: crRNA and trans-activating CRISPR RNA (tracrRNA).
Jennifer Doudna Jennifer Anne Doudna (; born February 19, 1964) is an American biochemist who has done pioneering work in CRISPR gene editing, and made other fundamental contributions in biochemistry and genetics. Doudna was one of the first women to share a ...
and
Emmanuelle Charpentier Emmanuelle Marie Charpentier (; born 11 December 1968) is a French professor and researcher in microbiology, genetics, and biochemistry. As of 2015, she has been a director at the Max Planck Institute for Infection Biology in Berlin. In 2018, sh ...
re-engineered the Cas9 endonuclease into a more manageable two-component system by fusing the two RNA molecules into a "single-guide RNA" that, when combined with Cas9, could find and cut the DNA target specified by the guide RNA. This contribution was so significant that it was recognized by the
Nobel Prize in Chemistry ) , image = Nobel Prize.png , alt = A golden medallion with an embossed image of a bearded man facing left in profile. To the left of the man is the text "ALFR•" then "NOBEL", and on the right, the text (smaller) "NAT•" then "M ...
in 2020. By manipulating the nucleotide sequence of the guide RNA, the artificial Cas9 system could be programmed to target any DNA sequence for cleavage. Another group of collaborators comprising
Virginijus Šikšnys Virginijus Šikšnys (born 26 January 1956) is a Lithuanian biochemist and a professor at Vilnius University. He is a chief scientist at the Vilnius University Institute of Biotechnology. Biography Šikšnys studied organic chemistry at Vilnius ...
together with Gasiūnas, Barrangou, and Horvath showed that Cas9 from the ''S. thermophilus'' CRISPR system can also be reprogrammed to target a site of their choosing by changing the sequence of its crRNA. These advances fueled efforts to edit genomes with the modified CRISPR-Cas9 system. Groups led by
Feng Zhang Feng Zhang (; born October 22, 1981) is a Chinese-American biochemist. Zhang currently holds the James and Patricia Poitras Professorship in Neuroscience at the McGovern Institute for Brain Research and in the departments of Brain and Cognitive ...
and George Church simultaneously published descriptions of genome editing in human cell cultures using CRISPR-Cas9 for the first time. It has since been used in a wide range of organisms, including baker's yeast (''
Saccharomyces cerevisiae ''Saccharomyces cerevisiae'' () (brewer's yeast or baker's yeast) is a species of yeast (single-celled fungus microorganisms). The species has been instrumental in winemaking, baking, and brewing since ancient times. It is believed to have been o ...
''), the
opportunistic pathogen An opportunistic infection is an infection caused by pathogens (bacteria, fungi, parasites or viruses) that take advantage of an opportunity not normally available. These opportunities can stem from a variety of sources, such as a weakened immune ...
''
Candida albicans ''Candida albicans'' is an opportunistic pathogenic yeast that is a common member of the human gut flora. It can also survive outside the human body. It is detected in the gastrointestinal tract and mouth in 40–60% of healthy adults. It is us ...
'', zebrafish (''
Danio rerio The zebrafish (''Danio rerio'') is a freshwater fish belonging to the minnow family (Cyprinidae) of the order Cypriniformes. Native to South Asia, it is a popular aquarium fish, frequently sold under the trade name zebra danio (and thus often ...
''), fruit flies (''
Drosophila melanogaster ''Drosophila melanogaster'' is a species of fly (the taxonomic order Diptera) in the family Drosophilidae. The species is often referred to as the fruit fly or lesser fruit fly, or less commonly the "vinegar fly" or "pomace fly". Starting with Ch ...
''), ants (''
Harpegnathos saltator ''Harpegnathos saltator'', sometimes called the Indian jumping ant or Jerdon's jumping ant, is a species of ant found in India. They have long mandibles and have the ability to leap a few inches. They are large-eyed and active predators that hu ...
'' and ''
Ooceraea biroi ''Ooceraea biroi'', the clonal raider ant, is a queenless clonal ant in the genus ''Ooceraea'' (recently transferred from the genus '' Cerapachys''). Native to the Asian mainland, this species has become invasive on tropical and subtropical islan ...
''), mosquitoes (''
Aedes aegypti ''Aedes aegypti'', the yellow fever mosquito, is a mosquito that can spread dengue fever, chikungunya, Zika fever, Mayaro and yellow fever viruses, and other disease agents. The mosquito can be recognized by black and white markings on its legs ...
''), nematodes ('' Caenorhabditis elegans''), plants, mice (''
Mus musculus domesticus ''Mus musculus domesticus'', the Western European house mouse, is a subspecies of the house mouse (''Mus musculus''). Some laboratory mouse strains, such as C57BL/6, are domesticated from ''M. m. domesticus''. Distribution In Europe, ''M. m. dom ...
)'', monkeys and human embryos. CRISPR has been modified to make programmable
transcription factors In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The func ...
that allows targeting and activation or silencing specific genes. The CRISPR-Cas9 system has shown to make effective gene edits in Human tripronuclear zygotes first described in a 2015 paper by Chinese scientists P. Liang and Y. Xu. The system made a successful cleavage of mutant Beta-Hemoglobin (HBB) in 28 out of 54 embryos. 4 out of the 28 embryos were successfully recombined using a donor template given by the scientists. The scientists showed that during DNA recombination of the cleaved strand, the homologous endogenous sequence HBD competes with the exogenous donor template. DNA repair in human embryos is much more complicated and particular than in derived stem cells.


Cas12a

In 2015, the nuclease Cas12a (formerly known as ) was characterized in the
CRISPR/Cpf1 Cas12a (CRISPR associated protein 12a, previously known as Cpf1) is an RNA-guided endonuclease of that forms part of the CRISPR system in some bacteria and is used by scientists to modify DNA. It originates as part of a bacterial immune mechanis ...
system of the bacterium ''
Francisella novicida ''Francisella novicida'' is a bacterium of the Francisellaceae family, which consist of Gram-negative pathogenic bacteria. These bacteria vary from small cocci to rod-shaped, and are most known for their intracellular parasitic capabilities. In ...
''. Its original name, from a
TIGRFAMs TIGRFAMs is a database of protein families designed to support manual and automated genome annotation. Each entry includes a multiple sequence alignment and hidden Markov model (HMM) built from the alignment. Sequences that score above the defined ...
protein family A protein family is a group of evolutionarily related proteins. In many cases, a protein family has a corresponding gene family, in which each gene encodes a corresponding protein with a 1:1 relationship. The term "protein family" should not be c ...
definition built in 2012, reflects the prevalence of its CRISPR-Cas subtype in the ''Prevotella'' and ''Francisella'' lineages. Cas12a showed several key differences from Cas9 including: causing a 'staggered' cut in double stranded DNA as opposed to the 'blunt' cut produced by Cas9, relying on a 'T rich' PAM (providing alternative targeting sites to Cas9), and requiring only a CRISPR RNA (crRNA) for successful targeting. By contrast, Cas9 requires both crRNA and a transactivating crRNA (tracrRNA). These differences may give Cas12a some advantages over Cas9. For example, Cas12a's small crRNAs are ideal for multiplexed genome editing, as more of them can be packaged in one vector than can Cas9's sgRNAs. As well, the sticky 5′ overhangs left by Cas12a can be used for DNA assembly that is much more target-specific than traditional Restriction Enzyme cloning. Finally, Cas12a cleaves DNA 18–23 base pairs downstream from the PAM site. This means there is no disruption to the recognition sequence after repair, and so Cas12a enables multiple rounds of DNA cleavage. By contrast, since Cas9 cuts only 3 base pairs upstream of the PAM site, the NHEJ pathway results in
indel Indel is a molecular biology term for an insertion or deletion of bases in the genome of an organism. It is classified among small genetic variations, measuring from 1 to 10 000 base pairs in length, including insertion and deletion events that ...
mutations that destroy the recognition sequence, thereby preventing further rounds of cutting. In theory, repeated rounds of DNA cleavage should cause an increased opportunity for the desired genomic editing to occur. A distinctive feature of Cas12a, as compared to Cas9, is that after cutting its target, Cas12a remains bound to the target and then cleaves other ssDNA molecules non-discriminately. This property is called "collateral cleavage" or "trans-cleavage" activity and has been exploited for the development of various diagnostic technologies.


Cas13

In 2016, the nuclease (formerly known as ) from the bacterium ''Leptotrichia shahii'' was characterized. Cas13 is an RNA-guided RNA endonuclease, which means that it does not cleave DNA, but only single-stranded RNA. Cas13 is guided by its crRNA to a ssRNA target and binds and cleaves the target. Similar to Cas12a, the Cas13 remains bound to the target and then cleaves other ssRNA molecules non-discriminately. This collateral cleavage property has been exploited for the development of various diagnostic technologies. In 2021, Dr. Hui Yang characterized novel miniature Cas13 protein (mCas13) variants, Cas13X & Cas13Y. Using a small portion of N gene sequence from SARS-CoV-2 as a target in characterization of mCas13, revealed the sensitivity and specificity of mCas13 coupled with RT-LAMP for detection of SARS-CoV-2 in both synthetic and clinical samples over other available standard tests like RT-qPCR (1 copy/μL).


Locus structure


Repeats and spacers

The CRISPR array is made up of an AT-rich leader sequence followed by short repeats that are separated by unique spacers. CRISPR repeats typically range in size from 28 to 37
base pairs A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
(bps), though there can be as few as 23 bp and as many as 55 bp. Some show
dyad symmetry In genetics, dyad symmetry refers to two areas of a DNA strand whose base pair sequences are inverted repeats of each other. They are often described as palindromes. For example, the following shows dyad symmetry between sequences GAATAC and GTATT ...
, implying the formation of a
secondary structure Protein secondary structure is the three dimensional conformational isomerism, form of ''local segments'' of proteins. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
such as a
stem-loop Stem-loop intramolecular base pairing is a pattern that can occur in single-stranded RNA. The structure is also known as a hairpin or hairpin loop. It occurs when two regions of the same strand, usually complementary in nucleotide sequence when ...
('hairpin') in the RNA, while others are designed to be unstructured. The size of spacers in different CRISPR arrays is typically 32 to 38 bp (range 21 to 72 bp). New spacers can appear rapidly as part of the immune response to phage infection. There are usually fewer than 50 units of the repeat-spacer sequence in a CRISPR array.


CRISPR RNA structures

Image:RF01315.png, ;CRISPR-DR2: Secondary structure taken from th
Rfam
database. Famil
RF01315
Image:RF01318.png, ;CRISPR-DR5: Secondary structure taken from th
Rfam
database. Famil
RF011318
Image:RF01319.png, ;CRISPR-DR6: Secondary structure taken from th
Rfam
database. Famil
RF01319
Image:RF01321.png, ;CRISPR-DR8: Secondary structure taken from th
Rfam
database. Famil
RF01321
Image:RF01322.png, ;CRISPR-DR9: Secondary structure taken from th
Rfam
database. Famil
RF01322
Image:RF01332.png, ;CRISPR-DR19: Secondary structure taken from th
Rfam
database. Famil
RF01332
Image:RF01350.png, ;CRISPR-DR41: Secondary structure taken from th
Rfam
database. Famil
RF01350
Image:RF01365.png, ;CRISPR-DR52: Secondary structure taken from th
Rfam
database. Famil
RF01365
Image:RF01370.png, ;CRISPR-DR57: Secondary structure taken from th
Rfam
database. Famil
RF01370
Image:RF01378.png, ;CRISPR-DR65: Secondary structure taken from th
Rfam
database. Famil
RF01378


Cas genes and CRISPR subtypes

Small clusters of ''cas'' genes are often located next to CRISPR repeat-spacer arrays. Collectively the 93 ''cas'' genes are grouped into 35 families based on sequence similarity of the encoded proteins. 11 of the 35 families form the ''cas'' core, which includes the protein families Cas1 through Cas9. A complete CRISPR-Cas locus has at least one gene belonging to the ''cas'' core. CRISPR-Cas systems fall into two classes. Class 1 systems use a complex of multiple Cas proteins to degrade foreign nucleic acids. Class 2 systems use a single large Cas protein for the same purpose. Class 1 is divided into types I, III, and IV; class 2 is divided into types II, V, and VI. The 6 system types are divided into 19 subtypes. Each type and most subtypes are characterized by a "signature gene" found almost exclusively in the category. Classification is also based on the complement of ''cas'' genes that are present. Most CRISPR-Cas systems have a Cas1 protein. The
phylogeny A phylogenetic tree (also phylogeny or evolutionary tree Felsenstein J. (2004). ''Inferring Phylogenies'' Sinauer Associates: Sunderland, MA.) is a branching diagram or a tree showing the evolutionary relationships among various biological spec ...
of Cas1 proteins generally agrees with the classification system, but exceptions exist due to module shuffling. Many organisms contain multiple CRISPR-Cas systems suggesting that they are compatible and may share components. The sporadic distribution of the CRISPR/Cas subtypes suggests that the CRISPR/Cas system is subject to
horizontal gene transfer Horizontal gene transfer (HGT) or lateral gene transfer (LGT) is the movement of genetic material between Unicellular organism, unicellular and/or multicellular organisms other than by the ("vertical") transmission of DNA from parent to offsprin ...
during microbial
evolution Evolution is change in the heritable characteristics of biological populations over successive generations. These characteristics are the expressions of genes, which are passed on from parent to offspring during reproduction. Variation ...
.


Mechanism

CRISPR-Cas immunity is a natural process of bacteria and archaea. CRISPR-Cas prevents bacteriophage infection,
conjugation Conjugation or conjugate may refer to: Linguistics * Grammatical conjugation, the modification of a verb from its basic form * Emotive conjugation or Russell's conjugation, the use of loaded language Mathematics * Complex conjugation, the chang ...
and
natural transformation In category theory, a branch of mathematics, a natural transformation provides a way of transforming one functor into another while respecting the internal structure (i.e., the composition of morphisms) of the categories involved. Hence, a natur ...
by degrading foreign nucleic acids that enter the cell.


Spacer acquisition

When a
microbe A microorganism, or microbe,, ''mikros'', "small") and ''organism'' from the el, ὀργανισμός, ''organismós'', "organism"). It is usually written as a single word but is sometimes hyphenated (''micro-organism''), especially in olde ...
is invaded by a
bacteriophage A bacteriophage (), also known informally as a ''phage'' (), is a duplodnaviria virus that infects and replicates within bacteria and archaea. The term was derived from "bacteria" and the Greek φαγεῖν ('), meaning "to devour". Bacteri ...
, the first stage of the immune response is to capture phage DNA and insert it into a CRISPR locus in the form of a spacer.
Cas1 CRISPR-associated protein 1 (cas1) is one of the two universally conserved proteins found in the CRISPR prokaryotic immune defense system. Cas1 is a metal-dependent DNA-specific endonuclease that produces double-stranded DNA fragments. Cas1 forms ...
and
Cas2 Cas2 is a protein associated with CRISPR that is involved with spacer acquisition. Representative cas2 proteins have been characterized as endonucleases that cleave single-stranded RNAs preferentially within U-rich regions, or as metal-dependen ...
are found in both types of CRISPR-Cas immune systems, which indicates that they are involved in spacer acquisition. Mutation studies confirmed this hypothesis, showing that removal of
cas1 CRISPR-associated protein 1 (cas1) is one of the two universally conserved proteins found in the CRISPR prokaryotic immune defense system. Cas1 is a metal-dependent DNA-specific endonuclease that produces double-stranded DNA fragments. Cas1 forms ...
or
cas2 Cas2 is a protein associated with CRISPR that is involved with spacer acquisition. Representative cas2 proteins have been characterized as endonucleases that cleave single-stranded RNAs preferentially within U-rich regions, or as metal-dependen ...
stopped spacer acquisition, without affecting CRISPR immune response. Multiple Cas1 proteins have been characterised and their structures resolved. Cas1 proteins have diverse
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha am ...
sequences. However, their crystal structures are similar and all purified Cas1 proteins are metal-dependent nucleases/ integrases that bind to DNA in a sequence-independent manner. Representative Cas2 proteins have been characterised and possess either (single strand) ssRNA- or (double strand) dsDNA- specific
endoribonuclease An endoribonuclease is a ribonuclease endonuclease. It cleaves either single-stranded or double-stranded RNA, depending on the enzyme. Example includes both single proteins such as RNase III, RNase A, RNase T1, RNase T2 and RNase H and also com ...
activity. In the I-E system of ''E. coli'' Cas1 and Cas2 form a complex where a Cas2 dimer bridges two Cas1 dimers. In this complex Cas2 performs a non-enzymatic scaffolding role, binding double-stranded fragments of invading DNA, while Cas1 binds the single-stranded flanks of the DNA and catalyses their integration into CRISPR arrays. New spacers are usually added at the beginning of the CRISPR next to the leader sequence creating a chronological record of viral infections. In ''E. coli'' a histone like protein called integration host factor ( IHF), which binds to the leader sequence, is responsible for the accuracy of this integration. IHF also enhances integration efficiency in the type I-F system of '' Pectobacterium atrosepticum''. but in other systems, different host factors may be required


Protospacer adjacent motifs (PAM)

Bioinformatic analysis of regions of phage genomes that were excised as spacers (termed protospacers) revealed that they were not randomly selected but instead were found adjacent to short (3–5 bp) DNA sequences termed
protospacer adjacent motif A protospacer adjacent motif (PAM) is a 2–6-base pair DNA sequence immediately following the DNA sequence targeted by the Cas9 nuclease in the CRISPR bacterial adaptive immune system. The PAM is a component of the invading virus or plasmid, but ...
s (PAM). Analysis of CRISPR-Cas systems showed PAMs to be important for type I and type II, but not type III systems during acquisition. In type I and type II systems, protospacers are excised at positions adjacent to a PAM sequence, with the other end of the spacer cut using a ruler mechanism, thus maintaining the regularity of the spacer size in the CRISPR array. The conservation of the PAM sequence differs between CRISPR-Cas systems and appears to be evolutionarily linked to Cas1 and the leader sequence. New spacers are added to a CRISPR array in a directional manner, occurring preferentially, but not exclusively, adjacent to the leader sequence. Analysis of the type I-E system from ''E. coli'' demonstrated that the first direct repeat adjacent to the leader sequence is copied, with the newly acquired spacer inserted between the first and second direct repeats. The PAM sequence appears to be important during spacer insertion in type I-E systems. That sequence contains a strongly conserved final nucleotide (nt) adjacent to the first nt of the protospacer. This nt becomes the final base in the first direct repeat. This suggests that the spacer acquisition machinery generates single-stranded overhangs in the second-to-last position of the direct repeat and in the PAM during spacer insertion. However, not all CRISPR-Cas systems appear to share this mechanism as PAMs in other organisms do not show the same level of conservation in the final position. It is likely that in those systems, a blunt end is generated at the very end of the direct repeat and the protospacer during acquisition.


Insertion variants

Analysis of ''
Sulfolobus solfataricus ''Saccharolobus solfataricus'' is a species of thermophilic archaeon. It was transferred from the genus ''Sulfolobus'' to the new genus ''Saccharolobus'' with the description of Saccharolobus caldissimus in 2018. It was first isolated and disco ...
'' CRISPRs revealed further complexities to the canonical model of spacer insertion, as one of its six CRISPR loci inserted new spacers randomly throughout its CRISPR array, as opposed to inserting closest to the leader sequence. Multiple CRISPRs contain many spacers to the same phage. The mechanism that causes this phenomenon was discovered in the type I-E system of ''E. coli''. A significant enhancement in spacer acquisition was detected where spacers already target the phage, even mismatches to the protospacer. This ‘priming’ requires the Cas proteins involved in both acquisition and interference to interact with each other. Newly acquired spacers that result from the priming mechanism are always found on the same strand as the priming spacer. This observation led to the hypothesis that the acquisition machinery slides along the foreign DNA after priming to find a new protospacer.


Biogenesis

CRISPR-RNA (crRNA), which later guides the Cas nuclease to the target during the interference step, must be generated from the CRISPR sequence. The crRNA is initially transcribed as part of a single long transcript encompassing much of the CRISPR array. This transcript is then cleaved by Cas proteins to form crRNAs. The mechanism to produce crRNAs differs among CRISPR/Cas systems. In type I-E and type I-F systems, the proteins Cas6e and Cas6f respectively, recognise stem-loops created by the pairing of identical repeats that flank the crRNA. These Cas proteins cleave the longer transcript at the edge of the paired region, leaving a single crRNA along with a small remnant of the paired repeat region. Type III systems also use Cas6, however, their repeats do not produce stem-loops. Cleavage instead occurs by the longer transcript wrapping around the Cas6 to allow cleavage just upstream of the repeat sequence. Type II systems lack the Cas6 gene and instead utilize RNaseIII for cleavage. Functional type II systems encode an extra small RNA that is complementary to the repeat sequence, known as a
trans-activating crRNA In molecular biology, trans-activating crispr RNA (tracrRNA) is a small ''trans''-encoded RNA. It was first discovered by Emmanuelle Charpentier in her study of human pathogen ''Streptococcus pyogenes'', a type of bacteria that causes harm to human ...
(tracrRNA). Transcription of the tracrRNA and the primary CRISPR transcript results in base pairing and the formation of dsRNA at the repeat sequence, which is subsequently targeted by RNaseIII to produce crRNAs. Unlike the other two systems, the crRNA does not contain the full spacer, which is instead truncated at one end. CrRNAs associate with Cas proteins to form ribonucleotide complexes that recognize foreign nucleic acids. CrRNAs show no preference between the coding and non-coding strands, which is indicative of an RNA-guided DNA-targeting system. The type I-E complex (commonly referred to as Cascade) requires five Cas proteins bound to a single crRNA.


Interference

During the interference stage in type I systems, the PAM sequence is recognized on the crRNA-complementary strand and is required along with crRNA annealing. In type I systems correct base pairing between the crRNA and the protospacer signals a conformational change in Cascade that recruits
Cas3 Cas3 is an ATP-dependent single-strand DNA (ssDNA) translocase/helicase enzyme that degrades DNA as part of CRISPR based immunity. Cas3 is a "signature" protein of class 1 CRISPR systems and functions in a complex known as CASCADE, with other c ...
for DNA degradation. Type II systems rely on a single multifunctional protein,
Cas9 Cas9 (CRISPR associated protein 9, formerly called Cas5, Csn1, or Csx12) is a 160 kilodalton protein which plays a vital role in the immunological defense of certain bacteria against DNA viruses and plasmids, and is heavily utilized in genetic ...
, for the interference step.
Cas9 Cas9 (CRISPR associated protein 9, formerly called Cas5, Csn1, or Csx12) is a 160 kilodalton protein which plays a vital role in the immunological defense of certain bacteria against DNA viruses and plasmids, and is heavily utilized in genetic ...
requires both the crRNA and the tracrRNA to function and cleave DNA using its dual HNH and RuvC/RNaseH-like endonuclease domains. Basepairing between the PAM and the phage genome is required in type II systems. However, the PAM is recognized on the same strand as the crRNA (the opposite strand to type I systems). Type III systems, like type I require six or seven Cas proteins binding to crRNAs. The type III systems analysed from ''S. solfataricus'' and ''P. furiosus'' both target the mRNA of phages rather than phage DNA genome, which may make these systems uniquely capable of targeting RNA-based phage genomes. Type III systems were also found to target DNA in addition to RNA using a different Cas protein in the complex, Cas10. The DNA cleavage was shown to be transcription dependent. The mechanism for distinguishing self from foreign DNA during interference is built into the crRNAs and is therefore likely common to all three systems. Throughout the distinctive maturation process of each major type, all crRNAs contain a spacer sequence and some portion of the repeat at one or both ends. It is the partial repeat sequence that prevents the CRISPR-Cas system from targeting the chromosome as base pairing beyond the spacer sequence signals self and prevents DNA cleavage. RNA-guided CRISPR enzymes are classified as type V restriction enzymes.


Evolution

The cas genes in the adaptor and effector modules of the CRISPR-Cas system are believed to have evolved from two different ancestral modules. A
transposon A transposable element (TE, transposon, or jumping gene) is a nucleic acid sequence in DNA that can change its position within a genome, sometimes creating or reversing mutations and altering the cell's genetic identity and genome size. Transpo ...
-like element called casposon encoding the Cas1-like integrase and potentially other components of the adaptation module was inserted next to the ancestral effector module, which likely functioned as an independent innate immune system. The highly conserved cas1 and cas2 genes of the adaptor module evolved from the ancestral module while a variety of class 1 effector cas genes evolved from the ancestral effector module. The evolution of these various class 1 effector module cas genes was guided by various mechanisms, such as duplication events. On the other hand, each type of class 2 effector module arose from subsequent independent insertions of mobile genetic elements. These mobile genetic elements took the place of the multiple gene effector modules to create single gene effector modules that produce large proteins which perform all the necessary tasks of the effector module. The spacer regions of CRISPR-Cas systems are taken directly from foreign mobile genetic elements and thus their long term evolution is hard to trace. The non-random evolution of these spacer regions has been found to be highly dependent on the environment and the particular foreign mobile genetic elements it contains. CRISPR/Cas can immunize bacteria against certain phages and thus halt transmission. For this reason, Koonin described CRISPR/Cas as a
Lamarckian Lamarckism, also known as Lamarckian inheritance or neo-Lamarckism, is the notion that an organism can pass on to its offspring physical characteristics that the parent organism acquired through use or disuse during its lifetime. It is also calle ...
inheritance mechanism. However, this was disputed by a critic who noted, "We should remember amarckfor the good he contributed to science, not for things that resemble his theory only superficially. Indeed, thinking of CRISPR and other phenomena as Lamarckian only obscures the simple and elegant way evolution really works". But as more recent studies have been conducted, it has become apparent that the acquired spacer regions of CRISPR-Cas systems are indeed a form of Lamarckian evolution because they are genetic mutations that are acquired and then passed on. On the other hand, the evolution of the Cas gene machinery that facilitates the system evolves through classic Darwinian evolution.


Coevolution

Analysis of CRISPR sequences revealed
coevolution In biology, coevolution occurs when two or more species reciprocally affect each other's evolution through the process of natural selection. The term sometimes is used for two traits in the same species affecting each other's evolution, as well ...
of host and viral genomes. Cas9 proteins are highly enriched in
pathogen In biology, a pathogen ( el, πάθος, "suffering", "passion" and , "producer of") in the oldest and broadest sense, is any organism or agent that can produce disease. A pathogen may also be referred to as an infectious agent, or simply a germ ...
ic and
commensal Commensalism is a long-term biological interaction (symbiosis) in which members of one species gain benefits while those of the other species neither benefit nor are harmed. This is in contrast with mutualism, in which both organisms benefit fro ...
bacteria. CRISPR/Cas-mediated gene regulation may contribute to the regulation of endogenous bacterial genes, particularly during interaction with eukaryotic hosts. For example, ''
Francisella novicida ''Francisella novicida'' is a bacterium of the Francisellaceae family, which consist of Gram-negative pathogenic bacteria. These bacteria vary from small cocci to rod-shaped, and are most known for their intracellular parasitic capabilities. In ...
'' uses a unique, small, CRISPR/Cas-associated RNA (scaRNA) to repress an endogenous transcript encoding a bacterial
lipoprotein A lipoprotein is a biochemical assembly whose primary function is to transport hydrophobic lipid (also known as fat) molecules in water, as in blood plasma or other extracellular fluids. They consist of a triglyceride and cholesterol center, su ...
that is critical for ''F. novicida'' to dampen host response and promote virulence. The basic model of CRISPR evolution is newly incorporated spacers driving phages to mutate their genomes to avoid the bacterial immune response, creating diversity in both the phage and host populations. To resist a phage infection, the sequence of the CRISPR spacer must correspond perfectly to the sequence of the target phage gene. Phages can continue to infect their hosts' given point mutations in the spacer. Similar stringency is required in PAM or the bacterial strain remains phage sensitive.


Rates

A study of 124 ''S. thermophilus'' strains showed that 26% of all spacers were unique and that different CRISPR loci showed different rates of spacer acquisition. Some CRISPR loci evolve more rapidly than others, which allowed the strains' phylogenetic relationships to be determined. A comparative genomic analysis showed that ''E. coli'' and '' S. enterica'' evolve much more slowly than ''S. thermophilus''. The latter's strains that diverged 250 thousand years ago still contained the same spacer complement.
Metagenomic Metagenomics is the study of genetic material recovered directly from environmental or clinical samples by a method called sequencing. The broad field may also be referred to as environmental genomics, ecogenomics, community genomics or microb ...
analysis of two acid-mine-drainage
biofilm A biofilm comprises any syntrophic consortium of microorganisms in which cells stick to each other and often also to a surface. These adherent cells become embedded within a slimy extracellular matrix that is composed of extracellular ...
s showed that one of the analyzed CRISPRs contained extensive deletions and spacer additions versus the other biofilm, suggesting a higher phage activity/prevalence in one community than the other. In the oral cavity, a temporal study determined that 7–22% of spacers were shared over 17 months within an individual while less than 2% were shared across individuals. From the same environment, a single strain was tracked using PCR primers specific to its CRISPR system. Broad-level results of spacer presence/absence showed significant diversity. However, this CRISPR added 3 spacers over 17 months, suggesting that even in an environment with significant CRISPR diversity some loci evolve slowly. CRISPRs were analysed from the metagenomes produced for the
human microbiome project The Human Microbiome Project (HMP) was a United States National Institutes of Health (NIH) research initiative to improve understanding of the microbiota involved in human health and disease. Launched in 2007, the first phase (HMP1) focused on i ...
. Although most were body-site specific, some within a body site are widely shared among individuals. One of these loci originated from
streptococcal ''Streptococcus'' is a genus of gram-positive ' (plural ) or spherical bacteria that belongs to the family Streptococcaceae, within the order Lactobacillales (lactic acid bacteria), in the phylum Bacillota. Cell division in streptococci occur ...
species and contained ≈15,000 spacers, 50% of which were unique. Similar to the targeted studies of the oral cavity, some showed little evolution over time. CRISPR evolution was studied in
chemostat A chemostat (from ''chem''ical environment is ''stat''ic) is a bioreactor to which fresh medium is continuously added, while culture liquid containing left over nutrients, metabolic end products and microorganisms is continuously removed at the sa ...
s using ''S. thermophilus'' to directly examine spacer acquisition rates. In one week, ''S. thermophilus'' strains acquired up to three spacers when challenged with a single phage. During the same interval, the phage developed
single nucleotide polymorphism In genetics, a single-nucleotide polymorphism (SNP ; plural SNPs ) is a germline substitution of a single nucleotide at a specific position in the genome. Although certain definitions require the substitution to be present in a sufficiently larg ...
s that became fixed in the population, suggesting that targeting had prevented phage replication absent these mutations. Another ''S. thermophilus'' experiment showed that phages can infect and replicate in hosts that have only one targeting spacer. Yet another showed that sensitive hosts can exist in environments with high phage titres. The chemostat and observational studies suggest many nuances to CRISPR and phage (co)evolution.


Identification

CRISPRs are widely distributed among bacteria and archaea and show some sequence similarities. Their most notable characteristic is their repeating spacers and direct repeats. This characteristic makes CRISPRs easily identifiable in long sequences of DNA, since the number of repeats decreases the likelihood of a false positive match. Analysis of CRISPRs in metagenomic data is more challenging, as CRISPR loci do not typically assemble, due to their repetitive nature or through strain variation, which confuses assembly algorithms. Where many reference genomes are available,
polymerase chain reaction The polymerase chain reaction (PCR) is a method widely used to rapidly make millions to billions of copies (complete or partial) of a specific DNA sample, allowing scientists to take a very small sample of DNA and amplify it (or a part of it) t ...
(PCR) can be used to amplify CRISPR arrays and analyse spacer content. However, this approach yields information only for specifically targeted CRISPRs and for organisms with sufficient representation in public databases to design reliable
polymerase chain reaction The polymerase chain reaction (PCR) is a method widely used to rapidly make millions to billions of copies (complete or partial) of a specific DNA sample, allowing scientists to take a very small sample of DNA and amplify it (or a part of it) t ...
(PCR) primers. Degenerate repeat-specific primers can be used to amplify CRISPR spacers directly from environmental samples; amplicons containing two or three spacers can be then computationally assembled to reconstruct long CRISPR arrays. The alternative is to extract and reconstruct CRISPR arrays from shotgun metagenomic data. This is computationally more difficult, particularly with second generation sequencing technologies (e.g. 454, Illumina), as the short read lengths prevent more than two or three repeat units appearing in a single read. CRISPR identification in raw reads has been achieved using purely ''de novo'' identification or by using direct repeat sequences in partially assembled CRISPR arrays from
contig A contig (from ''contiguous'') is a set of overlapping DNA segments that together represent a consensus region of DNA.Gregory, S. ''Contig Assembly''. Encyclopedia of Life Sciences, 2005. In bottom-up sequencing projects, a contig refers to ov ...
s (overlapping DNA segments that together represent a consensus region of DNA) and direct repeat sequences from published genomes as a hook for identifying direct repeats in individual reads.


Use by phages

Another way for bacteria to defend against phage infection is by having chromosomal islands. A subtype of chromosomal islands called phage-inducible chromosomal island (PICI) is excised from a bacterial chromosome upon phage infection and can inhibit phage replication. PICIs are induced, excised, replicated, and finally packaged into small capsids by certain staphylococcal temperate phages. PICIs use several mechanisms to block phage reproduction. In the first mechanism, PICI-encoded Ppi differentially blocks phage maturation by binding or interacting specifically with phage TerS, hence blocking phage TerS/TerL complex formation responsible for phage DNA packaging. In the second mechanism PICI CpmAB redirects the phage capsid morphogenetic protein to make 95% of SaPI-sized capsid and phage DNA can package only 1/3rd of their genome in these small capsids and hence become nonviable phage. The third mechanism involves two proteins, PtiA and PtiB, that target the LtrC, which is responsible for the production of virion and lysis proteins. This interference mechanism is modulated by a modulatory protein, PtiM, binds to one of the interference-mediating proteins, PtiA, and hence achieves the required level of interference. One study showed that lytic ICP1 phage, which specifically targets ''
Vibrio cholerae ''Vibrio cholerae'' is a species of Gram-negative, facultative anaerobe and comma-shaped bacteria. The bacteria naturally live in brackish or saltwater where they attach themselves easily to the chitin-containing shells of crabs, shrimps, and oth ...
'' serogroup O1, has acquired a CRISPR/Cas system that targets a ''V. cholera'' PICI-like element. The system has 2 CRISPR loci and 9 Cas genes. It seems to be homologous to the I-F system found in ''
Yersinia pestis ''Yersinia pestis'' (''Y. pestis''; formerly '' Pasteurella pestis'') is a gram-negative, non-motile, coccobacillus bacterium without spores that is related to both ''Yersinia pseudotuberculosis'' and ''Yersinia enterocolitica''. It is a facult ...
''. Moreover, like the bacterial CRISPR/Cas system, ICP1 CRISPR/Cas can acquire new sequences, which allows phage and host to co-evolve. Certain archaeal viruses were shown to carry mini-CRISPR arrays containing one or two spacers. It has been shown that spacers within the virus-borne CRISPR arrays target other viruses and plasmids, suggesting that mini-CRISPR arrays represent a mechanism of heterotypic superinfection exclusion and participate in interviral conflicts.


Applications


CRISPR gene editing

CRISPR technology has been applied in the food and farming industries to engineer probiotic cultures and to immunize industrial cultures (for yogurt, for instance) against infections. It is also being used in crops to enhance yield, drought tolerance and nutritional value. By the end of 2014, some 1000 research papers had been published that mentioned CRISPR. The technology had been used to functionally inactivate genes in human cell lines and cells, to study ''
Candida albicans ''Candida albicans'' is an opportunistic pathogenic yeast that is a common member of the human gut flora. It can also survive outside the human body. It is detected in the gastrointestinal tract and mouth in 40–60% of healthy adults. It is us ...
'', to modify
yeasts Yeasts are eukaryotic, single-celled microorganisms classified as members of the fungus kingdom. The first yeast originated hundreds of millions of years ago, and at least 1,500 species are currently recognized. They are estimated to constitut ...
used to make
biofuels Biofuel is a fuel that is produced over a short time span from biomass, rather than by the very slow natural processes involved in the formation of fossil fuels, such as oil. According to the United States Energy Information Administration (E ...
, and genetically modify crop strains. Hsu and his colleagues state that the ability to manipulate the genetic sequences allows for reverse engineering that can positively affect biofuel production CRISPR can also be used to change mosquitos so they cannot transmit diseases such as malaria. CRISPR-based approaches utilizing Cas12a have recently been utilized in the successful modification of a broad number of plant species. In July 2019, CRISPR was used to experimentally treat a patient with a genetic disorder. The patient was a 34-year-old woman with
sickle cell disease Sickle cell disease (SCD) is a group of blood disorders typically inherited from a person's parents. The most common type is known as sickle cell anaemia. It results in an abnormality in the oxygen-carrying protein haemoglobin found in red b ...
. In February 2020, progress was made on
HIV The human immunodeficiency viruses (HIV) are two species of ''Lentivirus'' (a subgroup of retrovirus) that infect humans. Over time, they cause acquired immunodeficiency syndrome (AIDS), a condition in which progressive failure of the immune ...
treatments with 60-80% of the integrated viral DNA removed in mice and some being completely free from the virus after edits involving both LASER ART, a new anti-retroviral therapy, and CRISPR. In March 2020, CRISPR-modified virus was injected into a patient's eye in an attempt to treat
Leber congenital amaurosis Leber congenital amaurosis (LCA) is a rare inherited eye disease that appears at birth or in the first few months of life. It affects about 1 in 40,000 newborns. LCA was first described by Theodor Leber in the 19th century. It should not be co ...
. In the future, CRISPR gene editing could potentially be used to create new species or revive extinct species from closely related ones. CRISPR-based re-evaluations of claims for gene-disease relationships have led to the discovery of potentially important anomalies. In July 2021, CRISPR gene editing of hiPSC's was used to study the role of MBNL proteins associated with DM1.


CRISPR as diagnostic tool

CRISPR associated nucleases have shown to be useful as a tool for molecular testing due to their ability to specifically target nucleic acid sequences in a high background of non-target sequences. In 2016, the Cas9 nuclease was used to deplete unwanted nucleotide sequences in next-generation sequencing libraries while requiring only 250 picograms of initial RNA input. Beginning in 2017, CRISPR associated nucleases were also used for direct diagnostic testing of nucleic acids, down to single molecule sensitivity. CRISPR diversity is used as an analysis target to discern
phylogeny A phylogenetic tree (also phylogeny or evolutionary tree Felsenstein J. (2004). ''Inferring Phylogenies'' Sinauer Associates: Sunderland, MA.) is a branching diagram or a tree showing the evolutionary relationships among various biological spec ...
and diversity in bacteria, such as in xanthomonads by Martins ''et al.'', 2019. Early detections of
plant pathogens Plant pathology (also phytopathology) is the scientific study of diseases in plants caused by pathogens (infectious organisms) and environmental conditions (physiological factors). Organisms that cause infectious disease include fungi, oomyc ...
by molecular typing of the pathogen's CRISPRs can be used in agriculture as demonstrated by Shen ''et al.'', 2020. By coupling CRISPR-based diagnostics to additional enzymatic processes, the detection of molecules beyond nucleic acids is possible. One example of a coupled technology is SHERLOCK-based Profiling of IN vitro Transcription (SPRINT). SPRINT can be used to detect a variety of substances, such as metabolites in patient samples or contaminants in environmental samples, with high throughput or with portable point-of-care devices. CRISPR/Cas platforms are also being explored for detection and inactivation of
SARS-CoV-2 Severe acute respiratory syndrome coronavirus 2 (SARS‑CoV‑2) is a strain of coronavirus that causes COVID-19 (coronavirus disease 2019), the respiratory illness responsible for the ongoing COVID-19 pandemic. The virus previously had a ...
, the virus that causes
COVID-19 Coronavirus disease 2019 (COVID-19) is a contagious disease caused by a virus, the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The first known case was COVID-19 pandemic in Hubei, identified in Wuhan, China, in December ...
. Two different comprehensive diagnostic tests, AIOD-CRISPR and SHERLOCK test have been identified for SARS-CoV-2. The SHERLOCK test is based on a fluorescently labelled press reporter RNA which has the ability to identify 10 copies per microliter. The AIOD-CRISPR helps with robust and highly sensitive visual detection of the viral nucleic acid.


See also

* Anti-CRISPR * CRISPR/Cas Tools *
CRISPR gene editing CRISPR gene editing (pronounced "crisper") is a genetic engineering technique in molecular biology by which the genomes of living organisms may be modified. It is based on a simplified version of the bacterial CRISPR-Cas9 antiviral defense sys ...
*
The CRISPR Journal ''The CRISPR Journal'' is a peer-reviewed scientific journal published every two months by Mary Ann Liebert. It covers research on all aspects of CRISPR research, including its uses in synthetic biology and genome editing. Its editor-in-chief i ...
*
DRACO Draco is the Latin word for serpent or dragon. Draco or Drako may also refer to: People * Draco (lawgiver) (from Greek: Δράκων; 7th century BC), the first lawgiver of ancient Athens, Greece, from whom the term ''draconian'' is derived * ...
* Gene knockout *
Genetics Genetics is the study of genes, genetic variation, and heredity in organisms.Hartl D, Jones E (2005) It is an important branch in biology because heredity is vital to organisms' evolution. Gregor Mendel, a Moravian Augustinian friar wor ...
* Genome-wide CRISPR-Cas9 knockout screens *
Glossary of genetics A glossary (from grc, γλῶσσα, ''glossa''; language, speech, wording) also known as a vocabulary or clavis, is an alphabetical list of terms in a particular domain of knowledge with the definitions for those terms. Traditionally, a glo ...
* ''Human Nature'' (2019 documentary film) * Prime editing *
RNAi RNA interference (RNAi) is a biological process in which RNA molecules are involved in sequence-specific suppression of gene expression by double-stranded RNA, through translational or transcriptional repression. Historically, RNAi was known by ...
*
SiRNA Small interfering RNA (siRNA), sometimes known as short interfering RNA or silencing RNA, is a class of double-stranded RNA at first non-coding RNA molecules, typically 20-24 (normally 21) base pairs in length, similar to miRNA, and operating ...
* Surveyor nuclease assay *
Synthetic biology Synthetic biology (SynBio) is a multidisciplinary area of research that seeks to create new biological parts, devices, and systems, or to redesign systems that are already found in nature. It is a branch of science that encompasses a broad ran ...
*
Zinc finger A zinc finger is a small protein structural motif that is characterized by the coordination of one or more zinc ions (Zn2+) in order to stabilize the fold. It was originally coined to describe the finger-like appearance of a hypothesized struct ...


Notes


References


Further reading

* * * * * * * * * * * * * * * * *


External links

* *


Protein Data Bank

* * * * * {{DEFAULTSORT:Crispr 1987 in biotechnology 2015 in biotechnology Biological engineering Biotechnology Emerging technologies Genetic engineering Genome editing Jennifer Doudna Molecular biology Non-coding RNA Repetitive DNA sequences Immune system Prokaryote genes