A computational gene is a molecular
automaton
An automaton (; plural: automata or automatons) is a relatively self-operating machine, or control mechanism designed to automatically follow a sequence of operations, or respond to predetermined instructions.Automaton – Definition and More ...
consisting of a structural part and a functional part; and its design is such that it might work in a cellular environment.
The structural part is a naturally occurring
gene
In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a ba ...
, which is used as a skeleton to encode the input and the transitions of the automaton (Fig. 1A). The conserved features of a structural gene (e.g.,
DNA polymerase
A DNA polymerase is a member of a family of enzymes that catalyze the synthesis of DNA molecules from nucleoside triphosphates, the molecular precursors of DNA. These enzymes are essential for DNA replication and usually work in groups to create ...
binding site, start and stop
codons
The genetic code is the set of rules used by living cells to translate information encoded within genetic material ( DNA or RNA sequences of nucleotide triplets, or codons) into proteins. Translation is accomplished by the ribosome, which links ...
, and splicing sites) serve as constants of the computational gene, while the coding regions, the number of
exons
An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term ''exon'' refers to both the DNA sequence within a gene and to the corresponding sequence ...
and
introns
An intron is any nucleotide sequence within a gene that is not expressed or operative in the final RNA product. The word ''intron'' is derived from the term ''intragenic region'', i.e. a region inside a gene."The notion of the cistron .e., gene. ...
, the position of start and stop codon, and the automata theoretical variables (symbols, states, and transitions) are the design parameters of the computational gene. The constants and the design parameters are linked by several logical and biochemical constraints (e.g., encoded automata theoretic variables must not be recognized as splicing junctions). The input of the automaton are molecular markers given by single stranded
DNA (ssDNA) molecules. These markers are signalling aberrant (e.g., carcinogenic) molecular
phenotype
In genetics, the phenotype () is the set of observable characteristics or traits of an organism. The term covers the organism's morphology or physical form and structure, its developmental processes, its biochemical and physiological proper ...
and turn on the self-assembly of the functional gene. If the input is accepted, the output encodes a double stranded DNA (dsDNA) molecule, a functional gene which should be successfully integrated into the cellular
transcription
Transcription refers to the process of converting sounds (voice, music etc.) into letters or musical notes, or producing a copy of something in another medium, including:
Genetics
* Transcription (biology), the copying of DNA into RNA, the fir ...
and
translation
Translation is the communication of the Meaning (linguistic), meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The ...
machinery producing a wild type
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respo ...
or an anti-drug (Fig. 1B). Otherwise, a rejected input will assemble into a partially dsDNA molecule which cannot be translated.
A potential application: ''in situ'' diagnostics and therapy of cancer
Computational genes might be used in the future to correct aberrant
mutations
In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, mi ...
in a gene or group of genes that can trigger disease phenotypes. One of the most prominent examples is the tumor suppressor
p53
p53, also known as Tumor protein P53, cellular tumor antigen p53 (UniProt name), or transformation-related protein 53 (TRP53) is a regulatory protein that is often mutated in human cancers. The p53 proteins (originally thought to be, and often s ...
gene, which is present in every cell, and acts as a guard to control growth. Mutations in this gene can abolish its function, allowing uncontrolled growth that can lead to
cancer
Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal b ...
. For instance, a mutation at codon 249 in the p53 protein is characteristic for
hepatocellular cancer
Hepatocellular carcinoma (HCC) is the most common type of primary liver cancer in adults and is currently the most common cause of death in people with cirrhosis. HCC is the third leading cause of cancer-related deaths worldwide.
It occurs in t ...
. This disease could be treated by the CDB3 peptide which binds to the p53 core domain and stabilises its fold.
A single disease-related mutation can be then diagnosed and treated by the following diagnostic rule:
Such a rule might be implemented by a molecular automaton consisting of two partially dsDNA molecules and one ssDNA molecule, which corresponds to the disease-related mutation and provides a molecular switch for the linear self-assembly of the functional gene (Fig. 2). The gene structure is completed by a cellular
ligase
In biochemistry, a ligase is an enzyme that can catalyze the joining (ligation) of two large molecules by forming a new chemical bond. This is typically via hydrolysis of a small pendant chemical group on one of the larger molecules or the enzym ...
present in both
eukaryotic
Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bacte ...
and
prokaryotic cells
A prokaryote () is a single-celled organism that lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Greek πρό (, 'before') and κάρυον (, 'nut' or 'kernel').Campbell, N. "Biology:Concepts & Connec ...
. The transcription and translation machinery of the cell is then in charge of therapy and administers either a wild-type protein or an anti-drug (Fig. 3). The rule (1) may even be generalised to involve mutations from different proteins allowing a combined diagnosis and therapy.
In this way, computational genes might allow implementation ''in situ'' of a therapy as soon as the cell starts developing defective material. Computational genes combine the techniques of
gene therapy
Gene therapy is a medical field which focuses on the genetic modification of cells to produce a therapeutic effect or the treatment of disease by repairing or reconstructing defective genetic material. The first attempt at modifying human DN ...
which allows to replace in the genome an aberrant gene by its healthy counterpart, as well as to silence the gene expression (similar to
antisense technology).
Challenges
Although mechanistically simple and quite robust on molecular level, several issues need to be addressed before an ''in vivo'' implementation of computational genes can be considered.
First, the DNA material must be internalised into the cell, specifically into the
nucleus
Nucleus ( : nuclei) is a Latin word for the seed inside a fruit. It most often refers to:
*Atomic nucleus, the very dense central region of an atom
*Cell nucleus, a central organelle of a eukaryotic cell, containing most of the cell's DNA
Nucle ...
. In fact, the transfer of DNA or
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
through
biological membrane
A biological membrane, biomembrane or cell membrane is a selectively permeable membrane that separates the interior of a cell from the external environment or creates intracellular compartments by serving as a boundary between one part of the ce ...
s is a key step in the
drug delivery
Drug delivery refers to approaches, formulations, manufacturing techniques, storage systems, and technologies involved in transporting a pharmaceutical compound to its target site to achieve a desired therapeutic effect. Principles related to d ...
.
Some results show that nuclear localisation signals can be irreversibly linked to
one end of the
oligonucleotides
Oligonucleotides are short DNA or RNA molecules, oligomers, that have a wide range of applications in genetic testing, research, and forensics. Commonly made in the laboratory by solid-phase chemical synthesis, these small bits of nucleic acids c ...
, forming an oligonucleotide-peptide conjugate that
allows effective internalisation of DNA into the nucleus.
In addition, the DNA complexes should have low
immunogenicity
Immunogenicity is the ability of a foreign substance, such as an antigen, to provoke an immune response in the body of a human or other animal. It may be wanted or unwanted:
* Wanted immunogenicity typically relates to vaccines, where the injectio ...
to guarantee their integrity in the cell and their resistance to cellular
nucleases
A nuclease (also archaically known as nucleodepolymerase or polynucleotidase) is an enzyme capable of cleaving the phosphodiester bonds between nucleotides of nucleic acids. Nucleases variously effect single and double stranded breaks in their ta ...
.
Current strategies to eliminate nuclease sensitivity include modifications of the oligonucleotide backbone such as methylphosphonate and phosphorothioate (S-ODN) oligodeoxynucleotides, but along with their increased stability, modified oligonucleotides often have altered pharmacologic properties.
Finally, similar to any other drug, DNA complexes could cause nonspecific and toxic side effects.
''In vivo'' applications of antisense oligonucleotides showed that toxicity is largely due to impurities in the oligonucleotide preparation and lack of specificity of the particular sequence used.
Undoubtedly, progress on anti
sense
A sense is a biological system used by an organism for sensation, the process of gathering information about the world through the detection of Stimulus (physiology), stimuli. (For example, in the human body, the brain which is part of the cen ...
biotechnology will also result
in a direct benefit to the model of computational genes.
See also
*
Biocomputers
Biological computers use biologically derived molecules — such as DNA and/or proteins — to perform digital or real computations.
The development of biocomputers has been made possible by the expanding new science of nanobiotechnology. The term ...
*
Cyclic enzyme system A cyclic enzyme system is a theoretical system of two enzymes sharing a single substrate or cofactor, also referred to as a biochemical switching device. It has been used as a biochemical implementation of a simple computational device, acting as ...
*
DNA computing
DNA computing is an emerging branch of unconventional computing which uses DNA, biochemistry, and molecular biology hardware, instead of the traditional electronic computing. Research and development in this area concerns theory, experiments, a ...
*
Finite-state machine
A finite-state machine (FSM) or finite-state automaton (FSA, plural: ''automata''), finite automaton, or simply a state machine, is a mathematical model of computation. It is an abstract machine that can be in exactly one of a finite number o ...
*
Molecular electronics
Molecular electronics is the study and application of molecular building blocks for the fabrication of electronic components. It is an interdisciplinary area that spans physics, chemistry, and materials science. The unifying feature is use of mo ...
*
Nanobiotechnology
Nanobiotechnology, bionanotechnology, and nanobiology are terms that refer to the intersection of nanotechnology and biology. Given that the subject is one that has only emerged very recently, bionanotechnology and nanobiotechnology serve as blan ...
*
Nanomedicine
Nanomedicine is the medical application of nanotechnology. Nanomedicine ranges from the medical applications of nanomaterials and BioBrick, biological devices, to Nanoelectronics, nanoelectronic biosensors, and even possible future applicatio ...
References
{{Reflist, 2
Nanotechnology