Two-hybrid screening (originally known as yeast two-hybrid system or Y2H) is a
molecular biology
Molecular biology is the branch of biology that seeks to understand the molecular basis of biological activity in and between cells, including biomolecular synthesis, modification, mechanisms, and interactions. The study of chemical and physi ...
technique used to discover
protein–protein interaction
Protein–protein interactions (PPIs) are physical contacts of high specificity established between two or more protein molecules as a result of biochemical events steered by interactions that include electrostatic forces, hydrogen bonding and th ...
s (PPIs)
and
protein–DNA interactions by testing for physical interactions (such as binding) between two
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, res ...
s or a single protein and a
DNA molecule, respectively.
The premise behind the test is the activation of
downstream
Downstream may refer to:
* Downstream (bioprocess)
* Downstream (manufacturing)
* Downstream (networking)
* Downstream (software development)
* Downstream (petroleum industry)
* Upstream and downstream (DNA), determining relative positions on DNA ...
reporter gene
In molecular biology, a reporter gene (often simply reporter) is a gene that researchers attach to a regulatory sequence of another gene of interest in bacteria, cell culture, animals or plants. Such genes are called reporters because the charac ...
(s) by the binding of a
transcription factor
In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The f ...
onto an
upstream activating sequence An upstream activating sequence or upstream activation sequence (UAS) is a cis-acting regulatory sequence. It is distinct from the promoter and increases the expression of a neighbouring gene. Due to its essential role in activating transcription ...
(UAS). For two-hybrid screening, the transcription factor is split into two separate fragments, called the DNA-binding domain (DBD or often also abbreviated as BD) and activating domain (AD). The BD is the
domain responsible for
binding to the UAS and the AD is the domain responsible for the activation of
transcription
Transcription refers to the process of converting sounds (voice, music etc.) into letters or musical notes, or producing a copy of something in another medium, including:
Genetics
* Transcription (biology), the copying of DNA into RNA, the fir ...
.
The Y2H is thus a
protein-fragment complementation assay.
History
Pioneered by
Stanley Fields and Ok-Kyu Song in 1989, the technique was originally designed to detect protein–protein interactions using the
Gal4
The Gal4 transcription factor is a positive regulator of gene expression of galactose-induced genes. This protein represents a large fungal family of transcription factors, Gal4 family, which includes over 50 members in the yeast ''Saccharomyces ...
transcriptional activator of the yeast ''
Saccharomyces cerevisiae
''Saccharomyces cerevisiae'' () (brewer's yeast or baker's yeast) is a species of yeast (single-celled fungus microorganisms). The species has been instrumental in winemaking, baking, and brewing since ancient times. It is believed to have b ...
''. The
Gal4
The Gal4 transcription factor is a positive regulator of gene expression of galactose-induced genes. This protein represents a large fungal family of transcription factors, Gal4 family, which includes over 50 members in the yeast ''Saccharomyces ...
protein activated transcription of a gene involved in galactose utilization, which formed the basis of selection.
[ Abstract is free; full-text article is not.] Since then, the same principle has been adapted to describe many alternative methods, including some that detect
protein–DNA interaction
DNA-binding proteins are proteins that have DNA-binding domains and thus have a specific or general affinity for single- or double-stranded DNA. Sequence-specific DNA-binding proteins generally interact with the major groove of B-DNA, becaus ...
s or
DNA-DNA interactions, as well as methods that use different
host organisms such as ''
Escherichia coli
''Escherichia coli'' (),Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. also known as ''E. coli'' (), is a Gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus '' Esc ...
'' or mammalian cells instead of yeast.
Basic premise
The key to the two-hybrid screen is that in most
eukaryotic
Eukaryotes () are organisms whose Cell (biology), cells have a cell nucleus, nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the ...
transcription factors, the activating and binding domains are modular and can function in proximity to each other without direct binding.
[ ] This means that even though the transcription factor is split into two fragments, it can still activate transcription when the two fragments are indirectly connected.
The most common screening approach is the yeast two-hybrid assay. In this approach the researcher knows where each prey is located on the used medium (agar plates). Millions of potential interactions in several organisms have been screened in the latest decade using
high-throughput screening
High-throughput screening (HTS) is a method for scientific experimentation especially used in drug discovery and relevant to the fields of biology, materials science and chemistry. Using robotics, data processing/control software, liquid handling ...
systems (often using robots) and over thousands of interactions have been detected and categorized in databases a
BioGRID This system often utilizes a
genetically engineered
Genetic engineering, also called genetic modification or genetic manipulation, is the modification and manipulation of an organism's genes using technology. It is a set of technologies used to change the genetic makeup of cells, including t ...
strain of yeast in which the
biosynthesis of certain nutrients (usually
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha a ...
s or
nucleic acids) is lacking. When grown on media that lacks these nutrients, the yeast fail to survive. This mutant yeast strain can be made to incorporate foreign DNA in the form of
plasmids. In yeast two-hybrid screening, separate bait and prey plasmids are simultaneously introduced into the mutant yeast strain or a mating strategy is used to get both plasmids in one host cell.
The second high-throughput approach is the library screening approach. In this set up the bait and prey harboring cells are mated in a random order. After mating and selecting surviving cells on selective medium the scientist will sequence the isolated plasmids to see which prey (DNA sequence) is interacting with the used bait. This approach has a lower rate of reproducibility and tends to yield higher amounts of false positives compared to the matrix approach.
Plasmids are engineered to produce a protein product in which the DNA-binding domain (BD) fragment is fused onto a protein while another plasmid is engineered to produce a protein product in which the activation domain (AD) fragment is fused onto another protein. The protein fused to the BD may be referred to as the bait protein, and is typically a known protein the investigator is using to identify new binding partners. The protein fused to the AD may be referred to as the prey protein and can be either a single known protein or a
library
A library is a collection of materials, books or media that are accessible for use and not just for display purposes. A library provides physical (hard copies) or digital access (soft copies) materials, and may be a physical location or a vir ...
of known or unknown proteins. In this context, a library may consist of a collection of protein-encoding sequences that represent all the proteins expressed in a particular organism or tissue, or may be generated by synthesising random DNA sequences.
Regardless of the source, they are subsequently incorporated into the protein-encoding sequence of a plasmid, which is then transfected into the cells chosen for the screening method.
This technique, when using a library, assumes that each cell is transfected with no more than a single plasmid and that, therefore, each cell ultimately expresses no more than a single member from the protein library.
If the bait and prey proteins interact (i.e., bind), then the AD and BD of the transcription factor are indirectly connected, bringing the AD in proximity to the transcription start site and transcription of reporter gene(s) can occur. If the two proteins do not interact, there is no transcription of the reporter gene. In this way, a successful interaction between the fused protein is linked to a change in the cell phenotype.
The challenge of separating cells that express proteins that happen to interact with their counterpart fusion proteins from those that do not, is addressed in the following section.
Fixed domains
In any study, some of the protein domains, those under investigation, will be varied according to the goals of the study whereas other domains, those that are not themselves being investigated, will be kept constant. For example, in a two-hybrid study to select DNA-binding domains, the DNA-binding domain, BD, will be varied while the two interacting proteins, the bait and prey, must be kept constant to maintain a strong binding between the BD and AD. There are a number of domains from which to choose the BD, bait and prey and AD, if these are to remain constant. In protein–protein interaction investigations, the BD may be chosen from any of many strong DNA-binding domains such as
Zif268
EGR-1 (Early growth response protein 1) also known as ZNF268 (zinc finger protein 268) or NGFI-A (nerve growth factor-induced protein A) is a protein that in humans is encoded by the ''EGR1'' gene.
EGR-1 is a mammalian transcription factor. It wa ...
.
A frequent choice of bait and prey domains are residues 263–352 of yeast Gal11P with a N342V mutation
and residues 58–97 of yeast Gal4,
respectively. These domains can be used in both yeast- and bacterial-based selection techniques and are known to bind together strongly.
The AD chosen must be able to activate transcription of the reporter gene, using the cell's own transcription machinery. Thus, the variety of ADs available for use in yeast-based techniques may not be suited to use in their bacterial-based analogues. The herpes simplex virus-derived AD, VP16 and yeast Gal4 AD have been used with success in yeast
whilst a portion of the α-subunit of ''E. coli'' RNA polymerase has been utilised in ''E. coli''-based methods.
Whilst powerfully activating domains may allow greater sensitivity towards weaker interactions, conversely, a weaker AD may provide greater stringency.
Construction of expression plasmids
A number of engineered genetic sequences must be incorporated into the host cell to perform two-hybrid analysis or one of its derivative techniques. The considerations and methods used in the construction and delivery of these sequences differ according to the needs of the assay and the organism chosen as the experimental background.
There are two broad categories of hybrid library: random libraries and cDNA-based libraries. A
cDNA library is constituted by the cDNA produced through
reverse transcription of mRNA collected from specific cells of types of cell. This library can be ligated into a construct so that it is attached to the BD or AD being used in the assay.
A random library uses lengths of DNA of random sequence in place of these cDNA sections. A number of methods exist for the production of these random sequences, including
cassette mutagenesis.
Regardless of the source of the DNA library, it is
ligated into the appropriate place in the relevant plasmid/phagemid using the appropriate
restriction endonucleases
A restriction enzyme, restriction endonuclease, REase, ENase or'' restrictase '' is an enzyme that cleaves DNA into fragments at or near specific recognition sites within molecules known as restriction sites. Restriction enzymes are one class o ...
.
''E. coli''-specific considerations
By placing the hybrid proteins under the control of
IPTG-inducible
''lac'' promoters, they are expressed only on media supplemented with IPTG. Further, by including different antibiotic resistance genes in each genetic construct, the growth of non-transformed cells is easily prevented through culture on media containing the corresponding antibiotics. This is particularly important for counter selection methods in which a ''lack'' of interaction is needed for cell survival.
The reporter gene may be inserted into the ''E. coli'' genome by first inserting it into an
episome
An episome is a special type of plasmid, which remains as a part of the eukaryotic genome without integration. Episomes manage this by replicating together with the rest of the genome and subsequently associating with metaphase chromosomes during m ...
, a type of plasmid with the ability to incorporate itself into the bacterial cell genome
with a copy number of approximately one per cell.
The hybrid expression phagemids can be electroporated into ''E. coli'' XL-1 Blue cells which after amplification and infection with VCS-M13
helper phage, will yield a stock of library phage. These phage will each contain one single-stranded member of the phagemid library.
Recovery of protein information
Once the selection has been performed, the
primary structure of the proteins which display the appropriate characteristics must be determined. This is achieved by retrieval of the protein-encoding sequences (as originally inserted) from the cells showing the appropriate phenotype.
''E. coli''
The phagemid used to transform ''E. coli'' cells may be "rescued" from the selected cells by infecting them with VCS-M13 helper phage. The resulting phage particles that are produced contain the single-stranded phagemids and are used to infect XL-1 Blue cells.
The double-stranded phagemids are subsequently collected from these XL-1 Blue cells, essentially reversing the process used to produce the original library phage. Finally, the DNA sequences are determined through
dideoxy sequencing.
Controlling sensitivity
The ''Escherichia coli''-derived
Tet-R repressor can be used in line with a conventional reporter gene and can be controlled by tetracycline or doxicycline (Tet-R inhibitors). Thus the expression of Tet-R is controlled by the standard two-hybrid system but the Tet-R in turn controls (represses) the expression of a previously mentioned reporter such as ''HIS3'', through its Tet-R promoter. Tetracycline or its derivatives can then be used to regulate the sensitivity of a system utilising Tet-R.
Sensitivity may also be controlled by varying the dependency of the cells on their reporter genes. For example, this may be affected by altering the concentration of histidine in the growth medium for ''his3''-dependent cells and altering the concentration of streptomycin for ''aadA'' dependent cells.
Selection-gene-dependency may also be controlled by applying an inhibitor of the selection gene at a suitable concentration.
3-Amino-1,2,4-triazole
3-Amino-1,2,4-triazole (3-AT) is a heterocyclic organic compound that consists of a 1,2,4-triazole substituted with an amino group.
3-AT is a competitive inhibitor of the product of the HIS3 gene, imidazoleglycerol-phosphate dehydratase. Imidazol ...
(3-AT) for example, is a competitive inhibitor of the ''HIS3''-gene product and may be used to titrate the minimum level of ''HIS3'' expression required for growth on histidine-deficient media.
Sensitivity may also be modulated by varying the number of operator sequences in the reporter DNA.
Non-fusion proteins
A third, non-fusion protein may be co-expressed with two fusion proteins. Depending on the investigation, the third protein may modify one of the fusion proteins or mediate or interfere with their interaction.
Co-expression of the third protein may be necessary for modification or activation of one or both of the fusion proteins. For example, ''S. cerevisiae'' possesses no endogenous tyrosine kinase. If an investigation involves a protein that requires tyrosine phosphorylation, the kinase must be supplied in the form of a tyrosine kinase gene.
The non-fusion protein may mediate the interaction by binding both fusion proteins simultaneously, as in the case of ligand-dependent receptor dimerization.
For a protein with an interacting partner, its functional homology to other proteins may be assessed by supplying the third protein in non-fusion form, which then may or may not compete with the fusion-protein for its binding partner. Binding between the third protein and the other fusion protein will interrupt the formation of the reporter expression activation complex and thus reduce reporter expression, leading to the distinguishing change in phenotype.
Split-ubiquitin yeast two-hybrid
One limitation of classic yeast two-hybrid screens is that they are limited to soluble proteins. It is therefore impossible to use them to study the protein–protein interactions between insoluble
integral membrane protein
An integral, or intrinsic, membrane protein (IMP) is a type of membrane protein that is permanently attached to the biological membrane. All ''transmembrane proteins'' are IMPs, but not all IMPs are transmembrane proteins. IMPs comprise a sign ...
s. The split-ubiquitin system provides a method for overcoming this limitation. In the split-ubiquitin system, two integral membrane proteins to be studied are fused to two different
ubiquitin
Ubiquitin is a small (8.6 kDa) regulatory protein found in most tissues of eukaryotic organisms, i.e., it is found ''ubiquitously''. It was discovered in 1975 by Gideon Goldstein and further characterized throughout the late 1970s and 1980s. Fo ...
moieties: a C-terminal ubiquitin moiety ("Cub", residues 35–76) and an N-terminal ubiquitin moiety ("Nub", residues 1–34). These fused proteins are called the bait and prey, respectively. In addition to being fused to an integral membrane protein, the Cub moiety is also fused to a
transcription factor
In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The f ...
(TF) that can be cleaved off by ubiquitin specific
protease
A protease (also called a peptidase, proteinase, or proteolytic enzyme) is an enzyme that catalyzes (increases reaction rate or "speeds up") proteolysis, breaking down proteins into smaller polypeptides or single amino acids, and spurring the ...
s. Upon bait–prey interaction, Nub and Cub-moieties assemble, reconstituting the split-ubiquitin. The reconstituted split-ubiquitin molecule is recognized by ubiquitin specific proteases, which cleave off the transcription factor, allowing it to induce the transcription of
reporter gene
In molecular biology, a reporter gene (often simply reporter) is a gene that researchers attach to a regulatory sequence of another gene of interest in bacteria, cell culture, animals or plants. Such genes are called reporters because the charac ...
s.
Fluorescent two-hybrid assay
Zolghadr and co-workers presented a fluorescent two-hybrid system that uses two hybrid proteins that are fused to different fluorescent proteins as well as LacI, the
lac repressor
The ''lac'' repressor (LacI) is a DNA-binding protein that inhibits the expression of genes coding for proteins involved in the metabolism of lactose in bacteria. These genes are repressed when lactose is not available to the cell, ensuring that ...
. The structure of the fusion proteins looks like this: FP2-LacI-bait and FP1-prey where the bait and prey proteins interact and bring the fluorescent proteins (FP1 =
GFP, FP2=
mCherry
mCherry is a member of the mFruits family of monomeric red fluorescent proteins (mRFPs). As a RFP, mCherry was derived from DsRed of '' Discosoma'' sea anemones unlike green fluorescent proteins (GFPs) which are often derived from '' Aequoera vic ...
) in close proximity at the binding site of the LacI protein in the host cell genome.
The system can also be used to screen for inhibitors of protein–protein interactions.
Enzymatic two-hybrid systems: KISS
While the original Y2H system used a reconstituted transcription factor, other systems create enzymatic activities to detect PPIs. For instance, the KInase Substrate Sensor ("KISS"), is a mammalian two-hybrid approach has been designed to map intracellular PPIs. Here, a bait protein is fused to a
kinase-containing portion of
TYK2
Non-receptor tyrosine-protein kinase TYK2 is an enzyme that in humans is encoded by the ''TYK2'' gene.
Tyk2 was the first member of the JAK family that was described (the other members are JAK1, JAK2, and JAK3). It has been implicated in IFN-α ...
and a prey is coupled to a
gp130
Glycoprotein 130 (also known as gp130, IL6ST, IL6R-beta or CD130) is a transmembrane protein which is the founding member of the class of all cytokine receptors. It forms one subunit of the type I cytokine receptor within the IL-6 receptor famil ...
cytokine receptor
Cytokine receptors are receptors that bind to cytokines.
In recent years, the cytokine receptors have come to demand the attention of more investigators than cytokines themselves, partly because of their remarkable characteristics, and partly be ...
fragment. When bait and prey interact, TYK2 phosphorylates
STAT3
Signal transducer and activator of transcription 3 (STAT3) is a transcription factor which in humans is encoded by the ''STAT3'' gene. It is a member of the STAT protein family.
Function
STAT3 is a member of the STAT protein family. In respons ...
docking sites on the prey chimera, which ultimately leads to activation of a
reporter gene
In molecular biology, a reporter gene (often simply reporter) is a gene that researchers attach to a regulatory sequence of another gene of interest in bacteria, cell culture, animals or plants. Such genes are called reporters because the charac ...
.
One-, three- and one-two-hybrid variants
One-hybrid
The one-hybrid variation of this technique is designed to investigate
protein–DNA interaction
DNA-binding proteins are proteins that have DNA-binding domains and thus have a specific or general affinity for single- or double-stranded DNA. Sequence-specific DNA-binding proteins generally interact with the major groove of B-DNA, becaus ...
s and uses a single fusion protein in which the AD is linked directly to the binding domain. The binding domain in this case however is not necessarily of fixed sequence as in two-hybrid protein–protein analysis but may be constituted by a library. This library can be selected against the desired target sequence, which is inserted in the promoter region of the reporter gene construct. In a positive-selection system, a binding domain that successfully binds the UAS and allows transcription is thus selected.
Note that selection of DNA-binding domains is not necessarily performed using a one-hybrid system, but may also be performed using a two-hybrid system in which the binding domain is varied and the bait and prey proteins are kept constant.
Three-hybrid
RNA-protein interactions have been investigated through a three-hybrid variation of the two-hybrid technique. In this case, a hybrid RNA molecule serves to adjoin together the two protein fusion domains—which are not intended to interact with each other but rather the intermediary RNA molecule (through their RNA-binding domains).
Techniques involving non-fusion proteins that perform a similar function, as described in the 'non-fusion proteins' section above, may also be referred to as three-hybrid methods.
One-two-hybrid
Simultaneous use of the one- and two-hybrid methods (that is, simultaneous protein–protein and protein–DNA interaction) is known as a one-two-hybrid approach and expected to increase the stringency of the screen.
Host organism
Although theoretically, any living cell might be used as the background to a two-hybrid analysis, there are practical considerations that dictate which is chosen. The chosen cell line should be relatively cheap and easy to culture and sufficiently robust to withstand application of the investigative methods and reagents.
The latter is especially important for doing
high-throughput studies. Therefore the yeast ''S. cerevisiae'' has been the main host organism for two-hybrid studies. However it is not always the ideal system to study interacting proteins from other organisms. Yeast cells often do not have the same post translational modifications, have a different codon use or lack certain proteins that are important for the correct expression of the proteins. To cope with these problems several novel two-hybrid systems have been developed. Depending on the system used agar plates or specific growth medium is used to grow the cells and allow selection for interaction. The most common used method is the agar plating one where cells are plated on selective medium to see of interaction takes place. Cells that have no interaction proteins should not survive on this selective medium.
''S. cerevisiae'' (yeast)
The yeast ''S. cerevisiae'' was the model organism used during the two-hybrid technique's inception. It is commonly known as the Y2H system. It has several characteristics that make it a robust organism to host the interaction, including the ability to form tertiary protein structures, neutral internal pH, enhanced ability to form disulfide bonds and reduced-state glutathione among other cytosolic buffer factors, to maintain a hospitable internal environment.
The yeast model can be manipulated through non-molecular techniques and its complete genome sequence is known.
Yeast systems are tolerant of diverse culture conditions and harsh chemicals that could not be applied to mammalian tissue cultures.
A number of yeast strains have been created specifically for Y2H screens, e.g.
Y187 Y187 is a strain of yeast (''Saccharomyces cerevisiae'') used in biological research for two-hybrid screening. The strain has been sold commercially by Clontech since at least 2000 and is used as a partner with strain AH109 in mating
In biolog ...
and
AH109 AH109 is a strain of yeast (''Saccharomyces cerevisiae'') used in biological research for two-hybrid screening. The strain is sold commercially by Clontech and is used as a partner with strain Y187 in mating
In biology, mating is the pairing ...
, both produced by
Clontech. Yeast strains R2HMet and BK100 have also been used.
''Candida albicans''
''C. albicans'' is a yeast with a particular feature: it translates the CUG codon into serine rather than leucine. Due to this different codon usage it is difficult to use the model system ''S. cerevisiae'' as a Y2H to check for protein-protein interactions using ''C. albicans'' genes. To provide a more native environment a ''C. albicans'' two-hybrid (C2H) system was developed. With this system protein-protein interactions can be studied in ''C. albicans'' itself. A recent addition was the creation of a high-throughput system.
''E. coli''
Bacterial two hybrid methods (B2H or BTH) are usually carried out in ''
E. coli'' and have some advantages over yeast-based systems. For instance, the higher transformation efficiency and faster rate of growth lends ''E. coli'' to the use of larger libraries (in excess of 10
8).
The absence of requirements for a
nuclear localisation signal to be included in the protein sequence and the ability to study proteins that would be toxic to yeast may also be major factors to consider when choosing an experimental background organism.
The methylation activity of certain ''E. coli''
DNA methyltransferase
In biochemistry, the DNA methyltransferase (DNA MTase, DNMT) family of enzymes catalyze the transfer of a methyl group to DNA. DNA methylation serves a wide variety of biological functions. All the known DNA methyltransferases use S-adenosyl m ...
proteins may interfere with some DNA-binding protein selections. If this is anticipated, the use of an ''E. coli'' strain that is defective for a particular methyltransferase may be an obvious solution.
The B2H may not be ideal when studying eukaryotic protein-protein interactions (e.g. human proteins) as proteins may not fold as in eukaryotic cells or may lack other processing.
Mammalian cells
In recent years a mammalian two hybrid (M2H) system has been designed to study mammalian protein-protein interactions in a cellular environment that closely mimics the native protein environment. Transiently transfected mammalian cells are used in this system to find protein-protein interactions.
Using a mammalian cell line to study mammalian protein-protein interactions gives the advantage of working in a more native context.
The post-translational modifications, phosphorylation, acylation and glycosylation are similar. The intracellular localization of the proteins is also more correct compared to using a yeast two hybrid system.
It is also possible with the mammalian two-hybrid system to study signal inputs.
Another big advantage is that results can be obtained within 48 hours after transfection.
''Arabidopsis thaliana''
In 2005 a two hybrid system in plants was developed. Using protoplasts of ''A. thaliana'' protein-protein interactions can be studied in plants. This way the interactions can be studied in their native context. In this system the GAL4 AD and BD are under the control of the strong 35S promoter. Interaction is measured using a GUS reporter. In order to enable a high-throughput screening the vectors were made gateway compatible.
The system is known as the protoplast two hybrid (P2H) system.
''Aplysia californica''
The sea hare
''A californica'' is a model organism in neurobiology to study among others the molecular mechanisms of long-term memory. To study interactions, important in neurology, in a more native environment a two-hybrid system has been developed in ''A californica'' neurons. A GAL4 AD and BD are used in this system.
''Bombyx mori''
An insect two-hybrid (I2H) system was developed in a silkworm cell line from the larva or caterpillar of the domesticated silk moth, ''
Bombyx mori
The domestic silk moth (''Bombyx mori''), is an insect from the moth family Bombycidae. It is the closest relative of ''Bombyx mandarina'', the wild silk moth. The silkworm is the larva or caterpillar of a silk moth. It is an economically imp ...
'' (BmN4 cells). This system uses the GAL4 BD and the activation domain of mouse NF-κB P65. Both are under the control of the OpIE2 promoter.
Applications
Determination of sequences crucial for interaction
By changing specific amino acids by mutating the corresponding DNA base-pairs in the plasmids used, the importance of those amino acid residues in maintaining the interaction can be determined.
After using bacterial cell-based method to select DNA-binding proteins, it is necessary to check the specificity of these domains as there is a limit to the extent to which the bacterial cell genome can act as a sink for domains with an affinity for other sequences (or indeed, a general affinity for DNA).
Drug and poison discovery
Protein–protein signalling interactions pose suitable therapeutic targets due to their specificity and pervasiveness. The random drug discovery approach uses compound banks that comprise random chemical structures, and requires a high-throughput method to test these structures in their intended target.
The cell chosen for the investigation can be specifically engineered to mirror the molecular aspect that the investigator intends to study and then used to identify new human or animal therapeutics or anti-pest agents.
Determination of protein function
By determination of the interaction partners of unknown proteins, the possible functions of these new proteins may be inferred.
This can be done using a single known protein against a library of unknown proteins or conversely, by selecting from a library of known proteins using a single protein of unknown function.
Zinc finger protein selection
To select
zinc finger proteins (ZFPs) for
protein engineering, methods adapted from the two-hybrid screening technique have been used with success.
A ZFP is itself a DNA-binding protein used in the construction of custom DNA-binding domains that bind to a desired DNA sequence.
By using a selection gene with the desired target sequence included in the UAS, and randomising the relevant amino acid sequences to produce a ZFP library, cells that host a DNA-ZFP interaction with the required characteristics can be selected. Each ZFP typically recognises only 3–4 base pairs, so to prevent recognition of sites outside the UAS, the randomised ZFP is engineered into a 'scaffold' consisting of another two ZFPs of constant sequence. The UAS is thus designed to include the target sequence of the constant scaffold in addition to the sequence for which a ZFP is selected.
A number of other DNA-binding domains may also be investigated using this system.
Strengths
*Two-hybrid screens are low-tech; they can be carried out in any lab without sophisticated equipment.
*Two-hybrid screens can provide an important first hint for the identification of interaction partners.
*The assay is scalable, which makes it possible to screen for interactions among many proteins. Furthermore, it can be automated, and by using robots many proteins can be screened against thousands of potentially interacting proteins in a relatively short time. Two types of large screens are used: the library approach and the matrix approach.
*Yeast two-hybrid data can be of similar quality to data generated by the alternative approach of
coaffinity purification followed by
mass spectrometry (AP/MS).
Weaknesses
*The main criticism applied to the yeast two-hybrid screen of protein–protein interactions are the possibility of a high number of false positive (and false negative) identifications. The exact rate of false positive results is not known, but earlier estimates were as high as 70%. This also, partly, explains the often found very small overlap in results when using a (high throughput) two-hybrid screening, especially when using different experimental systems.
The reason for this high error rate lies in the characteristics of the screen:
*Certain assay variants overexpress the fusion proteins which may cause unnatural protein concentrations that lead to unspecific (false) positives.
*The hybrid proteins are fusion proteins; that is, the fused parts may inhibit certain interactions, especially if an interaction takes place at the N-terminus of a test protein (where the DNA-binding or activation domain is typically attached).
*An interaction may not happen in yeast, the typical host organism for Y2H. For instance, if a bacterial protein is tested in yeast, it may lack a chaperone for proper folding that is only present in its bacterial host. Moreover, a
mammalian
Mammals () are a group of vertebrate animals constituting the class Mammalia (), characterized by the presence of mammary glands which in females produce milk for feeding (nursing) their young, a neocortex (a region of the brain), fur o ...
protein is sometimes not correctly modified in yeast (e.g., missing
phosphorylation), which can also lead to false results.
*The Y2H takes place in the nucleus. If test proteins are not localized to the nucleus (because they have other localization signals) two interacting proteins may be found to be non-interacting.
*Some proteins might specifically interact when they are co-expressed in the yeast, although in reality they are never present in the same cell at the same time. However, in most cases it cannot be ruled out that such proteins are indeed expressed in certain cells or under certain circumstances.
Each of these points alone can give rise to false results. Due to the combined effects of all error sources yeast two-hybrid have to be interpreted with caution. The probability of generating false positives means that all interactions should be confirmed by a high confidence assay, for example
co-immunoprecipitation of the endogenous proteins, which is difficult for large scale protein–protein interaction data. Alternatively, Y2H data can be verified using multiple Y2H variants
or bioinformatics techniques. The latter test whether interacting proteins are expressed at the same time, share some common features (such as
gene ontology
The Gene Ontology (GO) is a major bioinformatics initiative to unify the representation of gene and gene product attributes across all species. More specifically, the project aims to: 1) maintain and develop its controlled vocabulary of gene and ge ...
annotations or certain
network topologies), have homologous interactions in other species.
See also
*
Phage display, an alternative method for detecting protein–protein and protein–DNA interactions
*
Protein array, a chip-based method for detecting protein–protein interactions
*
Synthetic genetic array analysis, a yeast-based method for studying gene interactions
References
External links
Detail on sister technique two-hybrid systemScience Creative Quarterly's overview of the yeast two hybrid systemGateway-Compatible Yeast One-Hybrid Screens*
BioGrid Database with protein-protein interactions
{{Authority control
Cell biology
Molecular biology
Protein–protein interaction assays
Systems biology