ChIA-PET
   HOME

TheInfoList



OR:

Chromatin Interaction Analysis by Paired-End Tag Sequencing (ChIA-PET or ChIA-PETS) is a technique that incorporates chromatin immunoprecipitation (ChIP)-based enrichment, chromatin proximity ligation,
Paired-End Tags Paired-end tags (PET) (sometimes "Paired-End diTags", or simply "ditags") are the short sequences at the 5’ and 3' ends of a DNA fragment which are unique enough that they (theoretically) exist together only once in a genome, therefore making th ...
, and
High-throughput sequencing DNA sequencing is the process of determining the nucleic acid sequence – the order of nucleotides in DNA. It includes any method or technology that is used to determine the order of the four bases: adenine, guanine, cytosine, and thymine. Th ...
to determine ''de novo'' long-range
chromatin Chromatin is a complex of DNA and protein found in eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important roles in r ...
interactions genome-wide.
Genes In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a ba ...
can be
regulated Regulation is the management of complex systems according to a set of rules and trends. In systems theory, these types of rules exist in various fields of biology and society, but the term has slightly different meanings according to context. Fo ...
by regions far from the promoter such as regulatory elements, insulators and boundary elements, and transcription-factor binding sites (TFBS). Uncovering the interplay between
regulatory Regulation is the management of complex systems according to a set of rules and trends. In systems theory, these types of rules exist in various fields of biology and society, but the term has slightly different meanings according to context. Fo ...
regions and gene coding regions is essential for understanding the mechanisms governing
gene regulation Regulation of gene expression, or gene regulation, includes a wide range of mechanisms that are used by cells to increase or decrease the production of specific gene products (protein or RNA). Sophisticated programs of gene expression are wi ...
in
health Health, according to the World Health Organization, is "a state of complete physical, mental and social well-being and not merely the absence of disease and infirmity".World Health Organization. (2006)''Constitution of the World Health Organiza ...
and
disease A disease is a particular abnormal condition that negatively affects the structure or function of all or part of an organism, and that is not immediately due to any external injury. Diseases are often known to be medical conditions that a ...
(Maston et al., 2006). ChIA-PET can be used to identify unique, functional
chromatin Chromatin is a complex of DNA and protein found in eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important roles in r ...
interactions between distal and proximal
regulatory Regulation is the management of complex systems according to a set of rules and trends. In systems theory, these types of rules exist in various fields of biology and society, but the term has slightly different meanings according to context. Fo ...
transcription-factor binding sites and the promoters of the genes they interact with. ChIA-PET can also be used to unravel the mechanisms of
genome In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ge ...
control during processes such as cell differentiation, proliferation, and
development Development or developing may refer to: Arts *Development hell, when a project is stuck in development *Filmmaking, development phase, including finance and budgeting *Development (music), the process thematic material is reshaped * Photograph ...
. By creating ChIA-PET
interactome In molecular biology, an interactome is the whole set of molecular interactions in a particular cell. The term specifically refers to physical interactions among molecules (such as those among proteins, also known as protein–protein interactions, ...
maps for DNA-binding regulatory proteins and promoter regions, we can better identify unique targets for therapeutic intervention (Fullwood & Yijun, 2009).


Methodology

The ChIA-PET method combines ChIP-based methods, and
Chromosome conformation capture Chromosome conformation capture techniques (often abbreviated to 3C technologies or 3C-based methods) are a set of molecular biology methods used to analyze the spatial organization of chromatin in a cell. These methods quantify the number of int ...
(3C) based methods, to extend the capabilities of both approaches. ChIP-Sequencing (ChIP-Seq) is a popular method used to identify TFBS while 3C has been used to identify long-range chromatin interactions. Independently, both suffer from limitations in identifying de-novo long-range interactions genome wide. While ChIP-Seq is able to identify TFBS genome-wide, it provides only linear information of protein binding sites along the chromosomes (but not interactions between them), and can suffer from high genomic background noise (false positives). While 3C is capable of analyzing non-linear, long-range chromatin interactions, it cannot be used genome wide and, like ChIP-Seq, also suffers from high levels of background noise. Since the noise increases in relation to the distance between interacting regions (max 100kb), laborious and tedious controls are required for accurate characterization of chromatin interactions. Unlike 3C which is a locus-specific interaction profiling method, alternative methods such as
Hi-C Hi-C is a fruit juice–flavored drink made by the Minute Maid division of The Coca-Cola Company. It was created by Niles Foster in 1946 and released in 1947. The sole original flavor was orange. History Niles Foster, a former bakery and ...
have been established to profile interactions genome wide. Despite whole genome profiling methods for both TFBS and long range interactions, combining approaches with the ChIA-PET method allows for identification of genomic areas in which the protein of interest is bound as well as the genomic region which it interacts with. The ChIA-PET method successfully resolves the issues of non-specific interaction noise found in ChIP-Seq by sonicating the ChIP fragments in order to separate random attachments from specific interaction complexes. The next step, which is referred to as enrichment, reduces complexity for genome-wide analysis and adds specificity to chromatin interactions bound by pre-determined TFs (transcription factors). The ability of 3C approaches to identify long-range interactions is based on the theory of proximity ligation. In regards to DNA inter-ligation, fragments that are tethered by common protein complexes have greater kinetic advantages under dilute conditions, than those freely diffusing in solution or anchored in different complexes. ChIA-PET takes advantage of this concept by incorporating linker sequences onto the free ends of the DNA fragments tethered to the protein complexes. In order to build connectivity of the fragments tethered by regulatory complexes, the linker sequences are ligated during nuclear proximity ligation. Therefore, the products of linker-connected ligation can be analyzed by ultra-high-throughput PET sequencing and mapped to the
reference genome A reference genome (also known as a reference assembly) is a digital nucleic acid sequence database, assembled by scientists as a representative example of the set of genes in one idealized individual organism of a species. As they are assemble ...
. Since ChIA-PET is not dependent on specific sites for detection as 3C and 4C are, it allows unbiased, genome-wide de-novo detection of chromatin interactions. Compared to Hi-C, the use of an antibody pulldown limits the number of sequenced fragments to chromatin interactions bound by the protein of interest which also can ease the data analysis.


Workflow

Wet-lab portion of the workflow: * Formaldehyde is used to cross-link the DNA-protein complexes. Sonication is used to break-up the chromatin and also to reduce non-specific interactions. * A specific antibody of choice is used to enrich protein-of-interest–bound chromatin fragments. ChIP material bound by the antibody are used to construct the ChIA-PET. * Figure 1. Biotinylated oligonucleotide half-linkers containing flanking MmeI sites are used to connect proximity ligated DNA fragments. Two different linkers are designed (A and B) with specific nucleotide barcodes (CG or AT) for each of the two linker sequences (this will allow the identification of the chimeric ligation product as described in Figure 5.). * Figure 2. The linkers are ligated to the tethered DNA fragments. * Figure 3. The linker fragments are ligated on the ChIP beads under dilute conditions. The purified DNA is then digested by MmeI, which cuts at a distance from its recognition site to release the tag-linker-tag structure. * Figure 4. The biotinylated PETs are then immobilized on streptavidin-conjugated magnetic beads. * Figure 5. PET sequences with AA (CG/CG) and BB (AT/AT) linker barcode composition are considered to be possible intra-complex ligation products, while the PET sequences with AB (CG/AT) linker composition are considered to be derived from chimeric ligation products between DNA fragments bounded in different chromatin complexes. Dry-lab portion of the workflow: PET extraction, mapping, and statistical analyses The PET tags are extracted and mapped to the reference human genome ''in silico''. Identification of ChIP enriched peaks (binding sites) Self-ligated PET are used for identifying ChIP enriched sites because they provide the most reliable mapping (20 + 20 bit/s) to the reference genome. ChIP enrichment peak-finding algorithm A called peak is considered a binding site if there are multiple overlapping self-ligated PETs. The false discovery rate (FDR) is determined using statistical simulations to estimate the random background of PET-derived virtual DNA overlaps, and the estimated background noise. Filtering of repetitive DNA (affects non-specific binding) Satellite regions and binding sites present in regions with severe structural variations are removed. ChIP enrichment count The numbers of self-ligation and inter-ligation PETs (within + 250 bp window) are reported at each site. The total number of self-ligated and inter-ligated PETs at a specific site is called the ChIP enrichment count. Figure 6. PET Classification: Uniquely aligned PET sequences can be classified by whether they are derived from one DNA fragment or two DNA fragments. * Self-ligation PETs If the two tags of a PET are mapped on the same chromosome with the genomic span in the range of ChIP DNA fragments (less than 3 Kb), with expected self-ligation orientation and on the same strand, they are considered to be derived from a self-ligation of a single ChIP DNA fragment, and considered a self-ligation PET. * Inter-ligation PETs If a PET does not fit into these criteria, then the PET most likely resulted from a ligation product between two DNA fragments and referred to as an inter-ligation PET. The two tags of an inter-ligation PETs do not have fixed tag orientations, might not be found on the same strands, might have any genomic span, and might not map to the same chromosome. * Intrachromosomal inter-ligation PETs If the two tags of an inter-ligation PET are mapped in the same chromosome but with a span > 3 Kb in any orientation, then these PETs are called intrachromosomal inter-ligation PETs. * Interchromosomal inter-ligation PETs PETs which are mapped to different chromosomes are called interchromosomal inter-ligation PETs. Figure 7. Proposed mechanism showing how distal regulatory elements can initiate long-range chromatin interactions involving promoter regions of target genes. The interactions form DNA loop structures with multiple TFBS at the anchoring center. Small loops might package genes near the anchoring center in a tight sub-compartment, which could increase the local concentration of regulatory proteins for enhanced transcriptional activation. This mechanism might also enhance transcription efficiency, allowing RNA pol II to cycle the tight circular gene templates. The large interaction loops are more likely to link together distant genes at either end of the loop residing near anchor sites for coordinated regulation, or could separate genes in long loops to prevent their activation. Adapted from Fullwood et al. (2009).


Strengths and weaknesses

Advantages of the ChIA-PET method * ChIA-PET has a potential to be an unbiased, whole-genome and de-novo approach for long-range chromatin interaction analysis (Fullwood & Yijun, 2009). * A ChIA-PET experiment is capable of providing two global datasets: The protein factor binding sites (self-ligated PETs); and the interactions between the binding sites (inter-ligated PETs). * ChIA-PET involves ChIP to reduce the complexity for genome-wide analysis and adds specificity to chromatin interactions bound by specific factors of interest. * ChIA-PET is compatible with tag-based next-generation sequencing approaches such as Roche 454 pyrosequencing, Illumina GA, ABI SOLiD, and Helicos. * ChIA-PET is applicable to many different protein factors involved in transcriptional regulation or chromatin structural conformation. * ChIA-PET analysis can be applied to chromatin interactions involved in a particular nuclear process. By using general TFs such as RNA Polymerase II, it may be possible to identify all chromatin interactions involved in transcription regulation. Further, the use of protein factors involved in DNA replication or chromatin structure would allow identification of all interactions due to DNA replication and chromatin structural modification (Fullwood et al., 2009). Weaknesses * It is well established that cis and trans-regulatory complexes contain unique combinations of proteins based on cell and tissue specific conditions (Dekker et al., 2006). While identification of single, functional TFBS is a significant advancement, the use of ChIA-PET to identify individual proteins in a complex would require guess work and multiple experiments to identify each interacting protein. This would be a costly and time-consuming endeavour. * ChIA-PET is limited by the quality, purity, and specificity of the antibodies used (Fullwood et al., 2009). * ChIA-PET is dependent on identification of sequences that can be mapped to the reference sequence (ref). * ChIA-PET requires the use of peak-calling computer algorithms to organize and map PET reads to the reference genome. Because of variations between software platforms, results can vary depending on which program is used. * Although repetitive DNA regions can be associated with gene regulation (Polak & Domany, 2006), they need to be removed as they can affect the data (Fullwood et al., 2009). *Enrichment from two sites simultaneously introduces bias against surrounding regions.


History

Fullwood et al. (2009), used ChIA-PET to detect and map the chromatin interaction network mediated by
estrogen receptor alpha Estrogen receptor alpha (ERα), also known as NR3A1 (nuclear receptor subfamily 3, group A, member 1), is one of two main types of estrogen receptor, a nuclear receptor (mainly found as a chromatin-binding protein) that is activated by the sex ...
(ER-alpha) in human cancer cells. The resulting global chromatin interactome map revealed that remote ER-alpha-binding sites were also anchored to gene promoters through long-range chromatin interactions suggesting that ER-alpha functions by extensive chromatin looping in order to bring genes together for coordinated transcriptional regulation.


Analysis and software


Alternatives


Chromatin immunoprecipitation (ChIP):

The original ChIP method is an antibody-based technology that identify and bind proteins selectively in order to offer information regarding chromatin states and gene
transcription Transcription refers to the process of converting sounds (voice, music etc.) into letters or musical notes, or producing a copy of something in another medium, including: Genetics * Transcription (biology), the copying of DNA into RNA, the fir ...
br>


Genome architecture mapping, Genome Architecture Mapping (GAM):

This technique eliminates a number of drawbacks associated with 3C-based techniques by collecting three-dimensional proximities between any number of genomic loci.


Split-Pool Recognition of Interactions by Tag Extension (SPRITE)

SPRITE is a technique for mapping higher-order interactions in the nucleus across the genome. This approach detects interactions that occur over greater spatial distances and it allows for genome-wide detection of numerous
RNA Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
and DNA interactions that occur at the same time.


ChIA-Drop

ChIA-Drop is a straightforward method for analyzing multiplex chromatin interactions using droplet-based and barcode-linked sequencing at single-molecule accuracy. Previous pairwise population-level approaches such as Hi-C and ChIA-PET are distinct from this technology.


References

{{Reflist * Barski et al., (2007). High-resolution profiling of histone methylations in the human genome. Cell. (129); 823–37. * Dekker, (2002). Capturing chromosome conformation. Science. (295); 1306–1311. * Dekker, (2006). The three ‘C’ s of chromosome conformation capture: controls, controls, controls. Nat. Methods. (3); 17–21. * Fullwood et al., (2009). An oestrogen-receptor-α bound human chromatin interactome. Nature. (462); 58–64. * Fullwood & Yijun, (2009). ChIP-based methods for the identification of long-range chromatin interactions. J Cell Biochem. 107(1); 30–39. * Johnson et al., (2007). Genome-wide mapping of in vivo protein-DNA interactions. Science. (316); 1497–502. * Kuo & Allis, (1999). In-vivo cross-linking and immunoprecipitation for studying dynamic Protein: DNA associations in a chromatin environment. Methods. (19); 425–33. * Li, G., Fullwood, M.J., Xu, H., Mulawadi, F.H., Velkov, S., Vega, V., Ariyaratne, P.N., Mohamed, Y.B., Ooi, H.S., Tennakoon, C., Wei, C.L., Ruan, Y. and Sung, W.K. ChIA-PET tool for comprehensive chromatin interaction analysis with paired-end tag sequencing. Genome Biol, 11 (2). R22. * Maston et al., (2006). Transcriptional Regulatory Elements in the Human Genome. Annu. Rev: Genomics. Hum Genet. (7); 29–59. * Polak & Domany, (2006). Alu elements contain many binding sites for transcription factors and may play a role in regulation of developmental processes. BMC Genomics. (7); 133. * Wei et al., (2006). A global map of p53 transcription-factor binding sites in the human genome. Cell. (124); 207–19.


External links


ChIA-PET Genome Browser
- This browser is for viewing the data from Fullwood et al. (2009), and includes a custom Whole Genome Interaction Viewer which provides a macroscopic picture of binding sites and interactions along with a whole genome landscape. Molecular biology techniques Nuclear organization