HOME

TheInfoList



OR:

CAncer Personalized Profiling by deep Sequencing (CAPP-Seq) is a next-generation sequencing based method used to quantify circulating DNA in
cancer Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal b ...
(
ctDNA Circulating tumor DNA (ctDNA) is tumor-derived fragmented DNA in the bloodstream that is not associated with cells. ctDNA should not be confused with cell-free DNA (cfDNA), a broader term which describes DNA that is freely circulating in the blood ...
). The method was introduced in 2014 by Ash Alizadeh and Maximilian Diehn’s laboratories at Stanford, as a tool for measuring Cell-free tumor DNA which is released from dead tumor cells into the blood and thus may reflect the entire tumor genome. This method can be generalized for any cancer type that is known to have recurrent mutations. CAPP-Seq can detect one molecule of mutant DNA in 10,000 molecules of healthy DNA. The original method was further refined in 2016 for ultra sensitive detection through integration of multiple error suppression strategies, termed integrated Digital Error Suppression (iDES). The use of ctDNA in this technique should not be confused with circulating tumor cells (CTCs); these are two different entities. Originally described as a method to detect and monitor lung cancers, CAPP-Seq has been successfully adapted for a broad range of cancers by multiple independent groups. These include
diffuse large B-cell lymphoma Diffuse large B-cell lymphoma (DLBCL) is a cancer of B cells, a type of lymphocyte that is responsible for producing antibody, antibodies. It is the most common form of non-Hodgkin lymphoma among adults, with an annual Incidence (epidemiology), in ...
(DLBCL),
follicular lymphoma Follicular lymphoma (FL) is a cancer that involves certain types of white blood cells known as lymphocytes. The cancer originates from the uncontrolled division of specific types of B-cells known as centrocytes and centroblasts. These cells normal ...
(FL),
post-transplant lymphoproliferative disorder Post-transplant lymphoproliferative disorder (PTLD) is the name given to a B cell proliferation due to therapeutic immunosuppression after organ transplantation. These patients may develop infectious mononucleosis-like lesions or polyclonal polymor ...
(PTLD), metastatic
colorectal cancer Colorectal cancer (CRC), also known as bowel cancer, colon cancer, or rectal cancer, is the development of cancer from the colon or rectum (parts of the large intestine). Signs and symptoms may include blood in the stool, a change in bowel m ...
to ovary,
esophageal cancer Esophageal cancer is cancer arising from the esophagus—the food pipe that runs between the throat and the stomach. Symptoms often include difficulty in swallowing and weight loss. Other symptoms may include pain when swallowing, a hoarse voice ...
,
pancreatic cancer Pancreatic cancer arises when cell (biology), cells in the pancreas, a glandular organ behind the stomach, begin to multiply out of control and form a Neoplasm, mass. These cancerous cells have the malignant, ability to invade other parts of t ...
,
bladder cancer Bladder cancer is any of several types of cancer arising from the tissues of the urinary bladder. Symptoms include blood in the urine, pain with urination, and low back pain. It is caused when epithelial cells that line the bladder become mali ...
,
leiomyosarcoma Leiomyosarcoma is a malignant (cancerous) smooth muscle tumor. A benign tumor originating from the same tissue is termed leiomyoma. While leiomyosarcomas are not thought to arise from leiomyomas, some leiomyoma variants' classification is evolvi ...
, diverse adult and pediatric
sarcoma A sarcoma is a malignant tumor, a type of cancer that arises from transformed cells of mesenchymal (connective tissue) origin. Connective tissue is a broad term that includes bone, cartilage, fat, vascular, or hematopoietic tissues, and sarcom ...
s, among others.


Method

Population analysis is performed to identify recurrent mutations in a given cancer type. This is done by analyzing public data sets such as the
COSMIC cancer database COSMIC is an online database of somatically acquired mutations found in human cancer. Somatic mutations are those that occur in non-germline cells that are not inherited by children. COSMIC, an acronym of ''Catalogue Of Somatic Mutations In Cancer' ...
and TCGA. A ‘selector’ is designed which consists of
biotin Biotin (or vitamin B7) is one of the B vitamins. It is involved in a wide range of metabolic processes, both in humans and in other organisms, primarily related to the utilization of fats, carbohydrates, and amino acids. The name ''biotin'', bor ...
ylated DNA oligonucleotide probes targeting the recurrently mutated regions chosen for the specific cancer type. The selector is chosen using a multiphase bioinformatics approach. Using the selector, a probe-based hybridization capture is performed on tumor and normal DNA to discover mutations specific to the patient. The hybridization capture is then also applied to ctDNA to quantify the mutations that were previously discovered.


ctDNA Extraction and Library Preparation

Peripheral blood is collected from patients and ctDNA is isolated from ≥1 mL of
plasma Plasma or plasm may refer to: Science * Plasma (physics), one of the four fundamental states of matter * Plasma (mineral), a green translucent silica mineral * Quark–gluon plasma, a state of matter in quantum chromodynamics Biology * Blood pla ...
. Input DNA can be as low as 4 ng. There were four main goals in adapting this protocol for ctDNA work: *1) to optimize the adapter ligation efficiency *2) to reduce the number of PCR cycles needed after ligation *3) to preserve the naturally occurring size distribution of ctDNA (median 170 base pairs) *4) to minimize the variability in depth of sequence coverage across all the captured regions These were achieved by allowing adaptor ligation to be carried out at 16℃ for 16 hours to increase adaptor ligation efficiency and recovery. The most important adaptation is during enzymatic and clean-up steps; they are performed with-bead, in order to minimize tube transfer steps which increases recovery.


Selector design

In CAPP-Seq, design of selector is a crucial step that identifies recurrent mutations in a particular cancer type using publicly available next generation sequencing data. For inclusion in CAPP-seq selector, the recurrent mutations that are enriched in a population is described by an index- Recurrence Index (RI). RI is the number of mutations per kilobase of a given genomic locus of a patient carrying particular mutations. RI represents a patient level recurrence frequency estimated for somatic mutations and all mutations. Known and driver recurrent mutations in a population can be ranked based on the RI and therefore RI is used to design a selector. A six phase design strategy is employed to design selector. *Phase-1: Identifying frequently mutated known driver mutations using the publicly available data. *Phase-2: Maximum coverage of SNVs among the patients was identified by ranking their exonic RI. *Phase-3 and 4: Exons with higher RI were selected. *Phase-5: Addition of previously predicted driver mutations. *Phase-6: Addition of recurrent gene fusions rearrangement that are specific for particular cancer. Human cancer is heterogeneous and recurrent cancer mutations are present only a minority of patient. Therefore, a careful and non-redundant design of selector is the vital part in CAPP-Seq and also the size of the selector is related to its downstream costs.


Hybridization capture and sequencing

Hybridization capture with the selector probe set is performed on tumor DNA from a biopsy and sequenced to a depth of ~10,000× coverage. The biotinylated selector probes bind selectively to the regions of the DNA library that were chosen to be where the recurrently mutations occur in the given cancer type. In this way you are left with a smaller library that is enriched for only the regions you want, which can then be sequenced. This allows the determination of patient specific mutations. Hybridization capture with the same selector is then performed on ctDNA from the blood to quantify the previously identified mutations in the patient. CAPP-Seq can be applied to ctDNA from multiple blood samples at different time points in order to follow tumor evolution.


Computational pipeline for CAPP-seq

A series of steps are involved in analysis of CAPP-Seq data from mutation detection to validation and open source software can do most of the analysis. After the first step of variant calling, germline and
loss of heterozygosity Loss of heterozygosity (LOH) is a type of genetic abnormality in diploid organisms in which one copy of an entire gene and its surrounding chromosomal region are lost. Since diploid cells have two copies of their genes, one from each parent, a sing ...
(LOH) mutations are removed in CAPP-seq to reduce the background biases. Several statistical significance tests can be performed against background to all type of variant calling. For example, statistical significance of tumor-derived SNVs can be estimated by random sampling of background alleles using
Monte Carlo method Monte Carlo methods, or Monte Carlo experiments, are a broad class of computational algorithms that rely on repeated random sampling to obtain numerical results. The underlying concept is to use randomness to solve problems that might be determi ...
. For the indel calls, statistical significance is calculated applying a separate method that used a strand specific analysis by
Z-test A ''Z''-test is any statistical test for which the distribution of the test statistic under the null hypothesis can be approximated by a normal distribution. Z-tests test the mean of a distribution. For each significance level in the confidence ...
shown in previous work. Finally, a computational validation steps reduces the false positive calls. However, a robust computational framework specific for CAPP-seq data analysis is a high demand in this field.


Sensitivity

Sensitivity of this technology depends on the effective design of selector and highly biased with the size of the cohort and type of cancer under study. The lack of background to find the statistically significant recurrent variants has limited its performance due to stochastic noise and biological variability.
Receiver operating characteristic A receiver operating characteristic curve, or ROC curve, is a graphical plot that illustrates the diagnostic ability of a binary classifier system as its discrimination threshold is varied. The method was originally developed for operators of ...
(ROC) analysis on several cancer patient and cancer cured patient (sample collected at different tumor stages, circulating DNA time point, treatment, etc.) showed that CAPP-seq has higher sensitivity and specificity compared to previous methods in non–small-cell lung cancer.


Limitations

The detection limit of CAPP-Seq is affected by three main areas: the input amount of ctDNA molecules, sample cross-contamination, potential allelic bias in the capture reagent, and PCR or sequencing errors. ctDNA is able to be detected at a lower limit of 0.025% fractional abundance in the blood. Sample cross-contamination was found to be a very small contribution and reports have shown minimal allelic bias towards capture of reference alleles in PBLs ( peripheral blood lymphocytes). PCR and sequencing errors are also minimal. The technique becomes questionable when ctDNA is present at low levels of 0.01%. Also, when there is less discharge of ctDNA due to stability of tumor growth by therapy, the detection is compromised. Whether ctDNA is released at equal or unequal rate from primary tumors and
metastatic disease Metastasis is a pathogenic agent's spread from an initial or primary site to a different or secondary site within the host's body; the term is typically used when referring to metastasis by a cancerous tumor. The newly pathological sites, then, ...
s is still unknown. This fact should be taken into consideration while performing CAPP-Seq as it can cause problems in determining tumor burden and clonal evolution if different tumors or clones are dying off and releasing their DNA at different rates. It is also unknown how tumor histology affects ctDNA release. Another major limitation with using only ctDNA levels to detect tumor burden is that ctDNA can only predict residual tumor, it can tell nothing about the location of the tumor. This means that CAPP-Seq can be best used in complementary with other sequencing approaches for imaging disease burden at different times. Thus, technical sensitivity, reproducibility, specificity and requirement of expertise for analysis of large amount of data are some of the concerned issues with the technique.


Advantages

CAPP-Seq has many advantages over other methods such as
digital polymerase chain reaction Digital polymerase chain reaction (digital PCR, DigitalPCR, dPCR, or dePCR) is a biotechnological refinement of conventional polymerase chain reaction methods that can be used to directly quantify and clonally amplify nucleic acids strands includi ...
(dPCR) and
amplicon In molecular biology, an amplicon is a piece of DNA or RNA that is the source and/or product of amplification (molecular biology), amplification or DNA replication, replication events. It can be formed artificially, using various methods including ...
sequencing. CAPP-Seq can survey many loci in the same experiment compared to dPCR and amplicon sequencing which use multiple different experiments and therefore use up much more sample. Another advantage is that CAPP-Seq can not only detect point mutations but it can also detect
indels Indel is a molecular biology term for an insertion or deletion of bases in the genome of an organism. It is classified among small genetic variations, measuring from 1 to 10 000 base pairs in length, including insertion and deletion events that ...
,
structural variations Genomic structural variation is the variation in structure of an organism's chromosome. It consists of many kinds of variation in the genome of one species, and usually includes microscopic and submicroscopic types, such as deletions, duplications, ...
, and
copy number variations Copy number variation (CNV) is a phenomenon in which sections of the genome are repeated and the number of repeats in the genome varies between individuals. Copy number variation is a type of structural variation: specifically, it is a type of G ...
and also aids in monitoring minimal residual disease. Another advantage of CAPP-Seq is that because it only targets specific areas of interest in the genome it is more cost effective than whole
exome sequencing Exome sequencing, also known as whole exome sequencing (WES), is a genomic technique for sequencing all of the protein-coding regions of genes in a genome (known as the exome). It consists of two steps: the first step is to select only the subse ...
and
whole genome sequencing Whole genome sequencing (WGS), also known as full genome sequencing, complete genome sequencing, or entire genome sequencing, is the process of determining the entirety, or nearly the entirety, of the DNA sequence of an organism's genome at a s ...
which are 171X and 44X more expensive respectively. Also, there is no need of discrete streamlining for individual patients. Using circulating tumor DNA as opposed to solid tumor biopsies allows analysis of the full repertoire of tumor cells dispersed throughout the tumor and distant metastasis. Therefore, there is a better chance of finding all mutations associated with this cancer. Having a full overview of the cancer and what is driving it will allow for better treatment plans and management of disease.


Applications


Monitoring tumor burden

When treating cancer it is useful to have precise measurements of the total body disease burden. It helps with determining prognostic significance and treatment response. This is normally done using computed tomography (
CT scans A computed tomography scan (CT scan; formerly called computed axial tomography scan or CAT scan) is a medical imaging technique used to obtain detailed internal images of the body. The personnel that perform CT scans are called radiographers ...
), positron emission tomography (
PET scans Positron emission tomography (PET) is a functional imaging technique that uses radioactive substances known as radiotracers to visualize and measure changes in metabolic processes, and in other physiological activities including blood flow, r ...
) or magnetic resonance imaging (
MRI Magnetic resonance imaging (MRI) is a medical imaging technique used in radiology to form pictures of the anatomy and the physiological processes of the body. MRI scanners use strong magnetic fields, magnetic field gradients, and radio waves ...
). These medical imaging procedures are expensive and are not without their own problems. These imaging techniques are not able to accurately resolve small tumors (≤1 cm in diameter). Imaging can also be affected by radiation-induced inflammation and fibrotic changes, making it hard to determine if there is residual tumor or just effects of treatment. It has been found that levels of ctDNA in plasma significantly correlate with tumor volume as compared with medical imaging (CT, PET and MRI)., Detection of ctDNA can predict residual tumor or imminent relapse, in some cases even better than medical imaging and current methods.


Prognostic indicator

Detection of ctDNA has been found to be a predictor of relapse in multiple studies thus far. In a study in late-stage
NSCLC Non-small-cell lung cancer (NSCLC) is any type of epithelial lung cancer other than small-cell lung carcinoma (SCLC). NSCLC accounts for about 85% of all lung cancers. As a class, NSCLCs are relatively insensitive to chemotherapy, compared to sm ...
(non-small cell lung cancer) they found two cases where ctDNA correctly determined the outcome of a patient when medical imaging was wrong. In one case, the imaging predicted relapse based on a suspected residual tumor which turned out to only be radiation-induced inflammation, but ctDNA was not detected and the patient did not relapse. In another case, the imaging showed no tumor but ctDNA was detected and the patient relapsed shortly afterward. In another study on
DLBCL Diffuse large B-cell lymphoma (DLBCL) is a cancer of B cells, a type of lymphocyte that is responsible for producing antibody, antibodies. It is the most common form of non-Hodgkin lymphoma among adults, with an annual Incidence (epidemiology), in ...
(diffuse large B cell lymphoma), ctDNA was also found to be predictive of relapse.


Biopsy-free tumor genotyping

Biopsies are invasive and associated with risks to the patient. Therefore, multiple biopsies to monitor disease progression are rare and diagnostic biopsies are relied on for genetic information. This can be problematic because of tumor heterogeneity and tumor evolution. Firstly, biopsies only sample one portion of the tumor, and because tumors are heterogeneous, this will not cover the full genetic landscape of the tumor. Secondly, after treatment tumors evolve and there may be new mutations not represented in the diagnostic sample. Biopsy-free tumor genotyping, by way of CAPP-Seq and ctDNA, addresses many of these issues. A simple blood test is non-invasive and much safer and easier to subject cancer patients to multiple times through the course of treatment. Using ctDNA gives a better sample of tumor DNA compared to a single area of a tumor collected in a biopsy, allowing a better estimate of tumor heterogeneity. Taking multiple samples of ctDNA at different time points following the course of treatment allows tumor evolution to be uncovered. This can help detect the emergence of mutations that confer resistance to a targeted therapy and allow the course of treatment to be adjusted accordingly. CAPP-Seq specifically allows for the screening of multiple genomic locations which will become important as the list of cancer mutations important for treatment continues to grow. In a study for late stage NSCLC, they performed a version of CAPP-Seq where the tumor biopsy was not sequenced first, and they were able to correctly classify 100% of patient plasma samples with a 0% false positive rate. This shows that even without previous knowledge of tumor mutations, they can be accurately discovered by ctDNA alone.


References

{{reflist Cancer DNA