Detection of Rare Mutations in CtDNA Using Next Generation Sequencing

Xiaoxing Lv; Meiru Zhao; Yuting Yi; Lucheng Zhang; Yanfang Guan; Tao Liu; Ling Yang; Rongrong Chen; Jianhui Ma; Xin Yi

doi:10.3791/56342

JoVE Journal > Cancer Research

Cancer Research

Detection of Rare Mutations in CtDNA Using Next Generation Sequencing

Published: August 24, 2017

doi:

10.3791/56342

Xiaoxing Lv, Meiru Zhao, Yuting Yi, Lucheng Zhang, Yanfang Guan, Tao Liu, Ling Yang, Rongrong Chen, Jianhui Ma, Xin Yi

¹Geneplus-Beijing Institute

Summary

This manuscript describes a technique for detecting mutations of low frequency in ctDNA, ER-Seq. This method is differentiated by its unique use of two-directional error correction, a special background filter, and efficient molecular acquirement.

Abstract

The analysis of circulating tumor DNA (ctDNA) using next-generation sequencing (NGS) has become a valuable tool for the development of clinical oncology. However, the application of this method is challenging due to its low sensitivity in analyzing the trace amount of ctDNA in the blood. Furthermore, the method may generate false positive and negative results from this sequencing and subsequent analysis. To improve the feasibility and reliability of ctDNA detection in the clinic, here we present a technique which enriches rare mutations for sequencing, Enrich Rare Mutation Sequencing (ER-Seq). ER-Seq can distinguish a single mutation out of 1 x 10⁷ wild-type nucleotides, which makes it a promising tool to detect extremely low frequency genetic alterations and thus will be very useful in studying disease heterogenicity. By virtue of the unique sequencing adapter's ligation, this method enables an efficient recovery of ctDNA molecules, while at the same time correcting for errors bidirectionally (sense and antisense). Our selection of 1021 kb probes enriches the measurement of target regions that cover over 95% of the tumor-related driver mutations in 12 tumors. This cost-effective and universal method enables a uniquely successful accumulation of genetic data. After efficiently filtering out background error, ER-seq can precisely detect rare mutations. Using a case study, we present a detailed protocol demonstrating probe design, library construction, and target DNA capture methodologies, while also including the data analysis workflow. The process to carry out this method typically takes 1-2 days.

Introduction

Next-generation sequencing (NGS), a powerful tool to investigate the mysteries of the genome, can provide a large quantity of information, which may reveal genetic alterations. The application of NGS analysis in the clinic has become more common, especially for personalized medicine. One of the greatest limitations of NGS, however, is a high error rate. Although it is deemed suitable for studying inherited mutations, the analysis of rare mutations is greatly limited¹^,², especially when analyzing DNA obtained from a "liquid biopsy".

Circulating tumor DNA (ctDNA) is cell-free DNA (cfDNA) in the blood that is shed from tumor cells. In most cases, the quantity of ctDNA is extremely low, which make its detection and analysis very challenging. However, ctDNA has many attractive features: its isolation is minimally invasive, it can be detected in the early stages of tumor growth, the ctDNA level reflects therapeutic efficiency, and ctDNA contains DNA mutations found in both primary and metastatic lesions³^,⁴^,⁵. Therefore, given the rapid development of the NGS technique and analysis, the application of ctDNA detection has become more attractive.

Different massively parallel sequencing approaches have been utilized for ctDNA detection but none of these approaches have been accepted for routine use in clinics due to their limitations: low sensitivity, lack of versatility, and a relatively high cost⁶^,⁷^,⁸. For example, duplex sequencing, based on a unique identifier tag (UID), repeatedly corrects errors in the consensus bidirectionally, rectifying most sequencing errors. However, the feasibility of this method is lost due to its high cost and low data utilization⁹^,¹⁰. Similarly, CAPP-Seq and its improved iteration, CAPP-IDES¹¹^,¹², have greater practicality in cfDNA detection, though the accuracy and universality of these methods need improvement.

To meet the current need for accurate ctDNA detection and analysis, we developed a new strategy, Enrich Rare Mutation Sequencing (ER-Seq). This approach combines the following: unique sequencing adapters to efficiently recover ctDNA molecules, with bidirectional error correction and the ability to distinguish a single mutation out of > 1 × 10⁷ wild-type nucleotides; 1021 kb probes which enrich measurement of target regions that cover over 95% of the tumor-related mutations from 12 tumors, including lung cancer, colorectal cancer, gastric cancer, breast cancer, kidney cancer, pancreatic cancer, liver cancer, thyroid cancer, cervical cancer, esophageal cancer, and endometrial carcinoma (Table 1); and baseline database screening making it efficient and easy to precisely detect rare mutations in ctDNA.

To build a baseline database, find all the gene mutations by ER-Seq from a number of the same type of samples (~1000 at the beginning). These real mutations must be verified by several other reliable detection methods and analysis. Next, summarize the pattern of false mutations and cluster all the false mutations to build the initial baseline database. Continue adding false mutations found from subsequent sequencing experiments to this database. Therefore, this baseline database becomes a dynamic expanded database, which significantly improves sequencing accuracy.

To promote progress in tumor diagnosis and monitoring, we present ER-Seq, a low cost and feasible method for the acquisition of universal data. We present a case study which underwent ER-Seq analysis, demonstrating its accuracy for detecting rare mutations and feasibility for use in the clinic.

Protocol

Tumor specimens and blood samples were obtained according to a protocol approved by the Ethics Committee of Peking University People's Hospital. Written informed consent was obtained from the patients to use their samples. Participants were screened according to the following criteria: female, advanced Non-Small Cell Lung Cancer, EGFR p.L858R mutation indicated by previous Sanger Sequencing, disease progression following two session of EGFR targeted therapy with Erlotinib, and ER-Seq which was applied to ctDNA to analyze the cause of resistance and find new target drugs.

1. DNA Extraction from Peripheral Blood for cfDNA and genomic DNA (gDNA)

Collect 10 mL of peripheral blood in a collection tube, invert gently up and down 6-8 times to mix. Avoid cell shearing. Blood samples can be stored in the collection tube at 6-37 °C for up to 72 hours.
Centrifuge the collection tube at room temperature for 10 min at 1600 (±150) x g.
Transfer the supernatant (plasma) from the collection tube to four clean 2 mL appropriately labelled centrifuge tubes using a disposable pipet without disturbing the pellet layer (white blood cells).
Transfer1 mL of the cells using a clean disposable pipet to a 2 mL appropriately labelled centrifuge tube. The cells can be stored at -20 °C or colder before step 1.8.
Centrifuge the plasma (from step 1.3) for 10 min at 4 °C, and 16000 (±150) x g.
Transfer the plasma to clean centrifuge tubes properly labelled as "plasma", leave ~0.1 mL of residual volume at the bottom to avoid contamination. Store the plasma at -80 °C until step 1.7.
Use 3 mL of plasma from step 1.6 to isolate cfDNA (contains ctDNA) using a commercially available kit, following the manufacturer's instructions.
Use 200 µL of white blood cells from step 1.3 to isolate gDNA using a commercially available kit, following the manufacturer's instructions.
NOTE: Both cfDNA and gDNA can be stored at -20 °C before use.
Apply different quality controls for cfDNA and gDNA (Table 2). Perform quality control by a standardized method for the bioanalyzer to determine levels of sample degradation and/or gDNA contamination. Peak analysis of the quality of cfDNA extraction should appear similar to Figure 1.
NOTE: The peak size of cfDNA is around 170 bp. Note that increasing the amount of DNA will result in a higher quality library for the effective detection of rare mutations in cfDNA. We recommend using at least 30 ng cfDNA (30,000 copies).

2. Library Preparation

NOTE: Regarding base library construction on fragmented DNA, cfDNA exists in fragments with a peak size ~170 bp and thus does not need to be fragmented.

Fragment the control sample (gDNA) by sonication to obtain 200-250 bp fragments.
1. Prepare 1 µg of gDNA sample in 100 µL of Tris-EDTA buffer in a clean sonication tube.
2. Set the sonication program as 30 s on and 30 s off for 12 cycles for a total of 12 min.
3. After sonication, confirm the product (5 µL) contains 200-250 bp fragments by running an analysis with 2% agarose gel.
4. Transfer all fragmentation products to a new 1.5 mL tube and incubate with 150 µL of magnetic beads for 5 min to select the correct fragments.
5. Remove the supernatant by putting the tube on a magnetic rack for 30 s.
6. Wash the beads with 200 µL of 80% ethanol (freshly prepared) with the tube staying on the magnetic rack. Incubate for 30 s before removing the supernatant. Repeat once.
7. Air dry the beads with the tube lid open while staying on the magnetic rack. Avoid overdried beads to make sure that the DNA target recovers well. Elute DNA fragments from the beads by adding 32 µL of 10 mM Tris-HCl (pH 8) to the beads to resolve the DNA fragments followed by quantification by UV/Vis spectroscopy.
  NOTE: At least 200 ng is needed for the following experiments.
End Prep Reaction
NOTE: Follow manufacturer's instruction and modify as needed. Use 30 ng of cfDNA; and 1 µg of gDNA as the control.
1. Add 7 µL of reaction buffer, 3 µL of the enzyme mix, 30 ng of cfDNA in a sterile nuclease-free tube, and add ddH₂O to total volume of 60 µL. In a separate tube, add the above components but with 1 µg of gDNA instead of cfDNA.
2. Mix and incubate the mixture at 20 °C for 30 min followed by 65 °C for 30 min in a thermocycler without the lid heated. Proceed immediately to the next step.
Adapter Ligation
1. Add 30 µL of the ligation master mix, 1 µL of the ligation enhancer, and 4 µL of the unique sequencing adapters to the mixture from step 2.2.
2. Incubate at 20 °C for 15 min in a thermocycler without the lid heated.
3. Vortex to resuspend the magnetic beads and leave the beads at room temperature for at least 30 min before the next step.
4. Add 87 µL of the resuspended magnetic beads to the previously mentioned mixture; mix well by pipetting up and down and then incubate at room temperature for 5 min.
5. Wash and air dry the beads as described in Step 2.1.
6. Elute the DNA target from the beads by adding 20 µL of 10 mM Tris-HCl (pH 8).
Enrichment of DNA fragments ligated with adapter
1. Add 20 µL of adapter ligated DNA fragments, 5 µL of Index Primer/i7 (2.5 µL Primer-P7 (10 pM) and 2.5 µL of Primer-P5 (10 pM), for sequencing purposes), 25µL of library amplification master mix, and ddH₂O to a total volume of 50 µL.
2. PCR amplify the adapter ligated DNA and thermal cycle as follows: initial denaturation at 98 °C for 45 s, followed by 8 cycles (cfDNA) or 4 cycles (gDNA) of annealing °C for 15 s, 65 °C for 30 s, 72 °C for 30 s), and a final extension at 72 °C for 60 s, then holding the temperature at 4 °C.
3. Add 45 µL of suspended magnetic beads to the PCR-enriched DNA to get acquired fragments.
4. Elute DNA fragments with adapters in 30 µL of 10 mM Tris-HCl (pH 8).
Library Quality Control
1. Follow the manufacturer's protocol for the commercial kit to measure adapter ligated DNA concentration. Use the bioanalyzer to determine the DNA quality.

3. Targeted DNA capture

NOTE: Target enrichment was performed using a custom sequence capture-probe which is specifically designed for a 1021 kb target enrichment region covering known tumor-associated driver mutations from 12 different types of tumors. Modifications to the manufacturer's protocol are detailed in the following steps.

Blocking
1. Add 1.5 µg of pooling libraries (maximum number of libraries to pool for cfDNA is 6, gDNA is 20) from step 2, 8 µL of P5 Block(100P), 8 µL of P7 Block(100P), and 5 µL of Cot-1 DNA (1 µg/µL) in a sterile tube. Dry the contents of the tube using a vacuum concentrator set at 60 °C.
Hybridize the DNA capture probes with the library.
1. Add the following components to the tube from step 3.1: 8.5 µL of 2X Hybridization buffer, 2.7 µL of Hybridization enhancer, and 1.8 µL of nuclease-free ddH₂O.
2. Mix by pipetting up and down and incubate in a thermomixer at 95 °C for 10 min.
3. Following incubation, immediately add 4 µL of the Custom Probe. Incubate samples in a thermal cycler at 65 °C with lid heated to 75 °C for 4 h.
DNA fragments capture
1. Incubate the hybridized target DNA with streptavidin beads followed by washing the beads to remove unbound DNA. Use the commercial kit according to manufacturer's instructions to get a final 20 µL of resuspended beads with captured DNA fragments.
Amplification of the captured DNA fragments
1. Perform PCR using the commercial kit with 2 µM backbone oligonucleotides, according to manufacturer's instructions.
2. When the PCR reaction is completed, add 45 µL of suspended beads directly to the PCR product and enrich the amplified target DNA fragments bound to the beads by elution with 30 µL of 10 mM Tris-HCl (pH 8).
Quantify the DNA fragments with the library quantification kit and determine the average fragment length by the Bioanalyzer (Figure 2).

4. Sequencing

Sequence multiplexed libraries using 75 bp paired-end runs on a benchtop sequencer for 18 h. Total data produced is up to 60 Gigabytes, 15G is required for patient sample cfDNA and 1G for control sample gDNA.

5. Data analysis workflow

NOTE: Figure 3 displays the general work flow and data analysis process. Data analysis parameters and commands are shown below.

Filter low-quality reads and correct for errors and masking of UID using NCfilter.
NCfilter(own)
NCfilter filter -l 5 -q 0.5 -n 0.1 -T 3 -G -1 fastq1 -2 fastq2
realSeq(own)
python bin/realRealSeq MaskUID -1 fq1 -2 fq2 -o output dir -p prefix -u 4 -s 7 -m 3 -i uidFile -f.
Referencing genome hg19 (human genome 19), locate the sequencing results using BWA (version 0.7.12-r1039) mem and realign with Genome Analysis Toolkit (GATK) (version 3.4-46-gbc02625).
bwa mem -t 3 hg19 fastq1 fastq2
java -d64 -server -XX:+UseParallelGC -XX:ParallelGCThreads=8 -Xms2g -Xmx10g –
Djava.io.tmpdir=/tmp/ -jar
GenomeAnalysisTK.jar -T RealignerTargetCreator -R hg19 -L targetRegion -known dbsnp —
allowPotentiallyMisencodedQuals -nt 10 -I cancerBam -I normalBam
java -d64 -server -XX:+UseParallelGC -XX:ParallelGCThreads=8 -Xms2g -Xmx10g —
Djava.io.tmpdir=/tmp/ -jar GenomeAnalysisTK.jar -T IndelRealigner -R hg19 -known dbsnp
-allowPotentiallyMisencodedQuals — targetIntervals intervals -I cancerBam -I normalBam
Cluster the duplicates according to the UID and position of the template fragments, which will correct the error bases introduced by PCR/sequencing and modify the base quality.
realSeq(own)
python realSeq/bin/realSeq CLUSTER -b unmarkdup_sort_merge.bam -r chipRegion -o output_dir -p
prefix -m 6
Re-align the clustered duplicates by BWA mem.
bwa mem -t 3 hg19 fastq1 fastq2
Perform a local realignment followed by a recalibration of base quality score using GATK (version 3.4-46-gbc02625).
Loacal realignment: java -d64 -server -XX:+UseParallelGC -XX:ParallelGCThreads=8 -Xms2g -Xmx10g -Djava.io.tmpdir=/tmp/ -jar GenomeAnalysisTK.jar -T RealignerTargetCreator -R hg19 -L targetRegion -known dbsnp -allowPotentiallyMisencodedQuals -nt 10 -I cancerBam -I normalBam
java -d64 -server -XX:+UseParallelGC -XX:ParallelGCThreads=8 -Xms2g -Xmx10g -Djava.io.tmpdir=/tmp/ -jar GenomeAnalysisTK.jar -T IndelRealigner -R hg19 -known dbsnp -allowPotentiallyMisencodedQuals -targetIntervals intervals -I cancerBam -I normalBam
base quality-score recalibration: java -d64 -server -XX:+UseParallelGC -XX:ParallelGCThreads=8 -Xms2g -Xmx10g -Djava.io.tmpdir=/tmp/ -jar GenomeAnalysisTK.jar -T BaseRecalibrator -R hg19 -L targetRegion -knownSites dbsnp -knownSites cosmic -allowPotentiallyMisencodedQuals -nct 10 -I bam -o grp java -d64 -server -XX:+UseParallelGC -XX:ParallelGCThreads=8 -Xms2g -Xmx10g -Djava.io.tmpdir=/tmp/ -jar GenomeAnalysisTK.jar -T PrintReads -I bam -BQSR grp -o outbam -R hg19 -nct 8 -allowPotentiallyMisencodedQuals
For SNV (single-nucleotide variant) and INDEL (small insertions or deletions) calling, the accuracy of the low frequency mutations is further confirmed by GSR (Good Support Reads) with both forward and reverse strand read pairs and filtration through control group (germline mutations) and the baseline database.
realDcaller(own)
python realDcaller -f hg19 -r chipRegion -C baselineDB -O -L 2 cancerBam normalBam -p 30
Use baseline samples as a control for CNV (copy number variation) calling.
NCcnv(own)
python NCcnv -o cnvdir -t tmpdir -r chipRegion cnv –normgt normalBam –tumgt cancerBam –repdb repeatMarsk
Realigned unmapped, clips and discordant pair reads to call SV (Structural Variations).
NCsv(own)
python NCsv -t tmpdir -r hg19 -o outputdir -x transcript -w bwapath -n 4 normalBam cancerBam

Representative Results

The 1021 kb probes enriched target regions used in ER-Seq are shown in Table 1, which covers over 95% of the gene mutations in 12 common tumors. The wide range of these probes makes this process applicable to a majority of cancer patients. Additionally, our unique sequencing adapters and baseline database screening make it possible to detect rare mutations precisely.

Due to the different properties of gDNA and cfDNA extracted from peripheral blood, there is a range of data quality, which is determined by the different instruments and quality controls shown in Table 2. Successfully extracted cfDNA should display a peak size of ~170 bp, as analyzed by the bioanalyzer QC and shown in Figure 1. Large fragments indicate contamination from genomic DNA, which should not appear in the final product. DNA extracted from this patient sample was of sufficient quality and quantity for ER-Seq (cfDNA – 42.6 ng; gDNA – 3.426 µg).

Target DNA capture using a custom probe was then amplified by PCR. The average fragment size was evaluated by the bioanalyzer and representative data for a patient's cfDNA in Figure 2A showed a peak around ~320 bp and a unique band on agarose gel. gDNA should appear as a smear on an agarose gel, as shown in Figure 2B. The qualities of both libraries were acceptable for the subsequent sequencing.

After sequencing, the data was analyzed in order according to the work flow in Figure 3B. To illustrate the advantages of our ER-Seq versus the traditional method, we performed both analyses using the same sample. Results from both analyses are shown in Table 3 and Table 4. We showed that ER-Seq improves coverage depth by 23% compared with the traditional method (Table 3, 3214.3 reads in ER-Seq vs 2475 reads in traditional analysis), which was due to its efficient recovery of cfDNA molecules, thus greatly enhancing the analysis of rare mutations. It is also clear that the unique sequencing adapters used in ER-Seq enable easy differentiation of natural and PCR-induced duplications (Table 3, 0 PCR induced duplicated read in ER-Seq). Additionally, we found in the calling results that analysis based on ER-Seq was 100% consistent for EGFR p.L858R detection and that the frequency of detection was a bit higher (2.7% vs 1.2%) when compared with traditional analysis (Table 4). Importantly, ER-Seq analysis enabled the detection of other relatively low-frequency mutations, including EGFR p.T790M (variation frequency 0.53%) (Table 4), and this was not recognized by traditional analysis due to high background noise.

Figure 1. Representative QC Result of Successfully Isolated cfDNA
For the left graphs, the X axis shows fragment size (bp) and the Y axis indicates relative fluorescence units (FU). The primary size of cfDNA is ~170 bp and there are no large fragments of contamination. A simulated electrophoresis gel is shown on the right. Please click here to view a larger version of this figure.

Figure 2. Example of Libraries QC Analyzed.
The X axis shows the fragment size (bp) and the Y axis indicates the relative fluorescence units (FU). A simulated electrophoresis gel was shown on the right of each graph. (A) Patient cfDNA sample showed a sharp peak after it was amplified with adapters. (B) Control gDNA sample showed evenly distributed fragments with no sharp peak and no bias toward one side. Both samples were acceptable for subsequent sequencing. Please click here to view a larger version of this figure.

Figure 3. ER-Seq Work Flow.
(A) General work flow. (B) Data analysis work flow. Please click here to view a larger version of this figure.

2735 exons from all the exon region of 70 genes
ABL1	ABL2	AKT1	AKT2	AKT3	ALK	APC	AR	ARAF	ATM
ATR	AURKA	AURKB	AXL	BAP1	BCL2	BRAF	BRCA1	BRCA2	BRD2
BRD3	BRD4	BTK	C11orf30	C1QA	C1S	CBL	CCND1	CCND2	CCND3
CCNE1	CD274	CDH1	CDK13	CDK4	CDK6	CDK8	CDKN1A	CDKN1B	CDKN2A
CDKN2B	CHEK1	CHEK2	CRKL	CSF1R	CTNNB1	DDR1	DDR2	DNMT3A	EGFR
EPHA2	EPHA3	EPHA5	ERBB2	ERBB3	ERBB4	ERCC1	ERG	ESR1	EZH2
FAT1	FBXW7	FCGR2A	FCGR2B	FCGR3A	FGFR1	FGFR2	FGFR3	FGFR4	FLCN
FLT1	FLT3	FLT4	FOXA1	FOXL2	GAB2	GATA3	GNA11	GNAQ	GNAS
HDAC1	HDAC4	HGF	HRAS	IDH1	IDH2	IGF1R	IL7R	INPP4B	IRS2
JAK1	JAK2	JAK3	KDR	KIT	KRAS	MAP2K1	MAP2K2	MAPK1	MAPK3
MCL1	MDM2	MDM4	MED12	MET	MITF	MLH1	MLH3	MPL	MS4A1
MSH2	MSH3	MSH6	MTOR	MYC	MYD88	NF1	NF2	NOTCH1	NOTCH2
NOTCH3	NOTCH4	NRAS	NTRK1	NTRK3	PALB2	PDGFRA	PDGFRB	PDK1	PIK3CA
PIK3CB	PIK3R1	PIK3R2	PMS1	PMS2	PRKAA1	PSMB1	PSMB5	PTCH1	PTCH2
PTEN	PTPN11	RAF1	RARA	RB1	RET	RHEB	RHOA	RICTOR	RNF43
ROCK1	ROS1	RPS6KB1	SMARCA4	SMARCB1	SMO	SRC	STAT1	STAT3	STK11
SYK	TMPRSS2	TOP1	TP53	TSC1	TSC2	VEGFA	VHL	XPO1	XRCC1
introns, promoters and breakpoints or fusion region from the following 24 genes
ALK	FGFR1	FGFR2	FGFR3	NTRK1	NTRK3	PDGFRA	PDGFRB	ROS1	RET
MET	BRAF	ABL1	BRD3	BRD4	EGFR	RAF1	BCR	ERG	TMPRSS2
RARA	KIF5B	BCL2L11	TERT
other relative genes: 1122 exons from 847 genes

Table 1. 1021 kb Probes Enriched Target Regions.
These regions included: 2735 exons from 170 genes; introns, promoter regions and breakpoint regions from 24 genes; 1122 exons from 847 related genes. These cover over 95% of the tumor related driver mutations and target mutation sites from the 12 most common tumors (lung cancer, colorectal cancer, gastric cancer, breast cancer, kidney cancer, pancreatic cancer, liver cancer, thyroid cancer, cervical cancer, esophageal cancer, and endometrial carcinoma).

DNA	Result	Indication	Ideal Range
cfDNA	Fragment Size	Identification of DNA Fragment distribution	168bp±20(a)
cfDNA	Concentration	More accurate DNA quantification	Total cfDNA ≥30ng(b)
gDNA	A260/A230	Identification of chemical contaminants (e.g., ethanol)	1.50 – 2.2
	A260/A280	Identification of protein contaminants	1.60 – 2.2
	Concentration	DNA quantification	Total gDNA >3ug

Table 2. Quality Analysis of cfDNA and gDNA Extracted from Peripheral Blood.
The peak size of cfDNA is around 170Bp. Quality control should be performed by a standardized method for the 2100 Bioanalyzer to determine levels of sample degradation and/or gDNA contamination. Peak analysis of the quality of the cfDNA extraction should appear similar to Figure 1. Note that increased DNA amount will result in a library with a higher quality for the effective detection of rare mutations in cfDNA. We recommend using at least 30 ng cfDNA (30,000 copies).

Table 3. The QC of ER-Seq Information Analysis and Traditional Information Analysis.
The dark gray highlight indicates a coverage depth increase occurred in ER-Seq compared with traditional methods; a light gray highlight indicates no PCR induced duplicated read in ER-Seq. Please click here to view a larger version of this figure.

Gene	Chr	Start	End	cHGVS	pHGVS	Function	caseAltIsHot	ER-Seq var_freq	Traditional var_freq
TP53	17	7577534	7577535	c.746G>T	p.R249M	Missense	HighFreq	0.0400	0.0378
EGFR	7	55259514	55259515	c.2573T>G	p.L858R	Missense	Actionable	0.0270	0.0120
ATM	11	108203618	108203619	c.7919delC	p.T2640Ifs*6	frameshift	ND	0.0169	0.0180
PTCH1	9	98221917	98221918	c.2851G>T	p.D951Y	Missense	ND	0.0154	0.0260
EGFR	7	55249070	55249071	c.2369C>T	p.T790M	Missense	Actionable	0.0053	——(0.0013)
RB1	13	49039151	49039152	c.2230A>T	p.I744F	Missense	ND	0. 0036	——(0.0014)

Table 4. The Calling Result of ER-Seq Information Analysis and Traditional Information Analysis.

Discussion

The existence of circulating tumor DNA (ctDNA) was discovered more than 30 years ago, however the application of ctDNA analyses is still not routine in clinical practice. Interest in the practical application of ctDNA methods has increased with the development of technologies for ctDNA detection and analysis. Tumor monitoring with ctDNA offers a minimally-invasive approach for the assessment of microscopic residual disease, response to therapy, and tumor molecular profiles under the background of tumor evolution and intratumoral heterogeneity¹³^,¹⁴. Improvements in the sensitivity and specificity of analysis will facilitate all these applications¹⁵^,¹⁶.

In this manuscript, we introduced ER-Seq, a promising method aimed at improving the sensitivity of ctDNA detection, using a NSCLC case as an example. Compared with traditional analysis, the application of our unique sequencing adapters in ER-Seq significantly improved its sensitivity and specificity, indicated by a coverage depth increase and an ability to detect low-frequency mutations (such as EGFR p.T790M (0.53%)). The existence of T790M in this patient sample may be a contributing factor to Erlotinib resistance, and gave insight into how to better treat this patient, suggesting the use of a third-generation EGFR inhibitor specific for T790M, such as AZD929117-20. Another critical point in ER-Seq that contributes to its sensitivity and specificity is the baseline database screening, which facilitates the precise detection of low-frequency mutations in ctDNA by filtering out background noise.

Although ER-Seq has many advantages that make it superior to traditional methods, there are still some challenges that need to be overcome. One of the major challenges for ER-Seq is the extremely low level of ctDNA present in the blood. For those low-frequency mutations, the acquisition of sufficient template ctDNA is critical. Our unique adapters significantly increase the recycle rate of ctDNA, though they still need to be improved. Another challenge for ER-Seq, and for all the other sequencing techniques, is the limitation of coverage of target regions. Our selected 1021 kb probe enriched regions, which can cover about 95% of tumor-related mutations, are more comprehensive compared with other small-plane methods²¹^,²²^,²³. However, as tumor heterogeneity has been well recognized and more and more tumor biomarkers have been discovered, the relative coverage rate of our probes enriched regions decreases. To better serve as a tool for early diagnosis and disease monitoring of tumor patients, more comprehensive sequencing and analysis techniques are needed. Currently, ER-Seq still relies on a high volume of data for detecting low-frequency mutations. Further improvement of data utilization could be favorable for the application of ER-Seq.

As discussed above, many limitations associated with ER-Seq exist. However, our unique analysis technique still has many advantages compared with other top-ranking methods in this field. The pros and cons of this technique merit further research. The potential of ER-Seq for routine clinical use will be of great importance for clinical physicians, providing them with a powerful tool to diagnose tumors and monitor tumor dynamics and response to therapy. Novel mutations associated with resistance to conventional and targeted therapy found by our analysis might offer new avenues of treatment for cancer patients with advanced disease.

Disclosures

The authors have nothing to disclose.

Acknowledgements

This work is supported by Geneplus–Beijing Institute.

Materials

QIAamp Circulating Nucleic Acid Kit	Qiagen	55114	DNA Extraction from Peripheral Blood for cfDNA
QIAamp DNA Blood Mini Kit	Qiagen	51105	DNA Extraction from Peripheral Blood for gDNA
Quant-iT dsDNA HS Assay Kit	Invitrogen	Q32854	Measure cfDNA concentration
Quant-iT dsDNA BR Assay Kit	Invitrogen	Q32853	Measure library concentration
Agilent DNA 1000 Reagents	Agilent	5067-1504	Measure cfDNA and library fragments
The NEBNext UltraII DNA Library Prep Kit for Illumina	NEB	E7645L	Library Preparation
Agencourt SPRIselect Reagent	Beckman	B23317	DNA fragment screening and purification
Tris-HCl (10 mM, pH 8.0)-100ML	Sigma	93283	Dissolution
xGen Lockdown Probes	IDT	——	xGen Custom Probe
Human Cot-1 DNA	Life	15279-011	Targeted DNA capture
Dynabeads M-270 Streptavidin	Life	65305	Targeted DNA capture
xGen Lockdown Reagents	IDT	1072281	Targeted DNA capture
KAPA HiFi HotStart ReadyMix	KAPA	KK2602	post-capture PCR enrichment libraries
KAPA Library Quantification Kit	KAPA	KK4602	Measure library concentration
NextSeq 500 High Output Kit v2((150 cycles)	illumina	FC-404-2002	Sequence
Centrifuge5810	eppendorf	5810
Nanodrop8000	Thermo Scientific	8000	Measure gDNA concentration
Qubit 2.0	Invitrogen		Quantify
Agilent 2100 Bioanalyzer	Agilent
ThermoMixer C	eppendorf		Incubation
16-tube DynaMagTM-2 Magnet	Life	12321D
Concentrator plus	eppendorf
PCR	AB	simplyamp
QPCR	AB	7500Dx
NextSeq 500	illumina

References

Kennedy, S. R., et al. Detecting ultralow-frequency mutations by Duplex Sequencing. Nat. Protoc. 9, 2586-2606 (2014).
Shendure, J., Ji, H. Next-generation DNA sequencing. Nat Biotechnol. 26, 1135-1145 (2008).
Bettegowda, C., et al. Detection of circulating tumor DNA in early- and late-stage human malignancies. Sci. Transl. Med. 6, (2014).
Bratman, S. V., Newman, A. M., Alizadeh, A. A., Diehn, M. Potential clinical utility of ultrasensitive circulating tumor DNA detection with CAPP-Seq. Expert Rev. Mol. Diagn. 15, 715-719 (2015).
Diaz, L. A., Bardelli, A. Liquid biopsies: genotyping circulating tumor DNA. J. Clin. Oncol. 32, 579-586 (2014).
Crowley, E., Di Nicolantonio, F., Loupakis, F., Bardelli, A. Liquid biopsy: monitoring cancer-genetics in the blood. Nat. Rev. Clin. Oncol. 10, 472-484 (2013).
Su, Z., et al. A platform for rapid detection of multiple oncogenic mutations with relevance to targeted therapy in non-small-cell lung cancer. J. Mol. Diagn. 13, 74-84 (2011).
Kukita, Y., et al. High-fidelity target sequencing of individual molecules identified using barcode sequences: de novo detection and absolute quantitation of mutations in plasma cell-free DNA from cancer patients. DNA Res. 22, 269-277 (2015).
Schmitt, M. W., et al. Sequencing small genomic targets with high efficiency and extreme accuracy. Nat. Methods. 12, 423-425 (2015).
Schmitt, M. W., et al. Detection of ultra-rare mutations by next-generation sequencing. Proc. Natl. Acad. Sci. USA. 109, 14508-14513 (2012).
Newman, A. M., et al. An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage. Nat. Med. 20, 548-554 (2014).
Newman, A. M., et al. Integrated digital error suppression for improved detection of circulating tumor DNA. Nat Biotechnol. 34 (5), 547-555 (2016).
Alix-Panabières, C., Schwarzenbach, H., Pantel, K. Circulating tumor cells and circulating tumor DNA. Annu. Rev. Med. 63, 199-215 (2012).
Dawson, S. J., et al. Analysis of circulating tumor DNA to monitor metastatic breast cancer. N. Engl. J. Med. 368, 1199-1209 (2013).
Li, H., Durbin, R. Fast and accurate short-read alignment with Burrows- Wheeler transform. Bioinformatics. 25, 1754-1760 (2009).
Youn, A., Simon, R. Identifying cancer driver genes in tumor genome sequencing studies. Bioinformatics. 27, 175-181 (2011).
Forshew, T., et al. Noninvasive identification and monitoring of cancer mutations by targeted deep sequencing of plasma DNA. Sci. Transl. Med. 4, (2012).
Kobayashi, S., et al. EGFR mutation and resistance of non-small-cell lung cancer to gefitinib. N. Engl. J. Med. 352, 786-792 (2005).
Kuang, Y., Rogers, A., Yeap, Y., Wang, L., Makrigiorgos, M., Vetrand, K., Thiede, S., Distel, R. J., Jänne, P. A. Noninvasive detection of EGFR T790M in gefitinib or erlotinib resistant non-small cell lung cancer. Clin. Cancer Res. 15, 2630-2636 (2009).
Taniguchi, K., Uchida, J., Nishino, K., Kumagai, T., Okuyama, T., Okami, J., Higashiyama, M., Kodama, K., Imamura, F., Kato, K. Quantitative detection of EGFR mutations in circulating tumor DNA derived from lung adenocarcinomas. Clin. Cancer Res. 17, 7808-7815 (2011).
Leary, R. J., et al. Development of personalized tumor biomarkers using massively parallel sequencing. Sci. Transl. Med. 2, (2010).
Forshew, T., et al. Noninvasive identification and monitoring of cancer mutations by targeted deep sequencing of plasma DNA. Sci. Transl. Med. 4, (2012).
Bratman, S. V., Newman, A. M., Alizadeh, A. A., Diehn, M. Potential clinical utility of ultrasensitive circulating tumor DNA detection with CAPP-Seq. Expert Rev. Mol. Diagn. 15, 715-719 (2015).

Play Video

PDF

DOI

DOWNLOAD MATERIALS LIST

Cite This Article

Lv, X., Zhao, M., Yi, Y., Zhang, L., Guan, Y., Liu, T., Yang, L., Chen, R., Ma, J., Yi, X. Detection of Rare Mutations in CtDNA Using Next Generation Sequencing. J. Vis. Exp. (126), e56342, doi:10.3791/56342 (2017).