ABSTRACT
A method is described to simultaneously amplify and sequence DNA using a new class of nucleotides containing boron. During the polymerase chain reaction, boron-modified nucleotides, i.e. 2'-deoxynucleoside 5'-[alpha]-[P-borano]-triphosphates, are incorporated into the product DNA. The boranophosphate linkages are resistant to nucleases and thus the positions of the boranophosphates can be revealed by exonuclease digestion, thereby generating a set of fragments that defines the DNA sequence. The boranophosphate method offers an alternative to current PCR sequencing methods. Single-sided primer extension with dideoxynucleotide chain terminators is avoided, with the consequence that the sequencing fragments are derived directly from the original PCR products. Boranophosphate sequencing is demonstrated with the Pharmacia and the Applied Biosystems 373A automatic sequencers, producing data that is comparable with cycle sequencing.
Current PCR sequencing techniques rely on the incorporation of dideoxynucleotide chain truncators and thus are inherently incompatible with polymerase chain reaction (PCR) amplification (1 -8 ). Once a dideoxynucleotide is incorporated, the chain is terminated and no further extension or amplification is possible. By necessity, amplification must be performed prior to cycle sequencing. An alternative, but relatively unexplored, method for sequencing is to substitute delimiters for truncators directly into the PCR amplification and to reveal the presence of the delimiters either chemically or enzymatically.
Such a sequencing method, based on the incorporation of thiophosphate-modified nucleotides (2'-deoxynucleoside 5'-[alpha]- [P-thio]-triphosphates) into PCR products, was proposed by Nakamaye et al. (9 ). The thiophosphates were incorporated into DNA during PCR amplification and their positions were revealed by chemical degradation. An analogous enzymatic approach by Labeit et al. (10 ,11 ) also used thiophosphates as delimiters to sequence DNA; after incorporation into primer extension products, the positions of the thiophosphates were revealed by exonuclease III digestion. While, in principle, this method could be applied to PCR sequencing, the thiophosphates proved unsatisfactory because of uneven band intensity (9 ,12 ).
We have employed boranophosphates (2'-deoxynucleoside 5'-[alpha]-[P-borano]-triphosphates) a novel class of nucleic acids (13 ) synthesized in our laboratory, in order to develop a method that permits both amplification and direct sequencing of PCR products. The boranophosphates are heat stable and can be incorporated into DNA by PCR, yet, once incorporated, they block the action of exonuclease (14 ,15 ; K.Porter and B.R.Shaw, unpublished observations). Utilizing the complete set of thymidine-, 2'-deoxyadenosine-, 2'-deoxycytidine- and 2'-deoxyguanosine-5'- [alpha]-[P-borano]-triphosphates (dNTbPs; Fig. 1 ) in base-specific PCR reactions accomplishes both amplification and base-specific demarcation. The PCR products are then digested with exonuclease to generate the sequencing fragments. As such, PCR sequencing with boranophosphates provides a means for truly direct PCR sequencing.
The 5'-[alpha]-[P-borano]-triphosphates of dA, dT, dC and dG were synthesized and the diastereomers were HPLC separated by modification of methods described previously (14 ). Oligonucleotide primers were purchased from Genset and purified by TLC. [[gamma]-33P]ATP and [[alpha]-33P]dATP (>1000 Ci/mmol) were purchased from Amersham.
Phage T7 DNA template (5 ng) was mixed with 20 pmol primers (T7for 5'-GGAGCGTAGGAAATAATAC-3', positions 34534- 34552 of phage T7; T7rev, 5'-CGGTTTTAATTACGTTAGCC-3, complementary to positions 35042-35025), 100 [mu]M each of dATP, dTTP, dCTP and dGTP and 2.5 [mu]M of one base-specific boronated dNTP (i.e. dATbP, dTTbP, dCTbP or dGTbP) in 50 [mu]l Vent buffer [NEB; 10 mM KCl, 20 mM Tris-HCl, pH 8.8, 10 mM (NH4)2SO4, 2 mM MgSO4, 0.1% Triton X-100]. An aliquot of 0.5 [mu]l (1 U) Vent DNA polymerase (NEB) was added and PCR was performed for 25 cycles of 95oC for 1 min, 53oC for 1 min and 76oC for 1 min. A 10 [mu]l aliquot of each reaction was digested with 0.5 [mu]l (32.5 U) exonuclease III (Life Technologies) for 30 min at 37oC. Samples were separated on a 1% agarose gel.
Primer (5'-CAGGAACAGCTATGGCCTC-3'; 10 pmol) was 5'-end-labeled with 20 [mu]Ci [[gamma]-33P]ATP by polynucleotide kinase. Labeled primer was annealed to unlabeled template (5'-GTGTAGCTGAGGCCATAGCTGTTCCTG-3'; 10 pmol). Extension was accomplished with 1 U T4 DNA polymerase in the presence of (i) all-normal dNTPs or (ii) successive replacement of one dNTP with the corresponding dNTbP (50 [mu]M each in 25 [mu]l 50 mM Tris-HCl, pH 7.5, 10 mM MgCl2, 1 mM DTT) for 10 min at 37oC. An aliquot of each extended primer-template duplex (10 [mu]l) was digested with 0.5 [mu]l (32.5 U) exonuclease III for 30 min at 37oC. Samples were separated on a 16% polyacrylamide-8 M urea gel.
Pharmacia automated sequencer. A sample of 22 ng DNA template (p53 inserted into pUC18) was mixed with 10 pmol modified M13for (5'-CGACGTTGTAAACGACGGCCAGT-3'; 5' Cy 5, internal biotin) and 10 pmol unmodified M13rev (5'-ACAGGAAACAGCTATGACCATG-3'; 800 bp product) primers, 25 [mu]M each dATP, dTTP, dCTP and dGTP and either dATbP (2.5 [mu]M), dTTbP (10 [mu]M), dCTbP (10 [mu]M) or dGTbP (2.5 [mu]M) in 50 [mu]l Taq PCR buffer (10 mM Tris-HCl, pH 8.3, 50 mM KCl, 1.5 mM MgCl2). The reaction mixture was heated to 95oC for 1 min and returned to ice. After addition of 2.5 U AmpliTaq DNA polymerase (Perkin-Elmer) PCR was performed for 25 cycles of 95oC for 15 s, 62oC for 1 min and 76oC for 1 min.
A 25 [mu]l aliquot of the PCR reaction was mixed with streptavidin-linked magnetic beads (Dynal; 10 mg/ml in 25 [mu]l 5 mM Tris-HCl, pH 7.5, 0.5 mM EDTA, 1 M NaCl) and incubated with mild agitation for 30 min. The PCR products bound to the magnetic beads were immobilized by a magnet and washed twice with exonuclease III buffer (10 mM Tris-HCl, pH 7.6, 5 mM MgCl2, 5 mM DTT). The samples were resuspended in 10 [mu]l exonuclease III buffer and digested with 60 U exonuclease III for 30 min at 37oC.
After digestion, the samples were heated for 1 min at 95oC to denature the PCR products. The magnetic beads, along with the biotinylated primer strand fragments, were immobilized with a magnet and the unmodified primer strand fragments were removed. The biotinylated primer strand fragments were resuspended in 10 [mu]l formamide loading buffer, heated for 1 min at 95oC to detach them from the streptavidin-linked magnetic beads and then 6 [mu]l loaded onto a sequencing gel.
As a control, cycle sequencing was performed on p53-pUC18 DNA as directed in the Pharmacia AutoCycle sequencing kit. All samples were analyzed on a Pharmacia prototype single color, four lane automatic fluorescent sequencer.Applied Biosystems 373A sequencer. Dye Primers were provided by PE Applied Biosystems and a human leukemia cDNA library was provided by Stratagene. Base-specific four color PCR reactions were performed with Dye Primers: FAM-labeled primer for dC reactions, JOE for dA, TAMRA for dG and ROX for dT. As suggested for the 373A, two PCR reactions were performed for ROX- and TAMRA-labeled samples. A sample of 20 ng DNA template (human cDNA inserted into [lambda]ZAP II) was mixed with 20 pmol labeled -21M13 (5'-TGTAAACGACGGCCAGT-3') and 20 pmol unlabeled M13rev (5'-ACAGGAAACAGCTATGACCATG-3'; 1000 bp product) primers, dATP, dTTP, dCTP and dGTP (25 [mu]M each), one each of dATbP (2.5 [mu]M), dTTbP (10 [mu]M), dCTbP (10 [mu]M) or dGTbP (2.5 [mu]M) and 2.5 U Taq DNA polymerase in 50 [mu]l PCR buffer (10 mM Tris-HCl, pH 8.3, 50 mM KCl, 1.5 mM MgCl2). PCR was performed for 30 cycles of 95oC for 15 s, 55oC for 1 min and 70oC for 1 min.
Current methods for PCR sequencing cannot, in fact, directly sequence PCR products. DNA templates must first be amplified by PCR and only subsequently sequenced by a secondary primer extension reaction, which incorporates the dideoxynucleotide chain terminators. In contrast, for the boranophosphate sequencing method (Fig. 2 ), the amplified PCR products, after digestion, are loaded directly onto the sequencing gel.
To accomplish direct PCR sequencing, boranophosphates are incorporated in base-specific PCR reactions and their presence is revealed by exonuclease digestion, thereby defining the sequence of the DNA. In order to evaluate the use of boranophosphates for sequencing, several questions had to be answered. Could the boranophosphates be incorporated into DNA during PCR without decreasing amplification efficiency? Once incorporated, were the boranophosphates sufficiently resistant to exonuclease digestion that the normal nucleotides would be digested leaving a series of borano-terminated DNA fragments? Finally, could the boranophosphate method produce high quality sequence data?
To address the compatibility and efficiency of boranophosphates with PCR, a 509 bp duplex was amplified by 25 cycles of PCR in the presence of a small quantity of each of the dNTbPs. As seen in Figure 3 , comparable amounts of full-length DNA were produced with all-normal (lane 1) and partially boronated (lanes 3, 5, 7 and 9) nucleotides, demonstrating that the dNTbPs are readily incorporated into PCR reactions.
To examine the nuclease resistance conferred by the boranophosphates, the PCR products were treated with exonuclease III, which is a double-strand-specific 3'-exonuclease. The all-normal PCR products were digested extensively (lane 2), while the boron- containing DNA was noticeably resistant to digestion (lanes 4, 6, 8 and 10). Therefore, these studies demonstrated that boronated nucleotides can be incorporated into DNA by PCR and the resultant PCR products are resistant to exonuclease digestion.
Figure
To determine whether a boranophosphate could substitute, base specifically, for the corresponding normal dNTP and act as a block to exonuclease, a primer was extended in the presence of 100% of one boronated dNTP (dATbP, dTTbP, dGTbP or dCTbP) plus the three remaining normal dNTPs (Fig. 4 ). All of the primers were extended to the full length of the template (lane 1, all-normal; lanes 3, 5, 7 and 9, boronated). The presence of a significant quantity of boronated nucleotide (4-11% of the total nucleotide concentration of the extended primer) did not alter the yield or the electrophoretic mobility. Following extension, an aliquot of each sample was digested with exonuclease III. The all-normal product was digested completely (lane 2), whereas for each boranophosphate-containing sample the exonuclease digestion was halted significantly at the position of the boronated substitution (lanes 4, 6, 8 and 10). It should be noted that the preference of exonuclease III for cytidine residues (16 ) is also observed for boronated cytidine, thus producing faint extraneous bands in lane 10. Nevertheless, under conditions in which all-normal extension products were digested completely, the boronated extension products were base specifically resistant to exonuclease.
Figure
Figure
Taken together, Figures 3 and 4 demonstrate that the boronated nucleotides: (i) do not obstruct PCR or primer extension; (ii) are more resistant to exonuclease III than are normal nucleotides; (iii) are resistant to exonuclease III base specifically. These properties make the boranophosphates suitable for use as the sequence delimiters described in Figure 2 .
To assess the quality of the sequence data generated by the boranophosphate method, PCR products amplified from double-stranded clones were sequenced (Figs 5 and 6 ). For comparison, the analogous regions were sequenced by cycle sequencing from purified plasmid templates. Boranophosphate sequencing is compatible with both radioactive (data not shown) and fluorescent labeling. Automated sequencing was accomplished by the single color, four lane format (Pharmacia) and by the four color, single lane format (PE Applied Biosystems).
Boranophosphate sequencing (Fig. 5 A) compares favorably with Taq cycle sequencing (Fig. 5 B) methods; for both methods, the sequence could be read over ~450 bases. No secondary structure-induced artifacts were noted for either method, however, each method had at least one position where the correct base could not be determined because of an extremely weak band. At each position where a particular base could not be determined by one method, the base could be called correctly by the other method. The desirability of determining a sequence by two independent methods has been noted previously (17 ).Applied Biosystems 373A sequencer. An insert from a human cDNA library was sequenced by direct boranophosphate PCR sequencing and by AmpliTaq FS cycle sequencing on the Applied Biosystems 373A sequencer (Fig. 6 ). For the boranophosphate method, we performed base-specific PCR reactions with fluorescent Dye Primers, combined the reactions, digested with exonuclease and loaded the fragments onto the sequencer. Digestion conditions were modified to include both exonuclease III and phosphodiesterase I, which improved the signal quality on the Applied Biosystems instrument.
Boranophosphate sequencing (Fig. 6 A) approached the quality of AmpliTaq FS cycle sequencing (Fig. 6 B) over a considerable range. High quality sequence data were produced for >500 bases by boranophosphate sequencing and for >600 bases by AmpliTaq FS cycle sequencing. Visual inspection of the data showed that the boranophosphate method produced excellent sequence data from positions 100 to 450. Standard ABI base calling software indicated accuracy over an even wider range (Fig. 7 ); the aligned sequences maintained close agreement from positions 70 to 520.
Figure
We have demonstrated that boranophosphates are suitable for use as delimiters for direct PCR sequencing. The boranophosphates do not inhibit PCR or primer extension and, yet, once incorporated into DNA, they block the action of exonucleases. Boranophosphates, as sequence delimiters, do not interfere with the exponential amplification property of PCR. After incorporation, the positions of the boranophosphates can be revealed by exonuclease digestion, thereby generating a series of borano-terminated fragments that defines the DNA sequence. Since the sequencing fragments are generated from full-length PCR products, the distribution of fragments is advantageously skewed towards the longer fragments.
Direct boranophosphate PCR sequencing has been shown to be compatible with a variety of formats. Manual sequencing has been performed with end-labeled primers, as well as with unlabeled primers and radioactive dNTPs. Automatic sequencing has been performed with two types of sequencers. For boranophosphate sequencing with the Pharmacia instrument, primers were doubly modified with a Cy 5 fluorescent label and with biotin, which allowed solid phase purification of the PCR products. Although not demonstrated here, the boronated PCR products are amenable to bidirectional sequencing either by solid phase strand separation or by uniquely labeled fluorescent primers. For the Applied Biosystems 373A sequencer, a combination of exonuclease III and phosphodiesterase I nucleases was shown to perform optimally. The quality of the resultant sequencing data was equivalent to Taq cycle sequencing and approached that of AmpliTaq FS cycle sequencing.
Presently, boranophosphate sequencing has been shown to generate accurate and reproducible base calls over extended sequences (450 bases in Figs 5 -7 ). Yet in order for boranophosphate sequencing to be more generally useful, the method requires improvement, i.e. accurate base calls over longer read lengths. We expect to improve our method by introducing modifications to the nucleotides, the exonuclease and the base calling software. Chemically, the structure of the boronated nucleotides can be altered so as to reduce their differential susceptibility to nucleases while maintaining their excellent incorporation properties. Enzymatically, exonuclease III may be modified to perform better in our system, i.e. with reduced sequence specificity, in a manner analogous to that used to design ThermoSequenase (18 ) and AmpliTaq FS (19 ,20 ) polymerases. Finally, the base calling software, including neural net systems (21 ), can be modified to accommodate any nuances in the data caused by residual differential susceptibility of the boranophosphates to nuclease.
In conclusion, boranophosphate sequencing provides a means for truly direct PCR sequencing. The DNA produced during PCR amplification is transformed by digestion into sequencing fragments and is loaded directly onto the sequencing gel. By virtue of utilizing sequence delimiters, rather than chain terminators, boranophosphate sequencing provides a method that is unique from, and possibly complementary to, cycle sequencing. As such, it should be useful in situations which prove difficult for dideoxy cycle sequencing. Scenarios such as gap filling and closure of large scale sequencing projects, as well as genetic testing and diagnosis, may be appropriate applications for the boranophosphate method.
This work was supported by NIH grant HG00782 and DOE grant DE-FG05-94ER61882. The authors thank Drs Tim Hunkapiller and Stan Tabor for helpful comments and encouragement. The authors are also grateful to Dr Charles Brush and Mr Scott Duthie for assistance with the Pharmacia sequencer and Drs William Efcavitch and Roger O'Neill for assistance with the Applied Biosystems sequencer. Fluorescent sequencing reagents and the use of an automatic sequencer were generously provided by Pharmacia Biotech and by PE Applied Biosystems. A human cDNA library was provided by Stratagene.
*To whom correspondence should be addressed. Tel: +1 919 660 1551; Fax: +1 919 660 1618; Email: brs@chem.duke.edu
Pharmacia sequencer. Figure 5 compares boranophosphate and Taq cycle sequencing of the p53 gene on the Pharmacia instrument. For boranophosphate sequencing, the sequencing primer was modified with both a Cy 5 fluorescent label and biotin, the ligand for streptavidin-linked magnetic beads. Immediately after amplification, the biotinylated PCR duplexes were bound to streptavidin-linked magnetic beads, immobilized with a magnet and the PCR buffer was exchanged for the optimal exonuclease III buffer. Exonuclease III digestion was performed while the PCR products were attached to the magnetic beads, after which the base-specific borano-terminated DNA fragments were removed from the beads and analyzed on the automated sequencer.
REFERENCES
Return

