©
1996 Oxford University Press
2460-2462
Footnote
`Long distance sequencer' method; a novel strategy for large DNA sequencing
projects
`Long distance sequencer' method; a novel strategy for large DNA sequencing projects
Koichi
Hagiwara
and
Curtis C.
Harris*
Laboratory of Human Carcinogenesis, National Cancer Institute, 37 Convent Drive,
Bethesda
, MD 20892,
USA
Received January 29, 1996;
Revised and Accepted April 23, 1996
Every DNA sequencing project involves two steps: (i) making suitable templates for all the regions to be sequenced; and (ii) running sequencing reactions and electrophoresis. The latter step can be
automated by use of workstations and autosequencers. The former step requires
careful experimental design and laborious DNA manipulations such as the
construction of nested deletion mutants (
1
). This is often the limiting step in large sequencing projects. The `shot-gun' method eliminates this complicated DNA manipulations (
1
), but many recombinant clones must be sequenced because of the random nature of
this procedure. Here we describe a novel sequencing technique that utilizes
recent advances in amplification of long DNA fragments by PCR (
2
). This systematic method requires minimal amount of starting DNA and eliminates complicated steps for template preparation. We have successfully
used this method to determine the sequence of a cosmid insert and the genomic
structure of the transforming growth factor [beta] type II receptor gene (
5
).
In our protocol, nested DNA fragments around region of interest are amplified by
anchored PCR using a vectorette unit and the fragments are directly sequenced.
The procedures are schematically presented in Figure
1
.
Figure 1
.
Schematic representation of the method.
Amplification primers were designed in the sequenced regions of the cosmid using
the MacVector program (Kodak). The following oligonucleotides, V-top 5'-GAAGGAGAGGACGCTGTCTGTCGAAGGTAAGGAACGGACGAGAGAAGGGAGAG-3' and V-bottom 5'-CTCTCCCTTCTCGAATCGTAACCGTTCGTACGAGAATCGCTGTCCTCTCCTTC-3' were
synthesized, purified and dissolved in distilled water to a final concentration of 4 [mu]M of each. The solution was heated at 68o C for 10 min and was slowly cooled over 30 min to room temperature to
make an annealed vectorette unit (4 [mu]M; ref.
3
). V-top and V-bottom are complementary to each other except for the middle one-third, giving the vectorette unit a bubble-like structure (Fig.
1
). M13 sequence-tagged 224 primer (224M13: 5'-TGTAAAACGACGGCCAGTCGAATCGTAACCGTTCGTACGAGAATCGCT-3') (
3
) was phosphorylated in a 50 [mu]l volume containing 1* kinase buffer [70 mM Tris-HCl (pH 7.6), 10 mM MgCl
2
, 5 mM dithiothreitol, 1 mM ATP], 30 [mu]M 224M13 and 50 U T4 polynucleotide kinase. The reaction was incubated at 37o C for 30 min, heated to 68o C for 10 min then stored at -20o C.
Cosmid DNA was extracted from a 1.5 ml overnight culture by an alkaline mini-prep method (
4
) into 50 [mu]l distilled water. Two microliters of this cosmid DNA solution were
enzymatically digested using
Alu
I,
Bsa
AI,
Bst
UI,
Pal
I,
Rsa
I,
Acc
I,
Afl
III,
Bst
YI,
Hin
cII,
Msl
I,
Tsp
45I,
Eco
RV,
Hpa
I,
Pvu
II,
Sca
I,
Sma
I,
Ssp
I or
Stu
I (Stratagene and New England Biolabs) in a 20 [mu]l volume including 10 U enzyme and 1* buffer as recommended by the manufacturers. After a 1 h incubation at 37o C, individual digestions were extracted with phenol-chloroform and precipitated with ethanol. Because
Acc
I,
Afl
III,
Bst
YI or
Tsp
45I do not give blunt-end DNA fragments, samples digested with them were blunt-ended by treatment with T4 DNA polymerase in a 50 [mu]l reaction containing l* buffer [50 mM NaCl, 10 mM Tris-HCl (pH 7.9), 10 mM MgCl
2
, 1 mM dithiothreitol, 100 [mu]M dNTPs] and 6 U T4 DNA polymerase at 37o C for 30 min, followed by extraction with phenol-chloroform and precipitation with ethanol. The other enzymes
give blunt-end DNA and this step is unnecessary.
The vectorette unit was ligated to each restriction fragment in a 40 [mu]l reaction volume containing 1* T4 ligase buffer [50 mM Tris-HCl (pH 7.8), 10 mM MgCl
2
, 10 mM dithiothreitol, 1 mM ATP, 25 [mu]g/ml bovine serum albumin], 1 [mu]l 4 [mu]M vectorette unit solution and 3 U T4 DNA ligase. After 2 h at room
temperature the reaction was diluted with distilled water to 250 [mu]l and then stored at -20o C.
The PCR reactions were performed in 100 [mu]l volumes containing l* XL buffer II (Perkin-Elmer), 1.1 mM Mg(OAc)
2
, 200 [mu]M dNTPs, 2.5 [mu]l from the vectorette unit-ligated restriction fragment solution, 1 [mu]l 30 [mu]M phosphorylated 224M13, 1 [mu]l 30 [mu]M amplification primer and 2 U rTth XL DNA
polymerase. Reactions were heated to 94o C for 1 min, then PCRed for a total of 40 cycles at 94o C for 30 s, 55o C for 30 s, 68o C for 4 min; the last 24 cycles require a 15 s extension per
cycle using a thermal cycler (Perkin-Elmer).
Amplified PCR fragments were purified in 50 [mu]l distilled water using the Wizard PCR prep kit (Promega). Purified PCR
fragments, 25 [mu]l, were subjected to [lambda] exonuclease digestion using the `PCR template prep for ssDNA
sequencing' kit (Pharmacia) to get single-stranded DNA in 25 [mu]l of TE (10 mM Tris-HCl pH 7.6, 1 mM EDTA). Seven microliters of single-stranded DNA solutions were subjected to direct
fluorescent sequencing using the
Taq
dye-primer cycle sequencing kit and the 373A DNA sequencer (Perkin-Elmer).
We used restriction enzymes with various lengths of recognition sequences (4-6 bases) to obtain various lengths of the restricted fragment around the
amplification primer. This, in turn, enables us to generate various lengths of
amplified fragments. As expected, the sizes of the amplified fragments were
distributed up to 6 kb in length, giving a set of nested fragments suitable for
long range sequence determination (Fig.
2
a). The M13 sequence is at the end of the 224M13 primer, enabling sequencing
from that end using a commercially available dye-primer sequencing kit. The sequences were usually quite readable (Fig.
2
b), allowing long sequences to be read in a single run.
Figure 2
.
(
A
) Typical set of the amplified fragments. One-twentieth of each PCR amplification reaction was electrophoresed on a 1%
agarose gel. Reactions are aligned according to the fragment size. Enzymes not
shown gave no significant bands. (
B
) Sequencing result of the amplified fragment from
Sca
I digestion shown in (A). Sequence is clearly readable for >450 bp. After these sequences were determined and assembled new amplification primers
were designed, synthesized and the steps beginning from PCR amplification were
repeated using the same vectorette-ligated restriction fragment solutions.
Using this method, we could readily determine the sequence of 14 out of the 16
kb region of interest from a single small scale cosmid DNA preparation. We
found, in total, a region of 2 kb with few restriction sites for the initial
set of enzymes which required the use of additional restriction enzymes.
In addition to sequencing long continuous stretches of DNA, this strategy is
suitable for nucleotide sequence determination around a region of known
sequence. For example, we have successfully applied this method to determine
the genomic structure of the transforming growth factor [beta] type II receptor gene using YAC and cosmid clones (
5
).
The vectorette unit was originally used to isolate the end fragments from yeast artificial chromosome (YAC) clones because of its high specificity and low background. Using computer-designed amplification primers, we rarely encountered false-priming products.
DNA fragments amplified by the anchored PCR have been used to obtain nucleotide
sequences adjacent to known sequences in the genome (
6
). Our method is characterized by the extensive use of vectorette-mediated anchored PCR by which a series of nested DNA fragments suitable
for sequencing can be obtained in a single step. The ease of this technique,
the use of minimal amounts of DNA and the ability to systematically sequence
large nucleotide segments make this method advantageous and preferable to existing methods in various projects. This method alone or in conjunction
with other methods should accelerate a wide variety of sequencing projects.
ACKNOWLEDGEMENTS
We thank M. McMenamin and E. Spillare for critically reviewing the manuscript.
REFERENCES
1 Griffin ,H.G. and Griffin,A.M. (l993) DNA sequencing protocols . Humana Press, NJ.
2 Cheng ,S. , Chang,S.C., Gravitt,P. and Respess,R. (1994 ) Nature , 369 , 684 -685. MEDLINE Abstract
3 Riley ,J. , Butler,R., Ogilivie,D., Finniear,R., Jenner,D., Powell,S., Anand,R., Smith,J.C. and Markham,A.F. (1990 ) Nucleic Acids Res. 18 , 2887 -2890. MEDLINE Abstract
4 Sambrook , J. , Fritsch,E.F. and Maniatis,T. (1989 ) Molecular Cloning : A Laboratory Manual , 2nd Edition. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
5 Takenoshita ,S. , Hagiwara,K., Nagashima,M., Gemma,A., Bennett,W.P. and Harris,C.C. Genomics , submitted.
6 Rosenthal ,A. and Jones,D.S. (1990 ) Nucleic Acids Res ., 18 , 3095 -3096. MEDLINE Abstract
Return
*
To whom correspondence should be addressed