ABSTRACT
Treatment of T7 DNA ligase with a range of proteases generates two major
fragments which are resistant to further digestion. These fragments, of
molecular weight 16 and 26 kDa, are derived from the N- and C-termini of the protein, respectively. The presence of ATP or a non-hydrolysable analogue, ADPNP, during limited proteolysis greatly reduces the level of digestion. The N-terminal 16 kDa region of the intact T7 ligase is
labelled selectively in the presence of [
[alpha]
-
32
P]ATP, confirming that it contains the active site lysine residue. In common
with the intact enzyme, the C-terminal portion of the protein retains the ability to band shift DNA fragments of various lengths, implicating it in DNA binding. It can also inhibit ligation by the intact
protein, apparently by competing for target sites on DNA. We conclude that the
N-terminal region, which contains the putative active site lysine, plays a
role in the transfer of AMP from the enzyme-adenylate complex to the 5'
phosphate at the nick site, while the C-terminal 26 kDa fragment appears to position the enzyme at the target site
on DNA.
DNA ligases catalyse the formation of phosphodiester bonds at single-strand breaks between adjacent 3' hydroxyl and 5' phosphate termini in double-stranded DNA (for reviews see
1
,
2
,
3
). DNA ligase is an essential enzyme for cells. It is required for a number of
important cellular processes, including replication of DNA and the repair of
damaged DNA, as evidenced by the number of viruses which have genes encoding
their own DNA ligases. The enzymes appear to fall into two groups; those
requiring NAD
+
for activity and those requiring ATP. The NAD
+
-requiring DNA ligases have only been found in prokaryotic organisms (
1
,
2
), but the DNA ligase from an archeon bacteria
Desulfurolobus ambivalens
requires ATP (
4
). The eukaryotic enzymes all utilize ATP, as do the virally encoded enzymes.
They show a wide variety of molecular weights, from 103 kDa for the human type
I enzyme to 41 kDa for the enzyme from bacteriophage T7. The discrepancy
between these figures is due to the presence, in the mammalian enzymes, of a
domain in the protein which, when phosphorylated, is the signal for
translocation of the protein from the cytoplasm of the cell to the nucleus (
3
). Even in the proteins that are extended at the N-terminus, the level of sequence homology is much greater in the C-terminal region (Fig.
1
). The active site lysine has been identified and is in a region that is
homologous across all known sequences of both DNA and RNA ligases (
5
,
6
). The T bacteriophages also encode their own ATP-dependent DNA ligases. The T4 enzyme has found widespread use in
recombinant DNA techniques. Probably because of this and the ready availability
of the enzyme, it has been the subject of considerable study, though it has
never been crystallized. The enzyme from bacteriophage T7 is smaller than its
T4 counterpart, but shows very similar properties. The T7 enzyme appears to
represent the smallest functional unit for a DNA ligase and sequence homology
with other ligases shows that it corresponds to the highly conserved C-terminal region of other ATP-dependent ligases.
All restriction endonucleases and modification enzymes were obtained from
Boehringer Mannheim UK Ltd (Lewes, UK) or Gibco BRL. Protein purification
resins and columns were obtained from Pharmacia (Uppsala, Sweden). All other
chemicals were obtained from Sigma (Poole, Dorset, UK) unless stated otherwise.
[[alpha]-
35
S]dATP and [[alpha]-
32
P]ATP were obtained from Amersham International Ltd. Taq polymerase (AmpliTaq)
was obtained from Perkin Elmer Cetus (Emoryville, CA). Sequenase II was
obtained from United States Biochemicals (Cleveland, OH).
All restriction endonucleases and modification enzymes were used according to
the manufacturers' protocols. All the transformations, DNA isolations and manipulations were performed essentially as described
previously unless otherwise stated (
16
). Oligonucleotides were synthesized using an Applied Biosystems 381A DNA
synthesizer. The oligonucleotides were desalted on NAP G200 columns (Pharmacia) according to the manufacturer's instructions
The following strains of
E.coli
were used as hosts for pET21 (Novagen) constructs: XL1-Blue (
sup
E44,
hsd
R17,
rec
A1,
end
A1,
gyr
A96,
th
i,
rel
A, [F'
pro
AB,
lac
I
q
Z[Delta]M15, Tn
10
(
tet
R
)]) (
17
) for the propagation of clones and the preparation of single-stranded DNA for sequencing; GR501 (
Hfr
,
thi-
1,
pts
I
+
,
lig
-251) (
18
), a conditional lethal DNA ligase mutant, to test if the isolated genomic
fragments had complementary ligase activity; B834(DE3) (
hsd
S
gal
(
lcIts857 ind1 met
-
Sam
7
nin
5
lac
UV5-T7 gene 1) (
19
,
20
) for the over-expression of T7 DNA ligase.
Escherichia coli
strains were grown aerobically in Luria broth or on Luria broth agar plates
containing the appropriate antibiotics.
Proteolytic cleavage of purified T7 ligase (2 mg/ml) was carried out at 30oC for 10 min in the appropriate buffer containing 5 mM magnesium chloride.
A number of proteases and conditions were tried but the best results were obtained with 5 [mu]l clostripain (2.5 U/ml) with a buffer containing 50 mM Tris-HCl, pH 7.5, 5 mM CaCl
2
, 5 mM DTT, 10% glycerol in a total volume of 20 [mu]l. Reactions were terminated either by the addition of protease inhibitors
or EDTA followed by the addition of SDS-PAGE buffer, and heated at 95oC for 5 min. Protein samples were analysed by SDS-PAGE in 10, 12 and 15% gels with 4% stacking gels (
21
). Gels were stained with Coomassie Brilliant Blue and destained in 10% acetic
acid and 25% methanol. N-Terminal sequencing of the ligase fragments was performed using an Applied
Biosystems model 470A protein sequenator connected to an on-line 120A high pressure liquid chromatograph.
T7 DNA ligase (1 mg/ml) was incubated in 50 mM Tris-HCl, pH 7.5, 10 mM MgCl
2
, 5 mM DTT and 25 [mu]Ci [[alpha]-
32
P]ATP in a total volume of 20 [mu]l for 15 min at 25oC. The reactions were stopped by boiling in SDS buffer for 5 min and analysed by electrophoresis on a 12% SDS gel (
21
). The gels were dried before being autoradiographed on Fuji RX X-ray film.
In order to clone and over-express the 16 and 26 kDa fragments, we amplified the respective gene
fragments from T7 genomic DNA using PCR. DNA primers were designed based on the
amino acid sequencing of the proteolytic fragments and the known gene sequence
(GenBank accession no. G15572). Two oligonucleotides were designed for PCR with
each fragment based on the N-terminal amino acid sequence of the proteolytic fragments: a 5' primer (5'-GATATACCATGGTGAACATTAAGACTAAC-3') and a 3' primer (5'-GGGTTTGAATTCCTAACGGATTGGTTCAACGAATAA-3') for
PCR of the 16 kDa fragment and a 5' primer (5'-GATATACCATGGATAAAGTTCCCTTTAAGCTGCAC-3') and 3' primer (5'-GGGTTTTAAGCTTACATTTTCTCTTGAGGG-3') for PCR of
the 26 kDa fragment. The 5' primers contained an
Nco
I site and the 3' primers had a
Hin
dIII site after the stop codon, allowing the PCR products to be cloned into the
T7 promoter-based expression vector pET21d. PCR was performed with these primers and
bacteriophage T7 genomic DNA (Sigma) using Amplitaq in 100 [mu]l of PCR buffer and cycled as described (
22
). The PCR reaction was electrophoresed in 1.0% agarose (Gibco BRL) and the
product excised from the gel and purified using a Qiaex DNA extraction kit
(Qiagen). The DNA was digested with
Nco
I and
Hin
dIII and further gel purified. The fragments were ligated to dephosphorylated,
Nco
I/
Hin
dIII-cleaved pET21d, transformed into
E.coli
XL1-Blue and plated onto LB agar plates containing 100 [mu]g/ml ampicillin.
Clones containing the fragments pT716 and pT726 were identified by restriction
digestion. Clones were then screened for expression by transforming into
B834(DE3)[pLysS], growing selected colonies at 37oC in 5 ml Luria broth containing ampicillin and chloramphenicol to an A
600
of 0.6, induced with 1 mM IPTG and grown for a further 3 h. The level of
induction was monitored by electrophoresis of cell extracts on 15% SDS-polyacrylamide gels.
The dideoxy method (
23
) was used to confirm the nucleotide sequence of the T7
lig
fragments. Single-stranded template DNA was produced in
E.coli
XL1-Blue using helper phage M13KO7 (Pharmacia) according to the manufacturer's
instructions.
Two litre cultures of Luria broth containing 100 [mu]g/ml ampicillin and 50 [mu]g/ml chloramphenicol were inoculated with a 5 ml culture of
B834(DE3)[pLysS][pT726] and grown at 37oC until the A
600
reached 0.6-0.7. The cultures were induced by the addition of 0.5 mM IPTG. Growth was continued for a further 3 h before harvesting the cells by centrifugation at 5000
g
. The cell pellets were stored at -20oC until required. The pellets were lysed by sonication of a 10%
(w/v) cell suspension in buffer A (50 mM Tris-HCl, pH 7.5, 2 mM EDTA, 5 mM DTT and 100 [mu]M PMSF). The cells were pelleted by centrifugation at 20 000
g
. The pellet was washed with buffer A containing 1% Triton X-100 and recentrifuged at 20 000
g
. This procedure was repeated several times with buffer A alone. The pellet was
allowed to dissolve slowly in buffer C (5 M guanidine hydrochloride, 50 mM Tris-HCl, pH 7.5 and 25 mM NaCl). This solution was dialysed for 16 h against
buffer A containing 10% glycerol. The dialysed solution was applied to a
heparin-Sepharose column (20 ml) equilibrated with 90% buffer A and 10% buffer B
(buffer A containing 2 M sodium chloride). After washing the column with a
further two column volumes of this mixture, the protein was step eluted by
washing with 40% buffer B. SDS-PAGE confirmed that this peak contained the semi-purified enzyme at high concentrations. The conductivity of the
peak fractions containing the 26 kDa fragment was reduced to that of 50 mM NaCl
by dilution with buffer A. This was loaded onto a Q Sepharose column (40 ml)
pre-equilibrated in buffer A. The column was then washed with 80 ml of buffer
A before eluting the protein with 13% buffer B. Purity was again monitored by
electrophoresising samples on a 12% SDS-polyacrylamide gel.
The 16 kDa fragment was purified from SDS-polyacrylamide gels essentially as described previously (
24
). Clostripain-treated T7 ligase (20 [mu]g/track) was applied to a 15% SDS-polyacrylamide gel. After electrophoresis, the gel was cut in two and one
half was stained with Coomassie brilliant blue and the position of the 16 kDa fragment identified. This allowed identification of the position of the fragment on the unstained gel, which was then excised, 200 [mu]l of elution buffer (50 mM Tris-HCl, pH 7.5, 0.1 mM EDTA, 5 mM DTT, 150 mM NaCl) was added and the
gel was crushed thoroughly and left for 2 h. The eluted protein was purifed
away from the gel with a Micropure microporous separator (Amicon) and the
solution was buffer exchanged (50 mM Tris-HCl, pH 7.5, 1 mM EDTA, 5 mM DTT) and concentrated using a Microcon
concentrator (Amicon).
ATPase assays were carried out by detection of inorganic phosphate produced using acidic ammonium molybdate and malachite green as described previously (
25
).
DNA ligase assay substrate (22mer) was radiolabelled by incubating 20 [mu]g of the oligonucleotide with 100 [mu]Ci of [[gamma]-
32
P]ATP (3000 Ci/mmol; Amersham) and 50 U T4 polynucleotide kinase for 45 min at
37oC followed by 10 min at 70oC. The unincorporated label was removed by centrifugation through a S-200 microspin column (Pharmacia). The DNA ligase assay was
performed essentially as described previously (
26
). The complementary 18mer and 22mer oligonucleotides were annealed to single-stranded M13mp19 by incubation at 70oC for 2 min and allowed to cool for 1 h. The annealed DNA was
incubated with ligase buffer (50 mM Tris-HCl, pH 7.5, 10 mM MgCl
2
, 5 mM DTT), unless otherwise stated, in the presence of enzyme and nucleotide cofactors as indicated, in a total volume of 10 [mu]l for 15 min at 25oC. The reactions were terminated by the addition of sequencing stopping
buffer (Sequenase kit; US Biochemicals) followed by heating at 95oC for 5 min. The ligation products were subjected to electrophoresis on a
15% polyacrylamide-urea gel and to autoradiography with Fugi RX X-ray film.
These were performed essentially as described previously (
27
). An aliquot of 20 [mu]g of oligonucleotide was labelled by incubation with 100 [mu]Ci [[gamma]-
32
P]ATP (3000 Ci/mmol) and 50 U T4 polynucleotide kinase for 45 min at 37oC followed by 10 min at 70oC. The unincorporated label was removed by centrifugation through a S-200 microspin column (Pharmacia). Binding reactions were carried
out in a total volume of 10 [mu]l, containing 50 mM Tris-HCl, pH 7.5, 5 mM DTT, 5 mM MgCl
2
. Reaction mixtures were incubated at room temperature for 20 min and run on 6-8% polyacrylamide gels which contained 5% glycerol.
Cloning of the 16 and 26 kDa proteolysis fragments was performed as described in
Materials and Methods. A large number of positives clones containing the
correct gene inserts (pT716 and pT726) were identified by digestion of the mini-preparations of DNA with
Nco
I and
Hin
dIII. DNA sequencing of both constructs confirmed that no mutations had been introduced during PCR.
In order to determine whether these clones had DNA ligase activity
in vivo,
the two constructs (pT726 and pT716) were transformed into
E.coli
GR501, a conditional lethal DNA ligase mutant which is not viable at 42oC. Transformants were grown overnight at either 30oC (permissive temperature) or 42oC (non-permissive temperature) to assay for functional complementation of the replication-defective phenotype. Previously we have observed that expression of T7 ligase, despite being under
the control of a T7 promoter in this plasmid, was sufficient to complement
E.coli
GR501 (
18
). The fragments were unable to complement this ligase-deficent strain, indicating either that they have no ligation activity or
that it is too low to rescue growth.
The expression constructs (pT716 and pT726) were transformed into B834(DE3)[pLysS] and these transformants screened for over-expression of the T7 ligase fragments by inoculating single colonies into
small cultures of LB medium and growing for several hours. The cells were
induced at mid log phase, by the addition of IPTG, and grown for several more
hours. Samples were taken and analysed by SDS-PAGE. In all cases there was large production of proteins which had
apparent molecular weights of ~26 and 16 kDa respectively, but unfortunately both of these proteins were
totally insoluble. Various methods were employed to try and resolubilize the
proteins, with mixed success. It was possible to resolubilize >80% of the 26
kDa fragment using the method described in Materials and Methods. This protein
was purified to homogeneity using the three column procedure outlined above,
giving a final yield of >60 mg/l. Despite trying a wide range of expression
conditions and solubilization methods, attempts to re-fold the 16 kDa fragment proved unsuccessful.
It has been shown previously that it is possible to recover proteolytic fragments from polyacrylamide gels following electrophoresis in the presence of SDS (
24
,
29
). We resorted to purifying the proteolytically produced 16 kDa fragment using
this method and reconstituting the protein activity in essentially the same
manner as described previously (
24
). In order to test the feasibility of this approach for T7 ligase, we used
intact enzyme as a control. The proteins were separated by SDS-PAGE, eluted from the gel as described in Materials and Methods and
tested for activity. We found that it was possible to re-nature the intact T7 ligase and restore ATPase and ligase activities (data
not shown).
T7 DNA ligase appears to represent a minimal size `core' DNA ligase and is
therefore an ideal model to study the essential features, both structural and
catalytic, of this class of enzyme. Alignment of the amino acid sequences of
eukaryotic ligases using a pairwise alignment method for the determination of
sequence identities (
4
) reveals that these enzymes contain sequence homology with other ATP-dependent DNA ligases, but little similarity to the NAD
+
-dependent DNA ligases. Despite the different sizes of the eukaryotic
ligases it is possible to align their sequences because of a high degree of
conservation in the C-terminal region (Fig.
1
). It has been noted that on alignment of homologus sequences of T4 and T7 DNA
ligases, the C-termini coincided almost exactly, with the majority of the differences
being located at the N-termini (
32
). It has also been shown that the distance between the presumed active site
lysine and the C-terminus is very similar (~330 residues) across a diverse range of ATP-dependent ligases (
4
,
5
; Fig.
1
) and the sequence analysis suggests that there is probably a high degree of
structural homology between the ATP-dependent DNA ligases.
Based on the data we present here and complementary data from other studies on
DNA ligases, we can propose a model for DNA binding which is consistent with
the biological and catalytic roles of these enzymes. ATP-dependent DNA ligases consist of two functional regions, an N-terminal catalytic region and a C-terminal DNA binding region. The variable N-terminal region contains the active site motif (KYDGxR)
(
5
) and has been proposed to be responsible for AMP attachment and transfer to the
target nick site on DNA. In the larger eukaryotic enzymes, such as mammalian I
and II,
Drosophila
II and
S.cerevisiae
Cdc9, it has been demonstrated that the extreme N-terminal region of the enzyme, which is the least conserved, is not
required for catalytic activity (Fig.
1
;
3
,
33
). It has been suggested that this region may play a role in nuclear
translocalization or be employed in specific protein-protein interactions during DNA replication and repair/recombination (
3
). Phosphorylation of the N-terminus of human DNA ligase I is required for activity of the catalytic
domain, suggesting that this end region of the larger eukaryotic enzyme has
important regulatory functions which are not required in the smaller enzymes.
In T7 ligase, the N-terminus is much shorter, presumably because it does not require a modulation domain to regulate activity
in vivo
. Limited proteolysis of T7 ligase has provided data about the possible roles of
different regions of the enzyme. Although ATPase activity could not be
demonstrated for the 16 kDa N-terminal fragment, we have confirmed that this region is the site of
covalent attachment of AMP, as shown previously for human ligase I (
5
). Mutagenesis studies of human ligase I showed that deletion of 23 amino acids
at the C-terminus caused complete loss of enzyme activity and also prevented enzyme-AMP formation. Pre-incubation of T7 ligase with ATP prior to proteolysis greatly
reduces cleavage of the protein into the two major cleavage fragments (16 and 26 kDa), suggesting that binding of the nucleotide causes a conformational change making the protein resistant to proteolysis. One explanation is that there is an association of the N- and C-terminal domains which could play a role in ATP binding and
hydrolysis.
After completion of this work, we determined the crystal structure of T7 DNA
ligase at 2.6 Å resolution (
34
), allowing us to compare the proteolysis results directly with the structure of
the enzyme. The protein comprises two distinct domains (Fig.
6
), a larger N-terminal Domain 1 (residues 2-240) which contains the ATP binding site and a smaller Domain 2
(residues 241-349). The two domains have a distinct groove running between them, which
is proposed to be the DNA binding site. Although we expected the
proteolytically sensitive site to be situated between the two domains, it was actually located in an exposed surface loop (Fig.
6
). The accessibility of this loop region explains the hypersensitivity of this region to protease digestion. The 26 kDa fragment has lost 130 amino
acids (16 kDa) at the N-terminus, which accounts for over half of Domain 1. This N-terminal region is involved in extensive contacts with the remainder
of the protein, which may explain why the 16 kDa fragment was apparently unable
to fold correctly on its own. The truncated protein (26 kDa) retains half of
Domain 1, the cleft between the two domains and the whole of Domain 2. The
crystal structure strongly suggests that DNA binds in the cleft between the two
domains of the protein. Data from the limited proteolysis experiments described
in this work revealed that removal of 130 amino acids at the N-terminus does not affect the ability of the truncated protein to bind to
DNA, but destroys its catalytic activity. The loss of catalytic activity of the
resulting 26 kDa fragment is presumably due to removal of the active site
lysine residue (Lys34). However, the fragment retains the ability to bind to
double-stranded DNA, with a binding affinity that is approximately one order of
magnitude higher than the intact protein. It is also a very effective
competitive inhibitor of intact T4 and T7 DNA ligases and again ~10-fold less of the fragment is sufficient to abolish the ligation
activity. The apparent correlation of these data suggests that inhibition is
probably due to a tighter binding of the fragment to ligation sites. The 26 kDa
fragment retains the conserved cleft between the domains and hence these data
provide further evidence that this cleft is the DNA binding site. This would also explain the ability of the 26 kDa fragment to act as a competitive inhibitor of the intact enzyme. We are over-expressing Domains 1 and 2 separately in order to characterize the
biochemical properties of these regions of the enzyme.
Figure
Recently, Lindahl and co-workers have isolated a fourth human DNA ligase isoenzyme (
35
), which, in contrast with other ligases, contains an extended C-terminal region of 323 amino acids with no homology to any other protein
(Fig.
1
). Despite this difference, it retains the central core region, supporting the
view that certain essential features are present in this portion of all ATP-dependent DNA ligases. Bacteriophage T7 and T3 have evolved the smallest
DNA ligases and by discovering how these enzymes work we will begin to gain a
better understanding of the larger eukaryotic enzymes.
This work was supported by the Wellcome Trust. SRA is an MRC post-graduate student. We would like to thank Tomas Lindahl for kindly
providing the
E.coli
GR501 strain, Val Cooper for oligonucleotide synthesis, Tony Willis for protein
sequencing and Mark Odell for useful discussions on this work.

REFERENCES
Return


