ABSTRACT
R.
C
viJI is unique among site-specific restriction endonucleases in that its activity can be modulated
to recognize either a two or three base sequence. Normally R.
C
viJI cleaves RGCY sites between the G and C to leave blunt ends. In the presence
of ATP R.
C
viJI* cleaves RGCN and YGCY sites, but not YGCR sites. The gene encoding R.
C
viJI was cloned from the eukaryotic
Chlorella
virus IL-3A and expressed in
Escherichia coli
. The primary
E.coli
cviJIR
gene product is a 278 amino acid protein initiated from a GTG codon, rather than
the expected 358 amino acid protein initiated from an in-frame upstream ATG codon. Interestingly, the 278 amino acid protein
displays the normal restriction activity but not the R
.C
viJI* activity of the native enzyme. Nine restriction and modification proteins
which recognize a central GC or CG sequence share short regions of identity
with R.
C
viJI amino acids 144-235, suggesting that this region is the recognition and/or catalytic
domain.
Most Type II restriction endonucleases function as homodimers and recognize a
symmetric sequence of 4-8 bases (
1
,
2
). They precisely cleave both DNA strands within or adjacent to their
recognition sites in the presence of magnesium. Restriction enzymes are
extremely specific, catalyzing incorrect cleavage fewer than once in 10
7
binding events in the case of
Eco
RI (
3
) and
Taq
I (
4
). The specificity of some endonucleases can be `relaxed' [also known as `star'
(*) activity] by modified assay conditions such as low ionic strength, high pH,
high glycerol concentration, the presence of organic solvents, high ratio of
enzyme to DNA or substituting other divalent cations for magnesium (
5
). These star recognition sequences usually differ from the canonical sequence
by a single base pair, although strand preferences can be detected with the
appropriate assay (
4
). A unifying element of star-inducing reaction conditions is a net decrease in bound water activity,
which alters protein-DNA contacts and relaxes recognition and cleavage (
6
).
The phycodnavirus IL-3A (
7
-
9
), which infects a unicellular eukaryotic
Chlorella
-like green alga, encodes at least one site-specific restriction endonuclease, R.
C
viJI (
10
), as well as a cognate cytosine 5-methyltransferase, M.
C
viJI (
11
). The normal specificity of R.
C
viJI, RGCY (R, purine; Y, pyrimidine) resembles a bacterial Type II restriction
endonuclease in requiring only magnesium for activity. However, in the presence
of magnesium and ATP, cleavage specificity relaxes to include RGCN (N, any
nucleotide) and YGCY sites (
10
,
12
). This relaxed specificity is similar to the star activity reported for
bacterial Type II restriction endonucleases. R.
C
viJI* activity is unique, though, in requiring an adenine nucleotide cofactor,
which suggests a functional domain is involved in this ATP-dependent modulation of the restriction specificity. This feature of R.
C
viJI, as well as its short recognition sequence, distinguishes it from other
Type II endonucleases.
Because R.
C
viJI and R.
C
viJI* cleave DNA frequently they can be used for a variety of molecular biology
applications. For example, R.
C
viJI* generates numerous sequence-specific oligonucleotides from anonymous DNA samples (
12
) or produces quasi-random distributions of DNA fragments (
13
). Although R.
C
viJI can be isolated from virus IL-3A-infected
Chlorella
, this process is inefficient because the host
Chlorella
grows slowly and the virus-infected alga contains many nucleases. This report describes the cloning,
sequencing and expression of the
cviJIR
gene in
Escherichia coli
.
The growth of the host alga,
Chlorella
strain NC64A, on MBBM medium, the production and purification of virus IL-3A (
7
) and the isolation of IL-3A DNA (
14
) have been described. The molecular cloning of
cviJIM
in plasmid pIL-3A.22.8 (
11
) has been described.
Escherichia coli
strain DH5[alpha]MCR (Life Technologies, Gaithersburg, MD) served as a host for
recombinant plasmids and bacteriophage M13mp18 (
15
).
Restriction and modification enzymes were obtained from Molecular Biology
Resources (Milwaukee, WI). The Minute Miniprep DNA purification kit and Sequal
DNA sequencing kit were products of CHIMERx (Madison, WI). Oligonucleotides
were obtained from Synthetic Genetics (San Diego, CA), M13mp18 and the 100 bp
DNA ladder were from Life Technologies and the Muta-Gene
in vitro
mutagenesis kit and protein molecular weight markers were purchased from BioRad
(Hercules, CA).
Thermus aquaticus
DNA polymerase for the PCR was obtained from Applied Biosystems (Foster City,
CA). Ampicillin, X-Gal, IPTG and XOMAT AR X-ray film were from Sigma (St Louis, MO). [[gamma]-
32
P]ATP was purchased from Du Pont (Boston, MA).
One microgram of DNA was digested under R.
C
viJI conditions (10 mM MgCl
2
, 50 mM NaCl, 50 mM Tris-HCl, pH 8.0) or R.
C
viJI* conditions (1 mM ATP, 20 mM DTT, 10 mM MgCl
2
, 50 mM NaCl, 50 mM Tris-HCl, pH 8.0) at 37oC for 1 h. Restriction fragments were electrophoresed on 1.5%
agarose gels in Tris-acetate/EDTA buffer (
16
).
The
c
vi
JIR
gene was located on plasmid pCJH1.4 and both DNA strands were sequenced by the
dideoxy chain termination method (
17
) using three approaches: (i) primer walking; (ii) cloning restriction
endonuclease digests into M13mp18; (iii) sequencing restriction endonuclease
deletion derivatives of pCJH1.4. To ensure the accuracy of the
cviJIR
gene sequence, PCR primers were synthesized to flank both sides of the gene and
the amplified region from IL-3A virus was cloned into a [lambda] promoter-containing vector and maintained in the presence of a
compatible plasmid expressing the cloned
cviJIM
gene. The gene was sequenced again from two such cloned samples. The N-terminal 15 amino acids of R.
C
viJI purified from
E.coli
was sequenced at the Protein and Nucleic Acid Shared Facility of the Medical
College of Wisconsin (Milwaukee, WI) by the Edman degradation method using an
Applied Biosystems 477A Liquid Phase Protein Sequencer with an on-line 120A PTH Analyzer (
18
). The computer-translated sequence of R.
C
viJI was compared with the public sequence databases using MacDNASIS Pro
software (Hitachi Software, San Bruno, CA).
The
lacZ
' Shine-Dalgarno sequence and the
cviJIM
ATG start codon were separated by 215 nt on plasmid p710. These 215 nt were
deleted by site-specific oligonucleotide mutagenesis using the oligomer 5'-CAATTTCACACAGGAAACAGCTATGTCTTTTCGCACGTTAGAAC-3' as indicated in Figure
1
A. The mutagenesis was facilitated by converting the double-stranded plasmid DNA to single-stranded DNA by co-infecting p710-containing
E.coli
with the helper phage R408 (
19
). The oligonucleotide was annealed to the single-stranded plasmid, extended in the presence of T4 DNA polymerase, ligated
using T4 DNA ligase and transformed into
E.coli
DH5[alpha]MCR. The transformed cells were grown overnight in liquid culture, the
DNA was purified,
Xho
I digested, dephosphorylated with calf intestinal alkaline phosphatase (CIAP)
and transformed into
E.coli
DH5[alpha]MCR cells.
Xho
I digestion served to enrich for the desired mutation because a single
Xho
I site was located in the sequence being deleted (Fig.
1
A).
The successful cloning of the endonuclease portion of a restriction-modification (R-M) gene system usually requires methyltransferase expression (
1
). The
cviJIM
gene from the virus IL-3A genome was previously cloned, expressed and sequenced from plasmid pIL-3A.22.8 (
11
). Unpublished results indicated that the DNA following the 3'-end of the
cviJIM
gene was not co-linear with the viral genomic DNA. Therefore, to remove extraneous DNA and
to improve M.
C
viJI expression, a 1.6 kb
Dra
I fragment from plasmid pIL-3A.22.8, which includes the
cviJIM
gene (367 codons), was subcloned into the
Eco
RV site of a
lacZ
' vector. The resultant plasmid, p710 (Fig.
1
A), had the
cviJIM
gene in the same translational orientation as the
lacZ
' gene and expressed M.
C
viJI activity (RG
m
CY). The ability of this construct to methylate plasmid DNA was tested by
incubating p710 with either
Hae
III (GG\CC),
Sac
I (GAGCT\C) or
Hin
dIII (A\AGCTT). Efficient M.
C
viJI expression would make these sites sufficiently resistant to these enzymes.
However, the plasmid was only 90% resistant to
Hae
III and
Sac
I and 25% resistant to
Hin
dIII digestion. Therefore, it was necessary to increase M.
C
viJI expression before attempting to clone the
cviJIR
gene.
Improved translation of the
cviJIM
gene was obtained by deleting 215 nt between the
lacZ
' and
cviJIM
ATG start codons, as detailed in Materials and Methods. This deletion created a
perfect fusion of the
lacZ
' Shine-Dalgarno sequence with the
cviJIM
initiation codon (Fig.
1
A). Plasmid DNA from five clones was >95% resistant to
Hae
III,
Hin
dIII,
Pvu
II (CAG\CTG) or
C
viJI restriction. Four of the plasmids were sequenced and the results confirmed
the expected deletion. One plasmid, pBMC5, was chosen for further modification.
Two observations suggested that the
cviJIR
gene on the IL-3A virus genome was 3' of the
cviJIM
gene: (i)
E.coli
containing cloned DNA 5' of the
cviJIM
gene lacked R.
C
viJI activity; (ii) attempts to clone the DNA 3' of the
cviJIM
gene resulted in deletions/rearrangements of this region. The
cviJIR
gene was cloned using the following strategy. First, an
Eco
RI deletion derivative of pBMC5, which lacks the 3'-half of the
cviJIM
gene, was created (pBMC5RI; Fig.
1
A). Second, both pBMC5RI and IL-3A genomic DNA were cleaved with
Eco
RI, the plasmid was dephosphorylated and the DNAs were mixed with T4 DNA ligase
and used to transform DH5[alpha]MCR. Third, the pooled pBMC5RI/IL-3A transformants were grown overnight in liquid culture. Plasmid
DNA was isolated from the cell mixture and digested with
Hae
III, dephosphorylated with CIAP and transformed into DH5[alpha]MCR cells. Six colonies grew; the DNA from one, pCJH1.4, was resistant to
Hae
III.
Escherichia coli
containing pCJH1.4 also expressed R.
C
viJI activity.
The
Eco
RI fragment cloned into pCJH1.4 is 4901 bp in length and the entire fragment was
sequenced. Seven discrepancies were noted in the 273 bases following the
previously reported
cviJIM
gene (
11
) and that reported here. The sequence data shown here was obtained from three
independently derived clones from the viral genome (one plasmid genomic library
clone, pCJH1.4, and two PCR-derived clones) and carefully checked for inaccuracies. The insert in
pCJH1.4 contains six open reading frames (ORFs) of 1074 (ORF1), 468 (ORF2), 555
(ORF3), 1086 (ORF4), 396 (ORF5) and 579 bp (ORF6), which could code for
polypeptides containing 358 (41.4 kDa), 156 (19.4 kDa), 185 (20.3 kDa), 362
(38.9 kDa), 132 (14.5 kDa) and 193 (21.9 kDa) amino acids respectively,
assuming the most commonly used start codon ATG is the translational initiation
site (Fig.
1
B). ORF4-ORF6 do not encode R.
C
viJI, because
E.coli
containing pCdA12, which lacks DNA between the
Ava
I and
Bam
HI sites (Fig.
1
B), produces R.
C
viJI activity.
Escherichia coli
containing pCdEB7, which lacks DNA between the
Eco
RV and
Bam
HI sites, does not produce R.
C
viJI activity. Thus, ORF1 or ORF3 are the most likely candidates to encode R.
C
viJI.
R.
C
viJI purified from
E.coli
containing pCJH1.4 produced a single band on 10% SDS-PAGE with an apparent size of 32.5 kDa (Fig.
2
). The purified recombinant R.
C
viJI protein was sequenced and the first 15 N-terminal amino acids were (T, M, R or K)EEKKRLALIEKQRI. The ambiguous N-terminal amino acid sequence (T, M, R or K) reported here could be
the result of partial post-translational processing by methionine aminopeptidase (
21
). However, the penultimate residue, glutamic acid, does not favor removal of
methionine by this peptidase. Comparing the N-terminal sequence with that predicted from ORF1 revealed 14 out of 15
amino acids match if translation of the recombinant protein began at the GTG
codon located at nt 299-301 (Fig.
3
). Furthermore, the observed 32.5 kDa molecular weight of the recombinant enzyme
agrees with this predicted molecular weight (31.6 kDa). These results indicate
that the primary product of
cviJIR
expressed in
E.coli
is a GTG codon-initiated protein of 278 amino acids. Because of this unexpected result,
the
cviJIR
gene and flanking sequence from
Chlorella
virus IL-3A DNA were PCR amplified using appropriate primers (
22
). The PCR product was cloned for redundant sequencing purposes. The same
nucleotide sequence as that presented in Figure
3
was observed, eliminating the possibility of cloning artifacts (
23
).
In spite of the common recognition sequence shared by cognate R-M enzymes, only two,
Eco
RII (
35
) and
Eco
57I (
36
), share significant amino acid similarities between the endonuclease and
methyltransferase. The predicted amino acid sequences of R.
C
viJI and M.
C
viJI were compared for intermolecular homology and two small regions of identity
were found (Fig.
5
). One region (TGFS-_G_F_K_F) partially overlaps with a motif (motif IV) present in all
cytosine 5-methyltransferases (
37
). Interestingly, the TGFS sequence is immediately adjacent to the invariant
dipeptide PC in M.
C
viJI (
11
), which is the catalytic site of M.
Hha
I and other bacterial cytosine 5-methyltransferases (
37
). The other small region of identity is SF_RK_V_AP. Although these homology
regions are limited (SF_RK_V_AP shares 54% identity over 13 residues and TGFS-_G_F_K_F shares 38% identity over 21 residues) and not colinear, the
similarity between them may be significant and suggests a role in DNA
recognition or cleavage.
Figure
Amino acids 144-235 of R.
C
viJI show limited homologies to several restriction and modification enzymes
(Fig.
5
), all of which contain a central GC or CG as part of their recognition
sequence. R.
C
viJI has the highest homology with R.
Bsu
RI (GG\CC) from
Bacillus subtilis
(
38
), sharing 35% amino acid identity over 132 residues, a portion of which is
shown in Figure
5
. This homology is not colinear, but is segmented or epitopic. Seven other
proteins which recognize a central GC or CG sequence also share limited
identities with R.
C
viJI, including M.
Hgi
CII (GG[A/T]C
m
C;
39
) (63% identity/16 amino acids), R.
Hae
III (GG\CC; G.G.Wilson, unpublished results) (39%/61 amino acids), R.
Ngo
PII (GG\CC;
4
0) (28%/40 amino acids), R.
Pvu
II (CAG\CTG;
4
1
) (29%/31 amino acids), M.
Pvu
II (CAG
m
CTG;
4
2
) (35%/78 amino acids), R.
Pst
I (CTGCA\G;
4
3
) (36%/50 amino acids) and R.
Sma
I (CCC\GGG;
4
4
) (35%/48 amino acids). A particularly interesting similarity is found between
R.
Pvu
II and R.
C
viJI (Fig.
5
). The crystal structure of R.
Pvu
II complexed with its cognate DNA sequence (
45
) indicates that E55 and D58 may represent the Mg
2+
binding ligands and that E68 and K70 are the candidate active site residues
(underlined residues in Fig.
5
). R.
C
viJI appears to have the appropriate charged group equivalents in D191, D194,
K205 and R206. The sequence adjacent to the active site of R.
Pvu
II, GFSTHHH, contains three residues involved in DNA sequence recognition, S81,
H83 and H84. A partially similar sequence in R.
C
viJI is also present in the same location (212LTf-GFS). These similarities, as well as the coincidence of the nine central
GC recognition R-M proteins having limited regions of identity to R.
C
viJI, suggest that amino acids 144-235 encompass the target recognition and/or cleavage domain.
The rationale for utilizing short epitopic and in some cases non-colinear, homology comparisons is based on several considerations.
Endonucleases that share similar recognition sequences are statistically
closely related, despite their short amino acid similarities (
46
). For example, five endonucleases that recognize hexanucleotide palindromic
sequences and leave tetranucleotide 5'-overhangs were found to share two regions of similarity with the
catalytic and recognition domain of
Eco
RI (
47
). The crystal structure of four restriction enzymes reveal common structural
and functional elements between
Bam
HI (
48
),
Eco
RI (
49
),
Eco
RV (
50
) and
Pvu
II (
45
), despite the lack of significant sequence similarity. The catalytic and DNA
recognition regions of these molecules are not contiguous, but interspersed.
For those restriction enzymes related by similar recognition or cleavage sites,
this type of analysis may reveal additional correlations.
The Phycodnaviridae family of double-stranded DNA viruses encode numerous restriction and modification enzymes.
Based on the sensitivity/resistance of viral DNA to some 50 methylation-sensitive restriction endonucleases, a minimum of 12 modification
phenotypes has been deduced (
5
1
). Some of these R-M enzymes have the same modification and cleavage specificity as bacterial
Type II enzymes, whereas others have unique specificities. The R.
C
viJI restriction endonuclease from
Chlorella
virus IL-3A is unusual with respect to its short recognition sequence (RGCY) and
the ATP-dependent relaxation of sequence specificity (RGCN, YGCY). The three base
restriction activity of R.
C
viJI has been cloned in
E.coli
apparently separate from its two base cleavage specificity. As the recombinant
R.
C
viJI displays the normal restriction activity, but not the ATP-dependent star activity, the putative 80 amino acid N-terminal domain may be responsible for the ATP-dependent modulation of R.
C
viJI restriction specificity. Alternatively, the basic amino acid region could
function as a nucleic acid binding domain for viral packaging. It is also
reasonable that a domain separate from the catalytic portion of the enzyme
exists to carry out these functions. Uncovering the role of this putative
region, if any, will contribute to a detailed understanding of the structure-function relationship of this unique enzyme.
We thank Dwight Burbank for growing and processing virus-infected cells, Geoffrey Wilson for generously providing us with amino
acid sequences for R.
Hae
III and R.
Mth
TI prior to publication, Yanping Zhang for the p710 plasmid and Lily Marr and
Jim Szablewski for technical assistance. This investigation was supported, in
part, by Public Health Service grant GM-32441 to JVE from the National Institute of General Medical Sciences.
+
Present address: BioRad Laboratories, Hercules, CA 94547, USA
Lysate preparation.
Escherichia coli
DH5[alpha]MCR[pCJH1.4] was grown at 30oC with vigorous aeration in TB broth medium (
20
) supplemented with 0.5 mM MgCl
2
and 100 [mu]g/ml ampicillin. When the OD
595
reached 0.5, IPTG to 0.5 mM and maltose to 0.4% were added and the culture was
grown for an additional 2 h. Cells (60 g) were concentrated by centrifugation
and resuspended in three volumes of cold lysis buffer A [30 mM Tris-HCl, pH 7.9 (4oC), 200 mM NaCl, 2 mM EDTA, 10 mM [beta]-mercaptoethanol, 50 [mu]g/ml PMSF, 20 [mu]g/ml benzamidine, 2 [mu]g/ml
o
-phenantroline, 0.7 [mu]g/ml pepstatin, 0.7 [mu]g/ml leupeptin A]. The suspension was passed through a Manton-Gaulin cell disrupter and the cells were removed by
centrifugation. Polyethyleneimine solution (10%, pH 7.5) was added to the
supernatant to a final concentration of 1% and stirred for 30 min. The
resulting precipitate was removed by centrifugation and the nucleic acid-free solution was precipitated overnight with ammonium sulfate (0.5 g/ml).
Precipitated proteins were pelleted by centrifugation, dissolved in 100 ml of
buffer A, desalted on a Sephadex G-25 column (5 * 30 cm) and equilibrated in buffer B (10 mM K/PO
4
, pH 7.2, 30 mM NaCl, 1 mM EDTA, 10 mM [beta]-mercaptoethanol, 10% glycerol, 0.05% Triton X-100, 50 [mu]g/ml PMSF, 20 [mu]g/ml benzamidine, 2 [mu]g/ml
o
-phenantroline, 0.7 [mu]g/ml pepstatin, 0.7 [mu]g/ml leupeptin A).
Phosphocellulose chromatography.
The desalted R.
C
viJI extract was applied to a 5 * 12 cm phosphocellulose P11 (Whatman) column equilibrated in buffer B.
After washing with 4 l of buffer B, the protein was eluted with 4 l of a 0-1.2 M NaCl linear gradient in buffer B. Fractions were assayed by
incubating 1 [mu]g pBR322 DNA with 2 [mu]l aliquots of each fraction. The endonuclease eluted at ~0.7 M NaCl.
Heparin-Sepharose chromatography.
Active fractions from the phosphocellulose column were diluted 7-fold in buffer C [20 mM Tris-acetate, pH 7.2 (22oC), 60 mM potassium acetate, 2 mM EDTA, 10 mM [beta]-mercaptoethanol, 10% glycerol, 0.01% Triton X-100, 50 [mu]g/ml PMSF, 20 [mu]g/ml benzamidine, 2 [mu]g/ml
o
-phenantroline, 0.7 [mu]g/ml pepstatin, 0.7 [mu]g/ml leupeptin A] and applied to a heparin-Sepharose (Pharmacia) column (5 * 10 cm) that had been equilibrated with buffer C.
The column was washed with 1.5 l of buffer C and R.
C
viJI was eluted with 3 l of a 0-2.5 M potassium acetate gradient in buffer C. The endonuclease activity
eluted at 0.9 M potassium acetate.
Hydroxyapatite HTP chromatography.
The pooled R.
C
viJI-containing fractions were diluted 5-fold in buffer D (10 mM K/PO
4
, pH 7.0, 50 mM NaCl, 1 mM EDTA, 10 mM [beta]-mercaptoethanol, 10% glycerol, 0.05% Triton X-100, 50 [mu]g/ml PMSF, 20 [mu]g/ml benzamidine, 2 [mu]g/ml
o
-phenantroline, 0.7 [mu]g/ml pepstatin, 0.7 [mu]g/ml leupeptin A) and applied to a hydroxyapatite HTP (BioRad)
column (1.5 * 3 cm) equilibrated in buffer D. The column was washed with 200 ml of
buffer D and the enzyme was eluted with a 0-0.7 M K/PO
4
, pH 7.0, linear gradient in buffer D. R.
C
viJI activity was recovered at 0.2 M K/PO
4
.
Molecular sieving.
Active fractions from the hydroxyapatite HTP column were concentrated to 3 ml
using ultrafiltration membrane YM10 (Amicon) and subjected to molecular sieving on a Sephadex G-100 (Pharmacia) column (2.5 * 100 cm) that had been equilibrated in buffer E [20 mM Tris-acetate, pH 8.0 (4oC), 60 mM potassium acetate, 2 mM EDTA, 10 mM [beta]-mercaptoethanol, 10% glycerol, 0.01% Triton
X-100, 50 [mu]g/ml PMSF, 20 [mu]g/ml benzamidine, 2 [mu]g/ml
o
-phenantroline, 0.7 [mu]g/ml pepstatin, 0.7 [mu]g/ml leupeptin A]. Peak fractions containing homogeneous R.
C
viJI were pooled, dialyzed against storage buffer S [20 mM Tris-acetate, pH 8.0 (4oC), 200 mM potassium acetate, 0.5 mM EDTA, 0.1 mM DTT, 50% glycerol,
0.02% Triton X-100, 50 [mu]g/ml PMSF, 0.7 [mu]g/ml pepstatin, 0.7 [mu]g/ml leupeptin A] and stored at -70oC.

REFERENCES
Return

