ABSTRACT
Probing of the HNF1 (hepatocyte nuclear factor I) DNA-binding region using a set of DNA duplexes containing pyrophosphate or
O
-methyl-substituted pyrophosphate internucleotide groups at different
positions of the HNF1 recognition sequence was performed. The histidine-tagged HNF1/1-281 DNA binding domain and nuclear extract from rat liver were
used. We showed that HNF1 from these species specifically binds to modified DNA
duplexes. A correlation in binding affinity of both types of duplexes was
detected. Crosslinking of the HNF1 DNA-binding domain and HNF1 in nuclear liver extract to DNA duplexes carrying
O
-methyl-substituted pyrophosphate groups was observed. The crosslinking efficiency of HNF1 in liver
extract to substituted pyrophosphate-modified DNA duplex, containing a reactive internucleotide group between nucleotides G and T of the GT dinucleotide immediately 5
'
to the TAAT recognition sequence, amounts to 40% of the efficiency of non- covalent association. Nonspecific crosslinking of the reactive DNA
duplexes to other components of nuclear extract was not observed. These results
indicate that DNA duplexes carrying substituted pyrophosphate internucleotide
groups can specifically bind and crosslink with DNA-binding proteins, especially transcription factors in crude preparations
and could constitute a potential tool to control the expression of disease-causing genes.
The development and differentiation of eukaryotic organisms are complex
phenomena that require specific regulation of the expression of particular genes. This control operates predominantly at the transcriptional
level and to a great extent depends on cellular sequence-specific transcription factors which interact with DNA sequences usually located in the 5'-flanking region of the gene (
1
). Transcription factors are attractive targets for regulating gene expression (
2
,
3
). Their inhibition by short double-stranded oligodeoxynucleotides containing recognition sequences can interfere with the
gene expression.
We have previously demonstrated that double-stranded oligonucleotides carrying recognition sites for DNA transcription factors can specifically inhibit gene expression at nM concentrations (
4
). To improve the efficiency of oligonucleotide competitors, we suggest using a
crosslinking procedure. Specific crosslinking of reactive double-stranded oligonucleotides to targeted factors results in their irreversible inhibition and may be considered a potential therapeutic tool to regulate the expression of harmful genes involved in viral diseases or cancer.
Crosslinking procedures are extensively used to study nucleic acid-protein interactions. Photochemical crosslinking of proteins to nucleic
acids containing 5-bromouridine (
5
,
6
), 4-thiouridine (
7
-
9
), 6-thiodeoxyguanosine (
9
) or azidonucleotides (
10
-
12
) have been utilized. This procedure has been successfully applied to study transcription factors. Photochemical crosslinking has been performed to identify four subunits of
Saccharomyces cerevisae
transcription factor IIIC and to explore the proximity of each of these
subunits to different parts of a yeast tRNA (
11
). This procedure involved the use of DNA probes containing arylazide derivatives of deoxyuridine. Photochemical crosslinking has been developed to identify contacts between 5S DNA and
Xenopus
TF IIIA. 5-Azido-2'-deoxyuridine substituted DNA has been used in this
investigation (
13
). However, photochemical crosslinking requires additional activators such as
light or heat. Additionally, aside from the propensity of arylazides to yield
reactive nitrenes upon exposure to activators, the reaction pathways utilized
by the generated nitrene vary according to the composition and environment of the aryl group (
10
).
Lately, more efficient crosslinking procedures using platinum complexes as
crosslinking reagents have been introduced (
14
). These reagents function within the physiological pH range and require no additional activators (
15
). Crosslinking of transcription factors CREB and JUN to their recognition sequences with PtII complexes
has been performed (
14
). However, to obtain high crosslinking efficiency it is necessary to use
phosphorothioate-containing oligonucleotides because crosslinking is much less efficient in
the absence of phosphorothioate groups (
14
).
Recently, we have suggested a novel method for inserting active groups in double-stranded DNAs to permit crosslinking with proteins (
16
,
17
). Our studies revealed that DNAs carrying substituted pyrophosphate
internucleotide groups react in near-physiological conditions with different nucleophiles, including
nucleophilic amino acids, according to the following scheme (
16
,
18
):
Oligonucleotides and DNA duplexes used in this study are depicted in Figure
1
. Oligonucleotides [1-32] forming DNA duplexes I-XVI and I'-XVI' and oligonucleotides [33-38] forming DNA duplexes double-stranded (ds) PE56, ds HF30 and
mut ds HF30 were synthesized by GENSET (Paris, France) using an automated DNA
synthesizer (Applied Biosystems 394/8). 5'-Phosphate oligonucleotides (5'-p) were synthesized using 5' Phosphate-On cyanoethyl phosphoramidite (Clontech)
as the phosphorylating reagent. Oligonucleotides carrying a 3'-terminal phosphate group (3'-p) were obtained by the phosphoramidite method, using the 5' phosphorylation reagent as previously
described (
19
). 5'-End labeling of oligonucleotides was carried out with T4
polynucleotide kinase and [[gamma]-
32
P]ATP, following standard procedures (
26
).
Synthesis of 3'-pOCH
3
oligonucleotides [1-7] was performed as described in (
27
). Synthesis of more extended 3'-pOCH
3
oligonucleotides [8-16] was performed as follows: 0.01-0.1 [mu]mol of 3'-p oligonucleotides [8-16] were incubated for 12 h at 4oC in 110 [mu]l of reaction mixture containing
0.25 M MES pH 4.5, 0.5 M MgCl
2
(Buffer A), 35% CH
3
OH and 11 mg (57.4 [mu]mol)
N-
(3-dimethylaminopropyl)-
N
'-ethylcarbodiimide (EDC). After incubation, oligonucleotides were recovered by precipitation with 10 volumes of 2% LiClO
4
in acetone, and were further reprecipitated three times by resuspension in 2 M
LiClO
4
and addition of 10 volumes of acetone. 3'-pOCH
3
oligonucleotides were isolated by reverse-phase HPLC on a Delta Pak 300 Å column (3.9 * 150 mm, 7 [mu] particle size), using a linear gradient of acetonitrile
in 0.1 M triethylammonium acetate buffer pH 7.0. At the next step 3'-pOCH
3
oligonucleotides were desalted by multiple evaporations to dryness from 50%
ethanol at 50oC. Methylation efficiency was 75-95% and the final yield after HPLC purification was 60-75%.
Oligonucleotides containing internucleotide pyrophosphate groups (p-p) at the positions indicated in Figure
1
were synthesized by template-directed chemical ligation (
28
) of the corresponding 3'-phosphate upstream oligonucleotide and 5'-[
32
P]-phosphate downstream oligonucleotide. Briefly, equimolar amounts (total nucleotide concentration = 10
-3
M) of the 3'-phosphate and the radioactive 5'-phosphate oligonucleotides to be ligated, and the
complementary 30mer template (oligonucleotides [36], HF 30 lower strand or
[35], HF 30 upper strand) in 0.05 M MES pH 6.0, 0.02 M MgCl
2
(Buffer B) were treated with 0.2 M EDC for 16 h at 4oC in the dark. Ligation products corresponding to the HF30 upper or lower
strands carrying p-(
32
P)p pyrophosphate groups at the ligation sites (see Fig.
1
), were isolated by electrophoresis on a 20% denaturing polyacrylamide gel, followed by elution with 2 M LiClO
4
and precipitation with 5 volumes of acetone.
Synthesis of oligonucleotides containing internucleotide
O
-methyl-substituted pyrophosphate groups (pOCH
3
-p) at the positions indicated in Figure
1
was performed as described for the pyrophosphate-modified oligonucleotides (see above), except that the 3'-phosphate upstream oligonucleotide was replaced by the corresponding
O
-methyl-substituted 3'-phosphate oligonucleotide (3'-pOCH
3
).
The plasmid pET-HNF1/1-281 contains the cDNA sequence coding for a complete DNA binding
domain of HNF1 (
29
-
31
), with no additional C-terminal amino acids. This plasmid was constructed as follows. The DNA
fragment encoding the region between Met
1
and Leu
281
of rat HNF1 was generated by PCR on pRSV-HNF1 template (
29
) using a 5' primer containing an
Nde
I site overlapping the Met
1
codon, and a 3' primer in which the Leu
281
codon was followed by a stop codon and a
Bam
HI restriction site. The PCR fragment cut with
Nde
I and
Bam
HI was inserted between the corresponding sites in pET-14b (Novagen, Inc.) and sequenced by the dideoxy method.
Nuclear extracts from rat liver were prepared as described previously (
23
). Histidine-tagged HNF1 DNA binding domain was expressed in
E.coli
strain BL21(DE3)pLysS (
31
) transformed with pET-HNF1/1-281, and purified as follows. Cells were grown to mid-log phase and HNF1/1-281 expression was induced by adding 0.4 mM IPTG during 3 h at 37oC. After induction, cells were harvested and lysed by sonication in a
buffer containing 5 mM imidazole, 20 mM Tris-HCl pH 7.9, 500 mM NaCl, 1 mM PMSF (Buffer C). Insoluble material was
removed by centrifugation, and the supernatant was filtered through a 0.45 [mu]m membrane. The soluble protein extract was then applied to a Ni
2+
affinity column (His-Bind Resin, Novagen Inc.). After extensive washing with buffer C containing 60 mM imidazole, His-tagged HNF1/1-281 was eluted with buffer C containing 1 M imidazole. Protein was
>95% pure at this stage, as determined by SDS-PAGE. The purified protein was
supplemented with 20% glycerol and was stored at -80oC.
Binding of the DNA duplexes I-XVI carrying
O
-methyl-substituted pyrophosphate groups (pOCH
3
-[
32
P]p) was carried out as previously described (
4
). Three fmoles of
32
P-labeled duplex was added to 1.4 ng of His-tagged HNF1 DNA binding domain or 0.5-2 [mu]g nuclear extract in 14 [mu]l buffer containing 10 mM Hepes pH 7.9, 50 mM KCl, 0.1 mM EDTA, 0.5 mM DTT, 10% (vol/vol) glycerol, 0.25 mM PMSF, 2.5 [mu]g/ml aprotinin, 2.5 [mu]g/ml leupeptin, 6 mM MgCl
2
, 6 mM spermidine, 1.5 [mu]g poly(dI-dC)
.
poly(dI-dC) and 250 ng sonicated salmon sperm DNA. After incubation at 4oC for 10 min, binding was detected by gel shift assay on 6% non-denaturing PAGE gels containing 0.25 * TBE.
Binding of the non-modified duplexes dsPE56, dsHF30, mut ds HF30 and modified DNA duplexes I'-XVI' containing pyrophosphate groups was carried out in the same way. Both 5'-ends of dsPE56, dsHF30, mut ds HF30, or
on pyrophosphate group (p-[
32
P]p) of duplexes I'-XVI' were
32
P-labelled.
DNA duplexes I-XVI were incubated with purified HNF1 or liver nuclear extract as
described above. After incubation at 4oC for 10 min, at 4oC for 12 h or at 20oC for 10 min or 12 h the mixtures were directly loaded on 0.1%
SDS-10% PAGE and the crosslinked product separated from non-crosslinked duplex as described earlier (
23
). The contribution of a 30mer DNA duplex in the protein mobility is equivalent
to 16 kDa.
Crosslinking of ds oligonucleotides carrying substituted pyrophosphate
internucleotide groups to DNA-binding proteins can be achieved if nucleophilic amino acid residues in protein are implicated in direct
interactions with reactive internal groups. Therefore to design crosslinking
reagents on the base of substituted pyrophosphate-modified ds oligonucleotides it is necessary to determine phosphate groups
that are in close proximity with such amino acids.
It has been previously demonstrated that ds oligonucleotides containing the HNF1
binding site can specifically interact with HNF1 (
4
,
23
). In order to determine if HNF1 is able to specifically bind DNA duplexes
carrying pyrophosphate or substituted pyrophosphate groups instead of natural
phosphate ones both types of modified DNA duplexes I-XVI and I'-XVI' were tested for binding to the His-tagged DNA binding domain which contains all
of the determinants necessary for DNA recognition (
29
). Binding was detected by gel retardation shift assay.
Figure Figure 4 shows an autoradiogram of 6% non-denaturing PAGE gel after binding of DNA duplexes I-XIII and I'-XIII' to the DNA binding domain of HNF1. As shown in Figure 4
, when we incubated radioactive duplexes with protein, DNA-protein complexes corresponding to the HNF1 DNA binding domain retarded
band were observed in most cases. For DNA duplexes I, II, VIII, X, XI and I', II', VIII', XI' the binding efficiency of HNF1 to modified duplexes
was similar to the binding efficiency of ds HF30, which does not contain any
modifications. For duplexes III, V, XII, XIII and III', V', X', XII', XIII' the binding efficiency was lower and for
duplexes IV, VI, VII, IX, XIV-XVI and IV', VI', VII', IX', XIV'-XVI' binding was abolished
(for XIV-XVI, XIV'-XVI' data not shown). These data indicate that HNF1 can
recognize and effectively bind DNA duplexes containing pyrophosphate or
O
-methyl-substituted pyrophosphate groups within the recognition site. A correlation between binding
affinity of both types of modified duplexes was observed. This means that the presence of methyl substituent at the pyrophosphate group did not influence binding. The results of binding are summarized in Figure
4
C. As shown in Figure
4
C, specific binding efficiency depended on the place of modification and was
higher when the ends of the HNF1 binding site were modified. On the contrary,
when modified groups were placed near the center of dyad symmetry of the
pseudopalindromic binding site, binding was much less effective or was abolished. This can be explained by lengthening
of the internucleotide bond in the place of modification and consequently by local change of double helix geometry near the central nucleotide pair of palindromic sequence. This can interfere with the conformational change (presumably bending) of DNA, which
requires binding of HNF1 to a B-DNA palindromic sequence (
32
). We suggest that this conformational change involves a part of palindromic sequence adjacent to the center of dyad symmetry.
Figure
Additionally, it should be noted that after binding of some pyrophosphate-modified DNA duplexes (II', IV'-VIII' and X') to the HNF1 binding domain additional
bands were observed in the gel (Fig.
4
B). These bands were formed when the binding procedure is carried out in the
absence of HNF1. After addition of the protein in some cases these nonspecific
bands disappeared (see, for example, duplex II'). This can be explained by nonspecific binding of these modified duplexes to the components present in the
buffer for binding procedure such as aprotinin or leupeptine. The following
addition of HNF1 could compete for the binding to their target sequences. However, binding of
O
-methyl-substituted pyrophosphate DNA duplexes was highly specific.
In order to demonstrate the ability of substituted pyrophosphate-modified DNA duplexes to bind specific transcription factors in crude
preparations we studied the binding of duplexes I-XVI and I'-XVI' to the HNF1 in rat liver nuclear extract as
described above. It was particularly important because cell extracts contain many components that could interfere with the binding procedure. As shown in Figure
5
, after incubation of labeled DNA duplexes with the liver nucleic protein, DNA-protein complexes corresponding to the HNF1 retarded band were detected in all cases mentioned above.
However, in general the efficiency of binding was lower. It should be noted
that the data obtained for liver nuclear extract correlated with those obtained
for the HNF1 DNA binding domain. The binding efficiency decreased from the ends
of the HNF1 binding site to their center. Near the center of dyad symmetry
binding was abolished.
Figure After incubation of DNA-duplexes containing
O
-methyl- substituted pyrophosphate groups with nuclear cell extract we did
not detect any additional bands corresponding to nonspecific complexes of these
DNA duplexes with other nuclear proteins. On the contrary, after incubation of
pyrophosphate-modified DNA duplexes I'-XVI' with liver extract intensive additional bands were observed almost in all cases (Fig.
5
B). Thus, introduction of methyl substitute into the pyrophosphate
internucleotide group not only leads to the formation of reactive compounds
that are able to crosslink to DNA-binding proteins, but provides the high specificity of binding.
The interaction between protein and modified DNA duplexes in the cases when it
was observed was shown to be sequence specific by two independent criteria.
First, an excess of unlabeled ds PE56 or duplex X acted as a competitors in the
binding experiments (Fig.
6
). A 50-fold excess of competitors almost completely blocked the attachment of
labeled modified DNA duplex. At the same time the presence of mut ds HF30
containing an HNF1 mutated site that totally abolishes HNF1 binding (
23
) under the same conditions did not compete with labeled modified duplexes for
HNF1 binding (data not shown).
We have previously demonstrated that ds oligonucleotides carrying substituted pyrophosphate internal groups have been successfully applied to the
crosslinking to some DNA-binding proteins (
18
-
21
). In order to choose the effective crosslinking reagent for transcription factor HNF1 we used a similar crosslinking procedure.
Histidine-tagged HNF1 DNA binding domain or nuclear extract from rat liver were
incubated with body-labeled DNA duplexes I, II, VIII, X, XI, XII and XIII, which are shown to
bind specifically to the HNF1 under standard conditions described above. Crosslinking products were separated from non-crosslinked duplexes in SDS-polyacrilamide gels. As shown in Figure
7
, the incubation of
O
-methyl-substituted DNA duplex X with liver extract or with HNF1 DNA binding domain resulted in crosslinking. In the other cases the formation of the crosslinking products was
not observed. Because the covalent attachment of short oligonucleotides has a
minor measurable effect on the mobility of this protein in SDS-polyacrylamide gel electrophoresis (see Materials and Methods), these
experiments indicate that: (i) in the case of liver extract the crosslinking
complex gave rise to a ~100 kDa band corresponding to the 87-93 kDa protein linked to the 16 kDa oligonucleotide, which is in
agreement with the data observed earlier (
23
,
29
); (ii) the crosslinking complex with the HNF1 DNA binding domain gave rise to a
~60 kDa band corresponding to the ~45 kDa His-tagged HNF1/1-281 (
29
) to the 16 kDa DNA duplex.
Crosslinking should to be specific because binding of HNF1 with
O
-methyl-substituted pyrophosphate-modified DNA duplexes resulted in only one specific DNA-HNF1 complex (see above).
Crosslinking efficiency depended on the reaction time and temperature (Fig.
7
) and was higher when the reaction mixture was incubated overnight at room
temperature. The crosslinking effficiency was 40% of the efficiency of non-covalent association. In order to improve the crosslinking yield we added to the reaction
mixture 1 [mu]l of 0.4 M
N-
methylimidazole (N-MeIm) pH 8.0. It has been established previously that the addition of N-MeIm can lead to the increase of crosslinking efficiency (
18
,
19
). However it did not influence the extent of covalent attachment (data not
shown).
Thus, DNA duplex X specifically crosslinks with the HNF1 transcription factor in
cell extract. As shown in Figure
7
, additional bands corresponding to nonspecific crosslinking to the other
nuclear proteins could not be detected. Consequently, our crosslinking
procedure is generally applicable to the nuclear cell extract and could
constitute a potential therapeutic tool for inhibition of the expression of
harmful genes.
Additionally, our results indicate that DNA duplex X contained a substituted
pyrophosphate group between nucleosides G and T adjacent to the TAAT
recognition sequence. Consequently, nucleophilic amino acid (or amino acids)
from the DNA binding region of HNF1 have a specific contact with the phosphate
group located between G and T bases in the binding site. This could be lysine
or arginine since; as we have described earlier, DNA duplexes containing
substituted pyrophosphate groups cleaved efficiently under the action of such
amino acids (
18
). Hystidine seems to be less probable because in this case phosphoroimidazolide of oligonucleotide should be formed as a result of crosslinking. However, such
compounds are unstable in aqueous solution and cleave immediately giving
starting materials (
34
).
In the natural cognate sequences the GT dinucleotide 5'-adjacent to the TAAT recognition sequence is strictly conserved (
25
). In agreement with this it has been suggested that amino acid residues from
DNA binding region have specific contacts with GT bases in DNA (
33
). Our experimental data confirm that the phosphate group between G and T has a
specific contact with residue(s) from DNA binding region of rat liver HNF1.
*To whom correspondence should be addressed. Tel: +7 095 939 31 53; Fax: +7 095
939 3181; Email: kuznetsova@biorg.chem.msu.su



REFERENCES
Return


