| Nucleic Acids Research | Pages |
Stability of G,A triple helices
Introduction
Materials And Methods
Oligonucleotide preparation
Aptamer approach
DMS footprinting
Estimation of triplex stability
Influence of magnesium and KCl concentration on triplex stability
Modeling studies
Results
Selection of the stable triplexes by the aptamer approach
Verification of triplex stability
Influence of `mismatches' in TFOs on triplex stability
Influence of different substitutions in the TFOs on triplex stability
Influence of Mg2+ and K+ concentration on triplex stability
Modeling studies
Discussion
Acknowledgements
References
Stability of G,A triple helices
Received March 16, 1999; Revised and Accepted May 17, 1999
ABSTRACT In this work we selected double-stranded DNA sequences capable of forming stable triplexes at 20 or 50°C with corresponding 13mer purine oligonucleotides. This selection was obtained by a double aptamer approach where both the starting sequences of the oligonucleotides and the target DNA duplex were random. The results of selection were confirmed by a cold exchange method and the influence of the position of a `mismatch' on the stability of the triplex was documented in several cases. The selected sequences obey two rules: (i) they have a high G content; (ii) for a given G content the stability of the resulting triplex is higher if the G residues lie in stretches. The computer simulation of the Mg2+, Na+ and Cl- environment around three triplexes by a density scaled Monte Carlo method provides an interpretation of the experimental observations. The Mg2+ cations are statistically close to the G N7 and relatively far from the A N7. The presence of an A repels the Mg2+ from adjacent G residues. Therefore, the triplexes are stabilized when the Mg2+ can form a continuous spine on G N7.
INTRODUCTION
Short homopurine-homopyrimidine regions in DNA have attracted a great deal of attention in connection with their possible role in gene regulation in eukaryotes (1,2). These regions are potential targets for manipulating gene expression, gene-targeted mutagenesis/knockout and inhibition of virus propagation through artificial triple helix formation (3-6). Recognition of DNA by triplex-forming oligonucleotides (TFO) occurs by hydrogen bonding between oligonucleotide bases and purine bases in the major groove of the target DNA duplex. Triple helix formation arises in either of two patterns, termed the pyrimidine motif and the purine motif (7-9). To promote triplex formation with cytosine-containing oligonucleotides, a slightly acidic environment is required. In contrast, the purine motif is pH-independent and has been used far more often for successful in vivo inhibition of transcription. However, the stability of G,A triplexes is highly unpredictable (10,11). The present work was therefore undertaken to identify sequences that form stable triplexes. This was achieved through a novel aptamer approach involving the simultaneous use of a random bank of target duplexes and a random bank of third strand oligonucleotides. This approach allowed us to select about 200 different sequences of 13mer oligonucleotides for their stability in a co-migration assay at 20 or 50°C. These sequences are characterized by a high guanine content. To verify the importance of guanine residues for the stability of the triplexes, some selected sequences were further analyzed by a cold exchange method at a different temperature. Finally, molecular modeling studies were undertaken to understand the role of guanine residues and divalent cations in the stability of the GA triplexes.
MATERIALS AND METHODS
Oligonucleotide preparation
Oligonucleotides were synthesized on an Applied Biosystems 391A DNA synthesizer using the solid phase phosphoramidite procedure. They were precipitated with 10 vol of a 3% solution of LiClO4 in acetone, then the pellets were washed with acetone, dried and dissolved in water. Concentrations were determined spectrophotometrically. Extinction coefficients were calculated from the extinction coefficients for the nucleotides and dinucleotide phosphates according to the equation given in Puglisi and Tinoco (12). For cold exchange experiments, oligonucleotides were purified by electrophoresis in a 20% denaturing polyacrylamide gel. After electrophoresis the oligonucleotides were eluted from the gel in 1 ml of 0.2 M LiClO4 solution for 12 h at 37°C, then precipitated with 10 vol of acetone.
Aptamer approach
The following oligonucleotides were used to select stable triplexes. Matrix 1, a random 13mer oligopurine was placed between two amplifying sequences, 5[prime]-CGAACGCCGGTACCA(R13)-CAAG-CTTAGGGGCGC-3[prime]. The flanking sequences contained restriction sites for KpnI and HindIII, respectively, to clone the selected triple helix forming duplex. An oligonucleotide 5[prime]-(R13)-C-biotin was used as a third strand for triplex formation. In another set of experiments we used a modified construct, matrix 2, 5[prime]-CGAACGCCGGTACCAtc(R13)caCAAGCTTA-GGGGCGC-3[prime]. Selection was performed as follows (Fig. 1). The duplex-containing random part was amplified with [[gamma]-32]ATP-labeled primers 5[prime]-CGAACGCCGGTACCA-3[prime] and 5[prime]-GCGCCC-CTAAGCTTG-3[prime] (specific activity 100 Ci/mmol) on either matrix 1 or 2 and purified on a 15% non-denaturing gel. After elution (overnight at 37°C in water), the purified duplex (~200 pmol) was incubated with 2000 pmol of the third strand oligonucleotide in a buffer containing 50 mM NaAc, 20 mM Tris-Ac, pH 7.5, and 10 mM MgAc2 for 24 h at 37°C. An aliquot of 1000 pmol of streptavidin was then added for an additional 30 min and the mixture loaded on a 10% polyacrylamide gel containing 50 mM NaAc, 20 mM Tris-Ac, pH 7.5, and 10 mM MgAc2. Migration was performed in a thermostated Hoeffer minigel apparatus at the temperature of selection (20 or 50°C for different experiments). After visualization of the bands by autoradiography, the band with low mobility was cut out and the DNA eluted in 50 µl of water at 37°C and re-amplified. The procedure was repeated three or four times. After the last round of selection the DNA was cut with KpnI and HindIII restriction enzymes and cloned in the corresponding sites of pBluescript II SK (Stratagene). Plasmids were purified on Qiagen columns, following the manufacturer's recommendations. Sequencing was performed with an Amersham ThermoSequenase cycle sequencing kit with M13-20 and T3 primers.
Figure 1. Design of the experiments to select stable GA triple helices.
DMS footprinting
Preparation of a DNA fragment. The plasmid used in this experiment was obtained by the aptamer approach and was found to contain the sequence 5[prime]-AGGAGGGGGAGGGGG-3[prime], which harbors the 13mer polypurine sequence that we have previously studied (13). To prepare a DNA fragment for modification by DMS, 30 µg of a plasmid containing a polypurine sequence from the c-pim proto-oncogene promoter (14) were cut with ClaI restriction enzyme, 3[prime]-labeled with the Klenow fragment of DNA polymerase I (Eurogentec) in the presence of 50 pmol of [32P]dCTP (3000 Ci/mmol) and digested with XhoI restriction enzyme. The large DNA fragment was then purified on Genclean beads (US Biochemical).
Probing with DMS. The prepared fragment (~0.5 pmol) was incubated with TFOs (100 pmol) in 20 µl of buffer containing 50 mM MOPS, pH 7.2, 50 mM NaAc and 10 mM MgAc2 at 25°C overnight. An aliquot of 2 µl of 5% DMS was then added and the reaction performed for 2 min at 25°C. The reaction was stopped by the addition of 5 µl of a solution containing 50% mercaptoethanol and 0.1 M NaAc. After double precipitation in ethanol, the samples were treated with 50 µl of 10% piperidine at 95°C for 20 min and the cleavage products were separated in a 6% polyacrylamide denaturing gel.
Estimation of triplex stability
Triplex formation. To form the triplex, 0.015 pmol of 5[prime]-32P-labeled purine TFO (specific activity 1000 Ci/mmol) was incubated overnight at 25°C with 1 µg (~0.5 pmol) of the target plasmid DNA in 10 µl of buffer containing 10 mM MgCl2, 50 mM NaCl and 20 mM Tris-Ac, pH 7.5.
Estimation of triplex stability by a cold exchange method. The triplex obtained in this manner was placed on ice and 250 pmol of corresponding unlabeled competitor oligonucleotide were added. The mixture was covered with mineral oil to avoid evaporation and incubated for 30 min at different temperatures, as specified in the figure legends. After incubation, the samples were immediately cooled on ice and loaded on a 1% agarose gel. Electrophoresis was performed in the cold room at 3.5 V/cm for 5 h in the presence of 10 mM MgCl2, 50 mM NaCl and 20 mM Tris-Ac, pH 7.5. During the migration, the buffer temperature was kept below 10°C. The gel was prepared for autoradiography by drying on the glass surface in a heated air flow (60°C). To check that the mobility of the retarded band corresponds to the TFO co-migrating with the target plasmid, the DNA was visualized by brief staining of the dried gel in a 10 µg/ml ethidium bromide solution. The quantity of radioactivity bound to the targeted DNA was determined using a STORM phosphorimager (Molecular Dynamics) with the flow-dried gel and plotted against the incubation temperature. Td is defined as the temperature at which half of the initially bound radioactivity dissociates from the plasmid (and, correspondingly, half of the triplex dissociates).
Influence of magnesium and KCl concentration on triplex stability
To form the triplex, 0.15 pmol of 5[prime]-32P-labeled purine TFO was incubated overnight at 25°C with 10 µg (~5 pmol) of the target plasmid DNA in 20 µl of buffer solution containing either 50 mM NaCl and 20 mM Tris-Ac, pH 7.5, (NaCl buffer) plus 1 mM MgCl2 or 150 mM KCl and 20 mM Tris-Ac, pH 7.5, (KCl buffer) plus 1 mM MgCl2. The stability of the triplex was estimated by a cold exchange method in the presence of varying amounts of magnesium at 37°C. After 30 min incubation, the samples were cooled on ice and 2 µl of 100 mM MgCl2 were added immediately to prevent further triplex dissociation. Electrophoresis and autoradiography were carried out as described above.
Modeling studies
The experimental system was modeled by a canonical ensemble consisting of ions and a single triplex set at the center of a sphere of 50 Å. The A6G7, G4AG5AG2 and (GA)6G triple helices were built up by stacking the GCoGbqt and the AToAbqt triplets from our database (15) using our programs MORCAD (16) and OCL (17). In these triplexes, the sugar of the third strand is the [beta] anomer, both purine strands are mutually anti-parallel and the glycosidic angle is in the anti conformation. Moreover, in these triplexes, the third strand divides the major groove of the duplex into a narrow groove between the two purine strands and a wider groove between the purines of the third strand and the pyrimidines of the duplex. The naked triplexes were minimized using our quasi-Newtonian minimizer MORMIN (16) and the AMBER 4.1 force field parameters (18,19). Details on the force field, algorithm, protocol and ionic concentration determination can be found in the Supplementary Material.
RESULTS
Selection of the stable triplexes by the aptamer approach
In order to find target sequences that form stable triplexes with purine TFOs, we developed a new aptamer approach using a random duplex matrix obtained by PCR amplification and a random third strand biotinylated oligonucleotide. In the presence of streptavidin the duplexes hybridizing with the TFO form complexes that were purified by gel retardation at a well-defined temperature. The selected duplexes were re-amplified and subjected to a new round of selection. After two or three rounds of selection the selected duplexes were cloned and sequenced. The experimental design is shown in Figure 1 and the details given in Materials and Methods. To verify that stable triplexes are selected by this approach, preliminary experiments were performed with the sequence 5[prime]-GGGGAGGGGGAGG-3[prime], which is known to form a very stable triplex with its corresponding TFO (13). For this purpose the matrix 5[prime]-CG-AACGCCGGTACCAGGGGAGGGGGAGGCAAGCTTA-GGGGCGC-3[prime] was subjected to one round of selection (Materials and Methods) in the presence of the third strand oligonucleotide 5[prime]-GGAGGGGGAGGGGc-biotin. The formation of a complex between streptavidin and the biotinylated TFO was used to achieve a better separation between the target duplex and the triplex in the co-migration assay. In the absence of streptavidin the mobilities of the duplexes and triplexes were too close to make the selection efficient (Fig. 2A). Oligonucleotide 5[prime]-GG-AGGGGGAGGGGc-biotin formed a low mobility complex with the duplex and streptavidin which could be easily separated from the duplex DNA by a co-migration assay (Fig. 2A). The low mobility band was not observed when the oligonucleotide was incubated either with the duplex DNA alone or with streptavidin alone (Fig. 2A). Because biotinylated TFOs gave the same pattern of guanine protection in DMS footprint experiments (data not shown), we suggest that the low mobility band is composed of a complex of the triple helix DNA with streptavidin. The other minor low mobility bands seen in Figure 2A are interpreted as complexes between streptavidin and more than one triplex. This experiment shows that stable triplexes can be selected by our technique.
Figure 2. Gel-retardation assay at 50°C. (A) With amplified matrix 5[prime]-CGAACGCCGGTACCAGGGGAGGGGGAGGCAAGCTTAGGGGCGC-3[prime] (duplex 1). Lane 1, duplex 1 plus unrelated oligonucleotide 5[prime]-GGAAGGAAAGGGGc-biotin; lane 2, the same plus streptavidin; lane 3, duplex 1 plus oligonucleotide 5[prime]-GGAGGGGGAGGGGc-3[prime]; lane 4, duplex plus TFO 5[prime]-GGAGGGGGAGGGGc-biotin plus streptavidin. (B) With amplified matrix 1 (duplex 2), second round of selection. Lane 1, duplex 2; lane 2, duplex 2 plus streptavidin; lane 3, duplex 2 plus oligonucleotide 5[prime]-(R13)-C-biotin plus streptavidin. S, T and D correspond to the start of migration, positions of the complex (duplex plus TFO-biotin plus streptavidin) and duplex, respectively.
Verification of matrix composition. To make sure that the composition of the initial matrix is random, we sequenced DNA from 40 clones obtained in the same way as for selected sequences but without selection for triplex formation. For this purpose, matrix 1 was amplified, purified on a non-denaturing gel, cut with restriction enzymes HindIII and KpnI, then cloned and sequenced as described in Materials and Methods. The results of this experiment show that matrix 1 is initially enriched in G residues (68.7%) although it was supposed to contain 50% guanines and 50% adenines (see Table S1 in Supplementary Material). We then constructed matrix 2 with an initial composition of 40% G and 60% A. Verification of this matrix actually gave a value of 60.8% for A residues (Table 1). We decided to make the selection with the two matrices independently. Another interesting finding concerns the variations in the length of the triple helix-forming region (Tables 1 and S1). These variations most likely arise during the amplification process and might reflect the difficulties of Taq polymerase in the synthesis of G-rich stretches.
Table 1. Results of selection with matrix 2
Column 1, sequences of the purine strand of the duplex before selection; columns 2 and 3, the same after three and four rounds of selection, respectively, at 20°C; column 4, selected sequences after three rounds at 20°C followed by one round at 50°C. At the end of each column percents of G residues are calculated for all sequences in the column.
Selection with matrix 1. Selection with matrix 1 was performed at the temperatures 20 and 50°C (Fig. 2B). For each temperature, three rounds of selection were performed and about 40 clones per temperature were sequenced (see Table S1 in Supplementary Material). The first conclusion to emerge from this experiment is that the G content of the selected sequences is greater than that of the matrix (Table S1). Moreover, this proportion increased with temperature (76.0% at 20°C and 82.0% at 50°C). Finally, in the selected sequences, no stretch of A residues longer than three was found.
Selection with matrix 2. In this experiment, we first performed three rounds of selection at 20°C, after which 40 clones were sequenced (Table 1). Here again, the proportion of G increased in comparison to non-selected sequences. However, the increase in G content after three rounds of selection was lower in matrix 1 (increase from 68.7 to 80%) than in matrix 2 (increase from 39.2. to 90%). To evaluate the impact of the number of selection cycles on the selected sequences, sequencing was performed after an additional fourth round of selection at 20 or 50°C. This round increased the proportion of G residues to 93.4% at 20°C and to 94.6% at 50°C. This clearly shows that the G content plays a major role in the stability of the G,A triplexes. Surprisingly, one of the sequences selected at 50°C (5[prime]-AGGAAGAGGAAGGA) contains only 50% G residues. The stability at 50°C of this sequence was not further confirmed by the cold exchange method (Table 2). This can be explained either by contamination of the duplex during PCR amplification or by interaction of these oligonucleotides with streptavidin. The exact position of the few adenines remaining in the selected sequences appears to be unimportant for stability so long as they do not lie in a row. Further support for this conclusion was obtained by independent estimation using a cold exchange method for several selected duplexes of different G content and various TFOs (see below).
Table 2. Determination of triplex stability (Td) of some selected sequences
Verification of triplex stability
Triplex stability was estimated by a modified cold exchange method (see Fig. 3 and Materials and Methods for details) using duplexes containing 13, 11, 9 and 7 guanines in one strand (Table 2). Corresponding TFOs were chosen according to canonical rules (9). The data obtained clearly show that stability decreases when the number of guanines decreases (Table 2). Moreover, stability depends not only on the content but also on the sequence. Td varied from 28°C in the case of seven G residues in a row to <10°C for the sequences with dispersed G residues (Table 2), in which case triplex formation is undetectable by our approach (Materials and Methods). However, if the sequences are longer the stability for sequences with dispersed G residues can be quantified. For the longer sequences of identical G content: (G)10(A)10 and (GA)10 the respective Td values at 50 and 22°C drastically differ.
Figure 3. Determination of triplex stability for the TFO 5[prime]-GAGGGAGAGGAGG-3[prime]. Target DNA was incubated with the TFO at different temperatures under the conditions described in Materials and Methods and subjected to a co-migration assay. (A) Gel after staining with ethidium bromide. (B) Autoradiograph of the same gel. S, D, D+TFO and TFO correspond to the positions of the start of the migration, target plasmid and target plasmid plus TFO and TFO, respectively. Lanes 1 and 3-8, incubation with the cold competitor TFO at 100 (negative control), 35, 40, 45, 50, 55 and 60°C, respectively; lane 2, without competitor (positive control). (C) Graph built on the basis of the data obtained in the co-migration assay. Td is defined as the temperature at which half of the initially bound radioactivity (A, line 2) dissociates from the plasmid. For this TFO it is equal to 45°C.
Influence of `mismatches' in TFOs on triplex stability
Since the aptamer approach involves selection of the duplex part and we were unable to see selected TFOs it was interesting to further investigate the rules for triplex formation in G-rich sequences. To do so, we studied the stability of triplexes containing one `mismatch' in the TFO. For this purpose we chose the target sequence 5[prime]-GGGGAGGGGGAGG-3[prime] and a set of TFOs containing one C instead of the canonical base (Table 3). Each `mutated[prime] TFO gave a less stable triplex than the canonical TFO. Some additional conclusions can be drawn here. First, guanines once again play the major role in triplex stability. For example, substituting A5 by C decreases Td to 52°C, but substituting any G flanking this A5 decreases Td to 40 (for G4) or to 21°C (for G6) (Table 3). For A11 the corresponding values are 55 (C11), 37 (C10) and 43°C (C12). Secondly, the bases in the middle of the sequence have more impact on triplex stability than the flanking bases (Table 3). To estimate how the mismatched TFOs interact with the target sequence, DMS footprint experiments were performed (Fig. 4). Here it was seen that substituting any guanine by cytosine in the TFOs decreased the guanine protection not only of the target base, but also of the neighboring guanines. This effect was more pronounced for the five central guanines, with the one exception of G7 (Fig. 4). Substitution of A by C had no visible effect on the footprint, suggesting that guanines play the most important role in triplex stability.
Figure 4. Autoradiogram of a 6% polyacrylamide sequencing gel showing the results of DMS footprint experiments with TFOs containing `mismatches' in different positions. Lane T-, without TFO; lane C0, TFO 5[prime]-GGGGAGGGGGAGG-3 (without `mismatch'); lanes C1-C13, TFOs containing corresponding substitution of G or A bases for C.
Table 3. Influence of `mismatches' in the TFO on triplex stability
Triplex stability (the value Td) was determined for the target sequence 5[prime]-GGGGAGGGGGAGG-3[prime] and a set of TFOs containing one C (marked in italic bold) instead of the canonical base. C0, TFO without `mismatch'.
Influence of different substitutions in the TFOs on triplex stability
To check that destabilization of the triplex is not specific for G->C or A->C substitutions in the TFO, we next performed experiments with all four base substitutions in the TFO (Table 4). In accordance with published data, A can be replaced by T without visible effects on triplex stability. Substituting G or C for A destabilized the triplex to the same extent, but even when both A residues were substituted the resulting triplex could be visualized in the co-migration assay. In contrast, replacing G by any base destabilized the triplex to an almost equivalent degree (Table 4).
Table 4. Influence of different substitutions in the TFO on triplex stability
Triplex stability (the value Td) was determined for the target sequence 5[prime]-GGGGAGGGGGAGG-3[prime] and a set of TFOs containing all four base substitutions (marked in italic bold) in the TFO instead of the canonical base. C0, TFO without `mismatch'.
Influence of Mg2+ and K+ concentration on triplex stability
To determine the minimal Mg2+ concentration at which the triplexes are still stable, we assayed triplex stability with the TFO 5[prime]-GGGGAGGGGGAGG-3[prime] in NaCl and KCl buffers at different Mg2+ concentrations (Materials and Methods). In NaCl buffer there was no dissociation at Mg2+ concentrations as low as 0.5 mM. The monovalent ion concentration markedly influenced triplex stability since no retarded bands could be visualized at concentrations below 1 mM Mg2+ in the presence of 150 mM K+.
Modeling studies
Since TFO composition and monovalent and divalent ion concentrations play a crucial role in triplex stability, we performed modeling studies to seek a correlation between ion distribution, TFO sequence and triplex stability. For this purpose, three triplex structures were chosen: one containing G and A bases and having the highest stability in our studies (G4AG5AG2) and two others with the same GA content but different stability [A6G7 and (GA)6G] (Table 3). The aim was to show that triplex ionic environments may play an important role in their stabilization. As we wanted to simulate a system made of a triplex immersed in a solution of given ion concentrations that would be as close as possible to the experimental conditions, we used a Monte Carlo method. In such a method, the ionic environment is built up by a succession of independent moves (20). An ion is randomly chosen and then randomly moved to a trial position inside a cube of size 2[delta], where [delta] is the maximum step size. The variation of the energy in this trial move is computed. The trial move is accepted or refused according to the Metropolis rule (21). The acceptance rate is the ratio of accepted over total trial moves and should be as close as possible to 0.5 for a good survey of the conformational space. In this work, preliminary calculations showed that the cations stuck to the triplex, so that a density scaled method had to be implemented (22; see also Supplementary Material). The distribution of ions should not and does not depend on the law chosen for [delta] if a very large number of trial moves are generated. However, the choice of the law has a large effect on the reproducibility of the results for a given computer time. Table 5 compares some values obtained after simulation of the distribution of counterions around three triple helices. In all systems, the number of ions shown in columns 2 and 3 was chosen so that the outer bulk cation concentrations fit the experimental values to within 10%. Each system was heated to get four independent starting ion distributions. The system was then cooled down as described in Supplementary Material. The averages and fluctuations of the average energies of each production run are shown in column 4.
Table 5. Monte Carlo results
![]() |
E. M. McGuffie and C. V. Catapano Design of a novel triple helix-forming oligodeoxyribonucleotide directed to the major promoter of the c-myc gene Nucleic Acids Res., June 15, 2002; 30(12): 2701 - 2709. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||










