ABSTRACT
An analogue of the DNA binding compound Hoechst 33258, which has the para hydroxyl group altered to be at the meta position, together with the replacement of one benzimidazole group by pyridylimidazole, has been cocrystallized with the dodecanucleotide sequence d(CGCGAATTCGCG)2. The X-ray structure has been determined at 2.2 Å resolution and refined to an R factor of 20.1%. The ligand binds in the minor groove at the sequence 5'-AATTC with the bulky piperazine group extending over the C[middot]G base pair. This binding is stabilised by hydrogen bonding and numerous close van der Waals contacts to the surface of the groove walls. The meta-hydroxyl group was found in two distinct orientations, neither of which participates in direct hydrogen bonds to the exocyclic amino group of a guanine base. The conformation of the drug differs from that found previously in other X-ray structures of Hoechst 33258-DNA complexes. There is significant variation between the minor groove widths in the complexes of Hoechst 33258 and the meta-hydroxyl derivative as a result of these conformational differences. Reasons are discussed for the inability of this derivative to actively recognise guanine.
It is well established that a variety of small molecules such as netropsin, berenil, pentamidine, distamycin and Hoechst 33258 bind in the minor groove of duplex DNA, with a marked preference for AT-rich regions (1 -3 ). These groove binding species and their derivatives have shown a wide range of biological activity from antitumour to antiviral, antifungal and antimicrobial (for example, refs 4 -9 ). Their modes of action all involve initial groove binding to the DNA possibly followed by interference in the function of DNA topoisomerases (10 ,11 ) or with transcription factor binding (12 -15 ). In order to bind effectively in the minor groove these compounds should be able to follow the contours of the minor groove without unfavourable contacts and their charge distribution must be appropriate for the electrostatic potential encountered within the binding site of the groove. The minor groove binders therefore tend to have similar structural properties, such as an inherently concave shape along one long edge and cationic charge, often at the ends of the molecule. They often contain a series of linked aromatic ring systems which are close to coplanar when viewed from the side.
NMR (16 ,17 ) and X-ray (18 -24 ) studies of minor groove binding drugs complexed to DNA oligonucleotides have characterised the non-covalent interactions involved in drug-DNA recognition. These studies have shown the drug bound in the narrow minor groove of AT-rich sequences stabilised by an array of van der Waals interactions (25 ) involving the hydrogen atoms of the walls and floor of the minor groove. Direct hydrogen bonding between the drug and various sites on the DNA is frequently (though not invariably) observed (23 ). The sequence selectivity of groove binders is therefore likely to be dominated by their ability to recognise variations in DNA groove width, with DNA-drug hydrogen bonding interactions being a secondary factor (26 ).
Much of the current focus of minor-groove drug design is aimed towards improving the selectivity of the drugs to specific base pair sequences of duplex DNA. A number of minor groove binders have been designed that can bind to both A[middot]T and G[middot]C base pairs in stretches of mixed sequence by taking advantage of the sequence dependence of minor groove width in B-form DNA (27 -30 ). The bis-benzimidazole compound Hoechst 33258 (1; Fig. 1 ) is a primitive example of a GC-tolerating molecule (at one end of its binding site) as a result of the bulk of the saturated piperazine ring on one end of the molecule requiring a wide GC minor groove region.
The DNA dodecamer d(CGCGAATTCGCG)2 was purchased from the Oswel DNA Service (University of Edinburgh) and annealed before use. The meta-OH(N) Hoechst compound 2 was prepared as the hydrochloride salt. Its synthesis and biological properties will be reported elsewhere; its DNA-binding properties have been described (42 ). The complex was grown from hanging drops at 18oC as colourless plates. The crystal used for data collection was grown from a drop containing 4 [mu]l of 35% w/v 2-methylpentane-2,4-diol, 2 [mu]l of 5 mM drug, 2 [mu]l of 200 mM MgCl2 and 3 [mu]l of 2 mM dodecamer, equilibrated against a reservoir containing 1 ml 35% w/v 2-methyl-2,4-pentanediol. All solutions were prepared in 30 mM sodium cacodylate buffer at pH 7.0. An X-ray quality crystal was obtained after ~3 months.
The single crystal used for data collection was of approximate dimensions 0.30 * 0.24 * 0.08 mm and was mounted in a 0.5 mm Lindemann glass capillary with a small amount of mother liquor. Diffraction data were collected at 20oC to a maximum resolution of 2.2 Å using a Siemens-Xentronics multiwire area detector and a rotating-anode X-ray generator operated at 3 kW. A crystal-to-detector distance of 10 cm and a swing angle of 15o was used with [chi] set at 45o to obtain data sets at [Phi] values of 0 and 60o. The crystal was rotated through 100o in [omega], collecting 180 s frames every 0.2o. The crystal showed no significant decay during data collection. Data processing was performed using XENGEN version 1.3. After merging, the data comprised 3292 of the possible 3514 unique reflections (i.e. 93.0%) to 2.2 Å, with a merging R value of 4.5%.
The crystal was found to be in the space group P212121 with unit cell dimensions of a = 24.62, b = 40.35 and c = 66.23 Å. This cell is isomorphous with that for the native sequence and other drug-dodecamer structures and so the coordinates for the native structure [PDB entry 9BNA (47 )] were used for the initial model. The refinement process was carried out using the program X-PLOR version 3.1 (48 ). Initial rigid-body refinement of the whole duplex model was carried out for data in the range 8.0-4.0 Å, the resolution range of the data being gradually increased to 8.0-3.0 Å. The DNA duplex was then divided into 24 rigid groups comprising the 24 nucleotide units and the upper resolution limit was gradually increased to the maximum of 2.2 Å during further rigid-body refinement. At this stage the R factor was 31.5%. Positional and temperature factor refinement reduced the R factor to 28.3%. Electron density maps were calculated and displayed using the graphics package FRODO/TOM (version 3.1 Alberta/Caltech). The DNA molecule fitted the electron density well in all regions of the model and a continuous volume of electron density was observed in the minor groove of the DNA. The molecular structure of the meta-OH(N) Hoechst molecule was built from one of the Hoechst 33258 crystal structures (35 ) using the computer modelling package Insight II (49 ). The lobe of continuous electron density in the minor groove showed sufficient detail to allow the drug molecule to be unambiguously fitted to it with a unique orientation and position and with the piperazine group in the stereochemically-acceptable conformation (Fig. 2 ). The location of the meta-hydroxyl group itself was ambiguous and two possible structures were refined from this point, one with the hydroxyl pointed into the groove and the other with it pointing out. Partial charges were calculated for the drug molecule by an empirical calculation using the cvff (default) force-field of Insight II. Planar restraints were applied to each individual aromatic group. Positional and temperature refinement was continued with the drug molecule included. During refinement a total of 84 water molecules and a magnesium ion were located in Fo-Fc electron density maps and included in the structure. The final R factor for the `OH in' conformation was 20.2% for all observed data in the range 8.0-2.2 Å, with rms deviations from the ideal of 0.019 Å and 1.37o for bond lengths and angles, respectively. The X-PLOR refinement and R factor calculations used 3097 reflections with F > 2[sigma](F). For the `hydroxyl out' structure the final R factor was 20.4%, with 84 water molecules and the magnesium ion. Final refined coordinates for both `hydroxyl in' and `hydroxyl out' structures, together with the observed and calculated structure factors have been deposited in the Nucleic Acid database with the identity codes X96024 and X96025.
The DNA duplex adopts a B-DNA conformation with the ligand lying in the narrow central region of the minor groove (Fig. 3 ). The asymmetric unit contains one anti-parallel double helix, the two strands being crystallographically distinct. Packing of the helices in the unit cell is analogous to that observed for almost all other dodecanucleotide crystal structures in this space group, with the two terminal G[middot]C base pairs of each helix interacting with a neighbouring duplex via minor groove-minor groove base pair hydrogen bonding and crystal packing interactions. This leaves the central 8 bp free of any direct crystal packing forces and allows a drug molecule to reside in this region of the groove, free of direct crystal packing constraints. There is no evidence of the drug molecule being disordered along the groove, where it adopts a unique position and orientation, similar to that seen for a number of other Hoechst 33258 complexes, even though the conformation of the phenol ring was ambiguous due to possible ring flipping. The drug is located in the sequence at the 5'-AATTC site, with the phenol ring lying towards the 5' end of the sequence and the bulky piperazine group close to the C9[middot]G16 base pair (the `pip-down' orientation). The two possible conformations of the phenol ring have the hydroxyl group pointing directly either into or out of the groove. Both resulting structures show good geometry but only the `hydroxyl in' structure shows any evidence of (weak) direct hydrogen bonding of the hydroxyl group to the DNA and also has a slightly lower R factor (20.2% compared with 20.4%). The former orientation has been predicted by NMR studies. The two possible conformations and the comparatively larger temperature factors for the phenol ring when compared with the rest of the molecule (Table 1 ) could be evidence of ring flipping occurring although this dynamic effect was not observed in the NMR studies of meta-Hoechst (40 ,41 ,43 ). Both the `hydroxyl in' and `hydroxyl out' structures, while not showing significant direct hydrogen bonding between hydroxyl group and DNA, contain a number of indirect hydrogen bonds to the DNA via extended water networks. The existence of the `hydroxyl out' conformation is itself indicative of a weak propensity for the hydroxyl group to interact with DNA. We shall discuss the `hydroxyl in' conformation in greater detail since this one has been predicted to be capable of directly recognising a guanine base edge.
There are two sets of bifurcated hydrogen bonds between the edges of the bases and the two inner facing nitrogen atoms N1 and N3 of the benzimidazole and pyridylimidazole rings. These are from N1 of the drug to O2 of T20 and N3 of A6, and from N3 of the drug to O2 of T19 and O2 of T7 (Fig. 4 ). These bifurcated hydrogen bonds result in the ligand being held in a binding site that is overall similar to that found in the various Hoechst 33258 X-ray structures where the same pattern of bifurcated hydrogen bonds is seen although the distances vary considerably (Table 2 ). The pyridyl nitrogen atom N7 does not take part in any hydrogen bonding with the DNA and so the present structure is highly likely to be similar to that of meta-Hoechst itself. N7 is in van der Waals contact (3.5 Å) with atom N3 on the edge of A18, on the floor of the groove. The meta-hydroxyl oxygen atom O3 makes two direct, albeit very weak, hydrogen bonds to N3 of A5 and to O2 of C21 (Table 4 ). O3 does hydrogen bond to two water molecules, W30 and W43, which themselves bridge between base edge atoms and O4' atoms of a sugar ring (Fig. 4 ). There is no direct interaction between O3 and the 2-amino group of G4, in contrast with predictions from earlier studies (40 -42 ). The OH ... N2G4 distance is 4.2 Å. The oxygen atom of this hydroxyl group is in a similar position to, and shows the same pattern of hydrogen bonding as the N7 hydrogen bond donor of distamycin in its complex with d(CGCAAATTTGCG)2 (50 ). The water molecules W30 and W43 are part of an extensive hydrogen-bonded water network in and around the minor groove involving around 30 water molecules. This network, which extends into the major groove and also contains a hydrated magnesium ion, makes numerous contacts with the DNA including contacts to symmetry-related helices. An analogous network also stabilises the `hydroxyl out' structure and may contribute to the stability of this alternative conformation, as observed in the crystal structure. A well-defined octahedral mass of electron density was assigned as the Mg(H2O)62+ ion with each of the co-ordinated waters hydrogen bonded to various sites on two symmetry-related helices. These magnesium co-ordinated waters form hydrogen bonds to N7 or O6 atoms of G2 or C22 on one helix, or to phosphate oxygen atoms of A6 or T7 on the other helix. A magnesium ion was also located at this same site in the low-temperature structures of Hoechst 33258 bound to d(CGCGAATTCGCG)2 (33 ). At the other end of the minor groove, N6 of the piperazine group forms a bifurcated hydrogen bond to water molecules W62 and W65. These water molecules are part of a group of eight hydrogen-bonded waters which form a total of six hydrogen bonds to various DNA atoms including sugar, base and phosphate oxygen sites. One of the waters twice removed from N6 hydrogen bonds to O2PG10 of a symmetry-related helix. Another group of seven waters are found hydrogen-bonded via W84 to the benzimidazole atom N2. Again, one water twice removed from the ligand is hydrogen-bonded to the DNA at O3'C21 and another water hydrogen bonded to this one, itself hydrogen bonds to a symmetry-related phosphate oxygen. Many of the waters in these networks also fulfil a space-filling role.
The degree of drug twisting between each of its component groups is quite different from that observed for other Hoechst 33258/d(CGCGAATTCGCG)2 structures (31 -33 ) (Table 3 ), although the torsion angles between phenyl and benzimidazole rings are similar to those found in the -100oC low temperature (LT) structure (33 ). They are also close to those found in the NMR study of Hoechst 33258 complexed with the sequence d(GTGGAATTCCAC)2 (52 ,53 ). The LT studies found that lowering the temperature resulted in the two benzimidazole groups becoming progressively more coplanar. It was suggested (33 ) that at low temperatures the molecule would have insufficient energy to overcome the barrier to rotation between the benzimidazole ring systems, so that they could become more coplanar as a result of the favourable delocalisation of electron density. This coplanarity results in a poor fit to the walls of the minor groove and so at higher temperatures groove contacts favour a twist at each linker. The cause of the large phenol-benzimidazole torsion angle for the -100oC structure was unclear. It is possible that as the benzimidazole and pyridylimidazole rings become coplanar, groove wall contacts with the phenyl ring become unfavourable and the conjugation between phenyl and benzimidazole is lost. The torsion angles for the present ambient temperature structure can be explained by a different mechanism. Here, in order for a number of interactions to take place simultaneously, i.e. the phenyl hydroxyl to form hydrogen bonds with water molecules and weak contacts made with the DNA, for the benzimidazole and pyridylimidazole hydrogen bond donors to remain in phase with the base acceptor sites, and for the ligand to follow the groove topology, the molecule is forced to adopt the observed conformation. Whereas the delocalisation between phenyl and benzimidazole groups is weakened, that between the benzimidazole and pyridylimidazole moieties is favourable. This type of conformational change between RT Hoechst and meta-OH species was predicted previously by the NMR studies of meta-Hoechst (40 ,41 ).
Hoechst 33258 and meta-OH(N) Hoechst have very similar structures. Both contain a protonated, non-planar saturated heterocyclic ring system at one end of the molecule and a phenolic ring at the other. The benzimidazoles of the former and the benzimidazole/pyridylimidazole rings of the latter bind to the 5'-AATTC site in the minor groove by a combination of non-bonded interactions to the walls and floor of the groove and bifurcated hydrogen bonding to adenine N3 and thymine O2 atoms. The meta-OH substitution has little effect on the overall binding site of the drug, with the bulky N-heterocyclic ring requiring a widened groove for binding and the remainder of the drug favouring the non-bonded interactions associated with the narrow AATT region. This similarity in binding site and in the binding interactions overall is in accord with the closely similar DNA binding (and antitumour) properties of the two drugs (41 ,42 ). Thus, measurements of the extent of thermal stabilisation of the double helix gave essentially identical Tm values for them (41 ). The principal difference seen with meta-OH(N) Hoechst compared with the other Hoechst 33258 complexes is the (weak) bifurcated hydrogen bonding to N3 of A5 and to O2 of C21. This is distinct from the bifurcated bonding to the 2-amino group of G4 and O2 of C21 predicted by NMR (40 ,41 ). The phenol ring flipping indicated by the X-ray structure is not evident in the NMR studies of meta-Hoechst. This poses the question as to whether or not ring flipping is occurring in solution and is not being detected, or whether the ring flipping seen in the X-ray structure is solely an artefact of the crystallisation process. The weakness of the phenol OH hydrogen bonding to the DNA as seen in the present structure may well be an indication that phenol ring flipping can occur in solution although this cannot be automatically assumed from the present solid state crystal structure.
The position of the hydroxyl group being meta forces the drug away from the floor of the minor groove at the phenol binding site and rotates the rest of the molecule towards the groove for effective binding. The pyridylimidazole moiety is in closer contact with the groove floor than the second benzimidazole in the parent Hoechst 33258 because of this rotation and the presence of the non-protonated N7 atom. This closer approach allows the formation of the N3 ... O2T7 hydrogen bond, which was not seen in the 15oC Hoechst structure. The replacement of a carbon atom in Hoechst with N7 was designed to allow a greater tolerance of G[middot]C base pairs (42 ,46 ). The unprotonated N7 atom could tolerate the increased steric hindrance of the exocyclic amino group at G2, compared with the C2 hydrogen atom on adenine bases, as well as the possibility of N7 ... NH2G hydrogen bond formation. These interactions do not occur in the present structure.
The drug-DNA hydrogen bonding, while probably being a secondary factor in the binding, has an influence on the drug conformation. The formation of the phenol-DNA hydrogen bonds and the requirement that the molecule follow the groove topology and also keep its hydrogen bond donors in phase with the DNA acceptor sites at the edges of the base pairs, forces the meta-OH(N) Hoechst molecule into a conformation quite different from that of other Hoechst structures in 1:1 ligand-DNA complexes. This results in groove widening over the binding sites of the phenol and benzimidazole rings greater than that seen in the 15oC Hoechst structure. The most significant increase in minor groove width is found at the phenol binding site because of the high ring twist and shallow binding depth. The present structure also highlights the importance of water networks, not only in stabilising drug binding but also in stabilising the DNA double helical structure with very extensive hydrogen bonding networks linking symmetry-related helices as well as forming intramolecular bridges.
The observation of meta-OH(N) Hoechst binding in the same site as Hoechst 33258 itself, i.e. mostly in the narrow groove AT region, provides further support for the hypothesis that binding of such molecules to DNA is largely dominated by groove width factors, with complementarity of drug narrow cross-section and surface of groove walls (24 ,25 ,27 -30 ). The piperazine ring is the sole moiety of the Hoechst 33258 and meta-OH(N) Hoechst molecules which has a pronounced requirement to bind in the wider minor groove of the GC region in these complexes. The meta hydroxyl group cannot achieve GC recognition in the minor groove for two reasons: (i) the phenyl ring has too narrow a cross-section for effective binding to a wide GC-rich minor groove, and most importantly, (ii) the hydroxyl group is positioned insufficiently deeply into the groove for interaction with a guanine exocyclic amino group. The present structure demonstrates that it is almost equivalent in position to the inner-facing nitrogen atom of a benzimidazole or pyridylimidazole ring, and so its weak bifurcated hydrogen-bonding to N3A5 and to O2C21 is unsurprising. Effective GC recognition would require a substituent at this end of the bis-benzimidazole core that addresses both of the problems posed by meta-OH(N) Hoechst, i.e. that has sufficient size to contact the walls of the widened groove, as well as having substituents that reach the groove floor.
This work was supported by a CRC programme grant to S.N. GRC is grateful to the University of Auckland for sabbatical leave at The Institute of Cancer Researchwhen this work was performed. We are grateful to Christine Nunn, John Trent and Alexis Wood (ICR) for discussions and assistance on crystallographic aspects of this work.
*To whom correspondence should be addressed. Tel/Fax: +44 181 643 1675; Email: steve@iris5.icr.ac.uk
+Present address: Department of Physical Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
REFERENCES
Return




