Sensitivity of NMR internucleotide distances to B-DNA conformation: underlying mechanics
Sensitivity of NMR internucleotide distances to B-DNA conformation: underlying mechanicsAnne Lefebvre, Serge Fermandjian and Brigitte Hartmann1,*
Département de Biologie Structurale, URA 147 C.N.R.S., Institut Gustave Roussy, P.R.2, 39 rue C. Desmoulins, F-94805 Villejuif Cedex, France and 1Laboratoire de Biochimie Théorique, UPR 9080 C.N.R.S., Institut de Biologie Physico-Chimique, 13 rue Pierre et Marie Curie, F-75005 Paris, France
Received June 27, 1997;Revised and Accepted August 12, 1997
ABSTRACT
Nuclear magnetic resonance (NMR) spectroscopy, combining correlated spectroscopy (COSY) coupling constant measurements with nuclear Overhauser effect spectroscopy (NOESY) interatomic distances, should make it possible to determine an averaged solution structure for DNA oligomers. However, even if such data could be obtained with high accuracy, it is not clear which structural parameters of DNA would be determined. Here, the relationships between measurable internucleotide distances and helical parameters are systematically studied through molecular modelling. Investigations are carried out using four representative sequences, (ACGT)n, (TCGA)n, (AGCT)n and (TGCA)n, composed of repeated tetranucleotides belonging to oligomers previously studied by NMR. Correlations between interatomic distances become evident and strong connections between distances and inter-base helical parameters are observed. Results imply that twist, roll, shift and slide values can be accurately determined from NMR data. Sequence independent mechanical coupling which link backbone and sugar conformations to helical twist are also described.
INTRODUCTION
NMR spectroscopy, associated with molecular modelling, is the only technique capable of providing a detailed molecular structure of DNA in solution. NMR allows the evaluation of certain sugar and backbone dihedral angles through coupling constant measurements and sugar puckering can generally be accurately determined (1 -5 ). Unfortunately, helicoidal parameters cannot be measured directly and must be deduced from interproton distances within or between neighbouring nucleotides. Although much effort has been put into improving the quality of interproton distance measurements (6 -8 ), it remains unclear how precisely these data determine the overall DNA structure.
Following early work onvarious aspects of this problem (9 -15 ), we attempted to carry out a systematic study using the tools offered by the JUMNA modelling program (16 ,17 ). Four repeating sequences, (ACGT)n, (TCGA)n, (AGCT)n and (TGCA)n, formed from tetranucleotides found in oligonucleotides previously studied by combined NMR and molecular modelling (4 ,18 ,19 ) were selected. These sequences contain eight out of the ten unique dinucleotide steps with examples belonging to the families YpR, RpY, YpY (where R implies purine and Y implies pyrimidine). Modelling studies have been made on helically symmetric, polymeric DNA to simplify the conformational space to be searched and to avoid end-effects. Both distances and other structural constraints were investigated through systematic scans of their values, including sugar puckering, which has already been shown to play an important role in fixing overall DNA conformation (20 ,21 ). These data were then used to search for linear correlations between different interproton distances, as well as for relationships between distances and helicoidal parameters. Obtained correlations were subsequently verified using crystallographic data.
By linking interproton distances and helicoidal parameters, this study enabled us to determine which helicoidal parameters are best defined by NMR measurements and also to identify elements of the conformational mechanics which, independently of sequence, link base and backbone movements within double helical DNA.
MATERIALS AND METHODS
All calculations were performed using the JUMNA program (16 ,17 ) which represents nucleic acid flexibility using a combination of helicoidal and internal variables. Single bond torsions and valence angles are used to model the flexibility of each nucleotide, while the nucleotides are positioned in space using helical rotations and translations with respect to a reference axis system. All bond lengths are kept fixed and the junctions between successive 3'-monophosphate nucleotides, as well as the closure of the sugar rings, are ensured using quadratic restraints on O5'-C5' and C4'-O4' distances. This representation leads to roughly ten times fewer variables than Cartesian coordinate molecular mechanics.
Helical symmetry can be imposed within JUMNA by simply making symmetry-related sets of variables equivalent to one another. This option was very useful in the present study in order to avoid the end effects associated with oligomers and to further reduce the dimension of the conformational space to be searched. Conformations were scanned using automatic adiabatic mapping options associated with distance or other structural constraints (sugar pucker, helicoidal parameters, etc.). NMR measurable interproton distances were imposed either as simple quadratic restraints with respect to a given value or as quadratic lower and upper bounds bracketing a determined range of distances.
Model and crystallographic structures were analyzed with the CURVES program (22 ,23 ), which provides a rigorous way to obtain the overall helical axis locus for irregular nucleic acids. All helical parameters were calculated with respect to this helical axis and were thus termed global parameters. All parameters obey the Cambridge convention for DNA conformation (24 ). Here, we considered intra-strand parameters, to be coherent with distances measured by NMR.
Calculations were carried out for four regular repeating polymers in the B conformation (containing phosphates in the BI conformation and sugars with `southern' puckers). The sequences studied were (ACGT)n, (TCGA)n, (AGCT)n and (TGCA)n. Systematic studies of all measurable interproton distances between nucleotides were made involving: Hx-H6/H8, Hx-H5(C) (for NpC steps, where N implies any base) and H2(A)-H1' (for ApN steps) where Hx refers to H1', H2', H2'', H3' and H6/H8 protons. Reported distances between protons are always between two sequential nucleotides and denoted by A-B, where A is a proton of the 5'-nucleotide (i) and B a proton of the 3'-nucleotide (i + 1).
Results from our modelling were compared with crystallographic data from nine B-DNA structures which were each resolved to at least 1.8 Å [below this resolution certain structural features, notably sugar puckers, become unclear (25 ,26 )]. The oligomers involved were: CCAAG*A*TTGG (27 ), CCAACITTGG (28 ), CCAACGTTGG (26 ), CGATCG[sect]ATCG (29 ), CGATTAATCG (30 ), CCAGGCCTGG (31 ), CCAG#CGCTGG (32 ), CCAGGCmeCTGG (33 ) and C[sect]GATATATCG (34 ). Junctions perturbed by mismatches (G*A*), unusual bases (G#: oxoguanine) or unusual backbone conformations ([sect]: [alpha]/[beta]/[gamma] crankshafts) were not considered, but all others were taken into account, including sugars having phase angles between 80 and 200o or positive [epsilon]-[zeta] angle differences, since such conformations are known to exist in solution. Since decamers within crystal stack end-to-end to form continuous double helices, terminal base pairs were also considered. Given these restrictions, the nine oligomers contain a total of 116 non-redundant single strand dinucleotide steps.
RESULTS
Energy mapping and parameter variability
For each of the four repeating sequences studied, the conformational space was explored by scanning the sugar phase angles from 130 to 200o. Because of inversion symmetry each of the tetranucleotide repeats studied has four unique sugars, but, because the puckers of successive sugars are coupled to one another, the search can actually be simplified to mapping the phase angle differences between two successive pairs of nucleotides belonging to one strand of the duplex. We have previously demonstrated that this approach enables us to locate all the stable B-family conformations of such sequences (4 ,19 ,21 ,35 ). The energy map for (ACGT)n presents three distinct stable conformations, whereas (AGCT)n and (TGCA)n have two minima and (TCGA)n only one (see Table 1 ).
Starting from each of these minima, we carried out mapping of the interproton distances relevant to NMR spectroscopy for the internucleotide steps of each sequence. Distances >4 Å were varied over a 3 Å range and distances <4 Å over a 2 Å range. An example of the results obtained is given in Figure 1 B for the H6-H8 distance of the CpG step and the H3'-H8 distance of the TpC step within the (TCGA)n sequence. The variation of interproton distances over quite large ranges gave rise to only small changes of energy (of the order of a few kcal/mol), as was the case for individual sugar phase angle variations (see Fig. 1 A). However, it was noted that longer internucleotide distances, such as H6/H8-H6/H8, were generally more easily variable than shorter distances, such as H2''-H6/H8 (Table 2 ). The variability of longer distances was also more susceptible to sequence effects. This observation is illustrated by H6/H8-H6/H8 distances, for which RpY steps were the most rigid (0.8-1.0 Å variation for an energy change of 1 kcal/mol), followed by RpR and YpY steps (1.0-1.2 Å variation) and finally YpR steps (1.2-1.6 Å variation). It is interesting to remark that within the latter family, CpA and CpG were the most deformable steps, illustrating once again their conformational malleability (4 ,19 ,28 -30 ,36 -38 ). Shorter internucleotide distances showed much less sequence effects, although for H3'-H6/H8 it was possible to distinguish YpR and RpR steps (0.7-1.0 Å variation) from more rigid YpY and RpY steps (0.5-0.7 Å variation).
Scans of purine phase angles were also carried out. The conformation of these sugars have the largest impact on overall DNA conformation and, in the case of the sequences studied, allow all the minimal energy conformations to be reached along a single phase angle pathway (see examples in Fig. 1 A). The interproton distance and phase angle scans carried out generated considerable variations in most of the helicoidal parameters (Table 3 ). However, direct scans of helicoidal parameters were also performed for the (ACGT)n sequence, in order to verify the relationships determined from the distance and phase angle mapping, as well as to generate structures with more extreme values of the helicoidal parameters. This was necessary, for example, for tilt angles, which otherwise varied only weakly (see Table 3 ), as also observed for crystal structures (39 ). The energy cost of helicoidal deformations depended strongly on the type of parameter involved, X displacement, inclination and rise being much softer variables than twist, roll, shift or tip parameters (see examples in Fig. 1 C and D).
. dav (Å), average values of Hx-H6/H8 internucleotide distances; [Delta](dav) (Å), average variations for an energy cost of 1 kcal/mol; [Delta]([Delta]d)seq (Å), sequence variability of these variations
Distances
H1'-H6/H8
H2'-H6/H8
H2''-H6/H8
H3'-H6/H8
H6/H8-H6/H8
dav
3.4
3.5
2.3
4.8
5.0
[Delta](dav)
0.8
1.0
0.6
0.8
1.2
[Delta]([Delta]d)seq
0.4
0.4
0.
0.4
0.8
Sequential distances are noted by A-B, where A is a proton of the 5'-nucleotide (i) and B a proton of the 3'-nucleotide (i + 1).
The aim of the present study was to systematically investigate the relationships between all measurable internucleotide distances and helicoidal parameters in order to determine which sort of parameters were dominant and, consequently the role of distances in fixing overall helicoidal conformation. The results obtained for four repeating tetranucleotide sequences by restrained energy mapping, show that amongst the three rotation (twist, roll and tilt) and the three translation (shift, slide and rise) parameters which define the structure of a dinucleotide step, twist is the most easily accessible, since it is virtually connected to all the internucleotide distances. The roll and shift parameters appear to be relatively well constrained for similar reasons. The slide is also well determined, not because it is directly related to any given distances, but rather for its strong correlation with helical twist. Rise and tilt are not easily determined, but also do not vary very strongly.
Distances involving base protons located far from the glycosidic bond such as H5(C) or H2(A) are the most sensitive to the rotational base-axis parameters. Thus, parameters which fix base position with respect to the helical axis are only clearly given by distances involving adenine or cytosine protons and consequently, are sequence dependent.
This study also reveals the general mechanics of the B-DNA double helix. A number of clear correlations exist and reflect helical parameters coupling. The correlations between interatomic distances and the fact that certain distances can be expressed by two different sets of helical parameters lead to mechanical coupling between helicoidal parameters. The particular coupling between twist and sugar amplitude described here stresses the link which exists between base movements and backbone conformation via the puckering amplitude of the 5" sugar of each dinucleotide step and the related correlation between the twist and the backbone [epsilon]-[zeta] angle difference. Such mechanical coupling appears to be largely sequence independent and remains valid even for non-canonical sugar puckers and backbone conformations.
ACKNOWLEDGEMENTS
A. Lefebvre is the fellowship of the Institut de Formation Supérieure Biomédicale. B. Hartmann wishes to thank the Association for International Cancer Research (St Andrews, UK) for their support. This work was also partially supported by the Association pour la Recherche contre le Cancer and the Ligue Nationale Française contre le Cancer. The authors also wish to thank Muriel Delepierre and Richard Lavery for helpful discussions.
REFERENCES
1 Schmitz, U., Sethson, I., Egan, W.M. and James, T.L. (1992) J. Mol. Biol.227, 510-531.MEDLINE Abstract
2 Ulyanov, N.B., Schmitz, U., Kumar A. and James, T.L. (1995) Biophysical J. 68, 13-24.
3 Gaudin, F., Paquet F., Chanteloup, L., Beau, J.M., Nguyen, T.T. and Lancelot G. (1995). J. Biomol. NMR5, 49-58.MEDLINE Abstract
4 Lefebvre, A., Mauffret, O., Hartmann, B., Lescot, E. and Fermandjian, S. (1995) Biochemistry34, 12019-12028.MEDLINE Abstract
5 Conte, M.R., Bauer, C.J. and Lane, A.N. (1996) J. Biomol. NMR 7, 190-206.MEDLINE Abstract
6 Wijmenja, S.S., Mooren, M.M.W. and Hilbers, C.W. (1993) In Roberts, G.C.K. (ed.) NMR of Macromolecules. IRL Press, Oxford, pp. 217-283.
14 Leijon, M., Zdunek, J., Fritzsche, H., Sklenar, H. and Graslund, A. (1995) Eur. J. Biochem. 234, 832-842.MEDLINE Abstract
15 Schmitz, U. and James T.L. (1995) Methods Enzymol. 261, 3-45.MEDLINE Abstract
16 Lavery, R., Zakrzewska, K. and Sklenar H. (1995) Comp. Phys. Comm. 91, 135-158.
17 Lavery, R. (1988) In Olson, W.K., Sarma, R.H., Sarma, M.H. and Sundaralingam, M. (eds) Structure and expression. Vol.3, DNA bending and curvature. New York, Adenine Press. pp. 191-211.
18 Mauffret, O., Hartmann, B., Convert, O., Lavery, R. and Fermandjian, S. (1992) J. Mol. Biol.227, 852-862.MEDLINE Abstract
19 Lefebvre, A., Mauffret, O., Lescot, E., Hartmann, B. and Fermandjian, S. (1996) Biochemistry 35, 12560-12569.MEDLINE Abstract
20 Poncin, M., Hartmann, B. and Lavery R. (1992) J. Mol. Biol.226, 775-794.MEDLINE Abstract
21 Lavery, R. and Hartmann, B. (1994) Biophys. Chem.50, 33-45.MEDLINE Abstract
22 Lavery, R. and Sklenar, H. (1988) J. Biomol. Struct. Dynam.6, 655.
23 Lavery, R. and Sklenar, H. (1989) J. Biomol. Struct. Dynam.6, 655-661.
24 Dickerson, R.E., Bansal, M., Calladine, C.R., Diekmann, S., Hunter, W.N., Kennard, O., Lavery, R., Nelson, H.C.M., Olson, W.K., Saenger, W., et al. (1989) J. Mol. Biol. 205, 787-791.
25 Hahn, M. and Heinemann, U. (1993) Acta Cryst.D49, 468-477.
26 Privé, G.G., Yanagi, K. and Dickerson, R.E. (1991) J. Mol. Biol.217, 177-199.MEDLINE Abstract