Nucleic Acids Research, 2000, Vol. 28, No. 21 4376-4381
© 2000 Oxford University Press
Synthesis and crystal structure of an octamer RNA r(guguuuac)/r(guaggcac) with G·G/U·U tandem wobble base pairs: comparison with other tandem G·U pairs
Department of Chemistry and Department of Biochemistry, Biological Macromolecular Structure Center, The Ohio State University, 012 Rightmire Hall, 1060 Carmack Road, Columbus, OH 43210-1002, USA
Received May 18, 2000; Revised and Accepted August 15, 2000.
| ABSTRACT |
|---|
|
|
|---|
We have determined the crystal structure of the RNA octamer duplex r(guguuuac)/r(guaggcac) with a tandem wobble pair, G·G/U·U (motif III), to compare it with U·G/G·U (motif I) and G·U/U·G (motif II) and to better understand their relative stabilities. The crystal belongs to the rhombohedral space group R3. The hexagonal unit cell dimensions are a = b = 41.92 Å, c = 56.41 Å, and
= 120°, with one duplex in the asymmetric unit. The structure was solved by the molecular replacement method at 1.9 Å resolution and refined to a final R factor of 19.9% and Rfree of 23.3% for 2862 reflections in the resolution range 10.01.9 Å with F
2
(F). The final model contains 335 atoms for the RNA duplex and 30 water molecules. The A-RNA stacks in the familiar head-to-tail fashion forming a pseudo-continuous helix. The uridine bases of the tandem U·G pairs have slipped towards the minor groove relative to the guanine bases and the uridine O2 atoms form bifurcated hydrogen bonds with the N1 and N2 of guanines. The N2 of guanine and O2 of uridine do not bridge the locked water molecule in the minor groove, as in motifs I and II, but are bridged by water molecules in the major groove. A comparison of base stacking stabilities of motif III with motifs I and II confirms the result of thermodynamic studies, motif I > motif III > motif II. | INTRODUCTION |
|---|
|
|
|---|
Base pairs in DNA are important due to the crucial role of complementary DNA WatsonCrick base pairs in genetic information transfer from one generation to the next. Mispairs in DNA are usually corrected by repair enzymes in order to maintain the fidelity of the genetic code (1,2). RNAs have very substantial differences in biological functions than DNA (3). Mispaired base pairings are frequently observed in natural RNA molecules (4,5) and seem to be essential in biological functions such as RNA processing and RNAprotein interactions. Among mispairs, the wobble base pair (G·U) is the most frequent non-canonical structure conserved at certain sites in rRNA (6,7). It is often involved in functional interactions, particularly in RNAprotein recognition (812). Indeed, G·U pairs in tRNA stems, e.g. alanyl-tRNA, are recognized by its cognate synthetase (810) and are also crucial for splice site selection in group I introns (1315). Adjacent or tandem G·U pairs have three motifs (Fig. 1a; 16). Studies of the conservation of tandem G·U pairs in 16S and 23S rRNAs showed that motif I (5'-UG-3'/3'-GU-5') occurs most frequently. The next most frequent occurrence is motif III (5'-UU-3'/3'-GG-5'), and motif II (5'-GU-3'/3'-UG-5') is the least frequently observed. The order of occurrence of motif I > motif III > motif II is in agreement with thermodynamic studies (1618). The crystal structures of motifs I and II were reported by this laboratory earlier (19,20). While the present motif III sequence was being crystallized, the crystal structure of the catalytic domain of the group I intron was determined (21). While this paper was being written another 14 bp RNA oligonucleotide containing motif III was reported (22). Here we report a simple octamer double helix of motif III as in motifs I and II and compare their base stacking stabilities.
|
| MATERIALS AND METHODS |
|---|
|
|
|---|
Synthesis and purification
The non-self-complementary RNA octamer r(guguuuac)/r(guaggcac) was synthesized by the phosphoramidite method using an Applied Biosystem DNA synthesizer 391 and cleaved from the solid support using 3:1 (v/v) ammonia/ethanol and incubated overnight in the same solution at 55°C for deprotection. The 2'-hydroxyl groups were deprotected by overnight incubation with 0.8 ml of 1 M tetrabutylammonium fluoride. An equal amount (0.8 ml) of 0.1 M triethylamine acetate was added and the mixture was evaporated in a lyophilizer. The RNA fragment was subsequently precipitated with ammonium acetate/ethanol. The sample was further purified by ion exchange FPLC, using lithium chloride for the eluting gradient.
Crystallization and data collection
The crystallization experiments were performed by the hanging drop vapor diffusion method at room temperature (293 K). The best crystals were obtained with 40 mM sodium cacodylate buffer pH 7.0, 50 mM magnesium chloride, 1.0 mM cobalt hexamine chloride and 5% (v/v) methyl 2,4-pentanediol (MPD) against 55% MPD in the reservoir. Crystals of dimensions 0.3 x 0.2 x 0.4 mm were obtained in 2 weeks. Data were collected from a single crystal at room temperature on a R-AXIS IIc imaging plate and a 50 kV/100 mA graphite monochromated CuK
(1.5418 Å) X-ray beam. A total of 2862 independent reflections corresponding to 98.5% of the theoretically possible data were collected to 1.9 Å resolution with an Rmerge of 5.5%. The data completeness for the last resolution bin, 1.91.97 Å, was 97.4%. The data were processed using the programs DENZO and SCALEPACK (23). The crystals belonged to the rhombohedral space group R3 with hexagonal settings a = b = 41.92 Å, c = 56.41 Å,
= ß = 90.0° and
= 120°. The asymmetric unit contains one duplex with a volume/bp of 1192 Å3. Crystal data and refinement statistics are summarized in Table 1.
|
Structure solution and refinement
The structure was solved by the molecular replacement method, using the atomic coordinates of the self-complementary duplex r(guauaua)dC (24) as the starting model in the hexagonal unit cell: a = b = 43.07 Å and c = 59.36 Å. Refinement was performed using the program CNS (25): 10% of the reflections were selected at random for the Rfree calculation. The starting model gave a Rwork/Rfree of 0.363/0.425. The structure was then refined as a rigid body using 2396 reflections between 10.0 and 2.0 Å resolution followed by a positional refinement. The Rwork/Rfree dropped to 0.319/0.357. The model was then annealed by heating the system to 4000 K and slow cooling to room temperature at 0.5 fs sampling intervals. This dropped the Rwork/Rfree to 0.265/0.314. The bases were then corrected by omit
Fo
Fc
maps calculated by removing one base pair at a time. The model was subsequently refined by positional and individual B factor refinement with reflections from 10.0 to 1.9 Å. Thirty water molecules located in the difference density maps were added. The final Rwork/Rfree was 0.199/0.233. The model contains 335 nucleic acid atoms and 30 water molecules. The r.m.s. deviation of the final model from ideal geometry was 0.013 Å in bond length and 1.68° in bond angle, respectively. The atomic coordinates and structure factors have been deposited with the Nucleic Acid Database (26), accession code AR0022. | RESULTS AND DISCUSSION |
|---|
|
|
|---|
Overall molecular structure
Figure 1 gives a schematic diagram of the sequences. The non-self-complementary octamer r(guguuuac)/r(guaggcac) forms a right-handed A-RNA duplex with 10.6 residues/turn, exhibits a narrow major groove (deep groove, 3.54 Å) and a wide minor groove (shallow groove, 11.27 Å). The duplexes stack in columns forming a pseudo-continuous infinite RNA helix. The base pairs at the junction of the two duplexes are at a very low twist angle of 5°. All ribose sugars are in the characteristic C3'-endo conformation. The average base pair displacement is 3.45 Å. The bases are anti and the backbone bond conformations C4'C5' (
) and PO5' (
) are g and g+, respectively. The average intra-strand phosphatephosphate distance is 5.71 Å, typical of chains containing C3'-endo sugars. The uridine bases of the tandem wobble U·G/U·G pairs move into the minor groove such that the O2 atoms of the U residues are close enough to form bifurcated hydrogen bonds with N1 and N2 of the guanines (Fig. 2). In motifs I and II the tandem wobble base pairs form normal hydrogen bonds and the N2 atoms of G residues are at non-hydrogen bonding distances (>3.7 Å) from the O2 atoms of the U residues and lock a water molecule in the minor groove, which is not observed in motif III. However, in the crystal structure of a 14mer RNA containing motif III the tandem wobble base pairs lock a water molecule in the minor groove (22).
|
Hydration and crystal packing
A total of 30 ordered solvent molecules were found in the current structure. The majority of the water molecules are located in the major groove and are especially concentrated in the central part of the helix where the G·U wobbles occur. The N7 atoms of most of the purines in the major groove are hydrogen bonded with water molecules. In fact, the N7 atoms of residues A7 and A15 are linked to the respective O1P groups via water bridges. The N7 atom of G3 and the O4 atom on U4 of the adjacent base are connected by a water molecule. A similar pattern (N7N4) is found between the adjacent bases G13 and C14. Unlike motifs I and II, the helix backbone of motif III is not highly hydrated. Some O2'H groups on the sugars are linked to water molecules. Water molecules are found to be hydrogen bonded to O2' atoms of G1, U4 and G9 of strand I and G12, G13 and G15 of strand II. Most of the O2' atoms form intra-molecular hydrogen bonds to the adjacent O4' atoms of the sugar on the 3'-side, thus stabilizing the double helix (Fig. 3). The tandem wobble pairs are extensively hydrated in the major groove while no water molecules are found in the minor groove (Fig. 2), maybe because of slippage of the uridine bases towards the minor groove. The acceptor O4 (U4) and N7 (G13) atoms in the major groove are linked by three water-mediated hydrogen bonds (W25, W31 and W36), while O4 (U5) and O6 (G12) are linked by two water molecules, W18 and W25. The acceptors O4 (U5), O6 (G12), N7 (G12) and O1P (G12) in the major groove are linked to each other via a network of five water molecules, W18, W25, W29, W31 and W43 (Fig. 2). These water bridges in the major groove side could play an important role in stabilizing the G·U wobble pairs in the RNA helix and also in RNAprotein interactions. In all three motifs the major grooves in the center of the double helices are highly hydrated. The high negative potential, particularly the presence of O4 atoms in the major groove, makes the G·U wobbles potential metal binding sites.
|
The stacked duplexes are closely packed around the 31-screw axis in a pseudo-continuous column parallel to the crystallographic c-axis (Fig. 4). The inter-molecular contact mainly occurs by interaction between the 31-screw axis related molecules through the donor 2'-OH groups and the N4 atoms in the major groove. Several CHO contacts also occur in the minor groove (Table 2), for instance C5'(G3)HO4'(U4) = 3.19 Å. The 3-fold rotation axis related molecules are further apart and do not have any direct inter-molecular interactions.
|
|
Base stacking of the wobble and flanking WatsonCrick base pairs in motifs I, II and III
Base stacking is crucial for stabilizing adjacent base pairs. The structures of motifs I, II and III do not tell us about thermodynamic stabilities, but they can provide a rationale. Thermodynamic studies have shown that the stabilities of the tandem wobble pairs in these three motifs (U·G/G·U, G·U/U·G and G·G/U·U) are in the order motif I > motif III > motif II (17,18), which is the same order as their occurrence in nature. The helical parameters of motifs I and II have been published in earlier papers (19,20). The helical parameters of motif III have been calculated with the program NEWHEL92 (27) and the twist angles between adjacent G·U wobble pairs and the flanking WatsonCrick pairs have been tabulated (Table 3). Adjacent G·U wobble pairs have a twist angle of 32.2°, which is less than the average of the duplex (34.2°). The 3'-side flanking WatsonCrick base pair closest to a wobble pair shows a high twist angle (37.2°) compared to the average twist. In motif I the twist angle between the tandem G·U wobble pairs is high (38.13°), while twist angles are low at both flanking WatsonCrick ends (
26.5°, Table 3). In motif II the twist angle between the tandem wobble pairs is very low (23.8°), while average values are seen at flanking WatsonCrick ends (Table 3). The twist angles mainly affect base pair stacking (Fig. 5). In motif I strong intra-strand stacking occurs between G·U wobble base pairs and their flanking WatsonCrick pairs on both the 5'- and the 3'-sides (Fig. 5a and c); guanine bases exhibit cross-strand stacking (Fig. 5b) and uridine bases are apparently not stacked. Motif II has a reversed base pair stacking pattern compared to motif I. The adjacent G·U wobble pairs are stacked with slight slippage (Fig. 5e), but the flanking WatsonCrick pairs exhibit cross-strand stacking between the purines (Fig. 5d and f), while uridines have no stacking interactions. In motif III the uridine bases in the same strand have only partial stacking (Fig. 5h). The six-membered purine and pyrimidine rings of the 5'-side wobble pair with its flanking WatsonCrick base pair exhibit a slightly slipped stacking (Fig. 5g), whereas the stacking of the wobble pair with the 3'-side flanking WatsonCrick pair is considerably less (Fig. 5i).
|
|
The three stacking steps taken together are the main determinants of the stability of the various motifs of the tandem wobbles. In motif I intra-strand slippage occurs between the adjacent G·U wobbles (Fig. 5b) while good intra-strand base stacking is retained at both flanking WatsonCrick ends (Fig. 5a and c) without much distortion of the A-type helix. It seems that base stacking in motif I is comparatively the strongest. In motif II the tandem G·U/U·G wobbles stack with slightly slipped bases and on both the 5'- and 3'-sides only cross-strand stacking occurs with the flanking WatsonCrick pairs (Fig. 5df). Thus, motif II has the least stacked tandem base pairs because it has two weak cross-strand stackings in which the uridines are unstacked. The base pair stacking in motif III is intermediate between those of motifs I and II (Fig. 5gi).
When motif III is compared to the crystal structures of the group I intron at 2.8 Å resolution (21) and the 14 bp RNA at 2.1 Å resolution (22), where the flanking WatsonCrick base sequence is different, the base stacking pattern is nevertheless similar (base step twist angles are given in Table 4). In the 14 bp RNA crystal structure the 3'-side U·G wobble and its WatsonCrick pair (G·C) show less intra-strand stacking and increased cross-strand stacking (22). Although we do not have the structure of the motif III sequence with a pyrimidine on the 5'-side and a purine on the 3'-side, the stacking pattern is probably the same as above. Thermodynamic studies have shown that motif I is the most stable, independent of the flanking WatsonCrick sequence (17). In general, motif II is destabilizing and motif III is slightly more stable than motif II (1618). Considering all the possibilities for the four flanking WatsonCrick base pairs, there are 16 different possible combinations. We have compared the base stacking in motif III of three crystal structures (the current structure, the group I intron and 14 bp RNA), however, we do not have crystal structures for motifs I and II with combinations of flanking WatsonCrick base pairs. Nevertheless, based on the results of the available crystal structures for motif III, we have shown that the stacking stabilities can be rationalized as motif I > motif III > motif II, which is consistent with their thermodynamic stabilities.
|
| ACKNOWLEDGEMENTS |
|---|
We gratefully acknowledge support of this work by grant GM-17378 from the National Institutes of Health. We thank the Board of Regents of Ohio for an Eminent Scholar Chair and Endowment to M.S. We acknowledge the Hayes Investment Fund of the Ohio Regents for partial support towards the purchase of an R-axis II C imaging plate system.
| FOOTNOTES |
|---|
* To whom correspondence should be addressed. Tel: +1 614 292 2925; Fax: +1 614 292 2524; Email: sundaral@chemistry.ohio-state.edu
| REFERENCES |
|---|
|
|
|---|
-
1 Modrich,P. (1987) Annu. Rev. Biochem., 56, 435.[Web of Science][Medline]
2 Pearl,L.H. and Savva,R. (1995) Trends Biochem. Sci., 20, 421.[Web of Science][Medline]
3 Varani,G. and Pardi,A. (1994) In Nagai,K. and Mattaj,I.W. (eds), RNAProtein Interactions. Oxford University Press, Oxford, UK, pp. 124.
4 Varani,G., Wimberly,B.and Tinoco,I.,Jr (1989) Biochemistry, 28, 7760.[Medline]
5 Wimberly,B., Varani,G. and Tinoco,I.,Jr (1993) Biochemistry, 32, 1078.[Medline]
6 Woese,C.R., Gutell,R.R., Gupta,R. and Noller,H.F. (1983) Microbiol. Rev., 47, 621669.
7 Gutell,R.R., Larsen,N. and Woese,C.R. (1994) Microbiol. Rev., 58, 1026.
8 McClain,W.H. and Floss,K. (1988) Science, 240, 793796.
9 McClain,W.H. and Floss,K. (1988) J. Mol. Biol., 202, 697709.[Web of Science][Medline]
10 Hou,Y.M. and Schimmel,P. (1988) Nature, 333, 140145.[Medline]
11 McClain,W.H., Gabriel,K. and Scheneider,J. (1996) RNA, 2, 105109.[Abstract]
12 Mizuno,H. and Sundaralingam,M. (1978) Nucleic Acids Res., 5, 44514461.
13 Cech,T.R. (1987) Sciences, 236, 15321539.
14 Barford,E.T. and Cech,T.R. (1989) Mol. Cell. Biol., 9, 36573666.
15 Strobel,A.A. and Cech,T.R. (1995) Science, 267, 675679.
16 Gautheret,D., Konings,D. and Gutell,R. (1995) RNA, 1, 807814.[Abstract]
17 He,L., Kierzek,R., SantaLucia,J.J., Walter,A.E. and Turner,D.H. (1991) Biochemistry, 30, 1112411132.[Medline]
18 Wu,M., McDowell,J.A. and Turner,D.H. (1995) Biochemistry, 34, 32043211.[Medline]
19 Biswas,R., Wahl,M.C., Ban,C. and Sundaralingam,M. (1997) J. Mol. Biol., 267, 11491156.[Web of Science][Medline]
20 Biswas,R. and Sundaralingam,M. (1997) J. Mol. Biol., 270, 511519.[Web of Science][Medline]
21 Cate,J.H. and Doudna,J.A. (1996) Structure, 4, 12211229.[Medline]
22 Trikha,J., Filman,D.J. and Hogle,J.M. (1999) Nucleic Acids Res., 27, 17281739.
23 Otwinowski,Z. and Minor,W. (1997) Methods Enzymol., 276, 307326.[Web of Science]
24 Wahl,M.C., Ban,C., Sekharudu,C., Ramakrishnan,B. and Sundaralingam,M. (1996) Acta Crystallogr., 52D, 655667.
25 Brunger,A.T., Adams,P.D., Clore,G.M., DeLano,W.L., Gros,P., Grosse-Kunstleve,R.W., Jiang,J., Kuszewski,J., Niges,M., Pannu,N.S., Read,R.J., Rice,L.M., Simonson,T. and Warren,G.L. (1998) Acta Crystallogr., 54D, 905921.
26 Berman,H.M., Olson,W.K., Berveridge,D.L., Westbrook,J., Gelbin,A., Demeny,T., Hsieh,S.H., Srinivasan,A.R. and Schneider,B. (1992) Biophys. J., 63, 751759.[Web of Science][Medline]
27 Fratini,A.V., Kopka,M.L., Drew,H.R. and Dickerson,R.E. (1982) J. Biol. Chem., 257, 1468614707.
This article has been cited by other articles:
![]() |
D. Xu, T. Landon, N. L. Greenbaum, and M. O. Fenley The electrostatic characteristics of G{middle dot}U wobble base pairs Nucleic Acids Res., June 28, 2007; 35(11): 3836 - 3847. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-M. LANCHY, J. D. IVANOVITCH, and J. S. LODMELL A structural linkage between the dimerization and encapsidation signals in HIV-2 leader RNA RNA, August 1, 2003; 9(8): 1007 - 1018. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||






