| Nucleic Acids Research | Pages |
A[prime]-form RNA double helix in the single crystal structure of r(UGAGCUUCGGCUC)
Introduction
Materials And Methods
Crystallization
Data collection
Structure determination
Calculation of structure parameters
Cluster analysis of RNA oligomers
Results
Description of the structure
Evidence for the A[prime]-form
Cluster analysis
Comparison of torsion angles in A- and A[prime]-RNAs
Discussion
Acknowledgements
References
A[prime]-form RNA double helix in the single crystal structure of r(UGAGCUUCGGCUC)
ABSTRACT
INTRODUCTION
The sequence CUUCGG is known to form a thermally extra-ordinarily stable hairpin loop structure in solution (1-4). However, in a crystal, the tridecamer r(UGAGCUUCGGCUC) forms a double helix with an internal loop of four successive non-Watson-Crick base pairs instead of a monomeric hairpin loop structure, as previously reported (5,6). We have already reported the crystal-lization and crystal structure of the tridecamer using data at low resolution (7,8) and found that the local geometry of the non-Watson-Crick base pairs was similar to that in previous structures. In this study, we report a newly refined crystal structure based on new data obtained at higher resolution and examined it from the viewpoint of the conformational polymorphism of double helical RNA.
The major groove of nucleic acids has the potential to interact with proteins. As well as many transcriptional regulatory proteins, several proteins or peptides, such as Tat and Rev, bind to the major groove of the double helical regions of their target RNAs (TAR and RRE) (9,10). In such complexes, the conformation of the RNA duplex, especially the major groove width, is an important factor. The major groove of the A-RNA conformation is too narrow to accommodate a protein. In fact, it was observed that Tat and Rev increased the major groove widths of target RNAs upon binding. As in TAR and RRE, in the crystal structure of the 62 nt domain of Escherichia coli 5S rRNA, helix IV of a putative ribosomal protein (L25) binding site has a significantly wider major groove (11). Accordingly, it is crucial to determine the conformations of RNA duplexes themselves.
Numerous attempts have been made to clarify the conformational polymorphism of double helical polynucleotides by means of the X-ray fiber diffraction technique (12-16) and solid state 31P NMR of RNA fibers (17). With respect to RNA, there are two major right-handed conformers; one is A-RNA, which has 11 nt in one helical pitch, and the other is A[prime]-RNA, which has 12 nt in one helical pitch (16). However, the A[prime]-RNA conformation is so rare in a single crystal that the conformational difference between A- and A[prime]-RNAs has not been well studied. Furthermore, their conformations are so similar that they cannot be confidently discriminated from each other by means of root mean square deviation (r.m.s.d.). Accordingly, it is necessary to study their conformations in detail and to establish a novel method for classifying closely related but different conformations.
MATERIALS AND METHODS
Crystallization
The sample for crystallization was synthesized by means of the solid-phase phosphoramidite method and purified as described (2). The numbering of the bases and the base pair scheme in a crystal are presented in Figure
Figure 1. (a) The numbering of the bases and the base pair scheme are shown. A crystallographically independent molecule is presented in bold and symmetrically related molecules are presented in normal type with asterisks. The helices piled up along the c-axis are also presented in italics. Because of the disordered structure, U1 has two conformers, U1a and U1b, which are denoted as U1a,b for short. The bold and thin broken lines represent Watson-Crick base pairs and non-Watson-Crick base pairs, respectively. The black ovals represent the crystallographic two-fold axis. (b) Stereo view of the double-stranded form of the tridecamer. A single well-formed crystal was mounted and sealed in a glass capillary and then intensity data up to 1.8 Å resolution were collected. The numbers of observed reflections and reflections above 0.4[sigma](F) were 3556 and 3288, respectively. The completeness was 90.1 and 82.6%, respectively. The diffraction intensities were recorded in the [thetas]/[omega] scan mode at a scan speed of 6°/min and a scan width of 1.2° with a Rigaku AFC5R diffractometer using graphite monochromated CuKa radiation ([lambda] = 1.5418 Å). The unit cell dimensions were refined with 2[thetas] values of 20 reflections in the range of 16° < [thetas] < 22°. The crystal structure was solved by the molecular replacement method (18-21), using a single-stranded A-form dodecamer of r(GAGCUUCGGCUC) as the initial model. This gave a unique solution with reasonable packing at the R value of 40.0%. The resulting coordinates were subjected to rigid body refinements, refinements of individual atomic coordinates by the conjugate gradient method (22) and simulated annealing (23,24). Finally, 37 water molecules were introduced. The details are described in the supplementary material (Fig. S Helical parameters, groove widths and torsion angles were calculated using the program CURVES (28,29). In Table 1, the mean x-displacement, inclination, rise, twist, minor groove width and major groove width for each duplex are listed, but other structure parameters are not. This is because the standard deviations of the omitted parameters for each duplex were much larger than the differences in the structure parameters between conformers, which means that the omitted parameters did not reflect differences in the conformation. The omitted parameters were insufficient for classification of the completely different conformations of the A- and B-form duplexes. They were insufficient for the classification of RNA duplexes as well. Caution must be taken regarding the parameter twist, because the standard deviation of twist in each duplex was larger than the difference between conformers. It is, however, included in Table 1 because twist is said to be an important parameter for the classification of nucleic acid conformations. The sequences, and their PDB and NDB ID codes, used for the calculation of the structure parameters are listed in the legend to Table 1. To calculate the major groove width precisely, oligomers of >10 bp were selected. However, oligomers including G-A pairs were eliminated because of their irregular backbone geometries. In the calculation of the structure parameters for the tridecamer, the 5[prime]-terminal uridine at the dangling end was eliminated. Finally, the mean structure parameters of all the oligomers were averaged in terms of the structure parameters (AVERAGE) and then the standard deviation of each structure parameter was calculated (SD). These two parameters are structurally meaningless, but are required to calculate the normal distribution of each mean structure parameter for the following cluster analysis. The torsion angles of the oligomers are listed in Table 2. The normal distribution of each mean structure parameter was calculated in terms of the structure parameters using AVERAGE and SD in Table 1 (supplementary material, Table S1). Then the pair-wise distances between the oligomers were calculated using the normal distributions of the structure parameters (supplementary material, Table S2). They were clustered by means of the nearest neighbor method and transformed into a dendrogram (Fig. Figure 2. The mean inclination value of each oligomer duplex is plotted against the value of the mean major groove width of each oligomer duplex. Each point represents the corresponding oligomer duplex. The coordinates and names of the oligomers were taken from PDB, using its code. 1SDR and 280D include two duplexes per asymmetric unit, which are named 1SDR1 and 1SDR2 for 1SDR and 280D1 and 280D2 for 280D. The legend to Table 1 gives the names and sequences of the oligomers. A-RNA (fiber) and A[prime]-RNA (fiber) represent the structures obtained from fiber diffraction data. The crystal structure of the tridecamer r(UGAGCUUCGGCUC) contains one strand of the tridecamer and 37 water molecules per asymmetric unit. The 5[prime]-terminal uridine residue adopts two conformations, with an occupancy of 0.5, because of the statistical disorder due to the asymmetric base pair around the crystallographic two-fold axis. The sequence, with the numbers of bases, and the interactions between the symmetrically related molecules are presented in Figure Two strands of the tridecamer related by the crystallographic two-fold axis form a double-stranded structure (Fig. Figure 3. A dendrogram of the cluster analysis results. Oligomers are linked by the nearest neighbor method at distances which are listed in the supplementary material (Table S2) in bold. The legend to Table 1 gives the names and sequences of the oligomers. The arrangements of the non-Watson-Crick base pairs are similar to the previously reported structures, 255D (5) and 165D (6), which have the same core sequence, CUUCGG. The r.m.s.d. values of the core sequences in a double-stranded form between the tridecamer and the previous structures (5,6) were calculated. The r.m.s.d. between the tridecamer and 255D and that between the tridecamer and 165D were 1.12 and 1.40 Å, respectively, which indicates that the structure of the tridecamer is close to the previous structures and was refined correctly. The overall structure of the double helix was compared with those of the canonical A-RNA and A[prime]-RNA conformers (11 residues/turn for the A-form and 12 residues/turn for the A[prime]-form) (16). The r.m.s.d. of the coordinates from the tridecamer duplex to A-RNA and A[prime]-RNA are 1.60 and 1.13 Å, respectively. This showed that the conformation of the tridecamer is rather similar to that of A[prime]-RNA. However, the r.m.s.d. values were not very different from each other, so we could not determine the conformation of the tridecamer from these values. To clarify the conformational properties in detail, we have listed in Table 1 the mean helical parameters, the major and minor groove widths of the tridecamer, together with those of canonical A-RNA, A[prime]-RNA and B-DNA, and the single crystal structures of the oligonucleotides determined in previous works (5,31-37). All the coordinates were taken from PDB and their names represent the PDB codes (the legend to Table 1 gives the sequences and their NDB codes). 1SDR and 280D contain two double helices per asymmetric unit, which are named 1SDR1 and 1SDR2 for 1SDR and 280D1 and 280D2 for 280D. To calculate the major groove widths precisely, oligomers of >10 bp were selected. The parameters which are efficient for the classification of the conformations are listed in Table 1. Table 1. As far as the conformation of the RNA duplex is concerned, there are two remarkable features. (i) The major groove widths of the tridecamer and 255D are significantly greater than those of the others. (ii) The inclination angles of the tridecamer and 255D are much smaller than those of the others. When the major groove widths were plotted against the inclinations, two clearly separated clusters appeared (Fig. We performed cluster analysis using all the structure parameters to reconfirm that the conformations of these oligomers can be divided into two groups. In cluster analysis in general, samples are classified based on the similarity of variables which represent the features of the samples. As mentioned previously, the mean structure parameters in Table 1 are variables which define the global conformational features. Therefore, cluster analysis can be performed using these parameters directly. However, they include distances and angles and also their values are dispersed. If the values in Table 1 are used for cluster analysis, the results might be affected only by a certain parameter which has the largest value. Accordingly, we calculated the normal distribution of each structure parameter before the cluster analysis and then calculated the distances, based on their normal distributions, between all combinations of the oligomers, A-RNA, A[prime]-RNA and B-DNA (supplementary material, Tables S1 and S2). The oligomers, A-RNA, A[prime]-RNA and B-DNA, were clustered by means of the nearest neighbor method and the resulting dendrogram is presented in Figure The B-DNA group and all the RNA groups are separated by quite a great distance (4.13). As expected, the A-RNA and A[prime]-RNA groups in Figure The mean torsion angles with their standard deviations for duplexes are listed in Table 2. In spite of the apparently different conformations of A- and A[prime]-RNAs, the backbone torsion angles ([alpha], [beta], [gamma], [delta], [epsis] and [zeta]) exhibited basically the same mean values for A- and A[prime]-RNAs. The differences in the mean torsion angles were much smaller than the standard deviations. Only the [chi] angles of A- and A[prime]-RNAs showed a difference which is comparable with their standard deviations. In conclusion, only the [chi] angle was slightly different between A- and A[prime]-RNAs among all the torsion angles. The reason why backbone torsion angles are not affected by the difference in the conformation will be discussed later. Table 2. We showed that the A- and A[prime]-RNA conformations could be classified as to the major groove widths and inclination angles (Fig. Figure 4. (a) Explanation of the correlation between the inclination angle and major groove width. A view from the major groove of the base pair (black bar). Green curved lines, black bars and open circles represent the backbones of nucleic acids, base pair planes and the rotation axis for inclination, respectively. Black and blue arrows indicate the rotational direction of inclination and vertical shifts of the strands, respectively. The vertical major groove width is presented as ]. (b) The RNA duplex is presented as a cylinder and we assumed the radius of the cylinder to be 10 Å. Circles with the character P indicate the nearest phosphorus atoms across the major groove. The blue arrow and characters are the vertical shift and its value, respectively. The red arrow and characters are the horizontal shift and its value, respectively. The resulting major groove widening is indicated in black characters. We also demonstrated the presence of the A[prime]-RNA conformation using the structure of a tridecamer in a single crystal. Then we further retrieved the occurrence of the A[prime]-RNA conformation in the crystal structures of biologically active RNA molecules such as hammerhead ribozymes (39,40), the P4-P6 domains of group I intron (41,42) and the 62 nt domain of 5S rRNA (11). Interestingly, we found that helix IV of 5S rRNA adopted the A[prime]-RNA conformation. It has characteristic features of the A[prime]-RNA conformation, i.e. a wide major groove (8.7 Å) and a low inclination angle (9.5°). More interestingly, helix IV was suggested to be the binding site for the ribosomal protein L25, by the results of enzymatic probing (43-45). It could be thought that helix IV becomes ready for protein binding by taking on the A[prime]-RNA conformation. Thus, the wide major groove of the A[prime]-RNA conformation could be utilized for interaction with proteins, since the major groove of the A-RNA conformation is too narrow to accommodate a protein or peptide. Next we present space-filling models of the tridecamer, helix IV of 5S rRNA and canonical A- and A[prime]-RNAs to determine their major groove widths (Fig. Figure 5. Space filling models of A-RNA deduced from X-ray fiber diffraction data (a), A[prime]-RNA deduced from X-ray fiber diffraction data (b), the structure of the tridecamer (c) and the structure of helix IV with loop E in the 62 nt domain of E.coli 5S rRNA (URL065 from the Nucleic Acid DataBase, NDB) (11) (d). The helical axis for each duplex lies vertically. For (d), the helical axis was only calculated for helix IV because of the irregular structure of loop E. Here we conclude that the crystal structure of the tridecamer belongs to the A[prime]-RNA conformation and have revealed detailed structural features of the A[prime]-RNA conformation at nearly atomic resolution. We also discussed the possible function of the A[prime]-RNA conformation in a biological system. So far, it is reasonably safe to conclude that the A[prime]-RNA conformation, which has a wide major groove, must be taken into consideration for RNA-protein interaction. The authors wish to thank Prof. Kazunari Taira, Dr De-Min Zhou, Dr Takeo Kohda, Dr Naruhisa Ota, Dr Masaki Warashina, Dr Tomoko Kuwabara, Mr Satoshi Fujita, Mr Ryuji Utsunomiya, Mr Masayuki Sano and Mr Satoru Sekiya (Tsukuba University) and Drs Toshio Yamazaki, Takashi S. Kodama, Chojiro Kojima, Koichi Uegaki, Hiroshi Matsuo, Eugene H. Morita, Mr Junichi Furui and Mr Mitsuaki Sugahara (Osaka University) for critical discussions and comments on the manuscript and also wish to thank Mr Tomokazu Hasegawa (Osaka University) for programing the drawing software. This work was supported by a Grant-in-Aid for Basic Scientific Research, Category B (no. 09480176), from the Ministry of Education, Science and Culture, Japan, and the Human Frontier Science Program.See supplementary material available in NAR Online.
Data collection
Structure determination
Calculation of structure parameters
Cluster analysis of RNA oligomers
RESULTS
Description of the structure
Evidence for the A[prime]-form
Cluster analysis
Comparison of torsion angles in A- and A[prime]-RNAs
DISCUSSION
ACKNOWLEDGEMENTS
REFERENCES
This article has been cited by other articles:
This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals
Comments and feedback: www-admin{at}oup.co.uk
Last modification: 29 Jan 1999
Copyright©Oxford University Press, 1999.
![]()
CiteULike
Connotea
Del.icio.us What's this?
![]()
![]()

![]()
![]()
![]()
J. Zoll, M. Tessari, F. J.M. Van Kuppeveld, W. J.G. Melchers, and H. A. Heus
Breaking pseudo-twofold symmetry in the poliovirus 3'-UTR Y-stem by restoring Watson-Crick base pairs
RNA,
May 1, 2007;
13(5):
781 - 792.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
F. Joli, N. Bouchemal, A. Laigle, B. Hartmann, and E. Hantz
Solution structure of a purine rich hexaloop hairpin belonging to PGY/MDR1 mRNA and targeted by antisense oligonucleotides
Nucleic Acids Res.,
November 6, 2006;
34(20):
5740 - 5751.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
K. Sobczak and W. J. Krzyzosiak
Imperfect CAG Repeats Form Diverse Structures in SCA1 Transcripts
J. Biol. Chem.,
October 1, 2004;
279(40):
41563 - 41572.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
N. B. Leontis, J. Stombaugh, and E. Westhof
The non-Watson-Crick base pairs and their associated isostericity matrices
Nucleic Acids Res.,
August 15, 2002;
30(16):
3497 - 3531.
[Abstract]
[Full Text]
[PDF]
![]()
This Article ![]()
![]()
Abstract
![]()
Print PDF (292K)
![]()
Supplementary Data
![]()
Alert me when this article is cited
![]()
Alert me if a correction is posted
![]()
Services ![]()
![]()
Email this article to a friend
![]()
Similar articles in this journal
![]()
Similar articles in ISI Web of Science
![]()
Similar articles in PubMed
![]()
Alert me to new issues of the journal
![]()
Add to My Personal Archive
![]()
Download to citation manager
![]()
Search for citing articles in:
ISI Web of Science (19)
![]()
Request Permissions ![]()
Commercial Re-use Guidelines
for Open Access NAR Content
![]()
Google Scholar ![]()
![]()
Articles by Tanaka, Y.
![]()
Articles by Kyogoku, Y.
![]()
Search for Related Content
![]()
PubMed ![]()
![]()
PubMed Citation
![]()
Articles by Tanaka, Y.
![]()
Articles by Kyogoku, Y.
![]()
Social Bookmarking ![]()
![]()
What's this?