The NMR structure of 31mer RNA domain of
Escherichia coli
RNase P RNA using its non-uniformly deuterium labelled counterpart [the `NMR-window' concept]
The NMR structure of 31mer RNA domain of Escherichia coli RNase P RNA using its non-uniformly deuterium labelled counterpart [the `NMR-window' concept]
C.
Glemarec
,
J.
Kufel
1
,
A.
Földesi
,
T.
Maltseva
,
A.
Sandström
,
L. A.
Kirsebom
1
and
J.
Chattopadhyaya*
Departments of Bioorganic Chemistry and
1
Microbiology, Box 581, Biomedical Centre, University of Uppsala, S-751 23 Uppsala, Sweden
Received March 7, 1996;
Revised and Accepted April 12, 1996
ABSTRACT
The NMR structure of a 31mer RNA constituting a functionally important domain of
the catalytic RNase P RNA from
Escherichia coli
is reported. Severe spectral overlaps of the proton resonances in the
natural 31mer RNA (1) were successfully tackled by unique spectral
simplifications found in the partially-deuterated 31mer RNA analogue (2) incorporating deuterated cytidines [C5
(>95 atom %
2
H), C2
'
(>97 atom %
2
H), C3
'
(>97 atom %
2
H), C4
'
(>65 atom %
2
H) and C5
'
(>97 atom %
2
H)] [for the `NMR-window' concept see: Földesi,A.
et al.
(1992)
Tetrahedron
, 48, 9033; Földesi,A.
et al
. (1993)
J. Biochem. Biophys. Methods
,
26, 1; Yamakage,S.-I.
et al
. (1993)
Nucleic Acids Res.
, 21, 5005; Agback,P.
et al.
(1994)
Nucleic Acids Res.
, 22, 1404; Földesi,A.
et al.
(1995)
Tetrahedron
, 51, 10065; Földesi,A.
et al.
(1996)
Nucleic Acids Res
., 24, 1187-1194]. 175 resonances have been assigned out of total of 235 non-exchangeable proton resonances in (1) in an unprecedented manner in
the absence of
13
C and
15
N labelling. 41 out of 175 assigned resonances could be accomplished with the
help of the deuterated analogue (2). The two stems in 31mer RNA adopt an A-type RNA conformation and the base-stacking continues from stem I into the beginning of the loop I.
Long distance cross-strand NOEs showed a structured conformation at the junction between stem
I and loop I. The loop I-stem II junction is less ordered and shows structural perturbation at and
around the G11[middot]C22 base pair.
INTRODUCTION
RNase P is responsible for the maturation of the 5'-termini of almost all tRNAs in the cell. This ubiquitous
ribonucleoprotein complex consists of an ~400 nucleotide (nt) RNA, which is the catalytic subunit of RNase P RNA, and
a protein subunit (
1
,
2
). Bacterial RNase P RNA cleaves various tRNA precursors
in vitro
at the correct positions in the absence of any protein (
2
). Thus, RNase P RNA is a true ribozyme. In the enzyme-substrate complex the RNA interacts with the two cytosine residues in the
conserved 3'-terminal `RCCA' sequence of a tRNA precursor. The `GGU-motif' in RNase P RNA that is involved in this interaction is
part of an internal loop. This loop structure is also part of a divalent metal
ion(s) binding site as revealed by the fact that both Pb
2+
and Mg
2+
induce cleavage within this structure. Binding of Mg
2+
to this region has been suggested to be of functional importance (
3
,
4
-
6
). Hence, knowledge of the three dimensional structure of this domain of RNase P
RNA is important to understand the function of this ribozyme. Herein, we have
investigated the solution conformation of a 31mer RNA
1
(Fig.
1
) representing this domain of
Escherichia coli
RNase P RNA by NMR spectroscopy. In order to stabilize the secondary structure
of this domain of RNase P RNA, i.e. nucleotides 250-299, we have excised the nucleotide residues 263-287 and have transplanted a stable C[middot]G base pair with a tetraloop (UUCG) (
7
) as a link between the residues 262 and 288 to give finally our 31mer RNA
1
using T7 RNA polymerase. This study is a part of our goal to identify the
conformational features of the internal loop in 31mer RNA (i.e. Loop I, see
Fig.
1
). Owing to the relatively large size of the natural 31mer RNA
1
, severe spectral overlap precluded an unambiguous assignment of the proton
resonances from the NOESY and DQF-COSY spectra, thereby restricting the number of structural information. We
have tackled this problem by using a partially-deuterated 31mer RNA analogue
2
(Fig.
1
) in which deuterated cytidine residues [C5 (>95 atom %
2
H), C2' (>97 atom %
2
H), C3' (>97 atom %
2
H), C4' (>65 atom %
2
H) and C5' (>97 atom %
2
H)] (
39
) were incorporated by T7 RNA polymerase reaction. The choice of cytidine was
dictated by two factors: (i) The H5-H6 NOE crosspeaks are very strong and often obscure the anomeric-aromatic region of the NOESY spectra, and (ii) the C residues are
found in both stems I and II of the 31mer RNA, yet they are the only nucleotide
units which are not part of the loop I. Since divalent metal ions cleave the
RNase P RNA, we have also studied the effect of Mg
2+
on this 31mer RNA by NMR spectroscopy.
MATERIALS AND METHODS
Synthesis of 31mer RNA and its partially-deuterated analog by T7 RNA polymerase
C5-Deuterated (
20
) cytidine-d
6
block was prepared from 2',3'-
O
-isopropylidine uridine-d
5
using our procedure (
19
), which was subsequently converted to its 5'-triphosphate (
21
). The natural 31mer RNA
1
and its deuterated counterpart
2
were synthesized using T7 DNA dependent RNA polymerase as described elsewhere (
22
,
23
) and the concentration of DNA template was optimized to 0.15 [mu]M. In the preparation of
2
, cytidine 5'-triphosphate was substituted with cytidine-d
6
-5'-triphosphate (
39
).
NMR spectroscopy
Sample preparation
. The natural 31mer RNA
1
and the partially-deuterated 31mer RNA analogue
2
were dissolved in 0.4 ml of a phosphate buffer consisting of 50 mM NaCl, 10 mM
sodium phosphate, 0.1 mM EDTA at pH 6.2. For measurements in
2
H
2
O, the sample was lyophilized three times from 99.9%
2
H
2
O and dissolved in 0.5 ml of 99.98%
2
H
2
O. The final RNA concentration was 1 mM. All NMR spectra were recorded on a
BRUKER AMX 500 MHz instrument.
Exchangeable proton spectra.
The 1D spectra in 1:9, v/v,
2
H
2
O/H
2
O were recorded using a 1331 binomial solvent suppression pulse sequence (
24
).
Rate of exchange
. A combination of NOESY and ROESY experiments (
9
) was used to calculate the rates of exchange of the imino protons. The mixing
times used were 8, 10, 20, 25, 30 40 and 55 ms for NOESY and 8, 12, 16, 20 and
24 ms for ROESY. The experiments were performed at 5, 8, 11, 14, 17, 20 and 23oC (128 FIDs with 2 K data points, 16 scans per FID with a sweep width of
10200 Hz). A recycle delay of 5 s was used, which gave an optimal intensity of
the NOE crosspeaks. The volumes of NOE crosspeaks and diagonal were measured
using the program AURELIA (
25
). The activation energy (E
a
) of the exchange process was obtained from Arrhenius plot of the temperature
dependent exchange rate.
Spectra in
2
H
2
O
. 2D NMR spectra of the RNA in 99.98%
2
H
2
O were recorded using the TPPI method (
29
) and pre-irradiation at low decoupler power of the residual HDO peak. NOESY spectra
(
26
) were recorded at 100, 300 and 600 ms (512 FIDs with 4K data points, spectral
width of 5050 Hz, 32-64 scans for each FID). The data were zero-filled to 2 K * 2 K or 2 K * 1 K before applying a 2 Hz line-broadening factor in the F1 and F2 dimensions.
Relaxation delays of 3 s were used. The DQF-COSY spectra (
27
) were acquired with 4K data points in t2 and 512 points in t1 (32-64 scans for each FID with a sweep width of 5050 Hz). The data were zero
filled to give a 4K * 2K matrix, and a [pi]/4 shifted sine-square bell window was applied in both dimensions before
Fourier transformation. The clean-TOCSY spectra (
28
) were acquired with 512 spectra of 4K data points. The data were zero filled to
give a 4K * 2K matrix, and a [pi]/2 shifted sine-square window was applied in both dimensions before Fourier
transformation with the MLEV-17 sequence applied for mixing time using an extra delay of 65 [mu]s for compensation of NOE. Total mixing times of 25, 50 and 100 ms
were used.
XPLOR structure refinement
In the distance-geometry [XPLOR v.3.1 (
15
)], 655 NMR constraints were used: we performed usual substructure embedding
followed by regularization [(i) two energy minimizations, (ii) 625 MD steps at
2000 K, (iii) cooling to 100 K over 1000 MD steps] and seven cycles of
simulated annealing [(i) from 1000 K to 100 K cooling over 2000 MD steps, (ii)
200 steps of energy minimization. Force constants for the distance constraints
were set to 50 kcal Å
-2
in the first four cycles, followed by three cycles with force constants of 100,
200 and 400 kcal Å
-2
]. The MD step length was set to 3 fs. Extra planarity constraint of 2.0 kcal
was used in the first cycle, which was raised gradually to 20 kcal in the
fourth cycle and then kept constant. For all dihedral restraints a force
constant of 200 kcal rad
-2
was used.
AMBER structure refinement
The SANDER module of the AMBER 4.0 program package (
16
) was used both for the MD simulations and the energy minimizations with
infinite non-bonded cut-offs and flat bottom potential wells for constraining both distances
[the RANDMARDI (
30
-
33
,
38
) assigned widths of the wells were at least +-5% or +-0.1 Å] and dihedral angles (+-45o to +-90o for the backbone torsions and +-6.1o to +-21.5o for
the endocyclic sugar torsions). Force constants = 40 kcal Å
-2
for the distances and 40 kcal rad
-2
for the backbone torsions, and 100 kcal rad
-2
for endocyclic sugar torsions. In the MD simulations, the following settings
were used: (i) time step of 1 fs, (ii) bonds involving hydrogen atoms
constrained using shake (
17
), (iii) constant temperature mode [Berendsen algorithm (
18
), [tau] = 0.2 ps], (iv) the force constants were gradually increased from 5 kcal Å
-2
(5 or 12.5 kcal rad
-2
for torsions) to 40 kcal Å
-2
(40 or 100 kcal rad
-2
for torsions) over the first 2 ps. The AMBER refinement involved (1) Energy
minimization for 500 steps. (2a) MD simulation of 7 ps at 400 K. (2b) Cooling
to 300 K over a 1 ps period. (2c) MD simulation of 2 ps at 300 K. During the
last 2 ps the atomic coordinates were saved at 250 fs intervals. (3) The eight
collected conformers were averaged and the average structure was energy
minimized.
RESULTS AND DISCUSSION
Assignment of exchangeable protons
Assignment of non-exchangeable protons
Aromatic and anomeric protons.
All 13 H5-H6 crosspeaks were assigned to each of the 13 pyrimidine residues in the
DQF-COSY spectra of
1
, showing that a single conformation predominates under our NMR measurement
conditions. The U residues could be easily distinguished (
10
) from the C residues in the DQF-COSY spectra of
2
in which all C residues are C5 deuterated, resulting (
10
) in a disappearance of the H5-H6 crosspeaks
.
The assignment of the non-exchangeable protons was made using the connectivities H8/H6(i)-H1'(i)-H8/H6(i+1) in the 2D NOESY spectra of
1
at 26oC (Fig.
3
A). The weak H8/H6(i) to H5(i+1) NOEs were also used together with the sequence
information to determine as well as to confirm the assignment. Most of the
ambiguities encountered in the resonance assignment procedure owing to the
spectral overlap could be nicely resolved from the NOESY spectrum recorded at
40oC.
.
Rate constants k
ex
(s
-1
) and E
a
(kcal.mol
-1
) determined for the exchange process of the imino protons of
1
by NOESY
#
experiments without Mg
2+
(a) and with Mg
2+
(b)
#
The k
ex
calculated from NOESY and ROESY were very similar, hence only the k
ex
from NOESY are reported.*Cannot be calculated due to slow exchange.**Cannot be calculated due to fast exchange.
[curren]
Only two points could be measured at 20 and 23oC.
Effect of Mg
2+
Exchangeable imino protons.
Upon addition of Mg
2+
ions (up to 10 equiv.), no new imino resonance was observed, and the imino
resonances at 11.9, 12.0 and 12.4 p.p.m. were not stabilized by addition of Mg
2+
ions, suggesting that Mg
2+
ion does not induce a significant conformational change in the base pair
scheme.
Figure 6
.
Schematic representation of sequential and long-range interesidual NOEs observed in the 31mer RNA. The solid lines
represent the NOEs obtained from the natural 31mer RNA
1
and the dotted lines represent the NOEs obtained from the C-d
6
partially deuterated 31mer RNA
2
.
Figure 7
.
Stereoview of the best of the 109 structures after NMR constrained AMBER MD and
energy minimization, as judged by R-factor analysis by MARDIGRAS. Only heavy atoms are shown (see Fig. 1 for
the numbering scheme and Fig. 8 for the zooms of various parts of this
structure).
.
Number of distance and dihedral constraints used in XPLOR. Distance constraints
generated by RANDMARDI approach, shown in parenthesis, were used in AMBER
a
The number of intraresidue distance constraints calculated directly from the
NOESY volumes by the `strong-medium-weak' approach or by RANDMARDI (in parenthesis).
b
The number of interresidual constraints calculated directly from the NOESY
volumes by the `strong-medium-weak' approach or by RANDMARDI (in parenthesis). The number of
interresidual distance constraints is shown twice because of involvement of two
residues in a NOE crosspeak.
c
Intrastrand distance constraints.
d
Interstrand distance constraints.
e
Number of distance constraints derived indirectly from NOESY data such as the
hydrogen bond constraints in a Watson-Crick base pair or distances approximated from volumes with a high degree
of spin diffusion as well as all data from NOESY experiments in H
2
O. The number of interresidue distance constraints is shown twice because of
involvement of two residues in a NOE crosspeak.
Figure 8
.
Stereoviews of the zooms of the various parts of the best structure shown in
Figure 7 (see Fig. 1 for the numbering scheme as well as for the nomenclature).
(
A
) Watson-Crick base paired A-RNA type helix for the
30
C to
28
G residues (left strand) and
2
G to
4
C residues (right strand) in stem I. (
B
) A partial continuity of the base-stacked A-RNA type helix for the
27
A to
23
G residues (left strand) and
5
A to
10
G residues (right strand) in the flexible loop I. (
C
) Watson-Crick base paired A-RNA type helix for the
22
C to
19
G residues (left strand) and
11
G to
14
C residues (right strand) in stem II. (
D
) The UUCG tetra loop for
15
U (lower right) to
18
G (lower left) residues in loop II.
Rate of exchange of the imino protons.
Upon addition of MgCl
2
, only the rate of exchange of the imino proton of G18 increases (Table
2
), and its E
a
also changes significantly, suggesting that the first Mg
2+
binding site is in the vicinity of G18. The increase of exchange rate observed
upon Mg
2+
binding might be due to its hydrated nature, which promotes exchange rate with
the imino proton of G18 at the G18[middot]U15 base pair. The fact that E
a
increases despite the fact that k
ex
increases also indicates that the entropy of activation has a large
contribution to the observed rate. It is noteworthy that a G[middot]U base pair also defines a divalent ion binding site in a tRNA acceptor
stem (
13
) and also at the 5'-splice site of Group I introns (
14
).
Non-exchangeable aromatic and anomeric protons.
Upon addition of Mg
2+
ion, H8(A27) is shielded by 0.06 p.p.m. and H1'(G28) by 0.15 p.p.m., while the protons of the other nucleotides have
chemical shifts changes between 0 and 0.04 p.p.m., suggesting that the second
Mg
2+
binds in the vicinity of the stem I-loop I junction which is in spatially proximity of U26-G25-G24 sequence [corresponding to residues U294, G293 and G292
of RNase P RNA (
3
,
4
-
6
)].
Structure determination
Distance constraints.
NOESY volumes were obtained from four different NOESY spectra, 100 and 300 ms
mixing times for both the natural 31mer and its partially-deuterated counterpart. Schematic representation of sequential and long-range interresidual NOEs observed in the 31mer RNA is given in
Figure
6
. The distance constraints in XPLOR were obtained by the classification of NOESY
crosspeak intensities as strong (1.8-3.0 Å), medium (1.8-4.0 Å), weak (1.8-5.0 Å) or very weak (1.8-7.0 Å). The distance
constraints for AMBER were derived from the experimental NOESY volumes using
the iterative relaxation rate matrix method of MARDIGRAS and the RANDMARDI
modification of MARDIGRAS by James
et al.
(
30
-
33
,
38
) starting from many different distance-geometry derived conformers. In addition, data from a 600 ms NOESY
experiment in H
2
O was used to determine the spatial proximities between bases. Due to the
decrease in the problem of signal overlap, the partially-deuterated 31mer allowed us to extract an additional 28 distances (8 intra- and 20 interresidue). From the four NOESY experiments run in D
2
O a total of 375 distance constraints were generated, 207 intraresidue distances
and 168 interresidue distances. In addition, we used an extra 52 distance constraints arising from both (i) the
NOESY spectra in D
2
O (which were not used directly due to spin diffusion) as well as imino-imino, imino of residue (i) to H1'(i+1) and imino(i) to H1' of the residue base pairing with residue (i-1) (i.e. typical A-RNA features) crosspeaks in H
2
O were also included, where precise distances could not be estimated. In this
context, the distances found in models of the UUCG hairpin RNA (
7
a) were also used for constraints. Other extra constraints included the
distances between heteroatoms in the H-bonded base pairs.
In the final MD simulations and energy minimizations, distance constraints
calculated by averaging the results from a series of RANDMARDI (
30
-
33
,
38
) distance calculations on 109 conformers using four different NOESY spectra
(100 and 300 ms mixing times for both the natural and the partially-deuterated 31mer RNA, yielding 436 individual RANDMARDI results) were used
in conjunction with the same 52 extra distance constraints described above. The
RANDMARDI program calculated distance constraints from NOESY volumes by
averaging the results from 30 individual MARDIGRAS calculations in which
randomized noise with a noise level corresponding to the smallest volume in the
entire spectrum was added (
34
) to the experimental NOESY volumes. The MARDIGRAS program (
30
-
33
) is based on the analysis of the complete relaxation rate matrix and, given a
particular starting conformer, tries to find the inter-proton distances which best fit the experimental NOESY volumes in an
iterative fashion. There were a total of 362 different distance constraints
generated by the 436 RANDMARDI calculations, 13 distance constraints were
rejected by the RANDMARDI program.
Dihedral constraints.
A total of 228 dihedral constraints were used in the following manner. The stem
II and loop II residues (
11
G
12
G
13
G
14
C
15
U
16
U
17
C
18
G
19
G
20
C
21
C
22
C) in our 31mer RNA have the corresponding residues (except for the base paired
13
G and
20
C) in the 12mer hairpin loop RNA, GG
A
CUUCGG
U
CC, studied by Varani
et al
. (
7
). The chemical shifts of the residues within the loop
14
C
15
U
16
U
17
C
18
G
19
G in 31mer RNA and in Varani's tetraloop (
7
) are within +-0.15 p.p.m., suggesting their structural similarity. Hence, 79 of the
backbone torsions of stem II and loop II residues were constrained to the
values (+-45o) found in the 12mer RNA: [alpha], [beta] and [gamma] for residues G12-C22, [delta] and [chi] for residues G11-C22 and [epsilon] and [zeta] for
residues G11-C21. The sugar moieties of residues G11-C22 were similarly constrained, as found in the 12mer RNA loop, by
applying constraints on [nu]
0
, [nu]
1
, [nu]
2
and [nu]
4
. The center value and the allowed variation of each torsion was calculated in
such a way that the phaseangle of pseudorotation of a particular sugar would be
within 20o and the puckering amplitude would be within 5o of the value for the corresponding sugar in the 12mer RNA. Typically,
torsions were allowed to differ between +-9 and +-15o from the center value. 48 dihedral constraints of this
type were used. 26 dihedral constraints, typical of A-RNA (as evident from NOESY spectra), were applied to backbone torsions of
the double-stranded part of the 31mer (stem I): the [alpha], [beta], [gamma], [epsilon], [zeta] and [chi] torsions were constrained to -60o +- 60o, 180o +-
60o, 60o +- 60o, 180o +- 60o, -60o +- 60o and 180o +-
90o, respectively. All residues for which the J
1',2'
indicated a clear preference for either the S-type or the N-type sugar were constrained (P = 10o +- 30o for N-type sugars and 165o +- 30o for S-type sugars with a
puckering amplitude of 38o +- 5o). In total, we used 655 experimental distance and torsional
constraints for distance-geometry and simulated annealing of 31mer RNA, which makes an average of
21.1 constraints/nucleotide residue. However, it should be noted that for
constrained MD and energy minimization by AMBER, only 642 constraints were used
(average of 20.7 constraints/nucleotide residue).
Distance-geometry and simulated annealing using the XPLOR program.
Using the constraints described above, 200 structures were generated using
distance-geometry, regularization and simulated annealing calculations (see
Materials and Methods). After the seven cycles of simulated annealing, the 200
structures were evaluated in terms of number of violations of the distance
constraints. 91 Structures had more than two 0.2 Å distance violations and they were rejected. The remaining 109 structures
were analyzed by the MARDIGRAS program and the agreement between their
theoretically calculated NOE volumes with the experimental ones were measured.
The R-, R
2
-, R
x
- and R
x
2
-factors were found to be in the range 0.543-0.871, 0.658-1.106, 0.099-0.159 and 0.126-0.194 respectively. The conformational
similarity between the 109 structures was assessed by calculating an average
structure and comparing all 109 structures to the average (rmsd): 5.97 Å ([sigma] = 1.04 Å) for all heavy atoms of the structure, 1.87 Å ([sigma] = 0.84 Å) for stem I, 1.93 Å ([sigma] = 0.79 Å) for stem II,
4.4 Å ([sigma] = 0.62 Å) for loop I and 2.26 Å ([sigma] = 0.5 Å) for loop II.
Molecular dynamics simulations and energy minimizations using the AMBER program.
The 109 selected structures from the distance-geometry simulated annealing structure refinement procedure were used as
starting structures for 109 separate runs using the NMR constrained AMBER
procedure (see Materials and Methods).
Precision of the structure.
Nine conformers out of the 109 structures were selected because they showed the
lowest combination of R-factors after the constrained MD and minimization by AMBER in the previous
step, the structure showing the best agreement with the NMR data, as evident by
R-factor analysis, is shown in Figures
7
and
8
. The ranges of the R-, R
2
-, R
x
- and R
x
2
-factors for these nine best conformers were 0.330-0.429, 0.369-0.522, 0.066-0.083 and 0.082-0.110 respectively. It should be noted here
that such low R-factors in the MARDIGRAS were earlier obtained only for a much smaller
oligo-DNA/RNA (
35
-
37
), comprising approximately two-thirds of the number of nucleotides of the present 31mer RNA. The nine
individual structures were then compared with the average of these nine
structures with respect to all heavy atom, showing an average rmsd of 6.02 Å. When only a specific region of the 31mer was considered, the average
rmsd was 0.86, 0.77, 4.08 and 1.02 Å for stem I and II, loop I and II, respectively. A simple comparison of
these rmsds reveals that most of the structural differences in all heavy atom
comparison come from the large loop I region, but in contrast the conformation
of the two double-stranded stems I and II regions and loop II region show significant
conformational homogeneity. One of the main reasons for the observed rmsd
differences between stem I, stem II, loop II, in one hand, and loop I on the
other, is owing to the fact that relatively larger number of experimental NMR
constraints per nucleotide residue is available in the former group (22.8
distance constraint/nucleotide) compared with the latter (17.5 distance
constraint/nucleotide) (Table
4
). A comparison of the types of constraints for various nucleotides in various
parts of the 31mer RNA in Table
4
also shows an absence of imino to H1'(i+1) and imino to H1'(i-1) of the opposite strand, Watson-Crick hydrogen-bonds, as well as of imino to imino cross- and intra-strand distance constraints for
loop I nucleotides, which makes it very difficult to define its conformation,
thereby suggesting either a highly flexible nature or simply that the residues
are more than ~5 Å away from each other.
Conclusion
We have shown that the deuteration of nucleotides helps to simplify the NMR
spectra of RNA. By deuteration at the C5, C2', C3', C4' and C5' positions of the cytosine nucleotides which
constitute the stems, we were able to assign without ambiguity the anomeric and
aromatic protons. This was not possible from the NOESY spectra of the natural
31mer RNA where the strong H5-H6 crosspeaks were overlapping with some aromatic-anomeric NOEs. This possibility of assignment without ambiguity was
particularly important for the loop I region where no assumption on the
conformation can be made. The use of the partially-deuterated 31mer RNA has enormously facilitated the resonance assignment
of the non-anomeric sugar protons and the extraction of the NOE volumes. Out of 235
non-exchangeable proton resonances in the natural 31mer RNA
1
, 175 resonances have been unambiguously assigned. The 60 resonances that we
could not assign belong to the category of H4' and H5'/5''. Of the 175 assigned resonances, 41 resonances (15
belonging to the C residues) in the natural 31mer could only be assigned on the
basis of the comparative analysis between the natural 31mer RNA and its
partially-deuterated analogue, thereby showing the usefulness of our non-uniform deuterium labelling techniques (
39
-
44
). This has led us to obtain additional 8% constraints from the deuterated 31mer
RNA compared to the natural counterpart. It is clear that a successive
incorporation of the deuterated A, G and U residues in 31mer RNA would give us
additional structural information, giving larger number of NMR constraints for
further structure refinement. Alternatively, we are also trying to improve our
RNA solid-phase synthesis protocol to create an `NMR-window' (
44
) on the loop I, enabling only a selected protonated part to be visible by NMR
spectroscopy.
The NMR data supporting Watson-Crick base pairing, base-base stacking interactions and the North conformation of the
sugars indicates that both stems I and II of the 31mer RNA have an A-type geometry. The base-base stacking found in stem I is conserved into the loop I. The
base pairs C4[middot]G28 and G11[middot]C22, which are at two ends of the loop II, have very different
behaviour. The access of water to the G28[middot]C4 base paired imino proton is reduced probably through stacking
interactions. Thus, NOESY data have shown that G28 strongly stacks on A27 and
that C4 strongly stacks on A5. The observation of interstrand NOEs in the
internal loop in close proximity of the G28[middot]C4 base pair together with the observation of NOEs between the H2 of A5,
A6 and A27 to the H1' of their 3'-neighbouring residue also indicate that the A-type conformation continues until the beginning of the
stem I to loop I junction. The loop I conformation is more flexible in the loop
I-stem II region. The stacking interactions between A9, G10, G11 and
between C22, G23 and G24 are weaker and NOE data suggest that G10 is bulged
out. These data together with the fast rate of exchange of G11 indicate a less
ordered region of the loop I with structural perturbation at and around G11.
Magnesium ion binding occurs most probably around the G18[middot]U15 base pair in the loop II and around the A27 and G28 nucleotides at
the stem I-loop I junction. This second site of binding is of particular interest
since magnesium ion has been shown to bind in the vicinity of loop I region in
the
E.coli
RNase P RNA. Work is now in progress in our lab to study the solution
conformation of large functional RNA using the deuterated analogues in which
deuterated residues are incorporated in the NMR-invisible part (
39
-
44
) and appropriate
13
C and
15
N labelled sugar residues are incorporated in the NMR-visible part (
39
-
44
) to extract the sugar-phosphate backbone constraint for more accurate structure determination.
ACKNOWLEDGEMENTS
We thank Swedish Natural Science Research Council (NFR), Swedish Council of
Engineering and Research (TFR) and Swedish Board for Technical Development
(NUTEK) for generous financial support (to JC and LK). We would also like to
thank Prof. T. James for kindly supplying us with the latest versions of
MARDIGRAS and RANDMARDI. Thanks are also due to the Wallenbergstiftelsen,
Forskningsrådsnämnden, and University of Uppsala for funds for the purchase of a
500 MHz Bruker AMX NMR spectrometer and for the generous funding of our new NMR
facility.
REFERENCES
1 (a) Deutscher,M.P. (1990) Prog. Nucleic Acids Res. Mol. Biol., 39, 209-240; (b) Altman,S., Kirsebom,L.A. and Talbot,S. (1995) In Söll,D. and RajBhandary,U.R. (eds) tRNA: Structure, Biosynthesis and Function, American Society for Microbiology, Washington, DC 20005, pp. 67-78; (c) Kirsebom,L.A.(1995) Mol. Microbiol., 17, 411-420.
7 (a) Varani,G., Cheong,C. and Tinoco,I. Jr. (1991) Biochemistry, 30, 3280-3289. (b) For a modified structure for UUCG loop, see Allain,F.H.-T and Varani,G. (1995) J. Mol. Biol., 250, 333-353.
8 Heus,H.A. and Pardi,A. (1991) J. Am. Chem. Soc., 113, 4360-4361.