RNA aptamers that bind L-arginine with sub-micromolar dissociation constants and high enantioselectivity
RNA aptamers that bind L-arginine with sub-micromolar dissociation constants and high enantioselectivity
Albert
Geiger
,
Petra
Burgstaller
,
Herbert
von der Eltz
1
,
Albert
Roeder
1
and
Michael
Famulok
*
Institut für Biochemie, Genzentrum der Ludwig-Maximilians-Universität München, Würmtalstraße 221,
81375
München
,
Germany
and
1
Boehringer Mannheim GmbH, Research and Development,
Werk Penzberg, Nonnenwald 2, 82372
Penzberg
,
Germany
Received December 18, 1995;
Revised and Accepted February 6, 1996
ABSTRACT
A completely randomized RNA pool as well as a degenerate pool comprised of an RNA
sequence which binds citrulline with a dissociation constant of 60
[mu]
M were used to select for tight binding arginine specific RNA aptamers. A
modified
in vitro
selection scheme, based on affinity chromatography was applied to allow the enrichment of high affinity solution binders. The selection scheme included a negative selection with the non-cognate ligand citrulline, and a heat denaturation step prior to affinity
elution with an excess of the cognate ligand arginine. After 20 cycles the
majority of the pools bound specifically to the arginine matrix even after denaturation/renaturation in the presence of 20 mM of a non-cognate amino acid. When denatured and eluted in the presence of 20 mM
arginine, the selected RNAs quantitatively washed off the column. These RNA aptamers were cloned and sequenced. Equilibrium dialysis performed with the most abundant clone among the selected sequences revealed
K
d values of 330 nM for the RNA/arginine affinity, which is nearly a 200-fold improvement over the tightest binding arginine binding RNAs known to date. Arginine recognition by this RNA is highly
enantioselectice: L-arginine is bound 12 000-fold better than D-arginine. Chemical modification analysis revealed that the
secondary structure of the aptamer might contain a pseudoknot motif. Our tight
binding arginine aptamers join a number of natural and
in vitro
selected RNAs which recognize arginine. The RNAs described here compare in
their binding affinity with the tightest binding RNA aptamers for low molecular
weight molecules isolated in other
in vitr
o selection experiments.
INTRODUCTION
Interactions between amino acids and RNA play substantial roles in a number of
biological systems (
1
). For example, arginine inhibits the self-splicing reaction of the group I intron of
Tetrahymena
by substituting for two H-donor sites of the guanosine cofactor which contact the G
264
-C
311
base pair in the ribozyme's guanosine binding site (
2
,
3
). Recently, the editing reactions of aminoacyl tRNA synthetases have been
viewed as an example of RNA dependent amino acid recognition (
4
). These editing reactions involve RNA dependent steps which eliminate errors of
amino acid activation and aminoacylation (
5
). A third example is the interaction of the HIV-1 TAT protein with a stem-loop structure of TAR RNA, located at the 5'-end of HIV-1 mRNA. Critical for the recognition of TAT and
TAR is a single arginine within a basic region of TAT (
6
). Short oligopeptides resembling the basic region as well as free arginine bind
specifically to TAR, although weaker than within the context of the whole
protein (
7
). The TAT-TAR interaction provided the first example to show that in protein-RNA recognition RNA structures are involved which interact with individual amino
acid side chains in the protein. It seems likely that other, yet undiscovered
RNA-protein interactions exist in which single amino acid side chains within
a protein or peptide form specific contacts to structural elements provided by
RNA to largely determine specificity, functionality and strength of binding (
1
).
The isolation and characterization of RNA sequences which specifically recognize
individual amino acids might facilitate a better understanding of biologically
relevant protein-RNA or RNA-amino acid interactions. A powerful tool to obtain amino acid
binding RNAs is
in vitro
selection (
8
-
11
). RNA aptamers which specifically recognize amino acids, such as immobilized
tryptophan (
12
), arginine (
13
,
14
), citrulline (
14
) and valine (
15
) have been extracted from pools of up to 10
15
different RNA sequences. The reported affinities ranged from 60 [mu]M (
14
) to 12 mM (
15
) with a high level of discrimination against other amino acids being obtained
in each case. Among these amino acid binding RNAs, the arginine specific
aptamers might be especially relevant to protein-RNA recognition because arginine side chains carrying a positive charge
at neutral pH seem to be particularly suited to form specific contacts with a
negatively charged nucleic acid (
16
). For example, the HIV-1 Rev protein contains a basic region in which 10 out of 17 amino acids
between positions 34 and 50 are arginines. A corresponding 17mer peptide binds
to the Rev responsive element (RRE) RNA IIB hairpin in the same way as within the context of the full-length protein (
17
) with four arginines being important for specificity (
18
). In the BIV-TAR/Tat complex an arginine and an isoleucine residue were found to be
critical for binding and specificity (
19
). Furthermore, the Rex-protein of HTLV-I might interact with its natural RNA binding element XBE through
arginine residues (
20
).
To explore the range of affinities which can be achieved in RNA-arginine recognition we set out to obtain RNA sequence motifs which
tightly bind to arginine. We started with a completely randomized pool and a
partially randomized pool of RNAs comprised of a citrulline binding sequence
which was previously isolated by
in vitro
selection (
14
). Previously, arginine binding sequences which differed from the citrulline
motif in three base positions were identified from the latter pool (
14
). In the present study, a modified
in vitro
selection scheme based on affinity chromatography was applied to extract tight
binding arginine specific RNA aptamers from the two pools. A number of
sequences were identified which bound tightly to arginine and did not share
significant sequence homologies to each other or any of the other previously
identified arginine aptamers (
13
,
14
). One of these sequences was shown to bind L-arginine with high enantioselectivity and with affinities in the order of
magnitude of the tightest binding RNA aptamers for low molecular weight
molecules obtained so far (
21
-
24
). Our study shows that the problem of arginine recognition can be achieved by
many different RNA sequences over a broad range of binding affinities.
MATERIALS AND METHODS
Materials
L-Arginine, D-arginine, L-citrulline, agmatine and 1-cyclohexyl- 3-(2-(morpholino-ethyl)-carbodiimide metho-p-toluenesulphonate (CMCT) were purchased from Sigma. L-Homoarginine and dimethyl sulfate (DMS) were from Aldrich, kethoxal (KE) from ICN. L-[2,3,4,5-
3
H]arginine-HCl, [[alpha]-
32
P]ATP and [[gamma]-
32
P]ATP (3 Ci/[mu]mol) from Amersham. dNTPs, NTPs, glycogen and all enzymes and cloning
plasmids used were from Boehringer Mannheim unless otherwise noted. Sequencing
was performed with a digoxigenin labelled primer (DIG
Taq
DNA sequencing; Boehringer Mannheim). PCR primers were synthesized on an
Millipore Expedite oligonucleotide synthesizer using standard phosphoramidite
chemistry, and purified by HPLC, unless otherwise noted. Epoxy activated
agarose was purchased from Pharmacia.
Agaroses
Arginine-agaroses were obtained by derivatization of epoxy activated Sepharose 6B
according to the manufacturer's protocol. The agarose used in the selection was
prepared with 3.0 [mu]M arginine per millilitre of swollen agarose (3.0 mM), the agarose used in
the binding assays were derivatized to 4.0 mM arginine. The coupling efficiency was determined by addition of L-[2,3,4,5-
3
H]arginine-HCl to a coupling reaction containing 100 [mu]l of activated agarose, followed by quantification of immobilized radioactivity by Cerenkov
counting. Preselection agarose was derivatized with glycine in the same way.
RNA pool
The synthesis of pool 1 is described previously (
14
). For pool 2 we synthesized a 111mer DNA with a 30% doped insert of 74
nucleotides between a 18mer and a 19mer primer binding site on an Applied Biosystems DNA
synthesizer. The sequences of the primer binding sites and the primers are
shown in Figure
1
A.
Selections
Ligand-derivatized agarose (0.5 ml) was equilibrated with several column volumes of binding buffer (250 mM NaCl, 50 mM Tris-HCl pH 7.6, 5 mM MgCl
2
). Aqueous RNA solution (10-15 [mu]g; 0.4 nmol) was diluted with water to a final volume of 80 [mu]l, heated to 90oC for 10 min, cooled to 23oC for 15 min and 20 [mu]l 5* binding buffer was added. After incubation
for 5 min at 23oC this RNA was loaded onto the column, rinsed with 5 column volumes (5.0
ml) of binding buffer. Then the column was washed with 6 column volumes of a
solution of 20 mM citrulline in binding buffer. The agarose was heated to 90oC for 10 min, cooled to 23oC for 15 min and washed with 5 column volumes of 20 mM citrulline in
binding buffer. Elution of arginine specific aptamers was done with a 20 mM
solution of arginine in binding buffer for 15 column volumes, followed by a
heat denaturation step (90oC, 10 min, 23oC, 10 min) before elution with 5 column volumes of 20 mM arginine in
binding buffer (Fig.
1
C). Only the fraction of RNA which eluted after the heat
denaturation/renaturation step in the presence of arginine was collected,
phenol extracted to remove the amino acid, precipitated and amplified as
described previously (
14
).
K
d determinations
Measurements of dissociation constants were performed by the methods of
equilibrium dialysis (
22
,
25
,
26
) using dialysis chambers with 500 [mu]l volumes. The membrane (cellulose triacetate filters; Sartorius) had a
molecular weight cutoff of 5000 Da. Typically, 1.0 [mu]M RNA concentrations were used. Measurements were taken at three different
concentrations of L-[2,3,4,5-
3
H] arginine-HCl (250, 25 and 2.5 [mu]M). Sample volumes were 200 [mu]l on each side of the membrane. Samples were equilibrated at 23oC overnight, after which the volume on both sides of the membranes
was determined to ensure equal volumes on each side. Aliquots were withdrawn
and subjected to scintillation counting. From the specific activity of
arginine, the concentrations of bound ligand [ES]
eq
, free amino acid [S]
eq
and free RNA [E]
eq
, at equilibrium, were calculated. The
K
d
was calculated from the equation:
K
d
= ([E]
eq
* [S]
eq
) * [ES]
eq
-1
This analysis assumes the formation of a 1:1 complex between the RNA and the
amino acid. The
K
d
for the binding of the clone ag.06 to D-arginine was determined by analytical affinity chromato- graphy (
27
). When applied to the 4.0 mM D-arginine agarose, this RNA elutes in the void volume of the column (170 [mu]l agarose, see Fig.
5
B). We calculate the
K
d
to be >= 4 mM.
Chemical modification
RNA ag.06 (1-5 pmol) was denatured for 3 min at 95oC and renatured in the absence or presence of 50 [mu]M of amino acid in 250 mM NaCl, 50 mM sodium cacodylate pH 7.4 and 5 mM MgCl
2
for 10 min at room temperature. For probing with CMCT 50 mM potassium borate pH
8.0 instead of sodium cacodylate was used. Chemical modification was performed
by addition of 1 [mu]l DMS (1:5 dilution in 96% ethanol), 1 [mu]l kethoxal (1:5 dilution in H
2
O, stock solution: 37 mg/ml) or 12.5 [mu]l CMCT (32 mg/ml in H
2
O) to a final reaction volume of 50 [mu]l, followed by incubation at 25oC for 20 min. After precipitation in the presence of 10 [mu]g tRNA, samples were dissolved in H
2
O (DMS and CMCT modified samples) or 25 mM potassium borate pH 7.0 (kethoxal modified samples). Detection of modified
positions by primer extension and polyacrylamide gel electrophoresis was
performed as described previously (
28
,
29
)
using 5'-
32
P end-labeled primer M20.106.
RESULTS AND DISCUSSION
The stringent selection scheme led to tight binding sequences
Two RNA-pools of ~10
15
different sequences (Fig.
1
) were subjected to cycles of
in vitro
selection for arginine binding. One pool (pool 2; 117mer) was comprised of a
74mer oligonucleotide sequence which was previously shown to bind citrulline
with a dissociation constant of 65 [mu]M (
14
). Previously, in a standard
in vitro
selection experiment, a mutant RNA aptamer specific for arginine was isolated
from this pool which differed from the citrulline binding RNA by three
mutations (
14
). We have now re-selected the same pool as well as a completely randomized pool (pool 1;
113mer) for tight binding to arginine, by increasing the stringency of the
selection so that weak binders are removed and only the tight binding arginine
specific aptamers are amplified. This was achieved by a modified selection
scheme which included a counter selection (
23
) with 20 mM of the non-cognate amino acid citrulline. To increase the stringency of competition
between free citrulline and immobilized arginine, the column bound RNA was heat
denatured and renatured in the presence of 20 mM citrulline. Elution was
continued with citrulline solution for several column volumes to remove non-bound RNA. We then equilibrated the column with 20 mM arginine in binding
buffer for several column volumes, and carried out a heat
denaturation/renaturation step in 20 mM arginine. Our rationale for this
treatment was that molecules with a slow dissociation rate (
k
off
) which affinity-elute slowly can be assumed to be tight binders. When heat denatured,
these RNAs separate from the immobilized ligand and renature in the presence of
an excess of ligand in solution. This method should avoid the undesired
enrichment of matrix binders or molecules which require functional groups
provided by the matrix in addition to the ligand itself for binding, examples
of which have been reported in previous selection experiments in which the
bound nucleic acid was eluted by denaturation only (
12
).
In this way, we were able to increase the amount of RNAs which elute with
arginine after denaturation/renaturation from undetectable amounts in the first
8 cycles to 5% after 20 cycles. The elution profile obtained at cycle 20 is shown in Figure
2
. Elution profiles were essentially identical for pools 1 and 2.
Secondary structure elucidation
The computer generated folding of clone ag.06 is shown in Figure
3
B. The folding proposes the formation of three relatively long consecutive
stems; stem 1 consists of 7 base pairs (bp) which are separated by a bulged C
from the 10 bp stem 2, and stem 3 containing 8 bp. The formation of stem 3 is
supported by the point mutations found in clones ag.04, and ag.16,
respectively, which result in the substitution of two G[middot]U wobble pairs for G[middot]C pairs (Fig.
3
B).
To examine the secondary structure of clone ag.06, a chemical modification
analysis with DMS, kethoxal and CMCT was carried out in the presence and
absence of L-arginine (Fig.
4
).
Figure 4
.
Refined secondary structure of clone ag.06. (
A
) Chemical probing and footprinting of the secondary structure of ag.06 RNA with
DMS, kethoxal and CMCT. A and G indicate the dideoxy sequencing lanes, K the
primer extension of unmodified RNA. Chemical protection analysis was performed
in the free RNA (-) and in the presence (+) of L-arginine (Arg). The regions corresponding to residues C28C29G30 and
U35G36G37, respectively, which are not accessible to chemical probing and
presumably base pair to form a pseudoknot structure, are indicated by a green
bar. (
B
) Refined secondary structure of ag.06 RNA as proposed from the chemical
modification analysis. The constant regions which belong to the primer binding
sites are represented schematically. The reactivity of nucleotides in the free
RNA is summarized by the coloured labels (blue triangle, DMS-modified; red circle, kethoxal modified; yellow square, CMCT modified).
The size of the label reflects the degree of modification. Base positions which
are protected from modification in the complex are shown in light blue.
Residues C28C29G30 and U35G36G37 are shown in green.
Figure 5
.
Specificity of L-arginine recognition. (
A
) Chemical structures of ligands used for the specificity determination. (
B
) Elution of ag.06 from a 4 mM L-arginine agarose (red) and from a 4 mM D-arginine agarose (blue). Cumulative output c.p.m. are expressed as
the percentage of the input c.p.m. (
C
) Affinity elution of ag.06 RNA from an L-arginine column by L-arginine and competition by arginine analogues. Competitive affinity
elution was performed with 12.5 mM solutions of L-arginine (red), D-arginine (blue), agmatine (green) and L-homoarginine (yellow). The column volume was 0.17 ml of
swollen agarose.In the presence of the ligand the chemical probing data confirm the formation of
the Watson-Crick paired regions. In its absence, only the base pairs formed by A25-U65 and A26-U64 which close stem 2 are accessible to chemical modification. Stem
3 forms regardless whether the amino acid is present since protections from
chemical modification at Watson-Crick positions of stem 3 are identical in presence and absence of L-arginine. Only the positions G49 and U53 which are likely to be
paired, because of the U to C mutation in clone ag.04, are weakly modified.
Among the unpaired base positions in the multibranched loop region only A27,
A31, A41, G61, A62 and G63 are accessible to modification in the absence of
arginine. In the presence of the amino acid, all adenosine residues (A27, A31,
A41 and A62) in the multibranched region are protected. G61 shows identical
degrees of modification in the presence and absence of the ligand. G63 is
weakly protected in the presence of 50 [mu]M arginine. Table
1
summarizes the differences in the modification pattern observed for the free
RNA and the complex.
An interesting result is the lack of modification at positions C28, C29 and G30
both in the presence and absence of arginine. Likewise, we only observed very
weak instead of the expected strong modification in the UGG loop which closes
the 3 bp stem formed by U32-G40, G33-C39 and C34-G38. This suggests the formation of a pseudoknot structure
(
39
,
40
) due to Watson-Crick pairing at positions C28-G37, C29-G36 and possibly G30-U35. A pseudoknot structure was also
the critical recognition element in the cyanocobalamine aptamer isolated by
Lorsch and Szostak (
22
). This pseudoknot bound to cyanocobalamine with values of
K
d
<100 nM.
The relatively high percentage of Watson-Crick pairing found in clone ag.06 and other clones (data not shown)
might be the result of our applied selection scheme involving heat
denaturation/renaturation; those sequences which fold back into a defined
secondary structure within the relatively short renaturation period of 15 min
were preferentially enriched compared to other potentially tight binding
molecules which have slower folding kinetics. This is supported by the finding
that the folding proposed for all RNAs suggest a significantly higher content
of G[middot]C base pairs than A[middot]U or G[middot]U base pairs. For example, clone ag.01 contains 22 G[middot]C pairs, 8 A[middot]U pairs and 7 G[middot]U pairs (data not shown), which
means that 75% of all bases in this clone are base paired.
Binding specificity of ag.06
As expected, binding of clone ag.06 to the non-cognate amino acid L-citrulline was not detectable, consistent with the negative
selection against L-citrulline. To test the specificity of clone ag.06 for L-arginine, we examined the binding affinity of this clone to several
arginine analogues, D-arginine, agmatine and L-homoarginine (Fig.
5
A). Figure
5
B shows the result of a binding assay performed with 4 mM D-arginine agarose and 4 mM L-arginine agarose, respectively. After a 7 ml buffer wash, corresponding to 35 column volumes, 65% of the total input clone
ag.06 RNA still bound to the L-arginine column. From the D-arginine column this RNA elutes in the void volume which means that
the
K
d
of the ag.06 RNA for D-arginine is at best 4.0 mM. With the solution
K
d
of 330 nM for L-arginine recognition, ag.06 discriminates at least 12 000-fold between the D- and L-enantiomer of arginine. This is the highest factor of
discrimination between similar ligands or enantiomers by an RNA molecule known
to date. Other RNA aptamers with similar discrimination factors are known (
41
). For example, an aptamer isolated for theophylline recognition discriminates
theophylline from caffeine with a 10 000-fold difference in the binding affinity (
23
). The enantioselectivity of the ag.06 RNA corresponds to a difference in the
relative binding energies ([Delta][Delta]G) of 30 kJ[middot]mol
-1
(7.2 kcal[middot]mol
-1
). Several other RNA aptamers have been isolated that can discriminate between L- and D-amino acids (
12
-
15
) although with relatively poor discrimination factors.
Figure
5
C shows elution profiles of clone ag.06 bound to L-arginine agarose. Affinity elutions with 12.5 mM solutions of different
arginine analogues were performed after an extensive buffer wash. Only the L-arginine affinity elution was capable of eluting most of the RNA off the
column within 12 column volumes, whereas D-arginine, agmatine or L-homoarginine were inactive. The results of the affinity elutions are
quantified in Table
2
.
Summary of the modified positions in clone ag.06 in the presence and absence of
L-arginine
Base
Modified
Modified
position
in free RNA
in complex
A25
+
-
A26
++
-
A27
++
-
C28
-
-
C29
-
-
G30
+-
+-
A31
++
-
G33
-
-
G36
+-
+-
G37
+-
+-
G38
+-
+-
G40
-
-
A41
++
-
G49
+
+
A50
++
++
A51
++
++
G52
++
++
U53
+
+
G61
++
++
A62
++
-
G63
++
++
++, Indicates a high degree modification; +, indicates a medium degree of
modification, +-, indicates a low degree of modification; -, indicates protection from modification.
The elution data performed with the arginine analogues are consistent with the
clone ag.06 RNA being highly specific for L-arginine recognition. In a recent publication it has been argued that
selection for high-affinity binding automatically results in highly specific binding (
42
). The results presented here confirm these suggestions; although we did not
select for enantioselective binding, the tight binding RNA aptamer ag.06 binds L-arginine with higher enantio- and ligand selectivity than other RNA aptamers with lower binding
affinity to their cognate ligands (
12
-
15
).
Affinity elution of L-arginine-bound ag.06 RNA with different arginine analogues
Ligand
Percent of total input RNA
eluted from L-arginine column
L-arginine
64.1
D-arginine
6.3
agmatine
7.3
L-homoarginine
1.6
The concentration of L-arginine on the column was 4.0 [mu]mol/ml of swollen agarose (4.0 mM) as determined by derivatization in the presence of
3
H-labelled arginine. Affinity elution was performed with 12.5 mM solutions of ligands in
binding buffer over 12 column volumes (2.0 ml) at ambient temperature within 15 min of
incubation.
CONCLUSION
An RNA molecule (`aptamer') which binds tightly and highly specifically to L-arginine in solution has been isolated by applying a modified
in vitro
selection scheme. A series of arginine binding sequences was selected, which do
not share primary sequence relationships to each other. This demonstrates that
the tight and specific molecular recognition of arginine can be achieved by
many different RNA sequences. One of the selected RNA aptamers was shown to
exhibit the highest factor of discrimination between two low molecular weight
enantiomers, L- and D-arginine, reported to date. The relative binding energies for L- and D-arginine binding differ by 30 kJ.mol
-1
(7.2 kcal.mol
-1
) and correspond to a 12 000-fold difference in the
K
d
values. Binding to other arginine analogues, such as agmatine and L-homoarginine cannot be detected. The secondary structure of the tight
binding L-arginine aptamer was refined by chemical modification analysis for the
complex and the free RNA. This analysis revealed the formation of a compact
pseudoknot structure. In the free RNA, all unpaired adenosine residues within
the core region which forms the pseudoknot are accessible to modification with
DMS, whereas these positions are protected in the complex. This suggests that
these residues might be involved in a direct contact to the bound amino acid,
or in the formation of tertiary interactions induced by the ligand. The
specificity of the isolated RNA aptamer for L-arginine might indicate that every substituent carried by the amino acid
is directly contacting the RNA.
The high specificity obtained for the molecular recognition of L-arginine by the ag.06 RNA aptamer raises the question whether such highly
specialized motifs can be expected to play a role in complexes between arginine-rich peptides or proteins and RNA. As recently pointed out by Sundquist (
34
), the interactions in the full length HIV-1 Tat-protein-TAR complex cannot be mimicked by arginine alone, but high
resolution NMR analyses (
16
,
31
) have allowed to relate structural differences observed in the free TAR-RNA and the Tat-TAR complex to individual arginine residues. Analogously, the
identification and characterization of DNA (
43
) or RNA motifs such as the one described here will help to give new insights
into the principles of RNA-protein and RNA-ligand interactions.
ACKNOWLEDGEMENTS
We thank A. Hüttenhofer for discussions and critical reading of the manuscript. M.F.
expresses his gratitude to E.-L. Winnacker and W. Wolf for their support. This work was supported by
grants from Boehringer Mannheim to A.G. and M.F., the Deutsche
Forschungsgemeinschaft (grant no: Fa275/1-2) and the European Union (grant no: Biot-CT93-0345) to M.F.
35 Tao,J. and Frankel,A. D. (1992) Proc. Natl. Acad. Sci. USA, 89, 2723-2726.
36 Wang,Y. and Rando,R.R. (1995) Chem. Biol.,2, 281-290.
37 Lato,S.M., Boles,A.R. and Ellington,A.D. (1995) Chem. Biol., 2, 291-303.
38 Wallis,M.G., Von Ahsen,U., Schroeder,R. and Famulok,M. (1995) Chem. Biol., 2, 543-552.
39 Mans,R.M.W. and Pleij,C.W.A.(1993) In Eckstein,F. and Lilley,D.M.J. (eds) Nucleic Acids and Molecular Biology. Vol. 7, Springer Verlag, Berlin, pp. 250-270.
40 Puglisi,J.D., Wyatt,J.R. and Tinoco,I., Jr (1990) J. Mol. Biol., 214, 437-453.