Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (108K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (13)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Soltaninassab, S. R.
Right arrow Articles by LeStourgeon, W. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Soltaninassab, S. R.
Right arrow Articles by LeStourgeon, W. M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research Pages 3410-3417  


Oligonucleotide binding specificities of the hnRNP C protein tetramer
Introduction
Materials And Methods
   C-protein purification and quantification
   Fluorescence titrations
   Data analysis
   Competitor RNAs
   Binding isotherms using competitor RNAs
Results
   C protein displays no significant affinity for U-rich polypyrimidine tracts
   The affinity of C protein for a SELEX-identified winner sequence is not solely due to contiguous uridines
Discussion
Acknowledgement
References


Oligonucleotide binding specificities of the hnRNP C protein tetramer

Oligonucleotide binding specificities of the hnRNP C protein tetramer

Syrus R. Soltaninassab, James G. McAfee, Lillian Shahied-Milam, Wallace M. LeStourgeon*

Department of Molecular Biology, Vanderbilt University, Nashville, TN 37235, USA

Received February 20, 1998; Revised and Accepted May 26, 1998

ABSTRACT

Through the use of various non-equilibrium RNA binding techniques, the C protein tetramer of mammalian40S hnRNP particles has been characterized previously as a poly(U) binding protein with specificity for the pyrimidine-rich sequences that often precede 3[prime] intron-exon junctions. C protein has also been characterized as a sequence-independent RNA chaperonin that is distributed along nascent transcripts through cooperative binding and as a protein ruler that defines the length of RNA packaged in 40S monoparticles. In this study fluorescence spectroscopy was used to monitor C protein-oligonucleotide binding in a competition binding assay under equilibrium conditions. Twenty nucleotide substrates corresponding to polypyrimidine tracts from IVS1 of the adenovirus-2 major late transcript, the adenovirus-2 oncoprotein E1A 3[prime] splice site, IVS2 of human [alpha]-tropomyosin, the consensus polypyrimidine tract for U2AF65, AUUUA repeats and r(U)20 were used as competitors. A 20 nt [beta]-globin intronic sequence and a randomly generated oligo were used as competitor controls. These studies reveal that native C protein possesses no enhanced affinity for uridine-rich oligonucleotides, but they confirm the enhanced affinity of C protein for an oligonucleotide identified as a high affinity substrate through selection and amplification. Evidence that the affinity of C protein for the winner sequence is due primarily to its unique structure or to a unique context is seen in its retained substrate affinity when contiguous uridines are replaced with contiguous guanosines.

INTRODUCTION

In the presence of nuclease inhibitors the majority of pre-mRNA molecules released from nuclei by sonic disruption or by low salt extraction exist as oligomeric arrays of 20 nm ribonucleoprotein particles, termed heterogeneous nuclear ribonucleoprotein (hnRNP) complexes (for reviews see 1-3). In the presence of mild nuclease activity these complexes are converted to 20 nm monoparticles that sediment near 40S in density gradients and are termed 30-40S hnRNP core particles. The major proteins recovered from core particles (the A, B and C proteins) apparently exist as three different heterotetramers (4,5). The (A2)3B1 and (C1)3C2 tetramers have been isolated and partially characterized (4,6). Although protein A1 has been shown by crosslinking experiments to exist as homotrimers in monoparticles (7-9), in the absence of RNA stable (A1)3B2 tetramers have not been isolated. The core particle proteins are among the most abundant nuclear proteins (10,11) and they possess the intrinsic activity in vitro to reconstitute hnRNP particles every 700 nt along the length of long RNA substrates (6,12,13).

The nucleation of monoparticle assembly and the contiguous placement of hnRNP particles every 700 nt on long substrates appears to result from the unique properties of the C protein tetramer. For example, the C protein tetramer binds 230 nt of RNA in a highly cooperative salt-resistant manner and each contiguous grouping of three tetramers fold 700 nt lengths of RNA into a unique 19S triangular complex in vitro (6,14). The triangular C protein-RNA complex is a physiologically relevant structure as it exists as a remnant structure following dissociation of hnRNP particles in 300 mM salt. Additionally, purified triangular C protein-RNA complexes nucleate stoichiometric monoparticle assembly in vitro, while, in the absence of C protein, the basic A and B proteins bind RNA to form aggregate complexes that sediment in a heterodisperse manner. These findings indicate that C protein possesses three interrelated functions in pre-mRNA maturation. More specifically, it orders and constrains 230 nt lengths of the elongating transcript, it functions as a protein ruler (through the combinatorial folding of three tetramers) to define the length of RNA to be packaged in hnRNP particles and the 19S triangular complex functions to direct the stoichiometric assembly of monoparticles. The ATP-dependent displacement of C protein and other hnRNPs from splicing-competent pre-RNAs during spliceosome assembly (15) is generally consistant with a sequence-independent RNA chaperonin-like role for C protein in pre-mRNA maturation (3).

Other experimental findings, however, have been interpreted as evidence that C protein functions directly in pre-mRNA splicing through sequence-specific RNA binding events. For example, in oligonucleotide binding experiments it was observed that C protein does not dissociate from Sepharose-linked oligo(U) at 2 M NaCl (16). In experiments where defined RNAs were added to splicing-competent nuclear extracts and exposed to ribonuclease T1, U-rich polypyrimidine tracts were recovered from immunoprecipitates of C protein (17). Additionally, through UV irradiation experiments it has been reported that C protein specifically crosslinks to a U-rich sequence downstream of the cleavage site for polyadenylation (18) and to repeats of AUUUA in the 3[prime]-untranslated region of several mRNAs (19). Through more recent UV crosslinking experiments, it has also been reported that C protein specifically crosslinks to a region of four uridylates in the 5[prime] stem-loop structure of U2 snRNA (20). Finally, using a randomized sequence pool of 20 nt oligomers as in vitro substrates for C protein binding, it was observed that following eight rounds of binding and amplification (the SELEX procedure) the selected oligonucleotides possessed regions of five or more contiguous uridines (21). These non-equilibrium binding studies have led to the suggestion that C protein specifically binds in vivo to the polypyrimidine tract of introns (and perhaps to snRNAs) and functions to recruit snRNPs and other splicing factors to the 3[prime] consensus sequence for spliceosome assembly (17,21-23).

In contrast to the binding studies mentioned above, it has been observed more recently that when cDNAs for human C1 and C2 are expressed in bacterial cells, stable homo C14 and C24 tetramers assemble spontaneously. Under equilibrium conditions these proteins bind single-stranded RNA through a highly cooperative binding mode such that the overall binding affinity for poly(A) and poly(C), of sufficient length to saturate the tetramer (>230 nt), is significantly greater than for poly(U) (14,24). It thus became important to determine if, under equilibrium binding conditions, native C protein displays the same binding properties as recombinant protein and if short U-rich sequences are bound by C protein in preference to other substrates. In the studies described here a competition binding assay conducted under equilibrium conditions was used to investigate the relative affinity of native C protein for several short RNA sequences that have been postulated to function as target sequences for specific binding (17-19,21,25). Included in these studies were RNA oligos corresponding to 3[prime] splice sites possessing authentic polypyrimidine tracts, a uridylate-rich 3[prime]-untranslated region common to several mRNAs, several other uridylate-rich sequences, randomly generated sequences, an intronic sequence with no known splicing function, a SELEX-identified `winner' sequence previously shown to bind with high affinity to bacterially expressed C protein and to variants of the winner sequence (Table 1). As reported here, in comparison with random oligonucleotide sequences of the same length and to intronic sequences, native C protein did not show significantly higher affinity for any of the previously reported C protein target sequences or for uridine-rich oligonucleotides. Also, while the experiments described here confirm the enhanced affinity of C protein for a SELEX-identified winner sequence, they demonstrate that the interaction is not solely due to the presence of contiguous uridines.

MATERIALS AND METHODS

C-protein purification and quantification

C protein was purified from HeLa S3 cells under native conditions from density gradient-purified 40S hnRNP core particles as described previously (26,27). In these experiments an additional elution from the strong anion exchange column, Mono-Q (Pharmacia HR 5/5), was utilized to achieve electrophoretic homogeneity. C protein at concentrations of 1-2 mg/ml was equilibrated in binding buffer A (10 mM Tris, pH 8.0, 1 mM EDTA) and various concentrations of NaCl depending on the titration. Protein concentrations were determined using the BCA assay (Pierce). The BCA assay was accurate to within ±5% as determined through quantitative amino acid analysis. Because C protein contains no tryptophan, quantitative amino acid analyses were also utilized to demonstrate the absence of tryptophan in the purified native C protein preparations.

Fluorescence titrations

Equilibrium binding isotherms were obtained by measuring the change in the fluorescence signal (enhancement) that occurs when C protein binds the fluorescent probe RNA poly[r([epsis]A)]. Fluorescence measurements were performed with an SLM Aminco Bowman Series 2 Luminescence Spectrometer. Excitation and emission slits were fixed at 4 and 8 nm respectively. Titrations were conducted by exciting a fixed amount (0.5-2 µM) of the probe RNA at 310 nm and measuring the change in its emission at 410 nm as a function of increasing protein concentration. The probe RNA was prepared by treating poly(A) with chloroacetaldehyde (28). Fluorescence measurements were corrected for dilution due to the addition of protein sample and for background fluorescence. In all cases, the dilution of probe RNA did not exceed 5% of the total sample volume. Inner filter corrections were not necessary, since the quantities of RNA used in these titrations have negligible absorbance at 310 nm.

Data analysis

Binding parameters for the interaction of native C protein with the probe RNA under equilibrium conditions were derived by simulating theoretical curves to fit the experimental data using the non-cooperative model of McGhee and von Hippel for the interaction of a ligand (protein) with a nucleic acid (29).

[nu](i)/L = K[1 - n[nu]][1 - n[nu]][1 - (n - 1)[nu]]-1

In the above equation, [nu] is the binding density (mol protein bound/mol nucleotide), L is the free ligand (protein) concentration and K is the association constant. The lack of sigmoidicity in the isotherms for the interaction of C protein with probe RNA suggests that the cooperativity parameter [omega], if it exists, is <10n, where n is the ligand binding site size (30). Since cooperativity can only be determined when [omega] is substantially greater than n, we used the simplest model (non-cooperative) to fit the experimental data.

To reduce the number of fitted parameters we independently determined n from stoichiometric titrations of the probe RNA. In addition, the site size has previously been determined through ultrastructural and hydrodynamic studies using various synthetic pre-mRNAs and binding substrates (6,31). Likewise, maximum fluorescent enhancement (Emax) was obtained from experimental binding isotherms. To generate a theoretical set of binding parameters we varied [nu] from 0 to 1/n (the maximum binding density at 100% saturation is 1/n). Theoretical enhancement values (Ecalc) were calculated for any given Emax, K, n and [omega] using the relationship

Ecalc = n[nu]Emax

Total protein concentrations corresponding to calculated E values were determined from the equation

Lt calc = Lb + Lf = (n[nu]Rt)/n + Lf

where Lt and Lb are the total and bound ligand respectively and Rt is the total RNA concentration at any one point during the titration. L was calculated by solving the McGhee-von Hippel equation for free ligand concentration. Lb was calculated in terms of (n[nu]Rt/n). Sets of theoretical data were iteratively generated until the sum of the squared deviation between the experimental and calculated data were minimized. In addition, we found that visual inspection of the binding isotherms are consistant with this analysis.

To confirm binding parameters obtained through fitting routines, non-linear regression analyses were also performed on the experimental data using a modification of methods described elsewhere (28). Root mean square errors in the optimization were evaluated using an iterative algorithm which numerically searches for the value of the binding density, [nu](i), which corresponds to the ith experimental concentration of C protein, Lt exp(i). An initial value for the C protein bound to RNA, Lb(i), is obtained from

Lb(i) = [E(i)/Emax × n[nu](i)] × Rt

where Rt is the concentration of RNA, Emax is the maximum enhancement and E(i) is the experimentally determined enhancement. Thus, the parameters K, Emax and n were optimized in a non-linear least squares algorithm (32). The binding site sizes n determined from these fits were within 10% of the value determined from stoichiometric titrations.

[nu](i) = [Lb(i)/Rt]

The binding density [nu](i) is calculated from the above expression. The McGhee-von Hippel non-cooperative equation is solved for free C protein concentration. A new value for Lb(i) is calculated knowing the binding density at each stage of the titration. The corresponding value for Lt is calculated from

Lt - 1 = Lb + L

Lt - 1 calc(i) is compared with the experimental value Lt exp(i) and [nu](i) is iteratively incremented until the difference between the experimental and calculated Lt values is reduced (<0.01% error). The value of the enhancement observed Eobs(i), which corresponds to the final Lt - 1 calc(i) [i.e. Lt exp(i)] is calculated from the following expression

Eobs(i) = [Lb(i)] × Emax × n/Rt

Therefore Eobs(i) is calculated for a value of Lt exp(i).

Competitor RNAs

Except for r(X)20, r(U)20, r(G)20 and Ad2E1a, the oligonucleotides used in these studies were prepared commercially by Midland Certified Reagent and by Cruachem. The excepted oligonucleotides were prepared at the W.M.Keck Foundation at Yale University. Each oligonucleotide was purified via HPLC prior to use. The sequence of each oligonucleotide is shown in Table 1. The oligonucleotide denoted Ad2 IVS1 is the polypyrimidine tract from IVS1 of the adenovirus-2 major late transcript. The polypyrimidine tract termed Ad2 E1A is from the pre-mRNA encoding the E1A oncoprotein of adenovirus-2 (33). The oligonucleotide denoted U2AF65 consensus was deduced from a series of SELEX-identified winner sequences for the splicing factor U2AF65, known to specifically bind polypyrimidine tracts as an early event is spliceosome assembly (34). The polypyrimidine tract termed B3P3 is present in the second intron of human [alpha]-tropomyosin. It has been found to be a high affinity substrate for PTB (polypyrimidine tract binding protein) (35). The oligonucleotide termed AUUUA repeats is a 20 nt portion of the [Delta]2R1 sequence from the 3[prime]-untranslated region of the mRNA for granulocyte-macrophage colony stimulating factor (GM-CSF) (19). The 20 nt [beta]-globin intronic sequence termed [beta]-intron is present in IVS1 of human [beta]-globin pre-mRNA. This oligonucleotide is not a polypyrimidine tract and it has no known function in splicing (23). The oligonucleotide termed r(U)20 is a 20 nt homoribopolymer of uridine and r(G)20 is a 20 nt homoribopolymer of guanosine. The oligonucleotide r(X)20 is a 20 nt sequence produced under conditions where the nucleotide at each position was randomly chosen (36,37). The oligonucleotide termed winner was identified through the SELEX procedure using bacterially produced C protein. Additionally, this sequence was shown by a filter binding assay to be a high affinity substrate for C protein (21). The oligonucleotides termed winner-G and WinRan are identical to the winner sequence except that the five contiguous uridines were replaced with five guanosines or with UAACC.

Table 1. Apparent affinity of native C protein for various oligonucleotides
Description Sequence Apparent affinity (M-1)
Ad2 IVS1 rGUCCCUUUUUUUUCCACAG/C 6.4 ×105 (± 5.3 × 104)
Ad2 E1a 3[prime] rUGAUUUUUUUAAAAG/GUCCU 4.3 × 105 (± 3.0 × 104)
U2AF65 consensus rUUUUUUCCCCUUUUUUUUCC 7.0 × 105 (± 4.7 × 104)
B3P3 rCUUUCUCUUUCUCUCUCCCU 1.4 × 105 (± 1.6 × 104)
AUUUA repeats rCAUUUAUUUAUUUAUUUAAG 6.3 × 105 (± 2.3 × 104)
[beta]-intron rGAUCACUUGUGUCAACACAG 2.4 × 106 (± 1.1 × 106)
r(X)20   2.4 × 105 (± 1.9 × 104)
r(U)20   6.5 × 105 (± 1.6 × 104)
r(G)20   4.8 × 107 (± 2.4 × 107)
Winner rAGUAUUUUUGUGGA 3.1 × 107 (± 1.3 × 107)
Winner-G rAGUAGGGGGGUGGA 1.1 × 106 (± 4.5 × 104)
WinRan rAGUAUAACCGUGGA 2.1 × 105 (± 3.7 × 104)
Slash marks denote 3[prime] splice sites.

Binding isotherms using competitor RNAs

The concentration of each oligo was calculated using the nearest neighbor approximation method described in Borer et al. (38). For competition assays a control titration of the probe RNA with purified native C protein was carried out in the absence of competitor. To determine if the added oligonucleotides affect probe fluorescence, titrations were also performed with increasing amounts of each oligonucleotide. Titrations were then performed with C protein in the presence of the different oligonucleotide competitors (described in Table 1). The competitor RNAs were used in 11- to 50-fold excess (total nucleotides) depending on the strength of the competitor as a substrate for C protein binding (determined through test titrations).

An expression for the apparent equilibrium constant (Kapp) for the interaction of C protein with each oligonucleotide was derived considering the following equilibria (36,39)

P + [epsis]A [harr] P[epsis]A

and

P + comp [harr] [Pcomp]

where P, [epsis]A and comp are the steady-state concentrations of protein, poly[r([epsis]A)] and competitor RNA respectively. P[epsis]A and Pcomp are the equilibrium concentrations of protein-([epsis]A) or protein-competitor RNA complex. The mass action expression defining the Kapp for the interaction of protein with poly[r([epsis]A)] (KeA) or competitor RNA (Kcomp) are

KeA = [P[epsis]A]/[P]free[[epsis]A]free 1

and

Kcomp = [Pcomp]/[P]free[comp]free 2

In a competition titration, the free protein concentration for both mass action expressions is the same (equations 1 and 2). As a result, both expressions can be equated through [P]free. The solution for Kcomp is shown in equation 3 below.

Kcomp = [Pcomp] × KeA × [[epsis]A]free/[P[epsis]A][comp]free 3

In the above equation KeA was derived from a fit of data from a control titration (no competitor present) using the McGhee-von Hippel non-cooperative binding equation (29).

RESULTS

In the experiments described here the relative affinity of purified native C protein for various RNA substrates was determined through a competition binding assay conducted under equilibrium binding conditions. The fluorescent RNA poly[r([epsis]A)] was used as a `reporter' or probe substrate. In typical experiments the probe RNA (0.7-2.8 µM) was irradiated at 310 nm and the emission spectrum was monitored at 410 nm. In competition binding experiments, the magnitude of fluorescence attenuation in the presence of competitor RNA correlates with C protein binding to, and its relative affinity for, the competitor RNA (see Materials and Methods).

The stoichiometry of the native C protein-poly[r([epsis]A)] interaction at 100 mM salt was determined by titration with highly purified native C protein obtained from exponentially growing HeLa cells. Non-linear regression analyses of the binding isotherm yielded a binding stoichiometry of 260 nt/tetramer (Fig. 1). This binding stoichiometry is similar to that previously estimated through ultrastructural and hydrodynamic studies using various synthetic pre-mRNA substrates (230 ± 30 nt) (6,31). For this reason, the equilibrium binding parameters for the interaction of C protein with the probe substrate were determined by simulation using the McGhee-von Hippel non-cooperative model assuming an RNA binding site size of 230 nt (29). The binding parameters determined through this method are virtually identical to those determined from non-linear regression analysis of the binding isotherms (Fig. 1). This analysis yields an association constant of 5.8 × 106/M for binding of C protein to the probe RNA.


Figure 1. Equilibrium binding parameters for the interaction of native C protein with the fluorescent probe RNA poly[r([epsis]A)] at a concentration of 0.45 µM. This control titration was conducted in buffer A (10 mM Tris, 1 mM EDTA, pH 8.0) at 25°C containing 100 mM NaCl. The equilibrium binding parameters were obtained by simulating theoretical curves (smooth line) to fit the experimental data (open circles) using the McGhee-von Hippel non-cooperative model assuming a site size (n) of 230 nt. Through this method, the intrinsic association for the interaction of native C protein (C1)3C2 with poly[r([epsis]A)] is 5.8 × 106/M. The binding parameters were further defined through a non-linear regression analysis of the experimental data. In close agreement with the theoretical curve, this analysis yielded an intrinsic affinity of 5.6 × 106/M and a binding site size of 260 nt.

C protein displays no significant affinity for U-rich polypyrimidine tracts

To determine the relative binding affinity of native C protein for short target sequences under equilibrium conditions, a competition binding assay was utilized. The RNAs used as competitor substrates in these experiments are shown in Table 1 and described in Materials and Methods. To more definitively compare the relative strengths of these oligos as substrates for C protein binding, an 11- to 50-fold excess of competitor to probe (in nucleotides) was used in the binding assays. Through this approach, binding isotherms generated in the presence of competitor RNAs are less likely to superimpose on the probe curve and subtle differences in weak binding affinities are amplified. The control titration (with no competitor) is apparent as the upper solid tracing in Figures 2 and 4-7. In addition, a 20 nt [beta]-globin intronic sequence that lacks any known splicing signal (40) and a randomly generated 20 nt sequence r(X)20 were used as competitor controls (Table 1). In these experiments binding to competitor is seen as an attenuation of fluorescence enhancement from the probe RNA. In other words, as C protein binds the competitor RNA in preference to the probe, the amplitude of the binding isotherm is attenuated. As described in Materials and Methods, binding isotherms obtained in the competition binding assay were used to extrapolate relative binding affinities for each competitor sequence. Binding affinities are denoted Kcomp, as they refer to the relative affinity of C protein for the respective competitor RNAs.


Figure 2. Determination of the relative binding affinity of C protein for the [beta]-globin intronic sequence (open circles), for two adenoviral polypyrimidine tracts (Ad2 IVS1, open triangles; Ad2 E1a, open diamonds) and for the randomly generated oligonucleotide r(X)20 (open squares). The solid tracing shows the binding isotherm for the control titration (probe RNA only). The tracings of the competitor titrations here and in Figures 4-7 are interpolations through the data points. In these competition binding experiments the probe RNA was 0.48 µM. Concentrations of the competitor RNAs were as follows: [beta]-globin, 25.0 µM; Ad2 IVS1, 23.1 µM; Ad2 E1A, 23.5 µM; r(X)20, 26.7 µM.

To determine if C protein displays an enhanced affinity for bona fide polypyrimidine tracts, competition titrations were conducted with polypyrimidine tracts from IVS1 of the adenovirus-2 major late transcript, from the adenovirus-2 oncoprotein E1A 3[prime] splice site (33) and from IVS2 of human [alpha]-tropomyosin (the B3P3 element) (35). Also tested was the consensus polypyrimidine tract deduced from the sequence of several SELEX-identified winner sequences for U2AF65 binding (21). In some experiments a 20 nt oligo(U) sequence [r(U)20] was also used as a competitor. In all experiments the 20 nt [beta]-globin intronic sequence (described above) and randomly generated r(X)20 were used as competitor controls.

Figure 2 shows the binding isotherms generated with the two adenoviral polypyrimidine tracts. The histogram shown in Figure 3 graphically presents the relative affinities of C protein for the B3P3 sequence of [alpha]-tropomyosin and the winner sequence for U2AF65 binding and other substrates. The binding isotherms generated with the B3P3 sequence of [alpha]-tropomyosin and the winner sequence for U2AF65 binding are shown in Figure 4. As revealed in Figures 2 and 4 (tabulated in Table 1), in comparison with the four polypyrimidine tracts tested, C protein binds with highest apparent affinity (Kcomp = 2.4 × 106/M) to the [beta]-globin intronic sequence that does not possess a polypyrimidine tract and that has no postulated role in C protein binding or spliceosome assembly. C protein binds all of the 20 nt polypyrimidine tracts tested with affinities between 1.4 × 105 and 6.9 × 105/M.


Figure 3. Histogram of the relative affinity of native C protein for the various competitor RNAs used in these studies. Note that the histograms denoting the relative affinities for r(G)20 and for the C protein winner sequence have been truncated (see scale at left), thus the apparent magnitude of their difference from other competitor RNAs is greatly under-represented.


Figure 4. Determination of the relative binding affinity of C protein for the [beta]-globin intronic sequence (open circles), for the U2AF65 `winner' sequence (open triangles), for the randomly generated oligonucleotide r(X)20 (open squares) and for the B3P3 sequence (open diamonds). The solid tracing represents the binding isotherm for the control titration (probe RNA only). In these competition binding experiments the probe RNA was 0.48 µM. Concentrations of the competitor RNAs were as follows: [beta]-Globin, 25.0 µM; U2AF65 winner, 23.4 µM; r(X)20, 26.7 µM; B3P3, 23.2 µM.

It has previously been suggested from UV crosslinking and immunoprecipitation experiments that C protein binds a 3[prime]-untranslated region in the mRNA for GM-CSF (19) and for the [beta]A4 amyloid protein (25). Presumably this specificity is due to interactions between C protein and repeats of an AUUUA sequence element. To determine if C protein displays enhanced affinity for this sequence under equilibrium conditions, a 20 nt region of the [Delta]2R1 oligonucleotide (possessing the AUUUA repeats) described in Hamilton et al. (19; Table 1) was used as a competitor in the equilibrium binding assay. The binding isotherm for this oligonucleotide reveals an affinity of 6.3 × 105/M, which is less than the affinity of C protein for the [beta]-globin intronic sequence (2.4 × 106/M) and is not significantly greater than (~2-fold) the affinity of C protein for the randomly generated sequence r(X)20 (2.4 × 105/M) (Figs 3 and 5 and Table 1).


Figure 5. Determination of the relative affinity of C protein for the [beta]-globin intronic sequence (open circles), the randomly generated sequence r(X)20 (open squares) and the oligonucleotide possessing AUUUA repeats (open diamonds). The solid tracing represents the binding isotherm for the control titration (probe RNA alone). In these competition binding experiments the probe RNA was 0.48 µM. Concentrations of the competitor RNAs were as follows: [beta]-globin, 25.0 µM; r(X)20, 26.7 µM; AUUUA repeats, 23.3 µM.

The affinity of C protein for a SELEX-identified winner sequence is not solely due to contiguous uridines

Using the selection and amplification procedure of Tuerk and Gold (41) recombinant C protein was found to select a group of oligonucleotides from an infinite pool of randomly generated sequences that all possess at least one group of five or more contiguous uridines (21). From these findings it was assumed that this feature is the determinant of the affinity of C protein for the selected oligonucleotides. To determine if native C protein binds a SELEX-identified `winner' sequence under equilibrium conditions with especially high affinity, the winner shown in Table 1 was utilized in the competition binding assay. In the filter binding studies of Gorlach et al. (21) it was found that C protein binds this oligonucleotide with high affinity. To gain further insight into the role of contiguous uridines in C protein binding, variants of the winner sequence lacking contiguous uridines were also tested, as well as 20 nt homoribopolymers of uridine and guanosine. In Figures 3, 6 and 7 it can be seen that C protein binds the winner sequence under equilibrium conditions with an affinity of3.1 × 107/M, which is 60- to 200-fold higher than its affinity for bona fide polypyrimidine tracts possessing five or more contiguous uridines. In fact, C protein binds the winner sequence with a 47-fold higher affinity than it binds r(U)20. Furthermore, the affinity of C protein for r(U)20 (pure polypyrimidine) is 3-fold less than its affinity for the 20 nt intronic sequence of [beta]-globin and not significantly different from those for the U-rich U2AF65 winner sequence, the randomly generated 20mer and the various polypyrimidine sequences tested (Table 1 and Figs 2 and 4). It can also be seen that in a variant of the winner oligonucleotide (contiguous uridines replaced with GGGGG) the binding affinity is significantly attenuated, but remains 2- to 7-fold higher than for bona fide polypyrimidine tracts and ~1.5-fold higher than for r(U)20 (Fig. 7).


Figure 6. Determination of the relative affinity of C protein for the [beta]-globin intronic sequence (open circles), for r(G)20 (open diamonds), for r(U)20 (open triangles) and for r(X)20 (open squares). The solid tracing represents the binding isotherm for the control titration (probe RNA alone). In these competition binding experiments the probe RNA was 0.48 µM. Concentrations of the competitor RNAs were as follows: [beta]-globin, 25.0 µM; r(G)20, 25.5 µM; r(U)20, 24.0 µM; r(X)20, 26.7 µM.


Figure 7. Determination of the relative affinity of C protein for a SELEX-identified winner sequence (open circles), for the winner sequence containing five contiguous G residues (open diamonds) and for the winner sequence possessing the sequence UAACC in place of the five contiguous uridines (open squares). The solid tracing represents the binding isotherm for the control titration (probe RNA only). In these competition binding experiments the probe RNA was 0.48 µM. Concentrations of the competitor RNAs were as follows: winner sequence, 5.2 µM; winner-G, (24.4 µM); winner-UAACC, 23.2 µM.

DISCUSSION

The findings described above demonstrate that under equilibrium conditions native C protein does not bind with significantly enhanced affinity to short oligonucleotide sequences previously suggested to function as target sequences for C protein binding in vivo. Namely, to the uridine-rich polypyrimidine tracts of introns (17,21,23,42) or to AUUUA repeats present in the 3[prime]-untranslated regions of some mRNAs (19,25). As summarized in Figure 3 and Table 1, it can be seen that C protein binds with similar affinities to a randomly generated oligonucleotide sequence, a short homoribopolymer of uridine [poly(U)20], an oligonucleotide possessing four contiguous AUUUA repeats and to four uridine-rich polypyrimidine tracts. Of the naturally occurring oligonucleotides tested, C protein displayed its highest affinity for a 20 nt intronic sequence with no known function in RNA splicing. Native C protein does, however, bind a SELEX-identified winner sequence with a 60- to 200-fold higher affinity than for polypyrimidine sequences. This observation is consistant with the ability of selection and amplification methodologies to identify high affinity substrates in pools typically containing 1014 different oligonucleotides and it demonstrates the ability of the binding assay used in these studies to accurately detect significant differences in binding affinities under equilibrium conditions. The observation that native C protein binds poly(G)20 with an affinity similar to or greater than the winner sequence suggests that the selection/amplification methodology may, for several reasons, under-represent especially G-rich substrates. It also suggests that C protein binds poly(G)20 and the winner sequence through a mechanism that may not be physiologically relevant. On the latter point, it has recently been shown that a SELEX-identified winner sequence for hnRNP A1 (40) exists in solution as an oligomeric complex with a unique secondary structure and that the ability of A1 to select the winner sequence from randomized controls is primarily dependent on ionic interactions in the C-terminus and not with the canonical RRMs of A1 (36). In more recent studies it has also been observed that, except for the SELEX winner, hnRNP A1 displays no enhanced affinity for previously suggested target sequences (37).

In the SELEX studies on C protein it was reported that each of 28 SELEX-identified oligonucleotides possessed at least one group of five or more contiguous uridines and it was suggested that this element was the determinant of C protein binding (21). However, several observations described here indicate that contiguous uridines do not alone define a high affinity substrate for C protein nor do they alone define the high affinity interaction of C protein with the winner sequence. First, C protein has no enhanced affinity under equilibrium conditions for bona fide polypyrimidine tracts possessing runs of five or more uridines nor does it display enhanced affinity for poly(U)20. Secondly, C protein binds poly(G)20 with essentially the same affinity as it binds the winner sequence. Finally, when the five U residues are replaced with G residues there is a several fold drop in affinity yet C protein continues to bind the variant with affinities similar to or slightly higher than its affinities for the uridine-rich substrates tested. Taken together, these findings suggest that the winner oligos exist as unique structures with steric nucleotide positionings that optimize C protein interactions with short RNAs in solution. The physiological relevance of this interaction remains unclear because bona fide polypyrimidine sequences (possessing five or more contiguous uridylates) and other uridine-rich oligos are not preferentially bound by C protein under equilibrium conditions. It is also possible that unique RNA secondary structures in pools of short oligonucleotides may be absent in longer pre-mRNAs containing the same sequence elements. In this context, the winner sequence does not correspond to known repeated sequence elements in pre-mRNAs.

A finding perhaps relevant to the above discussion is the recent report that C protein binds the 5[prime] stem-loop of U2 snRNA (20). In this study it was found that elements within the stem structure and the four contiguous uridines in the loop region both function in binding. Also, in this context, it has been shown that the high affinity and salt-resistant RNA binding properties of C protein are not due to its N-terminal RRM but to a highly basic region preceding a leucine zipper motif (a bZIP-like motif, termed the bZLM) (24). It is therefore possible that C protein may bind to pre-mRNAs in a sequence-independent manner through its bZIP-like motif and to short RNAs possessing unique structure (i.e. snRNA) through its canonical RRM. The previous report that C protein binds poly(U)14 with a significantly higher affinity than the SELEX winner (21) is in sharp contrast to the results described here. It should be pointed out that the competition binding assay used here evaluates the relative ability of suggested target oligonucleotides to compete as substrates with a pre-mRNA length probe RNA. As shown previously, the SELEX winner is bound with high affinity by the bZLMs of C protein but not by its N-terminal RRM (14). It is therefore possible that poly(U)14 is bound with high affinity by the RRM of C protein (43) but that it competes poorly with long RNA substrates bound mostly by the tetramer bZLMs.

Previous conclusions that C protein specifically binds short uridine-rich sequences in pre-mRNA may in part be due to methodological phenomena. For example, it has been observed by several groups that, in comparison with other hnRNPs, C protein readily crosslinks to RNA upon brief UV irradiation at sites possessing one or more uridines (5,19,20,44-47). This, however, would be expected if C protein is distributed along the length of nascent transcripts (through cooperative binding) and if uridine is especially photoreactive. The latter is well documented, as uridine is ~100-fold more labile to UV activation than are the other ribonucleotides (48-50). Estimates of binding preference based on salt-resistant dissociation from immobilized homoribopolymers are problematical because these approaches primarily evaluate differences in intrinsic affinity and in the electrostatic component of particular interactions. Additionally, cooperative binding cannot be evaluated through non-equilibrium approaches and studies conducted with substrates substantially shorter than n (the ligand binding site) are not likely to reflect the physiologically relevant binding equilibrium. In this context, we have utilized the r(U)20 and `winner' oligos described above as control substrates in a separate study to characterize the interaction of native C protein with various snRNAs (51). C protein was observed to bind snRNA substrates with affinities proportional to substrate length and at molar RNA concentrations 500-fold lower than were required in the present studies to detect binding to short oligonucleotides. More specifically, in conventional band shift assays binding of C protein to the oligonucleotides used here could not be detected (51).

In addition to the findings described above that question the ability of C protein to identify uridine-rich sequences in pre-mRNAs, other findings point to the likely function of C protein as a sequence-independent RNA chaperonin that orders pre-mRNA and nucleates hnRNP assembly. Previous ultrastructural and hydrodynamic studies on native C protein and previous equilibrium binding studies on recombinant homo C14 and C24 demonstrate that a single tetramer binds ~230 nt of RNA and that three tetramers fold 700 nt increments of RNA into a unique triangular structure that nucleates correct 40S core particle assembly (6,14,31). It has been estimated that a typical HeLa cell contains ~22 × 106 molecules of C protein (11) and that there are ~0.4 × 106 nascent transcripts with average lengths between 6 and 10 kb. Because almost all C protein is nuclear (52), the molar concentration of C protein in the nucleus (12 µm spherical diameter) should be >10 µM. At these histone-like concentrations there is far more C protein in the nucleus than necessary to saturate the pool of pre-mRNAs with an occluded site size of 230 nt.

C protein is a highly stable tetramer possessing at least eight RNA binding motifs (24). While this occurrence may be well suited for binding and ordering long lengths of RNA, it complicates arguments for specificity for short sequences. More specifically, an enhanced affinity for a short target sequence must be sufficiently high to override sequence-independent interactions at other sites in the oligomer and to negate cooperative binding. It has been estimated that, for a lattice binding ligand, its sequence-specific binding mode must minimally be an order of magnitude higher than its non-specific interactions, if the latter are to possess physiological relevance (29). This is especially significant in the case of an oligomer binding a lattice that, unlike DNA, lacks spatially repeating sequence elements.

ACKNOWLEDGEMENT

Supported by NIH grant GM 48567 to W.M.L.

REFERENCES

1. Samarina,O.P. (1996) BioEssays, 18, 595-601.

2. Beyer,A.L. and Osheim,Y.N. (1990) In Strauss,P.R. and Wilson,S.H. (eds), The Eukaryotic Nucleus: Molecular Biochemistry and Macromolecular Assemblies. Telford Press, Caldwell, NJ, Vol. 2.

3. McAfee,J.G., Huang,M., Soltaninassab,S., Rech,J.E., Iyengar,S. and LeStourgeon,W.M. (1997) In Krainer,A. (ed.), Frontiers in Molecular Biology: Eukaryotic mRNA Processing. Oxford University Press, New York, NY.

4. Barnett,S.F., Theiry,T.A. and LeStourgeon,W.M. (1991) Mol. Cell. Biol., 11, 864-871. MEDLINE Abstract

5. LeStourgeon,W.M., Barnett,S.F. and Northington,S.J. (1990) In Strauss,P.R. and Wilson,S.H. (eds), The Eukaryotic Nucleus: Molecular Biochemistry and Macromolecular Assemblies. Telford Press, Caldwell, NJ, Vol. 2.

6. Huang,M., Rech,J.E., Northington,S.J., Flicker,P.F., Mayeda,A., Krainer,A.R. and LeStourgeon,W.M. (1994) Mol. Cell. Biol., 14, 518-533. MEDLINE Abstract

7. Harris,S.G., Martin,T.E. and Smith,H.C. (1988) Mol. Cell. Biochem., 84, 17-28. MEDLINE Abstract

8. Harris,S.G., Hoch,S.O. and Smith,H.C. (1988) Biochemistry, 27, 4595-4600. MEDLINE Abstract

9. Lothstein,L., Arenstorf,H.P., Chung,S., Walker,B.W., Wooley,J.C. and LeStourgeon,W.M. (1985) J. Cell Biol., 100, 1570-1581. MEDLINE Abstract

10. LeStourgeon,W.M., Lothstein,L., Walker,B.W. and Beyer,A.L. (1978) In Busch,H. (ed.), The Cell Nucleus: Chromatin, Part C. Academic Press, New York, NY.

11. Kiledjian,M., Burd,C.G., Gorlach,M., Portman,D.S. and Dreyfuss,G. (1994) In Mattaj,I. and Nagai,K. (eds), Frontiers in Molecular Biology. Oxford University Press, Oxford, UK.

12. Conway,G., Wooley,J., Bibring,T. and LeStourgeon,W.M. (1988)Mol. Cell. Biol., 8, 2884-2895. MEDLINE Abstract

13. Wilk,H.E., Angeli,G. and Schafer,K.P. (1983) Biochemistry, 22, 4592-4600. MEDLINE Abstract

14. McAfee,J.G., Soltaninassab,S.R., Lindsay,M.E. and LeStourgeon,W.M. (1996) Biochemistry, 35, 1212-1222. MEDLINE Abstract

15. Bennett,M., Michaud,S., Kingston,J. and Reed,R. (1992) Genes Dev., 6, 1986-2000. MEDLINE Abstract

16. Swanson,M.S. and Dreyfuss,G. (1988) Mol. Cell. Biol., 8, 2237-2241. MEDLINE Abstract

17. Swanson,M.S. and Dreyfuss,G. (1988) EMBO J., 7, 3519-3529. MEDLINE Abstract

18. Wilusz,J., Feig,D.I. and Shenk,T. (1988) Mol. Cell. Biol., 8, 4477-4483. MEDLINE Abstract

19. Hamilton,B.J., Nagy,E., Malter,J.S., Arrick,B.A. and Rigby,W.F. (1993) J. Biol. Chem., 268, 8881-8887. MEDLINE Abstract

20. Temsamani,J. and Pederson,T. (1996) J. Biol. Chem., 271, 24922-24926. MEDLINE Abstract

21. Gorlach,M., Burd,C.G. and Dreyfuss,G. (1994) J. Biol. Chem., 269, 23074-23078. MEDLINE Abstract

22. Choi,Y.D., Grabowski,P.J., Sharp,P.A. and Dreyfuss,G. (1986) Science, 231, 1534-1539. MEDLINE Abstract

23. Burd,C.G. and Dreyfuss,G. (1994) Science, 265, 615-621. MEDLINE Abstract

24. McAfee,J.G., Shahied-Milam,L., Soltaninassab,S.R. andLeStourgeon,W.M. (1996) RNA, 2, 1139-1152. MEDLINE Abstract

25. Zaidi,S.H. and Malter,J.S. (1995) J. Biol. Chem., 270, 17292-17298. MEDLINE Abstract

26. Barnett,S.F., LeStourgeon,W.M. and Friedman,D.L. (1988) J. Biochem. Biophys. Methods, 16, 87-97. MEDLINE Abstract

27. Huang,M. and LeStourgeon,W.M. (1994) In Celis,J.E. (ed.), Handbook of Cell Biology I. Academic Press, New York, NY.

28. Steiner,R.F., Kinner,W., Lunasin,A. and Delac,J. (1973)Biochim. Biophys. Acta, 294, 24-37.

29. McGhee,J.D. and von Hippel,P.H. (1974) J. Mol. Biol., 86, 469-489. MEDLINE Abstract

30. Kowalczykowski,S.C., Paul,L.S., Longerg,N., Newport,J.W., McSwiggen,J.A. and von Hippel,P.H. (1986) Biochemistry, 25, 1226-1240. MEDLINE Abstract

31. Rech,J.E., Huang,M.H., LeStourgeon,W.M. and Flicker,P.F. (1995)J. Struct. Biol., 114, 84-92. MEDLINE Abstract

32. Bevington,P.R. and Robinson,D.K. (1992) Data Reduction and Error Analysis for the Physical Sciences. McGraw-Hill, New York, NY.

33. Mount,S.M. (1982) Nucleic Acids Res., 10, 459-472. MEDLINE Abstract

34. Singh,R., Valcarcel,J. and Green,M.R. (1995) Science, 268, 1173-1176. MEDLINE Abstract

35. Mullen,M.P., Smith,C.W.J., Patton,J.G. and Nadal-Ginard,B. (1991)Genes Dev., 5, 642-655. MEDLINE Abstract

36. Abdul-Manan,N., OMalley,S.M. and Williams,K.R. (1996) Biochemistry, 35, 3545-3554. MEDLINE Abstract

37. Abdul-Manan,N. and Williams,K.R. (1996) Nucleic Acids Res., 24, 4063-4070. MEDLINE Abstract

38. Borer,P.N. (1975) In Fasmann,G.D. (ed.), Handbook of Biochemistry and Molecular Biology. CRC Press, Cleveland, OH.

39. Amrute,S.B., Abdul-Manan,Z., Pandey,V., Williams,K.R. and Modak,M.J. (1994) Biochemistry, 33, 8282-8291. MEDLINE Abstract

40. Burd,C.G. and Dreyfuss,G. (1994) EMBO J., 13, 1197-1204. MEDLINE Abstract

41. Tuerk,C. and Gold,L. (1990) Science, 249, 505-510. MEDLINE Abstract

42. Gorlach,M., Burd,C.G., Portman,D.S. and Dreyfuss,G. (1993)Mol. Biol. Rep., 18, 73-78. MEDLINE Abstract

43. Gorlach,M., Wittekind,M., Beckman,R.A., Mueller,L. and Dreyfuss,G. (1992) EMBO J., 11, 3289-3295. MEDLINE Abstract

44. Sebillion,P., Beldjord,C., Kaplan,J.C., Brody,E. and Marie,J. (1995) Nucleic Acids Res., 23, 3419-3425.

45. Moore,C.L., Chen,J. and Whoriskey,J. (1988) EMBO J., 7, 3159-3167. MEDLINE Abstract

46. Wilusz,J. and Shenk,T. (1990) Mol. Cell. Biol., 10, 6397-6407. MEDLINE Abstract

47. Santoro,B., DeGregorio,E., Caffarelli,E. and Bozzoni,I. (1994)Mol. Cell. Biol., 14, 6975-6982. MEDLINE Abstract

48. Wang,K. and Richard,F. (1974) J. Biol. Chem., 249, 375-389.

49. Smith,K.C. (1969) Biochem. Biophys. Res. Commun., 34, 354-357. MEDLINE Abstract

50. Smith,K.C. and Aplin,R.T. (1966) Biochemistry, 5, 2125-2130. MEDLINE Abstract

51. Shahied-Milam,L., Soltaninassab,S.R., Iyer,G.V. and LeStourgeon,W.M. (1998) J. Biol. Chem., in press.

52. Pinol-Roma,S. and Dreyfuss,G. (1992) Nature, 355, 730-732. MEDLINE Abstract


*To whom correspondence should be addressed. Tel: +1 615 322 2588; Fax: +1 615 343 6707; Email: lestouwm@ctrvax.vanderbilt.edu


This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals
Comments and feedback: www-admin{at}oup.co.uk
Last modification: 2 Jul 1998
Copyright©Oxford University Press, 1998.

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Virol.Home page
J. E. Brunner, J. H. C. Nguyen, H. H. Roehl, T. V. Ho, K. M. Swiderek, and B. L. Semler
Functional Interaction of Heterogeneous Nuclear Ribonucleoprotein C with Poliovirus RNA Synthesis Initiation Complexes
J. Virol., March 15, 2005; 79(6): 3254 - 3266.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
J. H. Kim, K. Y. Paek, K. Choi, T.-D. Kim, B. Hahm, K.-T. Kim, and S. K. Jang
Heterogeneous Nuclear Ribonucleoprotein C Modulates Translation of c-myc mRNA in a Cell Cycle Phase-Dependent Manner
Mol. Cell. Biol., January 15, 2003; 23(2): 708 - 720.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
M. Holcik, B. W. Gordon, and R. G. Korneluk
The Internal Ribosome Entry Site-Mediated Translation of Antiapoptotic Protein XIAP Is Modulated by the Heterogeneous Nuclear Ribonucleoproteins C1 and C2
Mol. Cell. Biol., January 1, 2003; 23(1): 280 - 288.
[Abstract] [Full Text]


Home page
Mol. Cell. Biol.Home page
V. Markovtsov, J. M. Nikolic, J. A. Goldman, C. W. Turck, M.-Y. Chou, and D. L. Black
Cooperative Assembly of an hnRNP Complex Induced by a Tissue-Specific Homolog of Polypyrimidine Tract Binding Protein
Mol. Cell. Biol., October 15, 2000; 20(20): 7463 - 7479.
[Abstract] [Full Text]


Home page
Mol. Cell. Biol.Home page
D. J. Williamson, S. Banik-Maiti, J. DeGregori, and H. E. Ruley
hnRNP C Is Required for Postimplantation Mouse Development but Is Dispensable for Cell Viability
Mol. Cell. Biol., June 1, 2000; 20(11): 4094 - 4105.
[Abstract] [Full Text]


Home page
Mol. Cell. Biol.Home page
O. Sella, G. Gerlitz, S.-Y. Le, and O. Elroy-Stein
Differentiation-Induced Internal Translation of c-sis mRNA: Analysis of the cis Elements and Their Differentiation-Linked Binding to the hnRNP C Protein
Mol. Cell. Biol., August 1, 1999; 19(8): 5429 - 5440.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (108K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (13)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Soltaninassab, S. R.
Right arrow Articles by LeStourgeon, W. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Soltaninassab, S. R.
Right arrow Articles by LeStourgeon, W. M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?