| Nucleic Acids Research | Pages |
Evolutionarily conserved structural features in the ITS2 of mammalian pre-rRNAs and potential interactions with the snoRNA U8 detected by comparative analysis of new mouse sequences Introduction
Materials And Methods
Polymerase chain reaction (PCR) and DNA sequencing
Sequence alignment and secondary structure prediction
Results
Discussion
Preserved structural features in the ITS2 folding of mammals
Potential functional implications
U8: a chaperone for 32S pre-rRNA folding in vertebrates?
Acknowledgements
References
Evolutionarily conserved structural features in the ITS2 of mammalian pre-rRNAs and potential interactions with the snoRNA U8 detected by comparative analysis of new mouse sequences
ABSTRACT
INTRODUCTION
In most eukaryotic organisms, the mature rRNA sequences (SSU rRNA, 5.8S and LSU rRNA) are transcribed in the nucleolus in the form of a single large precursor in which the different mature rRNA sequences are separated by spacer regions, 5[prime]ETS, ITS1, ITS2 and 3[prime]ETS. Nucleolar processing of the large pre-rRNA transcript involves an elaborate pathway of endo- and exonucleolytic cleavages taking place on a precursor RNA assembled with ribosomal proteins into an RNP particle of very complex structure. Maturation of pre-rRNA also includes the formation of a complex pattern of nucleoside modifications on the mature sequences of the precursor, most of them corresponding to 2[prime]-O ribose methylations and pseudouridylations. Throughout ribosome biogenesis, the pre-ribosomal RNP particle transiently associates with scores of snoRNAs (small nucleolar RNAs), only a minority of which are required for definite pre-rRNA cleavages (1). Most snoRNAs guide the two major types of rRNA nucleoside modifications through transient base-pairing with pre-rRNA at the modification site (2-4). While pre-rRNA processing has been extensively studied in various organisms, including Tetrahymena, Xenopus, mouse, rat and human, the major advances on the question have been achieved in yeast Saccharomyces cerevisiae. Through the development of elegant in vivo systems involving a powerful combination of biochemical and genetic approaches, a wealth of detailed information is now available in yeast, both on the cis-acting elements and the trans-acting factors required at definite stages of pre-rRNA processing (5). Although the biological role of the ITS spacers is not well understood, the utilization of the yeast model has definitely shown their importance for a faithful production of mature rRNAs. It has also revealed that processings of the different spacers are tightly coupled. Thus, deletions in the 3[prime]ETS affect the processing of both the ITS2 and 5.8S rRNAs (6,7), while the integrity of a stem-loop within this spacer is required for internal cleavages taking place within ITS1 (8). Moreover, a deletion within ITS2 not only prevents biogenesis of the large subunit but also affects the maturation of small-subunit rRNA (9). Finally, efficient production of 25S rRNA requires ITS1 sequences (10,11). The processing steps which have been best studied so far are those leading to the production of SSU-rRNA, which requires four essential snoRNAs, U3, U14, snR30 and MRP (12). In contrast, the molecular mechanisms by which ITS2, a eukaryote-specific insertion, is removed and the mature 3[prime] end of 5.8S and 5[prime] end of 28S rRNA are generated remain largely elusive. ITS2 sequences exhibit a high rate of size and sequence variation during evolution of eukaryotes, which has generally hampered the derivation of reliable secondary structure models by comparative analysis. Yeast S.cerevisiae ITS2 is the only specimen for which the secondary structure has been unambiguously resolved, through a combination of chemical and enzymatic probings, minimum energy modeling, genetic experiments and phylogenetic analyses (13,14).
Among distant yeast species, while its size may vary substantially (66-240 nt) and its conserved sequences are restricted to a few short tracts, ITS2 can fold into a largely similar secondary structure (15). In vivo mutational analysis guided by phylogenetic comparisons has pointed to the crucial importance of some of the most conserved structural elements for proper ITS2 excision and biogenesis of active ribosomal subunits (16). Vertebrate ITS2 do not exhibit any sequence homology with yeast ITS2. They are generally much longer, exhibiting dramatic variations in size, from 262 nt in Xenopus laevis (17) to >1000 nt in mammals. The rate of mammalian ITS2 sequence variation is so high that even among species as closely related as rat and mouse, only a few short tracts of significant sequence similarities can be detected (18,19).
Our present knowledge about the excision of ITS2 suggests major differences between yeast and vertebrate mechanisms. In both yeast and vertebrates, the mature 3[prime]-terminus of 5.8S rRNA and 5[prime]-terminus of LSU rRNA are formed by endonucleolytic cleavage(s) in ITS2 followed by action of exonucleases. However, the two groups of organisms differ in the number of endonucleolytic cleavages involved. In mammals, two precursor intermediates to 5.8S rRNA have been identified, the 8S and 12S pre-rRNAs (20-22) corresponding to a 5.8S rRNA still linked to an ITS2 extension, of 118 and 581 nt, respectively. In contrast, only one internal cleavage has been detected in yeast, corresponding to the accumulation of a pre-5.8S rRNA processing intermediate, termed 7S, containing an ITS2 extension of ~140 nt in S.cerevisiae (23) and ~100 nt in Schizosaccharomyces pombe (24). Moreover, while in X.laevis snoRNA U8 is required for cleavages at the 5[prime] end of 5.8S and 3[prime] end of 28S, through mechanisms which remain unclear (25), yeast cells do not seem to contain a U8 snoRNA homolog.
A detailed knowledge of the ITS2 folding should provide the basis for further analyzing ITS2 function in ribosome biogenesis and the molecular mechanisms involved in its excision from pre-rRNA. Since a first attempt based on the sole mouse-rat comparison (18), no ITS2 folding model based on the comparative approach has been proposed for mammals, due to the lack of other related sequences. With this objective in mind, we have sequenced the ITS2 of four new species belonging to the Mus genus, in order to derive a set of sequences sufficiently related to each other, i.e. appropriate for a comparative approach of their secondary structure. By a combination of minimum energy models and phylogenetic comparisons, we have identified a set of structural features, conserved not only among rodents but also in humans, which are likely to contain important functional elements of the ITS2, particularly for its excision. Finally, we have used this novel information on ITS2 folding to test, on a comparative basis among distant vertebrates, the potential for its base-pairing interactions with the snoRNA U8, and have identified a phylogenetically supported base-pairing that could play a key role in the excision of ITS2.
MATERIALS AND METHODS
Polymerase chain reaction (PCR) and DNA sequencing
Tissues from the following four mouse species belonging to the Mus genus, Mus cervicolor, Mus caroli, Mus (Coelomys) pahari and Mus (Pyromis) platythrix, were kindly provided by Dr F. Bonhomme (Université Montpellier 2). ITS2 sequences were determined on cloned PCR fragments following amplification of genomic DNA. In the first step, primers spanning conserved sequences at the 5[prime] end of 28S rRNA (LH3_reverse: 3[prime]-TCCTTTTCTTTGATTGGTCCTAAGCCTAGGCCG-5[prime]) and 3[prime] end of 5.8S rRNA (LH1_forward: 3[prime]-TTCGCTGCGAGTCTGTCCGCATCGCTTAAGATA-5[prime]) were used. PCR conditions were set up according to the recommendations of the Vent DNA polymerase supplier (Biolabs) using 0.2-0.5 ng genomic DNA (typical thermocycling profile: 30 cycles with 1 min at 98°C, 1 min at 55°C and 2 min at 80°C). PCR products were digested by EcoRI and BamHI before cloning in pTZ18R. Automated sequencing (ALF) was performed on double-strand plasmid DNA, using the Pharmacia PL AutoRead Kit and universal sequencing primers. To complete the sequencing of the central region of the ITS2, three additional primers, selected within portions of the ITS2 sequence conserved between the different Mus species: (8S_forward: 3[prime]-CAGACTTGAAGCCCTCTGCC-5[prime]; 5.8S+340_forward: 3[prime]-CCAAGGGTGCCCCGTGCCCT-5[prime] and 12S_reverse: 3[prime]-AGGCCGCGGCCGGGCGCCCT-5[prime]), were used in sequencing reactions performed in the presence of fluorescein-15-dATP or dUTP (Boehringer Mannheim) according to the supplier's protocols. Likewise, sequence ambiguities at the 3[prime] end of ITS2 were solved by utilization of an additional sequencing primer at the 5[prime] end of 28S (28S-b_reverse: 3[prime]-CGCTGGAGTCTAGTCTGCAC-5[prime]). Both strands were sequenced at least twice for each species. The sequences have been deposited at the EMBL data library under the accession numbers AJ132552 (M.cervicolor), AJ132553 (M.caroli), AJ132554 [M.(C)pahari], AJ132555 [M.(P)platythrix].
Sequence alignment and secondary structure prediction
In order to achieve the best phylogenetically supported folding models, we used an iterative method in which an alignment initially performed using the program Multalin (26) was progressively optimized according to secondary structure homologies. Optimal and suboptimal foldings were predicted for each sequence on a thermodynamic basis (27) using the software package of the Genetic Computer Group, University of Wisconsin (28). Proposed foldings were compared and solutions maximizing structural homologies retained and checked for the presence of compensatory changes. The alignments were then refined and new foldings derived from the improved alignments. Finally, we created a databank incorporating secondary structure informations in the alignment file (29), to search for potential pseudoknots and derive consensus secondary structures. Secondary structure drawings were produced using ESSA (30). Complementarities between U8 snoRNA and 32S pre-rRNA were searched with both the GCG package and the ESSA program. Alignments and foldings not presented here are available upon request.
RESULTS
The length of the ITS2 sequence exhibits significant variation among the four additional species of the Mus genus analyzed in this study, ranging from 1080 to 1219 nt [M.cervicolor, 1080 nt; M.(C)pahari, 1094 nt; M.caroli, 1102 nt; M.(P)platythrix, 1219 nt]. However, the values remain collectively relatively close to the size of the Mus musculus ITS2, 1089 nt (18), which is substantially larger than the 765 nt rat homolog (31). All the novel Mus ITS2 share the same highly biased base content noticed for the M.musculus sequence (18), i.e. very high in GC (~75%) and very low in A (~6.5%). As shown in Figure
![]() |
![]() |
Figure 1. Alignment of the ITS2 sequences for the five species of the Mus genus. Nucleotides identical with M.musculus, which serves as a reference, are shown by hyphens (stars refer to gaps). Conserved segments (longer than 13 nt and exhibiting at least 80% homology as compared to the M.musculus sequence) are delineated by a green underline and termed a-n. Nucleotides involved in conserved base pairings are boxed by a thin line with the two strands of each stem delineated by a pair of numbered arrows in opposite orientation. The four stems identified by yellow boxes, and numbered I-IV, form the conserved basis, as depicted in Figure 2, of the four independent domains of ITS2 secondary structure which extend into more variable regions D.I-D.IV (denoted by red dotted boxes). Conserved stems numbered 1-7 are located within the more variable regions, as depicted in Figures 3 and 4.
At the secondary structure level, we have identified 11 stems, termed I-IV and 1-7, that are conserved among the five Mus species (Fig.
Figure 2. A conserved core of secondary structure delineates four regions. Filled circles denote nucleotide positions that vary among the five mouse sequences. Pairings supported by compensatory changes are depicted in green. Domains D.I-D.IV, schematized by a red box (broken line) with indication of the range of size variation, and stems I-IV are as defined in Figure 1. The four regions delimited by stems I-IV are termed A-D, respectively. The 3[prime] end of the 8S precursor to 5.8S rRNA identified in rat (19) is shown by a blue arrow. Other symbols as in Figure 1. Despite the extensive variation of its sequence, domain C can adopt a very similar folding in the five Mus species (Fig. Figure 3. Folding of domain D.III. The portion of the folded ITS2 shown here for the different Mus species corresponds precisely to the part of the aligned sequences in a red dotted box termed D.III in Figure 1. Conserved segments (as defined in Fig. 1 legend) are delineated by a thick overline. Pairings supported by compensatory changes (when comparing the corresponding sequence to the M.musculus reference) are depicted in green. Conserved stems are numbered 1-4, as in Figure 1. Domain D exhibits a more complex secondary structure (Fig. Figure 4. Folding of domain D.IV. The portion of the folded ITS2 shown here for the different Mus species corresponds precisely to the part of the aligned sequences in a red dotted box termed D.IV in Figure 1. The blue arrow points on a cleavage site initially identified in M.musculus (20,21). Other symbols are as in Figure 3. The dot overlines denote sequence tracts shorter than 14 nt which are conserved among the five Mus sequences. Only a subset of the conserved tracts identified among the different Mus sequences can be identified within the ITS2 of the two other, more distant mammalian sequences, rat and human, which are 764 and 1155 nt, respectively (Fig. Figure 5. Secondary structure model for the ITS2 of rat (a) and human (b). The rat and human sequences are respectively from Subrahmanyam et al. (31) and Gonzalez et al. (34). Counterparts of sequence tracts conserved among the five Mus species are termed as in the previous figures and denoted by thick overlines. Although its sequence is not conserved between human and rodents, the human sequence termed e (dotted line) and its positional equivalent in rodents, tract e, are both pyrimidine-rich. Independent domains of secondary structure A-D correspond to regions A-D of the Mus ITS2 delineated in Figure 2. The 3[prime] ends of the 8S and 12S intermediates in the formation of 5.8S rRNA identified in rat and mouse, respectively, and a site of preferential cleavage of the rat ITS2 by protein B23 endonuclease (49) are also indicated (blue arrows). Figure 6. A potential pseudoknot adjacent to the 3[prime] end of 12S pre-rRNA. (a) Representation of the base pairings involved in the formation of stem 6 and the pseudoknot for mouse, rat and human. The red box delineates a variable portion of stem 6 and green boxes point to compensatory changes. A blue arrow indicates the position of the 3[prime] end of the 12S pre-rRNA intermediate and the sequence of conserved tract f in bold-face letters (this representation is not intended to imply triple-strand interaction). (b) Schematic representation of the folding of domains C and D in mouse, showing the localization of the potential pseudoknot, with conserved segments depicted in green, and the site of the 109 nt additional sequence in M.(P)platythrix shown by a filled triangle. Since snoRNA U8 is required for the production of 5.8S and 28S from vertebrate 32S pre-rRNA, we have systematically looked for evolutionarily conserved complementarities between the snoRNA and pre-rRNA sequences. In addition to the short hallmark motif boxes C and D, the conservation among the four available U8 sequences (Fig.
DISCUSSION
Preserved structural features in the ITS2 folding of mammals
The rates of divergence of the ITS2 sequence studied in this work appear well-suited for the derivation of secondary structure models by the comparative approach. Although most point mutations correspond to transitions on a single side of the base-pair (semi-conservative changes G:U versus G:C or A:U), a few true compensatory changes are observed, providing strong support for several elementary features of the model, stems III, IV, 1, 5 and 7. Moreover, the overall shape of the ITS2 folding is not altered by relatively frequent insertions-deletions preferentially located at the apex of long stems (for stems II, 5 and 7) or at opposite positions within a duplex, reminiscent of what was previously observed for the rapidly evolving domains of mature eukaryotic rRNAs (32,33), thus providing further indirect evidence in favor of the ITS2 model. Although in the early rodent model (18) the most conserved regions of the ITS2, i.e., domain A, domain B, the upper half of domain C (except for the terminal Y-shaped structure) and stem 6 in domain D, were similarly folded except for a few details, extensive differences are observed over the regions devoid of any sequence homology between rat and mouse, except for stem IV.
The new mouse model also provides a framework for a re-evaluation of ITS2 foldings for rat and human based on a sole thermodynamical basis (34). In rat and human too, the ITS2 can be folded in four regions emerging from a central loop containing three of the five most conserved sequence tracts of the spacer, pointing to the functional importance of this structural core, possibly at some stage of pre-rRNA processing. The high content of conserved adenines at unpaired positions of this structural core is also in line with this notion, since bulged and looped adenines may represent key elements in RNA-RNA tertiary contacts (35,36) and RNA-ribosomal protein interactions (reviewed in 37-39).
Figure 7. Two potential complementarities between snoRNA U8 and 32S pre-rRNA. (a) Alignment of the four vertebrates U8 sequences. Hyphens denote identities with the human sequence. The segments showing complementarities with the 5[prime] end of ITS2, the 3[prime] end of 5.8S rRNA and a central segment of 28S rRNA are respectively colored in green, yellow and blue. P1 and P2 locate two segments of complementarity with the 5[prime] end of 28S previously proposed to be functionally important (52). (b) Details of the potential interaction between U8 and 32S pre-rRNA around the 5.8S/ITS2 junction (arrow). The red dotted box delineates a portion of the binary complex which is slightly longer in Xenopus as compared to mammals. (c) Interaction between U8 snoRNA and mouse 28S rRNA. ITS2 exhibit large size differences and extensive sequence variation among distant eukaryotes. However, among groups of more closely related organisms, such as trematodes (40), green algae and flowering plants (41,42) the comparative analysis of different sets of related sequences has resulted in secondary structure models which, although substantially different from each other, nevertheless exhibit several common features with the present rodent model. Thus: (i) ITS2 can be organized around a preserved central core of secondary structure from which emerge four helices and containing a high level of sequence conservation within the single-stranded segments separating regions A, B and C. (ii) Within each of these groups, the 5[prime] end of the ITS2 is highly conserved over ~120 nt and it can fold into two short and strongly constrained stems. (iii) Domain C can fold into a long and generally unbranched stem. (iv) The ITS2 3[prime] half is the most variable region and it can frequently fold into complex branched structures. However, these common structural features are not found in all eukaryotes. Thus, the phylogenetically supported, experimentally tested yeast ITS2 folds into a cross-like structure delineated by a 5[prime]-3[prime] terminal stem (13,43) without any obvious similarity to the organization described above, suggesting that the function of ITS2 might substantially differ between fungi and other eukaryotic kingdoms. Analysis of other sets of related ITS2 sequences, such as X.laevis and Xenopus borealis (17), a group of seven fish ITS2 sequences, together with a critical revisiting of models derived for insects (44-46), and green algae (47) might provide further insight into the significance of this discrepancy.
Potential functional implications
Except for the 3[prime] terminal region, which merely retains a similarly biased base composition, sequences spanning pre-rRNA process-ing sites appear preferentially conserved among mammalian ITS2, pointing to the biological importance of 8S and 12S pre-rRNAs. Moreover, while conserved segment f does not span a processing site, it can form with the nucleotides immediately downstream from the 3[prime] end of 12S pre-rRNA a phylogenetically supported pseudoknot which could play a key role in the recognition by the processing endonuclease, given that several RNA-binding proteins, such as rproteins S12, S15 and L23 (36,48), recognize a pseudoknotted structure in their RNA target. Recently, a new endonucleolytic cleavage catalyzed in vitro by nucleolar protein B23 (49) has been identified in rat, located within a conserved segment of the rodent ITS2 (Fig.
Replacement experiments showed that functional conservation of ITS2 extends only to closely related yeast species, pointing to a high degree of concerted evolution between ITS2 and its processing machinery (16). Particularly, the presence of only one internal cleavage site within yeast ITS2 could be directly related to the above-mentioned peculiarities of yeast ITS2 folding. At the secondary structure level, the environment of the 3[prime] end of the 7S sequence within S.cerevisiae ITS2 appears more reminiscent of that of the mammalian 12S sequence than of the 8S intermediate (Fig.
U8: a chaperone for 32S pre-rRNA folding in vertebrates?
Figure 8. The multiple interactions proposed between U8 and 32S pre-rRNA schematized for mouse. U8 is colored in red, ITS2 in green, 28S rRNA in blue and 5.8S rRNA in yellow. The two arrows point to the 5[prime] and 3[prime] ends of ITS2. V13 identifies a divergent domain in the central part of 28S rRNA sequence. Accurate processing of the 3[prime] end of 5.8S rRNA and 5[prime] end of 28S in mammals has been proposed to be mediated by two 5 nt complementarities between the 5[prime] end of U8 and the 5[prime] end of 28S (24). However, no comparative support is available for the two proposed interactions (Fig.
ACKNOWLEDGEMENTS
We thank Yves Henry for helpful discussions and critical reading of the manuscript. This work was financially supported by general fundings from the C.N.R.S. and Université Paul Sabatier, Toulouse, and by specific grants from the Ministère de l'Education Nationale, de l'Enseignement Supérieur et de la Recherche (ACC SV 07 and 13 to B.M. and ACC SV 01 to J.P.B.), from the C.N.R.S. (Action PCV to J.P.B.) and from Région Midi-Pyrénées.
REFERENCES
This article has been cited by other articles:
This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals
Comments and feedback: jnl.info{at}oup.co.uk
Last modification: 14 May 1999
Copyright©Oxford University Press, 1999.
![]()
CiteULike
Connotea
Del.icio.us What's this?
![]()
![]()

![]()
![]()
![]()
A. W. Coleman
Pan-eukaryote ITS2 homologies revealed by RNA secondary structure
Nucleic Acids Res.,
May 11, 2007;
35(10):
3322 - 3329.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
X.-H. LIANG, S. ULIEL, A. HURY, S. BARTH, T. DONIGER, R. UNGER, and S. MICHAELI
A genome-wide analysis of C/D and H/ACA-like small nucleolar RNAs in Trypanosoma brucei reveals a trypanosome-specific pattern of rRNA modification
RNA,
May 1, 2005;
11(5):
619 - 645.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
M. Gottschling and J. Plotner
Secondary structure models of the nuclear internal transcribed spacer regions and 5.8S rRNA in Calciodinelloideae (Peridiniaceae) and other dinoflagellates
Nucleic Acids Res.,
January 13, 2004;
32(1):
307 - 315.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
C. L. Sansam, K. S. Wells, and R. B. Emeson
Modulation of RNA editing by functional nucleolar sequestration of ADAR2
PNAS,
November 25, 2003;
100(24):
14018 - 14023.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
N. Tomasevic and B. A. Peculis
Xenopus LSm Proteins Bind U8 snoRNA via an Internal Evolutionarily Conserved Octamer Sequence
Mol. Cell. Biol.,
June 15, 2002;
22(12):
4101 - 4112.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
S.A. GERBI, A.V. BOROVJAGIN, M. EZROKHI, and T.S. LANGE
Ribosome Biogenesis: Role of Small Nucleolar RNA in Maturation of Eukaryotic rRNA
Cold Spring Harb Symp Quant Biol,
January 1, 2001;
66(0):
575 - 590.
[Abstract]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
A. D. Omer, T. M. Lowe, A. G. Russell, H. Ebhardt, S. R. Eddy, and P. P. Dennis
Homologs of Small Nucleolar RNAs in Archaea
Science,
April 21, 2000;
288(5465):
517 - 522.
[Abstract]
[Full Text]
![]()
![]()
![]()

![]()
![]()
![]()
C. Verheggen, G. Almouzni, and D. Hernandez-Verdun
The Ribosomal RNA Processing Machinery Is Recruited to the Nucleolar Domain before RNA Polymerase I during Xenopus laevis Development
J. Cell Biol.,
April 17, 2000;
149(2):
293 - 306.
[Abstract]
[Full Text]
[PDF]
![]()
This Article ![]()
![]()
Abstract
![]()
Print PDF (1286K)
![]()
Alert me when this article is cited
![]()
Alert me if a correction is posted
![]()
Services ![]()
![]()
Email this article to a friend
![]()
Similar articles in this journal
![]()
Similar articles in ISI Web of Science
![]()
Similar articles in PubMed
![]()
Alert me to new issues of the journal
![]()
Add to My Personal Archive
![]()
Download to citation manager
![]()
Search for citing articles in:
ISI Web of Science (37)
![]()
Request Permissions ![]()
Commercial Re-use Guidelines
for Open Access NAR Content
![]()
Google Scholar ![]()
![]()
Articles by Michot, B.
![]()
Articles by Bachellerie, J. P.
![]()
Search for Related Content
![]()
PubMed ![]()
![]()
PubMed Citation
![]()
Articles by Michot, B.
![]()
Articles by Bachellerie, J. P.
![]()
Social Bookmarking ![]()
![]()
What's this?

