Intracellular localization and unique conserved sequences of three small nucleolar RNAs
Intracellular localization and unique conserved sequences of three small nucleolar RNAsN. Selvamurugan, Oscar H. Joost, Elizabeth S. Haas1, James W. Brown1, Nancy J. Galvin and George L. Eliceiri*
Department of Pathology, St Louis University School of Medicine, St Louis, MO 63104-1028, USA and 1Department of Microbiology, North Carolina State University, Raleigh, NC 27695-7615, USA
Received November 18, 1996;Revised and Accepted February 26, 1997
DDBJ/EMBL/GenBank accession nos U64695-U64709
ABSTRACT
Three human small nucleolar RNAs (snoRNAs), E1, E2 and E3, were reported earlier that have unique sequences, interact directly with unique segments of pre-rRNA in vivo and are encoded in introns of protein genes. In the present report, human and frog E1, E2 and E3 RNAs injected into the cytoplasm of frog oocytes migrated to the nucleus and specifically to the nucleolus. This indicates that the nucleolar and nuclear localization signals of these snoRNAs reside within their evolutionarily conserved segments. Homologs of these snoRNAs from several vertebrates were sequenced and this information was used to develop RNA secondary structure models. These snoRNAs have unique phylogenetically conserved sequences.
INTRODUCTION
Processing of rRNA precursors requires several small nucleolar RNA (snoRNA) species (reviewed in 1 ,2 ). E1/U17 (3 -8 ), E2 and E3 (3 -6 ) snoRNAs do not belong to the main class of snoRNAs, since they lack the C and D sequence boxes that are present in most snoRNAs, do not show substantial sequence homology with any other snoRNA and do not associate with the nucleolar protein fibrillarin. They are housekeeping RNAs, since they are present in all tissues tested (3 ). These three snoRNAs may play as yet undetermined roles in ribosome biogenesis, since they interact directly (psoralen crosslink) with unique segments of pre-rRNA in vivo (4 ). E1, E2 and E3 RNAs do not have any of the sequences that are known to be nuclear or nucleolar localization signals for other small nuclear RNAs (9 -15 ). E1 (5 ,7 ,8 ), E2 (16 ) and E3 (17 ,5 ) RNAs, among other snoRNAs (1 ), are encoded in introns of protein genes.
The intracellular localization and transport of a given RNA species are essential to its function in the cell and are important to the understanding of its various functions and interactions in vivo (reviewed in 18 -20 ). Nothing is known about how these three snoRNAs localize in the nucleus and nucleolus. Toward that long-term goal, we injected human and frog versions of these snoRNAs into the cytoplasm of frog oocytes and then monitored their intracellular localization. To study how the various functional domains of an RNA species function, it is important to identify its evolutionarily conserved nucleotides. We have determined the sequences of these snoRNAs from several vertebrates.
MATERIALS AND METHODS
General methods
The following procedures were as described before: cDNA synthesis with avian myeloblastosis virus reverse transcriptase (4 ); polymerase chain reaction (PCR) amplification of cDNA and genomic DNA (21 ); thermal cycle DNA sequencing of PCR products (22 ). Unless indicated otherwise, human E1, E2 and E3 RNA end primers were used for cDNA synthesis and for PCR amplification of cDNA and liver genomic DNA.
Microinjection into frog oocytes
PCR amplification was used to synthesize DNA fragments that had a bacteriophage T7 RNA polymerase promoter and full-length frog E1, E2 or E3 RNA sequences. The frog E1 DNA template was made from a genomic clone (8 ); the frog E2 and E3 DNA templates were made from cDNA. Xenopus laevis has six potential genes for E1 RNA; we used the f sequence, since it has been shown to be expressed (8 ). RNA synthesis in vitro was in the presence of [[alpha]-32P]UTP or [3H]UTP, as indicated, and the cap analog m7G(5')ppp(5')G, to cap the 5'-end (23 ).
32P-Labeled snoRNAs were injected into the cytoplasm of X.laevis oocytes, which were then incubated at 19oC for 20 h. 32P-Labeled frog oocytes were fractionated under oil into nucleus and cytoplasm (24 ). Their RNA was extracted and then fractionated by 10% polyacrylamide gel electrophoresis. 3H-Labeled snoRNAs were injected into the cytoplasm of frog oocytes. After 20 h incubation, the oocytes were fixed in formaldehyde, embedded in glycol methacrylate and 2 [mu]m sections were used for autoradiography. Exposures were at 4oC, using autoradiography emulsions Kodak NTB-2 and Ilford K.5D.
Sequencing of RNA ends
The sequences of the 5'-terminus of X.laevis E2 and E3 RNAs were determined by 5'-RACE (rapid amplification of cDNA ends) (25 ). Complementary DNA was synthesized using a primer corresponding to a conserved internal E2 or E3 RNA sequence. The product was digested with RNase H and purified by gel electrophoresis. A tail was added to cDNA with terminal deoxynucleotidyl transferase and dATP. PCR amplification of tailed cDNA was with an anchored dT primer (GCGGAATTCTTTTTTTTTTTTTTTTTT) and a nested primer corresponding to another conserved E2 or E3 RNA sequence. The PCR product was purified by gel electrophoresis and its sequence determined by thermal cycle sequencing. The sequences of the 3'-end of frog E2 and E3 RNAs were determined by the following procedure. Cellular RNA was ligated with T4 RNA ligase to an oligodeoxynucleotide that was phosphorylated at the 5'-terminus and was blocked at the 3'-end with cordycepin [SP6Reco, ATAGTGTCACCTAAATGAATTCC(3'-dA)] (26 ). The product was precipitated with isopropanol in the presence of ammonium acetate. A second 5'-end phosphorylated oligodeoxynucleotide (SP6eco, GGAATTCATTTAGGTGACACTAT), complementary to SP6Reco, was used for cDNA synthesis of the ligation product using reverse transcriptase (26 ). The product was digested with RNase H and precipitated with isopropanol in the presence of ammonium acetate. The cDNA was amplified by PCR using the SP6eco primer and a primer corresponding to an internal E2 or E3 RNA conserved sequence. The PCR product was gel purified and sequenced by thermal cycle sequencing.
RNA secondary structure
Models for the secondary structures of E1, E2 and E3 RNAs were constructed by a combination of the phylogenetic comparative method (27 ) and thermodynamic prediction (28 ). Sequences were aligned manually and searched by computer for conserved potential pairings and co-variation of sequence changes using COVARIATION (29 ). Potential secondary structure elements were predicted thermodynamically using MULFOLD (28 ) and sorted for consistency between sequences and with the comparative data.
RESULTS
E1, E2 and E3 RNAs need mechanisms to localize in the nucleolus and nucleus. First, the mature forms of these snoRNAs remain continuously in the nucleolus in interphase cells, instead of being scattered throughout the cell. Second, both the nucleolus and the nuclear membrane break down during mitosis. Then, whether the association of these mature snoRNA molecules with nucleolar structures is interrupted or not in mitosis, there has to be a mechanism to restore or maintain this association. Finally, the pre-mRNA transcription and processing steps that generate these snoRNAs occur at nuclear sites outside the nucleolus. A mechanism is needed to transport these newly made snoRNAs to the nucleolus. We asked first if these snoRNAs, when placed in the cytoplasm, can migrate to the nucleus and, if so, whether the nuclear localization signals of these three snoRNAs are conserved between human and frog. Injections were into the cytoplasm because (i) the mature forms of these snoRNAs are in the cytoplasm during mitosis, (ii) we are interested in both the nuclear and nucleolar localization signals of these snoRNAs and (iii) cytoplasmic injections are less damaging to the oocyte. In vitro transcribed human E1, E2 and E3 RNAs localized in the nucleus after they were injected into the cytoplasm of frog oocytes (Fig. 1 A). This localization is RNA sequence specific, since antisense transcripts of frog E1, E2 and E3 RNAs did not migrate to the nucleus (Fig. 1 B). To minimize degradation, both the snoRNAs and antisense transcripts were capped at the 5'-end with 7-monomethylguanosine. This cap is not a nuclear localization signal (9 ,10 ) and had no effect on this transport, since capped antisense transcripts remained in the cytoplasm (Fig. 1 B).
DISCUSSION
E1, E2 and E3 RNAs are expected to have novel nucleolar localization signals, since they lack the known nuclear or nucleolar localization elements of other nuclear RNAs (9 -15 ). The intracellular distribution of human E1, E2 and E3 RNAs that were injected into the cytoplasm of frog cells indicates that the nuclear and nucleolar localization signals of these snoRNAs reside within their evolutionarily conserved segments. An AGA triplet is found just downstream of the 5'-terminus folded domain of many snoRNAs that have the ACA box (35 ), but is absent from the conserved sequences of E1, E2 and E3 RNAs. The ACA sequence is the only conserved sequence shared by these three snoRNAs. Much longer sequence elements are needed for intracellular localization of the snoRNAs whose localization signals are known. For example, the nucleolar localization of MRP snoRNA requires a 40 base snoRNA sequence element (15 ) and the nuclear localization of U3 snoRNA requires both a 13 base snoRNA sequence element and a 5 bp stem (12 ). These observations suggest that E1, E2 and E3 RNAs each may have different nucleolar localization signals. Extensive conserved nucleotide sequences, which are absent in E1, E2 and E3 RNAs, are required by other snoRNA species to function in pre-rRNA processing or for snoRNA processing from pre-mRNA introns (36 ,37 ). For example, several U8 snoRNA conserved sequences are needed for its function in pre-rRNA processing, including five sequences consisting of 4-8 nt each (36 ). These observations suggest that the E1, E2 and E3 RNA cis-acting elements required for these functions may be novel and possibly also different among these three snoRNAs. It is anticipated that proteins that interact with these elements participate in the mechanism of intracellular localization of these snoRNAs.
A 26 base intron sequence is identical in rat and mouse and in both lies immediately upstream of the E3 RNA gene (Fig. 3 ). As expected for species whose ancestors split ~15 million years ago, there are many mismatches in the rest of the intron, except near the splice sites. This sequence is not part of the 5'-splice site or the branchpoint site, because it is not sufficiently near them. It is unlikely that a non-functional 26 base sequence would be fully conserved after so many million years. This sequence is not conserved in other vertebrates, but this is true for functional domains that have co-evolved with their functional partners (38 ,39 ). The genes for two intronic snoRNAs, human E2 (5 ) and frog U16 (40 ), both show two identical sequences in the same positions: CTACCTA, 123 nt upstream of the snoRNA coding region, and GAGAAATG, 27 bases downstream of the snoRNA coding sequence. One or more of these three flanking sequences might have a role.
ACKNOWLEDGEMENTS
We thank Noel Daly for PCR amplification of zebrafish E1 DNA, Thomas E.Dahms and Patricia L.Farrar for fresh tissues, Francesco Amaldi for the X.laevis E1 RNA genomic DNA clone, Elsebet Lund and Philip L.Paine for advice on the isolation of frog oocyte germinal vesicles and cytoplasm, Charles A.O'Brien for advice on 5'- and 3'-RACE and Paola Pierandrei-Amaldi for instructions on fixation and sectioning of frog oocytes. We also thank Gloriosa Go and Lynne Mann for technical assistance, Chiyen Miller for assistance in microautoradiography, Andrew Grainger for computer searching and Clifford Pollack and Jesse Urhahn for photography. This work was supported by a grant from NIH.
REFERENCES
1 Maxwell,E.S. and Fournier,M.J. (1995) Annu. Rev. Biochem., 35, 897-934.
23 Krieg,P.A. and Melton,D.A. (1987) Methods Enzymol., 155, 397-415.
24 Lund,E. and Paine,P.L. (1990) Methods Enzymol., 181, 36-43.MEDLINE Abstract
25 Frohman,M.A., Dush,M.K. and Martin,G.R. (1988) Proc. Natl. Acad. Sci. USA, 85, 8998-9002.
26 O'Brien,C.A. and Wolin,S.L. (1994) Genes Dev., 8, 2891-2903.
27 Gutell,R.R. (1993) In Nierhaus,K.H., Franceschi,F., Subramanian,A.R., Erdmann,V.A. and Wittmann-Liebold,B. (eds), The Translational Apparatus. Plenum Press, New York, NY, pp. 477-488.