Nucleic Acids Research Advance Access originally published online on September 13, 2007
Nucleic Acids Research 2007 35(18):6249-6258; doi:10.1093/nar/gkm668
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Nucleic Acids Research, 2007, Vol. 35, No. 18 6249-6258
© 2007 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
RNA |
Novel rapidly evolving hominid RNAs bind nuclear factor 90 and display tissue-restricted distribution
Department of Biochemistry and Molecular Biology, New Jersey Medical School, UMDNJ, 185 South Orange Ave., P. O. Box 1709, Newark, NJ 07101-1709, USA
*To whom correspondence should be addressed. Tel: 973 972 4411; Fax: 973 972 5594; Email: mathews{at}umdnj.edu
Received June 27, 2007. Revised August 13, 2007. Accepted August 13, 2007.
| ABSTRACT |
|---|
|
|
|---|
Nuclear factor 90 (NF90) is a double-stranded RNA-binding protein implicated in multiple cellular functions, but with few identified RNA partners. Using in vivo cross-linking followed by immunoprecipitation, we discovered a family of small NF90-associated RNAs (snaR). These highly structured non-coding RNAs of
117 nucleotides are expressed in immortalized human cell lines of diverse lineages. In human tissues, they are abundant in testis, with minor distribution in brain, placenta and some other organs. Two snaR subsets were isolated from human 293 cells, and additional species were found by bioinformatic analysis. Their genes often occur in multiple copies arranged in two inverted regions of tandem repeats on chromosome 19. snaR-A is transcribed by RNA polymerase III from an intragenic promoter, turns over rapidly, and shares sequence identity with Alu RNA and two potential piRNAs. It interacts with NF90's double-stranded RNA-binding motifs. snaR orthologs are present in chimpanzee but not other mammals, and include genes located in the promoter of two chorionic gonadotropin hormone genes. snaRs appear to have undergone accelerated evolution and differential expansion in the great apes. | INTRODUCTION |
|---|
|
|
|---|
Proteins in the nuclear factor 90 (NF90) family of double-stranded RNA-binding proteins participate in many aspects of vertebrate RNA metabolism [reviewed in (1)] and have been implicated in development (2,3), the cell cycle (4) and virus infection (5,6). The two most prominent protein isoforms are NF90 and NF110 (7,8). NF90 is also known as DRBP76 (9) and NFAR1 (10), and NF110 is synonymous with ILF3 (11), NFAR2 (10) and TCP110 (12). Both exist in complexes with a distinct protein, NF45 (D. Guan et al., manuscript in preparation; 13).
Proteins in the NF90 family possess two double-stranded RNA binding motifs (dsRBMs) (14) as well as RGG (9) and GQSY (1) nucleic acid binding motifs. Through in vitro reconstitution experiments, the proteins have been found to interact with coding and non-coding RNAs of cellular (2,3,15–18) and viral origin (5,6,19). Most of these RNA species are not abundant in cells, yet our earlier work demonstrated that the dsRBMs of NF90 and NF110 are almost completely occupied by cellular RNA throughout the cell cycle (20). We, therefore, sought to identify the predominant in vivo binding partners of NF90 by cloning the RNA species that were cross-linked to NF90 in live 293 cells using a rigorous and highly specific protocol (21). By this means we detected a hitherto unreported RNA family that forms complexes with NF90 protein and is designated snaR (small NF90-associated RNA).
This novel RNA family contains three distinct but related subsets that are encoded by tandemly repeated genes in two segments of human chromosome 19, as well as additional gene copies two of which are adjacent to genes encoding the ß chain of chorionic gonadotropin. The snaRs are modern, rapidly evolving, non-coding RNAs of
117 nucleotides (nt). Their genes are apparently restricted to humans and chimpanzees. snaR-A is highly structured, relatively unstable and synthesized by RNA polymerase III (Pol III) from an intragenic promoter. The snaRs were detected predominantly in testis among
20 human tissues tested but are abundant in many immortalized cell lines. Their distribution, genetic organization and evolutionary relationships suggest a biological role relevant to the speciation of great apes.
| MATERIALS AND METHODS |
|---|
|
|
|---|
GenBank accession numbers
snaR-A #EU035783, snaR-B #EU035784, human and chimpanzee snaR genes #EU071051-087.
Cell lines
Human 293 cell sublines expressing NF90a and NF90b (21) were maintained in DMEM (Sigma) supplemented with 8% fetal bovine serum (Sigma) and 100 µg/ml Geneticin (Invitrogen).
Cross-linking, immunoprecipitation, 3'-end-labeling and RT-PCR
Procedures and recipes are detailed in (21). 293 cell sublines were blocked with 0.1 mg/ml cycloheximide then washed in phosphate-buffered saline (PBS, Sigma) with 0.1 mg/ml cycloheximide and 7 mM MgCl2. Cells were incubated in PBS with 7 mM MgCl2 and 0.5% formaldehyde for 10 min at 25°C before the addition of glycine (pH 7, 0.25 M final) and a further incubation of 5 min. Cells were harvested then lysed on ice for 5 min in RIPA buffer. Lysate was pre-cleared with Protein A-Sepharose (Amersham Biosciences) for 1 h at 4°C, centrifuged then incubated with anti-Omni-probe antibody (Santa Cruz Biotech.)-Protein A-Sepharose complex for 3 h at 4°C. Immunoprecipitates were washed five times, 10 min each at 25°C, in Harsh RIPA buffer. Immunoprecipitates were incubated in Cross-Link Reversal buffer at 70°C for 45 min and RNA isolated by Trizol extraction (Invitrogen). RNA was precipitated with 1 volume isopropanol in the presence of 1 M ammonium acetate and glycogen and the pellet washed in 75% ethanol, air-dried and re-suspended in water.
Isolated RNA was 3'-radiolabeled by ligation of [5'-
-32P]-cytidine-3',5'-bisphosphate, catalyzed by T4 RNA ligase (New England Biolabs) for 2 h on ice.
Immunoprecipitated RNA was incubated with DNase I (Amplification Grade, Invitrogen). A lock-dock oligo(dT) primer (0.1 µM final) was annealed to RNA by heating at 70°C for 10 min. Reverse transcription was performed at 42°C for 50 min using SuperScript II Reverse Transcriptase (Invitrogen). The reaction was terminated at 70°C for 15 min and RNA was digested with RNase (15 U/µl RNase T1, 4 U/µl RNase H) for 30 min at 37°C. cDNA was twice purified through a QIAquick desalting column (Qiagen), heated at 94°C for 2 min in Tailing buffer then incubated with terminal deoxynucleotidyl transferase (New England BioLabs) for 10 min at 37°C. The reaction was terminated at 65°C for 10 min. cDNA was amplified over 30 cycles with an annealing temperature of 63°C, using a reverse primer, Abridged Anchor primer (Invitrogen) and SuperTaq polymerase (Ambion).
Plasmid constructs
GST fusion plasmids pGEX-6P-3-NF90a and -NF90b were subcloned from pcDNA3.1-NF90a and -NF90b (8) into pGEX-6P-3 vector (Amersham Biosciences) after EcoRI digestion. GST-NF90c was expressed from pGST-NF90 (22). pGEX-6P-3-A458P,A588P, was generated by digesting pcDNA3.1-NF110b(A458P,A588P) (23) with AvrII and HindIII, then inserting the gel purified fragment into similarly digested pGEX-6P-3-NF90b. To create pGEX4T-3-NF45, NF45 was PCR amplified from pcDNA3-NF45 (24) using primers F45 (5'-CGACAGAATTCCATGAGGGGTGACAGAGG-3') and R45 (5'-CCTGTGCGGCCGCTTCACTCCTGAGTTTCCATGC-3'). The PCR product was digested with EcoRI and NotI and ligated into similarly digested pGEX4T-3 vector (Amersham Biosciences).
To remove extraneous 3' and 5' RT-PCR priming sites from snaR-A sequence, pCR2.1-Clone 3 was PCR amplified using primers F-A (5'-GAGGTCTAGATTGGAGCCATTGTGGCTC-3') and R-A (5'-GAGGAAGCTTCCGACCCATGTGGACCA-3'). The PCR product was digested with XbaI and HindIII and inserted into similarly digested pCRII vector (Invitrogen), to create pCRII-snaR-A. T7 run-off transcription from linearized pCRII-snaR-A produces snaR-A with 21 extraneous 5'-flanking nucleotides. To utilize transcribed snaR-A as an accurate size marker, snaR-A was PCR amplified from the pCRII-snaR-A construct using primers F-A2 (5'-GGCAAAGCTTTAAAATCCTTTTTTCCGACCCATGTGGACCAGG-3') and R-A2 (5'-GGCAGGATCCTAATACGACTCACTATAGCCGGAGCCATTGTGGCTCAGG-3'). These primers place the snaR-A gene at position +2 and introduce a DraI restriction site after four terminal uridines. The PCR product was digested with BamHI and HindIII and inserted into similarly digested pUC18 vector (Invitrogen).
To construct pR1-R3, fragments of chromosome 19 encompassing a snaR-A gene (55299144, Table 1) were PCR amplified from genomic DNA using primers F1 (5'-GGCGTCCTCCCTTTATGTTTTGTCG-3') and either R1 (5'-GGAGCCATTGTGGCTCAGGC-3'), R2 (5'-CTGAACTCTCAGATACAGTTCCCC-3') or R3 (5'-CACTGCGCCAGCCTTGTTTCTCG-3'), then inserted via TOPO TA Cloning technology (Invitrogen) into pCR2.1 vector (Invitrogen).
|
RNase H digestion
Immunoprecipitated RNA (
25% isolated) was 3'-end labeled as described above. Labeled RNA (13.4 µl final) was incubated at 70°C for 10 min in the presence of 100 µM oligonucleotide specific for snaR (sense, 5'-GGGCACGAGTTCGAGGCC-3' or antisense, 5'-GGCCTCGAACTCGTGCCC-3') or 5S rRNA (sense, 5'-AACGCGCCCGATCTCGTCTGA-3' or antisense, 5'-TCAGACGAGATCGGGCGCGTT-3'), then cooled. RNA was incubated with RNase H (5 U, New England BioLabs) in the supplied reaction buffer at 30°C for 3 h. Digestion was stopped with 16 µl 2x formamide loading buffer (95% formamide, 10 mM Tris-HCl, pH 8, 20 mM EDTA) and heating at 70°C for 5 min.
GST pull-down assay
GST protein extract was prepared as previously described (22). Extract was rocked at 4°C for 1 h with 6 µl GSH beads in a final volume of 250 µl EBCD buffer (50 mM Tris, pH 8, 120 mM NaCl, 0.5% NP-40, 1 mM DTT). Beads were washed thrice for 2 min in 1 ml EBCD buffer and 0.075% SDS and twice for 2 min in 1 ml low salt buffer (50 mM Tris, pH 7.6, 50 mM NaCl) at 4°C with rocking. A third of the beads were set aside for protein analysis in a 7.5% SDS/PAGE gel. Beads were washed twice on ice in 1 ml Binding buffer (25 mM Hepes KOH, pH 7.4, 0.1 M KCl, 10 mM MgCl2, 0.1 mM EDTA, 1 mM DTT, 10 µM PMSF, 0.1 mg/ml BSA, 0.1 mg/ml tRNA) + 0.01% NP-40, then once in 1 ml Binding buffer. Radiolabeled snaR-A (
200 fmol) was incubated with the beads (
12 pMol of immobilized GST protein) in 10 µl Binding buffer with 1 U/µl RNasin at 30°C for 5 min, then on ice for 25 min. Beads were washed thrice in 1 ml modified Binding buffer (with 175 mM KCl, 10 µg/ml tRNA, 0.01% NP-40) and bound snaR-A was eluted in 250 µl Elution buffer (50 mM Tris pH 7.4, 150 mM NaCl, 0.05% NP-40, 50 µg/ml tRNA, 0.5% SDS). RNA was phenol extracted (200 µl) and precipitated in EtOH in the presence of 20 µg/ml glycogen and 1 M ammonium acetate.
Northern blots
Apart from fetal kidney RNA (Clontech), all human tissue RNA was purchased from Ambion. RNA was resolved in polyacrylamide/7 M urea gels and transferred in 0.5x TBE to GeneScreen Plus nylon membrane (PerkinElmer Life Sciences) by electroblotting. Membranes were blocked in ULTRAhyb-Oligo hybridization buffer (Ambion) for 1 h at 37°C, before hybridization at 37°C for 16 h with 5'-end labeled oligonucleotide. Blocking and hybridization of labeled RNA complementary to clone 3 was carried out at 42°C. Membranes were washed twice in 2x SSC with 0.1% SDS for 10 min at 25°C. All oligonucleotide probes were 5'-end labeled with 32P-
-ATP using T4 polynucleotide kinase (New England BioLabs) and purified over Micro Bio-Spin P-30 chromatography columns (Bio-Rad). Oligonucleotides used in this study: Probe A 5'-GACCCATGTGGACCAGGCTGGCCTCGAACT-3', 5.8S rRNA 5'-CGCAAGTGCGTTCGAAGTGTC-3'.
In vitro transcription
Genomic DNA from 293 cells was prepared as previously described (25). In vitro transcription of pVAII (26), linearized CMV DNA, and pR1-R3 was performed using the HeLa Cell Extract Transcription System (Promega). T7 and SP6 RNA polymerase run-off transcription from linearized pCRII-snaR-A yielded radiolabeled sense and antisense snaR-A, respectively.
| RESULTS |
|---|
|
|
|---|
A novel family of structured RNAs
To characterize RNAs bound to NF90 in vivo, stable 293 cell lines expressing epitope-tagged isoforms of NF90, NF90a and NF90b, were subjected to formaldehyde cross-linking (21). The a and b isoforms differ only in the absence or presence of a four amino acid sequence whose function is unknown (1). The most prominent RNA species immunoprecipitated with NF90 were
80–120 nt long (Figure 1A), and
12 RNAs were identified by cloning and sequencing (21). Several related clones of
104 nt were found, which fell into two subsets: a major subset, snaR-A, and a minor subset, snaR-B (Figure 2A). The subsets are
90% identical in sequence. Human genomic sequences corresponding to snaR-A and snaR-B had occasional mismatches that are possibly due to C-to-U editing of the RNA (Figure 2B). In the genome, the cloned snaR sequences are followed by a run of adenines that accounts for the cloning of these RNAs by RT-PCR with an oligo(dT) primer, and then by a run of thymidines that constitutes a potential PolIII termination site. Placing the 3' terminus within the oligo-(U) run gives a length of
117 nt, consistent with the RNAs gel mobility.
|
|
The snaRs lack an open reading frame and are predicted to fold into thermodynamically stable hairpin structures (Figure 1B). The structures illustrated have calculated
G of folding of –56 kcal/mol and >60% G:C base-pairing, but several alternative structures have similar stability (27). The dynamic secondary structure prediction program paRNAss (28) predicts that snaR-A can readily undergo a conformational switch which might account for mobility shifts seen in higher percentage acrylamide gels (see subsequently).
snaR is a major NF90 binding partner
A single band was detected when 293 cell RNA was examined by northern blotting with a DNA oligonucleotide complementary to snaR-A. As expected, this band was enriched in the NF90 immunoprecipitate (Figure 1C, lanes 5–7) and co-migrated with the major end-labeled RNA band that was immunoprecipitated with NF90 (lane 1). To evaluate the amount of snaR-A in this gel band, NF90-associated RNA was 3'-end labeled and incubated with RNase H in the presence of an oligonucleotide complementary to snaR-A and -B. The labeled snaR band was completely digested and replaced by the predicted end-labeled fragment of
35 nt (Figure 1D, lane 4). No digestion resulted from incubation in the presence of an oligonucleotide with a sequence identical to that of snaR-A (lane 3). A faint band corresponding to 5S rRNA was identified in a similar fashion (lanes 1 and 2). These results identify snaR-A as one of the principal RNAs cross-linked to NF90 in vivo. We estimate that there are
70 000 molecules of snaR-A in a 293 cell (data not shown), compared to
200 000 copies of 7SK RNA (29).
Structured RNAs such as VA RNAII interact with NF90 via its dsRBMs (5). To determine whether functional dsRBMs are required for the NF90/snaR interaction, we examined the ability of NF90 to bind snaR-A in vitro. In a pulldown assay, radiolabeled snaR-A bound to GST fusions with all three NF90 isoforms, NF90a–c, but not to GST itself or GST-NF45 (Figure 1E). Binding was not detected to the NF90b [A458P,A588P] mutant which contains inactivating mutations in both of its dsRBMs. Hence, NF90 binds directly to snaR-A via its dsRBMs, consistent with the high degree of secondary structure predicted for the RNA.
PolIII transcription and rapid turnover of snaR-A
Nearly all of the snaR genes contain a 3'-terminal oligo (dT) run that could serve as a PolIII termination signal (Table 1). They also exhibit homology with the PolIII B box motif (Figure 2C, see subsequently). To determine which RNA polymerase transcribes the snaR genes, we transcribed a chromosomal fragment encompassing a snaR-A gene in HeLa nuclear extract. snaR-A synthesis persisted in the presence of 20 µg/ml
-amanitin, a concentration that reduces PolIII activity by
50% but completely inhibits PolII (30), but was abrogated in the presence of 200 µg/ml of the toxin (Figure 3A, lanes 7–9). The adenovirus-2 VA RNAII gene, which is transcribed by PolIII, exhibited a similar sensitivity (lanes 4–6), whereas PolII transcription driven by the CMV immediate-early promoter was abrogated at 20 µg/ml
-amanitin (lanes 2 and 3). snaR-A was also generated by constructs containing less or no chromosomal sequence upstream of the snaR-A gene (Figure 3B, lanes 3 and 4). The in vitro product had the expected RNase H sensitivity (data not shown) and the same gel mobility as a T7 RNA polymerase run-off transcript of full-length snaR-A (lane 1). We conclude that the snaR-A gene is transcribed by PolIII from an intragenic promoter.
|
To determine the stability of snaR-A, we monitored its disappearance in HeLa cells treated with actinomycin D to inhibit transcription by all polymerases. Northern blotting showed that snaR-A decays rapidly, with a half-life of
15 min (Figure 3C). Inhibitors of PolII transcription and translation had little effect. This short half-life implies a rapid turnover rate and the active synthesis of snaR-A in cells.
Restricted snaR expression in human tissues and cell lines
To evaluate the distribution of snaR-A, we probed a northern blot of total RNA from 19 adult human tissues. High snaR-A expression was seen exclusively in testis (Figure 4A, upper panel). Long exposures revealed weak expression in brain and placenta. Remarkably, no expression was detected in adult kidney or in human fetal kidney tissue (not shown) even though snaR was initially discovered in 293 cells which are adenovirus-transformed cells derived from human embryonic kidney (31) and express the RNA at high levels (see below). RNA integrity and loading were assured by probing for 5.8S rRNA (lower panel). Furthermore, snaR-A was not detected in RNA from progenitor or adipocyte-differentiated mesenchymal stem cells, or from unstimulated or stimulated peripheral blood mononuclear cells (data not shown).
|
Considering the possibility that snaR expression is elevated in tumor cells, we compared RNA from a testis tumor with that from normal adjacent testis tissue. On the contrary, reduced expression was seen in the tumor tissue compared to normal adjacent tissue (Figure 4A, right panel). We next examined total RNA from 16 permanent cell lines to determine whether snaR-A is generally expressed in immortalized cells. Two major bands, possibly representing conformers or other snaR species (see subsequently), were detected in all cell lines tested except Colo205 where only the faster band was seen (Figure 4B, upper panel). snaR-A was expressed highly in 293 cells, at intermediate levels in HeLa, HepG2 and some other lines, and at lower levels in lines such as Jurkat and Colo205. Thus, snaR-A is tightly controlled in human tissues, where it is largely restricted to testis, but it is dysregulated in many virally transformed and tumor-derived cell lines.
With very long exposures, short RNAs of
26–34 nt were detected in testis and 293 cells (Figure 4C, asterisk). Database search disclosed the existence in a piRNA library (32) of two sequences that correspond to snaR-A (Figure 2C). piRNAs are recently discovered germline-specific microRNAs defined as binding to PIWI protein (33). The RNAs piR-36011 and piR-36189 are 27 and 30 nt in length, respectively, and their sequences overlap (32). Interestingly, piR-36189, and hence snaR-A, contain homology to a region of Alu RNA which possesses the consensus sequence of the PolIII B box motif (Figure 2C, (34)). Although it is not known whether these two particular RNAs bind to PIWI, we attempted to detect them in 293 cell, testis and brain RNA. Oligonucleotide probes complementary to the two piRNAs recognized intact snaR-A in 293 and testis RNA. The piR-36011 probe and snaR-A probe (Probe A), but not the piR-36189 probe, gave faint hybridization in the 26–34 nt region (Figure 4C). Another PolIII transcript, adenovirus VA RNAI, was recently found to be processed by Dicer, albeit inefficiently (35). However, the faint bands detected here are more likely to result from snaR-A degradation, since they are diffuse and lack a 5' terminal uracil believed to be a hallmark of piRNA processing (36).
Multiple divergent human snaR genes
A bioinformatic search of the human genome identified 28 snaR genes, including 14 genes for snaR-A and 2 genes for snaR-B (Table 1), all located on the q-arm of chromosome 19. In addition, the search revealed a further subset, snaR-C (5 genes), and 7 snaR genes that defy classification into subsets. Although the snaR-C genes display slight sequence variations (Figure S2A), this subset is closely related to snaR-B (95–97% identity; Figure 1B). The outliers, snaR-D to -G, include 5 unique genes on chromosome 19 as well as single copies (snaR-2 and -3) on chromosomes 2 and 3 (Table 1).
Genes in the three snaR subsets are located in two clusters within chromosomal bands 19q13.32 and 19q13.33 (Figure 5A). The clusters are transcribed in opposite directions and appear to have arisen from an inversion of a region of segmental duplication. The cluster on band 19q13.32 has 5 snaR-A genes interspersed with the 5 snaR-C genes, while the remaining 9 snaR-A genes and 2 tandem snaR-B genes are on band 19q13.33. Most of these snaR genes lie in a 5.3 kb tandem repeat, surrounded by multiple repetitive elements such as SINEs and LINEs (A.F.A. Smit, R. Hubley and P. Green RepeatMasker at http://repeatmasker.org). Each repeat contains a large LINE L1 element of
1.5 kb which is deleted in three repeat units on band 19q13.33 that appear to have been truncated by insertion of a SINE Alu-Sx element.
|
Most of the outliers share flanking sequence homology with the snaR genes in the two clusters (Table 1). The snaR-D, E and F genes are located at varying distances outside the two clusters on chromosome 19, while snaR-G1 and -G2 are located in the 2.1 Mb genomic fragment between the clusters. This region contains many protein-coding genes. Remarkably, these snaR genes lie within the promoter region of two genes encoding the ß polypeptide of chorionic gonadotropin (CGß; Figure 5A).
snaR is specific to hominids
snaR genes are present in chimpanzees (Figure 5B), but were not found in searches of rhesus macaque or other mammalian genomes. Despite a comparable genome size, the chimpanzee has fewer snaR genes (10 instead of 28). As in humans, 7 of the chimpanzee snaR genes have expanded in a tandem repeat. However, the repeat differs from those in humans in size and location (
1.5 kb on chromosome 3; Figure 5D). The repeated chimpanzee genes are
94% identical with the solitary human snaR gene on chromosome 3 (Figure S2B), compared to
88% identity with human snaR-A and –B (Figure 5B). This suggests the expansion of an orthologous gene on chromosome 3 has taken place in chimpanzee but not in humans.
The relationships of the human and chimpanzee genes were further illuminated by phylogenetic analysis (Figure 5C). This confirms that the human and chimpanzee genes on chromosome 3 are closely related. Similarly, the single snaR gene on chimpanzee chromosome 2a is 94% identical with its ortholog on human chromosome 2 (Figure S2B). Furthermore, two genes on chimpanzee chromosome 19 are orthologous with snaR-G1, a unique gene on human chromosome 19 (Figure 5C, top branch). Interestingly, their common ancestry is strengthened by the conserved position of these snaR genes relative to the primate-specific CGß genes (37). snaR-G1 and its closest chimpanzee ortholog are both located 86 nt upstream of the 5' end of the CGß1 gene, which is transcribed in the opposite direction (Figure 5A, gray box). snaR-G2 mirrors this localization, being 85 nt from the 5' end of human CGß2, although it has diverged considerably from snaR-G1 and the chimpanzee genes (Figure 5C).
These observations suggest that the snaR genes originated after the divergence of the Great Apes from other primates, and have expanded differentially, sometimes in tandem repeats, since the divergence between humans and chimpanzees 5–7 MYA (38).
| DISCUSSION |
|---|
|
|
|---|
The snaRs are a hitherto unrecognized family of small structured RNAs that are bound in vivo to NF90. These RNAs turn over rapidly, display a restricted distribution in human tissues, and appear to have evolved recently in primates. Accelerated evolution of the snaR genes is implicit in their restriction to great apes and their different gene copy numbers, location and sequences between human and chimpanzee. Apart from intra-subset comparisons, snaR genes display a lower degree of homology within the gene than in their flanking sequence (Figure S2C). This suggests rapid selection and then conservation of mutations within the loci, presumably due to divergent function among paralogs. Another rapidly evolving gene, HAR1F, which specifies a structured non-coding RNA similar in size to snaR, was recently associated with the emergence of essentially human characteristics including the development of the brain neocortex (39). Strikingly, like the snaRs, this RNA is expressed in testis as well as in brain.
NF90 is highly expressed in the testis (10), and at least three other double-stranded RNA-binding proteins—spermatid perinuclear RNA-binding protein, protamine 1 mRNA-binding protein, and testis nuclear RNA-binding protein—play critical roles in mammalian spermatogenesis [reviewed in (40,41)]. Further work is required to elucidate why members of this family of RNAs bind to NF90 protein. The apical stem-loop appears to be a common feature of snaRs. The stem, which is structurally stable due to extensive canonical Watson–Crick base-pairing, is likely to interact with the dsRBMs of NF90. On the other hand, the apical loop is variable in size and sequence and could play a part in determining the specific functions or targets of these RNAs.
With the exception of snaR-D and -E, all human and chimpanzee snaRs share sequence identity upstream (
150 bp) and downstream (
100 bp) of their genes (Figure S2C). This implies that the majority of snaRs have piggy-backed on a larger segment of DNA which has generated segmental duplications in both hominids. However, snaRs also display features characteristic of retrotransposons: they have an internal PolIII promoter, a stable hairpin loop structure, and a 3'-oligo(A) tract followed by an oligo(T) tract (42). Three observations are consistent with their limited retrotransposition. First, snaR-E resembles snaR-B (Table 1) but lacks the extended 3' oligo(T) tract (transcription termination site) and surrounding sequence identity with other snaR genes. Second, snaR-D, which is most closely related to snaR-A, shares no 5'-flanking sequence with other snaRs. Third, a MUC18 cDNA clone (drop4.7) has been reported that contains a non-genic 92 bp 3'-terminus (43): this sequence is identical to a slightly truncated version of snaR-C.
Transposons and segmental duplications are thought to be a rich source of rapid molecular evolution in primates, giving rise to new gene families often involved in reproduction and immunity (44). While the functions of the snaRs remain to be established, there is a strong association with the reproductive system. In addition to their predominant expression in testis, the locations of the snaR genes are suggestive. The genomic region between the two snaR clusters on 19q13 has the highest gene density on the chromosome [with
110 known or predicted genes in 2.1 megabases (Table S1), compared with an average of
60], and many of these genes are integral to reproduction. Most pertinently, the snaR-G genes are closely linked to the most recent members of the beta subunit of chorionic gonadotropin glycoprotein hormone, where they overlap predicted binding sites for CGß transcription factors that govern early placental development and implantation (45). The CGß polypeptides evolved in primates and have expanded in the great apes (37,45). The most recent additions to the family, CGß1 and CGß2, arose in the common ancestor of African great apes (45). In addition to the critical role of CGß in the establishment of pregnancy, its overexpression is diagnostic for certain testicular cancers (46). Indeed, overexpression of human CGß in transgenic male mice leads to defective reproductive organs (47) and fetal Leydig cell adenomas (48). We speculate that snaRs modulate the expression of CGß polypeptides and possibly other genes in between the snaR clusters through an epigenetic mechanism.
| SUPPLEMENTARY DATA |
|---|
|
|
|---|
Supplementary Data are available at NAR Online.
| ACKNOWLEDGEMENTS |
|---|
We thank Anita Antes, Dr Sergei Smirnov and Dr Stephan Schwander for cells and Michael Tsai for bioinformatics advice. We are indebted to Dr Tsafi Peery, Dr Melissa Rogers, Dr Chee-Gun Lee and Dr Bin Tian for suggestions and comments on the manuscript. Funding was provided by National Institutes of Health (AI034552). Funding to pay the Open Access publication charges for the article was provided by National Institutes of Health (AI034552).
Conflict of interest statement. None declared.
| REFERENCES |
|---|
|
|
|---|
- Reichman TW, Mathews MB. NF90 family of double-stranded RNA-binding proteins: regulators of viral and cellular function. In: Cell Signaling Handbook.—Bradshaw R, Dennis E, eds. (2003) Elsevier Science, USA: Academic Press. 335–342.
- Larcher JC, Gasmi L, Viranaicken W, Edde B, Bernard R, Ginzburg I, Denoulet P. Ilf3 and NF90 associate with the axonal targeting element of Tau mRNA. FASEB J (2004) 18:1761–1763.
[Abstract/Free Full Text] - Shi L, Zhao G, Qiu D, Godfrey WR, Vogel H, Rando TA, Hu H, Kao PN. NF90 regulates cell cycle exit and terminal myogenic differentiation by direct binding to the 3'-untranslated region of MyoD and p21WAF1/CIP1 mRNAs. J. Biol. Chem (2005) 280:18981–18989.
[Abstract/Free Full Text] - Matsumoto-Taniura N, Pirollet F, Monroe R, Gerace L, Westendorf JM. Identification of novel M phase phosphoproteins by expression cloning. Mol. Biol. Cell (1996) 7:1455–1469.[Abstract]
- Liao HJ, Kobayashi R, Mathews MB. Activities of adenovirus virus-associated RNAs: purification and characterization of RNA binding proteins. Proc. Natl Acad. Sci. USA (1998) 95:8514–8519.
[Abstract/Free Full Text] - Isken O, Grassmann CW, Sarisky RT, Kann M, Zhang S, Grosse F, Kao PN, Behrens SE. Members of the NF90/NFAR protein group are involved in the life cycle of a positive-strand RNA virus. EMBO J (2003) 22:5655–5665.[CrossRef][Web of Science][Medline]
- Kao PN, Chen L, Brock G, Ng J, Kenny J, Smith AJ, Corthesy B. Cloning and expression of cyclosporin A- and FK506-sensitive nuclear factor of activated T-cells: NF45 and NF90. J. Biol. Chem (1994) 269:20691–20699.
[Abstract/Free Full Text] - Reichman TW, Parrott AM, Fierro-Monti I, Caron DJ, Kao PN, Lee CG, Li H, Mathews MB. Selective regulation of gene expression by nuclear factor 110, a member of the NF90 family of double-stranded RNA-binding proteins. J. Mol. Biol (2003) 332:85–98.[CrossRef][Web of Science][Medline]
- Patel RC, Vestal DJ, Xu Z, Bandyopadhyay S, Guo W, Erme SM, Williams BR, Sen GC. DRBP76, a double-stranded RNA-binding nuclear protein, is phosphorylated by the interferon-induced protein kinase, PKR. J. Biol. Chem (1999) 274:20432–20437.
[Abstract/Free Full Text] - Saunders LR, Perkins DJ, Balachandran S, Michaels R, Ford R, Mayeda A, Barber GN. Characterization of two evolutionarily conserved, alternatively spliced nuclear phosphoproteins, NFAR-1 and -2, that function in mRNA processing and interact with the double-stranded RNA-dependent protein kinase, PKR. J. Biol. Chem (2001) 276:32300–32312.
[Abstract/Free Full Text] - Buaas FW, Lee K, Edelhoff S, Disteche C, Braun RE. Cloning and characterization of the mouse interleukin enhancer binding factor 3 (Ilf3) homolog in a screen for RNA binding proteins. Mamm. Genome (1999) 10:451–456.[CrossRef][Web of Science][Medline]
- Xu YH, Leonova T, Grabowski GA. Cell cycle dependent intracellular distribution of two spliced isoforms of TCP/ILF3 proteins. Mol. Gen. Metab (2003) 80:426–436.[CrossRef][Web of Science][Medline]
- Corthesy B, Kao PN. Purification by DNA affinity chromatography of two polypeptides that contact the NF-AT DNA binding site in the interleukin 2 promoter. J. Biol. Chem (1994) 269:20682–20690.
[Abstract/Free Full Text] - Langland JO, Kao PN, Jacobs BL. Nuclear factor-90 of activated T-cells: a double-stranded RNA-binding protein and substrate for the double-stranded RNA-dependent protein kinase, PKR. Biochemistry (1999) 38:6361–6368.[CrossRef][Medline]
- Tran H, Schilling M, Wirbelauer C, Hess D, Nagamine Y. Facilitation of mRNA deadenylation and decay by the exosome-bound, DExH protein RHAU. Mol. Cell (2004) 13:101–111.[CrossRef][Web of Science][Medline]
- Xu YH, Grabowski GA. Translation modulation of acid beta-glucosidase in HepG2 cells: participation of the PKC pathway. Mol. Gen. Met (2005) 84:252–264.[CrossRef]
- Bose SK, Sengupta TK, Bandyopadhyay S, Spicer EK. Identification of Ebp1 as a component of cytoplasmic bcl-2 mRNP (messenger ribonucleoprotein particle) complexes. Biochem. J (2006) 396:99–107.[CrossRef][Web of Science][Medline]
- Shim J, Lim H, Yates RJ, Karin M. Nuclear export of NF90 is required for interleukin-2 mRNA stabilization. Mol. Cell (2002) 10:1331–1344.[CrossRef][Web of Science][Medline]
- Shin HJ, Kim SS, Cho YH, Lee SG, Rho HM. Host cell proteins binding to the encapsidation signal epsilon in hepatitis B virus RNA. Arch. Virol (2002) 147:471–491.[CrossRef][Web of Science][Medline]
- Parrott AM, Walsh MR, Reichman TW, Mathews MB. RNA binding and phosphorylation determine the intracellular distribution of nuclear factors 90 and 110. J. Mol. Biol (2005) 348:281–293.[CrossRef][Web of Science][Medline]
- Parrott AM, Walsh MR, Mathews MB. Analysis of RNA:protein interactions in vivo: identification of RNA binding partners of Nuclear Factor 90. In: Methods in Enzymology: Translation Initiation: Extract Systems and Molecular Genetics.—Lorsch JR, ed. (2007) 429. San Diego, CA: Elsevier Inc. Inpress.
- Parker LM, Fierro-Monti I, Mathews MB. Nuclear factor 90 is a substrate and regulator of the eukaryotic initiation factor 2 kinase double-stranded RNA-activated protein kinase. J. Biol. Chem (2001) 276:32522–32530.
[Abstract/Free Full Text] - Reichman TW, Mathews MB. RNA binding and intramolecular interactions modulate the regulation of gene expression by nuclear factor 110. RNA (2003) 9:543–554.
[Abstract/Free Full Text] - Reichman TW, Muniz LC, Mathews MB. The RNA binding protein nuclear factor 90 functions as both a positive and negative regulator of gene expression in mammalian cells. Mol. Cell. Biol (2002) 22:343–356.
[Abstract/Free Full Text] - Strauss WM. Preparation of genomic DNA from mammalian tissue. In: Current Protocols in Molecular Biology.—Ausubel FM, Brent R, Kingston RE, Moore DD, Seidman JG, Smith JA, Struhl K, eds. (1998) I. New York: John Wiley & Sons, Inc. 2.2.1–2.2.3.
- Ma Y, Mathews MB. Comparative analysis of the structure and function of adenovirus virus-associated RNAs. J. Virol (1993) 67:6605–6617.
[Abstract/Free Full Text] - Zuker M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res (2003) 31:3406–3415.
[Abstract/Free Full Text] - Giegerich R, Haase D, Rehmsmeier M. Prediction and visualization of structural switches in RNA. Pac. Symp. Biocomput (1999) 126–137.
- Wassarman DA, Steitz JA. Structural analyses of the 7SK ribonucleoprotein (RNP), the most abundant human small RNP of unknown function. Mol. Cell. Biol (1991) 11:3432–3445.
[Abstract/Free Full Text] - Weinmann R, Roeder RG. Role of DNA-dependent RNA polymerase 3 in the transcription of the tRNA and 5S RNA genes. Proc. Natl Acad. Sci. USA (1974) 71:1790–1794.
[Abstract/Free Full Text] - Graham FL, Smiley J, Russell WC, Nairn R. Characteristics of a human cell line transformed by DNA from human adenovirus type 5. J. Gen. Virol (1977) 36:59–74.
[Abstract/Free Full Text] - Girard A, Sachidanandam R, Hannon GJ, Carmell MA. A germline-specific class of small RNAs binds mammalian piwi proteins. Nature (2006) 442:199–202.[Medline]
- Kim VN. Small RNAs just got bigger: piwi-interacting RNAs (piRNAs) in mammalian testes. Genes Dev (2006) 20:1993–1997.
[Abstract/Free Full Text] - Shankar R, Grover D, Brahmachari SK, Mukerji M. Evolution and distribution of RNA polymerase II regulatory sites from RNA polymerase III dependant mobile Alu elements. BMC Evol. Biol (2004) 4:37.[CrossRef][Medline]
- Sano M, Kato Y, Taira K. Sequence-specific interference by small RNAs derived from adenovirus VAI RNA. FEBS Letts (2006) 580:1553–1564.[CrossRef][Web of Science][Medline]
- Brennecke J, Aravin AA, Stark A, Dus M, Kellis M, Sachidanandam R, Hannon GJ. Discrete small RNA-generating loci as master regulators of transposon activity in Drosophila. Cell (2007) 128:1089–1103.[CrossRef][Medline]
- Maston GA, Ruvolo M. Chorionic gonadotropin has a recent origin within primates and an evolutionary history of selection. Mol. Biol. Evol (2002) 19:320–335.
[Abstract/Free Full Text] - Kumar S, Filipski A, Swarna V, Walker A, Hedges SB. Placing confidence limits on the molecular age of the human-chimpanzee divergence. Proc. Natl Acad. Sci. USA (2005) 102:18842–18847.
[Abstract/Free Full Text] - Pollard KS, Salama SR, Lambert N, Lambot MA, Coppens S, Pedersen JS, Katzman S, King B, Onodera C, et al. An RNA gene expressed during cortical development evolved rapidly in humans. Nature (2006) 443:167–172.[CrossRef][Medline]
- Saunders LR, Barber GN. The dsRNA binding protein family: critical roles, diverse cellular functions. FASEB J (2003) 17:961–983.
[Abstract/Free Full Text] - Tian B, Bevilacqua PC, Diegelman-Parente A, Mathews MB. The double-stranded-RNA-binding motif: interference and much more. Nat. Rev. Mol. Cell Biol (2004) 5:1013–1023.[CrossRef][Web of Science][Medline]
- Maraia RJ, Chang DY, Wolffe AP, Vorce RL, Hsu K. The RNA polymerase III terminator used by a B1-Alu element can modulate 3' processing of the intermediate RNA product. Mol. Cell. Biol (1992) 12:1500–1506.
[Abstract/Free Full Text] - Lehmann JM, Riethmuller G, Johnson JP. MUC18, a marker of tumor progression in human melanoma, shows sequence similarity to the neural cell adhesion molecules of the immunoglobulin superfamily. Proc. Natl Acad. Sci. USA (1989) 86:9891–9895.
[Abstract/Free Full Text] - Bailey JA, Eichler EE. Primate segmental duplications: crucibles of evolution, diversity and disease. Nature Rev (2006) 7:552–564.
- Hallast P, Rull K, Laan M. The evolution and genomic landscape of CGB1 and CGB2 genes. Mol. Cell. Endo (2007) 260–262:2–11.
- Skotheim RI, Lothe RA. The testicular germ cell tumour genome. APMIS (2003) 111:136–150.[CrossRef][Web of Science][Medline]
- Rulli SB, Ahtiainen P, Makela S, Toppari J, Poutanen M, Huhtaniemi I. Elevated steroidogenesis, defective reproductive organs, and infertility in transgenic male mice overexpressing human chorionic gonadotropin. Endocrinology (2003) 144:4980–4990.
[Abstract/Free Full Text] - Ahtiainen P, Rulli SB, Shariatmadari R, Pelliniemi LJ, Toppari J, Poutanen M, Huhtaniemi IT. Fetal but not adult Leydig cells are susceptible to adenoma formation in response to persistently high hCG level: a study on hCG overexpressing transgenic mice. Oncogene (2005) 24:7301–7309.[CrossRef][Web of Science][Medline]
- Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D. The human genome browser at UCSC. Genome Res (2002) 12:996–1006.
[Abstract/Free Full Text] - Kent WJ. BLAT-the BLAST-like alignment tool. Genome Res (2002) 12:656–664.
[Abstract/Free Full Text]
This article has been cited by other articles:
![]() |
M.-L. Bortolin-Cavaille, M. Dance, M. Weber, and J. Cavaille C19MC microRNAs are processed from introns of large Pol-II, non-protein-coding transcripts Nucleic Acids Res., June 1, 2009; 37(10): 3464 - 3473. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Guan, N. Altan-Bonnet, A. M. Parrott, C. J. Arrigo, Q. Li, M. Khaleduzzaman, H. Li, C.-G. Lee, T. Pe'ery, and M. B. Mathews Nuclear Factor 45 (NF45) Is a Regulatory Subunit of Complexes with NF90/110 Involved in Mitotic Control Mol. Cell. Biol., July 15, 2008; 28(14): 4629 - 4641. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||






