Nucleic Acids Research, 2001, Vol. 29, No. 1 194-195
© 2001 Oxford University Press
PseudoBase: structural information on RNA pseudoknots
1Group Theoretical Biology and Phylogenetics, Institute of Evolutionary and Ecological Sciences, Leiden University, Kaiserstraat 63, 2311GP Leiden, The Netherlands and 2Leiden Institute of Chemistry, Leiden University, PO Box 9502, 2300RA Leiden, The Netherlands
Received September 8, 2000; Accepted September 12, 2000.
| ABSTRACT |
|---|
|
|
|---|
PseudoBase is a database containing structural, functional and sequence data related to RNA pseudoknots. It can be reached at http://wwwbio.LeidenUniv.nl/
Batenburg/PKB.html. For each pseudoknot, thirteen items are stored, for example the relevant sequence, the stem positions of the pseudoknot, the EMBL accession number of the sequence and the support that can be given regarding the reliability of the pseudoknot. Since the last publication, information on sizes of the stems and the loops in the pseudoknots has been added. Also added are alternative entries that produce surveys of where the pseudoknots are, sorted according to stem size or loop size. | INTRODUCTION |
|---|
|
|
|---|
Pseudoknots are widely occurring structural motifs in RNA. First described in the early eighties as part of tRNA-like structures in plant viral RNAs, pseudoknots were recognized as a general principle of RNA folding (1,2). Pseudoknots play an important role in a number of RNA functions (e.g. in ribosomal frameshifting, regulation of translation and splicing) and they are also essential elements of the topology of many structural RNAs such as, for example, ribosomal RNAs or ribozymes. Therefore, a database with a collection of pseudoknot structures has been created (3). This database can be useful for many studies dealing with structurefunction relationships in RNA molecules.
| RECENT DEVELOPMENTS |
|---|
|
|
|---|
The sizes of stems and loops are very important structural characteristics of pseudoknots. Therefore, we have decided to include this information in all database entries; see labels Stem sizes and Loop sizes in Figure 1.
|
Determination of stem- and loop-size in classical or so-called H-(hairpin) pseudoknots is straightforward. However for more complicated structures several decisions had to be made for a consistent nomenclature.
Zero-size regions
Unfortunately, there is not yet a standard nomenclature for pseudoknot structural elements, even for relatively simple structures. In principle, the simplest pseudoknot, the classical or so-called H-pseudoknot, may contain two stems (regions A and C in Fig. 2) and three loops (regions B, D and E in Fig. 2).
|
Such stems and loops can be numbered in the 5'
3' direction as S1, S2 and L1, L2, L3 (for example see 4). However, the most studied type of pseudoknot is with coaxial stacking of stems so that L2 (region D in Fig. 2) is absent. In that case, region E is mostly called L2 instead of L3 (for examples see 2,58). Such a shift in nomenclature can even occur within one publication where this loop is called L3 on one page (for pseudoknots with nucleotides present in the D region) and called L2 on another page when there are no nucleotides in region D (for example see 4,9). We have decided always to assign label L2 to region D for reasons of consistent and unambiguous nomenclature in our database. If both stems are adjacent this would be L2 with size = 0; this ensures that structurally similar regions E are always called L3. A similar consistency is applied to more complex pseudoknots with coaxial stacking where we recognized such areas as Li with size 0. See for example Figure 3 where L1, L2 and L3 are regions with proper nucleotides but L4 has size 0.
|
Pseudoknots with internal structures
Classical pseudoknots have simple loops where all nucleotides are unpaired. Consequently the word loop is appropriate for this part of the pseudoknot, so there is no argument over defining loop size as the total number of nucleotides in that region. However, in more complicated tertiary structures a loop can contain substructures. Such a region between two pseudoknot stems can have several stems with their own internal loops, hairpin loops and multibranch loops. Here one can even argue whether loop is a suitable name or not. Nevertheless, for the sake of simplicity we have decided to consider this whole region a pseudoknot loop and abbreviate this in the context of PseudoBase as Li.
Consequently, stems that are not pseudoknotted with other stems and their corresponding loops are not included in our PseudoBase enumeration of stems and loops. For example, L5 in Figure 3 contains a hairpin with an internal loop that is not registered in PseudoBase as part of the stem and loop enumeration.
Furthermore, the nucleotides in such complicated loops are all counted as contributions to the size of the particular pseudoknot loop, whether they are unpaired or not. For example, L5 in Figure 3 is equal to 33 nt although only 17 of them are single stranded.
Stems with internal loops
Stems raised another nomenclature problem. Classical pseudoknot nomenclature is based on two simple, regular stems, the first one beginning at the 5'-end is called S1 and the second one ending nearest to the 3'-end is called S2 (for examples see 1,2,5). For more complex structures the question arises: what are the pseudoknot stems, what are their stem numbers and what are their sizes? Should we count a stem with a bulge-loop halfway as one stem or two? And what about a stem interrupted by an interior loop, or even by a multibranch loop?
Again we decided to opt for a simple nomenclature and discard bulges, interior loops and multibranch loops. For the size of such stems we did not count the nucleotides as this would raise dilemmas when one stem half contains a bulge, but counted the number of interactions instead, that is to say the number of nucleotide pairs. In other words, if a pseudoknot stem could be considered as interrupted with two or more duplexes forming a pseudoknot with some other stem, its stem size is defined as the total number of base pairs in the duplexes. By this decision we do not have to consider whether to include the bulge and loop nucleotides or not, and we think that this number is a more suitable statistic than the total number of nucleotides in interrupted stems. For illustration see Figure 3; the first stem S1 is simple and has size 4, but the next stem S2 has size 5 because the unpaired nucleotides 8 and 34 in the internal loop are ignored.
Sorting of pseudoknots
Using the adopted definitions we have developed a program that computes the Si and Li sizes for the submitted pseudoknots and added this information into each pseudoknot data page.
Furthermore, this program also generated html pages with surveys of the available pseudoknot items sorted on stem sizes or loop sizes.
| ACCESS |
|---|
|
|
|---|
The database is freely accessible at http://wwwbio.leidenuniv.nl/~Batenburg/PKB.html. This presents the introduction page, which leads to the pages for retrieval or submission or an About page for more information. Users of PseudoBase are requested to cite this publication and are encouraged to provide corrections and suggestions for improvement to the first author by email (batenburg{at}rulsfb.leidenuniv.nl) and to submit new pseudoknot data using the automatic submission form that is supplied on the Submission-page (http://wwwbio.leidenuniv.nl/~Batenburg/PKBPut.html).
| ACKNOWLEDGEMENT |
|---|
A.P.G. was supported by the European Commission (project BIO4-98-0189).
| FOOTNOTES |
|---|
* To whom correspondence should be addressed. Tel: +31 71 527 4972; Fax: +31 71 527 4900; Email: batenburg{at}rulsfb.leidenuniv.nl
| REFERENCES |
|---|
|
|
|---|
-
1 Rietveld,K., van Poelgeest,R., Pleij,C.W.A., van Boom,J.H. and Bosch,L. (1982) The tRNA-like structure at the 3' terminus of turnip yellow mosaic virus RNA. Differences and similarities with canonical tRNA. Nucleic Acids Res., 10, 19291946.
2 Pleij,C.W.A., Rietveld,K. and Bosch,L. (1985) A new principle of RNA folding based on pseudoknotting. Nucleic Acids Res., 13, 17171731.
3 van Batenburg,F.H.D., Gultyaev,A.P., Pleij,C.W.A., Ng,J. and Oliehoek,J. (2000) PseudoBase: a database with RNA pseudoknots. Nucleic Acids Res., 28, 201204.
4 Hilbers,C.W., Michiels,P.J.A. and Heus,H.A. (1998) New developments in structure determination of pseudoknots. Biopolymers, 48, 137153.[Web of Science][Medline]
5 Abrahams,J.P., van den Berg,M., van Batenburg,E. and Pleij,C. (1990) Prediction of RNA secondary structure, including pseudoknotting, by computer simulation. Nucleic Acids Res., 18, 30353044.
6 Deiman,B.A.L.M. and Pleij,C.W.A. (1997) Pseudoknots: a vital feature in viral RNA. Semin. Virol., 8, 166175.
7 Mans,R.M.W. and Pleij,C.W.A. (1993) RNA pseudoknots. In Eckstein,F. and Lilley,D.M.J. (eds) Nucleic Acids and Molecular Biology. Springer Verlag, Berlin, Vol.7, pp. 250270.
8 Pleij,C.W.A. (1994) RNA pseudoknots. Curr. Opin. Struct. Biol., 4, 337344.
9 ten Dam,E., Pleij,K and Draper,D. (1992) Structural and functional aspects of RNA pseudoknots. Biochemistry, 31, 1166511676.[Medline]
This article has been cited by other articles:
![]() |
Y. Byun and K. Han PseudoViewer3: generating planar drawings of large-scale RNA structures with pseudoknots Bioinformatics, June 1, 2009; 25(11): 1435 - 1437. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. P. Gardner, J. Daub, J. G. Tate, E. P. Nawrocki, D. L. Kolbe, S. Lindgreen, A. C. Wilkinson, R. D. Finn, S. Griffiths-Jones, S. R. Eddy, et al. Rfam: updates to the RNA families database Nucleic Acids Res., January 1, 2009; 37(suppl_1): D136 - D140. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Theis, J. Reeder, and R. Giegerich KnotInFrame: prediction of -1 ribosomal frameshift events Nucleic Acids Res., October 1, 2008; 36(18): 6013 - 6020. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Pennell, E. Manktelow, A. Flatt, G. Kelly, S. J. Smerdon, and I. Brierley The stimulatory RNA of the Visna-Maedi retrovirus ribosomal frameshifting signal is an unusual pseudoknot with an interstem element RNA, July 1, 2008; 14(7): 1366 - 1377. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Byun and K. Han PseudoViewer: web application and web service for visualizing RNA pseudoknots and secondary structures. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W416 - W422. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. REN, B. RASTEGARI, A. CONDON, and H. H. HOOS HotKnots: Heuristic prediction of RNA secondary structures including pseudoknots RNA, October 1, 2005; 11(10): 1494 - 1504. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-H. Huang, C. L. Lu, and H.-T. Chiu A heuristic approach for detecting RNA H-type pseudoknots Bioinformatics, September 1, 2005; 21(17): 3501 - 3508. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Han and Y. Byun PSEUDOVIEWER2: visualization of RNA pseudoknots of any type Nucleic Acids Res., July 1, 2003; 31(13): 3432 - 3440. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Griffiths-Jones, A. Bateman, M. Marshall, A. Khanna, and S. R. Eddy Rfam: an RNA family database Nucleic Acids Res., January 1, 2003; 31(1): 439 - 441. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Schattner Searching for RNA genes using base-composition statistics Nucleic Acids Res., May 1, 2002; 30(9): 2076 - 2082. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||




