Nucleic Acids Research, 2001, Vol. 29, No. 1 175-177
© 2001 Oxford University Press
The European Large Subunit Ribosomal RNA Database
1Departement Biochemie, Universiteit Antwerpen (UIA), Universiteitsplein 1, B-2610 Antwerpen, Belgium and 2Fakultät Biologie, Universität Konstanz, D-78457 Konstanz, Germany
Received October 5, 2000; Accepted October 10, 2000.
| ABSTRACT |
|---|
|
|
|---|
The European Large Subunit Ribosomal RNA Database compiles all complete or nearly complete large subunit ribosomal RNA sequences available from public sequence databases. These are provided in aligned format and the secondary structure, as derived by comparative sequence analysis, is included. Additional information about the sequences such as literature references and taxonomic information is also included. The database is available from our WWW server at http://rrna.uia.ac.be/lsu/.
| LARGE SUBUNIT RIBOSOMAL RNA |
|---|
|
|
|---|
The secondary structure model of the large subunit ribosomal RNA (LSU rRNA) consists of a conserved core that is interspersed with a number of variable areas. Most of the secondary structure of the molecule is known, especially for archaeal, bacterial and plastid sequences. In eukaryotes some areas show considerable variability, both in base composition and sequence length, even when closely related species are compared. This makes alignment more difficult, and as a result the secondary structure annotation, which is based on the primary structure alignment, is less reliable or even absent for these areas. The same problem arises with sequences from animal mitochondria and is even more pronounced with those from kinetoplast mitochondria.
Very often the LSU rRNA is not formed by one continuous molecule, but is fragmented in two or more parts. The most widely known fragments are 5.8S rRNA in eukaryotes and 4.5S rRNA in plastids but some LSU rRNA sequences show more extensive fragmentation. One extreme example is the sequence of Euglena gracilis, which consists of 14 pieces (1). In our alignment these fragments are joined in one continuous sequence but information about the individual fragments is also provided.
Most of our knowledge about the secondary and tertiary structure of LSU rRNA is based on comparative sequence analysis. Recently, the crystal structure of the large ribosomal subunit of the archaebacterium Haloarcula marismortui has been resolved at 2.4 Å resolution (2). This model confirmed the secondary structure model derived by comparative sequence analysis and added a vast amount of information about tertiary interactions. Only a very small number of inaccuracies in the comparative model were noted and have been removed from the alignment. These data again confirm the validity of comparative sequence analysis of RNA molecules and serves as a good indication that secondary structure models of other sequences in our database indeed reflect the true secondary structure of these molecules in the ribosome.
Figure 1 shows the complete secondary structure model of LSU rRNA of the Apicomplexan Toxoplasma gondii. The drawing was made using the software package RnaViz (3). In the drawing, nine substructures branching from the central loop and labeled from A to I are clearly visible. Individual helices are numbered clockwise from the 5'- to the 3'-end in each substructure and are given an individual number if a potential branching point separates them. Substructure A, which is formed by helix A1, is absent in eukaryotes but is present in bacteria, plastids and most archaea.
|
| CONTENTS OF THE DATABASE |
|---|
|
|
|---|
The database contains approximately 1400 sequences. Mitochondria and Bacteria have the largest number of representatives with about 750 and 400 sequences, respectively. There are approximately 150 eukaryote, 50 archaebacterial and 70 plastid sequences. The Supplementary Material includes a pie chart showing a graphical overview of the relative amount of representative sequences for all taxa.
The alignment is regularly updated from the EMBL sequence database (4). The LSU rRNA features are extracted from new and updated entries. Using the alignment editor DCSE (5), each new sequence is automatically aligned to the complete sequence of its closest relative and retained only if it shares at least 70% of the homologous positions. Next, the secondary structure pattern is applied to the new sequence. Where necessary, manual corrections are made.
The secondary structure of a sequence is indicated by the insertion of special characters in the alignment. Square brackets are used to indicate helix segments. The number of this helix can be found by looking at the corresponding position in a special sequence called Helix numbering. Internal loops and bulge loops are annotated by braces. Non-standard base pairs (base pairs other than A·U, G·C or G·U) are indicated by round brackets.
In addition to primary and secondary structure, other information such as literature references, accession numbers and taxonomy are also available. The taxonomic descriptions are the same as those adopted by the NCBI (6).
| AVAILABILITY |
|---|
|
|
|---|
The European LSU rRNA Database is accessible at http://rrna.uia.ac.be/lsu/. Sequence alignments can be downloaded in different formats. The DCSE alignment, Distribution format and Printable alignment formats incorporate both primary and secondary structure information. TREECON, NBRF/PIR and EMBL formats only include the primary structure in aligned format.
Three interfaces are available to select the desired sequences. Using the list interface one can select individual sequences that can be downloaded in the distribution format. The forms interface allows the selection of groups of sequences. The query interface allows one to search for sequences by species name, accession number and literature data. It is possible to do searches on the entire database or to limit the search to certain taxa.
Additional information available on the RNA server includes a database with information on PCR and sequencing primers used in our laboratory, detailed variability maps based on substitution rate calibration (7) and links to other interesting sites. Various software packages for molecular biology are also available. These include DCSE (5), an alignment editor that supports secondary structure annotation, RnaViz (3), a program to make RNA secondary structure drawings, and Forcon (8), a program for the conversion between various alignment formats.
From our anonymous ftp server at ftp://rrna.uia.ac.be/pub/lsu/ individual sequences in the distribution format are available. A compressed version of the complete DCSE alignment file and the DCSE reference file is also available.
| SUPPLEMENTARY MATERIAL |
|---|
|
|
|---|
The following additional information can be found at NAR Online:
pie chart showing the distribution of different taxa in the database;
detailed secondary structure models of several LSU rRNA species;
nucleotide variability maps of eukaryotic and bacterial LSU rRNA molecules.
| ACKNOWLEDGEMENTS |
|---|
Our research is supported by the Special Research Fund of the University of Antwerp (Belgium). P.D.R. and Y.V.d.P. are Research Fellows of the Fund for Scientific Research (Flanders).
| FOOTNOTES |
|---|
* To whom correspondence should be addressed. Tel: +32 3 820 23 19; Fax: +32 3 820 22 48; Email: dwachter{at}uia.ua.ac.be
| REFERENCES |
|---|
|
|
|---|
-
1 Schnare,M.N., Cook,J.R. and Gray,M.W. (1994) Fourteen internal transcribed spacers in the circular ribosomal DNA of Euglena gracilis. J. Mol. Biol., 215, 8591.
2 Ban,N., Nissen,P., Hansen,J., Moore,P.B. and Steitz,T.A. (2000) The complete atomic structure of the large ribosomal subunit at 2.4Å resolution. Science, 289, 905920.
3 De Rijk,P. and De Wachter,R. (1997) RnaViz, a program for the visualisation of RNA secondary structure. Nucleic Acids Res., 25, 46794684.
4 Baker,W., van den Broek,A., Camon,E., Hingamp,P., Sterk,P., Stoesser,G. and Tuli,M.A. (2000) The EMBL Nucleotide Sequence Database. Nucleic Acids Res., 28, 1923. Updated article in this issue: Nucleic Acids Res. (2001), 29, 1721.
5 De Rijk,P. and De Wachter,R. (1993) DCSE, an interactive tool for sequence alignment and secondary structure research. Comput. Appl. Biosci., 9, 735740.
6 Wheeler,D.L., Chappey,C., Lash,A.E., Leipe,D.D., Madden,T.L., Schuler,G.D., Tatusova,T.A. and Rapp,B.A. (2000) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res., 28, 1014. Updated article in this issue: Nucleic Acids Res. (2001), 29, 1116.
7 Van de Peer,Y., Van der Auwera,G. and De Wachter,R. (1996) The evolution of stramenopiles and alveolates as derived by "substitution rate calibration" of small ribosomal subunit RNA. J. Mol. Evol., 42, 201210.[Web of Science][Medline]
8 Raes,J. and Van de Peer,Y. (1999) ForCon: a software tool for the conversion of sequence alignments. EMBnet.news, 6.
This article has been cited by other articles:
![]() |
E. Pruesse, C. Quast, K. Knittel, B. M. Fuchs, W. Ludwig, J. Peplies, and F. O. Glockner SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB Nucleic Acids Res., December 18, 2007; 35(21): 7188 - 7196. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L Rabosky, S. C Donnellan, A. L Talaba, and I. J Lovette Exceptional among-lineage variation in diversification rates during the radiation of Australia's most diverse vertebrate clade Proc R Soc B, December 7, 2007; 274(1628): 2915 - 2923. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Razga, J. Koca, A. Mokdad, and J. Sponer Elastic properties of ribosomal RNA building blocks: molecular dynamics of the GTPase-associated center rRNA Nucleic Acids Res., June 9, 2007; 35(12): 4007 - 4017. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Mokdad, M. V. Krasovska, J. Sponer, and N. B. Leontis Structural and evolutionary classification of G/U wobble basepairs in the ribosome Nucleic Acids Res., March 6, 2006; 34(5): 1326 - 1341. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Gowri-Shankar and M. Rattray On the Correlation Between Composition and Site-Specific Evolutionary Rate: Implications for Phylogenetic Inference Mol. Biol. Evol., February 1, 2006; 23(2): 352 - 364. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. SMIT, M. YARUS, and R. KNIGHT Natural selection is not required to explain universal compositional patterns in rRNA secondary structure categories RNA, January 1, 2006; 12(1): 1 - 14. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. SCHATTNER, S. BARBERAN-SOLER, and T. M. LOWE A computational screen for mammalian pseudouridylation guide H/ACA RNAs RNA, January 1, 2006; 12(1): 15 - 25. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Arisue, M. Hasegawa, and T. Hashimoto Root of the Eukaryota Tree as Inferred from Combined Maximum Likelihood Analyses of Multiple Molecular Sequence Data Mol. Biol. Evol., March 1, 2005; 22(3): 409 - 420. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Smith, T. W. H. Lui, and E. R. M. Tillier Empirical Models for Substitution in Ribosomal RNA Mol. Biol. Evol., March 1, 2004; 21(3): 419 - 427. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Ludwig, O. Strunk, R. Westram, L. Richter, H. Meier, Yadhukumar, A. Buchner, T. Lai, S. Steppi, G. Jobb, et al. ARB: a software environment for sequence data Nucleic Acids Res., February 25, 2004; 32(4): 1363 - 1371. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Brosnan, W. Shin, K. M. Kjer, and R. E. Triemer Phylogeny of the photosynthetic euglenophytes inferred from the nuclear SSU and partial LSU rDNA Int J Syst Evol Microbiol, July 1, 2003; 53(4): 1175 - 1186. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Knudsen and J. Hein Pfold: RNA secondary structure prediction using stochastic context-free grammars Nucleic Acids Res., July 1, 2003; 31(13): 3423 - 3428. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. M. Marquez, D. J. Miller, J. B. MacKenzie, and M. J. H. van Oppen Pseudogenes Contribute to the Extreme Diversity of Nuclear Ribosomal DNA in the Hard Coral Acropora Mol. Biol. Evol., July 1, 2003; 20(7): 1077 - 1086. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. KRASILNIKOV and A. MONDRAGON On the occurrence of the T-loop RNA folding motif in large RNA molecules RNA, June 1, 2003; 9(6): 640 - 643. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Griffiths-Jones, A. Bateman, M. Marshall, A. Khanna, and S. R. Eddy Rfam: an RNA family database Nucleic Acids Res., January 1, 2003; 31(1): 439 - 441. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Lecompte, R. Ripp, J.-C. Thierry, D. Moras, and O. Poch Comparative analysis of ribosomal proteins in complete genomes: an example of reductive evolution at the domain scale Nucleic Acids Res., December 15, 2002; 30(24): 5382 - 5390. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Schattner Searching for RNA genes using base-composition statistics Nucleic Acids Res., May 1, 2002; 30(9): 2076 - 2082. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Wuyts, Y. Van de Peer, and R. De Wachter Distribution of substitution rates and location of insertion sites in the tertiary structure of ribosomal RNA Nucleic Acids Res., December 15, 2001; 29(24): 5017 - 5028. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||





