Nucleic Acids Research, 2003, Vol. 31, No. 1 502-504
© 2003 Oxford University Press
RNABase: an annotated database of RNA structures
Department of Biophysics, Johns Hopkins University, Jenkins Hall, 3400 N. Charles Street, Baltimore, MD 21218, USA
*To whom correspondence should be addressed. Tel: +410 5167244; Fax: +410 5164118; Email: grose{at}jhu.edu
Received July 16, 2002; Accepted August 13, 2002
ABSTRACT
RNABase is a unified database of all three-dimensional structures containing RNA deposited in either the Protein Data Bank (PDB) or Nucleic Acid Data Base (NDB). For each structure, RNABase contains a brief summary as well as annotation of conformational parameters, identification of possible model errors, Ramachandran-style conformational maps and classification of ribonucleotides into conformers. These same analyses can also be performed on structures submitted by users. To facilitate access, structures are automatically placed into a variety of functional and structural categories, including: ribozymes, pseudoknots, etc. RNABase can be freely accessed on the web at http://www.rnabase.org. We are committed to maintaining this database indefinitely.
INTRODUCTION
The pace of RNA structural determination has been accelerating rapidly (Fig. 1). Currently, there are over 500 publicly available structures containing one or more ribonucleotides, including 45 structures of ribozymes and ribozyme fragments as well as 85 structures of partial or complete tRNAs (alone and in complex with other molecules). Inconveniently, neither the Protein Data Bank (PDB) (1) nor the Nucleic Acid Database (NDB) (2) alone contains a complete set of these structures. Furthermore, few structural validations and analyses on the structures are provided by either database.
|
To address these issues, we have constructed RNABase, a web-driven relational database. For each structure, RNABase contains: a brief summary as well as annotation of conformational parameters, identification of possible model errors, Ramachandran-style conformational maps and classification of ribonucleotides into conformers.
RNABase ENTRIES AND ANALYSES
Lists of structures in RNABase can be sorted by experimental technique, structural and functional categories (e.g. ribozymes, aptamers, tetraloops, etc.), or by conformational outlier rate. Classification by technique and into structural and functional categories is performed based on keywords in author supplied data such as: title, keywords, etc. RNABase also provides both simple and advanced search tools in addition to these general lists.
For each structure, the RNABase summary page contains information extracted from the structure's header records such as title, authors, experimental technique, keywords, etc. This page also provides links to the corresponding entries in the PDB (1), NDB (2), MMDB (3), related Medline records (4), as well as the Image Library of Biological Molecules (5).
In addition to providing powerful search and browsing facilities, it is critical to perform structure analyses and validations. With the development of complete Ramachandran-style maps for RNA (6), it is increasingly possible to classify RNA conformers and to identify likely conformational errors. For each RNA residue in every structure, a complete set of conformational parameters are calculated, including: the backbone dihedral angles (
, ß,
,
,
, and
), the sidechain dihedral angle (
), the ribose dihedral angles (
0,
1,
2,
3, and
4), and the ribose puckering phase and amplitude. Using these parameters, complete sets of conformational plots are generated for every structure in RNABase. Furthermore, residues whose dihedral or puckering parameters fall outside of allowed areas are specifically flagged as probable model errors.
These conformational parameters are also used to generate discrete conformational codes (www.rnabase.org/confsum). Each conformational code describes a subset of the multi-dimensional conformational space available to nucleotides. Correspondingly, residues with the same conformational code have similar conformations. Using these codes, frequently or rarely occurring conformations can easily be identified.
RNABase META-ANALYSIS
RNABase also contains a number of meta-analyses of cumulative properties across the entire database (http://www.rnabase.org/metaanalysis/). Firstly, the total cumulative number of available RNA-containing structures is plotted (Fig. 1), showing that the rate of structure determination has been accelerating. It is interesting to note that the first several dozen structures were all solved by X-ray crystallography, but by the late 1990s the number of structures determined by nuclear magnetic resonance spectroscopy nearly equaled the number determined crystallographically. However, the last two years have seen the balance again shift toward the use of crystallography for RNA structure determination.
We note that the passage of time has not resulted in any noticeable improvement in the rate of conformational outliers on Ramachandran type plots (Fig. 2). Neither structures determined by NMR nor by crystallography have shown any consistent trends over the last two decades. Lastly, the number of conformational outliers varies with resolution for those determined by X-ray diffraction, as expected (Fig. 3). Struc-tures determined by NMR have outlier rates comparable to structures determined to
4 Å resolution by X-ray diffraction.
|
|
RNABase ARCHITECTURE
RNABase is built on a free-software platform consisting of the Red Hat Linux operating system, the apache web server and the PostgreSQL relational database system. In addition, a number of custom scripts have been developed using the python scripting language to parse and analyze the structures. Lastly, PHP scripts are used to dynamically generate the interface seen at the RNABase site. The entire system was designed to be largely self-maintaining and to require minimal ongoing work from its administrators.
The data in RNABase is assembled by a python script that parses all entries from the PDB and NDB. Because the same nomenclature is used by the NDB and PDB for both ribonucleotides and deoxyribonucleotides, RNABase classifies structures as containing RNA if they have one or more A, C, T, G, or U residues with an O2' atom (O2* in PDB notation).
DISCUSSION
RNABase is an integrated database of RNA three-dimensional structures containing a number of annotations. It has been designed to facilitate access to structures of RNA molecules by structural, functional or experimental features. Furthermore, by providing interactive structure analysis, the quality of RNA structures may show more consistent improvement in the future. We anticipate that RNABase will continue to expand and evolve. These developments will be reported on the RNABase News page (http://www.rnabase.org/news/).
ACKNOWLEDGEMENTS
Supported by grants from the NIH (GM29458) and the Mathers foundation.
REFERENCES
- Berman,H.M., Westbrook,J., Feng,Z., Gilliland,G., Bhat,T.N., Weissig,H., Shindyalov,I.N. and Bourne,P.E. (2000) The Protein Data Bank. Nucleic Acids Res., 28, 235242.
[Abstract/Free Full Text] - Berman,H.M., Olson,W.K., Beveridge,D.L., Westbrook,J., Gelbin,A., Demeny,T., Shieh,S.H., Srinivasan,A.R. and Schneider,B. (1992) The nucleic acid database. A comprehensive relational database of three-dimensional structures of nucleic acids. Biophys. J., 63, 751759.
[Free Full Text] - Wang,Y., Addess,K.J., Geer,L., Madej,T., Marchler-Bauer,A., Zimmerman,D. and Bryant,S.H. (2000) MMDB: 3D structure data in Entrez. Nucleic Acids Res., 28, 243245.
[Abstract/Free Full Text] - Wheeler,D.L., Church,D.M., Lash,A.E., Leipe,D.D., Madden,T.L., Pontius,J.U., Schuler,G.D., Schriml,L.M., Tatusova,T.A., Wagner,L. and Rapp,B.A. (2001) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res., 29, 1116.
[Abstract/Free Full Text] - Reichert,J., Jabs,A., Slickers,P. and Sühnel,J. (2000) The IMB Jena Image Library of Biological Macromolecules. Nucleic Acids Res., 28, 246249.
[Abstract/Free Full Text] - Murthy,V.L., Srinivasan,R., Draper,D.E. and Rose,G.D. (1999) A complete conformational map for RNA. J. Mol. Biol., 291, 313327.[CrossRef][ISI][Medline]
This article has been cited by other articles:
![]() |
R. Huhne, F.-T. Koch, and J. Suhnel A comparative view at comprehensive information resources on three-dimensional structures of biological macro-molecules Brief Funct Genomic Proteomic, October 23, 2007; (2007) elm020v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. D. Baird, M. Turcotte, R. G. Korneluk, and M. Holcik Searching for IRES RNA, October 1, 2006; 12(10): 1755 - 1785. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. L. Jensen, M. P. Styczynski, I. Rigoutsos, and G. N. Stephanopoulos A generic motif discovery algorithm for sequential data Bioinformatics, January 1, 2006; 22(1): 21 - 28. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. W. Davis, L. W. Murray, J. S. Richardson, and D. C. Richardson MOLPROBITY: structure validation and all-atom contact analysis for nucleic acids and their complexes Nucleic Acids Res., July 1, 2004; 32(suppl_2): W615 - W619. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. J. W. Murray, W. B. Arendall III, D. C. Richardson, and J. S. Richardson RNA backbone is rotameric PNAS, November 25, 2003; 100(24): 13904 - 13909. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. G. VITRESCHAK, D. A. RODIONOV, A. A. MIRONOV, and M. S. GELFAND Regulation of the vitamin B12 metabolism and transport in bacteria by a conserved RNA structural element RNA, September 1, 2003; 9(9): 1084 - 1097. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||







