Nucleic Acids Research Advance Access first published online on February 21, 2007
This version published online on March 27, 2007
Nucleic Acids Research, doi:10.1093/nar/gkm069
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Structural Biology |
Modeling RNA tertiary structure motifs by graph-grammars
1Institute for Research in Immunology and Cancer and 2Department of Computer Science and Operations Research, Université de Montréal, PO Box 6128, Downtown station, Montreal, Quebec H3C 3J7, Canada
*To whom correspondence should be addressed. Tel: 514 343 6752; Fax: 514 343 5839; Email: francois.major{at}umontreal.ca
Received September 1, 2006. Revised January 19, 2007. Accepted January 23, 2007.
A new approach, graph-grammars, to encode RNA tertiary structure patterns is introduced and exemplified with the classical sarcinricin motif. The sarcinricin motif is found in the stem of the crucial ribosomal loop E (also referred to as the sarcinricin loop), which is sensitive to the
-sarcin and ricin toxins. Here, we generate a graph-grammar for the sarcin-ricin motif and apply it to derive putative sequences that would fold in this motif. The biological relevance of the derived sequences is confirmed by a comparison with those found in known sarcinricin sites in an alignment of over 800 bacterial 23S ribosomal RNAs. The comparison raised alternative alignments in few sarcinricin sites, which were assessed using tertiary structure predictions and 3D modeling. The sarcinricin motif graph-grammar was built with indivisible nucleotide interaction cycles that were recently observed in structured RNAs. A comparison of the sequences and 3D structures of each cycle that constitute the sarcinricin motif gave us additional insights about RNA sequencestructure relationships. In particular, this analysis revealed the sequence space of an RNA motif depends on a structural context that goes beyond the single base pairing and base-stacking interactions.