Nucleic Acids Research, 2003, Vol. 31, No. 13 3423-3428
© 2003 Oxford University Press
Pfold: RNA secondary structure prediction using stochastic context-free grammars
BiRC (Bioinformatics Research Center), University of Aarhus, Høegh Guldbergsgade 10, Building 090, 8000 Århus C, Denmark 1 Department of Statistics, Oxford University, The Peter Medawar Building for Pathogen Research, South Parks Road, Oxford OX1 3SY, UK
*To whom correspondence should be addressed at Department of Zoology, Box 118525, University of Florida, Gainesville, FL 32611-8525, USA. Tel: +1 3523927552; Fax: +1 3523923704; Email: bk{at}birc.dk
Received February 15, 2003; Revised and Accepted April 5, 2003
| ABSTRACT |
|---|
|
|
|---|
RNA secondary structures are important in many biological processes and efficient structure prediction can give vital directions for experimental investigations. Many available programs for RNA secondary structure prediction only use a single sequence at a time. This may be sufficient in some applications, but often it is possible to obtain related RNA sequences with conserved secondary structure. These should be included in structural analyses to give improved results. This work presents a practical way of predicting RNA secondary structure that is especially useful when related sequences can be obtained. The method improves a previous algorithm based on an explicit evolutionary model and a probabilistic model of structures. Predictions can be done on a web server at http://www.daimi.au.dk/~compbio/pfold.
| INTRODUCTION |
|---|
|
|
|---|
RNA structures are essential in many biological processes and are often conserved in evolution. Examples of such conserved structures are found in tRNA (1), rRNA (2,3), tmRNA (4), RNase P RNA (5) and SRP RNA (6). Many computational methods have been developed for predicting RNA structures. Early algorithms were made by Nussinov et al. (7) and Zuker and Stiegler (8). Zuker's energy calculations have been further improved (911) and are probably the most widely used RNA secondary structure prediction method today (with the MFOLD program).
The work of Knudsen and Hein (12) (here denoted as the KH-99 algorithm) combines an explicit evolutionary model of RNA sequences with a probabilistic model for secondary structures. It assumes an alignment and gives one common structural prediction for all the sequences.
This work improves the KH-99 algorithm primarily by making it faster and more robust toward alignment errors. A thorough evaluation of performance under different circumstances is also included. This new method called Pfold is available through a web-based server www.daimi.au.dk/~compbio/pfold.
| METHODS |
|---|
|
|
|---|
Pfold is based on the KH-99 algorithm, which was only useful for a limited number of sequences due to its large computation time. This work makes the algorithm practically useful for larger numbers of sequences. The main concerns are treatment of gaps, computational speed and robustness.
The KH-99 algorithm
The KH-99 algorithm uses a stochastic context-free grammar (SCFG) to produce a prior probability distribution of RNA structures. Given an alignment and a phylogenetic tree relating the sequences, posterior probabilities of the structures can be calculated using the insideoutside algorithm (13). The posterior probability is based on individual probabilities for alignment columns or pairs of columns in the case of a base-pair.
Column probabilities are calculated using the likelihood approach by Felsenstein (14). The evolution of column pairs is modelled using a rate matrix for base-pairs (i.e. a 16 by 16 matrix). The most likely structure is found using the CYK algorithm (15). The tree is estimated using a maximum likelihood approach in the SCFG model described.
Both the evolutionary and structural parameters of the KH-99 model are based on extensive tRNA and large subunit rRNA databases (12).
Gaps
Treating gaps in an appropriate way is a returning problem in biological sequence analysis. The best way to deal with gaps would probably be to make an explicit evolutionary model for insertions and deletions, and use that in the sequence analysis. Unfortunately, such calculations are often complicated, as in statistical alignment (16,17). Two simpler ideas are to treat gaps as a fifth nucleotide or to treat them as unknown nucleotides.
When using an evolutionary model, a number of problems arise from treating a gap as a fifth nucleotide. First, the frequency of gaps will depend on how many sequences are being analysed and on their evolutionary distance. Furthermore, gaps cannot be viewed as evolving in the same way as a nucleotide, thus the rates of evolution are difficult to specify. This approach does, however, have the advantage that it includes the potentially useful information from the insertions and deletions. This method was successfully used in the RNA part of the non-coding RNA gene finding algorithm by Rivas and Eddy (18). When treating gaps as unknown nucleotides, a gapped sequence position should have probability one for any nucleotide. This has the advantage that the probability of a column with gaps is equal to the probability of the same column in an alignment without the gapped sequences, and the tree correspondingly pruned (Fig. 1).
|
Pfold uses the latter approach, which is often done in situations where different alignment columns are looked at individually (19). In RNA structure prediction, pairs of columns are analysed together, which can give rise to some difficulties: when treating gaps as unknowns, gaps can form pairs with nucleotides (the top of Fig. 2). This problem was handled by removing columns where less than 75% of the sequences have nucleotides (the bottom of Fig. 2).
|
Unknown nucleotides
In biological sequences, some nucleotides may be unknown or only partial information may be available. These situations can be treated by letting the unknown nucleotide have a probability of one for each of the possible nucleotides. This means that if a given position is known to be a pyrimidine, its probability of being a U is set to one, and its probability of being a C is also set to one. Using this method, any symbols of the extended nucleotide alphabet can be treated correctly by Pfold. This is in accordance with Felsenstein (14).
Tree estimation
In the KH-99 algorithm, the tree was estimated through a maximum likelihood method using the SCFG model. While this gave good results and was interesting with respect to phylogenetic analysis, it was slow. A much faster method is to estimate the tree first. This can be done using standard methods.
In Pfold, pairwise distances between sequences are calculated using maximum likelihood. The rate matrix used should correspond as closely as possible to what is used in the KH-99 algorithm. Since the tree is calculated before the structure has been estimated, a single rate matrix has to be used for this purpose. It was made from the KH-99 algorithm by summing the loop rate matrix and a reduction of the base-pair rate matrix to single positions. The rate matrices were weighted with the probabilities that a given position is in a loop region or a stem region, respectively. The tree is calculated from pairwise distances using the neighbour joining algorithm (20) and adjusting branch lengths to maximum likelihood estimates. This gives a large increase in speed, since the insideoutside calculations only need to be performed once, as opposed to the multiple iterations used in the KH-99 method for estimating the tree.
Robustness
Pfold assumes that all sequences have exactly the same structure. This means that a single sequence with a slightly different structure might ruin a prediction. The same situation applies for alignment errors and sequencing errors. When a single error might change a prediction significantly, the method is not robust (the top of Fig. 3).
|
A way to make the algorithm more robust is to let any nucleotide have a small probability of being any other nucleotide. The interpretation of this is most obvious in terms of sequencing errors, but the method works for alignment errors and structure differences, too. Figure 3 shows how introduction of this probability changes the results. In Pfold this probability was set to 1%.
Partially known structure
Often, something is known about the structure being predicted and including this knowledge in the analysis can give improved results. Different kinds of knowledge can be used:
- That two given columns form a pair together.
- That a given column is involved in a pair.
- That a given column is unpaired.
This can all be included in the calculations by letting the structures that do not satisfy the previous knowledge have a probability of zero. This approach does not change relative prior distributions of allowed structures.
As a side note, no loops of length two are allowed in this implementation, as opposed to the KH-99 algorithm. This was implemented by disallowing pairs between positions of distance less than four.
What structure should be chosen?
A prediction program should of course report a single prediction as the best. The CYK algorithm for finding the most likely parsing from the grammar is often used (15). An alternative to this has been chosen here: Pfold reports the nested structure with the highest expected number of correct predictions. Appendix 1 (available as Supplementary Material) describes how this structure can be found. Notice that this removes some of the problems associated with using the CYK algorithm on ambiguous grammars.
Once the best nested structure has been chosen, the reliability of the prediction is evaluated for each position. This is done by finding the probability that each prediction (specific pair or unpaired) was correct, given the model and the data. The variables from the insideoutside algorithm can easily be used to give this information (Appendix 1 available as Supplementary Material). Knowing which parts of the prediction are reliable is very important when using the prediction in further work (Fig. 4).
|
Finding a single best structure may not always give all the information that would be useful. To give an overview of the prediction, a dot plot is produced as well. It is a square plot of pairing probabilities for all different pairs. Each probability is represented by the size of a dot in the appropriate position. Probabilities of not pairing are shown on the sides of the plot (Fig. 5). These calculations resemble the work by McCaskill (21).
|
Making the obvious individual structure changes
Sometimes, a structure in a single sequence will have a slightly longer stem than its homologues. This can be incorporated in the prediction by extending a stem if immediate neighbours can form base pairs. Another obvious change is to remove non-standard base pairs from individual sequences. This is done after the structure predictions given by Pfold.
| INPUT AND OUTPUT |
|---|
|
|
|---|
The input to the web server is an alignment of up to 40 sequences and 500 positions. The alignment should be given in the FASTA format with gaps represented by hyphens (-). Previously known structural elements can be incorporated in the prediction by adding a specific sequence with the relevant information (see web site for details). When a prediction is done, the web server returns an email to the user. This email contains a link to a web page with the results. On the web page, the following is available in multiple formats:
- A summary of the input.
- The calculated tree.
- The predicted structure given as a bracket notation.
- Reliabilities of individual predictions.
- A dot plot.
| RESULTS |
|---|
|
|
|---|
For all predictions in this section, two versions were made. The first version was made using the alignment from the database of the sequences being analysed. This alignment was assumed to be correct. In the second version, sequences from databases were aligned using the ClustalW alignment program by Thompson et al. (22) to imitate a realistic scenario of RNA structure prediction.
Test sets
A number of test sets were made (Table 1). The sets A, B and C are used in evaluating the prediction accuracy as a function of the number of sequences used in the analysis. Test set D is used to show how prediction accuracy varies as a function of evolutionary distance.
|
Prediction accuracy
An evaluation of the prediction accuracy is shown in Figure 6. The accuracy was calculated as the percentage of positions for which the secondary structure was correctly predicted. For a pairing position to be counted as correct, the position of the predicted pair had to be correct. Sequences in the test sets used here have a maximum pairwise distance of 0.50 units in the Jukes--Cantor model (23). This means that the sequences are quite diverse, but still possible to align without too many errors.
|
As expected, prediction accuracy rises with the number of sequences used, as more covariance information becomes available. This shows that when related sequences are available, they should generally be used in the structure prediction (exceptions to this are discussed below). An accuracy of
75% is obtainable with six sequences and Pfold can cope with many more sequences, so even higher accuracies are possible. Pfold was also used on the original test set of Knudsen and Hein (12). There were a few small differences in the results compared to the KH-99 algorithm and the overall performance was slightly better in this new version of the algorithm.
Evolutionary distance effect
There are two effects of evolutionary distance on prediction accuracy: large distances imply much covariation information, but it also means that sequences are difficult to align. Another issue with large evolutionary distances is that secondary structures may not be conserved between the sequences. Since Pfold assumes a single common structure for all sequences, this can pose a problem.
These effects are illustrated in Figure 7. When using correct alignments, the accuracy rises with distance, as the evolutionary information increases. The accuracy levels off at
80%, which seems to be the maximum obtainable average for two sequences of this type using Pfold. The graph made from the ClustalW alignments show how accuracy increases until an evolutionary distance of
0.60. After this, the accuracy drops due to alignment errors. At a distance of
0.90 the quality of alignments become so low that the structures might as well be predicted individually.
|
Speed
The algorithmical improvements made in this work reduces the computation time substantially. The largest improvement is from estimating a single evolutionary tree and fixing it, rather than estimating all the branch lengths under the RNA model. For details on the computation times, see Appendix 2 (available as Supplementary Material).
| DISCUSSION |
|---|
|
|
|---|
Some aspects of this method remain to be explored, as described by Knudsen and Hein (12). These include: base stacking interactions, a grammar more closely describing real RNA structures and other models for base-pair evolution.
If base-stacking interactions and a better grammar is incorporated in Pfold, the prediction accuracies should become close to the MFOLD results for single sequences (911), since the methods resemble each other closely in that situation. For multiple sequences, Pfold should still be able to perform even better.
When structures are conserved in evolution, inclusion of information from multiple related sequences improves predictions. A number of methods have been developed that combine energy calculations with evolutionary information through a given alignment (2427). These methods have proven quite reliable but lack an explicit evolutionary model.
As emphasised by this work, an important aspect of RNA structure prediction is the alignment problem. In methods that depend on a sequence alignment, the success of the method is closely linked to the quality of the alignment. Some work has been done in the field of RNA structural sequence alignment (28). Rivas and Eddy (18) developed an RNA model that takes the alignment of two sequences into consideration through a pair-SCFG. They did, however, assume the alignment to be given since computational time would be prohibitive in using their method to align sequences. The RNAGA method by Chen et al. (29) predicts consensus structures without trying to align the sequences which might be a useful approach to avoiding the alignment problem.
| SUPPLEMENTARY MATERIAL |
|---|
|
|
|---|
Supplementary Material is available at NAR Online.
| ACKNOWLEDGEMENTS |
|---|
B.K. would like to thank the University of Florida, the Carlsberg Foundation (grant no. ANS-0604/20) and the Danish National Science Research Council (grant no. 21-00-0283) for their support. B.K. also thanks Christian N. S. Pedersen for his great help in answering questions about Unix and programming. We acknowledge the help of Ebbe S. Andersen in making this method practically useful and in illustrating the web server. Finally, we thank Jakob S. Pedersen for helpful comments on the manuscript.
| REFERENCES |
|---|
|
|
|---|
- Sprinzl,M., Horn,C., Brown,M., Ioudovitch,A. and Steinberg,S. (1998) Compilation of tRNA sequences and sequences of tRNA genes. Nucleic Acids Res., 26, 148153.
[Abstract/Free Full Text] - Wuyts,J., De Rijk,P., Van de Peer,Y., Winkelmans,T. and De Wachter,R. (2001) The European large subunit ribosomal RNA database. Nucleic Acids Res., 29, 175177.
[Abstract/Free Full Text] - Wuyts,J., Van de Peer,Y., Winkelmans,T. and De Wachter,R. (2002) The European database on small subunit ribosomal RNA. Nucleic Acids Res., 30, 183185.
[Abstract/Free Full Text] - Zwieb,C., Gorodkin,J., Knudsen,B., Burks,J. and Wower,J. (2003) tmRDB (tmRNA database). Nucleic Acids Res., 31, 446447.
[Abstract/Free Full Text] - Brown,J.W. (1999) The ribonuclease P database. Nucleic Acids Res., 27, 314.
[Abstract/Free Full Text] - Rosenblad,M.A., Gorodkin,J., Knudsen,B., Zwieb,C. and Samuelsson,T. (2003) Nucleic Acids Res., 31, 363364.
[Abstract/Free Full Text] - Nussinov,R., Pieczenik,G., Griggs,J.R. and Kleitman,D.J. (1978) Algorithms for loop matchings. SIAM J. Appl. Math., 35, 6882.[CrossRef]
- Zuker,M. and Stiegler,P. (1981) Optimal computer folding of large RNA sequences using thermodynamic and auxiliary information. Nucleic Acids Res., 9, 133148.
[Abstract/Free Full Text] - Zuker,M. (1989) Computer prediction of RNA structure. Methods Enzymol., 180, 262288.[Web of Science][Medline]
- Walter,A.E., Turner,D.H., Kim,J., Lyttle,M.H., Mueller,P., Mathews,D.H. and Zuker,M. (1994) Coaxial stacking of helixes enhances binding of oligoribonucleotides and improves predictions of RNA folding. Proc. Natl Acad. Sci. USA, 91, 92189222.
[Abstract/Free Full Text] - Mathews,D.H., Sabina,J., Zuker,M. and Turner,D.H. (1999) Expanded sequence dependence of thermodynamic parameters improve prediction of RNA secondary structure. J. Mol. Biol., 288, 911940.[CrossRef][Web of Science][Medline]
- Knudsen,B. and Hein,J. (1999) RNA secondary structure prediction using stochastic context-free grammars and evolutionary history. Bioinformatics, 15, 446454.
[Abstract/Free Full Text] - Lari,K. and Young,S.J. (1990) The estimation of stochastic context-free grammars using the insideoutside algorithm. Comput. Speech Lang., 4, 3556.
- Felsenstein,J. (1981) Evolutionary trees from DNA sequences: a maximum likelihood approach. J. Mol. Evol., 17, 368376.[CrossRef][Web of Science][Medline]
- Durbin,R., Eddy,S., Krogh,A. and Mitchison,G. (1998) Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge University Press, Cambridge, UK.
- Thorne,J.L., Kishino,H. and Felsenstein,J. (1991) An evolutionary model for maximum likelihood alignment of DNA sequences. J. Mol. Evol., 33, 114124.[CrossRef][Web of Science][Medline]
- Hein,J., Wiuf,C., Knudsen,B., Møller,M. and Wibling,G. (2000) Statistical alignment: computational properties, homology testing and goodness-of-fit. J. Mol. Biol., 302, 265279.[CrossRef][Web of Science][Medline]
- Rivas,E. and Eddy,S.R. (2001) Noncoding RNA gene detection using comparative sequence analysis. BMC Bioinformatics, 2, 8.[CrossRef][Medline]
- Thorne,J.L., Goldman,N. and Jones,D.T. (1996) Combining protein evolution and secondary structure. Mol. Biol. Evol., 13, 666673.[Abstract]
- Saitou,N. and Nei,M. (1987) The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol., 4, 406425.[Abstract]
- McCaskill,J.S. (1990) The equilibrium partition function and base pair binding probabilities for RNA secondary structure. Biopolymers, 29, 11051119.[CrossRef][Web of Science][Medline]
- Thompson,J.D., Higgins,D.G. and Gibson,T.J. (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties and weight matrix choice. Nucleic Acids Res., 22, 46734680.
[Abstract/Free Full Text] - Jukes,T.H. and Cantor,C.R. (1969) Evolution of protein molecules. In Munro,H.N. (ed.), Mammalian Protein Metabolism. Academic Press, New York, pp. 21132.
- Hofacker,I.L. and Stadler,P.F. (1999) Automatic detection of conserved base pairing patterns in RNA virus genomes. Comput. Chem., 23, 401414.[CrossRef][Web of Science][Medline]
- Juan,V. and Wilson,C. (1999) RNA secondary structure prediction based on free energy and phylogenetic analysis. J. Mol. Biol., 289, 935947.[CrossRef][Web of Science][Medline]
- Lück,R., Gräf,S. and Steger,G. (1999) ConStruct: a tool for thermodynamic controlled prediction of conserved secondary structure. Nucleic Acids Res., 27, 42084217.
[Abstract/Free Full Text] - Hofacker,I.L., Fekete,M. and Stadler,P.F. (2002) Secondary structure prediction for aligned RNA sequences. J. Mol. Biol., 319, 10591066.[CrossRef][Web of Science][Medline]
- Gorodkin,J., Heyer,L.J. and Stormo,G.D. (1997) Finding the most significant common sequence and structure motifs in a set of RNA sequences. Nucleic Acids Res., 25, 37243732.
[Abstract/Free Full Text] - Chen,J.-H., Len,S.-Y. and Maizel,J.V. (2000) Prediction of common secondary structures of RNAs: a genetic algorithm approach. Nucleic Acids Res., 28, 991999.
[Abstract/Free Full Text]
This article has been cited by other articles:
![]() |
W. Li, E. Manktelow, J. C. von Kirchbach, J. R. Gog, U. Desselberger, and A. M. Lever Genomic analysis of codon, sequence and structural conservation with selective biochemical-structure mapping reveals highly conserved and dynamic structures in rotavirus RNAs with potential cis-acting functions Nucleic Acids Res., July 29, 2010; (2010) gkq663v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Lou and P. Clote Thermodynamics of RNA structures by Wang-Landau sampling Bioinformatics, June 15, 2010; 26(12): i278 - i286. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Engelen and F. Tahi Tfold: efficient in silico prediction of non-coding RNA secondary structures Nucleic Acids Res., April 1, 2010; 38(7): 2453 - 2466. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Menzel, J. Gorodkin, and P. F. Stadler The tedious task of finding homologous noncoding RNA genes RNA, December 1, 2009; 15(12): 2075 - 2082. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. H. Bernhart and I. L. Hofacker From consensus structure prediction to RNA gene finding Briefings in Functional Genomics, November 1, 2009; 8(6): 461 - 471. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Schroeder Advances in RNA Structure Prediction from Sequence: New Tools for Generating Hypotheses about Viral RNA Structure-Function Relationships J. Virol., July 1, 2009; 83(13): 6326 - 6334. [Full Text] [PDF] |
||||
![]() |
K. Sato, M. Hamada, K. Asai, and T. Mituyama CENTROIDFOLD: a web server for RNA secondary structure prediction Nucleic Acids Res., July 1, 2009; 37(suppl_2): W277 - W280. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. K. Mimouni, R. B. Lyngso, S. Griffiths-Jones, and J. Hein An Analysis of Structural Influences on Selection in RNA Genes Mol. Biol. Evol., January 1, 2009; 26(1): 209 - 216. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kapoor, J. Victoria, P. Simmonds, E. Slikas, T. Chieochansin, A. Naeem, S. Shaukat, S. Sharif, M. M. Alam, M. Angez, et al. A highly prevalent and genetically diversified Picornaviridae genus in South Asian children PNAS, December 23, 2008; 105(51): 20482 - 20487. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Galli, A. Lai, S. Corvasce, F. Saladini, C. Riva, L. Deho, I. Caramma, M. Franzetti, L. Romano, M. Galli, et al. Recombination analysis and structure prediction show correlation between breakpoint clusters and RNA hairpins in the pol gene of human immunodeficiency virus type 1 unique recombinant forms J. Gen. Virol., December 1, 2008; 89(12): 3119 - 3125. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Wang, I. Barr, F. Guo, and C. Lee Evidence of a novel RNA secondary structurein the coding region of HIV-1 pol gene RNA, December 1, 2008; 14(12): 2478 - 2488. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. K. Bradley, L. Pachter, and I. Holmes Specific alignment of structured RNA: stochastic grammars and sequence annealing Bioinformatics, December 1, 2008; 24(23): 2677 - 2683. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. E. Seemann, J. Gorodkin, and R. Backofen Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments Nucleic Acids Res., November 1, 2008; 36(20): 6355 - 6362. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. R. Thompson, E. Buratti, M. de Wispelaere, and M. Tepfer Structural and functional characterization of the 5' region of subgenomic RNA5 of cucumber mosaic virus J. Gen. Virol., July 1, 2008; 89(7): 1729 - 1738. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Katoh and H. Toh Recent developments in the MAFFT multiple sequence alignment program Brief Bioinform, July 1, 2008; 9(4): 286 - 298. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. B. Do, C.-S. Foo, and S. Batzoglou A max-margin model for efficient simultaneous alignment and folding of RNA sequences Bioinformatics, July 1, 2008; 24(13): i68 - i76. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Barquist and I. Holmes xREI: a phylo-grammar visualization webserver Nucleic Acids Res., July 1, 2008; 36(suppl_2): W65 - W69. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Kiryu, T. Kin, and K. Asai Rfold: an exact algorithm for computing local base pairing probabilities Bioinformatics, February 1, 2008; 24(3): 367 - 373. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Lindgreen, P. P. Gardner, and A. Krogh MASTR: multiple alignment and structure prediction of non-coding RNAs using simulated annealing Bioinformatics, December 15, 2007; 23(24): 3304 - 3311. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Meyer A practical guide to the art of RNA gene prediction Brief Bioinform, November 1, 2007; 8(6): 396 - 414. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. S. Andersen, A. Lind-Thomsen, B. Knudsen, S. E. Kristensen, J. H. Havgaard, E. Torarinsson, N. Larsen, C. Zwieb, P. Sestoft, J. Kjems, et al. Semiautomated improvement of RNA alignments RNA, November 1, 2007; 13(11): 1850 - 1859. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Cai, B. Hartnett, C. Gustafsson, and J. Peccoud A syntactic model to design and verify synthetic genetic constructs derived from standard biological parts Bioinformatics, October 15, 2007; 23(20): 2760 - 2767. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Xu, Y. Ji, and G. D. Stormo RNA Sampler: a new sampling based algorithm for common RNA secondary structure prediction and structural alignment Bioinformatics, August 1, 2007; 23(15): 1883 - 1891. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Khaladkar, V. Bellofatto, J. T. L. Wang, B. Tian, and B. A. Shapiro RADAR: a web server for RNA data analysis and research Nucleic Acids Res., July 13, 2007; 35(suppl_2): W300 - W304. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Kiryu, Y. Tabei, T. Kin, and K. Asai Murlet: a practical multiple alignment tool for structural RNA sequences Bioinformatics, July 1, 2007; 23(13): 1588 - 1598. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Washietl, J. S. Pedersen, J. O. Korbel, C. Stocsits, A. R. Gruber, J. Hackermuller, J. Hertel, M. Lindemeyer, K. Reiche, A. Tanzer, et al. Structured RNAs in the ENCODE selected regions of the human genome Genome Res., June 1, 2007; 17(6): 852 - 864. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. U. T. Hellen and S. de Breyne A Distinct Group of Hepacivirus/Pestivirus-Like Internal Ribosomal Entry Sites in Members of Diverse Picornavirus Genera: Evidence for Modular Exchange of Functional Noncoding RNA Elements by Recombination J. Virol., June 1, 2007; 81(11): 5850 - 5863. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Kiryu, T. Kin, and K. Asai Robust prediction of consensus secondary structures using averaged base pairing probability matrices Bioinformatics, February 15, 2007; 23(4): 434 - 441. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Lindgreen, P. P. Gardner, and A. Krogh Measuring covariation in RNA alignments: physical realism improves information measures Bioinformatics, December 15, 2006; 22(24): 2988 - 2995. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Voss Structural analysis of aligned RNAs Nucleic Acids Res., November 14, 2006; 34(19): 5471 - 5481. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. D. Baird, M. Turcotte, R. G. Korneluk, and M. Holcik Searching for IRES RNA, October 1, 2006; 12(10): 1755 - 1785. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Dalli, A. Wilm, I. Mainz, and G. Steger STRAL: progressive alignment of non-coding RNA using base pairing probability vectors in quadratic time Bioinformatics, July 1, 2006; 22(13): 1593 - 1599. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Gesell and A. von Haeseler In silico sequence evolution with site-specific interactions along phylogenetic trees Bioinformatics, March 15, 2006; 22(6): 716 - 722. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. BINDEWALD and B. A. SHAPIRO RNA secondary structure prediction from sequence alignments using a network of k-nearest neighbor classifiers. RNA, March 1, 2006; 12(3): 342 - 352. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. S. Andersen, M. A. Rosenblad, N. Larsen, J. C. Westergaard, J. Burks, I. K. Wower, J. Wower, J. Gorodkin, T. Samuelsson, and C. Zwieb The tmRDB and SRPDB resources Nucleic Acids Res., January 1, 2006; 34(suppl_1): D163 - D168. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Kasprzak, E. Bindewald, and B. A. Shapiro Structural polymorphism of the HIV-1 leader region explored by computational methods Nucleic Acids Res., December 20, 2005; 33(22): 7151 - 7163. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Meyer and I. Miklos Statistical evidence for conserved, local secondary structure in the coding regions of eukaryotic mRNAs and pre-mRNAs Nucleic Acids Res., November 7, 2005; 33(19): 6338 - 6348. [Abstract] [Full Text] [PDF] |
||||
![]() |
D Baralle and M Baralle Splicing in action: assessing disease causing sequence changes J. Med. Genet., October 1, 2005; 42(10): 737 - 748. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Siebert and R. Backofen MARNA: multiple alignment and consensus structure prediction of RNAs based on sequence structure comparisons Bioinformatics, August 15, 2005; 21(16): 3352 - 3359. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Jenkins, M. Bitner-Glindzicz, S. Malcolm, C.-C. A. Hu, J. Allison, P. J.D. Winyard, A. M. Gullett, D. F.M. Thomas, R. A. Belk, S. A. Feather, et al. De Novo Uroplakin IIIa Heterozygous Mutations Cause Human Renal Adysplasia Leading to Severe Kidney Failure J. Am. Soc. Nephrol., July 1, 2005; 16(7): 2141 - 2149. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. P. Gardner, A. Wilm, and S. Washietl A benchmark of multiple sequence alignment programs upon structural RNAs Nucleic Acids Res., April 28, 2005; 33(8): 2433 - 2439. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Pedersen, I. M. Meyer, R. Forsberg, P. Simmonds, and J. Hein A comparative method for finding and folding RNA secondary structures within protein-coding regions Nucleic Acids Res., September 24, 2004; 32(16): 4925 - 4936. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. KNIGHT, A. BIRMINGHAM, and M. YARUS BayesFold: Rational 2{degrees} folds that combine thermodynamic, covariation, and chemical data for aligned RNA sequences RNA, September 1, 2004; 10(9): 1323 - 1336. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Touzet and O. Perriquet CARNAC: folding families of related RNAs Nucleic Acids Res., July 1, 2004; 32(suppl_2): W142 - W145. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Ruan, G. D. Stormo, and W. Zhang ILM: a web server for predicting RNA secondary structures with pseudoknots Nucleic Acids Res., July 1, 2004; 32(suppl_2): W146 - W149. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Ying, H. Luo, J. Luo, and W. Li RDfolder: a web server for prediction of RNA secondary structure Nucleic Acids Res., July 1, 2004; 32(suppl_2): W150 - W153. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. S. Andersen, S. A. Contera, B. Knudsen, C. K. Damgaard, F. Besenbacher, and J. Kjems Role of the Trans-activation Response Element in Dimerization of HIV-1 RNA J. Biol. Chem., May 21, 2004; 279(21): 22243 - 22249. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||




















