Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (314K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (73)
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Workman, C.
Right arrow Articles by Krogh, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Workman, C.
Right arrow Articles by Krogh, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

No evidence that mRNAs have lower folding free energies than random sequences with the same dinucleotide distribution
Nucleic Acids Research Pages 4816-4822


No evidence that mRNAs have lower folding free energies than random sequences with the same dinucleotide distribution
Introduction
Materials And Methods
   Data
   Comparison between wild-type and random sequences
   Random sequences
   Statistical significance
Results And Discussion
Conclusions
Acknowledgements
References


No evidence that mRNAs have lower folding free energies than random sequences with the same dinucleotide distribution

Christopher Workman, Anders Krogh*

Center for Biological Sequence Analysis, Technical University of Denmark, Building 208, 2800 Lyngby, Denmark

Received July 12, 1999; Revised and Accepted October 22, 1999

ABSTRACT

This work investigates whether mRNA has a lower estimated folding free energy than random sequences. The free energy estimates are calculated by the mfold program for prediction of RNA secondary structures. For a set of 46 mRNAs it is shown that the predicted free energy is not significantly different from random sequences with the same dinucleotide distribution. For random sequences with the same mononucleotide distribution it has previously been shown that the native mRNA sequences have a lower predicted free energy, which indicates a more stable structure than random sequences. However, dinucleotide content is important when assessing the significance of predicted free energy as the physical stability of RNA secondary structure is known to depend on dinucleotide base stacking energies. Even known RNA secondary structures, like tRNAs, can be shown to have predicted free energies indistinguishable from randomized sequences. This suggests that the predicted free energy is not always a good determinant for RNA folding.

INTRODUCTION

The secondary structure of single-stranded RNA is known to implicate tertiary structure and function. Localized structures in mRNA have been shown to play important functional roles in translational regulation of some genes (1,2). However, it is unclear whether more global structures are formed by mRNA (3). In a recent paper (4) the folding free energy of mRNA from various organisms was predicted by the mfold program (5,6) and compared to that of random sequences with the same nucleotide distribution. The paper concludes that the native sequences on average have a significantly lower predicted free energy than the random sequences, and thus suggests that mRNA is likely to form secondary structure and that this biases the selection of codons.

The methods typically used for predicting RNA structure attempt to minimize the free energy of the molecule by maximizing the number of favorable base pairing interactions (7,8). The main contribution to the stability of RNA secondary structure is the free energy associated with base pairing. This free energy can be well approximated by stacking energies that depend not only on a single base pair, but on two neighboring base pairs (9,10). For instance, a C-G base pair is more favorable than a G-C base pair when stacked on top of an A-U base pair. It is well known that the dinucleotide distribution of DNA sequences is quite different from what would be expected from the nucleotide distribution alone (see for example 11). For this reason the dinucleotide frequency bias in the transcribed RNA should be important for the predicted free energy.

In Seffens and Digby (4) the predicted free energy of folding a native sequence was compared to several types of random sequences, such as random shuffling of the native sequence and randomly shuffled coding regions, but none of them preserved the dinucleotide distribution. The codon-shuffled sequences also tested in the paper are the most conservative in terms of dinucleotide statistics, and they turned out to have a lower average predicted free energy than the randomly shuffled sequences. In this paper we perform a similar analysis, but using random sequences with the same dinucleotide distribution as the native sequence. We find no evidence that (on average) mRNAs have lower predicted free energies than the random sequences.

The method is also tested on two well-known RNA structures: tRNA and the 18S rRNA from the ribosome small subunit. This analysis suggests that the method is not always sensitive enough to discriminate between random sequences and RNA with a known secondary structure. This may also indicate that the fold prediction method is not sensitive enough to detect small localized structures in long mRNA sequences (300-1200 nt).

MATERIALS AND METHODS

Data

An attempt was made to extract all the 51 mRNA sequences used in Seffens and Digby (4) from GenBank release 109.0. However, some were not found (HUMHPBS, HUMIFNAF and PHVLBA) and some had significantly different lengths (ECOALKA and ECODAPA) than that reported in Seffens and Digby (4) and were excluded. This set is listed in Tables 1 and 2. A set of five tRNA sequences was selected from the tRNA Sequence Database (12) and five ribosomal RNA sequences from Van de Peer et al. (13). These sets are listed in Table 3.


Table 1. Comparison of the Z-scores and P values from each of the four random sequence models
The shuffled mono-nuc. and zero order Markov columns correspond to the zero order random models. The shuffled di-nuc. and first order Markov columns show the first order randomization statistics.


Table 2. Comparison of the native folding energies with the averages from each of the four random sequence models
The shuffled mono-nuc. and zero order Markov columns correspond to the zero order random model folding energy averages. The shuffled di-nuc. and first order Markov columns correspond to the folding energy averages of the first order randomizations.


Table 3. Comparison of the native folding energies with the averages from the shuffled mononucleotide and dinucleotide sequence models for selected tRNA and rRNA sequences

Comparison between wild-type and random sequences

The newest version of the mfold program was obtained from Michael Zuker (mfold v.3.0). The mfold program minimizes a free energy function, which sums contributions from stacking, loop lengths, etc. It actually estimates the difference between the free energy of the unfolded state and the folded state. For any given RNA sequence length, the lower the energy estimate the more stable the predicted fold. The minimization is done by a dynamic programming method that always finds the secondary structure with the minimum free energy under a simplified secondary structure model (7,14). Although the various contributions to the free energy are obtained from experiments (9,10), a simplified model of RNA structure that disregards pseudoknots and other tertiary structures usually does not give a 100% correctly predicted secondary structure and occasionally the prediction is completely wrong.

For each native mRNA the minimum free energy prediction was found using mfold for energy calculations at 37°C. Then 10 random sequences of the same length as the native were generated and the minimum free energy prediction was found for each. The average and standard deviation were calculated and from these values the `segment score' was found for each of the native sequences (4). The segment score is the number of standard deviations by which the predicted free energy of the native sequence is lower than the average of the random sequences. This is also called the Z-score, which will be the term used from here on. If the Z-score is positive then the native sequence has a higher minimum predicted free energy than the average of the random sequences and therefore is thought to have less secondary structure than random sequence. Z-scores were calculated for all the native sequences with the four different types of random sequences described below.

Random sequences

Making random sequences with exactly the same number of each nucleotide as the native sequence is trivial, one simply makes a random permutation of the nucleotides. It is less trivial to make a random sequence with exactly the same number of each dinucleotide as the native. We have made two different types of random sequences based on the dinucleotide distribution of the native sequence. Similarly, we made two types of random sequences based on the mononucleotide distribution to be used for comparison. The four types are detailed below.

Zero order Markov. The mononucleotide frequencies, P(b), for the native RNA sequence were calculated and used to generate a random sequence in which bases were simply chosen at random from P(b) until the length of the native sequence was reached (zero order Markov process).

Mononucleotide shuffled. The mononucleotide counts for the native RNA sequence were calculated and the rand-seq program, written by Gerald Hertz (unpublished), was used to generate a shuffled version. Rand-seq is given the sequence length and nucleotide counts and draws at random, weighted by the nucleotide proportions, until all counts are depleted.

First order Markov. From the native sequence the conditional probability P(a|b) of nucleotide a given b is found from the frequencies of the 16 possible pairs a,b. A random sequence is generated by first choosing a random nucleotide x1, and then generate a sequence by choosing each nucleotide xi + 1 from the probability P(xi + 1|xi) (first order Markov process). The process is stopped when the sequence has exactly the same length as the native.

Dinucleotide shuffled. Dinucleotide shuffling is performed in the following way. At each iteration a random trinucleotide is chosen (e.g. ATT). Then all the non-overlapping trinucleotides that begin and end with the same bases (e.g. AAT, ACT, AGT and ATT) are shuffled at random. This is done N times, where N was chosen to be 10 times the length of the native sequence.

The random sequences generated by one of the Markov processes will be `truly' random, meaning that the only relation to the native sequence is the mononucleotide (zero order) or dinucleotide (first order) distribution. However, the exact number of each nucleotide or dinucleotide will fluctuate around the numbers in the native sequence.

The shuffling methods preserve the exact nucleotide or dinucleotide composition. The mononucleotide shuffling produces a truly random sequence (by the definition above). A dinucleotide shuffled sequence will have exactly the same number of each dinucleotide, but may be `less random' due to fewer possible dinucleotide-preserving permutations. In fact, one can think of extreme examples where the sequence is not changed at all (e.g. of the form AAAATTTT), but in the real examples we have looked at, the shuffled sequence has no resemblance to the native, and we believe them to be randomized very effectively. These sequences always start and end with the same nucleotides as the native sequence, but for sequences that are all longer than 300 bases, we consider that to be of no importance. A careful and detailed treatment of dinucleotide shuffling is given in Altschul and Erickson (15), though any dinucleotide-preserving randomization method will be limited by sequences with few dinucleotide permutations, as discussed above. The two first types of randomization (zero order Markov and mononucleotide shuffled) will be referred to as zero order randomizations and the last two as first order randomizations.

Statistical significance

For a single mRNA one would like to know whether the Z-score is significantly different from that of a random sequence. The distribution of Z-scores for random sequences turns out to be fairly well approximated by a normal distribution with mean 0 and standard deviation 1, although this has no theoretical justification (one would rather expect an extreme value distribution because of the minimization over all possible folds). Under this approximation a Z-score of less than -2.33 is significant at 1%.

To account for the deviations from normal we have directly estimated significance levels by order statistics. To determine significance of the Z-score, we must make comparison with the distribution of Z-scores for random sequences with the same lengths and nucleotide statistics as the native sequences. For each of the 46 native sequences a set of 101 random sequences are generated (by one of the four methods) and their free energies estimated from the predicted fold. The following bootstrap type procedure (16) was repeated 2000 times. For each of the 46 groups, a random sequence is selected (the test sequence) and a random subset of 10 sequences is selected from the remaining 100 sequences [there are or [ap]1013 ways to calculate a mean and a variance in this manner]. The Z-score is calculated for the random test sequence from the mean and variance of the other 10 random sequences. The average Z-score over the 46 test sequences is also found. For a given native sequence, the fraction of random sequences with a Z-score lower than that of the native gives a very good approximation to the probability that a score lower than this value occurred at random. We call this the P value for the sequence.

In the limit of large sample size the Z-scores for the random sequences would have mean 0 and a standard deviation of 1. Therefore the average Z-score over 46 sequences would be normally distributed with average 0 and a standard deviation of 1/ to a very good approximation. For a sample of just 10 sequences the standard deviation would be slightly larger than 1, but if we disregard that effect, an average Z-score of less than -2.33/ = -0.344 would be significant at the 1% level. Although this is a good guideline, we have again used the order statistics described above. The fraction of the 2000 average random Z-score values that are lower than the average Z-score for the native sequences is the P value for the average.

RESULTS AND DISCUSSION

The Z-scores and P values for the mRNAs are shown in Table 1 for all four types of random sequences. The predicted free energies used for these calculations are shown in Table 2. The energies reported in Seffens and Digby (4) are all significantly larger than the ones we obtain, which could be due to different parameter settings or differences between the mfold versions. The Z-scores for the two zero order randomizations agree reasonably well with the findings of Seffens and Digby (4), who obtained an average Z-score of -1.23 for randomly shuffled sequences, for which we obtain -1.59 (and -0.83 for the Markov random). This difference may be because of the mRNAs missing in our set, the different version of the mfold program or because of fluctuations due to differences in the random sequences. The important point is that the trend is the same: on average the energies are lower for the native sequences than for the zero order random sequences. The distributions of Z-scores can be seen in Figure 1. The average Z-scores are clearly significant for the zero order shufflings with P values much less than 0.01 (which corresponds to a Z-score of approximately -0.35 as discussed in Materials and Methods).


Figure 1. Histograms of the Z-scores for the four types of random sequences. The top row are for the zero order randomizations, the zero order Markov (left) and shuffled sequences (right), while the bottom row are for the first order randomizations, first order Markov (left) and dinucleotide shuffled (right).

The large difference between the zero order Markov sequences and the shuffled sequences is due to larger fluctuations in the energies for the Markov sequences than the others. This is primarily due to fluctuations in GC content that occur in Markov random sequences whereas sequence shufflings maintain constant GC:AT ratios. When the variance is larger the Z-scores attenuate. The average folding energies for the two types of random sequences are essentially the same. Table 1 also shows the Z-scores for the first order randomizations. Here the average is much smaller: -0.22 for the dinucleotide shuffled sequences and -0.20 for the first order Markov random sequences. Although there is still a negative trend in the score, it is quite clear from the score histograms in Figure 1 that it is insignificant. The histograms look almost symmetrical around 0, whereas a clear skew to the negative side is observed for the zero order randomizations. The order statistics for random sequences described in Materials and Methods gives a probability of 0.11 for a mean Z-score value of less than -0.22 for the dinucleotide shuffled sequences and a probability of 0.13 for a mean Z-score value of less than -0.20 for the first order Markov random sequences.

If we consider the sequences individually, we see that many individual Z-scores calculated with zero order randomizations appear significant while the large majority calculated with first order randomizations do not. If we use 0.01 as a significance threshold, we find 10 significant sequences from either the mononucleotide shuffled or zero order Markov P values. Only three sequences appear significant for either shuffled dinucleotide or first order Markov random and this supports the poor significance values we find for the average Z-scores. We also find three sequences with significantly higher predicted free energies than expected from random (P values >0.99) in either of the first order randomization methods and only one sequence P value above this for the zero order models. The ratios of significantly low to significantly high sequence P values are 10:1 for the zero order methods and 1:1 for the first order methods. In Figure 2 the P values are plotted against the Z-scores for all types of randomizations. Notice that the distribution is close to normal, but the standard deviation is larger than 1, as expected (see Materials and Methods).


Figure 2. Correlation between Z-score and P value for all mRNAs and all four types of randomization. The different types of randomization are indistinguishable. The curve for a normal distribution of mean 0 and a standard deviation of 1 is also displayed.

The process was repeated for five tRNAs and five 18S rRNAs for the monomer shuffled and dimer shuffled random models. The results are shown in Table 3. Surprisingly, the tRNAs do not show a very clear difference between the native sequence and dinucleotide shuffled, and one of the native sequences even has a higher energy than the average of the shuffled ones. Estimating P values from Figure 2 suggests that two of the zero order Z-scores are potentially significant while none of the dinucleotide shuffled Z-scores appear to be significant. For the rRNA there is quite a significant difference for all rRNA sequences. On average the predicted free energy of the native sequence is >8 SD from the random sequences, and they all have predicted free energies lower than the average of the random sequences. For these molecules there is only little difference in the results between the zero and first order randomizations.

The tRNAs are very short sequences (~70 bases) while the rRNAs are long (1500-2000 bases). Both are known to have global secondary structures. We suggest that extended structures of long RNA sequences need to be significantly more energetically favorable than those of short sequences for kinetic reasons. If there exist many suboptimal foldings with free energies close to the native, then they will compete with the native fold and the folding process will be extremely slow (17,18). Another explanation for the difference between tRNA and rRNA behavior is that tertiary structure, which is not taken into account in the predicted free energy calculation, contributes more significantly to the predicted free energy of tRNAs. Furthermore, since it is known that rRNAs have extended global secondary structures and both zero and first order randomization methods are capable of disrupting this global structure, one would expect the same trend in mRNA sequences should they have global secondary structures.

It is interesting that apparently `well-folded' structures can be obtained from short random sequences. In Figure 3 a few structures predicted from shuffled tRNA sequences are shown.


Figure 3. The fold predicted by mfold for an Arabidopsis tRNA (A) compared to secondary structure predictions for randomized versions of the same molecule. Structures (B) and (C) are examples from mononucleotide shuffled randomization while (D) and (E) show dinucleotide shuffled randomizations. Drawings were created with plt22ps by D. Stewart and M. Zuker.

CONCLUSIONS

The comparison of the predicted free energy of mRNA and random sequences with the same dinucleotide distribution shows no significant difference between the two for the 46 mRNAs studied here. This suggests that mRNA, in general, does not form more stable extended structures than random sequences. Considering the inability to distinguish short tRNAs (with well-known secondary structures) from randomized tRNAs, the method is probably not sensitive enough to determine whether mRNAs form localized structures. For example, stable hairpin loops are known to be important in translational control of some genes but are not likely to be detected in the predicted folding energy of an otherwise unfavorable global secondary structure. It was shown that the dinucleotide distribution is important for determining the significance of secondary structure, which is not surprising since stacking energies are crucial for the stability of RNA structure. It is unlikely that the dinucleotide distribution of mRNA is influenced by a need to form secondary structure, because the dinucleotide distribution is generally very similar in other types of DNA from the same organism (i.e. non-coding DNA) and varies greatly between coding regions of different organisms.

ACKNOWLEDGEMENTS

We would like to thank the referees for valuable suggestions and for bringing Altschul and Erickson (15) to our attention. This work was supported by a grant from the Danish National Research Foundation.

REFERENCES

1. de Smit, M.H. (1998) In RNA Structure and Function.Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, pp. 495-540.

2. Pantopoulos, K., Johansson,H.E. and Hentze,M.W. (1994) Prog. Nucleic Acid Res. Mol. Biol., 48, 181-238. MEDLINE Abstract

3. Doktycz, M.J., Larimer,F.W., Pastrnak,M. and Stevens,A. (1998) Proc. Natl Acad. Sci. USA, 95, 14614-14621. MEDLINE Abstract

4. Seffens, W. and Digby,D. (1999) Nucleic Acids Res., 27, 1578-1584. MEDLINE Abstract

5. Zuker, M. (1989) Methods Enzymol., 180, 262-288. MEDLINE Abstract

6. Mathews, D.H., Sabina,J., Zuker,M. and Turner,D.H. (1999) J. Mol. Biol., 288, 911-940. MEDLINE Abstract

7. Zuker, M. and Stiegler,P. (1981) Nucleic Acids Res., 9, 133-148. MEDLINE Abstract

8. Schuster, P., Fontana,W., Stadler,P.F. and Hofacker,I.L. (1994) Proc. R. Soc. Lond. Ser. B Biol. Sci., 255, 279-284.

9. Freier, S.M., Kierzek,R., Jaeger,J.A., Sugimoto,N., Caruthers,M.H., Neilson,T. and Turner,D.H. (1986) Proc. Natl Acad. Sci. USA, 83, 9373-9377. MEDLINE Abstract

10. Turner, D.H., Sugimoto,N., Jaeger,J.A., Longfellow,C.E., Freier,S.M. and Kierzek,R. (1987) Cold Spring Harbor Symp. Quant. Biol., 52, 123-133. MEDLINE Abstract

11. Karlin, S. and Mrázek,J. (1997) Proc. Natl Acad. Sci. USA, 94, 10227-10232. MEDLINE Abstract

12. Sprinzl, M., Horn,C., Brown,M., Ioudovitch,A. and Steinberg,S. (1998) Nucleic Acids Res., 26, 148-153. MEDLINE Abstract

13. Van de Peer, Y., Robbrecht,E., de Hoog,S., Caers,A., De Rijk,P. and De Wachter,R. (1999) Nucleic Acids Res., 27, 179-183. MEDLINE Abstract

14. Nussinov, R., Pieczenik,G., Griggs,J.R. and Kleitman,D.J. (1978) SIAM J. Appl. Math., 35, 68-82.

15. Altschul, S.F. and Erickson,B.W. (1985) Mol. Biol. Evol., 2, 526-538. MEDLINE Abstract

16. Efron, B. and Tibshirani,R.J. (1993) An Introduction to the Bootstrap.Chapman and Hall, London, UK.

17. Uhlenbeck, O.C. (1995) RNA, 1, 4-6. MEDLINE Abstract

18. Treiber, D.K and Williamson,J.R. (1999) Curr. Opin. Struct. Biol., 9, 339-345. MEDLINE Abstract


*To whom correspondence should be addressed. Tel: +45 4525 2477; Fax: +45 4593 1585; Email: krogh{at}cbs.dtu.dk


This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals
Comments and feedback: jnl.info{at}oup.co.uk
Last modification:
Copyright© Oxford University Press, 1999.

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Genome Res.Home page
P. Larsson, A. Hinas, D. H. Ardell, L. A. Kirsebom, A. Virtanen, and F. Soderbom
De novo search for non-coding RNA genes in the AT-rich genome of Dictyostelium discoideum: Performance of Markov-dependent genome feature scoring
Genome Res., June 1, 2008; 18(6): 888 - 899.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Simmonds, I. Karakasiliotis, D. Bailey, Y. Chaudhry, D. J. Evans, and I. G. Goodfellow
Bioinformatic and functional analysis of RNA secondary structure elements among different genera of human and animal caliciviruses
Nucleic Acids Res., May 1, 2008; 36(8): 2530 - 2546.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
A. Kapoor, J. Victoria, P. Simmonds, C. Wang, R. W. Shafer, R. Nims, O. Nielsen, and E. Delwart
A Highly Divergent Picornavirus in a Marine Mammal
J. Virol., January 1, 2008; 82(1): 311 - 320.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Jiang, H. Wu, W. Wang, W. Ma, X. Sun, and Z. Lu
MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W339 - W344.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
W. Shu, X. Bo, Z. Zheng, and S. Wang
RSRE: RNA structural robustness evaluator
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W314 - W319.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
S. Washietl, J. S. Pedersen, J. O. Korbel, C. Stocsits, A. R. Gruber, J. Hackermuller, J. Hertel, M. Lindemeyer, K. Reiche, A. Tanzer, et al.
Structured RNAs in the ENCODE selected regions of the human genome
Genome Res., June 1, 2007; 17(6): 852 - 864.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
E. Torarinsson, J. H. Havgaard, and J. Gorodkin
Multiple structural alignment and clustering of RNA sequences
Bioinformatics, April 15, 2007; 23(8): 926 - 932.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
S. NG Kwang Loong and S. K. Mishra
Unique folding of precursor microRNAs: Quantitative evidence and implications for de novo identification
RNA, February 1, 2007; 13(2): 170 - 187.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. L. Jacobs, A. T. Belew, R. Rakauskaite, and J. D. Dinman
Identification of functional, endogenous programmed -1 ribosomal frameshift signals in the genome of Saccharomyces cerevisiae
Nucleic Acids Res., January 12, 2007; 35(1): 165 - 174.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
E. K. Freyhult, J. P. Bollback, and P. P. Gardner
Exploring genomic dark matter: A critical assessment of the performance of homology search methods on noncoding RNA
Genome Res., January 1, 2007; 17(1): 117 - 125.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
S. D. Baird, M. Turcotte, R. G. Korneluk, and M. Holcik
Searching for IRES
RNA, October 1, 2006; 12(10): 1755 - 1785.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. Yu and J. L. Thorne
Dependence among Sites in RNA Evolution
Mol. Biol. Evol., August 1, 2006; 23(8): 1525 - 1537.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
E. Torarinsson, M. Sawera, J. H. Havgaard, M. Fredholm, and J. Gorodkin
Thousands of corresponding human and mouse genomic regions unalignable in primary sequence contain common RNA structure
Genome Res., July 1, 2006; 16(7): 885 - 889.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
E. Borenstein and E. Ruppin
Direct evolution of genetic robustness in microRNA
PNAS, April 25, 2006; 103(17): 6593 - 6598.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. A. Shabalina, A. Y. Ogurtsov, and N. A. Spiridonov
A periodic pattern of mRNA secondary structure created by the genetic code.
Nucleic Acids Res., January 1, 2006; 34(8): 2428 - 2437.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
I. M. Meyer and I. Miklos
Statistical evidence for conserved, local secondary structure in the coding regions of eukaryotic mRNAs and pre-mRNAs
Nucleic Acids Res., November 7, 2005; 33(19): 6338 - 6348.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Florquin, Y. Saeys, S. Degroeve, P. Rouze, and Y. Van de Peer
Large-scale structural analysis of the core promoter in mammalian and plant genomes
Nucleic Acids Res., July 27, 2005; 33(13): 4255 - 4264.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
D. B. Carlini
Context-Dependent Codon Bias and Messenger RNA Longevity in the Yeast Transcriptome
Mol. Biol. Evol., June 1, 2005; 22(6): 1403 - 1411.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
P. CLOTE, F. FERRE, E. KRANAKIS, and D. KRIZANC
Structural RNA has lower folding energy than random RNA of the same dinucleotide frequency
RNA, May 1, 2005; 11(5): 578 - 591.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. H. Havgaard, R. B. Lyngso, G. D. Stormo, and J. Gorodkin
Pairwise local structural alignment of RNA sequences with sequence similarity less than 40%
Bioinformatics, May 1, 2005; 21(9): 1815 - 1824.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. Washietl, I. L. Hofacker, and P. F. Stadler
From The Cover: Fast and reliable prediction of noncoding RNAs
PNAS, February 15, 2005; 102(7): 2454 - 2459.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
E. Buratti and F. E. Baralle
Influence of RNA Secondary Structure on the Pre-mRNA Splicing Process
Mol. Cell. Biol., December 15, 2004; 24(24): 10505 - 10514.
[Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. S. Pedersen, I. M. Meyer, R. Forsberg, P. Simmonds, and J. Hein
A comparative method for finding and folding RNA secondary structures within protein-coding regions
Nucleic Acids Res., September 24, 2004; 32(16): 4925 - 4936.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. Giegerich, B. Voss, and M. Rehmsmeier
Abstract shapes of RNA
Nucleic Acids Res., September 15, 2004; 32(16): 4843 - 4851.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
P. SIMMONDS, A. TUPLIN, and D. J. EVANS
Detection of genome-scale ordered RNA structure (GORS) in genomes of positive-stranded RNA viruses: Implications for virus evolution and host persistence
RNA, September 1, 2004; 10(9): 1337 - 1351.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J.-V. Chamary and L. D. Hurst
Similar Rates but Different Modes of Sequence Evolution in Introns and at Exonic Silent Sites in Rodents: Evidence for Selectively Driven Codon Usage
Mol. Biol. Evol., June 1, 2004; 21(6): 1014 - 1023.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
E. Buratti, A. F. Muro, M. Giombi, D. Gherbassi, A. Iaconcig, and F. E. Baralle
RNA Folding Affects the Recruitment of SR Proteins by Mouse and Human Polypurinic Enhancer Elements in the Fibronectin EDA Exon
Mol. Cell. Biol., February 1, 2004; 24(3): 1387 - 1400.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
L. Katz and C. B. Burge
Widespread Selection for Local RNA Secondary Structure in Coding Regions of Bacterial Genes
Genome Res., September 1, 2003; 13(9): 2042 - 2051.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Kitagawa, Y. Futamura, and K. Yamamoto
Analysis of the conformational energy landscape of human snRNA with a metric based on tree representation of RNA structures
Nucleic Acids Res., April 1, 2003; 31(7): 2006 - 2013.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Schattner
Searching for RNA genes using base-composition statistics
Nucleic Acids Res., May 1, 2002; 30(9): 2076 - 2082.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
D. B. Carlini, Y. Chen, and W. Stephan
The Relationship Between Third-Codon Position Nucleotide Content, Codon Bias, mRNA Secondary Structure and Gene Expression in the Drosophilid Alcohol Dehydrogenase Genes Adh and Adhr
Genetics, October 1, 2001; 159(2): 623 - 633.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (314K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (73)
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Workman, C.
Right arrow Articles by Krogh, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Workman, C.
Right arrow Articles by Krogh, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?