Skip Navigation


Nucleic Acids Research Advance Access originally published online on October 25, 2008
Nucleic Acids Research 2009 37(Database issue):D136-D140; doi:10.1093/nar/gkn766
This Article
Right arrow Abstract Freely available
Right arrow Print PDF (1019K) Freely available
Right arrow Screen PDF (222K) Freely available
Right arrowOA All Versions of this Article:
37/suppl_1/D136    most recent
gkn766v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Gardner, P. P.
Right arrow Articles by Bateman, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gardner, P. P.
Right arrow Articles by Bateman, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2009, Vol. 37, Database issue D136-D140
© 2008 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article appears in the following Nucleic Acids Research issue: Database issue [View the issue table of contents]

Articles

Rfam: updates to the RNA families database

Paul P. Gardner1,*, Jennifer Daub1, John G. Tate1, Eric P. Nawrocki2, Diana L. Kolbe2, Stinus Lindgreen3, Adam C. Wilkinson1, Robert D. Finn1, Sam Griffiths-Jones4, Sean R. Eddy2 and Alex Bateman1

1Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SA, UK, 2Howard Hughes Medical Institute, Janelia Farm Research Campus, Ashburn, Virginia, USA, 3Center for Bioinformatics, Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark and 4Faculty of Life Sciences, The University of Manchester, Manchester M13 9PL, UK

*To whom correspondence should be addressed. Tel: +44 1223 494 983; Fax: +44 1223 494 919; Email: pg5{at}sanger.ac.uk

Received October 1, 2008. Revised October 5, 2008. Accepted October 6, 2008.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 DATA AND METHODOLOGICAL...
 PRESENTATION IMPROVEMENTS
 FUTURE CHALLENGES
 FUNDING
 REFERENCES
 
Rfam is a collection of RNA sequence families, represented by multiple sequence alignments and covariance models (CMs). The primary aim of Rfam is to annotate new members of known RNA families on nucleotide sequences, particularly complete genomes, using sensitive BLAST filters in combination with CMs. A minority of families with a very broad taxonomic range (e.g. tRNA and rRNA) provide the majority of the sequence annotations, whilst the majority of Rfam families (e.g. snoRNAs and miRNAs) have a limited taxonomic range and provide a limited number of annotations. Recent improvements to the website, methodologies and data used by Rfam are discussed. Rfam is freely available on the Web at http://rfam.sanger.ac.uk/and http://rfam.janelia.org/.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 DATA AND METHODOLOGICAL...
 PRESENTATION IMPROVEMENTS
 FUTURE CHALLENGES
 FUNDING
 REFERENCES
 
Rfam is a database of sequence families of structural RNAs, including non-coding RNA genes as well as cis-regulatory RNA elements. Rfam release 9.0 contains 603 families, each represented by a multiple sequence alignment of known and predicted representative members of the family, annotated with a consensus base-paired secondary structure. This so-called SEED alignment is used to build a covariance model (CM) with the Infernal software (1). Each Rfam covariance model is searched against a nucleotide sequence database, producing a list of putative hits. Matches that score above a curated threshold are then aligned to the CM to produce a so-called FULL alignment. This process is outlined diagrammatically in Figure 1. The Rfam database was developed as a generic approach to the annotation of structured RNA families on genomic sequences (2,3), but it has been widely used as a source of reliable alignments and structures for the purposes of training and benchmarking RNA sequence and secondary structure analysis software.


Figure 1
View larger version (26K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 1. An outline of the Rfam 9.0 databases and methods. RFAMSEQ is drawn from EMBL excluding only the EST, synthetic and patented divisions. There are 603 Rfam families in release 9.0, which are used to scan RFAMSEQ for homologues using first WU-BLAST filters followed by the more accurate CM-based methods cmsearch and cmalign. This results in 603 FULL alignments annotating 636 138 regions.

 

    DATA AND METHODOLOGICAL IMPROVEMENTS
 TOP
 ABSTRACT
 INTRODUCTION
 DATA AND METHODOLOGICAL...
 PRESENTATION IMPROVEMENTS
 FUTURE CHALLENGES
 FUNDING
 REFERENCES
 
RFAMSEQ
All Rfam models are searched against an underlying nucleotide sequence database, known as RFAMSEQ, which is derived from the EMBL nucleotide sequence database (4). Prior to release 9.0, RFAMSEQ represented only the various species sections of EMBL. These sections contained only sequences that were considered to be of finished quality and excluded sequences from many important genomes. With release 9.0, RFAMSEQ has been expanded to include the whole genome shotgun (WGS) and environmental sequence (ENV) divisions. These changes have increased the number of sequences in RFAMSEQ by more than an order of magnitude (2 225 030 sequences in Rfam 8.0 versus 29 574 458 sequences in Rfam 9.0).

Sequence filters
In order to make it feasible to search more than 120 gigabases of sequence with hundreds of covariance models in a reasonable time, we use sequence-based filters to prune the search space prior to applying the more accurate and more computationally expensive CMs. One of the primary limitations of the Rfam annotation pipe-line has been the use of BLAST-based sequence filters, which are likely to compromise search sensitivity. In order to address this issue at least partially, NCBI-BLAST has been replaced with a WU-BLAST search, which has been tuned for high sensitivity and low sequence similarity. A benchmark of several homology search tools has shown WU-BLAST to be the more accurate of the two methods on nucleotide data (5). Additionally, in order to make the BLAST filters more similar to profile HMMs, a sequence mask has been applied to each sequence in the alignment. Any nucleotide in an alignment column that has either a low frequency or is an insert relative to the majority of the rest of the sequences is ‘soft masked’ and not used for the BLAST word matches. These masked nucleotides do, however, still contribute to alignments that were seeded in the flanking regions. This approach has resulted in many fewer spurious hits with no detectable cost to sensitivity (data not shown), thus allowing E-value thresholds to be further relaxed. These observations together mean that the BLAST filters have been improved in terms of specificity and sensitivity.

Iteration of families
In order to improve the species and sequence depth of individual Rfam families, more than 370 families have been expanded by an ‘iteration’ process, in which some sequences in the FULL alignment are chosen for promotion to the SEED alignment. The sequences selected from FULL alignments for inclusion in the SEED must pass a series of stringent quality control requirements and be manually approved by a curator. The quality control steps include: ensuring that the sequence and secondary structure are consistent with the existing SEED sequences; the sequence identity with existing SEED sequences falls within 60–95%; the sequence is not truncated with respect to the SEED alignment. The curator also ensures that the new sequences make phylogenetic sense before allowing them to be incorporated into an updated SEED. An example of the utility of iteration is the snoRNA U103 SEED from Rfam 8.1 (accession: RF00188), which contained just three sequences and spanned two eutherian mammals (human and mouse). The SEED in Rfam 9.0 after iteration contains 42 sequences and spans Eutheria, Teleost (ray-finned fishes), Iguanidae, Monotremes, Marsupials, Placentals, Aves and Chondrichthyes (cartilaginous fishes).

Phylogenetic trees have been estimated for both the SEED and FULL alignments. For the majority of the alignments we produced the trees using an accurate maximum-likelihood approach, which included models of indels (6). However, the computational complexity of tree estimation meant that maximum-likelihood was not always possible and hence, where the number of sequences in the alignment was greater than 64, a neighbour-joining method was used instead (7). Large alignments and trees are problematic, both in terms of the computational cost of generation and the challenges of displaying them. Therefore, where the number of sequences in the alignment was greater than 1024, the highly similar sequences were filtered by sequence similarity, resulting in relatively sparse and easily presented trees that required comparatively little computing power to generate.


    PRESENTATION IMPROVEMENTS
 TOP
 ABSTRACT
 INTRODUCTION
 DATA AND METHODOLOGICAL...
 PRESENTATION IMPROVEMENTS
 FUTURE CHALLENGES
 FUNDING
 REFERENCES
 
Website redesign
We are currently developing a new Rfam website, with the aim of improving the presentation of Rfam data and providing more and better tools for searching the data. The new site is now available from http://rfam.sanger.ac.uk/and can be used to access Rfam 9.0 data. The new site lacks some features of the old site, but we aim to add all existing features and add many new ones over the coming months. Note that, at time of writing, the new website was available only at the Wellcome Trust Sanger Institute (http://rfam.sanger.ac.uk/). The two mirror sites will be updated to run the same website to coincide with the release of Rfam 9.1. The new site provides detailed overviews of Rfam families, including: a snapshot of the latest community-contributed annotation from Wikipedia (see below); tools for viewing and downloading the sequence alignments in various formats; representations of predicted secondary structure (see below); the taxonomic tree for the family; and phylogenetic trees for the SEED and FULL alignments.

Additionally, we provide several search tools in the new site. We currently support interactive searches, allowing a single RNA sequence to be searched against the whole Rfam database, and a batch search tool for searching multiple sequences against Rfam, the results of which are returned to the user via email. A new taxonomy search tool allows the user to find Rfam families that are specific to a given taxonomic level, or those found in a set of taxonomic levels that are specified by a complex, boolean query. For example, the query ‘Drosophila AND Caenorhabditis NOT Mammalia’ returns the two Rfam families (RF00047 and RF00533) that contain sequences from both drosophila and caenorhabditis but no sequences from any mammalian species.

Structure graphics
New graphical representations of secondary structures have been added to the Rfam website, based on software from the Vienna RNA package (8). We now annotate several statistics directly on secondary structure diagrams, including sequence conservation, covariation, base-pair conservation and the maximum CM scores (Figure 2). The sequence conservation metric uses a metric computed for each column in the alignment; this is the frequency of the most common nucleotide in each column (Figure 2A). The covariation metric is based upon that used by RNAalifold (9). For each base pair in the consensus structure and for each pair of sequences in the alignment, the difference in structurally consistent and inconsistent mutations is taken. Each mutation is weighted using a tree-weighting scheme (10) and this value is then normalized by the number of possible mutations (Figure 2B). The base-pair conservation metric is the fraction of canonical base pairs (Watson–Crick and G:U) in any two columns that correspond to a base pair in the consensus structure (Figure 2C). The maximum covariance model score and corresponding nucleotide/base pair is computed for each node in the CM. The resulting sequence, structure and bit-scores are used to produce a marked up secondary structure (Figure 2D).


Figure 2
View larger version (24K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 2. An example of the new secondary markups used by Rfam. The coronavirus 3'-UTR pseudoknot is shown (Rfam Accession RF00165). We display coloured markups of sequence conservation (A), covariation (B), base-pair conservation also known as the fraction of canonical base pairs (C) and CM scores (D).

 
Wikipedia
The Rfam website now draws textual annotation of RNA families directly from the scientific community, through the online encyclopedia Wikipedia. Any updates to relevant Wikipedia articles are downloaded on a nightly basis using the MediaWiki API, verified by members of the consortium and presented on the Rfam site (11). We consider the resulting articles to be a great improvement on the original static text because they are frequently updated, provide cross links to related articles and are generally considerably more comprehensive and informative than the original Rfam annotations that they replace.


    FUTURE CHALLENGES
 TOP
 ABSTRACT
 INTRODUCTION
 DATA AND METHODOLOGICAL...
 PRESENTATION IMPROVEMENTS
 FUTURE CHALLENGES
 FUNDING
 REFERENCES
 
The rate of discovery of new RNA families is accelerating rapidly, facilitated by advancements in new sequencing technologies (12,13) and targeted computational screens (14–17). Keeping abreast of these updates whilst still ensuring the quality of alignments and secondary structures is an ongoing challenge for Rfam. We continue to evaluate new technologies and techniques as they emerge and will adopt new procedures for building and checking Rfam families as necessary.

We have been actively updating Rfam families and database crosslinks using more specialized RNA databases such as miRBase (18), IRESite (19), Pseudobase (20), snoRNABase (21), the plant snoRNA database (22), TransTerm (23) and the Yeast snoRNA database (24). As a result of these efforts, the next release of Rfam (version 9.1) will contain more than 700 entirely new families, bringing the total number of Rfam families to over 1300.

A new version of Infernal (v1.0) is now available (http://infernal.janelia.org) and we plan to use this latest version to prepare the next major release of Rfam. Testing suggests that, compared with the version used for Rfam 9 (v0.72), v1.0 is faster and slightly more sensitive, whilst introducing for the first time E-values for hits returned from database searches. Although the speed increase will not be sufficient to obviate the need for BLAST filters in the Rfam production pipeline, this remains a major goal for Infernal development. Importantly, Infernal v1.0 is not compatible with the Rfam 9 CM files. Rfam/Infernal users may wish to generate new CMs from Rfam 9 SEED or FULL alignment files.

We have mapped a subset of three-dimensional RNA structures found in the Protein DataBank (PDB) (25) (primarily SRP and ribosomal RNAs) to corresponding sequences in Rfam. In an initial feasibility study, we have demonstrated that RNA sequences can be retrieved from PDB files and mapped to Rfam sequences reliably. The mapping is currently performed using BLAT (26) to detect local regions of high similarity with high specificity. The positions of matches to Rfam entries are transferred to the PDB sequences, allowing us to colour three-dimensional structures as in Figure 3. We intend to roll-out this mapping across all Rfam families and PDB entries using both local similarities and global matches to Rfam models. This sequence-to-structure mapping will allow us to use determined tertiary structures to calculate secondary structure as a quality control for existing families, and catalogue interactions between RNA–RNA and RNA–protein families.

A further area of active research at Rfam is how best to distribute genome annotations. We plan to make annotations available in a variety of formats including the distributed annotation service (DAS) (27), General Feature Format (GFF) (http://song.sourceforge.net/gff3.shtml) and EMBL format, together with links to relevant genome browsers, e.g. ENSEMBL, UCSC and Genome Reviews.


Figure 3
View larger version (50K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 3. An example of how PDB structures are displayed in Rfam. In this case, the structure 1l ng, containing the SRP19-7S.S RNA Complex from M. jannaschii, is rendered as cartoons using Jmol. Protein regions are coloured using the following scheme: beta-sheets (yellow), helices (magenta) and unstructured regions (white). RNA bases that match the Rfam model are coloured according to the key given in the web page (not shown here). In this structure, green represents a match to the eukaryotic SRP model, whereas those unmatched bases are coloured orange.

 

    FUNDING
 TOP
 ABSTRACT
 INTRODUCTION
 DATA AND METHODOLOGICAL...
 PRESENTATION IMPROVEMENTS
 FUTURE CHALLENGES
 FUNDING
 REFERENCES
 
Wellcome Trust (to P.P.G., J.D., J.T., R.D.F. and A.B.) Howard Hughes Medical Institute (to E.P.N., D.L.K. and S.R.E); University of Manchester (to S.G.J.); University of Copenhagen (to S.L.). Funding for open access charge: The Wellcome Trust.

Conflict of interest statement. None declared.


    ACKNOWLEDGEMENTS
 
We would like to thank Ivo Hofacker and Andreas Gruber for secondary structure graphics code, and Zasha Weinberg, Jeffrey Barrick, Chris Brown, Tom Jones for contributions to the database.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 DATA AND METHODOLOGICAL...
 PRESENTATION IMPROVEMENTS
 FUTURE CHALLENGES
 FUNDING
 REFERENCES
 

  1. Nawrocki EP, Eddy SR. Query-dependent banding (QDB) for faster RNA similarity searches. PLoS Comput. Biol. (2007) 3:e56.[CrossRef][Medline]

  2. Griffiths-Jones S, Bateman A, Marshall M, Khanna A, Eddy SR. Rfam: an RNA family database. Nucleic Acids Res. (2003) 31:439–441.[Abstract/Free Full Text]

  3. Griffiths-Jones S, Moxon S, Marshall M, Khanna A, Eddy SR, Bateman A. Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res. (2005) 33:D121–D124.[Abstract/Free Full Text]

  4. Cochrane G, Akhtar R, Aldebert P, Althorpe N, Baldwin A, Bates K, Bhattacharyya S, Bonfield J, Bower L, Browne P, et al. Priorities for nucleotide trace, sequence and annotation data capture at the Ensembl Trace Archive and the EMBL Nucleotide Sequence Database. Nucleic Acids Res. (2008) 36:D5–D12.[Abstract/Free Full Text]

  5. Freyhult EK, Bollback JP, Gardner PP. Exploring genomic dark matter: a critical assessment of the performance of homology search methods on noncoding RNA. Genome Res. (2007) 17:117–125.[Abstract/Free Full Text]

  6. Rivas E, Eddy SR. Probabilistic phylogenetic inference with insertions and deletions. PLoS Comput. Biol. (2008) 4:e1000172.[CrossRef][Medline]

  7. Felsenstein J. PHYLIP (Phylogeny Inference Package) version 3.6. (2005) Seattle: University of Washington. Distributed by the author. Department of Genome Sciences.

  8. Gruber AR, Neuböck R, Hofacker IL, Washietl S. The RNAz web server: prediction of thermodynamically stable and evolutionarily conserved RNA structures. Nucleic Acids Res. (2007) 35:W335–W338.[Abstract/Free Full Text]

  9. Hofacker IL, Fekete M, Stadler PF. Secondary structure prediction for aligned RNA sequences. J. Mol. Biol. (2002) 319:1059–1066.[CrossRef][Web of Science][Medline]

  10. Gerstein M, Sonnhammer EL, Chothia C. Volume changes in protein evolution. J. Mol. Biol. (1994) 236:1067–1078.[CrossRef][Web of Science][Medline]

  11. Daub J, Gardner PP, Tate J, Ramskld D, Manske M, Scott WG, Weinberg Z, Griffiths-Jones S, Bateman A. The RNA WikiProject: community annotation of RNA families. RNA (2008) 14:12.

  12. Sittka A, Lucchini S, Papenfort K, Sharma CM, Rolle K, Binnewies TT, Hinton JC, Vogel J. Deep sequencing analysis of small noncoding RNA and mRNA targets of the global post-transcriptional regulator, Hfq. PLoS Genet. (2008) 4:e1000163.[CrossRef][Medline]

  13. Carninci P, Kasukawa T, Katayama S, Gough J, Frith MC, Maeda N, Oyama R, Ravasi T, Lenhard B, Wells C, et al. The transcriptional landscape of the mammalian genome. Science (2005) 309:1559–1563.[Abstract/Free Full Text]

  14. Rivas E, Eddy SR. Noncoding RNA gene detection using comparative sequence analysis. BMC Bioinform. (2001) 2:8.[CrossRef][Medline]

  15. Washietl S, Hofacker IL. Consensus folding of aligned sequences as a new measure for the detection of functional RNAs by comparative genomics. J. Mol. Biol. (2004) 342:19–30.[CrossRef][Web of Science][Medline]

  16. Washietl S, Hofacker IL, Stadler PF. Fast and reliable prediction of noncoding RNAs. Proc. Natl Acad. Sci. USA (2005) 102:2454–2459.[Abstract/Free Full Text]

  17. Pedersen JS, Bejerano G, Siepel A, Rosenbloom K, Lindblad-Toh K, Lander ES, Kent J, Miller W, Haussler D. Identification and classification of conserved RNA secondary structures in the human genome. PLoS Comput. Biol. (2006) 2:e33.[CrossRef][Medline]

  18. Griffiths-Jones S, Saini HK, van Dongen S, Enright AJ. miRBase: tools for microRNA genomics. Nucleic Acids Res. (2008) 36:D154–D158.[Abstract/Free Full Text]

  19. Mokrejs M, Vopalensky V, Kolenaty O, Masek T, Feketova Z, Sekyrova P, Skaloudova B, Kriz V, Pospisek M. IRESite: the database of experimentally verified IRES structures (www.iresite.org). Nucleic Acids Res. (2006) 34:D125–D130.[Abstract/Free Full Text]

  20. van Batenburg FH, Gultyaev AP, Pleij CW. PseudoBase: structural information on RNA pseudoknots. Nucleic Acids Res. (2001) 29:194–195.[Abstract/Free Full Text]

  21. Lestrade L, Weber MJ. snoRNA-LBME-db, a comprehensive database of human H/ACA and C/D box snoRNAs. Nucleic Acids Res. (2006) 34:D158–D162.[Abstract/Free Full Text]

  22. Brown JW, Echeverria M, Qu LH, Lowe TM, Bachellerie JP, Hüttenhofer A, Kastenmayer JP, Green PJ, Shaw P, Marshall DF. Plant snoRNA database. Nucleic Acids Res. (2003) 31:432–435.[Abstract/Free Full Text]

  23. Jacobs GH, Stockwell PA, Tate WP, Brown CM. Transterm–extended search facilities and improved integration with other databases. Nucleic Acids Res. (2006) 34:D37–D40.[Abstract/Free Full Text]

  24. Piekna-Przybylska D, Decatur WA, Fournier MJ. New bioinformatic tools for analysis of nucleotide modifications in eukaryotic rRNA. RNA (2007) 13:305–312.[Abstract/Free Full Text]

  25. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The Protein Data Bank. Nucleic Acids Res. (2000) 28:235–242.[Abstract/Free Full Text]

  26. Kent WJ. BLAT–the BLAST-like alignment tool. Genome Res. (2002) 12:656–664.[Abstract/Free Full Text]

  27. Jenkinson AM, Albrecht M, Birney E, Blankenburg H, Down T, Finn RD, Hermjakob H, Hubbard TJ, Jimenez RC, Jones P, et al. Integrating biological data – the Distributed Annotation System. BMC Bioinform. (2008) 9(Suppl 8):S3.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Brief Funct Genomic ProteomicHome page
A. Mosig, L. Zhu, and P. F. Stadler
Customized strategies for discovering distant ncRNA homologs
Brief Funct Genomic Proteomic, November 1, 2009; 8(6): 451 - 460.
[Abstract] [Full Text] [PDF]


Home page
Brief Funct Genomic ProteomicHome page
P. P. Gardner
The use of covariance models to annotate RNAs in whole genomes
Brief Funct Genomic Proteomic, November 1, 2009; 8(6): 444 - 450.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
G. Grillo, A. Turi, F. Licciulli, F. Mignone, S. Liuni, S. Banfi, V. A. Gennarino, D. S. Horner, G. Pavesi, E. Picardi, et al.
UTRdb and UTRsite (RELEASE 2010): a collection of sequences and regulatory motifs of the untranslated regions of eukaryotic mRNAs
Nucleic Acids Res., October 30, 2009; (2009) gkp902v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Z. Zhang, J. Yu, D. Li, Z. Zhang, F. Liu, X. Zhou, T. Wang, Y. Ling, and Z. Su
PMRD: plant microRNA database
Nucleic Acids Res., October 6, 2009; (2009) gkp818v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Y. Zhang, J. Wang, S. Huang, X. Zhu, J. Liu, N. Yang, D. Song, R. Wu, W. Deng, G. Skogerbo, et al.
Systematic identification and characterization of chicken (Gallus gallus) ncRNAs
Nucleic Acids Res., October 1, 2009; 37(19): 6562 - 6574.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
J. Armisen, M. J. Gilchrist, A. Wilczynska, N. Standart, and E. A. Miska
Abundant and dynamically expressed miRNAs, piRNAs, and other small RNAs in the vertebrate Xenopus tropicalis
Genome Res., October 1, 2009; 19(10): 1766 - 1775.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
G. Solda, I. V. Makunin, O. U. Sezerman, A. Corradin, G. Corti, and A. Guffanti
An Ariadne's thread to the identification and annotation of noncoding RNAs in eukaryotes
Brief Bioinform, September 1, 2009; 10(5): 475 - 489.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
C. Lelandais-Briere, L. Naya, E. Sallet, F. Calenge, F. Frugier, C. Hartmann, J. Gouzy, and M. Crespi
Genome-Wide Medicago truncatula Small RNA Analysis Revealed Novel MicroRNAs and Isoforms Differentially Regulated in Roots and Nodules
PLANT CELL, September 1, 2009; 21(9): 2780 - 2796.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Hackenberg, M. Sturm, D. Langenberger, J. M. Falcon-Perez, and A. M. Aransay
miRanalyzer: a microRNA detection and analysis tool for next-generation sequencing experiments
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W68 - W76.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. L. Kolbe and S. R. Eddy
Local RNA structure alignment with incomplete sequence
Bioinformatics, May 15, 2009; 25(10): 1236 - 1243.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
E. P. Nawrocki, D. L. Kolbe, and S. R. Eddy
Infernal 1.0: inference of RNA alignments
Bioinformatics, May 15, 2009; 25(10): 1335 - 1337.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
K. A. Fox, A. Ramesh, J. E. Stearns, A. Bourgogne, A. Reyes-Jara, W. C. Winkler, and D. A. Garsin
Multiple posttranscriptional regulatory mechanisms partner to control ethanolamine utilization in Enterococcus faecalis
PNAS, March 17, 2009; 106(11): 4435 - 4440.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Y. Galperin and G. R. Cochrane
Nucleic Acids Research annual Database Issue and the NAR online Molecular Biology Database Collection in 2009
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D1 - D4.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (1019K) Freely available
Right arrow Screen PDF (222K) Freely available
Right arrowOA All Versions of this Article:
37/suppl_1/D136    most recent
gkn766v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Gardner, P. P.
Right arrow Articles by Bateman, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gardner, P. P.
Right arrow Articles by Bateman, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?