Nucleic Acids Research, 2002, Vol. 30, No. 9 2076-2082
© 2002 Oxford University Press
Searching for RNA genes using base-composition statistics
Center for Biomolecular Science and Engineering, 227 Sinsheimer Laboratories, University of California, 1156 High Street, Santa Cruz, CA 95064, USA
The hypothesis that genomic regions rich in non-protein-coding RNAs (ncRNAs) can be identified using local variations in single-base and dinucleotide statistics has been investigated. (G+C)%, (GC)% difference, (AT)% difference and dinucleotide-frequency statistics were compared among seven classes of ncRNAs and three genomes. Significant variations were observed in (G+C)% and, in Methanococcus jannaschii, in the frequency of the dinucleotide CG. Screening programs based on these two base-composition statistics were developed. With (G+C)% screening alone, a 1% fraction of the M.jannaschii genome containing all 44 known transfer RNAs, ribosomal RNAs and signal recognition particle RNAs could be identified. When (G+C)% combined with CG dinucleotide-frequency screening was used, 43 of the 44 known M.jannaschii structural ncRNAs were again identified, while the number of presumably false hits overlapping a known or putative protein-coding gene was reduced from 15 to 6. In addition, 19 candidate ncRNAs were identified including one with significant homology to several known archaeal RNaseP RNAs.
* Email: schattner{at}cse.ucsc.edu
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T. T. Tran, F. Zhou, S. Marshburn, M. Stead, S. R. Kushner, and Y. Xu De novo computational prediction of non-coding RNA genes in prokaryotic genomes Bioinformatics, November 15, 2009; 25(22): 2897 - 2905. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. P. Gardner The use of covariance models to annotate RNAs in whole genomes Brief Funct Genomic Proteomic, November 1, 2009; 8(6): 444 - 450. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Zhang and G. J. Olsen Messenger RNA processing in Methanocaldococcus (Methanococcus) jannaschii RNA, October 1, 2009; 15(10): 1909 - 1916. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Geissmann, C. Chevalier, M.-J. Cros, S. Boisset, P. Fechter, C. Noirot, J. Schrenzel, P. Francois, F. Vandenesch, C. Gaspin, et al. A search for small noncoding RNAs in Staphylococcus aureus reveals a conserved sequence motif for regulation Nucleic Acids Res., September 28, 2009; (2009) gkp668v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Zhang, E. Li, and G. J. Olsen Protein-coding gene promoters in Methanocaldococcus (Methanococcus) jannaschii Nucleic Acids Res., June 1, 2009; 37(11): 3588 - 3601. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Pichon and B. Felden Small RNA gene identification and mRNA target predictions in bacteria Bioinformatics, December 15, 2008; 24(24): 2807 - 2813. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Li, C. I. Reich, and G. J. Olsen A whole-genome approach to identifying protein binding sites: promoters in Methanocaldococcus (Methanococcus) jannaschii Nucleic Acids Res., December 1, 2008; 36(22): 6948 - 6958. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Larsson, A. Hinas, D. H. Ardell, L. A. Kirsebom, A. Virtanen, and F. Soderbom De novo search for non-coding RNA genes in the AT-rich genome of Dictyostelium discoideum: Performance of Markov-dependent genome feature scoring Genome Res., June 1, 2008; 18(6): 888 - 899. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Muller, F. Leclerc, I. Behm-Ansmant, J.-B. Fourmann, B. Charpentier, and C. Branlant Combined in silico and experimental identification of the Pyrococcus abyssi H/ACA sRNAs and their target sites in ribosomal RNAs Nucleic Acids Res., May 1, 2008; 36(8): 2459 - 2475. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Meyer A practical guide to the art of RNA gene prediction Brief Bioinform, November 1, 2007; 8(6): 396 - 414. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Thebault, S. de Givry, T. Schiex, and C. Gaspin Searching RNA motifs and their intermolecular contacts with constraint networks Bioinformatics, September 1, 2006; 22(17): 2074 - 2080. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. C. Samuels, R. J. Boys, D. A. Henderson, and P. F. Chinnery A compositional segmentation of the human mitochondrial genome is related to heterogeneities in the guanine mutation rate Nucleic Acids Res., October 15, 2003; 31(20): 6043 - 6052. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. McCutcheon and S. R. Eddy Computational identification of non-coding RNAs in Saccharomyces cerevisiae by comparative genomics Nucleic Acids Res., July 15, 2003; 31(14): 4119 - 4128. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Hershberg, S. Altuvia, and H. Margalit A survey of small RNA-encoding genes in Escherichia coli Nucleic Acids Res., April 1, 2003; 31(7): 1813 - 1820. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Gottesman Stealth regulation: biological circuits with small RNA switches Genes & Dev., November 15, 2002; 16(22): 2829 - 2842. [Full Text] [PDF] |
||||
![]() |
R. J. Klein, Z. Misulovin, and S. R. Eddy Noncoding RNA genes identified in AT-rich hyperthermophiles PNAS, May 28, 2002; 99(11): 7542 - 7547. [Abstract] [Full Text] [PDF] |
||||







