Published online 24 June 2004
Nucleic Acids Research, Vol. 32 No. 11 © Oxford University Press 2004; all rights reserved
A probabilistic model of 3' end formation in Caenorhabditis elegans
Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK
* To whom correspondence should be addressed. Tel: +44 1223 834244; Fax: +44 1223 494919; Email: rd{at}sanger.ac.uk
The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors
Received April 25, 2004; Revised and Accepted May 31, 2004
The 3' ends of mRNAs terminate with a poly(A) tail. This post-transcriptional modification is directed by sequence features present in the 3'-untranslated region (3'-UTR). We have undertaken a computational analysis of 3' end formation in Caenorhabditis elegans. By aligning cDNAs that diverge from genomic sequence at the poly(A) tract, we accurately identified a large set of true cleavage sites. When there are many transcripts aligned to a particular locus, local variation of the cleavage site over a span of a few bases is frequently observed. We find that in addition to the well-known AAUAAA motif there are several regions with distinct nucleotide compositional biases. We propose a generalized hidden Markov model that describes sequence features in C.elegans 3'-UTRs. We find that a computer program employing this model accurately predicts experimentally observed 3' ends even when there are multiple AAUAAA motifs and multiple cleavage sites. We have made available a complete set of polyadenylation site predictions for the C.elegans genome, including a subset of 6570 supported by aligned transcripts.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
G. Broitman-Maduro, M. Owraghi, W. W. K. Hung, S. Kuntz, P. W. Sternberg, and M. F. Maduro The NK-2 class homeodomain factor CEH-51 and the T-box factor TBX-35 have overlapping function in C. elegans mesoderm development Development, August 15, 2009; 136(16): 2735 - 2746. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. R. Stumpf, J. Kimble, and M. Wickens A Caenorhabditis elegans PUF protein family with distinct RNA binding specificity RNA, August 1, 2008; 14(8): 1550 - 1557. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Mangone, P. MacMenamin, C. Zegar, F. Piano, and K. C. Gunsalus UTRome.org: a platform for 3'UTR biology in C. elegans Nucleic Acids Res., January 11, 2008; 36(suppl_1): D57 - D62. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. Graber, J. Salisbury, L. N. Hutchins, and T. Blumenthal C. elegans sequences that control trans-splicing and operon pre-mRNA processing RNA, September 1, 2007; 13(9): 1409 - 1426. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Moucadel, F. Lopez, T. Ara, P. Benech, and D. Gautheret Beyond the 3' end: experimental validation of extended transcript isoforms Nucleic Acids Res., March 19, 2007; 35(6): 1947 - 1957. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. D. Hayes, A. R. Frand, and G. Ruvkun The mir-84 and let-7 paralogous microRNA genes of Caenorhabditis elegans direct the cessation of molting via the conserved nuclear hormone receptors NHR-23 and NHR-25 Development, December 1, 2006; 133(23): 4631 - 4641. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Hajarnavis and R. Durbin A conserved sequence motif in 3' untranslated regions of ribosomal protein mRNAs in nematodes RNA, October 1, 2006; 12(10): 1786 - 1789. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Cheng, R. M. Miura, and B. Tian Prediction of mRNA polyadenylation sites by support vector machine Bioinformatics, October 1, 2006; 22(19): 2320 - 2325. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. M. Schwarz, I. Antoshechkin, C. Bastiani, T. Bieri, D. Blasiar, P. Canaran, J. Chan, N. Chen, W. J. Chen, P. Davis, et al. WormBase: better software, richer content Nucleic Acids Res., January 1, 2006; 34(suppl_1): D475 - D478. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Loke, E. A. Stahlberg, D. G. Strenski, B. J. Haas, P. C. Wood, and Q. Q. Li Compilation of mRNA Polyadenylation Signals in Arabidopsis Revealed a New Signal Element and Potential Secondary Structures Plant Physiology, July 1, 2005; 138(3): 1457 - 1468. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. H. Brown, S. S. Gross, and M. R. Brent Begin at the beginning: Predicting genes with 5' UTRs Genome Res., May 1, 2005; 15(5): 742 - 747. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Qiu, C. M. Adema, and T. Lane A computational study of off-target effects of RNA interference Nucleic Acids Res., March 30, 2005; 33(6): 1834 - 1847. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Zhang, J. Hu, M. Recce, and B. Tian PolyA_DB: a database for mammalian mRNA polyadenylation Nucleic Acids Res., January 1, 2005; 33(suppl_1): D116 - D120. [Abstract] [Full Text] [PDF] |
||||





