Nucleic Acids Research, 2003, Vol. 31, No. 15 4639-4645
© 2003 Oxford University Press
Computational identification of protein coding potential of conserved sequence tags through cross-species evolutionary analysis
Dipartimento di Fisiologia e Biochimica Generali, Università di Milano, Via Celoria 26, 20133 Milano, Italy and 1 Sezione di Bioinformatica e Genomica, ITB-CNR, Via Amendola 168/5, 70125 Bari, Italy
*To whom correspondence should be addressed. Tel: +39 02 5031 4915; Fax: +39 02 5031 4912; Email: graziano.pesole{at}unimi.it
The identification of conserved sequence tags (CSTs) through comparative genome analysis may reveal important regulatory elements involved in shaping the spatio-temporal expression of genetic information. It is well known that the most significant fraction of CSTs observed in humanmouse comparisons correspond to protein coding exons, due to their strong evolutionary constraints. As we still do not know the complete gene inventory of the human and mouse genomes it is of the utmost importance to establish if detected conserved sequences are genes or not. We propose here a simple algorithm that, based on the observation of the specific evolutionary dynamics of coding sequences, efficiently discriminates between coding and non-coding CSTs. The application of this method may help the validation of predicted genes, the prediction of alternative splicing patterns in known and unknown genes and the definition of a dictionary of non-coding regulatory elements.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
G. Solda, I. V. Makunin, O. U. Sezerman, A. Corradin, G. Corti, and A. Guffanti An Ariadne's thread to the identification and annotation of noncoding RNAs in eukaryotes Brief Bioinform, September 1, 2009; 10(5): 475 - 489. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Castrignano, P. D. De Meo, G. Grillo, S. Liuni, F. Mignone, I. G. Talamo, and G. Pesole GenoMiner: a tool for genome-wide search of coding and non-coding conserved sequence tags Bioinformatics, February 15, 2006; 22(4): 497 - 499. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Boccia, M. Petrillo, D. di Bernardo, A. Guffanti, F. Mignone, S. Confalonieri, L. Luzi, G. Pesole, G. Paolella, A. Ballabio, et al. DG-CST (Disease Gene Conserved Sequence Tags), a database of human-mouse conserved elements associated to disease genes Nucleic Acids Res., January 1, 2005; 33(suppl_1): D505 - D510. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Castrignano, A. Canali, G. Grillo, S. Liuni, F. Mignone, and G. Pesole CSTminer: a web tool for the identification of coding and noncoding conserved sequence tags through cross-species genome comparison Nucleic Acids Res., July 1, 2004; 32(suppl_2): W624 - W627. [Abstract] [Full Text] [PDF] |
||||


