Nucleic Acids Research, Vol 27, Issue 17 3503-3509, Copyright © 1999 by Oxford University Press
P Mackiewicz, M Kowalczuk, A Gierlik, MR Dudek and S Cebrat
In a recent paper we have estimated the total number of protein coding open
reading frames (ORFs) in the Saccharomyces cerevisiae genome, based on
their properties, at about 4800. This number is much smaller than the
5800-6000 which is widely accepted. In this paper we analyse differences
between the set of ORFs with known phenotypes annotated in the Munich
Information Centre for Protein Sequences (MIPS) database and ORFs for which
the probability of coding, counted by us, is very low. We have found that
many of the latter ORFs have properties of antisense sequences of coding
ORFs, which suggests that they could have been generated by duplication of
coding sequences. Since coding sequences generate ORFs inside themselves,
with especially high frequency in the antisense sequences, we have looked
for homology between known proteins and hypothetical polypeptides generated
by ORFs under consideration in all the six phases. For many ORFs we have
found paralogues and orthologues in phases different than the phase which
had been assumed in the MIPS database as coding.
ARTICLES
Origin and properties of non-coding ORFs in the yeast genome
Institute of Microbiology, Wroclaw University, ul. Przybyszewskiego 63/77, 54-148 Wroclaw, Poland.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Kumar An Overview of Nested Genes in Eukaryotic Genomes Eukaryot. Cell, September 1, 2009; 8(9): 1321 - 1329. [Full Text] [PDF] |
||||
![]() |
N. Siew, Y. Azaria, and D. Fischer The ORFanage: an ORFan database Nucleic Acids Res., January 1, 2004; 32(90001): D281 - 283. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Decottignies, I. Sanchez-Perez, and P. Nurse Schizosaccharomyces pombe Essential Genes: A Pilot Study Genome Res., March 1, 2003; 13(3): 399 - 406. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. M. Harrison, A. Kumar, N. Lang, M. Snyder, and M. Gerstein A question of size: the eukaryotic proteome and the problems in defining it Nucleic Acids Res., March 1, 2002; 30(5): 1083 - 1090. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-T. Zhang and J. Wang Recognition of protein coding genes in the yeast genome at better than 95% accuracy based on the Z curve Nucleic Acids Res., July 15, 2000; 28(14): 2804 - 2814. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-P. Alimi, O. Poirot, F. Lopez, and J.-M. Claverie Reverse Transcriptase-Polymerase Chain Reaction Validation of 25 "Orphan" Genes from Escherichia coli K-12 MG1655 Genome Res., July 1, 2000; 10(7): 959 - 966. [Abstract] [Full Text] |
||||


