Nucleic Acids Research, 1994, Vol. 22, No. 24 5156-5163
© 1994
Articles |
Predicting internal exons by oligonucleotide composition and discriminant analysis of spliceable open reading frames
Department of Cell Biology, Baylor College of Medicine One Baylor Plaza, Houston, TX 77030, USA
*To whom correspondence should be addressed
Received November 4, 1994. Accepted November 11, 1994.
A new method which predicts internal exon sequences In human DNA has been developed. The method Is based on a splice site prediction algorithm that uses the linear discriminant function to combine Information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotides In protein coding and intron regions. The accuracy of our splice site recognition function Is 97% for donor splice sites and 96% for acceptor splice sites. For exon prediction, we combine in a discriminant function the characteristics describing the 5
-lntron region, donor splice site, coding region, acceptor splice site and 3'-lntron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise Internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences Is 77% with a specificity of 79%. The recognition quality computed at the level of Individual nucleotides Is 89% for exon sequences and 98% for intron sequences. This corresponds to a correlation coefficient for exon prediction of 0.87. The precision of this approach is better than other methods and has been tested on a larger data set. We have also developed a means for predicting exon - exon Junctions In cDNA sequences, which can be useful for selecting optimal PCR primers.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. Wang, A. Diehl, F. Wu, J. Vrebalov, J. Giovannoni, A. Siepel, and S. D. Tanksley Sequencing and Comparative Analysis of a Conserved Syntenic Segment in the Solanaceae Genetics, September 1, 2008; 180(1): 391 - 408. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Agrawal and G. D. Stormo Using mRNAs lengths to accurately predict the alternatively spliced gene products in Caenorhabditis elegans Bioinformatics, May 15, 2006; 22(10): 1239 - 1244. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Brent Genome annotation past, present, and future: How to define an ORF at each locus Genome Res., December 1, 2005; 15(12): 1777 - 1786. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Bekaert, H. Richard, B. Prum, and J.-P. Rousset Identification of programmed translational -1 frameshifting sites in the genome of Saccharomyces cerevisiae Genome Res., October 1, 2005; 15(10): 1411 - 1420. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Matsuyama, T. Shiraishi, F. Trapasso, T. Kuroki, H. Alder, M. Mori, K. Huebner, and C. M. Croce Fragile site orthologs FHIT/FRA3B and Fhit/Fra14A2: Evolutionarily conserved but highly recombinogenic PNAS, December 9, 2003; 100(25): 14988 - 14993. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Freund, C. Asang, S. Kammler, C. Konermann, J. Krummheuer, M. Hipp, I. Meyer, W. Gierling, S. Theiss, T. Preuss, et al. A novel approach to describe a U1 snRNA binding site Nucleic Acids Res., December 1, 2003; 31(23): 6963 - 6975. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Zhang and L. Luo Splice site prediction with quadratic discriminant analysis using diversity measure Nucleic Acids Res., November 1, 2003; 31(21): 6214 - 6220. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Cragg, G. R. Christie, S. R. Phillips, R. M. Russi, S. Kury, J. C. Mathers, P. M. Taylor, and D. Ford A Novel Zinc-regulated Human Zinc Transporter, hZTL1, Is Localized to the Enterocyte Apical Membrane J. Biol. Chem., June 14, 2002; 277(25): 22789 - 22797. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. B. Talbert, R. Masuelli, A. P. Tyagi, L. Comai, and S. Henikoff Centromeric Localization and Adaptive Evolution of an Arabidopsis Histone H3 Variant PLANT CELL, May 1, 2002; 14(5): 1053 - 1066. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. L. Himmel, F. Bi, H. Shen, N. A. Jenkins, N. G. Copeland, Y. Zheng, and D. A. Largaespada Activation of Clg, a Novel Dbl Family Guanine Nucleotide Exchange Factor Gene, by Proviral Insertion at Evi24, a Common Integration Site in B Cell and Myeloid Leukemias J. Biol. Chem., April 12, 2002; 277(16): 13463 - 13472. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Gonzalez, Z. Penzes, F. Almazan, E. Calvo, and L. Enjuanes Stabilization of a Full-Length Infectious cDNA Clone of Transmissible Gastroenteritis Coronavirus by Insertion of an Intron J. Virol., March 27, 2002; 76(9): 4655 - 4661. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Bullrich, H. Fujii, G. Calin, H. Mabuchi, M. Negrini, Y. Pekarsky, L. Rassenti, H. Alder, J. C. Reed, M. J. Keating, et al. Characterization of the 13q14 Tumor Suppressor Locus in CLL: Identification of ALT1, an Alternative Splice Variant of the LEU2 Gene Cancer Res., September 1, 2001; 61(18): 6640 - 6648. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Brudno, M. S. Gelfand, S. Spengler, M. Zorn, I. Dubchak, and J. G. Conboy Computational analysis of candidate intron regulatory elements for tissue-specific alternative pre-mRNA splicing Nucleic Acids Res., June 1, 2001; 29(11): 2338 - 2348. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Tsuchimoto, Y. Sakai, K. Sakumi, K. Nishioka, M. Sasaki, T. Fujiwara, and Y. Nakabeppu Human APE2 protein is mostly localized in the nuclei and to some extent in the mitochondria, while nuclear APE2 is partly associated with proliferating cell nuclear antigen Nucleic Acids Res., June 1, 2001; 29(11): 2349 - 2360. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Shiraishi, T. Druck, K. Mimori, J. Flomenberg, L. Berk, H. Alder, W. Miller, K. Huebner, and C. M. Croce Sequence conservation at human and mouse orthologous common fragile regions, FRA3B/FHIT and Fra14A2/Fhit PNAS, April 18, 2001; (2001) 91095898. [Abstract] [Full Text] |
||||
![]() |
M. Pertea, X. Lin, and S. L. Salzberg GeneSplicer: a new computational method for splice site prediction Nucleic Acids Res., March 1, 2001; 29(5): 1185 - 1190. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. McCullough and S. M. Berget An Intronic Splicing Enhancer Binds U1 snRNPs To Enhance Splicing and Select 5' Splice Sites Mol. Cell. Biol., December 15, 2000; 20(24): 9225 - 9235. [Abstract] [Full Text] |
||||
![]() |
J. E. Kim, K.-H. Kim, S. W. Lee, W. Seol, K. Shiba, and S. Kim An elongation factor-associating domain is inserted into human cysteinyl-tRNA synthetase by alternative splicing Nucleic Acids Res., August 1, 2000; 28(15): 2866 - 2872. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Endrizzi, V. Hadinoto, J. D. Growney, W. Miller, and W. F. Dietrich Genomic Sequence Analysis of the Mouse Naip Gene Array Genome Res., August 1, 2000; 10(8): 1095 - 1102. [Abstract] [Full Text] |
||||
![]() |
A.-M. Mallon, M. Platzer, R. Bate, G. Gloeckner, M.R.M. Botcherby, G. Nordsiek, M.A. Strivens, P. Kioschis, A. Dangel, D. Cunningham, et al. Comparative Genome Sequence Analysis of the Bpa/Str Region in Mouse and Man Genome Res., June 1, 2000; 10(6): 758 - 775. [Abstract] [Full Text] |
||||
![]() |
F. Almazan, J. M. Gonzalez, Z. Penzes, A. Izeta, E. Calvo, J. Plana-Duran, and L. Enjuanes From the Cover: Engineering the largest RNA virus genome as an infectious bacterial artificial chromosome PNAS, May 9, 2000; 97(10): 5516 - 5521. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. A. Salamov and V. V. Solovyev Ab initio Gene Finding in Drosophila Genomic DNA Genome Res., April 1, 2000; 10(4): 516 - 522. [Abstract] [Full Text] |
||||
![]() |
Y. Shishido-Hara, Y. Hara, T. Larson, K. Yasui, K. Nagashima, and G. L. Stoner Analysis of Capsid Formation of Human Polyomavirus JC (Tokyo-1 Strain) by a Eukaryotic Expression System: Splicing of Late RNAs, Translation and Nuclear Transport of Major Capsid Protein VP1, and Capsid Assembly J. Virol., February 15, 2000; 74(4): 1840 - 1853. [Abstract] [Full Text] |
||||
![]() |
T. A. Thanaraj Positional characterisation of false positives from computational prediction of human splice sites Nucleic Acids Res., February 1, 2000; 28(3): 744 - 754. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Upadhya, E. H. Birkenmeier, C. S. Birkenmeier, and J. E. Barker Mutations in a NIMA-related kinase gene, Nek1, cause pleiotropic effects including a progressive polycystic kidney disease in mice PNAS, January 4, 2000; 97(1): 217 - 221. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. C. Schutte, B. C. Bjork, K. B. Coppage, M. I. Malik, S. G. Gregory, D. J. Scott, L. M. Brentzell, Y. Watanabe, M. J. Dixon, and J. C. Murray A Preliminary Gene Map for the Van der Woude Syndrome Critical Region Derived from 900 kb of Genomic Sequence at 1q32-q41 Genome Res., January 1, 2000; 10(1): 81 - 94. [Abstract] [Full Text] |
||||
![]() |
N. J. Bowen and J. F. McDonald Genomic Analysis of Caenorhabditis elegans Reveals Ancient Families of Retroviral-like Elements Genome Res., October 1, 1999; 9(10): 924 - 935. [Abstract] [Full Text] |
||||
![]() |
K. Mimori, T. Druck, H. Inoue, H. Alder, L. Berk, M. Mori, K. Huebner, and C. M. Croce Cancer-specific chromosome alterations in the constitutive fragile region FRA3B PNAS, June 22, 1999; 96(13): 7456 - 7461. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Grossberger, C. Gieffers, W. Zachariae, A. V. Podtelejnikov, A. Schleiffer, K. Nasmyth, M. Mann, and J.-M. Peters Characterization of the DOC1/APC10 Subunit of the Yeast and the Human Anaphase-promoting Complex J. Biol. Chem., May 14, 1999; 274(20): 14500 - 14507. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Normark The Acid-Inducible asr Gene in Escherichia coli: Transcriptional Control by the phoBR Operon J. Bacteriol., April 1, 1999; 181(7): 2084 - 2093. [Abstract] [Full Text] |
||||
![]() |
M. Centola, X. Chen, R. Sood, Z. Deng, I. Aksentijevich, T. Blake, D. O. Ricke, X. Chen, G. Wood, N. Zaks, et al. Construction of an ~700-kb Transcript Map Around the Familial Mediterranean Fever Locus on Human Chromosome 16p13.3 Genome Res., November 1, 1998; 8(11): 1172 - 1191. [Abstract] [Full Text] |
||||
![]() |
G. Roman, V. Meller, K. H. Wu, and R. L. Davis The opt1 gene of Drosophila melanogaster encodes a proton-dependent dipeptide transporter Am J Physiol Cell Physiol, September 1, 1998; 275(3): C857 - C869. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-y. Chen, Q. Wang, Z. Fu, S. Zhou, and W. A. Fonzi Tca1, the Retrotransposon-Like Element of Candida albicans, Is a Degenerate and Inactive Element J. Bacteriol., July 15, 1998; 180(14): 3657 - 3662. [Abstract] [Full Text] |
||||
![]() |
M. S. Halleck, D. Pradhan, C. Blackman, C. Berkes, P. Williamson, and R. A. Schlegel Multiple Members of a Third Subfamily of P-Type ATPases Identified by Genomic Sequences and ESTs Genome Res., April 1, 1998; 8(4): 354 - 361. [Abstract] [Full Text] |
||||
![]() |
J. Jiang and H. J. Jacob EbEST: An Automated Tool Using Expressed Sequence Tags to Delineate Gene Structure Genome Res., March 1, 1998; 8(3): 268 - 275. [Abstract] [Full Text] |
||||
![]() |
H. Inoue, H. Ishii, H. Alder, E. Snyder, T. Druck, K. Huebner, and C. M. Croce Sequence of the FRA3B common fragile region: Implications for the mechanism of FHIT deletion PNAS, December 23, 1997; 94(26): 14584 - 14589. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Francis, T. M. Strom, S. Hennig, A. Boddrich, B. Lorenz, O. Brandau, K. L. Mohnike, M. Cagnoli, C. Steffens, S. Klages, et al. Genomic Organization of the Human PEX Gene Mutated in X-Linked Dominant Hypophosphatemic Rickets Genome Res., June 1, 1997; 7(6): 573 - 585. [Abstract] [Full Text] [PDF] |
||||
![]() |
M A Ansari-Lari, Y Shen, D M Muzny, W Lee, and R A Gibbs Large-scale sequencing in human chromosome 12p13: experimental and computational gene structure determination. Genome Res., March 1, 1997; 7(3): 268 - 280. [Abstract] [PDF] |
||||
![]() |
M. Q. Zhang Identification of protein coding regions in the human genome by quadratic discriminant analysis PNAS, January 21, 1997; 94(2): 565 - 568. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. E. Jensen and A. S. Whitehead IRAK1b, a Novel Alternative Splice Variant of Interleukin-1 Receptor-associated Kinase (IRAK), Mediates Interleukin-1 Signaling and Has Prolonged Stability J. Biol. Chem., July 27, 2001; 276(31): 29037 - 29044. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Shiraishi, T. Druck, K. Mimori, J. Flomenberg, L. Berk, H. Alder, W. Miller, K. Huebner, and C. M. Croce Sequence conservation at human and mouse orthologous common fragile regions, FRA3B/FHIT and Fra14A2/Fhit PNAS, May 8, 2001; 98(10): 5722 - 5727. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. M. Thomson, J. J. Lozano, N. Loukili, R. Carrió, F. Serras, B. Cormand, M. Valeri, V. M. Díaz, J. Abril, M. Burset, et al. Fusion of the Human Gene for the Polyubiquitination Coeffector UEV1 with Kua, a Newly Identified Gene Genome Res., November 1, 2000; 10(11): 1743 - 1756. [Abstract] [Full Text] |
||||











