Skip Navigation

This Article
Right arrow Print PDF (1946K)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (216)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Solovyev, V. V.
Right arrow Articles by Lawrence, C. B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Solovyev, V. V.
Right arrow Articles by Lawrence, C. B.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 1994, Vol. 22, No. 24 5156-5163
© 1994


Articles

Predicting internal exons by oligonucleotide composition and discriminant analysis of spliceable open reading frames

Victor V. Solovyev*, Asaf A. Salamov and Charles B. Lawrence

Department of Cell Biology, Baylor College of Medicine One Baylor Plaza, Houston, TX 77030, USA

*To whom correspondence should be addressed

Received November 4, 1994. Accepted November 11, 1994.

A new method which predicts internal exon sequences In human DNA has been developed. The method Is based on a splice site prediction algorithm that uses the linear discriminant function to combine Information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotides In protein coding and intron regions. The accuracy of our splice site recognition function Is 97% for donor splice sites and 96% for acceptor splice sites. For exon prediction, we combine in a discriminant function the characteristics describing the 5numero -lntron region, donor splice site, coding region, acceptor splice site and 3'-lntron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise Internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences Is 77% with a specificity of 79%. The recognition quality computed at the level of Individual nucleotides Is 89% for exon sequences and 98% for intron sequences. This corresponds to a correlation coefficient for exon prediction of 0.87. The precision of this approach is better than other methods and has been tested on a larger data set. We have also developed a means for predicting exon - exon Junctions In cDNA sequences, which can be useful for selecting optimal PCR primers.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
GeneticsHome page
Y. Wang, A. Diehl, F. Wu, J. Vrebalov, J. Giovannoni, A. Siepel, and S. D. Tanksley
Sequencing and Comparative Analysis of a Conserved Syntenic Segment in the Solanaceae
Genetics, September 1, 2008; 180(1): 391 - 408.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. Agrawal and G. D. Stormo
Using mRNAs lengths to accurately predict the alternatively spliced gene products in Caenorhabditis elegans
Bioinformatics, May 15, 2006; 22(10): 1239 - 1244.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. R. Brent
Genome annotation past, present, and future: How to define an ORF at each locus
Genome Res., December 1, 2005; 15(12): 1777 - 1786.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. Bekaert, H. Richard, B. Prum, and J.-P. Rousset
Identification of programmed translational -1 frameshifting sites in the genome of Saccharomyces cerevisiae
Genome Res., October 1, 2005; 15(10): 1411 - 1420.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
A. Matsuyama, T. Shiraishi, F. Trapasso, T. Kuroki, H. Alder, M. Mori, K. Huebner, and C. M. Croce
Fragile site orthologs FHIT/FRA3B and Fhit/Fra14A2: Evolutionarily conserved but highly recombinogenic
PNAS, December 9, 2003; 100(25): 14988 - 14993.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Freund, C. Asang, S. Kammler, C. Konermann, J. Krummheuer, M. Hipp, I. Meyer, W. Gierling, S. Theiss, T. Preuss, et al.
A novel approach to describe a U1 snRNA binding site
Nucleic Acids Res., December 1, 2003; 31(23): 6963 - 6975.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. Zhang and L. Luo
Splice site prediction with quadratic discriminant analysis using diversity measure
Nucleic Acids Res., November 1, 2003; 31(21): 6214 - 6220.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
R. A. Cragg, G. R. Christie, S. R. Phillips, R. M. Russi, S. Kury, J. C. Mathers, P. M. Taylor, and D. Ford
A Novel Zinc-regulated Human Zinc Transporter, hZTL1, Is Localized to the Enterocyte Apical Membrane
J. Biol. Chem., June 14, 2002; 277(25): 22789 - 22797.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
P. B. Talbert, R. Masuelli, A. P. Tyagi, L. Comai, and S. Henikoff
Centromeric Localization and Adaptive Evolution of an Arabidopsis Histone H3 Variant
PLANT CELL, May 1, 2002; 14(5): 1053 - 1066.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
K. L. Himmel, F. Bi, H. Shen, N. A. Jenkins, N. G. Copeland, Y. Zheng, and D. A. Largaespada
Activation of Clg, a Novel Dbl Family Guanine Nucleotide Exchange Factor Gene, by Proviral Insertion at Evi24, a Common Integration Site in B Cell and Myeloid Leukemias
J. Biol. Chem., April 12, 2002; 277(16): 13463 - 13472.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
J. M. Gonzalez, Z. Penzes, F. Almazan, E. Calvo, and L. Enjuanes
Stabilization of a Full-Length Infectious cDNA Clone of Transmissible Gastroenteritis Coronavirus by Insertion of an Intron
J. Virol., March 27, 2002; 76(9): 4655 - 4661.
[Abstract] [Full Text] [PDF]


Home page
Cancer Res.Home page
F. Bullrich, H. Fujii, G. Calin, H. Mabuchi, M. Negrini, Y. Pekarsky, L. Rassenti, H. Alder, J. C. Reed, M. J. Keating, et al.
Characterization of the 13q14 Tumor Suppressor Locus in CLL: Identification of ALT1, an Alternative Splice Variant of the LEU2 Gene
Cancer Res., September 1, 2001; 61(18): 6640 - 6648.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Brudno, M. S. Gelfand, S. Spengler, M. Zorn, I. Dubchak, and J. G. Conboy
Computational analysis of candidate intron regulatory elements for tissue-specific alternative pre-mRNA splicing
Nucleic Acids Res., June 1, 2001; 29(11): 2338 - 2348.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. Tsuchimoto, Y. Sakai, K. Sakumi, K. Nishioka, M. Sasaki, T. Fujiwara, and Y. Nakabeppu
Human APE2 protein is mostly localized in the nuclei and to some extent in the mitochondria, while nuclear APE2 is partly associated with proliferating cell nuclear antigen
Nucleic Acids Res., June 1, 2001; 29(11): 2349 - 2360.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
T. Shiraishi, T. Druck, K. Mimori, J. Flomenberg, L. Berk, H. Alder, W. Miller, K. Huebner, and C. M. Croce
Sequence conservation at human and mouse orthologous common fragile regions, FRA3B/FHIT and Fra14A2/Fhit
PNAS, April 18, 2001; (2001) 91095898.
[Abstract] [Full Text]


Home page
Nucleic Acids ResHome page
M. Pertea, X. Lin, and S. L. Salzberg
GeneSplicer: a new computational method for splice site prediction
Nucleic Acids Res., March 1, 2001; 29(5): 1185 - 1190.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
A. J. McCullough and S. M. Berget
An Intronic Splicing Enhancer Binds U1 snRNPs To Enhance Splicing and Select 5' Splice Sites
Mol. Cell. Biol., December 15, 2000; 20(24): 9225 - 9235.
[Abstract] [Full Text]


Home page
Nucleic Acids ResHome page
J. E. Kim, K.-H. Kim, S. W. Lee, W. Seol, K. Shiba, and S. Kim
An elongation factor-associating domain is inserted into human cysteinyl-tRNA synthetase by alternative splicing
Nucleic Acids Res., August 1, 2000; 28(15): 2866 - 2872.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. G. Endrizzi, V. Hadinoto, J. D. Growney, W. Miller, and W. F. Dietrich
Genomic Sequence Analysis of the Mouse Naip Gene Array
Genome Res., August 1, 2000; 10(8): 1095 - 1102.
[Abstract] [Full Text]


Home page
Genome ResHome page
A.-M. Mallon, M. Platzer, R. Bate, G. Gloeckner, M.R.M. Botcherby, G. Nordsiek, M.A. Strivens, P. Kioschis, A. Dangel, D. Cunningham, et al.
Comparative Genome Sequence Analysis of the Bpa/Str Region in Mouse and Man
Genome Res., June 1, 2000; 10(6): 758 - 775.
[Abstract] [Full Text]


Home page
Proc. Natl. Acad. Sci. USAHome page
F. Almazan, J. M. Gonzalez, Z. Penzes, A. Izeta, E. Calvo, J. Plana-Duran, and L. Enjuanes
From the Cover: Engineering the largest RNA virus genome as an infectious bacterial artificial chromosome
PNAS, May 9, 2000; 97(10): 5516 - 5521.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
A. A. Salamov and V. V. Solovyev
Ab initio Gene Finding in Drosophila Genomic DNA
Genome Res., April 1, 2000; 10(4): 516 - 522.
[Abstract] [Full Text]


Home page
J. Virol.Home page
Y. Shishido-Hara, Y. Hara, T. Larson, K. Yasui, K. Nagashima, and G. L. Stoner
Analysis of Capsid Formation of Human Polyomavirus JC (Tokyo-1 Strain) by a Eukaryotic Expression System: Splicing of Late RNAs, Translation and Nuclear Transport of Major Capsid Protein VP1, and Capsid Assembly
J. Virol., February 15, 2000; 74(4): 1840 - 1853.
[Abstract] [Full Text]


Home page
Nucleic Acids ResHome page
T. A. Thanaraj
Positional characterisation of false positives from computational prediction of human splice sites
Nucleic Acids Res., February 1, 2000; 28(3): 744 - 754.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
P. Upadhya, E. H. Birkenmeier, C. S. Birkenmeier, and J. E. Barker
Mutations in a NIMA-related kinase gene, Nek1, cause pleiotropic effects including a progressive polycystic kidney disease in mice
PNAS, January 4, 2000; 97(1): 217 - 221.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
B. C. Schutte, B. C. Bjork, K. B. Coppage, M. I. Malik, S. G. Gregory, D. J. Scott, L. M. Brentzell, Y. Watanabe, M. J. Dixon, and J. C. Murray
A Preliminary Gene Map for the Van der Woude Syndrome Critical Region Derived from 900 kb of Genomic Sequence at 1q32-q41
Genome Res., January 1, 2000; 10(1): 81 - 94.
[Abstract] [Full Text]


Home page
Genome ResHome page
N. J. Bowen and J. F. McDonald
Genomic Analysis of Caenorhabditis elegans Reveals Ancient Families of Retroviral-like Elements
Genome Res., October 1, 1999; 9(10): 924 - 935.
[Abstract] [Full Text]


Home page
Proc. Natl. Acad. Sci. USAHome page
K. Mimori, T. Druck, H. Inoue, H. Alder, L. Berk, M. Mori, K. Huebner, and C. M. Croce
Cancer-specific chromosome alterations in the constitutive fragile region FRA3B
PNAS, June 22, 1999; 96(13): 7456 - 7461.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
R. Grossberger, C. Gieffers, W. Zachariae, A. V. Podtelejnikov, A. Schleiffer, K. Nasmyth, M. Mann, and J.-M. Peters
Characterization of the DOC1/APC10 Subunit of the Yeast and the Human Anaphase-promoting Complex
J. Biol. Chem., May 14, 1999; 274(20): 14500 - 14507.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
S. Normark
The Acid-Inducible asr Gene in Escherichia coli: Transcriptional Control by the phoBR Operon
J. Bacteriol., April 1, 1999; 181(7): 2084 - 2093.
[Abstract] [Full Text]


Home page
Genome ResHome page
M. Centola, X. Chen, R. Sood, Z. Deng, I. Aksentijevich, T. Blake, D. O. Ricke, X. Chen, G. Wood, N. Zaks, et al.
Construction of an ~700-kb Transcript Map Around the Familial Mediterranean Fever Locus on Human Chromosome 16p13.3
Genome Res., November 1, 1998; 8(11): 1172 - 1191.
[Abstract] [Full Text]


Home page
Am. J. Physiol. Cell Physiol.Home page
G. Roman, V. Meller, K. H. Wu, and R. L. Davis
The opt1 gene of Drosophila melanogaster encodes a proton-dependent dipeptide transporter
Am J Physiol Cell Physiol, September 1, 1998; 275(3): C857 - C869.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
J.-y. Chen, Q. Wang, Z. Fu, S. Zhou, and W. A. Fonzi
Tca1, the Retrotransposon-Like Element of Candida albicans, Is a Degenerate and Inactive Element
J. Bacteriol., July 15, 1998; 180(14): 3657 - 3662.
[Abstract] [Full Text]


Home page
Genome ResHome page
M. S. Halleck, D. Pradhan, C. Blackman, C. Berkes, P. Williamson, and R. A. Schlegel
Multiple Members of a Third Subfamily of P-Type ATPases Identified by Genomic Sequences and ESTs
Genome Res., April 1, 1998; 8(4): 354 - 361.
[Abstract] [Full Text]


Home page
Genome ResHome page
J. Jiang and H. J. Jacob
EbEST: An Automated Tool Using Expressed Sequence Tags to Delineate Gene Structure
Genome Res., March 1, 1998; 8(3): 268 - 275.
[Abstract] [Full Text]


Home page
Proc. Natl. Acad. Sci. USAHome page
H. Inoue, H. Ishii, H. Alder, E. Snyder, T. Druck, K. Huebner, and C. M. Croce
Sequence of the FRA3B common fragile region: Implications for the mechanism of FHIT deletion
PNAS, December 23, 1997; 94(26): 14584 - 14589.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
F. Francis, T. M. Strom, S. Hennig, A. Boddrich, B. Lorenz, O. Brandau, K. L. Mohnike, M. Cagnoli, C. Steffens, S. Klages, et al.
Genomic Organization of the Human PEX Gene Mutated in X-Linked Dominant Hypophosphatemic Rickets
Genome Res., June 1, 1997; 7(6): 573 - 585.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M A Ansari-Lari, Y Shen, D M Muzny, W Lee, and R A Gibbs
Large-scale sequencing in human chromosome 12p13: experimental and computational gene structure determination.
Genome Res., March 1, 1997; 7(3): 268 - 280.
[Abstract] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. Q. Zhang
Identification of protein coding regions in the human genome by quadratic discriminant analysis
PNAS, January 21, 1997; 94(2): 565 - 568.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
L. E. Jensen and A. S. Whitehead
IRAK1b, a Novel Alternative Splice Variant of Interleukin-1 Receptor-associated Kinase (IRAK), Mediates Interleukin-1 Signaling and Has Prolonged Stability
J. Biol. Chem., July 27, 2001; 276(31): 29037 - 29044.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
T. Shiraishi, T. Druck, K. Mimori, J. Flomenberg, L. Berk, H. Alder, W. Miller, K. Huebner, and C. M. Croce
Sequence conservation at human and mouse orthologous common fragile regions, FRA3B/FHIT and Fra14A2/Fhit
PNAS, May 8, 2001; 98(10): 5722 - 5727.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
T. M. Thomson, J. J. Lozano, N. Loukili, R. Carrió, F. Serras, B. Cormand, M. Valeri, V. M. Díaz, J. Abril, M. Burset, et al.
Fusion of the Human Gene for the Polyubiquitination Coeffector UEV1 with Kua, a Newly Identified Gene
Genome Res., November 1, 2000; 10(11): 1743 - 1756.
[Abstract] [Full Text]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.