Nucleic Acids Research, Vol 27, Issue 13 2627-2637, Copyright © 1999 by Oxford University Press
TA Thanaraj
A clean data set of verified splice sites from Homo sapiens are reported as
well as the standards used for the clean-up procedure. The sites were
validated by: (i) standard cleaning procedures such as requiring
consistency in the annotation of the gene structural elements, completeness
of the coding regions and elimination of redundant sequences; (ii)
clustering by decision trees coupled with analysis of ClustalW alignments
of the translated protein sequence with homologous proteins from
SWISS-PROT; (iii) matching against human EST sequences. The sites are
categorised as: (i) donor sites, a set of 619 EST-confirmed donor sites,
for which 138 are either the sites or the regions around the sites involved
in alternative splice events; (ii) acceptor sites, a set of 623
EST-confirmed acceptor sites, for which 144 are either the sites or the
regions around the sites are involved in alternative splice events; (iii)
genuine splice sites, a set of 392 splice sites wherein both the donor and
acceptor sites had EST confirmation and were not involved in any
alternative splicing; (iv) alternative splice sites, a set of 209 splice
sites wherein both the donor and acceptor sites had EST confirmation and
the sites or the regions around them were involved in alternative splicing.
A set of nucleotide regions that can be used to generate a control set of
false splice sites that have a high confidence of being non-functional are
also reported.
ARTICLES
A clean data set of EST-confirmed splice sites from Homo sapiens and standards for clean-up procedures
European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK. thanaraj@ebi.ac.uk
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Bhasi, R. V. Pandey, S. P. Utharasamy, and P. Senapathy EuSplice: a unified resource for the analysis of splice signals and alternative splicing in eukaryotic genes Bioinformatics, July 15, 2007; 23(14): 1815 - 1823. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Pospisil, A. Herrmann, R. H. Bortfeldt, and J. G. Reich EASED: Extended Alternatively Spliced EST Database Nucleic Acids Res., January 1, 2004; 32(90001): D70 - 74. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. P. Lewis, R. E. Green, and S. E. Brenner Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans PNAS, January 7, 2003; 100(1): 189 - 192. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Farrer, A. B. Roller, W. J. Kent, and A. M. Zahler Analysis of the role of Caenorhabditis elegans GC-AG introns in regulated splicing Nucleic Acids Res., August 1, 2002; 30(15): 3360 - 3367. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Clark and T. A. Thanaraj Categorization and characterization of transcript-confirmed constitutively and alternatively spliced introns and exons from human Hum. Mol. Genet., February 1, 2002; 11(4): 451 - 464. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A. Thanaraj and F. Clark Human GC-AG alternative intron isoforms with weak donor sites show enhanced consensus at acceptor exon positions Nucleic Acids Res., June 15, 2001; 29(12): 2581 - 2593. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Lou and R. F. Gagel Alternative Ribonucleic Acid Processing in Endocrine Systems Endocr. Rev., April 1, 2001; 22(2): 205 - 225. [Abstract] [Full Text] |
||||
![]() |
M. Burset, I. A. Seledtsov, and V. V. Solovyev SpliceDB: database of canonical and non-canonical mammalian splice sites Nucleic Acids Res., January 1, 2001; 29(1): 255 - 259. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Muilu, P. Rodriguez-Tomé, and A. Robinson GBuilder---An Application for the Visualization and Integration of EST Cluster Data Genome Res., January 1, 2001; 11(1): 179 - 184. [Abstract] [Full Text] |
||||
![]() |
M. Burset, I. A. Seledtsov, and V. V. Solovyev Analysis of canonical and non-canonical splice sites in mammalian genomes Nucleic Acids Res., November 1, 2000; 28(21): 4364 - 4375. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A. Thanaraj Positional characterisation of false positives from computational prediction of human splice sites Nucleic Acids Res., February 1, 2000; 28(3): 744 - 754. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. A. Hide, V. N. Babenko, P. A. van Heusden, C. Seoighe, and J. F. Kelso The Contribution of Exon-Skipping Events on Chromosome 22 to Protein Coding Diversity Genome Res., November 1, 2001; 11(11): 1848 - 1853. [Abstract] [Full Text] [PDF] |
||||





