Nucleic Acids Research, 2001, Vol. 29, No. 1 255-259
© 2001 Oxford University Press
SpliceDB: database of canonical and non-canonical mammalian splice sites
The Sanger Centre, Hinxton, Cambridge CB10 1SA, UK and 1Softberry Inc., 108 Corporate Park Drive, Suite 120, White Plains, NY 10604, USA
Received September 7, 2000; Revised and Accepted October 31, 2000.
| ABSTRACT |
|---|
|
|
|---|
A database (SpliceDB) of known mammalian splice site sequences has been developed. We extracted 43 337 splice pairs from mammalian divisions of the gene-centered Infogene database, including sites from incomplete or alternatively spliced genes. Known EST sequences supported 22 815 of them. After discarding sequences with putative errors and ambiguous location of splice junctions the verified dataset includes 22 489 entries. Of these, 98.71% contain canonical GTAG junctions (22 199 entries) and 0.56% have non-canonical GCAG splice site pairs. The remainder (0.73%) occurs in a lot of small groups (with a maximum size of 0.05%). We especially studied non-canonical splice sites, which comprise 3.73% of GenBank annotated splice pairs. EST alignments allowed us to verify only the exonic part of splice sites. To check the conservative dinucleotides we compared sequences of human non-canonical splice sites with sequences from the high throughput genome sequencing project (HTG). Out of 171 human non-canonical and EST-supported splice pairs, 156 (91.23%) had a clear match in the human HTG. They can be classified after sequence analysis as: 79 GCAG pairs (of which one was an error that corrected to GCAG), 61 errors corrected to GTAG canonical pairs, six ATAC pairs (of which two were errors corrected to ATAC), one case was produced from a non-existent intron, seven cases were found in HTG that were deposited to GenBank and finally there were only two other cases left of supported non-canonical splice pairs. The information about verified splice site sequences for canonical and non-canonical sites is presented in SpliceDB with the supporting evidence. We also built weight matrices for the major splice groups, which can be incorporated into gene prediction programs. SpliceDB is available at the computational genomic Web server of the Sanger Centre: http://genomic.sanger.ac.uk/spldb/SpliceDB.html and at http://www.softberry.com/spldb/SpliceDB.html.
| INTRODUCTION |
|---|
|
|
|---|
The database has been generated as a result of our interest in characterization of observed types of splice sites. New sequences coming every day from genome sequencing projects are mostly annotated by computationally generated information. There is no straightforward procedure to retrieve experimentally supported splice site sequences to study their properties. Currently our knowledge about how the cell is specifying splice sites is not sufficient for accurate and comprehensive computational identification of splice junctions in genomic sequences. Characterization of all known splice sites can help us to increase the quality of gene structure prediction programs. Moreover, many annotated non-canonical splice site sequences may appear in databases as a result of sequencing or annotation errors (1,2). These errors should be found and corrected or discarded in the investigation of splice site characteristics.
EST sequences as an independent source of information were used to verify the annotated splice pairs. This approach has been suggested and exploited by Thanaraj (3), who selected genes without alternative splicing and generated a complex splice site classification system depending on the found EST matches. We extended this approach in using high throughput genome sequencing project (HTG) genomic sequences to verify splice site exonic and intronic composition and applied it to analysis of mammalian genes (4).
Our analysis comprises constitutively as well as alternatively spliced genes. Therefore all kinds of spliced introns of the same gene are included in the database. This is the first public database describing alternative introns supported by ESTs and non-canonical splice junctions.
| EST BASED CLASSIFICATION |
|---|
|
|
|---|
Every EST similar to any splice construct can be classified depending on quality and type of observed alignment with the annotated gene sequence as (for more details see figure 1b in 4): D-end: EST only covers the left exon; A-end: EST only covers the right exon; B-ends: EST overlaps with a splice junction covering left and right exons with no more than one substitution; Error: EST overlaps with a splice junction covering left and right exons with several mismatches or/and gaps.
When all EST alignments for the same spliced construct have been obtained, every splice site can be classified using the following rules:
(i) if there is some B-end EST, then classify as Supported junction (B20) splice pair, otherwise;
(ii) if there is some Error EST, then classify as Error in junction (Err) splice pair, otherwise;
(iii) if there is some D-end AND some A-end EST, then classify as Unsupported junction but supported exons (5+3) splice pair, otherwise;
(iv) if there is some D-end EST, then classify as Only supported 5' exon (5pr) splice pair, otherwise;
(v) if there is some A-end EST, then classify as Only supported 3' exon (3pr) splice pair, otherwise;
(vi) classify as Completely unsupported (Uns) splice pair.
Finally, all splice pairs classified as supported junction but with low conservation (identity <95%) within 20 bp. at every side of splice junction were reclassified as Error in junction.
| DATABASE STATUS |
|---|
|
|
|---|
This first version of SpliceDB was built using mammalian divisions of the InfoGene database (5), which united information from many GenBank (Release 112) (6) entries describing a particular gene. We obtained 43 337 splice site pairs, of which 22 815 were supported by ESTs. Applying corrections explained in Burset et al. (4) this number was reduced to 22 489 supported and corrected entries. Subdivisions of data in SpliceDB and their content are listed in Table 1.
|
More than half (65.69%) of database entries come from human sequences, so we decided to keep separate sets of human splice sites. It may be interesting for scientists working with humans to go directly to these sequences as well as we were able to compare human non-canonical splice sites with HTGs. So, originally we obtained 28 468 human splice site pairs, of which 15 645 (68.57%) were supported by ESTs. After correction procedures 15 434 (68.63% of human entries) of verified splice pairs were presented in the corresponding subdivision.
All subdivisions are subsets of all mammalian annotated splice pair sets and the users can retrieve sequences of any combination of interesting groups.
Mammals or human files are divided at every filter stage into canonical and non-canonical introns. We create three filter stages for every group. The first group is formed by all splice site pairs using original GenBank annotations; the second group comprises pairs supported by ESTs and the third includes pairs supported by ESTs and is automatically corrected, meaning that all ambiguous junctions have been discarded, see Burset et al. (4) for details (Table 1).
In human non-canonical subdivision there is a special file with a subset from non-canonical, EST supported and corrected splice pairs, which are supported by HTG. Analysis of alignment information and possible corrections using HTG have been done manually, studying case-by-case sequence alignments.
Several examples of EST-supported entries are presented in Figure 1a. As was indicated in Burset et al. (4), we often observe EST-supported sequences with putative errors or, at least, with ambiguity in splice junction position. The same EST may support two splice junctions or several ESTs may support different junctions, as can be seen in Figure 1b.
|
Using HTG sequences allows us to identify a large variety of sequencing and annotation errors. Some entries have only small sequence errors, such as the first example in Figure 1c, which only have a deletion in donor and a substitution in acceptor sites. We recovered here a canonical GTAG site shifted three positions downstream. Several other cases present completely unsupported introns (Fig. 1c). One very interesting example of errors is annotation of pseudogenes. The functional copy sometimes can be identified in HTG sequences by comparing them with ESTs (assuming that only the functional gene copy will generate EST sequences). In an example of such situation we found a substitution A to G upstream donor site, which helped differentiate the gene functional copy with canonical splice site. The last two examples in Figure 1c cannot be considered as sequence errors. They are examples of wrongly annotated non-canonical sites, which we corrected to typically observed non-canonical pairs.
| DATABASE FORMAT |
|---|
|
|
|---|
All entries in the database are presented in a tabular format, so every line in any file describes a completely specified splice site pair. We use two kinds of field separators: the different major parts in every entry are separated by the double symbol @@, and inside the major part the field separator is a typical blank space or tabulator. It allows us to write large sentences inside every major part maintaining clear separation between them. The typical structure of an entry in SpliceDB is:
ID @@ ACCES @@ INTRON @@ DON @@ ACC @@ SEQ_DON @@ SEQ_ACC @@ EST @@ EST_ACCES @@ CORR
ID (database identifier)
This field has always only one word, that is a unique and specific identifier provided to every pair. ID is formed by Infogene (5) entry name, assigned intron number and donor and acceptor positions in the original sequence. All these data are joined using a ## symbol (i.e. HG_0000731##114##122615##122965).
ACCES (accession number)
This field has always only one word, which refers to one of the original GenBank accession numbers (i.e. AB011399).
INTRON (assigned intron number)
This field has always only one word, that is the intron number assigned to every intron pair in the Infogene database (i.e. 114).
DON (donor number)
This field has always only one word, that is the donor position in the original Infogene entry (i.e. 122615).
ACC (acceptor number)
This field has always only one word, that is the acceptor position in original Infogene entry (i.e. 122965).
SEQ_DON (nucleotide sequence around donor)
The field has always only one word, that is the nucleotide sequence centered in donor conserved dinucleotides, with 40 bp in every side, forming a total sequence of 82 bp (i.e. aacatctgtctctactggaaacctctgcactgaggagcagattgattgataagcaaaaggcttctactgcatttccatcctt).
SEQ_ACC (nucleotide sequence around acceptor)
The field has always only one word, that is the nucleotide sequence centered in acceptor conserved dinucleotides, with 40 bp in every side, forming a total sequence of 82 bp (i.e. aaaaagctcactttttttgttcttcacattttacaggagcagacgcctccgcctagacctgaagcctaccccatccccactc).
EST (EST classification)
This field has always only one word, that is the obtained EST classification [see materials and methods in Burset et al. (4) for details] (i.e. B20).
EST_ACCES (EST accession number supporting classification)
This field has always only one word, that is the accession number of the EST used to support our classification (i.e. gb|N35650|N35650).
CORR (possible corrections)
This field is optional and is specified in free text. All possible corrections are annotated in this field, based on ESTs or in HTGs:
Automatic EST correction in positions [pos1 pos2] using [ESTaccession]. We annotate which positions present ambiguities in addition to the annotated and supported junctions (pos1 and pos2), and the EST accession number that supports alternative junction (ESTaccession).
HTGs [free text]. We provide information about HTGs corresponding to splice junction sequences [for more details see results in Burset et al. (4)].
| CONSENSUS AND WEIGHT MATRICES |
|---|
|
|
|---|
After analysis of the information presented, we conclude that practically all splice site pairs are limited to three types: GTAG, GCAG and ATAC, and the other kind of introns (if they exist) have a very small frequency (
0.02% or less). Alignment of conserved dinucleotides in every type of splice site allows us to observe a certain degree of conservation in surrounding nucleotides, which in practice means a deviation in observed frequencies with respect to expected random distribution. Often this information has been presented in the form of consensus sequences (for every column in aligned sequences we write the most representative nucleotide, or group of nucleotides, indicating this frequency or percent) and as frequency matrices (for every column in aligned sequences we represent the frequency or percentage for every nucleotide, creating a matrix of four rows and as many columns as significant positions). Frequency matrices are more informative and used in gene prediction programs, but a relatively high number of aligned sequences is needed to obtain discriminative matrices.
We present frequence matrices built on verified datasets for GTAG and GCAG pair sequences. Because we have a small number of ATAC cases only consensus sequences are provided for these splice sites (Fig. 2).
|
SpliceDB is available on the Sanger Centre computational genomic Web server at http://genomic.sanger.ac.uk/spldb/SpliceDB.html and at http://www.softberry.com/spldb/SpliceDB.html.
| DISCUSSION |
|---|
|
|
|---|
We have applied ESTs and HTG sequences to verify mammalian splice junctions, but there are other model organisms with a lot of genomic data which are interesting to analyze, such as Drosophila melanogaster, Caenorhabditis elegans or Arabidopsis thaliana. We plan to extend our database to nearly all model eukaryotic organisms. Another problem is to install HTG analysis as automatically as possible, because manual intervention is very time consuming (if more accurate).
Observation of practically only three types of splice sites simplifies the problem of their computational identification by gene prediction programs. Consideration of a GCAG splice pair in the Fgenesh program has been done by Salamov and Solovyev (7), and maintained the accuracy level despite including many more potential splice variants. Addition of ATAC splice sites (occurring with very low frequency) will probably wait until we accumulate more examples, allowing us to better describe the site characteristics.
Another point to consider is whether to include information about gene structures supported by ESTs. It might be useful for training gene prediction software because it is very sensitive to sequence errors, especially in conserved positions of splice site sequences. Including information about alternative intron positions is very important for developing gene prediction programs that will generate alternative splicing variants.
| FOOTNOTES |
|---|
* To whom correspondence should be addressed at present address: EOS Biotechnology, 225A Gateway Boulevard, South San Francisco, CA 94080, USA. Tel: +1 650 246 2331; Fax: +1 650 583 3881; Email: solovyev{at}eosbiotech.com Present address: M. Burset, Institut Municipal dInvestigació Mèdica (IMIM), C/Dr Aiguader 80, 08003 Barcelona, Spain
| REFERENCES |
|---|
|
|
|---|
-
1 Penotti,F.E. (1991) Human Pre-mRNA splicing signals. J. Theor. Biol., 150, 385420.[ISI][Medline]
2 Jackson,I.J. (1991) A reappraisal of non-consensus mRNA splice sites. Nucleic Acids Res., 19, 37953798.
3 Thanaraj,T.A. (1999) A clean data set of EST-confirmed splice sites from Homo sapiens and canonicals for clean-up procedures. Nucleic Acids Res., 27, 26272637.
4 Burset,M., Seledtsov,I.A. and Solovyev,V.V. (2000) Analysis of canonical and non-canonical splice sites in mammalian genomes. Nucleic Acids Res., 28, 43644375.
5 Solovyev,V.V. and Salamov,A.A. (1999) INFOGENE: a database of known gene structures and predicted genes and proteins in sequences of genome sequencing projects. Nucleic Acids Res., 27, 248250.
6 Benson,D.A., Boguski,M.S., Lipman,D.J., Ostell,J., Ouellette,B.F., Rapp,B.A. and Wheeler,D.L. (1999) GenBank. Nucleic Acids Res., 27, 1217.
7 Salamov,A. and Solovyev,V. (2000) Ab initio gene finding in Drosophila genomic DNA. Genome Res., 10, 516522.
This article has been cited by other articles:
![]() |
M.-R. Ho, W.-J. Jang, C.-h. Chen, L.-Y. Ch'ang, and W.-c. Lin Designating eukaryotic orthology via processed transcription units Nucleic Acids Res., June 1, 2008; 36(10): 3436 - 3442. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Buratti, M. Chivers, J. Kralovicova, M. Romano, M. Baralle, A. R. Krainer, and I. Vorechovsky Aberrant 5' splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization Nucleic Acids Res., July 26, 2007; 35(13): 4250 - 4263. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Buratti, A. Dhir, M. A. Lewandowska, and F. E. Baralle RNA structure is a key regulatory element in pathological ATM and CFTR pseudoexon inclusion events Nucleic Acids Res., July 26, 2007; 35(13): 4369 - 4383. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Bhasi, R. V. Pandey, S. P. Utharasamy, and P. Senapathy EuSplice: a unified resource for the analysis of splice signals and alternative splicing in eukaryotic genes Bioinformatics, July 15, 2007; 23(14): 1815 - 1823. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ali, P. T. Christie, I. V. Grigorieva, B. Harding, H. Van Esch, S. F. Ahmed, M. Bitner-Glindzicz, E. Blind, C. Bloch, P. Christin, et al. Functional characterization of GATA3 mutations causing the hypoparathyroidism-deafness-renal (HDR) dysplasia syndrome: insight into mechanisms of DNA binding by the GATA3 transcription factor Hum. Mol. Genet., February 1, 2007; 16(3): 265 - 275. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Kyriakopoulou, P. Larsson, L. Liu, J. Schuster, F. Soderbom, L. A. Kirsebom, and A. Virtanen U1-like snRNAs lacking complementarity to canonical 5' splice sites RNA, September 1, 2006; 12(9): 1603 - 1611. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Sheth, X. Roca, M. L. Hastings, T. Roeder, A. R. Krainer, and R. Sachidanandam Comprehensive splice-site analysis using comparative genomics Nucleic Acids Res., September 1, 2006; 34(14): 3955 - 3967. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Vigetti, M. Ori, M. Viola, A. Genasetti, E. Karousou, M. Rizzi, F. Pallotti, I. Nardi, V. C. Hascall, G. De Luca, et al. Molecular Cloning and Characterization of UDP-glucose Dehydrogenase from the Amphibian Xenopus laevis and Its Involvement in Hyaluronan Synthesis J. Biol. Chem., March 24, 2006; 281(12): 8254 - 8263. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Weintrob, J. Drouin, S. Vallette-Kasic, E. Taub, D. Marom, Y. Lebenthal, G. Klinger, E. Bron-Harlev, and M. Shohat Low Estriol Levels in the Maternal Triple-Marker Screen as a Predictor of Isolated Adrenocorticotropic Hormone Deficiency Caused by a New Mutation in the TPIT Gene Pediatrics, February 1, 2006; 117(2): e322 - e327. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Gaffoor, D. W. Brown, R. Plattner, R. H. Proctor, W. Qi, and F. Trail Functional Analysis of the Polyketide Synthase Genes in the Filamentous Fungus Gibberella zeae (Anamorph Fusarium graminearum) Eukaryot. Cell, November 1, 2005; 4(11): 1926 - 1933. [Abstract] [Full Text] [PDF] |
||||
![]() |
K J Bradley, B M Cavaco, M R Bowl, B Harding, A Young, and R V Thakker Utilisation of a cryptic non-canonical donor splice site of the gene encoding PARAFIBROMIN is associated with familial isolated primary hyperparathyroidism J. Med. Genet., August 1, 2005; 42(8): e51 - e51. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. N. Pouchkina-Stantcheva and A. Tunnacliffe Spliced Leader RNA-Mediated trans-Splicing in Phylum Rotifera Mol. Biol. Evol., June 1, 2005; 22(6): 1482 - 1489. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. D. Wu and C. K. Watanabe GMAP: a genomic mapping and alignment program for mRNA and EST sequences Bioinformatics, May 1, 2005; 21(9): 1859 - 1875. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Tian, J. Hu, H. Zhang, and C. S. Lutz A large-scale analysis of mRNA polyadenylation of human and mouse genes Nucleic Acids Res., January 12, 2005; 33(1): 201 - 212. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. F. Abril, R. Castelo, and R. Guigo Comparison of splice sites in mammals and chicken Genome Res., January 1, 2005; 15(1): 111 - 119. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Geiszt, K. Lekstrom, and T. L. Leto Analysis of mRNA Transcripts from the NAD(P)H Oxidase 1 (Nox1) Gene: EVIDENCE AGAINST PRODUCTION OF THE NADPH OXIDASE HOMOLOG-1 SHORT (NOH-1S) TRANSCRIPT VARIANT J. Biol. Chem., December 3, 2004; 279(49): 51661 - 51668. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Kupfer, S. D. Drabenstot, K. L. Buchanan, H. Lai, H. Zhu, D. W. Dyer, B. A. Roe, and J. W. Murphy Introns and Splicing Elements of Five Diverse Fungi Eukaryot. Cell, October 1, 2004; 3(5): 1088 - 1100. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Uchimura, K. Kadomatsu, F. M. El-Fasakhany, M. S. Singer, M. Izawa, R. Kannagi, N. Takeda, S. D. Rosen, and T. Muramatsu N-Acetylglucosamine 6-O-Sulfotransferase-1 Regulates Expression of L-Selectin Ligands and Lymphocyte Homing J. Biol. Chem., August 13, 2004; 279(33): 35001 - 35008. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. L. Orlov and V. N. Potapov Complexity: an internet resource for analysis of DNA sequence complexity Nucleic Acids Res., July 1, 2004; 32(suppl_2): W628 - W633. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Nesbit, M. R. Bowl, B. Harding, A. Ali, A. Ayala, C. Crowe, A. Dobbie, G. Hampson, I. Holdaway, M. A. Levine, et al. Characterization of GATA3 Mutations in the Hypoparathyroidism, Deafness, and Renal Dysplasia (HDR) Syndrome J. Biol. Chem., May 21, 2004; 279(21): 22624 - 22634. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Borgono, I. P. Michael, and E. P. Diamandis Human Tissue Kallikreins: Physiologic Roles and Applications in Cancer Mol. Cancer Res., May 1, 2004; 2(5): 257 - 280. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Zhu, S. D. Schlueter, and V. Brendel Refined Annotation of the Arabidopsis Genome by Complete Expressed Sequence Tag Mapping Plant Physiology, June 1, 2003; 132(2): 469 - 484. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Morimoto-Tomita, K. Uchimura, Z. Werb, S. Hemmerich, and S. D. Rosen Cloning and Characterization of Two Extracellular Heparin-degrading Endosulfatases in Mice and Humans J. Biol. Chem., December 13, 2002; 277(51): 49175 - 49185. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Vivian, Y. Chen, D. Yee, E. Schneider, and T. Magnuson An allelic series of mutations in Smad2 and Smad4 identified in a genotype-based screen of N-ethyl-N- nitrosourea-mutagenized mouse embryonic stem cells PNAS, November 26, 2002; 99(24): 15542 - 15547. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Kominato, Y. Hata, H. Takizawa, K. Matsumoto, K. Yasui, J.-i. Tsukada, and F.-i. Yamamoto Alternative Promoter Identified between a Hypermethylated Upstream Region of Repetitive Elements and a CpG Island in Human ABO Histo-blood Group Genes J. Biol. Chem., September 27, 2002; 277(40): 37936 - 37948. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Zavolan, E. van Nimwegen, and T. Gaasterland Splice Variation in Mouse Full-Length cDNAs Identified by Mapping to the Mouse Genome Genome Res., September 1, 2002; 12(9): 1377 - 1385. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Farrer, A. B. Roller, W. J. Kent, and A. M. Zahler Analysis of the role of Caenorhabditis elegans GC-AG introns in regulated splicing Nucleic Acids Res., August 1, 2002; 30(15): 3360 - 3367. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Pollard, A. R. Krainer, S. C. Robson, and G. N. Europe-Finner Alternative Splicing of the Adenylyl Cyclase Stimulatory G-protein Galpha s Is Regulated by SF2/ASF and Heterogeneous Nuclear Ribonucleoprotein A1 (hnRNPA1) and Involves the Use of an Unusual TG 3'-Splice Site J. Biol. Chem., May 3, 2002; 277(18): 15241 - 15251. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Altschmied, J. Delfgaauw, B. Wilde, J. Duschl, L. Bouneau, J.-N. Volff, and M. Schartl Subfunctionalization of Duplicate mitf Genes Associated With Differential Degeneration of Alternative Exons in Fish Genetics, May 1, 2002; 161(1): 259 - 267. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-H. Huang, Y.-T. Chen, J.-J. Lai, S.-T. Yang, and U.-C. Yang PALS db: Putative Alternative Splicing database Nucleic Acids Res., January 1, 2002; 30(1): 186 - 190. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Levine and R. Durbin A computational scan for U12-dependent introns in the human genome sequence Nucleic Acids Res., October 1, 2001; 29(19): 4006 - 4013. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A. Thanaraj and F. Clark Human GC-AG alternative intron isoforms with weak donor sites show enhanced consensus at acceptor exon positions Nucleic Acids Res., June 15, 2001; 29(12): 2581 - 2593. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||















