Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (163K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (74)
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Burset, M.
Right arrow Articles by Solovyev, V. V.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Burset, M.
Right arrow Articles by Solovyev, V. V.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2001, Vol. 29, No. 1 255-259
© 2001 Oxford University Press

SpliceDB: database of canonical and non-canonical mammalian splice sites

M. Burset, I. A. Seledtsov1 and V. V. Solovyev*

The Sanger Centre, Hinxton, Cambridge CB10 1SA, UK and 1Softberry Inc., 108 Corporate Park Drive, Suite 120, White Plains, NY 10604, USA

Received September 7, 2000; Revised and Accepted October 31, 2000.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 EST BASED CLASSIFICATION
 DATABASE STATUS
 DATABASE FORMAT
 CONSENSUS AND WEIGHT MATRICES
 DISCUSSION
 REFERENCES
 
A database (SpliceDB) of known mammalian splice site sequences has been developed. We extracted 43 337 splice pairs from mammalian divisions of the gene-centered Infogene database, including sites from incomplete or alternatively spliced genes. Known EST sequences supported 22 815 of them. After discarding sequences with putative errors and ambiguous location of splice junctions the verified dataset includes 22 489 entries. Of these, 98.71% contain canonical GT–AG junctions (22 199 entries) and 0.56% have non-canonical GC–AG splice site pairs. The remainder (0.73%) occurs in a lot of small groups (with a maximum size of 0.05%). We especially studied non-canonical splice sites, which comprise 3.73% of GenBank annotated splice pairs. EST alignments allowed us to verify only the exonic part of splice sites. To check the conservative dinucleotides we compared sequences of human non-canonical splice sites with sequences from the high throughput genome sequencing project (HTG). Out of 171 human non-canonical and EST-supported splice pairs, 156 (91.23%) had a clear match in the human HTG. They can be classified after sequence analysis as: 79 GC–AG pairs (of which one was an error that corrected to GC–AG), 61 errors corrected to GT–AG canonical pairs, six AT–AC pairs (of which two were errors corrected to AT–AC), one case was produced from a non-existent intron, seven cases were found in HTG that were deposited to GenBank and finally there were only two other cases left of supported non-canonical splice pairs. The information about verified splice site sequences for canonical and non-canonical sites is presented in SpliceDB with the supporting evidence. We also built weight matrices for the major splice groups, which can be incorporated into gene prediction programs. SpliceDB is available at the computational genomic Web server of the Sanger Centre: http://genomic.sanger.ac.uk/spldb/SpliceDB.html and at http://www.softberry.com/spldb/SpliceDB.html.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 EST BASED CLASSIFICATION
 DATABASE STATUS
 DATABASE FORMAT
 CONSENSUS AND WEIGHT MATRICES
 DISCUSSION
 REFERENCES
 
The database has been generated as a result of our interest in characterization of observed types of splice sites. New sequences coming every day from genome sequencing projects are mostly annotated by computationally generated information. There is no straightforward procedure to retrieve experimentally supported splice site sequences to study their properties. Currently our knowledge about how the cell is specifying splice sites is not sufficient for accurate and comprehensive computational identification of splice junctions in genomic sequences. Characterization of all known splice sites can help us to increase the quality of gene structure prediction programs. Moreover, many annotated non-canonical splice site sequences may appear in databases as a result of sequencing or annotation errors (1,2). These errors should be found and corrected or discarded in the investigation of splice site characteristics.

EST sequences as an independent source of information were used to verify the annotated splice pairs. This approach has been suggested and exploited by Thanaraj (3), who selected genes without alternative splicing and generated a complex splice site classification system depending on the found EST matches. We extended this approach in using high throughput genome sequencing project (HTG) genomic sequences to verify splice site exonic and intronic composition and applied it to analysis of mammalian genes (4).

Our analysis comprises constitutively as well as alternatively spliced genes. Therefore all kinds of spliced introns of the same gene are included in the database. This is the first public database describing alternative introns supported by ESTs and non-canonical splice junctions.


    EST BASED CLASSIFICATION
 TOP
 ABSTRACT
 INTRODUCTION
 EST BASED CLASSIFICATION
 DATABASE STATUS
 DATABASE FORMAT
 CONSENSUS AND WEIGHT MATRICES
 DISCUSSION
 REFERENCES
 
Every EST similar to any splice construct can be classified depending on quality and type of observed alignment with the annotated gene sequence as (for more details see figure 1b in 4): D-end: EST only covers the left exon; A-end: EST only covers the right exon; B-ends: EST overlaps with a splice junction covering left and right exons with no more than one substitution; Error: EST overlaps with a splice junction covering left and right exons with several mismatches or/and gaps.

When all EST alignments for the same spliced construct have been obtained, every splice site can be classified using the following rules:

(i) if there is some B-end EST, then classify as ‘Supported junction’ (B20) splice pair, otherwise;

(ii) if there is some Error EST, then classify as ‘Error in junction’ (Err) splice pair, otherwise;

(iii) if there is some D-end AND some A-end EST, then classify as ‘Unsupported junction but supported exons’ (5+3) splice pair, otherwise;

(iv) if there is some D-end EST, then classify as ‘Only supported 5' exon’ (5pr) splice pair, otherwise;

(v) if there is some A-end EST, then classify as ‘Only supported 3' exon’ (3pr) splice pair, otherwise;

(vi) classify as ‘Completely unsupported’ (Uns) splice pair.

Finally, all splice pairs classified as ‘supported junction’ but with low conservation (identity <95%) within 20 bp. at every side of splice junction were reclassified as ‘Error in junction’.


    DATABASE STATUS
 TOP
 ABSTRACT
 INTRODUCTION
 EST BASED CLASSIFICATION
 DATABASE STATUS
 DATABASE FORMAT
 CONSENSUS AND WEIGHT MATRICES
 DISCUSSION
 REFERENCES
 
This first version of SpliceDB was built using mammalian divisions of the InfoGene database (5), which united information from many GenBank (Release 112) (6) entries describing a particular gene. We obtained 43 337 splice site pairs, of which 22 815 were supported by ESTs. Applying corrections explained in Burset et al. (4) this number was reduced to 22 489 supported and corrected entries. Subdivisions of data in SpliceDB and their content are listed in Table 1.


View this table:
[in this window]
[in a new window]
 
Table 1. Characteristics of the SpliceDB divisions
 
More than half (65.69%) of database entries come from human sequences, so we decided to keep separate sets of human splice sites. It may be interesting for scientists working with humans to go directly to these sequences as well as we were able to compare human non-canonical splice sites with HTGs. So, originally we obtained 28 468 human splice site pairs, of which 15 645 (68.57%) were supported by ESTs. After correction procedures 15 434 (68.63% of human entries) of verified splice pairs were presented in the corresponding subdivision.

All subdivisions are subsets of all mammalian annotated splice pair sets and the users can retrieve sequences of any combination of interesting groups.

Mammals or human files are divided at every filter stage into canonical and non-canonical introns. We create three filter stages for every group. The first group is formed by all splice site pairs using original GenBank annotations; the second group comprises pairs supported by ESTs and the third includes pairs supported by ESTs and is automatically corrected, meaning that all ambiguous junctions have been discarded, see Burset et al. (4) for details (Table 1).

In human non-canonical subdivision there is a special file with a subset from non-canonical, EST supported and corrected splice pairs, which are supported by HTG. Analysis of alignment information and possible corrections using HTG have been done manually, studying case-by-case sequence alignments.

Several examples of EST-supported entries are presented in Figure 1a. As was indicated in Burset et al. (4), we often observe EST-supported sequences with putative errors or, at least, with ambiguity in splice junction position. The same EST may support two splice junctions or several ESTs may support different junctions, as can be seen in Figure 1b.



View larger version (66K):
[in this window]
[in a new window]
 
Figure 1. Examples of different situations in analysis of annotated splice junctions.

 
Using HTG sequences allows us to identify a large variety of sequencing and annotation errors. Some entries have only small sequence errors, such as the first example in Figure 1c, which only have a deletion in donor and a substitution in acceptor sites. We recovered here a canonical GT–AG site shifted three positions downstream. Several other cases present completely unsupported introns (Fig. 1c). One very interesting example of errors is annotation of pseudogenes. The functional copy sometimes can be identified in HTG sequences by comparing them with ESTs (assuming that only the functional gene copy will generate EST sequences). In an example of such situation we found a substitution A to G upstream donor site, which helped differentiate the gene functional copy with canonical splice site. The last two examples in Figure 1c cannot be considered as sequence errors. They are examples of wrongly annotated non-canonical sites, which we corrected to typically observed non-canonical pairs.


    DATABASE FORMAT
 TOP
 ABSTRACT
 INTRODUCTION
 EST BASED CLASSIFICATION
 DATABASE STATUS
 DATABASE FORMAT
 CONSENSUS AND WEIGHT MATRICES
 DISCUSSION
 REFERENCES
 
All entries in the database are presented in a tabular format, so every line in any file describes a completely specified splice site pair. We use two kinds of field separators: the different major parts in every entry are separated by the double symbol ‘@@’, and inside the major part the field separator is a typical blank space or tabulator. It allows us to write large sentences inside every major part maintaining clear separation between them. The typical structure of an entry in SpliceDB is:

ID @@ ACCES @@ INTRON @@ DON @@ ACC @@ SEQ_DON @@ SEQ_ACC @@ EST @@ EST_ACCES @@ CORR

ID (database identifier)
This field has always only one word, that is a unique and specific identifier provided to every pair. ID is formed by Infogene (5) entry name, assigned intron number and donor and acceptor positions in the original sequence. All these data are joined using a ‘##’ symbol (i.e. HG_0000731##114##122615##122965).

ACCES (accession number)
This field has always only one word, which refers to one of the original GenBank accession numbers (i.e. AB011399).

INTRON (assigned intron number)
This field has always only one word, that is the intron number assigned to every intron pair in the Infogene database (i.e. 114).

DON (donor number)
This field has always only one word, that is the donor position in the original Infogene entry (i.e. 122615).

ACC (acceptor number)
This field has always only one word, that is the acceptor position in original Infogene entry (i.e. 122965).

SEQ_DON (nucleotide sequence around donor)
The field has always only one word, that is the nucleotide sequence centered in donor conserved dinucleotides, with 40 bp in every side, forming a total sequence of 82 bp (i.e. aacatctgtctctactggaaacctctgcactgaggagcagattgattgataagcaa­aaggcttctactgcatttccatcctt).

SEQ_ACC (nucleotide sequence around acceptor)
The field has always only one word, that is the nucleotide sequence centered in acceptor conserved dinucleotides, with 40 bp in every side, forming a total sequence of 82 bp (i.e. aaaaagctcactttttttgttcttcacattttacaggagcagacgcctccgcctaga­cctgaagcctaccccatccccactc).

EST (EST classification)
This field has always only one word, that is the obtained EST classification [see materials and methods in Burset et al. (4) for details] (i.e. B20).

EST_ACCES (EST accession number supporting classification)
This field has always only one word, that is the accession number of the EST used to support our classification (i.e. gb|N35650|N35650).

CORR (possible corrections)
This field is optional and is specified in free text. All possible corrections are annotated in this field, based on ESTs or in HTGs:

Automatic EST correction in positions [pos1 pos2] using [ESTaccession]. We annotate which positions present ambigu­ities in addition to the annotated and supported junctions (pos1 and pos2), and the EST accession number that supports alternative junction (ESTaccession).

HTGs [free text]. We provide information about HTGs corresponding to splice junction sequences [for more details see results in Burset et al. (4)].


    CONSENSUS AND WEIGHT MATRICES
 TOP
 ABSTRACT
 INTRODUCTION
 EST BASED CLASSIFICATION
 DATABASE STATUS
 DATABASE FORMAT
 CONSENSUS AND WEIGHT MATRICES
 DISCUSSION
 REFERENCES
 
After analysis of the information presented, we conclude that practically all splice site pairs are limited to three types: GT–AG, GC–AG and AT–AC, and the other kind of introns (if they exist) have a very small frequency (~0.02% or less).

Alignment of conserved dinucleotides in every type of splice site allows us to observe a certain degree of conservation in surrounding nucleotides, which in practice means a deviation in observed frequencies with respect to expected random distribution. Often this information has been presented in the form of consensus sequences (for every column in aligned sequences we write the most representative nucleotide, or group of nucleotides, indicating this frequency or percent) and as frequency matrices (for every column in aligned sequences we represent the frequency or percentage for every nucleotide, creating a matrix of four rows and as many columns as significant positions). Frequency matrices are more informative and used in gene prediction programs, but a relatively high number of aligned sequences is needed to obtain discriminative matrices.

We present frequence matrices built on verified datasets for GT–AG and GC–AG pair sequences. Because we have a small number of AT–AC cases only consensus sequences are provided for these splice sites (Fig. 2).




View larger version (75K):
[in this window]
[in a new window]
 
Figure 2. Consensus sequences and weight matrices for major groups of splice site pairs. Frequency matrices have only been calculated for major splice site groups (GT–AG and GC–AG). In the first row we indicated positions with respect to the splice cut point, which is always between –1 and 1. It should be taken into account that negative numbers in donor matrices correspond to exonic regions, but in acceptor matrices positive numbers correspond to exonic regions. In consensus sequences | means cut position (M: A or C, R: A or G, Y: C or T, S: C or G).

 
SpliceDB is available on the Sanger Centre computational genomic Web server at http://genomic.sanger.ac.uk/spldb/SpliceDB.html and at http://www.softberry.com/spldb/SpliceDB.html.


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 EST BASED CLASSIFICATION
 DATABASE STATUS
 DATABASE FORMAT
 CONSENSUS AND WEIGHT MATRICES
 DISCUSSION
 REFERENCES
 
We have applied ESTs and HTG sequences to verify mammalian splice junctions, but there are other model organisms with a lot of genomic data which are interesting to analyze, such as Drosophila melanogaster, Caenorhabditis elegans or Arabidopsis thaliana. We plan to extend our database to nearly all model eukaryotic organisms. Another problem is to install HTG analysis as automatically as possible, because manual intervention is very time consuming (if more accurate).

Observation of practically only three types of splice sites simplifies the problem of their computational identification by gene prediction programs. Consideration of a GC–AG splice pair in the Fgenesh program has been done by Salamov and Solovyev (7), and maintained the accuracy level despite including many more potential splice variants. Addition of AT–AC splice sites (occurring with very low frequency) will probably wait until we accumulate more examples, allowing us to better describe the site characteristics.

Another point to consider is whether to include information about gene structures supported by ESTs. It might be useful for training gene prediction software because it is very sensitive to sequence errors, especially in conserved positions of splice site sequences. Including information about alternative intron positions is very important for developing gene prediction programs that will generate alternative splicing variants.


    FOOTNOTES
 
* To whom correspondence should be addressed at present address: EOS Biotechnology, 225A Gateway Boulevard, South San Francisco, CA 94080, USA. Tel: +1 650 246 2331; Fax: +1 650 583 3881; Email: solovyev{at}eosbiotech.com Present address: M. Burset, Institut Municipal d’Investigació Mèdica (IMIM), C/Dr Aiguader 80, 08003 Barcelona, Spain Back


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 EST BASED CLASSIFICATION
 DATABASE STATUS
 DATABASE FORMAT
 CONSENSUS AND WEIGHT MATRICES
 DISCUSSION
 REFERENCES
 

    1 Penotti,F.E. (1991) Human Pre-mRNA splicing signals. J. Theor. Biol., 150, 385–420.[ISI][Medline]

    2 Jackson,I.J. (1991) A reappraisal of non-consensus mRNA splice sites. Nucleic Acids Res., 19, 3795–3798.[Free Full Text]

    3 Thanaraj,T.A. (1999) A clean data set of EST-confirmed splice sites from Homo sapiens and canonicals for clean-up procedures. Nucleic Acids Res., 27, 2627–2637.[Abstract/Free Full Text]

    4 Burset,M., Seledtsov,I.A. and Solovyev,V.V. (2000) Analysis of canonical and non-canonical splice sites in mammalian genomes. Nucleic Acids Res., 28, 4364–4375.[Abstract/Free Full Text]

    5 Solovyev,V.V. and Salamov,A.A. (1999) INFOGENE: a database of known gene structures and predicted genes and proteins in sequences of genome sequencing projects. Nucleic Acids Res., 27, 248–250.[Abstract/Free Full Text]

    6 Benson,D.A., Boguski,M.S., Lipman,D.J., Ostell,J., Ouellette,B.F., Rapp,B.A. and Wheeler,D.L. (1999) GenBank. Nucleic Acids Res., 27, 12–17.[Abstract/Free Full Text]

    7 Salamov,A. and Solovyev,V. (2000) Ab initio gene finding in Drosophila genomic DNA. Genome Res., 10, 516–522.[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
M.-R. Ho, W.-J. Jang, C.-h. Chen, L.-Y. Ch'ang, and W.-c. Lin
Designating eukaryotic orthology via processed transcription units
Nucleic Acids Res., June 1, 2008; 36(10): 3436 - 3442.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. Buratti, M. Chivers, J. Kralovicova, M. Romano, M. Baralle, A. R. Krainer, and I. Vorechovsky
Aberrant 5' splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization
Nucleic Acids Res., July 26, 2007; 35(13): 4250 - 4263.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. Buratti, A. Dhir, M. A. Lewandowska, and F. E. Baralle
RNA structure is a key regulatory element in pathological ATM and CFTR pseudoexon inclusion events
Nucleic Acids Res., July 26, 2007; 35(13): 4369 - 4383.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Bhasi, R. V. Pandey, S. P. Utharasamy, and P. Senapathy
EuSplice: a unified resource for the analysis of splice signals and alternative splicing in eukaryotic genes
Bioinformatics, July 15, 2007; 23(14): 1815 - 1823.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
A. Ali, P. T. Christie, I. V. Grigorieva, B. Harding, H. Van Esch, S. F. Ahmed, M. Bitner-Glindzicz, E. Blind, C. Bloch, P. Christin, et al.
Functional characterization of GATA3 mutations causing the hypoparathyroidism-deafness-renal (HDR) dysplasia syndrome: insight into mechanisms of DNA binding by the GATA3 transcription factor
Hum. Mol. Genet., February 1, 2007; 16(3): 265 - 275.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
C. Kyriakopoulou, P. Larsson, L. Liu, J. Schuster, F. Soderbom, L. A. Kirsebom, and A. Virtanen
U1-like snRNAs lacking complementarity to canonical 5' splice sites
RNA, September 1, 2006; 12(9): 1603 - 1611.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. Sheth, X. Roca, M. L. Hastings, T. Roeder, A. R. Krainer, and R. Sachidanandam
Comprehensive splice-site analysis using comparative genomics
Nucleic Acids Res., September 1, 2006; 34(14): 3955 - 3967.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
D. Vigetti, M. Ori, M. Viola, A. Genasetti, E. Karousou, M. Rizzi, F. Pallotti, I. Nardi, V. C. Hascall, G. De Luca, et al.
Molecular Cloning and Characterization of UDP-glucose Dehydrogenase from the Amphibian Xenopus laevis and Its Involvement in Hyaluronan Synthesis
J. Biol. Chem., March 24, 2006; 281(12): 8254 - 8263.
[Abstract] [Full Text] [PDF]


Home page
PediatricsHome page
N. Weintrob, J. Drouin, S. Vallette-Kasic, E. Taub, D. Marom, Y. Lebenthal, G. Klinger, E. Bron-Harlev, and M. Shohat
Low Estriol Levels in the Maternal Triple-Marker Screen as a Predictor of Isolated Adrenocorticotropic Hormone Deficiency Caused by a New Mutation in the TPIT Gene
Pediatrics, February 1, 2006; 117(2): e322 - e327.
[Abstract] [Full Text] [PDF]


Home page
Eukaryot CellHome page
I. Gaffoor, D. W. Brown, R. Plattner, R. H. Proctor, W. Qi, and F. Trail
Functional Analysis of the Polyketide Synthase Genes in the Filamentous Fungus Gibberella zeae (Anamorph Fusarium graminearum)
Eukaryot. Cell, November 1, 2005; 4(11): 1926 - 1933.
[Abstract] [Full Text] [PDF]


Home page
J. Med. Genet.Home page
K J Bradley, B M Cavaco, M R Bowl, B Harding, A Young, and R V Thakker
Utilisation of a cryptic non-canonical donor splice site of the gene encoding PARAFIBROMIN is associated with familial isolated primary hyperparathyroidism
J. Med. Genet., August 1, 2005; 42(8): e51 - e51.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
N. N. Pouchkina-Stantcheva and A. Tunnacliffe
Spliced Leader RNA-Mediated trans-Splicing in Phylum Rotifera
Mol. Biol. Evol., June 1, 2005; 22(6): 1482 - 1489.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
T. D. Wu and C. K. Watanabe
GMAP: a genomic mapping and alignment program for mRNA and EST sequences
Bioinformatics, May 1, 2005; 21(9): 1859 - 1875.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
B. Tian, J. Hu, H. Zhang, and C. S. Lutz
A large-scale analysis of mRNA polyadenylation of human and mouse genes
Nucleic Acids Res., January 12, 2005; 33(1): 201 - 212.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
J. F. Abril, R. Castelo, and R. Guigo
Comparison of splice sites in mammals and chicken
Genome Res., January 1, 2005; 15(1): 111 - 119.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. Geiszt, K. Lekstrom, and T. L. Leto
Analysis of mRNA Transcripts from the NAD(P)H Oxidase 1 (Nox1) Gene: EVIDENCE AGAINST PRODUCTION OF THE NADPH OXIDASE HOMOLOG-1 SHORT (NOH-1S) TRANSCRIPT VARIANT
J. Biol. Chem., December 3, 2004; 279(49): 51661 - 51668.
[Abstract] [Full Text] [PDF]


Home page
Eukaryot CellHome page
D. M. Kupfer, S. D. Drabenstot, K. L. Buchanan, H. Lai, H. Zhu, D. W. Dyer, B. A. Roe, and J. W. Murphy
Introns and Splicing Elements of Five Diverse Fungi
Eukaryot. Cell, October 1, 2004; 3(5): 1088 - 1100.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
K. Uchimura, K. Kadomatsu, F. M. El-Fasakhany, M. S. Singer, M. Izawa, R. Kannagi, N. Takeda, S. D. Rosen, and T. Muramatsu
N-Acetylglucosamine 6-O-Sulfotransferase-1 Regulates Expression of L-Selectin Ligands and Lymphocyte Homing
J. Biol. Chem., August 13, 2004; 279(33): 35001 - 35008.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Y. L. Orlov and V. N. Potapov
Complexity: an internet resource for analysis of DNA sequence complexity
Nucleic Acids Res., July 1, 2004; 32(suppl_2): W628 - W633.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. A. Nesbit, M. R. Bowl, B. Harding, A. Ali, A. Ayala, C. Crowe, A. Dobbie, G. Hampson, I. Holdaway, M. A. Levine, et al.
Characterization of GATA3 Mutations in the Hypoparathyroidism, Deafness, and Renal Dysplasia (HDR) Syndrome
J. Biol. Chem., May 21, 2004; 279(21): 22624 - 22634.
[Abstract] [Full Text] [PDF]


Home page
Mol Cancer ResHome page
C. A. Borgono, I. P. Michael, and E. P. Diamandis
Human Tissue Kallikreins: Physiologic Roles and Applications in Cancer
Mol. Cancer Res., May 1, 2004; 2(5): 257 - 280.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
W. Zhu, S. D. Schlueter, and V. Brendel
Refined Annotation of the Arabidopsis Genome by Complete Expressed Sequence Tag Mapping
Plant Physiology, June 1, 2003; 132(2): 469 - 484.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. Morimoto-Tomita, K. Uchimura, Z. Werb, S. Hemmerich, and S. D. Rosen
Cloning and Characterization of Two Extracellular Heparin-degrading Endosulfatases in Mice and Humans
J. Biol. Chem., December 13, 2002; 277(51): 49175 - 49185.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. L. Vivian, Y. Chen, D. Yee, E. Schneider, and T. Magnuson
An allelic series of mutations in Smad2 and Smad4 identified in a genotype-based screen of N-ethyl-N- nitrosourea-mutagenized mouse embryonic stem cells
PNAS, November 26, 2002; 99(24): 15542 - 15547.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
Y. Kominato, Y. Hata, H. Takizawa, K. Matsumoto, K. Yasui, J.-i. Tsukada, and F.-i. Yamamoto
Alternative Promoter Identified between a Hypermethylated Upstream Region of Repetitive Elements and a CpG Island in Human ABO Histo-blood Group Genes
J. Biol. Chem., September 27, 2002; 277(40): 37936 - 37948.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. Zavolan, E. van Nimwegen, and T. Gaasterland
Splice Variation in Mouse Full-Length cDNAs Identified by Mapping to the Mouse Genome
Genome Res., September 1, 2002; 12(9): 1377 - 1385.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. Farrer, A. B. Roller, W. J. Kent, and A. M. Zahler
Analysis of the role of Caenorhabditis elegans GC-AG introns in regulated splicing
Nucleic Acids Res., August 1, 2002; 30(15): 3360 - 3367.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
A. J. Pollard, A. R. Krainer, S. C. Robson, and G. N. Europe-Finner
Alternative Splicing of the Adenylyl Cyclase Stimulatory G-protein Galpha s Is Regulated by SF2/ASF and Heterogeneous Nuclear Ribonucleoprotein A1 (hnRNPA1) and Involves the Use of an Unusual TG 3'-Splice Site
J. Biol. Chem., May 3, 2002; 277(18): 15241 - 15251.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. Altschmied, J. Delfgaauw, B. Wilde, J. Duschl, L. Bouneau, J.-N. Volff, and M. Schartl
Subfunctionalization of Duplicate mitf Genes Associated With Differential Degeneration of Alternative Exons in Fish
Genetics, May 1, 2002; 161(1): 259 - 267.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Y.-H. Huang, Y.-T. Chen, J.-J. Lai, S.-T. Yang, and U.-C. Yang
PALS db: Putative Alternative Splicing database
Nucleic Acids Res., January 1, 2002; 30(1): 186 - 190.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Levine and R. Durbin
A computational scan for U12-dependent introns in the human genome sequence
Nucleic Acids Res., October 1, 2001; 29(19): 4006 - 4013.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. A. Thanaraj and F. Clark
Human GC-AG alternative intron isoforms with weak donor sites show enhanced consensus at acceptor exon positions
Nucleic Acids Res., June 15, 2001; 29(12): 2581 - 2593.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (163K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (74)
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Burset, M.
Right arrow Articles by Solovyev, V. V.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Burset, M.
Right arrow Articles by Solovyev, V. V.
Social Bookmarking
 Add to CiteULike