Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (119K) Freely available
Right arrow Supplementary Data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (28)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Dralyuk, I.
Right arrow Articles by Dubchak, I.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Dralyuk, I.
Right arrow Articles by Dubchak, I.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2000, Vol. 28, No. 1 296-297
© 2000 Oxford University Press

ASDB: database of alternatively spliced genes

I. Dralyuk, M. Brudno, M. S. Gelfand1, M. Zorn and I. Dubchak*

National Energy Research Scientific Computing Center, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA and 1State Scientific Center for Biotechnology NIIGenetika, Moscow 113545, Russia

Received October 4, 1999; Accepted October 6, 1999.


    ABSTRACT
 TOP
 ABSTRACT
 DATABASE DESCRIPTION
 SUPPLEMENTARY MATERIAL
 AVAILABILITY
 REFERENCES
 
Version 2.1 of ASDB (Alternative Splicing Data Base) contains 1922 protein and 2486 DNA sequences. The protein entries from SWISS-PROT are joined into clusters corresponding to alternatively spliced variants of one gene. The DNA division consists of complete genes with alternative splicing mentioned or annotated in GenBank. The search engine allows one to search over SWISS-PROT and GenBank fields and then follow the links to all variants. The database can be assessed at the URL http://cbcg.nersc.gov/asdb


    DATABASE DESCRIPTION
 TOP
 ABSTRACT
 DATABASE DESCRIPTION
 SUPPLEMENTARY MATERIAL
 AVAILABILITY
 REFERENCES
 
Version 2.1 of ASDB consists of two divisions, ASDB(proteins), which contains amino acid sequences, and ASDB(nucleotides) with genomic sequences. ASDB(nucleotides) was developed in 1999, while ASDB(proteins) was updated with the latest data from SWISS-PROT and improved clustering procedures described below.

SWISS-PROT uses two formats for description of alternative splicing. Thus the protein sequences were selected from SWISS-PROT (1) using full text search for the words ‘alternative splicing’ (usually in the CC lines) and ‘varsplic’ (in the FT lines). This search generated 1922 initial entries. Some entries describe just one alternatively spliced variant, some (those that include the ‘varsplic’ field) indicate several.

In order to group proteins that could arise by alternative splicing of the same gene, we developed the clustering procedure. Two proteins were linked if they had a common fragment of at least 20 amino acids, and clusters were initially defined as maximum connected groups of linked proteins (2). Each cluster was represented by multiple alignment of its members constructed using CLUSTALW (3).

It turned out that some clusters were chimeric, in the sense that they contained members of multigene families, but not alternatively spliced variants of one gene. Therefore the multiple alignments were subject to additional analysis aimed at detection of chimeric clusters. This post-processing was based on the following assumption: bona fide alternative variants have few relatively long mismatching regions, as opposed to homologous proteins whose alignment contains multiple short mismatches. However, random matching residues within alternative fragments complicate implementation of this criterion. Thus, as the first step, short runs of matches between mismatches (up to three matching amino acids) were marked as mismatches. During the second step the mismatched fragments were counted. If there were less than five such fragments, the proteins were assumed to be alternatively spliced variants. If there were more than 15, the proteins were assumed to be homologues. The intermediate cases were subject to case by case manual analysis. Through the use of this procedure all chimeric clusters were removed. The distribution of cluster size, representation of species and other relevant statistics can be retrieved from the ASDB Web site.

This processing covers the cases when alternatively spliced variants are described in separate SWISS-PROT entries. The other kind of ASDB records, originating from the SWISS-PROT entries with the ‘varsplic’ field in the feature table, usually describe the proteins that are not part of any cluster. In these cases, information on the variable fragments of the several proteins which result from the alternative splicing of a single gene is contained in the entry itself. ASDB(proteins) entries are marked with different symbols to allow for easy differentiation among the three types: those proteins which are part of the ASDB clusters and the corresponding multi-alignments, those which have information on different variants in the associated SWISS-PROT entries, and those for which information on the variants is not available at the present time. ASDB contains internal links between entries and/or clusters, as well as external links to MEDLINE, GenBank and SWISS-PROT entries.

The ASDB(nucleotides) division was generated by collecting all GenBank (4) entries containing the words ‘alternative splicing’ and further selection of those entries that contain complete gene sequences (all CDS fields are complete, i.e., they do not have continuation signs). The distribution of the number of entries over the most represented species is given in Figure 1.



View larger version (116K):
[in this window]
[in a new window]
 
Figure 1. The number of ASDB(nucleotide) entries for the most represented species

 
The database can be searched using MEDLINE, SWISS-PROT (1) and GenBank (4) identifiers and accession numbers. Standard context search can be performed over SWISS-PROT and GenBank keywords, description, taxonomy, comment fields and feature tables. Standard boolean logic is supported to allow for a wider range of queries.


    SUPPLEMENTARY MATERIAL
 TOP
 ABSTRACT
 DATABASE DESCRIPTION
 SUPPLEMENTARY MATERIAL
 AVAILABILITY
 REFERENCES
 
Instructions on using ASDB and explanation of its various features are available via NAR Online.


    AVAILABILITY
 TOP
 ABSTRACT
 DATABASE DESCRIPTION
 SUPPLEMENTARY MATERIAL
 AVAILABILITY
 REFERENCES
 
ASDB is available at the URL http://cbcg.nersc.gov/asdb . The administrator of the database can be contacted by Email at asdb@lbl.gov


    ACKNOWLEDGEMENTS
 
We are grateful to Sylvia Spengler, Andrey Mironov and Pavel Pevzner for useful discussions. This work was supported by the Director, Office of Energy Research, Office of Biological and Environmental Research, of the US Department of Energy under Contract No. DE-ACO3-76SF00098. M.G. was partially supported by grants from the Russian Fund of Basic Research (99-04-48347), the Russian State Scientific Program ‘Human Genome’ (65/99), and the Merck Genome Research Institute (244).


    FOOTNOTES
 
* To whom correspondence should be addressed. Tel: +1 510 495 2419; Fax: +1 510 486 5717; Email: ildubchak@lbl.gov Back


    REFERENCES
 TOP
 ABSTRACT
 DATABASE DESCRIPTION
 SUPPLEMENTARY MATERIAL
 AVAILABILITY
 REFERENCES
 

    1 Bairoch,A. and Apweiler,R. (1999) Nucleic Acids Res., 27, 49–54. Updated article in this issue: Nucleic Acids Res. (2000), 28, 45–48.[Abstract/Free Full Text]

    2 Gelfand,M.S., Dubchak,I., Dralyuk,I. and Zorn,M. (1999) Nucleic Acids Res., 27, 301–302.[Abstract/Free Full Text]

    3 Thompson,J.D., Gibson,T.J., Plewniak,F., Jeanmougin,F. and Higgins,D.G. (1997) Nucleic Acids Res., 25, 4876–4882.[Abstract/Free Full Text]

    4 Benson,D.A., Boguski,M.S., Lipman,D.J., Ostell,J., Ouellette,B.F.F., Rapp,B.A. and Wheeler,D.L. (1999) Nucleic Acids Res., 27, 12–17. Updated article in this issue: Nucleic Acids Res. (2000), 28, 15–18.[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
RNAHome page
C. L. ZHENG, Y.-S. KWON, H.-R. LI, K. ZHANG, G. COUTINHO-MANSFIELD, C. YANG, T. M. NAIR, M. GRIBSKOV, and X.-D. FU
MAASE: An alternative splicing database designed for supporting splicing microarray applications
RNA, December 1, 2005; 11(12): 1767 - 1776.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Kim, N. Kim, Y. Lee, B. Kim, Y. Shin, and S. Lee
ECgene: genome annotation for alternative splicing
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D75 - D79.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H.-D. Huang, J.-T. Horng, F.-M. Lin, Y.-C. Chang, and C.-C. Huang
SpliceInfo: an information repository for mRNA alternative splicing in human genome
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D80 - D85.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. A. Thanaraj, S. Stamm, F. Clark, J.-J. Riethoven, V. Le Texier, and J. Muilu
ASD: the Alternative Splicing Database
Nucleic Acids Res., January 1, 2004; 32(90001): D64 - 69.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Q. Xu, B. Modrek, and C. Lee
Genome-wide detection of tissue-specific alternative splicing in the human transcriptome
Nucleic Acids Res., September 1, 2002; 30(17): 3754 - 3766.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
H. Kochiwa, R. Suzuki, T. Washio, R. Saito, T. R. G. E. R. G. Phase II Team, H. Bono, P. Carninci, Y. Okazaki, R. Miki, Y. Hayashizaki, et al.
Inferring Alternative Splicing Patterns in Mouse from a Full-Length cDNA Library and Microarray Data
Genome Res., August 1, 2002; 12(8): 1286 - 1293.
[Abstract] [Full Text] [PDF]


Home page
Endocr. Rev.Home page
C. P. Leo, S. Y. Hsu, and A. J. W. Hsueh
Hormonal Genomics
Endocr. Rev., June 1, 2002; 23(3): 369 - 381.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Y.-H. Huang, Y.-T. Chen, J.-J. Lai, S.-T. Yang, and U.-C. Yang
PALS db: Putative Alternative Splicing database
Nucleic Acids Res., January 1, 2002; 30(1): 186 - 190.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Ji, Q. Zhou, F. Wen, H. Xia, X. Lu, and Y. Li
AsMamDB: an alternative splice database of mammals
Nucleic Acids Res., January 1, 2001; 29(1): 260 - 263.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (119K) Freely available
Right arrow Supplementary Data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (28)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Dralyuk, I.
Right arrow Articles by Dubchak, I.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Dralyuk, I.
Right arrow Articles by Dubchak, I.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?