Skip Navigation

Nucleic Acids Research 2004 32(Web Server Issue):W166-W169; doi:10.1093/nar/gkh428
This Article
Right arrow Full Text Freely available
Right arrow Print PDF (169K) Freely available
Right arrow Supplementary Material
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Casillas, S.
Right arrow Articles by Barbadilla, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Casillas, S.
Right arrow Articles by Barbadilla, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2004, the authors
Nucleic Acids Research, Vol. 32, Web Server issue © Oxford University Press 2004; all rights reserved

PDA: a pipeline to explore and estimate polymorphism in large DNA databases

Sònia Casillas and Antonio Barbadilla*

Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra (Barcelona), Spain

* To whom correspondence should be addressed. Tel: +34 935 812 730; Fax: +34 935 812 387; Email: Antonio.Barbadilla{at}uab.es

Received February 15, 2004; Revised and Accepted April 13, 2004

Polymorphism studies are one of the main research areas of this genomic era. To date, however, no available web server or software package has been designed to automate the process of exploring and estimating nucleotide polymorphism in large DNA databases. Here, we introduce a novel software, PDA, Pipeline Diversity Analysis, that automatically can (i) search for polymorphic sequences in large databases, and (ii) estimate their genetic diversity. PDA is a collection of modules, mainly written in Perl, which works sequentially as follows: unaligned sequence retrieved from a DNA database are automatically classified by organism and gene, and aligned using the ClustalW algorithm. Sequence sets are regrouped depending on their similarity scores. Main diversity parameters, including polymorphism, synonymous and non-synonymous substitutions, linkage disequilibrium and codon bias are estimated both for the full length of the sequences and for specific functional regions. Program output includes a database with all sequences and estimations, and HTML pages with summary statistics, the performed alignments and a histogram maker tool. PDA is an essential tool to explore polymorphism in large DNA databases for sequences from different genes, populations or species. It has already been successfully applied to create a secondary database. PDA is available on the web at http://pda.uab.es/.


The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
GeneticsHome page
J. K. Biedler, H. Shao, and Z. Tu
Evolution and Horizontal Transfer of a DD37E DNA Transposon in Mosquitoes
Genetics, December 1, 2007; 177(4): 2553 - 2558.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. Egea, S. Casillas, E. Fernandez, M. A. Senar, and A. Barbadilla
MamPol: a database of nucleotide polymorphism in the Mammalia class
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D624 - D629.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Casillas and A. Barbadilla
PDA v.2: improving the exploration and estimation of nucleotide polymorphism in large datasets of heterogeneous DNA.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W632 - W634.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Cavallo and A. C. R. Martin
Mapping SNPs to protein sequence and structure data
Bioinformatics, April 15, 2005; 21(8): 1443 - 1450.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.