Article |
PatMatch: a program for finding patterns in peptide and nucleotide sequences
1Department of Plant Biology, Carnegie Institution of Washington 260 Panama Street, Stanford, CA 94305, USA 2Department of Computer Engineering, Santa Clara University 500 El Camino Real, Santa Clara, CA 95053, USA 3National Center for Genome Resources 2935 Rodeo Park Drive East, Santa Fe, NM 87505, USA 4Department of Genetics, Stanford University Stanford, CA 94305, USA
*To whom correspondence should be addressed. Tel: +1 650 325 1521 ext 251; Fax: +1 650 325 6857; Email: rhee{at}acoma.stanford.edu
Received February 11, 2005. Revised February 28, 2005. Accepted February 28, 2005.
Here, we present PatMatch, an efficient, web-based pattern-matching program that enables searches for short nucleotide or peptide sequences such as cis-elements in nucleotide sequences or small domains and motifs in protein sequences. The program can be used to find matches to a user-specified sequence pattern that can be described using ambiguous sequence codes and a powerful and flexible pattern syntax based on regular expressions. A recent upgrade has improved performance and now supports both mismatches and wildcards in a single pattern. This enhancement has been achieved by replacing the previous searching algorithm, scan_for_matches [D'Souza et al. (1997), Trends in Genetics, 13, 497498], with nondeterministic-reverse grep (NR-grep), a general pattern matching tool that allows for approximate string matching [Navarro (2001), Software Practice and Experience, 31, 12651312]. We have tailored NR-grep to be used for DNA and protein searches with PatMatch. The stand-alone version of the software can be adapted for use with any sequence dataset and is available for download at The Arabidopsis Information Resource (TAIR) at ftp://ftp.arabidopsis.org/home/tair/Software/Patmatch/. The PatMatch server is available on the web at http://www.arabidopsis.org/cgi-bin/patmatch/nph-patmatch.pl for searching Arabidopsis thaliana sequences.
Present address: Lukas A. Mueller, Cornell University, Emerson Hall Room 251, Ithaca, NY 14853, USA
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
R. Lin, L. Ding, C. Casola, D. R. Ripoll, C. Feschotte, and H. Wang Transposase-Derived Transcription Factors Regulate Light Signaling in Arabidopsis Science, November 23, 2007; 318(5854): 1302 - 1305. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Galuschka, M. Schindler, L. Bulow, and R. Hehl AthaMap web tools for the analysis and identification of co-regulated genes Nucleic Acids Res., January 12, 2007; 35(suppl_1): D857 - D862. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. A. Kellogg Progress and challenges in studies of the evolution of development J. Exp. Bot., October 1, 2006; 57(13): 3505 - 3516. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Fox, S. L. Butland, S. McMillan, G. Campbell, and B. F. F. Ouellette The Bioinformatics Links Directory: a Compilation of Molecular Biology Web Servers Nucleic Acids Res., July 1, 2005; 33(suppl_2): W3 - W24. [Abstract] [Full Text] [PDF] |
||||


