Nucleic Acids Research, 1991, Vol. 19, No. 23 6565-6572
© 1991
MOLECULAR BIOLOGY |
Automated assembly of protein blocks for database searching
Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center Seattle, WA 98104, USA
Received July 26, 1991. Revised November 7, 1991. Accepted November 7, 1991.
A system is described for finding and assembling the most highly conserved regions of related proteins for database searching. First, an automated version of Smith's algorithm for finding motifs is used for sensitive detection of multiple local alignments. Next, the local alignments are converted to blocks and the best set of non-overlapping blocks is determined. When the automated system was applied successively to all 437 groups of related proteins in the PROSITE catalog, 1764 blocks resulted; these could be used for very sensitive searches of sequence databases. Each block was calibrated by searching the SWISS-PROT database to obtain a measure of the chance distribution of matches, and the calibrated blocks were concatenated into a database that could itself be searched. Examples are provided in which distant relationships are detected either using a set of blocks to search a sequence database or using sequences to search the database of blocks. The practical use of the blocks database is demonstrated by detecting previously unknown relationships between oxidoreductases and by evaluating a proposed relationship between HIV Vif protein and thiol proteases.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
U. Paila, R. Kondam, and A. Ranjan Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome Nucleic Acids Res., December 1, 2008; 36(21): 6664 - 6675. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. W. Mount Using BLOSUM in Sequence Alignments CSH Protocols, June 1, 2008; 2008(7): pdb.top39 - pdb.top39. [Abstract] [Full Text] |
||||
![]() |
A. G. Jegga, S. Gowrisankar, J. Chen, and B. J. Aronow PolyDoms: a whole genome database for the identification of non-synonymous coding SNPs with the potential to impact disease Nucleic Acids Res., January 12, 2007; 35(suppl_1): D700 - D706. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li, J. Li, and L. Wong Discovering motif pairs at interaction sites from protein sequences on a proteome-wide scale Bioinformatics, April 15, 2006; 22(8): 989 - 996. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Frenkel-Morgenstern, H. Voet, and S. Pietrokovski Enhanced statistics for local alignment of multiple alignments improves prediction of protein function and structure Bioinformatics, July 1, 2005; 21(13): 2950 - 2956. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Johnson, J. Houck, and C. Chen Screening for Deleterious Nonsynonymous Single-Nucleotide Polymorphisms in Genes Involved in Steroid Hormone Metabolism and Response Cancer Epidemiol. Biomarkers Prev., May 1, 2005; 14(5): 1326 - 1329. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Atalay and R. Cetin-Atalay Implicit motif distribution based hybrid computational kernel for sequence classification Bioinformatics, April 15, 2005; 21(8): 1429 - 1436. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ota, M. Ikeguchi, and A. Kidera Phylogeny of protein-folding trajectories reveals a unique pathway to native structure PNAS, December 21, 2004; 101(51): 17658 - 17663. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. M. Dombek, N. Kacherovsky, and E. T. Young The Reg1-interacting Proteins, Bmh1, Bmh2, Ssb1, and Ssb2, Have Roles in Maintaining Glucose Repression in Saccharomyces cerevisiae J. Biol. Chem., September 10, 2004; 279(37): 39165 - 39174. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Bhaduri, G. Pugalenthi, N. Gupta, and R. Sowdhamini iMOT: an interactive package for the selection of spatially interacting motifs Nucleic Acids Res., July 1, 2004; 32(suppl_2): W602 - W605. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. D. Thomas, M. J. Campbell, A. Kejariwal, H. Mi, B. Karlak, R. Daverman, K. Diemer, A. Muruganujan, and A. Narechania PANTHER: A Library of Protein Families and Subfamilies Indexed by Function Genome Res., September 1, 2003; 13(9): 2129 - 2141. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Meyer, A. Goesmann, A. C. McHardy, D. Bartels, T. Bekel, J. Clausen, J. Kalinowski, B. Linke, O. Rupp, R. Giegerich, et al. GenDB--an open source genome annotation system for prokaryote genomes Nucleic Acids Res., April 15, 2003; 31(8): 2187 - 2195. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Iiizumi, H. Arakawa, T. Mori, A. Ando, and Y. Nakamura Isolation of a Novel Gene, CABC1, Encoding a Mitochondrial Protein That Is Highly Homologous to Yeast Activity of bc1 Complex Cancer Res., March 1, 2002; 62(5): 1246 - 1250. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Busch, A. Seuter, and R. Hain Functional Analysis of the Early Steps of Carotenoid Biosynthesis in Tobacco Plant Physiology, February 1, 2002; 128(2): 439 - 453. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. L. Moore and M. B. Roth Hcp-4, a Cenp-C-Like Protein inCaenorhabditis elegans, Is Required for Resolution of Sister Centromeres J. Cell Biol., June 11, 2001; 153(6): 1199 - 1208. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. C. Ng and S. Henikoff Predicting Deleterious Amino Acid Substitutions Genome Res., May 1, 2001; 11(5): 863 - 874. [Abstract] [Full Text] |
||||
![]() |
R. J. Maraia and R. V. A. Intine Recognition of Nascent RNA by the Human La Antigen: Conserved and Divergent Features of Structure and Function Mol. Cell. Biol., January 15, 2001; 21(2): 367 - 379. [Full Text] |
||||
![]() |
J. P. Boylan and A. F. Wright Identification of a novel protein interacting with RPGR Hum. Mol. Genet., September 1, 2000; 9(14): 2085 - 2093. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Henikoff and S. Henikoff Drosophila Genomic Sequence Annotation Using the BLOCKS+ Database Genome Res., April 1, 2000; 10(4): 543 - 546. [Abstract] [Full Text] |
||||
![]() |
L. Zhang, M. M. Howe, and T. P. Hatch Characterization of In Vitro DNA Binding Sites of the EUO Protein of Chlamydia psittaci Infect. Immun., March 1, 2000; 68(3): 1337 - 1349. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Dunwell, S. Khuri, and P. J. Gane Microbial Relatives of the Seed Storage Proteins of Higher Plants: Conservation of Structure and Diversification of Function during Evolution of the Cupin Superfamily Microbiol. Mol. Biol. Rev., March 1, 2000; 64(1): 153 - 179. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Henikoff, E. A. Greene, S. Pietrokovski, and S. Henikoff Increased coverage of protein families with the Blocks Database servers Nucleic Acids Res., January 1, 2000; 28(1): 228 - 230. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Cicchetti, P. Maurer, P. Wagener, and C. Kocks Actin and Phosphoinositide Binding by the ActA Protein of the Bacterial Pathogen Listeria monocytogenes J. Biol. Chem., November 19, 1999; 274(47): 33616 - 33626. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Sitrit, K. A. Hadfield, A. B. Bennett, K. J. Bradford, and A. B. Downie Expression of a Polygalacturonase Associated with Tomato Seed Germination Plant Physiology, October 1, 1999; 121(2): 419 - 428. [Abstract] [Full Text] |
||||
![]() |
P. G. Egland and C. S. Harwood BadR, a New MarR Family Member, Regulates Anaerobic Benzoate Degradation by Rhodopseudomonas palustris in Concert with AadR, an Fnr Family Member J. Bacteriol., April 1, 1999; 181(7): 2102 - 2109. [Abstract] [Full Text] |
||||
![]() |
R. V. Nair, E. M. Green, D. E. Watson, G. N. Bennett, and E. T. Papoutsakis Regulation of the sol Locus Genes for Butanol and Acetone Formation in Clostridium acetobutylicum ATCC 824 by a Putative Transcriptional Repressor J. Bacteriol., January 1, 1999; 181(1): 319 - 330. [Abstract] [Full Text] |
||||
![]() |
M.-H. Kuo, J. Zhou, P. Jambeck, M. E.A. Churchill, and C. D. Allis Histone acetyltransferase activity of yeast Gcn5p is required for the activation of target genes in vivo Genes & Dev., March 1, 1998; 12(5): 627 - 639. [Abstract] [Full Text] |
||||
![]() |
M. Sentandreu, M. V. Elorza, R. Sentandreu, and W. A. Fonzi Cloning and Characterization of PRA1, a Gene Encoding a Novel pH-Regulated Antigen of Candida albicans J. Bacteriol., January 15, 1998; 180(2): 282 - 289. [Abstract] [Full Text] |
||||
![]() |
R. O. McCann and S. W. Craig The I/LWEQ module: a conserved sequence that signifies F-actin binding in functionally diverse proteins from yeast to mammals PNAS, May 27, 1997; 94(11): 5679 - 5684. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Alejo, R. J. Yanez, J. M. Rodriguez, E. Vinuela, and M. L. Salas African Swine Fever Virus trans-Prenyltransferase J. Biol. Chem., April 4, 1997; 272(14): 9417 - 9423. [Abstract] [Full Text] [PDF] |
||||
![]() |
M A Moreno, L C Harper, R W Krueger, S L Dellaporta, and M Freeling liguleless1 encodes a nuclear-localized protein required for induction of ligules and auricles during maize leaf organogenesis. Genes & Dev., March 1, 1997; 11(5): 616 - 628. [Abstract] [PDF] |
||||
![]() |
R G Lafreniere, D L Rochefort, N Chretien, C E Neville, R G Korneluk, L Zuo, Y Wei, J Lichter, and G A Rouleau Isolation and genomic structure of a human homolog of the yeast periodic tryptophan protein 2 (PWP2) gene mapping to 21q22.3. Genome Res., December 1, 1996; 6(12): 1216 - 1226. [Abstract] [PDF] |
||||
![]() |
G. Frandsen, F. Müller-Uri, M. Nielsen, J. Mundy, and K. Skriver Novel Plant Ca[IMAGE]-binding Protein Expressed in Response to Abscisic Acid and Osmotic Stress J. Biol. Chem., January 5, 1996; 271(1): 343 - 348. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Kissil, L. P. Deiss, M. Bayewitch, T. Raveh, G. Khaspekov, and A. Kimchi Isolation of DAP3, a Novel Mediator of Interferon-[IMAGE]-induced Cell Death J. Biol. Chem., November 17, 1995; 270(46): 27932 - 27936. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Miura, T. Tam, A. Ido, T. Morinaga, T. Miki, T. Hashimoto, and T. Tamaoki Cloning and Characterization of an ATBF1 Isoform That Expresses in a Neuronal Differentiation-dependent Manner J. Biol. Chem., November 10, 1995; 270(45): 26840 - 26848. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. P. Becerra, A. Sagasti, P. Spinella, and V. Notario Pigment Epithelium-derived Factor Behaves Like a Noninhibitory Serpin J. Biol. Chem., October 27, 1995; 270(43): 25992 - 25999. [Abstract] [Full Text] [PDF] |
||||
![]() |
K C Worley, B A Wiese, and R F Smith BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res., September 1, 1995; 5(2): 173 - 184. [Abstract] [PDF] |
||||
![]() |
A. Knecht, P. Good, I. Dawid, and R. Harland Dorsal-ventral patterning and differentiation of noggin-induced neural tissue in the absence of mesoderm Development, January 6, 1995; 121(6): 1927 - 1935. [Abstract] [PDF] |
||||
![]() |
L P Deiss, E Feinstein, H Berissi, O Cohen, and A Kimchi Identification of a novel serine/threonine kinase and a novel 15-kD protein as potential mediators of the gamma interferon-induced cell death. Genes & Dev., January 1, 1995; 9(1): 15 - 30. [Abstract] [PDF] |
||||
![]() |
J L Dynes, A M Clark, G Shaulsky, A Kuspa, W F Loomis, and R A Firtel LagC is required for cell-cell interactions that are essential for cell-type differentiation in Dictyostelium. Genes & Dev., April 15, 1994; 8(8): 948 - 958. [Abstract] [PDF] |
||||
![]() |
P. Newmark and R. Boswell The mago nashi locus encodes an essential product required for germ plasm assembly in Drosophila Development, January 5, 1994; 120(5): 1303 - 1313. [Abstract] [PDF] |
||||
![]() |
C. Lawrence, S. Altschul, M. Boguski, J. Liu, A. Neuwald, and J. Wootton Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment Science, October 8, 1993; 262(5131): 208 - 214. [Abstract] [PDF] |
||||
![]() |
P Green, D Lipman, L Hillier, R Waterston, D States, and J. Claverie Ancient conserved regions in new gene sequences and the protein databases Science, March 19, 1993; 259(5102): 1711 - 1716. [Abstract] [PDF] |
||||
![]() |
A. I. Soldevila and S. A. Ghabrial A Novel Alcohol Oxidase/RNA-binding Protein with Affinity for Mycovirus Double-stranded RNA from the Filamentous Fungus Helminthosporium (Cochliobolus) victoriae. MOLECULAR AND FUNCTIONAL CHARACTERIZATION J. Biol. Chem., February 9, 2001; 276(7): 4652 - 4661. [Abstract] [Full Text] [PDF] |
||||

















