Nucleic Acids Research, 2004, Vol. 32, Database issue D134-D137
© 2004 Oxford University Press
Recent improvements to the PROSITE database
Swiss Institute of Bioinformatics (SIB), CMU, University of Geneva, 1 rue Michel Servet, 1211 Geneva 4, Switzerland, 1 Swiss Institute of Bioinformatics (SIB), Biozentrum, University of Basel, Klingelbergstrasse 5070, CH-4056 Basel, Switzerland and 2 Swiss Institute of Bioinformatics (SIB), Swiss Institute for Experimental Cancer Research (ISREC), CH-1066 Epalinges/Lausanne, Switzerland
*To whom correspondence should be addressed. Tel: +41 21 379 58 72; Fax +41 21 379 58 58; Email: Nicolas.Hulo{at}isb-sib.ch
Received September 15, 2003; Revised and Accepted September 22, 2003
| ABSTRACT |
|---|
|
|
|---|
The PROSITE database consists of a large collection of biologically meaningful signatures that are described as patterns or profiles. Each signature is linked to documentation that provides useful biological information on the protein family, domain or functional site identified by the signature. The PROSITE web page has been redesigned and several tools have been implemented to help the user discover new conserved regions in their own proteins and to visualize domain arrangements. We also introduced the facility to search PDB with a PROSITE entry or a users pattern and visualize matched positions on 3D structures. The latest version of PROSITE (release 18.17 of November 30, 2003) contains 1676 entries. The database is accessible at http://www.expasy.org/prosite/.
| INTRODUCTION |
|---|
|
|
|---|
A popular way to identify similarity between proteins is to perform a pairwise alignment. When the identity is >40% this method gives good results. However, the weakness of the pairwise alignment is that no distinction is made between an amino acid at a crucial position (like an active site) and an amino acid with no critical role. A multiple sequence alignment (MSA) gives a more general view of a conserved region by providing a better picture of the most conserved residues, which are usually essential for the protein function. The various amino acids can then be weighed according to their degree of conservation. Several databases have developed their own methods (descriptors) based on MSA in order to identify conserved regions. A search performed on these databases is generally more sensitive than a pairwise alignment and can help identify very remote similarity (<20%).
The PROSITE database uses two kinds of descriptor to identify conserved regions, patterns and generalized profiles, which each have their own strengths and weaknesses defining their area of optimum application (1).
(i) A pattern or regular expression is a quantitative descriptor: it either matches or does not. Therefore a good pattern is usually located in a short well-conserved region. Such regions are typically enzyme catalytic sites, prosthetic group attachment sites (haem, pyridoxal phosphate, biotin, etc.), metal ion binding amino acids, cysteines involved in disulfide bonds or regions involved in binding a molecule. Even though the scope of a regular expression is limited to these particular biological regions, patterns are still very popular because of their intelligibility for users.
(ii) A profile is a table of position-specific amino acid weights and gap costs. Various methods can be used to fill a profile table from a multiple alignment. Most frequently, a substitution matrix is used to convert a residue frequency distribution into weights, but alternative methods can be applied including structure-based approaches and methods involving hidden Markov modelling (24). These weights (also referred to as scores) are used to calculate a similarity score for any alignment between a profile and a sequence, or part of a profile and a sequence. An alignment with a similarity score higher than or equal to a given threshold value constitutes a motif occurrence. This threshold is estimated by calibrating the profile against a randomized protein database. The normalization procedure used for PROSITE profiles makes the normalized scores independent of the database size, allowing the comparison of scores from different searches (5). The quantitative behaviour of a profile allows the acceptance of a mismatch at a highly conserved position if the rest of the sequence displays a sufficiently high level of similarity and therefore allows the detection of poorly conserved domains such as immunoglobulin, SH2 or SH3. Another advantage of profiles over patterns is that they are not confined to small regions with high sequence similarity. Rather, they attempt to characterize a protein family or domain over its entire length.
| PROSITE ANNOTATION AND QUALITY CONTROL |
|---|
|
|
|---|
Each PROSITE signature is linked to an annotation document where the user can find information on the protein family or domain detected by the signature: origin of its name, taxonomic occurrence, domain architecture, function, 3D structure, main characteristics of the sequence and some references. Recently, for families or domains whose structure is known, a direct link to a representative PDB entry is provided in the documentation, in order to make the description of the 3D structure more comprehensible. All the biological information about a protein family or domain should also be used to evaluate the pertinence of matches with patterns and profiles. If the user has some information about their sequence that is inconsistent with the description of the motif detected, the match should be considered with caution.
The annotation document also contains direct information about the motif descriptors: for patterns, amino acid residues involved in the catalytic mechanism, metal ion or substrate binding, or conserved post-translational modifications are indicated. For profiles, it is stated whether they cover the entire domain or protein or only part of it. Finally, the sensitivity and specificity of the motif is also indicated, as well as an expert to contact, if any.
Biologically meaningful information on specific amino acids can also be found at the CC /SITE line in signature entries. This qualifier is used to indicate the position of an interesting site in a pattern or a profile. For example, if a pattern includes an active site residue, the /SITE qualifier is used to indicate the position of that residue in the pattern. Binding sites and disulfide bridges are also indicated. The ps_scan program, the reference tool to scan PROSITE (6), is able to highlight these positions in a matched region.
A match list of Swiss-Prot entries identified by the signature is also provided. Each protein entering Swiss-Prot is checked for the occurrence of PROSITE patterns or profiles and a match status is assigned (true or false positive or unknown). Proteins that are known to contain the domain but not identified by the signature are also added to the list with the status false negative. Because this match list has been verified manually, it can be used to evaluate the specificity of a given signature. This tight connection with Swiss-Prot also benefits the Swiss-Prot annotation. Some particular Swiss-Prot lines, which refer to the domain organization in the protein, are automatically annotated with PROSITE profiles.
The PROSITE descriptors and documentation can also be accessed through InterPro, which largely exploits the detailed family annotation provided by PRINTS (7) and PROSITE. InterPro (8) provides an integrated view of several domain databases and offers a large choice of methods to identify conserved regions.
| IMPROVEMENT OF THE PROFILE METHOD |
|---|
|
|
|---|
Repeat
Proteins can contain a single copy of a particular domain, but in many cases two or more copies are present. The identification of some of these repetitive elements presents additional difficulties compared with the detection of autonomous domains, because they are generally short in size and highly divergent.
We have developed a new approach to increase the sensitivity of PROSITE profiles for repeats. Our method is based on the determination of a lower acceptance threshold to detect highly divergent repeats. The computed lower acceptance threshold is used to increase the sensitivity of repeat detection within proteins as well as for the characterization of new family members. The method applied to 12 different families allowed the detection of more than 5000 repeat units and 200 proteins in Swiss-Prot previously not recognized by PROSITE.
Structural alignment
The sensitivity of a profile is strongly dependent on the quality of the starting sequence alignment. Usually ClustalW (9) or T-Coffee (10) are used to construct the MSA. But when sequences are too divergent it can be useful to integrate structural information in the MSA. Several of our profiles have been built by a mixture of classical alignment and structural alignment with the help of T-Coffee or by pure structural alignment provided by the DALI algorithm (11). These methods have been used for the construction of several profiles, e.g. the ABC transporter, the Ig-fold and the aminoacyl-transfer RNA synthetase class-II profiles. We have observed that structural information is often useful for very divergent domains or families, but that it is of small benefit for strongly conserved sequences.
Profile construction
To fill in our profile table from a MSA we generally use a symbol comparison table to convert a residue frequency distribution into weights, but in some particular cases a probabilistic model associated with a Dirichlet mixture can be more sensitive (12). For such an approach we use the HMMER package (13) to build the profile and convert it into PROSITE format profile with pftools (3). About 3% of our profiles have been built with this method.
| NEW IMPLEMENTATION ON THE WEB PAGE |
|---|
|
|
|---|
Our website was redesigned to help the user identify conserved regions in their own protein. The user can now build their own pattern from an unaligned set of sequences using the PRATT algorithm (14). The pattern can then be scanned on the non-redundant database UniProt (Swiss-Prot + TrEMBL) (15). The search space can be reduced to a specific taxon. The matched sequences can be visualized as a shaded MSA, as a taxonomic tree or as a graphical view of the domain arrangement of the matched proteins. The user can also retrieve the full-length sequences in FASTA or Swiss-Prot format. The pattern can also be visualized on 3D structures if the selected database is PDB: the region matched by the pattern is highlighted and can thus easily be located on the structure (see Fig. 1). As patterns do not produce scores, as do HMMs or profiles, it is difficult to evaluate the significance of a match. To circumvent this problem we allow the user to randomize non-redundant databases. A scan against any of these databases will give a raw estimate of the amount of matches produced by chance. We provide two methods to randomize databases. The first method, which simply reverses the order of sequences, is fast and efficient if the pattern is not palindromic. For this type of regular expression the user must use a shuffled randomization mode where windows of 20 amino acids are shuffled in the sequence (5).
|
The webview of the PROSITE documentation also contains new information. When a 3D structure is described in the text, a direct link to a 3D image of the domain is provided. The Swiss-Prot match list of each signature can be visualized as a multiple alignment, or as a taxonomic distribution graph. For PROSITE profiles, a domain arrangement view is also provided where active sites and disulfide bridges annotated in Swiss-Prot entries are superimposed on PROSITE domains (see Fig. 2).
|
| HOW TO OBTAIN PROSITE |
|---|
|
|
|---|
PROSITE is freely available to academic users. As of release 16, the documentation entries are copyright. To obtain a licence, commercial users should contact The Swiss Institute of Bioinformatics by email: license{at}isb-sib.ch or its commercial representative: Geneva Bioinformatics (GeneBio) SA, Case Postale 210, CH-1211 Geneva 12, Switzerland, phone: +41 22 702.99.00; fax: +41 22 702.99.99; email: info{at}genebio.com Weekly updates of PROSITE are available on our FTP server: ftp://ftp.expasy.org/databases/prosite/release_ with_updates/. PROSITE is also accessible from the Hits page (17): http://hits.isb-sib.ch/. Frame-tolerant scans can be performed at the following address (18): http://www.isrec. isb-sib.ch/software/PFRAMESCAN_form.html.
| ACKNOWLEDGEMENTS |
|---|
We wish to thank Tania Lima for the correction of the manuscript. PROSITE is supported by grant no. 3100-63879.00 from the Swiss National Science Foundation.
| REFERENCES |
|---|
|
|
|---|
- Sigrist,C.J.A., Cerutti,L., Hulo,N., Gattiker,A., Falquet,L., Pagni,M., Bairoch,A. and Bucher,P. (2002) PROSITE: a documented database using patterns and profiles as motif descriptors. Brief. Bioinform., 3, 265274.
[Abstract/Free Full Text] - Gribskov,M., Luthy,R. and Eisenberg,D. (1990) Profile analysis. Methods Enzymol., 183, 146159.[Web of Science][Medline]
- Bucher,P., Karplus,K., Moeri,N. and Hofmann,K. (1996) A flexible motif search technique based on generalized profiles. Comput. Chem., 20, 323.[CrossRef][Web of Science][Medline]
- Hofmann,K. (2000) Sensitive protein comparisons with profiles and hidden Markov models. Brief. Bioinform., 1, 167178.
[Abstract/Free Full Text] - Pagni,M. and Jongeneel,C.V. (2001) Making sense of score statistics for sequence alignments. Brief. Bioinform., 2, 5167.
[Abstract/Free Full Text] - Gattiker,A., Gasteiger,E. and Bairoch,A. (2002) ScanProsite: a reference implementation of a PROSITE scanning tool. Appl. Bioinform., 1, 107108.
- Attwood,T.K., Bradley,P., Flower,D.R., Gaulton,A., Maudling,N., Mitchell,A.L., Moulton,G., Nordle,A., Paine,K., Taylor,P. et al. (2003) PRINTS and its automatic supplement, prePRINTS. Nucleic Acids Res., 31, 400402.
[Abstract/Free Full Text] - Mulder,N.J., Apweiler,R., Attwood,T.K., Bairoch,A., Barrell,D., Bateman,A., Binns,D., Biswas,M., Bradley,P., Bork,P. et al. (2003) The InterPro Database, 2003 brings increased coverage and new features. Nucleic Acids Res., 31, 315318.
[Abstract/Free Full Text] - Thompson,J.D., Higgins,D.G. and Gibson,T.J. (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res., 22, 46734680.
[Abstract/Free Full Text] - Notredame,C., Higgins,D.G. and Heringa,J. (2000) T-Coffee: A novel method for fast and accurate multiple sequence alignment. J. Mol. Biol., 302, 205217.[CrossRef][Web of Science][Medline]
- Holm,L. and Sander,C. (1993) Protein structure comparison by alignment of distance matrices. J. Mol. Biol., 233, 123138.[CrossRef][Web of Science][Medline]
- Sjolander,K., Karplus,K., Brown,M., Hughey,R., Krogh,A., Mian,I.S. and Haussler,D. (1996) Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology. Comput. Appl. Biosci., 12, 327345.
[Abstract/Free Full Text] - Eddy,S.R. (1998) Profile hidden Markov models. Bioinformatics, 14, 755763.
[Abstract/Free Full Text] - Jonassen,I. (1997) Efficient discovery of conserved patterns using a pattern graph. Comput. Appl. Biosci., 13, 509522.
[Abstract/Free Full Text] - Apweiler,R. Bairoch,A., Wu,C.H., Barker,W.C., Boeckmann,B., Ferro,S., Gasteiger,E., Huang,H., Lopez,R., Magrane,M. et al. (2004) UniProt: the Universal Protein knowledgebase. Nucleic Acids Res., 32, D115D119.
[Abstract/Free Full Text] - Sayle,R.A. and Milner-White,E.J. (1995) RASMOL: biomolecular graphics for all. Trends Biochem. Sci., 20, 374.[CrossRef][Web of Science][Medline]
- Pagni,M., Iseli,C., Junier,T., Falquet,L., Jongeneel,V. and Bucher,P. (2001) trEST, trGEN and Hits: access to databases of predicted protein sequences. Nucleic Acids Res., 29, 148151.
[Abstract/Free Full Text] - Falquet,L., Pagni,M., Bucher,P., Hulo,N., Sigrist,C.J.A., Hofmann,K. and Bairoch,A. (2002) The PROSITE database, its status in 2002. Nucleic Acids Res., 30, 235238.
[Abstract/Free Full Text]
This article has been cited by other articles:
![]() |
T. Wang, S. Bird, A. Koussounadis, J. W. Holland, A. Carrington, J. Zou, and C. J. Secombes Identification of a Novel IL-1 Cytokine Family Member in Teleost Fish J. Immunol., July 15, 2009; 183(2): 962 - 974. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Letek, A. A. Ocampo-Sosa, M. Sanders, U. Fogarty, T. Buckley, D. P. Leadon, P. Gonzalez, M. Scortti, W. G. Meijer, J. Parkhill, et al. Evolution of the Rhodococcus equi vap Pathogenicity Island Seen through Comparison of Host-Associated vapA and vapB Virulence Plasmids J. Bacteriol., September 1, 2008; 190(17): 5797 - 5805. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Spatuzza, M. Schiavone, E. Di Salle, E. Janda, M. Sardiello, G. Fiume, O. Fierro, M. Simonetta, N. Argiriou, R. Faraonio, et al. Physical and functional characterization of the genetic locus of IBtk, an inhibitor of Bruton's tyrosine kinase: evidence for three protein isoforms of IBtk Nucleic Acids Res., August 1, 2008; 36(13): 4402 - 4416. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Ansong, S. O. Purvine, J. N. Adkins, M. S. Lipton, and R. D. Smith Proteogenomics: needs and roles to be filled by proteomics in genome annotation Brief Funct Genomic Proteomic, March 10, 2008; (2008) eln010v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Mishima, A. Hosokawa, H. Imaizumi-Anraku, K. Saito, M. Kawaguchi, and K. Saeki Requirement for Mesorhizobium loti Ornithine Transcarbamoylase for Successful Symbiosis with Lotus japonicus as Revealed by an Unexpected Long-Range Genome Deletion Plant Cell Physiol., March 1, 2008; 49(3): 301 - 313. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Brilli, R. Fani, and P. Lio Current trends in the bioinformatic sequence analysis of metabolic pathways in prokaryotes Brief Bioinform, January 1, 2008; 9(1): 34 - 45. [Abstract] [Full Text] [PDF] |
||||
![]() |
G.A. Loset, E. Lunde, B. Bogen, O.H. Brekke, and I. Sandlie Functional phage display of two murine {alpha}/{beta} T-cell receptors is strongly dependent on fusion format, mode and periplasmic folding assistance Protein Eng. Des. Sel., October 9, 2007; (2007) gzm044v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. V. Balasingham, R. F. Collins, R. Assalkhou, H. Homberset, S. A. Frye, J. P. Derrick, and T. Tonjum Interactions between the Lipoprotein PilP and the Secretin PilQ in Neisseria meningitidis J. Bacteriol., August 1, 2007; 189(15): 5716 - 5727. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Neff, S. Daher, P. Muzzin, U. Spenato, F. Gulacar, C. Gabay, and S. Bas Molecular Characterization and Subcellular Localization of Macrophage Infectivity Potentiator, a Chlamydia trachomatis Lipoprotein J. Bacteriol., July 1, 2007; 189(13): 4739 - 4748. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Gewehr, V. Hintermair, and R. Zimmer AutoSCOP: automated prediction of SCOP classifications using unique pattern-class mappings Bioinformatics, May 15, 2007; 23(10): 1203 - 1210. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. A. Suprenant, N. Bloom, J. Fang, and G. Lushington The major vault protein is related to the toxic anion resistance protein (TelA) family J. Exp. Biol., March 15, 2007; 210(6): 946 - 955. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Sletvold, P. J. Johnsen, G. S. Simonsen, B. Aasnaes, A. Sundsfjord, and K. M. Nielsen Comparative DNA Analysis of Two vanA Plasmids from Enterococcus faecium Strains Isolated from Poultry and a Poultry Farmer in Norway Antimicrob. Agents Chemother., February 1, 2007; 51(2): 736 - 739. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. R. Goncalves, H. Hara, D. Miyazawa, J. E. Davies, L. D. Eltis, and W. W. Mohn Transcriptomic Assessment of Isozymes in the Biphenyl Pathway of Rhodococcus sp. Strain RHA1 Appl. Envir. Microbiol., September 1, 2006; 72(9): 6183 - 6193. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Friedberg Automated protein function prediction--the genomic challenge Brief Bioinform, September 1, 2006; 7(3): 225 - 242. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Terribilini, J.-H. Lee, C. Yan, R. L. Jernigan, V. Honavar, and D. Dobbs Prediction of RNA binding sites in proteins from amino acid sequence RNA, August 1, 2006; 12(8): 1450 - 1462. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Bryson, V. Loux, R. Bossy, P. Nicolas, S. Chaillou, M. van de Guchte, S. Penaud, E. Maguin, M. Hoebeke, P. Bessieres, et al. AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system Nucleic Acids Res., July 19, 2006; 34(12): 3533 - 3545. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. R. Escasa, H. A. M. Lauzon, A. C. Mathur, P. J. Krell, and B. M. Arif Sequence analysis of the Choristoneura occidentalis granulovirus genome J. Gen. Virol., July 1, 2006; 87(7): 1917 - 1933. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-H. Hung, H.-D. Huang, and T.-Y. Lee ProKware: integrated software for presenting protein structural properties in protein tertiary structures. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W89 - W94. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Neduva and R. B. Russell DILIMOT: discovery of linear motifs in proteins. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W350 - W355. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. de Castro, C. J. A. Sigrist, A. Gattiker, V. Bulliard, P. S. Langendijk-Genevaux, E. Gasteiger, A. Bairoch, and N. Hulo ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W362 - W365. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Sun, T. Morozova, and M. Sonnenfeld Glial and Neuronal Functions of the Drosophila Homolog of the Human SWI/SNF Gene ATR-X (DATR-X) and the jing Zinc-Finger Gene Specify the Lateral Positioning of Longitudinal Glia and Axons Genetics, July 1, 2006; 173(3): 1397 - 1415. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Murray, C. L. Nilsson, J. T. Hare, M. R. Emmett, A. Korostelev, H. Ongley, A. G. Marshall, and M. S. Chapman Characterization of the Capsid Protein Glycosylation of Adeno-Associated Virus Type 2 by High-Resolution Mass Spectrometry. J. Virol., June 1, 2006; 80(12): 6171 - 6176. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Moreau, C. Granier, S. Villard, D. Laune, and F. Molina Discontinuous epitope prediction based on mimotope analysis Bioinformatics, May 1, 2006; 22(9): 1088 - 1095. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Raichaudhuri, R. Bhattacharyya, S. Chaudhuri, P. Chakrabarti, and M. DasGupta Domain Analysis of a Groundnut Calcium-dependent Protein Kinase: NUCLEAR LOCALIZATION SEQUENCE IN THE JUNCTION DOMAIN IS COUPLED WITH NONCONSENSUS CALCIUM BINDING DOMAINS J. Biol. Chem., April 14, 2006; 281(15): 10399 - 10409. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Babu, M. L. Priya, A. T. Selvan, M. Madera, J. Gough, L. Aravind, and K. Sankaran A Database of Bacterial Lipoproteins (DOLOP) with Functional Assignments to Predicted Lipoproteins. J. Bacteriol., April 1, 2006; 188(8): 2761 - 2773. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. M. Short and T. C. Cox Subclassification of the RBCC/TRIM Superfamily Reveals a Novel Motif Necessary for Microtubule Binding J. Biol. Chem., March 31, 2006; 281(13): 8970 - 8980. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Wietzorrek, H. Schwarz, C. Herrmann, and V. Braun The Genome of the Novel Phage Rtp, with a Rosette-Like Tail Tip, Is Homologous to the Genome of Phage T1 J. Bacteriol., February 15, 2006; 188(4): 1419 - 1436. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Titz, S. Thomas, S. V. Rajagopala, T. Chiba, T. Ito, and P. Uetz Transcriptional activators in yeast Nucleic Acids Res., February 7, 2006; 34(3): 955 - 967. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. V. V. Deevi and A. C. R. Martin An extensible automated protein annotation tool: standardizing input and output using validated XML Bioinformatics, February 1, 2006; 22(3): 291 - 296. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Fernandez-Fuentes, B. Oliva, and A. Fiser A supersecondary structure library and search algorithm for modeling loops in protein structures. Nucleic Acids Res., January 1, 2006; 34(7): 2085 - 2097. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. D'Ambrosio, S. Arena, G. Fulcoli, M. H. Scheinfeld, D. Zhou, L. D'Adamio, and A. Scaloni Hyperphosphorylation of JNK-interacting Protein 1, a Protein Associated with Alzheimer Disease Mol. Cell. Proteomics, January 1, 2006; 5(1): 97 - 113. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Matys, O. V. Kel-Margoulis, E. Fricke, I. Liebich, S. Land, A. Barre-Dirrie, I. Reuter, D. Chekmenev, M. Krull, K. Hornischer, et al. TRANSFAC(R) and its module TRANSCompel(R): transcriptional gene regulation in eukaryotes Nucleic Acids Res., January 1, 2006; 34(suppl_1): D108 - D110. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Finn, J. Mistry, B. Schuster-Bockler, S. Griffiths-Jones, V. Hollich, T. Lassmann, S. Moxon, M. Marshall, A. Khanna, R. Durbin, et al. Pfam: clans, web tools and services Nucleic Acids Res., January 1, 2006; 34(suppl_1): D247 - D251. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Tripathy, V. N. Pandey, B. Fang, F. Salas, and B. M. Tyler VMD: a community annotation database for oomycetes and microbial genomes Nucleic Acids Res., January 1, 2006; 34(suppl_1): D379 - D381. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Hartmann, D. Lu, J. Phillips, and T. J. Vision Phytome: a platform for plant comparative genomics Nucleic Acids Res., January 1, 2006; 34(suppl_1): D724 - D730. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Zhao, P. Canaran, R. Jurkuta, T. Fulton, J. Glaubitz, E. Buckler, J. Doebley, B. Gaut, M. Goodman, J. Holland, et al. Panzea: a database and resource for molecular and functional diversity in the maize genome Nucleic Acids Res., January 1, 2006; 34(suppl_1): D752 - D757. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Maiorino, A. Roveri, L. Benazzi, V. Bosello, P. Mauri, S. Toppo, S. C. E. Tosatto, and F. Ursini Functional Interaction of Phospholipid Hydroperoxide Glutathione Peroxidase with Sperm Mitochondrion-associated Cysteine-rich Protein Discloses the Adjacent Cysteine Motif as a New Substrate of the Selenoperoxidase J. Biol. Chem., November 18, 2005; 280(46): 38395 - 38402. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. A. Sigrist, E. De Castro, P. S. Langendijk-Genevaux, V. Le Saux, A. Bairoch, and N. Hulo ProRule: a new database containing functional and structural information on PROSITE profiles Bioinformatics, November 1, 2005; 21(21): 4060 - 4066. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-Y. Hung, K. R. Seshan, J.-J. Yu, R. Schaller, J. Xue, V. Basrur, M. J. Gardner, and G. T. Cole A Metalloproteinase of Coccidioides posadasii Contributes to Evasion of Host Detection Infect. Immun., October 1, 2005; 73(10): 6689 - 6703. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. F. Schuijffel, P. C. M. van Empel, A. M. M. A. Pennings, J. P. M. van Putten, and P. J. M. Nuijten Successful Selection of Cross-Protective Vaccine Candidates for Ornithobacterium rhinotracheale Infection Infect. Immun., October 1, 2005; 73(10): 6812 - 6821. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-W. Chang, H.-C. Chuang, C. Yu, T.-P. Yao, and H. Chen Stimulation of GCMa Transcriptional Activity by Cyclic AMP/Protein Kinase A Signaling Is Attributed to CBP-Mediated Acetylation of GCMa Mol. Cell. Biol., October 1, 2005; 25(19): 8401 - 8414. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Falb, F. Pfeiffer, P. Palm, K. Rodewald, V. Hickmann, J. Tittor, and D. Oesterhelt Living with two extremes: Conclusions from the genome sequence of Natronomonas pharaonis Genome Res., October 1, 2005; 15(10): 1336 - 1343. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Romualdi, R. Siddiqui, G. Glockner, R. Lehmann, and J. Suhnel GenColors: accelerated comparative analysis and annotation of prokaryotic genomes at various stages of completeness Bioinformatics, September 15, 2005; 21(18): 3669 - 3671. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Donald and E. I. Shakhnovich Predicting specificity-determining residues in two large eukaryotic transcription factor families Nucleic Acids Res., August 5, 2005; 33(14): 4455 - 4465. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Hoang, W.-T. Peng, E. Vanrobays, N. Krogan, S. Hiley, A. L. Beyer, Y. N. Osheim, J. Greenblatt, T. R. Hughes, and D. L. J. Lafontaine Esf2p, a U3-Associated Factor Required for Small-Subunit Processome Assembly and Compaction Mol. Cell. Biol., July 1, 2005; 25(13): 5523 - 5534. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. J. Alm, K. H. Huang, M. N. Price, R. P. Koche, K. Keller, I. L. Dubchak, and A. P. Arkin The MicrobesOnline Web site for comparative genomics Genome Res., July 1, 2005; 15(7): 1015 - 1022. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Quevillon, V. Silventoinen, S. Pillai, N. Harte, N. Mulder, R. Apweiler, and R. Lopez InterProScan: protein domains identifier Nucleic Acids Res., July 1, 2005; 33(suppl_2): W116 - W120. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Ausiello, A. Zanzoni, D. Peluso, A. Via, and M. Helmer-Citterich pdbFun: mass selection and fast comparison of annotated PDB residues Nucleic Acids Res., July 1, 2005; 33(suppl_2): W133 - W137. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Gutman, C. Berezin, R. Wollman, Y. Rosenberg, and N. Ben-Tal QuasiMotiFinder: protein annotation by searching for evolutionarily conserved motif-like patterns Nucleic Acids Res., July 1, 2005; 33(suppl_2): W255 - W261. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. H. Van Domselaar, P. Stothard, S. Shrivastava, J. A. Cruz, A. Guo, X. Dong, P. Lu, D. Szafron, R. Greiner, and D. S. Wishart BASys: a web server for automated bacterial genome annotation Nucleic Acids Res., July 1, 2005; 33(suppl_2): W455 - W459. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Wang and R. Samudrala FSSA: a novel method for identifying functional signatures from structural alignments Bioinformatics, July 1, 2005; 21(13): 2969 - 2977. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Donald and E. I. Shakhnovich Determining functional specificity from protein sequences Bioinformatics, June 1, 2005; 21(11): 2629 - 2635. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. P. Chia, S. Gomathinayagam, R. J. Schmaltz, and L. K. Smoyer Glycoprotein gp130 of Dictyostelium discoideum Influences Macropinocytosis and Adhesion Mol. Biol. Cell, June 1, 2005; 16(6): 2681 - 2693. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. S. Tuckwell, M. J. Nicholson, C. S. McSweeney, M. K. Theodorou, and J. L. Brookman The rapid assignment of ruminal fungi to presumptive genera using ITS1 and ITS2 RNA secondary structures to produce group-specific fingerprints Microbiology, May 1, 2005; 151(5): 1557 - 1567. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Coeytaux and A. Poupon Prediction of unfolded segments in a protein sequence based on amino acid composition Bioinformatics, May 1, 2005; 21(9): 1891 - 1900. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Schneider, A. Bairoch, C. H. Wu, and R. Apweiler Plant Protein Annotation in the UniProt Knowledgebase Plant Physiology, May 1, 2005; 138(1): 59 - 66. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Weiner 3rd, G. Thomas, and E. Bornberg-Bauer Rapid motif-based prediction of circular permutations in multi-domain proteins Bioinformatics, April 1, 2005; 21(7): 932 - 937. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Rahman, P. Advani, R. Schunk, R. Schrader, and D. Schomburg Metabolic pathway analysis web service (Pathway Hunter Tool at CUBIC) Bioinformatics, April 1, 2005; 21(7): 1189 - 1193. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hou, S.-R. Jun, C. Zhang, and S.-H. Kim From The Cover: Global mapping of the protein structure space and application in structure-based inference of protein function PNAS, March 8, 2005; 102(10): 3651 - 3656. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. P. Stinear, M. J. Pryor, J. L. Porter, and S. T. Cole Functional analysis and annotation of the virulence plasmid pMUM001 from Mycobacterium ulcerans Microbiology, March 1, 2005; 151(3): 683 - 692. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Gardy, M. R. Laird, F. Chen, S. Rey, C. J. Walsh, M. Ester, and F. S. L. Brinkman PSORTb v.2.0: Expanded prediction of bacterial protein subcellular localization and insights gained from comparative proteome analysis Bioinformatics, March 1, 2005; 21(5): 617 - 623. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. C. Carles, D. Choffnes-Inada, K. Reville, K. Lertpiriyapong, and J. C. Fletcher ULTRAPETALA1 encodes a SAND domain putative transcriptional regulator that controls shoot and floral meristem activity in Arabidopsis Development, March 1, 2005; 132(5): 897 - 911. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. H. Haft, J. D. Selengut, L. M. Brinkac, N. Zafar, and O. White Genome Properties: a system for the investigation of prokaryotic genetic content for microbiology, genome annotation and comparative genomics Bioinformatics, February 1, 2005; 21(3): 293 - 306. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Bairoch, R. Apweiler, C. H. Wu, W. C. Barker, B. Boeckmann, S. Ferro, E. Gasteiger, H. Huang, R. Lopez, M. Magrane, et al. The Universal Protein Resource (UniProt) Nucleic Acids Res., January 1, 2005; 33(suppl_1): D154 - D159. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Heger, C. A. Wilton, A. Sivakumar, and L. Holm ADDA: a domain database with global coverage of the protein universe Nucleic Acids Res., January 1, 2005; 33(suppl_1): D188 - D191. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Abhiman and E. L. L. Sonnhammer FunShift: a database of function shift analysis on protein subfamilies Nucleic Acids Res., January 1, 2005; 33(suppl_1): D197 - D200. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Bru, E. Courcelle, S. Carrere, Y. Beausse, S. Dalmar, and D. Kahn The ProDom database of protein domain families: more emphasis on 3D Nucleic Acids Res., January 1, 2005; 33(suppl_1): D212 - D215. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Meinel, A. Krause, H. Luz, M. Vingron, and E. Staub The SYSTERS Protein Family Database in 2005 Nucleic Acids Res., January 1, 2005; 33(suppl_1): D226 - D229. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Laskowski, V. V. Chistyakov, and J. M. Thornton PDBsum more: new summaries and analyses of the known 3D structures of proteins and nucleic acids Nucleic Acids Res., January 1, 2005; 33(suppl_1): D266 - D268. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Stothard, G. Van Domselaar, S. Shrivastava, A. Guo, B. O'Neill, J. Cruz, M. Ellison, and D. S. Wishart BacMap: an interactive picture atlas of annotated bacterial genomes Nucleic Acids Res., January 1, 2005; 33(suppl_1): D317 - D320. [Abstract] [Full Text] [PDF] |
||||
![]() |
W.-C. Kao, Y.-R. Chen, E. C. Yi, H. Lee, Q. Tian, K.-M. Wu, S.-F. Tsai, S. S.-F. Yu, Y.-J. Chen, R. Aebersold, et al. Quantitative Proteomic Analysis of Metabolic Regulation by Copper Ions in Methylococcus capsulatus (Bath) J. Biol. Chem., December 3, 2004; 279(49): 51554 - 51560. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Rhodes, J. Parkhill, C. Bird, K. Ambrose, M. C. Jones, G. Huys, J. Swings, and R. W. Pickup Complete Nucleotide Sequence of the Conjugative Tetracycline Resistance Plasmid pFBAOT6, a Member of a Group of IncU Plasmids with Global Ubiquity Appl. Envir. Microbiol., December 1, 2004; 70(12): 7497 - 7510. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Y. Galperin and E. V. Koonin 'Conserved hypothetical' proteins: prioritization of targets for experimental study Nucleic Acids Res., October 12, 2004; 32(18): 5452 - 5463. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Harte, V. Silventoinen, E. Quevillon, S. Robinson, K. Kallio, X. Fustero, P. Patel, P. Jokinen, and R. Lopez Public web-based services from the European Bioinformatics Institute Nucleic Acids Res., July 1, 2004; 32(suppl_2): W3 - W9. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Rigali, M. Schlicht, P. Hoskisson, H. Nothaft, M. Merzbacher, B. Joris, and F. Titgemeyer Extending the classification of bacterial transcription factors beyond the helix-turn-helix motif as an alternative approach to discover new cis/trans relationships Nucleic Acids Res., June 24, 2004; 32(11): 3418 - 3426. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Apweiler, A. Bairoch, C. H. Wu, W. C. Barker, B. Boeckmann, S. Ferro, E. Gasteiger, H. Huang, R. Lopez, M. Magrane, et al. UniProt: the Universal Protein knowledgebase Nucleic Acids Res., January 1, 2004; 32(90001): D115 - 119. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||


























