Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (122K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Hulo, N.
Right arrow Articles by Bairoch, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hulo, N.
Right arrow Articles by Bairoch, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2004, Vol. 32, Database issue D134-D137
© 2004 Oxford University Press

Recent improvements to the PROSITE database

Nicolas Hulo*, Christian J. A. Sigrist, Virginie Le Saux, Petra S. Langendijk-Genevaux, Lorenza Bordoli1, Alexandre Gattiker, Edouard De Castro, Philipp Bucher2 and Amos Bairoch

Swiss Institute of Bioinformatics (SIB), CMU, University of Geneva, 1 rue Michel Servet, 1211 Geneva 4, Switzerland, 1 Swiss Institute of Bioinformatics (SIB), Biozentrum, University of Basel, Klingelbergstrasse 50–70, CH-4056 Basel, Switzerland and 2 Swiss Institute of Bioinformatics (SIB), Swiss Institute for Experimental Cancer Research (ISREC), CH-1066 Epalinges/Lausanne, Switzerland

*To whom correspondence should be addressed. Tel: +41 21 379 58 72; Fax +41 21 379 58 58; Email: Nicolas.Hulo{at}isb-sib.ch

Received September 15, 2003; Revised and Accepted September 22, 2003


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 PROSITE ANNOTATION AND QUALITY...
 IMPROVEMENT OF THE PROFILE...
 NEW IMPLEMENTATION ON THE...
 HOW TO OBTAIN PROSITE
 REFERENCES
 
The PROSITE database consists of a large collection of biologically meaningful signatures that are described as patterns or profiles. Each signature is linked to documentation that provides useful biological information on the protein family, domain or functional site identified by the signature. The PROSITE web page has been redesigned and several tools have been implemented to help the user discover new conserved regions in their own proteins and to visualize domain arrangements. We also introduced the facility to search PDB with a PROSITE entry or a user’s pattern and visualize matched positions on 3D structures. The latest version of PROSITE (release 18.17 of November 30, 2003) contains 1676 entries. The database is accessible at http://www.expasy.org/prosite/.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 PROSITE ANNOTATION AND QUALITY...
 IMPROVEMENT OF THE PROFILE...
 NEW IMPLEMENTATION ON THE...
 HOW TO OBTAIN PROSITE
 REFERENCES
 
A popular way to identify similarity between proteins is to perform a pairwise alignment. When the identity is >40% this method gives good results. However, the weakness of the pairwise alignment is that no distinction is made between an amino acid at a crucial position (like an active site) and an amino acid with no critical role. A multiple sequence alignment (MSA) gives a more general view of a conserved region by providing a better picture of the most conserved residues, which are usually essential for the protein function. The various amino acids can then be weighed according to their degree of conservation. Several databases have developed their own methods (descriptors) based on MSA in order to identify conserved regions. A search performed on these databases is generally more sensitive than a pairwise alignment and can help identify very remote similarity (<20%).

The PROSITE database uses two kinds of descriptor to identify conserved regions, patterns and generalized profiles, which each have their own strengths and weaknesses defining their area of optimum application (1).

(i) A pattern or regular expression is a quantitative descriptor: it either matches or does not. Therefore a good pattern is usually located in a short well-conserved region. Such regions are typically enzyme catalytic sites, prosthetic group attachment sites (haem, pyridoxal phosphate, biotin, etc.), metal ion binding amino acids, cysteines involved in disulfide bonds or regions involved in binding a molecule. Even though the scope of a regular expression is limited to these particular biological regions, patterns are still very popular because of their intelligibility for users.

(ii) A profile is a table of position-specific amino acid weights and gap costs. Various methods can be used to fill a profile table from a multiple alignment. Most frequently, a substitution matrix is used to convert a residue frequency distribution into weights, but alternative methods can be applied including structure-based approaches and methods involving hidden Markov modelling (24). These weights (also referred to as scores) are used to calculate a similarity score for any alignment between a profile and a sequence, or part of a profile and a sequence. An alignment with a similarity score higher than or equal to a given threshold value constitutes a motif occurrence. This threshold is estimated by calibrating the profile against a randomized protein database. The normalization procedure used for PROSITE profiles makes the normalized scores independent of the database size, allowing the comparison of scores from different searches (5). The quantitative behaviour of a profile allows the acceptance of a mismatch at a highly conserved position if the rest of the sequence displays a sufficiently high level of similarity and therefore allows the detection of poorly conserved domains such as immunoglobulin, SH2 or SH3. Another advantage of profiles over patterns is that they are not confined to small regions with high sequence similarity. Rather, they attempt to characterize a protein family or domain over its entire length.


    PROSITE ANNOTATION AND QUALITY CONTROL
 TOP
 ABSTRACT
 INTRODUCTION
 PROSITE ANNOTATION AND QUALITY...
 IMPROVEMENT OF THE PROFILE...
 NEW IMPLEMENTATION ON THE...
 HOW TO OBTAIN PROSITE
 REFERENCES
 
Each PROSITE signature is linked to an annotation document where the user can find information on the protein family or domain detected by the signature: origin of its name, taxonomic occurrence, domain architecture, function, 3D structure, main characteristics of the sequence and some references. Recently, for families or domains whose structure is known, a direct link to a representative PDB entry is provided in the documentation, in order to make the description of the 3D structure more comprehensible. All the biological information about a protein family or domain should also be used to evaluate the pertinence of matches with patterns and profiles. If the user has some information about their sequence that is inconsistent with the description of the motif detected, the match should be considered with caution.

The annotation document also contains direct information about the motif descriptors: for patterns, amino acid residues involved in the catalytic mechanism, metal ion or substrate binding, or conserved post-translational modifications are indicated. For profiles, it is stated whether they cover the entire domain or protein or only part of it. Finally, the sensitivity and specificity of the motif is also indicated, as well as an expert to contact, if any.

Biologically meaningful information on specific amino acids can also be found at the CC /SITE line in signature entries. This qualifier is used to indicate the position of an ‘interesting’ site in a pattern or a profile. For example, if a pattern includes an active site residue, the /SITE qualifier is used to indicate the position of that residue in the pattern. Binding sites and disulfide bridges are also indicated. The ps_scan program, the reference tool to scan PROSITE (6), is able to highlight these positions in a matched region.

A match list of Swiss-Prot entries identified by the signature is also provided. Each protein entering Swiss-Prot is checked for the occurrence of PROSITE patterns or profiles and a match status is assigned (‘true’ or ‘false positive’ or ‘unknown’). Proteins that are known to contain the domain but not identified by the signature are also added to the list with the status ‘false negative’. Because this match list has been verified manually, it can be used to evaluate the specificity of a given signature. This tight connection with Swiss-Prot also benefits the Swiss-Prot annotation. Some particular Swiss-Prot lines, which refer to the domain organization in the protein, are automatically annotated with PROSITE profiles.

The PROSITE descriptors and documentation can also be accessed through InterPro, which largely exploits the detailed family annotation provided by PRINTS (7) and PROSITE. InterPro (8) provides an integrated view of several domain databases and offers a large choice of methods to identify conserved regions.


    IMPROVEMENT OF THE PROFILE METHOD
 TOP
 ABSTRACT
 INTRODUCTION
 PROSITE ANNOTATION AND QUALITY...
 IMPROVEMENT OF THE PROFILE...
 NEW IMPLEMENTATION ON THE...
 HOW TO OBTAIN PROSITE
 REFERENCES
 
Repeat
Proteins can contain a single copy of a particular domain, but in many cases two or more copies are present. The identification of some of these repetitive elements presents additional difficulties compared with the detection of autonomous domains, because they are generally short in size and highly divergent.

We have developed a new approach to increase the sensitivity of PROSITE profiles for repeats. Our method is based on the determination of a lower acceptance threshold to detect highly divergent repeats. The computed lower acceptance threshold is used to increase the sensitivity of repeat detection within proteins as well as for the characterization of new family members. The method applied to 12 different families allowed the detection of more than 5000 repeat units and 200 proteins in Swiss-Prot previously not recognized by PROSITE.

Structural alignment
The sensitivity of a profile is strongly dependent on the quality of the starting sequence alignment. Usually ClustalW (9) or T-Coffee (10) are used to construct the MSA. But when sequences are too divergent it can be useful to integrate structural information in the MSA. Several of our profiles have been built by a mixture of classical alignment and structural alignment with the help of T-Coffee or by pure structural alignment provided by the DALI algorithm (11). These methods have been used for the construction of several profiles, e.g. the ABC transporter, the Ig-fold and the aminoacyl-transfer RNA synthetase class-II profiles. We have observed that structural information is often useful for very divergent domains or families, but that it is of small benefit for strongly conserved sequences.

Profile construction
To fill in our profile table from a MSA we generally use a symbol comparison table to convert a residue frequency distribution into weights, but in some particular cases a probabilistic model associated with a Dirichlet mixture can be more sensitive (12). For such an approach we use the HMMER package (13) to build the profile and convert it into PROSITE format profile with pftools (3). About 3% of our profiles have been built with this method.


    NEW IMPLEMENTATION ON THE WEB PAGE
 TOP
 ABSTRACT
 INTRODUCTION
 PROSITE ANNOTATION AND QUALITY...
 IMPROVEMENT OF THE PROFILE...
 NEW IMPLEMENTATION ON THE...
 HOW TO OBTAIN PROSITE
 REFERENCES
 
Our website was redesigned to help the user identify conserved regions in their own protein. The user can now build their own pattern from an unaligned set of sequences using the PRATT algorithm (14). The pattern can then be scanned on the non-redundant database UniProt (Swiss-Prot + TrEMBL) (15). The search space can be reduced to a specific taxon. The matched sequences can be visualized as a shaded MSA, as a taxonomic tree or as a graphical view of the domain arrangement of the matched proteins. The user can also retrieve the full-length sequences in FASTA or Swiss-Prot format. The pattern can also be visualized on 3D structures if the selected database is PDB: the region matched by the pattern is highlighted and can thus easily be located on the structure (see Fig. 1). As patterns do not produce scores, as do HMMs or profiles, it is difficult to evaluate the significance of a match. To circumvent this problem we allow the user to randomize non-redundant databases. A scan against any of these databases will give a raw estimate of the amount of matches produced by chance. We provide two methods to randomize databases. The first method, which simply reverses the order of sequences, is fast and efficient if the pattern is not palindromic. For this type of regular expression the user must use a shuffled randomization mode where windows of 20 amino acids are shuffled in the sequence (5).



View larger version (45K):
[in this window]
[in a new window]
 
Figure 1. A search on the PDB database with the PROSITE pattern PS00107 (directed against the ATP binding region of the kinase domain) was performed on the ScanProsite page. The pattern identified 221 matches. The 1CTP [PDB] entry was selected to visualize the position of the pattern on the 3D structure. The ATP binding region is highlighted in red. The ScanProsite page uses RasMol (16) scripts to produce images. An interactive view with the Chime program is also provided.

 
The webview of the PROSITE documentation also contains new information. When a 3D structure is described in the text, a direct link to a 3D image of the domain is provided. The Swiss-Prot match list of each signature can be visualized as a multiple alignment, or as a taxonomic distribution graph. For PROSITE profiles, a domain arrangement view is also provided where active sites and disulfide bridges annotated in Swiss-Prot entries are superimposed on PROSITE domains (see Fig. 2).



View larger version (13K):
[in this window]
[in a new window]
 
Figure 2. Five proteins have been extracted from the domain view of the trypsin profile (PS50240) match list. Disulfide bridges are represented as red inverted hooks and active sites as red diamonds. The labeling of the active site residues allows the rapid detection of domains that may have lost their enzymatic activity. The fifth example is the human haptoglobin, a clearly related serine protease but with no enzymatic activity.

 

    HOW TO OBTAIN PROSITE
 TOP
 ABSTRACT
 INTRODUCTION
 PROSITE ANNOTATION AND QUALITY...
 IMPROVEMENT OF THE PROFILE...
 NEW IMPLEMENTATION ON THE...
 HOW TO OBTAIN PROSITE
 REFERENCES
 
PROSITE is freely available to academic users. As of release 16, the documentation entries are copyright. To obtain a licence, commercial users should contact The Swiss Institute of Bioinformatics by email: license{at}isb-sib.ch or its commercial representative: Geneva Bioinformatics (GeneBio) SA, Case Postale 210, CH-1211 Geneva 12, Switzerland, phone: +41 22 702.99.00; fax: +41 22 702.99.99; email: info{at}genebio.com Weekly updates of PROSITE are available on our FTP server: ftp://ftp.expasy.org/databases/prosite/release_ with_updates/. PROSITE is also accessible from the Hits page (17): http://hits.isb-sib.ch/. Frame-tolerant scans can be performed at the following address (18): http://www.isrec. isb-sib.ch/software/PFRAMESCAN_form.html.


    ACKNOWLEDGEMENTS
 
We wish to thank Tania Lima for the correction of the manuscript. PROSITE is supported by grant no. 3100-63879.00 from the Swiss National Science Foundation.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 PROSITE ANNOTATION AND QUALITY...
 IMPROVEMENT OF THE PROFILE...
 NEW IMPLEMENTATION ON THE...
 HOW TO OBTAIN PROSITE
 REFERENCES
 

  1. Sigrist,C.J.A., Cerutti,L., Hulo,N., Gattiker,A., Falquet,L., Pagni,M., Bairoch,A. and Bucher,P. (2002) PROSITE: a documented database using patterns and profiles as motif descriptors. Brief. Bioinform., 3, 265–274.[Abstract/Free Full Text]

  2. Gribskov,M., Luthy,R. and Eisenberg,D. (1990) Profile analysis. Methods Enzymol., 183, 146–159.[Web of Science][Medline]

  3. Bucher,P., Karplus,K., Moeri,N. and Hofmann,K. (1996) A flexible motif search technique based on generalized profiles. Comput. Chem., 20, 3–23.[CrossRef][Web of Science][Medline]

  4. Hofmann,K. (2000) Sensitive protein comparisons with profiles and hidden Markov models. Brief. Bioinform., 1, 167–178.[Abstract/Free Full Text]

  5. Pagni,M. and Jongeneel,C.V. (2001) Making sense of score statistics for sequence alignments. Brief. Bioinform., 2, 51–67.[Abstract/Free Full Text]

  6. Gattiker,A., Gasteiger,E. and Bairoch,A. (2002) ScanProsite: a reference implementation of a PROSITE scanning tool. Appl. Bioinform., 1, 107–108.

  7. Attwood,T.K., Bradley,P., Flower,D.R., Gaulton,A., Maudling,N., Mitchell,A.L., Moulton,G., Nordle,A., Paine,K., Taylor,P. et al. (2003) PRINTS and its automatic supplement, prePRINTS. Nucleic Acids Res., 31, 400–402.[Abstract/Free Full Text]

  8. Mulder,N.J., Apweiler,R., Attwood,T.K., Bairoch,A., Barrell,D., Bateman,A., Binns,D., Biswas,M., Bradley,P., Bork,P. et al. (2003) The InterPro Database, 2003 brings increased coverage and new features. Nucleic Acids Res., 31, 315–318.[Abstract/Free Full Text]

  9. Thompson,J.D., Higgins,D.G. and Gibson,T.J. (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res., 22, 4673–4680.[Abstract/Free Full Text]

  10. Notredame,C., Higgins,D.G. and Heringa,J. (2000) T-Coffee: A novel method for fast and accurate multiple sequence alignment. J. Mol. Biol., 302, 205–217.[CrossRef][Web of Science][Medline]

  11. Holm,L. and Sander,C. (1993) Protein structure comparison by alignment of distance matrices. J. Mol. Biol., 233, 123–138.[CrossRef][Web of Science][Medline]

  12. Sjolander,K., Karplus,K., Brown,M., Hughey,R., Krogh,A., Mian,I.S. and Haussler,D. (1996) Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology. Comput. Appl. Biosci., 12, 327–345.[Abstract/Free Full Text]

  13. Eddy,S.R. (1998) Profile hidden Markov models. Bioinformatics, 14, 755–763.[Abstract/Free Full Text]

  14. Jonassen,I. (1997) Efficient discovery of conserved patterns using a pattern graph. Comput. Appl. Biosci., 13, 509–522.[Abstract/Free Full Text]

  15. Apweiler,R. Bairoch,A., Wu,C.H., Barker,W.C., Boeckmann,B., Ferro,S., Gasteiger,E., Huang,H., Lopez,R., Magrane,M. et al. (2004) UniProt: the Universal Protein knowledgebase. Nucleic Acids Res., 32, D115–D119.[Abstract/Free Full Text]

  16. Sayle,R.A. and Milner-White,E.J. (1995) RASMOL: biomolecular graphics for all. Trends Biochem. Sci., 20, 374.[CrossRef][Web of Science][Medline]

  17. Pagni,M., Iseli,C., Junier,T., Falquet,L., Jongeneel,V. and Bucher,P. (2001) trEST, trGEN and Hits: access to databases of predicted protein sequences. Nucleic Acids Res., 29, 148–151.[Abstract/Free Full Text]

  18. Falquet,L., Pagni,M., Bucher,P., Hulo,N., Sigrist,C.J.A., Hofmann,K. and Bairoch,A. (2002) The PROSITE database, its status in 2002. Nucleic Acids Res., 30, 235–238.[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Immunol.Home page
T. Wang, S. Bird, A. Koussounadis, J. W. Holland, A. Carrington, J. Zou, and C. J. Secombes
Identification of a Novel IL-1 Cytokine Family Member in Teleost Fish
J. Immunol., July 15, 2009; 183(2): 962 - 974.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
M. Letek, A. A. Ocampo-Sosa, M. Sanders, U. Fogarty, T. Buckley, D. P. Leadon, P. Gonzalez, M. Scortti, W. G. Meijer, J. Parkhill, et al.
Evolution of the Rhodococcus equi vap Pathogenicity Island Seen through Comparison of Host-Associated vapA and vapB Virulence Plasmids
J. Bacteriol., September 1, 2008; 190(17): 5797 - 5805.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. Spatuzza, M. Schiavone, E. Di Salle, E. Janda, M. Sardiello, G. Fiume, O. Fierro, M. Simonetta, N. Argiriou, R. Faraonio, et al.
Physical and functional characterization of the genetic locus of IBtk, an inhibitor of Bruton's tyrosine kinase: evidence for three protein isoforms of IBtk
Nucleic Acids Res., August 1, 2008; 36(13): 4402 - 4416.
[Abstract] [Full Text] [PDF]


Home page
Brief Funct Genomic ProteomicHome page
C. Ansong, S. O. Purvine, J. N. Adkins, M. S. Lipton, and R. D. Smith
Proteogenomics: needs and roles to be filled by proteomics in genome annotation
Brief Funct Genomic Proteomic, March 10, 2008; (2008) eln010v1.
[Abstract] [Full Text] [PDF]


Home page
Plant Cell PhysiolHome page
E. Mishima, A. Hosokawa, H. Imaizumi-Anraku, K. Saito, M. Kawaguchi, and K. Saeki
Requirement for Mesorhizobium loti Ornithine Transcarbamoylase for Successful Symbiosis with Lotus japonicus as Revealed by an Unexpected Long-Range Genome Deletion
Plant Cell Physiol., March 1, 2008; 49(3): 301 - 313.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
M. Brilli, R. Fani, and P. Lio
Current trends in the bioinformatic sequence analysis of metabolic pathways in prokaryotes
Brief Bioinform, January 1, 2008; 9(1): 34 - 45.
[Abstract] [Full Text] [PDF]


Home page
Protein Eng Des SelHome page
G.A. Loset, E. Lunde, B. Bogen, O.H. Brekke, and I. Sandlie
Functional phage display of two murine {alpha}/{beta} T-cell receptors is strongly dependent on fusion format, mode and periplasmic folding assistance
Protein Eng. Des. Sel., October 9, 2007; (2007) gzm044v1.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
S. V. Balasingham, R. F. Collins, R. Assalkhou, H. Homberset, S. A. Frye, J. P. Derrick, and T. Tonjum
Interactions between the Lipoprotein PilP and the Secretin PilQ in Neisseria meningitidis
J. Bacteriol., August 1, 2007; 189(15): 5716 - 5727.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
L. Neff, S. Daher, P. Muzzin, U. Spenato, F. Gulacar, C. Gabay, and S. Bas
Molecular Characterization and Subcellular Localization of Macrophage Infectivity Potentiator, a Chlamydia trachomatis Lipoprotein
J. Bacteriol., July 1, 2007; 189(13): 4739 - 4748.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. E. Gewehr, V. Hintermair, and R. Zimmer
AutoSCOP: automated prediction of SCOP classifications using unique pattern-class mappings
Bioinformatics, May 15, 2007; 23(10): 1203 - 1210.
[Abstract] [Full Text] [PDF]


Home page
J. Exp. Biol.Home page
K. A. Suprenant, N. Bloom, J. Fang, and G. Lushington
The major vault protein is related to the toxic anion resistance protein (TelA) family
J. Exp. Biol., March 15, 2007; 210(6): 946 - 955.
[Abstract] [Full Text] [PDF]


Home page
Antimicrob. Agents Chemother.Home page
H. Sletvold, P. J. Johnsen, G. S. Simonsen, B. Aasnaes, A. Sundsfjord, and K. M. Nielsen
Comparative DNA Analysis of Two vanA Plasmids from Enterococcus faecium Strains Isolated from Poultry and a Poultry Farmer in Norway
Antimicrob. Agents Chemother., February 1, 2007; 51(2): 736 - 739.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
E. R. Goncalves, H. Hara, D. Miyazawa, J. E. Davies, L. D. Eltis, and W. W. Mohn
Transcriptomic Assessment of Isozymes in the Biphenyl Pathway of Rhodococcus sp. Strain RHA1
Appl. Envir. Microbiol., September 1, 2006; 72(9): 6183 - 6193.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
I. Friedberg
Automated protein function prediction--the genomic challenge
Brief Bioinform, September 1, 2006; 7(3): 225 - 242.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
M. Terribilini, J.-H. Lee, C. Yan, R. L. Jernigan, V. Honavar, and D. Dobbs
Prediction of RNA binding sites in proteins from amino acid sequence
RNA, August 1, 2006; 12(8): 1450 - 1462.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Bryson, V. Loux, R. Bossy, P. Nicolas, S. Chaillou, M. van de Guchte, S. Penaud, E. Maguin, M. Hoebeke, P. Bessieres, et al.
AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system
Nucleic Acids Res., July 19, 2006; 34(12): 3533 - 3545.
[Abstract] [Full Text] [PDF]


Home page
J. Gen. Virol.Home page
S. R. Escasa, H. A. M. Lauzon, A. C. Mathur, P. J. Krell, and B. M. Arif
Sequence analysis of the Choristoneura occidentalis granulovirus genome
J. Gen. Virol., July 1, 2006; 87(7): 1917 - 1933.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J.-H. Hung, H.-D. Huang, and T.-Y. Lee
ProKware: integrated software for presenting protein structural properties in protein tertiary structures.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W89 - W94.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. Neduva and R. B. Russell
DILIMOT: discovery of linear motifs in proteins.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W350 - W355.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. de Castro, C. J. A. Sigrist, A. Gattiker, V. Bulliard, P. S. Langendijk-Genevaux, E. Gasteiger, A. Bairoch, and N. Hulo
ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W362 - W365.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
X. Sun, T. Morozova, and M. Sonnenfeld
Glial and Neuronal Functions of the Drosophila Homolog of the Human SWI/SNF Gene ATR-X (DATR-X) and the jing Zinc-Finger Gene Specify the Lateral Positioning of Longitudinal Glia and Axons
Genetics, July 1, 2006; 173(3): 1397 - 1415.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
S. Murray, C. L. Nilsson, J. T. Hare, M. R. Emmett, A. Korostelev, H. Ongley, A. G. Marshall, and M. S. Chapman
Characterization of the Capsid Protein Glycosylation of Adeno-Associated Virus Type 2 by High-Resolution Mass Spectrometry.
J. Virol., June 1, 2006; 80(12): 6171 - 6176.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
V. Moreau, C. Granier, S. Villard, D. Laune, and F. Molina
Discontinuous epitope prediction based on mimotope analysis
Bioinformatics, May 1, 2006; 22(9): 1088 - 1095.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
A. Raichaudhuri, R. Bhattacharyya, S. Chaudhuri, P. Chakrabarti, and M. DasGupta
Domain Analysis of a Groundnut Calcium-dependent Protein Kinase: NUCLEAR LOCALIZATION SEQUENCE IN THE JUNCTION DOMAIN IS COUPLED WITH NONCONSENSUS CALCIUM BINDING DOMAINS
J. Biol. Chem., April 14, 2006; 281(15): 10399 - 10409.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
M. M. Babu, M. L. Priya, A. T. Selvan, M. Madera, J. Gough, L. Aravind, and K. Sankaran
A Database of Bacterial Lipoproteins (DOLOP) with Functional Assignments to Predicted Lipoproteins.
J. Bacteriol., April 1, 2006; 188(8): 2761 - 2773.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
K. M. Short and T. C. Cox
Subclassification of the RBCC/TRIM Superfamily Reveals a Novel Motif Necessary for Microtubule Binding
J. Biol. Chem., March 31, 2006; 281(13): 8970 - 8980.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
A. Wietzorrek, H. Schwarz, C. Herrmann, and V. Braun
The Genome of the Novel Phage Rtp, with a Rosette-Like Tail Tip, Is Homologous to the Genome of Phage T1
J. Bacteriol., February 15, 2006; 188(4): 1419 - 1436.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
B. Titz, S. Thomas, S. V. Rajagopala, T. Chiba, T. Ito, and P. Uetz
Transcriptional activators in yeast
Nucleic Acids Res., February 7, 2006; 34(3): 955 - 967.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. V. V. Deevi and A. C. R. Martin
An extensible automated protein annotation tool: standardizing input and output using validated XML
Bioinformatics, February 1, 2006; 22(3): 291 - 296.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. Fernandez-Fuentes, B. Oliva, and A. Fiser
A supersecondary structure library and search algorithm for modeling loops in protein structures.
Nucleic Acids Res., January 1, 2006; 34(7): 2085 - 2097.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
C. D'Ambrosio, S. Arena, G. Fulcoli, M. H. Scheinfeld, D. Zhou, L. D'Adamio, and A. Scaloni
Hyperphosphorylation of JNK-interacting Protein 1, a Protein Associated with Alzheimer Disease
Mol. Cell. Proteomics, January 1, 2006; 5(1): 97 - 113.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. Matys, O. V. Kel-Margoulis, E. Fricke, I. Liebich, S. Land, A. Barre-Dirrie, I. Reuter, D. Chekmenev, M. Krull, K. Hornischer, et al.
TRANSFAC(R) and its module TRANSCompel(R): transcriptional gene regulation in eukaryotes
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D108 - D110.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. D. Finn, J. Mistry, B. Schuster-Bockler, S. Griffiths-Jones, V. Hollich, T. Lassmann, S. Moxon, M. Marshall, A. Khanna, R. Durbin, et al.
Pfam: clans, web tools and services
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D247 - D251.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Tripathy, V. N. Pandey, B. Fang, F. Salas, and B. M. Tyler
VMD: a community annotation database for oomycetes and microbial genomes
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D379 - D381.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Hartmann, D. Lu, J. Phillips, and T. J. Vision
Phytome: a platform for plant comparative genomics
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D724 - D730.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
W. Zhao, P. Canaran, R. Jurkuta, T. Fulton, J. Glaubitz, E. Buckler, J. Doebley, B. Gaut, M. Goodman, J. Holland, et al.
Panzea: a database and resource for molecular and functional diversity in the maize genome
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D752 - D757.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. Maiorino, A. Roveri, L. Benazzi, V. Bosello, P. Mauri, S. Toppo, S. C. E. Tosatto, and F. Ursini
Functional Interaction of Phospholipid Hydroperoxide Glutathione Peroxidase with Sperm Mitochondrion-associated Cysteine-rich Protein Discloses the Adjacent Cysteine Motif as a New Substrate of the Selenoperoxidase
J. Biol. Chem., November 18, 2005; 280(46): 38395 - 38402.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
C. J. A. Sigrist, E. De Castro, P. S. Langendijk-Genevaux, V. Le Saux, A. Bairoch, and N. Hulo
ProRule: a new database containing functional and structural information on PROSITE profiles
Bioinformatics, November 1, 2005; 21(21): 4060 - 4066.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
C.-Y. Hung, K. R. Seshan, J.-J. Yu, R. Schaller, J. Xue, V. Basrur, M. J. Gardner, and G. T. Cole
A Metalloproteinase of Coccidioides posadasii Contributes to Evasion of Host Detection
Infect. Immun., October 1, 2005; 73(10): 6689 - 6703.
[Abstract] [Full Text] [PDF]


Home page
Infect. Immun.Home page
D. F. Schuijffel, P. C. M. van Empel, A. M. M. A. Pennings, J. P. M. van Putten, and P. J. M. Nuijten
Successful Selection of Cross-Protective Vaccine Candidates for Ornithobacterium rhinotracheale Infection
Infect. Immun., October 1, 2005; 73(10): 6812 - 6821.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
C.-W. Chang, H.-C. Chuang, C. Yu, T.-P. Yao, and H. Chen
Stimulation of GCMa Transcriptional Activity by Cyclic AMP/Protein Kinase A Signaling Is Attributed to CBP-Mediated Acetylation of GCMa
Mol. Cell. Biol., October 1, 2005; 25(19): 8401 - 8414.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. Falb, F. Pfeiffer, P. Palm, K. Rodewald, V. Hickmann, J. Tittor, and D. Oesterhelt
Living with two extremes: Conclusions from the genome sequence of Natronomonas pharaonis
Genome Res., October 1, 2005; 15(10): 1336 - 1343.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Romualdi, R. Siddiqui, G. Glockner, R. Lehmann, and J. Suhnel
GenColors: accelerated comparative analysis and annotation of prokaryotic genomes at various stages of completeness
Bioinformatics, September 15, 2005; 21(18): 3669 - 3671.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. E. Donald and E. I. Shakhnovich
Predicting specificity-determining residues in two large eukaryotic transcription factor families
Nucleic Acids Res., August 5, 2005; 33(14): 4455 - 4465.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
T. Hoang, W.-T. Peng, E. Vanrobays, N. Krogan, S. Hiley, A. L. Beyer, Y. N. Osheim, J. Greenblatt, T. R. Hughes, and D. L. J. Lafontaine
Esf2p, a U3-Associated Factor Required for Small-Subunit Processome Assembly and Compaction
Mol. Cell. Biol., July 1, 2005; 25(13): 5523 - 5534.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
E. J. Alm, K. H. Huang, M. N. Price, R. P. Koche, K. Keller, I. L. Dubchak, and A. P. Arkin
The MicrobesOnline Web site for comparative genomics
Genome Res., July 1, 2005; 15(7): 1015 - 1022.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. Quevillon, V. Silventoinen, S. Pillai, N. Harte, N. Mulder, R. Apweiler, and R. Lopez
InterProScan: protein domains identifier
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W116 - W120.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
G. Ausiello, A. Zanzoni, D. Peluso, A. Via, and M. Helmer-Citterich
pdbFun: mass selection and fast comparison of annotated PDB residues
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W133 - W137.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. Gutman, C. Berezin, R. Wollman, Y. Rosenberg, and N. Ben-Tal
QuasiMotiFinder: protein annotation by searching for evolutionarily conserved motif-like patterns
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W255 - W261.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
G. H. Van Domselaar, P. Stothard, S. Shrivastava, J. A. Cruz, A. Guo, X. Dong, P. Lu, D. Szafron, R. Greiner, and D. S. Wishart
BASys: a web server for automated bacterial genome annotation
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W455 - W459.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Wang and R. Samudrala
FSSA: a novel method for identifying functional signatures from structural alignments
Bioinformatics, July 1, 2005; 21(13): 2969 - 2977.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. E. Donald and E. I. Shakhnovich
Determining functional specificity from protein sequences
Bioinformatics, June 1, 2005; 21(11): 2629 - 2635.
[Abstract] [Full Text] [PDF]


Home page
Mol. Biol. CellHome page
C. P. Chia, S. Gomathinayagam, R. J. Schmaltz, and L. K. Smoyer
Glycoprotein gp130 of Dictyostelium discoideum Influences Macropinocytosis and Adhesion
Mol. Biol. Cell, June 1, 2005; 16(6): 2681 - 2693.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
D. S. Tuckwell, M. J. Nicholson, C. S. McSweeney, M. K. Theodorou, and J. L. Brookman
The rapid assignment of ruminal fungi to presumptive genera using ITS1 and ITS2 RNA secondary structures to produce group-specific fingerprints
Microbiology, May 1, 2005; 151(5): 1557 - 1567.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Coeytaux and A. Poupon
Prediction of unfolded segments in a protein sequence based on amino acid composition
Bioinformatics, May 1, 2005; 21(9): 1891 - 1900.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
M. Schneider, A. Bairoch, C. H. Wu, and R. Apweiler
Plant Protein Annotation in the UniProt Knowledgebase
Plant Physiology, May 1, 2005; 138(1): 59 - 66.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Weiner 3rd, G. Thomas, and E. Bornberg-Bauer
Rapid motif-based prediction of circular permutations in multi-domain proteins
Bioinformatics, April 1, 2005; 21(7): 932 - 937.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. A. Rahman, P. Advani, R. Schunk, R. Schrader, and D. Schomburg
Metabolic pathway analysis web service (Pathway Hunter Tool at CUBIC)
Bioinformatics, April 1, 2005; 21(7): 1189 - 1193.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. Hou, S.-R. Jun, C. Zhang, and S.-H. Kim
From The Cover: Global mapping of the protein structure space and application in structure-based inference of protein function
PNAS, March 8, 2005; 102(10): 3651 - 3656.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
T. P. Stinear, M. J. Pryor, J. L. Porter, and S. T. Cole
Functional analysis and annotation of the virulence plasmid pMUM001 from Mycobacterium ulcerans
Microbiology, March 1, 2005; 151(3): 683 - 692.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. L. Gardy, M. R. Laird, F. Chen, S. Rey, C. J. Walsh, M. Ester, and F. S. L. Brinkman
PSORTb v.2.0: Expanded prediction of bacterial protein subcellular localization and insights gained from comparative proteome analysis
Bioinformatics, March 1, 2005; 21(5): 617 - 623.
[Abstract] [Full Text] [PDF]


Home page
DevelopmentHome page
C. C. Carles, D. Choffnes-Inada, K. Reville, K. Lertpiriyapong, and J. C. Fletcher
ULTRAPETALA1 encodes a SAND domain putative transcriptional regulator that controls shoot and floral meristem activity in Arabidopsis
Development, March 1, 2005; 132(5): 897 - 911.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. H. Haft, J. D. Selengut, L. M. Brinkac, N. Zafar, and O. White
Genome Properties: a system for the investigation of prokaryotic genetic content for microbiology, genome annotation and comparative genomics
Bioinformatics, February 1, 2005; 21(3): 293 - 306.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Bairoch, R. Apweiler, C. H. Wu, W. C. Barker, B. Boeckmann, S. Ferro, E. Gasteiger, H. Huang, R. Lopez, M. Magrane, et al.
The Universal Protein Resource (UniProt)
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D154 - D159.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Heger, C. A. Wilton, A. Sivakumar, and L. Holm
ADDA: a domain database with global coverage of the protein universe
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D188 - D191.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Abhiman and E. L. L. Sonnhammer
FunShift: a database of function shift analysis on protein subfamilies
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D197 - D200.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. Bru, E. Courcelle, S. Carrere, Y. Beausse, S. Dalmar, and D. Kahn
The ProDom database of protein domain families: more emphasis on 3D
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D212 - D215.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. Meinel, A. Krause, H. Luz, M. Vingron, and E. Staub
The SYSTERS Protein Family Database in 2005
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D226 - D229.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. A. Laskowski, V. V. Chistyakov, and J. M. Thornton
PDBsum more: new summaries and analyses of the known 3D structures of proteins and nucleic acids
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D266 - D268.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Stothard, G. Van Domselaar, S. Shrivastava, A. Guo, B. O'Neill, J. Cruz, M. Ellison, and D. S. Wishart
BacMap: an interactive picture atlas of annotated bacterial genomes
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D317 - D320.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
W.-C. Kao, Y.-R. Chen, E. C. Yi, H. Lee, Q. Tian, K.-M. Wu, S.-F. Tsai, S. S.-F. Yu, Y.-J. Chen, R. Aebersold, et al.
Quantitative Proteomic Analysis of Metabolic Regulation by Copper Ions in Methylococcus capsulatus (Bath)
J. Biol. Chem., December 3, 2004; 279(49): 51554 - 51560.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
G. Rhodes, J. Parkhill, C. Bird, K. Ambrose, M. C. Jones, G. Huys, J. Swings, and R. W. Pickup
Complete Nucleotide Sequence of the Conjugative Tetracycline Resistance Plasmid pFBAOT6, a Member of a Group of IncU Plasmids with Global Ubiquity
Appl. Envir. Microbiol., December 1, 2004; 70(12): 7497 - 7510.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Y. Galperin and E. V. Koonin
'Conserved hypothetical' proteins: prioritization of targets for experimental study
Nucleic Acids Res., October 12, 2004; 32(18): 5452 - 5463.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. Harte, V. Silventoinen, E. Quevillon, S. Robinson, K. Kallio, X. Fustero, P. Patel, P. Jokinen, and R. Lopez
Public web-based services from the European Bioinformatics Institute
Nucleic Acids Res., July 1, 2004; 32(suppl_2): W3 - W9.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Rigali, M. Schlicht, P. Hoskisson, H. Nothaft, M. Merzbacher, B. Joris, and F. Titgemeyer
Extending the classification of bacterial transcription factors beyond the helix-turn-helix motif as an alternative approach to discover new cis/trans relationships
Nucleic Acids Res., June 24, 2004; 32(11): 3418 - 3426.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. Apweiler, A. Bairoch, C. H. Wu, W. C. Barker, B. Boeckmann, S. Ferro, E. Gasteiger, H. Huang, R. Lopez, M. Magrane, et al.
UniProt: the Universal Protein knowledgebase
Nucleic Acids Res., January 1, 2004; 32(90001): D115 - 119.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (122K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Hulo, N.
Right arrow Articles by Bairoch, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hulo, N.
Right arrow Articles by Bairoch, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?