Skip Navigation

Nucleic Acids Research 2006 34(Database Issue):D227-D230; doi:10.1093/nar/gkj063
This Article
Right arrow Abstract Freely available
Right arrow Print PDF (570K) Freely available
Right arrow Screen PDF (260K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Hulo, N.
Right arrow Articles by Sigrist, C. J. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hulo, N.
Right arrow Articles by Sigrist, C. J. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2006, Vol. 34, Database issue D227-D230
© The Author 2006. Published by Oxford University Press. All rights reserved
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions{at}oxfordjournals.org


Article

The PROSITE database

Nicolas Hulo*, Amos Bairoch, Virginie Bulliard, Lorenzo Cerutti1, Edouard De Castro, Petra S. Langendijk-Genevaux, Marco Pagni1 and Christian J. A. Sigrist

Swiss Institute of Bioinformatics (SIB), Centre Medical Universitaire 1 rue Michel Servet, 1211 Geneva 4, Switzerland 1Swiss Institute of Bioinformatics (SIB), BEP-UNIL 1066 Lausanne, Switzerland

*To whom correspondence should be addressed. Tel: +41 22 379 58 72; Fax +41 22 379 58 58; Email: Nicolas.Hulo{at}isb-sib.ch

Received September 13, 2005. Revised October 7, 2005. Accepted October 7, 2005.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 AUTOMATED UPDATE OF PATTERNS
 NEW FUNCTIONAL PREDICTION TOOL
 WEB PAGE
 IMPROVEMENT OF THE PROFILE...
 HOW TO OBTAIN PROSITE
 REFERENCES
 
The PROSITE database consists of a large collection of biologically meaningful signatures that are described as patterns or profiles. Each signature is linked to a documentation that provides useful biological information on the protein family, domain or functional site identified by the signature. The PROSITE database is now complemented by a series of rules that can give more precise information about specific residues. During the last 2 years, the documentation and the ScanProsite web pages were redesigned to add more functionalities. The latest version of PROSITE (release 19.11 of September 27, 2005) contains 1329 patterns and 552 profile entries. Over the past 2 years more than 200 domains have been added, and now 52% of UniProtKB/Swiss-Prot entries (release 48.1 of September 27, 2005) have a cross-reference to a PROSITE entry. The database is accessible at http://www.expasy.org/prosite/.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 AUTOMATED UPDATE OF PATTERNS
 NEW FUNCTIONAL PREDICTION TOOL
 WEB PAGE
 IMPROVEMENT OF THE PROFILE...
 HOW TO OBTAIN PROSITE
 REFERENCES
 
The PROSITE database uses two kinds of signatures or descriptors to identify conserved regions, i.e. patterns and generalized profiles, which both have their own strengths and weaknesses defining their area of optimum application. Each PROSITE signature is linked to an annotation document where the user can find information on the protein family or domain detected by the signature: origin of its name, taxonomic occurrence, domain architecture, function, 3D structure, main characteristics of the sequence, domain size and some references. As a more detailed description of the PROSITE database has already been provided in previous publications (1,2), this paper will only focus on recent developments that have taken place during the last 2 years.


    AUTOMATED UPDATE OF PATTERNS
 TOP
 ABSTRACT
 INTRODUCTION
 AUTOMATED UPDATE OF PATTERNS
 NEW FUNCTIONAL PREDICTION TOOL
 WEB PAGE
 IMPROVEMENT OF THE PROFILE...
 HOW TO OBTAIN PROSITE
 REFERENCES
 
Patterns or regular expressions are useful tools to identify short and well-conserved regions, such as catalytic sites, binding sites, post-transcriptional modifications (PTMs) or zinc fingers. They are also easy to construct and to use by biologists that have no knowledge in bioinformatics. But patterns do not stand the test of time very well. If a new sequence has an amino acid at a conserved position that was not present in the seed alignment used to construct the pattern, it will not be recognized. Thus, patterns need to be updated regularly to introduce this new variability in the regular expression.

We have developed a tool to identify weak patterns and automatically update them. This tool uses the PROSITE match list, which stores true positives, false positives (FP), false negatives (FN), partial and unknown matches, to generate a new pattern that minimizes FP and FN.

FP and FN updates are treated independently. We first take care of FN in a three-step procedure:

  1. The patterns that can potentially be updated are selected. Updating a pattern to recover FN amounts to introduce more variability in the pattern, but it increases the risk of creating new FP. Hence, only patterns that are stringent enough can be updated. The selection procedure consists of running all PROSITE patterns on a random database to keep only the ones that do not produce too many matches.
  2. Mismatches produced by each FN are detected and the pattern is modified accordingly to accept the observed residues.
  3. The new pattern is tested on a random database to see whether it is still stringent enough. If it produces too many matches in a random database, the pattern is refined and some mismatch positions are removed.

To remove false positives we check ‘wildcard’ positions (‘x’ with the PROSITE syntax) in the pattern. We look at these positions for amino acids that are only found in FP sequences. These amino acids are then ‘forbidden’ ({} with the PROSITE syntax) at these positions in the new pattern.

The new pattern is then used to scan Swiss-Prot and all new matches are checked manually. Only patterns that produce no new false positives are kept.

This strategy has allowed the automatic update of 943 patterns (out of a total of 1322 patterns in PROSITE). 2661 FN (out of a total of 14 412) and 1927 FP (out of a total of 7446) were removed. We have also removed the less specific patterns that could not be updated and have replaced them by profiles. The application of these two strategies allowed a decrease of the number of FP and FN in the Swiss-Prot part of UniProt by ~25%. The procedure to automatically update patterns is run at each major release.


    NEW FUNCTIONAL PREDICTION TOOL
 TOP
 ABSTRACT
 INTRODUCTION
 AUTOMATED UPDATE OF PATTERNS
 NEW FUNCTIONAL PREDICTION TOOL
 WEB PAGE
 IMPROVEMENT OF THE PROFILE...
 HOW TO OBTAIN PROSITE
 REFERENCES
 
When a signature identifies a conserved region in a given protein, it is important to know what functional information can be transferred to this new protein according to what is known about the function of the conserved region. If the information that is transferred is very general (name and position of a given domain in a sequence) only the occurrence of a match with a descriptor at a reliable score is enough. But descriptors can supply much more precise information. If one looks at the residue level, functional sites such as active sites, disulfide bridges or PTM sites can be identified. One can also look at the domain arrangement to discriminate between particular families or sub-families. The predicted function of the protein can thus be much more accurate.

PROSITE has a long experience in documentation and detailed annotation of domains, families and functional sites. This information is mainly stored in free text and used by biologists who read the various documents and make their own decision on the function of their protein according to the PROSITE matches. But with the rapid growth of sequence databases, there is an increasing need for a reliable tool that can generate automatically precise and accurate functional annotation in standard format. We thus decided to group some functional information stored in PROSITE in a database of rules that can easily be read by a program and applied on proteins that are recognized by PROSITE profiles. We named this complementary database ProRule, for PROSITE rules. ProRule generates annotation in Swiss-Prot format for DE, CC, KW or FT lines.

Two types of information are stored in ProRule:

  1. General information: the occurrence of a match with a profile is enough to trigger this annotation. Usually, it is restricted to the name of the domain and the position of its boundaries.
  2. Conditional information: this is dependent on the presence of given amino acids at precise positions, on the occurrence of other domains or on taxonomic specificity. This information is only transferred if the conditions are fulfilled. For example, an enzymatic active site is annotated only if the correct amino acid is found at the required position.

ProRule is extensively used by Swiss-Prot curators to facilitate the annotation work and to check the consistency of Swiss-Prot entries. But it can also be accessible for external users through the ScanProsite web page (see below) or downloaded from the PROSITE ftp site under PROSITE license conditions. For more details on ProRule and its range of application see Sigrist et al. (3).


    WEB PAGE
 TOP
 ABSTRACT
 INTRODUCTION
 AUTOMATED UPDATE OF PATTERNS
 NEW FUNCTIONAL PREDICTION TOOL
 WEB PAGE
 IMPROVEMENT OF THE PROFILE...
 HOW TO OBTAIN PROSITE
 REFERENCES
 
The PROSITE website was redesigned and new predictive tools were implemented to assign more detailed functional information to the scanned proteins. Users who want to scan their own proteins against all PROSITE entries or to scan a PROSITE entry against a protein database will find a new version of the ScanProsite web page.

PROSITE matches on UniProt knowledgebase (UniProtKB) or PDB entries are now pre-calculated and stored in a relational database (PostgreSQL) that is maintained in collaboration with Swiss-Prot (4). This greatly speeds up the scanning of UniProtKB proteins. A domain visualizer was integrated in the result page (see Figure 1a). Each ProRule associated with a PROSITE profile is also scanned, which allows the localization of interesting functional residues such as active sites, PTMs and disulfide bridges. These features are only shown if the expected amino acid is found at the right position. But we also indicate missing features when we expect another amino acid at a given position. This tool can be used to identify divergent subfamilies of proteins like inactive enzymes. In Figure 1a, we show the ScanProsite output for the human ephrin B4 receptor, which is a functional kinase receptor (5), and its paralogue the ephrin B6 receptor, which is known to have an inactive kinase domain (6). The ScanProsite output indicates that the expected Asp residue was not found at the position of the active site in ephrin B6 receptor. To test the efficiency of the method, we looked at mammalian homologues of ephrin B6 receptor. We used ScanProsite to identify all mammalian homologues of the ephrin B6 receptor and to construct a multiple sequence alignment (MSA) of this subfamily (Figure 1b). The MSA also shows that the conserved Asp residue of the active site is found in none of the ephrin B6 receptor orthologues. ScanProsite can also be used to identify new uncharacterized subfamilies of putatively inactive enzymes (Figure 1c). From the ScanProsite web page, we have searched with the kinase profile (PS50011) for plant proteins that have no detected active site and a common domain arrangement. We have identified an uncharacterized family of putatively inactive kinases, which is conserved in various plant genomes as it is shown in the MSA.



View larger version (66K):
[in this window]
[in a new window]
 
Figure 1 Output of the ScanProsite web page. (a) The left protein is a classical ephrin receptor protein (ephrin B4 receptor protein) which is known to transduce a signal through its kinase domain (5). The right protein is also an ephrin receptor protein (human ephrin B6 receptor protein) but with an inactive kinase domain (6). The ProRule associated with the kinase domain identifies an active site in ephrin B4 receptor but not in ephrin B6 receptor (absent feature: active site). The canonical Asp residue at the active site position is replaced by a serine. (b) We used ScanProsite to identify orthologues of the ephrin B6 receptor in mammals, searching for proteins that have the same domain arrangement and have a putative inactive kinase domain. (c) We also identified with ScanProsite an uncharacterized plant subfamily of kinase receptors with a putative inactive kinase domain. The canonical aspartate residue at the active site position is replaced by a histidine. This kinase subfamily is conserved in various plant genomes. Both multiple sequence alignments were generated on the ScanProsite web page using the alignment with the kinase profile (PS50011).

 
The documentation page has also been reorganized. It now contains three main sections:
  1. The description part that exposes the main characteristics of the domain or the family and a representative list of proteins that contain the domain or belong to the family.
  2. A technical section that refers to the descriptors used to identify the domain or family. For each descriptor, there is a link to a domain architecture view of UniProtKB proteins matched by the descriptor, an MSA in different formats, a link to retrieve the list of proteins matched by the descriptor in various formats and a link to a taxonomy tree view of all entries containing the domain. There is also an external link to MSDsite (7) to view ligand binding statistics of the domain and a link to 3D structures.
  3. The third section is the reference block where, for each reference, we added the PubMed ID and a direct link to the article.

The architecture view of PROSITE profiles is now visible, from each UniProtKB entry on the ExPASy server, from the PROSITE documentation page and from the ScanProsite web page. In each view, some interesting residues are tagged according to ProRule predictions (see Figure 1a).


    IMPROVEMENT OF THE PROFILE METHOD CONSTRUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 AUTOMATED UPDATE OF PATTERNS
 NEW FUNCTIONAL PREDICTION TOOL
 WEB PAGE
 IMPROVEMENT OF THE PROFILE...
 HOW TO OBTAIN PROSITE
 REFERENCES
 
There are currently several tools to construct efficient profiles based on MSA (8). All these tools were designed to recover very divergent proteins (<20% of similarity). They were developed 10 years ago when protein databases were quite small and very few representative genomes were sequenced. There was thus a strong sample bias when constructing seed alignments and profile tools that used these seeds needed to be strongly predictive. Currently, proteins databases are 10 times bigger and thousands of genomes have been sequenced spanning the whole tree of life. It is now easier to have a seed alignment with representatives of all possible variability and descriptors can be more conservative. There is rather an increasing need for more specific descriptors in order to have more precise functional information. As we described previously, specific annotation can be assigned to a match with a profile when looking in the matched region at the residue level for the presence of particular amino acids at particular sites, such as enzymatic active sites, disulphide bridges, etc. We thus have developed a new strategy to annotate the MSA at these particular sites and to transfer this information to the profile builder program. Profile builder parameters can then be adjusted according to the annotation. We have used this strategy to adjust specific parameters in a column-dependant manner. We have tested the weight of the matrix, gap and insertion penalties. The tool aim is to be more stringent on specific columns and to produce a better local alignment, which then helps to re-localize the functional residues in sequences matched by the profile.


    HOW TO OBTAIN PROSITE
 TOP
 ABSTRACT
 INTRODUCTION
 AUTOMATED UPDATE OF PATTERNS
 NEW FUNCTIONAL PREDICTION TOOL
 WEB PAGE
 IMPROVEMENT OF THE PROFILE...
 HOW TO OBTAIN PROSITE
 REFERENCES
 
PROSITE and ProRule are freely available to academic users. As of release 16, the documentation entries of PROSITE are copyright. The ProRule database is also copyright. To obtain a license, commercial users should contact:

The Swiss Institute of Bioinformatics by email: license{at}isb-sib.ch or its commercial representative: Geneva Bioinformatics (GeneBio) S.A, Case Postale 210, CH-1211 Geneva 12, Switzerland, Tel: +41 22 702 99 00, Fax: +41 22 702 99 99, Email: info{at}genebio.com.

  1. Weekly updates of PROSITE are available on our FTP server: ftp://ftp.expasy.org/databases/prosite/release_with_updates/.
  2. PROSITE is also accessible from the Hits page: http://hits.isb-sib.ch/.
  3. Frame-tolerant scans can be performed at the following address: http://www.isrec.isb-sib.ch/software/PFRAMESCAN_form.html.


    ACKNOWLEDGEMENTS
 
The authors wish to thank Tania Lima for critical reading of the manuscript. PROSITE is supported by grant no. 3152A0-103922/1 from the Swiss National Science Foundation. Funding to pay the Open Access publication charges for this article was provided by the State Secretariat for Education and Research (SER) of the Swiss Confederation.

Conflict of interest statement. None declared.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 AUTOMATED UPDATE OF PATTERNS
 NEW FUNCTIONAL PREDICTION TOOL
 WEB PAGE
 IMPROVEMENT OF THE PROFILE...
 HOW TO OBTAIN PROSITE
 REFERENCES
 

  1. Sigrist, C.J.A., Cerutti, L., Hulo, N., Gattiker, A., Falquet, L., Pagni, M., Bairoch, A., Bucher, P. (2002) PROSITE: a documented database using patterns and profiles as motif descriptors Brief. Bioinform, . 3, 265–274[Abstract/Free Full Text] .

  2. Hulo, N., Sigrist, C.J.A., Le Saux, V., Langendijk-Genevaux, P.S., Bordoli, L., Gattiker, A., De Castro, E., Bucher, P., Bairoch, A. (2004) Recent improvements to the PROSITE database Nucleic Acids Res, . 32, 134–137 .

  3. Sigrist, C.J.A., De Castro, E., Langendijk-Genevaux, P.S., Le Saux, V., Bairoch, A., Hulo, N. (2005) ProRule: a new database containing functional and structural information on PROSITE profiles Bioinformatics, 21, 4060–4066[Abstract/Free Full Text] .

  4. Gattiker, A., Michoud, K., Rivoire, C., Auchincloss, A.H., Coudert, E., Lima, T., Kersey, P., Pagni, M., Sigrist, C.J.A., Lachaize, C., et al. (2003) Automated annotation of microbial proteomes in SWISS-PROT Comput. Biol. Chem, . 27, 49–58[CrossRef][Web of Science][Medline] .

  5. Kullander, K. and Klein, R. (2002) Mechanisms and functions of Eph and ephrin signalling Nature Rev. Mol. Cell. Biol, . 7, 475–486 .

  6. Matsuoka, H., Iwata, N., Ito, M., Shimoyama, M., Nagata, A., Chihara, K., Takai, S., Matsui, T. (1997) Expression of a kinase-defective Eph-like receptor in the normal human brain Biochem. Biophys. Res. Commun, . 235, 487–492[CrossRef][Web of Science][Medline] .

  7. Golovin, A., Dimitropoulos, D., Oldfield, T., Rachedi, A., Henrick, K. (2005) MSDsite: a database search and retrieval system for the analysis and viewing of bound ligands and active sites Proteins, 58, 190–199[CrossRef][Web of Science][Medline] .

  8. Hofmann, K. (2000) Sensitive protein comparisons with profiles and hidden Markov models Brief. Bioinform, . 2, 167–178 .


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Bacteriol.Home page
J. C. Micklinghoff, K. J. Breitinger, M. Schmidt, R. Geffers, B. J. Eikmanns, and F.-C. Bange
Role of the Transcriptional Regulator RamB (Rv0465c) in the Control of the Glyoxylate Cycle in Mycobacterium tuberculosis
J. Bacteriol., December 1, 2009; 191(23): 7260 - 7269.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
R. M. Presti, G. Zhao, W. L. Beatty, K. A. Mihindukulasuriya, A. P. A. Travassos da Rosa, V. L. Popov, R. B. Tesh, H. W. Virgin, and D. Wang
Quaranfil, Johnston Atoll, and Lake Chad Viruses Are Novel Members of the Family Orthomyxoviridae
J. Virol., November 15, 2009; 83(22): 11599 - 11606.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. Davidsen, E. Beck, A. Ganapathy, R. Montgomery, N. Zafar, Q. Yang, R. Madupu, P. Goetz, K. Galinsky, O. White, et al.
The comprehensive microbial resource
Nucleic Acids Res., November 5, 2009; (2009) gkp912v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Vanhee, J. Reumers, F. Stricher, L. Baeten, L. Serrano, J. Schymkowitz, and F. Rousseau
PepX: a structural database of non-redundant protein-peptide complexes
Nucleic Acids Res., October 30, 2009; (2009) gkp893v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. M. Brinkac, T. Davidsen, E. Beck, A. Ganapathy, E. Caler, R. J. Dodson, A. S. Durkin, D. M. Harkins, H. Lorenzi, R. Madupu, et al.
Pathema: a clade-specific bioinformatics resource center for pathogen research
Nucleic Acids Res., October 20, 2009; (2009) gkp850v1.
[Abstract] [Full Text] [PDF]


Home page
J R Soc InterfaceHome page
M. Izumi, A. M. Sweeney, D. DeMartini, J. C. Weaver, M. L. Powers, A. Tao, T. V. Silvas, R. M. Kramer, W. J. Crookes-Goodson, L. M. Mathger, et al.
Changes in reflectin protein phosphorylation are associated with dynamic iridescence in squid
J R Soc Interface, September 23, 2009; (2009) rsif.2009.0299v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. A. Rowe-Magnus
Integrase-directed recovery of functional genes from genomic libraries
Nucleic Acids Res., September 1, 2009; 37(17): e118 - e118.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Kumar and L. Cowen
Augmented training of hidden Markov models to recognize remote homologs via simulated evolution
Bioinformatics, July 1, 2009; 25(13): 1602 - 1608.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. T.-H. Chang, T.-Y. Chien, and C.-Y. Chen
seeMotif: exploring and visualizing sequence motifs in 3D structures
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W552 - W558.
[Abstract] [Full Text] [PDF]


Home page
J BiochemHome page
T. Murakami, T. Maeda, A. Yokota, and M. Wada
Gene Cloning and Expression of Pyridoxal 5'-Phosphate-Dependent L-threo-3-Hydroxyaspartate Dehydratase from Pseudomonas sp. T62, and Characterization of the Recombinant Enzyme
J. Biochem., May 1, 2009; 145(5): 661 - 668.
[Abstract] [Full Text] [PDF]


Home page
ANN BOT (LOND)Home page
S. Schuette, A. J. Wood, M. Geisler, J. Geisler-Lee, R. Ligrone, and K. S. Renzaglia
Novel localization of callose in the spores of Physcomitrella patens and phylogenomics of the callose synthase gene family
Ann. Bot., March 1, 2009; 103(5): 749 - 756.
[Abstract] [Full Text] [PDF]


Home page
J R Soc InterfaceHome page
G. A Reeves, D. Talavera, and J. M Thornton
Genome and proteome annotation: organization, interpretation and integration
J R Soc Interface, February 6, 2009; 6(31): 129 - 147.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
A. Kaufmann and P. Philippsen
Of Bars and Rings: Hof1-Dependent Cytokinesis in Multiseptated Hyphae of Ashbya gossypii
Mol. Cell. Biol., February 1, 2009; 29(3): 771 - 783.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
H. Tang, L. Wang, X. Meng, L. Ma, S. Wang, X. He, G. Wu, and P. Xu
Novel Nicotine Oxidoreductase-Encoding Gene Involved in Nicotine Degradation by Pseudomonas putida Strain S16
Appl. Envir. Microbiol., February 1, 2009; 75(3): 772 - 778.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
E. L. Denham, P. N. Ward, and J. A. Leigh
In the absence of Lgt, lipoproteins are shed from Streptococcus uberis independently of Lsp
Microbiology, January 1, 2009; 155(1): 134 - 141.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Hunter, R. Apweiler, T. K. Attwood, A. Bairoch, A. Bateman, D. Binns, P. Bork, U. Das, L. Daugherty, L. Duquenne, et al.
InterPro: the integrative protein signature database
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D211 - D215.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. Wilson, R. Pethica, Y. Zhou, C. Talbot, C. Vogel, M. Madera, C. Chothia, and J. Gough
SUPERFAMILY--sophisticated comparative genomics, data mining, visualization and phylogeny
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D380 - D386.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. Lu, R. Z. Liu, V. Izumi, D. Fenstermacher, E. B. Haura, J. Koomen, and S. A. Eschrich
IPEP: an in silico tool to examine proteolytic peptides for mass spectrometry
Bioinformatics, December 1, 2008; 24(23): 2801 - 2802.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
M. Mackiewicz, B. Paigen, N. Naidoo, and A. I. Pack
Analysis of the QTL for sleep homeostasis in mice: Homer1a is a likely candidate
Physiol Genomics, October 8, 2008; 33(1): 91 - 99.
[Abstract] [Full Text] [PDF]


Home page
Antimicrob. Agents Chemother.Home page
V. Dhote, S. Gupta, and K. A. Reynolds
An O-Phosphotransferase Catalyzes Phosphorylation of Hygromycin A in the Antibiotic-Producing Organism Streptomyces hygroscopicus
Antimicrob. Agents Chemother., October 1, 2008; 52(10): 3580 - 3588.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. N. Davies, A. Secker, A. A. Freitas, E. Clark, J. Timmis, and D. R. Flower
Optimizing amino acid groupings for GPCR classification
Bioinformatics, September 15, 2008; 24(18): 1980 - 1986.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. D'Aniello, M. Irimia, I. Maeso, J. Pascual-Anaya, S. Jimenez-Delgado, S. Bertrand, and J. Garcia-Fernandez
Gene Expansion and Retention Leads to a Diverse Tyrosine Kinase Superfamily in Amphioxus
Mol. Biol. Evol., September 1, 2008; 25(9): 1841 - 1854.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
R. J. Dutton, D. Boyd, M. Berkmen, and J. Beckwith
Bacterial species exhibit diversity in their mechanisms and capacity for protein disulfide bond formation
PNAS, August 19, 2008; 105(33): 11933 - 11938.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
A. Nakhamchik, C. Wilde, and D. A. Rowe-Magnus
Cyclic-di-GMP Regulates Extracellular Polysaccharide Production, Biofilm Formation, and Rugose Colony Development by Vibrio vulnificus
Appl. Envir. Microbiol., July 1, 2008; 74(13): 4199 - 4209.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. F. W. Saunders and B. Kobe
The Predikin webserver: improved prediction of protein kinase peptide specificity using structural information
Nucleic Acids Res., July 1, 2008; 36(suppl_2): W286 - W290.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T.-Y. Chien, D. T.-H. Chang, C.-Y. Chen, Y.-Z. Weng, and C.-M. Hsu
E1DS: catalytic site prediction based on 1D signatures of concurrent conservation
Nucleic Acids Res., July 1, 2008; 36(suppl_2): W291 - W296.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
S. Cun, H. Li, R. Ge, M. C. M. Lin, and H. Sun
A Histidine-rich and Cysteine-rich Metal-binding Domain at the C Terminus of Heat Shock Protein A from Helicobacter pylori: IMPLICATION FOR NICKEL HOMEOSTASIS AND BISMUTH SUSCEPTIBILITY
J. Biol. Chem., May 30, 2008; 283(22): 15142 - 15151.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
T. Saito, M. Nishi, M. I. Lim, B. Wu, T. Maeda, H. Hashimoto, T. Takeuchi, D. S. Roos, and T. Asai
A Novel GDP-dependent Pyruvate Kinase Isozyme from Toxoplasma gondii Localizes to Both the Apicoplast and the Mitochondrion
J. Biol. Chem., May 16, 2008; 283(20): 14041 - 14052.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
P. Boulanger, P. Jacquot, L. Plancon, M. Chami, A. Engel, C. Parquet, C. Herbeuval, and L. Letellier
Phage T5 Straight Tail Fiber Is a Multifunctional Protein Acting as a Tape Measure and Carrying Fusogenic and Muralytic Activities
J. Biol. Chem., May 16, 2008; 283(20): 13556 - 13564.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
K. Brinkrolf, S. Ploger, S. Solle, I. Brune, S. S. Nentwich, A. T. Huser, J. Kalinowski, A. Puhler, and A. Tauch
The LacI/GalR family transcriptional regulator UriR negatively controls uridine utilization of Corynebacterium glutamicum by binding to catabolite-responsive element (cre)-like sequences
Microbiology, April 1, 2008; 154(4): 1068 - 1081.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Ye, G. Vriend, and A. P. IJzerman
Tracing evolutionary pressure
Bioinformatics, April 1, 2008; 24(7): 908 - 915.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
I. Ruiz-Trillo, A. J. Roger, G. Burger, M. W. Gray, and B. F. Lang
A Phylogenomic Investigation into the Origin of Metazoa
Mol. Biol. Evol., April 1, 2008; 25(4): 664 - 672.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
V. Naponelli, A. Noiriel, M. J. Ziemak, S. M. Beverley, L.-F. Lye, A. M. Plume, J. R. Botella, K. Loizeau, S. Ravanel, F. Rebeille, et al.
Phylogenomic and Functional Analysis of Pterin-4a-Carbinolamine Dehydratase Family (COG2154) Proteins in Plants and Microorganisms
Plant Physiology, April 1, 2008; 146(4): 1515 - 1527.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C.-M. Hsu, C.-Y. Chen, and B.-J. Liu
Corrigendum
Nucleic Acids Res., March 27, 2008; 36(4): 1400 - 1406.
[Abstract] [Full Text] [PDF]


Home page
DevelopmentHome page
T. L. Tootle and A. C. Spradling
Drosophila Pxt: a cyclooxygenase-like facilitator of follicle maturation
Development, March 1, 2008; 135(5): 839 - 847.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. J. Perez-Garcia, M. Gou-Fabregas, Y. de Pablo, M. Llovera, J. X. Comella, and R. M. Soler
Neuroprotection by Neurotrophic Factors and Membrane Depolarization Is Regulated by Calmodulin Kinase IV
J. Biol. Chem., February 15, 2008; 283(7): 4133 - 4144.
[Abstract] [Full Text] [PDF]


Home page
JEMHome page
C. Giefing, A. L. Meinke, M. Hanner, T. Henics, D. B. Minh, D. Gelbmann, U. Lundberg, B. M. Senn, M. Schunn, A. Habel, et al.
Discovery of a novel class of highly conserved vaccine antigens using genomic scale antigenic fingerprinting of pneumococcus with human antibodies
J. Exp. Med., January 21, 2008; 205(1): 117 - 131.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. Brylinski and J. Skolnick
A threading-based method (FINDSITE) for ligand-binding site prediction and functional annotation
PNAS, January 8, 2008; 105(1): 129 - 134.
[Abstract] [Full Text] [PDF]


Home page
JCBHome page
I. Raykhel, H. Alanen, K. Salo, J. Jurvansuu, V. D. Nguyen, M. Latva-Ranta, and L. Ruddock
A molecular specificity code for the three mammalian KDEL receptors
J. Cell Biol., December 17, 2007; 179(6): 1193 - 1204.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Dinkel and H. Sticht
A computational strategy for the prediction of functional linear peptide motifs in proteins
Bioinformatics, December 15, 2007; 23(24): 3297 - 3303.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
H. L. Wong, R. Pinontoan, K. Hayashi, R. Tabata, T. Yaeno, K. Hasegawa, C. Kojima, H. Yoshioka, K. Iba, T. Kawasaki, et al.
Regulation of Rice NADPH Oxidase by Binding of Rac GTPase to Its N-Terminal Extension
PLANT CELL, December 1, 2007; 19(12): 4022 - 4034.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
S.-H. Chon, Y. X. Zhou, J. L. Dixon, and J. Storch
Intestinal Monoacylglycerol Metabolism: DEVELOPMENTAL AND NUTRITIONAL REGULATION OF MONOACYLGLYCEROL LIPASE AND MONOACYLGLYCEROL ACYLTRANSFERASE
J. Biol. Chem., November 16, 2007; 282(46): 33346 - 33357.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
N. Bhardwaj, R. V. Stahelin, G. Zhao, W. Cho, and H. Lu
MeTaDoR: a comprehensive resource for membrane targeting domains and their host proteins
Bioinformatics, November 15, 2007; 23(22): 3110 - 3112.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. A. Rodriguez, T. Bompada, M. Syed, P. K. Shah, and N. Maltsev
Evolutionary analysis of enzymes using Chisel
Bioinformatics, November 15, 2007; 23(22): 2961 - 2968.
[Abstract] [Full Text] [PDF]


Home page
Brief Funct Genomic ProteomicHome page
R. Huhne, F.-T. Koch, and J. Suhnel
A comparative view at comprehensive information resources on three-dimensional structures of biological macro-molecules
Brief Funct Genomic Proteomic, October 23, 2007; (2007) elm020v1.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
M. Thys, I. Schrauwen, K. Vanderstraeten, K. Janssens, N. Dieltjens, K. Van Den Bogaert, E. Fransen, W. Chen, M. Ealy, M. Claustres, et al.
The coding polymorphism T263I in TGF-{beta}1 is associated with otosclerosis in two independent populations
Hum. Mol. Genet., September 1, 2007; 16(17): 2021 - 2030.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Pagni, V. Ioannidis, L. Cerutti, M. Zahn-Zabal, C. V. Jongeneel, J. Hau, O. Martin, D. Kuznetsov, and L. Falquet
MyHits: improvements to an interactive resource for analyzing protein sequences
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W433 - W437.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
E. F. Donaldson, A. C. Sims, R. L. Graham, M. R. Denison, and R. S. Baric
Murine Hepatitis Virus Replicase Protein nsp10 Is a Critical Regulator of Viral RNA Synthesis
J. Virol., June 15, 2007; 81(12): 6356 - 6368.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. M. Higa, S. L. Alam, W. I. Sundquist, and K. S. Ullman
Molecular Characterization of the Ran-binding Zinc Finger Domain of Nup153
J. Biol. Chem., June 8, 2007; 282(23): 17090 - 17100.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. S. Papadopoulos and R. Agarwala
COBALT: constraint-based alignment tool for multiple protein sequences
Bioinformatics, May 1, 2007; 23(9): 1073 - 1079.
[Abstract] [Full Text] [PDF]


Home page
FASEB J.Home page
A.-J. Rodrigues, G. Coppola, C. Santos, M. d. C. Costa, M. Ailion, J. Sequeiros, D. H. Geschwind, and P. Maciel
Functional genomics and biochemical characterization of the C. elegans orthologue of the Machado-Joseph disease protein ataxin-3
FASEB J, April 1, 2007; 21(4): 1126 - 1136.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
S. Richardt, D. Lang, R. Reski, W. Frank, and S. A. Rensing
PlanTAPDB, a Phylogeny-Based Resource of Plant Transcription-Associated Proteins
Plant Physiology, April 1, 2007; 143(4): 1452 - 1466.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
L. Feng, W. Wang, J. Cheng, Y. Ren, G. Zhao, C. Gao, Y. Tang, X. Liu, W. Han, X. Peng, et al.
Genome and proteome of long-chain alkane degrading Geobacillus thermodenitrificans NG80-2 isolated from a deep-subsurface oil reservoir
PNAS, March 27, 2007; 104(13): 5602 - 5607.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Ye, W. A. Kosters, and A. P. IJzerman
An efficient, versatile and scalable pattern growth approach to mine frequent patterns in unaligned protein sequences
Bioinformatics, March 15, 2007; 23(6): 687 - 693.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. J. Mulder, R. Apweiler, T. K. Attwood, A. Bairoch, A. Bateman, D. Binns, P. Bork, V. Buillard, L. Cerutti, R. Copley, et al.
New developments in the InterPro database
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D224 - D228.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
E. B. Rubin, Y. Shemesh, M. Cohen, S. Elgavish, H. M. Robertson, and G. Bloch
Molecular and phylogenetic analyses reveal mammalian-like clockwork in the honey bee (Apis mellifera) and shed new light on the molecular evolution of the circadian clock
Genome Res., November 1, 2006; 16(11): 1352 - 1365.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
S. Bottcher, B. G. Klupp, H. Granzow, W. Fuchs, K. Michael, and T. C. Mettenleiter
Identification of a 709-Amino-Acid Internal Nonessential Region within the Essential Conserved Tegument Protein (p)UL36 of Pseudorabies Virus
J. Virol., October 1, 2006; 80(19): 9910 - 9915.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
T. Lingner and P. Meinicke
Remote homology detection based on oligomer distances
Bioinformatics, September 15, 2006; 22(18): 2224 - 2231.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Espadaler, E. Querol, F. X. Aviles, and B. Oliva
Identification of function-associated loop motifs and application to protein function prediction
Bioinformatics, September 15, 2006; 22(18): 2237 - 2243.
[Abstract] [Full Text] [PDF]


Home page
IOVSHome page
W. Adachi, H. Ulanovsky, Y. Li, B. Norman, J. Davis, and J. Piatigorsky
Serial Analysis of Gene Expression (SAGE) in the Rat Limbal and Central Corneal Epithelium.
Invest. Ophthalmol. Vis. Sci., September 1, 2006; 47(9): 3801 - 3810.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. R. Gabdoulline, S. Ulbrich, S. Richter, and R. C. Wade
ProSAT2--Protein Structure Annotation Server.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W79 - W83.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C.-M. Hsu, C.-Y. Chen, and B.-J. Liu
MAGIIC-PRO: detecting functional signatures by efficient discovery of long patterns in protein sequences.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W356 - W361.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. de Castro, C. J. A. Sigrist, A. Gattiker, V. Bulliard, P. S. Langendijk-Genevaux, E. Gasteiger, A. Bairoch, and N. Hulo
ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W362 - W365.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
G. A. Reeves, J. M. Thornton, and the BioSapiens Network of Excellence
Integrating biological data through the genome
Hum. Mol. Genet., April 15, 2006; 15(suppl_1): R81 - R87.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (570K) Freely available
Right arrow Screen PDF (260K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Hulo, N.
Right arrow Articles by Sigrist, C. J. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hulo, N.
Right arrow Articles by Sigrist, C. J. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?