Skip Navigation

Nucleic Acids Research 2005 33(Database Issue):D433-D437; doi:10.1093/nar/gki005
This Article
Right arrow Abstract Freely available
Right arrow Print PDF (494K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by von Mering, C.
Right arrow Articles by Bork, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by von Mering, C.
Right arrow Articles by Bork, P.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2005, Vol. 33, Database issue D433-D437
© 2005, the authors
Nucleic Acids Research, Vol. 33, Database issue © Oxford University Press 2005; all rights reserved

STRING: known and predicted protein–protein associations, integrated and transferred across organisms

Christian von Mering, Lars J. Jensen, Berend Snel1, Sean D. Hooper, Markus Krupp, Mathilde Foglierini, Nelly Jouffre, Martijn A. Huynen1 and Peer Bork*

European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg, Germany and 1 Nijmegen Centre for Molecular Life Sciences p/a Centre of Molecular and Biomolecular Informatics, University Medical Center St Radboud, Toernooiveld 1, 6525 ED Nijmegen, The Netherlands

* To whom correspondence should be addressed. Tel: +49 6221 387 8526; Fax: +49 6221 387 517; Email: mering{at}embl-heidelberg.de

Received September 11, 2004; Accepted September 13, 2004


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 DATA SOURCES AND SCORING
 TRANSFER OF ASSOCIATIONS ACROSS...
 INTEGRATION
 REFERENCES
 
A full description of a protein's function requires knowledge of all partner proteins with which it specifically associates. From a functional perspective, ‘association’ can mean direct physical binding, but can also mean indirect interaction such as participation in the same metabolic pathway or cellular process. Currently, information about protein association is scattered over a wide variety of resources and model organisms. STRING aims to simplify access to this information by providing a comprehensive, yet quality-controlled collection of protein–protein associations for a large number of organisms. The associations are derived from high-throughput experimental data, from the mining of databases and literature, and from predictions based on genomic context analysis. STRING integrates and ranks these associations by benchmarking them against a common reference set, and presents evidence in a consistent and intuitive web interface. Importantly, the associations are extended beyond the organism in which they were originally described, by automatic transfer to orthologous protein pairs in other organisms, where applicable. STRING currently holds 730 000 proteins in 180 fully sequenced organisms, and is available at http://string.embl.de/.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 DATA SOURCES AND SCORING
 TRANSFER OF ASSOCIATIONS ACROSS...
 INTEGRATION
 REFERENCES
 
Several databases exist, whose main purpose is to collect and curate direct experimental evidence about protein–protein interactions (14). Other databases take a more generalized perspective on proteins and their associations, by functionally grouping proteins into metabolic, signaling or transcriptional pathways (58). Finally, a third class of resources attempts to fill gaps in both datasets, by predicting protein–protein associations de novo, using a variety of computational techniques (913).

The database STRING (‘Search Tool for the Retrieval of Interacting Genes/Proteins’) represents an ongoing effort to provide these three types of protein–protein association evidence under one common framework. Such an integrated approach offers several unique advantages: (i) various types of evidence are mapped onto a single, stable set of proteins, thereby facilitating comparative analysis; (ii) known and predicted interactions often partially complement each other, leading to increased coverage; (iii) an integrated scoring scheme can provide higher confidence when independent evidence types agree; and (iv) mapping and transferring interactions onto a large number of organisms facilitates evolutionary studies.

Because STRING is fully pre-computed, all information can be quickly accessed—both at the high-level network view and at the level of the individual interaction record. The various evidence types can be enabled or disabled separately, which allows the searches to be customized at run-time, and dedicated viewers allow the inspection of all the evidence underlying an association (Figure 1). The database is an exploratory resource: it contains a much larger number of associations than primary interaction databases—albeit with varying confidence scores. It is thus best used for getting a quick initial overview of the functional partners of a query protein, especially for proteins that are still poorly characterized.



View larger version (57K):
[in this window]
[in a new window]
 
Figure 1. Results from a STRING search. Inserts show partial screen shots from evidence pages, which are accessible from the main result page. Two proteins were used as inputs to the query—one is a subunit from the yeast ATP synthase complex, the other a subunit from the ubiquinol–cytochrome C reductase complex. The number of requested partners was limited to 10 (default settings). STRING reports both proteins to be members of functional modules, which are in turn connected as part of a larger unit. The diversity of evidence types supporting the modules is noted.

 

    DATA SOURCES AND SCORING
 TOP
 ABSTRACT
 INTRODUCTION
 DATA SOURCES AND SCORING
 TRANSFER OF ASSOCIATIONS ACROSS...
 INTEGRATION
 REFERENCES
 
Many of the protein–protein associations in STRING are imported from other databases (see below), but STRING also contains a large body of predicted associations that are produced de novo. These predictions are based on systematic genome comparisons [‘genomic context’, (14,15)]. We periodically import completely sequenced genomes [metazoan genomes from Ensembl, all others from SwissProt, (16)], and search them for three types of genomic context associations: conserved genomic neighborhood, gene fusion events, and co-occurrence of genes across genomes. All three searches aim to identify pairs of genes which appear to be under common selective pressures during evolution (more so than expected by chance), and which are therefore thought to be functionally associated.

As for all other types of associations in STRING, we assign a confidence score to each predicted association. The scores are derived by benchmarking the performance of the predictions against a common reference set of trusted, true associations. We chose the functional grouping of proteins maintained at KEGG [Kyoto Encyclopedia of Genes and Genomes, (5)] as the reference. Any predicted association for which both proteins are assigned to the same ‘KEGG pathway’ is counted as a true positive. KEGG pathways are particularly suitable as a reference because they are based on manual curation, are available for a number of organisms, and cover several functional areas. The benchmarked confidence scores in STRING generally correspond to the probability of finding the linked proteins within the same KEGG pathway. STRING performs a similar benchmark for high-throughput experimental interaction data, separately for each dataset. Scores vary within one dataset because they include additional, intrinsic information from the data itself, such as the frequency or reciprocality of the detection (see Figure 2 for a typical benchmark). In contrast to high-throughput data, validated small-scale interactions, protein complexes, and annotated pathways are directly imported from databases (2,5,17), and given a uniform confidence score per dataset.



View larger version (24K):
[in this window]
[in a new window]
 
Figure 2. Deriving confidence scores for high-throughput interaction data [exemplified here for a dataset of protein complex purifications (22)]. In this case, the relative confidence depends on how often two proteins are pulled down together (a and b), versus how often they are pulled down alone (c and d). A purification is counted twice when one of the partners is the bait (a and d). Raw quality is: Q = log{(Ntogether · Ntotal)/[(Nalone1 + 1) · (Nalone2 + 1)]}.

 
Another important source of protein association information is the published literature (18,19). We systematically extract associations from PubMed, by searching for recurrent co-mentioning of gene names in abstracts. This search relies on gene names and synonyms parsed from SwissProt as well as from organism-specific databases, and we utilize a benchmarked scoring system based on the frequencies and distributions of gene names in abstracts (not shown).

Finally, we also derive protein–protein associations from functional genomics data: co-regulation of genes across diverse experimental conditions, as measured by using microarray analysis, can be a predictor of functional associations (20). We import these associations from the ArrayProspector server (12), which is based on the same benchmarks and genomes as STRING itself.


    TRANSFER OF ASSOCIATIONS ACROSS ORGANISMS
 TOP
 ABSTRACT
 INTRODUCTION
 DATA SOURCES AND SCORING
 TRANSFER OF ASSOCIATIONS ACROSS...
 INTEGRATION
 REFERENCES
 
STRING employs two different strategies for transferring known and predicted associations between organisms (Figure 3): the first (‘COG-mode’) relies on externally provided orthology assignments and transfers interactions in an all-or-none fashion, whereas the second (‘protein-mode’) uses quantitative sequence similarity searches and often distributes a given interaction fractionally among several protein pairs of the target organism. Both approaches have strengths and weaknesses, and users can choose either one of them before starting their query (a color change helps them to distinguish the modus throughout the user interface).



View larger version (34K):
[in this window]
[in a new window]
 
Figure 3. Transferring association scores between organisms. Initial situation (top): a scored association between two proteins in a source organism—how confidently can it be transferred to a target organism by a postulated association among homologous proteins? Bottom left: in ‘COG-mode’, all proteins in an orthologous group (COG) are considered equivalent. The highest association score between any two proteins in the two COGs is assumed to be valid for all pairs. Bottom right: in ‘protein-mode’, all sequence similarity relations between the two organisms are considered. Associations are transferred fractionally, such that the pair with the highest similarity receives the bulk of the score. The relation is not linear: empiric analysis (not shown) suggests that competing similarity links should be down weighted, relative to the best link, as follows: (i) express similarities as values between zero and one, i.e. normalize by self-hit; (ii) transform similarities using s' = exp(–k1/s), thereby amplifying their ‘spread’; (iii) re-normalize so that, between the two species, all similarities for a protein family add up to one; (iv) each pair of proteins, A and B in the target species now receives a share of the association score: Starget = Ssource · k2 · s'A · s'B. (optimal values for k1 and k2 were empirically found to be 0.7 for both).

 
The COG mode requires an assignment of proteins into orthologous groups; all proteins within such a group are assumed to be functionally equivalent across genomes. This orthology information is imported from the COGs database [(21), we extend the groups to cover all organisms in STRING]. Any association score observed between a pair of proteins from two different COGs is assumed to be valid for all protein pairs spanning these two COGs. Repeated observations of links, e.g. occurrence of genes in the same operon, increase the association score—but only when they are observed in phylogenetically distant organisms.

In the newly developed protein mode, there is no preassigned orthology information. Instead, the transfer relies on a precomputed all-against-all similarity search of the 730 000 proteins in STRING (using the sensitive Smith-Waterman algorithm). For each association to be transferred, the algorithm searches for potential orthologs of the interacting partners in other genomes. Orthology is assumed if proteins form reciprocal best matches in the searches, in the absence of any close, second-best hits (paralogs) in either species. In such an ideal situation, the interactions can be transferred in toto. However, in reality there will often be additional paralogs in one or both of the genomes, which complicates the transfer. We have devised and benchmarked an empirical scheme that is based on the relative sequence similarity of competing paralogous proteins (Figure 3). Essentially, the pair of proteins exhibiting the highest sequence similarity to the source pair receives the highest ‘share’ of the transferred interaction.


    INTEGRATION
 TOP
 ABSTRACT
 INTRODUCTION
 DATA SOURCES AND SCORING
 TRANSFER OF ASSOCIATIONS ACROSS...
 INTEGRATION
 REFERENCES
 
After assignment of association scores and transfer between species, we compute a final ‘combined score’ between any pair of proteins (or pair of COGs). This score is often higher than the individual sub-scores, expressing increased confidence when an association is supported by several types of evidence (Table 1). It is computed under the assumption of independence for the various sources, in a naïve Bayesian fashion. It is thus a simple expression of the individual scores:


View this table:
[in this window]
[in a new window]
 
Table 1. The number of associations stored in STRING, shown separately for each data source and confidence range (low confidence: scores <0.4; medium: 0.4 to 0.7; high: >0.7)

 
The assumption of independence is valid here because datasets that are based on similar technologies (e.g. different yeast two-hybrid datasets) have been joined previously and are benchmarked as a single information source. Along with the combined score, the individual sub-scores are always displayed as well, because they provide valuable information about the nature of a particular association.


    ACKNOWLEDGEMENTS
 
This work was supported in part by grants from the Bundesministerium für Forschung und Bildung, Germany, from the Netherlands Organization of Scientific Research (NOW), and from The Knut and Alice Wallenberg Foundation (to S.D.H.).


    Notes
 
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use permissions, please contact journals.permissions{at}oupjournals.org.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 DATA SOURCES AND SCORING
 TRANSFER OF ASSOCIATIONS ACROSS...
 INTEGRATION
 REFERENCES
 

  1. Salwinski,L., Miller,C.S., Smith,A.J., Pettit,F.K., Bowie,J.U. and Eisenberg,D. ( (2004) ) The database of interacting proteins: 2004 update. Nucleic Acids Res., , 32, , D449–D451.[Abstract/Free Full Text] .

  2. Bader,G.D., Betel,D. and Hogue,C.W. ( (2003) ) BIND: the Biomolecular Interaction Network Database. Nucleic Acids Res., , 31, , 248–250.[Abstract/Free Full Text] .

  3. Hermjakob,H., Montecchi-Palazzi,L., Lewington,C., Mudali,S., Kerrien,S., Orchard,S., Vingron,M., Roechert,B., Roepstorff,P., Valencia,A. et al. ( (2004) ) IntAct: an open source molecular interaction database. Nucleic Acids Res., , 32, , D452–D455.[Abstract/Free Full Text] .

  4. Zanzoni,A., Montecchi-Palazzi,L., Quondam,M., Ausiello,G., Helmer-Citterich,M. and Cesareni,G. ( (2002) ) MINT: a Molecular INTeraction database. FEBS Lett., , 513, , 135–140.[CrossRef][Web of Science][Medline] .

  5. Kanehisa,M., Goto,S., Kawashima,S., Okuno,Y. and Hattori,M. ( (2004) ) The KEGG resource for deciphering the genome. Nucleic Acids Res., , 32, , D277–D280.[Abstract/Free Full Text] .

  6. Krieger,C.J., Zhang,P., Mueller,L.A., Wang,A., Paley,S., Arnaud,M., Pick,J., Rhee,S.Y. and Karp,P.D. ( (2004) ) MetaCyc: a multiorganism database of metabolic pathways and enzymes. Nucleic Acids Res., , 32, , D438–D442.[Abstract/Free Full Text] .

  7. Joshi-Tope,G., Vastrik,I., Gopinath,G.R., Matthews,L., Schmidt,E., Gillespie,M., D'Eustachio,P., Jassal,B., Lewis,S., Wu,G. et al. ( (2003) ) The Genome Knowledgebase: a resource for biologists and bioinformaticists. Cold Spring Harb. Symp. Quant. Biol., , 68, , 237–243.[CrossRef][Web of Science][Medline] .

  8. Salgado,H., Gama-Castro,S., Martinez-Antonio,A., Diaz-Peredo,E., Sanchez-Solano,F., Peralta-Gil,M., Garcia-Alonso,D., Jimenez-Jacinto,V., Santos-Zavaleta,A., Bonavides-Martinez,C. et al. ( (2004) ) RegulonDB (version 4.0): transcriptional regulation, operon organization and growth conditions in Escherichia coli K-12. Nucleic Acids Res., , 32, , D303–D306.[Abstract/Free Full Text] .

  9. Enault,F., Suhre,K., Poirot,O., Abergel,C. and Claverie,J.M. ( (2004) ) Phydbac2: improved inference of gene function using interactive phylogenomic profiling and chromosomal location analysis. Nucleic Acids Res., , 32, , W336–W339.[Abstract/Free Full Text] .

  10. Mellor,J.C., Yanai,I., Clodfelter,K.H., Mintseris,J. and DeLisi,C. ( (2002) ) Predictome: a database of putative functional links between proteins. Nucleic Acids Res., , 30, , 306–309.[Abstract/Free Full Text] .

  11. von Mering,C., Huynen,M., Jaeggi,D., Schmidt,S., Bork,P. and Snel,B. ( (2003) ) STRING: a database of predicted functional associations between proteins. Nucleic Acids Res., , 31, , 258–261.[Abstract/Free Full Text] .

  12. Jensen,L.J., Lagarde,J., von Mering,C. and Bork,P. ( (2004) ) ArrayProspector: a web resource of functional associations inferred from microarray expression data. Nucleic Acids Res., , 32, , W445–W448.[Abstract/Free Full Text] .

  13. Bowers,P.M., Pellegrini,M., Thompson,M.J., Fierro,J., Yeates,T.O. and Eisenberg,D. ( (2004) ) Prolinks: a database of protein functional linkages derived from coevolution. Genome Biol., , 5, , R35.[CrossRef][Medline] .

  14. Valencia,A. and Pazos,F. ( (2003) ) Prediction of protein-protein interactions from evolutionary information. Methods Biochem Anal., , 44, , 411–426.[Medline] .

  15. Huynen,M.A., Snel,B., von Mering,C. and Bork,P. ( (2003) ) Function prediction and protein networks. Curr. Opin. Cell Biol., , 15, , 191–198.[CrossRef][Web of Science][Medline] .

  16. Brooksbank,C., Camon,E., Harris,M.A., Magrane,M., Martin,M.J., Mulder,N., O'Donovan,C., Parkinson,H., Tuli,M.A., Apweiler,R. et al. ( (2003) ) The European Bioinformatics Institute's data resources. Nucleic Acids Res., , 31, , 43–50.[Abstract/Free Full Text] .

  17. Mewes,H.W., Amid,C., Arnold,R., Frishman,D., Guldener,U., Mannhaupt,G., Munsterkotter,M., Pagel,P., Strack,N., Stumpflen,V. et al. ( (2004) ) MIPS: analysis and annotation of proteins from whole genomes. Nucleic Acids Res., , 32, , D41–D44.[Abstract/Free Full Text] .

  18. Donaldson,I., Martin,J., de Bruijn,B., Wolting,C., Lay,V., Tuekam,B., Zhang,S., Baskin,B., Bader,G.D., Michalickova,K. et al. ( (2003) ) PreBIND and Textomy—mining the biomedical literature for protein–protein interactions using a support vector machine. BMC Bioinformatics, , 4, , 11.[CrossRef][Medline] .

  19. Marcotte,E.M., Xenarios,I. and Eisenberg,D. ( (2001) ) Mining literature for protein-protein interactions. Bioinformatics, , 17, , 359–363.[Abstract/Free Full Text] .

  20. Stuart,J.M., Segal,E., Koller,D. and Kim,S.K. ( (2003) ) A gene-coexpression network for global discovery of conserved genetic modules. Science, , 302, , 249–255.[Abstract/Free Full Text] .

  21. Tatusov,R.L., Fedorova,N.D., Jackson,J.D., Jacobs,A.R., Kiryutin,B., Koonin,E.V., Krylov,D.M., Mazumder,R., Mekhedov,S.L., Nikolskaya,A.N. et al. ( (2003) ) The COG database: an updated version includes eukaryotes. BMC Bioinformatics, , 4, , 41.[CrossRef][Medline] .

  22. Gavin,A.C., Bosche,M., Krause,R., Grandi,P., Marzioch,M., Bauer,A., Schultz,J., Rick,J.M., Michon,A.M., Cruciat,C.M. et al. ( (2002) ) Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature, , 415, , 141–147.[CrossRef][Medline] .


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Brief BioinformHome page
A. Csikasz-Nagy
Computational systems biology of the cell cycle
Brief Bioinform, July 1, 2009; 10(4): 424 - 434.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
A. Alexeyenko and E. L.L. Sonnhammer
Global networks of functional coupling in eukaryotes from comprehensive data integration
Genome Res., June 1, 2009; 19(6): 1107 - 1116.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. A. Elias, A. Mukhopadhyay, M. P. Joachimiak, E. C. Drury, A. M. Redding, H.-C. B. Yen, M. W. Fields, T. C. Hazen, A. P. Arkin, J. D. Keasling, et al.
Expression profiling of hypothetical genes in Desulfovibrio vulgaris leads to improved functional annotation
Nucleic Acids Res., May 1, 2009; 37(9): 2926 - 2939.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
S. J. Wodak, S. Pu, J. Vlasblom, and B. Seraphin
Challenges and Rewards of Interaction Proteomics
Mol. Cell. Proteomics, January 1, 2009; 8(1): 3 - 18.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Yu and R. L. Finley Jr
Combining multiple positive training sets to generate confidence scores for protein-protein interactions
Bioinformatics, January 1, 2009; 25(1): 105 - 111.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. J. Jensen, M. Kuhn, M. Stark, S. Chaffron, C. Creevey, J. Muller, T. Doerks, P. Julien, A. Roth, M. Simonovic, et al.
STRING 8--a global view on proteins and their functional interactions in 630 organisms
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D412 - D416.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
J. R. Tavares, R. F. de Souza, G. L. S. Meira, and F. J. Gueiros-Filho
Cytological Characterization of YpsB, a Novel Component of the Bacillus subtilis Divisome
J. Bacteriol., November 1, 2008; 190(21): 7096 - 7107.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. A. A. Castro, R. J. S. Dalmolin, J. C. F. Moreira, J. C. M. Mombach, and R. M. C. de Almeida
Evolutionary origins of human apoptosis and genome-stability gene networks
Nucleic Acids Res., November 1, 2008; 36(19): 6269 - 6283.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
Y. Xue, J. Ren, X. Gao, C. Jin, L. Wen, and X. Yao
GPS 2.0, a Tool to Predict Kinase-specific Phosphorylation Sites in Hierarchy
Mol. Cell. Proteomics, September 1, 2008; 7(9): 1598 - 1608.
[Abstract] [Full Text] [PDF]


Home page
Brief Funct Genomic ProteomicHome page
K. Pawlowski
Uncharacterized/hypothetical proteins in biomedical 'omics' experiments: is novelty being swept under the carpet?
Brief Funct Genomic Proteomic, July 19, 2008; (2008) eln033v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Ha, Y.-J. Seo, M.-S. Kwon, B.-H. Chang, C.-K. Han, and J.-H. Yoon
IDMap: facilitating the detection of potential leads with therapeutic targets
Bioinformatics, June 1, 2008; 24(11): 1413 - 1415.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Gunther, M. Kuhn, M. Dunkel, M. Campillos, C. Senger, E. Petsalaki, J. Ahmed, E. G. Urdiales, A. Gewiess, L. J. Jensen, et al.
SuperTarget and Matador: resources for exploring drug-target relationships
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D919 - D922.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Kuhn, C. von Mering, M. Campillos, L. J. Jensen, and P. Bork
STITCH: interaction networks of chemicals and proteins
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D684 - D688.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
T. Fuhrer, L. Chen, U. Sauer, and D. Vitkup
Computational Prediction and Experimental Verification of the Gene Encoding the NAD+/NADP+-Dependent Succinate Semialdehyde Dehydrogenase in Escherichia coli
J. Bacteriol., November 15, 2007; 189(22): 8073 - 8078.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
J. Xiong, C. E. Bauer, and A. Pancholy
Insight into the haem d1 biosynthesis pathway in heliobacteria through bioinformatics analysis
Microbiology, October 1, 2007; 153(10): 3548 - 3562.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
E. D. Harrington, A. H. Singh, T. Doerks, I. Letunic, C. von Mering, L. J. Jensen, J. Raes, and P. Bork
Quantitative assessment of protein function prediction from metagenomics shotgun sequences
PNAS, August 28, 2007; 104(35): 13913 - 13918.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. L. Green and P. D. Karp
Using genome-context data to identify specific types of functional associations in pathway/genome databases
Bioinformatics, July 1, 2007; 23(13): i205 - i211.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
W. Iwasaki and T. Takagi
Reconstruction of highly heterogeneous gene-content evolution across the three domains of life
Bioinformatics, July 1, 2007; 23(13): i230 - i239.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
J.-H. Shin and C. W. Price
The SsrA-SmpB Ribosome Rescue System Is Important for Growth of Bacillus subtilis at Low and High Temperatures
J. Bacteriol., May 15, 2007; 189(10): 3729 - 3737.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. A. A. Castro, J. C. M. Mombach, R. M. C. de Almeida, and J. C. F. Moreira
Impaired expression of NER gene network in sporadic solid tumors
Nucleic Acids Res., March 19, 2007; 35(6): 1859 - 1867.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Gu, I. Anderson, V. Kunin, M. Cipriano, S. Minovitsky, G. Weber, N. Amenta, B. Hamann, and I. Dubchak
TreeQ-VISTA: an interactive tree visualization tool with functional annotation query capabilities
Bioinformatics, March 15, 2007; 23(6): 764 - 766.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
K. G. Thakur, A. M. Joshi, and B. Gopal
Structural and Biophysical Studies on Two Promoter Recognition Domains of the Extra-cytoplasmic Function {sigma} Factor {sigma}C from Mycobacterium tuberculosis
J. Biol. Chem., February 16, 2007; 282(7): 4711 - 4718.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. von Mering, L. J. Jensen, M. Kuhn, S. Chaffron, T. Doerks, B. Kruger, B. Snel, and P. Bork
STRING 7--recent developments in the integration and prediction of protein interactions
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D358 - D362.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
A. Ng, B. Bursteinas, Q. Gao, E. Mollison, and M. Zvelebil
Resources for integrative systems biology: from data through databases to networks and dynamic system models
Brief Bioinform, December 1, 2006; 7(4): 318 - 330.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. E. Vinogradov
'Genome design' model and multicellular complexity: golden middle
Nucleic Acids Res., November 6, 2006; 34(20): 5906 - 5914.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
J. Boekhorst, M. Wels, M. Kleerebezem, and R. J. Siezen
The predicted secretome of Lactobacillus plantarum WCFS1 sheds light on interactions with its environment.
Microbiology, November 1, 2006; 152(Pt 11): 3175 - 3183.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
I. Friedberg
Automated protein function prediction--the genomic challenge
Brief Bioinform, September 1, 2006; 7(3): 225 - 242.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Li, X. Li, H. Su, H. Chen, and D. W. Galbraith
A framework of integrating gene relations from heterogeneous data sources: an experiment on Arabidopsis thaliana
Bioinformatics, August 15, 2006; 22(16): 2037 - 2043.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. L. Green and P. D. Karp
The outcomes of pathway database computations depend on pathway ontology
Nucleic Acids Res., August 7, 2006; 34(13): 3687 - 3697.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. Massjouni, C. G. Rivera, and T. M. Murali
VIRGO: computational prediction of gene functions.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W340 - W344.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
C. Toft and M. A. Fares
GRAST: a new way of genome reduction analysis using comparative genomics
Bioinformatics, July 1, 2006; 22(13): 1551 - 1561.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. Strong, M. R. Sawaya, S. Wang, M. Phillips, D. Cascio, and D. Eisenberg
Toward the structural genomics of complexes: Crystal structure of a PE/PPE protein complex from Mycobacterium tuberculosis
PNAS, May 23, 2006; 103(21): 8060 - 8065.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
C. B. Abulencia, D. L. Wyborski, J. A. Garcia, M. Podar, W. Chen, S. H. Chang, H. W. Chang, D. Watson, E. L. Brodie, T. C. Hazen, et al.
Environmental whole-genome amplification to access microbial populations in contaminated sediments.
Appl. Envir. Microbiol., May 1, 2006; 72(5): 3291 - 3301.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
B. Adamcsek, G. Palla, I. J. Farkas, I. Derenyi, and T. Vicsek
CFinder: locating cliques and overlapping modules in biological networks
Bioinformatics, April 15, 2006; 22(8): 1021 - 1023.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Saric, L. J. Jensen, R. Ouzounova, I. Rojas, and P. Bork
Extraction of regulatory gene/protein networks from Medline
Bioinformatics, March 15, 2006; 22(6): 645 - 650.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. Campillos, C. von Mering, L. J. Jensen, and P. Bork
Identification and analysis of evolutionarily cohesive functional modules in protein networks
Genome Res., March 1, 2006; 16(3): 374 - 382.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. Vallenet, L. Labarre, Z. Rouy, V. Barbe, S. Bocs, S. Cruveiller, A. Lajus, G. Pascal, C. Scarpelli, and C. Medigue
MaGe: a microbial genome annotation system supported by synteny results
Nucleic Acids Res., January 10, 2006; 34(1): 53 - 65.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
B. Anand, S. K. Verma, and B. Prakash
Structural stabilization of GTP-binding domains in circularly permuted GTPases: implications for RNA binding.
Nucleic Acids Res., January 1, 2006; 34(8): 2196 - 2205.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
I. Letunic, R. R. Copley, B. Pils, S. Pinkert, J. Schultz, and P. Bork
SMART 5: domains in the context of genomes and networks
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D257 - D260.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. Winter, A. Henschel, W. K. Kim, and M. Schroeder
SCOPPI: a structural classification of protein-protein interfaces
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D310 - D314.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Okuda, T. Katayama, S. Kawashima, S. Goto, and M. Kanehisa
ODB: a database of operons accumulating known operons across multiple genomes
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D358 - D362.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Ng, B. Bursteinas, Q. Gao, E. Mollison, and M. Zvelebil
pSTIING: a 'systems' approach towards integrating signalling pathways, interaction and transcriptional regulatory networks in inflammation and cancer
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D527 - D534.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. D. Hooper and P. Bork
Medusa: a simple tool for interaction graph analysis
Bioinformatics, December 15, 2005; 21(24): 4432 - 4433.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
J. REHWINKEL, I. LETUNIC, J. RAES, P. BORK, and E. IZAURRALDE
Nonsense-mediated mRNA decay factors act in concert to regulate common mRNA targets
RNA, October 1, 2005; 11(10): 1530 - 1544.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
L. Goldovsky, P. Janssen, D. Ahren, B. Audit, I. Cases, N. Darzentas, A. J. Enright, N. Lopez-Bigas, J. M. Peregrin-Alvarez, M. Smith, et al.
CoGenT++: an extensive and extensible data environment for computational genomics
Bioinformatics, October 1, 2005; 21(19): 3806 - 3810.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
G. Dieterich, U. Karst, J. Wehland, and L. Jansch
MineBlast: a literature presentation service supporting protein annotation by data mining of BLAST results
Bioinformatics, August 15, 2005; 21(16): 3450 - 3451.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Sun, J. Xu, Z. Liu, Q. Liu, A. Zhao, T. Shi, and Y. Li
Refined phylogenetic profiles method for predicting protein-protein interactions
Bioinformatics, August 15, 2005; 21(16): 3409 - 3415.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. L. Green and P. D. Karp
Genome annotation errors in pathway databases due to semantic ambiguity in partial EC numbers
Nucleic Acids Res., July 20, 2005; 33(13): 4035 - 4039.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
E. J. Alm, K. H. Huang, M. N. Price, R. P. Koche, K. Keller, I. L. Dubchak, and A. P. Arkin
The MicrobesOnline Web site for comparative genomics
Genome Res., July 1, 2005; 15(7): 1015 - 1022.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
S. G. Tringe, C. von Mering, A. Kobayashi, A. A. Salamov, K. Chen, H. W. Chang, M. Podar, J. M. Short, E. J. Mathur, J. C. Detter, et al.
Comparative Metagenomics of Microbial Communities
Science, April 22, 2005; 308(5721): 554 - 557.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (494K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by von Mering, C.
Right arrow Articles by Bork, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by von Mering, C.
Right arrow Articles by Bork, P.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?