Skip Navigation


Nucleic Acids Research Advance Access originally published online on June 5, 2009
Nucleic Acids Research 2009 37(Web Server issue):W122-W128; doi:10.1093/nar/gkp438
This Article
Right arrow Abstract Freely available
Right arrow Print PDF (2739K) Freely available
Right arrow Screen PDF (347K) Freely available
Right arrowOA All Versions of this Article:
37/suppl_2/W122    most recent
gkp438v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Blankenburg, H.
Right arrow Articles by Albrecht, M.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Blankenburg, H.
Right arrow Articles by Albrecht, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2009, Vol. 37, No. suppl_2 W122-W128
© 2009 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.


Articles

DASMIweb: online integration, analysis and assessment of distributed protein interaction data

Hagen Blankenburg*, Fidel Ramírez, Joachim Büch and Mario Albrecht

Max Planck Institute for Informatics, Campus E1.4, 66123 Saarbrücken, Germany

*To whom correspondence should be addressed. Tel: +49 681 9325 328; Fax: +49 681 9325 399; Email: hagen.blankenburg{at}mpi-inf.mpg.de

Received February 28, 2009. Revised April 25, 2009. Accepted May 11, 2009.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 USER INTERFACE
 CONCLUSIONS
 FUNDING
 REFERENCES
 
In recent years, we have witnessed a substantial increase of the amount of available protein interaction data. However, most data are currently not readily accessible to the biologist at a single site, but scattered over multiple online repositories. Therefore, we have developed the DASMIweb server that affords the integration, analysis and qualitative assessment of distributed sources of interaction data in a dynamic fashion. Since DASMIweb allows for querying many different resources of protein and domain interactions simultaneously, it serves as an important starting point for interactome studies and assists the user in finding publicly accessible interaction data with minimal effort. The pool of queried resources is fully configurable and supports the inclusion of own interaction data or confidence scores. In particular, DASMIweb integrates confidence measures like functional similarity scores to assess individual interactions. The retrieved results can be exported in different file formats like MITAB or SIF. DASMIweb is freely available at http://www.dasmiweb.de.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 USER INTERFACE
 CONCLUSIONS
 FUNDING
 REFERENCES
 
Protein interactions play an important role in many cellular processes (1). Different small- and large-scale experimental techniques together with the manual curation of the scientific literature as well as numerous computational prediction methods generate ever increasing amounts of publicly accessible protein interaction data (2). However, this rapid accumulation of data renders it difficult for researchers to keep track of all available information because they are scattered over multiple online repositories. As of April 2009, the pathway resource list Pathguide (3) gives the impressive number of 118 databases providing protein interaction data. Some of these projects are highly specialized and focus, for example, on interactions of molecular subcomponents or specific classes of proteins, on specific diseases or organisms, or on experimentally observed or computationally predicted interactions. Moreover, doubts have been raised about the quality and reliability of protein interaction data and particular detection methods (2,4,5).

Databases that collect and curate experimentally observed protein–protein interactions reported in the literature (6–13) are essential pillars of interactomics, but they cover only a small fraction of the complete set of interactions, and thus proteome-wide predictions are also required (2,4). All these efforts have resulted in a multitude of resources that the user has to query individually. Initiatives like IMEx (14) that promote data exchange between some of the databases are very important, but are still in an early implementation phase. One of the possible solutions to integrate protein interaction data is the creation of data warehouses as composite databases that centrally store and merge the available data from multiple sources (10,11,15–23). However, the static data unification procedure underlying data warehouses has the considerable drawback of providing only a snapshot of a fixed number of data sources at a certain point of time. Once the data have been included into the central repository, curation efforts are required to keep it up to date and in sync with the original data sources. Furthermore, data warehouses are rather inflexible as the inclusion of additional datasets, for example, new experimental or predicted data or improved confidence scores, can normally be accomplished solely by the central authority and not by the user.

In the context of the European BioSapiens network (24), we have developed DASMIweb as a gateway to interactome data from multiple resources. In contrast to composite databases, data are not stored in a local repository, but queries are distributed to the original data sources and the unified results are displayed (25). Due to this novel realization as a distributed and dynamic system, DASMIweb bypasses the inherent rigidity of static databases and addresses their problem of data update cycles. In addition, DASMIweb allows access to distributed servers with confidence scores, which can be used to evaluate the quality of individual interactions with different scoring methods.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 USER INTERFACE
 CONCLUSIONS
 FUNDING
 REFERENCES
 
Distributed architecture
The fundamental concept of DASMIweb is decentralization (Figure 1). Here, the interaction data remain distributed with their original providers instead of being periodically aggregated into central data repositories (10,11,15–23). Subsequent to a user request, DASMIweb independently queries each original data provider for interactions, additional annotations, and interaction confidence scores. Then it unifies the retrieved results and presents them to the user.


Figure 1
View larger version (40K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 1. Decentralized architecture of DASMIweb. Data sources for protein and domain interactions as well as for interaction confidence scores are distributed over the Internet and are contacted by DASMIweb upon user request.

 
The technical architecture of DASMIweb, based on an extension of the Distributed Annotation System (DAS) (25,26) and different types of web services (27,28), has the great advantage of being easily extendable with new data sources. In addition, data update cycles every few weeks or months are not necessary because all data is left in its source database and is only retrieved on request. This use of a distributed architecture greatly empowers the end-user who can instantly add data sources, for instance, own private interactions or the results of an improved confidence scoring method. DASMIweb also gives data providers the possibility to easily share their results without the time-consuming development of own web interfaces. Since the distributed architecture supported by DASMIweb is driven by the community that provides the contents, there is no need for central authorities to decide on the available resources. This is exemplarily evident in the case of confidence scoring methods, which can be based on various criteria, for instance, co-expression, co-localization, functional co-annotation, network topology and evolutionary conservation. Apparently, it would be impractical to implement and maintain all these different methods at a single site. Instead, each method developed by some independent institution can be queried through DASMIweb. It is noteworthy that our decentralization approach has also found the interest of the Proteomics Standards Initiative (PSI) of the Human Proteome Organization (HUPO), which is currently defining standards for distributed interaction data retrieval and interaction confidence scoring (28). We are actively contributing to these projects and all servers developed in this context are accessible via DASMIweb (Table 1).


View this table:
[in this window]
[in a new window]

 
Table 1. Interaction datasets currently available as data sources through DASMIweb and corresponding references

 
Data sources
DASMIweb has been developed to support different levels of molecular interactions, for example, interactions of proteins as well as of protein domains. In the following, we will refer to the distributed data servers that provide interactions, additional annotations or interaction confidence scores, as data sources (in contrast to our server DASMIweb). As of April 2009, DASMIweb provides access to 35 data sources containing experimentally determined and computationally derived protein and domain interaction datasets (Table 1). In addition, there are two data sources for scoring the confidence of protein–protein interactions.

In our current setup, the majority of data sources are temporarily cached and maintained at our institute, even if this appears to be a contradiction to the actual DASMIweb aim of leaving the interaction data with their original providers. At present, this setup is unavoidable for demonstrating the capabilities of our system; otherwise, many more external data source providers would already be needed from the beginning. Nevertheless, several major protein interaction databases like BioGrid (10), IntAct (12) and MINT (6) are not cached as they already support external web service access to their data. Moreover, since most of the cached datasets are the results of single studies, they are not updated by their authors and do not require any maintenance efforts. The other data sources are updated by us on a monthly basis. Of course, we will replace a temporary cached source as soon as the respective original provider supports web service access to its data.

The current selection of data sources listed in Table 1 is only a snapshot, additional resources for interactions and confidence scores are currently being prepared at other institutions for public access in the near future. At the moment, there are two alternative ways for providing new data sources. The first option is the download of a server library from our website http://www.dasmi.de. This software, available as Java and Perl implementations, parses interaction data from several standard file formats and serves them in a format supported by DASMIweb (25). An online tutorial on setting up own data sources is available on our website. Data sources that provide confidence scores are handled like sources that contain interaction data and can be set up using the same software library. The second option is the implementation of a web service that follows the standard currently defined by HUPO-PSI (28). However, as this standard is not yet published, it might still evolve.

Identifier mapping
Proteomics research uses a substantial diversity of object identifiers for describing genes, proteins or protein domains. Accordingly, interaction datasets use a variety of identifier systems for their data (2). In order to unify them, DASMIweb maintains internal mapping tables derived from iProClass (29) and Pfam (30) to convert identifiers between the different systems. For protein interactions, DASMIweb currently supports the identifier systems Ensembl (31), Entrez Gene (32), Entrez Geneinfo (32), RefSeq (32) and UniProtKB (33). In the following, we will refer to these identifier systems as compatible systems because mappings exist between them. The mappings enable DASMIweb to merge results from data sources that employ different but compatible identifier system. For example, if the user requests all huntingtin interactions known for the UniProtKB protein ‘HD_HUMAN’, DASMIweb will automatically convert this identifier to the Entrez Gene identifier ‘3064’, the RefSeq identifier ‘NP_002102.4’, and all other compatible identifier systems described above. Subsequently, data sources providing interactions in the Entrez Gene or RefSeq identifier system will be queried in addition to data sources that use the requested UniProtKB identifier system. The final unification of interactions from different data sources is performed by converting all identifiers to Entrez Gene identifiers. Therefore, the DASMIweb results are independent of the particular identifier system used for querying; in the prior example, a query for ‘HD_HUMAN’ will return the same results as the one for ‘NP_002102.4’ or ‘3064’.

It should be noted that the identifier mapping procedure can result in considerable, but unavoidable, computational overhead. While there is usually a one-to-one mapping from UniProtKB to Entrez Gene identifiers, mapping in the opposite direction may produce multiple results as one gene can be responsible for several protein variants or fragments. Therefore, in the exemplary case when the user queries DASMIweb with the Entrez Gene identifier ‘3064’, it will be converted to two UniProtKB entries ‘Q59FF4’ and ‘P42858’. Consequently, all interactions reported for the two protein variants or fragments will be included. Fortunately, the identifier diversity for domain interaction datasets is less problematic as stable Pfam identifiers (30) are predominantly used.


    USER INTERFACE
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 USER INTERFACE
 CONCLUSIONS
 FUNDING
 REFERENCES
 
Our primary goal while designing DASMIweb was user-friendliness, which we tried to achieve by an intuitive user interface and a clear representation of the results. In addition, several tutorials on our website guide the user through potential query and analysis tasks. The most important parts of the user interface are the Query, Information, and Interaction Panels (Figure 2). DASMIweb requires a JavaScript-enabled browser for a technology known as Asynchronous JavaScript and XML (AJAX). The AJAX functionality, provided by the Direct Web Remoting framework (http://getahead.org/dwr/), compensates for different data source response times and allows for presenting interaction results to the user as soon as DASMIweb receives them. DASMIweb stores all data associated with a user request in sessions. This means that all interactions, interaction details, confidence scores and all DASMIweb configurations are maintained for half an hour even if the user temporarily leaves our website.


Figure 2
View larger version (60K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 2. DASMIweb user interface. The screen is separated into the top left Query Panel, the top right Information Panel and the central Interaction Panel. Interactions are presented in tabular form: each column represents a data source, each row contains interaction partner(s), and each square at the intersection of a row and a column indicates a particular interaction. The Gene Ontology-based confidence measure FunSimMat-BPscore is selected, and the interaction squares are colored with a white-to-blue gradient: white for no functional similarity and dark blue for complete similarity.

 
Querying
The Query Panel in the top left corner of the screen only contains a single search field to allow the user straightforward querying. The user does not need to specify the input type; DASMIweb tries to determine it automatically. If the identifier type cannot be resolved unambiguously, the user is asked to refine the query. As detailed above, DASMIweb will not only include all data sources with the same identifier system of the query, but also attempt mapping the query identifier to all compatible identifier systems to include additional sources. DASMIweb converts only between protein (e.g. UniProtKB or RefSeq) and gene (e.g. Entrez Gene) identifier systems, which does not include domain identifier systems like Pfam. All data sources for which a suitable identifier is found will subsequently be queried for interactions.

Result presentation
Information on the query interactor, such as names, synonyms, or external database references, is provided in the Information Panel located in the top right corner of the screen. Interaction results are presented to the user in a table within the central Interaction Panel: table columns represent data sources that have been queried for interactions, rows contain interaction partners (single partners for binary interactions and all partners for protein complexes), and squares in the intersections of rows and columns indicate particular interactions (Figure 2). Different background colors for each data source in the table header highlight the corresponding interaction determination methods (Table 1): green represents sources with data derived from experimental studies or curation of the scientific literature, yellow represents computational predictions. The interaction table is built gradually, and new rows and interaction squares are inserted as soon as results have been retrieved from the data sources. In addition, the interactions can be sorted according to individual table columns or by their frequency of occurrence in all data sources. For the sake of clarity, a tabbed display only shows a user-definable number of interactions per page (initially set to 50); arrows allow for browsing through additional results. Display options like sorting and tabbed browsing can be configured in the myDASMI Panel, which can be opened by clicking on the correspondent box in the middle of the right screen border.

Our tabular representation supports a quick visual assessment of the results, based on the assumption that interactions that are reported in several datasets are more likely to be accurate. To further investigate particular interactions, the user can click on an interaction square and request the display of a new table row with additional information on an interaction and the interaction partner(s). For example, the additional information may include links to the original publication that reported the interaction, information on the experimental settings or conditions, a web link to the full entry in the source database or external database identifiers for the interaction partner(s). The amount of details given in the additional information is primarily defined by the data source providers that reports the respective interaction and not by DASMIweb.

Data source configuration
DASMIweb maintains a list of all publicly available interaction sources (Table 1) in the Source Configuration Panel, which can be opened by pressing the corresponding button in the Query Panel. The sources are grouped according to their identifier system, and basic information like the name, source type, and a description are provided for each entry. As described above, green and yellow background colors indicate different interaction determination methods. A blue background represents data sources useful for interaction confidence scoring. Initially, all data sources are active and will be used for answering queries. The user can deactivate data sources by removing the leading checkmark of the respective entry.

We currently support three approaches for including new data sources in DASMIweb. First, all data sources registered at the central DAS registry (http://www.dasregistry.org) will be available automatically to the DASMIweb users. The second option is the local registration of a data source by providing information like its name, URL, and identifier system. The third option is uploading a PSI-MI XML2.5 file (34) to DASMIweb and temporarily creating a data source from its content. The first two options require that the data source to be added is already set up and accessible over the Internet. In contrast, the third option allows for comparing own interactions with existing datasets or for assessing them by different confidence scoring servers. Another distinction can be made with respect to data privacy: data sources added with the first approach are accessible to all DASMIweb users, while the second and third approaches affect only the respective user session.

Interaction confidence scoring
Despite improved methods for generating protein interaction data (35), current interactomes are still incomplete to a large extent and doubts about the reliability of detection methods remain (2,4,5). Quality assessment is crucial not only for interactions determined by large-scale experimental assays, but also for those curated from scientific literature or obtained by computational prediction methods. Therefore, DASMIweb provides access to specialized data sources (Table 1) and, at the same time, supports the convenient evaluation of the quality of individual protein interactions.

As the distributed retrieval of confidence scores for a large number of interactions can be computationally demanding, it has to be explicitly requested by pressing a button in the header of the interaction table. After retrieval, the different scoring methods can be selected in a drop-down menu next to the same button. This menu also lists all original confidence scores provided by the authors of a source dataset. A brief description of the selected scoring method can be found in the bottom right corner of the screen. Confidence scores are printed atop the interaction squares and are also available as new interaction details. If the scores of a method can be normalized to a range between zero and one, they will additionally be color-coded by a white-to-blue gradient into the respective interaction squares, white for the value zero, dark blue for the value one (Figure 2).

Exporting results
Interaction results can be exported in different file formats, enabling the user to analyze the retrieved data further in other applications. Currently, we support the Simple Interaction Format (SIF), defined by the network analysis and visualization program Cytoscape (36), and the tabular MITAB2.5 format as specified by HUPO-PSI (34).


    CONCLUSIONS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 USER INTERFACE
 CONCLUSIONS
 FUNDING
 REFERENCES
 
We presented our new web server DASMIweb that supports the online integration, analysis and assessment of distributed sets of molecular interaction data in a dynamic and user-configurable fashion. DASMIweb provides access to over thirty different interaction and confidence scoring resources, which constitutes one of the largest amounts of protein and domain interaction data available through one web interface. In particular, DASMIweb can be used to assess the quality of arbitrary user-defined sets of protein interactions with different confidence scoring methods. Due to the decentralized architecture, users can easily extend DASMIweb by adding further data sources. Additional data sources for providing protein interactions and confidence scoring methods are already expected to be made available within DASMIweb by different external sites in the near future. Furthermore, additional DASMIweb features are currently under development, ranging from support for full-text searches and batch queries for multiple interactors to additional data import and export formats.


    FUNDING
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 USER INTERFACE
 CONCLUSIONS
 FUNDING
 REFERENCES
 
German National Genome Research Network (NGFN) and the German Research Foundation (DFG), contract number KFO 129/1-2. The work was conducted in the context of the DFG-funded Cluster of Excellence for Multimodal Computing and Interaction as well as of the BioSapiens Network of Excellence funded by the European Commission under grant number LSHG-CT-2003-503265. Funding for open access charge: Max Planck Society.

Conflict of interest statement. None declared.


    ACKNOWLEDGEMENTS
 
We are grateful to Dorothea Emig and Sven-Eric Schelhorn for providing several domain interaction datasets and to Andreas Schlicker for his help with the functional similarity scoring servers. We also thank Robert Finn, Andrew Jenkinson, Andreas Prlic and Jonathan Warren for maintaining parts of the DAS infrastructure. In particular, we greatly appreciate the time and work of external providers of interaction data sources because a distributed system can only be successful with broad community support.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 USER INTERFACE
 CONCLUSIONS
 FUNDING
 REFERENCES
 

  1. Frishman D, Albrecht M, Blankenburg H, Bork P, Harrington ED, Hermjakob H, Jensen LJ, Juan DA, Lengauer T, Pagel P, et al. Protein–protein interactions: analysis and prediction. In: Modern Genome Annotation: The Biosapiens Network.—Frishman D, Valencia A, eds. (2009) Wien, Austria: Springer. 353–410.

  2. Ramírez F, Schlicker A, Assenov Y, Lengauer T, Albrecht M. Computational analysis of human protein interaction networks. Proteomics (2007) 7:2541–2552.[CrossRef][Web of Science][Medline]

  3. Bader GD, Cary MP, Sander C. Pathguide: a pathway resource list. Nucleic Acids Res. (2006) 34:D504–D506.[Abstract/Free Full Text]

  4. Venkatesan K, Rual JF, Vazquez A, Stelzl U, Lemmens I, Hirozane-Kishikawa T, Hao T, Zenkner M, Xin X, Goh KI, et al. An empirical framework for binary interactome mapping. Nat. Methods (2009) 6:83–90.[CrossRef][Web of Science][Medline]

  5. Anonymous. Maturing interactions. Nat Methods (2009) 6:2.[CrossRef][Web of Science]

  6. Chatr-aryamontri A, Ceol A, Montecchi-Palazzi L, Nardelli G, Schneider MV, Castagnoli L, Cesareni G. MINT: the Molecular INTeraction database. Nucleic Acids Res. (2007) 35:D572–D574.[Abstract/Free Full Text]

  7. Güldener U, Martin M, Oesterheld M, Pagel P, Ruepp A, Mewes H.-W, Volker S. MPact: the MIPS protein interaction resource on yeast. Nucleic Acids Res. (2006) 34:D436–D441.[Abstract/Free Full Text]

  8. Salwinski L, Miller CS, Smith AJ, Pettit FK, Bowie JU, Eisenberg D. The Database of Interacting Proteins: 2004 update. Nucleic Acids Res. (2004) 32:D449–D451.[Abstract/Free Full Text]

  9. Bader GD, Betel D, Hogue CWV. BIND: the Biomolecular Interaction Network Database. Nucleic Acids Res. (2003) 31:248–250.[Abstract/Free Full Text]

  10. Breitkreutz BJ, Stark C, Reguly T, Boucher L, Breitkreutz A, Livstone M, Oughtred R, Lackner DH, Bahler J, Wood V, et al. The BioGRID interaction database: 2008 update. Nucleic Acids Res. (2008) 36:D637–D640.[Abstract/Free Full Text]

  11. Goll J, Rajagopala SV, Shiau SC, Wu H, Lamb BT, Uetz P. MPIDB: the microbial protein interaction database. Bioinformatics (2008) 24:1743–1744.[Abstract/Free Full Text]

  12. Kerrien S, Alam-Faruque Y, Aranda B, Bancarz I, Bridge A, Derow C, Dimmer E, Feuermann M, Friedrichsen A, Huntley R, et al. IntAct—open source resource for molecular interaction data. Nucleic Acids Res. (2007) 35:D561–D565.[Abstract/Free Full Text]

  13. Keshava Prasad TS, Goel R, Kandasamy K, Keerthikumar S, Kumar S, Mathivanan S, Telikicherla D, Raju R, Shafreen B, Venugopal A, et al. Human Protein Reference Database—2009 update. Nucleic Acids Res. (2009) 37:D767–D772.[Abstract/Free Full Text]

  14. Orchard S, Kerrien S, Jones P, Ceol A, Chatr-aryamontri A, Salwinski L, Nerothin J, Hermjakob H. Submit your interaction data the IMEx way. Proteomics (2007) 7:28–34.[CrossRef][Medline]

  15. Wu J, Vallenius T, Ovaska K, Westermarck J, Makela TP, Hautaniemi S. Integrated network analysis platform for protein–protein interactions. Nat. Methods (2009) 6:75–77.[CrossRef][Web of Science][Medline]

  16. Chaurasia G, Malhotra S, Russ J, Schnoegl S, Hanig C, Wanker EE, Futschik ME. UniHI 4: new tools for query, analysis and visualization of the human protein-protein interactome. Nucleic Acids Res. (2009) 37:D657–D660.[Abstract/Free Full Text]

  17. Jensen LJ, Kuhn M, Stark M, Chaffron S, Creevey C, Muller J, Doerks T, Julien P, Roth A, Simonovic M, et al. STRING 8—a global view on proteins and their functional interactions in 630 organisms. Nucleic Acids Res. (2009) 37:D412–D416.[Abstract/Free Full Text]

  18. Pagel P, Oesterheld M, Tovstukhina O, Strack N, Stümpflen V, Frishman D. DIMA 2.0-predicted and known domain interactions. Nucleic Acids Res. (2008) 36:D651–D655.[Abstract/Free Full Text]

  19. Prieto C, Rivas JDL. APID: Agile Protein Interaction DataAnalyzer. Nucleic Acids Res. (2006) 34:W298–W302.[Abstract/Free Full Text]

  20. Raghavachari B, Tasneem A, Przytycka TM, Jothi R. DOMINE: a database of protein domain interactions. Nucleic Acids Res. (2008) 36:D656–D661.[Abstract/Free Full Text]

  21. Tarcea VG, Weymouth T, Ade A, Bookvich A, Gao J, Mahavisno V, Wright Z, Chapman A, Jayapandian M, Ozgur A, et al. Michigan molecular interactions r2: from interacting proteins to pathways. Nucleic Acids Res. (2009) 37:D642–D646.[Abstract/Free Full Text]

  22. Razick S, Magklaras G, Donaldson IM. iRefIndex: a consolidated protein interaction database with provenance. BMC Bioinformatics (2008) 9:405.[CrossRef][Medline]

  23. Li D, Liu W, Liu Z, Wang J, Liu Q, Zhu Y, He F. PRINCESS, a protein interaction confidence evaluation system with multiple data sources. Mol. Cell Proteomics (2008) 7:1043–1052.[Abstract/Free Full Text]

  24. Thornton J. Annotations for all by all—the BioSapiens network. Genome Biol. (2009) 10:401.[Medline]

  25. Jenkinson AM, Albrecht M, Birney E, Blankenburg H, Down T, Finn RD, Hermjakob H, Hubbard TJP, Jiménez RC, Jones P, et al. Integrating biological data – the Distributed Annotation System. BMC Bioinformatics (2008) 9(Suppl. 8):S3.

  26. Dowell RD, Jokerst RM, Day A, Eddy SR, Stein L. The Distributed Annotation System. BMC Bioinformatics (2001) 2:7.[CrossRef][Medline]

  27. Stein LD. Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges. Nat. Rev. Genet. (2008) 9:678–688.[CrossRef][Web of Science][Medline]

  28. Orchard S, Albar JP, Deutsch EW, Binz PA, Jones AR, Creasy D, Hermjakob H. Annual Spring Meeting of the Proteomics Standards Initiative, 23–25 April 2008, Toledo, Spain. Proteomics (2008) 8:4168–4172.[CrossRef][Web of Science][Medline]

  29. Huang H, Barker WC, Chen Y, Wu CH. iProClass: an integrated database of protein family, function and structure information. Nucleic Acids Res. (2003) 31:390–392.[Abstract/Free Full Text]

  30. Finn RD, Tate J, Mistry J, Coggill PC, Sammut SJ, Hotz H.-R, Ceric G, Forslund K, Eddy SR, Sonnhammer ELL, et al. The Pfam protein families database. Nucleic Acids Res. (2008) 36:D281–D288.[Abstract/Free Full Text]

  31. Hubbard TJ, Aken BL, Ayling S, Ballester B, Beal K, Bragin E, Brent S, Chen Y, Clapham P, Clarke L, et al. Ensembl 2009. Nucleic Acids Res. (2009) 37:D690–D697.[Abstract/Free Full Text]

  32. Sayers EW, Barrett T, Benson DA, Bryant SH, Canese K, Chetvernin V, Church DM, DiCuccio M, Edgar R, Federhen S, et al. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. (2009) 37:D5–D15.[Abstract/Free Full Text]

  33. The Universal Protein Resource (UniProt) 2009. Nucleic Acids Res. (2009) 37:D169–D174.[Abstract/Free Full Text]

  34. Kerrien S, Orchard S, Montecchi-Palazzi L, Aranda B, Quinn A, Vinod N, Bader G, Xenarios I, Wojcik J, Sherman D, et al. Broadening the horizon—level 2.5 of the HUPO-PSI format for molecular interactions. BMC Biol. (2007) 5:44.[CrossRef][Medline]

  35. Braun P, Tasan M, Dreze M, Barrios-Rodiles M, Lemmens I, Yu H, Sahalie JM, Murray RR, Roncari L, de Smet AS, et al. An experimentally derived confidence score for binary protein-protein interactions. Nat. Methods (2009) 6:91–97.[CrossRef][Web of Science][Medline]

  36. Cline MS, Smoot M, Cerami E, Kuchinsky A, Landys N, Workman C, Christmas R, Avila-Campilo I, Creech M, Gross B, et al. Integration of biological networks and gene expression data using Cytoscape. Nat. Protoc. (2007) 2:2366–2382.[CrossRef][Web of Science][Medline]

  37. Rual J-F, Venkatesan K, Hao T, Hirozane-Kishikawa T, Dricot A, Li N, Berriz GF, Gibbons FD, Dreze M, Ayivi-Guedehoussou N, et al. Towards a proteome-scale map of the human protein-protein interaction network. Nature (2005) 437:1173–1178.[CrossRef][Medline]

  38. Stelzl U, Worm U, Lalowski M, Haenig C, Brembeck FH, Goehler H, Stroedicke M, Zenkner M, Schoenherr A, Koeppen S, et al. A human protein–protein interaction network: a resource for annotating the proteome. Cell (2005) 122:957–968.[CrossRef][Web of Science][Medline]

  39. McDermott J, Bumgarner R, Samudrala R. Functional annotation from predicted protein interaction networks. Bioinformatics (2005) 21:3217–3226.[Abstract/Free Full Text]

  40. Rhodes DR, Tomlins SA, Varambally S, Mahavisno V, Barrette T, Kalyana-Sundaram S, Ghosh D, Pandey A, Chinnaiyan AM. Probabilistic model of the human protein-protein interaction network. Nat. Biotechnol. (2005) 23:951–959.[CrossRef][Web of Science][Medline]

  41. Persico M, Ceol A, Gavrila C, Hoffmann R, Florio A, Cesareni G. HomoMINT: an inferred human network based on orthology mapping of protein interactions discovered in model organisms. BMC Bioinformatics (2005) 6(Suppl. 4):S21.

  42. Brown KR, Jurisica I. Online Predicted Human Interaction Database. Bioinformatics (2005) 21:2076–2082.[Abstract/Free Full Text]

  43. Huang T-W, Tien A-C, Huang W.-S, Lee Y-CG, Peng C-L, Tseng H-H, Kao C-Y, Huang C-YF. POINT: a database for the prediction of protein–protein interactions based on the orthologous interactome. Bioinformatics (2004) 20:3273–3276.[Abstract/Free Full Text]

  44. Lehner B, Fraser AG. A first-draft human protein-interaction map. Genome Biol (2004) 5:R63.[CrossRef][Medline]

  45. Stein A, Panjkovich A, Aloy P. 3did update: domain–domain and peptide-mediated interactions of known 3D structure. Nucleic Acids Res. (2009) 37:D300–D304.[Abstract/Free Full Text]

  46. Finn RD, Marshall M, Bateman A. iPfam: visualization of protein–protein interactions in PDB at domain and amino acid resolutions. Bioinformatics (2005) 21:410–412.[Abstract/Free Full Text]

  47. Bordner AJ, Gorin AA. Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces. BMC Bioinformatics (2008) 9:234.[CrossRef][Medline]

  48. Wang R-S, Wang Y, Wu L-Y, Zhang X-S, Chen L. Analysis on multi-domain cooperation for predicting protein–protein interactions. BMC Bioinformatics (2007) 8:391.[CrossRef][Medline]

  49. Riley R, Lee C, Sabatti C, Eisenberg D. Inferring protein domain interactions from databases of interacting proteins. Genome Biol. (2005) 6:R89.[CrossRef][Medline]

  50. Ng S-K, Zhang Z, Tan S-H. Integrative approach for computationally inferring protein domain interactions. Bioinformatics (2003) 19:923–929.[Abstract/Free Full Text]

  51. Schelhorn SE, Lengauer T, Albrecht M. An integrative approach for predicting interactions of protein regions. Bioinformatics (2008) 24:i35–i41.[Abstract/Free Full Text]

  52. Lee H, Deng M, Sun F, Chen T. An integrated approach to the prediction of domain–domain interactions. BMC Bioinformatics (2006) 7:269.[CrossRef][Medline]

  53. Liu Y, Liu N, Zhao H. Inferring protein–protein interactions through high-throughput interaction data from diverse organisms. Bioinformatics (2005) 21:3279–3285.[Abstract/Free Full Text]

  54. Guimarães KS, Jothi R, Zotenko E, Przytycka TM. Predicting domain–domain interactions using a parsimony approach. Genome Biol. (2006) 7:R104.[CrossRef][Medline]

  55. Jothi R, Cherukuri PF, Tasneem A, Przytycka TM. Co-evolutionary analysis of domains in interacting proteins reveals insights into domain–domain interactions mediating protein–protein interactions. J. Mol. Biol. (2006) 362:861–875.[CrossRef][Web of Science][Medline]

  56. Chen X-W, Liu M. Prediction of protein-protein interactions using random decision forest framework. Bioinformatics (2005) 21:4394–4400.[Abstract/Free Full Text]

  57. Wuchty S. Topology and weights in a protein domain interaction network—a novel way to predict protein interactions. BMC Genomics (2006) 7:122.[CrossRef][Medline]

  58. Schlicker A, Albrecht M. FunSimMat: a comprehensive functional similarity database. Nucleic Acids Res. (2008) 36:D434–D439.[Abstract/Free Full Text]

  59. Schlicker A, Domingues FS, Rahnenführer J, Lengauer T. A new measure for functional similarity of gene products based on Gene Ontology. BMC Bioinformatics (2006) 7:302.[CrossRef][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow Print PDF (2739K) Freely available
Right arrow Screen PDF (347K) Freely available
Right arrowOA All Versions of this Article:
37/suppl_2/W122    most recent
gkp438v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Blankenburg, H.
Right arrow Articles by Albrecht, M.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Blankenburg, H.
Right arrow Articles by Albrecht, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?