Nucleic Acids Research, 2005, Vol. 33, Database issue D233-D237
© 2005, the authors
Nucleic Acids Research, Vol. 33, Database issue © Oxford University Press 2005; all rights reserved
The RCSB Protein Data Bank: a redesigned query system and relational database based on the mmCIF schema
1 San Diego Supercomputer Center and 2 Department of Pharmacology, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA and 3 Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, 610 Taylor Road, Piscataway, NJ 08854-8087, USA
* To whom correspondence should be addressed. Tel: +1 858 534 8301; Fax: +1 858 822 0873; Email: bourne{at}sdsc.edu
Received September 15, 2004; Revised and Accepted October 1, 2004
| ABSTRACT |
|---|
|
|
|---|
The Protein Data Bank (PDB) is the central worldwide repository for three-dimensional (3D) structure data of biological macromolecules. The Research Collaboratory for Structural Bioinformatics (RCSB) has completely redesigned its resource for the distribution and query of 3D structure data. The re-engineered site is currently in public beta test at http://pdbbeta.rcsb.org. The new site expands the functionality of the existing site by providing structure data in greater detail and uniformity, improved query and enhanced analysis tools. A new key feature is the integration and searchability of data from over 20 other sources covering genomic, proteomic and disease relationships. The current capabilities of the re-engineered site, which will become the RCSB production site at http://www.pdb.org in late 2005, are described.
| INTRODUCTION |
|---|
|
|
|---|
The production version of the Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) (http://www.pdb.org) is mirrored at seven sites around the world and has been described previously (1,2). In order to improve the accessibility of the PDB's structure data, display the increased level of detail and improved consistency resulting from the RCSB PDB data uniformity project (3,4), and take advantage of advances in database and Web/Internet technologies, the RCSB PDB has re-engineered its database and redesigned the associated website. This site is now available for beta testing at http://pdbbeta.rcsb.org and will henceforth be referred to as PDB Beta. The following features of PDB Beta have been introduced: software architecture, database content and schema, data integration from other sources, and query and analysis capabilities expanded from those reported previously (5).
| CONTENT |
|---|
|
|
|---|
Software architecture, database content and schema
Using an Enterprise Java framework, the PDB Beta has been redesigned and it is composed of three tiers: an underlying relational database, a presentation tier designed in collaboration with users and an object-relational J2EE middle tier based on Hibernate. The PDB Beta has been tested with both MySQL and IBM DB2 relational database tiers. The current system uses MySQL, which enables unlimited distribution of the primary and secondary data in the RCSB PDB database. Moreover, the current system makes extensive use of freely distributable Java components (Table 1).
|
The database uses an mmCIF-based (6) schema derived from the PDB Exchange Dictionary (7). Data are loaded from either XML (8) or mmCIF data files with their associated remediated and extended content. Data file parser/structure loaders are SQL-92 compliant and therefore independent of the backend database. Data files are parsed and loaded weekly at the same time as the current production site is updated. The design of this system along with its local weekly data updates will facilitate local distribution and use.
Data integration from other sources
From a user's perspective, macromolecular structure does not exist in isolation, but is generally associated with an inquiry that might include relationships to genomic and proteomic sequence, biological function, cellular location and disease. While the focus of the RCSB PDB remains on fully exposing the features of macromolecular structures, a wider spectrum of inquiry is now possible. This is achieved through the weekly collection and integration (warehousing) of external data. For example, data from the Gene Ontology (GO) (9), Enzyme Commission (http://www.chem.qmul.ac.uk/iubmb/enzyme/), KEGG Pathways (10,11) and NCBI resources (including LocusLink, OMIM, SNP and BookShelf) (12) are mapped onto structures and loaded into the database. This is achieved as follows.
Our ongoing data uniformity efforts enable accurate assignment of external database references to structures in the RCSB PDB database; these include identifiers from Swiss-Prot (13), GenBank, PubMed, EC numbers and the taxonomy of the source organism. These references are used to locate information in a further set of databases. For example, Swiss-Prot identifiers were used to assign GO terms from the Gene Ontology Consortium to structures. Swiss-Prot and GenBank identifiers were also used to obtain genome information: gene name, chromosome location, structural genomics targets (14) and OMIM numbers for structures. Figure 1 summarizes the rich and varied linkages that have been established between structure data in the RCSB PDB database and data from external biological databases. Note that many of these data are related to structure through a one-to-many relationship since a structure consists of one or more components, such as multiple polypeptide chains. The representation of structures as a number of constituent components, each with external data assignments, is an ongoing effort at the RCSB PDB.
|
Data loaders written in Java access the external databases, parse the files and load relevant derived information into the database. In some instances additional external information is retrieved at query run time. For example, KEGG pathways associated with a given EC number are retrieved by issuing a Web service call to the KEGG database at query run time. Under an agreement with the US National Library of Medicine, PubMed identifiers for the primary citation associated with a structure are used to load the PubMed abstracts into the RCSB PDB database. In doing so, abstracts can be searched by keyword(s) as an alternative means to find structures of interest.
As a final example of how linkages between incorporated data were generated, consider the relationship between structure and disease (Figure 2). The OMIM text was searched for disease terms obtained from the chapter and section headings in the online book Genes and Disease from the NCBI Bookshelf (http://www.ncbi.nlm.nih.gov/books/bv.fcgi?call=bv.View..ShowSection&rid=gnd.preface.91). The disease term and the OMIM numbers returned for the term were loaded into a relational table, as was a mapping of structures to OMIM numbers from the Swiss-Prot site. The two tables were joined using OMIM numbers and the disease term was thus mapped onto structures. These relationships have enabled the implementation of a hierarchical disease tree suitable for browsing. For example, users can locate all the PDB structures identified as being associated with cancer and drill down to find only those associated with breast cancer.
|
Query and analysis
Browsing
A major feature of PDB Beta is the ability to browse database content. Much of the data now integrated with structure is hierarchical and lends itself to display via tree browsers. When each node in a browser tree is moused over, the number of associated structures at that branch is displayed. This gives the user a sense of the size of the result set, even before a query is made. As some data that are browsed are not strictly hierarchical, concessions are made. An example of this concession is a protein chain that has been associated with multiple GO molecular functions. This protein chain would therefore appear multiple times in the browser tree even though it is only associated with a single structure. Browsers are also useful in reverse; knowing the location of a structure in a tree reveals its place in the hierarchy. For example, entering homo sapiens in the taxonomy browser will locate the structures for which this has been identified as a source organism, and highlight where humans are located in the tree of life, at least according to the NCBI classification scheme. The results of browsing can be used as a starting point for query refinement using the SearchFields interface outlined below.
Searching
The PDB Beta has retained the search capability of the RCSB PDB production site but has provided, based on user feedback, more intuitive interfaces and added additional Website navigation tools. Sequence searching functionality via BLAST (15) has also been added.
SearchLite is based on the Lucene text indexing and search engine, and uses the content of the mmCIF versions of the PDB structures which provides indexes of terms not available in the original PDB files. For example, at the time of submission it may not have been known that the structure was associated with apoptosis. When the term is later added to the list of Swiss-Prot keywords, it will be accessible through SearchLite even though there was no reference in the original PDB file. SearchLite can be considered as an inclusive rather than an exclusive search engine, since it produces results that may require further query refinement. This feature is also available on the current production site.
StatusSearch continues to provide information on deposited but unreleased structures. Sequence data are available ahead of a structure's release for some entries, which is useful for theoretical modeling and avoiding duplication of effort.
SearchFields, an interface for performing advanced queries, has been enhanced to include the full extent of the experimental information that is collected and much of the integrated information outlined above. Particular attention has been paid to NMR structures. A new SearchFields option is to search for structures with specific NMR experimental parameters like refinement method, selection criteria, spectrometer details and sample conditions.
Searches performed during a session are recorded and can be recalled and rerun or modified and rerun. Since the result of one query may trigger a new line of inquiry, we have extended the notion of query by example in which a result from one query can be used as a search term in a subsequent query. For example, a search for the structure with the PDB identifier 1AEW [PDB] displays the Structure Explorer page for the iron storage molecule ferritin (16). According to the GO term for the single polypeptide chain in the asymmetric unit of this structure, it is assigned a molecular function of ferric iron binding. Clicking on this term on the Structure Explorer page will reveal all other structures in the PDB associated with the same molecular function as defined by the same GO term.
A histogram feature in the early stages of development is applied to quantitative data and is accessible from the results display. For example, an X-ray structure is reported with a resolution of 2.5 Å. A novice user may be interested in how that resolution compares with the contents of the complete database. An icon next to the resolution on the structure page can be selected to present a graphical distribution of the resolutions of all the structures in the PDB. Specific ranges can be expanded and the structures in a selected resolution range displayed in a report format.
The PDB Beta is also composed of
1000 curated Web pages, which have been made more accessible through site searching and indexing. All features are in the process of being better documented and made accessible through a context-sensitive help system based on the RoboHelp tool.
Results reporting
Query results are presented on the Structure Explorer page for a single structure or in the Query Results Browser for a set of structures. The Structure Explorer page presents data similar to the format of a scientific paper. This page can be printed as a PDF. Table 2 highlights the new features that are included beyond those available from the current production site. Of the new reports, the Materials and Methods section is customized based on the experiment type. Structure Explorer pages list crystallization, diffraction and refinement information for X-ray structures; and NMR experiment, refinement and ensemble information for NMR entries.
|
Molecular viewers
Recognizing the difficulties that users may have installing the existing molecular viewers, four new, general purpose molecular viewers have been added to the PDB Beta: KiNG, Jmol, SimpleViewer and WebMol applets. KiNG, Jmol and WebMol require Java-enabled browsers; SimpleViewer requires the installation of Java3D to provide high-quality rendering. SimpleViewer is an example of an application built from the Molecular Biology Toolkit (MBT) (http://mbt.sdsc.edu). The concept behind MBT is to deliver simple context-sensitive molecular graphics applications at the appropriate point in a query. So for example, a ligand viewer has been implemented that provides a detailed view of the interaction between a macromolecule and a ligand (HET group in original PDB terminology), but is not designed to be a general-purpose molecular viewer. A SNPviewer, indicating where non-synonymous single nucleotide polymorphisms (SNPs) are mapped onto structures, is another example of the use of a context-sensitive viewer that has been added to the PDB Beta.
Distribution system
The RCSB production and PDB Beta sites distribute data files in the PDB, mmCIF and XML formats. Data for sequences and complete structural descriptions are available in uncompressed as well as various compressed formats. The PDB data can also be obtained using CORBA and Web services. A CORBA server may be established using C++ (http://deposit.pdb.org/mmcif/FILM/) or the Java OpenMMS software (http://openmms.sdsc.edu) (17). Web services, which are currently in the early implementation stage, will allow users to use XML and SOAP to perform queries and retrieve results programmatically from the PDB Beta. The PDB Beta Web Services Definition Language (WSDL) is available at http://pdbbeta.rcsb.org/jboss-net/services/pdbWebService?wsdl.
| CONCLUSION |
|---|
|
|
|---|
While the RCSB PDB's primary mandate continues to be the delivery of high-quality structure data in a timely manner, our services are being expanded. Recognizing that structure exists as a point on a spectrum of biological inquiry, more integrated access to structure data is being provided. Using PDB Beta locally to manage private copies of the PDB data on a laptop or larger computer, and a web interface that can be customized for individual user preferences, either locally or on the RCSB's PDB servers are examples of forthcoming deliverables. Comments and suggestions are always welcome by sending email to betafeedback{at}rcsb.org. Upon completion of the beta testing, anticipated to be in late 2005 the re-engineered site will be available as the PDB production site (http://www.pdb.org).
| ACKNOWLEDGEMENTS |
|---|
The RCSB PDB is operated by Rutgers, The State University of New Jersey; the San Diego Supercomputer Center (SDSC) at the University of California San Diego (UCSD); and the Center for Advanced Research in Biotechnology (CARB/UMBI/NIST)three members of the Research Collaboratory for Structural Bioinformatics. This work is supported by grants from National Science Foundation (NSF), National Institute of General Medical Sciences (NIGMS), Office of Science, Department of Energy (DOE), National Library of Medicine (NLM), National Cancer Institute (NCI), National Center for Research Resources (NCRR), National Institute of Biomedical Imaging and Bioengineering (NIBIB) and National Institute of Neurological Disorders and Stroke (NINDS). The RCSB PDB is a member of wwwPDB. We wish to thank the many users who have provided input into the development of the re-engineered RCSB PDB.
| Notes |
|---|
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use permissions, please contact journals.permissions{at}oupjournals.org.
| REFERENCES |
|---|
|
|
|---|
- Berman,H.M., Westbrook,J., Feng,Z., Gilliland,G., Bhat,T.N., Weissig,H., Shindyalov,I.N. and Bourne,P.E. ( (2000) ) The Protein Data Bank. Nucleic Acids Res., , 28, , 235242.
[Abstract/Free Full Text] . - Bernstein,F.C., Koetzle,T.F., Williams,G.J.B., Meyer,E.F.,Jr, Brice,M.D., Rodgers,J.R., Kennard,O., Shimanouchi,T. and Tasumi,M. ( (1977) ) Protein Data Bank: a computer-based archival file for macromolecular structures. J. Mol. Biol., , 112, , 535542.[Web of Science][Medline] .
- Westbrook,J., Feng,Z., Jain,S., Bhat,T.N., Thanki,N., Ravichandran,V., Gilliland,G.L., Bluhm,W., Weissig,H., Greer,D.S. et al. ( (2002) ) The Protein Data Bank: unifying the archive. Nucleic Acids Res., , 30, , 245248.
[Abstract/Free Full Text] . - Bhat,T.N., Bourne,P., Feng,Z., Gilliland,G., Jain,S., Ravichandran,V., Schneider,B., Schneider,K., Thanki,N., Weissig,H. et al. ( (2001) ) The PDB data uniformity project. Nucleic Acids Res., , 29, , 214218.
[Abstract/Free Full Text] . - Bourne,P.E., Addess,K.J., Bluhm,W.F., Chen,L., Deshpande,N., Feng,Z., Fleri,W., Green,R., Merino-Ott,J.C., Townsend-Merino,W., Weissig,H., Westbrook,J. and Berman,H.M. ( (2004) ) The distribution and query systems of the RCSB Protein Data Bank. Nucleic Acids Res., , 32, , D223D225.
[Abstract/Free Full Text] . - Bourne,P.E., Berman,H.M., Watenpaugh,K., Westbrook,J.D. and Fitzgerald,P.M.D. ( (1997) ) The macromolecular Crystallographic Information File (mmCIF). Methods Enzymol., , 277, , 571590.[Medline] .
- Westbrook,J., Henrick,K., Ulrich,E.L. and Berman,H.M. ( (2004) ) Definition and exchange of crystallographic data. In International Tables for Crystallography. Kluwer Academic Publishers, Dordrecht, The Netherlands, Vol. G (in press). .
- Westbrook,J., Ito,N., Nakamura,H., Henrick,K. and Berman,H.M. ( (2004) ) PDBML: the representation of archival macromolecular structure data in XML. Bioinformatics, , doi:10.1093/bioinformatics/bti082. .
- The Gene Ontology Consortium ( (2000) ) Gene Ontology: tool for the unification of biology. Nature Genet., , 25, , 2529.[CrossRef][Web of Science][Medline] .
- Kanehisa,M. ( (1997) ) A database for post-genome analysis. Trends Genet., , 13, , 375376.[CrossRef][Web of Science][Medline] .
- Kanehisa,M. and Goto,S. ( (2000) ) KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res., , 28, , 2730.
[Abstract/Free Full Text] . - Wheeler,D.L., Church,D.M., Edgar,R., Federhen,S., Helmberg,W., Madden,T.L., Pontius,J.U., Schuler,G.D., Schriml,L.M., Sequeira,E. et al. ( (2004) ) Database resources of the National Center for Biotechnology Information: update. Nucleic Acids Res., , 32, , D35D40.
[Abstract/Free Full Text] . - Bairoch,A. and Apweiler,R. ( (2000) ) The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res., , 28, , 4548.
[Abstract/Free Full Text] . - Chen,L., Oughtred,R., Berman,H.M. and Westbrook,J. ( (2004) ) TargetDB: a target registration database for structural genomics projects. Bioinformatics, , 20, , 28602862.
[Abstract/Free Full Text] . - Altschul,S.F., Gish,W., Miller,W., Myers,E.W. and Lipman,D.J. ( (1990) ) Basic local alignment search tool. J. Mol. Biol., , 215, , 403410.[CrossRef][Web of Science][Medline] .
- Hempstead,P.D., Yewdall,S.J., Fernie,A.R., Lawson,D.M., Artymiuk,P.J., Rice,D.W., Ford,G.C. and Harrison,P.M. ( (1997) ) Comparison of the three-dimensional structures of recombinant human H and horse L ferritins at high resolution. J. Mol. Biol., , 268, , 424448.[CrossRef][Web of Science][Medline] .
- Greer,D.S., Westbrook,J.D. and Bourne,P.E. ( (2002) ) An ontology driven architecture for derived representations of macromolecular structure. Bioinformatics, , 18, , 12801281.
[Abstract/Free Full Text] . - Walther,D. ( (1997) ) WebMola Java-based PDB viewer. Trends Biochem. Sci., , 22, , 274275.[CrossRef][Web of Science][Medline] .
- Conte,L.L., Brenner,S.E., Hubbard,T.J., Chothia,C. and Murzin,A.G. ( (2002) ) SCOP database in 2002: refinements accommodate structural genomics. Nucleic Acids Res., , 30, , 264267.
[Abstract/Free Full Text] .
This article has been cited by other articles:
![]() |
C. L. Zirbel, J. E. Sponer, J. Sponer, J. Stombaugh, and N. B. Leontis Classification and energetics of the base-phosphate interactions in RNA Nucleic Acids Res., August 1, 2009; 37(15): 4898 - 4918. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Melamud and J. Moult Structural implication of splicing stochastics Nucleic Acids Res., August 1, 2009; 37(14): 4862 - 4872. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Xie, L. Xie, and P. E. Bourne A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery Bioinformatics, June 15, 2009; 25(12): i305 - i312. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ryan, M. Diekhans, S. Lien, Y. Liu, and R. Karchin LS-SNP/PDB: annotated non-synonymous SNPs mapped to Protein Data Bank structures Bioinformatics, June 1, 2009; 25(11): 1431 - 1432. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Stombaugh, C. L. Zirbel, E. Westhof, and N. B. Leontis Frequency and isostericity of RNA base pairs Nucleic Acids Res., April 1, 2009; 37(7): 2294 - 2312. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Manning, A. Aggarwal, K. Gao, and G. Tucker-Kellogg Scaling the walls of discovery: using semantic metadata for integrative problem solving Brief Bioinform, March 1, 2009; 10(2): 164 - 176. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Karchin Next generation tools for the annotation of human SNPs Brief Bioinform, January 1, 2009; 10(1): 35 - 52. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Czerwoniec, S. Dunin-Horkawicz, E. Purta, K. H. Kaminska, J. M. Kasprzak, J. M. Bujnicki, H. Grosjean, and K. Rother MODOMICS: a database of RNA modification pathways. 2008 update Nucleic Acids Res., January 1, 2009; 37(suppl_1): D118 - D121. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Pieper, N. Eswar, B. M. Webb, D. Eramian, L. Kelly, D. T. Barkan, H. Carter, P. Mankoo, R. Karchin, M. A. Marti-Renom, et al. MODBASE, a database of annotated comparative protein structure models and associated resources Nucleic Acids Res., January 1, 2009; 37(suppl_1): D347 - D354. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Schmidt, S. Struck, B. Gruening, J. Hossbach, I. S. Jaeger, R. Parol, U. Lindequist, E. Teuscher, and R. Preissner SuperToxic: a comprehensive database of toxic compounds Nucleic Acids Res., January 1, 2009; 37(suppl_1): D295 - D299. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Morcos, C. Lamanna, M. Sikora, and J. Izaguirre Cytoprophet: a Cytoscape plug-in for protein and domain interaction networks inference Bioinformatics, October 1, 2008; 24(19): 2265 - 2266. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Lesburg and J. S. Duca soaPDB: a web application for searching the Protein Data Bank, organizing results, and receiving automatic email alerts Nucleic Acids Res., July 1, 2008; 36(suppl_2): W252 - W254. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Xie and P. E. Bourne Detecting evolutionary relationships across existing fold space, using sequence order-independent profile-profile alignments PNAS, April 8, 2008; 105(14): 5441 - 5446. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Arauzo-Bravo and A. Sarai Indirect readout in drug-DNA recognition: role of sequence-dependent DNA conformation Nucleic Acids Res., February 2, 2008; 36(2): 376 - 386. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Singh, A. Olowoyeye, P. H. Baenziger, J. Dantzer, M. G. Kann, P. Radivojac, R. Heiland, and S. D. Mooney MutDB: update on development of tools for the biochemical analysis of genetic variation Nucleic Acids Res., January 11, 2008; 36(suppl_1): D815 - D819. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Gunther, M. Kuhn, M. Dunkel, M. Campillos, C. Senger, E. Petsalaki, J. Ahmed, E. G. Urdiales, A. Gewiess, L. J. Jensen, et al. SuperTarget and Matador: resources for exploring drug-target relationships Nucleic Acids Res., January 11, 2008; 36(suppl_1): D919 - D922. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Rattei, P. Tischler, R. Arnold, F. Hamberger, J. Krebs, J. Krumsiek, B. Wachinger, V. Stumpflen, and W. Mewes SIMAP structuring the network of protein similarities Nucleic Acids Res., January 11, 2008; 36(suppl_1): D289 - D292. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-H. Tung and J.-M. Yang fastSCOP: a fast web server for recognizing protein structural domains and SCOP superfamilies Nucleic Acids Res., July 13, 2007; 35(suppl_2): W438 - W443. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Via, D. Peluso, P. F. Gherardini, E. de Rinaldis, T. Colombo, G. Ausiello, and M. Helmer-Citterich 3dLOGO: a web server for the identification, analysis and use of conserved protein substructures Nucleic Acids Res., July 13, 2007; 35(suppl_2): W416 - W419. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-H. Wong, T.-Y. Lee, H.-K. Liang, C.-M. Huang, T.-Y. Wang, Y.-H. Yang, C.-H. Chu, H.-D. Huang, M.-T. Ko, and J.-K. Hwang KinasePhos 2.0: a web server for identifying protein kinase-specific phosphorylation sites based on sequences and coupling patterns Nucleic Acids Res., July 13, 2007; 35(suppl_2): W588 - W594. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-C. Chen, Y.-S. Lo, W.-C. Hsu, and J.-M. Yang 3D-partner: a web server to infer interacting partners and binding models Nucleic Acids Res., July 13, 2007; 35(suppl_2): W561 - W567. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yu, P.-A. Genest, B. ter Riet, K. Sweeney, C. DiPaolo, R. Kieft, E. Christodoulou, A. Perrakis, J. M. Simmons, R. P. Hausinger, et al. The protein that binds to DNA base J in trypanosomatids has features of a thymidine hydroxylase Nucleic Acids Res., April 1, 2007; 35(7): 2107 - 2115. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Wilson, M. Madera, C. Vogel, C. Chothia, and J. Gough The SUPERFAMILY database in 2007: families and functions Nucleic Acids Res., January 12, 2007; 35(suppl_1): D308 - D313. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Combet, N. Garnier, C. Charavay, D. Grando, D. Crisan, J. Lopez, A. Dehne-Garcia, C. Geourjon, E. Bettler, C. Hulo, et al. euHCVdb: the European hepatitis C virus database Nucleic Acids Res., January 12, 2007; 35(suppl_1): D363 - D366. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Berman, K. Henrick, H. Nakamura, and J. L. Markley The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data Nucleic Acids Res., January 12, 2007; 35(suppl_1): D301 - D303. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Wang, K. J. Addess, J. Chen, L. Y. Geer, J. He, S. He, S. Lu, T. Madej, A. Marchler-Bauer, P. A. Thiessen, et al. MMDB: annotating protein sequences with Entrez's 3D-structure database Nucleic Acids Res., January 12, 2007; 35(suppl_1): D298 - D300. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Murcia, J. D. Faraldo-Gomez, F. R. Maxfield, and B. Roux Modeling the structure of the StART domains of MLN64 and StAR proteins in complex with cholesterol J. Lipid Res., December 1, 2006; 47(12): 2614 - 2630. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Friedrich, B. Pils, T. Dandekar, J. Schultz, and T. Muller Modelling interaction sites in protein domains with interaction profile hidden Markov models Bioinformatics, December 1, 2006; 22(23): 2851 - 2857. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Espadaler, E. Querol, F. X. Aviles, and B. Oliva Identification of function-associated loop motifs and application to protein function prediction Bioinformatics, September 15, 2006; 22(18): 2237 - 2243. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Reumers, S. Maurer-Stroh, J. Schymkowitz, and F. Rousseau SNPeffect v2.0: a new step in investigating the molecular phenotypic effects of human non-synonymous SNPs Bioinformatics, September 1, 2006; 22(17): 2183 - 2185. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-M. Yang and C.-H. Tung Protein structure database search and evolutionary classification Nucleic Acids Res., August 2, 2006; 34(13): 3646 - 3659. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. S. A. Maaty, A. C. Ortmann, M. Dlakic, K. Schulstad, J. K. Hilmer, L. Liepold, B. Weidenheft, R. Khayat, T. Douglas, M. J. Young, et al. Characterization of the Archaeal Thermophile Sulfolobus Turreted Icosahedral Virus Validates an Evolutionary Link among Double-Stranded DNA Viruses from All Domains of Life. J. Virol., August 1, 2006; 80(15): 7625 - 7635. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Bryson, V. Loux, R. Bossy, P. Nicolas, S. Chaillou, M. van de Guchte, S. Penaud, E. Maguin, M. Hoebeke, P. Bessieres, et al. AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system Nucleic Acids Res., July 19, 2006; 34(12): 3533 - 3545. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. R. Gabdoulline, S. Ulbrich, S. Richter, and R. C. Wade ProSAT2--Protein Structure Annotation Server. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W79 - W83. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-H. Hung, H.-D. Huang, and T.-Y. Lee ProKware: integrated software for presenting protein structural properties in protein tertiary structures. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W89 - W94. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-C. Chen, J.-K. Hwang, and J.-M. Yang (PS)2: protein structure prediction server. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W152 - W157. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Vullo, O. Bortolami, G. Pollastri, and S. C. E. Tosatto Spritz: a server for the prediction of intrinsically disordered regions in protein sequences using kernel machines. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W164 - W168. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Martin, V. Catherinot, and G. Labesse kinDOCK: a tool for comparative docking of protein kinase ligands. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W325 - W329. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L Marsden, J. A.G Ranea, A. Sillero, O. Redfern, C. Yeats, M. Maibaum, D. Lee, S. Addou, G. A Reeves, T. J Dallman, et al. Exploiting protein structure data to explore the evolution of protein function and biological complexity Phil Trans R Soc B, March 29, 2006; 361(1467): 425 - 440. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Dunin-Horkawicz, A. Czerwoniec, M. J. Gajda, M. Feder, H. Grosjean, and J. M. Bujnicki MODOMICS: a database of RNA modification pathways Nucleic Acids Res., January 1, 2006; 34(suppl_1): D145 - D149. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. H. Wu, R. Apweiler, A. Bairoch, D. A. Natale, W. C. Barker, B. Boeckmann, S. Ferro, E. Gasteiger, H. Huang, R. Lopez, et al. The Universal Protein Resource (UniProt): an expanding universe of protein information Nucleic Acids Res., January 1, 2006; 34(suppl_1): D187 - D191. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Castrignano, P. D. De Meo, D. Cozzetto, I. G. Talamo, and A. Tramontano The PMDB Protein Model Database Nucleic Acids Res., January 1, 2006; 34(suppl_1): D306 - D309. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Hinrichs, D. Karolchik, R. Baertsch, G. P. Barber, G. Bejerano, H. Clawson, M. Diekhans, T. S. Furey, R. A. Harte, F. Hsu, et al. The UCSC Genome Browser Database: update 2006 Nucleic Acids Res., January 1, 2006; 34(suppl_1): D590 - D598. [Abstract] [Full Text] [PDF] |
||||
![]() |
T.-Y. Lee, H.-D. Huang, J.-H. Hung, H.-Y. Huang, Y.-S. Yang, and T.-H. Wang dbPTM: an information repository of protein post-translational modification Nucleic Acids Res., January 1, 2006; 34(suppl_1): D622 - D627. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Goldovsky, P. Janssen, D. Ahren, B. Audit, I. Cases, N. Darzentas, A. J. Enright, N. Lopez-Bigas, J. M. Peregrin-Alvarez, M. Smith, et al. CoGenT++: an extensive and extensible data environment for computational genomics Bioinformatics, October 1, 2005; 21(19): 3806 - 3810. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Ausiello, A. Zanzoni, D. Peluso, A. Via, and M. Helmer-Citterich pdbFun: mass selection and fast comparison of annotated PDB residues Nucleic Acids Res., July 1, 2005; 33(suppl_2): W133 - W137. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Kulkarni-Kale, S. Bhosle, and A. S. Kolaskar CEP: a conformational epitope prediction server Nucleic Acids Res., July 1, 2005; 33(suppl_2): W168 - W171. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Dantzer, C. Moad, R. Heiland, and S. Mooney MutDB services: interactive structural analysis of mutation data Nucleic Acids Res., July 1, 2005; 33(suppl_2): W311 - W314. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Kleinjung and F. Fraternali POPSCOMP: an automated interaction analysis of biomolecular complexes Nucleic Acids Res., July 1, 2005; 33(suppl_2): W342 - W346. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. I. Olason Integrating protein annotation resources through the Distributed Annotation System Nucleic Acids Res., July 1, 2005; 33(suppl_2): W468 - W470. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Hao, W.-Z. He, Y. Huang, L.-X. Ma, Y. Xu, H. Xi, C. Wang, B.-S. Liu, J.-M. Wang, Y.-X. Li, et al. MPSS: an integrated database system for surveying a set of proteins Bioinformatics, May 1, 2005; 21(9): 2142 - 2143. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Brooksbank, G. Cameron, and J. Thornton The European Bioinformatics Institute's data resources: towards systems biology Nucleic Acids Res., January 1, 2005; 33(suppl_1): D46 - D53. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||








