Skip Navigation

Nucleic Acids Research 2006 34(Database Issue):D204-D206; doi:10.1093/nar/gkj103
This Article
Right arrow Abstract Freely available
Right arrow Print PDF (57K) Freely available
Right arrow Screen PDF (59K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Kumar, M. D. S.
Right arrow Articles by Sarai, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kumar, M. D. S.
Right arrow Articles by Sarai, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2006, Vol. 34, Database issue D204-D206
© The Author 2006. Published by Oxford University Press. All rights reserved
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions{at}oxfordjournals.org


Article

ProTherm and ProNIT: thermodynamic databases for proteins and protein–nucleic acid interactions

M. D. Shaji Kumar1, K. Abdulla Bava1, M. Michael Gromiha2, Ponraj Prabakaran3, Koji Kitajima1, Hatsuho Uedaira4 and Akinori Sarai*

Department of Bioscience and Bioinformatics, Kyushu Institute of Technology (KIT) 680-4 Kawazu Iizuka, 820-8502 Japan 1Advanced Technology Institute, Inc. (ATI) 2-3-13-103 Tate, Shiki, Saitama 353-0006, Japan 2Computational Biology Research Center (CBRC), National Institute of Advanced Industrial Science and Technology (AIST) AIST Tokyo Waterfront Bio-IT Research Building, 2-42 Aomi, Koto-ku, Tokyo 135-0064, Japan 3Laboratory of Experimental and Computational Biology, NCI, NIH Frederick, MD 21702, USA 4Tsukuba Materials Information Laboratory 3-23-4, Ninomiya, Tsukuba, 305-0051 Japan

*To whom correspondence should be addressed. Tel: +81 948 29 7811; Fax: +81 948 29 7841; Email: sarai{at}bse.kyutech.ac.jp

Received September 15, 2005. Revised October 14, 2005. Accepted October 14, 2005.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 CONTENT, ORGANIZATION AND DATA...
 DATABASE STATISTICS
 NEW FEATURES
 CITATION AND AVAILABILITY
 REFERENCES
 
ProTherm and ProNIT are two thermodynamic databases that contain experimentally determined thermodynamic parameters of protein stability and protein–nucleic acid interactions, respectively. The current versions of both the databases have considerably increased the total number of entries and enhanced search interface with added new fields, improved search, display and sorting options. As on September 2005, ProTherm release 5.0 contains 17 113 entries from 771 proteins, retrieved from 1497 scientific articles (~20% increase in data from the previous version). ProNIT release 2.0 contains 4900 entries from 273 research articles, representing 158 proteins. Both databases can be queried using WWW interfaces. Both quick search and advanced search are provided on this web page to facilitate easy retrieval and display of the data from these databases. ProTherm is freely available online at http://gibk26.bse.kyutech.ac.jp/jouhou/Protherm/protherm.html and ProNIT at http://gibk26.bse.kyutech.ac.jp/jouhou/pronit/pronit.html.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 CONTENT, ORGANIZATION AND DATA...
 DATABASE STATISTICS
 NEW FEATURES
 CITATION AND AVAILABILITY
 REFERENCES
 
Thermodynamic database for proteins and mutants (ProTherm) and thermodynamic database for protein–nucleic acid interactions (ProNIT) are two comprehensive, integrated databases that document experimentally determined thermodynamic parameters published in the literature. Both ProTherm (14) and ProNIT (5) include several thermodynamic parameters along with sequence and structural information, experimental methods and conditions, and literature information. Recent years have seen tremendous progress in studies on proteins owing to the development of various experimental methods to analyze proteins at the genome scale. The correlation between structure and thermodynamics of these key molecules provides valuable insights into the way in which they function. Even though the information is available in scientific journals, books (6,7) and literature databases, retrieving useful, specific data from these resources is time consuming and laborious. Our major goal in developing these databases is to provide the scientific community a single, comprehensive data repository, where all the thermodynamic data related to protein stability and protein–nucleic acid interactions are available. The availability of such thermodynamic databases would be a valuable resource for understanding the protein folding mechanism, protein stability, molecular recognition and gene expressions. This can lead to a wide spectrum of applications such as developing algorithms/methods for prediction systems, protein engineering and quantitative simulation of gene regulatory networks. The thermodynamic data available in ProTherm and ProNIT are widely used by researchers to study the underlying mechanisms of protein stability upon mutations and protein–nucleic acid interactions (see the reference sections on both the websites). This paper describes the major updates and enhancements to these databases for the last few years.


    CONTENT, ORGANIZATION AND DATA COLLECTION
 TOP
 ABSTRACT
 INTRODUCTION
 CONTENT, ORGANIZATION AND DATA...
 DATABASE STATISTICS
 NEW FEATURES
 CITATION AND AVAILABILITY
 REFERENCES
 
Both the databases contain information on protein, mutational information, experimental methods and conditions, several thermodynamic parameters and literature information. Previous publications (15) explain in detail the content and organization of the databases. Table 1 summarizes the contents of ProTherm and ProNIT. ProTherm and ProNIT are implemented in 3DinSight (8), a relational database system for structure, function and property of biomolecules. This facilitates more efficient search and retrieval of data by flexible queries, and enables users to gain insight into the relationship among structure, thermodynamics and function of proteins. We have been collecting the thermodynamic data from published original articles, by searching the PubMed literature database with a combination of specific terms, as well as by searching online journals probably containing thermodynamic data. The database does not contain any predicted or computational interaction data. Researchers then extract the relevant data from the selected articles. The input data are checked automatically by checking programs and also manually to avoid errors. Then, we upload the data first to a test site, where expert curators check and verify the data. After this checking, we upload the data to the public site for users. Furthermore, an email notification for each input entry is sent to the corresponding author, which enables the authors to check their own data and thereby improve the data validation.


View this table:
[in this window]
[in a new window]
 
Table 1 Contents of ProTherm and ProNIT

 

    DATABASE STATISTICS
 TOP
 ABSTRACT
 INTRODUCTION
 CONTENT, ORGANIZATION AND DATA...
 DATABASE STATISTICS
 NEW FEATURES
 CITATION AND AVAILABILITY
 REFERENCES
 
We update both the databases frequently. The current release, ProTherm 5.0 contains 17 113 entries from 771 proteins, retrieved from 1497 scientific articles, which is ~20% increase in data from the last version (4). Currently, the numbers of data for wild-type proteins, single, double and multiple mutants are 7014, 8202, 1277 and 620, respectively. Based on the solvent accessibility of mutants, 4426 mutations are buried, 2687 partially buried and 2751 exposed. In terms of secondary structures, 3993 mutations are in helix, 2622 in strand, 1227 in turn and 2467 in coil regions. Majority of data are obtained from CD (6825) and DSC (5294) experiments followed by fluorescence (3628). Further, 10 154 data are obtained by thermal denaturation, 3890 and 2796 data from GdnHCl and urea denaturation, respectively.

Currently, ProNIT 2.0 contains 4900 entries from 273 research papers. There are 158 different DNA-binding proteins with 3489 wild-type entries and 1411 mutant entries. Majority of data are obtained by gel shift (1316), fluorescence (1143) and filter binding (1053), followed by calorimetry (727), surface plasmon resonance (185) and footprinting (168). Although proteins from a variety of organisms are present in ProNIT, majority of interaction data are from Escherichia coli proteins (1625) followed by Mus musculus (637) and Homo sapiens (569).


    NEW FEATURES
 TOP
 ABSTRACT
 INTRODUCTION
 CONTENT, ORGANIZATION AND DATA...
 DATABASE STATISTICS
 NEW FEATURES
 CITATION AND AVAILABILITY
 REFERENCES
 
There is a growing interest in the relationship between structure and thermodynamics of proteins. Thus, we try to provide link from thermodynamic data in ProTherm to structural information. So far, ProTherm data are connected to sequence and structural information of proteins through 3DinSight. We have added a new cross-link between ProTherm and STING (9), a comprehensive analysis tool for proteins with many structural descriptors. For given protein mutations searched within STING, each entry in STING report is connected to available thermodynamic data in ProTherm. Conversely, all the ProTherm data with available protein structure have pointers to the corresponding STING entry with detailed structural information. This cross-link will greatly facilitate the analysis of structure–thermodynamic relationship of proteins. The ProTherm page also provides cross-reference tables necessary for creating cross-links with PDB (10), PIR (11) and Swiss-Prot (12) databases. We have also added several new features in the search interface to make the search more efficient and convenient.

In the current release of ProNIT 2.0, we have included ~200 protein–RNA interaction data. To facilitate the retrieval of data based on DNA and RNA separately, we have added a new field called TYPE_NUC, where we provide the information about whether the nucleic acid sequence is DNA (single-stranded DNA, ssDNA, or double-stranded DNA, dsDNA) or RNA. Furthermore, a search option is added to retrieve data based on ssDNA, dsDNA or RNA. The protein nomenclature in the literature is not necessarily uniform. Hence, we have added a new field, SYNONYMS, in order to address this problem. Other additions of field are the SwissProt ID of the protein and ‘RELATED_ENTRIES’, which provides the list of entries that contain data from the same paper (the original paper usually contains multiple data and they are entered in different entries). We also provide a link to all homologous PDB codes with sequence identity of >95%. Also, display options and sorting options are significantly improved. We have supplied the lists of ProNIT entries, protein names, protein sources, PDB code, NDB codes (13), authors and references in the advanced search page, along with a new query help page to help users for easy retrieval of the data. Several entries are deleted because of duplication, co-operative binding and so on, and the database entries are now renumbered. A mapping table, which relates the old and new entry numbers, is provided to help old users of ProNIT.


    CITATION AND AVAILABILITY
 TOP
 ABSTRACT
 INTRODUCTION
 CONTENT, ORGANIZATION AND DATA...
 DATABASE STATISTICS
 NEW FEATURES
 CITATION AND AVAILABILITY
 REFERENCES
 
The URLs for ProTherm and ProNIT are http://gibk26.bse.kyutech.ac.jp/jouhou/Protherm/protherm.html and http://gibk26.bse.kyutech.ac.jp/jouhou/pronit/pronit.html, respectively. The users of ProTherm and ProNIT are requested to cite the references (4) and (5), respectively, in their publication including the above URLs. Users who use both the databases for their work may cite this article in their publications. Suggestions and other materials for inclusion in the databases are welcome and should be sent to either protherm{at}rtcmain.bse.kyutech.ac.jp or pronit{at}rtcmain.bse.kyutech.ac.jp.


    ACKNOWLEDGEMENTS
 
The database development is partially supported by a Grant-in-Aid for Publication Scientific Research Results from the Japan Society for the Promotion of Sciences (JSPS). We also thank Advanced Technology Institute Inc. (ATI) for support. Funding to pay the Open Access publication charges for this article was provided by JSPS.

Conflict of interest statement. None declared.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 CONTENT, ORGANIZATION AND DATA...
 DATABASE STATISTICS
 NEW FEATURES
 CITATION AND AVAILABILITY
 REFERENCES
 

  1. Gromiha, M.M., An, J., Kono, H., Oobatake, M., Uedaira, H., Sarai, A. (1999) ProTherm: Thermodynamic database for proteins and mutants Nucleic Acids Res, . 27, 286–288[Abstract/Free Full Text] .

  2. Gromiha, M.M., Uedaira, H., An, J., Selvaraj, S., Prabakaran, P., Sarai, A. (2002) ProTherm: thermodynamic database for proteins and mutants: developments in version 3.0 Nucleic Acids Res, . 30, 301–302[Abstract/Free Full Text] .

  3. Sarai, A., Gromiha, M.M., An, J., Prabakaran, P., Selvaraj, S., Kono, H., Oobatake, M., Uedaira, H. (2002) Thermodynamic databases for proteins and protein–nucleic acid interactions Biopolymers, 61, 121–126[CrossRef] .

  4. Abdulla Bava, K., Gromiha, M.M., Uedaira, H., Kitajima, K., Sarai, A. (2004) ProTherm, version 4.0: thermodynamic database for proteins and mutants Nucleic Acids Res, . 32, D120–D121[Abstract/Free Full Text] .

  5. Prabakaran, P., An, J., Gromiha, M., Selvaraj, S., Uedaira, H., Kono, H., Sarai, A. (2001) Thermodynamic database for protein–nucleic acid interactions (ProNIT) Bioinformatics, 17, 1027–1034[Abstract/Free Full Text] .

  6. Pfeil, W. Protein Stability and Folding: A Collection of Thermodynamic Data, (1998) NY Springer .

  7. Pfeil, W. Protein Stability and Folding, Supplement1: A Collection of Thermodynamic Data, (2001) NY Springer .

  8. An, J., Nakama, T., Kubota, Y., Sarai, A. (1998) 3DinSight: an integrated relational database and search tool for structure, function and property of biomolecules Bioinformatics, 14, 188–195[Abstract/Free Full Text] .

  9. Neshich, G., Borro, L.C., Higa, R.H., Kuser, P.R., Yamagishi, M.E., Franco, E.H., Krauchenco, J.N., Fileto, R., Ribeiro, A.A., Bezerra, G.B., et al. (2005) The Diamond STING server Nucleic Acids Res, . 33, W29–W35[Abstract/Free Full Text] .

  10. Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E. (2000) The Protein Data bank Nucleic Acids Res, . 28, 235–242[Abstract/Free Full Text] .

  11. Cathy, H.W., Yeh, L.L., Huang, H., Arminski, L., Jorge, C.A., Chen, Y., Hu, Z.Z., Ledley, R.S., Kourtesis, P., Suzek, B.E., et al. (2003) The Protein Information Resource Nucleic Acids Res, . 31, 345–347[Abstract/Free Full Text] .

  12. Boeckmann, B., Bairoch, A., Apweiler, R., Blatter, M.-C., Estreicher, A., Gasteiger, E., Martin, M.J., Michoud, K., O'Donovan, C., Phan, I., et al. (2003) The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 Nucleic Acids Res, . 31, 365–370[Abstract/Free Full Text] .

  13. Berman, H.M., Zardecki, C., Westbrook, J. (1998) The Nucleic Acid Database: a resource for nucleic acid science Acta Crystallogr. D Biol. Crystallogr, . D54, 1095–1104[CrossRef][Medline] .


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
GeneticsHome page
P. Chen and E. I. Shakhnovich
Lethal Mutagenesis in Viruses and Bacteria
Genetics, October 1, 2009; 183(2): 639 - 650.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
B. Contreras-Moreira
3D-footprint: a database for the structural analysis of protein-DNA complexes
Nucleic Acids Res., September 18, 2009; (2009) gkp781v1.
[Abstract] [Full Text] [PDF]


Home page
Protein Eng Des SelHome page
V. Potapov, M. Cohen, and G. Schreiber
Assessing computational methods for predicting protein stability upon mutation: good on average but not in the details
Protein Eng. Des. Sel., September 1, 2009; 22(9): 553 - 560.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
L.-T. Huang and M. M. Gromiha
Reliable prediction of protein thermostability change upon double mutation from amino acid sequence
Bioinformatics, September 1, 2009; 25(17): 2181 - 2187.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
N. Tuncbag, G. Kar, O. Keskin, A. Gursoy, and R. Nussinov
A survey of available tools and web servers for analysis of protein-protein interactions and interfaces
Brief Bioinform, May 1, 2009; 10(3): 217 - 232.
[Abstract] [Full Text] [PDF]


Home page
Protein Eng Des SelHome page
S. Kang, G. Chen, and G. Xiao
Robust prediction of mutation-induced protein stability change by property encoding of amino acids
Protein Eng. Des. Sel., February 1, 2009; 22(2): 75 - 83.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Lonquety, Z. Lacroix, N. Papandreou, and J. Chomilier
SPROUTS: a database for the evaluation of protein stability upon point mutation
Nucleic Acids Res., January 1, 2009; 37(suppl_1): D374 - D379.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Ahmad, O. Keskin, A. Sarai, and R. Nussinov
Protein-DNA interactions: structural, thermodynamic and clustering patterns of conserved residues in DNA-binding proteins
Nucleic Acids Res., October 1, 2008; 36(18): 5922 - 5932.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Stout, J. Bacardit, J. D. Hirst, and N. Krasnogor
Prediction of recursive convex hull class assignments for protein residues
Bioinformatics, April 1, 2008; 24(7): 916 - 923.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Grandison and R. J. Morris
Biological pathway kinetic rate constants are scale-invariant
Bioinformatics, March 15, 2008; 24(6): 741 - 743.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
N. Bhardwaj, R. V. Stahelin, G. Zhao, W. Cho, and H. Lu
MeTaDoR: a comprehensive resource for membrane targeting domains and their host proteins
Bioinformatics, November 15, 2007; 23(22): 3110 - 3112.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
C. Deutsch and B. Krishnamoorthy
Four-Body Scoring Function for Mutagenesis
Bioinformatics, November 15, 2007; 23(22): 3009 - 3015.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
K. B. Zeldovich, P. Chen, and E. I. Shakhnovich
Protein stability imposes limits on organism complexity and speed of molecular evolution
PNAS, October 9, 2007; 104(41): 16152 - 16157.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. D. Bloom, A. Raval, and C. O. Wilke
Thermodynamics of Neutral Protein Evolution
Genetics, January 1, 2007; 175(1): 255 - 266.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
G. D. Rose, P. J. Fleming, J. R. Banavar, and A. Maritan
A backbone-based theory of protein folding
PNAS, November 7, 2006; 103(45): 16623 - 16633.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (57K) Freely available
Right arrow Screen PDF (59K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Kumar, M. D. S.
Right arrow Articles by Sarai, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kumar, M. D. S.
Right arrow Articles by Sarai, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?