Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (70K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (36)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Ghosh, D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ghosh, D.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2000, Vol. 28, No. 1 308-310
© 2000 Oxford University Press

Object-oriented Transcription Factors Database (ooTFD)

David Ghosh*

Institute for Transcriptional Informatics, PO Box 2556, Pittsburgh, PA 15230, USA

Received October 1, 1999; Accepted October 4, 1999.


    ABSTRACT
 TOP
 ABSTRACT
 ORGANIZATION AND CONTENTS OF...
 DATA RETRIEVAL TOOLS
 EXTERNAL DATA INTERFACES AND...
 REFERENCES
 
ooTFD (object-oriented Transcription Factors Database) is an object-oriented successor to TFD. This database is aimed at capturing information regarding the polypeptide interactions which comprise and define the properties of transcription factors. ooTFD contains information about transcription factor binding sites, as well as composite relationships within transcription factors, which frequently occur as multisubunit proteins that form a complex interface to cellular processes outside the transcription machinery through protein–protein interactions. In the past year, a few additions and changes were made to this database and associated tools, which are accessible through the IFTI-MIRAGE web site at http://www.ifti.org/


    ORGANIZATION AND CONTENTS OF THE DATABASE
 TOP
 ABSTRACT
 ORGANIZATION AND CONTENTS OF...
 DATA RETRIEVAL TOOLS
 EXTERNAL DATA INTERFACES AND...
 REFERENCES
 
This database is based on data structures derived from the tables of the original database (1,2), now referred to as rTFD (relational Transcription Factors Database). Data structures as described initially (3) for containment and composite relationships in the object-oriented version of the database are being further developed. The contents of main data structures are presented in Table 1, and the flow of information between this database, related data resources, and query and analysis tools is presented in Figure 1. The Mood (Materials object oriented database) and ozone (a pure java object database) systems have been used to maintain the object-oriented version of this database in the past year, although a number of other object database systems are now also being investigated. Entries in the Sites component of this database, which is used in conjunction with transcription factor binding sites sequence analysis tools, are also accessible through network-based interfaces using mysql, a standard relational database system.


View this table:
[in this window]
[in a new window]
 
Table 1. Primary ooTFD data structures
 


View larger version (19K):
[in this window]
[in a new window]
 
Figure 1. Organization and flow of information in ooTFD, rTFD and data access tools.

 

    DATA RETRIEVAL TOOLS
 TOP
 ABSTRACT
 ORGANIZATION AND CONTENTS OF...
 DATA RETRIEVAL TOOLS
 EXTERNAL DATA INTERFACES AND...
 REFERENCES
 
The two categories of retrieval tools and services, namely sequence analysis queries and database queries, are summarized in Table 2 and presented graphically in Figure 1. Protein sequence analyses can be performed against the Polypeptides/Domains dataset (tfdaa) through a standard BLASTP analysis. In cases where BLAST matches are to a polypeptide with known composite relationships in ooTFD, associated links indicating the composite information are provided. Sites analyses can be performed with Tfsitescan, a service which constructs an imagemap in association with sequence analysis results which is linked to individual Sites entries. Object database queries are currently possible through an interface to the ozone implementation of ooTFD, and these generally provide links to the individual database entries. A simple ‘knowledge based’ web tool, TF-Advisor, is available to a new user for selecting the most suitable approach to performing a particular query.


View this table:
[in this window]
[in a new window]
 
Table 2. ooTFD-associated analysis and retrieval tools
 

    EXTERNAL DATA INTERFACES AND FUTURE DIRECTIONS
 TOP
 ABSTRACT
 ORGANIZATION AND CONTENTS OF...
 DATA RETRIEVAL TOOLS
 EXTERNAL DATA INTERFACES AND...
 REFERENCES
 
The original relational database contained a table, X_POINTERS, which was used to map TFD entries to general sequence databases, including GenBank, EMBL, SWISS-PROT and PIR (2). Mainly, this table was used to refer to entries in these other databases which represented the same biological entity. Similar data structures are now being developed to allow the recording of a different type of database relationship, one that is representative of a molecular relationship. These types of pointers, in contrast to the hard links which indicate identical information in multiple databases, represent information not present in either database. For example, one of the ways in which transcription factors transmit cellular and extracellular information to the gene expression machinery is through phosphorylation and dephosphorylation events (4,5). The PKR (Protein Kinase Resource) is a database which contains a variety of information on protein kinases and their biologically relevant sites in their respective protein sequences (6). An interface currently in development links ooTFD entries to PKR entries, allowing end-users to explore the known protein kinase–transcription factor interactions that are a part of the interactions of the cellular machinery with the transcriptional machinery at a particular transcription factor binding site. A java applet in development similar to one presented in an earlier publication (3) would allow the viewing of these interactions with transcription factor polypeptides as a result that is returned from an ooTFD object query.

Another example of inter-database links representing information not present in either database involves low P-value sequence matches between ooTFD/Sites sequence entries and sequence entries in COMPEL, a database of composite regulatory elements (7). These mappings thus include sequence relationships other than, and in addition to, those that are represented in the COMPEL annotation, which is based on experimentally observed data for transcription factor interactions at a composite transcriptional regulatory element. Such links could suggest possible protein–protein interactions between two sequence-specific transcription factors other than those observed by the investigators who originally reported these sites; in these cases, associated retrieval tools would indicate that the suggested composite interaction is one that is predicted from computation.

Another precomputed dataset that has been developed as an accessory to ooTFD involves low P-value sequence matches of TFD/Sites sequences to sequences in EPD, Eukaryotic Promoters Database (8). Query tools for retrieving entries from this precomputed dataset are in development, and further versions of the promoter sequence analysis tool available at the IFTI-MIRAGE web site, Tfsitescan, would provide results which automatically link to this precomputed dataset.

The current consensus in the bioinformatics field is that transcription factor binding sites, especially when represented as matrices, do not contain sufficient information for predictive value for a new binding site from sequence analysis (9). Although there is some usefulness in this view, simple methods for organizing sequence analysis results based on P-value, as are currently performed by Tfsitescan, can serve as one approach to screen out spurious and statistically insignificant matches. As in the cases of homeodomain proteins, some transcription factors may simply be less specific for their target sequences than others, and the primary determinants for transcription factor access to DNA may in these cases be modulated at the level of chromatin organization. Although maintaining and updating information in an object database system is less straightforward than maintaining information as a relational database, the increasing complexity of the information being generated regarding gene regulation suggests a need for approaches to data management beyond that which is afforded by relational database management systems. The available dataset of co-regulated genes in complete genomes, their common upstream sequence elements, and reports regarding the role of combinatorial transcription factor interactions in gene regulation, is increasing. As it becomes available, automated systems to link such publicly available information to information derived from transcription factor data retrieval, would be developed in the context of this database resource.


    ACKNOWLEDGEMENTS
 
This work was supported in part by a gift from Procter & Gamble Co. to the University of California at San Diego, by the facilities of the National Biomedical Computation Resource (NIH RR-08605) at the San Diego Supercomputer Center, and by the facilities of the Biomolecular Engineering Research Center at Boston University.


    FOOTNOTES
 
* Tel/Fax: +1 412 682 0550; Toll-free Tel/Fax (USA only): +1 800 894 4770; Email: dghosh@ifti.org Back


    REFERENCES
 TOP
 ABSTRACT
 ORGANIZATION AND CONTENTS OF...
 DATA RETRIEVAL TOOLS
 EXTERNAL DATA INTERFACES AND...
 REFERENCES
 

    1 Ghosh,D. (1990) Nucleic Acids Res., 18, 1749–1756.[Abstract/Free Full Text]

    2 Ghosh,D. (1991) Trends Biochem. Sci., 16, 445–447.[Web of Science][Medline]

    3 Ghosh,D. (1998) Nucleic Acids Res., 26, 360–361.[Abstract/Free Full Text]

    4 Hunter,T. and Karin,M. (1992) Cell, 70, 375–387.[Web of Science][Medline]

    5 Karin,M. (1994) Curr. Opin. Cell. Biol., 6, 415–424.[Web of Science][Medline]

    6 Smith,C.M., Shindyalov,I.N., Veretnik,S., Gribskov,M., Taylor,S.S., TenEyck,L.F., Bourne,P.E. (1997) Trends Biochem. Sci., 22, 444–446.[Web of Science][Medline]

    7 Kel-Margoulis,O.V., Kel,A.E., Frisch,M., Romaschenko,A.G., Kolchanov,N.A. and Wingender,E. (1998) Proceedings of the First International Conference on Bioinformatics of Genome Regulation and Structure, (BGRS ’98). ICG, Novosibirsk, Vol. 1, 54–57.

    8 Perier,R.C., Junier,T., Bonnard,C. and Bucher,P. (1999) Nucleic Acids Res., 27, 307–309. Updated article in this issue: Nucleic Acids Res. (2000), 28, 302–303.[Abstract/Free Full Text]

    9 Bucher,P. (1999) Curr. Opin. Struct. Biol., 9, 400–407.[Web of Science][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Arch Gen PsychiatryHome page
C. S. Barr, R. L. Dvoskin, Q. Yuan, R. H. Lipsky, M. Gupte, X. Hu, Z. Zhou, M. L. Schwandt, S. G. Lindell, M. McKee, et al.
CRH Haplotype as a Factor Influencing Cerebrospinal Fluid Levels of Corticotropin-Releasing Hormone, Hypothalamic-Pituitary-Adrenal Axis Activity, Temperament, and Alcohol Consumption in Rhesus Macaques
Arch Gen Psychiatry, August 1, 2008; 65(8): 934 - 944.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
O. L. Griffith, S. B. Montgomery, B. Bernier, B. Chu, K. Kasaian, S. Aerts, S. Mahony, M. C. Sleumer, M. Bilenky, M. Haeussler, et al.
ORegAnno: an open-access community-driven resource for regulatory annotation
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D107 - D113.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
T. D. Schoep and K. Gregg
Isolation and characterization of putative Pseudobutyrivibrio ruminis promoters
Microbiology, September 1, 2007; 153(9): 3071 - 3080.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
N. A. Kolchanov, T. I. Merkulova, E. V. Ignatieva, E. A. Ananko, D. Yu. Oshchepkov, V. G. Levitsky, G. V. Vasiliev, N. V. Klimova, V. M. Merkulov, and T. C. Hodgman
Combined experimental and computational approaches to study the regulatory elements in eukaryotic genes
Brief Bioinform, July 12, 2007; (2007) bbm027v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Pachkov, I. Erb, N. Molina, and E. van Nimwegen
SwissRegulon: a database of genome-wide annotations of regulatory sites
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D127 - D131.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Y. Galperin
The Molecular Biology Database Collection: 2005 update
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D5 - D24.
[Abstract] [Full Text] [PDF]


Home page
Eukaryot CellHome page
S. W. Stumpferl, O. Stephan, and H. D. Osiewacz
Impact of a Disruption of a Pathway Delivering Copper to Mitochondria on Podospora anserina Metabolism and Life Span
Eukaryot. Cell, February 1, 2004; 3(1): 200 - 211.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
S. Rombauts, K. Florquin, M. Lescot, K. Marchal, P. Rouze, and Y. Van de Peer
Computational Approaches to Identify Promoters and cis-Regulatory Elements in Plant Genomes
Plant Physiology, July 1, 2003; 132(3): 1162 - 1176.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
I. A. Shahmuradov, A. J. Gammerman, J. M. Hancock, P. M. Bramley, and V. V. Solovyev
PlantProm: a database of plant promoter sequences
Nucleic Acids Res., January 1, 2003; 31(1): 114 - 117.
[Abstract] [Full Text] [PDF]


Home page
Endocr. Rev.Home page
C. P. Leo, S. Y. Hsu, and A. J. W. Hsueh
Hormonal Genomics
Endocr. Rev., June 1, 2002; 23(3): 369 - 381.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. Praz, R. Perier, C. Bonnard, and P. Bucher
The Eukaryotic Promoter Database, EPD: new entry types and links to gene expression data
Nucleic Acids Res., January 1, 2002; 30(1): 322 - 324.
[Abstract] [Full Text] [PDF]


Home page
J. Immunol.Home page
E.-K. Jo, H. Kanegane, S. Nonoyama, S. Tsukada, J.-H. Lee, K. Lim, M. Shong, C.-H. Song, H.-J. Kim, J.-K. Park, et al.
Characterization of Mutations, Including a Novel Regulatory Defect in the First Intron, in Bruton's Tyrosine Kinase Gene from Seven Korean X-Linked Agammaglobulinemia Families
J. Immunol., October 1, 2001; 167(7): 4038 - 4045.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (70K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (36)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Ghosh, D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ghosh, D.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?