Skip Navigation

Nucleic Acids Research 2006 34(Database Issue):D74-D81; doi:10.1093/nar/gkj131
This Article
Right arrow Full Text Freely available
Right arrow Print PDF (722K) Freely available
Right arrow Screen PDF (191K) Freely available
Right arrow Supplementary Material
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Kummerfeld, S. K.
Right arrow Articles by Teichmann, S. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kummerfeld, S. K.
Right arrow Articles by Teichmann, S. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2006, Vol. 34, Database issue D74-D81
© The Author 2006. Published by Oxford University Press. All rights reserved
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions{at}oxfordjournals.org


Article

DBD: a transcription factor prediction database

Sarah K. Kummerfeld* and Sarah A. Teichmann

MRC Laboratory of Molecular Biology Hills Road, Cambridge CB2 2QH, UK

*To whom correspondence should be addressed. Tel: +44 1223 402479; Fax: +44 1223 213556; Email: skk{at}mrc-lmb.cam.ac.uk

Received August 12, 2005. Revised October 24, 2005. Accepted October 24, 2005.

Regulation of gene expression influences almost all biological processes in an organism; sequence-specific DNA-binding transcription factors are critical to this control. For most genomes, the repertoire of transcription factors is only partially known. Hitherto transcription factor identification has been largely based on genome annotation pipelines that use pairwise sequence comparisons, which detect only those factors similar to known genes, or on functional classification schemes that amalgamate many types of proteins into the category of ‘transcription factor’. Using a novel transcription factor identification method, the DBD transcription factor database fills this void, providing genome-wide transcription factor predictions for organisms from across the tree of life. The prediction method behind DBD identifies sequence-specific DNA-binding transcription factors through homology using profile hidden Markov models (HMMs) of domains. Thus, it is limited to factors that are homologus to those HMMs. The collection of HMMs is taken from two existing databases (Pfam and SUPERFAMILY), and is limited to models that exclusively detect transcription factors that specifically recognize DNA sequences. It does not include basal transcription factors or chromatin-associated proteins, for instance. Based on comparison with experimentally verified annotation, the prediction procedure is between 95% and 99% accurate. Between one quarter and one-half of our genome-wide predicted transcription factors represent previously uncharacterized proteins. The DBD (www.transcriptionfactor.org) consists of predicted transcription factor repertoires for 150 completely sequenced genomes, their domain assignments and the hand curated list of DNA-binding domain HMMs. Users can browse, search or download the predictions by genome, domain family or sequence identifier, view families of transcription factors based on domain architecture and receive predictions for a protein sequence.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
F. Tian, P. K. Shah, X. Liu, N. Negre, J. Chen, O. Karpenko, K. P. White, and R. L. Grossman
Flynet: a genomic resource for Drosophila melanogaster transcriptional regulatory networks
Bioinformatics, November 15, 2009; 25(22): 3001 - 3004.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Perez-Rodriguez, D. M. Riano-Pachon, L. G. G. Correa, S. A. Rensing, B. Kersten, and B. Mueller-Roeber
PlnTFDB: updated content and new features of the plant transcription factor database
Nucleic Acids Res., October 25, 2009; (2009) gkp805v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Zhang, M. Xu, S. Li, and Z. Su
Genome-wide de novo prediction of cis-regulatory binding sites in prokaryotes
Nucleic Acids Res., June 1, 2009; 37(10): e72 - e72.
[Abstract] [Full Text] [PDF]


Home page
Brief Funct Genomic ProteomicHome page
B. Deplancke
Experimental advances in the characterization of metazoan gene regulatory networks
Brief Funct Genomic Proteomic, January 1, 2009; 8(1): 12 - 27.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
A. Yilmaz, M. Y. Nishiyama Jr., B. G. Fuentes, G. M. Souza, D. Janies, J. Gray, and E. Grotewold
GRASSIUS: A Platform for Comparative Regulatory Genomics across the Grasses
Plant Physiology, January 1, 2009; 149(1): 171 - 180.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Park, J. Park, S. Jang, S. Kim, S. Kong, J. Choi, K. Ahn, J. Kim, S. Lee, S. Kim, et al.
FTFD: an informatics pipeline supporting phylogenomic analysis of fungal transcription factors
Bioinformatics, April 1, 2008; 24(7): 1024 - 1025.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Zeng, J. Yan, T. Wang, D. Mosbrook-Davis, K. T. Dolan, R. Christensen, G. D. Stormo, D. Haussler, R. H. Lathrop, R. K. Brachmann, et al.
Genome wide screens in yeast to identify potential binding sites and target genes of DNA-binding proteins
Nucleic Acids Res., January 17, 2008; 36(1): e8 - e8.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
N. Molina and E. van Nimwegen
Universal patterns of purifying selection at noncoding positions in bacteria
Genome Res., January 1, 2008; 18(1): 148 - 160.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
V. Vermeirssen, M. I. Barrasa, C. A. Hidalgo, J. A. B. Babon, R. Sequerra, L. Doucette-Stamm, A.-L. Barabasi, and A. J.M. Walhout
Transcription factor modularity in a gene-centered C. elegans core neuronal protein-DNA interaction network
Genome Res., July 1, 2007; 17(7): 1061 - 1071.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
A. V. Morozov and E. D. Siggia
Connecting protein structure with predictions of regulatory sites
PNAS, April 24, 2007; 104(17): 7068 - 7073.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Tjong and H.-X. Zhou
DISPLAR: an accurate method for predicting DNA-binding sites on protein surfaces
Nucleic Acids Res., March 12, 2007; 35(5): 1465 - 1477.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. Wilson, M. Madera, C. Vogel, C. Chothia, and J. Gough
The SUPERFAMILY database in 2007: families and functions
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D308 - D313.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
A. J.M. Walhout
Unraveling transcription regulatory networks by protein-DNA and protein-protein interaction mapping
Genome Res., December 1, 2006; 16(12): 1445 - 1454.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Gough
Genomic scale sub-family assignment of protein domains
Nucleic Acids Res., July 28, 2006; 34(13): 3625 - 3633.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.