Nucleic Acids Research, 2003, Vol. 31, No. 19 e116
© 2003 Oxford University Press
A non-parametric model for transcription factor binding sites
Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, 250 Longwood Avenue, SGMB-322, Boston, MA 02115, USA
*To whom correspondence should be addressed. Tel: +1 617 432 3553; Fax: +1 617 432 3557; Email: oliver_king{at}hms.harvard.edu
We introduce a non-parametric representation of transcription factor binding sites which can model arbitrary dependencies between positions. As two parameters are varied, this representation smoothly interpolates between the empirical distribution of binding sites and the standard position-specific scoring matrix (PSSM). In a test of generalization to unseen binding sites using 10-fold cross-validation on known binding sites for 95 TRANSFAC transcription factors, this representation outperforms PSSMs on between 65 and 89 of the 95 transcription factors, depending on the choice of the two adjustable parameters. We also discuss how the non- parametric representation may be incorporated into frameworks for finding binding sites given only a collection of unaligned promoter regions.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
G. Della Gatta, M. Bansal, A. Ambesi-Impiombato, D. Antonini, C. Missero, and D. di Bernardo Direct targets of the TRP63 transcription factor revealed by a combination of gene expression profiling and reverse engineering Genome Res., June 1, 2008; 18(6): 939 - 948. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Tomovic and E. J. Oakeley Position dependencies in transcription factor binding sites Bioinformatics, April 15, 2007; 23(8): 933 - 941. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. T. Naughton, E. Fratkin, S. Batzoglou, and D. L. Brutlag A graph-based motif detection algorithm models complex nucleotide dependencies in transcription factor binding sites Nucleic Acids Res., November 6, 2006; 34(20): 5730 - 5739. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Johnson, R. J. Gamblin, L. Ooi, A. W. Bruce, I. J. Donaldson, D. R. Westhead, I. C. Wood, R. M. Jackson, and N. J. Buckley Identification of the REST regulon reveals extensive transposable element-mediated binding site duplication Nucleic Acids Res., September 1, 2006; 34(14): 3862 - 3877. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta Computational identification of transcriptional regulatory elements in DNA sequence Nucleic Acids Res., July 19, 2006; 34(12): 3585 - 3598. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. I. Gershenzon, G. D. Stormo, and I. P. Ioshikhes Computational technique for improvement of the position-weight matrices for the DNA/protein binding sites Nucleic Acids Res., April 22, 2005; 33(7): 2290 - 2301. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Bielinska, J. Lu, D. Sturgill, and B. Oliver Core Promoter Sequences Contribute to ovo-B Regulation in the Drosophila melanogaster Germline Genetics, January 1, 2005; 169(1): 161 - 172. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Linnell, R. Mott, S. Field, D. P. Kwiatkowski, J. Ragoussis, and I. A. Udalova Quantitative high-throughput analysis of transcription factor binding specificities Nucleic Acids Res., February 27, 2004; 32(4): e44 - e44. [Abstract] [Full Text] [PDF] |
||||



