Nucleic Acids Research, 1995, Vol. 23, No. 8 1434-1440
© 1995
COMPUTATIONAL BIOLOGY |
Multi-alphabet consensus algorithm for identification of low specificity protein-DNA interactions
Department of Molecular, Cellular and Development Biology, University of Colorado at Boulder Campus Box 347, Colorado, Boulder, CO 80309-0347, USA
*To whom correspondence should be addressed at present address: EMBL, Postfach 10.2209, Meyerhofstrasse 1, D-6900 Heidelberg, Germany
Received September 16, 1994. Revised March 10, 1995. Accepted March 10, 1995.
A method for the identification and characterization of protein-DNA interactions is presented. We have developed an approach for finding unknown multiple patterns that occur imperfectly in a set of several sequences. The pattern may contain letters from the nucleotide alphabet (A, C, G and T) including ambiguous characters (A/C, A/G, A/T; A/C/G, etc.). This method reveals weak DNA signals on an unaligned set of DNA fragments known to be functionally related and assumes no prior information on the sequences' alignment. It determines the locations of the signals from only the information intrinsic to the sequences them selves. We have applied this method to analyze the binding sites of cAMP receptor protein (CRP). The consensus based on these data are discussed and a comparison of the consensus with the crystal structure of CAP-DNA complex is presented. We further show that in a mixture of DNA sequences, containing binding sites for two different proteins, both classes of binding sites can be discovered simultaneously by this method. The DNA sequences of nucleosome cores from chicken erythrocyte and a set of the other known nucleosomal sequences show existence of symmetrical features in nucleosome-binding DNA sequences. We also show multi-alphabet patterns that can play a role in the phasing signal on the nucleosome DNA molecule and have compared the results with existing models of nucleosome positioning.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
I. Ioshikhes, E. N. Trifonov, and M. Q. Zhang Periodical distribution of transcription factor sites in promoter regions and connection with chromatin structure PNAS, March 16, 1999; 96(6): 2891 - 2895. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Zucman-Rossi, P. Legoix, J.-M. Victor, B. Lopez, and G. Thomas Chromosome translocation based on illegitimate recombination in human tumors PNAS, September 29, 1998; 95(20): 11786 - 11791. [Abstract] [Full Text] [PDF] |
||||
