| Nucleic Acids Research | Pages |
A signal encoded in vertebrate DNA that influences nucleosome positioning and alignment
Introduction
Computational Methods
Results And Discussion
Period-10 VWG variations correlate with nucleosome forming preferences in SV40 chromatin
Periodic oscillations in period-10 VWG predict nucleosome alignment and the value of the dinucleosome spacing periodicity
Conclusion
Acknowledgements
References
A signal encoded in vertebrate DNA that influences nucleosome positioning and alignment
ABSTRACT
INTRODUCTION
The vertebrate genome is packaged into chromatin, consisting largely of nucleosomes. Each nucleosome contains 147 bp of tightly wrapped DNA (1), a variable length (generally ranging from 20 to 80 bp) of linker DNA, and linker histone H1 (2). Nucleosomes in chromatin aggregate into higher-order structures, the exact nature of which is currently unclear (3,4). In addition to its obvious role in organizing DNA, chromatin structure is thought to be an important factor in gene regulation (5-7). It is reasonable to suppose that the DNA base sequence influences the nucleosome arrangements in chromatin, and that this influence could be of functional significance. DNA sequences that position nucleosomes have been known for some time (8). However, the extent to which the DNA sequence contributes to the nucleosome arrangements in chromatin remains unknown.
This problem has been examined by a number of different approaches. There have been high resolution mapping studies performed on a few well-studied genes. From such studies, it seems clear, for example, that nucleosome positioning is involved in the regulation of the mouse mammary tumor virus long terminal repeat promoter (9-13) and in the Xenopus 5S rRNA genes (14). In another approach, sequence alignment studies of nucleosomal DNA have revealed a weak out-of-phase 10 bp periodicity of A·T and G·C bp (15-17) and a 10 bp periodicity in the occurrence of AA dinucleotides (18-20), consistent with in-phase DNA sequence-dependent anisotropic deformability playing a role in nucleosome positioning (21,22). Competitive reconstitution and sequence selection experiments using nucleosome-size DNA fragments have additionally demonstrated affinity differences among different genomic DNA sequences for core histones (23,24). Because >95% of bulk genomic DNA differed in free energy for nucleosome formation, under those conditions, by only ±0.2 kcal/mol relative to synthetic random DNA, it was concluded that the DNA sequence has no appreciable influence on nucleosome positioning for the vast majority of nucleosomes (23). However, this type of experiment does not include contributions to positioning preferences resulting from the low energy positional shifting along DNA, which can readily occur at physiological conditions, where histones do not dissociate from DNA to an appreciable extent (13,25). Pointing to the importance of such nucleosome sliding mechanisms is the demonstration that nucleosome reconstitution on a 4.4 kb DNA fragment containing the chicken adult [beta]-globin gene resulted in extensive variation in the affinity of core histones for different DNA regions. The range of variation was about two orders of magnitude greater than that of the initial binding of the histone octamer (26). In still another type of experiment, in vitro chromatin assembly using Drosophila embryo extracts, it was concluded that DNA sequence plays, at most, a minor role in nucleosome positioning (27). However, it is not clear that the ATP-dependent nucleosome mobilization that occurs in Drosophila early embryos (28) is reflective of most animal cells. Finally, the presence of linker histone H1, not included in most studies, can dramatically influence the nucleosome arrangement on DNA in vitro (29,30).
Despite the complexities of the experimental data, the preponderance of evidence obtained over many years from both in vitro and in vivo experiments favors the view that the DNA sequence does influence nucleosome positioning. However, it has been found that nucleosomes generally do not occupy unique positions on DNA. Rather, there appear to be preferred nucleosome locations. An example is provided by SV40 chromatin. In one experiment (31) native SV40 chromatin was isolated, digested with micrococcal nuclease, and the mononucleosome DNA fragments were isolated and cloned. By repeatedly selecting random clones from this library, and by sequencing the SV40 inserts, the extent to which nucleosomes overlapped each SV40 map position was determined. The peaks and valleys obtained for the clone overlap function (Fig.
In vitro chromatin assembly studies using a purified system have also revealed that there are preferred nucleosome arrangements on different DNA sequences. These experiments showed that some 2-3 kb vertebrate DNA sequences assembled into highly ordered, physiologically spaced, nucleosome arrays in the presence of linker histone, whereas other 2-3 kb sequences did not, under identical conditions (32-36). Moreover, an excellent correlation has been found for the tendency of different DNA sequences to package into ordered nucleosome arrays in vitro and in animal cell nuclei (34-36). These studies suggest that the DNA base sequence can influence nucleosome array formation, and it is of interest to know what base sequence patterns could be responsible for this phenomenon.
Recently, a fairly strong consensus sequence: non-T(A/T)G, which will be referred to as VWG (with complement CWB), with a period of 10 nt was found throughout human DNA. This unexpected discovery was made when a powerful machine learning approach was applied to a large set of human genes in order to train the computer to better recognize splice junctions and to find possible distinctive characteristics of intron and exon sequences (37). Because the 10 nt period could be reflecting an in phase bending potential, the authors suggested that period-10 VWGs might be a signal involved in nucleosome positioning. We have tested this hypothesis, and report results here which support it.
COMPUTATIONAL METHODS
Computer programs were written to scan along a DNA sequence to identify VWGs on both strands (or equivalently, to identify VWG/CWB on one strand) and to count the number of period-10 VWGs occurring in a window ± approximately 50 bp (5 periods) from each VWG position. Overlapping occurrences were relatively rare, and were counted separately. The University of Wisconsin GCG Sequence Analysis Software (running on a VAX system) was used to find the pattern VWG, or other patterns when desired. All other programs were written in VAX FORTRAN. The period was allowed to vary within a small specified range. For chicken genomic DNA this range was 10.00-10.33; for SV40 DNA the range was as stated. The period variation was implemented by counting as periodic only nucleotide numbers, which satisfy the relation
| PL [le] ¦nucleotide no.¦/period no. [le] PH, |
where the nucleotide no. is the integer number of nucleotides measured from any VWG/CWB position to another VWG/CWB position, the period no. is the integer 1, 2, 3, 4 or 5, PL is the low end of the period variation and PH is the high end. For example, for a period range of 10.00-10.33, PL = 10.00, PH = 10.33, and the periodic nucleotide numbers are: ±10; ±20; ±30, ±31; ±40, ±41; ±50, ±51. Thus, 41 is a periodic nucleotide number for this range because 41/4 = 10.25, whereas 42 is not periodic because 42/4 = 10.50.
The histogram data were then averaged in a sliding (5 bp increments) 100 bp window to generate a continuous curve of average period-10 VWG count versus GenBank nucleotide number. Closely similar results were obtained using window sizes over the range 50-150 bp. It should be pointed out that an average count of 10 would be obtained only for a perfectly periodic region of DNA (i.e., a 100 bp or larger region having a VWG every 10 bp and nowhere else, which is quite rare in vertebrate DNA). Moreover, because the VWG consensus includes six common nucleotide triplets, it occurs frequently (one every 5.6 bp in SV40 DNA on average), and this leads to an appreciable background level of the average period-10 VWG count throughout vertebrate DNA. The total number of VWG occurrences in a sliding (5 bp increments) 600 bp window was also computed. While deviations from the average value of VWG occurrences were usually <10%, occasionally VWG-rich or VWG-poor regions of DNA were encountered. Thus, the period-10 VWG counts were normalized by a factor equal to the average VWG count for 600 bp/the VWG count in the center of a 600 bp window surrounding the nucleotide of interest. The rationale for normalization is that DNA regions poor (or rich) in VWG will statistically have fewer (or greater) period-10 VWGs. The 600 bp window size is arbitrary, but large enough to accommodate several nucleosomes and to damp out fluctuations. It was found that this normalization had only a small effect on the computational results.
RESULTS AND DISCUSSION
Period-10 VWG variations correlate with nucleosome forming preferences in SV40 chromatin
When this analysis was performed with SV40 DNA using a periodicity range of 9.2-10.5 bp (Fig.
A
![]() |
B
![]() |
Figure 1. Experimental nucleosome positioning preferences in native SV40 chromatin, periodic VWG/CWB counts along SV40 DNA, and VWG occurrences in SV40 DNA. (A) Comparison of the variation in the clone overlap (solid curve), an experimental measure of nucleosome positioning preferences in native SV40 chromatin, with the variation in the window-averaged count of periodic VWG/CWB (dotted curve) along the SV40 DNA sequence (strain 776). Clone overlap denotes the fragment overlap function defined precisely in (31). It is essentially the number of clones with SV40 nucleosomal DNA inserts found in the 412 clone set that overlapped each site ±5 bp along the SV40 DNA sequence. The solid curve reproduces the curve shown in figure 5 of ref. 31. Periodic VWG/CWB denotes the number of VWG nucleotide triplets on both strands spaced at intervals consistent with a periodicity range of 9.2-10.5 bp ±52 bp from any VWG. The VWG counts were averaged in a 100 nt sliding window, and normalized as described in the text. The analysis was performed using a linear map of SV40 DNA; the computation of periodic VWG/CWB values began and ended ~300 bp in from each end. (B) VWG occurrences on both strands of SV40 DNA in a sliding 600 bp window.
For comparison, we performed the same analysis as was done with VWG using dinucleotides that are thought to contribute to the rotational positioning of DNA in the nucleosome. Figure
A
![]() |
B
![]() |
C
![]() |
Figure 2. Periodic dinucleotide counts along SV40 DNA compared with the experimentally determined nucleosome positioning preferences in native SV40 chromatin. The analysis was performed as described in Figure 1 for VWG/CWB. (A) Periodic AA/TT, the average number of AA or TT dinucleotides occurring periodically on both strands. (B) Periodic TT, the average number of TT dinucleotides occurring periodically on the GenBank forward strand. (C) Periodic AA, the average number of AA dinucleotides occurring periodically on the GenBank forward strand.
In order to provide a more objective and a quantitative measure of how well the experimental and theoretical curves correspond in Figures
Figure
Figure 3. Scatter plots. (A) The y coordinate values, periodic AA/TT and clone overlap, for the two curves presented in Figure 2A are plotted as points for each nucleotide position value. (B) The y coordinate values, periodic VWG/CWB and clone overlap, for the two curves presented in Figure 1A are plotted as points for each nucleotide position value. We next examined whether VWG oscillations can provide an explanation for why some, but not other, 2-3 kb regions of vertebrate DNA assemble into highly ordered, physiologically spaced, nucleosome arrays, as was shown for the chicken ovalbumin gene (32). Using in vitro chromatin assembly, it was shown that the 2-3 kb region within the gene assessed by probe 3 formed a highly regular nucleosome array with a 195 bp spacing periodicity when linker histone was present (32). Twelve multiples of this unit repeat were resolved for the micrococcal nuclease ladder using this probe. In contrast, the 2-3 kb region immediately downstream was considerably less ordered (32). The variations in the period-10 VWG count across the chicken ovalbumin gene are shown in Figure Figure 4. Variations in the average periodic VWG/CWB count throughout the chicken ovalbumin gene. Periodic VWG/CWB denotes the number of VWG triplets on both strands spaced at intervals consistent with a period range of 10.00-10.33 bp ±52 bp from any VWG. The VWG counts were averaged in a 100 nt sliding window, and normalized as described in the text. Position denotes GenBank locus CHKOVAL nucleotide numbers. Probe 3, from (32), was a 235 bp restriction fragment that spanned the intron E-exon 5 junction. Figure 5. Fourier analysis of the oscillations in periodic (10.00-10.33 bp) VWG/CWB in two regions of the chicken ovalbumin gene (Fig. 4). (A) The approximate ordered region assessed by probe 3 (32); GenBank (locus CHKOVAL) nucleotide positions 4200-7500. The arrow marks the period corresponding to 2× the value of the experimentally determined nucleosome repeat detected by probe 3 (2 × 195 bp = 390 bp). (B) Adjacent disordered downstream region; GenBank (locus CHKOVAL) nucleotide positions 6790-8790. We then analyzed the chicken [beta]-globin gene, for which the nucleosome ladders have been examined both in vitro (33) and in erythroid cells (38). In both cases, this DNA assembles into fairly well ordered chromatin with an ~180 bp nucleosome spacing periodicity. Interestingly, in erythrocytes the bulk nucleosome spacing periodicity is >200 bp, noticeably longer than on the [beta]-globin sequences (38), consistent with the [beta]-globin gene DNA base sequence being responsible for the shorter repeat. Fourier analysis revealed a predominant amplitude peak at a period very close to 2 × 180 bp = 360 bp throughout the [beta]-globin locus, again consistent with a dinucleosome period oscillation in the period-10 VWGs. The analysis of a 3 kb region from the 5[prime] end of the locus is shown in Figure Figure 6. Fourier analysis of the oscillations in periodic (10.00-10.33 bp) VWG/CWB. (A) DNA region just upstream of the chicken [beta]-globin gene cluster; GenBank (locus CHKHBBRE) nucleotide positions 6000-9000. The arrow marks the period corresponding to 2× the approximate value of the experimentally determined nucleosome repeat in this DNA region in nuclei of erythroid cells (38) (2 × 180 bp = 360 bp). (B) pBR322; nucleotide positions 300-3300. Another example of a correlation between a regular dinucleosome period oscillation in the period-10 VWG count and DNA sequence-dependent nucleosome alignment is provided by SV40 DNA. In vitro and in transfected cells, the SV40 early region assembles into a significantly more regular nucleosome array than the late region; the nucleosome spacing periodicity was 200 bp (34). Consistent with these findings, the SV40 early region (Fig. Figure 7. Fourier analysis of the oscillations in periodic (9.2-10.5) VWG/CWB for SV40 DNA. (A) SV40 early region (nucleotides 2600-4900). The arrow marks the period corresponding to 2× the value of the experimentally determined nucleosome repeat in this region (34) (2 × 200 bp = 400 bp). (B) SV40 late region (nucleotides 300-2600). The findings reported here suggest that the period-10 VWG signal, identified serendipitously by a machine learning approach does in fact contribute to nucleosome positioning and alignment, as postulated (37), and also that the dinucleosome may have some structural significance, as has been claimed previously (2). These findings also suggest that it may be possible to deduce information about chromatin structure computationally. Recent work by others has suggested that nucleosome arrangements might affect chromatin higher-order structure. In non-solenoid models (39,40), disordered nucleosome arrays should form more globular (superbead-like) chromatin fibers than highly ordered arrays, which should generally form extended (solenoid-like) structures (39). Variations in nucleosome arrangements along large contiguous regions of DNA could lead to chromatin structures of many different types and shapes. For example, some chromatin structures might dictate that distal enhancers be placed close to appropriate promoters in 3D-space, obviating the need for a solenoid to be disrupted to allow DNA looping to occur. It is plausible that the DNA base sequence, through its effect on nucleosome arrangements, contributes to functionally significant chromatin structure variations along the chromosome. We wish to thank Rick Westerman for help using the GCG software and VAX system, and Raúl Alvarez for assistance. This work was supported by US Public Health Service grants GM48341 (to A.S.) and AI29121 (to M.B.).
Periodic oscillations in period-10 VWG predict nucleosome alignment and the value of the dinucleosome spacing periodicity
Conclusion
ACKNOWLEDGEMENTS
REFERENCES
This article has been cited by other articles:
This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals
Comments and feedback: www-admin{at}oup.co.uk
Last modification: 15 Jan 1999
Copyright©Oxford University Press, 1999.
![]()
CiteULike
Connotea
Del.icio.us What's this?
![]()
![]()

![]()
![]()
![]()
X. Xu, K. Tsumagari, J. Sowden, R. Tawil, A. P. Boyle, L. Song, T. S. Furey, G. E. Crawford, and M. Ehrlich
DNaseI hypersensitivity at gene-poor, FSH dystrophy-linked 4q35.2
Nucleic Acids Res.,
December 1, 2009;
37(22):
7381 - 7393.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
G. A. Babbitt and Y. Kim
Inferring Natural Selection on Fine-Scale Chromatin Organization in Yeast
Mol. Biol. Evol.,
August 1, 2008;
25(8):
1714 - 1727.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
J. S. Song, X. Liu, X. S. Liu, and X. He
A high-resolution map of nucleosome positioning on a fission yeast centromere
Genome Res.,
July 1, 2008;
18(7):
1064 - 1072.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
H. E. Peckham, R. E. Thurman, Y. Fu, J. A. Stamatoyannopoulos, W. S. Noble, K. Struhl, and Z. Weng
Nucleosome positioning signals in genomic DNA
Genome Res.,
August 1, 2007;
17(8):
1170 - 1177.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
A. Cioffi, T. J. Fleury, and A. Stein
Aspects of large-scale chromatin structures in mouse liver nuclei can be predicted from the DNA sequence
Nucleic Acids Res.,
April 13, 2006;
34(7):
1974 - 1981.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
Y. Dalal, T. J. Fleury, A. Cioffi, and A. Stein
Long-range oscillation in a periodic DNA sequence motif may influence nucleosome array formation
Nucleic Acids Res.,
February 16, 2005;
33(3):
934 - 945.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
A. Stein, Y. Dalal, and T. J. Fleury
Circle ligation of in vitro assembled chromatin indicates a highly flexible structure
Nucleic Acids Res.,
December 1, 2002;
30(23):
5103 - 5109.
[Abstract]
[Full Text]
[PDF]
![]()
This Article ![]()
![]()
Abstract
![]()
Print PDF (108K)
![]()
Alert me when this article is cited
![]()
Alert me if a correction is posted
![]()
Services ![]()
![]()
Email this article to a friend
![]()
Similar articles in this journal
![]()
Similar articles in ISI Web of Science
![]()
Similar articles in PubMed
![]()
Alert me to new issues of the journal
![]()
Add to My Personal Archive
![]()
Download to citation manager
![]()
Search for citing articles in:
ISI Web of Science (22)
![]()
Request Permissions ![]()
Scopus Links ![]()
Commercial Re-use Guidelines
for Open Access NAR Content
![]()
Google Scholar ![]()
![]()
Articles by Stein, A.
![]()
Articles by Bina, M.
![]()
Search for Related Content
![]()
PubMed ![]()
![]()
PubMed Citation
![]()
Articles by Stein, A.
![]()
Articles by Bina, M.
![]()
Social Bookmarking ![]()
![]()
What's this?




