Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (153K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (20)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Law, A.
Right arrow Articles by Lis, J. T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Law, A.
Right arrow Articles by Lis, J. T.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research Pages 919-924


Direct cloning of DNA that interacts in vivo with a specific protein: application to RNA polymerase II and sites of pausing in Drosophila
Introduction
Materials And Methods
   Cloned DNAs
   Antibody
   UV cross-linking and protein-DNA adduct purification
   HaeIII digestion, CATCH linker ligation and XhoI digestion of concatemers
   Immunoprecipitation
   PCR
   Construction of libraries
   Nuclear run-on transcription assays
Results
Discussion
Acknowledgements
References


Direct cloning of DNA that interacts in vivo with a specific protein: application to RNA polymerase II and sites of pausing in Drosophila

Direct cloning of DNA that interacts in vivo with a specific protein: application to RNA polymerase II and sites of pausing in Drosophila Adam Law, Kazunori Hirayoshi, Thomas O'Brien and John T. Lis*

Section of Biochemsitry, Molecular and Cellular Biology, Biotechnology Building, Cornell University, Ithaca, NY 14853, USA

Received November 12, 1997; Revised and Accepted December 26, 1997

ABSTRACT

A new method is described for cloning DNA sequences occupied by a specific protein on chromatin in vivo. The approach uses UV cross-linking to couple proteins covalently to DNA and the resulting complexes are then purified under stringent conditions. Particular adducts are immunoprocipitated with antibody to the protein of interest. The resulting DNA (iDNA) is amplified by PCR, cloned and characterized. The model system used was RNA polymerase II (Pol II), whose density on particular DNAs under various conditions is well documented. Pol II can exist in several states on DNA. While Pol II can simply be bound to DNA, the bulk of DNA-associated Pol II is transcriptionally engaged in either the transcribing or paused states. Paused Pol IIs that have previously been characterized are found at promoters and have the distinctive property that their transcription in isolated nuclei is stimulated by sarkosyl or high salt. Here we isolate and sequence DNAs that cross-link to Pol II molecules. We identify by nuclear run-on assays those DNAs that have Pol II engaged in transcription. Twenty one percent of the iDNA clones that have detectable transcriptionally engaged Pol II appear to be paused, in that they display sarkosyl-stimulated trancription in a nuclear run-on transcription assay. At least some of these map to the 5'-ends of genes. These results suggest that transcriptional pausing of Pol II is a general phenomenon in vivo.

INTRODUCTION

Chromosomal DNA is covered with thousands of proteins that execute a broad spectrum of nuclear functions. Some of these proteins bind DNA with strong sequence specificity, others bind DNA non-specifically, still others bind DNA not on their own but by cooperating with other proteins of a complex and, finally, others bind and translocate along the DNA. Identifying where and at what level a protein interacts with DNA is critical in assessing its function. While binding assays in vitro permit examination of protein-DNA contacts in biochemical detail, methods that quantitatively examine specific protein-DNA interactions in vivo are critical in assessing their biological significance.

Well-documented strategies exist for analytically measuring the location and density of a protein on specific segments of DNA in vivo (1-8). Typically a protein can be cross-linked to DNA in cells and antibody to the protein of interest used to immunoprecipite protein-DNA adducts. The abundance of co-precipitated specific DNAs is quantified by hybridization or PCR methods to provide a measure of the relative density of the protein on particular DNA fragments. Early versions of these protocols used UV light as the cross-linking agent (2,3) and more recent versions have made use of formaldehyde as the cross-linker (4,6,8). Each approach has its advantages, e.g. flashes of UV light rapidly penetrate cells and allow kinetic analysis of proteins that are in direct contact with DNA (9), while formaldehyde is a more general cross-linker and works with a broad array of chromosomal proteins (4,5). Although both approaches have been used analytically to measure a protein on specific regions of DNA, the DNA sequences examined are limited to suspected target DNAs. In principle, the cross-linked DNAs that co-purify with a particular protein can be used to identify new DNA targets of the protein in vivo. Generating cross-links of protein to DNA in vivo ensures that proteins are binding true DNA targets and not artifacts of rearrangements that can occur on chromosomes during cell lysis or further handling of chromatin.

Here we describe an approach for cloning DNA sequences with which a specific protein associates in vivo. We developed this approach with RNA polymerase II (Pol II), which we have shown previously can cross-link efficiently to DNA when cells are irradiated with UV light (10). The distribution of Pol II on many genes is well defined by quantitative studies of transcription (11), providing a strong test of whether the new approach is working in a predictable manner. Also, non-transcribing Pol II has been found on the promoters of some genes, which can be detected by nuclear run-on assays done in the presence of sarkosyl or high salt (12,13). These Pol IIs are also cross-linked to DNA by UV light (2) and the DNAs associated with polymerase should therefore be clonable by this new approach.

One of the best characterized paused Pol II complexes is at the start of the Drosophila hsp70 gene. In the absence of heat shock a transcriptionally engaged Pol II molecule pauses 21-35 bp from the transcription start site of hsp70 (14,15). A survey of Drosophila genes showed evidence of paused Pol II on several but not all genes (13). Pol II pausing has also been reported in other eukaryotic organisms. These genes include the well-characterized pause in human c-myc (16,17) and human hsp70 (18), as well as other genes (19). The pause site in each of these cases occurs within 20-50 bases from the transcriptional start site. The accumulation of a promoter paused Pol II (in the case of hsp70 genes the level is one Pol II per gene) indicates that simple Pol II recruitment or initiation of transcription is not always rate limiting and that pausing may represent an important regulated step in expression of a variety of genes in diverse organisms.

To identify sites of Pol II pausing in an unbiased manner we used this UV cross-linking method to select and clone a collection of DNA sequences binding Pol II molecules in vivo. Transcribed genes are enriched in this collection of cloned DNAs. Analysis of these clones by nuclear run-on assay shows that many contain detectable, transcriptionally engaged Pol II. Moreover, the results indicate that a significant fraction of these engaged RNA polymerases are paused.

MATERIALS AND METHODS

Cloned DNAs

The probes used in Table 1 are the [beta]1-tubulin 1.4 and 2.7 kb AvaII fragments from pTu56-94 (20); the Hsp83 3.3 kb BamHI-SalI fragment from aDm4.46 (21); the ribosome 1.1 kb HindIII fragment from DmRyr22 (22). The control DNAs probed in nuclear run-on assays are histone H4 [a 896 bp StuI-AvaI fragment from the 4.8 kb histone repeat unit cloned into the SmaI site of pUC19; (23,23)]; hsp70 [p70X2.6 digested with AvaI and PstI (12) or with BanI and ScaI (23)]; the HSC5-ph6-8 genomic clone (24).

Antibody

Polyclonal goat anti-Drosophila melanogaster RNA Pol II large subunit was a kind gift from A.Greenleaf (25).

UV cross-linking and protein-DNA adduct purification

We followed the protocol previously described (26), with the following modifications. Drosophila Schneider line 2 (S2) cells were grown in spinner flasks containing 200 ml medium until they reached a density of 107/ml. Aliquots of 2 × 109 cells in mid-logarithmic growth were placed in a pre-chilled Pyrex lasagna dish inside an ice bath. These cells were irradiated for 10 min using an inverted shortwave transilluminator at 256 nm (UV Products) with the solarizer removed. These cells were aliquoted into four 50 ml polypropylene centrifuge tubes on ice and spun for 5 min in a clinical centrifuge. Cell lysis, nuclear preparations and cesium chloride gradients were performed as described (26). The gradient was fractionated using an 18 guage needle into 1 ml aliquots. The aliquots were checked for maximum genomic DNA concentration by agarose gel electrophoresis and these genomic DNA enriched fractions were pooled. The cesium chloride fraction was dialysed against four changes of 2 l buffer (100 mM Tris-HCl, pH 8, 2 mM EDTA and 0.2% sarkosyl).

HaeIII digestion, CATCH linker ligation and XhoI digestion of concatemers

The UV cross-linked DNA was digested with HaeIII in a 2 ml reaction mix using 1× React 2 buffer (Gibco BRL), 0.2% sarkosyl, 0.1% Triton X-100 and 20 U HaeIII and incubated overnight at 37°C. CATCH linkers (27) were ligated to the blunt-ended genomic DNA fragments using the following reaction conditions: 2 ml HaeIII digestion mixture and the following (final concentrations), 1× ligation buffer (Gibco BRL), 1 mM ATP, 5 µg/ml CATCH A and CATCH B linkers, 0.1% Sarkosyl in a final volume of 3 ml containing 100 U T4 DNA ligase (Gibco BRL). These ligation reactions were incubated overnight at 14°C. The CATCH linker concatemers were digested with XhoI, which cuts sites at the 3'-end of the CATCH linker sequence. The 3 ml of the above ligation reaction was brought to 4 ml with the following (final concentrations), 1× React 2 buffer (Gibco BRL), 0.2% sarkosyl, 0.1% Triton and 1000 U XhoI. The XhoI digest was incubated at 37°C for 8 h and a further 1000 U XhoI were added at 4 h.

Immunoprecipitation

Goat anti-Pol II affinity purified antibody (5 µl) was added to the experimental sample (+ immunoprecipitation) and incubated overnight at 4°C. A control sample was prepared by adding no antibody (- immunopreciptation), which was taken through all the subsequent stages in parallel with the + immunoprecipitation experiment. A slurry of 400 µl protein G-Sepharose (10% in ethanol) was washed with buffer 1 (0.2% sarkosyl, 100 mM Tris-HCl, pH 8, 2 mM EDTA) and resuspended in 500 µl buffer 1. An aliquot of 200 µl of this washed protein G-Sepharose suspension was added to the immunoprecipitation experiments. These reactions were incubated on a rotating platform for at least 4 h at 4°C. The immunoprecipitated Pol II-DNA adducts were spun for 1 min in a clinical centrifuge and the supernatant preserved, which we used later as a control called `total' genomic DNA (this is effectively total DNA, since only a small fraction of the DNA cross-links to Pol II). The protein G-Sepharose precipitate was transferred to 1.6 ml microcentrifuge tubes. The first four washes were in buffer 1 and the next eight washes in buffer 2 (100 mM Tris, pH 9, 0.5 M LiCl, 1% NP40, 1% sodium deoxycholate).

The Pol II-DNA adducts were eluted from the protein G-Sepharose beads by resuspension in 400 µl 0.5% SDS, 50 mM Tris-HCl, pH 8.5, and shaken for 30 min at room temperature. The protein G-Sepharose was separated by centrifugation and elution was repeated three times. The Pol II-DNA adducts were ethanol precipitated using a 1/10 vol 3 M NaOAC and 2.5 vol 100% ethanol and the visible pellets were washed with 70% ethanol. The ethanol precipitates were resuspended in 400 µl 1× proteinase K buffer (0.5% SDS, 10 mM Tris-HCl, pH 8.0, 10 mM EDTA), 4 µl 10 mg/ml proteinase K were added and the reaction mixture digested overnight at 65°C. The proteinase K was removed by phenol/ether extraction and the DNA fragments were ethanol precipitated and resuspended in 20 µl ddH2O. Samples of 5 µl of this product were used in subsequent PCR reactions.

PCR

We followed the amplification protocol for CATCH primers described by Kinzler and Vogelstein for their whole genomic PCR procedure (27) with the following modifications. We found increased yields of PCR products when we used only one CATCH primer in the PCR reaction. Optimal yields were obtained by a modified `hot start' procedure. The PCR reactions were assembled at 70°C and started by mixing in the Taq DNA polymerase and overlaying the mineral oil. An Erikcomp Thermocycler was programed as follows: 95°C 0.5 min, 50°C 2 min, 70°C 1.5 min repeated 25 times and 5% of this sample was reamplified. This amplified DNA was purified by PEG precipitation and two additional cycles of PCR were performed (93°C 2 min, 55°C 10 min, 70°C 10 min with a PEG purification of DNA between cycles) to fill in DNA ends more completely. The resulting amplified DNA was digested with EcoRI prior to the cloning step.


Figure 1. iDNA library construction. 1. Chilled Schneider line 2 cells were irradiated in culture for 10 min using an inverted shortwave transilluminator. Cross-linking and purification of Pol II-DNA adducts were performed as in Gilmour et al. (26) except where noted. 2. Protein-DNA adducts were purified using a CsCl step gradient and the fraction containing DNA was isolated and dialyzed. 3. The DNA-protein adducts were digested with HaeIII to provide small, blunt-ended genomic DNA fragments for linker ligation. 4. `CATCH linkers' (CATCH A and B) were ligated onto the blunt ends of the DNA and concatemers cleaved with XhoI as described by Kinzler and Vogelstein (27). 5. Pol II-DNA complexes were immunoprecipitated using goat anti-Pol II antibody and protein G-Sepharose. 6. The Pol II-DNA complexes were eluted from protein G-Sepharose with a buffer containing 0.5% SDS and digested with proteinase K, leaving fragments of double-stranded DNA with `CATCH linkers' on either end. 7. The DNA was amplified using CATCH primer. 8. The `CATCH linkers' were cleaved at their internal EcoRI site to facilitate cloning of the amplification products into a phagemid vector.

Construction of libraries

Three libraries were constructed: (i) the iDNA library, that was Pol II antibody selected; (ii) the control minus antibody library, from a mock selection with no antibody; (iii) a control `total' genomic library from the supernant of the Pol II immunoprecipitation. The library was constructed using [lambda]ZapII vector according to the manufacturer's (Stratagene) instructions.pBluescript containing the library inserts was excised following the Stratagene protocol. Lambda filter lifts and hybridization, plasmid DNA preparation, dideoxy DNA sequencing, restriction endonuclease digestion, DNA agarose electrophoresis and Southern blotting were all performed using standard methods.

Nuclear run-on transcription assays

Approximately 4 µg large scale plasmid DNA preparations of iDNA clones were digested with EcoRI, cutting sites within the CATCH linkers. These digested DNA samples were electrophoresed on duplicate 1% agarose gels and transferred to nylon membranes for Southern blots. Nuclear run-on transcription assays were carried out using Schneider line 2 cells (under non-heat shock conditions) according to our standard protocol (12). The reactions were labeled with [[alpha]-32P]UTP; the other nucleotides were cold. The nuclear run-on reaction time was 5 min for Figure 2 and 2 min for Figure 3A. Southern blots were hybridized with the nuclear run-on probes for 24 h in roller bottles containing 6× SSC, 5% SDS, 50% formamide, 10% dextran sulfate and 100 µg/ml sonicated salmon sperm at 42°C. The filters were washed three times in 2× SSC, 0.1% SDS at 65°C, twice in 0.2× SSC, 0.1% SDS at 65°C and once in 0.1× SSC, 0.1% SDS at 65°C. Autoradiography was carried out using both X-ray film and a Molecular Dynamics PhosphorImager.


Figure 2. Characterization of the transcriptional status of Pol II iDNA clones. (A) Nuclear run-on transcription assay of clones 1F-3H. Duplicate Southern blots of clones 1F-3H were hybridized with nuclear run-on RNA probes synthesized in the absence (top panel, -S) and presence (bottom panel, +S) of sarkosyl. The left panels show a 4 day PhosphorImager exposure and the right panels show the control lanes exposed as an autoradiogram overnight. Rib is a ribosomal RNA clone (HindIII/EcoRI-cut pDmrY22, a 870 bp, 300 bp, and a partial digestion product are visible), H70 is an hsp70 clone (BanI/ScaI-cut p70X2.6, a 573 bp BanI-BanI fragment is visible).

RESULTS

We developed a scheme to clone DNA sequences that are associated with a specific protein in vivo, as outlined in Figure 1A. This approach is an integration of our protein-DNA cross-linking methods (26) and other techniques employed to clone the target sequences that bind transcription factors in vitro (27). The UV light cross-linked DNA-protein adducts can be subjected to harsh purification procedures, including cell and nuclear lysis with detergents, CsCl ultracentrifugation, and immunoprecipitation. We have used a goat anti-Pol II polyclonal antibody (25) to immunoprecipitate UV-induced Pol II-DNA adducts (26) and have adapted the whole genome PCR procedure (26) to provide sufficient mass of immunoprecipitated DNA for cloning into a phagemid vector. We have termed the resulting library a Pol II immunoselected genomic DNA or iDNA library. Two control libraries were made in parallel with the Pol II iDNA library. The first, a `total' genomic library, was constructed by removing 0.5% of the total genomic DNA after the first immunoprecipitation step and subsequent amplification and cloning. The second, a `minus antibody' library, was constructed in an identical fashion to the iDNA library except that the goat anti-Pol II antibody was omitted.


Figure 3. Localization of the Pol II pause site on HSC5. (A) Nuclear run-on transcription assay in the absence (top panels) and presence of sarkosyl (lower panels). HSC5: lane 1, BglII and SphI (14); lane 2, BglII and AvaI; lane 3, SphI and HincII. Controls: lane 4, histone H4, HindIII and EcoRI; lane 5, hsp70, AvaI and PstI; lane 6, hsp70, BanI and ScaI. Sizes of hybridizing restriction fragments (kb) are to the right of controls. (B) Map of the HSC5 genomic clone (pH6-8). The arrow indicates the transcription start site and the first two introns are marked. Restriction sites are at the following positions relative to the transcription start site: BglII, 1055 bp; HincII, +2; AvaI, +62; SphI, +327; SphI, +868; AvaI, +950. Block diagrams illustrate restriction digests in lanes 1-3 and filled bars represent the fragments that hybridize to the run-on probe. Lettering (A-D) corresponds to positively hybridizing bands in the nuclear run-on assay.

The quality of the iDNA library was assessed by hybridization with sequences that should be enriched or depleted in the library. Table 1 demonstrates enrichment of the [beta]-tubulin and Hsp83 genes, which are constitutively transcribed by Pol II (20,21), in the Pol II iDNA library but not in the control libraries made in the selection minus antibody or from unselected total DNA. Table 1 also demonstrates that the rRNA genes, which are constitutively transcribed by RNA Pol I, are depleted in the Pol II iDNA library relative to the two control libraries.

Table 1 . Characterization of the Pol II iDNA library and two control libraries
Probes Positives (total number of plaques screened)
  iDNA library
selected
Pol II antibody
Control minus
library
antibody
Control `total'
DNA library
genomic
[beta]1-tubulin 71 (1.6 × 105) 0 (1.6 × 105) 2 (1.7 × 105)
hsp83 227 (1.2 × 105) 0 (1 × 105) 0 (1.2 × 105)
rDNA 6 (7.3 × 104) 45 (7.6 × 104) 118 (7.9 × 104)
Plaque counts of positively hybridizing iDNA clones employing probes specific for Pol II or Pol I constitutively transcribed genes.

In addition, we assessed the success of our cloning strategy by sequencing 24 iDNA library clones chosen at random. The phagemid sequences contain the predicted CATCH linker junction sequences between the vector cloning site and the Drosophila DNA inserts (data not shown). Approximately half of the clones had single inserts and the remainder had two and occasionally more. These inserts were always flanked by EcoRI sites and therefore are likely to represent ligation of EcoRI fragment inserts during ligation with the [lambda] vector. Table 2 contains the results of a GenBank search of these sequences. The majority of the identifiable clones are retrotransposons. This is expected, since these sequences are found in multiple copies in the Drosophila genome and are also highly transcribed. The microsatellite clone represents sequences that are very highly repeated and not efficiently trancribed by Pol II and probably represent non-specific background. The three single copy genes include calreticulin, HMG CoA reductase and the heat shock protein 70 cognate gene HSC5.

Table 2 . Eight matches to the database of 24 randomly selected Pol II iDNA clones
Clone Sequence name Accession no.a
1C Dm ORF 1 Z22588
1F Dm mgd1 retrotransposon X59545
1G Dm calreticulin X64461
2A Dm HSC5 L01503
2C Dm HMGcoA reductase M21329
2F Dm retrotransposon 297 X03431
2F Dm 1.688 g/ml satellite J01126
2G Dm copia X54147
aAccession number indicates the sequences to which the iDNA clones matched in the database.

After having established that the collection of clones in the Pol II iDNA library were enriched for sequences occupied by Pol II in vivo, their transcriptional status was determined. The same Drosophila cell line used to construct the Pol II iDNA library was used to make nuclear run-on probes in the presence or absence of the detergent sarkosyl. Sarkosyl treatment allows transcriptionally engaged (both transcribing and paused) Pol II to transcribe (12), but it prevents further rounds of initiation (28). Labeled run-on RNAs were made and hybridized to a representative group of iDNA library clones and control genes represented on the Southern blot shown in Figure 2. The rDNA control lane (Rib) shows that this RNA Pol I transcribed gene strongly hybridizes to run-on RNAs made in the presence and absence of sarkosyl. This is expected, as rDNA genes are repeated ~250 times/genome and have a high density of transcribing Pol I (29). The hsp70 control lane shows that a promoter-proximal fragment of hsp70 (BanI fragment 267-840 bp from the hsp70 transcriptional start site) hybridizes with the run-on probe made in the presence of sarkosyl, but not in its absence. This is predicted, as the hsp70 gene has been shown to have a promoter paused RNA Pol II whose transcription in nuclei is stimulated by sarkosyl. Two thirds of the tested iDNA clones showed no detectable hybridization with the run-on RNA probes and represent sequences that are transcribed at a level below detection by this assay or those that are not transcribed. Clones 1F and 2B show similar hybridization patterns in the presence and absence of sarkosyl and represent genes occupied by transcribing Pol II. 2F and 2G show a slight reduction in signal with run-on RNAs made in the presence of sarkosyl, suggesting that sarkosyl slightly impairs the processivity of Pol II on these two genes. Two restriction fragments in clone 2A and one restriction fragment in clone 3F hybridize much more strongly to run-on RNAs made in the presence, compared with the absence, of sarkosyl, suggesting the presence of transcriptionally paused Pol II. These hybridizing fragments were confirmed by hybridization with new run-on probes with fresh Southern blots of these cloned iDNAs.

Paused RNA Pol II has been found at the start of several genes, therefore iDNA sequences showing similar properties, i.e. sarkosyl-induced transcription, may represent new examples of this class of genes. Clone 2A (Fig. 2) contains two DNA fragments that hybridize with nuclear run-on probes in the presence of sarkosyl, a smaller strongly hybridizing fragment and a larger weaker hybridizing fragment. The smaller hybridizing fragment has not been identified, whereas the larger fragment is identical to the genomic sequence of the 5'-end of HSC5. The HSC5 gene is a single copy gene and encodes a 74 kDa heat shock cognate protein. It is also a member of the heat shock protein 70 multigene family, but is not itself induced by heat shock. The HSC5 fragment in clone 2A starts at a HaeIII site 3 bp downstream of the transcriptional start site and ends ~300 bp downstream.

We mapped the pause site on HSC5 in a nuclear run-on assay using a genomic clone (pH6-8) (24) containing the 5'-end of the gene (Fig. 3). Lanes 4-6 are controls: lane 4 is a fragment of highly transcribed, repeated histone H4 gene and is transcribed in nuclear run-on assays in the presence or absence of sarkosyl (9); lanes 5 and 6 show different restriction digests of hsp70, both of which demonstrate a positive signal on promotor-proximal restriction fragments only in the presence of sarkosyl (compare lower with upper panels), consistent with paused polymerase described in previous reports (9,12). There is some expression of HSC5 in the absence of sarkosyl, compatable with previous studies showing that the gene is constitutively transcribed, however, there is a marked increase in signal in the presence of sarkosyl. We have mapped the sarkosyl-induced signal by hybridization using different restriction digests of HSC5 (Fig. 3). Most telling is lane 3 (fragment D), which hybridizes the bulk of the HSC5-specific sequences in the sarkosyl-induced run-on RNA. This maps the paused Pol II to the extreme 5'-end of the gene.

In order to assess the frequencies with which sequences that contain transcriptionally engaged Pol II also have paused Pol II we examined another larger set of 126 iDNA clones by nuclear run-on assay. In this new set we found 191 inserts, indicating that on average there were 1.5 inserts/clone. By nuclear run-on assay (similar to those in Fig. 2) we find that 17% have reproducibly detectable engaged RNA polymerase. Of these, 21% have engaged Pol II whose transcription is stimulated by sarkosyl. These positives were tested independently by two of the authors (T.O'Brien and K.Hirayoshi). These estimates are of course limited by the sensitivity of the run-on assay and therefore are minimum estimates.

DISCUSSION

In previous studies we and others have demonstrated that in vivo cross-linking of proteins to DNA can be used to analytically measure the relative concentration of proteins on specific chromosomal DNA segments (1-8). The specificity of these methods is provided by stringent purification and washing of protein-DNA adducts and by immunoprecipitation with antibodies raised against particular chromosomal proteins. While the immunoprecipitation can be done without prior cross-linking (30,31), the cross-linking step with intact cells is particularly critical to ensure that the protein-DNA complexes are a result of true in vivo interactions and not an artifact of rearrangement during cell and nuclear lysis and subsequent purification. Here we present an extension of these methods that allows preparation of libraries of DNA sequences that are enriched for sequences that interact directly with a specific protein in vivo. This provides a means of isolating new DNA targets of nuclear proteins.

The methodology we have developed for making iDNA libraries for Pol II should be applicable to other DNA binding proteins that cross-link to DNA in vivo. Indeed, UV light treatment of cells and embryos has been shown to cross-link several proteins to specific DNAs. These proteins include RNA Pol II (10), topoisomerase I (3), B52 (32) GAGA factor (33), eve (8), ftz (8) and zeste (8). UV irradiation creates stable cross-links between protein and DNA at an efficiency of ~1 cross-link/60 kb (10). While the use of UV light to couple proteins to DNA is known to cross-link a variety of proteins to DNA, the approach in principle could be used with a variety of cross-linking reagents. Of particular interest is the use of formaldehyde, which has proven very effective at efficiently coupling a variety of proteins to DNA in vivo (4,6,8).

One concern in the cloning of DNA from cross-linked protein-DNA adducts is the damage to the DNA caused by the cross-linking agent, both damage to the DNA in general and, in particular, at the site of the protein cross-link. This can be minimized in several ways. First, the dosage of UV light (or chemical cross-linking agent) should be kept as low as possible. Second, the target DNA segment used for cloning should be as short as possible (here we use HaeIII fragments that are on average ~300 bp). Third, the protein-DNA cross-link should be to one strand of the DNA duplex, so that the complementary strand is available for amplification by the approach described here. Finally, while UV light cross-links are not readily reversible, some cross-linking agents, like formaldehyde, produce reversible cross-links.

The Drosophila genome is 1/20 the size of a mammalian genome. To apply this approach to more complex genomes may require additional steps to reduce background. This could include more washes of immunoprecipitated adducts or resuspension and re-immunoprecipitation with the same or a second antibody to a different epitope of the protein of interest.

We appreciate the limits to the conclusions about promoter pausing that can be drawn from our analysis of the Pol II iDNA library. First, the run-on assay used to analyze iDNA clones is of limited sensitivity and only allows examination of abundantly transcribed genes. Using the hsp70 gene as a standard [it is present at six copies/genome (34) and has one Pol II/gene in non-heat shocked cells (23)] we estimate we can only reproducibly detect >0.1 Pol II molecules on a single copy DNA sequence. Therefore, the many genes expressed at lower levels are undetectable and may in part account for the failure to detect signals in nuclear run-on assays in 83% of the cloned inserts in the iDNA library. Alternatively, some of these negative clones could represent chromatin-bound but non-engaged Pol II or background in the purification and cloning. Second, nuclear run-on assays are only one way of examining pausing. Several techniques [cross-linking (2), permanganate mapping (35) and transcript analysis (14,15)] in addition to run-on assays have been previously used to rigorously demonstrate and characterize promoter paused Pol II on several genes. While these other techniques do not lend themselves to large scale screening of the iDNA library, sarkosyl-stimulated run-on does and serves as a good first indicator of this class of Pol II. Third, there is no guarantee that the sarkosyl-stimulated Pol II complexes (suggestive of paused Pol II) are all situated in a promoter-proximal position. Indeed, pausing at other positions in transcription units will also be of interest. One of the iDNA cloned segements did match a known single copy gene, HSC5, and fine structure mapping located the Pol II pause site to a similar position relative to the promoter as the pause site for the hsp70 gene. Interestingly, sequence comparison of the hsp70 and HSC5 promoters reveals only the TATA box in common, i.e. there are no classical heat shock elements nor GAGA factor binding sites (which are critical for pausing on hsp70) in the HSC5 promoter (24). Although paused Pol II appears to be relatively common in Drosophila and mammals, the rules that specify pausing are not simply identified by sequence comparisons.

In this paper an iDNA library was generated to map the distribution of RNA Pol II on Drosophila chromatin. The use of Pol II provided a predictable test of the method. The constitutively active Hsp83 and [beta]1-tubulin genes were found to be greatly enriched in our library, whereas rDNA, which is transcribed by Pol I, was depleted. In addition to this test of the method, we have analyzed a random sample of the resulting clones by nuclear run-on assay to determine the relative distribution of transcribing and paused Pol II. While pausing was discovered to occur on over half of the small collection of previously cloned abundantly transcribed genes that were tested (13), we did not have an unbiased estimate of the frequency of paused Pol II. Our result that 21% of the detectable transcribing Pol II complexes are paused indicates that a high fraction of chromosome-associated Pol II is in this configuration. This is surprising, but not inconsistent with other results. Greenleaf and colleagues examined the distribution of hyper-phosphorylated and hypo-phosphorylated Pol II on Drosophila polytene chromosomes (36). While large developmental puffs contain transcribing hyper-phosphorylated Pol II, many interbands had predominantly hypo-phosphorylated Pol II. This hypo-phosphorylated Pol II has been demonstrated to be the form of Pol II at the promoter paused sites of a variety of genes (37). Therefore, these immunofluorescence studies would be consistent with the broad distribution of pausing we have seen in this iDNA library screen.

ACKNOWLEDGEMENTS

We thank Janis Werner and Jill Sangree for excellent technical assistance, Karen Palter for the HSC5 genomic clone and Arno Greenleaf for Pol II antibody. This work was supported by an NIH grant (GM25232) and an NIH postdoctoral fellowship to A.L.

REFERENCES

1 Gilmour,D.S. and Lis,J.T. (1984) Proc. Natl. Acad. Sci. USA, 81, 4275-4279. MEDLINE Abstract

2 Gilmour,D.S. and Lis,J.T. (1986) Mol. Cell. Biol., 6, 3984-3989. MEDLINE Abstract

3 Gilmour,D.S., Pflugfelder,G., Wang,J.C. and Lis,J.T. (1986) Cell, 44, 401-408. MEDLINE Abstract

4 Orlando,V. and Paro,R. (1993) Cell, 75, 1187-1198. MEDLINE Abstract

5 Aparicio,O.M., Weinstein,D.M. and Bell,S.P. (1997) Cell, 91, 59-69. MEDLINE Abstract

6 Solomon,M.J., Larsen,P.L. and Varshavsky,A. (1988) Cell, 53, 937-947. MEDLINE Abstract

7 Hecht,A., Strahl Bolsinger,S. and Grunstein,M. (1996) Nature, 383, 92-96. MEDLINE Abstract

8 Walter,J., Dever,C.A. and Biggin,M.D. (1994) Genes Dev., 8, 1678-1692. MEDLINE Abstract

9 O'Brien,T. and Lis,J.T. (1993) Mol. Cell. Biol., 13, 3456-3463. MEDLINE Abstract

10 Gilmour,D.S. and Lis,J.T. (1985) Mol. Cell. Biol., 5, 2009-2018. MEDLINE Abstract

11 Lis,J.T., Neckameyer,W., Dubensky,R. and Costlow,N. (1981) Gene, 15, 67-80. MEDLINE Abstract

12 Rougvie,A.E. and Lis,J.T. (1988) Cell, 54, 795-804. MEDLINE Abstract

13 Rougvie,A.E. and Lis,J.T. (1990) Mol. Cell. Biol., 10, 6041-6045. MEDLINE Abstract

14 Rasmussen,E.B. and Lis,J.T. (1993) Proc. Natl. Acad. Sci. USA, 90, 7923-7927. MEDLINE Abstract

15 Rasmussen,E.B. and Lis,J.T. (1995) J. Mol. Biol., 252, 522-535. MEDLINE Abstract

16 Strobl,L.J. and Eick,D. (1992) EMBO J., 11, 3307-3314. MEDLINE Abstract

17 Krumm,A., Meulia,T., Brunvand,M. and Groudine,M. (1992) Genes Dev., 6, 2201-2213. MEDLINE Abstract

18 Brown,S.A., Imbalzano,A.N. and Kingston,R.E. (1996) Genes Dev., 10, 1479-1490. MEDLINE Abstract

19 Krumm,A., Hickey,L.B. and Groudine,M. (1995) Genes Dev., 9, 559-572. MEDLINE Abstract

20 Bialojan,S., Falkenburg,D. and Renkawitz-Pohl,R. (1984) EMBO J., 3, 2543-2548. MEDLINE Abstract

21 O'Connor,D. and Lis,J.T. (1981) Nucleic Acids. Res., 9, 5075-5092. MEDLINE Abstract

22 Wellauer,P.K. and Dawid,I.B. (1978) J. Mol. Biol., 126, 769-782. MEDLINE Abstract

23 O'Brien,T. and Lis,J.T. (1991) Mol. Cell. Biol., 11, 5285-5290. MEDLINE Abstract

24 Rubin,D.M., Mehta,A.D., Zhu,J., Shoham,S., Chen,X., Wells,Q.R. and Palter,K.B. (1993) Gene, 128, 155-163. MEDLINE Abstract

25 Weeks,J.R., Coulter,D.E. and Greenleaf,A.L. (1982) J. Biol. Chem., 257, 5884-5891. MEDLINE Abstract

26 Gilmour,D., Rougvie,A. and Lis,J. (1991) In Hamkalo,B.A. and Elgin,S.C.R. (eds), Methods in Cell Biology. Functional Organization of the Nucleus: A Laboratory Guide. Academic Press Inc., San Diego, CA, Vol. 35, pp. 369-381.

27 Kinzler,K.W. and Vogelstein,B. (1989) Nucleic Acids Res., 17, 3645-3653. MEDLINE Abstract

28 Hawley,D.K. and Roeder,R.G. (1985) J. Biol. Chem., 260, 8163-8172. MEDLINE Abstract

29 Spradling,A.C. and Rubin,G.M. (1981) Annu. Rev. Genet., 15, 219-264. MEDLINE Abstract

30 Gould,A.P., Brookman,J.J., Strutt,D.I. and White,R.A.H. (1990) Nature, 348, 308-312. MEDLINE Abstract

31 Graba,Y., Aragnol,D., Laurenti,P., Garzino,V., Charmot,D., Berenger,H. and Pradel,J. (1992) EMBO J., 11, 3375-3384. MEDLINE Abstract

32 Champlin,D.T., Frasch,M., Saumweber,H. and Lis,J.T. (1991) Genes Dev., 5, 1611-1621. MEDLINE Abstract

33 O'Brien,T., Wilkins,R.C., Giardina,C. and Lis,J.T. (1995) Genes Dev., 9, 1098-1110. MEDLINE Abstract

34 Mirault,M.-E., Goldschmidt-Clermont,M., Artavanis-Tsakonas,S. and Schedl,P. (1979) Proc. Natl. Acad. Sci. USA, 76, 5254-5258. MEDLINE Abstract

35 Giardina,C., Perez Riba,M. and Lis,J.T. (1992) Genes Dev., 6, 2190-2200. MEDLINE Abstract

36 Weeks,J.R., Hardin,S.E., Shen,J., Lee,J.M. and Greenleaf,A.L. (1993) Genes Dev., 7, 2329-2344. MEDLINE Abstract

37 O'Brien,T., Hardin,S., Greenleaf,A. and Lis,J.T. (1994) Nature, 370, 75-77. MEDLINE Abstract


*To whom correspondence should be addressed. Tel: +1 607 255 2442; Fax: +1 607 255 2428; Email: jtl10@cornell.edu


This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals Comments and feedback: www-admin{at}oup.co.uk
Last modification: 6 Feb 1998
Copyright© Oxford University Press, 1998.

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
K. Sanyal, M. Baum, and J. Carbon
Centromeric DNA sequences in the pathogenic yeast Candida albicans are all different and unique
PNAS, August 3, 2004; 101(31): 11374 - 11379.
[Abstract] [Full Text] [PDF]


Home page
Genes Dev.Home page
C.-H. Wu, Y. Yamaguchi, L. R. Benjamin, M. Horvat-Gordon, J. Washinsky, E. Enerly, J. Larsson, A. Lambertsson, H. Handa, and D. Gilmour
NELF and DSIF cause promoter proximal pausing on the hsp70 promoter in Drosophila
Genes & Dev., June 1, 2003; 17(11): 1402 - 1414.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
S. R. Eberhardy and P. J. Farnham
Myc Recruits P-TEFb to Mediate the Final Step in the Transcriptional Activation of the cad Promoter
J. Biol. Chem., October 11, 2002; 277(42): 40156 - 40162.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
S. R. Eberhardy and P. J. Farnham
c-Myc Mediates Activation of the cad Promoter via a Post-RNA Polymerase II Recruitment Mechanism
J. Biol. Chem., December 14, 2001; 276(51): 48562 - 48571.
[Abstract] [Full Text] [PDF]


Home page
Genes Dev.Home page
J. T. Lis, P. Mason, J. Peng, D. H. Price, and J. Werner
P-TEFb kinase recruitment and function at heat shock loci
Genes & Dev., April 1, 2000; 14(7): 792 - 803.
[Abstract] [Full Text]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (153K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (20)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Law, A.
Right arrow Articles by Lis, J. T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Law, A.
Right arrow Articles by Lis, J. T.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?