ABSTRACT
We have developed a high efficiency system in which mammalian extracts join DNA double-strand breaks with non-complementary termini. This system has been used to obtain a large number of junction sequences from a range of different break-end combinations, allowing the elucidation of the joining mechanisms. Using an extract of calf thymus it was found that the major mechanism of joining was by blunt-end ligation following removal or fill-in of the single-stranded bases. However, some break-end combinations were joined through an efficient mechanism using short repeat sequences and we have succeeded in separating this mechanism from blunt-end joining by the biochemical fractionation of extracts. Characterization of activities and sequence data in an extensively purified fraction that will join ends by the repeat mechanism led to a model where joining is initiated by 3' strand invasion followed by pairing to short repeat sequences close to the break site. Thus the joining of double-strand breaks by mammalian extracts is achieved by several mechanisms and this system will allow the purification of the factors involved in each by the judicial choice of the non-complementary ends used in the assay.
The processes of DNA replication and transcription rely on the presence of a continuous DNA template. For this reason double-strand breaks (dsb) in the DNA helix can have catastrophic consequences for the cell if they are not rapidly repaired. Previous work has shown that in lower eukaryotes the presence of a single unrepaired dsb can be lethal (1 ), even when the break is in a piece of DNA that is not essential for viability (2 ). Additionally, in higher eukaryotes, agents that cause a high proportion of dsbs, such as ionizing radiation, are effective in causing genetic changes such as complex mutations and chromosomal aberrations (3 ,4 ). For such reasons the efficient repair of dsbs is crucial for the maintenance of genomic stability in all organisms.
In lower eukaryotes such as the yeast Saccharomyces cerevisiae, the major pathway for the removal of dsbs appears to be through the mechanism of homologous recombination (5 ). However, in higher eukaryotes many studies have shown that dsbs are commonly joined through end-to-end mechanisms by a variety of processes termed non-homologous or illegitimate recombination. The majority of these studies have utilized site-specific DNA dsbs produced by cutting DNA with restriction enzymes to produce DNA termini with defined end structures. This has been achieved by either transfection of linear DNA into cultured cells (6 -9 ) or direct electroporation of enzymes into cells (10 ,11 ). Additionally, the rejoining of linear plasmids after incubation in cell-free extracts from either mammalian sources (12 -16 ) or from Xenopus laevis (17 -20 ) has been studied. In all cases, it has been shown that the majority of plasmids cut to produce complementary ends are joined faithfully by cells or cell-free extracts from normal tissue sources.
However, to examine the pathways of joining used when such ends are not available, as in the breaks caused by DNA damaging agents, junctions produced by the joining of non-complementary ends have also been examined. In general, three major mechanisms have been described. The first (fill-in mechanism) requires the filling in of single-stranded overhangs followed by blunt-end ligation. The second (overlap mechanism) appears to proceed through the pairing of one or more bases in the termini, followed by filling in of the missing bases combined with correction of the mismatched bases. The third mechanism (repeat mechanism) brings together short regions of homology with subsequent removal of the displaced single-stranded sequences. However, although these three mechanisms have been found in a variety of different systems, the proportions of each varies widely. In addition, no attempt has been made to date to identify or purify the proteins involved in any of these pathways.
Previous studies of non-complementary end joining have used a strategy where a single DNA molecule is restricted with two separate enzymes and the (circular) products analysed after transfection into bacteria. However, a large fraction of joined products occur as linear dimers and multimers rather than circular monomeric products and it is possible that the analysis of circular products alone may detect only joining mechanisms that occur in conjunction with the process of recircularization. In addition, our extract fractionation studies have shown that many partially purified fractions lost the ability to form circular products, while the formation of linear forms was not affected. Therefore, to avoid both these problems we developed a system based on the formation of linear dimers.
In this report we present a systematic analysis of the junctions in linear molecules produced from DNA with a variety of non-complementary end termini by mammalian extracts. In addition we present the extensive purification and characterization of an activity that joins non-complementary ends through a mechanism that utilizes short repeat sequences.
Thymus glands (500 g) from newly slaughtered calves (3-6 months old) were frozen in liquid nitrogen and stored at -70oC. Protease inhibitors (1 [mu]g/ml pepstatin A and 1 [mu]g/ml leupeptin) were added to all buffers directly before use and all steps were carried out at 4oC. The frozen thymus was disrupted in a Waring blender in 750 ml homogenization buffer (100 mM Tris-HCl, pH 7.5, 20 mM phosphate buffer, pH 7.5, 100 mM KCl, 1 mM EDTA, 1 mM DTT, 1 mM PMSF) until homogeneous. This was stirred for 30 min and centrifuged at 10 000 g for 30 min. The supernatant was filtered through gauze and added to 1 l phosphocellulose resin (100 g dry weight) which had been prepared according to the manufacturer's instructions (Whatman) and equilibrated in buffer A (25 mM Tris-HCl, pH 7.5, 1 mM EDTA, 0.01% Nonidet P-40, 10% glycerol, 1 mM DTT, 1 mM PMSF) containing 25 mM NaCl. The slurry was stirred for 15 min and filtered through Whatman No. 1 paper. The cake of resin was resuspended in ~600 ml buffer A/150 mM NaCl, restirred and filtered and the protein eluted with 600 ml buffer A/600 mM NaCl. To this wash was added solid ammonium sulphate to 40% saturation and the precipitated protein recovered by centrifugation at 10 000 g for 30 min. The supernatant was adjusted to 70% saturation and the proteins precipitated and recovered as before. For activity assays this pellet (Fraction I) was resuspended in buffer A/25 mM NaCl, dialysed overnight against the same buffer and stored as small aliquots at -70oC. For fractionation studies, the ammonium sulphate pellet was stored at -70oC.
All columns were run at 4oC and all dialysis steps were against buffer A/25 mM NaCl for 14-17 h. Fraction I was dissolved in buffer A/25 mM NaCl (~20 mg/ml) and loaded onto a Superose 6 FPLC column (Pharmacia). Pooled active fractions (II) were loaded onto a native DNA cellulose (US Biochemicals) column (4 mg/ml resin) in the same buffer and the protein eluted at 200 mM and 1 M NaCl. After dialysis the active fraction (1 M) was adjusted to buffer A/0.9 M ammonium sulphate/25 mM NaCl and loaded onto a FPLC Phenyl Superose column (Pharmacia) in the same buffer. The column was eluted in steps and the activity (Fraction IV) eluted in buffer A/25 mM NaCl. This was dialysed against the same buffer containing 20% sucrose and loaded onto a FPLC Mono S ion exchange column (Pharmacia). Proteins were eluted using a gradient of buffer A containing 25-400 mM NaCl. Activity from this column could only be detected after the addition of T4 DNA ligase to the joined products before electrophoresis (see below).
pUC18 plasmid was propagated in Escherichia coli HB101, isolated using alkali lysis and purified by double banding on caesium chloride/ethidium bromide gradients. Plasmids were linearized with restriction enzymes as indicated and the digestion monitored by agarose gel electrophoresis. Bacteriophage [lambda] DNA was restricted with KpnI, the unique 1.5 kb fragment isolated and cloned into the multicloning site of pUC18. This plasmid (pBK-1) was then grown in E.coli HB101 and isolated as described above. The 1.5 kb [lambda] fragment was released by digestion with KpnI and purified by gel isolation.
Reactions (25 [mu]l) contained 70 mM Tris-HCl, pH 7.5, 10 mM MgCl2, 10 mM DTT, 4 mM ATP, 40 mM phosphocreatine, 1 [mu]g creatine phosphokinase, 100 [mu]M each dATP, dCTP, dGTP and dTTP, 45 pM both linear pUC18 (1 [mu]g) and [lambda]/KpnI (0.55 [mu]g) and protein extracts as indicated. Reactions were set up on ice and incubated for 15 min at 37oC unless otherwise stated. Reactions were terminated by addition of an equal volume of 100 mM EDTA/1% SDS and the DNA purified as described previously (14 ). Where indicated, the precipitated DNA products were resuspended in 1* One-Phor-All buffer (Pharmacia)and incubated with 2 U T4 DNA ligase for 10 min at 37oC before addition of gel loading buffer (2.5% Ficoll, 0.025% bromophenol blue). In all cases the joined products were separated by agarose gel electrophoresis (1%) in 1* TBE (89 mM Tris base, 89 mM orthoboric acid, 2 mM EDTA) containing 0.5 [mu]g/ml ethidium bromide for 15-17 h at 2.25 V/cm.
For calculation of specific activities (Table 2 ), an image of the gel was produced using a charge coupled device (CCD) camera and the image was stored as a TIFF file (Apple Macintosh). The relative amount of pUC/[lambda] DNA forms was calculated as a percentage of the unjoined pUC + pUC/[lambda] forms using the Collage software package (Fotodyne). For gels in which the joined product was not visible by this method, the gels were blotted onto reinforced nitrocellulose (Sartorius) and probed using the Enhanced Chemiluminescence (ECL; Amersham) kit with pUC18 labelled with horseradish peroxidase and hybridized according to the manufacturer's instructions. Blots were washed at a final stringency of 0.4% SDS at 55oC for 10 min and exposed to X-ray film at -70oC.
To obtain sufficient material for further analysis, multiple joining reactions (2 -8 ) were performed and the reactions combined before gel electrophoresis. Sequence analysis was performed on joined products produced by several different extracts in separate experiments. The product of non-complementary joining was excised from the gel and purified using Gene Clean II (Bio 101) according to the manufacturer's instructions.
As two linear molecules are used in the reaction, there are four different orientations in which the ends could combine (Fig. 1 ). To be able to study these four orientations separately, PCR primers were designed which would selectively amplify the junction in each orientation in each case. To amplify the junction produced in orientation 1, the forward pUC primer 196 (5'-CTTCGCTATTACGCCAGCTGGCGA) was used in combination with the [lambda] primer LK3 (5'-GCAATCCTCCGGCCTTTTCCCTGA). For orientation 2, primer 196 was used in combination with the [lambda] primer LK4 (5'-ACGTGCCGTCGTCGTTCTCACGA). For orientation 3, the reverse pUC primer RC (5'-TTGTGAGCGGATAACAATTTC) was used with LK3 and orientation 4 was amplified using the primers RC and LK4. PCR reactions (25 or 50 [mu]l) contained 10 mM Tris-HCl, pH 8.3, 1.5 mM MgCl2, 50 mM KCl, 200 [mu]M dATP, dCTP, dGTP and dTTP, 1.5% DMSO, 400 nM each primer, 1-5 ng DNA and 1 U Taq polymerase (Boehringer Mannheim). Routinely, the initial denaturation step was at 94oC for 5 min followed by 30 cycles of 55oC for 30 s, 72oC for 1 min, 94oC for 30 s, followed by a final cycle of 55oC for 30 s, 72oC for 5 min. Aliquots of 1-2 ng of PCR products were cloned into pCRII using a TA Cloning kit (Invitrogen), single transformants picked and the orientation of the cloned fragment checked by PCR analysis. Single-stranded DNA was prepared from suitable clones using 5 * 109 p.f.u. of the helper phage M13K07 (Pharmacia) and sequenced using a USB Sequenase 2 kit (Amersham) according to the manufacturer's instructions.
SDS-PAGE was performed according to Laemmli (21 ) and gels stained using a Silver Stain Plus kit (BioRad). Western blots were treated and immunodetected using the ECL system (Amersham); antibodies were checked against a positive control (human 293 cell extract) and shown to cross-react with Fraction I of calf thymus extracts. Protein concentrations were measured using the assay of Bradford (22 ) using BSA as a standard.
To detect 3' -> 5' exonuclease activity, labelled templates were made by filling in pUC18/XbaI with Klenow fragment and [[alpha]-32P]CTP. To detect 5' -> 3' exonuclease activity, aliquots of the pUC18/SacI or the [lambda]/KpnI fragment used for joining reactions were dephosphorylated with calf intestinal phosphatase and end-labelled with [[gamma]-32P]ATP and T4 polynucleotide kinase using standard protocols (23 ). Reactions (80 [mu]l) contained 20 000 c.p.m. of appropriately labelled template in 70 mM Tris-HCl, pH 7.5, 10 mM DTT, 10 mM MgCl2, 40 [mu]g/ml BSA and various volumes of column fractions. Aliquots of each reaction were removed at 0 and 60 min and stopped by addition of an equal volume of 20 mM EDTA/1 [mu]g/ml sheared salmon sperm DNA. The DNA was precipitated with 1 ml 8% TCA/1% sodium pyrophosphate on ice for 30 min, filtered onto GF-C filters and counted in a scintillation counter.
To ensure that joining was not occurring by homologous recombination, reactions contained two DNA substrates that were unrelated (Fig. 1 ). One substrate was pUC18 (pUC), chosen for the availability of unique restriction sites (SacI, EcoRI, HincII, PstI, HindIII and BamHI) in the multicloning site. The second substrate, a KpnI-KpnI fragment from bacteriophage [lambda], was selected because of its convenient size (1.5 kb) and absence of cross-reactivity to pUC as determined by Southern analysis. It was possible to monitor the joining of DNA with complementary and non-complementary ends concurrently from the different sizes of products. The major product of non-complementary joining is the [lambda]/pUC dimer (4.1 kb), while dimers of complementary joining were 5.2 and 3 kb for the pUC/pUC and [lambda]/[lambda] dimers respectively. Probing the resulting Southern blots with pUC increased the sensitivity of the assay and simplified the identification of the various joined forms (see Fig. 2 ).
Unfractionated extracts from calf thymus would not join substrates with non-complementary termini and the extract preparation was modified to include a batch elution step from phosphocellulose resin and a 40-70% precipitation using ammonium sulphate (Materials and Methods) to give Fraction I. This fraction was then incubated with [lambda]/KpnI and pUC restricted with various enzymes to give either 3'-3', 3'-5' or 3'-blunt combinations to estimate the amount of [lambda]/pUC dimer produced. The greatest amount of [lambda]/pUC dimer was produced by the combination of [lambda]/KpnI fragment with pUC/SacI and this combination was used to determine optimal joining conditions for Fraction I. Production of [lambda]/pUC dimer increased with protein concentration to 10 [mu]g and a time course study showed an optimal incubation time of 15 min at 37oC (data not shown). Although longer incubations and higher extract concentrations resulted in the formation of more product, nuclease activity also increased, making dimer production difficult to quantitate. Using these optimal conditions, Fraction I was found to join all the end combinations with similar efficiency; however, the KpnI/SacI combination routinely gave ~3-fold more non-complementary joined product than any of the other combinations tested (Fig. 2 a).
Dimers were gel purified, amplified by PCR and cloned to determine by sequencing the mechanisms of joining of the different end combinations by Fraction I (Table 1 a). The KpnI/SacI (3'-3'), KpnI/EcoRI (3'-5') and KpnI/HincII (3'-blunt) combinations were assayed in each of the four orientations (see Materials and Methods and Fig. 1 ), while KpnI/PstI (3'-3'), KpnI/HindIII (3'-5') and KpnI/BamHI (3'-5') termini were analysed in a single orientation. It is clear that the predominant mechanism used for the joining of non-complementary termini proceeded by a final step of blunt-end ligation. This was achieved by either removal of the single-stranded 3' regions and/or by the fill-in of the 5' overhangs respectively (shown schematically in Fig. 3 a). Moreover, this mechanism was observed in all four orientations for the 3'-5' (KpnI/EcoRI) and 3'-blunt (KpnI/HincII) combinations, indicating that the DNA sequence adjacent to the termini had little effect on the mechanismused. However, the combination that was joined with the greatest efficiency (KpnI/SacI), showed a more varied pattern, with only one of the four orientations (no. 3) joined primarily by this mechanism. For the remaining three orientations (nos 1, 2 and 4) the sequences obtained indicated that the majority of junctions (55-100%) had been joined through a homology-dependent mechanism involving the pairing of short (2-6 bp) repeat sequences. A model proposed for this repeat-driven mechanism (13 ) is shown in Figure 3 b. This variability in mechanism observed between the orientations indicates that for this combination, the sequence around the termini has a marked affect on the mechanism used. The sequences at each junction obtained for KpnI/SacI and the positions and lengths of the repeat sequences used in each orientation are shown schematically in Figure 4 . For all combinations of ends used, a small percentage of junctions was found that did not fall into either of these two categories (termed `other' in Table 1 ). Such sequences showed small deletions that had no homology at the junction site. Interestingly, using this system, no sequences were found that had been joined using the overlap mechanism.
Table 3
As seen in Figure 2 b, an optimal concentration of Fraction V produced a strong rejoin reaction for the KpnI/SacI combination, while only a small amount of product was observed for the other 3'-3' combination used (KpnI/PstI). No [lambda]/pUC dimer product was observed for either the 3'-5' (KpnI/EcoRI, KpnI/SalI) or the 3'-blunt (KpnI/HincII) combinations.
Sequence analysis was limited to the KpnI/SacI combination, since insufficient joined product was produced by the other combinations used. Compared with Fraction I, a larger proportion of orientations 1, 2 and 4 were joined through short repeats and orientation 3 now showed an increased use of the repeat mechanism (use of a 3 bp repeat and increased use of 1 bp `repeats') as well as random deletions (Fig. 4 ). A comparison of the repeat sequences used for all four orientations by Fractions I and V is summarized in Figure 6 a.
Using the KpnI/SacI combination, a comparison of Fractions I and V showed that both fractions were equally sensitive to NaCl and both showed maximum joining at 37oC, with no product at 16oC. Assay of Fraction V in the absence of ATP resulted in the formation of a small amount of [lambda]/pUC dimer, which could be increased to an optimum at 1 mM. Addition of the polymerase inhibitors aphidicolin or ddTTP had no effect. ATPase activity was not observed in Fraction V in the presence or absence of added DNA. DNA binding assays performed across the final fractions showed activity in a wider peak than that of non-complementary joining activity on both single- and double- stranded templates (data not shown). No DNA topoisomerase I and II activity was observed in Fraction V, as assayed by the relaxation of supercoiled plasmid or the decatenation of kDNA respectively (25 ). Immunoblots of fractions across the Mono S column showed no reaction with antibodies raised against the end-binding protein Ku70 or the structure-specific nucleases FEN-1, XP-G and ERCC1, although these proteins were detected in Fraction I and some were detected with partial purification (data not shown). Monitoring the conversion of closed circle to open circle pUC indicated the absence of any gross endonuclease activity. However, as the mechanism of joining suggests that a nuclease activity would be required to remove the single-stranded overhangs, fractions across the final Mono S column were tested on appropriately labelled templates to detect the presence of 3' -> 5' or 5' -> 3' exonucleases respectively (see Materials and Methods). In all preparations, weak 3' -> 5' and 5' -> 3' exonuclease activities which peaked with the joining activity could be detected (data not shown). Aliquots of the same fractions assayed for joining activity were separated by SDS-PAGE and detected by silver staining. Although several protein bands are contained in the most active fractions, the band that most closely follows the distribution of activity in separate preparations from calf thymus indicates a protein of ~73 kDa (Fig. 5 ).
Figure
Figure
We have developed an efficient cell-free system to examine the joining of non-complementary termini by mammalian enzymes. Two of the three major mechanisms of non-complementary end joining identified previously, the `fill-in' and `repeat' mechanisms (see Introduction), were found following examination of a total of 139 junctions joined by Fraction I. We did not observe a third (`overlap') mechanism, where one or more terminal bases can pair, followed by filling in and correction of mismatched bases. The latter mechanism has previously been found in systems relying on the formation of circular products in Xenopus extracts (17 ,18 ,20 ) or after mammalian cell transfection (6 -8 ). In these situations there is a high probability that the naked input DNA will be converted to a chromatin-like structure (26 ,27 ) and this may stabilize the required alignment of one or two bases (28 ). In our system, where there is the possibility of 1 or 2 bp overlaps in the KpnI/SacI and KpnI/PstI combinations respectively, chromatinization may not occur during the short incubation times used. Additionally, we cannot completely rule out some bias in the recovery of different joined products with PCR amplification, but the fact that the proportions of different joined products change with fractionation (Table 1 ) suggests that our methods do not strongly favour the amplification of one product relative to others.
In this system the major mechanism used for the joining of linear DNA substrates proceeds through a final step of blunt-end ligation. However, if the terminal bases on the 3' overhang can pair with a complementary sequence <10 bp from the break site, then joining occurs through a pathway utilizing short regions of homology, as described in previous studies with both mammalian cell extracts (13 ,16 ) and whole cells (7 ,10 ,11 ,29 ). A possible explanation why the KpnI/SacI combination showed the greatest proportion of joining using short repeats is that in the pUC polylinker the KpnI and SacI sites are adjacent (Fig. 6 a). Therefore, in orientations 1 and 2 (which differ in the orientation of the [lambda] sequence) the 3' [lambda] overhang has only to find a repeat 6 bp into the pUC sequence to form a 4 bp homology. In orientation 2 this same pairing distance fortuitously forms a potentially more stable 6 + 3 bp region of homology, which probably accounts for the 100% use of this mechanism as compared with the 55% usage for orientation 1. In orientations 3 and 4 the adjacent KpnI site is not present; however, in orientation 4 the 3' termini from pUC can form a 2 + 3 bp repeat, again situated 6 bp into the [lambda] sequence. This repeat is used exclusively by Fraction I, but a greater range of repeat sequences is used by the purified Fraction V, indicating that factors in the crude extract may be influencing which repeat sequences are used.
Surprisingly, with the other 3'-3' combination (KpnI/PstI) examined with Fraction I repeats of >1 bp were not used, despite the presence of a 5 bp homology present 2 bp from the ends of both substrates. This shows that for the repeat mechanism to occur efficiently under our conditions, the homology must be complete to the end of the 3' overhang, suggesting that this process is initiated by strand invasion of the free 3'-end into the helix of the DNA strand to be joined. In addition, the extensively purified Fraction V did not appear to be affected by the inclusion of polymerase inhibitors, indicating that DNA synthesis is not required and unpaired bases are removed at the point where the DNA is again correctly base paired. Taking these factors into account, we propose a new model for repeat-driven joining of non-complementary termini (Fig. 6 b) in which, following strand invasion, the single-stranded ends are removed by precise nuclease action, removing the need for a DNA polymerase.
We looked in Fraction V for the presence of several endonucleases that may cleave at single- to double-strand transitions, as predicted by the model for repeat-driven joining. Surprisingly, neither the flap endonuclease FEN-1 (also identified as DNase IV and MF-1; 30 ) nor the two structure-specific endonucleases involved in excision repair, ERCC1 and XP-G, were found to be present by Western analysis. This result indicates that the weak 3' -> 5' and 5' -> 3' exonuclease activities detected in the active fractions are sufficient for the removal of the single-stranded tails needed for this mechanism to occur.
A protein that has been shown to have a major role in the ability of mammalian cells to recover from DNA double-strand breaks is the DNA-dependent protein kinase complex (DNA-PK). This complex consists of a heterodimer of two subunits (Ku70 and Ku86) which can bind to the ends of broken DNA where it then recruits a further subunit (p450) (for a review see 31 ). It is of interest that we could not detect the Ku70 protein in Fraction V, indicating either that the repeat-driven joining mechanism does not require this protein or that Ku is not directly involved in the joining process but acts as part of a DNA damage signalling pathway.
Fractionation of extracts to purify the proteins involved in the repeat mechanism resulted in the appearance a 73 kDa protein after silver staining as the only common band observed in all active fractions. Recently, Jessberger et al. (32 )have identified a 72 kDa protein (SRSP) which stimulates homologous recombination in mammalian extracts. It will be important to see if these proteins are the same, suggesting an overlap in homologous and illegitimate recombination mechanisms.
We thank Profs T.Lindahl and S.Jackson and Drs R.Wood, S.Clarkson and M.Lieber for kindly providing antibodies. We thank Dr S.Bouffler for helpful comments on the manuscript. This study was supported in part by the Commission of the European Communities contract no. F13PCT920007.
*To whom correspondence should be addressed. Tel: +44 1235 834393; Fax: +44 1235 834776
+Present address: Department of Radiation Genetics and Chemical Mutagenesis, Sylvius Laboratory, Wassenaarseweg 72, 2333 AL Leiden, The Netherlands
Fraction
Column/step
Protein (mg)
Total activity (Ua)
Sp. act. (U/mg)
I
Ammonium sulphate (40-70%)
100
5000
50
II
Superose 6
7.5
1071
143
III
Native DNA cellulose
0.65
195b
300
IV
Phenyl Superose
0.115
115c
1 000
V
Mono S
0.017
1360d
80 000
REFERENCES
Return


