ABSTRACT
Although the human
[alpha]
-globin and
[beta]
-globin genes are co-regulated in adult life, they achieve the same end by very different
mechanisms. For example, a transfected
[beta]
-globin gene is expressed in an inducible manner in mouse erythroleukemia
(MEL) cells while a transfected
[alpha]
-globin gene is constitutively expressed at a high level in induced and
uninduced MEL cells. Interestingly, when the
[alpha]
-globin gene is transferred into MEL cells as part of human chromosome 16,
it is appropriately expressed in an inducible manner. We explored the basis for
the lack of erythroid-responsiveness of the proximal regulatory elements of the human
[alpha]
-globin gene. Since the
[alpha]
-globin gene is the only functional human globin gene that lacks CACCC and
GATA-1 motifs, we asked whether their addition to the
[alpha]
-globin promoter would make the gene erythroid-responsive in MEL cells. The addition of each of these binding sites
to the
[alpha]
-globin promoter separately did not result in inducibility in MEL cells.
However, when both sites were added together, the
[alpha]
-globin gene became inducible in MEL cells. This suggests that erythroid non-responsiveness of the
[alpha]
-globin gene results from the lack of erythroid binding sites and is not
necessarily a function of the constitutively active, GC rich promoter.
The different genes of the human [alpha]- and [beta]-globin gene clusters are coordinately regulated to
produce embryonic, fetal and adult hemoglobins at specific developmental stages
(
1
,
2
). In the adult stage, the [alpha]-globin and [beta]-globin genes express equivalent levels of globin chains
to produce Hb A, the major adult hemoglobin. The molecular basis of the
coordinate regulation of these two genes has been the subject of intense
scrutiny. These studies have identified a number of important and unexpected
differences between the [alpha]- and [beta]-globin genes: (i) the [alpha]- and [beta]-globin genes are present on
different chromosomes. The [alpha]-globin gene is present on chromosome 16 (
3
) while the [beta]-globin gene is present on chromosome 11 (
4
); (ii) there are four copies of the [alpha]-globin gene in the diploid human genome and only two copies of the [beta]-globin gene (
5
,
6
); (iii) whereas the general intron-exon structure of the [alpha]-globin and [beta]-globin genes is similar, there is very little
homology between the proximal regulatory elements of the two genes; (iv) the [alpha]-globin gene is located in a GC rich domain while the [beta]-globin gene is present in an AT rich domain (
7
); (v) the [alpha]-globin promoter is constitutively hypomethylated in all cell types
(
8
,
9
) while the [beta]-globin promoter is heavily methylated in non-expressing cells (
10
-
12
); (vi) the [beta]-like globin gene cluster is in an open chromatin domain only in erythroid cells while the [alpha]-like globin gene cluster is located within a larger
chromatin domain that is open in all cell types (
13
). (vii) The [beta]-globin gene is not expressed efficiently in heterologous cells
unless it is linked to a viral enhancer (
14
-
16
). In contrast, the [alpha]-globin gene is expressed constitutively in all cell types
regardless of the presence of a linked viral enhancer (
14
,
15
,
17
); (viii) a stably integrated human [beta]-globin gene is expressed in an inducible manner in mouse
erythroleukemia (MEL) cells (
18
,
19
) while the [alpha]-globin gene is highly expressed in a constitutive and non-inducible manner (
18
); (ix) the human [beta]-globin gene with its immediate flanking sequences is expressed in a
tissue and developmental stage-specific manner when inserted in the genome of a transgenic mouse (
20
,
21
). In contrast, the [alpha]-globin gene with its immediate flanking sequences is not expressed in any
tissue in transgenic mice (
22
-
24
); (x) both the [alpha]-globin and [beta]-globin genes are under the control of distant major
regulatory elements known as locus control regions (LCRs). The [beta]LCR consists of four hypersensitive sites (HS1-HS4) (
25
-
27
) that have different activities at different developmental stages (
28
). In contrast, all the activity of the [alpha]LCR appears to be localized to a single discreet hypersensitive site
known as HS-40 (
29
,
30
).
We had previously explored the molecular basis for the differences in the
behavior of the [alpha]- and [beta]-globin genes in transient expression systems. We showed
that [alpha]-globin promoter sequences upstream of the cap site are not
sufficient to account for the constitutive activity of the [alpha]-globin gene in erythroid and non-erythroid cells (
31
). We demonstrated that sequences immediately downstream from the cap site are essential for the full enhancer-independent activity of the [alpha]-globin gene promoter (
31
). We also showed that an extended [alpha]-globin promoter that includes sequences from the structural gene is
more responsive to the enhancing activity of the [alpha]LCR than a promoter element that is limited to 5' flanking sequences (
32
). James-Pederson
et al.
have recently made similar observations during their study of the regulation of
the rabbit [alpha]-globin gene (
33
). These studies of the human and rabbit [alpha]-globin genes highlight another feature that the [alpha]-globin promoter shares with promoters of many
housekeeping genes, i.e. extension of the promoter element into the structural sequences of the gene (
34
-
36
).
A number of erythroid-specific transcription factors play important roles in mediating the
activity of the proximal and distal regulatory elements of the different globin
genes. The binding sites for two of these erythroid transcription factors, GATA-1 and EKLF [an erythroid-specific transcription factor that binds CACCC sequences (
37
)] are present in the proximal regulatory element of every human globin gene
except the [alpha]-globin gene. It is not clear whether the lack of erythroid-responsiveness of the human [alpha]-globin gene in MEL cells is dictated by the
`housekeeping' properties of its promoter or by the lack of GATA-1 and CACCC sites in its promoter. We used inducibility in MEL cells as a surrogate
marker for erythroid-responsiveness. We asked whether the insertion of these binding sites in the [alpha]-globin promoter would allow the gene to respond to erythroid
differentiation of MEL cells by increasing its level of expression beyond the
constitutively high basal level of uninduced cells. The experiments described
in this report show that addition of GATA-1 and CACCC sequences, together but not separately, can make the [alpha]-globin promoter inducible in MEL cells. This suggests that the lack of
erythroid-responsiveness of the [alpha]-globin gene is not necessarily a result of its `housekeeping'
promoter which maintains it in an active state in homologous and heterologous
cells but more likely a result of the absence of binding sites for erythroid-specific transcription factors in its promoter. The implications of these
experimental observations will be discussed within the context of our current
understanding of the regulated expression of the [alpha]-globin gene.
A 1.5 kilobase (kb)
Pst
I fragment that carries the human [alpha]1-globin gene was cloned into the
Pst
I site of the Bluescript II KS
®
plasmid vector (Stratagene, La Jolla, CA). Upstream 5' flanking sequences were deleted from the cloned [alpha]-globin gene by exonuclease III digestion according to instructions provided with the Erase-a-Base
®
kit (Promega, Madison, WI). Initially, the plasmid carrying the [alpha]-globin gene was digested with
Xba
I and the 5' ends were protected by [alpha]-phosphorothioate incorporation. The plasmid DNA was then
digested with
Bam
HI followed by timed exonuclease III digestions. A plasmid library was generated
from these deletion mutants and several unique clones with different sized
deletions were identified. Expression from these truncated promoter constructs
was analyzed in a stable MEL cell expression system as previously described (
38
). Based on the results of this preliminary analysis, a construct which retained
142 bp of 5' flanking sequences ( [alpha]
-142
) was selected for use as a backbone for the generation of all constructs used
in this study (Fig.
1
).
Stable cell lines were generated by transfecting MEL cells with each of the five
plasmid expression constructs shown in Figure
1
. DNA (20 [mu]g) from each construct was co-transfected with 2 [mu]g of a plasmid carrying the neomycin resistance gene (
39
) by electroporation into MEL cells as previously described (
32
). The transfected cells were plated for two weeks in selective media which
contained G418 at an active concentration of 400 [mu]g/ml. Pools of transfected cells were established by expanding the G418-resistant cells that survived after 2 weeks in selective medium.
Single cell clones were isolated from these pools by limiting dilution.
Pools and clones of G418-resistant cells were induced to differentiate by exposure to 1.4% dimethyl
sulfoxide (DMSO) for 3 days as previously described (
32
). The success of each induction was confirmed by a marked increase in the level
of expression of the murine [beta]
maj
globin mRNA as previously described (
32
). DNA was isolated from these cells and used in Southern blot analyses to
quantify the number of integrated copies of the transfected [alpha]-globin genes. Ten micrograms of each DNA were digested with
Hin
dIII and probed with a 1.5 kb
Pst
I fragment that carries the human [alpha]-globin gene. Cytoplasmic RNAs were isolated from induced and
uninduced cells as described by Favaloro
et al.
(
40
) and analyzed for human [alpha]-globin gene expression by S1 nuclease assays as previously
described (
41
). The 3' end-labeled probe consisted of a 391 bp
Nco
I/
Hin
dIII fragment of the [alpha]-globin gene of which 97 nt are protected by [alpha]-globin mRNA (
42
). Since the basal level of expression of the human [alpha]-globin gene in MEL cells was very high, we only used 3-5 [mu]g of cytoplasmic RNA in each S1 assay to ensure that all
hybridizations will take place in DNA probe excess. We also included 5 [mu]g of RNA from human K562 cells as a control in every experiment to be able
to make comparisons across assays. Each S1 assay was repeated at least three
times with consistent findings. Autoradiographs of Southern blots and S1 assays
were analyzed by densitometric scanning to determine the number of integrated
copies and the level of expression of the [alpha]-globin gene respectively.
We also analyzed the effect of CACCC and GATA-1 sequences on the activity of the [alpha]-globin gene in fibroblasts. NIH 3T3 cells were transiently
transfected by calcium phosphate-DNA coprecipitation as previously described (
41
). Twenty micrograms of either the [alpha]
-142
or the CCGG[alpha]
-142
construct were co-transfected with a control construct consisting of a human [gamma]-globin gene linked to the SV40 enhancer (
42
). The levels of expression of the [alpha]- and [gamma]-globin genes were assessed by an S1 nuclease
protection assay as previously described (
42
). The relative activity of the two [alpha]-globin gene constructs in fibroblasts was expressed as the ratio of
the densitometric signals corresponding to [alpha]-globin mRNA over [gamma]-globin mRNA.
Autoradiographs were imaged using a Silverscan flatbed digital scanner (La Cie, Ltd, Beaverton, OR). Image processing was performed on a
Macintosh Quadra 800 (Apple, Cupertino, CA) using Adobe Photoshop (Adobe,
Mountain, CA) and Aldus Persuasion (Aldus, Seattle, WA) software. Figures were
printed using a XL7700 Digital Continuous Tone Printer (Kodak, Rochester, NY).
We first analyzed the activity of different truncations of the [alpha]-globin promoter by expression assays in stably transfected MEL
cells to identify a minimal promoter that retains full constitutive activity. A
previous study had shown that an [alpha]-globin promoter truncated to position -87 retains full activity in a transient expression assay in
COS cells (
17
). The activity of the [alpha]
-142
globin construct was found to be essentially identical to the non-truncated [alpha]-globin promoter in induced and uninduced MEL cells (data not shown). Thus, we selected this minimal [alpha]
-142
construct as a backbone for the generation of the other expression constructs
shown in Figure
1
. By design, these expression constructs carried two copies of the binding sites
for CACCC and GATA-1 since these sites are commonly duplicated in globin regulatory elements
in vivo
.
Figure
2
shows the results of a quantitative S1 assay of [alpha]-globin expression in one pool and six clones of MEL cells
transfected with the [alpha]
-142
plasmid. Both the pool and the individual clones expressed high levels of human
[alpha]-globin mRNA that did not increase further with induction. In
contrast, the expression of the endogenous mouse [beta]
maj
globin genes increased from 20- to 50-fold with induction (data not shown). We analyzed a total of four
pools and 10 clones of MEL cells transfected with the [alpha]
-142
globin construct. None of the pools or clones analyzed showed inducible
expression of the [alpha]-globin gene in MEL cells.
We then investigated the erythroid-responsiveness of an [alpha]-globin gene with two GATA-1 sites inserted in its promoter (GG[alpha]
-142
). Figure
3
shows an autoradiograph of an S1 assay of [alpha]-globin expression in one pool and six clones of MEL cells
transfected with the GG [alpha]
-142
expression construct. Although the pool and all clones expressed the
transfected [alpha]-globin gene at a relatively high basal level, [alpha]-globin mRNA increased with induction in two of the six
clones analyzed (Fig.
3
, lanes 2 and 5). The level of induction was 1.7-fold in clone 2 and 3-fold in clone 5. We analyzed a total of three pools and 11 clones of
MEL cells transfected with the same construct. Inducible expression was seen in
only two of the 11 clones and in none of the three pools analyzed. No
correlation was noted between the number of integrated copies of the [alpha]-globin gene and inducibility in MEL cells.
An [alpha]-globin gene with two CACCC sites inserted in its promoter (CC[alpha]
-142
) was analyzed in a similar manner. Figure
4
shows an autoradiograph of an S1 assay of [alpha]-globin expression in one pool and six clones of MEL cells transfected with CC[alpha]
-142
. The expression of the human [alpha]-globin gene was not inducible in the pool neither in any of the individual clones. We
analyzed a total of two pools and 17 individual clones of MEL cells transfected
with CC[alpha]
-142
. None of the pools or individual clones showed inducible expression of the
human [alpha]-globin gene.
The first experiment described above confirmed the original observations of
Charnay
et al.
that the expression of a transfected human [alpha]-globin gene is not inducible in MEL cells (
18
). When we inserted GATA-1 binding sites in the promoter of the [alpha]-globin gene, inducibility was seen in a small minority (two out of
11) of the MEL clones that were analyzed. The significance of this observation
is not clear. Inducibility resulting from the presence of GATA-1 sites must be very uncommon since [alpha]-globin expression was not inducible in three different pools,
each consisting of 500-1000 individual clones. When we inserted CACCC sites in the promoter of
the [alpha]-globin gene, inducibility was not seen in any of the pools or
clones that were analyzed. The lack of inducibility of the CC[alpha]
-142
and the GG[alpha]
-142
constructs is unlikely to be a result of the high basal levels of [alpha]-globin expression in the uninduced cells since the average level of
expression in the different uninduced cell lines varied by <20%. In contrast, when the CACCC and GATA-1 sites were added together to the [alpha]-globin promoter, the gene became inducible in seven out of
ten clones and four out of six pools. Although these pools were generally of
high complexity consisting of a large number of independent clones, it is not
clear why the [alpha]-globin gene was inducible in some of the pools but not in others.
For this reason, it is always desirable to analyze several pools and an even
larger number of individual clones before making statements about inducibility.
Figure
The proportion of inducible clones from cells transfected with the CCGG[alpha]
-142
globin gene is in the same range that was previously described for the human [beta]-globin gene in MEL cells (
18
,
19
). The magnitude of induction, however, appears to be significantly higher with
the [beta]-globin gene than with the CCGG[alpha]
-144
gene. This is most likely a reflection of the very high basal level of
expression of the [alpha]-globin gene in uninduced MEL cells compared to the low basal level
of expression of the [beta]-globin gene. In fact, Charnay
et al.
estimated the basal level of expression of the [alpha]-globin gene in uninduced MEL cells to be equivalent to the level of
expression of an induced [beta]-globin gene (
18
). The lack of inducibility of both the [alpha]- and [beta]-globin genes in a minority of the transfected cell
lines is probably a result of their integration in regions of the genome that
prevent them from responding to the erythroid environment of the differentiated
MEL cell.
Previous studies by other investigators had shown that the AT rich [beta]-globin promoter requires the presence of both CACCC and GATA-1 sites for inducibility in MEL cells. deBoer
et al.
first demonstrated that a minimal [beta]-globin promoter which includes CACCC, CAAT and TATA boxes is not
inducible in MEL cells (
44
). Addition of sequences that included binding sites for CP1 and GATA-1 resulted in inducibility in MEL cells (
44
). Walters and Martin refined this analysis further by demonstrating that a TATA
box-containing minimal promoter from the rabbit [beta]-globin gene becomes inducible in MEL cells upon the addition of GATA-1 sites linked to either CACCC or NF-E2 (
45
). It should be noted that both these studies demonstrated that the combination
of CACCC and GATA-1 sites results in inducibility in the context of a minimal promoter
originating from a [beta]-globin gene which itself is inducible in MEL cell. In contrast, the
experiments described in this report demonstrate that the same binding sites
can make the constitutively expressed [alpha]-globin gene inducible in MEL cells.
The level of expression of GATA-1 had been shown to increase with erythroid maturation in the developing
mouse
in vivo
(
46
) and with induction of MEL cell differentiation
in vitro
(
47
). However, the experiments described above and those of others (
45
) show that the addition of GATA-1 to non-inducible promoters is not sufficient to make them inducible in MEL
cells. CACCC sequences have previously been shown to bind both the erythroid
factor EKLF (
37
) as well as the ubiquitous factor Sp1 (
48
). Neither the expression of EKLF nor that of the related Sp1 increase when MEL
cells are induced to differentiate (
37
). Our experiments show that the addition of CACCC sequences to the [alpha]-globin gene promoter does not result in inducible expression in MEL
cells. However, when the CACCC and GATA-1 sequences are added together to the [alpha]-globin promoter, inducible expression is seen in about two
thirds of the cell lines. It is interesting to note that with the exception of
the [alpha]-globin gene, all the functional human globin genes (i.e. [epsilon],
G
[gamma],
A
[gamma], [beta] and [xi] have closely situated GATA-1 and CACCC binding sites. This raises the possibility
that physical interaction between GATA-1 and EKLF or GATA-1 and Sp1 may be required for inducibility in MEL cells. If this were indeed the case, then the level of GATA-1 may be the limiting factor in this interaction. When MEL
cells are induced to differentiate and the level of GATA-1 increases, the GATA-1-EKLF or the GATA-1-Sp1 complexes may reach the critical
concentration that is necessary for the activation of promoters which include
both binding sites. This speculation is compatible with previous observations relating to the role of GATA-1 and CACCC sequences in the regulation of the human porphobilinogen
deaminase (PBG-D) gene (
49
). Mutagenesis of either of these sites individually abolished the activity of
the promoter in erythroid cells. More interestingly, when the distance between
these sites was progressively increased, the activity of the promoter was
progressively reduced (
49
) suggesting the possibility of physical interaction between these factors. More
recently, Merika and Orkin demonstrated direct physical interaction and
functional synergy between GATA-1 and EKLF and also between GATA-1 and Sp1 (
50
).
Two other observations made in the course of this study deserve further comment. The first observation relates to the issue of copy-number dependence of globin gene expression in the presence of LCR
elements. Our experiments show that the LCR does not make the expression of a
linked [alpha]-globin gene copy-number dependent in MEL cells. This is in agreement with observations in transgenic mice that the [alpha]LCR (or HS-40) is not sufficient to make the expression of
a linked [alpha]-globin gene copy-number dependent (
51
,
52
). The second is that even though the addition of CACCC and GATA-1 sites can make the [alpha]-globin gene erythroid-responsive in MEL cells, their combined effects are
distinctly different from those of the [alpha]LCR. The addition of the [alpha]LCR to the [alpha]-globin gene resulted in inducible expression in a
position-independent manner (i.e. the gene was inducible in every clone regardless
of the position of integration). This was clearly not the case when CACCC and
GATA-1 were added together to the [alpha]-globin gene since inducible expression was seen in only two
thirds of the clones analyzed. In addition, the magnitude of induction was
significantly higher in the presence of the [alpha]LCR (3-49-fold) than in the presence of CACCC and GATA-1 (1.5-5-fold). Thus, even though LCR elements of
the [alpha]- and [beta]-globin gene clusters contain CACCC and GATA-1 sites (
29
,
53
-
56
), the presence of these sites together is not sufficient to reproduce the full
activity of the LCR. Since globin LCR elements also contain NF-E2 binding sites (
29
,
53
), it would be interesting to investigate whether the combination of GATA-1, CACCC and NF-E2 could account for the full activity of a locus control region.
One of the most important unanswered questions about the regulation of the [alpha]-globin gene relates to the mechanism responsible for its tissue-specific expression. In other words, what are the factors
that restrict the expression of the inherently promiscuous [alpha]-globin gene to erythroid cells
in vivo
? Although the [alpha]LCR has been shown to have powerful erythroid-specific enhancing activity (
32
,
43
,
57
), this does not provide an explanation for the lack of expression from the
constitutively active [alpha]-globin promoter in non-erythroid cells. Charnay
et al.
suggested many years ago that the state of chromatin (as detected by DNase I
hypersensitivity) may be very important for restricting the expression of the [alpha]-globin gene to erythroid cells (
18
). Unlike deletions of the [beta]LCR, deletions of the [alpha]LCR do not result in loss of DNase I hypersensitivity of the [alpha]-globin promoter in erythroid cells (
13
,
58
). This suggests that the mechanism(s) responsible for the DNase I
hypersensitivity of the [alpha]-globin promoter are independent of LCR activity. In addition,
unlike CpG islands of housekeeping genes that are typically present in an open
chromatin configuration in all cell types, the CpG rich [alpha]-globin promoter is present in a closed configuration in non-erythroid cells (
59
). We believe that the identification of the elusive factors which control the
chromatin configuration of the [alpha]-globin promoter may be crucial for understanding the mechanism of
regulated erythroid-specific expression of the [alpha]-globin gene
in vivo.
We would like to thank Dr James Bieker for helpful discussions during the course
of this research. We are grateful for the excellent secretarial assistance of
Migdalia Torrres. This research was supported by a PHS grant to G.F.A. from NIH
(HL42919) and by a GCRC grant (MO1-RR00071) from NIH to the Mount Sinai School of Medicine.

REFERENCES
Return



