Published online 9 March 2006
Article |
The Rhizobium etli
70 (SigA) factor recognizes a lax consensus promoter
Programa de Genómica Evolutiva, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México Apartado Postal 565-A, C.P 62210, Cuernavaca, Morelos, México
*To whom correspondence should be addressed. Tel: +52 777 329 16 90; Fax: +52 777 317 55 81; Email: mramirez{at}ccg.unam.mx
Received December 2, 2005. Revised February 15, 2006. Accepted February 15, 2006.
| ABSTRACT |
|---|
|
|
|---|
A collection of Rhizobium etli promoters was isolated from a genomic DNA library constructed in the promoter-trap vector pBBMCS53, by their ability to drive the expression of a gusA reporter gene. Thirty-seven clones were selected, and their transcriptional start-sites were determined. The upstream sequence of these 37 start-sites, and the sequences of seven previously identified promoters were compared. On the basis of sequence conservation and mutational analysis, a consensus sequence CTTGACN1623TATNNT was obtained. In this consensus sequence, nine on of twelve bases are identical to the canonical Escherichia coli
70 promoter, however the R.etli promoters only contain 6.4 conserved bases on average. We show that the R.etli sigma factor SigA recognizes all R.etli promoters studied in this work, and that E.coli RpoD is incapable of recognizing them. The comparison of the predicted structure of SigA with the known structure of RpoD indicated that regions 2.4 and 4.2, responsible for promoter recognition, are different only by a single amino acid, whereas the region 1 of SigA contains 72 extra residues, suggesting that the differences contained in this region could be related to the lax promoter recognition of SigA. | INTRODUCTION |
|---|
|
|
|---|
The Escherichia coli RNA polymerase holoenzyme consists of a core enzyme (
2ßß'), and one sigma factor (
) subunit, which directs the core enzyme to specific sites on DNA to initiate transcription. In general, bacteria contain a main or housekeeping sigma factor, also known as sigma-70 (
70) which controls the expression of most of the genes, in association with the RNApol core (14). Sequence alignments of several
70 factors from different bacteria reveal four conserved regions (1 to 4), that have been further divided into subregions, involved in different aspects of transcription initiation, including promoter recognition, promoter melting, initiaton of RNA syntesis, abortive initiation and promoter escape (5).
The
70 subunit of E.coli and Bacillus subtilis recognizes two types of promoter sequences: the first one, or canonical promoter, consisting of two hexamer sequences [TTGACA] and [TATAAT], centered respectively in the positions 10 and 35 relative to the transcription start-site (6,7). A helixturnhelix motif located in the C-terminus of
70 (subregion 4.2) recognizes the 35 box of the promoter, whereas a more centrally located
-helix (subregion 2.4) recognizes the 10 box (8). The second one, named extended10 promoter consists of an element [TATAAT] plus 2 nt [TG], located in the 15 and 14 positions. In this promoter type a 35 region or equivalent sequence does not exist. The N-terminal part of the subregion 3.0 (previously named 2.5) from
70 is involved in the recognition of the conserved [TG] motif (911).
Rhizobium etli is a soil
-proteobacterium with the ability to colonize the roots of bean-plants to form nitrogen fixing nodules. The whole genomic sequence of R.etli CFN42 was recently concluded [(12); GeneBank accession no CP000133
[GenBank]
], and consists of a circular chromosome and six large plasmids, with an average G + C content of 61.5%. R.etli contains a large number of sigma factors (one housekeeping
70 gene, two copies of rpoH, two copies of rpoN and 18 genes of the extracitoplasmic factor group), a feature shared with other nitrogen fixing organisms, like Bradyrhizobium japonicum, Mesorhizobium loti and Sinorhizobium meliloti (4). The physiological roles and the mechanisms of promoter recognition by these sigma factors remain unknown. Moreover, in these organisms only a few well-characterized promoters are available.
The R.etli housekeeping
70 gene (sigA) encodes a protein of 685 amino acid residues with a molecular weight of 77.18 kDa. The primary structure of SigA is very similar to RpoD (the
70 factor of E.coli), especially in regions 2, 3 and 4, however SigA contains an extended region 1 (13).
S.meliloti, Caulobacter crescentus and Rhodobacter sphaeroides also have a larger SigA region 1, and most of their
70 promoters that have been characterized are not transcribed by the E.coli RNApol in vivo or in vitro, but in contrast, the RNApol of these bacteria can initiate transcription of typical E.coli
70 promoters (1316). This observation suggests some differences between the transcriptional machineries of E.coli and the
-proteobacteria, perhaps at the level of promoter recognition by the
70 factor.
The knowledge of the promoter structures and their regulation in a particular organism is the first step towards the molecular understanding of the regulatory networks, an essential prerequisite to develop tools for genetic engineering.
To analyze the molecular basis of R.etli gene expression in this study, we identified, characterized and sequenced active promoters in R.etli under exponential growth conditions. A consensus promoter sequence was deduced, and possesses some differences to the promoter consensus recognized by
70 in E.coli and B.subtilis. We showed that the double helix stability and the stacking energy of the R.etli and E.coli promoter regions possess a minimal energy value around 10 box, suggesting that despite the differences in their promoter structures, the promoters of both bacteria preserve thermodynamic properties, which favor DNA denaturation and help the transcription initiation. We demonstrated that SigA specifically recognizes the R.etli promoters studied in this work, but in contrast, the E.coli RNApol holoenzyme is incapable of recognizing them. Finally, we propose that R.etli contains a less strict
70 that recognizes a lax consensus promoter structure.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Bacterial strains and growth conditions
The bacterial strains and plasmids used in this work are listed in Table 1. E.coli strains were grown at 37°C in LuriaBertani (LB) medium. R.etli strains were grown at 30°C in PY medium (17) or minimal medium containing 1.2 mM K2HPO4, 0.8 mM MgSO4, 10 mM succinic acid, 10 mM NH4Cl, 1.5 mM CaCl2 and 0.0005% FeCl3, with the pH adjusted to 6.8 (18). Microaerobic conditions during growth were set up by putting the cultures in 150 ml bottles that were flushed with several volumes of a 1% oxygen-99% argon mixture and closed with an airtight stopper. Antibiotics were added at the following final concentrations (in µgml1): gentamicin 30; chloramphenicol 25; ampicillin 100; nalidixic acid 20. When it was used, X-Gluc (5-Br-4-Cl-3-indolyl-ß-D-glucuronic acid, Research Organics) was added at a final concentration of 20 µgml1.
|
DNA and RNA isolation and manipulation
Genomic DNA was isolated using components and instructions from GenomicPrep cells and tissue DNA isolation kit (Amersham). Plasmid DNA was isolated as described by Sambrook et al. (19). DNA was restricted and ligated under the conditions specified by the enzyme manufacturer (Invitrogen). Taq DNA polymerase (Altaenzymes) was used for PCR. The PCR products were cloned using a pMOSblue blunt-ended kit (Amersham). RNA was isolated using components and instructions from the High pure RNA isolation kit (Roche).
Construction of a R.etli promoter library
To isolate DNA fragments containing promoters, 10 mg chromosomal DNA of CFN42 strain was disrupted by nebulization during 2 min to 10 psi. Fragments ranging from 0.5 to 1 kb were purified from an agarose gel, blunted with Klenow and T4 pol enzymes, and ligated into the alkaline phosphatase treated SmaI sites of the promoter-trap vector pBBMCS53, which contains a promoterless gusA gene (20). The ligation mixture was electropored into DH10B using the instructions from ElectroMAX DH10B cell kit (GibcoBRL). Single colonies were selected in LB plates containing gentamicin.
Bacterial matings
pBBMCS53 derivatives were introduced by conjugation into R.etli CFN42 using pRK2013 as a helper plasmid. Strains were grown in PY liquid medium to a stationary phase, mixed in a proportion of donorhelperreceptor of 1:1:2 on PY plates and incubated at 30°C overnight. Cells were resuspended in fresh PY medium, and serial dilutions were plated in PY nalidixic acid 20, gentamicin 30 and X-Gluc. Clones containing fragments with a promoter were selected by blue/white screening of transconjugants.
ß-Glucuronidase activity measurements
Overnight cultures of the strains carrying the desired constructs were grown in the presence of the appropriate selection. Cultures were diluted in fresh PY medium to an OD540 of 0.01 and grown to a final OD540 of 0.4. The culture (1 ml) was centrifuged and resuspended in a salt wash solution supplemented with chloramphenicol (100 mgml1). Quantitative ß-glucuronidase assays were performed with p-nitrophenyl glucuronide substrate as described previously (21). Data were normalized to total-cell protein concentration by the Lowry method (19). The results presented in the figures are the mean of three independent experiments.
Transcription start-sites determination
Transcription start-sites were mapped by means of 5' RACE kit version 2.0 (Invitrogen). Briefly, DNA single chains (scDNA) were synthesized from 5 to 10 mg of total RNA using a gene-specific primer (GUS-LW5, CGATCCAGACTGAATGCCCAC) complementary to the region located in the position 96 to117 from gusA, and using a derivative of Moloney Murine Leukemia Virus Reverse Transcriptase (M-MLV RT). After first strand cDNA synthesis, the original mRNA template was removed by treatment with a RNase Mix (mixture of RNase H and RNase T1) and the scDNA was purified using a GlassMax DNA column. A homopolymeric tail was then added to the 3' end of the scDNA using Terminal deoxynucleotidil transferase (TdT) and dCTP. In the next step, a PCR amplification was accomplished using Taq DNA polymerase, a second antisense primer (GUS-LW4, GTAACATAAGGGACTGACCTGC) that was complementary to gusA in the position 28 to 49, and a complementary homopolymer tail primer (AAP, GGCCACGCGTCGACTAGTACGGGIIGGGIIGGGIIG). When secondary bands were obtained, an additional PCR was done using an AAP primer and a third antisense primer (GUS-LW2, GCTTGGCGTAATCATGGTCAT), that was complementary to the DNA region located in the position 1 to 21 from gusA.
DNA sequencing
DNA sequencing reactions were performed using the GUS-LW primers and Big-Dye terminator kit version 3.1 in an automatic 310 DNA sequencer (Applied Biosystems).
Construction of promoter mutations
Mutant versions of the phemC and lac promoters were constructed by PCR using oligonucleotides carrying altered versions of the wild-type 35, 10 and both boxes simultaneously. Each PCR product was cloned in to pBBMCS53 vector and introduced into R.etli CFN42 by triparental conjugation.
Overexpression and purification of the R.etli SigA protein
A DNA fragment corresponding to the full-length sigA gene of R.etli was PCR amplified from total DNA, and cloned into pET-28a(+) (Novagen) to express sigA in BL21 (DE3)plysS strain (22). The induction in exponential growth phase (OD540 = 0.4) was obtained by the addition of isopropyl-ß-D-thiogalactopyranoside (IPTG) up to the final concentration 1 mM. Cells were harvested after 4 h of cultivation and stored at 70°C. A total of 10 ml of culture usually yielded 2.53 mg of protein. Histidine tagged sigma protein was purified using HisTrap Kit (Amersham Biosciences). The cell pellet was suspended in 0.5 ml of binding buffer, containing 8 M urea and 0.25 mM phenylmethlysulfonyl fluoride (PMSF) [1x phosphate buffer (pH 7.4), 20 mM imidazole], and incubated 30 min at room temperature with gentle mixing. After sedimentation of cell debris, lyzate was loaded on to Ni-column and equilibrated with a binding buffer. The desired protein was eluted by phosphate buffer, containing 400 mM imidazol, collected in 0.5 ml fractions, and analyzed in 10% SDSPAGE. The applied procedure provides protein of 95% purity. To avoid the influence of six-aminoacide histidine tagg, HisTrap purified protein was treated by thrombin (Thrombin CleanCleave Kit, Sigma) after buffer exchange by dialysis. After the thrombin treatment protein was dialyzed against TGED buffer [10 mM TrisHCl (pH 7.6); 0.1 mM EDTA; 0.1 mM DTT; 0.1 M NaCl; 5% (v/v) glycerol] and storage buffer [10 mM TrisHCl (pH 7.6); 0.1 mM EDTA; 0.1 mM DTT; 10 mM MgCl2; 0.2 M KCl; 50% (v/v) glycerol]. The final concentration of protein was evaluated by method of Bradford (Bio-Rad protein assay kit).
DNA binding shift assay
Linear DNA templates were randomly labeled with [
-P32]dCTP in a standard PCR, purified from 3.5% PAGE and used for DNAprotein complex formation. RNA polymerase was assembled from core and
70 protein as it was described by Tang et al. (23). Binary complexes were allowed to preform for 20 min at 30°C, challenged by heparin (200 mg/ml) and immediately loaded on to 3.5% native PAGE, prewarmed up to the same temperature. After electrophoresis, gels were dried and radioautographed.
Software tools
Multiple DNA alignments were done using the programs CLUSTALW version 1.8.1 (24) and MultiAlin version 5.4.1 (25). Alignments were visually inspected and adapted with respect to the known transcriptional start-sites. Consensus matrices were generated and selected using WCONSENSUS version v5c (26). Briefly, to identify the 10 box, the first 20 bp upstream from the transcription start-site (+1) were used. To identify the 35 box, the 20 bp upstream to the 20 position were used as an input for WCONSENSUS. Algorithms available from server http://www.icgeb.trieste.it/dna (27) were used to evaluate double helix stability (28) and stacking energy (29). Calculations for each parameter were carried out with a window size of 10 and 20 nt, providing similar results. For each promoter position, the average value was calculated for the whole set of analyzed sequences.
| RESULTS |
|---|
|
|
|---|
Isolation of R.etli promoter sequences
To determine the general structure of the R.etli promoters, a random genomic library with an average insert size of 0.5 kb was constructed in the promoter-test vector pBBMCS53 and introduced by conjugation into R.etli CFN42. Direct screening on PY plates containing X-Gluc allowed the selection of transconjugants with an active reporter gene (gusA), suggesting that the constructs contained R.etli promoter regions. The ß-glucuronidase activity of 100 transconjugans was determined in PY liquid cultures in exponential and stationary phases of growth. Sixty-eight of these clones showed ß-glucuronidase activity under both conditions, while the remaining 32 transconjugants expressed the reporter gene only in PY plates, and for this reason were not used in further experiments. The ability to express the reporter gene of the 68 strains described above was tested in different conditions: low oxygen (1%), minimal medium (MM) and at different temperatures (30, 42 or 10°C). All but three expressed the reporter gene in all conditions, but at different levels, suggesting that the transconjugants carry constructs with constitutive promoters. The DNA sequences of 36 inserts of these constructs were determined. These promoter sequences, plus seven R.etli promoters experimentally identified in other studies were selected for subsequent analyses (3035).
Thirty-three promoters were mapped within intergenic regions; eight within the coding regions of the immediately preceeding gene, and three in DNA sections that contained overlapped coding regions.
The genetic compartments of the 44 promoters of our collection were next: thirty-nine were located on the chromosome, two on plasmid p42d and one in each of these plasmids: p42b, p42e and p42f.
Identification of transcriptional start-sites
The transcription start-sites are good reference points to identify the upstream conserved sequence characteristics of the promoters. Thus, the 5' ends of the mRNAs synthesized from each of the selected 36 promoter regions selected, were successfully identified by 5' RACE using the total RNA isolated from PY cultures in exponential growth conditions of the transconjugants. In general, a single RTPCR product was obtained, but in one clone, two conspicuous amplification products were observed. In this case the bands were independently isolated, cloned and sequenced, showing that these products are transcripts associated to the same gene.
Thirty-eight of the transcription start-sites (86%) fell within the first 250 bp upstream from coding regions, and four transcription start-sites corresponded to the nucleotide A of the start codon ATG from their respective ORF (Table 2).
|
Determination of a promoter consensus sequence
The promoter consensus sequence was derived by aligning 50 bp upstream from the transcription start-site of each promoter region. The identification of 10 and 35 boxes was corroborated with the WCONSENSUS program as described in Materials and Methods. As shown in Figure 1a, over-represented nucleotides clearly appeared around the 35 region and, in a lesser degree, in the 10 region. Then, the putative 10 consensus box [TATNNT] showed four identical positions to the 10 box of the E.coli consensus promoter [TATAAT] recognized by
70 factor. The 3' end of the R.etli 10 consensus boxes was positioned from 3 to 10 bp upstream from the transcriptional start-site (Figure 1c). Six promoter regions (14%) showed the dinucleotide [TG] in positions 15 and 14, which resembled the structure of the E.coli 10 extended promoter. The 35 box consensus sequence [CTTGAC], is identical in five positions to the equivalent box of the E.coli consensus promoter [TTGACA] recognized by
70. The spacing between the 35 and 10 regions ranges from 16 to 23 bp (Figure 1b). The promoter sequences contain 6.4 conserved bases on average, of the 12 conforming 10 and 35.
|
Suboptimal promoters were identified in 11 of the 44 promoter regions as shown in Figure 1a, but they were not taken in account to generate the consensus promoter sequence for the following reasons: none of them contained both hexameric boxes recognized by the best statistic parameters of WCONSENSUS. Nevertheless, in six of these promoters the 35 box overlapped with one proposed by us. In one case the 10 box overlapped the region proposed here. Moreover, some of the suboptimal promoters contained the spacer region two long (up to 27 bp) or to short (11 or 12 bp). In two cases the transcription start-sites were located far of the 10 boxes (up to 20 bp).
The results presented here showed that the R.etli promoter consensus possesses some similarities with the E.coli promoter consensus recognized by the
70 factor, but the variability in the nucleotide frequency in each position of the consensus was greater than that observed in E.coli promoters (7).
Thermodynamic properties of the R.etli promoter regions
It was recently demonstrated in several bacteria genomes, that the intergenic regions containing promoters are generally less stable and less flexible than coding regions (36). To determine if these thermodynamic properties are also present in the promoter regions of R.etli, the stacking energy and the double helix stability were evaluated in our collection of 44 promoters. For this analysis the sequence of 200 bp upstream and 100 bp downstream from transcription start-site of all members of the promoter collection were taken in account. As an internal control, a set similar in size and length was constructed with the immediately downstream coding region of each member of the promoter collection. The same properties were predicted and contrasted with two E.coli equivalent sets: one containing 44 randomly chosen promoter sequences experimentally characterized and putatively recognized by
70, and the other of 44 coding region sequences that corresponded to the genes controlled by the selected promoters. As shown in Figure 2a, the distribution profiles of double helix stability for coding and no coding regions of R.etli were similar, however, a significative local minimum around the 10 element was detected in our collection of promoter regions. The average stability of the E.coli coding regions was higher than the average stability of their promoter regions, and like R.etli, a prominent feature of the promoter regions was a minimum stability around the 10 box (Figure 2b).
|
The stacking energy profiles of R.etli and E.coli promoter regions were variable, but with a tendency to low negative values (low stability), nevertheless local minimum values were located around the 10 box. In contrast, the stacking energy profiles of R.etli and E.coli coding regions were similar: both showed more negative values that corresponded to great stability (Figure 2a and b). These results suggest that despite the variability in the nucleotide composition of R.etli promoters, these regions possess thermodynamic and structural properties similar to the E.coli promoter regions.
Promoter sequence mutagenesis
To demonstrate that the promoter consensus proposed here is correct, the DNA sequence of three R.etli mutant promoters (from repA, the antisense RNA gene present in the p42d repABC operon and recA) with the reduced transcriptional activity previously characterized was located in our alignment. As shown in Figure 3, these mutations fall in the regions centred in 10 and 35 of the consensus, suggesting that these elements were correctly identified.
|
Additionally the promoter consensus was verified in the following way: the constitutive promoter (phemC) of hemC, a gene encoding a porphobilinogen deaminase protein, was selected for a mutagenic analysis. This promoter possessed five conserved positions from consensus and showed moderate activity. Three mutant derivatives were constructed that deviated the promoter sequence from the consensus. First, the putative 35 element was altered, changing sequence [ATTGCG] to [AAGACG] to generate the construct pGUSphemC-35. Second, the putative 10 element was changed from [TACGAG] to [GCGGAG] to produce construct pGUSphemC-10. Third, the putative wild-type 35 and 10 elements were changed at the same time to the sequences indicated before, to generate plasmid pGUSphemC-1035. These constructs were introduced into R.etli CFN42, and the ß-glucuronidase activities of the transconjugants were determined. As shown in Figure 3, all mutant constructs showed a low level of ß-glucuronidase activity, indicating that the 35 and 10 elements of the R.etli promoter consensus were correctly identified.
Activity of the R.etli promoters in E.coli
As described in the previous section, the R.etli
70 promoter consensus sequence is more variable than that described for E.coli. This observation suggests that the E.coli
70 is unable to recognize the R.etli promoters or recognizes them less efficiently. To prove this possibility, the 36 transcriptional fusions of the collection were introduced into the BW21038 (gusA) strain, and their ß-glucuronidase activities were evaluated in plates supplemented with X-Gluc. In this experiment only three promoters were functional, but showed low ß-glucuronidase activity. These results suggest that the 33 transcriptional fusions are poorly or unrecognized by the E.coli sigma factors. It is possible than the E.coli
70 is incapable of binding the R.etli promoters because it is more stringent in its recognition pattern, and imposes a strict functional barrier. If this hypothesis is true, then the R.etli
70 (SigA) activates the transcriptional fusions described above in E.coli. To demonstrate this, all strains containing the gusA fusions were transformed with a plasmid expressing the R.etli
70 gene under the control of the lac promoter and the ß-glucuronidase activity of the transformants were evaluated in LB plates containing X-Gluc. Representative results are shown in Figure 4. The expression of sigA allowed the gusA fusions to express their reporter gene in the 36 R.etli promoter regions studied here, however, the strains containing sigA constructs were unstable, and a mixure of blue and white colonies was observed in the plates (Figure 4). The results suggest that R.etli
70 is responsible for the transcriptional activity of these promoters, and that the structural variability of R.etli promoters cannot be recognized by E.coli
70.
|
The E.coli
70 is incapable of binding in vitro R.etli promotersTo verify if the R.etli promoters are recognized by the E.coli
70 factor, electrophoretic mobility shift assay (EMSA) experiments were done using E.coli RNApol containing E.coli
70 and a PCR product containing the promoter of hemC gene (phemC), plus the PCR products of four promoter regions selected at random: ureA, cycH, yhch00197 and yhch00468. As a negative control, a PCR product of a mutant derivative of promoter phemC containing the changes in the 10 and 35 elements described previously was used, and as a positive control, a PCR product containing the lac promoter was used (lac promoter is functional in R.etli). Representative results are shown in Figure 5a. In all experiments the E.coli RNApol holoenzyme was unable to bind the R.etli promoters tested, indicating that E.coli
70 cannot recognize R.etli promoters.
|
SigA binds in vitro R.etli promoters
To determine if SigA is capable of recognizing the R.etli promoters, binding shift experiments were done using a chimeric RNApol holoenzyme composed of E.coli RNApol core and R.etli
70 factor assembled in vitro. As a target DNA, the PCR products containing the promoters mentioned above (phemC and the promoters of ureA, cycH, yhch00197 and yhch00468 genes) were used in these experiments. Representative results are shown in Figure 5. The chimeric RNApol holoenzyme bound the five PCR products, but at low efficiency (Figure 5b). In contrast, no DNA binding shifts were observed using a PCR control with the mutant promoter phemC (Figure 5c). These results showed that the chimeric RNApol holoenzyme specifically recognizes the R.etli promoters.
Comparative analysis of the
70 consensus sequences from different bacteria
The R.etli
70 promoter consensus was compared with those of the: (i)
-proteobacteria C.crescentus (G + C% of 67) (37) and Rhodobacter capsulatus (G + C% of 67) (38); (ii) the
-proteobacteria E.coli (G + C% of 52) (6); and (iii) B.subtilis (G + C% of 43) (39). In the promoter consensus sequences of C.crescentus, R.capsulatus and R.etli, the 35 boxes are more conserved than the 10 boxes (Figure 6). Furthermore, the inspection of the all sequences showed that each bacterial species has a consensus sequence, with a different degree of variability. The R.etli and R.capsulatus promoters contain 6.4 and 5.0 conserved bases on average, respectively, less conserved in comparison with the promoter consensus of C.crescentus, that contains 8.0, E.coli 7.9 and B.subtilis 9.1 conserved nucleotides (7,39,40). Finally, the number of conserved nucleotides, especially in the 10 box, tends to diminish when genomic G + C content is above 60%. It is worth noting that the E.coli and B.subtilis 10 and 35 boxes are A + T rich and conserve 10 of 12 positions.
|
| DISCUSSION |
|---|
|
|
|---|
R.etli is a bacterium that inhabits diverse environments and has genome rich in genes related to transcriptional regulation, suggesting that it contains a complex genetic regulation system. To characterize the molecular basis of the transcription, and particulary, what kind of sequences act as promoters in R.etli, we identified and characterized a promoter consensus sequence recognized by the R.etli
70 factor (SigA) in this paper.
The R.etli
70 promoter consensus sequence [CTTGAC] N1623 [TATNNT] was deduced from 44 promoter regions. Using site-directed mutagenesis of one representative R.etli promoter isolated in this work, and a collection of mutants of three promoter regions described previously, we found that all changes fell within the 10 or 35 boxes identified in this work, suggesting that the consensus sequence is correct. Moreover, two different lines of evidence indicate that this promoter consensus sequence is recognized by the R.etli
70. The first one is that gusA fusions containing R.etli promoters are not functional in E.coli, but all regain their activity when complemented with SigA. Second, SigA is able to bind to the R.etli promoters but not to their mutant derivatives that partially eliminate the 35 or 10 boxes.
The location of the 44 promoter regions in the R.etli genome showed that 33 are located within intergenic regions, and that 8 (18%), fall within coding regions. This situation has also been observed in other bacteria, i.e. in E.coli 18% of the promoters are within genes (41), and around of 20% in Prochlorococcus marinus (42).
Of the promoters 86% analyzed here were located less than 250 bp from the translation start codons of their genes. This property has been widely studied in E.coli, where it is known that 90% of the promoters are located less than 250 bp upstream from the gene that they regulate (41).
Although the R.etli, B.subtilis and E.coli promoter consensus display a common structure, the R.etli consensus sequence contains only 6.4 conserved bases on average, revealing a more variable composition than in the E.coli and B.subtilis consensus, where 7.9 and 9.1 nt are conserved on average, respectively (7,39,40). Additionally, the R.etli promoter consensus shows that the sequence length between the 10 and 35 boxes varies from 16 to 23 nt, nevertheless, 25 of the promoters (56.81%) present a distance of 17 ± 1 nt. This variation range is similar to that found in E.coli (15 to 21 nt) and in B.subtilis (14 to 20 nt). The percentage of promoters with 17 ± 1 nt in length between the hexameric boxes, is quite low compared to E.coli (92%) and B.subtilis (84.5%) (6,39). The distance between +1 nt and 10 box 3' end fluctuates from 3 to 10 bases, as found in E.coli and B.subtilis (6,39). Six promoters (14%) were found with a putative 10 extended sequence, and their frequency is lower than those reported for this kind of promoter in E.coli (20%) and B.subtilis (45%) (39,43,44).
A computer analysis of DNA double helix stability and stacking energy showed that the promoter regions presented a minimum value for both parameters around 10 box, as found in E.coli promoters. Having these results in mind, it is reasonable to conclude that in spite of their sequence heterogeneity, the promoter regions preserve thermodynamic properties which have been selected to favor DNA denaturation and help to start the transcription. Similar conclusions have been drawn for other bacterial promoter regions (36,45).
The R.etli promoters tested are not recognized by E.coli
70, however, the E.coli lac promoter is recognized by R.etli
70, suggesting that R.etli
70 is more promiscuous than E.coli
70.
The deduced primary structure of SigA is 47% similar to the E coli
70, especially in regions 2.4 and 4.2, which are responsible for promoter recognition. The most notable difference between SigA and RpoD is the larger size of region 1 of SigA, which contains 72 extra amino acids. This region could be related to the lax promoter recognition of SigA. Thus, E. coli
70 being the strictest factor for promoter recognition, it allows less structural variation, which is reflected in a robust consensus sequence. The opposite case is observed in
-proteobacteria, where the promiscuous
70 allows a larger variation in its promoter structure.
A promiscuous
70 factor could be an adaptive advantage for bacteria like the
-proteobacteria that inhabit environments with high biological and enviromental diversity, because they can take the benefits of the genetic information acquired by horizontal transfer in a fast and efficient way, avoiding a barrier that could limit the acquisition of new functions required for the adaptation to new environmental conditions.
Our comparison of the 






consensus of several bacteria suggests that differences in the genomic G + C content are related to promoter structure: the bacteria with high G + C content tend to be more variable, and to have 10 boxes different in sequence to those from bacteria with low G + C content, and vice versa. Previous evidence in other organisms supports this conclusion (46).
| ACKNOWLEDGEMENTS |
|---|
The authors wish to thank Marco Zúñiga, Rosa Elena Gómez Barreto, Ismael Hernández, Patricia Bustos, Rosa Isela Santamaría and José Espíritu for their skillful technical support. The authors also thank Paul Gaytán and Eugenio López for the synthesis of oligonucleotides. This work was funded by CONACyT grants N-028, N-U46333-Q for emergent areas, and the Universidad Nacional Autónoma de México. Funding to pay the Open Access publication charges for this article was provided by Universidad Nacional Autónoma de México.
Conflict of interest statement. None declared.
| Footnotes |
|---|
Correspondence may also be addressed to Guillermo Dávila. Tel: +52 777 329 16 90; Fax: 52 777 317 55 81; Email: davila{at}ccg.unam.mx
| REFERENCES |
|---|
|
|
|---|
- Lonetto, M., Gribskov, M., Gross, C.A. (1992) The
70 family: sequence conservation and evolutionary relationships J. Bacteriol, . 174, 38433849[Free Full Text] . - Wosten, M.M.S.M. (1998) Eubacterial sigma-factors FEMS Microbiol. Rev, . 22, 127150[CrossRef][ISI][Medline] .
- Maeda, H., Fujita, N., Ishihama, A. (2000) Competition among seven Escherichia coli sigma subunits: relative binding affinities to the core RNA polymerase Nucleic Acids Res, . 28, 34973503
[Abstract/Free Full Text] . - Mittenhuber, G. (2002) An inventory of genes encoding RNA polymerase sigma factors in 31 completely sequenced eubacterial genomes J. Mol. Microbiol. Biotechnol, . 4, 7791[ISI][Medline] .
- Paget, M.S. and Helmann, J.D. (2003) The sigma70 family of sigma factors Genome Biol, . 4, 203.1203.6 .
- Harley, C.B. and Reynolds, R.P. (1987) Analysis of E.coli promoter sequences Nucleic Acids Res, . 15, 23432361
[Abstract/Free Full Text] . - Lisser, S. and Margalit, H. (1993) Compilation of E.coli mRNA promoter sequences Nucleic Acids Res, . 21, 15071516
[Abstract/Free Full Text] . - Murakami, K.S., Masuda, S., Darst, S.A. (2002) Structural basis of transcription initiation: RNA polymerase holoenzyme at 4 A resolution Science, 296, 12801284
[Abstract/Free Full Text] . - Kumar, A., Malloch, R.A., Fujita, N., Smillie, D.A., Ishihama, A., Hayward, R.S. (1993) The minus 35-recognition region of Escherichia coli sigma 70 is inessential for initiation of transcription at an Extended minus 10 promoter J. Mol. Biol, . 232, 406418[CrossRef][ISI][Medline] .
- Barne, K.A., Bown, J., Busby, S.J., Minchin, S.D. (1997) Region 2.5 of the Escherichia coli RNA polymerase sigma70 subunit is responsible for the recognition of the extended-10 motif at promoters EMBO J, . 16, 40344040[CrossRef][ISI][Medline] .
- Campbell, E.A., Muzzin, O., Chlenov, M., Sun, J.L., Olson, C.A., Weinman, O., Trester-Zedlitz, M.L., Darst, S.A. (2002) Structure of the bacterial RNA polymerase promoter especificity sigma subunit Mol. Cell, . 9, 527539[CrossRef][ISI][Medline] .
- González, S.A., et al. (2005) The partitioned Rhizobium etli genome: Genetic and metabolic redundancy in seven interacting replicons Proc. Natl Acad. Sci. USA, 103, 38343839 .
- Luka, S., Patriarca, E.J., Riccio, A., Iaccarino, M., Defez, R. (1996) Cloning of the rpoD analog from Rhizobium etli: sigA of R.etli is growth phase regulated J. Bacteriol, . 178, 71387143
[Abstract/Free Full Text] . - Peck, M.C., Gaal, T., Fisher, R.F., Gourse, R.L., Long, S.R. (2002) The RNA polymerase alpha subunit from Shinorhizobium meliloti can assemble with RNA polymerase subunits from Escherichia coli and function in basal and activated transcription both in vivo and in vitro J. Bacteriol, . 184, 38083814
[Abstract/Free Full Text] . - Cullen, P.J., Kaufman, C.K., Bowman, W.C., Kranz, R.G. (1997) Characterization of the Rhodobacter capsulatus housekeeping RNA polymerase. In vitro transcripcion of photosynthesis and other genes J. Biol. Chem, . 272, 2726627273
[Abstract/Free Full Text] . - Wu, J., Ohta, N., Benson, A.K., Ninfa, A.J., Newton, A. (1997) Purification, characterization, and reconstitution of DNA-dependent RNA polymerases from Caulobacter crescentus J. Biol. Chem, . 272, 2155821564
[Abstract/Free Full Text] . - Noel, K.D., Sánchez, A., Fernández, L., Leemans, J., Cevallos, M.A. (1984) Rhizobium phaseoli symbiotic mutants with transposon Tn5 insertions J. Bacteriol, . 158, 148155
[Abstract/Free Full Text] . - Bravo, A. and Mora, J. (1988) Ammonium assimilation in Rhizobium phaseoli by the glutamine synthetase-glutamate synthase pathway J. Bacteriol, . 170, 980984
[Abstract/Free Full Text] . - Sambrook, J., Fritsch, E.F., Maniatis, T. Molecular Cloning. A Laboratory Manual, (1989) 2nd edn Cold Spring Harbor, NY Cold Spring Harbor Laboratory Press .
- Corvera, A., Promé, D., Promé, J.C., Martínez-Romero, E., Romero, D. (1999) The nolL gene from Rhizobium etli determines nodulation efficiency by mediating the acetylation of the fucosyl residue in the nodulation factor Mol. PlantMicrobe Interact, . 12, 236246[Medline] .
- Wilson, K.J., Huges, S.G., Jefferson, R.A. (1992) The Escherichia coli gus operon, induction and expression of the gus operon in E.coli and the occurrence and use of GUS in other bacteria In Gallagher, S.R. (Ed.). Gus Protocols, Using the Gus Gene as a Reporter of Gene Expression, San Diego, CA Academic Press Vol. 1, pp. 723 .
- Studier, F.W., Rosenberg, A.H., Dunn, J.J., Dubendorff, J.W. (1990) Use of T7 RNA polymerase to direct expression of cloned genes Meth. Enzymol, . 185, 6089[Medline] .
- Tang, H., Severinov, K., Goldfarb, A., Ebright, H. (1995) Rapid RNA polymerase genetics: one-day, no column preparation of reconstituted recombinant Escherichia coli RNA polymerase Proc. Natl Acad. Sci. USA, 92, 49024906
[Abstract/Free Full Text] . - Higgins, D., Thompson, J., Gibson,Thompson, J.D., Higgins, D.G., Gibson, T.J. (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice Nucleic Acids Res, . 22, 46734680
[Abstract/Free Full Text] . - Corpet, F. (1988) Multiple sequence alignment with hierarchical clustering Nucleic Acids Res, . 16, 1088110890
[Abstract/Free Full Text] . - Hertz, G.Z. and Stormo, G.D. (1999) Identifying DNA and protein patterns with statistically significant alignments of multiple sequences Bioinformatics, 15, 563577
[Abstract/Free Full Text] . - Vlahovicek, K., Kaján, L., Pongor, S. (2003) DNA analysis servers: plot.it, bend.it, model.it and IS Nucleic Acids Res, . 31, 36863687
[Abstract/Free Full Text] . - Sugimoto, N., Nakano, S., Yoneyama, M., Honda, K. (1996) Improved thermodynamic parameters and helix initiation factors to predict stability of DNA duplexes Nucleic Acids Res, . 24, 45014505
[Abstract/Free Full Text] . - Ornstein, R.L., Rein, R., Breen, D.L., MacElroy, R.D. (1978) An optimized potential function for the calculation of nucleic acid interaction energies I. Base stacking Biopolymers, 17, 23412360[CrossRef][ISI] .
- Venkova-Canova, T., Soberón, N.E., Ramírez-Romero, M.A., Cevallos, Miguel A. (2004) Two discrete elements are required for the replication of repABC plasmid: an antisense RNA and a stemloop structure Mol. Microbiol, . 54, 14311444[CrossRef][ISI][Medline] .
- Ramírez-Romero, M.A., Téllez-Sosa, J., Barrios, H., Pérez-Oseguera, A., Rosas, V., Cevallos, M.A. (2001) RepA negatively autoregulates the transcription of the repABC operon of the Rhizobium etli symbiotic plasmid basic replicon Mol. Microbiol, . 42, 195204[CrossRef][ISI][Medline] .
- Miranda-Rios, J., Morera, C., Taboada, H., Dávalos, A., Encarnación, S., Mora, J., Soberón, M. (1997) Expressión of thiamin biosynthetic genes (thiCOGE) and production of symbiotic terminal oxidase cbb3 in Rhizobium etli J. Bacteriol, . 179, 68876893
[Abstract/Free Full Text] . - Tapias, A., Fernández de Henestrosa, A.R., Barbé, J. (1997) Characterization of the promoter of the Rhizobium etli recA gene J. Bacteriol, . 179, 15731579
[Abstract/Free Full Text] . - Taté, R., Riccio, A., Iaccarino, M., Patriarca, E.J. (1997) Cloning and transcriptional análisis of the lipA (lipoic acid synthetase) gene from Rhizobium etli FEMS Microbiol. Lett, . 149, 165172[Medline] .
- Patriarca, E.J., Riccio, A., Taté, R., Colonna-Romano, S., Iaccarino, M., Defez, R. (1993) The ntrBC genes of Rhizobium leguminosarum are part of a complex operon subject to negative regulation Mol Microbiol, . 9, 569577[ISI][Medline] .
- Pedersen, A.G., Jensen, L.J., Brunak, S., Staerfeldt, H.H., Ussery, D.W. (2000) A DNA structural atlas for Escherichia coli J. Mol. Biol, . 299, 907930[CrossRef][ISI][Medline] .
- Malakooti, J., Wang, S.P., Ely, B. (1995) A consensus promoter sequence for Caulobacter crescentus genes involved in biosynthetic and housekeeping functions J. Bacteriol, . 177, 43724376
[Abstract/Free Full Text] . - Richard, C.L., Tandon, A., Kranz, R.G. (2004) Rhodobacter capsulatus nifA1 promoter: High-GC10 regions in high-GC bacteria and basis for their transcription J. Bacteriol, . 186, 740749
[Abstract/Free Full Text] . - Helmann, J.D. (1995) Compilation and analysis of Bacillus subtilis
A-dependent promoter sequences: evidence for extended contact between RNA polymerase and upstream promoter DNA Nucleic Acids Res, . 23, 23512360[Abstract/Free Full Text] . - Ozoline, O.N., Deev, A.A., Arkhipova, M.V. (1997) Non-canonical sequence elements in the promoter structure. Cluster analysis of promoters recognized by Escherichia coli RNA polymerase Nucleic Acids Res, . 25, 47034709
[Abstract/Free Full Text] . - Huerta, M.A. and Collado-Vides, J. (2003) Sigma70 promoters in Escherichia coli: specific transcription in dense regions of overlapping promoter-like signals J. Mol. Biol, . 333, 261278[CrossRef][ISI][Medline] .
- Vogel, J., Axmann, I.M., Herzel, H., Hess, W.R. (2003) Experimental and computer analysis of transcriptional start sites in the cyanobacterium Prochlorococcus MED4 Nucleic Acids Res, . 31, 28902899
[Abstract/Free Full Text] . - Burr, T., Mitchell, J., Kolb, A., Minchin, S., Busby, S. (2000) DNA sequence elements located immediately upstream of the 10 hexamer in Escherichia coli promoters: a systematic study Nucleic Acids Res, . 28, 18641870
[Abstract/Free Full Text] . - Mitchell, J.E., Zheng, D., Busby, S.J., Minchin, S.D. (2003) Identification and analysis of extended 10 promoters in Escherichia coli Nucleic Acids Res, . 31, 46894695
[Abstract/Free Full Text] . - Lisser, S. and Margalit, H. (1994) Determination of common structural features in Escherichia coli promoters by computer analysis Eur. J. Biochem, . 223, 823830[ISI][Medline] .
- Morrison, D.A. and Jaurin, B. (1990) Sttreptococcus pneumoniae possesses canonical Escherichia coli (sigma 70) promoters Mol. Microbiol, . 4, 11431152[ISI][Medline] .
- Hanahan, D. (1983) Studies on transformation of Escherichia coli with plasmids J. Mol. Biol, . 166, 557580[ISI][Medline] .
- Metcalf, W.W. and Wanner, B.L. (1993) Construction of new ß-glucuronidase cassettes for making transcriptional fusions and their use with new methods for allele replacement Gene, 129, 1725[CrossRef][ISI][Medline] .
- Kovach, M.E., Elzer, P.H., Hill, D.S., Robertson, G.T., Farris, M.A., Roop, R.M., Peterson, K.M. (1995) Four new derivatives of the broad-host range cloning vector pBBR1MCS, carrying different antibiotic resistance cassettes Gene, 166, 175176[CrossRef][ISI][Medline] .
- Keen, N.T., Tamaki, S., Kobayashi, D., Trollinger, D. (1988) Improved broad-host range plasmids for DNA cloning in gram-negative bacteria Gene, 70, 191197[CrossRef][ISI][Medline]
.
This article has been cited by other articles:
![]() |
C. Bongiorni, T. Fukushima, A. C. Wilson, C. Chiang, M. C. Mansilla, J. A. Hoch, and M. Perego Dual Promoters Control Expression of the Bacillus anthracis Virulence Factor AtxA J. Bacteriol., October 1, 2008; 190(19): 6483 - 6492. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Sepulveda, D. Perez-Mendoza, M. A. Ramirez-Romero, M. J. Soto, I. M. Lopez-Lara, O. Geiger, J. Sanjuan, S. Brom, and D. Romero Transcriptional Interference and Repression Modulate the Conjugative Ability of the Symbiotic Plasmid of Rhizobium etli J. Bacteriol., June 15, 2008; 190(12): 4189 - 4197. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. I. Muglia, D. H. Grasso, and O. M. Aguilar Rhizobium tropici response to acidity involves activation of glutathione synthesis Microbiology, April 1, 2007; 153(4): 1286 - 1296. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Hadjifrangiskou, Y. Chen, and T. M. Koehler The Alternative Sigma Factor {sigma}H Is Required for Toxin Gene Expression by Bacillus anthracis J. Bacteriol., March 1, 2007; 189(5): 1874 - 1883. [Abstract] [Full Text] [PDF] |
||||
|
|

51% of the cases in any given position are marked in yellow. The transcription start-sites are marked with +1. Alternative promoters with suboptimal properties are underline. The consensus sequence derived from the alignment are indicated at the bottom of the alignment. The promoters identified were: 1, prepA; 2, pctRNA; 3, precA; 4, plipA; 5, pthiC; 6, pnifR promoter 1; 7, pnifR promoter 2; 8, psigA; 9, pypf00079; 10, pglyQ; 11, pyhch00468; 12, pyhch00334; 13, pyhch01154; 14, peda; 15, pyhch00528; 16, pengB; 17, p16S rRNA; 18, pyhch01141; 19, pyhch00226; 20, pyhch00739 pseudogene; 21, phemC; 22, pclpP; 23, prpoH1; 24, paroG; 25, pypch00196; 26, pureA; 27, pypch00131; 28, pndvB; 29, pype00047 pseudogene; 30, pyhch00197; 31, ppfs; 32, pcycH; 33, pypch00552; 34, pyhch00096; 35, pypch00132; 36, ppepN promoter 1; 37, ppepN promoter 2; 38, pyhch00326; 39, prpsP; 40, pyhch01115; 41, pyhch00280; 42, pispB; 43, pyhch00076; 44, pexoR2. (b) Distribution of spacer lengths for the 44 promoters of the (a) (c) Distribution of the distance between the 3' of the 10 element (TATNNT) and the +1




