|
|
|
Journal of Bacteriology, September 2004, p . 5699-5707, Vol . 186, No . 17 Transposon-Mediated Linker Insertion Scanning Mutagenesis of the Escherichia coli McrA EndonucleaseBrian P . Anton and Elisabeth A . Raleigh* New England Biolabs, Beverly, Massachusetts Received 30 March 2004/ Accepted 27 May 2004
All three endonucleases exhibit sequence specificity, although for the well-characterized McrBC enzyme, the specificity is weak (34) . The specificity of McrA is poorly known, since its in vitro action has not been studied . Furthermore, evidence that McrA actually cleaves DNA is indirect, consisting of the bacteriophage restriction phenomenon and the ability of a partially disabled mcrA gene to mediate induction of a DNA damage reporter in the presence of a sensitive methyltransferase gene (22) . Substrates sensitive to McrA restriction in vivo include those modified by M.HpaII (C5mCGG), M.Eco1831I (C5mCSGG), and M.SssI (5mCG) (14, 16, 27); however, a comprehensive survey of 5-methylcytosine methyltransferases with regard to McrA restriction has not yet been undertaken, so the precise recognition sequence remains unclear . Genetically, mcrA resides in the E . coli K-12 genome on e14 (26), a 14-kb defective prophage which self-excises following induction of the DNA damage-inducible (SOS) regulon; this property has proven useful in the construction of McrA strains more tractable for cloning methylated foreign DNA than wild-type E . coli K-12 . The mcrA gene has been cloned and found to encode a 31-kDa protein product with 277 residues (13, 28) . It is normally expressed at low levels in the cell, and regulation at the translational level has been suggested (29) . McrA has no close homologues in the public sequence databases (3), but two conserved motifs have been noted in its carboxy-terminal domain . First, four cysteine residues define a zinc finger-like motif common to the reverse transcriptase/intron nuclease-like proteins encoded in many group II introns (8) . Second, an H-N-H endonuclease motif, common to both group I and II intron-encoded proteins as well as phage proteins and bacteriocins, overlaps the zinc finger motif (10, 30) . As the zinc finger motif is not conserved among all of the H-N-H endonucleases, its presence or absence is not thought to affect the fold of the endonuclease domain (10) . In keeping with this, a recent sequence-threading model of McrA residues 159 to 272 shows the zinc finger stabilizing the arrangement of the presumptive catalytic histidine residues rather than participating directly in the active site of the protein (4) . Outside the modeled region, proposed to be the catalytic core of the enzyme, McrA exhibits weak similarity only to the group II intron maturases, which have not been structurally characterized . Thus, there remains little insight into the structure and function of the majority of the McrA protein . The relationship between the McrA and RglA phenotypes is also poorly understood . Of the mcrA mutants that have been described previously, several appeared to separate the McrA and RglA phenotypes, i.e., abolishing one activity but not the other (13, 29) . However, the published results are inconsistent: two very similar mutants isolated by independent groups were characterized as McrA+ RglA by one group and McrA RglA+ by the other . This inconsistency aside, such mutants may ultimately help characterize the recognition elements responsible for distinguishing methylated from hydroxymethylated substrates . The present study sought to characterize the functional domain architecture of McrA genetically by insertion scanning mutagenesis . These insertion mutants contain five additional contiguous amino acids interpolated at random locations within the polypeptide chain; the resulting structural perturbations will have different effects on the protein's activity depending on the location and composition of the inserted residues . Previous studies with this technique have yielded significant structure-function relationship information on proteins whose structure is known, including the XerD recombinase from Salmonella enterica serovar Typhimurium and ί-lactamase from pBR322 (5, 11) . As might be expected, insertions near a protein's active site as well as those that disrupt regions of secondary structure were in general shown to have more deleterious effects on activity than those occurring in linker regions and surface loops . In addition, some insertions affecting the substrate binding cleft of ί-lactamase significantly altered the substrate specificity of that enzyme while sparing its catalytic activity (11) . This study probed the domain structure of a protein of largely unknown structure by the generation of similar insertion mutants .
DNA damage assays were carried out in E . coli strain ER2171
[F fhuA2
Enzymes and general techniques. All enzymes were from New England Biolabs, Inc., and used according to the manufacturer's instructions . Plasmids were isolated with the QIAprep Spin miniprep kit (Qiagen, Inc.) or the Compass mini plasmid prep kit (American Bioanalytical, Inc.) . Plasmid pNEB193 was from New England Biolabs, Inc . Plasmid construction. pJEK8 was constructed from pER137, which has been described previously (26), in several steps . The resulting construct (3,789 bp) consists of a roughly 1.1-kb HpaI-BsrGI E . coli genomic DNA fragment containing mcrA fused to the 2.7-kb BsgI-EcoRI fragment of pBR322 . pACYC-MHpaII was constructed by inserting the 2.0-kb HindIII fragment containing the hpaIIM gene (6) into the HindIII site of pACYC184 . The orientation of the insert is unknown, but M.HpaII expression was confirmed by complete protection of the construct from cleavage by HpaII . DNA sequence. The nucleotide coordinates discussed in this work refer to the 1,040-bp DNA sequence of the E . coli genomic fragment containing mcrA from GenBank accession no. Z19104 (28) . pJEK8 contains nucleotides 35 to 1037 of this sequence, and the mcrA coding region is nucleotides 124 to 957 of this sequence . Bacteriophage restriction assays. All bacteriophage assays
were performed on phage medium plates supplemented with 100 ΅g of
ampicillin per ml . Cross streak and spot tests were performed as
described previously (26) . For full plate titers,
100 ΅l of cells prepared as for the spot tests was combined with 100
΅l of bacteriophage (T4gt for RglA assays or
Linker scanning mutagenesis. Mutagenesis was performed with the GPS-LS linker scanning system (New England Biolabs, Inc., Beverly, Mass.) according to the manufacturer's instructions . ER1793 cells were transformed by electroporation or chemical means with between 1 and 10 ΅l of the reaction and selected with ampicillin and either kanamycin or chloramphenicol, depending on the donor plasmid used . Transformants were assayed by cross-streaking, and plasmids were isolated from RglA clones; 20 ΅l of each plasmid was digested with PmeI for 2 h at 37°C in a 50-΅l volume, and the enzyme was heat killed by incubation for 20 min at 65°C . Half of each reaction was examined by gel electrophoresis to determine the completeness of the reaction and the correctness of the fragment sizes . The remainder was ligated overnight at 16°C with 400 U of T4 DNA ligase in a total volume of 100 ΅l . ER1793 cells were transformed again with 4 ΅l of each ligation reaction and selected with ampicillin . Transformants were assayed for RglA and/or McrA activity by cross-streaking, spot test, or full plate titer . In addition, loss of the PmeI fragment containing the selectable marker was confirmed by restriction analysis . Insertions were located by DNA sequencing with the GPS-N and GPS-S primers .
Transposon mutagenesis. The mcrA-containing plasmid pJEK8 was mutagenized in vitro with minitransposons in six separate transposition reactions: five reactions employed the transposon Transprimer-5, encoding resistance to kanamycin, and one used Transprimer-4, encoding chloramphenicol resistance . A total of 470 kanamycin- or chloramphenicol-resistant colonies were examined by cross-streaking analysis, and 129 (27%) were scored as RglA (Table 1) . We assumed that all mutants carrying insertions in the ampicillin resistance marker and origin of replication would be inviable, all mutants carrying transposon insertions in the mcrA coding region would be RglA, and all mutants carrying transposon insertions outside the mcrA coding region would be RglA+ . Given these assumptions, about 36% of transformants were expected to be RglA, given a random distribution of insertions, in good agreement with the 27% obtained . The small discrepancy between the expected and observed values may be attributed to the failure of any of the above assumptions in certain cases, misinterpretation of cross-streaking results, or stochastic variation . None of the RglA+ clones were sequenced .
Generation and analysis of linker mutants. The target plasmid pJEK8 contained no PmeI sites, so digestion of the transposon insertion plasmids with PmeI generated two fragments, 3.8 kb (including all of pJEK8 plus 15 bp of new DNA at the site of transposon insertion; see Fig . 1) and either 1.7 or 1.4 kb (including the bulk of Transprimer-5 or Transprimer-4, respectively) . Those of the PmeI-digested RglA clones which yielded the expected digestion pattern were religated, yielding 3,804-bp plasmids containing a functional origin of replication and encoding the mcrA linker mutants . Of the 129 RglA clones, 84 were used in the final analysis (Table 1) . The remaining 45 were eliminated for a variety of reasons, specifically: aberrant PmeI digest patterns either before or after loss of the transposon fragment (28), poor sequencing results, possibly due to double transposon insertion (3), location of the insertion outside the mcrA fragment (2), clerical errors specific to one experiment (8), and miscellaneous problems with strain growth, plasmid copy number, or spot test results (4) . The linker mutants were tested for RglA activity by spot test with phage T4gt . These showed a range of restriction of greater than 106-fold between the RglA+ phenotype from ER1793(pJEK8) cells expressing wild-type McrA and the RglA phenotype from ER1793(pBR322) cells not expressing McrA . Tables 2 and 3 show the degree of restriction and corresponding phenotypes of the 84 McrA mutants generated in this study with amino acid insertions and C-terminal truncations, respectively . The phenotypes of most mutants resembled either the archetypal RglA+ or RglA phenotype, but several exhibited clearly intermediate degrees of T4gt restriction, and these were collectively termed RglAi phenotypes .
RglA classification of linker mutants. Linker insertions generated by this method result in either in-frame amino acid insertions or premature stops, depending on the reading frame . The amino acid insertions will have various composition, dependent partly on the target sequence at the site of insertion (Fig . 1) . Fifty-six such mutants were isolated, representing 46 independent locations within the gene and 42 independent locations within the protein . Of the 46 distinct mutants, 17 restricted T4gt to a degree approximating that of wild-type McrA (RglA+), 23 exhibited no detectable restriction (RglA), and 6 restricted the phage but to a degree significantly less than the wild type (RglAi) . Figure 2 shows the distributions of these phenotypes across the gene graphically . Much of the protein is divided into tolerant and intolerant regions, judged by the clustering of black and gray RglA arrows in the upper half of the figure . A segment within the modeled region contained a cluster of mutations each giving partial activity .
A final group of mutants, of which six were isolated, have linker insertions immediately upstream of the mcrA coding sequence . Although the expressed proteins should all be identical to wild-type McrA, the original transposon insertions were no doubt RglA due to separation of the coding sequence from its promoter . In agreement with this, all corresponding linker mutants were RglA+ . Even clone 4-16, in which the distance between the putative ribosome-binding site and the mcrA coding sequence is increased by 15 bp, appeared to be fully RglA+ . McrA classification of linker mutants. In a surprisingly large number of cases (about one-third of all mutants tested), the McrA phenotype did not match the RglA phenotype, confirming the separation-of-phenotype phenomenon observed in earlier studies (13, 29) (see Discussion) . The relationship between the two phenotypes can be described by the following rules, with one exception: (i) mutants that are RglA+ are also McrA+, and vice versa; (ii) mutants that are RglAi are also McrAi; and (iii) mutants that are RglA can be either McrA or McrAi . In most cases, RglA mutants with insertions or truncations in the N-terminal half were McrA, and RglA mutants with insertions or truncations in the C-terminal half were McrAi . To confirm that the McrAi phenotype observed in the truncation mutants did not result from spurious translational readthrough of the inserted stop codon, the AfeI-PmeI mcrA fragments from mutants 2-24 (McrAi) and 4-13 (McrA) were subcloned into the PmeI site of pNEB193 . These subclones did not contain any mcrA-derived sequence downstream of the stop codon but still conferred the same McrA phenotypes as their respective parental plasmids, indicating that the McrAi phenotype is intrinsic to the mutant protein (data not shown) . The single exception to the rules above was mutant 1-75, which had
a 5-amino-acid insertion after residue 67 and was RglA but fully
McrA+ . All cases demonstrating separation of phenotype
indicated that RglA activity is more easily disrupted by mutation
than is McrA activity, or conversely,
Assessment of DNA damage. Piekarowicz and colleagues suggested that wild-type mcrA can mediate induction of the LexA regulon (the SOS response to DNA damage) in the presence of M.HpaII methylation (22) . We exploited this property to determine which of the McrA mutants could inflict DNA damage in the presence of M.HpaII methylation . Each of the linker mutant plasmids was introduced into ER2171(pACYC-MHpaII), an E . coli reporter strain with lacZ fused to the damage-inducible dinD locus and expressing M.HpaII from a compatible plasmid . This strain yields blue colonies on X-Gal plates at 37°C and below when the SOS response has been induced . Similar reporter strains have been used to screen for restriction endonucleases and methyltransferases (9, 12, 22) . The final columns of Tables 2 and 3 show
the colony colors obtained in this test . As expected, pBR322 yielded
white colonies, as did 12 of 15 derivatives with RglA
McrA phenotypes . No colonies were obtained from strains
transformed with the wild type or any derivatives with the RglA+
McrA+ phenotypes . This was anticipated also; strains
containing mcrA restrict plasmids containing hpaIIM (27),
so it is reasonable to expect that the opposite configuration of
genes would also be incompatible . All RglAi McrAi
mutants yielded blue colonies, suggesting the presence of
McrA-dependent DNA cleavage activity in these cases; 15 of 16 RglA
McrAi mutants yielded white colonies, suggesting the
absence of DNA cleavage activity . The partial restriction of
There were a few exceptions to the above generalizations among the mutants with 5-amino-acid insertions . One RglA McrAi mutant with an insertion in the middle of the protein yielded blue colonies . A more sensitive RglA assay indicated no more than threefold restriction of T4gt by this mutant . Although we consider restriction in this range to be RglA, the blue phenotype on X-Gal suggests some level of DNA cleavage competence . Finally, three RglA McrA mutants carrying insertions in the middle of the protein yielded pale blue colonies on X-Gal . This color did not appear until about 24 h after plating and was significantly less intense than that scored as blue . We grouped these with whites when considering them below (Discussion) . Distribution of insertions. The distribution of transposon insertions is shown in Fig . 2 . Not shown are duplicate insertions . Nine of 72 sites yielded multiple insertions: one site had four insertions, one had three insertions, and seven sites had two insertions . In 12 of the 16 possible pairs of duplicate insertions, the two members were generated by different transposition reactions and thus were independent . The distribution of transposons appears random and therefore is compatible with previous reports (1) . The longest region that did not receive an insertion was between nucleotides 345 and 413 (68 bp long) . Given a completely random distribution of 84 insertions in 872 bp (nucleotides 85 to 957 in our sequence), the longest run of sequence that we should expect to find without an insertion is approximately 67 bp, in excellent agreement with our observed value . (Expected run length was calculated as K = log1/[1P] N, where K is the longest expected run, P = 84/872 is the probability of an insertion at any given location, and N is the number of windows of size K, which we approximate as the sequence length 872.) Furthermore, no obvious biases were observed specific to either Transprimer-4 or Transprimer-5 (Tables 2 and 3) .
As expected from this model, all C-terminal truncations within this region abolished RglA activity and SOS induction with the exception of isolate 3-7 (Table 3), which lacks only the extreme C-terminal 9 residues . This mutant shows an intermediate phenotype (RglAi McrAi) . Furthermore, the phenotypes of amino acid insertion mutants are consistent with the proposed structure . Within the modeled region, only amino acid insertions after residues 159, 167, 174, and 177 resulted in full activity (RglA+ McrA+) . The structural elements proposed for these locations are loops or regions connecting secondary structural elements at the periphery of the structure . The region may represent the C-terminal end of a larger connector region extending outside of the modeled area . Six other amino acid insertion mutants retained partial restriction activity (after residues 203, 217, 224, 266, 267, and 271) and also induced the SOS response . While these insertions were also in proposed surface loop regions, they were more centrally located in the proposed tertiary structure . This could cause enough structural distortion to affect catalytic activity . The remaining nine insertions in this region, many of which were located within putative helices or proximal to metal ion-coordinating residues, abolished both RglA restriction and SOS induction completely . Outside of the modeled region, we found additional segments critical for restriction activity in the N-terminal domain of the protein . Between residues 28 and 124, a majority of insertions (12 of 18) were inactive in all tests . Since the active site likely resides in the modeled C-terminal region, DNA binding and sequence specificity are probable roles for these N-terminal intolerant segments . The extreme N terminus (residues 1 to 20) and the putative interdomain connector region (residues 128 to 177) were highly tolerant of the five-residue insertions generated in this study . Mutants with insertions in the extreme C terminus, on the other hand, suffered partial loss of both restriction activities . This may result from spatial proximity of this region to the cysteine residues coordinating the Zn2+ ion as per the threading model (4); distortion of this region by insertion may cause loss of or reduced affinity for the metal ion . Effect of amino acid identity in insertions. The insertion of five amino acid residues within a protein will affect the overall structure both by distorting the backbone and by introducing new noncovalent, and potentially covalent, interactions with the remainder of the protein via the new side chains . Among the mutants generated here, there were four cases in which different sets of amino acids were introduced at the same location, after residues 38, 174, 177, and 235 . In all four cases, the phenotypes of both insertions were identical . Based on this evidence, it seems that the backbone distortion is the more influential of the two structural effects, but side chain effects may predominate in other cases . Comparison with published mutants. The small number of nondeletion mutants of McrA which have been described previously are summarized in Table 4 . Entries include both characterized mutants, for which the precise changes relative to the wild-type sequence are known, and mutagenized E . coli strains, for which the sequence change(s) underlying the restriction phenotypes is unknown . The insertion mutation mcrA1272::Tn10 abolished RglA restriction activity, as did all of our full-length transposon insertions .
Separation of restriction phenotypes. All 17 mutants exhibiting phenotype separation were RglA; of these, only one retained full McrA+ activity, and this one will be discussed later . The remaining 16 showed partial activity (McrAi) and all carried mutations located in the C-terminal half of the protein . Six of these were truncation mutants, most comparable to the HindIII fill-in mutants reported earlier; 15 of these 16 McrAi mutants (including all of the truncations) were also white in the SOS-reporting strain, and therefore we conclude that they are not able to inflict DNA damage in the presence of HpaII-methylated DNA, the presumed target of restriction . We propose that for these 15 mutants, the separation of phenotype
reflects loss of cleavage activity but not DNA binding activity for
both targets . The residual restriction phenotype, we suggest, is due
to interference with phage development by binding rather than
cleavage . The RglA and McrA phenotype assays differ in two
fundamental ways, the type of bacteriophage (T4 or
This explanation for residual restriction of
A further inference can be drawn; there may be a privileged
location in
Other partial-restriction phenotypes. Mutants carrying
mutations that only partially disrupt the catalytic domain should be
capable of partially restricting both T4gt and
One RglA McrAi mutant carrying an insertion near the border of the proposed binding domain (residue 122) retained blue color in the SOS reporter strain, and three mutants carrying insertions within the proposed boundaries of the DNA binding domain were RglA McrA but gave a pale blue color on X-Gal . These mutants may retain limited cleavage activity but represent various degrees of debilitation of enzymatic activity or protein stability relative to the RglAi McrAi class . The last mutant showing phenotype separation is mutant 1-75 . This mutant did not display partial disability in any assay; it was fully RglA but also fully McrA+ . Moreover, it could not be stably maintained in the presence of M.HpaII expression; both blue and white colonies were observed on X-Gal in the SOS induction assay, presumably resulting from strong selection for further mutations that inactivate the enzyme completely . This mutation is a five-residue insertion in the proposed DNA binding domain . A reasonable proposal is that this mutation does indeed disable the recognition of hydroxymethylated but not methylated substrates . Utility of insertion scanning mutagenesis. The ease with which many mutants can be generated, coupled with a variety of phenotypic assays, has enabled us to propose a model for the functional domains of the McrA protein, an N-terminal DNA binding domain and a C-terminal cleavage domain . In addition, the convenient restriction sites associated with the mutations provide ready tools for further manipulation . A straightforward test of the model will be isolation of protein from truncation mutants that spare McrA activity partially; these should enable isolation of a DNA-binding protein . Characterization of the sole mutant that is fully McrA+ and fully RglA may enable better understanding of sequence recognition elements in this protein .
What Is Rhizobia?,
What Is Pcr?,
What Is Yeast?,
What Is Antibiotic?,
What Is Genetics?,
c,
Microbe,
a,
Bacteriology,
a,
Bacterium,
s,
Microbiology,
i,
Microorganisms,
n,
Multidrug resistant,
n,
Bacillus,
c,
S. cerevisiae,
e,
Salmonella typhimurium,
i,
S. cerevisiae,
o,
Salmonella,
c,
Bacteria,
o,
Campylobacter,
e,
Escherichia coli,
s,
Antimicrobial,
i,
Water treatment,
n,
Botulism,
n,
Erwinia,
i,
S. cerevisiae,
c,
S. cerevisiae,
o,
Penicillin,
r,
Antibiotic prophylaxis,
o,
Acinetobacter,
n,
Phage,
i,
Neisseria,
a,
Microorganism
|
© 2005
Transgalactic Ltd (manufacturer of Bioscreen C software) |
Privacy Statement | P.O. Box
1393, 00101 Helsinki, Finland,
Last modified: May 25, 2005
| ||||||