|
|
|
Journal of Bacteriology, May 2002, p . 2755-2766, Vol . 184, No . 10 Characterization of Two Cryptic Helicobacter pylori Plasmids: a Putative Source for Horizontal Gene Transfer and Gene Shuffling
Dirk Hofreuter, Max von Pettenkofer Institut für Hygiene und Medizinische Mikrobiologie, Ludwig-Maximilians Universität München, D-80336 Munich, Germany Received 29 November 2001/ Accepted 20 February 2002
Comparison of the two available genome sequences of H . pylori 26695 (42) and J99 (1) reveals that differences between the strains are the result of intragenomic rearrangements, resulting in deletion, inversion, or translocation of larger genome fragments (macrodiversity) (1) . Many H . pylori strains possess several copies of the insertion sequences IS605 and IS606 . Furthermore, H . pylori 26695 contains five so-called plasticity zones with G+C contents of 33% (zone 1), 35% (zone 2), 33% (zone 3), 43% (zone 4), and 33% (zone 5), which differ from the chromosomal G+C content of 39% (42) . Strain H . pylori J99 (1) carries several different plasticity zones which are not present in H . pylori 26695 . It is speculated that H . pylori received the plasticity zones by horizontal gene transfer . Transduction, conjugation, and natural transformation are the three common mechanisms of horizontal gene transfer in bacteria . Natural transformation competence was first described for H . pylori by Nedenskov-Sorensen et al . (32) . Several genes that are involved in the transformation process of H . pylori have been identified (2, 24, 39, 40) . We demonstrated recently that natural transformation of H . pylori is mediated by basic components of a type IV secretion system (23) . Plasmid conjugation has not been proven for H . pylori, but there is some evidence that horizontal transfer of genes in H . pylori could take place via a DNase-resistant, conjugation-like mechanism (3, 28) . Although about 50% of H . pylori strains carry cryptic plasmids ranging in size from 2 to about 100 kb (35), the role of these plasmids is not well understood . Several H . pylori plasmids of up to 6 kb have been analyzed in detail in the last few years (20, 27, 30) . They can be grouped into at least two separate classes . Plasmid pHPK255 (27) reveals homology to plasmids of gram-positive bacteria replicating via the "rolling-circle" mechanism, whereas other plasmids (pHPM180, pHel1, and pHPS1) fall into the group of iteron-containing plasmids and replicate via the theta mechanism (10) . In this study, we investigated which genetic information is encoded by larger H . pylori plasmids (6 to 20 kb) and whether plasmids might be a source for horizontal gene transfer and for generation of macrodiversity in H . pylori . The sequencing and comparison of two H . pylori plasmids, pHel4 and pHel5, identified both common and unique open reading frames (ORFs), suggesting that individual plasmids express distinct features . From database comparisons we identified putative genes involved in conjugative transfer (mob) as well as gene sequences homologous to that of an operon present in certain Escherichia coli strains responsible for microcin production . Furthermore, the H . pylori plasmids described in this study show a modular structure, which apparently enables them to integrate or delete complete functional modules, thus making them ideal candidates for an efficient gene-shuffling mechanism .
For selection of H . pylori mutant strains, serum plates supplemented with chloramphenicol (6 mg liter-1) were used . E . coli strains HB101 (8) and DH5 DNA manipulations and plasmid constructions. Cloning and DNA analysis procedures were performed as described by Sambrook et al . (37) . H . pylori chromosomal DNA was isolated with the QIAamp tissue kit (Qiagen) . Plasmid DNA was purified from E . coli by the boiling procedure, and electroporation-competent E . coli cells were prepared according to the protocol recommended for the Gene Pulser (Bio-Rad) . Plasmid preparation and plasmid library construction for DNA sequencing. Plasmid DNA of different Helicobacter strains was extracted from cells growing on plates or from cell pellets after cultivation in liquid media . The latter method resulted in general in plasmid DNA of better quality . Plasmid DNA was isolated by using either the Wizard Plus SV Minipreps purification system (Promega) according to the manufacturers protocol or the XSP buffer extraction method described by De Ungria et al . (11) . In brief, cell pellets were resuspended in 100 µl of TES (10 mM Tris-HCl [pH 7.4], 1 mM EDTA [pH 8.0], 15 mM NaCl) by adding 1.5 ml of prewarmed (65°C) XSP buffer . XSP buffer contains XS buffer (1% potassium ethyl xanthogenate, 100 mM Tris-HCl [pH 7.4], 20 mM EDTA [pH 8.0], 1% sodium dodecyl sulfate [SDS], and 800 mM ammonium acetate) with an equal volume of phenol . The resuspended pellets were incubated for 30 min at 65°C . After a short vortexing, incubation on ice for 5 min, and centrifugation at 14,000 x g for 15 min the aqueous phase was transferred to a fresh Eppendorf tube . The samples were purified with two phenol-chloroform-isoamyl alcohol (25:24:1) extractions, and RNA was digested with RNase A for 30 min at 37°C . One volume of isopropanol and 0.1 volume of 3 M sodium acetate, pH 4.8, were used to precipitate the DNA, which was washed with 70% ethanol once . After being dried the plasmid DNA was resuspended in 50 µl of distilled water . The isolation of plasmid DNA from H . pylori by the XSP method resulted in minor contamination with chromosomal DNA, which was removed by digestion with an exonuclease (Plasmid-Safe ATP-dependent DNase; Epicentre) . Purified plasmids pHel4 and pHel5 were mechanically dissected into 1.2- to 1.4-kb fragments, and the plasmid fragments were cloned into the Topo-blunt-II vector (Invitrogen) for DNA sequencing of 96 subclones . Every section of the plasmids was covered with at least two contigs . The analysis of the sequences with MAPSORT resulted in HindIII fragments corresponding in size to the restriction patterns, verifying the sequence assembly . Tetracycline susceptibility determination of H . pylori. Tetracycline resistance of H . pylori strains was determined by a plate diffusion test . Approximately 108 H . pylori cells were plated on serum plates, and an antibiotic disk with a tetracycline concentration of 30 µg/ml was placed in the middle of the plates . After incubation under microaerobic conditions for 5 days at 37°C the size of the zone of resistance around the antibiotic disk was measured . For quantitative tetracycline resistance determination an E-test strip with a tetracycline gradient of 0.016 to 256 mg/ml was used . Southern hybridization for detection of mob genes in H . pylori plasmids. Southern blotting and hybridizations with DNA fragments were performed using the ECL labeling and detection system according to the manufacturer's protocol (Amersham) . For hybridization 0.5 M NaCl was used; the washing buffer contained 0.5x SSPE (180 mM NaCl, 10 mM sodium phosphate [pH 7.5], 1 mM EDTA), 6 M urea, and 0.4% SDS at 42°C . Filters were washed with 2x SSPE-0.05% SDS at 42°C .
Computer analyses.
Predictions of ORFs were performed by the programs MAP of Genetics Computer Group (GCG) software (12), ARTEMIS (http://www.sanger.ac.uk/Software/Artemis), with a cutoff of 100 bp, and BLASTX (http://www.sanger.ac.uk/Software/Artemishttp://www.ncbi.nlm.nih.gov/BLAST) . The GC contents were determined by COMPOSITION (GCG), the GC profiles were created by ARTEMIS, and the terminator structures were identified with TERMINATOR (GCG) . Physical properties of proteins (Mr, pI, and net charge) were calculated with PEPTIDESORT (GCG), and structural properties were determined with PREDICTPROTEIN (http://www.embl-heidelberg.de/predictprotein/) . Membrane associations were determined by the programs PEPTIDESTRUCTURE (GCG), PSORT (http://psort.nibb.ac.jp), TMPRED (http://www.ch.embnet.org/software/TMPRED), and TMHMM (http://www.cbs.dtu.dk/services/TMHMM) . The program COILS (http://www.ch.embnet.org/software/COILS) was used for finding
Nucleotide sequence accession numbers. The complete plasmid sequences of pHel4 and pHel5 have been submitted to GenBank, and the following accession numbers have been provided: pHel4, AF469112; pHel5, AF469113 .
Plasmid maintenance regions of pHel4 and pHel5. Orf4I, encoded by pHel4, and Orf5A, encoded by pHel5, were identified as replication initiation proteins (Rep) showing significant sequence identity to the corresponding RepA proteins encoded by different H . pylori plasmids (Table 3) . This group of H . pylori Rep proteins supports replication according to the theta replication mechanism (10) . The upstream region of orf4I is highly conserved compared to the pHel1 repA gene, indicating that the same transcriptional start site as that experimentally determined for pHel1 (20) might be used in pHel4 . The upstream region of orf5A (pHel5) differs significantly from the corresponding regions of pHel1 and pHel4 but is identical to the upstream region of repA of pHPS1, suggesting that the repA genes of pHel5 and pHPS1 use the same transcriptional start site (data not shown) .
For pHel1 we demonstrated a low copy number of about 4 to 10 plasmid copies per cell in H . pylori (21) . In addition, pHel4 and pHel5 replicate stably in H . pylori growing in vitro on agar plates without any selection pressure . The putative gene product of Orf5L has significant homology to a group of ParA proteins involved in the correct partitioning of plasmids to the bacterial daughter cells during cell division, which is necessary for the maintenance of low-copy-number plasmids (17) . A protein identical to Orf5L, encoded by plasmid pHPM186 (the unpublished plasmid sequence is available in the database under accession no . AF077006), and one encoded by plasticity zone 3 in H . pylori 26695 (HP1000; 49.5% identity) were described . We also identified JHP0935, homologous to ParA and encoded by a plasticity zone of H . pylori J99; this protein is 45.5% identical to Orf5L . HP1000 was recently grouped in the new ParF subgroup of the ParA superfamily (19) . Further studies need to be conducted to show whether or not plasmids without parA or -F homologues, for example, pHel1, pHel4, pHPM180, and pHS1, use the chromosomally encoded plasmid partitioning systems for plasmid maintenance . Microcin MccC7 homology region. Orf4A (pHel4) revealed the best sequence homology to a putative TetA(P) tetracycline efflux membrane transporter protein of H . pylori strains 26695 (HP1165) (42) and J99 (JHP1092) (1) (Table 1) . Since H . pylori P8 did not tolerate higher concentrations of tetracycline than H . pylori strains 26695, J99, P1, P12, and P29 (0.05 to 0.6 µg/ml; data not shown), it was concluded that Orf4A and chromosomally encoded proteins HP1165 and JHP1092 are probably not involved in tetracycline resistance . Orf4A displayed a lower homology to several transporter proteins involved in protein secretion, especially E . coli microcin secretion protein MccC (23% identity) . Interestingly, the orf4B sequence of pHel4, located upstream of orf4A, seems to be organized with orf4A in an operon, and its product reveals in addition a sequence identity to the MccB protein (28% identity), encoded by the E . coli microcin operon (18) . In the E . coli system, MccB is involved in modification of MccA (MccC7) (J . E . Gonzalez-Pastor, J . L . San Millan, and F . Moreno, Letter, Nature 369:281, 1994) and microcin is exported by MccC . The modified secreted MccA peptide is taken up by related bacteria and acts there as an inhibitor of translation if no corresponding immunity protein is expressed . In plasmid pHel4 a candidate microcin structural gene upstream of mccB was also identified (D . Hofreuter and R . Haas, unpublished data) . Conjugation-like ORFs. Between two regions (bp 2380 to 2454 and bp 5043 to 5245) of pHel4 with very low G+C contents (21 and 25%, respectively) a gene cluster with homology to a conjugal mobilization (mob) region of colicinogenic plasmids was identified . The observed overlap of ORFs orf4C to orf4F (Fig . 4A) is very reminiscent of the structural organization of mobA, mobB, mobC, and mobD of colicin-encoding plasmids pColA (31), pColE1 (7), pColD157 (22), and pWQ799 (26) . Orf4C shows significant homology in its N terminus to different MobA proteins and the RLX protein encoded by plasmid pC223 of Staphylococcus aureus, which is also a mobilizing protein (Fig . 4B) . Orf4F, encoded by pHel4, reveals best homologies to MobC proteins encoded by colicinogenic plasmids .
Although primary sequence homologies between putative proteins encoded by orf4E and orf4D and MobB and MobD proteins are not obvious, there are several conserved motifs in both groups of proteins, indicating a conserved function . In the orf4D-encoded protein there are two leucine zipper motifs (L-X[6]-L-X [6]-L-X[6]-L), which are also found in MobD encoded by colicinogenic plasmids . For Orf4E best homologies with several viral and eucaryotic proteins with coiled-coil structures, as well as an identity of 21% with a mobilizing protein (MobB2) of Staphylococcus epidermidis, were found . Thus, similar structural and physical properties of these proteins imply similar functional properties of the Orf4C, Orf4D, and Orf4F proteins with mobilization proteins encoded by conjugative plasmids . We investigated the distribution of the mob region in H . pylori by analyzing plasmids from clinical H . pylori isolates for the presence of the mobA gene by (i) PCR amplification with specific primers DHO104 and DHO105 and (ii) Southern hybridizations with a mob-specific (orf4C) DNA probe (data not shown) . According to these data, about 35% of H . pylori plasmids investigated possess the putative mob region . A putative origin of transfer (oriT) but no tra functions. Essential for the mobilization of a plasmid is the presence of an origin of transfer (oriT) in the plasmid itself . The Mob and Tra proteins are usually active in trans and might be encoded by other conjugative plasmids or the bacterial chromosome . The oriT sequences in colicinogenic plasmids are usually located upstream of the mob regions, and the nic sites for plasmids pColE1 and pColA are identical . A corresponding sequence for pHel4 was not found at this position . A putative nic sequence identical to the well-characterized nic sequence of IncP plasmid RP4 was found in orf4M of pHel4 (TATCCTG/C [consensus sequence in bold type]); this sequence might act as a functional nic site of the plasmid . P-type nic sites in conjugative colicinogenic plasmids have not been described yet (29) . A further question relates to the tra functions for a conjugative transfer of pHel4, which are missing in the plasmid . Several putative proteins, which might be involved in DNA conjugation, are encoded by the genome sequence, especially in plasticity zones of H . pylori 26695 (42) and J99 (1) . They show significant homology to VirB and Trb proteins (Fig . 5) . The identification of putative IncP type relaxases encoded by hp0996 and hp1004 (5, 41) also supports the conjugative transfer of plasmids in H . pylori .
The second pair of homologous proteins, Orf4H and Orf5F, show best homologies to HP0892 and HP0894 of H . pylori 26695, as well as to JHP0825 and JHP0831 of H . pylori J99 (Tables 1 to 3) . These hypothetical proteins have well-conserved C termini . The BLASTP search also identified significant homology among Orf4H and Orf5F and gene products of other bacterial species, encoded either on plasmids or in the chromosome (Tables 1 and 2) . Most unexpected, orf4G and orf4H of plasmid pHel4 and orf5F and orf5G of pHel5 have the same organization as the corresponding chromosomal genes of H . pylori 26695 and H . pylori J99, suggesting an exchange of the gene clusters between chromosome and plasmid . For the chromosomal genes it was shown that the members of these two families interact with each other in a two-hybrid screen (36) . Homologies by BLASTP are as follows: HP0316 and HP0895, 1e-57; HP0894 and HP0895, 1e-71; HP0895 and HP0895, 1e-204 . ORFs in the chromosomes of H . pylori 26695 and J99 homologous to orf4G and orf4H and orf5F and orf5G are clustered in regions with high genetic diversity (9) . A switch-inducing repeat 1 sequence located in plasmids pHel4 and pHel5. Interestingly, genes homologous to orf4G and orf5G are part of the repeat 1 sequence (42), which is located at different positions in the chromosome and which is coupled with the 3' regions of different OMPs . The repeat 1 sequence can be considered a switch-inducing sequence, which might be supported by the concerted activity of Orf4G and Orf5G or Orf4H and Orf5F, acting as recombination-inducing proteins . Such a switch mechanism, which is related to an intrachromosomal recombination event at the repeat 1 sequence, is actually found when the 26695 and the J99 genomes are compared . This switch resulted in an exchange in the chromosomal locations of bacterial adhesin genes babA and babB (1, 42) . This mechanism leads to variation in the C termini of the corresponding OMPs (25, 34) . Such a genetic switch might result in an immune system escape mechanism by phase variation of immunodominant OMPs but might also result in functional variation of the adhesin binding properties of BabA or BabB (25) . The third pair of common genes found in pHel4 and pHel5 encode proteins with homology to products of chromosomal genes orf4M, orf4O, and orf5K (Table 3) . Orf4M and Orf5K have identities of between 38 and 46% to hypothetical proteins JHP651 of H . pylori J99 and HP0712 and HP0713 of H . pylori 26695 . JHP0651 might be a fusion between HP0712 (N terminus) and HP713 (C terminus) . The BLASTP analysis also identified homology to eucaryotic proteins, for instance, ZK593.8 of Caenorhabditis elegans and human huntingtin interacting protein E, an approximately 350-kDa protein of unknown function involved in the neuropathology of Huntington's disease (15) . Orf4M and Orf5K also possess homology to the Orf2 protein, described first as a product of H . pylori plasmid pHPM180 (30), and to Orf-2, encoded by the NBU1 element (nonreplicating Bacteroides units) (43) . The homology is clustered mainly in the N-terminal parts of Orf4M and Orf5K, which contain the conserved amino acid sequence PFSDGNGRTGRALMF (data not shown) . Thus, Orf4M, Orf4O, and Orf5K belong to a group of proteins whose coding sequences are widespread in bacteriophages, plasmids, and NBU-1 elements and also in eubacteria and some eucaryotic organisms . Further plasmid-encoded proteins also encoded by the H . pylori chromosome. As observed in pHPM186, plasmid pHel5 encodes proteins with homology to products of genes clustered in plasticity zone 3 of H . pylori 26695 . As described above, Orf5L belongs to the group of ParA proteins, as do the homologous proteins HP1000 and JHP0935 . Additionally, orf5P and orf5Q encode proteins with high identity to HP0993 and HP0994, hypothetical proteins encoded by genes in the H . pylori plasticity zones with unknown functions (Table 3) . The sequence of orf5Q in pHel5 carries a frameshift, resulting in a truncated HP0994 gene product . Further proteins encoded by pHel4 and pHel5 with homology to products of genes widespread in the chromosomes of H . pylori 26695 and J99 are Orf5B (similar to endonuclease HP1295) and Orf5O (best homologue to hypothetical protein HP1334) . Repeat sequences and site-specific recombination events determine size variation and the modular structure of H . pylori plasmids. The comparison of independent H . pylori plasmid sequences identified several sequence repeats termed R1, R2, and R3 (10) . The R1 and R3 repeats correspond to iteron sequences, located upstream of genes encoding replication initiation proteins RepA and RepB . It has been suggested that the R2 repeat, which consists, in pHP180, of two copies of 232 noncoding nucleotides, might act as a target sequence for recombination events (10, 30) . Our sequence analysis of pHel4 and pHel5 revealed for both plasmids incomplete sequences of the 232-bp R2 repeat, designated R2' (Fig . 6) . For pHel4 we found one nearly complete 232-bp sequence (bp 5665 to 5842) between orf4H and the putative origin of replication . This region also contains an intact 36-bp stretch (bp 5578 to 5613), which was first found between the two R2 repeats in pHPM180 . A second incomplete copy of the R2 repeat in pHel4 is between orf4O and orf4A (bp 10845 to 10954) . For pHel5, one identical stretch to the 232-bp sequence was found (bp 18120 to 18291) . The comparison of the locations of repeat sequences in H . pylori plasmids suggests a modular structure of H . pylori plasmids with insertion and deletion of sequence modules at different repeat sequences, which might act as hot spots for recombination or site-specific integration events (Fig . 6) . A novel R4 repeat in H . pylori plasmids is associated with gene shuffling. Further studies showed that in pHel4 a stretch of 36 bp located at the 3' end of the repA gene occurred twice and that the two copies are separated by about 1.7 kb (Fig . 7A) . We designated this 36-bp sequence (5'-CATTTGAAAAATTGGTTCAAGAAACACTACAGGTAA-3'), which codes for the C terminus of RepA (HLKNWFKKHYR), the novel R4 repeat . The R4 repeat is also well conserved in plasmids pHel5 (this study), pHel1 (20), pHPM180 (30), pHPS1 (10), and pHPM186 (accession no . AF077006) (Fig . 6) . We were interested to test whether or not the duplication of the 36-bp sequence, as in pHel4 and pHPM186, is a common event in H . pylori plasmids . PCR amplification with oligonucleotides DHO106 and DHO107 (Fig . 7B) revealed fragments of 1.7 and 0.8 kb in pHel4 and four plasmids isolated randomly without prior screening from independent H . pylori isolates, respectively . Since the PCR fragments can only be generated if the R4 repeat is duplicated (Fig . 7A), we postulate the site-specific integration of a DNA fragment at the 3' end of the repA gene for these plasmids, resulting in a duplication of the conserved R4 repeat . The 1.7-kb fragment of pHel4 codes for putative proteins Orf4J, Orf4K, Orf4L, and Orf4M . Orf4M has significant homology to HP712, HP713, and JHP651 (Table 3) . Direct sequencing of two 800-bp PCR products revealed sequences with coding capacity for a protein with identity to Orf4M (100%), Orf5K (97%), Orf6-pHPM8 (86%), and JHP0651 (41%) . These experimental data support our hypothesis that the R4 repeat might act as a site of integration of novel sequence modules into H . pylori plasmids (Fig . 7) .
Partial and complete IS elements on plasmids pHel4 and pHel5. Plasmids pHel4 and pHel5 do not carry any complete IS elements, unlike pHPM186 . Instead, we identified several regions in pHel5 identical to border sequences of IS elements, such as IS606 and IS608, and to the core sequence of mini-IS605 (42) . A stretch of 35 bp between orf5G and orf5H is identical to the left border sequence of IS606 . Between orf5C and orf5D there are 35 bp identical to the right border sequence of IS606 . This region also has homology to mini-IS605 . A second smaller stretch of this IS606 inverted repeat right (IRR) is located between orf5M and orf5N . This sequence is part of a sequence duplication (5'-TTTTGACATACTCCCCATAGCTAAAGCTAGAGACTTTGCGG-3') in pHel5 (bp 6997 to 7037 and 12353 to 12380) since a second copy of that sequence was identified between orf5E and orf5F (Fig . 6) . A single copy of this 41-bp sequence could also be found in pHel4 between the R4 repeat and orf4J . Again, the repeat and IS border sequences are always located at defined positions, flanking certain genes or groups of genes, such as repA (R2 and R4 repeats), the mcc-mob region in pHel4 and pHP186 (R2 repeat), and the orf4J-orf4M region of plasmid pHel4 (R4 repeat) . The advantage of such a modular organization is rather obvious (Fig . 8) . H . pylori can distribute a high number of diverse genetic modules in the population . Different combinations of modules might be created by recombination (deletion and insertion) and selected for by the needs of the bacteria in their individual hosts . Due to the modular structure, plasmids might either pick up chromosomal genes of H . pylori or integrate sequence modules from foreign plasmids, which are taken up by the bacteria during its natural transformation competence (gene shuffling) . For conjugative plasmids, as we postulate for pHel4, the novel sequences could be rapidly distributed within the H . pylori population and exchange novel plasmid sequences with the bacterial chromosome . Such events might help to explain the development of macrodiversity among H . pylori strains and the rapid generation of substrains .
This work was supported by the Deutsche Forschungsgemeinschaft (HA2697/3-1) .
What Is Biofilter?,
What Is Protein?,
What Is Genetic Engineering?,
What Is Biotechnology?,
What Is Fermentation?,
s,
Bacteria,
s,
Microbiology,
r,
Bacteriology,
s,
Microbe,
i,
Microorganism,
s,
Pseudomonas,
r,
Anaerobes,
n,
Escherichia coli,
s,
Bacteria,
r,
Streptococcal,
o,
Multidrug resistant,
a,
Cell cultures,
i,
Meningococcus,
n,
Enterobacters,
c,
Thermophile,
c,
Lactobacillus,
c,
Lactobacillus,
s,
Haemophilus,
r,
Microorganism,
r,
E coli O157,
a,
Antibiotics,
e,
Phage,
c,
Streptococci,
n,
S. cerevisiae,
n,
Antibiotic treatment,
n,
Biofilms
|
© 2005
Transgalactic Ltd (manufacturer of Bioscreen C software) |
Privacy Statement | P.O. Box
1393, 00101 Helsinki, Finland,
Last modified: May 25, 2005
| ||||||