|
|
|
Journal of Bacteriology, September 2002, p . 5158-5169, Vol . 184, No . 18 R391: a Conjugative Integrating Mosaic Comprised of Phage, Plasmid, and Transposon ElementsDietmar Böltner,1 Claire MacMahon,2 J . Tony Pembroke,2 Peter Strike,3 and A . Mark Osborn1* Department of Biological Sciences, University of Essex, Colchester,1 School of Biological Sciences, Donnan Laboratories, University of Liverpool, Liverpool, United Kingdom,3 Molecular Biochemistry Laboratory, Department of Chemical & Environmental Science, University of Limerick, Limerick, Ireland2 Received 1 April 2002/ Accepted 13 June 2002
The most important contributor to horizontal gene transfer is the heterogeneous group of mobile genetic elements that includes plasmids, insertion (IS) elements, transposons, integrons, phages, and genomic islands (10, 17) . They are "selfish" elements that promote their own maintenance and distribution and, in addition, can function as vectors for accessory DNA elements . These accessory elements commonly consist of genes that are nonessential for survival but confer a phenotype, which is advantageous under particular environmental conditions . Prominent examples of such traits are antibiotic and heavy metal resistances, degradation of xenobiotic compounds, symbiosis and virulence determinants, resistance to radiation, and increased mutation frequency (15) .
A group of related conjugative DNA elements has been identified as carriers of antibiotic resistance genes in members of the
More recently, the plasmid nature of these elements was challenged by the discovery of their transfer and stable integration into the host chromosome in recombination-deficient (recA) backgrounds, suggesting that they are conjugative transposons (37) . Furthermore, Hochhut et al . (21) have since reported a close relationship between R391 and the self-transmissible SXT element from Vibrio cholerae O139, which carries multiple antibiotic resistance cassettes (Sur Tmr Smr) . Both code for a nearly identical phage-like integrase, which mediates site-specific integration into the 5' end of the prfC gene of the Escherichia coli chromosome (23) .
In this study, the complete nucleotide sequence of R391 has been determined and analyzed . The main objective of the analysis was to elucidate the nature of R391 and, by inference, of related elements, to identify the genetic systems responsible for both the chromosomal integration of R391 and its conjugative transfer, and to determine whether R391 carries a recognizable autonomous replicon . Additionally, given that the chromosomal association of these elements makes detection difficult, it is possible that such elements are representatives of a much larger and important group of mobile genetic elements, potentially acting as vectors of antibiotic resistances and other phenotypes in the
Shotgun cloning and DNA sequencing. Shotgun cloning of R391 was performed with a TOPO shotgun subcloning kit (Invitrogen) in accordance with the manufacturer's protocol . Briefly, DNA was sheared with compressed air by using a nebulizer, generating fragments of about 1 to 2 kb in size . After blunt-end repair and dephosphorylation, the fragments were ligated to the vector pCR4Blunt-TOPO and transformed into E . coli TOP10, with subsequent blue/white screening used to detect transformants carrying an insert . Plasmid DNA from clones was isolated with a Qiaprep8 miniprep kit (Qiagen) . Library subclones were sequenced by the chain termination method (59) with sequencing kits provided by Applied Biosystems, Amersham, or Cambio . Electrophoresis and analysis were carried out on an ABI Prism 310 genetic analyzer (Applied Biosystems) or a Licor IR4200 . The complete nucleotide sequence of R391 has been determined by sequencing 750 shotgun clones, yielding a fourfold coverage in combination with primer walking for the closure of gaps and regions of insufficient coverage . Primers for gap filling were designed with PrimerSelect software (Lasergene) and used either directly for sequencing on shotgun clones or to sequence PCR amplicons . PCR amplification with the extrachromosomal form of R391 as the template DNA was performed according to standard protocols . Annotation and phylogenetic analysis. Sequences were assembled by using SeqMan software (Lasergene) . ORF Finder (http://www.ncbi.nlm.nih.gov/gorf/gorf.html) was employed to search the complete R391 sequence for open reading frames (ORFs), initially with a cutoff value of 50 amino acids, by using the bacterial genetic code . Subsequently, the ORFs were compared to sequences in the public sequence databases of GenBank and EMBL by using the BLAST (1) and FASTA (44) search tools . Of the original 885 ORFs identified, only those that had a minimum length corresponding to 60 amino acids and were not overlapping other ORFs with significant identity to ORFs in the databases were included in the results . Significant identity was defined as being greater than 20% identity over at least 60% of the ORF or, alternatively, over at least 50 corresponding amino acids . ORF products were further analyzed for the presence of conserved protein domains by using the Web-based tools Cognitor (66), PFAM Search (3), and ProDom (1) . Multiple sequence alignments and bootstrap neighbor-joining phylogenies (12, 56) were generated with ClustalX software (67) . Gaps in the alignment were excluded from the calculation of the phylogenetic tree . The G+C plot was created with the program Freak from the EMBOSS package (Human Genome Mapping Project Resource Centre, Cambridge, United Kingdom) with a window size setting of 300 nucleotides (nt) and a step size setting of 10 . Nucleotide sequence accession number. The nucleotide sequence of R391 has been deposited in the GenBank database under accession number AY090559 .
Remarkably, 67 of the ORFs (70%) are transcribed in the three forward frames (Table 1; Fig . 1) . The exceptions are, first, two clusters of eight and six ORFs lying, respectively, upstream and downstream of the attachment site attP, which is located at the end of the linear map (Fig . 1) . Second, a few ORFs belonging to the IS elements, as well as a number of dispersed ORFs, also lie on the reverse strand . This unidirectionality is even more surprising considering the different origins of this element . Given that it has the ability to transfer by conjugation and that numerous antibiotic resistance markers are carried by plasmids, R391 had initially been thought to be a plasmid . However BLASTP analysis of the 96 ORFs failed to show similarity to any known plasmid replication gene; likewise, no similarity was found between R391 and any known origin of replication, suggesting the absence of a plasmid replicon on R391 . It is likely, however, that major parts of R391 have been derived from a plasmid, as suggested by the high number of homologs to ORFs from Salmonella plasmids . If the ancestral R391 did possess a plasmid replicon, it might have been lost upon acquiring the integration mechanism . Integration and excision. Integration of R391 into the host chromosome and its excision from it share similarities to the integration and excision of bacteriophages and genomic islands . Integration into the prfC gene is mediated by a phage-like integrase and results in short direct repeats at both ends of the element and in the reconstitution of prfC by the replacement of its 5' end (20, 22) . However, this mechanism differs notably from those of the majority of genomic islands described to date in its use of an integration site that is not located in, or near, a tRNA gene and in the replacement of the 5' end of the interrupted gene . Other mobile elements using phage-like integrases normally carry a 3' terminus of the interrupted gene, presumably because upstream regulatory sequences such as the promoter region are more complex and difficult to replace (74) . The integrase gene int (ORF 5) is located from 2.7 to 4 kb from the left end of R391 (Fig . 1) and is translated towards the left junction attL of the element, unlike genomic islands, where the integrase gene is typically located directly adjacent to the junction with the chromosome and is translated divergently from the junction . The IncJ elements R997 and pMERPH probably possess an integration mechanism closely related to those of R391 and the SXT element . In both R997 and pMERPH PCR, amplification generates a PCR product of the expected size for the integrase gene and the integration sites are nearly identical (B . McGrath and J.T . Pembroke, unpublished data) . In addition, restriction digestion analysis revealed many restriction fragments common to both R391 and R997 (45) . R391 and the SXT element can integrate into the same chromosome in tandem fashion, with an incoming element probably using one of the junction sites of the resident element with the chromosome attL or attP as the integration site (21) . Conversely, in recA mutant strains containing both R391 and R997, the incoming element could be isolated in the extrachromosomal circular form (45) . The existence of a circular extrachromosomal form raises the possibility of mutual exclusion from integration between these two elements and the probable requirement for extrachromosomal replication . However, this study found no indication of a plasmid replicon, suggesting that the circular form may be a nonreplicative transposition or transfer intermediate .
Evolutionary relationships between phage-related integrases.
The R391 integrase belongs to a diverse family of integrase proteins within a wider group of tyrosine recombinases with a range of functions in DNA metabolism (71; http://mywebpages.comcast.net/domespo/trhome.html) . It has previously been reported that the closely related integrase from the SXT element was itself closely related to that from bacteriophage
To further investigate the evolutionary relationships within the integrase family, representative integrases from R391, the SXT element, phages, and pathogenicity and other genomic islands that were identified by a BLAST similarity search were aligned and a phylogenetic tree was constructed (Fig . 3) . From the several distinct clusters identified, the integrases from R391 and the SXT element belong to a group containing the prophage
Conjugative transfer. A total of 17 putative transfer genes, located in four clusters (TRA I to IV) spanning 24 kb of R391, were identified (Fig . 1) . Given the lack of consistency in the nomenclature of transfer genes among different incompatibility (Inc) groups, in this study the putative R391 transfer genes were named by their homologs in the IncF plasmids (14) and, for ORFs not having close relatives in F plasmids, according to their closest relative . The first cluster, TRA I, is 4 kb in size and is located 32 kb from the left end of the R391 sequence, following a region containing mainly ORFs whose putative function is related to DNA repair or modification . TRA I codes for two putative transfer proteins . The first (encoded by ORF 33) is a homolog of the traI product from plasmid R27 (63), which is thought to be a relaxase, responsible for nicking the DNA during conjugative transfer . ORF 34 is related to a putative coupling protein from R27 . Such proteins usually contain a transmembrane domain and an ATP-binding domain . The R391 homolog similarly carries motifs of this protein family . Separated from TRA I by four ORFs of unknown function, the second cluster, TRA II, is also about 4 kb in size and contains homologs of traL, traE, traK, traB, htdD, and traA (ORFs 39 to 44, respectively), which are probably all involved in pilus synthesis and assembly . TRA II and TRA III are separated by a 7-kb region containing a number of ORFs which are related to those found on transposons and which possibly represent the remnants of such an element . TRA III spans nearly 10 kb and contains homologs of htdT (ORF 56), traC (ORF 57), trhF (ORF 59), traW (ORF 60), traU (ORF 61), and traN (ORF 62) . From the function of their closest relatives, it can be assumed that the first five of these ORFs are involved in pilus synthesis and assembly while traN is probably involved in aggregate stability . Additionally, this cluster contains an ORF (ORF 58) of unknown function . The fourth cluster, TRA IV, is about 6 kb in size and is separated from TRA III by a 15-kb region . It contains homologs of traF (ORF 81) and traH (ORF 82), again putatively involved in pilus synthesis and assembly, and of traG (ORF 83), putatively involved in pilus assembly and mating pair stabilization . Generally, in terms of individual homology and overall structure, the R391 transfer genes are most closely related to homologs from large plasmids from Salmonella strains (Fig . 4) . Of the 17 transfer genes identified, 16 have homologs (average identity, 27%) in the Salmonella enterica serovar Typhi plasmid R27 (63) and its derivative pHCM1 (43) . Both of these plasmids are very large (180 and 218 kb, respectively) and carry multiple antibiotic resistance genes . The order and structural organization of transfer genes, however, are significantly different in R391 and R27 . The transfer genes might have been derived from a common ancestral plasmid, having subsequently diverged and undergone rearrangements . Other conjugative plasmids sharing transfer gene homologs with R391 are the 90-kb Salmonella enterica serovar Typhimurium plasmid pSLT (33), with nine homologs, and the 184-kb plasmid pNL1 from Novosphingobium aromaticivorans (55), having 10 homologs, all of them located within clusters TRA II, III, and IV of R391 . The F plasmid derivative R100 (2) shares seven homologs within these clusters . The IncHI2 plasmid R478 from Serratia marcescens (42) has a total of eight homologs, including all of the putative transfer genes from TRA II and the first two of TRA III . Furthermore, two genomic islands from Salmonella enterica serovar Typhimurium (6) and Neisseria gonorrhoeae (11) have three and two homologs, respectively .
Phenotypic traits . (i) Kanamycin resistance.
R391 confers a kanamycin resistance phenotype . This trait is conferred by an aminoglycoside phosphotransferase gene (ORF 20) which is 100% identical to a number of such kanamycin resistance genes from different strains (Table 1) . The putative gene is carried by a transposon flanked by one copy of IS15- (ii) The mercury resistance (mer) operon. Narrow-spectrum mercury resistance is mediated by a 3.5-kb mer operon consisting of four structural genes, merT, merP, merC, and merA (ORFs 85 to 88) and the regulatory gene merR (ORF 84) . The structural genes of the mer operon of R391 were found to be highly similar (97 to 100% nucleotide identity) to those of the IncJ element pMERPH from Shewanella putrefaciens (40) . The merR gene of pMERPH had not been sequenced previously, but PCR amplification and sequence analysis of the pMERPH merR gene in this study proved that the predicted amino acid sequence is 100% identical to the R391 MerR sequence (D . Böltner and A . M . Osborn, unpublished data) . Unusually for a mer operon from a gram-negative organism, there is no indication of the operon being carried by a transposon, and furthermore, the merR gene is transcribed unidirectionally with the structural genes, a characteristic of gram-positive mer operons (41) . (iii) Mutagenic DNA repair and UV sensitization. Previously, R391 had been shown to carry genes involved in mutagenic DNA repair . These genes, rumA and rumB (ORFs 11 and 10, respectively), had been cloned, sequenced, and shown to be homologs of the E . coli umuDC genes and to function in damage-inducible mutagenic DNA repair (27) . In contrast to these results, R391 more recently has been reported to confer a UV sensitizing phenotype, apparently compromising the host's DNA repair capability and leading to decreased postirradiation survival (46, 47, 70) . The UV-sensitizing phenotype is probably conferred by ORF 13, located adjacent to rumA, which encodes a putative exonuclease domain similar to the DNA polymerase III epsilon subunit . This protein is likely to interact with RumA and/or RumB, altering their activity and dramatically reducing the DNA repair capacity (C . MacMahon and J . T . Pembroke, unpublished data) . Sequence analysis of R391 revealed a number of additional ORFs whose homologs are involved in DNA modification or repair or which contain motifs corresponding to such proteins: ORF 23 (DNA mismatch repair), ORFs 25 and 26 (putative ATPases involved in DNA repair), ORFs 27 and 29 (putative type II restriction enzyme), ORF 64 (DNA ligase), ORF 67 (single-strand binding protein), and ORF 75 (DNA repair protein RadC) . While the restriction enzyme is probably not functional due to insertion of an IS element (see "Transposable elements" below), the role of the other ORFs remains unclear . Homology of ORFs 23 to 27 and 29 to 31 to a 10-kb region from Salmonella enterica serovar Typhimurium LT2 suggests that they may be derived from the same origin .
Transposable elements.
A number of transposable elements have been identified on R391 . The kanamycin resistance gene (ORF 20) is flanked by two copies of IS15- A novel IS element related to IS911 (82% identity) from Shigella dysenteriae (50) has been identified . Located 20 kb from the start of the sequence, the IS element is 1,223 nt in length and is flanked by 26-nt imperfect inverted repeats . The two ORFs upstream and downstream of the IS element (ORFs 27 and 29) are both related to the same putative restriction modification enzyme from Salmonella enterica serovar Typhimurium (GenBank accession no . AAL23313) . Insertion of the IS element has probably interrupted the original ORF in R391 . Although the right end of the IS element contains a new start codon for ORF 29, both ORF 27 and 29 are probably pseudogenes . Between the transfer regions TRA II and TRA III is a region containing the gene for a putative transposase, tnp391B (ORF 48, with 66% identity to the IS5 transposase gene) and, additionally, three ORFs (ORFs 50 to 52), one of which encodes a sulfate permease (ORF 51) and which is related to the Pseudomonas sp . strain R9 transposon Tn1404 (60) . This region might be a transposon or the remnant of such an element, although no inverted repeats, marking the ends of transposons, were found . Putative regulatory elements. ORF 96, located upstream of the right end of R391, is related to both the cI repressor from phage lambda and the lexA repressor . Both these repressors carry a DNA-binding helix-turn-helix motif and an autoprotease domain, which cleaves the protein in the presence of RecA and ATP . In the case of the cI repressor, cleavage inactivates the protein, resulting in prophage induction (25) . LexA controls the expression of a number of DNA repair proteins and is likewise inactivated upon cleavage . The rumA gene carried by R391 (ORF 11) is one such DNA repair gene, and it is preceded by a LexA binding site (27) . Interestingly, RumA itself also contains an autoprotease domain, but in this case the protein is activated when cleaved (25) . The putative repressor encoded by ORF 96 could possibly control the expression of the integrase and inhibit excision, given that both the repressor and the integrase are related to phage lambda proteins . However, the presence of a LexA binding site may also suggest a role in regulation of the rumAB operon . Alternatively, the repressor could belong to the remains of a coresident phage, for which a number of phage-like ORFs on R391 might be an indication (see below) . At present, the exact role of ORF 96 remains unknown but clearly deserves further attention . While being most closely related to ORFs of unknown function from Rhizobium sp . strain NGR234, ORFs 2 and 3 also show similarity to hipB and hipA, respectively, which in E . coli act as a pair of transcriptional regulators that affect lethality due to inhibition of peptidoglycan or DNA synthesis (4, 5) . ORFs 90 and 91 are homologous, respectively, to flhC and flhD,the transcriptional regulators of flagellar synthesis . These form a master operon at the top of the regulatory hierarchy of flagellar synthesis while themselves being regulated by a number of molecules, including the cyclic AMP-catabolite activator protein complex and OmpR . flhD is also a repressor of cell division (31, 51-53) . ORF 15 may be another transcriptional regulator, since it contains a motif encoding such proteins . Conclusions. Sequence analysis of R391 revealed a composite structure of elements of different origins (phage, plasmid, and transposable elements) . While the integrase is clearly phage related, there are a number of further phage-related ORFs, including the cI repressor homolog (ORF 96), an upstream ORF (ORF 94) whose function is unclear, ORF 67 (related to single-strand binding proteins, which are common to phages and plasmids), ORF 68 (related to phage recombination proteins), and ORF 75 (related to DNA primases) . Since these ORFs are dispersed within a 30-kb region, it is unclear if all or some of them are the remains of one prophage or if they are of multiple origins . The conjugative transfer functions are related to those from a number of plasmids and probably share a common origin with the plasmids R27, pHCM1, and pSLT from Salmonella and pNL1 from Novosphingobium . One-third of the 96 ORFs of R391 are related to homologs from Salmonella, in particular the transfer regions and a further 15-kb region containing 8 ORFs which is homologous to a chromosomal region of Salmonella enterica serovar Typhimurium LT2 . Further elements contributing to the mosaic nature of R391 are the transposable elements and the mercury resistance operon, whose origin remains unknown because of the versatility and widespread distribution of these elements . R391 and the related IncJ elements were originally described as transmissible resistance factors (8) and were initially assumed to be plasmids . They were reclassified as conjugative transposons (CTns) on the basis of their RecA-independent transfer and chromosomal integration (36, 37) . More recently, the demonstration of the chromosomal integration of R391 by its use of a phage-like integration system closely related to that of the SXT element from Vibrio cholerae led to a proposal that they should be included in the new group of constins (conjugative, self-transmitting, integrating elements) (21) . R391 and the SXT element share functional and structural similarities to genomic islands, conjugative transposons, and bacteriophages . R391 and its relatives carry specific phenotypes that are also found in genomic islands (including symbiosis and pathogenicity islands) . Moreover, many genomic islands similarly carry phage-related integrase determinants and terminal direct repeats (15, 16) . R391 likewise shares a number of common features with CTns (for example, Tn916 and the Bacteroides CTnDot) . CTns represent a diverse group of self-transmissible elements that are normally integrated into the chromosome but are capable of excision, integration, and transfer by conjugation . While exhibiting a transposon-like phenotype in terms of their ability to excise from, and integrate into, the chromosome, these elements have molecular mechanisms that clearly differ from those of classical transposons such as Tn5 and Tn10 and are instead related to those of lambdoid phages (57, 62) . With conjugative transfer functions related to those of plasmids, CTns are thus hybrid elements comprising phage integration and plasmid conjugation functions . The primary difference between CTns and R391 is that the latter shows a strong specificity with regard to integration site, integrating into the prfC gene of E . coli while CTns integrate relatively randomly into the chromosome (57, 62) . While genomic islands exhibit nonrandom target integration, as does R391, their preferred integration targets are tRNA genes (15, 16) . Finally, in contrast to prophages, R391 lacks the genes that are essential for the formation of a bacteriophage . Our future analysis will further investigate the relationships between, and evolutionary origins of, these different classes of mobile genetic elements .
We thank Barry McGrath for inspiring discussions and Angela Rosin for her contribution to the DNA sequencing of R391 .
What Is Protein?,
What Is Genetics?,
What Is Molecular Biology?,
What Is Staphylococcus Aureus?,
What Is Rhizobia?,
o,
Microbes,
s,
Bacterium,
r,
Bacteriology,
i,
Microbiology,
o,
Microorganisms,
r,
Microorganism,
i,
S. cerevisiae,
r,
Salmonella typhimurium,
n,
Salmonella typhimurium,
a,
Staphylococcus,
s,
Microbial,
e,
Cell cultures,
r,
Escherichia coli,
o,
Antimicrobial,
n,
Bactericidal,
r,
Botulism,
e,
Fermentations,
i,
S. cerevisiae,
s,
Salmonella typhimurium,
e,
Penicillin,
e,
Bactericidal,
a,
Antimicrobials,
s,
Microorganism,
a,
Neisseria,
a,
Antimicrobial,
s,
Prokaryotes
|
© 2005
Transgalactic Ltd (manufacturer of Bioscreen C software) |
Privacy Statement | P.O. Box
1393, 00101 Helsinki, Finland,
Last modified: May 25, 2005
| ||||||