|
|
|
Journal of Bacteriology, June 2004, p . 3980-3990, Vol . 186, No . 12 Intragenomic Heterogeneity and Intergenomic Recombination among Haloarchaeal rRNA GenesYan Boucher,1* Christophe J . Douady,2 Adrian K . Sharma,1 Masahiro Kamekura,3 and W . Ford Doolittle1 Genome Atlantic and Canadian Institute for Advanced Research, Dalhousie University, Halifax, Nova Scotia, B3H 4H7 Canada,1 Équipe Hydrobiologie et Écologie Souterraines, Université Lyon I, 69622 Villeurbanne cedex, France,2 Noda Institute for Scientific Research, 399 Noda, Noda-shi, Chiba-ken 278-0037, Japan3 Received 5 November 2003/ Accepted 28 February 2004
In placing such reliance on SSU rRNA genes, microbial systematists
have assumed (i) that the sequences of multiple SSU rRNA genes often
found in any one genome do not differ significantly and (ii) that
interspecific (intergenomic) exchange of, or recombination between,
SSU rRNA genes does not occur . In the last few years, it has become
obvious that these assumptions are not true for all prokaryotes . Low
levels of intragenomic variability of the SSU rRNA gene seem to be
very common (6), and even more important
variability can be present in the ITS between the SSU and LSU genes (5) .
Even these low levels of divergence can undermine attempts to
evaluate prokaryotic diversity at the molecular level, especially
when methods as sensitive as denaturing gradient gel electrophoresis
are used (7) . Also, when the SSU gene sequence is
used as a criterion to assign strains to a particular species, even a
low level of heterogeneity ( High levels (>5%) of heterogeneity between multiple SSU gene copies found in a single cell have also been determined for some organisms . In eukaryotes, intragenomic SSU gene variability has been reported for the apicomplexan Plasmodium berghei (harboring SSU gene copies that are divergent at 5.0% of their nucleotide positions [13]) and the metazoan Dugesia mediterranea (8.0% divergence [4]) . In prokaryotes, divergence levels of more than 5.0% have been found in two different instances: the thermophilic actinomycete Thermomonospora chromogena and the extremely halophilic archaeal genus Haloarcula . However, it seems that these two cases of intragenomic SSU gene variability have different evolutionary origins . T . chromogena harbors six rRNA operons, one of which differs from the others at 6.0% of the nucleotide positions in the SSU gene and at 10.0% of the nucleotide positions in the LSU gene (35) . This divergent operon was found to be a mosaic of the other operons of T . chromogena and an rRNA operon from the actinomycete species Thermobispora bispora . This mosaic operon was found in only one species, T . chromogena, and the authors hypothesized that it was originally acquired by lateral gene transfer from T . bispora and was subsequently recombined with the other operons of its new host by gene conversion (35) . Presumably, further conversion events ultimately homogenized these operons, and this lateral gene transfer or interspecific recombination event was detected only because it occurred recently . The situation in Haloarcula is different . Intragenomic rRNA
heterogeneity was first detected in Haloarcula marismortui,
which had two rRNA operons that were divergent at 5.0% of the
positions in the SSU gene and at 1.3% of the positions in the LSU
gene (18) . The processing of the rRNA product was shown
to be different in the two operons; one operon exhibited the
canonical archaeal processing, and the other had a unique processing
procedure (10) . Later, similar levels of divergence were
determined for the two to four SSU genes found in numerous species of
Haloarcula (12) . This showed that the
phenomenon is not an isolated event that affects only one species (as
was probably the case for T . chromogena) but rather is an
evolutionarily stable characteristic . Both of the H . marismortui
rRNA operons were shown to be expressed when the organism was grown
in rich media in the laboratory (1), suggesting
that they are both functional . The recent identification of SSU genes
that are
We isolated the rRNA operons of Natrinema sp . strain XA3-1 by using methods that do not involve PCR amplification, since this can lead to the formation of chimeric molecules if multiple heterogeneous rRNA operons are present in the DNA being amplified . In fact, as we discovered in this work, the three operons reported for H . carlsbadense are actually only two operons plus a PCR chimera (32) . We obtained the rRNA operons of H . carlsbadense through a combination of PCR-independent and chimera-limiting PCR methods, and the results clearly showed that only two operons are present and that the third operon that was originally identified is in fact a mosaic of these two operons . Other than H . marismortui, no haloarchaeon possessing more than one rRNA operon has had all of its copies completely sequenced until now . With the addition of H . carlsbadense and Natrinema sp . strain XA3-1, all heterogeneous rRNA operons of representatives of three divergent groups of the Halobacteriales are available to help us understand the origins and evolutionary importance of intragenomic rRNA variability .
Genomic DNA digests. The genomic DNA of Natrinema sp . strain XA3-1 was completely digested with ClaI, while H . carlsbadense genomic DNA was digested with both NotI and ScaI . The digestions were performed overnight at 37°C by following the manufacturer's recommendations (New England Biolabs) . The restriction endonucleases used were chosen because the average length of digestion products was between 5 and 10 kb and there was a low probability of cutting within the rRNA operons of the extreme halophiles being investigated . To determine which restriction endonucleases were unlikely to cut within these rRNA operons, we obtained the sequence of one rRNA operon from each species . These sequences were obtained from PCR-amplified fragments (obtained by using primers F1 and R1 [Fig . 1 ]) cloned in plasmid vector Topo-XL (Invitrogen) . One clone was sequenced for each strain by using the primers described in Table 1 . The resulting sequences were analyzed to identify enzymes that did not cut within them . Also, the enzymes were selected so that they did not cut within other Halobacteriales rRNA operons whose sequences are known, including those of Haloferax volcanii (http://www.ornl.gov/sci/techresources/Human_Genome/publicat/00santa/118.html/), Haloarcula marismortui (http://zdna2.umbi.umd.edu/cgi-bin/blast/blast.pl), Natrialba magadii (16), and Halobacterium sp . strain NRC-1 (19) .
Genomic DNA library construction. Southern hybridization revealed the locations of the rRNA operons relative to the DNA ladder (between 5 and 10 kb) . For both Natrinema sp . strain XA3-1 and H . carlsbadense, genomic DNA was redigested and electrophoresed on an agarose gel under conditions identical to those used for Southern hybridization . Genomic DNA was then extracted from the regions of the gel corresponding to DNA fragments containing an rRNA operon (MinElute; QIAGEN) . Before purified DNA fragments could be cloned, protruding 5' or 3' overhangs resulting from digestion had to be blunt ended and dephosphorylated in preparation for blunt end cloning according to the manufacturer's instructions (TOPO-Zeroblunt; Invitrogen) . The resulting products were ligated into the Topo-Zeroblunt plasmid vector (Invitrogen) and transformed into chemically competent Escherichia coli TOP10 cells (Invitrogen), which were plated on kanamycin-containing media to select for positive transformants . Library screening. The libraries were transferred to positively charged nylon membranes (Roche) and screened directly for the presence of rRNA genes by hybridization with a digoxigenin-dUTP-labeled probe (PCR-amplified SSU gene of the species from which the genomic DNA in the library originated) . Positive clones were confirmed by end sequencing with the M13 forward and M13 reverse primers and direct sequencing with a universal Halobacteriales SSU gene primer, primer F1 (Table 1) . PCR amplification of rRNA genes not obtained in genomic libraries.
All four rRNA operons of Natrinema sp . strain XA3-1 and one of
the two operons of H . carlsbadense were obtained after repeated
screening of the libraries . A PCR strategy aimed at minimizing
the formation of chimeras was used to obtain the second rRNA operon
of H . carlsbadense . Compared to standard PCR conditions, we
used five times more of each primer, an extension time that was five
times longer, and fewer PCR cycles (25 cycles) . This resulted in the
following PCR conditions: a 50-µl (final volume) mixture containing 1
to 5 ng of template DNA, 1x PCR
buffer, 1 µl of a solution containing each deoxynucleoside
triphosphate at a concentration of 10 mM, 5 µl of a solution
containing each primer at a concentration of 10 µM, and 1 µl of PFU
Turbo DNA polymerase (Stratagene) . The initial denaturation took
place at 95°C for 2 min, and this was followed by 25 cycles
consisting of denaturation at 95°C for 30 s, primer annealing at 55°C
for 30 s, and primer extension at 72°C for 5 min . The operon was
amplified in six fragments that were 0.8 to 1.0 kb long and
overlapped at their extremities, as chimera formation was less likely
for shorter products (Fig . 1) . After cloning of
each fragment (TOPO-ZeroBlunt; Invitrogen), a mixed population of
clones (from each of the two operons) was obtained . Twenty-four
clones were sequenced for each fragment, and subsequently the
sequences were compared to the sequence of the complete H .
carlbadense rRNA operon obtained from the library . Sequences that
appeared only once (singletons) ( DNA sequencing, analysis, and assembly. Positive clones from the library found to carry an entire rRNA operon were sequenced by using MegaBase technology and BigDye chemistry . Multiple primers were used for sequencing the entire rRNA operon, which gave overlapping data so that a reliable sequence could be obtained (Fig. 1) . The sequencing primers targeted conserved regions of rRNA operons, as determined by examination of an alignment of all available Halobacteriales rRNA genes . The nucleotide sequences of these primers are shown in Table 1 . Sequencher 4.1.2 (Gene Codes Corporation) was used to analyze sequence chromatograms and assemble sequence fragments . Multiple-sequence alignment and phylogenetic analysis. All
available nucleotide sequences for SSU and LSU genes of culturable
Halobacteriales were retrieved from the GenBank database .
Sequences from the database were aligned with novel sequences
obtained in this study by using CLUSTALW (29) and were edited
manually to remove gaps and ambiguously aligned characters .
Phylogenetic analyses were performed with PAUP* 4.04b (28) by
using the heuristic search option and the TBR branch-swapping
algorithm . Maximum likelihood and maximum-likelihood distances were
the tree reconstruction methods used, and the nucleotide substitution
model, gamma rate parameter
Calculation of the among-site rate variation for the SSU gene in Halobacteriales. TREE-PUZZLE (http://www.tree-puzzle.de) was used to determine the relative evolutionary rates of all nucleotide positions of the SSU gene in Halobacteriales . Rates were calculated from an alignment of the SSU genes of 80 representative taxa belonging to the Halobacteriales under a gamma-distributed model with four rate categories . Such an analysis could not be performed for the LSU gene because sequence data were available for only eight taxa, which was insufficient for calculation of informative relative evolutionary rates . Nucleotide sequence accession numbers. The nucleotide sequences determined in this study have been deposited in the EMBL database under accession numbers AJ586107 to AJ586112 .
H . carlsbadense SSU genes were amplified by PCR as part of the
taxonomic identification of this organism by Vreeland et al . (32),
who used a DNA polymerase with relatively low fidelity (LA Taq;
Takara Shuzo, Kyoto, Japan) . Three divergent genes were initially
inferred, and one of these genes was different from the other two at
Determination of the number of rRNA operons in Natrinema sp . strain XA3-1 and H . carlsbadense. The number of rRNA operons in each of the two organisms in which heterogeneity was detected was determined by Southern hybridization (Fig . 2) . Natrinema sp . strain XA3-1 was found to have four rRNA operons . In contrast to previous claims that H . carlsbadense harbors three rRNA operons (based on PCR amplification of three divergent SSU genes from genomic DNA [32]), we found that this haloarchaeon had only two operons . This result was obtained when the genomic DNA was digested to completion with a combination of NotI and ScaI (Fig . 2), as well as with NotI/ClaI and ScaI/ClaI (results not shown) .
Locations of the variable positions of heterogeneous rRNA genes on the secondary structure of the rRNA product. Nucleotide substitutions for the multiple SSU and LSU genes in Halosimplex, Natrinema, and Haloarcula were mapped by using the secondary structures of their rRNA products . Haloarcula was included because it is the only genus of haloarchaea besides those examined in this study known to display intragenomic heterogeneity in its rRNA genes (18) . Most substitutions in the heterogeneous rRNA genes found in these haloarchaea are either compensatory mutations that occur in stems or are located in loop regions . This suggests that these substitutions should have little or no effect on the overall secondary structure of the rRNA . Furthermore, all of the ribosomal protein genes of H . marismortui have been cloned and sequenced (26), and all appear to be single-copy genes . This implies that the same protein should have the capacity to bind to rRNA from either of the two heterogeneous genes to form functional ribosomes . When grown in rich media, H . marismortui has indeed been shown (by using fluorescent in situ hybridization probes specific for each of the two SSU genes) to have a mixed ribosome population (1) . Nucleotide substitutions in heterogeneous SSU genes are almost always found in hypervariable regions (Fig . 3A) (that is, regions in which most interspecies divergence occurs) . Regions corresponding to helices 21, 22, and 26 (5' domain) in the SSU gene secondary structure are variable in all three haloarchaeal genera that display high intragenomic rRNA gene heterogeneity . The region corresponding to helices 7, 8, and 9 (5' domain) is highly variable in both Halosimplex and Natrinema but not in Haloarcula . Multiple regions are heterogeneous in only one genus (e.g., helix 41 is variable in Halosimplex, and helix 6 is variable in Haloarcula) .
The surprisingly high level of similarity between positions 270
and 370 of the strain XA3-1 LSU-B gene and the equivalent region in
the N . magadii homolog is most easily explained by
cross-species homologous recombination . Such a recombination event is
made even more likely by the fact that it is limited to a specific
structural region of the LSU molecule (helices 18, 19, and 20 in
domain I) . Most of the nucleotide substitutions caused by the
recombination, when they are in stem regions, are complementary to
each other, conserving the rRNA secondary structure (Fig.
4A) . The maintenance of the secondary structure is
important, given the complexity of the ribosome and the numerous
interactions of the LSU gene molecule with proteins and other RNA
components . To our knowledge, this is the first example of
interspecies recombination between rRNA genes in archaea . Also, the
recombination took place over one of the largest phylogenetic
distances ever reported for recombination of rRNA genes (between two
genera that differ at
Phylogenetic distribution of haloarchaea with intracellular heterogeneity in the rRNA genes. A phylogeny of the order Halobacteriales based on the SSU gene is shown in Fig. 5A . With the addition of data from this study, rRNA heterogeneity is now known to occur in three divergent lineages: Halosimplex, Haloarcula, and Natrinema . These lineages, according to the SSU gene data, do not exhibit specific phylogenetic affinity for each other and are spread across the Halobacteriales diversity (Fig . 5A) . Of course, each species with multiple different rRNA operons has many branches on the tree, but assignments at the genus level or higher (perhaps with the exception of Natrinema and Haloterrigena) are not altered .
When phylogenetic trees of the SSU and LSU genes of haloarchaea were compared to each other by using the same taxon sampling method, the overall structure was very similar (Fig . 6) . The LSU gene tree, which resulted from a data set with more informative positions, seems to be slightly more resolved . The only difference between the two trees is in the branching order of the SSU and LSU genes from Natrinema sp . strain XA3-1 . In the SSU gene tree, the SSU-A gene is by far the most divergent and strongly branches in a basal position in relation to the other three copies of the SSU gene . In the LSU gene tree, on the other hand, the most divergent copy is LSU-B, and LSU-A groups strongly with the other two copies .
The similarity observed in a 100-bp stretch between one of the LSU genes of Natrinema sp . strain XA3-1 and the N . magadii LSU gene, which is likely to be due to an interspecies recombination event, suggests an origin for intragenomic rRNA heterogeneity . As the other heterogeneous regions of Natrinema sp . strain XA3-1 LSU genes are also in 100- to 200-bp stretches and are found in only one or two of the rRNA operons, it is likely that they also originated by recombination (although this cannot be confirmed unless the source of the recombined fragment is identified) . The source of recombined fragments is often very difficult to identify, for several reasons . Recombination usually occurs in hypervariable regions, which means that the sequence of the recombined fragment rapidly diverges from the source sequence, making it hard to detect by eye or by using software (which is unable to find a statistically significant match with the source) . The limited number of rRNA gene sequences available in databases for Halobacteriales, compared to the diversity found in nature, also makes the search for a recombination donor difficult (especially for the LSU gene, for which less than a dozen sequences are available) . It is clear that the four LSU genes of Natrinema sp . strain XA3-1 did not evolve in coordination with the SSU genes and the ITS partners . First, all LSU gene copies differ from each other by roughly similar numbers of substitutions (0.9 to 1.9% of the nucleotide positions differ) . In contrast, only one of the strain XA3-1 SSU genes is really different from the others (it differs at 5.0% of the positions); the other three SSU genes differ from each other at only one or two positions . The ITS between the SSU and LSU genes are identical in all four operons . This is surprising, as the ITS is the fastest evolving part of rRNA operons (23) and usually displays high variability, even at the species level (11) . There are two possible explanations (which are not mutually exclusive) for the unequal evolutionary rates of different parts of Natrinema sp . strain XA3-1 rRNA operons: intragenomic homologous recombination (gene conversion) among the four rRNA operons of XA3-1 does not happen uniformly across the length of the operons, and/or interspecies homologous recombination is frequent . Recombination between rRNA genes of different strains or species could occur at a much higher frequency than originally suspected . Like the intragenomic variability of Natrinema, the intragenomic variability between the LSU genes of Halosimplex and Haloarcula seems likely to have originated through recombination . For both of these taxa, heterogeneous positions are distributed nonrandomly in the LSU gene and usually occur in small patches . Also, although heterogeneity always occurs in hypervariable regions, the regions affected differ from one species to another . This suggests that the origins of regions displaying intragenomic heterogeneity are diverse . As no widespread survey of the Halobacteriales for intragenomic rRNA variability has been undertaken, the frequency of the phenomenon is still unknown . Such heterogeneity has been detected in only three lineages so far, but it is likely to be present in others . For most haloarchaeal species, only one SSU gene which was amplified by PCR has been sequenced . This approach is likely to miss intragenomic heterogeneity for several reasons: PCR amplification can be biased toward a particular copy of the SSU gene, the amplified gene could be a chimera of the multiple copies present in the organism, and in many studies only a few clones originating from a single PCR are sequenced (and therefore it is likely that heterogeneous PCR products are missed) . Phylogenetic analysis of rRNA genes recovers groups that display intragenomic heterogeneity for these genes as monophyletic entities (e.g., all Haloarcula SSU genes form a clade, and all Natrinema-Haloterrigena LSU genes group together to the exclusion of the LSU genes of other genera) . These monophylies suggest that the heterogeneity could have originated in the ancestors of each of the clades, although information about the distribution of this phenomenon in haloarchaea is necessary to verify this claim . What is the source of the heterogeneity? It could be interspecific recombination, as mentioned above, or simply random divergence of paralogs . It is difficult to distinguish between these possibilities, which again are not necessarily mutually exclusive . Identification of clear traces of a between-genus recombination event in the LSU genes of Natrinema sp . strain XA3-1 established that at least part of the intragenomic rRNA heterogeneity found in some Halobacteriales is due to the latter process . Could the intragenomic variability of rRNA operons be maintained by evolutionary pressure? Such a link between functionality and intragenomic rRNA divergence has been observed in the apicomplexan Plasmodium berghei . The two types of SSU genes (which differ at 5.0% of their nucleotide positions) are preferentially expressed in different stages of the life cycle of this eukaryotic parasite (13) . It has been suggested that in extremely halophilic archaea, a selective advantage could be gained from differential expression of divergent rRNA operons depending on the salt concentration in the environment (10) . Indeed, salinity has a significant influence on most biochemical reactions and is very variable in the environments occupied by halophiles, which are subjected to constant fluctuations caused by solubilization-precipitation and dilution-evaporation (9) . There is even some experimental evidence of salinity dependence for rRNA expression . Indeed, the promoters used for expression of the unique rRNA operon of Halobacterium cutirubrum vary according to the salt concentration in the medium in which the organism is grown (8) . Both rRNA operons of H . marismortui have been shown to be expressed under standard laboratory growth conditions (1), but differential expression under variable growth conditions has yet to be demonstrated . Intragenomic heterogeneity, as discussed above, causes problems at the level of rRNA data analysis (6) . More importantly, it can also cause artifacts at the data acquisition level . Indeed, PCR amplification is known to be susceptible to the formation of chimeric products if the template DNA contains multiple divergent copies of the target gene (25) . A certain rate of chimera formation is known to occur when the SSU gene is amplified directly from environmental DNA samples, which usually contain a great diversity of the target gene (31) . Intragenomic heterogeneity means that this type of artifact can also occur for PCR amplification of rRNA genes when DNA extracted from a pure culture of an organism is used . Although PCR conditions can be modified to reduce the risk of chimera formation, this problem can never be completely eliminated . The vast majority of the SSU gene sequences of Halobacteriales available in the database have been amplified by PCR without knowledge of the possibility of intragenomic heterogeneity occurring in lineages other than the genus Haloarcula . Several of the sequences found in the database could therefore be chimeric, and thus caution should be used when they are used for phylogenetic analysis (14) . Furthermore, rRNA recombination is certainly not a process limited to a specific phylogenetic group . It has been observed in several bacterial lineages at the species or subspecies level (3, 21, 22, 27) . The highly evolutionarily conserved stretches of DNA found in rRNA genes, often considered an advantage for tracing the phylogeny of organisms over large evolutionary distances, could also facilitate homologous recombination between divergent organisms (34) . The presence of intragenomic heterogeneity, which is at least partially caused by homologous recombination of rRNA genes, can have a significant impact on the acquisition of true (nonchimeric) gene sequences and their use in identification and classification of organisms through phylogenetic analysis . Therefore, efforts to develop non-PCR methods of acquiring rRNA genes and evaluating the frequency of intragenomic heterogeneity and homologous recombination among prokaryotes are needed to ensure that these genes are reliable molecular markers for use in microbiological disciplines .
What Is Dna?,
What Is Molecular Microbiology?,
What Is Rhizobia?,
What Is Genome?,
What Is Prokaryote?,
s,
Microbes,
o,
Microorganism,
r,
Bacterium,
i,
Bacteria,
r,
Bacteriology,
c,
Ps. fluorescens,
i,
Biofilms,
s,
Staphylococcus,
s,
Staphylococcus aureus,
e,
Schizosaccharomyces,
n,
Thermophiles,
r,
Prokaryotes,
o,
Escherichia coli,
o,
Escherichia coli,
e,
Bacteriological,
e,
Bacillus,
i,
Denitrifying,
n,
Lactobacillus,
a,
Salmonellosis,
e,
Streptococci,
r,
S. cerevisiae,
a,
Bacteria,
r,
Bacillus subtilis,
e,
Campylobacter,
c,
Pseudomonas,
c,
Candida tropicalis
|
© 2005
Transgalactic Ltd (manufacturer of Bioscreen C software) |
Privacy Statement | P.O. Box
1393, 00101 Helsinki, Finland,
Last modified: May 25, 2005
| ||||||