|
|
|
Journal of Bacteriology, March 2004, p . 1448-1461, Vol . 186, No . 5 Transcriptional Profiling of Caulobacter crescentus during Growth on Complex and Minimal MediaAlison K . Hottes,1,2 Maliwan Meewan,2 Desiree
Yang,3, Departments of Electrical Engineering,1 Developmental Biology, Stanford University, Stanford, California 94305,2 Biology Department, Santa Clara University, Santa Clara, California 95053,3 Bioinformatics Research Group, SRI International, Menlo Park, California 940254 Received 14 August 2003/ Accepted 12 November 2003
The dimorphic life cycle of C . crescentus has served as a model for understanding prokaryotic development, cell cycle regulation, and asymmetric cell division [24] . At each division, C . crescentusdivides into two morphologically and physiologically distinctcells: motile swarmer progeny with a single polar flagellum,and sessile stalked progeny displaying an adhesive stalk . Theswarmer progeny, in which DNA replication is initially inhibited,is specialized for dispersal and can seek new nutrient sourcesby chemotactic sensing . After about a third of their cell cycle,swarmer progeny cells eject their flagellum and grow a stalk from the same pole . The stalk enhances nutrient acquisitionby increasing the effective membrane area available for transport systems without substantially increasing the cytoplasmic volume[20] . Stalked cells initiate chromosome replication, grow anew flagellum, and divide, yielding new swarmer and stalkedcells. Genome-scale transcriptional profiling has shown that the expression of over 500 C . crescentus genes varies significantly over the course of the cell cycle [32] . Among these are many genes involvedin energy metabolism, nucleotide and amino acid biosynthesis,and protein synthesis . As shown here, a substantial fractionof these genes are also regulated in response to nutrients, raising intriguing questions as to how regulatory networks controlling metabolism and cell cycle progression might interact. Caulobacter is obligately aerobic and heterotrophic and can
utilize a variety of carbon sources . Previous studies have demonstrated
the activity of Entner-Doudoroff pathway enzymes [14]
duringCaulobacter growth on glucose [48,
52], but there has been noquantitative analysis
of glucose flow into the Entner-Doudoroffpathway versus alternatives
such as the Embden-Meyerhof-Parnasglycolytic pathway . Related
Genome sequence analysis unexpectedly revealed that C . crescentus has the potential to degrade plant-derived biopolymers through the production of exoenzymes, including cellulases [endo-ß-1,4-glucanases and ß-glucosidases], xylanases and xylosidases, and polysaccharide deacetylases [45] . In freshwater ponds, lakes, and rivers, Caulobacter species could use live or decaying aquatic vegetation as surfaces for biofilm formation . The major monomeric products released by degradative exoenzymes acting on lignocellulose would be glucose [from cellulose] and xylose [from xylan] . The mechanism of xylose dissimilation in C . crescentus is unknown. The best-known bacterial pathway for xylose catabolism, isomerization to xylulose followed by phosphorylation to xylulose-5-phosphateand entry into the pentose phosphate pathway [23], is probably not operative in C . crescentus, given the absence of homologs for xylose isomerase and xylulose kinase in the genome . An inducible NADP-dependent xylose dehydrogenase activity has been reported in C . crescentus [48], but neither the gene encoding this activitynor the catabolic pathway in which it might participate is known.We identify here genes whose expression increases in the presenceof xylose . These data suggest that C . crescentus uses xyloseas an indicator of the availability of plant-derived polymers. We also identify a DNA sequence motif associated with many xylose-induciblegenes and show that the motif is important for regulation. The abundance of putative secreted peptidases in the C . crescentus genome suggests that extracellular degradation of proteins may also be important in natural habitats [45] . Environmental aminoacids support rapid growth of C . crescentus, leading us to comparegene expression in a complex amino acid-containing medium [PYE]to gene expression in defined minimal media . As expected, severalamino acid degradation pathways are upregulated during growth in PYE, in contrast to induction of amino acid biosynthetic pathways [along with ammonia and sulfate assimilation systems]in minimal media . Interestingly, however, expression of manyother genes encoding enzymes of key biosynthetic pathways doesnot change significantly between PYE and minimal media. As a gram-negative bacterium, C . crescentus must acquire nutrients via transport across both outer and inner membranes, and high-affinity nutrient uptake is presumably necessary to support an oligotrophic lifestyle . The C . crescentus genome seems to lack homologs of the OmpF-type outer membrane porins characteristic of many gram-negative species [45] but is remarkably rich in TonB-dependent receptors,with at least 65 gene products containing the TonB box . As analternative transport system, TonB-dependent receptors are thought to harness energy from the inner membrane proton motive force to the transport of macromolecular complexes, including vitaminB12 and siderophores, across the outer membrane [28, 41] . Withrespect to transport across the inner membrane, over 80 ABC[ATP binding cassette] transporter components [61] are encodedin the Caulobacter genome . We show here that numerous TonB-dependentreceptors and some ABC transporters are differentially expressedin each medium . Whether the deployment of such transportersis a key factor for Caulobacter and other oligotrophs to efficientlyacquire nutrients in diverse nutrient-limited environments clearlywarrants further investigation.
RNA isolation. For the microarrays, cells were harvested from a total of eightPYE cultures, eight M2G cultures, and five M2X cultures . Thecultures were in exponential growth at the time of harvest,with mean densities as follows: PYE, OD600 = 0.37 ± 0.12;M2G, OD600 = 0.34 ± 0.13; and M2X, OD600 = 0.27 ±0.05 . [The cultures were not synchronized, and thus containedcells from all stages of the cell cycle.] Cells were isolatedby centrifugation in a Beckman J2-21 M centrifuge at 7,740 xg at room temperature for 2 min . [In control experiments, approximately85% of the cells were recovered in the pellet, and all stagesof the cell cycle seemed to be recovered with similar efficiencybased on phase contrast microscopy.] The complete RNA extraction and purification protocol is described in the supplementary material [http://www.stanford.edu/group/caulobacter/metabolism] and described briefly here . Each cell pellet was resuspendedin Trizol [Invitrogen] and incubated at 65°C for 10 min.Chloroform was added, and the samples were centrifuged in amicrocentrifuge at 16,000 x g at 4°C for 15 min . The aqueouslayer was transferred to a new tube, mixed with isopropanol, and frozen overnight at -80°C . The next day, RNA was pelleted by centrifugation at 16,000 x g at 4°C for 30 min . Supernatantwas poured off, and the pellets were washed in 70% ethanol andresuspended in diethyl pyrocarbonate-treated water . ContaminatingDNA was digested with DNase I [Ambion] . RNA was extracted withacid phenol-chloroform, precipitated with ethanol and sodiumacetate, and then resuspended in RNase-free water . RNA qualitywas visualized on agarose gels, and RNA concentration was determinedspectrophotometrically. PCR microarrays. PCR products were spotted onto poly-L-lysine-coated glass microarrayslides with an OmniGrid printer [GeneMachines] with 32 Telechempins . The slides were postprocessed as described by Laub etal . [32] . Primer sequences are available in the supplementarymaterial and correspond largely to those used in Laub et al.[31] . Slides from three printings were used. Labeled indodicarbocyanine-dCTP [Cy3] and indocarbocyanine-dCTP [Cy5] cDNA probes were generated from 20 µg of total RNAby reverse transcription as described by Laub et al . [32], except that the amount of random hexamers was increased from 500 ngto 1 µg . Samples were hybridized competitively under coverslipsto the microarray slides overnight at 65°C, then washedas described by Laub et al . [32] . Hybridized arrays were scannedwith a GenePix 4000B scanner [Axon Instruments], and scannedspots were converted to ratios [red/green] with GenePix Pro4.0 software . Ratios and statistics on each spot were storedin an SQL Server database . The signal for each channel [redand green] was taken to be the mean of the foreground signalminus the median of the local background signal . Some spotswere excluded by the criteria described in the online supplementarymaterial . Array-wide normalization procedures are also describedin the supplementary material. Identifying genes differentially expressed between pairs of media. Thirty-four microarrays were used to assay relative RNA abundance. Eighteen compared M2G and PYE, nine compared PYE and M2X, andseven compared M2X and M2G . To reduce dye bias, samples fromeach medium were labeled with Cy3 about half the time and withCy5 about half of the time . We used least-squares techniques[59] to simultaneously estimate log2[M2G/M2X], log2[M2X/PYE], and log2[M2G/PYE] expression ratios for each gene . We used a correlated error model to account for dependencies introduced by hybridizing some RNA samples on multiple arrays . The varianceof each estimate was modeled by assuming a normal error distribution. For each gene, Student's t test was used to test the hypothesis that each log ratio, i.e., log2[M2G/M2X], log2[M2G/PYE], andlog2[M2X/PYE], was zero . A two-sided P value, which accountedfor the different number of samples available for each gene,was assigned to each ratio . For most genes, more arrays comparingM2G and PYE were done than were done for any other combinationof media . Thus, the confidence regions for log2[M2G/PYE] tendedto be smaller than those for log2[M2G/M2X] or for log2[M2X/PYE].To roughly equalize the number of false-negatives, significancelevels were set so that, under the noise assumptions, 10 false-positiveswere expected for each of the log2[M2G/M2X] and the log2[M2X/PYE]comparisons and five false-positives were expected for the log2[M2G/PYE] comparisons . To minimize the effect of possible biases, to be considered significant, gene expression also needed to changeat least 1.4-fold . Additionally, average changes of at leastthreefold were considered significant . Applying the above proceduresto a randomized version of the data set yielded 18 false-positives[all comparisons combined] . For more detail on the analysisprocedure, see the supplementary material. To aid in identifying patterns of coregulation, differentially expressed genes were sorted into several groups . For example,genes were assigned to the "up-in- PYE" set [equivalent to the"down-in-M2" set] if the increase in expression between PYEand both M2G and M2X was statistically significant or if theincrease in expression between PYE and at least one of the M2media was statistically significant, while the difference inexpression between PYE and the other M2 medium was at least1.75-fold . The down-in-PYE/up-in-M2, up-in- M2G, down-in-M2G,up-in-M2X, and down-in-M2X sets were constructed analogously. Metabolic pathway analysis and identification of differentially regulated potential isozymes. Functional analysis of microarray data was aided by CauloCyc[http://biocyc.org], a computationally derived pathway/genomedatabase of C . crescentus . A pathway/genome database describesthe genome of an organism, the product of each gene, the biochemicalreaction[s], if any, catalyzed by each gene product, the substratesof each reaction, and the organization of those reactions intopathways . CauloCyc was computationally derived from the annotatedgenome of C . crescentus by the PathoLogic software, part ofthe Pathway Tools software suite [26, 54]. Each metabolic pathway in CauloCyc consists of a set of enzymatic functions [biochemical reactions] and the gene[s] that codefor each function . When CauloCyc contained multiple genes fora given reaction, microarray data were used to construct 95%confidence intervals for each gene for each pairwise comparisonwith the noise model described above . This allowed pairs ofgenes whose products are annotated as catalyzing the same reaction,but which for one or more pairwise comparisons averaged at leasta twofold difference in expression and had disjoint confidenceintervals to be identified . Typically, we required at leasttwo comparisons to yield distinct confidence intervals . Forexample, in a case where one gene in the pair was up in PYEand the other was unchanged, the confidence intervals for bothPYE/M2G and PYE/M2X would need to be distinct . Also, if thePYE/M2G and PYE/M2X ratios for the first gene were 0.9 and 1.2,respectively, then the average PYE/M2G and PYE/M2X ratios forthe second gene would need to be at least 1.8 and 2.4, respectively. Gene annotations and categorization. Most C . crescentus open reading frame [ORF] annotations weretaken from GenBank accession number AE005673 [45] . For genesannotated in GenBank as hypothetical or conserved hypotheticalproteins, clusters of orthologous gene [COG] descriptions [57, 58] were used if they were more detailed . Gene names shown in the tables came from TIGR and generally refer to the homologous Escherichia coli gene [see www.tigr.org for more information].Annotations for CC2086 and CC1000 were obtained on the strengthof BLAST matches to the NCBI NR database [2] . BLAST was alsoused to identify homology between C . crescentus gene productsand the gene products of other organisms . Classifications weretaken from COG classifications when available; genes withoutCOG classifications were categorized based on their GenBankannotations. In order to create disjoint groups, genes assigned to multipleCOG categories were placed into a single category [see supplementary materials] . To facilitate comparisons, gene products involvedin transport [based on either their GenBank or their COG annotation] were grouped together . Some genes coding for amino acid metabolism proteins were further classified as biosynthetic or degradativebased on the metabolic pathways in CauloCyc . Nontransport genesresponsible for assimilating sulfate into cysteine were classifiedin the amino acid metabolism and biosynthesis groups . Uncharacterizedgenes annotated simply as hypothetical proteins which did nothave a COG number were placed in the hypothetical category;other uncharacterized genes were placed in the conserved hypothetical category. Mapping of hex mutants. C . crescentus mutants
unable to use glucose as their sole carbonsource [hex
phenotype] were isolated by Ely and colleagues [13,
25] . Mutations in eight such strains were mapped by
Construction of mutant strains. A nonpolar deletion removing nearly half of the coding regionof CC2054, encoding glucokinase, was generated through a PCR-basedstrategy . Flanking primers were designed that annealed approximately300 bp upstream and downstream of the CC2054 coding region.Complementary chimeric internal primers were designed that fusedsequences from the 5' and 3' portions of the coding region,deleting codons 114 through 250 [out of 332 total] but maintaininga continuous reading frame . [The sequences of all primers usedin this work are available on request.] Each of the chimeric internal primers was used in individual PCRs with the appropriate flanking primer . The products were isolated and combined ina subsequent PCR, with only the flanking primers for amplification. Fusion of the initial PCR products in the second PCR was mediated by complementarity of the terminal chimeric regions, resultingin a single final product with much of the CC2054 coding regionremoved . This construct was cloned into plasmid pCR2.1 withthe Topo Cloning System [Invitrogen], then subcloned into thekanamycin-resistant pNPTS138 vector for integration into theC . crescentus genome and sacB-mediated sucrose counterselection[60] . Sucrose-resistant, kanamycin-sensitive isolates lackingthe integrated plasmid were screened by colony PCR to determinewhether the wild-type or mutant gene remained on the chromosome.Deletion mutants were streaked on M2G, M2X, and PYE to determinewhether CC2054 is required for utilization of glucose as thesole carbon source. lacZ reporter fusions to confirm microarray data. The promoter regions of several genes were fused to a lacZ reporter to provide an alternative method of assessing regulation of expression of these promoters during growth on various media.The upstream regions of the CC0505, CC0823, and CC2057 geneswere amplified by PCR, generally including approximately 300bases upstream of the start codon for the indicated genes andup to several hundred bases downstream into the coding regions.PCR products were cloned into plasmid pCR2.1 with the Topo CloningSystem [Invitrogen], then subcloned as EcoRI fragments intothe pRKlac290 vector to generate a transcriptional fusion tothe lacZ gene [56] . Orientation in the pRKlac290 vector wasverified by PCR with an insert-specific primer and a primercomplementary to the coding sequence of the lacZ gene . pRK290constructs containing the various promoters were delivered byconjugation from E . coli S17.1 into C . crescentus, with selectionon PYE containing tetracycline and nalidixic acid . ß-Galactosidaseactivity was assayed in log-phase cells as described previously[56]. Motif identification. The MEME software [4] was used to identify matrix models describingDNA sequence motifs present upstream of genes whose expressionis elevated in M2X or [separately] in M2G . The 475 bases upstreamand the 125 bases downstream [600 bases total] of the startof each predicted C . crescentus open reading frame were searched.Bases interior to the predicted coding region were includedin the search set to provide robustness against possibly misannotatedstart codons . Each possible motif location was scored with themotif matrix as well as with a third-order Markov model of C.crescentus intergenic regions [35] . A location's score was thelog of the probability that the motif was present minus thelog of the probability that the sequence came from the background.Both strands of each sequence were searched . The cutoff scorefor the presence of the motif was taken as the score of thelowest scoring sequence used in generating the motif, exceptfor glucose motif 2, where the second lowest score was used. In Table 4, when overlapping instances of the xylose motif appearedon both DNA strands, only the highest scoring instance is shown.Both as a control and to estimate the number of false-positives,an equal number of 600-bp sequences were generated from thebackground distribution model and scored . One random sequencescored above the cutoff for the xylose motif, three were abovethe cutoff for glucose motif 1, and 22 were above the cutoff for glucose motif 2 . Matrices describing the motifs are available in the supplementary material.
To test the functionality of the motif associated with xylose-regulated genes, the sequences upstream of the CC0823 and CC0505 genes were altered by site-directed mutagenesis [60] . The templates for mutagenesis were the same promoter-containing fragments used to assay regulation of the wild-type promoters . The mutated promoter regions, with identical flanking sequences, were similarly subcloned into the pRKlac290 lacZ transcriptional fusion vectors and introduced into C . crescentus for assay.
Only glucose, xylose, and glutamate produced colonies of atleast 1 mm in diameter within 72 h, although colony growth wasonly slightly slower on sucrose . Trehalose generated 1-mm colonieswithin 5 days, while galactose, mannose, mannitol, arabinose,and ribose supported growth of 1-mm colonies within 6 to 10days at 30°C . Growth on M2 medium containing 0.2% fructose,acetate, glycerol, gluconate, glucuronic acid, or glucosaminewas barely detectable after 10 days at 30°C . No significantdifferences were observed between CB15 and CB15N during growthon these carbon sources. To more quantitatively assess growth rates, CB15N was grownin liquid batch culture on PYE [an amino acid-rich medium typicallyused for laboratory culture of C . crescentus], M2 with glucose [M2G], and M2 with xylose [M2X] . During the exponential growthphase of cultures incubated aerobically at 30°C, CB15N culturesexhibited a doubling time of 89 min on PYE, 114 min on M2G,and 127 min on M2X. Transcriptional profiling of C . crescentus grown on PYE and minimal media. To assess gene expression in cultures grown in PYE, M2G, andM2X, RNA was prepared from log-phase C . crescentus CB15N cultures.RNA samples were fluorescently labeled and competitively hybridizedon spotted arrays displaying PCR product probes representing3,528 predicted C . crescentus genes [93% of the 3,761 open readingframes predicted in the C . crescentus genome] [31, 32] . Analysisof the data produced M2X/M2G expression ratios for 3,074 genes,PYE/M2G ratios for 3,179 genes, and M2X/PYE ratios for 3,089genes . [Due to space constraints, only selected data are presentedand discussed here . The entire data set, as well as all tablesnumbered with an S prefix, are presented in the supplementarymaterials, which are available online at http://www.stanford.edu/group/caulobacter/metabolism.Additional information on probes, data collection, data processing,and statistical analysis can be found at the same site.] Expressionof 390 genes was found to vary significantly and reproducibly.Genes that were found to be differentially expressed betweeneach pair of media are shown by functional categories in Fig.1.
Degradation pathways induced in PYE. PYE supplies amino acids that can be employed in protein synthesis,used to fuel energy production, or transformed into other necessarymetabolites . The expression of a number of genes encoding componentsof amino acid degradation pathways was elevated in PYE [Fig.1D and 2A] . Particularly notable are components of pathways involved in degrading arginine, histidine, alanine, glycine, glutamate, and proline, which are abundant in peptone and inyeast extract, the main components of PYE [BD Diag-notic Systems, http://www.bd.com/diagnostics/microservices/regulatory/typicalanalysis/index.asp] [Table 1] . Arginine catabolism appears to occur via the argininesuccinyltransferase pathway, with enzymes from two potential transcription units [CC0581-0584 and CC1606-1608], yieldingCO2, NH3, NADH, glutamate, and succinate. Histidine degradation in C . crescentus [15] has
been shown toproceed through the initial steps of the classical Hut
[histidineutilization] pathway defined in Aerobacter [Klebsiella]
aerogenes[36] . That pathway yields
glutamate and formamide, but C . crescentusproduces formate
rather than formamide, suggesting that theterminal steps of the
pathway resemble that of Pseudomonas putida[34] .
The known C . crescentus Hut enzymes are products of asingle
PYE-induced locus [CC0957-0960] . CC0958 is annotatedin GenBank as
formiminoglutamase, which catalyzes the terminalstep of the K .
aerogenes Hut pathway, yielding formamide . Theannotation may be
erroneous in that the CC0958 polypeptide sequencealigns closely
[BLAST E value, 3 x 10-48] with
the P . putidaN-formylglutamate amidohydrolase [GenBank
accession number
AAB86969],which catalyzes the terminal step of the pathway
version yieldingformate . Conservation of this gene product in other
The induced pathways suggest mechanisms by which C . crescentus generates energy from amino acid degradation during growth in PYE . For example, induction of alanine dehydrogenase [CC3574]allows the conversion of alanine to pyruvate and generates NADH.The pyruvate is presumably fed into the tricarboxylic acid cycle,while NADH feeds respiratory electron transport . Similarly,a bifunctional proline dehyrogenase/pyrroline-carboxylate dehydrogenase[CC0804, homologous to E . coli putA] is also induced, whichgenerates FADH2 and NADH while producing glutamate from proline. Glutamate can serve as an excellent source of carbon and nitrogen
for C . crescentus [13] . Expression of CC0088, which
encodesa member of a recently identified class of glutamate
dehydrogenases[40], increased threefold during
growth in PYE relative to M2media . Glutamate dehydrogenase carries
out oxidative deaminationof glutamate, yielding
The yeast extract in PYE may contain significant levels of fatty acids, which C . crescentus utilizes as an energy source, based on the observation that growth on PYE induced expression ofseveral genes encoding enzymes in the ß-oxidationpathway for fatty acids . These include acyl-coenzyme A dehydrogenase[CC1350: PYE/M2G = 3.55, PYE/M2X = 3.55], enoyl-coenzyme A hydratase/isomerase[CC1352: PYE/M2G = 2.33, PYE/M2X = 2.34; CC2169: PYE/M2G = 3.54,PYE/M2X = 3.65], and acetyl/propionyl-coenzyme A carboxylase[CC2168: PYE/M2G = 2.17, PYE/M2X = 2.10] . The fatty acid oxidationgenes induced in PYE are a subset of a large family of similarlyannotated genes in the C . crescentus genome [Table S2] . Whetherthe induced gene products are functionally distinct from theuninduced ones is unknown, as is the mechanism of their differentialregulation. Assimilation and biosynthesis in minimal media. During growth in M2 medium, C . crescentus must synthesize eachamino acid . As expected, this leads to elevated expression ofgenes encoding components of several amino acid biosyntheticpathways, including those for leucine, isoleucine, valine, serine,threonine, cysteine, methionine, glutamine, and glutamate [Fig.2B and Table 1] . Interestingly, several pathways showed no apparentdifference in expression compared to growth on PYE, and in someonly a subset of the genes showed increased expression . For example, among the many enzymes included in the pathways for synthesis of the aromatic amino acids tryptophan, tyrosine,and phenylalanine, only CC2300 [encoding 3-deoxy-D-arabino-heptulosonate-7-phosphate synthase, the first enzyme of the chorismate pathway] and CC2222, encoding chorismate mutase, showed significant induction inM2 . The observed lack of induction of other aromatic amino acid biosynthesis genes in M2 is consistent with previous S1 nuclease mapping data [55], which found that trpF [CC3545], trpB [CC3544],and trpA [CC3543] mRNA levels did not change between cells grownon PYE and M2-sucrose . In contrast, microarray studies foundthat mRNA levels for genes in the E . coli trpEDCBA operon increasedtwo- to fivefold in minimal media compared to the same mediasupplemented with tryptophan [27]. In M2 medium, sulfate supplies sulfur to the cells, and genes
encoding components of the sulfate assimilation pathway arestrongly
induced relative to PYE [Table 1] . These include a
sulfate-specific ABC transport system [encoded by CC1596-1598,whose
products are homologous to the E . coli CysTWA sulfate
transporter], a periplasmic binding protein that probably delivers
sulfate to the ABC transport system [CC0286], and the enzymesthat
incorporate sulfate-derived sulfur into cysteine: sulfateadenylate
transferase/adenylylsulfate kinase [CC1482-1483],
phospho-adenylylsulfate reductase [CC1121], sulfite reductase
[CC1119], and cysteine synthase [CC1426 and CC3625] . Expressionof
CC3063 [cysJ, the
During growth in M2, ammonia serves as the sole nitrogen source. Ely et al . [13] showed that ammonia assimilation in C . crescentusoccurs via the combined action of glutamine synthetase and glutamatesynthase . They observed fivefold induction of glutamate synthaseactivity and fourfold induction of glutamine synthetase activityin M2G versus PYE medium . Similarly, our microarray data showelevated expression in M2 media [compared to PYE] for the genes encoding glutamine synthetase [CC1969: M2G/PYE = 2.78, M2X/PYE = 1.93] and glutamate synthase subunits [CC3606: M2G/PYE = 2.54, M2X/PYE = 3.07; CC3607: M2G/PYE = 4.06, M2X/PYE = 4.75]. Very few genes encoding enzymes involved in nucleotide biosynthesis showed significant changes in expression in M2 versus PYE . Some enzymes involved in the synthesis of other relevant metabolitesand cofactors, however, showed significant induction in M2.De novo purine synthesis and some reactions in amino acid synthesisdepend on formyl-tetrahydrofolate as a donor of one-carbon units.The formyl-tetrahydrofolate synthesis pathway in C . crescentushas not been clearly elucidated, but expression of genes encodingseveral enzymes that could be relevant, including CC2138 [5-methyl-tetrahydrofolate-homocysteine methyltransferase], CC2140[5,10-methylenetetrahydrofolate reductase], and CC3630[formyltetrahydrofolate deformylase],is induced during growth in M2 medium . With respect to one-carbonunit metabolism, expression of S-adenosylmethionine synthetase [CC0050], which produces the key methyl donor S-adenosylmethionine, was also increased in M2. Glucose catabolism. C . crescentus expresses enzymes of the Entner-Doudoroff pathwayduring growth on glucose, and it has been proposed that thisis the primary route of glucose catabolism in this organism[52] . Our microarray results show that the expression of CC1495,which codes for 2-dehydro-3-deoxyphosphogluconate aldolase [commonlyreferred to as KDPG aldolase], which carries out the last stepin the Entner-Doudoroff pathway, was increased in both M2G andM2X compared to PYE but did not vary significantly between M2Gand M2X . The microarray data also show that the genes encodingthe first four enzymes in the Entner-Doudoroff pathway [CC2054to CC2057] are more highly expressed in M2G than in either M2Xor PYE, although there is some elevation of expression in M2Xrelative to PYE. The CC2057 to CC2054 coding regions are closely spaced [3, 7,and 15 bp of separation, respectively] and probably form anoperon . To independently verify the regulation of this operon,the CC2057 promoter region [including 287 bases upstream ofthe predicted start codon] and coding sequence were cloned intoa low-copy-number lacZ fusion vector . Expression of the CC2057promoter fusion in strain CB15 grown in M2G [1,460 ±445 Miller units] and in PYE supplemented with glucose [1,460± 430 units] was higher than in M2X [917 ± 90 units] or PYE [693 ± 185 units] . The glucose-dependentincrease in expression was lower than observed in the microarrayexperiments [Table 2] but qualitatively consistent. C . crescentus hex [hexose] mutant strains that are unable to use glucose as their sole carbon source have been isolated [25]. We examined eight hex mutant strains to further characterize the phenotype . All of the strains grew normally on PYE and M2X agar plates . Most of these strains appeared completely incapableof growth on M2G, the exceptions being two strains that formedcolonies very slowly on M2G [see below] . To identify the basisfor this inability to utilize glucose, the hex mutations weremapped by transductional linkage to kanamycin resistance markers[60] . The specific genes in which the mutations resided werethen identified by complementation, based on restoration ofthe ability to grow on M2G . Three strains [SC333, SC337, andSC414] contained mutations that mapped to CC2055, a homologof the E . coli edd gene encoding phosphogluconate dehydratase,which catalyzes the Entner-Doudoroff pathway reaction in whichphosphogluconate is converted to 2-keto-3-deoxy-6-phosphogluconate.Another mutation [SC144] was mapped to CC1495, a homolog ofthe E . coli eda gene encoding KDPG aldolase, which cleaves 2-keto-3-deoxy-6-phosphogluconate to yield glyceraldehyde-3-phosphate and pyruvate. Four other hex mutant strains were examined, including two [SC114 and SC481] that showed no growth on M2 with glucose or galactose as the carbon source and two [SC155 and SC162] that grew very slowly on glucose or galactose . All four mutations mapped toCC1493 [ppc], which encodes phosphoenolpyruvate carboxylase,an enzyme that catalyzes the addition of CO2 to phosphoenolpyruvateto generate oxaloacetic acid . Expression of CC1493 was elevatedin M2G compared to both M2X and PYE . During growth on minimalmedia such as M2G, tricarboxylic acid cycle intermediates aresiphoned off for use in various biosynthetic pathways . In orderfor the cycle to continue, oxaloacetate levels must be replenished[17] . The difficulty experienced by ppc mutants in growing onM2 with glucose or galactose as the carbon source demonstratesthe critical role of this replenishment process . We suspectthat the mutations in SC155 and SC162 compromise the functionof the phosphoenolpyruvate carboxylase, reducing [but not eliminating]oxaloacetate production. Entry of glucose into the Entner-Doudoroff pathway typically requires the sugar to be phosphorylated, though exceptions insome archaea have been noted [9] . In E . coli, glucose is phosphorylatedas it enters the cell through a phosphoenolpyruvate-dependentphosphotransferase transporter system [50] . Interestingly, thoughthe C . crescentus genome encodes several putative phosphotransferasetransporter system-type transport systems, none were inducedn response to glucose in our data . However, the putative operonencoding several other components of the Entner-Doudoroff pathwayalso includes a glucokinase homolog [CC2054], which could accountfor production of glucose-6-phosphate . Since no mutations withthe hex phenotype were identified in this gene, a knockout ofCC2054 was constructed . The mutant strain showed no detectablegrowth with glucose as the sole carbon source, a phenotype consistentwith its being required for phosphorylation of glucose priorto entry into the Entner-Doudoroff pathway . This suggests thatin C . crescentus, glucose is not transported through or phosphorylatedby a phosphotransferase transporter system-type system . Basedon the microarray data, one candidate for transporting glucoseacross the inner membrane is encoded by CC1103, whose productexhibits significant similarity to the E . coli fucose permease. The MEME software [4] was used to identify conserved DNA motifsassociated with glucose-induced genes . Two palindromic motifs were found, associated with nonoverlapping sets of genes [Tables S3 and S4, supplementary materials] . The first motif appearsin 13 intergenic regions in the C . crescentus genome [TableS3] . Genes with the first motif upstream include CC2057 [thefirst gene of the operon encoding enzymes of the Entner-Doudoroffpathway], four genes encoding transporters [CC0970, CC1103,CC1500, and CC1754], two genes coding for 1,4-ß-D-glucanglucohydrolase homologs [CC0797 and CC2052], and a gene fora putative glucokinase family protein [CC3167] . The second motif,albeit less statistically significant [see above], appears in14 intergenic regions, including probable promoter regions for11 genes showing elevated expression in glucose [Table S4].These include CC1493 [encoding phosphoenolpyruvate carboxylase]and four genes encoding transporters [CC0214, CC1517, CC1518,and CC2928] . Though intriguing, the significance of these twomotifs with respect to expression and regulation of these genesremains to be investigated. Xylose regulon. A substantial fraction of the genes whose
expression increasedin xylose encode enzymes predicted to contribute
to the degradationof extracellular polymers [Table 3] .
This class includes twoxylosidase/arabinosidases [CC2802, CC3054],
an
With SignalP [43, 44], four additional nontransport, xylose-inducedgenes encoding proteins with probable signal peptides for exportwere identified . Based on homology, probable functions couldbe identified for only one of these [CC0979, peptidyl-prolylcis-trans isomerase A] . Another one [CC0925] is annotated asan OmpA-related protein, suggesting that it may be targetedto the outer membrane . Also targeted to the outer membrane area set of nine TonB-dependent receptors whose expression is inducedby xylose . As noted previously, these proteins could support transport across the outer membrane . Xylose also induced expression of a gene [CC0814] encoding a member of the major facilitator superfamily of inner membrane proteins . The CC0814 product is strikingly similar to the E . coli XylE xylose/H+ symporter [BLASTPE value, 2 x 10-70], suggesting a role in xylose import intothe cytoplasm . Given that E . coli, as an example, utilizes atleast one other xylose transport system in addition to XylE,it is possible that other inner membrane transporters aid inacquiring xylose in C . crescentus as well. The CC0823 through CC0819 genes probably form an operon, basedon
the close spacing of the coding regions and the coordinatedinduction
by xylose . The extent of induction observed for thesegenes ranged
from a low of 2.8-fold [M2X/M2G] for CC0820 toa high of 11.6-fold
for CC0822 . [No expression data were obtainedfor CC0821.] A
chromosomal Tn5-lacZ insertion previously isolatedin
the CC0823 coding region showed strong induction of ß-galactosidase
activity by xylose [ Regions upstream of the genes in the C . crescentus xylose regulon
were scanned with MEME [4] to identify conserved
sequences potentiallyrepresenting transcription factor binding
sites . A conserved20-bp palindromic motif strongly associated with
xylose-regulatedgenes was identified [Table 4] .
The motif appeared 25 timesin intergenic regions in the C .
crescentus genome . Fifteen ofthese sites were adjacent to at
least one gene whose expressionis significantly increased in xylose
[Table 4], including CC0823,where it overlapped
the predicted -10 region of the promoter[Fig . 3] .
Transcription factors controlling gene expressionin response to
xylose have been identified in several bacterialspecies, and binding
sites for such factors are sometimes conservedbetween species [38] .
However, the motif identified here differssubstantially from the
consensus binding site of the XylR repressorfound in the
Bacillus/Clostridium group [53], and it does not
resemble the recognition sequence for the XylR activator of
some
To experimentally examine the role of this motif in xylose-dependent regulation, the motifs associated with CC0823 and CC0505 were mutationally altered [Fig . 3] . CC0505 encodes a signal peptide-containingprotein, possibly secreted, whose function is unknown . CC0505is not necessary for xylose utilization [data not shown], butits expression showed substantial xylose induction in microarrayexperiments [M2X/M2G = 5.8, M2X/PYE = 6.2] . Wild-type and mutantpromoters were fused to the lacZ reporter gene on a low-copy-numberplasmid, and ß-galactosidase expression was assayed in the same media used in the microarray experiments as wellas PYE supplemented with xylose [Fig . 3] . There were clear differencesbetween the two promoters—the CC0823 promoter yielded notably higher expression under all conditions—but the consequences of altering the conserved motif were similar . Inboth cases, the mutant promoters showed strong constitutiveexpression in media lacking xylose . These data support a modelin which the conserved motif functions as a binding site fora repressor protein that obstructs these promoters in the absenceof xylose and releases in the presence of xylose, allowing accessto RNA polymerase . Definition of this regulatory motif is particularlysignificant in that the CC0823 promoter is widely used as aheterologous promoter directing inducible gene expression inC . crescentus [for example, see references 11 and 39]; definingthe target site for regulation should aid in identifying the regulator itself and in manipulating the system to improve its utility. Possible regulatory proteins. A number of putative transcription factor or proteins involvedin signal transduction pathways were differentially expressedin the media examined here . Regulators induced in M2 media relativeto PYE included a homolog of ArsR [CC2141: M2G/PYE = 2.23, M2X/PYE= 2.15], a response regulator [CC3477: M2G/PYE = 2.81, M2X/PYE= 2.40], a histidine kinase [CC0285: M2G/PYE = 1.88, M2X/PYE= 1.82] a hybrid histidine kinase/response regulator [CC0026:M2G/PYE = 2.09, M2X/PYE = 1.99], a LuxR-related protein [CC0782:M2G/PYE = 2.53, M2X/PYE = 3.08], and extracytoplasmic functionsigma factors CC2883 [M2G/PYE = 2.38, M2X/PYE = 2.07] and CC3475[M2G/PYE = 2.03, M2X/PYE = 1.89] . Two putative transcriptionfactors, both members of the LacI family [CC2053 and CC2355],appeared to be specifically induced by glucose [expression significantlygreater on M2G than M2X or PYE, Table 2] . Expression of a responseregulator [CC0247] and a sensor histidine kinase [CC1294] wasinduced during growth on xylose [Table 3] . A few potential transcriptional regulators showed relatively minor [less than twofold] but statistically significant increases in expression in PYE relative to the M2 media, including a homolog of the PupR anti-sigma factor [CC0982],a response regulator [CC1610], a putative cyclic AMP bindingprotein [CC2171], and an AsnC family member [CC3573] [see supplementary materials]. Cell cycle regulation of metabolic genes. The expression of several C . crescentus genes associated withroutine metabolic functions has been shown to be cell cycleregulated during growth in M2G medium [32] . A subset of thosecell cycle-regulated genes were differentially regulated inthe growth media tested here [supplementary material] . Twelvecell cycle-regulated genes were up and one was down in xylose,two were down and six were up in glucose, and 37 were down and21 were up in PYE . The genes regulated by both the growth mediaand the cell cycle did not seem to have a common time of peakexpression during the cell cycle . Cell cycle-regulated genescould appear to be medium regulated in microarray experimentswith mixed cultures if the cultures grown in various media haddifferent proportions of swarmer and stalk cells, but we sawno obvious evidence for that to be the case here. The faster growth rate of C . crescentus in PYE compared to the M2 media likely explains why some cell cycle-regulated genes, including dnaA [CC0008] and recA [CC1087], appear to be increased in PYE . Some of the cell cycle-regulated genes whose expression was elevated in PYE, including phnC, phnD, and phnE [CC0361 to CC0363], are involved in transport, while others, such as gcvT [CC3335] and gcvH [CC3334], are involved in degradation. Conversely, many of the cell cycle-regulated genes whose expression was down in PYE are involved in biosynthesis . For promoters regulated by both the cell cycle and nutrient composition, itremains to be determined whether the same transcription factorsare used in both forms of regulation.
The Entner-Doudoroff pathway has been shown to be active inC .
crescentus during growth on glucose [48,
52] . In the datapresented here, all of the genes encoding
enzymes of the Entner-Doudoroffpathway were found to be
significantly induced during growthon glucose-containing media
compared to PYE . Mutations in threegenes [CC1495, CC2054, and
CC2055] encoding enzymes of the Entner-Doudoroffpathway rendered
C . crescentus incapable of growth with glucoseas the sole carbon
source . The Entner-Doudoroff pathway is thusessential for glucose
utilization in C . crescentus, a situationpreviously seen only
in Pseudomonas aeruginosa [5] and Burkholderia
cepacia [1] . In contrast, in several free-living
The apparent absence of the enzyme phosphofructokinase in C. crescentus would preclude use of the upper branch of the Embden-Meyerhof-Parnaspathway . Thus, instead of generating glyceraldehyde-3-phosphateand dihydroxyacetone phosphate, glucose is processed into glyceraldehyde-3-phosphateand pyruvate . There appears to be little transcriptional regulationof the genes encoding enzymes for the latter steps of glycolysis[taking glyceraldehyde-3-phosphate to pyruvate], as mRNA levelsshowed minimal differences between media . In contrast, in E.coli, expression of all genes in the lower branch of the Embden-Meyerhof-Parnaspathway except pykA and pykF [pyruvate kinases] increases duringfermentation on glucose compared to fermentation on xylose [16]. The E . coli gene expression data agree with metabolic flux estimates in that all steps in the lower part of the glycolytic pathway are expected to have higher flux during fermentation of glucose compared to fermentation of xylose except for the conversionof phosphoenolpyruvate to pyruvate [16] . Whether similar expressionof glycolytic genes during growth of C . crescentus on xyloseand glucose likewise reflects comparable carbon flux throughthe lower part of the glycolytic pathway during growth on glucoseand xylose is uncertain, given that the route of xylose catabolismis unknown. There are many genes whose products have the same predicted enzymatic function in the current C . crescentus genome annotation but which exhibited distinct expression patterns in the microarray data [Table S2] . Some differentially regulated isozymes may participate in distinct pathways that are differentially induced depending on the medium . For example, in an oligotrophic Spirillum sp., the relative activities of three forms of lactate dehydrogenase vary as a function of the lactate concentration in the growth medium [37] . The exact roles of many of the C . crescentus isozymesremain to be determined . In such cases, differential regulationmay provide clues to distinct functions. One example is particularly relevant to glucose catabolism.Two genes [CC0784 and CC1495] are annotated as encoding KDPGaldolase, which catalyzes the final step in the Entner-Doudoroffpathway . Expression of CC1495 is elevated in M2 minimal mediawith either xylose or glucose as the carbon source comparedto PYE, whereas expression of CC0784 is unchanged . Subsequentmutational results demonstrated that CC1495 is essential forgrowth on glucose, implying that CC0784 [assuming that it isexpressed] is not capable of functionally substituting for CC1495in glucose catabolism . On the other hand, the CC1495 mutantstrains could still form colonies with galactose as the carbonsource . The CC0784 polypeptide sequence resembles gene productsannotated as 2-dehydro-3-deoxyphosphogalactonate aldolases fromseveral organisms, including Ralstonia solanacearum, Bradyrhizobiumjaponicum, and Brucella species; this resemblance is considerably stronger than that shown by the CC1495 polypeptide . It has been reported previously that C . crescentus may use some versionof the Entner-Doudoroff pathway for galactose catabolism [29]. Our results suggest that the CC0784 product, rather than participating in the glucose Entner-Doudoroff pathway, may be involved ina version of the Entner-Doudoroff pathway dedicated to galactose. The pathway of xylose degradation in C . crescentus may be distinct from known pathways in other microbes, as genes encoding key enzymes for those pathways [particularly xylose isomerase and xylulose kinase] are not apparent in the C . crescentus genome. An inducible xylose dehydrogenase activity has been demonstratedin C . crescentus, which may serve as the initial step in xylose catabolism [48] . The only known C . crescentus mutant strainthat is unable to grow on xylose resulted from a Tn5-lacZ insertionin the xylX [CC0823] gene [39] . This insertion showed stronglyxylose-inducible ß-galactosidase activity . Microarraydata presented here confirm xylose induction of CC0823, whichturns out to be the first gene in a potential five-gene operon[CC0823 to CC0819], the largest transcription unit whose expressionresponds to xylose. The CC0823 product does not resemble any gene of known function, but it is homologous to genes in other genomes, including Mesorhizobium loti, Streptomyces coelicolor, Agrobacterium tumefaciens, Pseudomonasputida, Bradyrhizobium japonicum, and Sinorhizobium meliloti.The xyl mutant phenotype of the Tn5 insertion could reflecta critical role for the CC0823 product in xylose metabolism[39]; alternatively, the xyl mutant phenotype could result frompolar effects of the insertion on expression of critical downstreamgenes in the operon . Two putative dehydrogenases [CC0822 andCC0821] of unknown substrate specificity are encoded in thisoperon, either of which could potentially be responsible forinducible xylose dehydrogenase activity. The ability to replenish intermediates of the tricarboxylicacid cycle can be critical for growth of microbes on a singlecarbon source [17] . Under such conditions, tricarboxylic acid cycle intermediates are continually siphoned off for various biosynthetic processes, compromising continued flux throughthe pathway . Expression data and genetic analysis suggest thatcarbon flux through the tricarboxylic acid cycle is manageddifferently during growth on glucose and xylose . The expressionof phosphoenolpyruvate carboxylase is induced during growthon glucose but not xylose, and genetic evidence is presentedabove that supplementation of the tricarboxylic acid cycle withoxalacetate via phosphoenolpyruvate carboxylase is criticalfor growth on glucose but not xylose. In contrast, expression of isocitrate lyase [CC1764] is specifically elevated during growth on xylose . Isocitrate lyase catalyzes the conversion of isocitrate to glyoxylate and succinate, initiating the glyoxylate bypass . The succinate generated by this processcan be drawn off for other biosynthetic pathways . Glyoxylate,meanwhile, can be combined with acetyl-coenzyme A by malatesynthase to produce malate, allowing a modified tricarboxylicacid cycle to continue . Interestingly, expression of malatesynthase, encoded by the gene [CC1765] adjacent to that encodingisocitrate lyase, was not observed to increase coordinatelyin response to xylose in the microarrays . Future studies arewarranted to examine the role of the glyoxylate bypass duringgrowth on xylose and other carbon sources. The ability to simultaneously import a variety of growth substrates
could be critical for survival in oligotrophic environments[21] .
As such, it seems unlikely that all of the putative transportersand
exoenzymes whose expression is induced by xylose are utilizedsolely,
or are even necessary, for the import and utilizationof this sugar .
The CC0505 gene, for example, is strongly inducedby xylose, but a
CC0505 knockout strain shows no defects ingrowth on xylose [data not
shown] . We hypothesize that C . crescentususes xylose as an
indicator that metabolites derived from thebreakdown of plant cell
walls are available . Other microorganisms,particularly soil microbes
such as Streptomyces species, areknown to use small molecules
resulting from extracellular polymerbreakdown as a positive feedback
signal to induce degradativeenzymes [22] .
Similarly, growth of Thermotoga maritima on xyloseinduces
genes for two xylanases and an
Since xylose in natural habitats would presumably be acquiredby breakdown of xylan, xylose may be used to trigger the expressionof extracellular hydrolytic enzymes relevant to breakdown of lignicellulose [of which xylan is a major constituent] as wellas the enzymes needed for the catabolism of xylose itself, asin the fungus Aspergillus niger [18] . Environments rich in xylan should contain cellulose as well; interestingly, however, xylose does not induce expression of endoglucanases or ß-glucosidases that could [at least theoretically] be used by C . crescentus for cellulose degradation. The abundance of TonB-dependent receptors induced by xylosecould provide cells with the ability to import products of extracellular lignocellulose degradation, but it remains to be determinedwhat metabolites each transporter is capable of bringing intothe periplasm . In an effort to further characterize the transporters induced by xylose, we used BLASTP to compare xylose-inducedC . crescentus transporters to the NCBI database . Somewhat surprisingly, many of the strongest homologies were to proteins encoded by Xanthomonas campestris, a plant pathogen that secretes exoenzymes to attack plant cell walls . The virulence of at least one Xanthomonas species [X . oryzae, which causes rice blast disease] is dependent on the ability to secrete xylanase [51] . The X . campestris genomecontains bidirectional best hits to four of the xylose-inducedTonB-dependent receptors [CC0442, CC0999, CC2832, and CC3336]as well as the putative inner membrane xylose transporter [CC0814]and the SapC peptide permease homolog [CC1000]. Although Caulobacter and Xanthomonas are not close phylogenetically,their genomic contents bear intriguing similarities . In additionto having sets of putative exoenzymes similar to C . crescentus,the sequenced genomes of the Xanthomonas species X . campestrisand X . citri also display an abundance [nearly 50] of TonB-dependentreceptors [10] . There are also hints that regulation by xylosemay be similar in X . campestris and C . crescentus . The X . campestris genome has several close matches to the DNA sequence motif associated with xylose regulation in C . crescentus [Table 4], with threeof the four strongest instances of this motif in the X . campestrisgenome located upstream of putative transcription units containingxylose-associated genes [xylose isomerase, xylosidase, and xylanase][Table S5] . Thus, although C . crescentus has not been reportedto act as a plant pathogen, it may have mechanisms for acquiringplant polymer-derived metabolites similar to those found inXanthomonas species. Environmentally responsive signal transduction pathways in C. crescentus have not been extensively explored . The collectionof large microarray data sets, as presented here, provides someinitial insight into regulatory responses to nutrient levelsin this organism . A set of genes whose expression responds tothe availability of xylose have been identified, along witha cis-acting site that appears to represent the target for axylose-responsive repressor . Two potential regulatory siteshave also been identified for glucose-responsive genes; theseneed to be tested, and the trans-acting factors controllingxylose and glucose regulation will be pursued in future work.Coregulation of many genes and potential operons encoding aminoacid synthesis or degradation functions has been observed. Future work will continue to characterize the traits that allow C . crescentus to be a successful oligotroph by exploring this organism's metabolism in more dilute media, in mixtures of substrates, and in continuous cultures . Finally, the stage is set for exploring the interface between environmental and cell cycle regulation. What environmental cues influence cell cycle progression? Arethere cell cycle checkpoints that determine if a cell has sufficient resources to proceed with division? How are individual promoters influenced by both the environmental and cell cycle state? Asjust one example, expression of the glycine cleavage system[gcv] genes [CC3355 to CC3352] is strongly induced in PYE mediumversus M2, presumably due to relatively high levels of exogenousglycine . During the cell cycle, expression of the gcv genespeaks in swarmer cells and then is repressed dramatically inpredivisional cells [32] . Is the same transcription factor usedin both forms of regulation? What controls cell cycle regulationof this promoter, and why? Further application of molecularand classical genetic approaches with microarray analysis canprovide insight into such questions.
We thank Bert Ely [University of South Carolina], who kindly provided us with several hex mutant strains for analysis, and Lisandra West, for support in generating the microarray dataset.
|