|
|
|
Journal of Bacteriology, July 2004, p . 4369-4375, Vol . 186, No . 13
Displacements of Prohead Protease Genes in the Late Operons of
Double-Stranded-DNA Bacteriophages
|
| ABSTRACT |
|---|
Most of the known prohead maturation proteases in double-stranded-DNA
bacteriophages are shown, by computational methods, to fall into two
evolutionarily independent clans of serine proteases, herpesvirus
assemblin-like and ClpP-like . Phylogenetic analysis suggests that
these two types of phage prohead protease genes displaced each other
multiple times while preserving their exact location within the late
operons of the phage genomes .
| TEXT |
|---|
Capsid maturation in double-stranded-DNA (dsDNA) bacteriophages
requires proteolytic cleavage by a prohead protease . The MEROPS
database (27;
http://merops.sanger.ac.uk) currently places phage prohead
proteases into four families, two of which exclusively consist of
phage and prophage prohead proteases with uncharacterized catalytic
mechanisms . They are family U9 (prohead proteases from T4-like
Myoviridae) and family U35 (prohead proteases from HK97 and
related phages of the Siphoviridae family) . The other two,
serine protease families S14 and S49, are typified by ClpP protease
and Escherichia coli peptidase IV, respectively, and contain
phage prohead proteases such as gpC protein of phage lambda and
prohead protease of Pseudomonas aeruginosa phage D3 (both
phages are members of the Siphoviridae) .
Prohead protease is an essential factor in phage capsid morphogenesis, as first demonstrated for phage T4 (2) . The maturational cleavage of the capsid protein precursors confers stability to the nucleic acid-free prohead and triggers a conformational switch of prohead, which allows phage genomic DNA to enter and initiate the DNA packaging process . A similar capsid maturation mechanism has been reported for various dsDNA phages from the Siphoviridae family, such as HK97 (8), PVL (7), and D3 (12), suggesting that the proteolytic cleavage of structural proteins during phage capsid assembly is common in many dsDNA phages and that the prohead protease is the key enzyme in the process . The prohead protease is generally encoded next to the gene for the capsid protein that it cleaves and may be fused to it in some cases (e.g., Siphoviridae psiM2 and psiM100) . Although the capsid assembly and maturation processes in phages HK97, from the lambda-like Siphoviridae group, and T4, from the Myoviridae family, are tightly regulated and share many common features, no sequence similarity or evolutionary connection between their prohead proteases has been reported . In this study, we use computational analysis to show that virtually all known prohead proteases from dsDNA phages belong to one of the two superfamilies of serine proteases and are unrelated to each other but apparently functionally interchangeable .
To examine the evolutionary relationship between the prohead proteases from phages T4 and HK97, we searched the NCBI nonredundant protein database (7 November 2003; 1,537,641 sequences; 502,645,420 total letters) with the PSI-BLAST program (1) using members of the U9 and U35 families as queries, with the threshold for inclusion set at a BLAST E value of 0.02 . Starting with the prohead protease from HK97 (gi 9634157; family U35) as a query, we collected the prohead proteases from HK97-related Siphoviridae and observed statistically significant matches to proteases from other phages . In the course of these iterative searches, members of the family U35 were found to share significant sequence similarity with herpesvirus proteases and UL26 assemblins from family S21, and they were all remote homologs of prohead proteases from T4-like Myoviridae, members of the family U9 . For example, when the prohead protease from Salmonella enterica serovar Typhimurium phage ST64B (Podoviridae; gi 23505450; family U35), was used as a query after phage prohead proteases from family U9 and herpesvirus proteases from family S21 were detected, the prohead core scaffold protein and protease gp21 from phage Aeh1 (gi 33414841; family U9), a T4-like member of the Myoviridae, were found in the ninth iteration, with an E value of 10–17 and without any false positives . In the PSI-BLAST searches that were initiated with herpesvirus proteases, a phage head maturation protease from Novosphingobium aromaticivorans (gi 23107536; COG3740) was always just below the threshold . When that protease was submitted as a query, a UL26 capsid maturation protease from meleagrid herpesvirus 1 was found in the second iteration, with an E value of 10–3 .
In Myovirus P2, the internal scaffolding protein gpO is indispensable for the cleavage of capsid protein gpN (29), but it is unknown whether gpO itself has protease activity or if the cleavage reaction is performed by a host protease . During sequence similarity searches, we found that P2 gpO shares significant sequence similarity with phage Mu putative head maturation protease gpI, and both of them are distantly related to herpesvirus assemblin UL26 . For instance, when a conserved hypothetical protein from Chromobacterium violaceum ATCC 12472 (gi 34496934) served as a query in a PSI-BLAST search, gpO from P2-like Myoviridae was detected, followed by the putative head maturation protease gpI from Mu-like phages . If we included the UL26 protein from gallid herpesvirus 3 (gi 10834896), whose similarity was just below the threshold in the seventh iteration in the later round of the PSI-BLAST search, we found many UL26 proteases from herpesviruses without any false positives until no new sequences could be retrieved after the 10th iteration .
Herpesviruses are enveloped dsDNA viruses that can cause severe
health problems in humans and other animals . As in dsDNA bacteriophages,
the proteolytic processing of capsid protein precursor by a
virus-encoded enzyme is essential for producing infectious virions (17,
20, 37-39), and several
structural and mechanistic parallels between the maturation of phage
heads and herpesvirus capsids have been noted (14,
31) . So far, the UL26 gene products are the only
known proteases encoded by herpesvirus genomes, and the
three-dimensional structures of UL26 proteases from all three
herpesvirus subfamilies have been determined (3,
6, 15, 25,
26, 28, 32, 35)
(Fig . 1A) . The central core of the enzyme is a
ß-barrel, consisting of two sheets held together at one edge by a
long ß-strand (ß3) passing between the sheets and forming a
triple-stranded corner and at the other edge by two hydrogen bonds
between strands ß5 and ß7 . All residues with a direct role in
catalysis, including the three members of the catalytic triad and two
residues predicted to form the oxyanion hole, are located within this
core . The core is surrounded by eight
-helices,
four of which play a role in the formation of the homodimer . The
similarity between phage proteases and UL26 proteases extends along
most of the ß-strand's rich central region of the UL26
sequence, except for the middle region of the herpesvirus sequences,
which consists of three
-helices
that appear to be lacking in phage proteases .
|
To further validate the sequence and structure similarities between
proteases from phages and herpesviruses, we queried the global
network of independent structure prediction servers via the 3D-Jury
Meta predictor (13;
http://bioinfo.pl/Meta/) with prohead proteases from HK97 and
ST64B . Herpesvirus proteases were ranked as the top-scoring matches,
with 3D-Jury scores greater than 90, which are typical of true
positives (13) . We concluded that the three
families are probably derived from a common ancestor and propose to
group them into a single protease clan, SH .
A multiple-sequence alignment of proteases from the newly defined clan SH is shown in Fig . 2 . The fully conserved serine residue in the middle of the fifth ß-strand suggests that the phage prohead proteases from families U9 and U35 and the internal scaffolding protein gpO from P2-like phages are serine proteases . The second member of the catalytic triad, histidine, located near the C terminus of strand 2, is also universally conserved . In UL26 proteases, a histidine residue (His157 in human cytomegalovirus [HCMV] protease) at the N terminus of strand 6 is predicted to be the third triad member according to its position in the three-dimensional structure (6, 25, 32, 35) . This position aligns with a well-conserved acidic residue in all phage proteases, which therefore appear to have a more conventional His-Ser-Asp(Glu) catalytic triad . Kinetic studies of HCMV protease with the His157 residue mutated to alanine (H157A) showed about a 10-fold loss in activity relative to that of the wild-type enzyme, while the H157D and H157E mutants were about three times less active than the wild type (16) . It would be interesting to see whether phage prohead proteases from families U9 and U35, which all have an aspartate or glutamate in the third position, display lower activities than their herpesvirus homologs or whether evolutionary changes elsewhere in the molecule have compensated for that .
|
Two other recognized families of phage prohead proteases are families
S14 and S49, which include proteases widely distributed in bacteria,
archaea, and eukarya . The founding members of the S14 and S49
families are, respectively, ClpP protease and E . coli
peptidase IV . Using lambda prohead protease gpC protein (gi 9626248)
as a query and after detecting many S49 family proteases, we also
retrieved ClpP proteases at the second iteration . Statistically
significant matches to many members of the large crotonase
superfamily were detected in later iterations, which is compatible
with the observation of a distant structural similarity between ClpP
protease and crotonases (11, 22) . The
computational fold recognition experiments further validated the
sequence and structure similarities between proteases from families
S14 and S49 . As expected, the best match from the 3D-Jury consensus
prediction (13) for lambda gpC protein is ClpP protease
(PDB code 1TYF), with a score of 126.33 . With an
/ß
fold composed of six repeats of the ß-ß-
unit (Fig . 1B) (36), ClpP is
distinct from the herpesvirus protease fold . An analysis of a
multiple-sequence alignment (Fig . 3) indicates that
most of the enzyme catalytic core is well conserved among proteases
from families S14 and S49; therefore, proteases from the two families
are expected to adopt a ClpP/crotonase fold and constitute clan SK .
|
Herpesvirus-like proteases and ClpP-like proteases represent two
distinct serine protease folds that are most likely evolutionarily
independent but appear to play the same essential function in the
phage life cycle . The only phage prohead protease with experimentally
demonstrated activity that does not seem to belong to these two
classes and lacks similarity to any other proteins is the product of
orf13 in pneumococcal phage Cp-1 (21) . While ClpP-like
proteases are found in both prokaryotic and eukaryotic organisms,
a striking feature of clan SH is that its members are found
only in the dsDNA viruses of bacteria and eukaryotes . Sensitive
probabilistic searches of sequence databases, such as with the HMMER
package (9) and Wise2 package (http://www.ebi.ac.uk/Wise2/),
as well as searches of the databases of unfinished prokaryotic
and eukaryotic genomes and expressed sequence tags, failed to reveal
clan SH members in any other viruses or cellular organisms . All
significant matches to bacterial genomes turned out to be in the
recently integrated prophages, which typically possess the full
complement of structural components of the virions (data not shown) .
Mechanistic similarities between the maturation of phage heads and
herpesvirus capsids are quite remarkable . Both procapsids are
assembled on an asymmetric
-helical
scaffold, which is cut into fragments by viral protease and leaves
behind an empty head shell . In both cases, replicated virus genome is
packaged into this shell until the associated virus-encoded terminase
enzyme cleaves genomic DNA . Concomitant with DNA packaging, the
capsid changes from nearly spherical to a much more stable
icosahedral form . These parallel relationships remained an evolutionary
puzzle without direct evidence for sequence relationships between
individual components . It has been noted, however, that the
orders of structural components and factors of their assembly in
virus genomes were similar and that large subunits of phage and
herpesvirus terminases are related ATP-hydrolyzing enzymes (5,
14, 31) . Statistically significant
sequence similarity between the SH-type proteases from phages and
herpesviruses strengthens the argument that the complement of genes
encoding capsid components and maturation factors in bacteriophages
and herpesviruses has evolved from a common ancestor .
In phages, the prohead protease gene is always located between genes that encode the portal protein and the major capsid protein . Since these three gene products interact intimately with each other during the phage capsid assembly and maturation process, the tandem arrangement appears to be under a selective constraint . Strikingly, however, the type of the maturation protease is not correlated with the extent of sequence similarity between the neighboring structural proteins . One such example is presented by phages Xp10, D3, and Pseudomonas putida prophage; their genes coding for the portal protein and the major capsid protein are closely related and must have diverged very recently (Fig . 4 and data not shown), yet in phages Xp10 and D3, the protease belongs to clan SK, and the gene in exactly the same location in P . putida prophage codes for a clan SH member (40) .
|
A prohead protease gene can thus be replaced in evolution by a
functionally equivalent gene coding for a structurally unrelated
protease . Such nonorthologous gene displacement is thought to be
common in the genomes of cellular life forms (18) and has
been noted to occur in some other phage genes, such as integrase
(33) and lysozyme (4) . In order to
trace the history of the mutual displacement of prohead proteases, we
built a phylogenetic tree of dsDNA phages based on the similarity of
their portal proteins and labeled the leaves of the resulting
consensus tree by the type of prohead protease observed in each
lineage . Analysis of the resulting tree (Fig . 4)
indicates that the SK-type protease may be a derived characteristic,
with the possessors of this protease typically being nested within a
group of phages with the SH-type protease . All of the deepest
branches of the tree, or all clades but one with bootstrap support of
more than 50%, are inferred to have the SH-type protease at the root
(Fig . 4 and data not shown) . Thus, the SH-type
protease may be evolutionarily ancient, predating the common ancestor
of present-day DNA phages, and several in situ gene replacements by
the ClpP-like protease may have occurred in different evolutionary
lineages . Interestingly, at lease in one case, such SH-to-SK
displacement appears to have been followed by a reverse displacement,
from SK to SH, in the P . putida prophage (Fig . 4) .
The phage phylogeny shown in Fig . 4 is inferred on the basis of a single gene, albeit one that has strong functional links to prohead protease . We also examined the distribution of two classes of portal proteases in a phylogenetic tree built on the basis of a whole-genome approach (30) . There are topological differences between that phage proteomic tree and our single-gene tree, but remarkably, the pattern of incongruity between the protease type and the whole-genome phylogeny still holds, with phages with the SK-type protease typically nested within a group of phages with the SH-type protease (details are available from the authors upon request) .
To conclude, the proteolytic cleavage of the scaffold during the assembly of capsids in dsDNA bacteriophages appears to be a virus-specific function provided by the proteolytic enzymes from at least two distinct families and folds . One of these folds, which is perhaps evolutionarily more ancient, is shared with the functionally very similar protease of herpesviruses . Phylogenetic analysis suggests that the two types of head maturation proteases in dsDNA bacteriophages have displaced each other on multiple occasions .
| ACKNOWLEDGMENTS |
|---|
We thank Galina Glazko and Helen Piontkivska for assistance with the
phylogenetic analysis and two anonymous reviewers for many
suggestions for improving the manuscript .
| FOOTNOTES |
|---|
* Corresponding author . Mailing address: Stowers Institute for
Medical Research, 1000 E . 50th St., Kansas City, MO 64110 . Phone: (816)
926-4425 . Fax: (816) 926-2073 . E-mail: jil@stowers-institute.org.
Supplemental material for this article may be found at http://jb.asm.org/ .
| REFERENCES |
|---|
What Is Genome?,
What Is Botulism?,
What Is Fermentation?,
What Is Salmonella?,
What Is Biofilm?,
o,
Microorganism,
s,
Microorganisms,
n,
Bacteriology,
c,
Bacteria,
a,
Microbe,
a,
Clostridia,
s,
Microbial,
r,
Microbiological,
s,
Functional genomics,
a,
Staphylococcus aureus,
o,
S. cerevisiae,
r,
Vibriosis,
r,
Eubacter,
e,
Yeasts,
n,
Yeasts,
e,
Salmonella typhimurium,
c,
Alcaligenes,
a,
Lactobacillus,
r,
Pseudomonas,
o,
Bacteriophages,
c,
Streptomycin,
c,
Microorganisms,
r,
Erythromycin,
e,
Antibiotics,
o,
Escherichia coli,
s,
Biofilms
© 2005
Transgalactic Ltd (manufacturer of Bioscreen C software) |
Privacy Statement | P.O. Box
1393, 00101 Helsinki, Finland,
Last modified: May 25, 2005
| ||||||