Microbiology Reader
Equipment to run microbiology work automatically

Growth Curves of any strain.
Microbiological calculations.

Microbiology Home
Microbioloy Reader
Growth Curves
Photo Album
Microorganisms
Software
Download
Purchasing
Contact Us

Scientific Publications - Work Done by Microbiology Reader Bioscreen C

 

Lecture notes, Spring 2003: BE 203 - Bioinformatics III (Systems Biology), BE 160C/203 Topic I.7, April 23rd, 2002, University of California, San Diego, Department of Bioengineering, Genetic Circuits Research Group

Data for reconstruction:  Proteomics

Steve Fong / Bernhard Palsson
 

INTRODUCTION

This topic will discuss mainly experimental methods for proteomics. Most of the methods have been developed in the last few years so that as a field proteomics is still in its infancy. However, there is great excitement about the prospects of proteomics research for understanding cellular function and reconstruction of genetic circuits. The hope is that just like genomics is today in a few years time we will be routinely be able to use data from proteomics technologies to reconstruct genetic circuits.

 

OVERVIEW

The proteomics part will be heavily focused on experimental methods for the simple reason that there is very little material available on proteomics data analysis and even less on reconstruction using this data.

 

1. Proteomics tasks:
a. Protein interaction mapping
b. Protein expression profiling
c. Protein activity profiling
d. Protein modification profiling

2. Proteomics methods:
a. Yeast two-hybrid screen
b. Mass spectrometry
c. Protein chips

3. Proteomics data analysis:

a. Protein interaction network analysis
b. Mass spec based protein identification

 

PROTEOMICS OVERVIEW

Proteomics is a vastly larger field in terms of different methods and applications than either genomics or transcriptomics. For this reason there are multiple ways in which one could organize an overview on proteomics. Because many of the experimental methods can be used for multiple tasks, this lecture is organized by the different experimental methods and the specific applications and data analysis techniques are described under each method.

 

What is proteomics?

• Proteome: Expressed protein complement of a genome
• Proteomics: Just about everything that has something to do with proteins and is “high-throughput” in some way
• The next big thing after genomics and transcriptomics
• Here we focus mainly on proteomics methods that could be useful for genetic circuit reconstruction, i.e. ignore structural proteomics/genomics

 

WHAT IS PROTEOMICS?

Proteomics is not a very well defined topic – it could be described as a largescale study of protein structure, expression, and function (including modifications and interactions). Proteomics is currently an extremely “hot” topic and people from many different fields – biology, bioengineering, bioinformatics, computer science, chemistry … - are actively developing ways to characterize the proteome. In this lecture we will ignore a major part of proteomics – large-scale structural characterization of proteins. Although knowing the structures of all proteins would certainly help in annotating their functions, this type of data is only indirectly connected to reconstruction of genetic circuits.

 

Challenges in proteomics

• The number of different proteins is much larger than the number of genes or mRNAs:
– Alternative splicing
– Post-translational modifications
– Proteolytic cleavage
• Changes in protein “states” (e.g. phosphorylation) occur typically much faster than changes in transcriptional regulation
• DNA/RNA has four different building blocks and a (mainly) uniform hydrophilic negatively charged structure – proteins have 20 building blocks, highly variable structure, hydrophobicity, and charge
• Proteins need to be kept in a functional correctly folded state for most proteomics applications

 

CHALLENGES IN PROTEOMICS

Proteins are much more difficult to characterize and work with than nucleic acids. This is basically the reason for the predominance of molecular biology over biochemistry in biological research over the last 30 years or so. This slide lists just some of the major reasons why characterizing the proteome is expected to be much more difficult than characterizing the genome or transcriptome. The flip side is that the proteome is vastly more interesting for a lot of practical applications than any of the other ‘omes’. For example most drugs act on proteins, not nucleic acids, and enzymes, not nucleic acids, are used as catalysts in bioprocessing applications.

 

Some proteomics tasks

1. Protein interaction mapping
– Best developed of the proteomics subfields
– Methods include yeast two-hybrid, co-immunoprecipitation with mass spec, and protein chips
2. Protein expression profiling
– Same as gene expression profiling, but for proteins
– Methods include 2DGE or LC coupled with mass spec and protein chips
3. Protein activity profiling
– For example kinase activity on different substrates
– Usually done using protein chips
4. Protein modification profiling
– For example phosphorylation
– Usually done using some mass spec-based approach

 

TYPICAL PROTEOMICS TASKS

There are many possible tasks in proteomics out of which only fo ur fairly well-developed ones are listed here. For most of these tasks there are more than one method and all the methods have their own advantages and disadvantages.

 

Yeast two-hybrid (Y2H) method

• Standard genetic screen for physical protein-protein interactions
• Benefits:
– Simple
– Inexpensive
– Scalable and automatable
• Problems:
– Non-physiological (done in the yeast nucleus)
– High false positive rate (>30%) requires replication and careful post-processing
– Cannot detect multiprotein complexes

 

YEAST TWO-HYBRID METHOD

Figure caption: Principle of two-hybrid library and array screens. (a) Typical two-hybrid screens use a library of random DNA or cDNA fused to a transcriptional activation domain (AD), expressed in yeast ('preys'; circles denote plasmids). The library clones are mated to a strain of opposite mating type that expresses a protein of interest ('bait', B) as a fusio n to a DNA-binding domain (DBD). If bait and prey interact in the resulting diploid cells, they reconstitute a transcription factor, which activates a reporter gene whose expression allows the diploid cell to grow on selective media (here, without histidine). As an alternative to mating, prey libraries can also be transformed into the bait strain in order to express bait and prey in the same cell. In any case, positive clones have to be picked, their DNA isolated and the encoded plasmids sequenced in order to identify interacting proteins. (b) Array screens use defined sets of cloned prey ORFs or fragments thereof that are mated systematically to a certain bait strain. Matings and two-hybrid tests can be automated when large sets of preys have to be assayed, as in the case of whole genomes.

-- Curr Opin Chem Biol 2002 Feb;6(1):57-62
Two-hybrid arrays.

 

Applications of Y2H screens

• Yeast: Two genome-wide Y2H screens have been performed resulting in >6000 interactions between >4500 proteins (Uetz et al. Nature 403:623 (2000) and Ito et al. PNAS 98:4569 (2001))
• H. pylori: A genome-wide Y2H screen using individual protein domains as baits was performed resulting in >1200 interactions (Rain et al. Nature 409:211 (2001))
• Mouse: Mammalian twohybrid method was used to detect 145 interactions (Suzuki et al. Genome Res 11:1758 (2001)) Uetz P Curr Opin ChemBiol 6:57 (2001)

 

APPLICATIONS OF Y2H-TYPE SCREENS

There have been three major Y2H screen projects: Two in yeast and one in H. pylori. In addition smaller Y2H screens have been performed for C. elegans and D. melanogaster. Mammalian two- hybrid screens have also been developed to avoid problems associated with doing interaction screens in yeast for non-yeast proteins.

--

Figure caption: Mammalian two-hybrid system used by Suzuki et al. About 3500 mouse cDNAs were amplified by PCR and these PCR products (ORF X and ORF Y) mixed with another PCR product that carried a cytomegalovirus (CMV) promoter and either a Gal4 DNA-binding domain (DBD) or a VP16 transcriptional activation domain (AD). Because the two PCR products have overlapping sequences, they can be fused into one DNA fragment by a secondary PCR reaction using primers at the ends of the individual fragments. The final PCR fragments were transfected into tissue culture cells (CHO-K1) together with a reporter plasmid that carried a luciferase gene (Luc). When the encoded proteins interact, the luciferase reporter gene is transcribed and its activity can be measured as fluorescence. All 3500 Χ 3500 protein combinations were tested. To speed up the screening procedure, various numbers of baits and preys were co-transfected (i.e. pooled), and positive signals were later deconvoluted to identify interacting proteins.

 

ANALYSIS OF INTERACTION DATA

Nature 2001 May 3;411(6833):41-2 Lethality and centrality in protein networks. Jeong H, Mason SP, Barabasi AL, Oltvai ZN.

 

Figure caption: Characteristics of the yeast proteome. a, Map of protein– protein interactions. The largest cluster, which contains 78% of all proteins, is shown. The colour of a node signifies the phenotypic effect of removing the corresponding protein (red, lethal; green, non- lethal; orange, slow growth; yellow, unknown). b, Connectivity distribution p(k) of interacting yeast proteins, giving the probability that a given protein interacts with k other proteins. The exponential cut-off6 indicates that the number of proteins with more than 20 interactions is slightly less than expected for pure scale-free networks. In the absence of data on the link directions, all interactions have been considered as bidirectional. The parameter controlling the short-length scale correction has value k0 1. c, The fraction of essential proteins with exactly k links versus their connectivity, k, in the yeast proteome. The list of 1,572 mutants with known phenotypic profile was obtained from the Proteome database13. Detailed statistical analysis, including r = 0.75 for Pearson's linear correlation coefficient, demonstrates a positive correlation between lethality and connectivity. For additional details, see http://www.nd.edu/~networks/cell.

 

Integration with other data

• Y2H with phage display-derived binding motif analysis to reduce false positives (Tong AHY et al. Science 295:321 (2002))
• Y2H with protein sequence/structure analysis ΰ Prediction of interactions from various features:
– Amino acid properties (Bock JR and Gough DA Bioinformatics 17:455 (2001))
– Domains/motifs (Gomez SM and Rzhetsky A Pac Symp Biocomp 413 (2002))
– Conserved amino acids (Pazos F and Valencia A Proteins 47:219 (2002))
• Y2H with gene expression:
– In general an interacting proteins do not tend to be significantly co-expressed except in permanent complexes such as ribosomes (Jansen R et al. Genome Res 12:37 (2002))
– Methods are being developed that try to utilize gene expression and interaction data to e.g. predict functional pathways/complexes (GeH et al. Nat Genet 29:482 (2001))

 INTEGRATION OF INTERACTION DATA WITH OTHER DATA TYPES

Science 2002 Jan 11;295(5553):321-4
A combined experimental and computational strategy to define protein interaction networks for peptide recognition modules.
Tong AH, Drees B, Nardelli G, Bader GD, Brannetti B, Castagnoli L, Evangelista M, Ferracuti S, Nelson B, Paoluzi S, Quondam M, Zucconi A, Hogue CW, Fields S, Boone C, Cesareni G

Bioinformatics 2001 May;17(5):455-60
Predicting protein--protein interactions from primary structure.
Bock JR, Gough DA

Pac Symp Biocomput 2002;:413-24
Towards the prediction of complete protein--protein interaction networks.
Gomez SM, Rzhetsky A

Proteins 2002 May 1;47(2):219-27
In silico two-hybrid system for the selection of physically interacting protein pairs.
Pazos F, Valencia A

Genome Res 2002 Jan;12(1):37-46
Relating whole-genome expression data with protein-protein interactions.
Jansen R, Greenbaum D, Gerstein M

Nat Genet 2001 Dec;29(4):482-6
Correlation between transcriptome and interactome mapping data from Saccharomyces cerevisiae.
Ge H, Liu Z, Church GM, Vidal M

 

MASS SPECTROMETRY

Figure caption: The mass spectrometry approach. (a) A single-stage mass spectrometer. The instrument consists of three components: an io nization source, mass analyzer and ion detector. The mass analyzer that is shown is a time-of-flight (TOF) mass spectrometer. Mass-to-charge ratio (m/z) values are determined by measuring the time it takes ions to move from the ion source to the detector. The time that is required to move this distance can be directly correlated with the m/z value. A mass spectrum of a protein dige st is shown to the right of the figure. (b) The components of one type of tandem mass spectrometer. The instrument consists of an ion source, first mass analyzer, gas-phase collision cell, second mass analyzer and ion detector. The first mass analyzer can be used to isolate a particular m/z value for dissociation in the collision cell. The dissociation products are then analyzed in the second mass analyzer. A tandem mass spectrum for a peptide produces a ladder of fragment ions that represent amide bond cleavage. A peptide spectrum is shown to the right of the mass spectrometer.

 

Types of mass specs

• Number of stages:
– Standard single stage mass spec: Outputs a peptide mass spectrum ΰ Peptide fingerprinting ΰ Protein identification
– Tandem mass spec (MS/MS): Outputs mass spectrum of individual peptides ΰ Protein sequence determination ΰ Modification detection
• Method of ion generation:
– MALDI: Matrix-assisted laser desorption- ionization
– ESI: Electrospray ionization
• Types of mass analyzers:
– TOF: Time of flight
– FTICR: Fourier-transform ion-cyclotron resonance

 

TYPES OF MASS SPECTROMETERS

Mass spectrometry was originally developed as a method to characterize small (organic) molecules in analytical chemistry. In principle all the techniques developed in this application can be modified to work on proteins (which are just large organic molecules). However, in most cases this requires a lot of technology development to allow high reliability and throughput. This slide lists just some of the different types of techniques used in protein mass spectrometry.

 

PROTEIN IDENTIFICATION USING MASS SPEC

Figure caption: Mass spectrometric identification of a protein. A protein is digested using a highly specific protease (typically trypsin) (a) and the derived peptides are analysed in a mass spectrometer (b). One peptide species is selected for collision with an inert gas, such as argon, in the mass spectrometer. The derived fragments of this peptide are measured to give the product mass spectrum (c). Some of these fragments differ in the ir mass by individual amino acids ― leucine and isoleucine having identical masses. Part of the sequence can therefore be read out from a series of peaks in the spectrum. This sequence information is placed in the peptide by the mass of the fragments, and is used in the `peptide sequence tag' in conj unction with the mass of the peptide and the specificity of the protease (tryptic digest results in K or R at the C terminus of the peptide) to search for a match in the database. A single peptide sequence tag is usually sufficient to unambiguously link a database entry with the investigated protein (d). Alternatively, fragment masses are measured and automatically compared with the predicted fragment masses of all peptides derived from a database to find the best match. In any case, the confidence increases with the number of fragmented peptides matching the same entry. Typically 2―70% of the sequence of the entry is covered by the experimental data, depending on the sample amount. Abbreviations: m/z, mass: charge ratio; S. cerevisiae, Saccharomyces cerevisiae.

 

Mass spec data analysis
• Identifying proteins from their peptide spectrum (standard MS)
• Identifying short peptide fragments from their spectrum and matching against sequence databases (MS/MS)
• Accounting for covalent modifications (Pevzner PA et al. Genome Res 11:290 (2001))

 

ANALYSIS OF MASS SPEC SPECTRA

Figure caption: Protein identification using peptide mapping information. (a) In the experiment, the proteins are digested with an enzyme and the masses of the proteolytic peptides are measured with mass spectrometry. (b) In the database search, each protein sequence in the database is digested according to the specificity of the enzyme. The masses of the resulting peptides are calculated and a theoretical mass spectrum is constructed. The measured mass spectrum is compared with the theoretical mass spectrum.

Genome Res 2001 Feb;11(2):290-9
Efficiency of database search for identification of mutated and modified proteins via mass spectrometry.
Pevzner PA, Mulyukov Z, Dancik V, Tang CL

 

Mass spectrometry applications
• Mapping protein complexes
– Tagged bait proteins are used to affinity purify complexes associated with baits
– MS is used to identify proteins in each complex
– Over 700 different complexes were identified in two recent studies (Gavin A - C et al. Nature 405:141 (2002) and Ho Y et al.Nature 415:180 (2002))
– False positive rate estimated at 30%
• Protein expression profiling: identification and quantification
• Detecting protein modifications

MASS SPEC APPLICATIONS

IDENTIFYING PROTEIN COMPLEXES

Figure caption: Analysing protein interactions. In the 'co-precipitation/mass spectrometry' approach used by Gavin et al.1 and Ho et al.2, an 'affinity tag' is first attached to a target protein (the 'bait'; a). b, Bait proteins are systematically precipitated, along with any associated proteins, on an 'affinity column'. c, Purified protein complexes are resolved by one-dimensional SDS– PAGE, a technique that involves running an electric charge through the complexes on a gel, so that proteins become separated according to mass. d, Proteins are excised from the gel, digested with the enzyme trypsin, and analysed by mass spectrometry. Database- search algorithms (bioinformatics) are then used to identify specific proteins from their mass spectra.

 

Traditional 2DGE based profiling
• 2DGE: Two dimensional polyacrylamide gel electrophoresis
– 1st dimension: Isoelectric focusing (charge)
– 2nd dimension: SDS-PAGE (size)
• Problems with 2DGE:
– Detection of low abundance proteins: Typically need more than 1000 copies per cell
– Not really quantitative – Labor intensive and low throughput
– Resolution: co-migration and differential migration
– Membrane/large MW/extreme pI proteins cannot be reliably identified
• Application: Identification of 8767 proteins from the mouse brain

 

PROTEIN EXPRESSION PROFILING USING 2DGE

For a review on 2DGE in general see Lilley KS et al. Curr Opin Chem Biol 6:46 (2001)

 

Genetic analysis of the mouse brain proteome.

Klose J, Nock C, Herrmann M, Stuhler K, Marcus K, Bluggel M, Krause E, Schalkwyk LC, Rastan S, Brown SD, Bussow K, Himmelbauer H, Lehrach H.

Abstract: Proteome analysis is a fundamental step in systematic functional genomics. Here we have resolved 8,767 proteins from the mouse brain proteome by large-gel two -dimensional electrophoresis. We detected 1,324 polymorphic proteins from the European collaborative interspecific backcross. Of these, we mapped 665 proteins genetically and identified 466 proteins by mass spectrometry. Qualitatively polymorphic proteins, to 96%, reflect changes in conformation and/or mass. Quantitatively polymorphic proteins show a high frequency (73%) of allele -specific transmission in codominant heterozygotes. Variations in protein isoforms and protein quantity often mapped to chromosomal positions different from that of the structural gene, indicating that single proteins may act as polygenic traits. Genetic analysis of proteomes may detect the types of polymorphism that are most relevant in disease-association studies.

Figure caption: 2-DE brain protein pattern from a B6-SPR hybrid. From the whole 2-DE pattern, consisting of an acid half and a basic half (see Methods), only the acid half is shown. The three protein spot families show that spot families can be recognized in 2-DE patterns on the basis of genetic variation. The spot family of heat-shock 70-kD protein 4 (HSP70, red) consists of 52 'isospots', which occur in 52 double spots (hybrid spots). The two allelic forms vary in the vertical direction in all 52 double spots, with a spacing of 0.5 mm and the B6 spot on top. The -enolase 2 family (blue) shows 24 double spots, which vary again in the vertical direction with the B6 spot on top, but with a spacing of 1 mm. The lactate dehydrogenase 2 B chain spot family (LDH2, yellow) includes 17 double spots that form in each case horizontal spot pairs spaced at 25 mm, with the B6 spots always on the left side. This family also includes 10 paV spots, which are interpreted as a difference in degradation rate between the two allelic forms 11. Four spots were present on the basic half of the 2-DE pattern (data not shown). Nat Genet 2002 Apr;30(4):385-95.

 

Modern profiling techniques
- Multidimensional chromatography
- Isotope-coded affinity tags

 

MODERN PROTEIN EXPRESSION PROFILING TECHNIQUES

Figure captions:

MudPIT: Multidimensional chromatography for the separation of complex peptide mixtures. The peptides are first separated and fractionated by electrostatic charge using strong cation exchange high-performance liquid chromatography (SCX HPLC). Increasing salt concentrations are applied to the column, either in a stepwise manner or as a continuous gradient to the mobile phase. Each of the discrete peptide fractions is then further separated by hydrophobicity using RP- ΅LC with a gradient of increasing organic solvent. The eluting peptides are analyzed by ESI MS/MS.

ICAT: Quantitative analysis of protein abundance using stable- isotope dilution. Mixtures of proteins (e.g. those extracted from cells in two different biological states) are labeled separately such that the proteins exist in either an isotopically 'light' (L) or 'heavy' (H) form. This can be achieved by metabolic labeling during cell growth on stable- isotope-enriched or depleted media [24] or through chemical reaction with an externally introduced isotopically coded reagent [22,31,32] . The labeled protein mixtures are then combined, proteolyzed and the peptides detected by MS. Alternatively, separate protein mixtures can be first proteolyzed, the peptides isotopically labeled, combined and detected by MS [25– 27] . The relative abundances of labeled peptides, and hence the relative abundances of the proteins from which they were derived, are determined by comparison of the signal intensities between the light and heavy forms of the peptides. The signal intensities are simultaneously detected at discrete mass-to-charge (m/z) values in the mass spectrometer.

 

Profiling applications
• MudPIT: (Washburn MP et al. Nat Biotechnol 19: 242 (2001))
– 1484 yeast proteins were detected and identified
– Included low abundance proteins such as kinases and transcription factors
• ICAT: (Ideker et al. Science 292:929 (2001))
– Protein expression was compared with gene expression in +gal and –gal yeast cultures

 

PROTEIN EXPRESSION PROFILING APPLICATIONS

Nat Biotechnol 2001 Mar;19(3):242-7
Large-scale analysis of the yeast proteome by multidimensional protein identification technology.
Washburn MP, Wolters D, Yates JR 3rd

Science 2001 May 4;292(5518):929-34
Integrated genomic and proteomic analyses of a systematically perturbed metabolic network.
Ideker T, Thorsson V, Ranish JA, Christmas R, Buhler J, Eng JK, Bumgarner R, Goodlett DR, Aebersold R, Hood L

 

Figure captions:

Sensitivity of MudPIT to a wide variety of protein classes

The percentage of proteins identified in this study from a variety of protein classes is presented. The percentages were determined by dividing the number of proteins identified in the study in each category shown by the total number of predicted proteins from each category shown. MIPS (ref. 24) and the Yeast Proteome Database33 were used to obtain the predicted numbers of proteins from S. cerevisiae in each class. From left to right are the percentages identified of total proteins, proteins with a CAI <0.2, proteins with a pI < 4.3, proteins with a pI >11, proteins with a MW <10kDa, proteins with a MW >180 kDa, integral membrane proteins (IMPs) with three or more predicted transmembrane domains, and peripheral membrane proteins (PMPs).

Comparison of protein vs mRNA expression ratios

Scatter plot of protein expression versus mRNA expression ratios. Ratios of wt+gal to wt gal protein expression, measured for each of 289 genes using the ICAT technique, are plotted against the corresponding mRNA expression ratios measured by microarray. Many genes with elevated mRNA or protein expression in wt+gal were metabolic ( ) or ribosomal ( ), whereas genes involved in respiration ( ) almost always had reduced expression levels. Names of genes that were indistinguis hable in both mRNA and protein (due to high sequence similarity) are separated by a slash.

 

Phosphoprotein profiling
• Phosporylation of proteins at Ser/Thr/Tyr residues controls most cellular processes
• Phosphopeptides are enriched using fractionation, affinity chromatography (IMAC), anti-phosphoTyr antibodies, or chemical tagging
• Mass spec can be used to detect phosphoproteins, localize phosphorylation sites and quantify phosphorylation
• Ficarro et al. recently determined 383 phosphorylation sites in yeast using IMAC and MS/MS (Ficarro et al. Nat Biotechnol 20:301 (2002))

 

PHOSPHOPROTEIN PROFILING

Nat Biotechnol 2002 Mar;20(3):301

Phosphoproteome analysis by mass spectrometry and its application to Saccharomyces cerevisiae.

Ficarro SB, McCleland ML, Stukenberg PT, Burke DJ, Ross MM, Shabanowitz J, Hunt DF, White FM

Abstract:

Protein kinases are coded by more than 2,000 genes and thus constitute the largest single enzyme family in the human genome. Most cellular processes are in fact regulated by the reversible phosphorylation of proteins on serine, threonine, and tyrosine residues. At least 30% of all proteins are thought to contain covalently bound phosphate. Despite the importance and widespread occurrence of this modification, identification of sites of protein phosphorylation is still a challenge, even when performed on highly purified protein. Reported here is methodology that should make it possible to characterize most, if not all, phosphoproteins from a whole-cell lysate in a single experiment. Proteins are digested with trypsin and the resulting peptides are then converted to methyl esters, enriched for phosphopeptides by immobilized metalaffinity chromatography (IMAC), and analyzed by nanoflow HPLC/electrospray ionization mass spectrometry. More than 1,000 phosphopeptides were detected when the methodology was applied to the analysis of a whole-cell lysate from Saccharomyces cerevisiae. A total of 216 peptide sequences defining 383 sites of phosphorylation were determined. Of these, 60 were singly phosphorylated, 145 doubly phosphorylated, and 11 triply phosphorylated. Comparison with the literature revealed that 18 of these sites were previously identified, including the doubly phosphorylated motif pTXpYderived from the activation loop of two mitogen-activated protein (MAP) kinases. We note that the methodology can easily be extended to display and quantify differential expression of phosphoproteins in two different cell systems, and therefore demonstrates an approach for "phosphoprofiling" as a measure of cellular states.

Figure caption:

Enrichment of phosphopeptides from complex mixtures by chemical introduction of an affinity tag. Blue wavy lines represent proteins or peptides in a mixture. The wavy line with 'P' designates a phosphoprotein or phosphopeptide. The affinity tag (in this case, biotin) allows selective isolation of the species of interest.

 

Protein chips
• What is “spotted” on the chip:
– Antibodies
– Actual proteins of interest
– Substrates for proteins of interest
• Mode of attachment:
– Immobilization onto surfaces with high binding energy to proteins (e.g. hydrophobic plastics)
– Through one or a small number of specific strong bonds between protein and surface (e.g. covalent, biotin-streptavidin)
• What is the purpose of the chip:
– Protein expression profiling
– Protein-ligand (including protein-protein) binding screen
– Protein (enzyme) activity screen

 

PROTEIN CHIPS

Figure caption: Classes of capture molecules for protein microarrays. For specific interaction analysis, different classes of molecules can be immobilized on a planar surface to act as capture molecules in a microarray assay. (a) Illustrates antigen–antibody interaction and (b) shows a scheme of a Sandwich immunoassay. In (c), a specific protein–protein interaction is shown. A different class of binders is shown in (d), where synthetic molecules referred to as aptamers act as capture molecules. They can be composed of nucleotides, ribonucleotides or peptides. Interactions of enzymes with their specific substrates are shown in (e), where a substrate (s) for kinases is immobilized and phosphorylated (P) by the respective kinase. A typical example for a receptor–ligand interaction is given in (f), where synthetic low molecular mass compounds are immobilised as capture molecules.

 

Protein expression profiling
• Haab et al. spotted 115 antibodies onto a glass slide in array format
• Arrays were incubated with a mixture of fluorescently labeled protein antigens
• 60 % of antibodies detected qualitatively their antigen
• Only <30% gave quantitative data
• Experiments were repeated by spotting the antigens instead of antibodies
• Better signal amplification would improve results (See e.g. Schweitzer B et al. Nature Biotech 20:359 (2002))

 

PROTEIN CHIP APPLICATIONS: PROTEIN EXPRESSION PROFILING

Genome Biol 2001;2(2):RESEARCH0004

Protein microarrays for highly parallel detection and quantitation of specific proteins and antibodies in complex solutions.
Haab BB, Dunham MJ, Brown PO

Abstract:
BACKGROUND: We have developed and tested a method for printing protein microarrays and using these microarrays in a comparative fluorescence assay to measure the abundance of many specific proteins in complex solutions. A robotic device was used to print hundreds of specific antibody or antigen solutions in an array on the surface of derivatized microscope slides. Two complex protein samples, one serving as a standard for comparative quantitation, the other representing an experimental sample in which the protein quantities were to be measured, were labeled by covalent attachment of spectrally resolvable fluorescent dyes. RESULTS: Specific antibody-antigen interactions localized specific components of the complex mixtures to defined cognate spots in the array, where the relative intensity of the fluorescent signal representing the experimental sample and the reference standard provided a measure of each protein's abundance in the experimental sample. To test the specificity, sensitivity and accuracy of this assay, we analyzed the performance of 115 antibody/antigen pairs. 50% of the arrayed antigens and 20% of the arrayed antibodies provided specific and accurate measurements of their cognate ligands at or below concentrations of 0.34 microg/ml and 1.6 microg/ml, respectively. Some of the antibody/antigen pairs allowed detection of the cognate ligands at absolute concentrations below 1 ng/ml, and partial concentrations of 1 part in 106, sensitivities sufficient for measurement of many clinically important proteins in patient blood samples. CONCLUSIONS: These results suggest that protein microarrays can provide a practical means to characterize patterns of variation in hundreds of thousands of different proteins in clinical or research applications.
 

Figure caption: Antibody array detection of labeled antigens. 114 different antibodies were spotted onto poly-L-lysine coated slides 6-12 times each at a 375 m spacing. Six protein mixes were labeled and detected according to the Materials and methods section. The inset in each panel highlights anti-Flag and anti-IgG spots, and the labels indicate the concentration of the antigen applied to each array. The images were normalized (see the Materials and methods section) and contrast adjusted to better show bright features.

 

Protein interaction screening
• Zhu et al. expressed over 5800 yeast proteins fused to GST- and His-tags
• GST-tags were used to purify all proteins
• His-tags were used to spot proteins onto Ni2+ coated surface
• Protein chips were used to screen for:
– Calmoduling binding
– Phosphoinotisol lipid binding

 

APPLICATIONS OF PROTEIN CHIPS: PROTEIN INTERACTION SCRENING

Science 2001 Sep 14;293(5537):2101-5
Global analysis of protein activities using proteome chips.
Zhu H, Bilgin M, Bangham R, Hall D, Casamayor A, Bertone P, Lan N, Jansen R, Bidlingmaier S, Houfek T, Mitchell T, Miller P, Dean RA, Gerstein M, Snyder M

Abstract:

To facilitate studies of the yeast proteome, we cloned 5800 open reading frames and overexpressed and purified their corresponding proteins. The proteins were printed onto slides at high spatial density to form a yeast proteome microarray and screened for their ability to interact with proteins and phospholipids. We identified many new calmodulin- and phospholipid- interacting proteins; a common potential binding motif was identified for many of the calmodulin-binding proteins. Thus, microarrays of an entire eukaryotic proteome can be prepared and screened for diverse biochemical activities. The microarrays can also be used to screen protein-drug interactions and to detect posttranslational modifications.

Figure caption: GST::yeast proteins were purified in a 96-well format. (A) Sixty samples were examined by immunoblot analysis using anti-GST; 19 representative examples are shown. Greater than 80% of the preparations produce high yields of fusion protein. (B) 6566 protein samples representing 5800 unique proteins were spotted in duplicate on a single nickel-coated microscope slide. The slide was probed with anti-GST (10). (C) An enlarged image of one of the 48 blocks is depicted to the right of the proteome chip.

 

Protein activity profiling
• Protein kinase chip by Zhu et al. utilized a custom manufactured nanowell plate
• 119 yeast kinases were attached to the wells and screened against 14 substrates
• Kinase activity was quantified using radiolabeled ATP
• Advantages:
– Eliminates non-specific crosscontamination
– Only small amounts of reagents are needed
– Buffers can be changed easily

 

PROTEIN ACTIVITY PROFILING

Nat Genet 2000 Nov;26(3):283-9
Analysis of yeast protein kinases using protein chips.
Zhu H, Klemic JF, Chang S, Bertone P, CasamayorA, Klemic KG, Smith D, Gerstein M, Reed MA, Snyder M

Abstract:

We have developed a novel protein chip technology that allows the high-throughput analysis of biochemical activities, and used this approach to analyse nearly all of the protein kinases from Saccharomyces cerevisiae. Protein chips are disposable arrays of microwells in silicone elastomersheets placed on top of microscope slides. The high density and small size of the wells allows for high-throughput batch processing and simultaneous analysis of many individual samples. Only small amounts of protein are required. Of 122 known and predicted yeast protein kinases, 119 were overexpressed and analysed using 17 different substrates and protein chips. We found many novel activities and that a large number of protein kinases are capable of phosphorylating tyrosine. The tyrosine phosphorylating enzymes often share common amino acid residues that lie near the catalytic region. Thus, our study identified a number of novel features of protein kinases and demonstrates that protein chip technology is useful for high-throughput screening of protein biochemical activity.

Figure caption: Protein chip fabrication and kinase assays. a, Kinase activities were detected using protein chips. PDMS was poured over the acrylic mold. After curing, the chip containing the wells was peeled away and mounted on a glass slide. The next step included modification of the surface and then attachment of proteins to the wells. Wells were blocked with 1% BSA before kinase, 33P -ATP and buffer were added. After incubation for 30 min at 30 °C, the chips were washed extensively and exposed to both X-ray film and a phosphoimager, which has a resolution of 50 m and is quantitative. For 12 substrates each kinase assay was repeated at least twice; for the remaining 5 the assays were performed once. b , An enlarged picture of the protein chip.

 

Conclusions

• Proteomics refers to large-scale studies of the protein complement of a genome using a variety of experimental techniques
• Proteomics tasks are more diverse than transcriptomics tasks reflecting the wider diversity in protein function
• Methods include yeast two-hybrid, mass spectrometry, protein separation technologies, and protein chips
• Currently most of the challenges are still in developing experimental methods ΰ Data analysis methods development has only recently begun
• We don’t yet know how to utilize proteomics data in genetic circuit reconstruction, but the expectation is that at least for reconstructing signal transduction networks proteomics data will be useful

 

CONCLUSIONS

Proteomics is an exiting and complex field that holds great promise for biological, medical, and bioengineering applications. Right now proteomics methods have been developed to the point where their power can be demonstrated in large-scale studies of interesting biological systems. However, we are still a couple of years away from being able to use these technologies routinely the way genomics or transcriptomics methods are used. On the computational side, due to lack of published data only baby steps have been taken in proteomics data analysis. Also, the utilization of proteomics data in genetic circuit reconstruction appears to be a few years away, but eventually this type of data will be useful for reconstruction of at least systems where modifications of protein states play a central role such as signal transduction.

 

References
• For more on the experimental technologies: BIMM 150/BGGN 218 Post-genomic biology
• General:
– Zhu H and Snyder M: ‘Omic’ approaches for unraveling signaling networks. Curr Opin Cell Biol 14:173 (2002).
– Feb. 2002 issues of Current Opinion in Chemical Biology and Current Opinion in Biotechnology
• Local people developing proteomics technologies include
– John Yates at TSRI (multidimensional chromatography and mass spec)
– Pavel Pevzner at UCSD (protein identification from mass spec data)

 

(Full Text online - PDF)

 

Back to Automation in Microbiology main page

   

   Scientific Publications - Work Done by Microbiology Reader Bioscreen C

Agricultural Microbiology
Anaerobic Microbiology
Antimicrobial Susceptibility
Artificial Atmosphere
Bioassay of Antibiotics
Biofilm Microbiology
Bioreactor Technology
Biotechnology
Cell Biology
Clinical Microbiology
Environmental Microbiology
Experiments with Yeast
Fermentation
Food Microbiology
Functional Genomics
Gene Technology
Growth Media Development
Growth Rate and Lag Time
Industrial Microbiology
Medical/Pharmaceutical Field
Microbiological Assay
Microbiological Research
Microbiology of Cosmetics

go to a specific theme...

Military Microbiology
Molecular Microbiology
Mutagenicity and Genotoxicity
Oral Microbiology
Patents
Postantibiotic Studies
Soil Microbiology
Spore Microbiology
Veterinary Microbiology
Waste/Wastewater Treatment
Water Microbiology
Wine Microbiology

 


 

© 2005 Transgalactic Ltd (manufacturer of Bioscreen C software) | Privacy Statement | P.O. Box 1393, 00101 Helsinki, Finland, phone: +358 9 85172920, fax: +358 9 8749481, e-mail: microbiology@bionewsonline.com
 

 

 

Last modified: May 25, 2005