Microbiology Reader
Equipment to run microbiology work automatically

Growth Curves of any strain.
Microbiological calculations.

Microbiology Home
Microbioloy Reader
Growth Curves
Photo Album
Microorganisms
Software
Download
Purchasing
Contact Us

Scientific Publications - Work Done by Microbiology Reader Bioscreen C

 

United States Patent Application 2003 0104520
Kind Code A1
Ellington, Andrew D. ;   et al. June 5, 2003

Regulatable,  catalytically active  nucleic acids

 

Abstract

Compositions and methods are provided to make, isolate, characterize and use regulatable, catalytically active nucleic acids (RCANA). RCANA may be used for regulating gene expression and in assays to detect the presence of ligands or to detect activation by an effector of an RCANA bound to a solid support such as a chip or multi-well plate. Also disclosed are compositions and methods for automating the selection procedures of the present invention.

Inventors: Ellington, Andrew D.; (Austin, TX) ; Hesselberth, Jay; (Austin, TX) ; Marshall, Kristin A.; (Cambridge, MA) ; Robertson, Michael P.; (Austin, TX) ; Sooter, Letha; (Austin, TX) ; Davidson, Eric; (Austin, TX) ; Cox, J. Colin; (Austin, TX) ; Reidel, Timothy; (Austin, TX)
Correspondence Name and Address:
    Edwin S. Flores, Esq.
    GARDERE WYNNE SEWELL LLP
    3000 Thanksgiving Tower
    1601 Elm Street
    Dallas
    TX
    75201
    US
Serial No.: 883119
Series Code: 09
Filed: June 14, 2001

 

U.S. Current Class: 435/69.1; 435/320.1; 435/325; 514/7; 530/327; 530/328; 530/329; 536/23.5
U.S. Class at Publication: 435/69.1; 435/320.1; 435/325; 530/327; 530/328; 530/329; 514/7; 536/23.5
Intern'l Class: A61K 038/08; A61K 038/10; C07K 007/08; C07K 007/06; C07H 021/04; C12P 021/02; C12N 005/06

 

Claims


What is claimed is:

1. A polynucleotide that is regulated by a polypeptide comprising: a regulatable, catalytically active polynucleotide, wherein the peptide interacts with the polynucleotide to affect its catalytic activity.

2. The polynucleotide of claim 1, wherein the polypeptide is further defined as being a protein.

3. The polynucleotide of claim 1, wherein the polypeptide comprises a peptide of between about 7 and 20 amino acids.

4. The polynucleotide of claim 1, wherein the polypeptide comprises a peptide of between about 7 and 12 amino acids.

5. The polynucleotide of claim 1, wherein the catalytic activity of the nucleic acid is specific for a nucleic acid target sequence.

6. The polynucleotide of claim 1, wherein the catalytic activity of the nucleic acid is regulated by the interaction of the nucleic acid with an effector.

7. The polynucleotide of claim 1, wherein the polynucleotide comprises RNA.

8. The polynucleotide of claim 1, wherein the polynucleotide comprises DNA

9. The polynucleotide of claim 1, wherein the polynucleotide is at least partially single stranded.

10. The polynucleotide of claim 1, wherein the polynucleotide is at least partially double stranded.

11. The polynucleotide of claim 1, wherein the polynucleotide comprises at least one modified base.

12. The polynucleotide of claim 1, wherein the peptide is endogenous.

13. The polynucleotide of claim 1, wherein the peptide is exogenous.

14. The polynucleotide of claim 1, wherein the peptide comprises a phosphorylated peptide.

15. A nucleic acid that is regulated by an effector comprising: a regulatable, catalytically active nucleic acid, generated by the modification of at least one catalytic residue.

16. The nucleic acid of claim 15, wherein the catalytic activity of the nucleic acid is specific for a nucleic acid target sequence.

17. The nucleic acid of claim 15, wherein the catalytic activity of the nucleic acid is regulated by the interaction of the nucleic acid with an effector.

18. The nucleic acid of claim 15, wherein the nucleic acid comprises RNA.

19. The nucleic acid of claim 15, wherein the nucleic acid comprises DNA.

20. The nucleic acid of claim 15, wherein the nucleic acid is at least partially single stranded.

21. The nucleic acid of claim 15, wherein the nucleic acid is at least partially double stranded.

22. The nucleic acid of claim 15, wherein the nucleic acid comprises at least one modified base.

23. The nucleic acid of claim 15, wherein the effector is endogenous.

24. The nucleic acid of claim 15, wherein the effector is exogenous.

25. The nucleic acid of claim 15, wherein the effector comprises a protein.

26. The nucleic acid of claim 15, wherein the effector comprises a pharmaceutical agent.

27. The nucleic acid of claim 15, wherein the effector comprises a protein complex.

28. The nucleic acid of claim 15, wherein the effector comprises a peptide.

29. The nucleic acid of claim 15, wherein the effector a phosphorylated peptide.

30. The nucleic acid of claim 15, wherein the effector comprises a dephosphorylated peptide.

31. The nucleic acid of claim 15, wherein the nucleic acid catalyses a reaction that causes the expression of a target gene to be up-regulated.

32. The nucleic acid of claim 15, wherein the nucleic acid catalyses a reaction that causes the expression of a target gene to be down-regulated.

33. The nucleic acid of claim 15, wherein the nucleic acid is used to detect at least one exogenous effector from a library of candidate exogenous effector molecules.

34. The nucleic acid of claim 15, wherein the nucleic acid and the effector form a nucleic acid-effector complex.

35. The nucleic acid of claim 15, wherein the nucleic acid and the effector is a molecule that forms an nucleic acid-effector complex and the nucleic acid-effector complex acts synergistically to affect the catalytic activity of the nucleic acid-effector complex.

36. The nucleic acid of claim 15, wherein the nucleic acid catalyses a ligation reaction with an oligonucleotide substrate.

37. The nucleic acid of claim 15, wherein the nucleic acid catalyses a reaction that adds a non-oligonucleotide substrate.

38. The nucleic acid of claim 15, wherein the nucleic acid catalyses a reaction that adds biotin to the nucleic acid.

39. The nucleic acid of claim 15, wherein the nucleic acid catalyses a cleavage reaction with an oligonucleotide substrate.

40. The nucleic acid of claim 15, in which the kinetic parameters of nucleic acid catalysis are altered in the presence of one or more effector-effectors that acts on the effector molecule that interacts with the nucleic acid.

41. The nucleic acid of claim 15, in which the kinetic parameters of nucleic acid catalysis are altered in the presence of theophylline.

42. The nucleic acid of claim 15, in which the kinetic parameters of nucleic acid catalysis are altered in the presence of a supermolecular structure.

43. The nucleic acid of claim 15, in which the kinetic parameters of nucleic acid catalysis are altered in the presence of a supermolecular structure that comprises a virus particle.

44. The nucleic acid of claim 15, in which the kinetic parameters of nucleic acid catalysis are altered in the presence of a supermolecular structure that comprises a cell wall.

45. A nucleic acid comprising: a gene; a regulatable, catalytically active nucleic acid inserted within the gene; wherein the presence of an effector causes the nucleic acid to catalyze a reaction.

46. The nucleic acid of claim 45, wherein the catalytic reaction is a self-splicing reaction.

47. The nucleic acid of claim 45, wherein the catalytic reaction is a ligation reaction.

48. The nucleic acid of claim 45, wherein the catalytic reaction is a trans-cleavage reaction.

49. The nucleic acid of claim 45, wherein the catalytic activation of the nucleic acid leads to changes in expression of the gene.

50. The nucleic acid of claim 45, wherein the catalytic activation of the nucleic acid leads to changes in expression of one or more genes.

51. The nucleic acid of claim 45, wherein the catalytic activation of the nucleic acid leads to changes in expression of the mRNA of the gene.

52. The nucleic acid of claim 45, wherein the catalytic activation of the nucleic acid leads to changes in expression of the protein encoded by the gene.

53. A nucleic acid segment comprising: a regulatable, catalytically active nucleic acid comprising one or more catalytic nucleotides, selected from a pool of nucleic acids in which at least one of the catalytic residues has been randomized.

54. A regulatable, catalytically active nucleic acid segment comprising: an effector domain; and a nucleic acid catalyst domain in which one or more catalytic residues of the nucleic acid catalyst have been randomized; wherein the kinetic parameters of the catalytic domain are regulated by an effector that interacts with the effector domain.

55. A method of isolating a regulatable, catalytically active nucleic acid, comprising the steps of: randomizing at least one nucleotide in the catalytic domain of a catalytically active nucleic acid to create a nucleic acid pool; and removing from the nucleic acid pool those nucleic acids that interact with the catalytic target of the catalytic domain.

56. The method of claim 55, further comprising the step of adding an effector to the remaining pool of nucleic acids.

57. The method of claim 55, further comprising the steps of adding an effector to the remaining nucleic acids, wherein the effector acts on the nucleic acids to alter the catalytic activities of the nucleic acids.

58. The method of claim 55, further comprising the step of purifying the isolated nucleic acid.

59. The method of claim 55, further comprising the step of sequencing the isolated nucleic acid.

60. The method of claim 55, wherein the step of removing the nucleic acids is under high stringency conditions.

61. The method of claim 55, wherein the step of removing the nucleic acids is under moderate stringency conditions.

62. The method of claim 55, wherein the step of removing the nucleic acids is under low stringency conditions.

63. The method of claim 55, where the target is an mRNA molecule.

64. The method of claim 56, where the effector is a protein.

65. The method of claim 56, where the effector is a peptide.

66. The method of claim 56, where the effector is a phosphoprotein.

67. The method of claim 56, where the effector is a glycoprotein.

68. The method of claim 56, where the effector is light.

69. The method of claim 56, where the effector is visible light.

70. The method of claim 56, where the effector is a magnet.

71. The method of claim 55, where the target is a metabolic reaction.

72. The method of claim 55, in which nucleic acids with altered catalytic specificity are selected in the presence of an effector.

73. The method of claim 55, in which nucleic acids with altered catalytic activities are selected in the absence of an effector.

74. The method of claim 55, in which nucleic acids with altered catalytic activities are serially selected in the presence and the absence of an effector.

75. The method of claim 55, the effector domain comprises a random sequence pool.

76. The method of claim 55, the effector domain comprises a partially randomized sequence pool.

77. A method of making a regulatable, catalytically active nucleic acid, comprising the steps of: contacting a pool of nucleic acids, the nucleic acids having a catalytic and an effector domain, wherein at least one nucleotide in the catalytic domain of the nucleic acids has been randomized; removing from the nucleic acid pool those nucleic acids that interact with the catalytic target of the catalytic domain; adding an effector protein to the remaining nucleic acids; and isolating those nucleic acids that interact with the catalytic target of the catalytic domain.

78. A method of isolating a regulatable, catalytically active nucleic acid, comprising the steps of: randomizing at least one nucleotide in the catalytic domain of a catalytically active nucleic acid to create a nucleic acid pool; removing from the nucleic acid pool those nucleic acids that interact with the catalytic target of the catalytic domain; adding an effector molecule to the nucleic acids; and isolating those nucleic acids that interact with the catalytic target of the catalytic domain.

79. A method of isolating a regulatable, catalytically active nucleic acid having a catalytic and an effector domain, comprising the steps of: randomizing at least one nucleotide in the catalytic domain of the nucleic acid to create a nucleic acid pool; removing from the nucleic acid pool those randomized nucleic acids that interact with the catalytic target of the catalytic domain; adding an effector to the nucleic acids; and isolating the nucleic acids that interact with the catalytic target of the catalytic domain.

80. An automated method of isolating a regulatable, catalytically active nucleic acid having a catalytic and an effector domain, comprising the steps of: (a) randomizing at least one nucleotide in the catalytic domain of the nucleic acid to create a nucleic acid pool; (b) removing from the nucleic acid pool those randomized nucleic acids that interact with the catalytic target of the catalytic domain; (c) adding an effector to the nucleic acids; (d) adding an effector-effector that specifically interacts with the effector; and (e) isolating the nucleic acids that interact with the catalytic target of the catalytic domain; and (f) repeating steps (a) through (e).

81. A method of detection of a target using a regulatable, catalytically active nucleic acid comprising the steps of. contacting the a regulatable, catalytically active nucleic acid with the target; and measuring the effect of the interaction between the a regulatable, catalytically active nucleic acid and the target.

82. A method of modifying a target using a regulatable, catalytically active nucleic acid comprising the steps of: providing a regulatable, catalytically active nucleic acid capable of target specific modification; and modifying the target under conditions that cause a regulatable, catalytically active nucleic acid-specific activity.

83. A biosensor comprising: a solid support; and at least one regulatable, catalytically active nucleic acid, wherein the kinetic parameters of the nucleic acid on a target vary in response to the interaction of an effector molecule with the nucleic acid; wherein the at least one regulatable, catalytically active nucleic acid is immobilized on the support.

84. The biosensor of claim 83, wherein the reaction is machine readable.

85. The biosensor of claim 83, wherein the solid support comprises a multiwell plate.

86. The biosensor of claim 83, wherein the solid support comprises a surface plasmon resonance sensor.

87. The biosensor of claim 83, wherein the at least one regulatable, catalytically active nucleic acids is covalently immobilized on the solid support.

88. The biosensor of claim 83, wherein the catalytic reaction produces a detectable signal.

89. The biosensor of claim 83, wherein the catalytic reaction is the attachment of a tag to the immobilized nucleic acids to produce the signal.

90. The biosensor of claim 83, wherein the substrate is further defined as containing known nucleic acid sequences tags and the nucleic acids are sorted on the surface of the substrate based on non-covalent hybridization to sequence tags.

91. A biosensor comprising: a solid support; and at least one regulatable, catalytically active nucleic acids, wherein the kinetic parameters of the nucleic acids on a target vary in response to the interaction of an effector molecule with the nucleic acid; wherein catalytic targets of the catalytic domain is immobilized on the support.

92. A biosensor comprising: a solid support; and at least one regulatable, catalytically active nucleic acids, wherein the kinetic parameters of the nucleic acids on a target vary in response to the interaction of an effector molecule with the nucleic acid; wherein the effector is immobilized on the support.

93. A method of selecting a regulatable, catalytically active nucleic acid, comprising the steps of: contacting a pool of nucleic acids, the nucleic acids having a catalytic and an effector domain, wherein at least one nucleotide in the catalytic domain of the nucleic acids has been randomized; removing from the nucleic acid pool those nucleic acids that interact with the catalytic target of the catalytic domain; adding an effector to the remaining nucleic acids; and isolating those nucleic acids that interact with the catalytic target of the catalytic domain; introducing the nucleic acids into a host cell; and measuring the catalytic activity of the nucleic acid upon exposure of the host cell to the effector.

94. The method of claim 93, further comprising the step of purifying the isolated nucleic acid.

95. The method of claim 93, further comprising the step of sequencing the isolated nucleic acid.

96. The method of claim 93, wherein the step of removing the nucleic acids is under high stringency conditions.

97. The method of claim 93, wherein the step of removing the nucleic acids is under moderate stringency conditions.

98. The method of claim 93, wherein the step of removing the nucleic acids is under low stringency conditions.

99. The method of claim 93, where the target is an mRNA molecule.

100. The method of claim 93, where the effector is a protein.

101. The method of claim 93, where the effector is a peptide.

102. The method of claim 93, where the effector is a phosphoprotein.

103. The method of claim 93, where the effector is a glycoprotein.

104. The method of claim 93, where the effector is light.

105. The method of claim 93, where the effector is visible light.

106. The method of claim 93, where the effector is a magnet.

107. The method of claim 93, in which nucleic acids with altered catalytic activities are serially selected in the presence and the absence of the effector.

108. The method of claim 93, the effector domain comprises a completely random sequence pool.

109. The method of claim 93, the effector domain comprises a partially randomized sequence pool.

110. A method of selecting a regulatable, catalytically active nucleic acid, comprising the steps of: contacting a pool of nucleic acids, the nucleic acids having a catalytic and an effector domain, wherein at least one nucleotide in the catalytic domain of the nucleic acids has been randomized; removing from the nucleic acid pool those nucleic acids that interact with the catalytic target of the catalytic domain; adding an effector to the remaining nucleic acids; and isolating those nucleic acids that interact with the catalytic target of the catalytic domain; introducing the nucleic acids into a host cell; and measuring the catalytic activity of the nucleic acid upon exposure of the host cell to the effector.

111. The method of claim 110, further comprising the step of purifying the isolated nucleic acid.

112. The method of claim 110, further comprising the step of sequencing the isolated nucleic acid.

113. The method of claim 110, wherein the step of removing the nucleic acids is under high stringency conditions.

114. The method of claim 110, wherein the step of removing the nucleic acids is under moderate stringency conditions.

115. The method of claim 110, wherein the step of removing the nucleic acids is under low stringency conditions.

116. The method of claim 110, where the target is an mRNA molecule.

117. The method of claim 110, where the effector is a protein.

118. The method of claim 110, where the effector is a peptide.

119. The method of claim 110, where the effector is a phosphoprotein.

120. The method of claim 110, where the effector is a glycoprotein.

121. The method of claim 110, where the effector is light.

122. The method of claim 110, where the effector is visible light.

123. The method of claim 110, where the effector is a magnet.

124. The method of claim 110, in which nucleic acids with altered catalytic activities are serially selected in the presence and the absence of the effector.

125. The method of claim 110, the effector domain comprises a completely random sequence pool.

126. The method of claim 110, the effector domain comprises a partially randomized nucleotide sequence.

127. A method of detecting a regulatable, catalytically active nucleic acid, comprising the steps of: isolating a regulatable, catalytically active nucleic acid; creating a construct in which the nucleic acid is in position to regulate the expression of a reporter gene; introducing the construct into a host cell; and measuring the catalytic activity of the nucleic acid upon exposure of the host cell to an effector.

128. A vector comprising: a regulatable, catalytically active polynucleotide, wherein the peptide molecule interacts with the polynucleotide to affect its catalytic activity.

129. A vector comprising: a regulatable, catalytically active nucleic acid, generated by the modification of at least one catalytic residue.

130. A method of modulating expression of a nucleic acid, the method comprising providing a polynucleotide that is regulated by a peptide, the polynucleotide comprising a regulatable, catalytically active polynucleotide, wherein the peptide interacts with the polynucleotide to affect its catalytic activity; and contacting the polynucleotide with the peptide, thereby modulating expression of a nucleic acid.

131. The method of claim 130, wherein the polynucleotide is provided in a cell.

132. The method of claim 131, wherein the cell is provided in vitro.

133. The method of claim 131, wherein the cell is provided in vivo.

134. The method of claim 131, wherein the cell is a prokaryotic cell.

135. The method of claim 131, wherein the cell is a eukaryotic cell.

136. A method of modulating expression of a nucleic acid, the method comprising the steps of: providing a nucleic acid that is regulated by an effector, the nucleic acid comprising: a regulatable, catalytically active nucleic acid, wherein the regulatable, catalytically active nucleic acid molecule includes at least one modified catalytic residue; and contacting the nucleic acid with the effector, thereby modulating expression of a nucleic acid.

Description



[0001] This application is a continuation-in-part of U.S. Serial No. 60/212,097, filed Jun. 15, 2000.

FIELD OF THE INVENTION

[0002] The present invention relates generally to the field of catalytic nucleic acids and in particular to regulatable, catalytically active nucleic acids that modulate their kinetic parameters in response to the presence of an effector.

BACKGROUND OF THE INVENTION

[0003] Ribozymes are oligonucleotides of RNA that can act like enzymes and are sometimes called RNA enzymes. Generally, they have the ability to behave like an endoribonuclease, catalyzing the cleavage of RNA molecules. The location of the cleavage site is highly sequence specific, approaching the sequence specificity of DNA restriction endonucleases. By varying conditions, ribozymes can also act as polymerases or dephosphorylases.

[0004] Ribozymes were first described in connection with Tetrahymena thermophilia. The Tetrahymena rRNA was shown to contain an intervening sequence (IVS) capable of excising itself out of a large ribosomal RNA precursor. The IVS is a catalytic RNA molecule that mediates self-splicing out of a precursor, whereupon it converts itself into a circular form. The Tetrahymena IVS is more commonly known now as the Group I Intron.

[0005] Regulatable ribozymes have been described, wherein the activity of the ribozyme is regulated by a ligand-binding moiety. Upon binding the ligand, the ribozyme activity on a target RNA is changed. Regulatable ribozymes have only been described for small molecule ligands such as organic or inorganic molecules. Regulatable ribozymes that are controlled by proteins, peptides, or other macro-molecules.

SUMMARY OF THE INVENTION

[0006] The present invention includes a regulatable, catalytically active nucleic acids (RCANA), wherein the catalytic activity of the RCANA is regulated by an effector. The RCANA of the present invention are, therefore, regulatable in that their activity is under the control of a second portion of the RCANA. Just as allosteric protein enzymes undergo a change in their kinetic parameters or of their enzymatic activity in response to interactions with an effector, the catalytic abilities of the RCANA may similarly be modulated by the effector(s). Thus, the present invention is directed to RCANA that transduce molecular recognition into catalysis.

[0007] As will become apparent below, RCANA are more robust than allosteric protein enzymes in several ways: (1) they can be selected in vitro, which facilitates the engineering of particular constructs; (2) the levels of catalytic modulation are much greater for RCANA than for protein enzymes; and (3) since RCANA are nucleic acids, they can potentially interact with the genetic machinery in ways that protein molecules may not.

[0008] It should be noted that the methods described herein may include any type of nucleic acid. For example, these methods are not limited to RNA-based RCANA, but also encompass DNA RCANA and RNA or DNA RCANA. Furthermore, the methods can be applied to any catalytic activity the ribozymes are capable of carrying out. For example, the methods are not limited to ligases or splicing reactions, but could also encompass other ribozyme classes. The methods are also not limited to protein or peptide ligands, but also include other molecular species, such as ions, small molecules, organic molecules, metabolites, sugars and carbohydrates, lipids and nucleic acids. The methods may also be extended to effectors that are not molecules, such as heat or light or electromagnetic fields. Furthermore, the methods are not limited to ligand-induced conformational changes, but could also take into account `chimeric` catalysts in which residues essential for chemical reactivity were provided by both the nucleic acid and the ligand, in concert.

[0009] The effector may be a peptide, a polypeptide, a polypeptide complex, or a modified polypeptide or peptide. The effector may even be, e.g., an enzyme or even light (such as visible light) or even a magnet. The effector may be activated by a second effector that acts on the first effector (also referred to herein as an effector-effector), which may be an inorganic or an organic molecule. The polypeptide, peptide or polypeptide complex can be either endogenous, i.e., derived from the same cell type as the polynucleotide, or exogenous, i.e., derived from a cell type different than the cell from which the polynucleotide is derived.

[0010] The polypeptide or peptide may be phosphorylated or dephosphorylated. Alternatively, the effector may include a pharmaceutical agent. In some embodiments, the nucleic acid catalyzes a reaction that causes the expression of a target gene to be up-regulated. In other embodiments, the nucleic acid catalyzes a reaction that causes the expression of a target gene to be down-regulated. If desired, the nucleic acid may be used to detect at least one exogenous effector from a library of candidate exogenous effector molecules. In some embodiments, the nucleic acid and the effector form a nucleic acid-effector complex.

[0011] In some embodiments, the kinetic parameters of nucleic acid catalysis are altered in the presence of a supermolecular structure, e.g., a viral particle or a cell wall. The nucleic acid may further include a regulatory element that can recognize a target molecule of interest. The nucleic acid may in addition include a transducer element that transmits information from the regulatory element to the catalytically active region of the nucleic acid.

[0012] The invention also includes a biosensor that includes a solid support on which at least one regulatable, catalytically active nucleic acid is disposed. The kinetic parameters of the nucleic acid on a target vary in response to the interaction of an effector molecule with the nucleic acid. The regulatable, catalytically active nucleic acid may be immobilized on the support and the reaction may be machine-readable. The solid support may include, e.g., a multiwell plate, a surface plasmon resonance sensor. Regulatable, catalytically active nucleic acid may be covalently or non-covalently immobilized on the solid support. In some embodiments, the catalytic reaction produces a detectable signal. The substrate may include at least 10 regulatable, catalytically active nucleic acids, at least 100 regulatable, catalytically active nucleic acids, at least 1000 regulatable, catalytically active nucleic acids, at least 10,000 regulatable, catalytically active nucleic acids or even at least 100,000 regulatable, catalytically active nucleic acids.

[0013] Protein and Peptide RCANA. The present invention includes RCANA with catalytic activity that is regulated by a protein or peptide. One embodiment of the present invention involves the in vitro selection of RCANA that are regulated by proteins. A selection scheme for RCANA dependent on protein cofactors has been developed.

[0014] This invention allows the selection of protein-dependent RCANA, which are reagents that can be useful in a variety of applications. For example, protein-dependent RCANA can be used: (1) in chips for the acquisition of data about whole proteomes, (2) as in vitro diagnostic reagents to detect proteins specific to disease states, such as prostate-specific antigen (PSA) or viral proteins, (3) as sentinels for the detection of biological warfare agents, (4) as elements in cell-based assays or animal models for drug development studies or (5) as regulatory elements in gene therapies, as described herein. Initially, many protein targets may prove refractive to selection. However, many derivatives of the base method can be developed, to deal with novel targets or target classes.

[0015] Modification of Catalytic Residues of RCANA. In one embodiment of this invention, the RCANA is generated by the modification of at least one catalytic residue. One of the unique features of the present selection protocol relative to others that have previously been published is that the present invention randomizes not only a domain that is pendant on the catalytic core, but a portion of the catalytic core itself. Thus, the selection for ligand-dependence not only yields species that bind to ancillary regions of the RCANA, but that may help stabilize the catalytic core of the RCANA.

[0016] Also provided by the invention is a method of isolating a regulatable, catalytically active nucleic acid created by randomizing at least one nucleotide in the catalytic domain of a catalytically active nucleic acid to create a nucleic acid pool. The nucleic acid pool whose nucleic acids interact with the catalytic target of the catalytic domain are removed. The method further may also include the step of adding an effector to the remaining pool of nucleic acids. In some embodiments, the method may also include the step of adding an effector to the remaining nucleic acids, wherein the effector acts on the nucleic acids to alter the catalytic activities of the nucleic acids. The method may include optionally the step of purifying the isolated nucleic acid, and, if desired the step of sequencing the isolated nucleic acid. In various embodiments, the step of removing the nucleic acids is under high stringency conditions, moderate stringency conditions, or low stringency conditions.

[0017] Automated Selection of RCANA. The invention further includes the automation of in vitro selection, and a mechanized system that executes both common and modified in vitro selection procedures. Automation facilitates the execution of this procedure, accomplishing in hours-to-days what once necessitated weeks-to-months. Additionally, the mechanized system attends to other technical obstacles not addressed in "common" in vitro selection procedure (e.g., specialized robotic manipulation to avoid cross-contamination). The automation methods are generalizable to a number of different types of selections, including selections with DNA or modified RNA, selections for ribozymes and selections for phage-displayed or cell-surface displayed proteins.

[0018] Automating selection greatly diminishes human error in the actual pipetting and biological manipulations. While programming the robot is often not a trivial task, and can be time consuming, automated selection is far faster and more efficient than manual selection. Time is used preparing samples and analyzing data, rather than performing the actual selection. Additionally, automated selection may include real-time monitoring methods (e.g., molecular beacons, TaqMan.RTM.) into the selection procedure and software that can make intelligent decisions based on real-time monitoring.

[0019] In vitro sensing (or detection) applications. The current invention also provides for the use of RCANA for detection of a wide variety molecular species in vitro. For example, RCANA may be anchored to a chip, such as wells in a multi-well plate. Mixtures of analytes and fluorescently tagged substrates are added to each well. Where cognate effectors are present, the RCANA will covalently attach the fluorescent tags to themselves. Where RCANA have not been activated by effectors, the tagged substrates are washed out of the well. After reaction and washing, the presence and amounts of co-immobilized fluorescent tags are indicative of amounts of ligands that were present during the reaction. The reporter may be a fluorescent tag, but it may also be an enzyme, a magnetic particle, or any number of detectible particles. Additionally, the RCANA may be immobilized on beads, but they could also be directly attached to a solid support via covalent bonds.

[0020] One advantage of this embodiment is that covalent immobilization of reporters (as opposed to non covalent immobilization, as in ELISA assays) allows stringent wash steps to be employed. Additionally, ribozyme ligases have the unique property of being able to transduce effectors into templates that may be amplified, affording an additional boost in signal prior to detection.

[0021] Modified nucleotides may be introduced into the RCANA that substantially stabilize them from degradation in environments such as sera or urine. The analytical methods of the present invention do not rely on binding per se, but only on transient interactions. The present invention requires mere recognition rather that actual binding, providing a potential advantage of RCANA over antibodies. That is, even low affinities are sufficient for activation and subsequent detection, especially if individual immobilized RCANA are examined (i.e., by ligand-dependent immobilization of a quantum dot).

[0022] Expression of RCANA in cells. The RCANAs of the present invention may also be expressed inside cells. The RCANAs of the present invention that are expressed inside a cell are not only responsive to a given effector, but are also able to participate in genetic regulation or responsiveness. In particular, self-splicing introns can splice themselves out of genes in response to exogenous or endogenous effector molecules.

[0023] The present invention includes RCANA constructs that may be inserted into a gene of interest, e.g., a gene targeting expression vector. The RCANA sequence provides gene specific recognition as well as modulation of the RCANA's kinetic parameters. The kinetic parameters of the RCANA vary in response to an effector. Specifically, in the case of RCANA that performs self splicing in the presence of the effector, the RCANA may splices itself out of the gene in response to the effector to regulate expression of the gene.

[0024] In another aspect, the invention includes a method of modulating expression of a nucleic acid by providing a polynucleotide that is regulated by a peptide. The polynucleotide may be a regulatable, catalytically active polynucleotide, in which the peptide interacts with the polynucleotide to affect its catalytic activity. The polynucleotide is contacted with the peptide, thereby modulating expression of a nucleic acid. The polynucleotide may be provided in a cell, and the cell may be, e.g., provided in vitro or in vivo and may be a prokaryotic cell or a even a eukaryotic cell.

[0025] The present invention also includes an RCANA construct with a regulatable oligonucleotide sequence having a regulatory domain, such that the kinetic parameters of the RCANA on a target gene vary in response to the interaction of an effector with the regulatory domain.

[0026] In vivo sensing (or detection) applications. It is possible to activate or repress a reporter gene (e.g., luciferase) containing an engineered intron in response to an endogenous activator. In this way, luciferase-engineered intron constructs may be used to monitor intracellular levels of proteins or small molecules such as cyclic AMP. This method may be used for in vivo measurements in both cellular systems, such as cell culture, and in whole organisms, such as animal models. Such applications may be used for high-throughput screening. If a particular intracellular signal (e.g., the production of a tumor repressor) was desired, compound libraries for pharmacophores that induce the signal (the tumor repressor) are screened for activation of the reporter gene. Thus, the information desired is changed or morphed into the detection of glowing cells.

[0027] Gene therapy applications. Similarly, a gene can be activated or repressed in response to an exogenously introduced effector (drug) for gene therapy. The RCANA may be used for gene expression up regulation (increasing production of the gene product) or down regulation (decreasing the production of the gene product). The construct of one embodiment of the present invention provides a DNA oligonucleotide coding for a catalytic domain and effector binding domain. The advantages of the nucleic acid-based technology of the present invention include, e.g., the ability to continually modulate gene expression with a high degree of sensitivity without additional gene therapy interventions.

[0028] In another aspect, the invention includes a method of modulating expression of a nucleic acid in a cell by providing a polynucleotide that is regulated by an effector, e.g., a peptide. The polynucleotide may be a regulatable, catalytically active polynucleotide, in which the peptide interacts with the polynucleotide to affect its catalytic activity. The polynucleotide is contacted with the peptide, thereby modulating expression of a nucleic acid. The polynucleotide may be provided in a cell, and the cell may be, e.g., provided in vitro or in vivo and may be a prokaryotic cell or a even a eukaryotic cell.

BRIEF DESCRIPTION OF THE DRAWINGS

[0029] For a more complete understanding of the features and advantages of the present invention, reference is now made to the detailed description of the invention along with the accompanying figures in which corresponding numerals in different figures refer to corresponding parts and in which:

[0030] FIG. 1 is a depiction of the secondary structure of the Group 1 theophylline-dependent (td) intron of bacteriophage T4 (wild type);

[0031] FIG. 2a is a photograph of a gel showing activation of the GpITh1P6.131 aptamer construct, together with a graphical representation of the gel, of one embodiment of the present invention;

[0032] FIG. 2b is a photograph of a gel showing activation of GpITh2P6.133 aptamer construct, together with a graphical representation of the gel of one embodiment of the present invention.

[0033] FIG. 3 is a schematic depiction of an in vivo assay system for group I introns of one embodiment of the present invention.

[0034] FIG. 4a depicts a portion of the P6 region of the Group I ribozyme joined to the anti-theophylline aptamer by a short randomized region to generate a pool of aptazymes of the present invention.

[0035] FIG. 4b is a schematic depiction of a selection protocol for the Group I P6 Aptazyme Pool of FIG. 4a.

[0036] FIG. 5 is a diagram of one embodiment of the present invention depicting exogenous or endogenous activation of Group I intron splicing;

[0037] FIG. 6 is a diagram of another embodiment of the present invention depicting a strategy for screening libraries of exogenous activators;

[0038] FIG. 7 is a diagram of an alternative embodiment of the present invention for screening libraries of exogenous activators;

[0039] FIG. 8 is a diagram of yet another alternative embodiment of the present invention for screening libraries of exogenous activators;

[0040] FIG. 9 is a diagram of an embodiment of the present invention for screening for endogenous activators;

[0041] FIG. 10 is a diagram of an alternative to the embodiment of FIG. 9 of the present invention to screen for endogenous activators;

[0042] FIG. 11 is a diagram of another embodiment of the present invention to screen for compounds that perturb cellular metabolism;

[0043] FIG. 12 is a diagram of a further embodiment of the present invention that provides a non-invasive readout of metabolic states;

[0044] FIG. 13 is a diagram of yet a further embodiment of the present invention wherein endogenous suppressors provide a non-invasive readout of multiple metabolic states;

[0045] FIG. 14 is a schematic depiction of an example of a work surface for automatic selection procedures of one embodiment of the invention;

[0046] FIG. 15a is an illustration of the LI ligase aptazyme construct of one embodiment of the present invention;

[0047] FIG. 15b is an illustration of a modified LI ligase aptazyme construct of FIG. 15a of the present invention;

[0048] FIG. 15c is a schematic diagram of a selection protocol of one embodiment of the present invention;

[0049] FIG. 16 is a schematic diagram of a method to anchor an aptazyme construct of the present invention to a solid support in one embodiment of the present invention;

[0050] FIGS. 17(a-d) show the LI ligase was the starting point for pool design;

[0051] FIG. 18(a-d) shows the progression of the L1-N50 selections;

[0052] FIG. 19(a & b) shows protein-dependent regulatable, catalytically active nucleic acid sequences and structures;

[0053] FIG. 20 demonstrates the ribozyme activity with inactivated protein samples;

[0054] FIG. 21 demonstrates an aptamer competition assays;

[0055] FIG. 22 shows the binding and ligation activity as a function of protein concentration;

[0056] FIG. 23 is a flow chart of a method for negative and positive selection of RCANA;

[0057] FIG. 24 shows the progress of the L1-N50 Rev selection;

[0058] FIG. 25 (a & b) shows the theophylline-dependent td group I intron constructs of the present invention;

[0059] FIG. 26 shows the design of an FMN-dependent td nucleic acid intron splicing construct;

[0060] FIGS. 27(a-c) show the relative growth curves of theophylline-dependent in vivo growth;

[0061] FIG. 28 shows 3-Methylxanthine dependent in vivo growth;

[0062] FIG. 29 (a & b) shows a schematic of ribozyme ligase array;

[0063] FIG. 30 shows the results of a regulatable, catalytically active ligase array;

[0064] FIG. 31 shows the titrations of individual allosteric ribozyme ligases.

DETAILED DESCRIPTION OF THE INVENTION

[0065] While the making and using of various embodiments of the present invention are discussed in detail below, the present invention provides many applicable inventive concepts that may be embodied in a wide variety of specific contexts. The specific embodiments discussed herein are merely illustrative of specific ways to make and use the invention and do not delimit the scope of the invention.

[0066] The present invention includes compositions or matter, methods and automation that permit the creation, isolation, identification, characterization and optimization of regulatable catalytically active nucleic acids. Furthermore, it includes methods to use RCANA for in vitro sensing (or detection), in vivo sensing (or detection), and gene therapy. Regulatable, catalytically active nucleic acids selected by the method of the present invention also have advantages over other biopolymers that might be used for sensing or gene regulation. Regulatable, catalytically active nucleic acids are more robust than allosteric protein enzymes in several ways: (1) they can be selected in vitro (facilitating the engineering of particular constructs); (2) the levels of catalytic modulation are much greater than those typically observed with protein enzymes; and (3) since regulatable, catalytically active nucleic acids are nucleic acids, they can potentially interact with the genetic machinery in ways that protein molecules may not.

[0067] The method is not limited to RNA pools, but may also encompass DNA pools or modified RNA pools. Modified nucleotides may be introduced into the regulatable, catalytically active nucleic acids that substantially stabilize them from degradation in environments such as sera or urine. The method is not limited to ligases, but could also encompass other ribozyme classes. The method is not limited to protein-induced conformational changes, but could also take into account `chimeric` catalysts in which residues essential for chemical reactivity were provided by both the nucleic acid and the protein in concert. Initially, many protein targets may prove refractive to selection. Many derivatives of the base method can be developed, however, to deal with novel targets or target classes.

[0068] A. Protein Dependent RCANA

[0069] Effector-dependent ribozymes have been shown to be responsive to small organic compounds, such as ATP and theophylline. The present inventors recognized the need for effector-dependent ribozymes, or as used herein, "regulatable, catalytically active nucleic acids" that are responsive to larger molecules, such as, e.g., peptides or proteins. The peptides, proteins or other large molecules may be provided from endogenous sources (e.g., expressed within a cell or cell extract), or exogenous sources (added or expressed in a cell or cell extract).

[0070] In order to understand the present invention, it is important to understand that previous attempts to make catalytically active nucleic acids that interact and respond to a large effector, by the inventors and others, have failed. Initially, attempts were made to generate protein-dependent ribozymes by the addition of aptamers (known binding sequences) to ribozymes (catalytically active domains). This design and method was unsuccessful in providing regulatable nucleic acids. Next, attempts were made to generate protein-dependent ribozymes by adding random sequence regions between an aptamer (binding) and a ribozyme (catalytic) and selecting for effector-dependence. These attempts were also unsuccessful. Next, the inventors attempted to generate protein-dependent ribozymes by adding a large random sequence region to the catalytic cores of ribozymes and selecting for effector-dependence. These attempts were also unsuccessful. In other words, all previously detailed methods for the generation of ribozymes that were dependent on small organic compounds were unsuccessful for generating ribozymes that were dependent on proteins.

[0071] To date, the present inventors have selected a number of protein- and peptide-dependent ribozyme ligases. One example is the isolation of a protein-dependent, regulatable, catalytically active nucleic acid with an activity that was increased in a standard assay by 75,000-fold in the presence of its cognate protein effector, tyrosyl tRNA synthetase from Neurospora mitochondria (Cyt18). The Cyt18-dependent ribozyme was not activated by non-cognate proteins, including other tRNA synthetases.

[0072] A protein-dependent, regulatable, catalytically active nucleic acid was also created and selected with an activity that was increased by 3,500-fold in the presence of its cognate protein effector, hen egg white lysozyme. The lysozyme-dependent ribozyme was not activated by most non-cognate proteins, including T4 lysozyme, but was activated by a UVery closely related protein, turkey egg white lysozyme. Moreover, the protein-dependent ribozyme was inhibited by a RNA binding species that specifically bound to lysozyme. In other words, the activation of these protein-dependent ribozymes was highly specific.

[0073] A peptide-dependent, regulatable, catalytically active nucleic acids was also created and isolated with activity was increased by 18,000-fold in the presence of its cognate peptide effector, the arginine-rich motif (ARM) from the HIV-1 Rev protein. The Rev-dependent nucleic acid was not activated by other ARMs from other viral proteins, such as HTLV-I Rex. Using the present invention, regulatable, catalytically active nucleic acids may be developed that are regulated by any of a vast number of proteins.

[0074] As will be clear from the continued description, protein dependent RCANAs are useful in a variety of applications. For example, protein-regulated catalytically active nucleic acids can be used (1) for the acquisition of data about whole proteomes, (2) as in vitro diagnostic reagents to detect proteins specific to disease states, such as prostate-specific antigen or viral proteins, (3) as sentinels for the detection of biological warfare agents, or (4) as regulatory elements in gene therapies.

[0075] B. Modification of Residues in Catalytic Domain

[0076] In one embodiment, the present invention randomizes a portion of the catalytic core itself, not necessarily a domain that is pendant on the catalytic core. One example for selection using the present invention was using the L1 ligase. The catalytic core of the L1 ligase has been mapped by deletion analysis and by partial randomization and re-selection. FIG. 15a depicts the LI ligase that was the starting point for pool design. Stems A, B, and C are indicated. The shaded region contains the catalytic core and ligation junction. Primer binding sites are shown in lower case, an oligonucleotide effector required for activity is shown in italics, and the ligation substrate is bolded. The `tag` on the ligation substrate can be varied, but was biotin in the exemplary selection described herein. The LI pool contains 50 random sequence positions and overlaps with a portion of the ribozyme core. In FIG. 15b, Stem B was reduced in size and terminated with a stable GNRA tetraloop, but stem A was unchanged.

[0077] A pool was synthesized in which the random sequence region spanned the catalytic core. Protein-dependent ribozymes were selected from this random sequence pool by selecting for the ability to ligate an oligonucleotide tag in the presence of a protein effector followed by capturing the oligonucleotide tag on an affinity matrix, followed by amplification in vitro or in vivo. Because the catalytic core has been randomized, the selection for protein-dependence not only yields species that may bind to ancillary regions of the ribozyme, but species in which the protein effector actually helps to organize the catalytic core of the ribozyme.

[0078] Selection for protein-dependence from a pool in which at least a portion of the catalytic core of the ribozyme is randomized differs from selection for protein-dependence from a pool in which the catalytic core is not randomized. For example, the catalytic core of the protein-dependent ribozymes that was selected differed substantially from the catalytic core of the original ribozyme and the catalytic core of other, non-protein-dependent ribozymes selected based on the original ribozyme.

[0079] FIG. 15a depicts the LI ligase that was the starting point for pool design in the Cyt18 RCANA selection, as an example of a protein-activated regulated, catalytically active nucleic acid. Stems A, B, and C are indicated. The shaded region contains the catalytic core and ligation junction. Primer binding sites are shown in lower case, an oligonucleotide effector required for activity is shown in italics, and the ligation substrate is bolded. The `tag` on the ligation substrate can be varied, but was biotin in the exemplary selection described herein. The LI pool contains 50 random sequence positions and overlaps with a portion of the ribozyme core. In FIG. 15b, Stem B was reduced in size and terminated with a stable GNRA tetraloop, but stem A was unchanged.

[0080] Because one or more residues in the catalytic core have been randomized, the effectors may add essential catalytic residues for a given reaction. That is, both the effector molecule and the regulatable, catalytically active nucleic acids contribute a portion of the active site of the ribozyme. For example, using the method of the present invention a ribozyme and an effector molecule that would only carry-out poorly an enzymatic function independently may perform that enzymatic function upon interaction with one another. As such, a regulatable, catalytically active nucleic acid that contributes a guanosine and an adenosine and a protein effector that contributes a histidine together form a complex that has greater activity than either of the individual compounds. Using the methods disclosed herein it is possible to identify a chimeric effector:ribozyme (e.g., a protein:RNA complex) active site that would lead to catalysis. The invention describes ribozymes that have a detectable, basal chemical reactivity, and that the presence of the effector modulates this basal chemical reactivity. It is for this reason that the present invention differs significantly from other inventions which have claimed protein:RNA complexes in which no basal catalytic activity exists in the ribozyme or protein alone.

[0081] C. Selection of RCANA

[0082] FIG. 15c schematically shows the following selection scheme: the RNA pool was incubated with a biotinylated tag and reactive variants were removed from the population. The remaining species were again incubated with a biotinylated tag in the presence of the target (for example the protein Cyt18). Reactive variants were removed from the population and preferentially amplified by reverse transcription, PCR, and in vitro transcription.

[0083] Ligand-dependent, regulatable, catalytically active nucleic acids selected by this method differ from functional nucleic acids selected from random sequence pools. Selection for ligand-dependence requires a selection for catalytic activity as opposed to a selection for binding. Therefore, protein-dependent, regulatable, catalytically active nucleic acids are not aptamers. The composition of matter of a selected protein-dependent ribozyme will be different than the composition of matter of a selected aptamer. For example, the sequence of the lysozyme-dependent ribozyme is different from the sequence of anti-lysozyme aptamers. An important feature of the present invention is that the regulatable, catalytically active nucleic acids disclosed herein only required recognition rather than selected or enhanced binding ability. For example, the affinity of lysozyme for the naive, unselected RNA pool is identical to the affinity of lysozyme for the selected, regulatable, catalytically active nucleic acid. The only difference is that the way in which lysozyme is recognized by the regulatable catalytically active nucleic acids leads to activation, while for the pool as a whole non-specific binding does not lead to activation. In other words, binding is a concomitant but secondary function of selection for regulatability; that is, the regulatable ribozymes disclosed herein may bind the effector or target very poorly, but upon interaction the activity of the ribozyme may nonetheless be modulated.

[0084] D. Automated Selection of RCANA

[0085] Robotic workstations have become essential to high-throughput manipulations of biomolecules, such as in high-throughput screening for drugs with a particular mechanism of action. The invention also includes the automation of in vitro selection procedures, and a mechanized system that executes both common and modified in vitro selection procedures. Automation facilitates the execution of this procedure, accomplishing in hours to days what once necessitated weeks to months. In particular, the present inventors have adapted a robotic workstation to the selection of aptamers and ribozymes. However, the automation methods are generalizable to a number of different types of selections, including selections with DNA or modified RNA, selections for ribozymes, and selections for phage-displayed or cell-surface proteins.

[0086] In short, in vitro selection involves several components: generation of a random sequence pool, sieving the random sequence pool for nucleic acid species that bind a given target or catalyze a given reaction, amplification of the sieved species by a combination of reverse transcription, the polymerase chain reaction, and in vitro transcription. Beyond the generation of the random sequence pool, each of these steps can potentially be carried out by a robotic workstation. The pool can be pipetted together with a target molecule. If the target is immobilized on a magnetic bead, then the nucleic acid:target complex can be sieved from solution using an integrated magnetic bead collector. Finally, selected nucleic acid species can be eluted from the complex and amplified via a series of enzymatic steps that include the polymerase chain reaction carried out via an integrated thermal cycler.

[0087] There are many potential ways in which binding species can be sieved from a random sequence population. However, not all of these methods are amenable to automated selection. For example, to select aptamers, others have suggested that targets can be immobilized onto microtitre plates and binding species can be sieved by panning. The present inventors have had little success with this method, likely because panning is a relatively inefficient, low stringency method for sieving. Instead, the present inventors have discovered that when targets are immobilized on beads and mixed with a random sequence pool, binding species can be efficiently sieved from non-binding species by filtration of the beads. Beads can be readily manipulated by pipetting, allowing for the facile recovery and elution of the binding species, which are then amplified and carried into subsequent rounds of selection. This method differs from the magnetic bead capture method, and can be carried out with much higher stringency. This method is novel, and has not previously been used for in vitro selection experiments.

[0088] FIG. 14 depicts schematically an exemplary work surface for yet another embodiment of the present invention: automated selection. See, J. C. Cox, et al., Automated RNA Selection Biotechnol. Prog., 14, 845 850, 1998.

[0089] Base protocol. Automated selection involves several, sequential automated steps. Several modules are placed on the robotic work surface, including a magnetic bead separator, and enzyme cooler, and a thermal cycler. After manually preparing reagents and preloading those reagents (including random pool RNA, buffers, enzymes, streptavidin magnetic beads, and biotinylated target) and tips onto the robot, a program is run. The selection process, automated by the robot, goes as follows: RNA pool is incubated in the presence, of biotinylated target conjugated to streptavidin magnetic beads. After incubation, the magnets on the magnetic bead separator are raised, and the beads (now bound by pool RNA--the selected nucleic acids) are pulled out of solution. Thus, the beads can be washed, leaving only RNA bound to targets attached to beads. These RNA molecules are reverse transcribed, reamplified via PCR, and the PCR DNA is in vitro transcribed into RNA to be used in iterative rounds of selection.

[0090] The Bioworks method for in vitro selection. This scripted programming method contains all movements necessary in order to facilitate automated selection. This includes all physical movements to be coordinated, and also communication statements. For instance, five rounds of automated selection against a single target requires over 5,000 separate movements in x, y, z, t coordinate space. Additionally, the method also holds all relevant measurements, offsets, and integrated equipment data necessary to prevent physical collisions and permit concerted communication between devices.

[0091] "Beads on filter" selections. While the vast majority of manual selections have been performed on nitrocellulose-based filters, a small few have also been performed on solid surfaces, such as beads. A novel selection scheme was developed whereby selection is performed on magnetic beads that are placed on nitrocellulose filters, and washed as the bead is the selection target itself. This method allows for much greater specificity of selection, thereby promoting `winning` molecules to amplify in greater number, and thus reduce the overall amount of rounds necessary to complete the selection procedure. Manual selection does not involve a combination of surfaces to enhance selection. An alternative method is to take the magnetic beads, or nucleic acids attached to beads using methods other than beads, and running buffer over the beads and through a filter. It has been found that a complete filter washing step provides improved performance in the selection due to decreased background. One example of the automation of such a methods would be to remove, e.g., nucleic acids attached to the beads by placing the beads in a 96-well plate with a filtered bottom, the beads washed with buffer followed by subsequent elution of the target nucleic acids.

[0092] Cross-contamination avoidance. The introduction of contaminating species of nucleic acid strands in a manual selection may be commonplace. This is especially true if selection is done against multiple targets in parallel, and also when a researcher reuses the same pool for different selection tools. Contaminating species have been shown in the past to interfere with a manual selection such that it could not be completed. Automated in vitro selection takes steps to minimizing and/or eliminate the possibility of cross-contamination between pools and targets. Movement of the mechanical pod along the work surface is unidirectional when carrying potentially contaminating material. This movement away from `clean` things and only towards items that have already been exposed to replicons greatly diminishes the possibility of cross-contaminating reactions. The only circumstance in which the pod reverses its direction is to acquire a new, clean pipette tip. Additionally, the reagent trays were sealed with aluminum foil for a physical barrier between the environment and unexposed reagents. See FIG. 14, a layout of the robotic work surface that reduces cross-contamination.

[0093] Using this method the present inventors have successfully selected aptamers against a number of protein targets, including Cyt18, lysozyme, the signaling kinase MEK1, Rho from a thermophilic bacteria, and the Herpes virus US11 protein. The robot can perform 6 rounds of selection/day versus individual protein targets, and selections are typically completed within 12-18 rounds. In each instance, selected populations showed a substantially greater affinity for their cognate proteins than naive populations. In addition, when selected populations were sequenced one or more sequence families typically predominated. Sequence families are a hallmark of a successful selection, and indicate that the robotic method faithfully recapitulates manual selection methods.

[0094] The use of beads for target immobilization allows automated selection to be generalized to virtually any target class. For example, small organic molecules could be directly conjugated to beads. Similarly, antibodies could be conjugated to beads and in turn could be used to capture macromolecular structures, such as viruses or cells.

[0095] In another embodiment, the robotic workstation can be used for the selection of nucleic acid catalysts. For example, a DNA library was incubated that contained an iodine leaving group at its 5' end with a DNA oligonucleotide substrate containing a 3' phosphorothioate nucleophile and a 5' biotin. The biotin can be captured on beads bearing streptavidin, and the beads can in turn be captured either by magnetic separation or by filtration. Any molecules in the DNA pool that ligate themselves to the biotinylated substrate are co-immobilized with that substrate. Immobilized species can be directly amplified following transfer to the integrated thermal cycler. The inclusion of a biotin on one of the primers used for amplification allows single-stranded DNA to be prepared by denaturation of the non-biotinylated strand in base, followed by neutralization of the solution. While this method has proved successful for the selection of deoxyribozyme ligases, variations could also have been attempted. For example, the biotinylated DNA oligonucleotide substrate could have been pre-immobilized on beads, and the DNA pool incubated with the beads. In this instance, any molecules in the DNA pool that ligate themselves to the substrate will also be directly captured on the beads.

[0096] The use of beads for catalyst immobilization immediately suggests other selection protocols. For example, nucleic acid cleavases could be selected by first immobilizing a pool on the beads, then selecting for those species that cleave themselves away from the beads. Similarly, nucleic acid Diels-Alder synthetases may be selected by first immobilizing a diene on the beads, creating a nucleic acid pool that terminates in a dienophile, and selecting for those species that most efficiently conjugate the diene and dienophile.

[0097] This method can be applied to the selection of RCANAs. The ability to use a robotic workstation to select for ligases demonstrates that it is possible to select for regulatable ribozymes. For example, the selection protocols described in this invention can be altered so that ligases that immobilized themselves in the absence of a protein effector are removed from the random sequence population, while ligases that subsequently immobilized themselves once a protein effector were added are transferred to the integrated thermal cycler, amplified, and used for additional rounds of selection. This automated selection methods for regulatable ribozymes can readily be extended to other classes or catalysts than ligases, such as cleavases or Diels Alder synthetases by those skilled in the art.

[0098] Automating selection greatly diminishes human error in the actual pipetting and biological manipulations. While programming the robot is often not a trivial task, and can be time-consuming, automated selection is far faster and more efficient than manual selection. The scientist's time is thus put to better use preparing samples and analyzing data, rather than performing the actual selection. Additionally, automated selection may include real-time monitoring methods (e.g., molecular beacons, TaqMan) and software that can make intelligent decisions based on real-time monitoring.

[0099] E. Chip-based RCANA for in vitro detection applications

[0100] Regulatable catalytically active nucleic acids are especially useful for biosensor applications. For example, different protein-regulated catalytically active nucleic acids may be anchored to a surface, such as wells in a multi-well plate. Mixtures of analytes and fluorescently tagged substrates are added to each well. Where cognate effectors are present, the protein-regulated catalytically active nucleic acids will covalently attach the fluorescent tags to themselves. Where protein-regulated catalytically active nucleic acids have not been activated by effectors, the tagged substrates will be washed out of the well. After reaction and washing, the presence and amounts of co-immobilized fluorescent tags are indicative of amounts of ligands that were present during the reaction.

[0101] In one embodiment of the invention, the reporter may be a fluorescent tag, but it may also be an enzyme, a magnetic particle, or any number of detectable particles. Additionally, the protein-regulated catalytically active nucleic acids may be attached to beads or non-covalently linked to a surface rather than covalently linked to a surface.

[0102] One advantage of this method is that covalent immobilization of reporters (as opposed to non-covalent immobilization, as in ELISA assays) allows stringent wash steps to be employed. Additionally, ribozyme ligases have the unique property of being able to transduce effectors into nucleic acid templates that can be amplified, affording an additional boost in signal prior to detection.

[0103] Another advantage is that the analytical methods of the present invention do not rely on binding per se, but only on transient interactions. The present invention requires mere recognition rather than a binding event that must be physically isolated, providing a potential advantage of protein-regulated catalytically active nucleic acids over antibodies. That is, even low affinities are sufficient for activation and subsequent detection, especially if individual, immobilized protein-regulated catalytically active nucleic acids are examined (i.e., by ligand-dependent immobilization of a quantum dot).

[0104] FIG. 16 schematically depicts one way to anchor aptazymes to a chip for a particular embodiment of the present invention. In this schematic, different ribozyme ligases (indicated by different colored allosteric sites) are shown immobilized on beads in wells, and mixtures of analytes (differentiated by shape and color) and fluorescently tagged substrates have been added to each well. In the middle panel of this figure, where cognate effectors are present (same color analyte and allosteric site), the aptazymes will covalently attach the fluorescent tags to themselves. Where RCANA have not been activated by effectors, the tagged substrates are washed out of the well. In the last panel of FIG. 16, after reaction and washing, the presence and amounts of co-immobilized fluorescent tags are indicative of amounts of ligands that were present during the reaction.

[0105] In the embodiment of FIG. 16, the reporter may be a fluorescent tag, but it may also be an enzyme, a magnetic particle, or any number of detectible particles. Additionally, the RCANA could be immobilized on beads, but they could also be directly attached to a solid support via covalent bonds.

[0106] One advantage of this embodiment is that covalent immobilization of reporters allows stringent wash steps to be employed. This can be distinguished from to non covalent immobilization assays such as ELISA assays where stringent washing may destroy the signal. An additional advantage is that ribozyme ligases have the unique property of being able to transduce effectors into templates that can be amplified, affording an additional boost the in signal prior to detection.

[0107] Additionally, the method of the present invention contemplates that the RCANA construct may be amplified by polymerase chain reaction. Finally, the method contemplates that the RCANA oligonucleotide sequence of the construct may include RNA nucleotides, so that the method further includes reverse transcription of the RNA using reverse transcriptase to produce a DNA complementary to the RNA template.

[0108] Modified nucleotides may be introduced into the RCANA that substantially stabilize them from degradation in environments such as sera or urine. The analytical methods of the present invention do not rely on binding per se, but only on transient interactions. The present invention requires mere recognition rather that actual binding, thus providing a potential advantage of RCANA over antibodies. That is, even low affinities are sufficient for activation and subsequent detection, especially if individual immobilized RCANA are examined (i.e., by ligand-dependent immobilization of a quantum dot).

[0109] F. In Vitro Engineering and Selection of RCANAs for In Vivo Applications

[0110] The above discussion has disclosed methods for the in vitro creation of RCANAs, and has disclosed some of their in vitro applications. In the following section we describe the design, engineering, and in vitro selection of RCANAs for in vivo applications.

[0111] This invention utilizes ribozymes that can alter the level of mRNAs in a cellular system. In one embodiment, the ribozyme can be a self splicing intron, such as the group I intron. This ribozyme can be inserted into a gene. If the ribozyme is active, it will catalyze the a self-splicing reaction that removes itself from the gene, allowing accurate expression of the gene. In another embodiment, the ribozyme may be one that acts in trans to cleave a mRNA. Again, changing the activity of the ribozyme will lead to a change in the level of the mRNA in the cell, thereby altering the level of the protein coded by that gene. Those skilled in the art will recognize that other ribozyme activities may be used. For the purpose of illustration, the invention is now described in detail with the use of the self splicing intron.

[0112] The intron is first modified to function as an RCANA. Briefly, the methods described above can be used generate RCANA introns. A pool of potential RCANA introns is created by randomizing one or more regions of the intron. The randomized region optionally includes one or more residues from the catalytic core. A selection protocol is then developed that allows the active RCANA introns to be partitioned from the inactive ones. For example, the active RCANA introns can be partitioned from the inactive RCANA intron based on the mobility in gel electrophoresis. Other methods will be clear to those skilled in the art. Based on this partitioning method, an iterative procedure of partitioning and subsequent amplification of the RCANA introns is used to select RCANAs that are regulated by an effector. With the exception of the partitioning method, this procedure is essentially identical to the selection described about for RCANA ligases.

[0113] As an alternative to the selection of RCANA introns, it is also possible to engineer RCANA introns. For example, one of the stem-loop structures of the intron can be replaced by an aptamer for the desired effector. Interaction of the effector with this engineered RCANA intron-will result in a modulation of the RCANA intron activity. Because an aptamer is different from an regulatory element (as was detailed above), the present method will, in general, lead to RCANAs that are regulated by the effector. However, as will be shown in an example below, an important aspect of the current invention is that this level of regulation can be adequate for in vivo applications.

[0114] G. In Vivo Selection and Optimization of RCANAs.

[0115] Here we disclose methods to generate RCANAs by using in vivo selection. FIG. 4b shows a selection protocol for the Group I P6 RCANA Pool of FIG. 4a. Positive and negative selections are made in vitro to select Group I RCANA that are dependent on activator. The selections are described above in Example 2 for a specific embodiment of the present invention--a theophylline dependent RCANA. In vivo screens and selections are used to select Group I RCANA that exhibit strong theophylline-dependence. The selected RCANA are mixed at various ratios with mutant Group I ribozymes that splice at a low but continuous level to determine the level at which RCANA can be selected against background. Because activation domains are often in the form of a stem-loop, the mutations can be concentrated in a single stem loop structure of the RCANA intron. In an alternate embodiment, the mutations can include catalytic residues. In yet another embodiment, the mutations are randomly dispersed in the intron. Finally, the best theophylline-dependent Group I aptazymes that have been derived by any of the methods described herein may undergo further selection by partially randomizing their sequences and selecting for improved in vivo performance.

[0116] Strategies similar to those depicted in FIGS. 4a and 4b may be used to develop RCANA on any desired effector. Positive and negative in vitro selection such as depicted in FIG. 4b are described above in Example 2 for a specific embodiment of the present invention.

[0117] From 10.sup.6 to 10.sup.10 variants can be efficiently transformed as described herein, sufficient to encompass most variants in the populations discussed so far. This efficiency of transformation, however, is likely to be insufficient to encompass a significant fraction of a completely random pool. Nonetheless, sequences have been selected from completely random expressed pools that can protect bacteria from bacteriophage infection.

[0118] The above procedure described how to select in vivo RCANAs. A similar procedure can be used to optimize engineered RCANAs. Residues in the RCANA that might include the ligand binding region, structural stem-loops, or even catalytic residues can be mutated. The selection procedure described above is then used to select for optimized RCANAs.

[0119] Finally, since the rules that govern Group I intron splicing in different gene contexts are well known to those skilled in the art, the skilled artisan can remove RCANA introns from one context and modularly insert them into other genes. Should the efficiency or effector-dependence of intron splicing be compromised in the new gene, the intron may be reaccommodated to its new genetic environment by a selectable marker to the interrupted gene of interest and selecting for an effector-dependent phenotype.

[0120] To the extent that Group I aptazymes are self-sufficient, they should also function in eukaryotic cells, including human cells. Selecting for effector-dependence may also be performed in eukaryotic cells. Selection in eukaryotic systems may be performed, e.g., by fusing the gene of interest to a reporter gene such as GFP or luciferase. Variants of the RCANA that promote effector-dependent protein production may then be selected using a FACS. A pool of 10.sup.6 to 10.sup.10 variants may be screened by this procedure, a range comparable to the bacterial system previously described.

[0121] H. In Vivo Detection Applications

[0122] Using the present invention, it is possible to activate or repress a reporter gene (e.g., luciferase or GFP) containing an engineered intron in response to an endogenous protein activator, or a post-translationally modified form of an endogenous protein activator (e.g., protein kinases such as ERK 1 and phosphorylated ERK 1). It is also possible to activate or repress a reporter gene (e.g., luciferase or GFP) containing an engineered intron in response to small molecule effectors (e.g., cyclic AMP, glucose, bioactive peptides, bioactive nucleic acids, or low molecular weight drugs such as antibiotics, antineoplastics or the like.). Thus, reporter gene-engineered intron constructs may be used to monitor intracellular levels of proteins, post-translationally modified forms of proteins or small molecules such as cyclic AMP and the like. Such applications may be used for high-throughput cell-based assays and screens for drug leads or for drug optimization and development.

[0123] Bacterial strains such as E. coli, and B. subtilis, or yeast strains such as S. cerevisiae, and S. pombe may be transformed with an expression vector encoding a reporter gene regulated by an intron RCANA, and these engineered microbial cell lines may be used for cell-based assays and tests for drug discovery and development. Similarly, standard mammalian cell lines such as CHO, NIH3T3, 293, and 293T may be transfected with appropriate vectors (e.g., pcDNA, pCMV, or retrovirus), that are engineered to contain RCANA-regulatable reporter genes, and these re-engineered cell lines may be used subsequently for cell-based assays and tests. In another use of the RCANA reporter gene technology, tumorigenic cell lines such as LNCaP, MCF-7, MDA-MB-435, SK-Mel, DL1, PC3, T47D and the like, may be transfected in vitro with appropriate vectors encoding an RCANA-regulatable reporter gene. These re-engineered tumorigenic cell lines may be used in cell-based screens for the discovery and development anti-neoplastic drugs.

[0124] In another in vivo application, reporter gene--intron RCANA constructs (e.g., luciferase or GFP) may be used to generate live animal models for use in drug development. In one embodiment the RCANA-intron construct may be used in an engineered tumorigenic cell line to indicate the levels of a target molecule used to generate a tumor xenograft in nude mice. Mice bearing the tumors derived from the engineered cell line may then be used to screen for drugs that alter the level of the target molecule. For example, a transfected MDA-MB-435 line engineered to express a GFP gene under regulatable control by intron response to the protein activator phospho-ERK 1 is used to screen for drugs which both inhibit tumor growth and block formation of phospho-ERK. In another embodiment of the RCANA intron invention, transgenic mouse models may be generated in which tissue or cell type specific expression of the reporter gene is controlled by the effector activated RCANA intron. For example transgenic mice expressing a phospho-VEGF receptor tyrosine kinase (RTK) specific RCANA regulated GFP gene under control of the MMTV (mouse mammary tumor virus) promoter would show expression of GFP in mouse mammary tissue in a phospho-VEGF RTK dependent manner. Furthermore, these mice may be used to screen compounds for anti-VEGF RTK activity.

[0125] FIG. 5 is a diagrammatic representation of another embodiment of the present invention. Exogenous or endogenous activation of Group I intron splicing is depicted. A reporter gene such as Luciferase or beta-Gal is fused to the gene of interest which also contains the group I intron (td). Splicing-out of the Group I intron is induced by an effector, shown in the diagram as a protein, in this case Cyt18, by the shaded oval. Activation of the RCANA and auto-excision of the intron results in expression of the reporter gene to detect the desired reaction. The use of a reporter gene of this embodiment may be suitable for use in eukaryotic systems.

[0126] FIG. 6 is a diagram of another embodiment of the present invention. Libraries of candidate exogenous activators (E.sub.1-n) may be generated from a randomized RCANA pool indicated by the triangle. As in the embodiment of FIG. 5, a reporter gene is expressed in cells where the exogenous activator activates the RCANA to release the intron from the gene. As will be known to those of skill in the art any number of current and future libraries may be used with the present invention.

[0127] FIG. 7 depicts an alternative embodiment for screening libraries of exogenous activators. In the embodiment of the present invention of FIG. 7, Group I introns are induced into trans-splicing. Extracted and amplified introns are used to transform cells.

[0128] FIG. 8 shows yet another alternative embodiment for screening libraries of exogenous activators of the present invention. In the embodiment of FIG. 8, the effector (shaded oval), shown in this illustration as protein Cyt18, is mutagenized (triangle) to form an effector library. A second effector (E.sub.1-n) interacts with and activates one or more members of the effector library. The effector-effector complex is exposed to the gene containing both the Group I intron and a reporter gene. Cell sorting reveals the cells that express the reporter gene to indicate successful activation of the RCANA by the effector-effector complex.

[0129] FIG. 9 is a diagram of an embodiment of the present invention for screening for endogenous activators. In this embodiment, an endogenous effector, in this illustration shown as a protein activator from endogenous or transformed origin (shaded oval), activates self-splicing of the Group I intron. Cell sorting is used to reveal the expression of the reporter gene. To protect against spontaneous auto-excision of the intron, the gene may be transferred into a different background system such as yeast or E. coli, for example.

[0130] FIG. 10 depicts an alternative to the embodiment of FIG. 9 to screen for endogenous activators of the present invention. In FIG. 10, the activator that is being screened for may include, inter alia, a phosphorylated protein, a product of ubiquitination, or a protein-protein complex. For example, a protein activator, shown as the small shaded oval, may phosphorylate an effector such as Cyt18, shown as a large shaded oval with the phosphorylation indicated by the shaded rectangle. The phosphorylated effector activates intron self-splicing with concomitant expression of the reporter gene, shown here for illustration as Luciferase or beta-Galactosidase.

[0131] FIG. 11 shows yet another embodiment of the present invention to monitor compounds that perturb cellular metabolism. In this embodiment, a ribozyme similar to described in FIG. 6, and designated in this diagram by a line with a triangle is activated by a protein effector, shown as a shaded oval in FIG. 11. The protein effector may be a phosphoprotein, an induced protein, or a protein complex, for example. One or more second effectors, designated as a series of circles, alters the level of or degree of modification of the protein effector. The source of the second effectors may be endogenous or the effectors may be the product of a transformation construct used to transform a cell. Alteration of the level or modification of the protein effector results in an alteration in the expression of the reporter gene (shown as a dark circle with "lightning bolts"). The functioning of the gene of interest may thereby be perturbed, providing information about the function and/or regulation of the gene or gene product. FIG. 11 describes a method for taking the products of the screen described in FIGS. 8 and 10 and using them to monitor cellular or metabolic states.

[0132] FIG. 12 shows a further embodiment of the present invention that provides a non-invasive readout of metabolic states. An RCANA construct of the present invention may be introduced to a gene of interest. A protein suppressor from either an endogenous source from the product of cell transformation activates self-splicing of the RCANA, leading to expression of the endogenous gene, shown here again as a dark circle with lightning bolts. Whether or not the gene of interest is expressed upon release of the RCANA intron from the gene provides information about the metabolic state of the gene of interest. The embodiment of the present invention of FIG. 12 thus provides a non-invasive means to determine the metabolic state of an organism with regard to a gene of interest.

[0133] FIG. 13 depicts a further embodiment of the present invention wherein endogenous suppressors provide a non-invasive readout of multiple metabolic states. Multiple protein activators (endogenous or transformed) are exposed to a pool of Group I introns of the present invention. The pool comprises introns with length polymorphisms that are depicted in FIG. 13 by a discontinuity or break in the line representing the Group I intron (thick line) residing in a gene of interest (thin line). Activation of the RCANA leads to trans-splicing among the various polymorphisms. The products of trans-splicing may be extracted and amplified. Separation of the trans-splicing products by gel electrophoresis provides a read out of the protein function or the metabolic pathway. The readout may even be digitized for analysis.

[0134] I. In Vivo Uses of RCANAs for Gene Therapy

[0135] One important feature of using RCANAs and the method of the present invention for gene therapies is that regulated introns may be used to control gene expression, for any of a variety of genes, since the introns may be inserted into and be engineered to accommodate virtually any gene. Moreover, since the RCANAs may be engineered to respond to any of a variety of effectors, the characteristics of the effector (oral availability, synthetic accessibility, pharmacokinetic properties) may be chosen in advance. The drug is chosen prior to engineering the target of the drug. In part because of these extraordinary capabilities, RCANA provide perhaps the only viable route to medically successful and practical gene therapies. Drugs may be given throughout the treatment (or lifetime) of a patient who had undergone a single initial gene therapy. In addition, by making the gene therapy regulatable, the amount of a gene product may be easily increased or decreased in different individuals at different times during the treatment by increasing or decreasing the doses of effectors.

[0136] The present method also includes transforming a cell with the RCANA construct so that the construct is inserted into a gene of interest. An effector is provided to activate the RCANA so that administering to the cell an effective amount of the effector induces the RCANA to splice itself out of the gene to regulate expression of the gene.

[0137] The method of the present invention contemplates that the RCANA construct may be a plasmid. The method then further includes transforming the cell with the plasmid. The method of the present invention also contemplates ligating the RCANA construct into a vector and transforming the cell with the vector.

[0138] Definitions

[0139] As used herein, the term "regulatable, catalytically active nucleic acid" or "RCANA" means a ribozyme or nucleic acid enzyme that is regulated by an effector. The kinetic parameters of the RCANA may be varied in response to the amount of an effector, which may be an allosteric effector molecule. Just as allosteric protein enzymes undergo a change in their kinetic parameters or of their enzymatic activity in response to interactions with an effector molecule, the catalytic abilities of RCANAs may be similarly modulated by effectors. As demonstrated herein, the effectors may be small molecules, proteins, peptides or molecules that interact with proteins, peptides or other molecules. RCANAs transduce molecular recognition into catalysis upon interaction with an effector that interacts with a portion of the RCANA.

[0140] As used herein, the term "effector," "effector molecule", "allosteric effector" or "allosteric effector molecule" means a molecule or process that changes the kinetic parameters or catalytic activity of an RCANA.

[0141] As used herein, the term "catalytic residue" refers to residues that when mutated decrease the activity of the RCANA. Mutating a residue that affects the catalytic activity of a ribozyme following the selection of the RCANA, may cause different residues to become sensitive to mutation than in the original ribozyme. The relative mutational sensitivity of a given `catalytic residue` may change before and after the selection of the RCANA. These secondary mutations are also encompassed by the present invention.

[0142] As used herein, the term "aptamer" refers to a nucleic acid that has been specifically selected to optimally bind to a target ligand. As described above, it is important to recognize that an aptamer is fundamentally different than an RCANA.

[0143] As used herein, the term "kinetic parameters" refers to any aspect of the catalytic activity of the nucleic acid. Changes in the kinetic parameters of a catalytic RCANA produce changes in the catalytic activity of the RCANA such as a change in the rate of reaction or a change in substrate specificity. For example, self-splicing of an RCANA out of a gene environment may result from a change in the kinetic parameters of the RCANA.

[0144] As used herein, the term "catalytic" or "catalytic activity" refers to the ability of a substance to affect a change in itself or of a substrate under permissive conditions. As used herein, the term "protein-protein complex" or "protein complex" refers to an association of more than one protein. The proteins that make up a protein complex may be associated by functional, stereochemical, conformational, biochemical, or electrostatic mechanisms. It is intended that the term encompass associations of any number of proteins.

[0145] As used herein, the term "in vivo" refers to cellular systems and organisms, e.g., cultured cells, yeast, bacteria, plants and/or animals.

[0146] As used herein the terms "protein", "polypeptide" or "peptide" refer to compounds comprising amino acids joined via peptide bonds and are used interchangeably.

[0147] As used herein, the term "endogenous" refers to a substance the source of which is from within a cell, cell extract or reaction system. Endogenous substances are produced by the metabolic activity of, e.g., a cell. Endogenous substances, however, may nevertheless be produced as a result of manipulation of cellular metabolism to, for example, make the cell express the gene encoding the substance.

[0148] As used herein, the term "exogenous" refers to a substance the source of which is external to a cell, cell extract or reaction system. An exogenous substance may nevertheless be internalized by a cell by any one of a variety of metabolic or induced means known to those skilled in the art.

[0149] As used herein the term "modified base" refers to a non-natural nucleotide of any sort, in which a chemical modification may be found on the nucleobase, the sugar, or the polynucleotide backbone or phosphodiester linkage.

[0150] As used herein, the term "gene" means the coding region of a deoxyribonucleotide sequence encoding the amino acid sequence of a protein. The term includes sequences located adjacent to the coding region on both the 5, and 3, ends such that the deoxyribonucleotide sequence corresponds to the length of the full-length mRNA for the protein. The term "gene" encompasses both cDNA and genomic forms of a gene. A genomic form or clone of a gene contains the coding region interrupted with non-coding sequences termed "introns" or "intervening regions" or "intervening sequences." Introns are segments of a gene that are transcribed into nuclear RNA (hnRNA); introns may contain regulatory elements such as enhancers. Introns are removed, excised or "spliced out" from the nuclear or primary transcript; introns therefore are absent in the messenger RNA (mRNA) transcript. The mRNA functions during translation to specify the sequence or order of amino acids in a nascent polypeptide.

[0151] In addition to containing introns, genomic forms of a gene may also include sequences located on both the 5' and 3' end of the sequences that are present on the RNA transcript. These sequences are referred to as "flanking" sequences or regions (these flanking sequences are located 5' or 3' to the non-translated sequences present on the mRNA transcript). The 5' flanking region may contain regulatory sequences such as promoters and enhancers that control or influence the transcription of the gene. The 3' flanking region may contain sequences that direct the termination of transcription, post-transcriptional cleavage and polyadenylation.

[0152] DNA molecules are said to have "5'ends" and "3'ends" because mononucleotides are reacted to make oligonucleotides in a manner such that the 5' phosphate of one mononucleotide pentose ring is attached to the 3' oxygen of its neighbor in one direction via a phosphodiester linkage. Therefore, an end of an oligonucleotides referred to as the "5'end" if its 5' phosphate is not linked to the 3' oxygen of a mononucleotide pentose ring and as the "3'end" if its 3' oxygen is not linked to a 5' phosphate of a subsequent mononucleotide pentose ring. As used herein, a nucleic acid sequence, even if internal to a larger oligonucleotide, also may be said to have 5' and 3' ends. In either a linear or circular DNA molecule, discrete elements are referred to as being "upstream" or 5' of the "downstream" or 3' elements. This terminology reflects the fact that transcription proceeds in a 5' to 3' fashion along the DNA strand.

[0153] The term "gene of interest" as used here refers to a gene, the function and/or expression of which is desired to be investigated, or the expression of which is desired to be regulated, by the present invention. In the present disclosure, the td gene of the T4 bacteriophage is an example of a gene of interest and is described herein to illustrate the invention. The present invention may be useful in regard to any gene of any organism, whether of a prokaryotic or eukaryotic organism.

[0154] The term "hybridize" as used herein, refers to any process by which a strand of nucleic acid binds with a complementary strand through base pairing. Hybridization and the strength of hybridization (i.e., the strength of the association between the nucleic acid strands) is impacted by such factors as the degree of complementary between the nucleic acids, stringency of the conditions involved, the melting temperature of the formed hybrid, and the G:C (or U:C for RNA) ratio within the nucleic acids.

[0155] The terms "complementary" or "complementarity" as used herein, refer to the natural binding of polynucleotides under permissive salt and temperature conditions by base-pairing. For example, for the sequence "A-G-T" binds to the complementary sequence "T-C-A". Complementarity between two single-stranded molecules may be partial, in which only some of the nucleic acids bind, or it may be complete when total complementarity exists between the single stranded molecules. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands. This is of particular importance in amplification reactions, which depend upon binding between nucleic acids strands.

[0156] The term "homology," as used herein, refers to a degree of complementarity. There may be partial homology or complete homology (i.e., identity). A partially complementary sequence is one that at least partially inhibits an identical sequence from hybridizing to a target nucleic acid; it is referred to using the functional term "substantially homologous." The inhibition of hybridization of the completely complementary sequence to the target sequence may be examined using a hybridization assay (Southern or Northern blot, solution hybridization and the like) under conditions of low stringency. A substantially homologous sequence or probe will compete for and inhibit the binding (i.e., the hybridization) of a completely homologous sequence or probe to the target sequence under conditions of low stringency. This is not to say that conditions of low stringency are such that non-specific binding is permitted; low stringency conditions require that the binding of two sequences to one another be a specific (i.e., selective) interaction. The absence of non-specific binding may be tested by the use of a second target sequence which lacks even a partial degree of complementarity (e.g., less than about 30% identity); in the absence of non-specific binding, the probe will not hybridize to the second non-complementary target sequence. When used in reference to a single-stranded nucleic acid sequence, the term "substantially homologous" refers to any probe which can hybridize (i.e., it is the complement of) the single-stranded nucleic acid sequence under conditions of low stringency as described.

[0157] As known in the art, numerous equivalent conditions may be employed to comprise either low or high stringency conditions. Factors such as the length and nature (DNA, RNA, base composition) of the sequence, nature of the target (DNA, RNA, base composition, presence in solution or immobilization, etc.), and the concentration of the salts and other components (e.g., the presence or absence of formamide, dextran sulfate and/or polyethylene glycol) are considered and the hybridization solution may be varied to generate conditions of either low or high stringency different from, but equivalent to, the above listed conditions.

[0158] As used herein the term "stringency" is used in reference to the conditions of temperature, ionic strength, and the presence of other compounds such as organic solvents, under which nucleic acid selections are conducted. With "high stringency" conditions a relatively small number of nucleic acid catalysts will be selected from a random sequence pool, while under "low stringency conditions a larger number of nucleic acid catalysts will be selected from a random sequence pool.

[0159] Numerous equivalent conditions may be employed to comprise low or high stringency conditions; factors such as the length of incubation of the reaction, the presence of competitive inhibitors of the reaction, the buffer conditions under which the reaction is carried out, the temperature at which the reaction is carried out are considered and the hybridization solution may be varied to generate conditions of low stringency selection different from, but equivalent to, the above listed conditions.

[0160] The term "antisense," as used herein, refers to nucleotide sequences that are complementary to a specific DNA or RNA sequence. The term "antisense strand" is used in reference to a nucleic acid strand that is complementary to tile "sense" strand. Antisense molecules may be produced by any method, including synthesis by ligating the gene(s) of interest in a reverse orientation to a viral promoter that permits the synthesis of a complementary strand. Once introduced into a cell, the transcribed strand combines with natural sequences produced by the cell to form duplexes. These duplexes then block either the further transcription or translation. In this manner, mutant phenotypes may also be generated. The designation "negative" is sometimes used in reference to the antisense strand, and "positive" is sometimes used in reference to the sense strand. The term is also used in reference to RNA sequences that are complementary to a specific RNA sequence (e.g., mRNA). Included within this definition are antisense RNA ("asRNA") molecules involved in genetic regulation by bacteria.

[0161] Antisense RNA may be produced by any method, including synthesis by splicing the gene(s) of interest in a reverse orientation to a viral promoter that permits the synthesis of a coding strand. Once introduced into an embryo, this transcribed strand combines with natural mRNA produced by the embryo to form duplexes. These duplexes then block either the further transcription of the mRNA or its translation. In this manner, mutant phenotypes may be generated. The term "antisense strand" is used in reference to a nucleic acid strand that is complementary to the "sense" strand. The designation. (-) (i.e., "negative") is sometimes used in reference to the antisense strand with the designation (+) sometimes used in reference to the sense (i.e., "positive") strand.

[0162] A gene may produce multiple RNA species that are generated by differential splicing of the primary RNA transcript. cDNAs that are splice variants of the same gene will contain regions of sequence identity or complete homology (representing the presence of the same exon or portion of the same exon on both cDNAs) and regions of complete non-identity (for example, representing the presence of exon "A" on cDNA 1 wherein cDNA 2 contains exon "B" instead). Because the two cDNAs contain regions of sequence identity they will both hybridize to a probe derived from the entire gene or portions of the gene containing sequences found on both cDNAs; the two splice variants are therefore substantially homologous to such a probe and to each other.

[0163] "Transformation," as defined herein, describes a process by which exogenous DNA enters and changes a recipient cell. It may occur under natural or artificial conditions using various methods well known in the art. Transformation may rely on any known method for the insertion of foreign nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method is selected based on the host cell being transformed and may include, but is not limited to, viral infection, electroporation, lipofection, and particle bombardment. Such "transformed" cells include stably transformed cells in which the inserted DNA is capable of replication either as an autonomously replicating plasmid or as part of the host chromosome. The term "transfection" as used herein refers to the introduction of foreign DNA into eukaryotic cells.

[0164] Transfection may be accomplished by a variety of methods known to the art including, e.g., calcium phosphate-DNA co-precipitation, DEAE-dextran-mediated transfection, polybrene-mediated transfection, electroporation, microinjection, liposome fusion, lipofection, protoplast fusion, retroviral infection, and biolistics. Thus, the term "stable transfection" or "stably transfected" refers to the introduction and integration of foreign DNA into the genome of the transfected cell. The term "stable transfectant" refers to a cell that has stably integrated foreign DNA into the genomic DNA. The term also encompasses cells that transiently express the inserted DNA or RNA for limited periods of time. Thus, the term "transient transfection" or "transiently transfected" refers to the introduction of foreign DNA into a cell where the foreign DNA fails to integrate into the genome of the transfected cell. The foreign DNA persists in the nucleus of the transfected cell for several days. During this time the foreign DNA is subject to the regulatory controls that govern the expression of endogenous genes in the chromosomes. The term "transient transfectant" refers to cells that have taken up foreign DNA but have failed to integrate this DNA.

[0165] As used herein, the term "selectable marker" refers to the use of a gene that encodes an enzymatic activity and which confers the ability to grow in medium lacking what would otherwise be an essential nutrient (e.g., the HIS3 gene in yeast cells); in addition, a selectable marker may confer resistance to an antibiotic or drug upon the cell in which the selectable marker is expressed. A review of the use of selectable markers in mammalian cell lines is provided in Sambrook, J. et al., Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press, New York (1989) pp.16.9-16.15.

[0166] As used herein, the term "reporter gene" refers to a gene that is expressed in a cell upon satisfaction of one or more contingencies and which, upon expression, confers a detectable phenotype to the cell to indicate that the contingencies for expression have been satisfied. For example, the gene for Luciferase confers a luminescent phenotype to a cell when the gene is expressed inside the cell. In the present invention, the gene for Luciferase may be used as a reporter gene such that the gene is only expressed upon the splicing out of an intron in response to an effector. Those cells in which the effector activates splicing of the intron will express Luciferase and will glow.

[0167] As used herein, the term "vector" is used in reference to nucleic acid molecules that transfer DNA segment(s) from one cell to another. The term "vehicle" is sometimes used interchangeably with "vector." The term "vector" as used herein also includes expression vectors in reference to a recombinant DNA molecule containing a desired coding sequence and appropriate nucleic acid sequences necessary for the expression of the operably linked coding sequence in a particular host organism. Nucleic acid sequences necessary for expression in prokaryotes usually include a promoter, an operator (optional), and a ribosome binding site, often along with other sequences. Eukaryotic cells are known to utilize promoters, enhancers, and termination and polyadenylation signals.

[0168] As used herein, the term "amplify", when used in reference to nucleic acids refers to the production of a large number of copies of a nucleic acid sequence by any method known in the art. Amplification is a special case of nucleic acid replication involving template specificity. Template specificity is frequently described in terms of "target" specificity. Target sequences are "targets" in the sense that they are to be sorted out from other nucleic acid. Amplification techniques have been designed primarily for this sorting out.

[0169] As used herein, the term "primer" refers to an oligonucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, which is capable of acting as a point of initiation of synthesis when placed under conditions in which synthesis of a primer extension product which is complementary to a nucleic acid strand is induced, (i.e., in the presence of nucleotides and an inducing agent such as DNA polymerase and at a suitable temperature and pH). The primer may be single stranded for maximum efficiency in amplification but may alternatively be double stranded. If double stranded, the primer is first treated to separate its strands before being used to prepare extension products. The primer must be sufficiently long to prime the synthesis of extension products in the presence of the inducing agent. The exact lengths of the primers will depend on many factors, including temperature, source of primer and the use of the method.

[0170] As used herein, the term "probe" refers to an oligonucleotide (i.e., a sequence of nucleotides), whether occurring naturally as in a purified restriction digest or produced synthetically, recombinantly or by PCR amplification, which is capable of hybridizing to another oligonucleotide of interest. A probe may be single-stranded or double-stranded. Probes are useful in the detection, identification and isolation of particular gene sequences. It is contemplated that any probe used in the present invention will be labeled with any "reporter molecule," so that is detectable in any detection system, including, but not limited to enzyme (e.g. ELISA, as well as enzyme-based histochemical assays), fluorescent, radioactive, and luminescent systems. It is not intended that the present invention be limited to any particular detection system or label.

[0171] As used herein, the term "target" when used in reference to the polymerase chain reaction, refers to the region of nucleic acid bounded by the primers used for polymerase chain reaction. Thus, the "target" is sought to be sorted oat from other nucleic acid sequences. A "segment" is defined as a region of nucleic acid within the target sequence.

[0172] As used herein, the term "polymerase chain reaction" ("PCR") refers to the method of K. B. Mullis U.S. Pat. Nos. 4,683,195, 4,683,202, and 4,965,188, hereby incorporated by reference, which describe a method for increasing the concentration of a segment of a target sequence in a mixture of genomic DNA without cloning or purification. This process for amplifying the target sequence consists of introducing a large excess of two oligonucleotide primers to the DNA mixture containing the desired target sequence, followed by a precise sequence of thermal cycling in the presence of a DNA polymerase. The two primers are complementary to their respective strands of the double stranded target sequence.

[0173] To effect amplification, the mixture is denatured and the primers then annealed to their complementary sequences within the target molecule. Following annealing, the primers are extended with a polymerase so as to form a new pair of complementary strands. The steps of denaturation, primer annealing and polymerase extension can be repeated many times (i.e., denaturation, annealing and extension constitute one "cycle"; there can be numerous "cycles") to obtain a high concentration of an amplified segment of the desired target sequence. The length of the amplified segment of the desired target sequence is determined by the relative positions of the primers with respect to each other, and therefore, this length is a controllable parameter. By virtue of the repeating aspect of the process, the method is referred to as the "polymerase chain reaction" (hereinafter "PCR"). Because the desired amplified segments of the target sequence become the predominant sequences (in terms of concentration) in the mixture, they are said to be "PCR amplified".

[0174] With PCR, it is possible to amplify a single copy of a specific target sequence in genomic DNA to a level detectable by several different methodologies (e.g., hybridization with a labeled probe; incorporation of biotinylated primers followed by avidin-enzyme conjugate detection; incorporation of .sup.32P-labeled deoxynucleotide triphosphates, such as DCTP or DATP, into the amplified segment). In addition to genomic DNA, any oligonucleotide sequence can be amplified with the appropriate set of primer molecules. In particular the amplified segments created by the PCR process itself are, themselves, efficient templates for subsequent PCR amplifications.

EXAMPLE 1

GPITH1P6

[0175] Engineering of an RCANA for In Vivo Detection Applications

[0176] The first example illustrates how to make an RCANA construct and demonstrates self-splicing of the RCANA out of a gene in response to an effector molecule.

1 Construction of a RCANA. Oligos GpIWt3.129: 5'-TAA TCT TAC CCC GGA ATT ATA TCC (SEQ ID NO:1) AGC TGC ATG TCA CCA TGC AGA GCA GAC TAT ATC TCC AAC TTG TTA AAG CAA GTT GTC TAT CGT TTC GAG TCA CTT GAC CCT ACT CCC CAA AGG GAT AGT CGT TAG-3' and GpITh1P6.131: 5'-GCC TGA GTA TAA GGT GAC TTA TAC (SEQ ID NO:2) TTG TAA TCT ATC TAA ACG GGG AAC CTC TCT AGT AGA CAA TCC CGT GCT AAA TTA TAC CAG CAT CGT CTT GAT GCC CTT GGC AGA TAA ATG CCT AAC GAC TAT CCC TT-3'

[0177] were annealed and extended in a 30 .mu.l reaction containing 100 pmoles of each oligo, 250 mM Tris-HCl (pH 8.3), 40 mM MgCl.sub.2, 250 mM NaCl, 5 mM DTT, 0.2 mM each dNTP, 45 units of AMV reverse transcriptase (RT: Amersham Pharmacia Biotech, Inc., Piscataway, N.J.) at 37.degree. C. for 30 minutes. The extension reaction was diluted 1 to 50 in H.sub.2O.

[0178] A PCR reaction containing 1 .mu.l of the extension dilution, 500 mM KCl, 100 mM Tris-HCl, (pH 9.0), 1% Triton.RTM. x-100, 15 mM MgCl.sub.2, 0.4 .mu.M of GpIWt1.75: 5'-GAT

2 0.4 .mu.M of GplWtl. 75: 5'-GAT AAT ACG ACT CAC TAT AGG GAT (SEQ ID NO:3) CAA CGC TCA GTA GAT GTT TTC TTG GGT TAA TTG AGG CCT GAG TAT AAG GTG-3', 0.4 .mu.M of GpIWt4.89: 5'-CTT AGC TAC AAT ATG AAC TAA CGT (SEQ ID NO:4) AGC ATA TGA CGC AAT ATT AAA CGG TAG CAT TAT GTT CAG ATA AGG TCG TTA ATC TTA CCC CGG AA-3',

[0179] NO:4), 0.2 mM each dNTP and 1.5 units of Taq polymerase (Promega, Madison, Wis.) was thermocycled 20 times under the following regime: 94.degree. C. for 30 seconds, 45.degree. C. for 30 seconds, 72.degree. C. for 1 minute. The PCR reaction was precipitated in the presence of 0.2 M NaCl and 2.5 volumes of ethanol and then quantitated by comparison with a molecular weight standard using agarose gel electrophoresis.

[0180] The RCANA construct was transcribed in a 10 .mu.l high yield transcription reaction (AmpliScribe from Epicentre, Madison, Wis. The reaction contained 500 ng PCR product, 3.3 pmoles of .sup.32P [.sup.32P]UTP, 1.times. AmpliScribe transcription buffer, 10 mM DTT, 7.5 mM each NTP, and 1 .mu.l AmpliScribe T7 polymerase mix. The transcription reaction was incubated at 37.degree. C. for 2 hours. One unit of RNase free-DNase was added and the reaction returned to 37.degree. C. for 30 minutes. The transcription was then purified on a 6% denaturing polyacrylamide gel to separate the full length RNA from incomplete transcripts and spliced products, eluted and quantitated spectrophotometrically.

[0181] In vitro Assay. The RNA (4 pmoles/12 .mu.l H.sub.2O) was heated to 94.degree. C. for 1 minute then cooled to 37.degree. C. over 2 minutes in a thermocycler. The RNA was divided into 2 splicing reactions (9 .mu.l each) containing 100 mM Tris-HCl (pH 7.45), 500 mM KCl and 15 MM MgCl.sub.2, plus or minus theophylline (2 mM). The reactions were immediately placed on ice for 30 minutes. GTP (1 mM) was added to the reactions (final volume of 10 .mu.l) and the reactions were incubated at 37.degree. C. for 2 hours.

[0182] The reactions were terminated by the addition of stop dye (10 .mu.l) (95% formamide, 20 mM EDTA, 0.5% xylene cyanol, and 0.5% bromophenol blue). The reactions were heated to 70.degree. C. for 3 minutes and 10 .mu.l was electrophoresed on a 6% denaturing polyacrylamide gel. The gel was dried, exposed to a phosphor screen and analyzed using a Molecular Dynamics Phosphorimager (Sunnyvale, Calif.).

[0183] Activation was determined from the amount of circular intron in each reaction. Circularized introns migrate slower than linear RNA and can be seen as the bands above the dark bands of linear RNA in the +Theo lanes of the gels of FIGS. 2a and 2b.

[0184] In vivo Screening of Group I Aptazymes. The RCANA constructs as well as the wild type and a negative control were ligated into a vector that contains the T4 td intron with Eco RI and Spe I flanking the P6 region, transformed and minipreped. The plasmids were then transformed into C600:Thy A Kan.sup.R cells (cells lacking thymidine synthetase). Individual colonies were picked and grown in rich media overnight. Theophylline (1 .mu.l: 6.6 mM) or H.sub.2O (1 .mu.l) was added to 2 .mu.l of the overnight growth and was spotted on either minimal media plates, or minimal media plates with thymine, see FIG. 3.

EXAMPLE 2

GPIP6THPOOL

[0185] In Vitro Selection to Optimize an RCANA for In Vivo Detection Applications

[0186] Example 2 illustrates how to generate a population of RCANA so that there is variation in the nucleotide sequence of the aptamer