WO1999030151A1 - Method and cells to screen combinatorial libraries - Google Patents

Method and cells to screen combinatorial libraries Download PDF

Info

Publication number
WO1999030151A1
WO1999030151A1 PCT/EP1998/007922 EP9807922W WO9930151A1 WO 1999030151 A1 WO1999030151 A1 WO 1999030151A1 EP 9807922 W EP9807922 W EP 9807922W WO 9930151 A1 WO9930151 A1 WO 9930151A1
Authority
WO
WIPO (PCT)
Prior art keywords
polypeptide
target compound
cells
interest
selection
Prior art date
Application number
PCT/EP1998/007922
Other languages
French (fr)
Inventor
Alexander Von Gabain
Aaron Hirsh
Original Assignee
Intercell Biomedizinische Forschungs- Und Entwicklungs Gmbh
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from EP97121451A external-priority patent/EP0922957B1/en
Application filed by Intercell Biomedizinische Forschungs- Und Entwicklungs Gmbh filed Critical Intercell Biomedizinische Forschungs- Und Entwicklungs Gmbh
Priority to AU18769/99A priority Critical patent/AU749042B2/en
Priority to JP2000524659A priority patent/JP2001526054A/en
Priority to CA002310381A priority patent/CA2310381A1/en
Priority to EP98963539A priority patent/EP1036321A1/en
Publication of WO1999030151A1 publication Critical patent/WO1999030151A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B30/00Methods of screening libraries
    • C40B30/04Methods of screening libraries by measuring the ability to specifically bind a target molecule, e.g. antibody-antigen binding, receptor-ligand binding
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1037Screening libraries presented on the surface of microorganisms, e.g. phage display, E. coli display
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1055Protein x Protein interaction, e.g. two hybrid selection
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B40/00Libraries per se, e.g. arrays, mixtures
    • C40B40/02Libraries contained in or displayed by microorganisms, e.g. bacteria or animal cells; Libraries contained in or displayed by vectors, e.g. plasmids; Libraries containing only microorganisms or vectors
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/5005Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells
    • G01N33/5008Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/5005Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells
    • G01N33/5008Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics
    • G01N33/5014Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics for testing toxicity
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/5005Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells
    • G01N33/5008Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics
    • G01N33/502Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics for testing non-proliferative effects
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/569Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
    • G01N33/56911Bacteria
    • G01N33/56916Enterobacteria, e.g. shigella, salmonella, klebsiella, serratia
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • G01N33/6803General methods of protein analysis not limited to specific proteins or families of proteins
    • G01N33/6845Methods of identifying protein-protein interactions in protein mixtures

Definitions

  • Combinatorial chemistry (reviewed in Janda, 1994), in which a diverse library of laboratory-constructed molecules is screened for structures that exhibit properties of interest, is a relatively novel and potentially powerful technology, promising useful application in three general areas.
  • combinatorial chemistry may be used to identify molecules with highly specific adhesive or enzymatic activities useful in biomolecular research, clinical therapy, or diagnostics.
  • Second, by screening a library for molecules that interact specifically with a known compound of significant interest, combinatorial chemistry may assist in identification of the known compound's native interactants (e.g. Phizicky.et al . , 1995) .
  • combinatorial chemistry may be used for epitope mapping, serving to elucidate the surface of structural interaction between two molecules of interest (e.g. Phizicky et al., 1995) .
  • polypeptide combinatorial chemistry and polypeptide interaction identification that resolves or circumnavigates many of the difficulties encountered by other technologies.
  • polypeptide is used herein to comprise all types of peptides or proteins, including glycoproteins, proteoglycans, lipoproteins, acetylated proteins or (other) processed peptide or protein forms.
  • a method for identifying and isolating at least one of the following: a polypeptide of interest, a target compound or a gene of a polypeptide of interest or a gene of a target compound, wherein at least one of said polypeptide of interest or said target compound is unknown comprising the steps a) constructing a cell population by engineering it to express a membrane polypeptide which has said polypeptide of interest in its extra- or intracellular domain, said extra- or intracellular domain being exhibited in one or more copies on each cell's surface directly or indirectly interacting with a selection site specific for a selection agent, such that upon binding of said target compound to said polypeptide of interest said selection site of cells which exhibit such a binding of said target compound to said polypeptide of interest is modified in a manner that said selection agent may be used for selecting those cells which exhibit said binding of said target compound to said polypeptide of interest from cells which do not exhibit said binding, b) exposing said cell population to said target compound or a population of said target compounds for a
  • a library of polypeptides is exhibited in one or more extra- or intracellular domains of a membrane polypeptide expressed by a cell population.
  • Selective pressures exerted by selection agents eliminate those cells whose polypeptide sequences fail to bind to a preselected target compound.
  • cells whose sequences bind the target compound are selected and e.g. amplified by growth.
  • polypeptide imprinting is used to refer to the use of the method according to the present invention to select cells exhibiting polypeptides that bind the target compound or exhibit specific properties.
  • prokaryotic and eukaryotic polypeptides may be imprinted, serving e.g. creation or discovery of polypeptide sequences with high affinity for a predetermined target, interaction studies, or epitope mapping.
  • polypeptides may be imprinted to exhibit the following specific properties: multivalent binding to a predetermined target; high affinity for a target and low affinity for one or more molecules closely related to the target; inhibitory activity in a biological pathway (antagonism) ; excitatory activity m a biological pathway (agonism) ; or enzymatic activity.
  • the present invention may be performed by displaying the polypeptide to be imprinted (the polypeptide of interest) in the extra- or intracellular domain of a membrane protein.
  • This invention is the product of the confluence of three distinct areas of research: i. the study of prokaryotic and eukaryotic cellular membranes and their complex constituents; ii. the study of the evolution of microbial populations under the selective pressures of toxins, bacteriocins, or viral parasites; and iii. the design and screening of polypeptide combinatorial libraries.
  • a polypeptide with promising properties can immediately be identified, amplified and isolated, making the present method extremely sensitive. It is possible to select one polypeptide that binds to a ligand out of a background of 10 6 non specific polypeptides as a minimum, but in there is no evidence for an upper limit on sensitivity regarding the present method per se.
  • the present method all cells of the combinatorial library of polypeptides and/or target compounds are screened at once, rather than serially, providing results in far less time than many alternative technologies.
  • the present invention is easy to use, because the important parameters are easily adjustable for the man skilled in the art. Once a set of cells is selected, the genes for the polypeptides or the target compounds (if these are also expressed by the cells) are easily isolated.
  • the polypeptide of interest may be displayed by any of the wide ranges of cell types known and used for displaying foreign proteins in extra- or intracellular domains (e.e. on the cell surface) . This allows the present method to avoid the biases of any one expression system.
  • the polypeptide is preferably expressed in a membrane protein which is biochemically appropriate for the target compound or for the system wherein the isolated polypeptide/target pair is intended to be applied (e.g. as a vaccine or as a therapeutic) .
  • Combinatorial chemistry which is one of the major components of the present invention, consists of two or three steps, often performed in iterative cycles.
  • the first step is the generation of a diverse population of molecules, the so- called combinatorial library.
  • the molecules in the combinatorial library are partitioned according to their exhibition of properties sought by researchers, such as affinity for a particular compound or enzymatic activity.
  • the molecules found to exhibit the desired properties are replicated or resynthesized.
  • the cycle may then be repeated, beginning either at the first step, with the introduction of new variation to the molecular population, or at the second step, with the partitioning of the population according to the exhibition of desired properties.
  • the immune system is essentially a genetically coded combinatorial library, complete with steps of variation, selection, and replication: A vast diversity of antibodies is displayed on the surface of B-cells (variation) , and specific recognition of antigen by any given cell triggers that cell's proliferation (partitioning and replication) . It is not surprising, then, that the first in vitro genetically-coded combinatorial libraries were directly appropriated from the immune system: B-cells were immortalized through hybridization to mouse myeloma lines (Marks et al . , 1991) . Later, in order to circumnavigate difficulties of hybridization, and to utilize a greater part of the antibody repertoire, E. coli were engineered to express and secrete the variable chains of antibodies.
  • Combinatorial libraries of fusion-phage are composed of filamentous bacteriophage or phagemid that have been engineered to bear semi-random exogenous proteins as sandwich fusions near the end of the adsorption protein pill (Janda, 1994, Marks et al., 1991) .
  • the method of screening these libraries for phage that exhibit affinity for a target compound is known as "affinity purification": the target is immobilized on a solid surface or column, over which the phage library is washed; phage bound to the target are eluted, amplified by growth in bacterial culture, and then subjected to yet another round of partitioning. This cycle is usually iterated until a tractable number of clones are isolated.
  • Phage libraries have been used for epitope mapping (Phizicky et al., 1995) -viz. pinpointing the region of a larger protein that is involved in binding a target compound of interest-as well as for deriving novel ligands to targets. Entire single- chain variable-fragment antibodies (scFv's) have been inserted into phagemid pill (Marks et al . , 1991), and libraries of scFv's or semi-random oligopeptides have been screened for binding to a variety of medically significant polypeptides, nucleic acids, and molecules not previously known to bind proteins .
  • scFv's single-chain variable-fragment antibodies
  • fusion phage technology has been responsible for significant strides forward in combinatorial chemistry - such as the recognition that "tagged" libraries would permit screening of a vastly increased number of sequences - phage libraries themselves have not escaped several besetting difficulties.
  • affinity purification of phage libraries factors besides affinity for the target compound contribute significantly to each clone's selection.
  • the collection of clones selected from a single library by affinity purification can vary dramatically with the technique used for eluting phage from the solid substrate of target compound (Phizicky et al . , 1995) .
  • phage adsorption protein, pill may fail to provide exhibited proteins with a sufficiently stable platform. Many polypeptides acquire structural rigidity only in certain constraining conformations; otherwise they remain extremely plastic. Therefore, to be functionally active structures, many polypeptides will require a foundation other than pill .
  • LamB intercalates the outer membrane even when exogenous polypeptide sequences have been inserted into one or more of the protein's extracellular domains.
  • the E. coli population was placed in a suspension of iron oxide particles, binding was allowed to occur, and a magnet was used to recover bacteria that had adhered to metal (Brown, 1992) .
  • the selected portion of the population was amplified by bacterial growth, and the partitioning procedure was repeated. After four cycles of selection and amplification, fresh variation was introduced according to a strategy that would not erase sequence- information accrued thus far. Another four cycles of selection and amplification then yielded four clones, three of which bore a consensus sequence for binding iron oxide.
  • the combinatorial library m LamB introduced the promising possibility of using bacterial outer membrane proteins as platforms for presenting libraries of diverse polypeptides
  • the selection procedure applied to the library was labor-intensive and very limited in its applicability.
  • the target compound, iron oxide was conveniently chosen to allow screening for magnetism - a procedure that could not be applied to screening for the ma-jonty of interesting interactions that polypeptides exhibit.
  • the present invention offers a user-friendly, efficient, and precise method of selection, capable of screening libraries of all sorts of polypeptides presented on the platform of a membrane polypeptide .
  • membrane polypeptides bounding any prokaryotic or eukaryotic cell present an extremely complex surface to the extra- or intracellular environment.
  • These molecules - referred to here as membrane polypeptides - exhibit a vast range of distinctive conformational, chemical, and electrical properties.
  • the membrane polypeptides are here categorized into several structural families. The conformational and functional differences between these families are relevant to this invention because membrane polypeptides provide the underlying structures that are used to display exogenous sequences, and are selected for interaction with the target compound.
  • this invention often capitalizes on the intrinsic properties of membrane polypeptides by imprinting them to exhibit properties or perform functions that are similar to their natural ones.
  • the deliniation of structural families presented here should also provide a profile of the vast range of properties exhibited by polypeptides on the cell membranes, for it is in part this range that makes the capabilities of this invention so broad.
  • Single-pass transmembrane polypeptides (reviewed in Singer, 1990) , also called bitopic polypeptides, have one intra- and one extra-cellular domain, and a midsection that passes through the membrane. The terminal domains fold into tertiary structures, while the transmembrane section often forms a hydropathic ⁇ -helix, in which no more than 2 of the 20 transmembrane residues may be ionic. Singer (Singer, 1990) divides bitopic polypeptides into types la, lb, and II (figure 1) . Types la and lb exhibit the amino terminus on the extracellular side of the outer membrane.
  • Type la polypeptides are ubiquitous, and in eukaryotes they are the predominant type. Because the transmembrane sequence of type la polypeptides is typically closer to the carboxyl terminus, the extracellular domain is often large. Consequently, type la polypeptides are competent for functions involving high-affinity adhesion or highly specific binding to especially extracellular targets.
  • type la polypeptides are receptors for growth factors and for peptide hormones; immune system antigen recognition molecules; and molecules responsible for cell adhesion (Singer, 1990, Staunton et al . , 1990) . Proclivity for adhesion or highly specific binding to extracellular targets make type la polypeptides suitable for use in many embodiments of this invention, especially those in which a polypeptide is imprinted to bind to an extracellular target.
  • Type lb polypeptides do not have a cleavable signal sequence.
  • the transmembrane stretch is typically close to the amino terminus, and therefore there are few extracellular residues (Singer, 1990) . This makes type lb polypeptides suitable for sophisticated cytoplasmic functions, and interactions, but poorly suited for interactions with targets in the extracellular medium.
  • Type II bitopic polypeptides are oriented with their carboxyl terminus on the exterior side of the membrane (figure 1) . As in type la bitopic polypeptides, the bulk of type II molecules is usually on the exterior of the cell. Type II polypeptides are typically enzymes with extracellular active sites, such as glycosyl transferases, or highly specific extracellular receptors, such as the asialoglycoprotein receptor (Singer, 1990) . Competence for extracellular enzymatic or receptor activity makes type II bitopic polypeptides suitable for use in certain embodiments of this invention, in which extracellular domains are imprinted to bind or enzymatically modify targets in the extracellular medium.
  • Multipass transmembrane polypeptides (reviewed in Singer, 1990), also called polytopic polypeptides, traverse the lipid bilayer in two to fifteen or more passes. Like thread stitched into cloth, the polypeptide begins on one side of the membrane, passes through it, loops into the space on the other side of the membrane, then once again traverses the membrane to expose either its end or yet another loop leading to more transmembrane passes (figure 1) . Although referred to and often illustrated as "a loop", each exposed region of a multipass polypeptide exhibits not only a unique sequence of amino acids, but also a distinctive tertiary structure.
  • multipass ones are suspended in the lipid bilayer by the hydrophobicity of transmembrane strands.
  • these strands do not invariably twist into ⁇ -helices, because when multiple strands neighbor one another, they can adopt an alternative strategy for hydrogen bonding between peptide bonds: if each strand adopts the secondary structure of a ⁇ - strand, these strands can curl together into a closed cylinder, allowing hydrogen bonds to form between the peptide bonds on separate strands.
  • the resultant tertiary structure is called a ⁇ -barrel (figure 1) . It is extremely stable, and can be released from the outer membrane without denaturing tertiary structure.
  • the extremely stable, polytopic ⁇ -barrel OmpA is imprinted to bind to various targets.
  • Polytopic polypeptides that traverse the membrane in ⁇ - helical conformation often function as specific receptors for messengers or hormones, such as yeast mating pheromone and many others. These and other ⁇ -helical polytopic receptors are categorized by Singer (Singer, 1990) as type III polypeptides. Their capacity to bind target molecules specifically, and sometimes multivalently, makes ⁇ -helical polytopic receptors ideal for use in many embodiments of this invention. Moreover, selection systems according to the present invention are easy to construct when the membrane polypeptides involved in the selection are polytopic.
  • An anchor consists of a lipid molecule, such as dimyristyl glycerol or alkylacyl glycerol, adorned by a multi-residue sugar complex, which is, in turn, covalently linked to the membrane polypeptide.
  • glycolipid anchor a well-defined ammo terminal signal sequence, which enables transport into the lumen of the endoplasmic reticulum, and a carboxy terminal glycolipid- addition sequence, which is not precisely defined but generally consists of a short hydrophobic domain located 10-12 residues carboxy terminal to a pair of small residues (Cross, 1990) .
  • carboxy terminal glycolipid- addition sequence which is not precisely defined but generally consists of a short hydrophobic domain located 10-12 residues carboxy terminal to a pair of small residues (Cross, 1990) .
  • glycolipid-anchored outer membrane proteins cannot perform cytosolic or periplasmic functions. Instead, they provide protection from the extracellular environment, or function as enzymes or adhesms recognizing exclusively extracellular substrates.
  • At least six leukocyte surface proteins, many other eukaryotic proteins belonging to the immunoglobulin superfamily (Cross, 1990) , and the trypanosome variable surface glycoprote (the protective protein that the trypanosome sequentially alters to avoid detection by the host immune system) are anchored by glycolipids.
  • the diverse array of surface- exposed proteins is adorned by yet another tier of complex molecules with distinctive properties, namely, the saccha ⁇ des (reviewed Lis et al . , 1993).
  • the oligosaccharides adorning glycoprote s generally consist of less than 15 residues. However, owing to the diversity of residues and covalent linkages that permit assembly into branched as well as linear structures, oligosaccharides exhibit thousands of distinct structures.
  • the polysaccharides of integral membrane proteoglycans are larger than oligosaccharides, and can contain dozens of residues.
  • the oligo- or polysaccharide molecule attached to a particular site in a polypeptide is not a simple function of the local ammo acid sequence.
  • the same polypeptide sequence receives distinct saccharide chains when expressed in different cells, and separate copies of a polypeptide expressed in a single cell type can have different sugars attached at a given site. Because sugar residues are sequentially appended, each by a particular enzyme to which the protein is exposed in the course of the processes of translation, folding, and export to the outer membrane, the glycosylation at a site is largely a product of the extent and duration of the site's exposure to each enzyme.
  • Exposure of a site is, in turn, affected by the way a protein folds and the rate at which it moves through the steps from translation to insertion. Consequently, the glycosylation' at a site can be altered in unpredictable ways by changes to the local sequence or to the sequence elsewhere in the polypeptide (Lis et al . , 1993) .
  • glycoproteins and proteoglycans are involved in the intercellular recognition and adhesion underpinning sperm-egg interactions, blood clotting, and inflammation (Lis et al., 1993, Singer, 1990). The roles they play, via adhesion, in these important processes make the glycoproteins and proteoglycans ideal for use in certain embodiments of this invention.
  • the membrane protein is an ⁇ -helical polytopic receptor, a glycolipid-anchored polypeptide, a polytopic integral membrane polypeptide, a membrane-bound isoform of a secreted polypeptide, a bitopic integral membrane polypeptide, a glycoprotein or a proteoglycan .
  • insertion of polypeptides into the membrane and translocation of secreted polypeptides across the membrane are performed by the same cellular machinery (Singer, 1990) .
  • integral membrane polypeptides inserted into the endoplasmic reticulum (ER) have the same cleavable or noncleavable amino terminal signal sequences as completely translocated polyeptides. This is consistent with the fact that insertion and secretion depend on the same cytoplasmic signal recognition protein (SRP) , which transfers both inserted and secreted polypeptides to the SRP receptor on the ER membrane.
  • SRP cytoplasmic signal recognition protein
  • type III polypeptide intercalation into the membrane also has implications that are significant to this invention.
  • the cellular machinery that inserts bitopic polypeptides and translocates secreted ones is also responsible for intercalating ⁇ -helical multipass polypeptides into the outer membrane. This is clear from studies that show that l) type III polypeptides sometimes have the same signal sequences as bitopic and secreted polypeptides, and n) membrane intercalation of type III polyeptides requires the same gene products as insertion of bitopic polypeptides.
  • inducting gene fusion have shown that the cellular machinery for inserting polypeptides in the outer membrane intercalates most type III polypeptides by translocating one hydrophilic extracellular loop at a time. The polypeptide is thus intercalated as a series of independently inserted segments (Smger, 1990) .
  • Most embodiments of this invention require a topological map of the imprinted polypeptide, and a technique for introducing new sequences to specified extracellular domains of the imprinted polypeptide. Both of these needs are met by gene fusion techniques.
  • each embodiment of this invention also requires a technique for inserting exogenous peptide sequences into one or more extra- or intracellular sites in the imprinted polypeptide.
  • exogenous peptide sequences may replace existingsections of the extra- or intracellular domains of membrane.
  • oligonucleotides rich m restriction endonuclease sites were inserted into the ompR gene at sites corresponding to protein loops II and IV, and the altered protein was expressed and exhibited normally (Freudl, 1989) .
  • influenza haemaglutmm, beta-galactosidase, foot and mouth disease viral protein 1 (FMDV-VP1), malaria antigens, and several other antigenic polypeptides up to 56 ammo acids in length were successfully displayed m OmpA loops III and IV (Georgiou et al., 1997).
  • two lines of evidence demonstrated that these exogenous polypeptides adopted stable folded structures.
  • extracellular domains of a number of membrane proteins accept exogenous peptide sequences without exhibiting disturbance to normal sorting and insertion (Boyd, 1994, Boder and Wittrup, 1997).
  • Higher eukaryotic membrane polypeptides have also exhibited the capacity to accept foreign inserts at extracellular domains.
  • gene fusion studies may insert extra glycosylation sites to function as reporters: only polypeptide domains on the luminal side of the ER are exposed to glycosylating enzymes, and luminal domains become extracellular domains.
  • Extra glycosylation sites were inserted into extramembranous regions of mouse muscle nicotinic acetylcholine receptor: the inserted sites did not disrupt normal sorting or insertion of the outer membrane polypeptide; and, sites inserted in extracellular receptor domains were in fact glycosylated (Chavez et al . , 1991).
  • gene fusion mapping techniques contribute to this invention in four ways.
  • these techniques have been widely used to determine the topology of countless outer membrane polypeptides, many of which are described above and may be imprinted by this invention.
  • these techniques may be used to provide topological maps for polypeptides that have not yet been mapped, but may be candidates for imprinting by this invention.
  • gene fusion techniques provide this invention with commonly practiced methods for inserting an exogenous peptide sequence into a specified region of a membrane polypeptide; the use of these techniques in the invention, as well as the straightforward extension that enables the insertion of variable sequences, is described below.
  • secreted and membrane-bound isomorphs can use the same amino terminal signal sequence, so in many cases, the only alteration required for conversion of a polypeptide to its isomorph is a change at the polypeptide ' s carboxy terminal. Attachment of a 20 residue hydrophobic sequence to the carboxy terminal of a secreted polypeptide is often sufficient to construct an isomorphic polypeptide that is anchored by a transmembrane span; this technique for converting a- secreted polypeptide into a membrane-bound one is described in the literature (Singer, 1990) .
  • transmembrane or secreted polypeptides can be converted to glycolipid anchored variants by addition of carboxy terminal signal sequences from the precursors of glycolipid anchored polypeptides; this technique is also described in the literature (Cross, 1990).
  • a glycolipid anchor not only attaches an exogenous polypeptide to a cell membrane, but also directs sorting of the anchored alien polypeptide.
  • a single glycolipid anchor sequence is all that is required to direct an exogenous polypeptide to the membrane, translocate it, and anchor it (Cross, 1990) .
  • two or more outer or inner membrane polypeptides are fused to construct a chimeric multipass polypeptide comprising extra- and/or intracellular domains of both constituent polypeptides.
  • This manipulation has been successfully performed with a variety of prokaryotic and eukaryotic polypeptides.
  • E. coli ompF-ompC chimeric protein 3601 the region of ompF comprising extracellular loops 1 through 5 is fused to the region of o pC comprising extracellular loops 6 and 7.
  • the extra- or intracellular domain of the membrane polypeptide contains a highly variable region, which is, upon binding to a target compound, identified and isolated as the polypeptide of interest or a section thereof.
  • a target compound identified and isolated as the polypeptide of interest or a section thereof.
  • the membrane polypeptide can be a homologous protein for the cell (adapted for the present method) or - which is preferred - a heterologous polypeptide for the host cells.
  • the membrane polypeptide is genetically engineered to contain the variable domain and the selection site residing adjacent to each other on the cellular surface.
  • the target compound can be any material, e.g. chemically produced substances, proteins, nucleic acids, complex products derived from biological sources or mixtures or (combinatorial) libraries of these materials.
  • Preferred substance classes for the target compound are known drugs (or derivatives thereof) (e.g. to investigate further binding patterns of such compounds to polypeptides) , polypeptides (to identify and isolate protein/protein interactions) , antibodies or antigens, vaccines or pathogens, etc..
  • the present invention is performed using an intracellular domain of the membrane protein, i.e. the polypeptide of interest is displayed at the inner surface (including the periplasma) , it is essential for the target compound to be present inside the cell. This can be achieved by transportation or diffusion of the target compound from outside into the cell (which is, however, only possible with a few or small target compounds.
  • An alternative way of bringing also the target compound into the cell is by coexpression of the polypeptide of interest and the target compound the same cell. Systems to enable such a coexpression of two (or more) exogenous proteins one and the same cell are known m the art. It is clear that this way of performing the invention is restricted to target compounds that can be produced by the host cell, e.g. proteins or nucleic acids.
  • any suitable selection system known m the art may be adapted to the method according to the present invention.
  • These selection systems are known to the skilled man in the art in principle, the adaption of any suitable selection system to the system according to the present invention will be explained below.
  • a critical point for the present invention is how the binding of the target compound with the polypeptide of interest influences the selection site:
  • the binding between the polypeptide of interest-portion of the membrane polypeptide and the target compound results m a signal which influences the status of the selection site.
  • This influence may be a direct or an indirect interaction of the membrane polypeptide or the bound target compound with the selection site.
  • a direct interaction may e.g. be performed by a ste ⁇ cal hindrance of the selection site upon binding of the target molecule, i.e. the site, where the selection agent should bind, is not accessible anymore, because of the physical presence of the target molecule.
  • such a sterical hindrance may be induced by a conformational change of the membrane polypeptide upon binding of the target compound to the polypeptide of interest-portion of the membrane polypeptide.
  • the selection site is present m the vicinity of the membrane polypeptide, e.g. by covalent bond to the membrane polypeptide.
  • the selection site can also be a part of the membrane polypeptide itself. In such a case, the sterical properties of the system are clearly defined and easily adjustable by common protein engineering.
  • a preferred embodiment of the present invention is a method, wherein the interaction between the extra- or intracellular domain with the polypeptide of interest and the selection site is a sterical interaction, especially a sterical interaction which does not allow the selection agent to interact with the selection site after binding of the target compound to the polypeptide of interest.
  • the selection site is immediately adjacent (especially via covalent bonds) to the extra- or intracellular domain of the polypeptide of interest.
  • the selection site is an adhesion domain for the selection agent, e.g. an adhesion domain which is also part of the membrane polypeptide displaying the polypeptide of interest.
  • the membrane protein exhibits an additional domain on the surface of the cell, which additional domain is adjacent to both, the variable domain and the selection site and this additional domain also interacts, preferably by binding, with the target compound.
  • Indirect interaction can be performed e.g. by messenger molecules, which are released upon binding and induces a change of the selection site by binding or by contact. It is essential that the change in selection specificity of the cell is in direct correlation to the binding of the target compound to the polypeptide of interest, i.e. the binding immediately induces the (direct or indirect) interaction with the selection site.
  • the mode of selection in the method according to the present invention is not critical, positive and/or negative selection protocols are applicable.
  • the selection agent is a toxic agent.
  • the polypeptides of the (outer) membrane are ports of adhesion for an array of toxic agents that kill or retard multiplication of the cells they enter.
  • the term "toxic agent” denotes any agent that impairs or inhibits growth of the cells.
  • Most of the classical selection systems applied molecular biology fall under this type of selection system. Due to its nature, this selection system is preferably applied the pupilextracellular" embodiment of the present invention, wherein the polypeptide of interest is located m an extracellular domain of the membrane polypeptide.
  • Binding by the agent to the adhesion domain is the first step m a process that culminates in cell lysis or inhibition of cell growth and multiplication. If this first step is impeded by mutations m the adhesion domain, by repression of the receptor, or by the interference of an inhibitor occluding the adhesion domain, a selection agent's invasion of the cell is blocked.
  • the cell is rendered immune.
  • the cell may become resistant but not immune: reduction of the number of receptors may slow the entry of agents, and thus reduce the rapidity or intensity of the agent's harmful effects.
  • a selection agent may be employed that has been alterered such that it is different from the corresponding naturally occurmg agent with regard to specificity and mode of action on the cell.
  • pathogenic agents like bacteriocins and bacteriophages - for which the above mentioned pathway via an adhesion domain is quite common - can be altered such that they change their preference for a specific adhesion domain.
  • design strategies to make pathogens adhere to a great variety of adhesion domains.
  • the selection agent is a pathogenic agent, such as a bacteriocm, a bacteriophage or a toxin or virus of an eukaryotic cell.
  • Bacteriocins are bacterially produced proteins that are harmless to the strains that express and secrete them, but fatal to other, often closely related, bacteria (bacteriocins are reviewed m Pattus et al., 1990) .
  • the most extensively studied of the bacteriocins are the colicins, which are expressed by E . coli , and are encoded by plasmids that also code for the proteins that confer colicm immunity upon colicm-producmg cells.
  • the colicins are composed of three structural domains, each of which is responsible for one of three consecutive steps of colicm activity: l. recognition of an adhesion domain; n. translocation across the cellular membrane; and in. cell destruction (Pattus et al . , 1990).
  • the colicins kill by two principal mechanisms: l. the formation of a pore m the cytoplasmic membrane, resulting m membrane permeabilization and de-energization (colicins El, A, B, la, lb, and K) ; n. enzymatic cleavage of DNA or 16S ribosomal RNA (colicins E2 and E3) (Pattus et al . , 1990).
  • the kinetics of cell death when bacteria are exposed to colicins suggest that the toxic molecules enter the cell in discrete, fatal doses, which may comprise only a single molecule, but may also consist of a number of active molecules imported one concerted step (Pattus et al . , 1990) .
  • the important point here is that a single dose kills a single cell.
  • the adhesion domain of colicm A is somewhat different. Loops 1 through 5 are required for colicinA to bind its receptor, while L6 and L7 are superfluous (Fourel et al., 1990).
  • the chimeric ompF-ompC protein 3601 which was described above, comprises loops 1-5 from ompF and loops 6 and 7 from the distinct ompC .
  • E. coli SM1005 expressing 3601 are totally susceptible to colicm A, but totally immune to colicm N.
  • the number of receptors per cell is important for an easy handling of this invention's selection of specific cells from a highly variable population.
  • the functional receptors of colicins are generally present on the surface of the cell thousands of copies.
  • Bacteriophages are a ubiquitous and extraordinarily diverse group of viral parasites of bacteria. There are twelve families of phage, comprising 3500 distinct isolates of known morphology, and infecting over 100 bacterial genera. Their vast variation is evident even at the level of nucleic acid: depending on species, phage contain single stranded DNA, double stranded DNA, single stranded RNA, or double stranded RNA. A wide range of morphologies is usually divided into four categories: tailed; cubic; filamentous; and pleomorphic. Extensive reviews of morphology, genetics, ecology, and life- cycle are available in the literature (Calendar, 1988).
  • phages bind to specific adhesion domains of receptors on the host surface.
  • a single outer membrane protein may be the receptor for multiple phage species, whose adhesion domains overlap to varying extents.
  • Figure 3 shows the adhesion domains of various phages, as well as one of the colicins, that utilize E. coli outer membrane protein A as their receptor. The figure also shows the receptor domains that are manifestly not utilized by the phages: alterations to these regions of the receptor do not affect phage adsorption, and thus do not confer phage- resistance to the bacterial cell.
  • Phage receptors like those of bacteriocins, are often present m 100s to 1000s of copies per cell, but this number can be manipulated through use of a plasmid-encoded receptor regulated by an mducible heterologous promoter. As was stressed above, regulation of the level of expression of the imprinted polypeptide is important to this invention. Protocols for the construction and manipulation of plasmids with inducible promoters are described m the literature.
  • phage multiplication commences.
  • the strategies for multiplication, and the effects of multiplication upon the host vary widely across phage species. Immediately following infection, some but not all species of phage confer immunity to subsequent infections by other individuals of the same phage species. If the infecting phage is a virulent one, it quickly modifies normal bacterial syntheses in order to appropriate them to its own multiplication and release into the extracellular space. Distinct strategies for release have very different effects upon the host.
  • the Inoviridae (a family of filamentous phages) are liberated by extrusion: new phage are secreted through the bacterial membrane (Calendar, 1988).
  • the bacteriophages exert such a wide range of effects upon their hosts that even among phage species using the same receptor there may be several different consequences of infection. Moreover, the effects of a phage upon its host can be manipulated. Mutagenesis followed by artificial selection for phage virulence or benevolence can intensify or mitigate a phage ' s deleterious effect on its host. Insertion or deletion of phage genes, including insertion of genes that confer metabolic capabilities or antibiotic resistance upon infected bacteria, are commonly practiced means of manipulating the impact that phage have upon their bacterial hosts. Another way to modify a phage ' s effects upon its host is to alter the host. For example, certain groEL mutations m E .
  • coli produce bacterial strains that are susceptible to T4, T5, lambda, and 186 phage infection, but do not permit the intracellular assembly of mature phage particles (Zeilestra-Ryalls et al . , 1993) .
  • infected bacteria are retarded, or even killed, but they do not yield infectious phage particles; m short, these bacteria are a dead-end for the phage that infect them.
  • the effects of phage upon their hosts can be modified by the preparation of phage ghosts (phage depleted of DNA) , by methods described the literature (Boulanger et al . , 1988) . Because they do not carry a genome, phage ghosts do not infect cells.
  • Toxins and viruses of eukaryotic cells recognize specific adhesion domains of their receptors on eukaryotic cell- surfaces.
  • the receptors of at least thirty different species of virus of eukaryotic cell lines, representing eleven different viral families, are described in the literature, and for most of these pathogens, the viral adhesion domains, which may comprise oligosaccharides, glycosylated peptide sequences, or sequences unadorned by sugars, have been characterized (Tyler et al., 1996).
  • the receptors of eukaryotic pathogenic agents like those of bacteriocins and bacteriophage, are commonly cloned onto vectors and expressed from inducible promoters, allowing manipulation of the number of receptors expressed per cell (Staunton et al . , 1990).
  • HRVs Major group human rhinoviruses
  • HRVs provide an example of the pathogenic agents that affect eukaryotic cells and are suitable for use in this invention.
  • HRVs are members of Pi cornaviridae, small protein-encapsidated viruses with RNA genomes. Of 100 serotypes of rhinovirus, 89 use an extremely well-characterized glycoprotein, ICAM-1 (CD-54) as their receptor. HRV serotype 14 is well studied , easily propagated in the lab, and therefore suitable for use in this invention (Staunton et al . , 1990, Couch, 1996).
  • Major group HRVs grow efficiently in many different lines of human cell culture. The most commonly used tissues are WI-38 diploid fibroblast, fetal tonsil, MRC, HeLa and COS cell lines.
  • HRVs form plaques on monolayers of a variety of cell lines exposed by any one of the described techniques for exposure to viral agents. Infected cells yield 10 to 200 plaque forming units per cell after 10 to 12 hours of incubation (Couch, 1996) .
  • the structure of HRV14 has been solved by X-ray crystallography (Staunton et al., 1990) .
  • the capsid is 30 n in diameter, exhibiting icosahedral symmetry and a canyon, 12- 30 A wide and 25 A deep, that runs along the 5-fold axis and contains the pathogen's receptor binding site.
  • the receptor is the well-characterized member of the immunoglobulin superfamily, ICAM-1 (CD-54), a 532 residue, 19nm x 2-3 nm rod comprising 5 immunoglobulin-like domains arranged in a line with a hinge between domains 2 and 3 (figure 4) .
  • Viral binding involves insertion of the distal immunoglobulin domain (Dl) into the viral canyon.
  • ICAM-1 bears multiple N-linked oligosaccharides, and variation in glycosylation between cell-types results in ICAM- l's molecular mass heterogeneity of 76 to 114 kD. It functions as the cell-surface ligand for lymphocyte function-associated antigen 1 (LFA-1), and binding between ICAM-1 and LFA-1 contributes to leukocyte adhesion in several immunological and inflammatory processes (Staunton et al . , 1990). Cytokines secreted by T-lymphocytes and monocytes, including IFN-g, TNF- a and b, and IL-a and b, induce expression of 3.5 x 10 6 copies per endothelial cell. However, uninduced cells exhibit much lower levels of ICAM-1, and in the CDM8 expression system,
  • ICAM-1 can be exogenously regulated through control of an inducible promoter (Staunton et al . , 1990) which makes this system excellentely suitable for the function of this invention.
  • the method is carried out in the form of a high-throughput screening assay wherein said cell population is exposed to a multiplicity of target compounds from which an unknown binding partner for a polypeptide of interest is to be identified and isolated.
  • the target compound in this embodiment may be a single definite compound, a mixture of compounds or, to have a large variety of binding possibilities tested, a (combinatorial) library of compounds .
  • Such a high-throughput assay is especially suited for screening of organisms for which comprehensive or complete genomic information is available, such as Mycobacterium tuberculosis or Helicobacter .
  • the method according to the present invention is usually carried out by performing steps a) to f) as described above. It may be necessary to repeat the selection process, e.g., with an increase in the selection pressure. This is suitable if, e.g., the number of isolated, positive clones is too large to handle.
  • steps b) through e) at least once, preferably comprising an increase in the selection pressure in the repeated steps.
  • the method can be carried out by steps b) through e) until the cell population comprises a desired number of distinct clones .
  • background is used to refer to cells that fail to bind the target compound but nonetheless remain in the selected population. In general, it is desireable to minimize the number of background cells, so that CISTEM reliably eliminates cells whose imprinted domains fail to bind the target compound, while selecting those cells whose imprinted domains do bind the target.
  • the CISTEM cell population is supplemented with "helper cells", whose function is propagation and amplification of the selective agent. This amplification of the selective agent increases the intensity of selection, and thereby reduces residual background cells.
  • helper cells lack a selectable marker that is present in the rest of the CISTEM cell population; this permits efficient elimination of the helper cells after they have served their purpose of amplifying the selective agent.
  • Supplemental cells may also be used in other ways to regulate the action of the selective agent.
  • heat-killed cells bearing the agent's receptor may be added to the CISTEM cell population. These cells bind the agent, removing them from selective activity in the remaining CISTEM cell population.
  • step d) When repeating the steps, new sequence variations can be introduced in the variable regions of clones selected in step d) before steps b) to e) are repeated.
  • the screening test may also be (additionally) performed by functional parameters.
  • the polypeptide of interest binds to the target compound and affects its function. Then, the polypeptides exhibited by such clones selected in step d) and amplified in step e) are screened by their ability to affect the function of the target compound.
  • the method according to the present invention comprises a step wherein the cell population is placed in a nutrient medium in which each cell's supply of a vital compound depends upon that cell's variable region acting enzymatically upon a compound structurally similar, but not identical to the target compound .
  • This invention provides a method for identifying and isolating a polypeptide of interest or a target compound or a gene of said polypeptide or target using the detection of the interaction between a polypeptide of interest and a target compound.
  • at least one of the interactants is unknown.
  • the method allows for identification of the unknown interaction partner, i.e. a polypeptide of interest and/or the target compound.
  • both interactants may be unknown, e.g. when an unknown ligand, upon binding at a receptor site, exerts a certain effect on the cell which needs to be antagonized or interfered with and both the ligand and the receptor domain to which it binds need to be identified.
  • CISTEMs an acronym for Cellular Imprinting by Selection: Teleological Evolution of Molecules.
  • a CISTEM applies selective pressure to a population of genetically engineered cells in order to amplify the frequency of cells in which a membrane polypeptide shows prescribed properties. This process is - as stated above - termed "imprinting" the polypeptide.
  • the binding of the target compound prevents the action of the selection agent via intramolecular interaction (in the membrane polypeptide), e.g. by occluding the selection agent from its adhesion domain.
  • This may comprise binding, adhesion, structural changes or chemical modifications of the variable domain which has an impact on the adhesion domain.
  • Such impact on the adhesion domain may, inter alia, be a conformational change of the adhesion domain that prevents the harmful action of the selection agent.
  • the invention is useful in at least three general areas.
  • the invention may be employed to identify polypeptides with properties that are useful in medical diagnostics, therapeutics, research, laboratory procedures, or other applications.
  • polypeptides that may be identified by the method disclosed here are polypeptides that interact with, in particular bind with high affinity and specificity to, virtually any given target compound, polypeptides that adhere to their respective target compounds at multiple epitopes, polypeptides that bind their respective target compounds but show no affinity for compounds closely related to the targets, enzymatic polypeptides, and polypeptides that show biological activities, such as antagonism or agonism of a receptor.
  • this invention may assist in the identification of the known compound's native interactants.
  • this invention may be used to analyse the interaction between a known polypeptide of interest and a known target molecule. By identifying which sequences in the polypeptide of interest interact with the target molecule, the invention assists in mapping the polypeptide epitopes that play a role in the interaction.
  • the method of the invention may be used to screen a population of target compounds and to identify those compounds that bind to a known polypeptide.
  • cells displaying a given section in the extra- or intracellular domain of the membrane polypeptide are used in a screening assay as test cells to screen for e.g. pharmaceutically active compounds that interact with the polypeptide of interest.
  • This invention may also be used to identify polypeptides with more sophisticated properties of interaction with the target compound than prescribed binding affinity.
  • the molecules that may be identified by this method are the following: polypeptides that exhibit multivalent binding; polypeptides that bind tightly to one molecule but show no affinity for a second, closely related molecule; polypeptides that affect the functional activity of the target compound to which they bind; and enzymatic polypeptides.
  • the invention is based on the interaction of three CISTEM components: (1) a genetically engineered population of prokaryotic or eukaryotic cells; (2) a target compound; and (3) a selection agent (figure 5) .
  • a genetically engineered population of prokaryotic or eukaryotic cells On the surface of each cell in the population, there are - at least - two polypeptide domains involved in the operation of the CISTEM: the cellular domain carrying either a given polypeptide whose interaction with an unknown target compound is to be determined or, in the case of an unknown polypeptide, the imprinted domain, and the selection site, e.g. the adhesion domain.
  • the imprinted domain is highly variable, because the CISTEM cell population is engineered to display up to at least 10 12 distinct oligopeptides at the imprinted domain.
  • the oligopeptides may be adorned by diverse oligosaccharides or other structures resulting from post- translational processing of the polypeptide
  • the imprinted domain on each cell's surface is neighboring the selection site, e.g. on an adhesion domain, the structure to which the CISTEM' s selection agent must bind in order to exert its effects upon the cell.
  • this may be an organism or molecule, and it may be fatal to cells or merely retardative of their growth, but the first step in the agent's destruction or retardation of a cell is necessarily binding between the agent and its adhesion domain.
  • the two important domains - namely, the imprinted domain and the adhesion domain - are expressed in multiple copies on the surface of each cell, but they are - in the case of direct interaction - normally adjacent to each other: more specifically, they must be sufficiently close to each other that the target compound, when interacting with the imprinted domain, either extends far enough over the adhesion domain that adhesion of the selection agent is at least partially impeded, or exerts its effect on the selection site in a different way (e.g. by intramolecular steric alteration of the selection site) .
  • the target compound is the molecule or set of molecules with which the polypeptide sequences identified by the CISTEM must interact in the prescribed manner in order to exhibit the properties or perform the functions desired of them.
  • the CISTEM identifies polypeptides that interact with, e.g. bind to, and optionally also structurally or functionally alter, the target compound.
  • a target compound can be any molecule, especially a protein, carbohydrate, saccharide, glycoprotein, proteoglycan, hormone, receptor, antigen, antibody, pathogen, substrate, metabolite, transition state analog, cofactor, inhibitor, drug, dye, nutrient, small molecular compound, etc., without limitation.
  • Each CISTEM places only one requirement on the target compound: it must be effective in affecting the change in the selection site upon binding to the polypeptide of interest, e.g. that it is sufficiently large that its attachment to the imprinted domain results in occlusion of the adjacent adhesion domain or that its binding affects the structure of the membrane protein such that the selection site is sufficiently changed to prevent or to allow action of the selective agent.
  • each selected clone increases in frequency and in number as it continues to divide, while unprotected clones are afflicted by the selection agent.
  • the CISTEM selects and amplifies clones expressing structures that bind the target compound (figure 8) .
  • CISTEMs permit selection for a prescribed affinity or nature of interaction of the imprinted domain with the target. Multivalent binding may be selected, as may highly specific binding, binding at predetermined sites, or binding by specified structural motifs. Moreover, CISTEMs may be extended to select not only for binding, but also for functional activity; these extended CISTEMs may be used to create enzymes and other functionally active molecules.
  • CISTEMs for Cellular Imprinting by Selection: Teleological Evolution of Molecules.
  • a CISTEM consists of three components. They are: (1) an engineered population of cells; (2) a target compound; and (3) a selection agent (figure 5).
  • the CISTEM cell population may be prokaryotic or eukaryotic.
  • the unknown interactant is the polypeptide of interest
  • in the CISTEM cell population one extra- or intracellular domain (the "imprinted domain") of a membrane polypeptide (the "imprinted polypeptide") is engineered to exhibit highly variable peptide sequences.
  • the CISTEM cell population is eukaryotic, glycosylation sites within the imprinted domain may be adorned by variable complex saccharides. High sequence variability at the imprinted domain is achieved through the insertion of variable oligonucleotides at the sites in the polypeptide gene that correspond to translated regions in which variation is desired.
  • variable oligonucleotides may be cDNAs, random sequences, or any other form of oligonucleotides.
  • Techniques for inserting a gene for an exogenous oligopeptide at a targeted site in an outer membrane protein gene were described above, and an extension of these techniques, which allows the insertion of variable oligonucleotides (at least 10 5 to 1012 different sequences, instead of a single, stereotyped nucleotide sequence) , will be described in detail below.
  • a CISTEM' s selection agent may bind to an adhesion domain that is located immediately adjacent to the imprinted domain on the surface of the cells in the CISTEM population (figure 5) .
  • the adjacent extra- or intracellular domains in most of the membrane proteins used in this invention are separated by 3 - 30 A.
  • this arrangement of domains is the result of carefully choosing an imprinted polypeptide and a selection agent such that the imprinted domain is immediately adjacent to the adhesion domain.
  • membrane proteins are genetically altered in order to construct a hybrid polypeptide in which the imprinted domain is placed beside the adhesion domain.
  • the techniques for engineering membrane proteins, including the fusion of two separate polypeptides to form a chimeric polytopic polypeptide, were described above; their specific applications to embodiments of this invention are described below.
  • the target compound is any molecule or set of multiple, distinct molecules, which are simply added to the cell population according to the present invention or, in some embodiments of this invention, are secreted by the cells themselves.
  • the present invention identifies and selects cells in which the displayed polypeptide sequence interacts with the target compound, or some subset of molecules included in the target compound, and this interaction event is necessary to the functions to which the polypeptides are ultimately applied, such as enzymatic activity or alteration of the target compound's biological activity.
  • Each cell in the CISTEM population expresses multiple copies of its imprinted polypeptide, and binding is a dynamic process balancing on-rate and off-rate. Therefore even when a cell's imprinted polypeptide exhibits high affinity for the target compound, some copies of the imprinted polypeptide are left unprotected some of the time. Consequently, every cell in a CISTEM population, no matter how high its affinity for the target compound, has some probability of being hit by a selection agent. Nonetheless, this probability is lower when a cell's affinity for the target compound is higher.
  • each clone in a CISTEM is present in multiple copies (the CISTEM cell population is redundant) the probability that a single cell will be hit is also the fraction of that cell's clonal population that will be hit, and the differential elimination of clonal populations is the very basis of selection.
  • each selected clone increases in frequency and in number as it continues to divide, while unprotected clones are destroyed.
  • the CISTEM selects and amplifies clones expressing structures that bind the target compound (figure 8) .
  • the effect of the target compound relevant for the preferred embodiment of the invention, wherein the selection site is located on an adhesion domain is that, due to its size, it extends, upon interaction with the imprinted domain, from the imprinted domain to the adhesion domain.
  • This is 3 - 30 A for many imprinted polypeptides imprinted by this invention.
  • the interaction between the target compound and the imprinted domain may lead to a conformational and/or chemical change of the variable domain and/or the adhesion domain which prevents the selection agent from attachment.
  • the target compounds, their source, preparation, and use of CISTEMs with particular applications in this specific embodiment of the invention are described hereinafter. While the concentration of target compound in a CISTEM is also important to its function, this concentration depends on the nature of other CISTEM components, and is therefore discussed below.
  • the target compound is simply the molecule that is to be tagged and/or partitioned by the polypeptide.
  • the target compound is HIV gpl20, the extracellular subunit of a major HIV glycoprotein (Fantini et al., 1997).
  • the production of recombinant gpl20 in a soluble form that is suitable for use in CISTEMs is described in the literature (Fantini et al . , 1997) .
  • Another embodiment of the invention is used to design a diagnostic polypeptide for the direct detection of a different virus, Human T-cell Leukemia Virus type I (HTLV-1); here the target compound is the HTLV-1 surface glycoprotein gp46.
  • HTLV-1 Human T-cell Leukemia Virus type I
  • the target compound is the HTLV-1 surface glycoprotein gp46.
  • the production of authentic, glycosylated HTLV-1 gp46 in a recombinant vaccinia virus/T7 polymerase system is described in the literature (Arp et al . , 1996). Diagnostic assays for detecting human viral infection often bind not to the virus itself, but rather to the antibodies raised against the infectious agent.
  • One preferred embodiment of this invention imprints a polypeptide to exhibit high affinity for a monoclonal antibody, MAbT7.
  • the target compound in this CISTEM, MAbT7 itself, is commercially available (Novagene) .
  • the invention is also used to imprint polypeptides to serve as antagonists, agonists, or modulators of the functional action of biologically active molecules.
  • the target compound may be either of the molecules involved in the interaction to be antagonized.
  • HIV gpl20 is the viral glycoprotein that binds to the CD4 receptor, and thus mediates viral invasion of the human cell (Fantini et al . , 1997) .
  • the embodiment of this invention that imprints a polypeptide (ICAM-1, which is described above) to bind specifically to HIVgpl20 provides a polypeptide that could serve as an antagonist of the interaction between the viral glycoprotein and its host cell receptor.
  • polypeptides selected by a CISTEM are more likely to functionally antagonize a given molecular interaction when the CISTEM' s target compound is pared down, so far as is possible, to one interactant ' s active site. Structures on either interactant that are distinct and topologically distant from the active site are less promising as target compounds because a polypeptide that binds such structures may not interfere with the normal interaction of the active molecules.
  • one embodiment of this invention imprints a polypeptide (E. coli OmpA, which is depicted in figure 13/, and described further below) to function as an antagonist of the interaction between the eukaryotic cell-signalling proteins Ras and Raf (Block et al . , 1996).
  • Raf Activation of Raf, a protein kmase, by the immediately upstream protein, Ras, initiates pathways leading to the phosphorylation of transcription factors, the stimulation of numerous genes, and, ultimately, to cell proliferation and differentiation.
  • Constitutively active oncogenic mutants of Ras or Raf occur in a large number of human tumors (Block et al . , 1996) .
  • the target compound is the domain of Raf (ammo acids 51-131) that is known to be directly involved m binding to Ras (Raf's so-called Ras-bmdmg domain, or Raf-RBD) (Block et al., 1996).
  • Raf-RBD Ras-bmdmg domain
  • Raf-RBD rather than Raf m its entirety, as the target compound ensures that the polypeptides selected by the CISTEM bind to precisely that domain of Raf which interacts with Ras, rather than to Raf epitopes that are extraneous to the important cell-signalling interaction.
  • the target compound is the receptor or biomolecule that the imprinted polypeptide is intended to activate.
  • the target compound is a solubilized form of IL-6R: the expression of soluble human IL-6R in CHO cells, and the protein's purification, are described in the literature (Yasukawa et al . , 1990).
  • CISTEM selects polypeptides with affinity for IL-6R
  • a secondary screen is used to isolate polypeptides whose binding to the receptor results in transduction of the signal that is normally initiated by IL-6. This secondary screen is described in the passages relating to extensions of the basic CISTEM.
  • the target compound When this invention is used to create agonists, the target compound is pared, so far as is possible, to the active site of the receptor or biomolecule that is to be agonized. This is done because removal of extraneous epitopes from the target compound increases the probability that an imprinted polypeptide that binds the target will also be active as an agonist .
  • CISTEM is used to identify, or to create mimics of, specific antigens that are present in infectious pathogens or cancer. These antigens or antigen mimics are then often used to provoke a specific immune response against the pathogen or tumor.
  • the target compound is composed of more than one molecule: multiple monoclonal antibodies, specific immune receptors, such as cytotoxic T-lymphocyte receptors or MHC-I molecules, or polyclonal antisera derived from individuals exposed to a specific antigen, such as meningicoccal Opa proteins, to a specific pathogen, such as meningicocci, or to a tumor, such as melanoma.
  • Polyclonal antisera are divided from animals, human volunteers or actual patients. This composite target is used as an impression - a "negative", or “footprint", of the antigen - and from this footprint, the CISTEM is used as described in example 2 to identify the antigen or to create a mimic of the antigen.
  • An antigen-mimic is of interest, because a polypeptide that is imprinted to bind the antibodies composing sera against an antigen is likely to elicit an immune response very similar to the one provoked by the antigen.
  • Antibodies suitable for use as the target compound in a CISTEM for imprinting E are examples of the target compound in a CISTEM for imprinting E .
  • coli OmpA to mimic Neisserial protein Opa 5c include mAb B306, mAb A222/5b, and others described in the literature (Olyhoek et al . , 1991).
  • Antibodies for use as the target compound in a CISTEM for identifying tumor antigens are derived from the polyclonal serum of a cancer patient, the isolation of which is widely described in the literature.
  • polypeptides are imprinted to bind to transition state analogs.
  • the preparation and structure of these transition state analogs are described in the literature (e.g. Charbonnier et al . , 1995).
  • a phosphonate transition-state analog (structure and preparation described in Charbonnier et al . , 1995) is used as the target compound for imprinting E. coli OmpA to function as a catalyst of the hydrolysis of p-nitrophenyl esters.
  • the target compound is not a transition state analog, but rather a modified substrate.
  • the target is a molecule that is structurally very similar, but not identical, to the substrate upon which the imprinted polypeptide will ultimately act.
  • the reason for using a slightly modified structure, rather than the substrate itself, is the following: if the substrate itself were used as the target compound, imprinted polypeptides that not only bound the substrate, but also performed the desired enzymatic activity, would alter the substrate and then release it, rendering the underlying cell sensible to the selection agent. Thus, if the substrate itself were used as the target compound, the CISTEM would select against clones expressing polypeptides with the desired enzymatic activity.
  • the target compound must be an effective structural mimic of the substrate (i.e. polypeptides that bind the target must also be able to bind the substrate) , but it must be invulnerable to the prescribed enzymatic activity (i.e. polypeptides that act enzymatically on the substrate must be unable to act on the target compound) .
  • Imprinted polypeptides that show high affinity for a different peptide motif bind at most only a tmy fraction of the target polypeptides, and are therefore left vulnerable to the selection agent.
  • Several methods for the synthesis of this oligopeptide target compound are described m the literature
  • a secondary screen may be applied to isolate imprinted polypeptides that not only bind the target, but also perform the desired enzymatic reaction. Secondary screens, including the one used in the CISTEM that imprints polypeptides to hydrolyze the bond between two methionines, are described below.
  • this invention In addition to the creation of functionally active polypeptides, such as antagonists and enzymes, this invention's applications include interaction analysis (current techniques reviewed m Phizicky et al . , 1995). By screening a diverse library of polypeptides for sequences that interact specifically with a target compound of significant interest, this invention assists m the identification of the compound's native interactants. For example, although the structural interaction between Ras and Raf-RBD has been well- characterized through crystallography and mutational analysis
  • This CISTEM involves the following components: Loop IV of E.coli OmpA is used to display the polypeptides translated from fragments of cDNAs derived from a cell expressing Ras and Raf (the insertion of foreign oligonucleotides into the gene for OmpA is described below) ; Raf-RBD (see above for a more detailed description of this target compound and its preparation) serves as the target compound; and phage K3 serves as the (pathogenic) selection agent.
  • the appropriate concentrations of the target compound and the selection agent can easily be determined by the skilled man in the art after reading the present specification and are described below in the examples of OmpA imprinting CISTEMs.
  • This invention is also used for epitope mapping (Phizicky et al., 1995), serving to elucidate the surface of structural interaction between two molecules.
  • the target compound is one of the two molecules whose interaction is to be mapped.
  • the interactant that is most easily prepared in soluble form that serves as the target compound.
  • Raf-RBD serves as the target compound.
  • it is readily prepared in soluble form (Block et al., 1996).
  • one of the interacting proteins is dissected into multiple smaller domains which are displayed in the imprinted domain and selected by the CISTEM for their capability to bind to the other interactant.
  • multiple separate molecules compose the target compound.
  • Each molecule must meet the CISTEM' s size requirement and must be present in the CISTEM at a concentration sufficient to confer protection upon clones that bind to it with high affinity. (Selection by CISTEMs with multiple distinct target molecules strongly favors binding by imprinted domains to multiple distinct molecules, because the effective concentration of target compound rises significantly with each additional compound that the imprinted domain is capable of binding.)
  • the CISTEM assists m the identification of interacting pairs of molecules: polypeptides expressed m the imprinted domain are selected for their affinity for particular members of the library of molecules that composes the target compound.
  • the target compound consists of a library of polypeptides or phage-displayed polypeptides that are secreted by the CISTEM cell population itself.
  • the target compound is the interaction partner that is to be identified due to its interaction with a given polypeptide of interest, e.g. a section of a naturally occuring cell surface receptor which is displayed the extracellular domain.
  • the target compound is the unknown interactant.
  • the cells presenting the receptor domain of interest may be incubated with chemical compounds from a pool of substances. Surviving cells are indicators that the compound that has interacted with these cells is a candidate substance for interaction with the receptor domain.
  • cells are protected from the (pathogenic) selection agent m an autocrme fashion, due to binding between the imprinted domain and a polypeptide target that is secreted by the cell itself.
  • two separate libraries of polypeptides expressed from two separate libraries of genes, are screened for interaction with each other.
  • the cell population displays one library on the cell surface, exactly like the standard CISTEM cell population.
  • the cell population carries also a second library of genes. These genes reside in a vector that encodes a secreted polypeptide. Such vectors are described m the literature (Mollenkopf et al . , 1996) .
  • Each cell m the autocrme population displays one foreign polypeptide on its surface and secretes a second foreign polypeptide into the extracellular medium. In the event that these two polypeptides bind to each other, a cell protects itself from the pathogenic agent and is therefore positively selected.
  • polypeptide relates to all types of peptides or proteins synthesizeable by a host cell.
  • a polypeptide must comprise at least one extra- or intracellular domain that accepts exogenous oligopeptide insertions without disturbing translation or export of the polypeptide to the cell surface.
  • This capacity to accept insertions has been observed in numerous secreted polypeptides, which can be attached to the outer membrane by a genetically engineered anchor that is described m the literature (Smger, 1990; Cross, 1990) , as well as in each one of the dozens of bacterial, yeast, and mammalian membrane polypeptides that have been topologically mapped by gene fusion techniques.
  • a number of these polypeptides such as E. coli OmpA, E. coli LamB, and mammalian ICAM-1, their properties, and their capacities to exhibit exogenous sequences m extracellular domains, are described m the literature.
  • the membrane polypeptide may be autologous or heterologous for the cell.
  • the membrane polypeptide is heterologous.
  • all copies of the membrane protein gene a given clone must be altered m some wayFor example, if a plasmid- borne copy of the gene for the imprinted polypeptide bears a foreign insert at the imprinted domain, whereas the chromosomal copy of the gene remains wildtype, then proper selection by the CISTEM is hindered, because the selection agent and/or the target compound can bind to the wildtype copy of the polypeptide, as well as to the engineered copy.
  • the gene encoding the membrane protein is inactivated, e.g. by deletion. Standard molecular methods are available to the person skilled in the art.
  • Exogenous insertions at an extra- or intracellular domain are involved in two stages of each CISTEM.
  • an oligopeptide tag is inserted into the imprinted domain, and cell populations exhibiting this tag are used m preliminary rt ⁇ T ⁇ tr t*J 3 TJ o 3 ⁇ ⁇ Hi 3 P- ⁇ - ⁇ ⁇ TJ ⁇ cn C ⁇ PJ TJ ⁇ - O rt tr o ⁇ - TJ TJ TJ cn TJ ⁇ - rt
  • an amber-suppressing host such as E. coli UH203
  • CDM8 cDNA expression vector pCDl.8 may be used in conjunction with the COS cell expression system (Staunton et al . , 1990).
  • the vectors bearing the gene for the imprinted polypeptide are replicated and isolated. They may be replicated by a cell population, by PCR techniques, or by any other commonly employed methods described in the literature.
  • 10 to 10 copies of the vector are isolated. The exact number depends on the number of distinct clones that will compose the CISTEM cell population, and on the technique used in the step described below, the insertion of the variable oligonucleotides into the vectors .
  • the gene for a well-characterized oligopeptide tag is inserted into the vector at a genetic site corresponding to the imprinted domain of the polypeptide.
  • the oligonucleotide tag may replace a nucleotide sequence of the naturally occuring sequence. Replacement is accomplished by excising a natural coding region and by inserting the oligonucleotide tag in place thereof.
  • the tag may be any peptide sequence that is known to bind with high affinity and specificity to a ligand that is commercially available or easily produced in the laboratory by techniques described in the literature.
  • One example of a commercially available tag/ligand pair is the T7 sequence (Met -Ala -Ser-Met -Thr-Gly-
  • any common technique for inserting an oligonucleotide at a specific site m a vector can be used (e.g. Boyd, 1994). Several such techniques are referenced above, and one preferred procedure is described here m more detail.
  • the vector is cleaved at the two designated sites by digestion with restriction endonucleases Rl and R2, following the protocols and under the conditions specified by the suppliers of the enzymes. This yields a linearized vector with non- complementary sticky ends (RSI and RS2) and an excized fragment.
  • the linearized vector is isolated by standard electrophoretic size fractionation .
  • oligonucleotide template comprising the gene for the tag is synthesized using conventional technology, such as phosphoramitide chemistry, or can be obtained from commercial laboratories offering custom oligonucleotides.
  • the oligonucleotide is synthesized to read 5 ' -P--Rl--tag gene--R2- -3', in which P is the primer sequence for a proof-reading polymerase, such as Vent polymerase (New England Biolabs) , Rl and R2 are the restriction endonuclease sites mentioned above, and --tag gene-- is the genetic sequence encoding the oligopeptide tag.
  • oligonucleotide template is used to generate a double stranded oligonucleotide.
  • This oligonucleotide is digested with restriction endonucleases Rl and R2 under conditions described by the enzyme suppliers, yielding an oligonucleotide with sticky ends RSI' and RS2 ' .
  • This oligonucleotide which has sticky-ends complementary to the linearized vector, is ligated into the vector by a common ligation procedure familiar to those practiced in the art.
  • the result is that the vector's gene for the imprinted polypeptide bears the gene for the oligopeptide tag at a genetic site corresponding to the imprinted domain.
  • Oligonucleotide 1 5' - CG[ATGGCTAGCATGACTGGTGGACAGCAAATGGGT] - 3'
  • Oligonucleotide 2 5' - CTAG[ACCCATTTGCTGTCCACCAGTCATGCTAGCCAT]CGAGCT - 3'
  • the sequence in brackets comprises the gene segment encoding the oligopeptide that is recognized by the T7 antibody.
  • These two single-stranded oligonucleotides were annealed in MBI Fermentas Klenow buffer, forming the double stranded T7-tag gene segment with sticky ends complementary to the sticky ends on pEV218, linearized by Sad and Xbal, as described in the previous paragraph.
  • the double-stranded oligonucleotide was ligated into the linearized plasmid, the T7 tag gene segment was in frame with the ompA gene on pEV218.
  • the new construct was designated pEV218T7.
  • the vector is used to transform a population of cells lacking a chromosomal copy of the gene for the imprinted polypeptide.
  • Strains lacking chromosomal genes for outer membrane polypeptides are described in the literature (e.g., Brown, 1992, Freudl, 1989, and Staunton et al . , 1990), as well as below, in the passages relating to cell lines and media.
  • Conventional methods of transformation are used, and transformants are selected by their expression of the vector- borne selectable marker, such as a gene for ampicillin resistance.
  • E in which E .
  • coli OmpA was engineered to display the T7 polypeptide tag at loopIV, pEV218T7 was used to transform E . coli strain UH203 ⁇ l a c, s upF, ompA, recA, proAB, rpsL) (Freudl, 1989), according to a common method of electrotransformation, as follows.
  • variable oligonucleotides that are inserted into the vector, and, ultimately, translated into the variable regions exhibited by the CISTEM cell population.
  • the variable oligonucleotides must have conserved sequences at their termini, just as the tag gene described in the previous section was given defined termini RSI and RS2.
  • variable oligonucleotides with defined sequences at their termini are easily synthesized using solid-phase phosphoramidite chemistry (Brown, 1992): at defined terminal positions, only one of the four nucleotides is made available for addition to the strands being synthesized, whereas at variable positions, a mixture of multiple nucleotides is presented to the strands.
  • end addition of bases catalyzed by terminal transferase m the presence of nonlimitmg concentrations of multiple nucleotides, can append a highly variable sequence to a defined terminal.
  • Highly variable sequences may also be obtained by selecting fragments of defined length from digestions of natural cellular DNA or RNA preparations.
  • variable oligonucleotides are the cDNA library genes.
  • the size of the cDNA library that is screened by such a CISTEM is in some embodiments of this invention reduced by limiting the screen only to genes of interest, as determined by comparison of the cDNA library genes to sequences catalogued in genomic libraries.
  • cDNAs are prepared from tumor cells according to protocols described in the literature.
  • variable oligonucleotides are sections of the gene encoding the known polypeptide.
  • the oligonucleotides inserted into pEV218 are sections of the sequence encoding Ras.
  • stop codons are excluded from variable oligonucleotide inserts in order to prevent truncation of the imprinted polypeptide. This is achieved by synthesizing oligonucleotides to read Dl(NNP) n D2, where Dl and D2 represent the defined sequences that facilitate insertion of oligonucleotides into vectors; n represents the number of triplet codons that compose the variable section of each oligonucleotide; N represents all four nucleotides, each with equal probability; and P represents a subset of nucleotides, each with equal probability. NNP encodes all 20 amino acids, but precludes the occurrence of stop codons recognized by the CISTEM cell population.
  • the stop codons TAA and TGA are excluded from the oligonucleotide inserts. It is not necessary to preclude the occurrence of all three stop codons in inserted oligonucleotides, because the CISTEM cell population may be composed of stop codon- suppressing mutants, which are capable of translating one or more of the stop codons. Numerous suppressor strains are described in the literature (e.g. Brown, 1992, Freudl, 1989).
  • the inserted oligonucleotides may include consensus sequences for N-linked glycosylation (Chavez et al . , 1991) . These are synthesized as defined sequences interrupting the variable sequence of each oligonucleotide.
  • variable oligonucleotide inserts may include a mixture of defined and variable positions, sequences coding for fixed motifs, or statistical biases toward certain sequences or structures.
  • variable oligonucleotides are inserted at a prescribed site m the vectors constructed and amplified by the methods described above. Two preferred procedures for precisely targeted insertion are described here-one m which the number of oligonucleotides inserted into each vector is defined, and another in which the number of inserts is variable.
  • the available procedures for inserting oligonucleotides at specific vector sites are certainly not limited to the two described here. In fact, any of the gene fusion techniques used to insert a gene coding for a reporter moiety into the gene for a membrane protein may be used to the same effect as the procedures that will be described.
  • the site of insertion m the vector is located such that oligonucleotides inserted there are translated as part of the imprinted domain.
  • the site of insertion may be further specified (i.e. withm the sequence encoding the imprinted domain) according to structural analysis or prescriptions for a particular nature of interaction between the imprinted domain and the target compound.
  • both of the procedures outlined here make use of restriction endonuclease sites that are unique m the vector, and are located at or near the prescribed site of insertion. If a unique restriction site does not already exist at the site of insertion, one may be engineered using any one of a number of procedures described in the literature (e.g. Boyd, 1994, Brown, 1992 and Freudl, 1989) .
  • variable oligonucleotide To insert one and only one variable oligonucleotide per vector, two distinct restriction sites, each unique m the vector, must be located such that they flank a short restriction fragment comprising the prescribed site of insertion. If the sticky ends left by cleaving the vector with one endonuclease (Rl) at its unique site are RSI and RSI', while cleaving with the other endonuclease (R2) at its unique site leaves sticky ends RS2 and RS2 ' , then the variable oligonucleotides generated at step 3 are given defined and conserved sticky ends RSI and RS2 ' .
  • the vectors are digested by both restriction nucleases (Rl and R2 ) , and the digestion products are size-fractionated to isolate the vectors, each of which has non-complementary sticky ends RSI' and RS2.
  • the population of restriction-digested vectors is combined with a slightly smaller population of variable oligonucleotide inserts (vectors/insert » 1.5 - 2.0), and ligation is allowed to occur. Because the sticky ends of the vectors are non- complementary to each other, each vector can neither ligate shut, nor ligate to other vectors.
  • oligonucleotide inserts have non-complementary sticky ends RSI and RS2', they can neither self-ligate into loops, nor ligate to each other to form concatamers of multiple oligonucleotides.
  • the only ligation that can occur is the insertion of one variable oligonucleotide into each vector. After ligation has taken place, the vector population is size- fractionated to discard vectors that did not receive an insert. In the resultant vector population, each vector has one variable oligonucleotide at the prescribed site in the region encoding the imprinted domain.
  • This plasmid is digested with Sad (enzyme and reaction buffer from New England Biolabs) and Xbal (enzyme and reaction buffer from MBI Fermentas), under conditions described by the suppliers, yielding a fragment of the restriction site polylmker and the linearized plasmid.
  • the linearized plasmid is isolated by standard gel electrophoresis .
  • Single stranded oligonucleotides are synthesized by conventional technology to read 5'GGTCTAGA(VNN) n CGAGCTC3 ' , in which N represents all four nucleotides, each with equal probability; and V represents C, G, or A, each with equal probability.
  • a 5' primer and a proof reading polymerase such as the Vent polymerase, are used to generate double-stranded oligonucleotides from the single-stranded templates.
  • a proof reading polymerase such as the Vent polymerase.
  • These double-stranded, highly variable oligonucleotides are then digested with Sad and Xbal, under conditions described by the suppliers, yielding highly variable oligonucleotides with non-complementary sticky ends 5'-CTAG and AGCT-3'.
  • the highly-variable oligonucleotides are then ligated into the linearized plasmid (MBI ligation buffer, conditions described by supplier) ; they ligate only one oligo per plasmid, and in only one orientation.
  • Conventional electrophoretic size fractionation is used to isolate ligated plasmids containing an insert.
  • the product of this procedure is a population of plasmids (pEV218I) bearing, at the genetic site corresponding to loop IV of OmpA, a highly variable exogenous sequence free of TAA and TGA stop codons.
  • a restriction site that is unique in the vector is located at the prescribed site of insertion, i.e., at a genetic site corresponding to the imprinted domain of the imprinted polypeptide.
  • a unique restriction site does not already exist at the desired site, one may be engineered by conventional methods described m the literature, such as oligonucleotide-directed mutagenesis (e.g. Brown, 1992 and Freudl, 1989) .
  • variable oligonucleotides generated by the procedures described above are given defined and conserved complementary sticky ends RS and RS ' .
  • the vectors bearing the gene for the imprinted polypeptide are cleaved by R.
  • the vectors are then combined with an excess of variable oligonucleotide inserts. Several distinct ligation events can occur. First, the vectors, which have complementary sticky ends RS and RS ' , can ligate to other vectors or self-ligate, closing without an oligonucleotide insert.
  • variable oligonucleotides which also have complementary sticky ends RS and RS ' can l) self-ligate as a loop, n) ligate into a vector with sticky ends RS and RS ' , m) ligate to each other, forming a multi- oligonucleotide concatenation, which itself can perform ligations 1, 11, or m.
  • size fractionation can be used to discard all inserts that did not ligate into a vector and all vectors that did not receive inserts. This yields a population of vectors, each of which harbors between 1 and n variable oligonucleotides at the prescribed site m the gene encoding the imprinted polypeptide.
  • the vectors bearing variable oligonucleotide inserts are used to transform a population of cells that lack a gene for the imprinted polypeptide. Conventional methods of transformation may be used. Transformants are selected by their expression of the vector-borne selectable marker, such as a gene for antibiotic resistance. (Efficient transformation
  • the product of transformation is a population of cells, each of which harbors a distinctive oligonucleotide insertion at a genetic site corresponding to a point m the imprinted domain. After transformation, the cell population is amplified by growth a suitable medium.
  • m which the CISTEM cell population is engineered to display variable oligopeptides at loop IV of E. coli OmpA
  • transformation is performed m the following manner.
  • the variable plasmid population, pEV218I whose construction was described above, is used to transform E . col i strain UH203 ⁇ lac, supF, ompA, recA, proAB, rpsL) , according to a common method of transformation described in the literature (Freudl, 1989) .
  • the UH203 population is grown Luna broth supplemented with 100 mg Ap/ml, selecting for Ap resistance of transformants.
  • the imprinted polypeptide is induced with the compound appropriate to the vector's promoter.
  • the population of UH203 transformed with pEV218I is induced with lm IPTG) .
  • Each clone displays a distinctive oligopeptide at the imprinted domain. If the imprinted polypeptide is a glycoprotein or a proteoglycan, glycosylation sites in the imprinted domain bear diverse and unpredictable oligosaccharides.
  • a CISTEM is constructed wherein the imprinted polypeptide is a polytopic membrane protein in which the domain that accepts insertions is directly adjacent to the adhesion domain of a selection agent suitable for use in the CISTEM.
  • This preferred configuration is depicted in figure 5, and several of the polypeptide/selection agent combinations that exhibit this configuration are listed in table 1, and in the literature (Tyler et al . , 1996).
  • the CISTEM is assembled with a minimum of genetic manipulation of the imprinted polypeptide; the only necessary manipulation is the introduction of an oligopeptide tag, as well as variable peptide sequences, to the imprinted domain, as described above.
  • the list of polypeptides that may be imprinted is by no means limited to polytopic proteins exhibiting adjacent adhesion and imprinted domains.
  • two genetic manipulations are employed.
  • secreted proteins When secreted proteins are imprinted, they are first converted to their glycolipid- or transmembrane-anchored isoforms. Because translocation of secreted proteins and membrane insertion of integral proteins are handled by the same cellular machinery (reviewed in Singer, 1990, and above) , many secreted proteins may be converted to membrane-bound isoforms simply by carboxyl-terminal attachment of a hydrophobic transmembrane span or a signal sequence for glycolipid addition.
  • the gene-fragment encoding the 37 carboxyl terminal amino acids of decay accelerating factor can be fused to the gene for a secreted protein, such as human growth hormone, in order to target and anchor the protein to the apical surface of human intestinal cell lines.
  • a secreted protein such as human growth hormone
  • heparin-binding epidermal growth factor-like growth factor can be expressed as a longer, membrane-spanning isoform (HB-EGFTMII) comprising an extracellular domain that is structurally identical to the secreted polypeptide (Onu et al., 1994).
  • HB-EGFTMII membrane-spanning isoform
  • Other examples of and techniques for the conversion of secreted polypeptides into isoforms anchored by transmembrane spans are described in the literature (Singer, 1990) .
  • a second genetic manipulation of the imprinted polypeptide is performed when the domain that is to be imprinted lacks an adjacent adhesion domain. This occurs e.g. when the polypeptide chosen for imprinting is not polytopic, or when the selection agents that adhere to domains adjacent to the imprinted domain are unknown or poorly suited for use in the CISTEM.
  • the sequence for a functional adhesion domain may be fused to the gene for the extra- or intracellular domain, creating a gene for a chimeric polypeptide comprising both the functional adhesion domain and the variable domain.
  • the colicin A adhesion domain from E.
  • coli OmpF when fused to separate domains from another outer membrane protein, OmpC, remains functional as a colicin A receptor (Fourel et al . , 1990) .
  • a group of eukaryotic viral systems which are described in the literature (Chu et al . , 1995), provide further examples of the insertion of functional adhesion domains. In their application to embodiments of this invention, these systems operate as follows.
  • the antigen-binding site of an antibody (scA) is displayed on the surface of a virus, such as spleen necrosis virus, as described in the literature (Chu et al., 1995).
  • the antigen that is recognized by the scA is then expressed as an exogenous insert in the extracellular domain that will serve rt ⁇ TJ tr ⁇ ⁇
  • type la polypeptides are generally competent for functions involving high-affinity adhesion or specific binding to extracellular targets. They are therefore well-suited to be imprinted for use as highly specific markers or antagonists of small polypeptides, such as small peptide hormones.
  • Type II polypeptides are often enzymes with extracellular active sites, such as glycosyl transferases . Some type II polypeptides, then, are well suited to be imprinted to exhibit enzymatic activity. When this invention is used to imprint a polypeptide to serve as a functionally active effector, the preferable imprinted polypeptide is usually a membrane-bound isoform of a secreted protein, such as human growth hormone.
  • the imprinted polypeptide is a molecule that naturally exhibits functional or structural homology with molecules that perform functions similar or identical to the one desired of the imprinted polypeptide.
  • structural homologues to E . coli OmpA have been found in Haemophil us f Shi gella , Nei sseri a , Bordetell a , and Streptococcus . In these virulent bacteria, the OmpA homologues play a role in adhesion to host tissues, and are often recognized by the host's specific immune response.
  • OmpA The structural homology between OmpA and these antigenic adhesins makes OmpA well-suited for imprinting for use in applications requiring mimicry of homologous adhesins. Two such applications are competitive inhibition of microbial adhesion, and vaccination, described in greater detail below.
  • this invention can screen libraries of polypeptides expressed by eukaryotic cells.
  • the oligo- and polysaccharides adorning the imprinted domain are not varied in the same systematic manner as the underlying polypeptide sequence, because, as was described above, the saccharide molecule attached to a particular site on a polypeptide is not a simple function of the local amino acid sequence.
  • glycosylation at a site is altered in complex and unpredictable ways by changes to the local sequence or to the sequence elsewhere in the polypeptide (Lis et al., 1993). Therefore, when the glycosylation sites within the imprinted domain are preserved while immediately flanking regions are varied by the method described above, a semi- random diversity of sugar structures is appended to a systematic diversity of polypeptide sequences, creating a complex glycoprotein or proteoglycan library.
  • CISTEM' s target compound When a CISTEM' s target compound naturally interacts with processed, e.g. glycosylated polypeptides (i.e., in its natural setting, the target compound binds to the processed forms, e.g. to the glycoproteins or proteoglycans), it is preferred that the imprinted polypeptide be processed, especially glycosylated.
  • the CISTEM cell population be eukaryotic.
  • the adhesions that underpin inflammation, blood clotting, sperm-egg interactions, and intercellular recognition are all mediated by glycoproteins or proteoglycans. Therefore, for the identification of molecules for use as antagonists or agonists in these processes, CISTEMs with eukaryotic cell populations and glycosylated imprinted domains are preferred.
  • the CISTEM' s selection (pathogenic) agent must not require a specific glycosylation on its adhesion domain. Otherwise, unexpected changes in the sugars adorning the adhesion domain renders some cells immune to the pathogenic agent even if they do not bind the target compound at their imprinted domains.
  • the selection site especially an adhesion domain of a pathogenic agent, used in a CISTEM is well-characterized. In most cases, this characterization reveals whether or not the agent requires glycosylations for adhesion. This may be verified, moreover, by removing glycosylations from the receptor and testing for susceptibility to the agent; methods for removing glycosylations are described in the literature (Lis et al., 1993) .
  • polypeptides that exhibit structural integrity are preferred. It is particularly preferred that they not only maintain their properties when liberated from the cell surface, but also exhibit stability in the environmental- perhaps in vivo-conditions in which the imprinted molecules are ultimately applied.
  • prokaryotic polypeptides ⁇ -barrels are exceptionally stable; they can be denatured only by boiling in strong detergents.
  • glycosylation often adds structural stability to otherwise malleable protein structures (Lis et al . , 1993).
  • the structural integrity of imprinted domains may be heightened by genetic manipulations. Such manipulations may be performed before or after imprinting, and they may be used to increase the stability of structures that will be attached to, or liberated from, the membrane.
  • genetic manipulations of the imprinted polypeptide may also be used to introduce an element of rational structural design to a CISTEM.
  • Structural analysis of the target compound, its native ligands, or molecules that naturally perform functions similar to that desired of the imprinted polypeptide may guide the insertion of predetermined motifs into the imprinted polypeptide.
  • a tendency toward the formation of certain structural motifs, such as ⁇ -helices may be incorporated into imprinted domains by inserting variable oligopeptides that are not entirely random, but instead are statistically biased toward sequences that form the desired motifs.
  • Predetermined motifs may also be inserted into the conserved regions of imprinted domains.
  • polyproline or a-helices may be used to extend a variable region away from the cell surface, making it more accessible to a sunken epitope on the target compound.
  • genetic manipulation informed by structural analysis is by no means necessary to CISTEMs, it may be utilized in combination with CISTEMs to design polypeptides that are well-suited to the performance of desired functions.
  • the CISTEM cell population consists of cells that lack a chromosomal copy of the gene for the imprinted polypeptide, but are capable, upon transfection with an episomal or vector-borne copy of this gene, of translating, exporting, and inserting the imprinted polypeptide in the membrane.
  • CISTEMs which imprint E.
  • E. coli OmpA to bind to any one of several targets E. coli strain UH203 (Freudl, 1989), which is ompA ⁇ , composes the CISTEM cell population.
  • E. coli strain S1755 (Brown, 1992), which lacks the lamB gene, composes the cell population.
  • a eukaryotic example is provided by several preferred eukaryotic CISTEMs, which imprint human ICAM-1 to bind to any one of several targets.
  • COS-7 cells are used; this strain lacks ICAM-1, but is easily transfected with an ICAM-1 gene borne on the CDM8 expression vector (the COS-7/CDM8 expression system is described in Staunton et al., 1990).
  • the strains that compose CISTEM cell populations are preferably stop-codon suppressors; that is, they recognize one of the stop-codons, such as UAG, as a translatable codon.
  • stop-codon suppressors that is, they recognize one of the stop-codons, such as UAG, as a translatable codon.
  • any stop-codons recognized by the strain composing the CISTEM cell population must be excluded from the variable oligonucleotides that are inserted into the vector-borne gene for the imprinted polypeptide. (The synthesis of variable oligonucleotides free of certain stop- codons, and their insertion into the gene for the imprinted polypeptide, are described above) . Therefore, the synthesis of variable oligonucleotides is simplified when the CISTEM cell population is able to recognize one of the stop-codons. For example, E.
  • coli UH203 which is used in several preferred CISTEMs for imprinting OmpA, is supF, i.e., capable of translating the UAG stop-codon. Consequently, variable oligonucleotides inserted into the OmpA vector (pEV218), must be free of the stop codons UAA and UGA, but not of UAG. This is easily achieved by the method of synthesis described above.
  • S1755 the strains used in CISTEMs imprinting LamB, is supE44, i.e., capable of translating UAG.
  • the medium in which the CISTEM cell population is propagated depends simply on the strain composing the population. For example, E.
  • coli UH203 is grown in Luria broth or agar. When a strain contains a plasmid with an antibiotic resistance gene, the appropriate antibiotics are added to the medium or agar. Standard media for all commonly used strains of prokaryotic and eukaryotic cells are described in the literature and familiar to those practiced in the art. The conditions for transformation of CISTEM cells with the vector bearing the imprinted polypeptide also depend upon the cell type, and are described m more detail above, as well as in the literature. Induction of expression of the imprinted polypeptide requires incubation of the cell population with the compound that activates the imprinted polypeptide' s mducible promoter. As is described above, however, certain inducible promoters, such as the lac promoter/operator elements, are "leaky", allowing for the appropriate level of expression of the imprinted polypeptide without any induction.
  • the medium When the CISTEM cell population is incubated with a suspension or solution of the target compound, the medium generally contains only the minimal requisites for cellular sustenance m addition to the target compound.
  • the presence of abundant macromolecules other than the target compound can result m selection for clones whose imprinted domain bind not to the target, but rather to one of the constituents of the medium.
  • imprinting OmpA E . coli UH203 are incubated m the presence of any one of several target compounds in ATCC Medium 52.
  • m some CISTEMs, Luna broth was used.
  • CISTEMs for imprinting ICAM-1
  • COS cells are incubated with target compound m TC Minimal Medium Eagle or Minimal TC Medium 99.
  • target compound m TC Minimal Medium Eagle or Minimal TC Medium 99.
  • Minimal media for other commonly used prokaryotic and eukaryotic cell lines are described m the literature. The concentration of target compound and duration of incubation are important to CISTEM function, and are discussed m detail below.
  • the procedure demonstrates that the agent is competent to perform CISTEM selection.
  • this procedure is used to determine the experimental conditions that will allow reliable CISTEM selection.
  • multiple parameters may be varied, including the following: concentration of the target compound ( [T] ) with which the CISTEM cell population is incubated; duration of incubation with the target compound; initial multiplicity of infection (Mol)or concentration of the selection agent; duration of incubation with the selection agent; temperature; level of expression of the imprinted polypeptide.
  • concentration of the target compound ( [T] ) with which the CISTEM cell population is incubated duration of incubation with the target compound
  • initial multiplicity of infection (Mol)or concentration of the selection agent concentration of the selection agent
  • duration of incubation with the selection agent duration of incubation with the selection agent
  • temperature level of expression of the imprinted polypeptide.
  • concentration of target compound, [T] and initial multiplicity of infection, Mol, are the two parameters that are varied over a number of trials.
  • two different parameters - namely, level of expression of the imprinted polypeptide and duration of incubation with the selection agent - are varied.
  • the level of expression of the imprinted polypeptide is one of the varied parameters.
  • the method by which regulation of the imprinted polypeptide is achieved depends upon which heterologous promoter controls the gene for the imprinted polypeptide.
  • heterologous promoter controls the gene for the imprinted polypeptide.
  • ompA is under the control of the lac regulatory elements, which are induced by isopropyl-b-D-thiogalactopyranoside (IPTG) .
  • the ompA gene on pEV218T7 comprises a TAG stop codon corresponding to amino acid 7 (Freudl, 1989) of the polypeptide; therefore, when expressed in the amber-suppressing (supF) host E. coli UH203, ompA expression is regulated not only by the level of induction with IPTG, but also by the concentration of the suppressor tRNA, which is expressed from a plasmid. Low level expression of OmpA is achieved with 1.0 mM IPTG induction. On pSB1649, lamB is under the control of the lac regulatory elements, and expression is regulated as described in the literature (Brown, 1992) . Regulation of the expression of ICAM-1 from pCDl.8, the CDM8 vector construct used in a preferred eukaryotic embodiment of this invention, is also described in the literature (Staunton et al., 1990).
  • Two populations of cells (populations A and B) exhibiting the oligopeptide tag in the imprinted domain are grown in a medium appropriate to the cell type and the expression vector.
  • E. coli UH203 transformed by pEV218T7 are grown at 37°C in Luria broth supplemented with lOO ⁇ g/ml ampicillin, and 1.0 mM IPTG for induction of pEV218T7 expression; conditions for growth of other cell types are described above and in the literature.
  • a third population of cells (C) of the same strain as populations A and B, transformed by the same vector as populations A and B, but without the oligopeptide tag, is grown in the same medium.
  • E. coli UH203 transformed by pEV218, rather than pEV218T7 are grown in the conditions described above.
  • Populations B and C are each grown to generate three clonal subpopulations of 10 cells, Bl, B2, B3, and Cl, C2, C3.
  • Population A is grown to generate nine clonal subpopulations, Al , A2...A9, each comprising 10 cells.
  • Populations Al ...9 and C1...3 are incubated with the target compound, while populations B1...3 are incubated under identical conditions, but without the target.
  • Optimal conditions and time required for binding between the oligopeptide tag and the target are prescribed by the supplier of the tag/target system or in the literature (Phizicky et al., 1995).
  • the cell population expressing pEV218T7 is incubated with mAbT7 for approximately 60 minutes.
  • the concentration of target compound with which cells are incubated is systematically varied across populations Al ...9 and Cl ...3 in order to identify the concentration that is optimal for CISTEM selection (table 3) .
  • Populations A7...9 and C1...3 are incubated with the highest concentration of target, such as lOOx the concentration used with populations Al ...3.
  • C1...3 are exposed to the selection agent.
  • the technique of exposure depends upon the nature of the agent, and, for all well-known agents, is described in the literature. Methods of exposing prokaryotic cell populations to bacteriophages are described by Calendar (Calendar, 1988); methods of exposing prokaryotic cell populations to colicins are described by
  • Pattus et al . (Pattus et al., 1990); and methods of exposing eukaryotic cell populations to viruses are described in the literature (e.g. Staunton et al . , 1990, Tyler et al . , 1996).
  • concentration of agent is varied across the cell populations in order to identify optimal conditions for the CISTEM' s selection (table 3).
  • the concentration of target is not one of the parameters that is varied. Instead, this concentration is fixed, while two other parameters are varied.
  • Cell populations A2,5,8, B2, and C2 are exposed to a doubled concentration of the agent, and populations A3, 6, 9, B3, and C3 are exposed to yet another doubling of agent concentration.
  • Table 3 summarizes the variation of target and agent concentrations over the cell populations in this procedure for optimizing CISTEM selection.
  • the concentrations in this procedure may vary depending on the nature of the selection agent and the method of exposure, but several fundamental aspects of the procedure do not change from one CISTEM to another.
  • the selection agent has unimpeded access to its receptor, and therefore affects every cell.
  • Population B is susceptible to the agent because, although cells bear the tag at the imprinted domain, the population is not exposed to the tag's ligand. Therefore, the selection agent's site, e.g. the adhesion domain, remains exposed and functional.
  • Population C is susceptible because, although the cell population is exposed to the ligand, the ligand is not bound by the imprinted polypeptide, which lacks the ligand' s tag.
  • populations Al ...9 exhibit a range of resistance, extending even to immunity.
  • the imprinted domain harbors the oligopeptide tag, which binds the target ligand during incubation.
  • the adjacent adhesion domain is occluded from the selection agent, and the cells are protected.
  • Populations Al ...9 exhibit a range of resistance because they are exposed to a range of concentrations of target ligand. When the range of concentrations is broad enough, the range of resistance extends from partial susceptibility to complete immunity.
  • the concentration of selection agent that allows for functional CISTEM selection is found as follows. Ubiquitous pathogenic effect--!. e. every cell in a population is affected by the agent--is easily observed when cell populations are exposed to agents according to protocols described in the literature. For example, in exposing prokaryotic populations to virulent lytic phage, ubiquitous pathogenic effect results in confluent lysis of the population, with zero surviving colonies (Calendar, 1988) . The lowest concentration of selection agent at which every cell in the control populations (B and C) is affected by the agent is an appropriate level for CISTEM selection.
  • the minimum concentration required for ubiquitous effect could be more exactly determined by further trials with Mol's ranging from 5 to 10.
  • the threshold for a CISTEM can be set by adjusting the concentration of the target compound.
  • the target concentration used in the CISTEM is the lowest level that provides complete protection to population Ai in the presence of selection agent at the CISTEM' s Mol. (Complete immunity to the agent is easily observed when cell populations are exposed to agents according to common protocols described in the literature.)
  • population A5 exhibits complete immunity to the selection agent while population A2
  • CISTEM to select for cells whose imprinted polypeptides bind the target with an affinity equal to or greater than the affinity exhibited by the oligopeptide tag for its ligand.
  • 3 10 molecules per imprinted polypeptide is used to select for cells with imprinted polypeptides that exhibit an affinity for the target that is significantly higher than the affinity exhibited by the oligopeptide tag for its ligand.
  • target compound such as 10 molecules per imprinted polypeptide
  • CISTEM The application of CISTEM to cell populations yields cell clones that survive the selection procedure by virtue of the interaction between the polypeptide of interest and the target (s).
  • the sequence of the polypeptide of interest is encoded by the DNA sequence which has been inserted into the display domain.
  • the surviving cell population can be monoclonal or polyclonal. In the latter instance, surviving cells need to be plated out on solid medium plates to recover colonies derived from single cells (state of the art bacterial genetics) .
  • Amplification of monoclonal cell populations and the subsequent preparation of their relevant episomal or vector DNA is performed according to standard protocols of recombinant DNA methodology.
  • DNA sequence analysis of the inserted DNA tag provides the DNA sequence of the insert and thereby allows the prediction of the ammo acid sequence of the polypeptide of interest which has been expressed and displayed at the surface as integral part of the membrane protein.
  • the inserted DNA is derived from a pool of randomly synthesized oligonucleotides or from DNA fragments derived from natural occurmg mRNA (cDNA) or genomic DNA.
  • cDNA mRNA
  • genomic DNA The selected DNA and cognate polypeptide sequences can be scrutinized to match sequences of known proteins and their cognate genes.
  • sequence analysis can be employed to identify the polypeptide of interest and to attribute and/or relate its composition to known genes or DNA sequences and their predicted proteins.
  • Modifications and elaborations of the basic CISTEM achieve several extensions of its capabilities, including the following: 1. augmentation of the maximum affinity of selected polypeptides for the target compound; 11. specification of the number of binding sites joining a multivalent imprinted polypeptide to the target compound; 111. selection for imprinted polypeptides that exhibit both high affinity for the target compound and low affinity for a second compound; IV. selection not only for affinity, but also for functional activity; v. screening the polypeptides expressed from a cDNA library for binding to a target compound; vi . selection for imprinted polypeptides that exhibit affinity for one or more of a set of multiple, distinct target compounds.
  • variation is introduced not as an extension of the original oligonucleotide inserts, but rather as an oligomeric rearrangement of the original inserts (the procedure of oligomeric rearrangement of oligonucleotide inserts is described in Brown, 1992) or a limited mutagenesis of the original inserts (techniques for the limited mutation of oligonucleotide inserts are described m Balmt et al . , 1993; insertion of mutated oligonucleotides is performed according to the procedure described above for insertion of variable oligonucleotides into a targeted site in a vector) .
  • oligopeptide "sequence space" may be searched by a number of strategies, some of which have been applied to screening other combinatorial libraries (Balmt et al., 1993, Brown, 1992, Janda, 1994).
  • the vectors bearing fresh variation at a site corresponding to the imprinted domain are then used to transform a population of cells, according to common protocols cited above, creating a new CISTEM cell population. This population is then subjected to yet another round of selection by the CISTEM (figure 10).
  • TJ TJ ⁇ 3 3 ⁇ - ⁇ a ⁇ ⁇ ⁇ - ⁇ PJ ⁇ - ⁇ ⁇ tr ⁇ 3 ⁇ - O a ⁇ 3 O ⁇ 3 ⁇ 3 O ⁇ - 1 ⁇ 3 - O 3 J ⁇ cn cn ⁇ - a ⁇ 3 ⁇ cn Hi ⁇ a ⁇ TJ C ⁇ 3 X rt ttl ⁇ X C ⁇ cn ⁇ - 3 3 3 3 3 3
  • Certain embodiments of this invention select for imprinted polypeptides that exhibit high on-rate in their binding with the target compound.
  • the CISTEM cell population is not pre-mcubated with the target compound prior to introduction of the selection agent. Rather, the selection agent is introduced simultaneously with the target compound.
  • concentrations of agent and target compound are varied over multiple trials, as is described in the optimization procedures detailed elsewhere in this document. Concentrations are adjusted such that, upon simultaneous introduction of agent and target to the CISTEM cell population, clones whose imprinted polypeptides bind to the target relatively rapidly are protected, whereas clones whose imprinted polypeptide binds to the target more slowly are eliminated by the selection agent.
  • a CISTEM that imprints a polypeptide to show high affinity for a target compound but extremely low affinity for a second compound, which is here called the "repelled compound” is constructed as follows.
  • the size of the target compound, the size of the repelled compound, and the distance between the imprinted domain and the adhesion domain are altered, if necessary, to meet the following condition: when the target compound is bound to the imprinted domain, it occludes the adhesion domain, whereas when the repelled compound is bound to the imprinted domain, it is too small to occlude the adhesion domain. If this criterion is not met fortuitously (i.e.
  • the diameter of the target compound and the distance between the domains may be altered, as follows.
  • the target compound may be enlarged by creating a multimer. (The procedure for creating a multimer depends upon the nature of the target compound, but many techniques for joining multiple copies of a compound are commonly practiced and described m the literature.)
  • the distance between the imprinted domain and the adhesion domain may be increased by moving either domain to a site on the imprinted polypeptide that is topologically farther from the other domain. Transposing the imprinted domain simply involves inserting variable oligonucleotides, according to the procedure described above, at a genetic site corresponding to a different extracellular domain of the imprinted polypeptide.
  • the adhesion domain may be transposed to a different extracellular domain of the imprinted polypeptide without affecting its function as a receptor for the selection agent.
  • the procedure for transposing an adhesion domain is described above as well as m the literature (Chu et al . , 1995) .
  • the superfluous domains may be replaced by polyprolme linkers m order to ensure that extra- or intracellular structures do not impede proper occlusion of the adhesion domain by a target compound bound to the imprinted domain.
  • the CISTEM cell population is incubated with both the target compound and the repelled compound. Then the CISTEM' s selection agent is introduced, and this creates two selective pressures.
  • One pressure favors cells expressing imprinted polypeptides with higher affinity for the target compound (figure 12) : a target compound bound to an imprinted domain occludes the adjacent adhesion domain, and thus serves to protect the underlying cell from the selection agent.
  • a second selective pressure favors cells with lower affinity for the repelled compound; a repelled compound bound to an imprinted domain not only fails to occlude the adjacent adhesion domain, but also prevents binding by a target compound (figure 12) .
  • an imprinted polypeptide ' s affinity for the repelled compound effectively reduces its affinity for the target compound.
  • selection favors cells that exhibit imprinted domains with high affinity for the target compound and low affinity for the repelled compound (figure 12) .
  • a basic CISTEM may be coupled to a secondary screen for activity.
  • a CISTEM selects clones whose imprinted polypeptides bind the target compound m the prescribed fashion. From this population of clones, or from the imprinted domains extracted from such a population, a secondary screen selects the structures whose binding to the target compound exhibits the desired functional activity.
  • Imprinted domains liberated from the cell surface by detergent or by expression as a secreted isomorph (lsomorphic forms and their mterconversion are described above) may be subjected to any commonly practiced screens or assays for activity or modification of the target compound's activity.
  • one CISTEM imprints E. coli OmpA to bind with high affinity to soluble IL-6-Receptor (This target compound is described in more detail above) .
  • the imprinted polypeptides freed from the cell membrane by a conventional gentle detergent, which is known not to denature OmpA, are then subjected to a second screen for their capacity to agonize the cell-bound, functional IL-6-Receptor .
  • This screen for agonistic activity is described m the literature (Hibi et al . , 1990).
  • extremely strong selection for functional activity may be exerted upon the CISTEM' s selected set of cellular clones, and does not require liberation of the imprinted domains from the cellular membranes.
  • CISTEMs can be coupled to selection regimes very similar to genetic complementation.
  • each cell's supply of a vital compound such as a nutrient
  • the imprinted domain's functional activity either converts the target compound to a compound vital to the underlying cell, or secures the vital compound for the cell.
  • the target compound consists of a library of separate and distinct molecules, which is here called cryosomal target library".
  • a CISTEM m which a target library (rather than a single target compound) is used serves to select interacting pairs m which one member of each selected pair is the oligopeptide displayed at the imprinted domain of a protected clone, and the other member is a target library molecule that binds to this oligopeptide and thus serves to protect the clone.
  • the target library is large enough to meet the size requirement for CISTEM function, and is present in the medium at a concentration adequate to confer protection upon cells whose imprinted polypeptides bind to one or more molecules m the target library, the target library may comprise as many distinct molecules as is desired, and may be constructed according to any known procedure for making or isolating libraries of molecules.
  • CISTEMs involving target libraries two strategies are used, if necessary, to increase the concentration of each molecule m the target library to the minimum level required for effective CISTEM selection, i . e . , to achieve the concentration required to confer protection from the selection agent upon those cells whose imprinted polypeptides bind to one or more of the molecules m the target library.
  • the CISTEM cell population and the target library are each amplified, divided into fractions, and exposed to one another m microtiter wells. This allows the concentration of target molecules to be sufficiently high to confer protection upon cells whose imprinted polypeptides bind to one or more molecules in the target library.
  • the target library is secreted by the CISTEM cell population itself.
  • libraries of variable oligopeptides can be displayed as fusions to gene protein 3 of E. coli filamentous phage (M13, fl, or fd) (filamentous phage-display technology is described in Janda, 1994 and Phizicky et al . , 1995).
  • Filamentous phage create relatively avirulent infections m which phage particles are liberated from the host cell by extrusion, leaving the cell in tact. Therefore, a CISTEM cell population that secretes its own target library can be engineered as follows.
  • a cell population that lacks a gene for the imprinted polypeptide is transformed with a library of exogenous copies of this gene, m which variable oligonucleotides have been inserted at the genetic site corresponding to the imprinted domain.
  • E. coli UH203 is transformed with pEV218I, a library of ompA plasmids with variable oligonucleotides at the genetic site corresponding to extracellular loop IV of the membrane protein.
  • the resultant cell population displays a library of exogenous oligopeptides at the imprinted domain.
  • this cell population is then transformed with a second extra-chromosomal genetic element that encodes a secreted library of oligopeptides.
  • the OmpA- imprinting cell population for example, is transfected with a library of filamentous bacteriophage engineered to display exogenous oligopeptides at gene protein 3. (Many such libraries are described in the literature, reviewed m Janda, 1994 and Phizicky et al .
  • the CISTEM cell population expressing two extra- chromosomal genetic elements is then exposed to the CISTEM' s selection agent, following the same protocols for exposure as were described for the basic CISTEM.
  • a clone m the CISTEM' s population is protected only if the target library molecule that is secreted by the clone binds specifically to the clone' s imprinted domain. That is, a clone is protected by an autocrme loop.
  • K3 serves as the selection agent (figure 13) .
  • coli clone is protected from K3 only if the exogenous polypeptide displayed at loop IV of OmpA binds specifically to the M13 that the clone itself secretes.
  • the genes for both members of this interacting pair of oligopeptides are easily recovered from the protected cell.
  • a CISTEM with a target library selects for specifically interacting pairs of oligopeptides, as well as for the genes that encode them.
  • a CISTEM is constructed according to the procedure ust described, but a hemolysin library is used m place of the phage M13 library. Hemolysms are secreted bacterial polypeptides that tolerate exogenous insertions. Thus, the hemolysin gene can be used to carry exogenous genes, which encode polypeptides that are then secreted from the cell.
  • CISTEM Two possibilities for an intracellular CISTEM are depicted m figure 20.
  • the protective partner :partner interaction occurs within the cell.
  • Version A shows a typical CISTEM, wherein the two partners are transported to the cell surface.
  • version B the positive selection occurs, because the two partners cannot be transported to the cell surface and e.g. accumulate withm the cell and/or become substrates for protein degradation pathways .
  • a CISTEM for the identification and isolation of cancer antigens may be constructed as follows: First, a cDNA library is derived from a patient's tumor cells. Protocols for isolation of cells and derivation of the cDNA library are described m the literature. The tumor-derived cDNAs serve as the variable inserts m the construction of the CISTEM cell population: they are inserted into the imprinted polypeptide vector at a genetic site corresponding to the imprinted domain of the polypeptide. The cDNAs are inserted m all three reading frames m order to guarantee that polypeptides associated with the tumor are expressed by the CISTEM cell population.
  • Methods for inserting cDNAs into vectors are available m the art. These vectors are used to transform a population of cells lacking an endogenous copy of the gene for the imprinted polypeptide. When expression of the vector-borne gene is induced, each clone the CISTEM cell population displays one tumor-associated polypeptide, fused to the imprinted domain on the cell surface.
  • the target compound in a CISTEM for the identification of tumor antigens preferably consists of polyclonal antibodies, usually isolated from the same patient whose cells are used as the source for the cDNA library.
  • the tumor cells that supply cDNA and the antibodies that comprise the target compound are preferably autologous.
  • any clone m which the imprinted domain displays a tumor-associated polypeptide that binds autologous antibodies has a different status concerning the selection agent than a cell with no such binding. E.g. in case of binding, these cells may be protected from the selection agent.
  • any clones m which the imprinted domain displays a tumor polypeptide that is not recognized by autologous antibodies is eliminated.
  • this CISTEM identifies tumor-associated polypeptides that interact specifically with autologous antibodies. It is shown m the literature that such polypeptides are likely to be lmmunogenic, eliciting a tumor-specifIC humoral and cellular immune response.
  • Certain embodiments of this invention are used to determine whether certain molecules interact specifically with an imprinted polypeptide that has not been altered by the insertion of variable oligopeptides.
  • Any outer membrane polypeptide that is used as a receptor by a pathogenic agent including polypeptides engineered by the manipulations described above, may be used m these embodiments of the invention.
  • the imprinted polypeptide is not engineered to display highly variable oligopeptides, there is no Uber ⁇ mprmted domain", and the selection agent's adhesion domain may be located anywhere on the imprinted polypeptide.
  • the imprinted polypeptide is expressed from an mducible promoter.
  • the cell population, induced to express the imprinted polypeptide, is exposed to a target library, according to one of the methods described above for constructing target libraries and introducing them to CISTEMs. Either microtiter wells or the autocrme loop methods may be used to increase the concentration of target library molecules.
  • the CISTEM' s selection agent is then introduced according to procedures described for the basic CISTEM. Cells are protected only if molecules m the target library bind to the imprinted polypeptide, and thus prevent adhesion by the selection agent.
  • these embodiments of this invention provide an easy and straightforward method for assaying a set of molecules' affinity for the imprinted polypeptide.
  • preferred areas, for which the present method is especially suitable is the discovery and isolation of novel antigens for use m vaccines, tumors, novel targets for therapeutic intervention, novel diagnostic markers, or molecules useful for monitoring treatment efficacy.
  • novel antigens for use m vaccines, tumors, novel targets for therapeutic intervention, novel diagnostic markers, or molecules useful for monitoring treatment efficacy is also subject matter of the present invention.
  • the present invention relates to the cells/the cell population to be used m the present invention with a) a membrane polypeptide which has the polypeptide of interest m its extra- or intracellular domain and b) a selection site being specific for a selection agent, such that upon binding of a target compound to the polypeptide of interest the selection site of cells which exhibit such a binding of said target compound to said polypeptide of interest is modified m a manner that said selection agent may be used for selecting those cells which exhibit said binding of said target compound to said polypeptide of interest from cells which do not exhibit said binding.
  • the present invention also relates to a kit comprising the cells/cell population according to the present invention, the target molecule (and/or a library of target compounds) and one or more selection agents.
  • kits may be provided &J H- M o IVi O 3 T) ⁇ - ⁇ 3 3 cn 3 ⁇ ⁇ ⁇ - ⁇ ⁇ 3 3 PJ rt TJ ⁇ a ⁇ 3 Hi n
  • E. coli Outer membrane protein A is a 325-res ⁇ due polypeptide comprising an ammo-termmal, eight-stranded ⁇ - barrel with four extra-cellular domains, three periplasmic loops, and a roughly 150-residue periplasmic carboxyl-termmal moiety (figure 3) (Freudl, 1989, Morona et al . , 1985).
  • the fa- barrel conformation is extremely stable, and can be denatured only by boiling in strong detergents.
  • OmpA is among the most abundant proteins m the E. coli outer membrane. It is also widely conserved m bacterial evolution.
  • exogenous oligonucleotides inserted into the ompA gene at sites corresponding to extracellular domains of the polypeptide, are normally translated as part of the polypeptide, and do not disturb the polypeptide ' s insertion in the outer membrane.
  • oligopeptides encoded by these exogenous genetic insertions are displayed on the surface of the cell, adopt stable folded structures, and can exhibit enzymatic, agonistic, or antagonistic activities.
  • an E. coli population lacking wild-type OmpA was engineered to express a plasmid-encoded copy of OmpA, m which extracellular loop IV of the polypeptide displayed the T7 tag oligopeptide (Met-Ala-Ser- Met-Thr-Gly-Gly-Gln-Gln-Met-Gly) , which binds to the commercially available T7 antibody (Novagene) .
  • pEV218 In pEV218 (Freudl, 1989) , the ompA gene, which is under the IPTG- mducible control of the lac promoter, has been engineered to bear a polylmker rich in restriction endonuclease sites at a genetic region corresponding to extracellular loopIV of the polypeptide (figure 9) .
  • This plasmid was digested with Sad (enzyme and reaction buffer from New England Biolabs) and Xbal (enzyme and reaction buffer from MBI Fermentas), yielding a fragment of the restriction site polylmker and the linearized plasmid with sticky ends left by the restriction enzymes.
  • the linearized plasmid with sticky ends was isolated by standard gel electrophoresis.
  • Two single stranded oligonucleotides were synthesized by conventional technology (custom oligonucleotides are available from commercial laboratories, and methods for synthesizing single-stranded oligonucleotides are commonly use and described m the literature) to read as follows :
  • Oligonucleotide 1 5' - CG[ATGGCTAGCATGACTGGTGGACAGCAAATGGGT] - 3'
  • Oligonucleotide 2 5' - CTAG[ACCCATTTGCTGTCCACCAGTCATGCTAGCCAT]CGAGCT - 3'
  • the sequence m brackets composes the T7 tag gene sequence.
  • These two single-stranded oligonucleotides were annealed m Klenow reaction buffer (MBI catalogue) , forming the double stranded T7-tag gene with sticky ends complementary to the sticky ends on pEV218, linearized by Sad and Xbal as described m the previous paragraph.
  • MBI catalogue m Klenow reaction buffer
  • the double-stranded oligonucleotide was ligated (MBI Fermentas ligation buffer) into the linearized plasmid
  • the T7 tag gene was inserted, in frame, into the ompA gene on pEV218.
  • the new construct was designated pEV218T7.
  • the plasmid pEV218T7 was used to transform a population of the strain UH203 (lac, supF, ompA, recA, proAB, rpsL) (Freudl, 1989) , according to common methods of transformation described m detail above.
  • Coliphage K3 invades E. col i by binding to loops I, II, and III of OmpA. Loop IV is not used by the phage, and, indeed, exogenous oligopeptides at loopIV do not affect K3 susceptibility of the cell (the K3 adhesion domains are shown m figure 3; bacteriophages are reviewed briefly m the specification, and bacteriophage K3, as well as its receptor requirements, are described m Freudl, 1989 and Morona et al . , 1985) .
  • the following procedure was performed to fulfill two objectives: a) to conclusively demonstrate that cells whose imprinted polypeptides bind the target compound are protected from K3 attack, whereas cells whose imprinted polypeptides fail to bind the target are eliminated by the phage, and b) to achieve preliminary optimization of the duration of incubation of the CISTEM cell population with phage K3.
  • the principle of this experiment is illustrated m figure 14.
  • E . coli MC4100 (pEGFP-Nl), a kanamycm-resistant, ompA wild-type strain, which will here be referred to as U (for thoughUntagged"), was cultivated overnight m Luna broth (LB) supplemented with 25 ⁇ g kanamycm/ml .
  • E. coli UH203 (pEV218T7), which will here be referred to as T (for perhapsTagged”), is an ampicill -resistant, ompA strain that has been transformed, as described m the specification, by a plasmid encoding OmpA bearing the T7 tag sequence at the fourth extracellular loop of the polypeptide.
  • Mol Multiplicity of Infection
  • This experiment also serves m the preliminary optimization of CISTEM selection by phage K3.
  • m table 4 and figure 15 m the mixed population of U and T cells that were incubated with mAbT7, U cells were completely eliminated from the population after 30 mm. incubation with phage K3.
  • T cells by contrast, maintained a roughly stable population well beyond 30 mm. of incubation with the phage. This suggested that when starting with Mol - 10, the minimum time period for phage K3 to eliminate all cells that failed to bind the target would be 30 - 60 mm.
  • the preliminary, single-parameter optimizations of incubation time with mAbT7 and the duration of K3 selection allowed an initial test of the CISTEM' s capacity to select rare cells whose imprinted polypeptides bind the target from a background of cells whose imprinted polypeptides fail to bind the target.
  • the following procedure was performed.
  • E . coli strains U (Untagged) and T (Tagged) are described above.
  • U was cultivated overnight m LB supplemented with 25 ⁇ g kanamycm/ml.
  • T was cultivated overnight m LB supplemented with lOO ⁇ g ampicillm/ml and ImM IPTG to induce expression of pEV218T7, the plasmid encoding OmpA with the T7 tag at loopIV of the polypeptide.
  • Each overnight culture was diluted with LB to OD600 - 0.005.
  • a 50 ⁇ l aliquot of the U culture was combined with a 50 ⁇ l aliquot of the T culture to create a mixed population.
  • the mixed population was created with equal sized aliquots of U and T cultures, it was composed predominantly of U cells. The reason for this was probably that the very low level of OmpA expression induced in the T strain (only ImM IPTG was used) made this strain significantly less viable than the U strain under standard conditions of cultivation.
  • the mixed population was incubated at 37C with 68nM mAbT7 (described above) diluted in PBS for 60 min.. Following incubation, 100 ⁇ l aliquots of the cell population were plated on LB plates supplemented with 25 ⁇ g kanamycin/ml or lOO ⁇ g ampicillin/ml . Colonies were counted after overnight growth, showing that the population consisted of approximately 1 T cell for every 120 U cells
  • T cells 10 8 U to 10 4 T; 10 8 U to 10 3 T; 10 8 U to 10 2 T; and 10 9 U to 102 T. Each population is incubated for 60 min. at
  • T cells are created by combining aliquots, as described above. 10 nM mAbT7 is added to each population, and the populations are incubated for 60 mm. at 37°C. During the 60 mm. incubation, the population composition is monitored, as described above, by taking sample aliquots of 5 ⁇ l every 20 mm., plating, growing, and counting the numbers of U and T cells. After the removal of an aliquot at 60 mm., coliphage K3 is introduced at an Mol of 2. Over the next 180 mm., the populations are monitored by 5 ⁇ l aliquot sampling every 20 mm.. The [IPTG] and duration of incubation with K3 that result m the elimination of all U cells, with concomitant elimination of fewest T cells, are optimal parameter values for use in the CISTEM.
  • OmpA-imprinting CISTEM cell population OmpA plasmid pEV218 (described above, shown m figure 9, construction described m Freudl, 1989) , bears a polylinker rich in restriction endonuclease sites at a genetic region corresponding to extracellular loopIV of the polypeptide.
  • This plasmid is digested with Sad (enzyme and reaction buffer from New England Biolabs) and Xbal (enzyme and reaction buffer from MBI Fermentas) , under conditions described by the suppliers, yielding a fragment of the restriction site polylmker and the linearized plasmid.
  • the linearized plasmid is isolated by standard gel electrophoresis. 10 distinct, single-stranded oligonucleotides are synthesized by conventional technology (synthesis of variable oligonucleotides is described above) to read as follows:
  • N represents all four nucleotides, each with equal probability; and V represents C, G, or A, each with equal probability.
  • V represents C, G, or A, each with equal probability.
  • variable plasmid population pEV218I
  • UH203 ⁇ lac, supF, ompA, recA, proAB, rpsL This strain, its propagation and its transformation, are described more completely above, and in Freudl, 1989.
  • the UH203 population is grown in Luria broth supplemented with
  • the UH203 (pEV218I) population of 10 8 cells plus approximately 100 UH203 (pEV218T7) cells is incubated at 37C for 15 min. in 100ml LB medium supplemented with 100 ⁇ g Ap/ml and IPTG between 0.25 and 2.0 mM, as determined in the optimization trial described above. This yields a population of 10 8 clones, comprising 106 distinct clones, each of which displays a distinct, exogenous, oligopeptide sequence at loop
  • lOnM mAbT7 is added to the 100ml of cells. The cells are incubated with the mAb for 60 min. at 37°C.
  • the cell population is centrifuged to separate cells from phage and the products of lysis, and cells are resuspended in 100ml NB, without lOOmM
  • This cell population represents the set of clones selected by the CISTEM for binding to the target compound mAbT7. To amplify the numbers of selected clones, the selected cell population is grown to 10 cells at 37°C in 100ml NB .
  • OmpA The structural homology between OmpA and a wide range of antigenic adhesins makes OmpA well-suited for imprinting for use m applications requiring mimicry of homologous adhesins.
  • One such application is vaccination.
  • a polypeptide in this case, OmpA
  • OmpA polypeptide
  • the imprinted polypeptide may elicit an immune response very similar to the one provoked by the antigen (Folgori et al . , 1994).
  • coli OmpA to mimic Nei sseria meningi tidi s Opa proteins (Olyhoek et al., 1991) include mAb B33, which reacts with all Opa proteins, mAbs P322 and P514, which react with a broad subset of Opa proteins, and mAb B306, mAb A222/5b (Olyhoek et al . , 1991), and others described in the literature.
  • this CISTEM like the one described in Example 1, uses phage K3 to select from a cell population displaying variable inserts 20 amino acids in length at loop IV of OmpA.
  • this CISTEM imprints OmpA to bind to mAbT7
  • this CISTEM imprints OmpA to bind to antibodies against Neisseria meningi tidi s Opa proteins.
  • E. coli strains U (Untagged) and T (Tagged) are described above (in Example 1).
  • U was cultivated overnight in LB supplemented with 25 ⁇ g kanamycin/ml.
  • T was cultivated overnight in LB supplemented with lOO ⁇ g amplicillin/ml and 2mM IPTG to induce expression of pEV218T7, the plasmid encoding OmpA with the T7 tag at extracellular loopIV of the mature polypeptide.
  • Figure 17 shows the dynamics of phage titer and Mol over the course of CISTEM selection. Mol climbs exponentially and reaches a maximum near 100,000.
  • Figure 18 shows the dynamics of cell growth and death during CISTEM selection. The rapid exponential growth of phage results in a monotonic decline of U cells, irrespective of the presence of absence of mAbT7. By contrast, in the presence of mAbT7, T cells are protected from the extremely high Mol, and consequently they expand in number. Thus, specific interaction between the antibody and the epitope displayed at OmpA loopIV resulted in selection of cells exhibiting the epitope, while cells that did not exhibit the epitope were eliminated. This result is summarized in figure 19. iv: EXAMPLE 4: REDUCTION OF BACKGROUND THROUGH THE USE OF HELPER CELLS
  • background is used to refer to cells that fail to bind the target compound but nonetheless remain in the selected population. In general, it is desireable to minimize the number of background cells, such that CISTEM reliably eliminates cells whose imprinted domains fail to bind the target compound, while selecting those cells whose imprinted domains do bind the target compound.
  • the following experiment illustrates one strategy for reducing background.
  • the CISTEM cell population is supplemented with "helper cells", the function of which is propagation and amplification of the selective agent. This amplification of the selective agent increases the intensity of selection, and thereby reduces residual background cells.
  • helper cells lack a selectable marker that is present in the rest of the CISTEM cell population; this permits efficient elimination of the helper cells after they have served their purpose of amplifying the selective agent.
  • UH203 (pEV218T7) , the "T" strain, was cultivated overnight in LB supplemented with lOO ⁇ g ampicillin/ml and lmM IPTG to induce expression of the ompA plasmid.
  • UHFl(pEV218) was used as a background strain that does not exhibit the T7 epitope at loopIV of OmpA.
  • UHF1 is virtually isogenic with UH203, except that it carries a gene for Chloramphenicol resistance.
  • the plasmid pEV218 is identical to pEV218T7, except that it carries a restriction-enzyme site polylmker in place of the gene for the T7 tag.
  • UHF1 pEV218
  • UHF1 the "U” strain
  • pEV218 was cultivated overnight in LB supplemented with lOO ⁇ g ampicillin/ml and lmM IPTG to induce expression of pEV218.
  • To create mixed populations aliquots of the "T" and "U” strains were combined to compose populations with a ratio of T to U cells of approximately 1 to 100 (table 9) .
  • helper cells MC4100 (pEGFPNl) were used as helper cells.
  • lOO ⁇ l of MC4100 (pEGFPNl) at OD 600 0.5 were added to two mixed populations. Two other mixed populations received no helper cells.
  • the background proportion defined as U cells/ (U cells + T cells) is reduced from 34% in the absence of helper cells to 16% with the addition of lOO ⁇ l of helper cells at OD 6 oo.
  • This experiment thus illustrates the principle of helper cells: they are added to CISTEM cell population to modify the activity of the selective agent.
  • the E. coli strains UH203 (pEV218T7) , here called “T” cells, and UHF1 (pEV218) , here called “U” cells, were cultured overnight with lmM IPTG to induce expression of the ompA plasmids.
  • T cells E. coli strains UH203 (pEV218T7)
  • UHF1 pEV2128
  • U UHF1
  • Helper cells were added as described in the previous example.
  • One set of five mixed populations representing each of the serial dilutions of T cells (see table 10), was incubated for 1 hour at 37C with 340 nM mAbT7.
  • Two replicate sets of five mixed populations (a total of 10 populations, comprising two of each of the serial dilutions of T cells) were incubated under the same conditions without the antibodies.
  • phage K3 was added to the set of five mixed populations that had been incubated with the antibodies, as well as to one of the two sets of five mixed populations that were not incubated with the antibodies.
  • the presence of antibodies or phage is indicated by a +, whereas the absence of antibodies or phage is indicated by a -.
  • aliquots from each population were plated on agar containing the appropriate antibiotics, as described above. After overnight culture, colonies were counted to determine the composition of the CISTEM cell populations.
  • Table 10 shows not only the final numbers of cells, but also several indices of CISTEM' s performance in selecting T cells from a background of U cells. Enrichment, an index of the selection system's capacity for amplifying clones that bind the target compound while eliminating those that do not, is defined as ( T Cel l s /U Cel ls ) after selerr mn/ ( T Cel ls /U Cells ) befDre selection -
  • Yield an index of the proportion of all cells that bind the target compound that are successfully selected by the CISTEM, is estimated by comparing the number of T cells m the K3- populations with the number of T cells m the K3+, mAb+ populations. Yield is defined as (T cells) ⁇ +, Ab+/ (T cells) 3 -.
  • CISTEM One application of CISTEM is in mapping the epitopes involved in protem-protem interactions. To illustrate this application, CISTEM was used to select cells displaying the phage T7 polymerase epitope that is recognized by the T7-polymerase specific mAbT7 (described above) .
  • the plasmid pET17xb (Novagene) contains the entire gene for phage T7 polymerase. This plasmid was digested with micrococcal nuclease, according to protocols provided by the supplier, to yield fragments of a range of sizes from approximately 50 basepairs to 300bp.
  • the band corresponding to fragments between 60 and 75 bp in length was excized with a razorblade and the fragments were resuspended.
  • the fragments were treated with SI nuclease (conditions according to supplier) to remove the 3' phosphates left by micrococcal nuclease.
  • the fragments were then phosphorylated by T4 polynucleotidkmase .
  • the ompA plasmid pEV218 (described above) was cleaved by the restriction enzymes Ecll36II and Sail (conditions of digestion according to supplier) , at unique sites in the genetic region corresponding to the fourth extracellular loop of the mature polypeptide.
  • the linearized vector was dephosphorylated by calf intestinal phosphatase. Combining lOOng linearized vector with 200ng fragments, the fragments were ligated into the linearized vector by standard blunt-end ligation using T4 ligase. PEG was added to the ligation buffer to favor circularization. Circularization of the vector without insertion of a fragments creates a Sail site. Therefore, the products of ligation were treated with Sail to linearize vectors that did not contain a fragment. The circularized vectors containing fragments were isolated by electrophoresis, and used to transform the highly competent, commercially available strain XL-1 blue. Cells were plated on LB supplemented with lOO ⁇ g/ml ampicillin to select for transformants.
  • Colonies were grown in 250ml LB supplemented with ampicillin, as well as glucose for catabolic repression of low- level expression of the ompA plasmid. Plasmid DNA was extracted using the Qiagene kit. 7 ⁇ g of plasmid DNA was cleaved by Sail and used for transformation of UH203 (an ompA deficient strain described above) by electroporation and CaMg transformation. Cells were grown on LB supplemented with lOO ⁇ g ampicillin to select for transformants. This population of transformants will be refered to as UH203 (pEV218T7L) .
  • the UH203 (pEV218T7L) population was grown in overnight culture with lmM IPTG to induce expression of the ompA plasmid.
  • the cells were incubated for one hour with 680nM mAbT7, allowing for binding between the monocolonal antibody and the epitope presented at OmpA loopIV by certain members of the cell population.
  • Each polypeptide (column A) is listed with the selection agent (column B) that could be used in a CISTEM to imprint the polypeptide; the respective domains that compose the agent's adhesion domains (column C; numbering begins at the amino terminal) ; a domain that is interacting (e.g. for sterical reasons) with the adhesion domain and could be imprinted to bind a target compound (column D)
  • Table 3 Serial dilutions of selection agent and target concentrations in preliminary optimization of CISTEM selection. The cell populations listed in the table are described in the text.
  • Rhinoviruses propagation in cell culture.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Immunology (AREA)
  • Urology & Nephrology (AREA)
  • Hematology (AREA)
  • Biotechnology (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Microbiology (AREA)
  • Organic Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • General Physics & Mathematics (AREA)
  • Food Science & Technology (AREA)
  • Pathology (AREA)
  • Cell Biology (AREA)
  • Analytical Chemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Toxicology (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Biophysics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Plant Pathology (AREA)
  • Virology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Peptides Or Proteins (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)

Abstract

This invention is a method for identifying and isolating at least one of the following: a polypeptide of interest and/or a gene thereof; a target compound and/or a gene thereof, wherein the polypeptide of interest, or the target compound, or both are unknown. The method is based on the steps of: (1) displaying a library of a polypeptide of interest on the surface of a population of prokaryotic or eukaryotic cells; (2) exposing the population to a target compound; and (3) exposing the population to a selection agent. Interaction among the three components 'imprints' the polypeptide: that is, the interaction selects for those cells in which the polypeptide interacts with the target compound in a prescribed manner.

Description

METHOD AND CELLS TO SCREEN COMBINATORIAL LIBRARIES
Combinatorial chemistry (reviewed in Janda, 1994), in which a diverse library of laboratory-constructed molecules is screened for structures that exhibit properties of interest, is a relatively novel and potentially powerful technology, promising useful application in three general areas. First, combinatorial chemistry may be used to identify molecules with highly specific adhesive or enzymatic activities useful in biomolecular research, clinical therapy, or diagnostics. Second, by screening a library for molecules that interact specifically with a known compound of significant interest, combinatorial chemistry may assist in identification of the known compound's native interactants (e.g. Phizicky.et al . , 1995) . Third, combinatorial chemistry may be used for epitope mapping, serving to elucidate the surface of structural interaction between two molecules of interest (e.g. Phizicky et al., 1995) .
The current combinatorial chemistry of proteins, powerful though it may be, has shown two distinct limitations. In many libraries, the diverse proteins are presented on a platform that fails to confer much structural integrity; as a result, the proteins can be too plastic to exhibit a stable shape and consistent properties. Equally as important, the screening procedure can be labor-intensive, and factors outside of a protein's exhibition of the properties of interest often influence its isolation by a screen. This occurs in most screens reported in the literature (e.g. Phizicky et al., 1995) . A related technology, two-hybrid systems, has proven useful in the latter two applications mentioned above, namely, interaction studies and epitope mapping (Phizicky et al . , 1995) . However, two hybrid systems place a very strict limit on the kinds of interaction that may be studied: both components of the interaction must be proteins, and they must be readily transported into the nucleus.
It was an object of the present invention to provide a method of polypeptide combinatorial chemistry and polypeptide interaction identification that resolves or circumnavigates many of the difficulties encountered by other technologies. (The term polypeptide is used herein to comprise all types of peptides or proteins, including glycoproteins, proteoglycans, lipoproteins, acetylated proteins or (other) processed peptide or protein forms) . It was a further object to provide a fast and reliable screening method for combinatorial or other peptide libraries, which is also versatile and broadly applicable .
These objectives are met by a method for identifying and isolating at least one of the following: a polypeptide of interest, a target compound or a gene of a polypeptide of interest or a gene of a target compound, wherein at least one of said polypeptide of interest or said target compound is unknown comprising the steps a) constructing a cell population by engineering it to express a membrane polypeptide which has said polypeptide of interest in its extra- or intracellular domain, said extra- or intracellular domain being exhibited in one or more copies on each cell's surface directly or indirectly interacting with a selection site specific for a selection agent, such that upon binding of said target compound to said polypeptide of interest said selection site of cells which exhibit such a binding of said target compound to said polypeptide of interest is modified in a manner that said selection agent may be used for selecting those cells which exhibit said binding of said target compound to said polypeptide of interest from cells which do not exhibit said binding, b) exposing said cell population to said target compound or a population of said target compounds for a period of time sufficient for said target compound to bind to said polypeptide of interest, c) exposing said cell population simultaneously with or after step b) to said selection agent for a period of time sufficient for said agent to interact with said selection site, d) selecting for clones which show said binding of said target compound to said polypeptide of interest , e) optionally allowing said selected clones to multiply and f) isolating said polypeptide of interest and/or said target compound and/or said gene of said polypeptide of interest and/or said gene of said target compound.
In this invention, a library of polypeptides is exhibited in one or more extra- or intracellular domains of a membrane polypeptide expressed by a cell population. Selective pressures exerted by selection agents eliminate those cells whose polypeptide sequences fail to bind to a preselected target compound. Thus, from a cell population that may exhibit a diversity of polypeptide sequences, cells whose sequences bind the target compound are selected and e.g. amplified by growth. The term "polypeptide imprinting" is used to refer to the use of the method according to the present invention to select cells exhibiting polypeptides that bind the target compound or exhibit specific properties.
With this invention, a wide variety of prokaryotic and eukaryotic polypeptides may be imprinted, serving e.g. creation or discovery of polypeptide sequences with high affinity for a predetermined target, interaction studies, or epitope mapping. In more sophisticated embodiments of the invention, polypeptides may be imprinted to exhibit the following specific properties: multivalent binding to a predetermined target; high affinity for a target and low affinity for one or more molecules closely related to the target; inhibitory activity in a biological pathway (antagonism) ; excitatory activity m a biological pathway (agonism) ; or enzymatic activity.
It is also possible to screen not only for a polypeptide library but also comcidently or alternatively, for a population of target molecules.
As mentioned above, the present invention may be performed by displaying the polypeptide to be imprinted (the polypeptide of interest) in the extra- or intracellular domain of a membrane protein.
Preferred embodiments of the present invention are described hereinafter and in the examples and figures. This invention is the product of the confluence of three distinct areas of research: i. the study of prokaryotic and eukaryotic cellular membranes and their complex constituents; ii. the study of the evolution of microbial populations under the selective pressures of toxins, bacteriocins, or viral parasites; and iii. the design and screening of polypeptide combinatorial libraries.
With the present invention, a polypeptide with promising properties can immediately be identified, amplified and isolated, making the present method extremely sensitive. It is possible to select one polypeptide that binds to a ligand out of a background of 106 non specific polypeptides as a minimum, but in there is no evidence for an upper limit on sensitivity regarding the present method per se.
With the present method all cells of the combinatorial library of polypeptides and/or target compounds are screened at once, rather than serially, providing results in far less time than many alternative technologies. The present invention is easy to use, because the important parameters are easily adjustable for the man skilled in the art. Once a set of cells is selected, the genes for the polypeptides or the target compounds (if these are also expressed by the cells) are easily isolated.
The polypeptide of interest may be displayed by any of the wide ranges of cell types known and used for displaying foreign proteins in extra- or intracellular domains (e.e. on the cell surface) . This allows the present method to avoid the biases of any one expression system. The polypeptide is preferably expressed in a membrane protein which is biochemically appropriate for the target compound or for the system wherein the isolated polypeptide/target pair is intended to be applied (e.g. as a vaccine or as a therapeutic) .
Combinatorial chemistry, which is one of the major components of the present invention, consists of two or three steps, often performed in iterative cycles. The first step is the generation of a diverse population of molecules, the so- called combinatorial library. In the second step, the molecules in the combinatorial library are partitioned according to their exhibition of properties sought by researchers, such as affinity for a particular compound or enzymatic activity. In a third step that is practiced in some but not all embodiments of combinatorial chemistry, the molecules found to exhibit the desired properties are replicated or resynthesized. The cycle may then be repeated, beginning either at the first step, with the introduction of new variation to the molecular population, or at the second step, with the partitioning of the population according to the exhibition of desired properties. When a combinatorial approach is successful, the cycle of generating molecular diversity, whittling down diversity in favor of the few molecules that display the desired properties, and, in some cases, amplifying the numbers of these molecules, ultimately yields molecules exhibiting the desired specific activities.
The immune system is essentially a genetically coded combinatorial library, complete with steps of variation, selection, and replication: A vast diversity of antibodies is displayed on the surface of B-cells (variation) , and specific recognition of antigen by any given cell triggers that cell's proliferation (partitioning and replication) . It is not surprising, then, that the first in vitro genetically-coded combinatorial libraries were directly appropriated from the immune system: B-cells were immortalized through hybridization to mouse myeloma lines (Marks et al . , 1991) . Later, in order to circumnavigate difficulties of hybridization, and to utilize a greater part of the antibody repertoire, E. coli were engineered to express and secrete the variable chains of antibodies. However, because secreted antibodies were "untagged" (Janda, 1994) -viz. not physically linked to their genetic codes-screens for antibodies that bound antigen had to be performed according to techniques that allowed researchers to trace from each selected antibody to the gene that encoded it. This severely limited the number of clones that could feasibly be screened. The advent of phage-display, or "fusion- phage", libraries made it possible to screen variable proteins that were "tagged" (Janda, 1994), and thus competent for immediate replication. Combinatorial libraries of fusion-phage are composed of filamentous bacteriophage or phagemid that have been engineered to bear semi-random exogenous proteins as sandwich fusions near the end of the adsorption protein pill (Janda, 1994, Marks et al., 1991) . The method of screening these libraries for phage that exhibit affinity for a target compound is known as "affinity purification": the target is immobilized on a solid surface or column, over which the phage library is washed; phage bound to the target are eluted, amplified by growth in bacterial culture, and then subjected to yet another round of partitioning. This cycle is usually iterated until a tractable number of clones are isolated. Phage libraries have been used for epitope mapping (Phizicky et al., 1995) -viz. pinpointing the region of a larger protein that is involved in binding a target compound of interest-as well as for deriving novel ligands to targets. Entire single- chain variable-fragment antibodies (scFv's) have been inserted into phagemid pill (Marks et al . , 1991), and libraries of scFv's or semi-random oligopeptides have been screened for binding to a variety of medically significant polypeptides, nucleic acids, and molecules not previously known to bind proteins .
However, while fusion phage technology has been responsible for significant strides forward in combinatorial chemistry - such as the recognition that "tagged" libraries would permit screening of a vastly increased number of sequences - phage libraries themselves have not escaped several besetting difficulties. First, in affinity purification of phage libraries, factors besides affinity for the target compound contribute significantly to each clone's selection. For example, the collection of clones selected from a single library by affinity purification can vary dramatically with the technique used for eluting phage from the solid substrate of target compound (Phizicky et al . , 1995) . Second, phage adsorption protein, pill, may fail to provide exhibited proteins with a sufficiently stable platform. Many polypeptides acquire structural rigidity only in certain constraining conformations; otherwise they remain extremely plastic. Therefore, to be functionally active structures, many polypeptides will require a foundation other than pill .
A promising solution to problems with the platform for polypeptide display was introduced when a combinatorial library of proteins was displayed in an extracellular loop of a bacterial outer membrane protein (Brown, 1992) . To construct this library, between 1 and 20 semi-random oligonucleotides, each one 33 bases long, were inserted into the genetic site encoding an extracellular loop of the outer membrane porin LamB. Like many membrane proteins that have been the subjects of topological (Boyd, 1994) and antigen presentation (Georgiou et al . , 1997) studies, LamB intercalates the outer membrane even when exogenous polypeptide sequences have been inserted into one or more of the protein's extracellular domains. Thus, when lamB genes bearing insertions were used to transform E. coli , each transformant expressed LamB m which one extracellular domain exhibited a semi-random, exogenous oligopeptide between 11 and 220 ammo acids long.
To screen the library for proteins exhibiting affinity for the target compound iron oxide, the E. coli population was placed in a suspension of iron oxide particles, binding was allowed to occur, and a magnet was used to recover bacteria that had adhered to metal (Brown, 1992) . The selected portion of the population was amplified by bacterial growth, and the partitioning procedure was repeated. After four cycles of selection and amplification, fresh variation was introduced according to a strategy that would not erase sequence- information accrued thus far. Another four cycles of selection and amplification then yielded four clones, three of which bore a consensus sequence for binding iron oxide.
While the combinatorial library m LamB introduced the promising possibility of using bacterial outer membrane proteins as platforms for presenting libraries of diverse polypeptides, the selection procedure applied to the library was labor-intensive and very limited in its applicability. The target compound, iron oxide, was conveniently chosen to allow screening for magnetism - a procedure that could not be applied to screening for the ma-jonty of interesting interactions that polypeptides exhibit. The present invention offers a user-friendly, efficient, and precise method of selection, capable of screening libraries of all sorts of polypeptides presented on the platform of a membrane polypeptide .
The membranes bounding any prokaryotic or eukaryotic cell, referred to here as the outer or inner membrane, present an extremely complex surface to the extra- or intracellular environment. These molecules - referred to here as membrane polypeptides - exhibit a vast range of distinctive conformational, chemical, and electrical properties. Surveyed elsewhere in greater detail (Singer, 1990) , the membrane polypeptides are here categorized into several structural families. The conformational and functional differences between these families are relevant to this invention because membrane polypeptides provide the underlying structures that are used to display exogenous sequences, and are selected for interaction with the target compound. Moreover, this invention often capitalizes on the intrinsic properties of membrane polypeptides by imprinting them to exhibit properties or perform functions that are similar to their natural ones. Finally, the deliniation of structural families presented here should also provide a profile of the vast range of properties exhibited by polypeptides on the cell membranes, for it is in part this range that makes the capabilities of this invention so broad.
Single-pass transmembrane polypeptides (reviewed in Singer, 1990) , also called bitopic polypeptides, have one intra- and one extra-cellular domain, and a midsection that passes through the membrane. The terminal domains fold into tertiary structures, while the transmembrane section often forms a hydropathic α-helix, in which no more than 2 of the 20 transmembrane residues may be ionic. Singer (Singer, 1990) divides bitopic polypeptides into types la, lb, and II (figure 1) . Types la and lb exhibit the amino terminus on the extracellular side of the outer membrane. Distinguishing type la is an amino terminal signal sequence that directs export to the membrane, and is cleaved before final membrane insertion. Type la polypeptides are ubiquitous, and in eukaryotes they are the predominant type. Because the transmembrane sequence of type la polypeptides is typically closer to the carboxyl terminus, the extracellular domain is often large. Consequently, type la polypeptides are competent for functions involving high-affinity adhesion or highly specific binding to especially extracellular targets. Among type la polypeptides are receptors for growth factors and for peptide hormones; immune system antigen recognition molecules; and molecules responsible for cell adhesion (Singer, 1990, Staunton et al . , 1990) . Proclivity for adhesion or highly specific binding to extracellular targets make type la polypeptides suitable for use in many embodiments of this invention, especially those in which a polypeptide is imprinted to bind to an extracellular target.
Type lb polypeptides do not have a cleavable signal sequence. The transmembrane stretch is typically close to the amino terminus, and therefore there are few extracellular residues (Singer, 1990) . This makes type lb polypeptides suitable for sophisticated cytoplasmic functions, and interactions, but poorly suited for interactions with targets in the extracellular medium.
Type II bitopic polypeptides are oriented with their carboxyl terminus on the exterior side of the membrane (figure 1) . As in type la bitopic polypeptides, the bulk of type II molecules is usually on the exterior of the cell. Type II polypeptides are typically enzymes with extracellular active sites, such as glycosyl transferases, or highly specific extracellular receptors, such as the asialoglycoprotein receptor (Singer, 1990) . Competence for extracellular enzymatic or receptor activity makes type II bitopic polypeptides suitable for use in certain embodiments of this invention, in which extracellular domains are imprinted to bind or enzymatically modify targets in the extracellular medium.
Multipass transmembrane polypeptides (reviewed in Singer, 1990), also called polytopic polypeptides, traverse the lipid bilayer in two to fifteen or more passes. Like thread stitched into cloth, the polypeptide begins on one side of the membrane, passes through it, loops into the space on the other side of the membrane, then once again traverses the membrane to expose either its end or yet another loop leading to more transmembrane passes (figure 1) . Although referred to and often illustrated as "a loop", each exposed region of a multipass polypeptide exhibits not only a unique sequence of amino acids, but also a distinctive tertiary structure. Like single-pass transmembrane polypeptides, multipass ones are suspended in the lipid bilayer by the hydrophobicity of transmembrane strands. However, in multipass polypeptides, these strands do not invariably twist into α-helices, because when multiple strands neighbor one another, they can adopt an alternative strategy for hydrogen bonding between peptide bonds: if each strand adopts the secondary structure of a β- strand, these strands can curl together into a closed cylinder, allowing hydrogen bonds to form between the peptide bonds on separate strands. The resultant tertiary structure is called a β-barrel (figure 1) . It is extremely stable, and can be released from the outer membrane without denaturing tertiary structure. In several preferred embodiment of this invention, the extremely stable, polytopic β-barrel OmpA is imprinted to bind to various targets.
Polytopic polypeptides that traverse the membrane in α- helical conformation often function as specific receptors for messengers or hormones, such as yeast mating pheromone and many others. These and other α-helical polytopic receptors are categorized by Singer (Singer, 1990) as type III polypeptides. Their capacity to bind target molecules specifically, and sometimes multivalently, makes α-helical polytopic receptors ideal for use in many embodiments of this invention. Moreover, selection systems according to the present invention are easy to construct when the membrane polypeptides involved in the selection are polytopic.
Many prokaryotic and eukaryotic cell-surface polypeptides are fixed to the lipid bilayer not by a hydrophobic transmembrane strand, but rather by a glycolipid anchor (reviewed in Cross, 1990) (figure 1) . An anchor consists of a lipid molecule, such as dimyristyl glycerol or alkylacyl glycerol, adorned by a multi-residue sugar complex, which is, in turn, covalently linked to the membrane polypeptide. Among eukaryotes, two signaling elements of a protein direct the addition of a glycolipid anchor: a well-defined ammo terminal signal sequence, which enables transport into the lumen of the endoplasmic reticulum, and a carboxy terminal glycolipid- addition sequence, which is not precisely defined but generally consists of a short hydrophobic domain located 10-12 residues carboxy terminal to a pair of small residues (Cross, 1990) . Once appended to the carboxy terminus of a protein, the glycolipid anchor itself can direct polypeptide insertion m a targeted region of the outer membrane.
Lacking an intracellular domain, glycolipid-anchored outer membrane proteins cannot perform cytosolic or periplasmic functions. Instead, they provide protection from the extracellular environment, or function as enzymes or adhesms recognizing exclusively extracellular substrates. At least six leukocyte surface proteins, many other eukaryotic proteins belonging to the immunoglobulin superfamily (Cross, 1990) , and the trypanosome variable surface glycoprote (the protective protein that the trypanosome sequentially alters to avoid detection by the host immune system) are anchored by glycolipids. These examples illustrate the characteiistics of glycolipid-anchored proteins that are important to their use embodiments of this invention: capacity for highly specific adhesion and plasticity that allows variation of the anchored protein without disturbing a robust system for export to and insertion the outer membrane.
Among eukaryotic cells, the diverse array of surface- exposed proteins is adorned by yet another tier of complex molecules with distinctive properties, namely, the sacchaπdes (reviewed Lis et al . , 1993). The oligosaccharides adorning glycoprote s generally consist of less than 15 residues. However, owing to the diversity of residues and covalent linkages that permit assembly into branched as well as linear structures, oligosaccharides exhibit thousands of distinct structures. The polysaccharides of integral membrane proteoglycans are larger than oligosaccharides, and can contain dozens of residues.
The oligo- or polysaccharide molecule attached to a particular site in a polypeptide is not a simple function of the local ammo acid sequence. The same polypeptide sequence receives distinct saccharide chains when expressed in different cells, and separate copies of a polypeptide expressed in a single cell type can have different sugars attached at a given site. Because sugar residues are sequentially appended, each by a particular enzyme to which the protein is exposed in the course of the processes of translation, folding, and export to the outer membrane, the glycosylation at a site is largely a product of the extent and duration of the site's exposure to each enzyme. Exposure of a site is, in turn, affected by the way a protein folds and the rate at which it moves through the steps from translation to insertion. Consequently, the glycosylation' at a site can be altered in unpredictable ways by changes to the local sequence or to the sequence elsewhere in the polypeptide (Lis et al . , 1993) .
Diversity of molecular structure and relatively complete exposure to the extracellular environment make the saccharides appended to proteins effective receptors of specific substrates outside the cell. Glycoproteins and proteoglycans are involved in the intercellular recognition and adhesion underpinning sperm-egg interactions, blood clotting, and inflammation (Lis et al., 1993, Singer, 1990). The roles they play, via adhesion, in these important processes make the glycoproteins and proteoglycans ideal for use in certain embodiments of this invention.
Therefore, in a preferred embodiment of the present invention, the membrane protein is an α-helical polytopic receptor, a glycolipid-anchored polypeptide, a polytopic integral membrane polypeptide, a membrane-bound isoform of a secreted polypeptide, a bitopic integral membrane polypeptide, a glycoprotein or a proteoglycan .
It is significant to some embodiments of this invention that insertion of polypeptides into the membrane and translocation of secreted polypeptides across the membrane are performed by the same cellular machinery (Singer, 1990) . Among eukaryotes, integral membrane polypeptides inserted into the endoplasmic reticulum (ER) have the same cleavable or noncleavable amino terminal signal sequences as completely translocated polyeptides. This is consistent with the fact that insertion and secretion depend on the same cytoplasmic signal recognition protein (SRP) , which transfers both inserted and secreted polypeptides to the SRP receptor on the ER membrane. Similarly, among bacteria, membrane insertion and translocation seem to require the same gene products, namely secA and secY (Singer, 1990) . Among prokaryotes and eukaryotes alike, there are many examples of secreted and membrane-bound polypeptide lsoforms. Often the only difference between the isoforms is the existence of a hydrophobic stretch of residues or a glycolipid attachment site near the carboxy terminus of the membrane-bound polypeptide (Smger, 1990) .
The process of type III polypeptide intercalation into the membrane also has implications that are significant to this invention. The cellular machinery that inserts bitopic polypeptides and translocates secreted ones is also responsible for intercalating α-helical multipass polypeptides into the outer membrane. This is clear from studies that show that l) type III polypeptides sometimes have the same signal sequences as bitopic and secreted polypeptides, and n) membrane intercalation of type III polyeptides requires the same gene products as insertion of bitopic polypeptides. Several experimental approaches, inducting gene fusion, have shown that the cellular machinery for inserting polypeptides in the outer membrane intercalates most type III polypeptides by translocating one hydrophilic extracellular loop at a time. The polypeptide is thus intercalated as a series of independently inserted segments (Smger, 1990) .
Most embodiments of this invention require a topological map of the imprinted polypeptide, and a technique for introducing new sequences to specified extracellular domains of the imprinted polypeptide. Both of these needs are met by gene fusion techniques.
A vast variety of polypeptides, especially the polypeptides reviewed above have already been topologically mapped using gene-fusion techniques. In these studies, the gene for a membrane polypeptide was ]omed to a gene coding for a "reporter", a protein that is functionally active only when it is located on a particular side of the cell membrane. This genetic construct coded for a hybrid polypeptide in which the membrane polypeptide bore the reporter moiety as a mid- sequence insertion or as a C-terminal extension. Provided the foreign moiety disrupted neither sorting of the polypeptide to the appropriate membrane, nor insertion of the polypeptide into the membrane, the reporter was localized in the same way as the immediately adjacent region of the membrane polypeptide. This region's location on the exterior or interior of the membrane was then inferred from the reporter's activity, or lack thereof. The entire polypeptide ' s topology was deduced by assaying reporter activity in a set of hybrid polypeptides, each of which bore the reporter at a different place in the membrane polypeptide ' s sequence (these gene- fusion mapping techniques are reviewedin Boyd, 1994). Not only has this approach provided topological maps of most of the polypeptides commonly used in gene technology, especially the proteins described above; it can also be used to determine the topology of any membrane polypeptide that has not yet been mapped, but is a candidate for imprinting by this invention.
In addition to a topological map of the imprinted polypeptide, each embodiment of this invention also requires a technique for inserting exogenous peptide sequences into one or more extra- or intracellular sites in the imprinted polypeptide. Alternatively, exogenous peptide sequences may replace existingsections of the extra- or intracellular domains of membrane.
Because gene-fusion topology studies used a set of hybrid polypeptides, each bearing the reporter in a region whose location relative to the membrane was to be mapped, several techniques were developed to enable insertion of the reporter gene at precisely specified sites in the membrane protein gene. These techniques include the following: i. the use of vectors with restriction sites that occur at unique locations in the membrane protein gene; ii. an oligonucleotide-directed deletion mutagenesis method; and iii. polymerase chain reaction methods (Boyd, 1994) . Each of these techniques can be used in this invention to insert exogenous sequences into a specified site in the imprinted polypeptide; an extension of these techniques, which is used in this invention to introduce new variation to extra- or intracellular domains, is described below.
Besides providing topological maps and techniques for inserting exogenous peptide sequences, gene-fusion studies have collectively demonstrated that introduction of an exogenous sequence of peptides to an extra- or intracellular domain of a membrane polypeptide does not interfere with the sorting of the polypeptide to its appropriate membrane, or with the polypeptide ' s normal insertion into the membrane. This capacity of extra- or intracellular domains to accommodate foreign oligopeptides is critical to the construction of combinatorial libraries of exogenous oligopeptides m the extra- or intracellular domains of membrane polypeptides. This capacity is also, therefore, critical to certain embodiments of this invention. It has been observed m various bacterial, yeast, and mammalian polypeptides, including the examples described the present specification.
An extracellular domain of lamB, an outer membrane porm of E. coli , has been shown to accept exogenous polypeptides up to 165 ammo acids m length without disturbing normal sorting or insertion of the protein (Brown, 1992) . Foreign sequences have also been inserted into extracellular domains of E. coli PhoE (phosphate-limitation- ducible outer membrane pore) , and FhuA (ferrichrome uptake receptor), and the insertions had no effect on protein sorting or insertion (Georgiou et al.,1997). In the specific case of E. coli OmpA, one of the most abundant polypeptides of the bacterial outer membrane (Morona et al . , 1985) , oligonucleotides rich m restriction endonuclease sites were inserted into the ompR gene at sites corresponding to protein loops II and IV, and the altered protein was expressed and exhibited normally (Freudl, 1989) . Subsequently, influenza haemaglutmm, beta-galactosidase, foot and mouth disease viral protein 1 (FMDV-VP1), malaria antigens, and several other antigenic polypeptides up to 56 ammo acids in length were successfully displayed m OmpA loops III and IV (Georgiou et al., 1997). Furthermore, two lines of evidence demonstrated that these exogenous polypeptides adopted stable folded structures. First, ompA fusion proteins exhibiting viral antigens were shown to elicit intense virus-specific immune response in rabbits (Georgiou et al . , 1997) . Second, Cex exoglucanase displayed on an extracellular domain of OmpA hydrolyzed cellulose as effectively as conventionally immobilized enzymes (Francisco et al . , 1993).
In yeast, extracellular domains of a number of membrane proteins accept exogenous peptide sequences without exhibiting disturbance to normal sorting and insertion (Boyd, 1994, Boder and Wittrup, 1997). Higher eukaryotic membrane polypeptides have also exhibited the capacity to accept foreign inserts at extracellular domains. When a polypeptide is expressed in eukaryotic cells, gene fusion studies may insert extra glycosylation sites to function as reporters: only polypeptide domains on the luminal side of the ER are exposed to glycosylating enzymes, and luminal domains become extracellular domains. Extra glycosylation sites were inserted into extramembranous regions of mouse muscle nicotinic acetylcholine receptor: the inserted sites did not disrupt normal sorting or insertion of the outer membrane polypeptide; and, sites inserted in extracellular receptor domains were in fact glycosylated (Chavez et al . , 1991).
In summary, gene fusion mapping techniques contribute to this invention in four ways. First, these techniques have been widely used to determine the topology of countless outer membrane polypeptides, many of which are described above and may be imprinted by this invention. Second, these techniques may be used to provide topological maps for polypeptides that have not yet been mapped, but may be candidates for imprinting by this invention. Third, gene fusion techniques provide this invention with commonly practiced methods for inserting an exogenous peptide sequence into a specified region of a membrane polypeptide; the use of these techniques in the invention, as well as the straightforward extension that enables the insertion of variable sequences, is described below. Fourth, gene fusion topology studies have demonstrated that the introduction of an exogenous sequence to an extra- or intracellular domain of many mapped polypeptides does not interfere with the polypeptides' export to or insertion in the membrane . Certain embodiments of this invention involve alteration of a polypeptide ' s mode of attachment to the outer or inner membrane. Because insertion into the membrane and translocation across the membrane are handled by the same cellular machinery, secreted, transmembrane, and glycolipid anchored isomorphs of a single polypeptide are often interconvertible. Moreover, secreted and membrane-bound isomorphs can use the same amino terminal signal sequence, so in many cases, the only alteration required for conversion of a polypeptide to its isomorph is a change at the polypeptide ' s carboxy terminal. Attachment of a 20 residue hydrophobic sequence to the carboxy terminal of a secreted polypeptide is often sufficient to construct an isomorphic polypeptide that is anchored by a transmembrane span; this technique for converting a- secreted polypeptide into a membrane-bound one is described in the literature (Singer, 1990) . Alternatively, transmembrane or secreted polypeptides can be converted to glycolipid anchored variants by addition of carboxy terminal signal sequences from the precursors of glycolipid anchored polypeptides; this technique is also described in the literature (Cross, 1990). In some cases, a glycolipid anchor not only attaches an exogenous polypeptide to a cell membrane, but also directs sorting of the anchored alien polypeptide. In such cases, a single glycolipid anchor sequence is all that is required to direct an exogenous polypeptide to the membrane, translocate it, and anchor it (Cross, 1990) .
In certain embodiments of this invention, two or more outer or inner membrane polypeptides are fused to construct a chimeric multipass polypeptide comprising extra- and/or intracellular domains of both constituent polypeptides. This manipulation has been successfully performed with a variety of prokaryotic and eukaryotic polypeptides. For example, in E. coli ompF-ompC chimeric protein 3601 (Fourel et al., 1990), the region of ompF comprising extracellular loops 1 through 5 is fused to the region of o pC comprising extracellular loops 6 and 7. When expressed in E . coli SM1005 ( F~ D2acU169 rpsL relA t iA x~i.bB gyrA ompC ompF) , this chimeric polypeptide is properly inserted in the outer membrane (Fourel et al . , 1990) . In a striking example of the construction of eukaryotic chimeric polypeptides, novel transmembrane polypeptides were created as follows. A region of the human asialoglycoprotein receptor HI, which included the polypeptide ' s transmembrane domain (20 hydrophobic residues, 40-61 in the wild-type protein), was tandemly repeated two, three, or four times. In each case the HI repeats were joined by relatively hydrophilic linkers, such as the hexapeptide HRPSLG. These constructs were normally inserted into membrane, and luminal domains were normally glycosylated.
The general principle that often makes fusion of membrane proteins possible has already been outlined: the above description of the process of membrane protein insertion emphasizes that α-helical transmembrane strands of polytopic membrane proteins are inserted sequentially and independently of one another. An implication of this mechanism of intercalation of transmembrane strands is that almost any chimeric polypeptide that is constructed of an amino-terminal signal sequence followed by multiple @20-residue-long hydrophobic stretches, alternating with stretches of more hydrophilic residues, can be normally inserted, provided each hydrophobic stretch is independently stable in the membrane.
In a preferred embodiment of the present invention, the extra- or intracellular domain of the membrane polypeptide contains a highly variable region, which is, upon binding to a target compound, identified and isolated as the polypeptide of interest or a section thereof. With the gene technology methods described above, in combination with combinatorial genetics, the skilled man in the art can make a library of cells displaying a huge variety of variable regions, which can be identified and isolated with the present matter in a very short time. It is possible to provide the gene for the polypeptide of interest simultaneously with the polypeptide of interest. The highly variable domain may comprise polypeptides encoded by a cDNA library or sections thereof.
The membrane polypeptide can be a homologous protein for the cell (adapted for the present method) or - which is preferred - a heterologous polypeptide for the host cells. Preferably, the membrane polypeptide is genetically engineered to contain the variable domain and the selection site residing adjacent to each other on the cellular surface.
The target compound can be any material, e.g. chemically produced substances, proteins, nucleic acids, complex products derived from biological sources or mixtures or (combinatorial) libraries of these materials. Preferred substance classes for the target compound are known drugs (or derivatives thereof) (e.g. to investigate further binding patterns of such compounds to polypeptides) , polypeptides (to identify and isolate protein/protein interactions) , antibodies or antigens, vaccines or pathogens, etc..
If the present invention is performed using an intracellular domain of the membrane protein, i.e. the polypeptide of interest is displayed at the inner surface (including the periplasma) , it is essential for the target compound to be present inside the cell. This can be achieved by transportation or diffusion of the target compound from outside into the cell (which is, however, only possible with a few or small target compounds. An alternative way of bringing also the target compound into the cell is by coexpression of the polypeptide of interest and the target compound the same cell. Systems to enable such a coexpression of two (or more) exogenous proteins one and the same cell are known m the art. It is clear that this way of performing the invention is restricted to target compounds that can be produced by the host cell, e.g. proteins or nucleic acids.
For selection of the interaction between target compound and polypeptide of interest, any suitable selection system known m the art may be adapted to the method according to the present invention. These selection systems are known to the skilled man in the art in principle, the adaption of any suitable selection system to the system according to the present invention will be explained below.
A critical point for the present invention is how the binding of the target compound with the polypeptide of interest influences the selection site: According to the present invention, the binding between the polypeptide of interest-portion of the membrane polypeptide and the target compound results m a signal which influences the status of the selection site. This influence may be a direct or an indirect interaction of the membrane polypeptide or the bound target compound with the selection site. A direct interaction may e.g. be performed by a steπcal hindrance of the selection site upon binding of the target molecule, i.e. the site, where the selection agent should bind, is not accessible anymore, because of the physical presence of the target molecule. Alternatively, such a sterical hindrance may be induced by a conformational change of the membrane polypeptide upon binding of the target compound to the polypeptide of interest-portion of the membrane polypeptide. Preferably, the selection site is present m the vicinity of the membrane polypeptide, e.g. by covalent bond to the membrane polypeptide. The selection site can also be a part of the membrane polypeptide itself. In such a case, the sterical properties of the system are clearly defined and easily adjustable by common protein engineering.
Therefore, a preferred embodiment of the present invention is a method, wherein the interaction between the extra- or intracellular domain with the polypeptide of interest and the selection site is a sterical interaction, especially a sterical interaction which does not allow the selection agent to interact with the selection site after binding of the target compound to the polypeptide of interest. Preferably, the selection site is immediately adjacent (especially via covalent bonds) to the extra- or intracellular domain of the polypeptide of interest.
In a preferred embodiment of the present invention, the selection site is an adhesion domain for the selection agent, e.g. an adhesion domain which is also part of the membrane polypeptide displaying the polypeptide of interest.
In another preferred embodiment, the membrane protein exhibits an additional domain on the surface of the cell, which additional domain is adjacent to both, the variable domain and the selection site and this additional domain also interacts, preferably by binding, with the target compound.
Indirect interaction can be performed e.g. by messenger molecules, which are released upon binding and induces a change of the selection site by binding or by contact. It is essential that the change in selection specificity of the cell is in direct correlation to the binding of the target compound to the polypeptide of interest, i.e. the binding immediately induces the (direct or indirect) interaction with the selection site.
The mode of selection in the method according to the present invention is not critical, positive and/or negative selection protocols are applicable. In a preferred embodiment, the selection agent is a toxic agent.
Many of the polypeptides of the (outer) membrane are ports of adhesion for an array of toxic agents that kill or retard multiplication of the cells they enter. For the purpose of the present invention, the term "toxic agent" denotes any agent that impairs or inhibits growth of the cells. Most of the classical selection systems applied molecular biology fall under this type of selection system. Due to its nature, this selection system is preferably applied the „extracellular" embodiment of the present invention, wherein the polypeptide of interest is located m an extracellular domain of the membrane polypeptide.
Many toxic agents recognize a specific region (the adhesion domain) of a specific polypeptide (the receptor) on the surface of a specific cell type (the host) . Binding by the agent to the adhesion domain is the first step m a process that culminates in cell lysis or inhibition of cell growth and multiplication. If this first step is impeded by mutations m the adhesion domain, by repression of the receptor, or by the interference of an inhibitor occluding the adhesion domain, a selection agent's invasion of the cell is blocked. When all of an agent's possible receptor molecules on a cell are, for whatever reason, inaccessible or dysfunctional for agent entry, the cell is rendered immune. When some but not all receptors are inaccessible or dysfunctional, the cell may become resistant but not immune: reduction of the number of receptors may slow the entry of agents, and thus reduce the rapidity or intensity of the agent's harmful effects.
For the purpose of the present invention, a selection agent may be employed that has been alterered such that it is different from the corresponding naturally occurmg agent with regard to specificity and mode of action on the cell. For example, pathogenic agents like bacteriocins and bacteriophages - for which the above mentioned pathway via an adhesion domain is quite common - can be altered such that they change their preference for a specific adhesion domain. Thus it is easily possible for the man skilled the art to design strategies to make pathogens adhere to a great variety of adhesion domains.
In a preferred embodiment of the present invention, the selection agent is a pathogenic agent, such as a bacteriocm, a bacteriophage or a toxin or virus of an eukaryotic cell.
Bacteriocins are bacterially produced proteins that are harmless to the strains that express and secrete them, but fatal to other, often closely related, bacteria (bacteriocins are reviewed m Pattus et al., 1990) . The most extensively studied of the bacteriocins are the colicins, which are expressed by E . coli , and are encoded by plasmids that also code for the proteins that confer colicm immunity upon colicm-producmg cells. The colicins are composed of three structural domains, each of which is responsible for one of three consecutive steps of colicm activity: l. recognition of an adhesion domain; n. translocation across the cellular membrane; and in. cell destruction (Pattus et al . , 1990).
The colicins kill by two principal mechanisms: l. the formation of a pore m the cytoplasmic membrane, resulting m membrane permeabilization and de-energization (colicins El, A, B, la, lb, and K) ; n. enzymatic cleavage of DNA or 16S ribosomal RNA (colicins E2 and E3) (Pattus et al . , 1990). The kinetics of cell death when bacteria are exposed to colicins suggest that the toxic molecules enter the cell in discrete, fatal doses, which may comprise only a single molecule, but may also consist of a number of active molecules imported one concerted step (Pattus et al . , 1990) . The important point here is that a single dose kills a single cell.
Almost every major outer membrane protein of E. coli is the receptor for at least one colicm. When multiple colicins recognize a single receptor, their adhesion domains may overlap partially. Two colicins that adhere to OmpF, a major outer membrane porm, furnish an example. As is illustrated m figure 2, OmpF exhibits seven extra-cellular loops, LI through L7, where LI is ammo proximal and L7 is carboxy proximal. The adhesion domain of colicm N includes LI, L3, L4, L5, and L6, but not L2 or L7. Thus, alterations m the peptide sequence of LI result in total immunity to colicm N, whereas alterations to L2 do not confer even slight resistance. The adhesion domain of colicm A, another colicm that recognizes ompF, is somewhat different. Loops 1 through 5 are required for colicinA to bind its receptor, while L6 and L7 are superfluous (Fourel et al., 1990). The chimeric ompF-ompC protein 3601, which was described above, comprises loops 1-5 from ompF and loops 6 and 7 from the distinct ompC . As would be expected, E. coli SM1005 expressing 3601 are totally susceptible to colicm A, but totally immune to colicm N.
The number of receptors per cell is important for an easy handling of this invention's selection of specific cells from a highly variable population. The functional receptors of colicins are generally present on the surface of the cell thousands of copies. However, it is important for an optimal performance of this invention that the level of expression of most receptors is easily manipulated: the gene encoding the receptor has already been, or may easily be, cloned into a plasmid with an mducible heterologous promoter, permitting external regulation of expression. Protocols for the construction and manipulation of plasmids with mducible promoters, permitting regulation of expression, are described m the literature (Brown, 1992, Freudl, 1989) . Protocols for the construction and manipulation of plasmids with mducible promoters, permitting regulation of expression are available to those practiced m the art.
Bacteriophages are a ubiquitous and extraordinarily diverse group of viral parasites of bacteria. There are twelve families of phage, comprising 3500 distinct isolates of known morphology, and infecting over 100 bacterial genera. Their vast variation is evident even at the level of nucleic acid: depending on species, phage contain single stranded DNA, double stranded DNA, single stranded RNA, or double stranded RNA. A wide range of morphologies is usually divided into four categories: tailed; cubic; filamentous; and pleomorphic. Extensive reviews of morphology, genetics, ecology, and life- cycle are available in the literature (Calendar, 1988).
Very much like bacteriocins, phages bind to specific adhesion domains of receptors on the host surface. A single outer membrane protein may be the receptor for multiple phage species, whose adhesion domains overlap to varying extents. Figure 3 shows the adhesion domains of various phages, as well as one of the colicins, that utilize E. coli outer membrane protein A as their receptor. The figure also shows the receptor domains that are manifestly not utilized by the phages: alterations to these regions of the receptor do not affect phage adsorption, and thus do not confer phage- resistance to the bacterial cell. Phage receptors, like those of bacteriocins, are often present m 100s to 1000s of copies per cell, but this number can be manipulated through use of a plasmid-encoded receptor regulated by an mducible heterologous promoter. As was stressed above, regulation of the level of expression of the imprinted polypeptide is important to this invention. Protocols for the construction and manipulation of plasmids with inducible promoters are described m the literature.
Following binding to the receptor, the viral nucleic acid enters the bacterial host, and phage multiplication commences. The strategies for multiplication, and the effects of multiplication upon the host, vary widely across phage species. Immediately following infection, some but not all species of phage confer immunity to subsequent infections by other individuals of the same phage species. If the infecting phage is a virulent one, it quickly modifies normal bacterial syntheses in order to appropriate them to its own multiplication and release into the extracellular space. Distinct strategies for release have very different effects upon the host. The Inoviridae (a family of filamentous phages) are liberated by extrusion: new phage are secreted through the bacterial membrane (Calendar, 1988). While this process is not lethal to the bacterium, it is metabolically taxing, and therefore infected bacteria divide at a slower rate than un fected bacteria. Transmission also takes place vertically, as daughter cells inherit the infection from their parents. A strategy for release similar to extrusion is exhibited by the Plasmaviridae (a family of pleomorphic phages) . These phages, which consist only of a double stranded DNA molecule surrounded by a lipoprotem envelope, are liberated from the host by budding from the cellular membrane. Again, the process is not lethal but retards multiplication. By contrast, lysis, a means of release for tailed and cubic phages, destroys the host. After intracellular phage multiply to numbers of 20- 1000, they weaken the bacterial cell from within, causing it to burst and liberate the infective particles (Calendar, 1988) .
The bacteriophages exert such a wide range of effects upon their hosts that even among phage species using the same receptor there may be several different consequences of infection. Moreover, the effects of a phage upon its host can be manipulated. Mutagenesis followed by artificial selection for phage virulence or benevolence can intensify or mitigate a phage ' s deleterious effect on its host. Insertion or deletion of phage genes, including insertion of genes that confer metabolic capabilities or antibiotic resistance upon infected bacteria, are commonly practiced means of manipulating the impact that phage have upon their bacterial hosts. Another way to modify a phage ' s effects upon its host is to alter the host. For example, certain groEL mutations m E . coli produce bacterial strains that are susceptible to T4, T5, lambda, and 186 phage infection, but do not permit the intracellular assembly of mature phage particles (Zeilestra-Ryalls et al . , 1993) . Thus, infected bacteria are retarded, or even killed, but they do not yield infectious phage particles; m short, these bacteria are a dead-end for the phage that infect them. Finally, the effects of phage upon their hosts can be modified by the preparation of phage ghosts (phage depleted of DNA) , by methods described the literature (Boulanger et al . , 1988) . Because they do not carry a genome, phage ghosts do not infect cells. They do, however, adhere to cells, pierce the cytoplasmic membrane, and form ion channels, which would normally serve to transport the phage genomes. Whereas normal phage close these channels behind them, phage ghosts leave the channels open indefinitely. The rate ion efflux through these channels in linearly related to the number of ghosts that bind to the cell; gradually, the efflux depolarizes the cell, which eventually dies .
Toxins and viruses of eukaryotic cells recognize specific adhesion domains of their receptors on eukaryotic cell- surfaces. The receptors of at least thirty different species of virus of eukaryotic cell lines, representing eleven different viral families, are described in the literature, and for most of these pathogens, the viral adhesion domains, which may comprise oligosaccharides, glycosylated peptide sequences, or sequences unadorned by sugars, have been characterized (Tyler et al., 1996). The receptors of eukaryotic pathogenic agents, like those of bacteriocins and bacteriophage, are commonly cloned onto vectors and expressed from inducible promoters, allowing manipulation of the number of receptors expressed per cell (Staunton et al . , 1990).
Major group human rhinoviruses (HRVs) provide an example of the pathogenic agents that affect eukaryotic cells and are suitable for use in this invention. HRVs are members of Pi cornaviridae, small protein-encapsidated viruses with RNA genomes. Of 100 serotypes of rhinovirus, 89 use an extremely well-characterized glycoprotein, ICAM-1 (CD-54) as their receptor. HRV serotype 14 is well studied , easily propagated in the lab, and therefore suitable for use in this invention (Staunton et al . , 1990, Couch, 1996). Major group HRVs grow efficiently in many different lines of human cell culture. The most commonly used tissues are WI-38 diploid fibroblast, fetal tonsil, MRC, HeLa and COS cell lines. Optimal conditions and protocols for growth are described in the literature (Couch, 1996) . HRVs form plaques on monolayers of a variety of cell lines exposed by any one of the described techniques for exposure to viral agents. Infected cells yield 10 to 200 plaque forming units per cell after 10 to 12 hours of incubation (Couch, 1996) .
The structure of HRV14 has been solved by X-ray crystallography (Staunton et al., 1990) . The capsid is 30 n in diameter, exhibiting icosahedral symmetry and a canyon, 12- 30 A wide and 25 A deep, that runs along the 5-fold axis and contains the pathogen's receptor binding site. The receptor is the well-characterized member of the immunoglobulin superfamily, ICAM-1 (CD-54), a 532 residue, 19nm x 2-3 nm rod comprising 5 immunoglobulin-like domains arranged in a line with a hinge between domains 2 and 3 (figure 4) . Viral binding involves insertion of the distal immunoglobulin domain (Dl) into the viral canyon. This mode of interaction is extremely vulnerable to disturbance by steric inhibition (Staunton et al . , 1990). Thus, although the lower three immunoglobulin-like domains (D3-D5) are not directly involved in viral binding, and mutations within them do not disturb adhesion, their complete deletion does inhibit adhesion by reducing accessibility of Dl .
ICAM-1 bears multiple N-linked oligosaccharides, and variation in glycosylation between cell-types results in ICAM- l's molecular mass heterogeneity of 76 to 114 kD. It functions as the cell-surface ligand for lymphocyte function-associated antigen 1 (LFA-1), and binding between ICAM-1 and LFA-1 contributes to leukocyte adhesion in several immunological and inflammatory processes (Staunton et al . , 1990). Cytokines secreted by T-lymphocytes and monocytes, including IFN-g, TNF- a and b, and IL-a and b, induce expression of 3.5 x 10 6 copies per endothelial cell. However, uninduced cells exhibit much lower levels of ICAM-1, and in the CDM8 expression system,
ICAM-1 can be exogenously regulated through control of an inducible promoter (Staunton et al . , 1990) which makes this system excellentely suitable for the function of this invention.
In a preferred embodiment of the present invention, the method is carried out in the form of a high-throughput screening assay wherein said cell population is exposed to a multiplicity of target compounds from which an unknown binding partner for a polypeptide of interest is to be identified and isolated. The target compound in this embodiment may be a single definite compound, a mixture of compounds or, to have a large variety of binding possibilities tested, a (combinatorial) library of compounds .
Such a high-throughput assay is especially suited for screening of organisms for which comprehensive or complete genomic information is available, such as Mycobacterium tuberculosis or Helicobacter .
The method according to the present invention is usually carried out by performing steps a) to f) as described above. It may be necessary to repeat the selection process, e.g., with an increase in the selection pressure. This is suitable if, e.g., the number of isolated, positive clones is too large to handle.
In such cases, it is preferred to repeat steps b) through e) at least once, preferably comprising an increase in the selection pressure in the repeated steps.
The method can be carried out by steps b) through e) until the cell population comprises a desired number of distinct clones .
The term "background" is used to refer to cells that fail to bind the target compound but nonetheless remain in the selected population. In general, it is desireable to minimize the number of background cells, so that CISTEM reliably eliminates cells whose imprinted domains fail to bind the target compound, while selecting those cells whose imprinted domains do bind the target. In one strategy for the reduction of background, the CISTEM cell population is supplemented with "helper cells", whose function is propagation and amplification of the selective agent. This amplification of the selective agent increases the intensity of selection, and thereby reduces residual background cells. The helper cells lack a selectable marker that is present in the rest of the CISTEM cell population; this permits efficient elimination of the helper cells after they have served their purpose of amplifying the selective agent.
Supplemental cells may also be used in other ways to regulate the action of the selective agent. For example, to stop the action of the agent, heat-killed cells bearing the agent's receptor may be added to the CISTEM cell population. These cells bind the agent, removing them from selective activity in the remaining CISTEM cell population.
When repeating the steps, new sequence variations can be introduced in the variable regions of clones selected in step d) before steps b) to e) are repeated. The screening test may also be (additionally) performed by functional parameters. In a specific embodiment of the present invention, the polypeptide of interest binds to the target compound and affects its function. Then, the polypeptides exhibited by such clones selected in step d) and amplified in step e) are screened by their ability to affect the function of the target compound.
An additional step of screening is possible, if the polypeptide of interest exhibits an enzymatic activity. In such a case, the method according to the present invention comprises a step wherein the cell population is placed in a nutrient medium in which each cell's supply of a vital compound depends upon that cell's variable region acting enzymatically upon a compound structurally similar, but not identical to the target compound .
This invention provides a method for identifying and isolating a polypeptide of interest or a target compound or a gene of said polypeptide or target using the detection of the interaction between a polypeptide of interest and a target compound. In this interaction, at least one of the interactants is unknown. The method allows for identification of the unknown interaction partner, i.e. a polypeptide of interest and/or the target compound. For some embodiments, both interactants may be unknown, e.g. when an unknown ligand, upon binding at a receptor site, exerts a certain effect on the cell which needs to be antagonized or interfered with and both the ligand and the receptor domain to which it binds need to be identified. (The words "identify" and "identification" are used here, as in the rest of the specification, to denote both the discovery of polypeptides of interest that exist in nature, and the design of polypeptides that have no natural occurrence.) Embodiments of this method are called CISTEMs, an acronym for Cellular Imprinting by Selection: Teleological Evolution of Molecules. A CISTEM applies selective pressure to a population of genetically engineered cells in order to amplify the frequency of cells in which a membrane polypeptide shows prescribed properties. This process is - as stated above - termed "imprinting" the polypeptide. The range of polypeptides that may be imprinted by this invention is broad, comprising proteins, glycoproteins, proteoglycans, etc., presented in a wide range of sizes, on a diverse array of underlying structures, and by any one of several prokaryotic or eukaryotic cell cultures (figure 1) .
The terms „interaction" and „interact" for the purpose of the present invention denote any action of the target compound upon the polypeptide of interest (in the extra- or intracellular domain of the membrane protein) and that alters or occludes the selection sitesuch that the selection agent is unable to utilize said selection site.
In the specific embodiment wherein the selection site is located on an adhesion domain, the binding of the target compound prevents the action of the selection agent via intramolecular interaction (in the membrane polypeptide), e.g. by occluding the selection agent from its adhesion domain. This may comprise binding, adhesion, structural changes or chemical modifications of the variable domain which has an impact on the adhesion domain. Such impact on the adhesion domain may, inter alia, be a conformational change of the adhesion domain that prevents the harmful action of the selection agent.
The invention is useful in at least three general areas. First, the invention may be employed to identify polypeptides with properties that are useful in medical diagnostics, therapeutics, research, laboratory procedures, or other applications. Among the polypeptides that may be identified by the method disclosed here are polypeptides that interact with, in particular bind with high affinity and specificity to, virtually any given target compound, polypeptides that adhere to their respective target compounds at multiple epitopes, polypeptides that bind their respective target compounds but show no affinity for compounds closely related to the targets, enzymatic polypeptides, and polypeptides that show biological activities, such as antagonism or agonism of a receptor. Second, by identifying polypeptide sequences that interact specifically with a target that is a known compound of significant interest, this invention may assist in the identification of the known compound's native interactants. Third, this invention may be used to analyse the interaction between a known polypeptide of interest and a known target molecule. By identifying which sequences in the polypeptide of interest interact with the target molecule, the invention assists in mapping the polypeptide epitopes that play a role in the interaction. In a further embodiment, the method of the invention may be used to screen a population of target compounds and to identify those compounds that bind to a known polypeptide. In this embodiment, cells displaying a given section in the extra- or intracellular domain of the membrane polypeptide are used in a screening assay as test cells to screen for e.g. pharmaceutically active compounds that interact with the polypeptide of interest.
This invention may also be used to identify polypeptides with more sophisticated properties of interaction with the target compound than prescribed binding affinity. Among the molecules that may be identified by this method are the following: polypeptides that exhibit multivalent binding; polypeptides that bind tightly to one molecule but show no affinity for a second, closely related molecule; polypeptides that affect the functional activity of the target compound to which they bind; and enzymatic polypeptides.
The invention is based on the interaction of three CISTEM components: (1) a genetically engineered population of prokaryotic or eukaryotic cells; (2) a target compound; and (3) a selection agent (figure 5) . On the surface of each cell in the population, there are - at least - two polypeptide domains involved in the operation of the CISTEM: the cellular domain carrying either a given polypeptide whose interaction with an unknown target compound is to be determined or, in the case of an unknown polypeptide, the imprinted domain, and the selection site, e.g. the adhesion domain. The imprinted domain is highly variable, because the CISTEM cell population is engineered to display up to at least 10 12 distinct oligopeptides at the imprinted domain. (If the population is eukaryotic, the oligopeptides may be adorned by diverse oligosaccharides or other structures resulting from post- translational processing of the polypeptide) . In the case of direct interaction of the target compound with the selection site, the imprinted domain on each cell's surface is neighboring the selection site, e.g. on an adhesion domain, the structure to which the CISTEM' s selection agent must bind in order to exert its effects upon the cell. In the case of a pathogenic agent, this may be an organism or molecule, and it may be fatal to cells or merely retardative of their growth, but the first step in the agent's destruction or retardation of a cell is necessarily binding between the agent and its adhesion domain. The two important domains - namely, the imprinted domain and the adhesion domain - are expressed in multiple copies on the surface of each cell, but they are - in the case of direct interaction - normally adjacent to each other: more specifically, they must be sufficiently close to each other that the target compound, when interacting with the imprinted domain, either extends far enough over the adhesion domain that adhesion of the selection agent is at least partially impeded, or exerts its effect on the selection site in a different way (e.g. by intramolecular steric alteration of the selection site) .
The target compound is the molecule or set of molecules with which the polypeptide sequences identified by the CISTEM must interact in the prescribed manner in order to exhibit the properties or perform the functions desired of them. In other words, the CISTEM identifies polypeptides that interact with, e.g. bind to, and optionally also structurally or functionally alter, the target compound. A target compound can be any molecule, especially a protein, carbohydrate, saccharide, glycoprotein, proteoglycan, hormone, receptor, antigen, antibody, pathogen, substrate, metabolite, transition state analog, cofactor, inhibitor, drug, dye, nutrient, small molecular compound, etc., without limitation. Each CISTEM places only one requirement on the target compound: it must be effective in affecting the change in the selection site upon binding to the polypeptide of interest, e.g. that it is sufficiently large that its attachment to the imprinted domain results in occlusion of the adjacent adhesion domain or that its binding affects the structure of the membrane protein such that the selection site is sufficiently changed to prevent or to allow action of the selective agent. Ω Ω Ω J CΛ O CΛ 3 3 CΛ μ- ; r 5 3 * * ε > rt cn Ω PJ J o μ- i-i ι-i μ- 3 l-i μ- ft rt ε •n PJ Φ
O o Φ X Φ rt PJ PJ μ- 3 3 μ- 3 3 μ- tr CΛ tr φ ρ cn a Hi 3 Φ PJ 3 o J 3 o J μ- μ- 3 3
3 3 μ^ φ μ-- Φ μ 3 > rt 3 o 3 ** O 3 Φ Φ I-1 3 cn a Ω J rt TJ 3 TJ l-i rt ι-i a tr
TJ rt \—> 3 Φ 1-1 3 3 Φ Φ ε TJ 3 ε TJ 3 a φ l Φ Φ tr 3 Ω Φ 1-1 Φ X ι-ι rt X tr cn o o ι-i •» rt Ω Ω Φ tr a 3 i-i 0 3 U Φ rt Ω φ a PJ tr rt CΛ μ- Ω Φ μ- tr φ rt μ- a
3 J - rt PJ rt ι-1 Φ μ- μ- μ- ε μ- a CΛ PJ rt 3 & Ω PJ μ- - 3 3 3 Φ rt o »» 3 μ- ι-3
3 CΛ J cn μ- TJ 3 • CΛ PJ TJ 3 3 TJ 3 Φ Ω t-i μ- 3 Ω rt rt rt O rt o rt μ- 3 o
CL rt 3 O i-i TJ rt o rt O rt CΛ l-i X < rt φ ι o φ μ- 3 O φ Φ Hi Φ rt Ω CΛ n rt φ
0- 3 rt Φ π Φ O Φ μ-* Φ rt φ Ω μ- Φ φ Φ cn r-i O cn Hi a cn a ) O H PH μ- 3 PJ ε rt r o l-i 3 a PJ >< a l-i tr rt ι-i • O rt μ- 3 Hi . μ- hi 3 t? CΛ PJ rt ω μ- Ω J rt CΛ Φ 3 Hi 3 TJ ι-i TJ μ- Φ PJ PJ Hi tr o "> ε 1 a 3 TJ tQ TJ s ι-3 rt cn rt Φ i— J μ- 0 CΛ o rt Φ φ TJ X! φ a tr a Ω a Ω Φ Ω μ- t-i O H rt o Φ o W φ ε φ tr — Ω rt CΛ Ω Φ 1-1 tr μ- TJ o Φ TJ o μ- O <! rt rt μ- O rt PJ 3 tr Φ — rt 3 Ω S tr μ-1 o tr Φ 3 PJ X 3 Φ X rt μ-1 rt rt 3 3 PJ 3 PJ μ- tr Ω 3 l-l tr rt PJ PJ ι-1 *< 3 0 cn Φ &
PJ cn ε 3 • i-i rt 3 Φ tr μ- * μ- PJ X tr TJ 3 O Φ φ ~ CΛ Φ μ- rt PJ TJ Ω a 3 Ω Φ l-i y-' a μ- Φ Hi Φ Φ a J σ a TJ Ω a μ- o O rt 3 μ- ' μ3 rt cn 3 Ω φ O TJ O Φ φ
Φ ε 3 3 >-3 PJ CL 3 a O φ φ O φ 3 CΛ < 3 PJ tr tr M tr " CΛ μ- rt TJ 3 o 3 Φ μ-
X 3* X rt tr O rt μ- tr 1-1 TJ 3 rt Φ 3 X PJ PJ cn PJ s Φ cn μ- rt TJ σ 3 TJ Ω 3 rt
3 O μ- Φ J μ-1 3 Φ CΛ o rt TJ o ι-i a Φ 3 cn Ω PJ O -* O μ- O 3 Φ o rt tr
PJ cn μ- rt cn rt >< CΛ Hi μ- O Hi μ- 3 O μ- PJ rt Ω rt 3 Hi 3 a 3 ι-i 3 3 μ- rt φ rt Φ rt 0 0 J μ- o a 3 CΛ Ω μ- . — . 3 3 Ω 3 Φ O PJ a rt CΛ φ 3 μ- rt Φ O tr
Φ O Hi rt o 3 μ- φ 3 μ- rt Hi Hi TJ X a l-i 3 H cn tr cn a 3 3 3 φ 3 μ- rt rt < tr ε < 3 3 ; a 3 PJ 3 μ- O n μ- TJ Q a φ Φ tr - •>» X ^~. rt o J 3 0 3* Φ rt Φ tr Φ rt rt rt H rt X 3 rt rt O o Φ μ- T Φ IV) 3 CΛ
Hi T Φ 1-1 tr Φ μ a tr Φ 3 μ- Φ CΛ PJ tr 3 tr CΛ PJ TJ 3 rt cn PJ a rt <! rt φ ^^ ^~. rt 3 rt
Hi μ Ω Φ μ- 3 o Φ ι-i PJ cn H o Φ t-i Ω Φ 3 ι-i tr Φ cn ι-i μ- ε PJ tr X ^ -» tr - μ- μ- o QJ J 3 3 φ Φ φ Φ '. M X J 3 O PJ ω Φ l-i φ TJ Φ 3 tr
3 3 3 a TJ J TJ rt a PJ Ω CΛ rt CΛ 3 Hi μ- μ-- rt S Φ X rt Hi Ω It rt Φ μ- o PJ O PJ μ- rt rt tr α i-i tr Φ μ- Φ rt PJ O rt CΛ Φ 3 -J tr rt Φ μ- μ- φ μ- 3 J cn cn Hi ε CΛ rt Φ μ- Φ μ- LJ- μ- Φ Hi 3 μ-1 -— ' μ-1 φ PJ TJ — - cn ι-i μ- ^ X PJ 3 tr rt 3 CΛ rt O 3 μ- : a 3 CΛ 3 J 3 μ- ~ CΛ tr ε a rt t-i ' Φ CΛ Ω > 3 rt Ω Ω μ- j H ι-1 o 3** Ω
3 μ- rt Ω rt rt 3 PJ o Φ tr 3 μ- ε Φ O < — ' ι-i μ- Φ rt 3 φ 3 Φ -1 Φ μ-1 μ- i-i α Φ O Φ Φ Φ PJ μ- rt CΛ 3 o t-h i-i 3 ι-3 tr Ω 3 μ- φ O TJ Ω 3 O 3
Φ o 3 ι-i 3 a H rt tr 3 a a a CΛ o Φ rt tr o o o TJ μ- 3 μ-1 CΛ l-f ι-i rt O rt Ω ε rt
3 3 α Hi rt X Φ Φ ι-i Φ Φ Φ i-i CΛ Φ μ- ω 3 o 3 cn CTi ω rt μ- Φ 3 Hi μ- Φ μ- Φ J J μ- a Φ α Φ Hi rt 3 rt a CΛ φ CΛ TJ 3 cn - — Ω i-i 3 X l-i O ]—• 3 i-i μ- μ- < o μ J o rt PJ μ- PJ tr O Φ μ- rt o t-3 3 μ- • O φ 3 rt μ- J rt 3 \—> X PJ M
3 3 μ- 3 Φ a 3 3 3 Ω Φ rt ι-i 3 tr a rt μ- M Φ a 3 3 X Ω Φ O μ-1 3"* Ω X cn α J cn tr J Ω a rt φ Φ 3 rt Φ o PJ 3 rt rt • O rt > CΛ tr rt a 3 Φ O TJ cn rt
3 μ- μ- Φ μ- O Φ » CΛ μ- Φ rt 3 ? TJ Φ l-i ι-i Hi rt μ- 3 cn a ι-i O rt PJ . — .
3 α 3 3 ε cn 3 3 Φ i-i φ TJ 3 H a tr J Φ l-i Ω a s o rt PJ tr ι-1 a μ- O TJ Φ 3 φ
TJ o X . μ- μ- ^ TJ PJ PJ φ l-i Φ PJ o Φ μ- CΛ μ- O μ- rt a Φ 3 μ- Φ o i-i < M cn 3 TJ rt X i-i rt O o CΛ Ω • φ o a Ω 3 3 3 3 rt O 3 ι-( rt rt CΛ 3 Φ φ CΛ 3 μ^ CΛ φ o 3 . — . rt tr 3 rt 3 μ- rt Ω rt • rt PJ μ- TJ rt TJ PJ tr X Ω CΛ PJ Λ ι-3 cn J μ- 3 rt o Hi tr tr 3 μ- rt Φ μ- μ- 3 Φ O 3 μ- Φ O Ω a μ- 3 cn M TJ rt PJ CΛ TJ
Φ rt μ- 3 rt a Φ a o μ- μ- μ- tc O 3 < μ- PJ a 3 a rt 3 a ι-1 μ- tr μ- 3 μ-1 μ- S Φ μ- l-i
Ω X CΛ tr o 3 H) o 3 φ 3 Φ 3 Ω Φ ^ •• 3 PJ CΛ cn rt rt 3 O φ rt μ- rt tr 3 Φ 3 Ω tr PJ 3 CΛ 3 Ω 3 Φ a 3 cn μ- TJ a l-i TJ cn *< Ω cn 3 tr h
Φ μ- l-i TJ PJ O μ- CΛ 3 rt Ω ε PJ rt rt o rt Φ cn PJ PJ 3 rt PJ -> J Φ μ- • TJ Φ μ- α 3 φ ι-i CΛ μ- 3 3 CΛ TJ 3* cn Ω Φ μ- 1-1 μ- tr PJ 3 cn μ^ i-i l-i tr Ω PJ 3 μ- O μ-> O φ φ a o φ 3 TJ a Φ O Φ μ- PJ rt i-i O Φ CΛ PJ φ cn rt TJ μ- Φ rt *< a 3 Hi -1 3 μ- 1-1 TJ a
-J rt μ-1 o CΛ 3 3 rt l-i tr 3 μ- Ω Ω rt μ- o 3 μ- Φ φ cn Hi o J rt — Φ φ ε 3 tr CΛ Φ l-i rt μ- - Hi Hi 3 O rt Φ Ω TJ X! i-i l-i a rt PJ rt TJ 0 O μ-' Hi
3 tr • Ω Ω μ- 3 rt h-" tr Φ ^ tr PJ 3 o O cn 3 μ- TJ H" 3 μ- J r o t-h μ- 1-1 *< o α Φ rt rt rt a o Φ μ- J μ- Φ 3 X rt *—> ]— Hi o Φ *—• σ ω PJ μ ε Φ TJ 3 3 TJ H to μ- μ- tr a 3 Φ ι-i 3 tr *—• tr Φ 3 cn PJ μ- rt rt X μ- 3 rt Ω φ φ
PJ rt 3 o Ω J a Ω Φ X rt J Φ o O μ- H rt 3 μ- Φ a -1 tr 3 a TJ rt ι-( PJ X 3 rt J μ- rt Φ ε ε 3 cn μ- μ- a Ω rt rt φ J Φ tr rt tr
Φ l-i tr 3 3 μ- CΛ H rt μ- CΛ a cn O μ- tr rt PJ μ- Φ X rt Φ X o 3 3 Φ 3 PJ 3 3 O Φ μ- rt a φ tr 3 Ω ι-i X X 3 o Φ φ rt φ tr 3 1 3 a
therefore selected, e.g. killed or retarded, by the selection agent (figure 7) . Thus, each selected clone increases in frequency and in number as it continues to divide, while unprotected clones are afflicted by the selection agent. From the cell population displaying a great diversity of structures at the imprinted domain, the CISTEM selects and amplifies clones expressing structures that bind the target compound (figure 8) .
Modifications and elaborations of the basic CISTEM permit selection for a prescribed affinity or nature of interaction of the imprinted domain with the target. Multivalent binding may be selected, as may highly specific binding, binding at predetermined sites, or binding by specified structural motifs. Moreover, CISTEMs may be extended to select not only for binding, but also for functional activity; these extended CISTEMs may be used to create enzymes and other functionally active molecules.
As stated above, the embodiments of the present invention are called CISTEMs, for Cellular Imprinting by Selection: Teleological Evolution of Molecules. In general, a CISTEM consists of three components. They are: (1) an engineered population of cells; (2) a target compound; and (3) a selection agent (figure 5).
The CISTEM cell population may be prokaryotic or eukaryotic. In case the unknown interactant is the polypeptide of interest, in the CISTEM cell population, one extra- or intracellular domain (the "imprinted domain") of a membrane polypeptide (the "imprinted polypeptide") is engineered to exhibit highly variable peptide sequences. If the CISTEM cell population is eukaryotic, glycosylation sites within the imprinted domain may be adorned by variable complex saccharides. High sequence variability at the imprinted domain is achieved through the insertion of variable oligonucleotides at the sites in the polypeptide gene that correspond to translated regions in which variation is desired. These variable oligonucleotides may be cDNAs, random sequences, or any other form of oligonucleotides. Techniques for inserting a gene for an exogenous oligopeptide at a targeted site in an outer membrane protein gene were described above, and an extension of these techniques, which allows the insertion of variable oligonucleotides (at least 10 5 to 1012 different sequences, instead of a single, stereotyped nucleotide sequence) , will be described in detail below.
As was described above, many pathogenic agents can kill or retard the growth of cells only if they can first bind to a specific domain on the cell surface. A CISTEM' s selection agent may bind to an adhesion domain that is located immediately adjacent to the imprinted domain on the surface of the cells in the CISTEM population (figure 5) . The adjacent extra- or intracellular domains in most of the membrane proteins used in this invention are separated by 3 - 30 A. In the preferred embodiments of this invention, which are described in greater detail below, this arrangement of domains is the result of carefully choosing an imprinted polypeptide and a selection agent such that the imprinted domain is immediately adjacent to the adhesion domain. As was described above, almost every membrane polypeptide serves as the receptor for multiple selection agents, each of which utilizes a distinctive adhesion domain. However, in other embodiments of this invention, membrane proteins are genetically altered in order to construct a hybrid polypeptide in which the imprinted domain is placed beside the adhesion domain. The techniques for engineering membrane proteins, including the fusion of two separate polypeptides to form a chimeric polytopic polypeptide, were described above; their specific applications to embodiments of this invention are described below.
The target compound is any molecule or set of multiple, distinct molecules, which are simply added to the cell population according to the present invention or, in some embodiments of this invention, are secreted by the cells themselves. The present invention identifies and selects cells in which the displayed polypeptide sequence interacts with the target compound, or some subset of molecules included in the target compound, and this interaction event is necessary to the functions to which the polypeptides are ultimately applied, such as enzymatic activity or alteration of the target compound's biological activity. In the application of μ- < 3 μ- o rt μ- i CΛ φ TJ PJ Ω < rt rt Ω CΛ σ tr P CΛ Hi 3 f-J a a CΛ ε TJ rt ε J TJ rt rt rt
3 PJ φ 3 til PJ 3 μ- φ g o a O μ- tr tr O μ- Φ PJ X> TJ 3 φ O μ- Φ φ μ- o PJ tr CΛ O 3* PJ tr rt l-i a l-i < l-i X o** LJ. 3 H Φ PJ 3 rt cn Φ φ 3 3 μ ι-i ]—• X rt H* ι-i μ- CΛ O ι-i μ-
Φ μ- μ- 3 μ- X φ cn 3 O 3 § 3 rt TJ φ Ω 3 Ω Ω rt Φ μ- 3 tr • XI Ω μ- •< 3 X cn ι-i PJ PJ o 3 Φ 3 rt φ a TJ CΛ o cn O O PJ rt μ- rt μ- Φ Ω 3 Φ TJ Φ tr cn TJ X! Φ
PJ tr . μ Ω rt rt 3 μ- φ H It 3 φ Φ μ- 3 ε 3 • 3 Hi μ- O PJ rt Φ 3 PJ φ rt rt Φ tr rt μ-
Ω Φ 3 μ- rt 3 TJ tr 3 3 μ-1 CΛ 3 tr rt l-i tr P- 0 3 Ω PJ Ω TJ rt Φ TJ rt 3 rt φ σ tr Ω o rt μ- φ rt Φ Φ μ-1 Ω φ a μ- μ- Φ ι-3 φ Ω 3 φ tr •< rt Φ TJ rt Ω tr a rt Ω < μ- 3 a PJ o 3 3* P 3 μ- H 3 PJ Φ Ω PJ • Ω 3 PJ - CΛ a μ- CΛ o μ- O Φ μ- rt O Φ
O l-i μ Φ rt 3 Φ \-> rt a Φ rt tr rt a tr X a Φ rt rt *•• o μ- 3 μ-1 a 3 tr a O 3 3
3 φ μ- rt μ- TJ -> CΛ Φ o PJ μ- H Φ tr PJ tr Hi 3 X rt ^ φ TJ rt •< Φ TJ rt
CΛ X 3 J o O μ- π * PJ tr ι-( 3 O PJ tr μ- 3 3 μ- μ μ- Φ < tr TJ ' — ' O PJ cn tr O μ- μ- X μ- 3 3 cn -» O CΛ μ J PJ a 3 Ω μ- cn rt tr O n X 3 i- rt O rt PJ Φ "• 3 l-i rt • J 3 0 tr o μ-> 3 CΛ Hi Φ Φ < rt φ CΛ Φ CΛ a φ rt Φ tr tr rt TJ 3 X tr < 3 3
Φ 3 μ- μ- a μ- ι-3 rt X Φ o Ω J 3 3 3 Φ rt rt Φ Φ < Φ rt ε a φ μ- φ a rt cn 3 μ- ω 3 M tr rt 3 rt i-i O X rt CΛ O TJ 3 Ω ι-i 1 Φ tr μ- tr rt CΛ a rt ε Ω 3 Ω £ o tr Φ ε PJ 3 Φ φ a O μ- O Ω J rt a tl μ- a μ- μ- Φ μ- 3 o
Φ H 3 J Ω 3 3 μ- 3 o Ω 3 rt μ- 3 cn 3 3 O Ω μ- 3** Φ 3 φ Ω cn Ω μ- 3 3 J
Φ Φ σ rt O tr Ω X CΛ Ω ι-( TJ Φ rt O φ Hi rt T TJ 3 rt a μ μ- X a tr O 3 rt TJ * rt
3 cn PJ tr cn 3 J Φ tr Φ CΛ φ ι-i 3 Ω μ- rt PJ O o TJ μ- φ Φ 3 μ- o PJ 3 < μ- o tr
3 rt Φ rt TJ rt μ-1 μ- CΛ rt PJ O rt rt rt φ tr rt i-i 3 O O 3 Φ O rt Hi μ- TJ Φ Hi l-i tr φ μ- μ-< μ- PJ o Φ rt 3 Φ a Ω t-i Φ 3* μ- a Φ Φ rt φ 3 3 rt μ- 3 tr cn 3 O 3 μ- rt Φ
3 rt o TJ 3 3 a tr < rt TJ Φ J • Φ O a P 3 3 μ- PJ rt ω Φ μ- o 3 rt Ω PJ J
TJ cn 3 PJ a Φ TJ Φ φ tr CΛ a rt X 3 3 PJ • • 3 rt a PJ Hi l-i cn 3 * μ-1 3 μ- PJ 3 J 3
H -» CΛ 0) 3 ε O ι< 3 PJ tr 3 μ- • μ- TJ X rt CΛ 3 μ- Φ O rt rt 3 φ a 0 rt rt PJ μ- μ- cn μ rt μ- TJ rt rt μ- Φ ι-i O 3 CΛ o Φ rt O PJ Ω PJ μ- Hi PJ Φ o Ω 3 μ- * h-"
3 3 rt PJ a rt 3 3 μ- 3 Φ Φ 3 J TJ μ- 3 3 tr i-i h PJ *— PJ cn 3 i-i ι-i ε 3 μ- - O μ- 3 rt tr X tr μ^ PJ O tr 3 CΛ *> 3 ι-i rt rt Φ Φ t-i rt rt rt X Φ 3 μ-1 3 cn 3 3 O CΛ
Φ J Φ Φ 3 [\J J 3 μ- rt • μ- φ rt - X rt Φ CΛ μ- o Φ tr Φ cn φ rt rt ε μ- a cn Φ PJ rt » 3 tr a 3 T 3 tr cn PJ 3 tr μ- O l-h μ Φ rt rt rt Φ μ- O φ 3 CΛ ε CΛ a μ- μ- a Φ Φ ι-3 PJ PJ rt μ- Φ X μ- Φ a cn 3 PJ ~ • O o ι-i a t-h ι-i a μ- rt i-i μ- o CΛ O 3 rt cn tr >< rt Φ cn PJ Φ ι-i Φ PJ Ω ** Hi PJ φ PJ Ω o o a H Φ 3 Hi 3 3 3* rt φ Ω φ tr a tr tr 3 φ μ- CΛ O TJ rt 3 rt ω σ Ω 3 rt Ω O
3 Φ 3 μ-1 3 cn - cn Φ 3* X ι-i cn tr o T μ- μ- rt 3 3 Ω PJ Hi TJ μ- O tr Φ μ- rt rt tr rt 3 J Ω PJ rt TJ o CΛ Φ φ μ- φ Φ X a i-i 3 μ-1 - φ < μ 3 {—• O ε μ- 3 cn μ- φ μ- TJ g μ- μ rt rt l-h tr Φ Ω 3 Φ Ω σ Φ o Φ a μ- CΛ 3 Φ μ- a cn μ- 3 3 cn μ- μ- rt . Hi cn O O
3 J 3 μ- o Φ 3 O Φ rt 3 Φ J J 3 3 t-h μ- rt rt 3 tr Φ Ω a 3 Φ μ- φ 3 3 ° ω 3 ι-i 3 1-1 CΛ 3 CΛ J rt a rt h-1 μ- PJ Φ 3 Φ rt Φ Φ X PJ ε T μ- Φ < ι-i μπ Ω cn 3 φ
X J X μ- μ- TJ Ω rt H μ- • rt rt Ω μ- 1-1 X Hi O μ- a TJ 3 rt μ- O 3 3 O Φ 3 J 3 a O J Φ Ω 3 o o J Φ X O i-i Φ 3 J rt Hi Hi O μ- Φ μ- rt ]—• < rt μ-1 cn rt 3 ε 3
3 rt Φ < 3 3 CΛ TJ Φ 3 μ- ι-i J • tr φ O Φ 3 tr rt 3 o tr Φ μ- < rt Φ μ- ?? μ- o a o a o μ-1 φ φ Φ CΛ rt tr Φ X < Ω rt Φ O Ω 3 TJ 3 ttl φ TJ 0 3 rt Hi PJ
Hi μ- μ-1 3 O 3 CΛ O 3 a Φ O *< φ tr rt tr μ- l-i TJ Φ cn rt Φ rt >< a . — . μ- 3 O tr l-i rt < Ω rt ι-i rt PJ Ω Hi rt ^ 3 rt 3 μ- CΛ Φ CΛ Φ Φ cn tr TJ μ- μ- 3 rt ε μ-
PJ tr φ Φ X μ- tr l-i O Φ rt tr J rt 3 μ- rt Φ rt O 3 μ- O O O 3 3 3 μ- ι-i μ- H ι-ι O Cn Φ φ 3 CΛ CΛ μ- Φ 3 a 3 cn rt 3 3 ε tr μ- 3 X 3 rt TJ Hi 3 rt 3
X 3 CΛ O 3 O I—1 T Φ Hi μ O TJ Φ 3* PJ PJ μ- P rt a Φ T Φ rt
Φ a μ- ε • μ-1 CΛ TJ 0 PJ CΛ PJ Hi rt O HJ φ ttl TJ rt rt PJ Φ PJ T PJ 3 cn O 3 ι-i Φ rt μ- rt μ-1 rt 3 μ- φ 3 Φ 3 3 3 J a O 3 Φ rt TJ tr l-i CΛ 0 3 Φ 3 Φ μ-> O Φ ι-i
3 *< μ- tr rt O 3 ι-( 3 Ω a φ Ω rt tr rt Ω CΛ φ μ- 3- X rt CΛ μ-' Ω PJ X ! ε cn PJ
3 X 3 "• tr μ- Hi 3 l-h a rt Ω tr rt Φ tr PJ Ω rt φ H 3 3 PJ Φ tr μ- μ- Φ TJ 3 TJ 3 rt Ω
O O φ Φ O O • μ- rt Φ ι-i Λ Φ Φ μ- • X cn < rt J CΛ TJ 3 CΛ TJ Φ φ rt — Hi cn a 3 ft rt l-i o tr cn PJ μ- μ- cn O φ Φ Φ rt rt φ rt cn μ- 3 TJ rt μ-
Φ Φ ω tr J 3 H 3 Φ cn CΛ tr O rt Φ μ-1 3 Ω 1 Hi Ω CΛ T Φ J 3 Ω rt tr O
Ω rt PJ cn o O Φ 3 φ 3 μ- PJ 3 3 PJ μ-1 CΛ rt 3 tr O PJ rt ι-i i-i X Φ μ- PJ 3
3 tr 3 Ω Hi Φ a l-h μ i-i rt ι-i Φ PJ μ- μ-1 Φ 3 l-i μ- μ- J μ- • cn a rt cn
Φ a ι-i 3 O 3 o * Φ a X Ω 3 X O Φ TJ φ 3 a Ω —• Φ
Φ μ- rt rt 3 o i-i "« CΛ o Φ rt 3 Φ 3 3 O φ rt ~< rt tr μ- tr CΛ CΛ 3 rt μ- CΛ 3 3 μ- tr μ ε cn μ-
CΛ - CΛ rt
•• φ o Φ • rt tr o PJ O rt rt 3 O PJ Φ rt P- tr a 3 * Hi μ- 3 a 3 3** cn Φ
PJ rt TJ Φ O PJ Ω CΛ μ- Ω PJ Ω PJ Φ PJ 3 PJ s μ- PJ PJ μ- PJ 3 Ω rt PJ rt TJ rt tr Φ tr rt a CΛ rt tr tr Φ 3 μ a o rt 3 o 3 Φ μ X TJ Φ tr 3 ) X) cn Φ o tr μ J O tr 0 XI Φ J μ- Φ 3**
3 Φ TJ σ tr 3 i-i TJ 3 a Φ TJ TJ Ω l-i Φ φ φ tr rt 3 Φ Φ H Φ 3 3 rt ι-i CΛ TJ Φ rt rt o μ- Φ TJ 3 ι-i TJ o μ φ Φ 3 ε 3 3 Φ J tr TJ X5 1-1 μ- ε X CΛ PJ rt μ- μ- a 3 CΛ o Ω μ- 0 CΛ cn μ- CΛ o cn TJ tr rt rt X Ω o o μ- a Φ TJ TJ -' Φ Φ O μ a μ- 3 a μ- rt μ- 3 rt 3 3 Φ > 3 3 TJ cn ι-< rt μ- TJ rt a 3 3 > Φ rt Φ Φ μ- μ- Φ rt Ω PJ μ-
3 TJ Φ 3 μ o 3 3 rt φ PJ a t-i ι-i PJ o tr Ω μ- μ- o φ 3 < Hi CΛ TJ l-i CΛ tr 3 P- rt CΛ
X l-i φ J 3 a μ Φ 3 Φ 3 a rt Φ μ- μ a Φ tr Λ CΛ cn l-i PJ a Φ rt Ω rt μ- μ PJ φ rt μ- tr 3 Ω φ a rt Ω a μ- tr *» PJ *< 3 Φ μ- TJ • 3 Φ l-i tr μ- O cn μ- μ- Hi rt μ- J 3 o rt Φ a CΛ rt rt Φ rt Ω cn rt TJ PJ a O TJ rt μ μ- 3 a a 3 3 3 μ- μ- Ω 3 a rt μ CΛ μj o rt a μ- rt μ- PJ Φ rt μ- Φ tr Φ T H M μ- σ rt φ Hi 3 TJ X O φ Ω tr Φ 3 μ-1 3 tr o o Ω O PJ o X 3 o 3 —> Φ i-( Ω rt tr o X 0 μ- Φ cn O Hi μ 3 3 μ-1 rt
Φ a o o 3 J Φ Ω 3 O 3 μ 3 φ a Ω X Φ l-h O o PJ TJ T 3 3 a PJ Hi μ- μ- ι-i
CΛ 3 Hi μ^ μ- J PJ 3 XI 3 O CΛ Ω Ω O μ-1 X l-i o « Ω rt Ω φ 3 Φ Ω CΛ CΛ μ- a Φ PJ 3 Ω rt μ- Hi Ω φ O Φ TJ 3 rt 0 rt Φ 1-1 μ- Ω Φ μ- cn 3 tr CΛ tr μ- μ- <J rt O rt
O O rt μ • O Φ 3 Φ O rt l-h ι-i 1-1 Ω O μ μ- ]—• 3 Ω PJ 3 μ- σ Φ o PJ 3 φ φ φ σi 3 Φ μ
3 3 μ- t 3 a CΛ μ 3 PJ o Φ TJ X O μ-1 φ μ- PJ rt i-i CΛ PJ rt Ω 3 a ^ CΛ X 3 J cn μ- a W TJ CΛ 3 Ω rt 1— ' Ω 3 PJ 3 CΛ a 3 cn rt Φ Φ rt o φ 3 rt cn . rt 3** Ω a μ- cn o Φ o J tr φ O tr φ rt μ- 3 . cn tr PJ H- ε X Φ tr . • a PJ μ- rt o 3 J 3 Ω 3 rt μ- PJ 3 3 φ a ι-i rt μ- PJ PJ n μ- Φ rt o a O . Ω X PJ o 3 tr 3
3 ~ μ- PJ PJ 3 3 Ω TJ 3 3 PJ cn cn X l-i μ- φ Ω o Hi a 3 Φ tr rt ^ 3 3 rt μ- ι-i
PJ n 3 μ- 3 a (t a CΛ Φ O cn O i-i rt 3 Φ Φ 3 X PJ cn Φ ^ TJ P- μ- 3 o PJ Ω CΛ rt Φ μ- ε 3 < 3 cn tr Φ cn 3 Φ ε φ μ- PJ 3 TJ TJ μ-1 3 rt rt a rt tr O μ P- 3 cn
3 tr t, Φ cn Φ o Φ rt μ^ . 3 μ-1 CΛ o Ω rt μ- o cn tr tr rt μ- P- 3 Ω 3 tr μ- Ω
• Φ t-h 3 < tr φ a Φ φ 3 rt CΛ 3 * CΛ TJ TJ Φ Φ rt tr O rt Φ 3 cn PJ 3 tr a μ μ- rt O rt Φ μ- φ Ω Ω a Hi μ- 3 μ- Ω 3 tr Φ o φ 3 μ- μ O rt PJ μ-
Φ Ω μ- l-h tr μ 3 rt 3 Ω rt X o O o Ω 3 3 tr μ J 3 PJ Ω J 3 CΛ rt cn PJ μ- rt μ cn tr μ- o Φ μ-1 TJ rt μ- rt O P- Φ μ ttl 3 tr tr ι< Φ X cn X φ rt CΛ μ-1 O PJ rt 3 O tr PJ TJ μ- μ- Φ 3 rt PJ ι-ι PJ < Φ 3 o rt PJ Φ μ- Φ μ-1 tr Φ μ-1 rt Ω μ- a 3 Φ Ω
CΛ rt 3 tr a TJ μ- i-i Φ μ TJ 3 O tr TJ J PJ tr rt PJ o O 3 Φ μ-* ^ 3 3 Φ μ- rt PJ rt μ- Φ μ- ω 3 X PJ Hi PJ PJ t-h CΛ Φ Φ a t-h μ- 3 rt φ O rt 3 ft Ω μ φ cn *< 1
O μ- (—> ω rt φ PJ Ω φ PJ μ-- rt rt a a 3 • TJ CΛ Ω tr 3 Φ O PJ O μ Φ '
< cn TJ rt rt Φ rt a rt rt X rt rt tr Φ PJ Hi μ- rt Hi O o 3 rt μ- μ^ CΛ g o l-i 3 tr μ-
Φ O o J tr a < μ- φ Φ tr μ- O μ o ε 3 tr φ Hi ^ TJ CΛ μ- XI •< tr - a X! rt μ- CΛ μ tr I—1 3 ^ 3 Φ Ω PJ O cn 3 Φ 3 X tr i-i μ- X Φ Ω o 3 TJ o tr o o μ- φ μ- 3 rt PJ μ-1 O PJ -1 Ω a O 3 3 rt X φ ) PJ 3 rt rt Ω μ φ 3 J l-i a 3 rt 3 a μ- cn rt 1
PJ 3 ι-i * TJ Φ cn o 3 rt PJ 3 Ω tr rt Ω μ- 3 o 1
Φ PJ rt H- φ 3 μ- Ω
TJ 3 X φ 3 3 T PJ J cn rt rt μ- TJ rt o tr φ O {—• Φ rt CΛ PJ 3 tr 3 3 Ω Φ 3 rt a Φ TJ tr μ J O X 3 cn O μ- tr Ω Φ Φ Hi rt Φ h-1 3 μ-1 X μ- μ- X 1 P- μ- φ rt O cn X o tr μ- rt Φ Hi P- 3 Φ o Φ Φ μ μ tr • CΛ PJ O O Φ μ X 3 3 Cn 3 3 Φ
3 rt 1 μ- rt J 3 3 3 3 rt μ-1 PJ μ- μ- PJ Φ Ω 3 3 3 3 3 tr rt TJ 3 μ 1 rt rt PJ a ε o a 3 X σ tr φ PJ X O o o T > PJ TJ rt rt O CΛ o O 3 φ μ μ-
Φ O i-i Φ Φ Φ tr • • TJ μ-' φ a Ω Φ a TJ TJ rt O μ-> 3 t-1 μ- o φ PJ H t-h 3 rt J PJ 3 μ X O CΛ Φ μ- O rt > rt 3 tr PJ PJ μ- TJ rt a Φ cn Hi CΛ Hi a O 3 μ- Ω rt TJ
Hi O φ 3 o 3 Φ 3 tr cn 3 μ- rt 0 J rt ι-i Ω 3 Φ - Ω • Hi Φ Hi rt a rt φ l-i
Φ < rt >' 3 Hi a • φ o 3 PJ o ^ Hi X tr X μ- {—• ι-i rt TJ Ω 0 P- μ tr cn μ- CΛ μ- μ Φ CΛ PJ CΛ X Ω Hi cn 3 3 Φ o Φ 3 PJ 3 PJ μ- φ Φ 3 rt 3 rt Φ rt O >.
• 3
Φ i-i rt Φ Φ a rt Φ rt TJ T φ ^ X rt rt PJ i-i Hi l-i μ-1 TJ 3** P- rt tr PJ tr 3 rt
CΛ h-1 o < a v_ι. tr rt tr rt Φ μ- o Φ X Φ rt μ- rt XI Hι 0 Φ rt O Φ μ- TJ Φ CΛ O φ
PJ Φ J Φ tr ε μ-> l-i tr 3 3 Hi ι-1 TJ μ- 3 Ω O 0 μ- Φ rt O cn 3 CΛ >< 3 TJ t-h a ε TJ ι-i 3 μ- Ω Φ tr CΛ φ Φ cn μ- o rt μ- O 3 < rt tr hi Φ φ Hi μ- < hi tr ε Hi μ- Φ 3 Φ J Φ φ P- rt rt O CΛ Ω 3 rt Φ Φ 3 PJ 3 Hi PJ 3 φ o μ- μ- 1 rt rt a
PJ PJ 3 a rt 3 ε π O tr tr a 3 3 TJ tr rt -1 Ω Φ 3 rt n O < 3 PJ 3 rt t-i o tr tr Ω TJ rt PJ tr n 3 Φ Φ l-i PJ J 0 Φ 0 •< O PJ a a μ O Φ rt Ω tr cn a t PJ 3
Φ l-i PJ μ J o CΛ 0 Φ *< X 3 3 X . — . CΛ H 3 μ- tr μ- rt PJ rt 3 Φ Φ o X CΛ CΛ H O PJ PJ Hi • φ 3 3 rt TJ Φ tr rt ω rt rt o φ 3 rt Φ μ- tr Hi PJ Hi X φ φ φ 3 M Hi μ X tr 3 a Φ tr H- O 3 • P) — - M tr μ- 3 cn X tr CΛ w S rt 3
Φ 1-1 μ- Φ rt 3 rt φ m Φ rt • a Φ Hi 3 rt μ S φ o •^ Φ "* cn o μ-1 μ μ rt s *> • 3 tr μ- 3 PJ X o en 3
3 μ J rt Φ μ- 3 rt a μ- Φ Hi ^* O PJ O Φ 1 cn 3 tr cn rt 3 3 Hi a φ Φ a
selection agent's attachment, protecting the cell and allowing it to multiply (figure 7) . Cells whose imprinted domains do not bind the target compound with adequate affinity remain unprotected, and, consequently, they are killed or retarded by the pathogenic (selection) agent (figure 7).
Each cell in the CISTEM population expresses multiple copies of its imprinted polypeptide, and binding is a dynamic process balancing on-rate and off-rate. Therefore even when a cell's imprinted polypeptide exhibits high affinity for the target compound, some copies of the imprinted polypeptide are left unprotected some of the time. Consequently, every cell in a CISTEM population, no matter how high its affinity for the target compound, has some probability of being hit by a selection agent. Nonetheless, this probability is lower when a cell's affinity for the target compound is higher. Furthermore, because each clone in a CISTEM is present in multiple copies (the CISTEM cell population is redundant) the probability that a single cell will be hit is also the fraction of that cell's clonal population that will be hit, and the differential elimination of clonal populations is the very basis of selection. Thus, each selected clone increases in frequency and in number as it continues to divide, while unprotected clones are destroyed. In short, the CISTEM selects and amplifies clones expressing structures that bind the target compound (figure 8) .
The effect of the target compound relevant for the preferred embodiment of the invention, wherein the selection site is located on an adhesion domain is that, due to its size, it extends, upon interaction with the imprinted domain, from the imprinted domain to the adhesion domain. This is 3 - 30 A for many imprinted polypeptides imprinted by this invention. Alternatively, independent of its size, the interaction between the target compound and the imprinted domain may lead to a conformational and/or chemical change of the variable domain and/or the adhesion domain which prevents the selection agent from attachment. The target compounds, their source, preparation, and use of CISTEMs with particular applications in this specific embodiment of the invention are described hereinafter. While the concentration of target compound in a CISTEM is also important to its function, this concentration depends on the nature of other CISTEM components, and is therefore discussed below.
In CISTEMs designed to imprint polypeptides to serve as highly specific markers or partitioning agents in diagnostic assays, the target compound is simply the molecule that is to be tagged and/or partitioned by the polypeptide. For example, in an embodiment of the invention that is used in the design of a diagnostic polypeptide that binds tightly and specifically to Human Immunodeficiency Virus (HIV) , the target compound is HIV gpl20, the extracellular subunit of a major HIV glycoprotein (Fantini et al., 1997). The production of recombinant gpl20 in a soluble form that is suitable for use in CISTEMs is described in the literature (Fantini et al . , 1997) . Another embodiment of the invention is used to design a diagnostic polypeptide for the direct detection of a different virus, Human T-cell Leukemia Virus type I (HTLV-1); here the target compound is the HTLV-1 surface glycoprotein gp46. The production of authentic, glycosylated HTLV-1 gp46 in a recombinant vaccinia virus/T7 polymerase system is described in the literature (Arp et al . , 1996). Diagnostic assays for detecting human viral infection often bind not to the virus itself, but rather to the antibodies raised against the infectious agent. One preferred embodiment of this invention imprints a polypeptide to exhibit high affinity for a monoclonal antibody, MAbT7. The target compound in this CISTEM, MAbT7 itself, is commercially available (Novagene) .
The invention is also used to imprint polypeptides to serve as antagonists, agonists, or modulators of the functional action of biologically active molecules. When this invention is used to imprint a polypeptide to serve as an antagonist, the target compound may be either of the molecules involved in the interaction to be antagonized. HIV gpl20 is the viral glycoprotein that binds to the CD4 receptor, and thus mediates viral invasion of the human cell (Fantini et al . , 1997) . The embodiment of this invention that imprints a polypeptide (ICAM-1, which is described above) to bind specifically to HIVgpl20, provides a polypeptide that could serve as an antagonist of the interaction between the viral glycoprotein and its host cell receptor.
The polypeptides selected by a CISTEM are more likely to functionally antagonize a given molecular interaction when the CISTEM' s target compound is pared down, so far as is possible, to one interactant ' s active site. Structures on either interactant that are distinct and topologically distant from the active site are less promising as target compounds because a polypeptide that binds such structures may not interfere with the normal interaction of the active molecules. For example, one embodiment of this invention imprints a polypeptide (E. coli OmpA, which is depicted in figure 13/, and described further below) to function as an antagonist of the interaction between the eukaryotic cell-signalling proteins Ras and Raf (Block et al . , 1996). (Activation of Raf, a protein kmase, by the immediately upstream protein, Ras, initiates pathways leading to the phosphorylation of transcription factors, the stimulation of numerous genes, and, ultimately, to cell proliferation and differentiation. Constitutively active oncogenic mutants of Ras or Raf occur in a large number of human tumors (Block et al . , 1996) .) In this CISTEM, the target compound is the domain of Raf (ammo acids 51-131) that is known to be directly involved m binding to Ras (Raf's so-called Ras-bmdmg domain, or Raf-RBD) (Block et al., 1996). The construction of a recombinant expression vector for Raf-RBD, expression m E. coli, and purification of the protein are described m the literature (Block et al . , 1996) . Using Raf-RBD, rather than Raf m its entirety, as the target compound ensures that the polypeptides selected by the CISTEM bind to precisely that domain of Raf which interacts with Ras, rather than to Raf epitopes that are extraneous to the important cell-signalling interaction.
When this invention is used to imprint polypeptides to serve as agonists, rather than antagonists, of a receptor or biological pathway, the target compound is the receptor or biomolecule that the imprinted polypeptide is intended to activate. For example, m an embodiment of this invention that imprints a polypeptide ( E . col i OmpA) to mimic mterleukm 6 and function as an agonist of the mterleukm 6 receptor (IL- 6R) , the target compound is a solubilized form of IL-6R: the expression of soluble human IL-6R in CHO cells, and the protein's purification, are described in the literature (Yasukawa et al . , 1990). After this CISTEM selects polypeptides with affinity for IL-6R, a secondary screen is used to isolate polypeptides whose binding to the receptor results in transduction of the signal that is normally initiated by IL-6. This secondary screen is described in the passages relating to extensions of the basic CISTEM.
When this invention is used to create agonists, the target compound is pared, so far as is possible, to the active site of the receptor or biomolecule that is to be agonized. This is done because removal of extraneous epitopes from the target compound increases the probability that an imprinted polypeptide that binds the target will also be active as an agonist .
In another example of this invention's application to the creation of agonists, CISTEM is used to identify, or to create mimics of, specific antigens that are present in infectious pathogens or cancer. These antigens or antigen mimics are then often used to provoke a specific immune response against the pathogen or tumor. In these embodiments of the invention, the target compound is composed of more than one molecule: multiple monoclonal antibodies, specific immune receptors, such as cytotoxic T-lymphocyte receptors or MHC-I molecules, or polyclonal antisera derived from individuals exposed to a specific antigen, such as meningicoccal Opa proteins, to a specific pathogen, such as meningicocci, or to a tumor, such as melanoma. Polyclonal antisera are divided from animals, human volunteers or actual patients. This composite target is used as an impression - a "negative", or "footprint", of the antigen - and from this footprint, the CISTEM is used as described in example 2 to identify the antigen or to create a mimic of the antigen. An antigen-mimic is of interest, because a polypeptide that is imprinted to bind the antibodies composing sera against an antigen is likely to elicit an immune response very similar to the one provoked by the antigen. Antibodies suitable for use as the target compound in a CISTEM for imprinting E . coli OmpA to mimic Neisserial protein Opa 5c include mAb B306, mAb A222/5b, and others described in the literature (Olyhoek et al . , 1991). Antibodies for use as the target compound in a CISTEM for identifying tumor antigens are derived from the polyclonal serum of a cancer patient, the isolation of which is widely described in the literature.
In order to create catalysts of chemical reactions, such as ester hydrolysis, polypeptides are imprinted to bind to transition state analogs. The preparation and structure of these transition state analogs are described in the literature (e.g. Charbonnier et al . , 1995). In one embodiment of this invention, a phosphonate transition-state analog (structure and preparation described in Charbonnier et al . , 1995) is used as the target compound for imprinting E. coli OmpA to function as a catalyst of the hydrolysis of p-nitrophenyl esters.
In other embodiments of this invention that are used to create polypeptides with specific enzymatic activity, the target compound is not a transition state analog, but rather a modified substrate. The target is a molecule that is structurally very similar, but not identical, to the substrate upon which the imprinted polypeptide will ultimately act. The reason for using a slightly modified structure, rather than the substrate itself, is the following: if the substrate itself were used as the target compound, imprinted polypeptides that not only bound the substrate, but also performed the desired enzymatic activity, would alter the substrate and then release it, rendering the underlying cell sensible to the selection agent. Thus, if the substrate itself were used as the target compound, the CISTEM would select against clones expressing polypeptides with the desired enzymatic activity. Therefore, the target compound must be an effective structural mimic of the substrate (i.e. polypeptides that bind the target must also be able to bind the substrate) , but it must be invulnerable to the prescribed enzymatic activity (i.e. polypeptides that act enzymatically on the substrate must be unable to act on the target compound) .
For example, in an embodiment of this invention that imprints E. coli OmpA to function as a sequence-specific protease that hydrolyzes the bond between two methionines, the target compound is the oligopeptide NH2-R1-R2-R3-M1=M2-R4-R5- COOH. Ml and M2 represent methionme residues; the bond between them, which is represented by "=", is a nonhydrolyzable methylene-group bond. Normal amide bonds are represented by "-". Rl ...5 are variable residues, which are randomly chosen from the 20 amino acids. These residues are varied so that only imprinted polypeptides that bind to the consistent M1=M2 motif of the target compound are selected by the CISTEM. Imprinted polypeptides that show high affinity for a different peptide motif bind at most only a tmy fraction of the target polypeptides, and are therefore left vulnerable to the selection agent. Several methods for the synthesis of this oligopeptide target compound are described m the literature
(Janda, 1994) .
After the CISTEM has selected clones expressing imprinted polypeptides that bind the modified substrate, a secondary screen may be applied to isolate imprinted polypeptides that not only bind the target, but also perform the desired enzymatic reaction. Secondary screens, including the one used in the CISTEM that imprints polypeptides to hydrolyze the bond between two methionines, are described below.
In addition to the creation of functionally active polypeptides, such as antagonists and enzymes, this invention's applications include interaction analysis (current techniques reviewed m Phizicky et al . , 1995). By screening a diverse library of polypeptides for sequences that interact specifically with a target compound of significant interest, this invention assists m the identification of the compound's native interactants. For example, although the structural interaction between Ras and Raf-RBD has been well- characterized through crystallography and mutational analysis
(Block et al . , 1996), the activation of Raf by Ras is a complex process that is not yet completely understood, and probably involves interaction with other molecules. The CISTEM that screens polypeptides displayed m OmpA for interaction with Raf-RBD assists m the identification of polypeptides involved in Raf activation. This CISTEM involves the following components: Loop IV of E.coli OmpA is used to display the polypeptides translated from fragments of cDNAs derived from a cell expressing Ras and Raf (the insertion of foreign oligonucleotides into the gene for OmpA is described below) ; Raf-RBD (see above for a more detailed description of this target compound and its preparation) serves as the target compound; and phage K3 serves as the (pathogenic) selection agent. The appropriate concentrations of the target compound and the selection agent can easily be determined by the skilled man in the art after reading the present specification and are described below in the examples of OmpA imprinting CISTEMs.
This invention is also used for epitope mapping (Phizicky et al., 1995), serving to elucidate the surface of structural interaction between two molecules. In CISTEMs for epitope mapping, the target compound is one of the two molecules whose interaction is to be mapped. Usually it is the interactant that is most easily prepared in soluble form that serves as the target compound. Thus, if one of the two interactants is a membrane protein, it is often the other molecule that serves as the target. In a CISTEM used to map the epitopes of interaction between Ras and Raf-RBD, Raf-RBD serves as the target compound. As noted above, it is readily prepared in soluble form (Block et al., 1996). Thus, one of the interacting proteins is dissected into multiple smaller domains which are displayed in the imprinted domain and selected by the CISTEM for their capability to bind to the other interactant.
In certain embodiments of this invention, multiple separate molecules compose the target compound. Each molecule must meet the CISTEM' s size requirement and must be present in the CISTEM at a concentration sufficient to confer protection upon clones that bind to it with high affinity. (Selection by CISTEMs with multiple distinct target molecules strongly favors binding by imprinted domains to multiple distinct molecules, because the effective concentration of target compound rises significantly with each additional compound that the imprinted domain is capable of binding.) When the target compound is a library of moleculesand the imprinted domain displays variable regions, the CISTEM assists m the identification of interacting pairs of molecules: polypeptides expressed m the imprinted domain are selected for their affinity for particular members of the library of molecules that composes the target compound. Those cells that express an imprinted polypeptide that binds to one or more molecules in the target compound are selected, while cells expressing an imprinted polypeptide that fails to bind any component of the target compound are eliminated by the CISTEM' s selection agent. In certain embodiments of this invention, which are described below, the target compound consists of a library of polypeptides or phage-displayed polypeptides that are secreted by the CISTEM cell population itself.
In an embodiment of the invention, the target compound is the interaction partner that is to be identified due to its interaction with a given polypeptide of interest, e.g. a section of a naturally occuring cell surface receptor which is displayed the extracellular domain. In this case, the target compound is the unknown interactant. In this example, the cells presenting the receptor domain of interest may be incubated with chemical compounds from a pool of substances. Surviving cells are indicators that the compound that has interacted with these cells is a candidate substance for interaction with the receptor domain.
In a further embodiment of the invention cells are protected from the (pathogenic) selection agent m an autocrme fashion, due to binding between the imprinted domain and a polypeptide target that is secreted by the cell itself. In this embodiment, two separate libraries of polypeptides, expressed from two separate libraries of genes, are screened for interaction with each other. The cell population displays one library on the cell surface, exactly like the standard CISTEM cell population. In this embodiment, however, the cell population carries also a second library of genes. These genes reside in a vector that encodes a secreted polypeptide. Such vectors are described m the literature (Mollenkopf et al . , 1996) . Each cell m the autocrme population displays one foreign polypeptide on its surface and secretes a second foreign polypeptide into the extracellular medium. In the event that these two polypeptides bind to each other, a cell protects itself from the pathogenic agent and is therefore positively selected.
As was noted above, the term "polypeptide" relates to all types of peptides or proteins synthesizeable by a host cell. To be imprinted by this invention, a polypeptide must comprise at least one extra- or intracellular domain that accepts exogenous oligopeptide insertions without disturbing translation or export of the polypeptide to the cell surface. This capacity to accept insertions has been observed in numerous secreted polypeptides, which can be attached to the outer membrane by a genetically engineered anchor that is described m the literature (Smger, 1990; Cross, 1990) , as well as in each one of the dozens of bacterial, yeast, and mammalian membrane polypeptides that have been topologically mapped by gene fusion techniques. A number of these polypeptides, such as E. coli OmpA, E. coli LamB, and mammalian ICAM-1, their properties, and their capacities to exhibit exogenous sequences m extracellular domains, are described m the literature.
The membrane polypeptide may be autologous or heterologous for the cell. Preferably, the membrane polypeptide is heterologous. In preferred embodiments of this invention, all copies of the membrane protein gene a given clone must be altered m some wayFor example, if a plasmid- borne copy of the gene for the imprinted polypeptide bears a foreign insert at the imprinted domain, whereas the chromosomal copy of the gene remains wildtype, then proper selection by the CISTEM is hindered, because the selection agent and/or the target compound can bind to the wildtype copy of the polypeptide, as well as to the engineered copy. In case the cell contains the autologous membrane polypeptide but otherwise has the desired properties for its use m the present invention, the gene encoding the membrane protein is inactivated, e.g. by deletion. Standard molecular methods are available to the person skilled in the art.
Exogenous insertions at an extra- or intracellular domain are involved in two stages of each CISTEM. First, an oligopeptide tag is inserted into the imprinted domain, and cell populations exhibiting this tag are used m preliminary rt Ω T μ tr t*J 3 TJ o 3 φ Ω Hi 3 P- μ- μ Ω TJ μ cn CΛ PJ TJ μ- O rt tr o μ- TJ TJ TJ cn TJ μ- rt
O o h-1 Φ PJ cn Hi CΛ • μ-1 PJ PJ 3 CΛ Φ o O Φ φ O ι-( O cn 3 - Φ o 3 Φ o o Φ o 3 μ
3 PJ X cn Φ J φ X O Ω μ rt O X 3 -> X μ-1 3 Φ TJ rt Φ μ 3 cn TJ TJ TJ Ω TJ cn H- J rt CΛ 3 o a CΛ rt a • 3 μ- Φ TJ 3 TJ * ; 3 Φ Φ 3 μ- O Φ cn Φ rt 3 3 rt 3 φ PJ
3 μ 3 μj tr o 3 tr - » μ- Φ μ μ H1 μ TJ *—• Ω Φ μ^ 3 J • rt μ P- —- μ-1 μ- μ-1 μ μ- o μ- PJ φ μ-. μ- P- μ- ^ 3 μ- μ Hi O PJ P- Φ J rt Ω X J T PJ μ ω μ rt a PJ PJ O PJ rt CΛ
3 a rt φ μ. 3 a CΛ J O 00 X rt ^ Φ TJ rt CΛ TJ rt μ- PJ tr rt O 3 rt * 3 Φ φ rt rt 3 rt P- o μ-1 0 3 cn 3 μ μ PJ μ O μ- rt μ- O cn μ- μ- ι-i ^ Ω a μ- μ- * μ- O rt
Φ TJ μ O J . μ- a o < rt CΛ O μ 3 μ- O 3 Φ tr O rt J Ω rt CΛ O O O 3 0
PJ a G *< TJ 3 3 TJ ε Φ Φ 3 3 1 >< X a 3 • cn μ- 3 PJ TJ rt o μ- P- φ 3 3 CΛ 3
Ω n TJ TJ ^ < rt μ 3 Ω Ω σ Φ rt cn 3 TJ tr 3 O 3 X Φ cn P- o μ- tr sΩ φ J > J o φ tr o rt rt 3* T 1 φ 3 O Φ Φ rt μ Φ < 3 rt 3 μ- tr Ω 3 TJ a • μ-' o μ μ 3 Φ ? O μ μ α —• 3 μ- Hi 3 X a O φ O Φ cn J 0 Φ rt rt φ φ . — . rt rt PJ h-1 μ PJ J O 1 φ CΛ TJ μ rt TJ X! 3 O 3 < 3 X O P-
- rt π 3 a tq μ- Φ μ- TJ μ O CΛ 3 cn 3 rt 3 μ- rt TJ μ TJ o O μ Φ rt Hi rt Ω Ω Φ tr o X O o VO CΛ O tr φ a 3
3* X Φ 3 3 tr μ φ Φ 3 μ- 3 μ- tr Φ 0 p- rt P- o Φ T 3 3 3 PJ 3 μ-1 o NJ PJ t-h PJ rt μ- 3 a PJ Φ Φ cn μ X n J Φ o n Φ CΛ 3 tr tr tr N
Hi μ rt 3 n μj 3 rt μ O Φ O rt 3 a Hi CΛ tr M rt 3 μπ cn Φ o P- Φ Φ
TJ Φ cn a o PJ TJ tr TJ μ- Φ μ X μ X CΛ Ω Φ μ-1 Φ μ- Ω CΛ Φ Φ J CΛ P- rt Φ 3 rt rt μ cn Φ μ- μ (— PJ Φ Ω TI 3 Φ • PJ μ- Φ μ O Φ > H 3 h-1 3 M μ 3 Ω P- P- rt tr Φ cn μ μ μ. μ-1 Φ < T μ Φ PJ 3 μ-1 ^—. σ T < μ 3 μ-1 w < Ω M TJ PJ 3 Φ 3 3 tr
Φ CΛ μ- Φ - Φ rt TJ Φ X rt Φ ι-3 PJ •n μ^ O Φ Φ — μ-1 s Φ O 3 S μ rt Ω TJ X TJ φ
Φ o CΛ rt O μ- μ 3 rt μ- tr Ω μ Φ CΛ μ-1 a μ- • o Ω o a Φ P- tr rt φ TJ μ
TJ 3 3 μ- tr 3 TJ J a o a Φ o Hi Φ rt Φ CΛ cn o Hi rt μ- rt Ω 3 φ Φ μ μ rt μ- o n
Ω a Φ rt μ 3 μ-< Φ rt μ-1 3 3 o O 3 TJ μ- O Φ t o 3 O 3 3^ Φ rt μ μ a Hi Φ tr 3 μ-1 Φ O μ- Φ Φ μ cn Φ "• CΛ < T a μ tr t-h 3 PJ o 3 μ X O μ-> Φ O Φ rt CΛ
<< Hi 3 μ- μ Hi T Φ μ- μ- ^-^ Φ *< μ-1 o tr a ε Ω O • a *—• a rt μ μ- Φ H
TJ O X 3 Φ PJ J μ- 3 μ-> < CΛ J Ω μ 3 Φ Φ o ι_i- Φ O rt rt CΛ tr μ- 3 3 rt a W
Φ Hi O a 3 μ < a 3 CΛ V Φ Φ 3 rt J o X a 3 < TJ H- ι-3 tr TJ a PJ 3 Φ μ- PJ
TJ 3 O 3 Φ μ l\J ι< TJ ^ OD μ- ' 1— ' rt O 3 μ-> rt rt TJ μ- cn Φ μ- O tr Φ Hi O o s
3 CL 3 X a rt PJ T 3 Ω 3 Φ μ-1 μ VO <: Φ μ- μ o D Φ tr μ 3 rt μ φ 3 Φ PJ . TJ 3 rt PJ o μ- > μ- tr a OD σ μ- Ω Ω tr cn OD μ μ Φ φ Φ cn . P- 3 3 PJ PJ tr μ 3 a •-3 rt tr μ Φ 3 O • a rt μ- PJ μ- ( ) O n 3 o tr μ- PJ Φ . — . Φ rt 3 Φ a < 3 μ- -> P- 3 μ- rt *< J PJ
Φ > P) O μ^ a — ' 3 cn rt PJ O Φ TJ PJ 3 cn tr μ P- ^
• • cn 3 Φ Φ 3 3 ^ 3 Φ 3 CΛ tr rt cn φ "" CΛ X μ- CΛ rt cn Hi 3 Ω μ μ- rt • o Φ rt φ 3 »
Φ tr μ a O Ω μ-1 μ- o 3 tr O O rt μ- PJ μ- μ-1 TJ μ ε CΛ TJ Ω Ω o Φ T 3 μ Φ O . — . ε Ω 3 o μ- rt 3 O 3 μ o μ- O PJ μ- PJ ° tr rt W O O TJ a 3 μ-' μ- μ- Ω h-i tr tr rt Hi J cn tr O μ rt 3 tr X TJ Ω PJ μ-- 3 H-
Φ o < 3 3 μ P- a PJ 3 Φ σ 3 μ 0 TJ μ- tr O Ω Φ cn - Φ rt cn Φ o 3 rt μ-1 3 X
3 TJ I\J 3 rt o 3 μ-> Ω X Φ J Φ 3 ι-3 Ω PJ Φ Hi rt tr a rt CΛ a o TJ 3 CΛ μ-> O μ rt φ Φ TJ TJ a μ cn rt a o o tr Cn tr μ- φ μ- PJ cn Φ PJ PJ a TJ ft Ω OD 3 O φ 3 a μ μ P μ- PJ — - 3 rt μ- Φ cn 3 n TJ rt μ rt TJ rt μ-1 ε Φ Ω φ tr O ~ *' μ- rt μ-> Φ Φ μ- Φ CΛ μ- >- PJ rt CΛ tr CΛ < Ω TJ φ O 3** Φ Φ rt P- μ- cn o TJ
Φ a μ- 3 ^D o Hi CΛ 3 μ rt 3 μ tr Φ φ Φ μ μ CΛ < μ-1 o TJ μ- O cn n rt Ω 3 ft
O " o O CO 3 Φ CΛ cn J CΛ o Φ Φ μ- μ- μ- φ CΛ J cn a 3 tr μ n μ-
TJ 3 μ- t- > ttl O μ μ- rt 3 μ O μ- 3 TJ σ 3 W μ-1 TJ Φ H" Φ CΛ μ- rt
Hi X CΛ a ι-i O tr J Ω PJ μ- TJ l-h 3 < μ φ rt s Φ cn P- H rt tr μ Φ
PJ Ω 3 tr rt μ- rt ^ 3 Φ 3 φ 3 Φ rt 3 μ TJ φ 0 a φ O TJ TJ 0 3 rt < M tr φ 3 cn o μ 1 tr cn tr μ- a a ^ CΛ tr a o PJ μ 3 TJ a Ω t-h rt μ PJ PJ S Φ
O Φ μ- rt rt TJ μ- a Ω rt
3 μ rt < φ Φ 3 P μ- rt Φ tr Φ P- PJ a rt X μ rt PJ μ- μ tr o a cn tr PJ Φ — μ- rt rt μ-1 Ω J < 3 μ- μ Φ TJ Φ a Ω Φ tr μ- Ω rt P- P- a XI
Φ φ TJ Φ φ tr 3 PJ rt tr O φ Φ P- μ Φ rt O ^ o μ-1 X Φ rt cn Φ PJ Φ PJ 3 O
CΛ μ J cn μ- μ-1 tr cn Φ Φ Ω 3 tr ^ Ω Φ 3 π o h-1 TJ μ- Ω J tr X 3 μ- TJ 1 Ω Ω 3 TJ φ o 3 μ μ- rt * o rt a ε» • μ μ- Ω μ μ XI rt P-
CΛ o 3 μ < 1— ' a μ- PJ μ J PJ φ rt O CΛ ^ TJ Φ cn Φ μ- Φ Φ φ tr O ft
3 3 μ- Φ J μ- a rt tr μ μ- μ 3 φ CΛ a tr a 3 σ a μ- Hi CΛ
3 3 cn 3 cn 3 3 μ-1 > ε Ω W μ- TJ CΛ Ω φ CΛ
CΛ μ- tr Φ rt 3 φ μ CΛ φ O P- S 3 rt μ- h-" μ- a μ- μ- 3 PJ Φ Φ rt rt μ- O O 3 a Ω Ω
Φ 3 Φ φ Φ a X μ O a rt i-i μ- tr PJ 3 3 CΛ φ μ- Ω Ω a 3
Φ Φ μ-1 3 a
to transform an amber-suppressing host, such as E. coli UH203
(supF), expression is regulated not only by induction with IPTG, but also by the concentration of the suppressor tRNA. The plasmid also comprises a gene for ampicillin resistance. Similarly, on pSB1649 (Brown, 1992), E. coli lamB is regulated by the inducible lac promoter on the common plasmid pACl. For eukaryotic polypeptides, common viral vectors are suitable, and many polypeptides have been cloned into appropriate vectors. For example, for the polytopic mammalian glycoprotein
(ICAM-1) (Intercellular Adhesion Molecule-1 is described further below) , which is imprinted in a preferred eukaryotic embodiment of this invention, the CDM8 cDNA expression vector pCDl.8 may be used in conjunction with the COS cell expression system (Staunton et al . , 1990).
The vectors bearing the gene for the imprinted polypeptide are replicated and isolated. They may be replicated by a cell population, by PCR techniques, or by any other commonly employed methods described in the literature.
8 1 In preferred embodiments of the invention, 10 to 10 copies of the vector are isolated. The exact number depends on the number of distinct clones that will compose the CISTEM cell population, and on the technique used in the step described below, the insertion of the variable oligonucleotides into the vectors .
The gene for a well-characterized oligopeptide tag is inserted into the vector at a genetic site corresponding to the imprinted domain of the polypeptide. Alternatively, the oligonucleotide tag may replace a nucleotide sequence of the naturally occuring sequence. Replacement is accomplished by excising a natural coding region and by inserting the oligonucleotide tag in place thereof. The tag may be any peptide sequence that is known to bind with high affinity and specificity to a ligand that is commercially available or easily produced in the laboratory by techniques described in the literature. One example of a commercially available tag/ligand pair is the T7 sequence (Met -Ala -Ser-Met -Thr-Gly-
Gly-Gln-Gln-Met -Gly) , which binds to the T7 antibody
(available from Novagene, TB15 protocol) . An example of a tag/ligand pair that is described in the literature is amino acid sequence 36-41 of the eukaryotic polypeptide Ras and the Ras-b dmg domain (amino acid sequence 51-131) of Raf (Block et al., 1996) . The production and purification of Ras-RBD by commonly practiced techniques is described in the literature
(Block et al., 1996). Many other tag/ligand pairs, such as peptides that bind to specific antibodies, are also described m the literature and available to those familiar with the art
(Phizicky et al . , 1995) . To insert the gene for the oligopeptide tag at a genetic site encoding the imprinted domain, any common technique for inserting an oligonucleotide at a specific site m a vector can be used (e.g. Boyd, 1994). Several such techniques are referenced above, and one preferred procedure is described here m more detail.
Many membrane polypeptides have been topologically mapped, and these maps are available m the literature. Using a topological map of a membrane polypeptide ' s sequence, two restriction endonuclease sites (site Rl and site R2) each of which is unique m the vector and is located in the genetic region coding for the imprinted domain, are identified. If two unique restriction sites do not already exist at the desired site of insertion, they may be engineered using any one of a number of procedures described the literature (examples of unique restriction sites being engineered at a desired site of insertion are described in Brown, 1992; Freudl, 1989). The vector is cleaved at the two designated sites by digestion with restriction endonucleases Rl and R2, following the protocols and under the conditions specified by the suppliers of the enzymes. This yields a linearized vector with non- complementary sticky ends (RSI and RS2) and an excized fragment. The linearized vector is isolated by standard electrophoretic size fractionation .
An oligonucleotide template comprising the gene for the tag is synthesized using conventional technology, such as phosphoramitide chemistry, or can be obtained from commercial laboratories offering custom oligonucleotides. The oligonucleotide is synthesized to read 5 ' -P--Rl--tag gene--R2- -3', in which P is the primer sequence for a proof-reading polymerase, such as Vent polymerase (New England Biolabs) , Rl and R2 are the restriction endonuclease sites mentioned above, and --tag gene-- is the genetic sequence encoding the oligopeptide tag. (It may be necessary to include one or two extra-oligonucleotides flanking the tag gene so that the tag is translated in frame with respect to the imprinted polypeptide.) Using the polymerase whose primer is P, the oligonucleotide template is used to generate a double stranded oligonucleotide. This oligonucleotide is digested with restriction endonucleases Rl and R2 under conditions described by the enzyme suppliers, yielding an oligonucleotide with sticky ends RSI' and RS2 ' . This oligonucleotide, which has sticky-ends complementary to the linearized vector, is ligated into the vector by a common ligation procedure familiar to those practiced in the art. The result is that the vector's gene for the imprinted polypeptide bears the gene for the oligopeptide tag at a genetic site corresponding to the imprinted domain.
An alternative procedure for the construction of the tag gene with sticky ends RSI' and RS2 ' is used in some preferred embodiments of this invention. In this procedure, two semi- complementary single stranded oligonucleotides are synthesized by conventional technology or acquired from a laboratory offering custom oligonucleotides. The two single stranded oligonucleotides are synthesized such that, when they are annealed, they form a double-stranded oligonucleotide with sequence RSI '--tag gene--RS2 ' . This double stranded oligonucleotide is then ligated into the linearized vector described above.
An example of this procedure is provided by a preferred embodiment of this invention, which involved the construction of an E. coli population displaying the T7 tag (figure 9) at loop IV of OmpA. In pEV218 (Freudl, 1989), the ompA gene has been engineered to bear a string of restriction endonuclease sites at a genetic region corresponding to extracellular loopIV of the polypeptide (figure 9) . This plasmid was digested with Sad (enzyme and reaction buffer from New England Biolabs) and Xbal (enzyme and reaction buffer from MBI Fermentas), yielding a fragment of the restriction site polylinker and the linearized plasmid. The linearized plasmid was isolated by standard gel electrophoresis . Two single stranded oligonucleotides were synthesized by conventional technology to read as follows:
Oligonucleotide 1: 5' - CG[ATGGCTAGCATGACTGGTGGACAGCAAATGGGT] - 3'
Oligonucleotide 2: 5' - CTAG[ACCCATTTGCTGTCCACCAGTCATGCTAGCCAT]CGAGCT - 3'
The sequence in brackets comprises the gene segment encoding the oligopeptide that is recognized by the T7 antibody. These two single-stranded oligonucleotides were annealed in MBI Fermentas Klenow buffer, forming the double stranded T7-tag gene segment with sticky ends complementary to the sticky ends on pEV218, linearized by Sad and Xbal, as described in the previous paragraph. When the double-stranded oligonucleotide was ligated into the linearized plasmid, the T7 tag gene segment was in frame with the ompA gene on pEV218. The new construct was designated pEV218T7.
After the tag gene has been inserted into the vector, the vector is used to transform a population of cells lacking a chromosomal copy of the gene for the imprinted polypeptide. Strains lacking chromosomal genes for outer membrane polypeptides are described in the literature (e.g., Brown, 1992, Freudl, 1989, and Staunton et al . , 1990), as well as below, in the passages relating to cell lines and media. Conventional methods of transformation are used, and transformants are selected by their expression of the vector- borne selectable marker, such as a gene for ampicillin resistance. In a particularly preferred embodiment of this invention, in which E . coli OmpA was engineered to display the T7 polypeptide tag at loopIV, pEV218T7 was used to transform E . coli strain UH203 { l a c, s upF, ompA, recA, proAB, rpsL) (Freudl, 1989), according to a common method of electrotransformation, as follows.
25 ml of overnight culture of UH203 (Luria broth, 37°C) was suspended in 11 of LB. Cells were cultivated at 37°C, with shaking. When the culture reached OD600"= 0.5, the cells were chilled for 15 min. on ice, and then centrifuged in a cold rotor at 4000 x g for 15 min at 4°C. Supernatant was removed. Sediment was gently resuspended in 11 of ice-cold 10% glycerol and centrifuged as before. This washing procedure was repeated three times and cells were resuspended in 1 ml of ice-cold 10% glycerol. 42μl of the cell suspension was mixed with 2μl of DNA (pEV218T7 described above) solution (lpg dissolved in TE) and incubated 10 mm on ice. 40μl of this suspension was pipetted to an electroporation cuvette (BIO-RAD Gene Pulser with Pulse Controller was used. Apparatus settings were as follows: 2,5 kV; 200 W; 25μF; time constant = 4,9 ms . Immediately following electroporation, cells were washed in 600μl of SOC medium and transferred to a cultivation tube. Cells were then allowed to recover for 1 hour at 37°C in a shaking flask. Finally, cells were plated on LB plates with lOOμg a pici11in/ml .
There are several ways to generate the variable oligonucleotides that are inserted into the vector, and, ultimately, translated into the variable regions exhibited by the CISTEM cell population. To facilitate insertion into the vectors, the variable oligonucleotides must have conserved sequences at their termini, just as the tag gene described in the previous section was given defined termini RSI and RS2. For example, m preferred embodiments of the invention, variable oligonucleotides with defined sequences at their termini are easily synthesized using solid-phase phosphoramidite chemistry (Brown, 1992): at defined terminal positions, only one of the four nucleotides is made available for addition to the strands being synthesized, whereas at variable positions, a mixture of multiple nucleotides is presented to the strands. Alternatively, end addition of bases, catalyzed by terminal transferase m the presence of nonlimitmg concentrations of multiple nucleotides, can append a highly variable sequence to a defined terminal. Highly variable sequences may also be obtained by selecting fragments of defined length from digestions of natural cellular DNA or RNA preparations.
In a CISTEM for screening the polypeptides encoded by a cDNA library, the variable oligonucleotides are the cDNA library genes. The size of the cDNA library that is screened by such a CISTEM is in some embodiments of this invention reduced by limiting the screen only to genes of interest, as determined by comparison of the cDNA library genes to sequences catalogued in genomic libraries. In a CISTEM for identifying cancer antigens, cDNAs are prepared from tumor cells according to protocols described in the literature.
In a CISTEM for analyzing the interaction between a known polypeptide and a second known compound of interest, the variable oligonucleotides are sections of the gene encoding the known polypeptide. For example, in a preferred embodiment of this invention, in which the interaction between the eukaryotic cell-signalling proteins Ras and Raf is analyzed, the oligonucleotides inserted into pEV218 are sections of the sequence encoding Ras.
In preferred embodiments of the invention, stop codons are excluded from variable oligonucleotide inserts in order to prevent truncation of the imprinted polypeptide. This is achieved by synthesizing oligonucleotides to read Dl(NNP)nD2, where Dl and D2 represent the defined sequences that facilitate insertion of oligonucleotides into vectors; n represents the number of triplet codons that compose the variable section of each oligonucleotide; N represents all four nucleotides, each with equal probability; and P represents a subset of nucleotides, each with equal probability. NNP encodes all 20 amino acids, but precludes the occurrence of stop codons recognized by the CISTEM cell population. Thus, when P is allowed to be G, C, or T, the stop codons TAA and TGA are excluded from the oligonucleotide inserts. It is not necessary to preclude the occurrence of all three stop codons in inserted oligonucleotides, because the CISTEM cell population may be composed of stop codon- suppressing mutants, which are capable of translating one or more of the stop codons. Numerous suppressor strains are described in the literature (e.g. Brown, 1992, Freudl, 1989).
When the imprinted polypeptide is a glycoprotein or proteoglycan, the inserted oligonucleotides may include consensus sequences for N-linked glycosylation (Chavez et al . , 1991) . These are synthesized as defined sequences interrupting the variable sequence of each oligonucleotide. In CISTEMs that incorporate an element of structural design, variable oligonucleotide inserts may include a mixture of defined and variable positions, sequences coding for fixed motifs, or statistical biases toward certain sequences or structures.
The variable oligonucleotides are inserted at a prescribed site m the vectors constructed and amplified by the methods described above. Two preferred procedures for precisely targeted insertion are described here-one m which the number of oligonucleotides inserted into each vector is defined, and another in which the number of inserts is variable. The available procedures for inserting oligonucleotides at specific vector sites are certainly not limited to the two described here. In fact, any of the gene fusion techniques used to insert a gene coding for a reporter moiety into the gene for a membrane protein may be used to the same effect as the procedures that will be described.
The site of insertion m the vector is located such that oligonucleotides inserted there are translated as part of the imprinted domain. The site of insertion may be further specified (i.e. withm the sequence encoding the imprinted domain) according to structural analysis or prescriptions for a particular nature of interaction between the imprinted domain and the target compound. To insert oligonucleotides, both of the procedures outlined here make use of restriction endonuclease sites that are unique m the vector, and are located at or near the prescribed site of insertion. If a unique restriction site does not already exist at the site of insertion, one may be engineered using any one of a number of procedures described in the literature (e.g. Boyd, 1994, Brown, 1992 and Freudl, 1989) .
To insert one and only one variable oligonucleotide per vector, two distinct restriction sites, each unique m the vector, must be located such that they flank a short restriction fragment comprising the prescribed site of insertion. If the sticky ends left by cleaving the vector with one endonuclease (Rl) at its unique site are RSI and RSI', while cleaving with the other endonuclease (R2) at its unique site leaves sticky ends RS2 and RS2 ' , then the variable oligonucleotides generated at step 3 are given defined and conserved sticky ends RSI and RS2 ' . The vectors are digested by both restriction nucleases (Rl and R2 ) , and the digestion products are size-fractionated to isolate the vectors, each of which has non-complementary sticky ends RSI' and RS2. The population of restriction-digested vectors is combined with a slightly smaller population of variable oligonucleotide inserts (vectors/insert » 1.5 - 2.0), and ligation is allowed to occur. Because the sticky ends of the vectors are non- complementary to each other, each vector can neither ligate shut, nor ligate to other vectors. Similarly, because oligonucleotide inserts have non-complementary sticky ends RSI and RS2', they can neither self-ligate into loops, nor ligate to each other to form concatamers of multiple oligonucleotides. The only ligation that can occur is the insertion of one variable oligonucleotide into each vector. After ligation has taken place, the vector population is size- fractionated to discard vectors that did not receive an insert. In the resultant vector population, each vector has one variable oligonucleotide at the prescribed site in the region encoding the imprinted domain.
An example of this procedure for inserting one and only one variable oligonucleotide per vector is furnished by a preferred embodiment of this invention, m which a cell population is engineered to display variable oligopeptides at loop IV of E . coli ompA. In pEV218 (Freudl, 1989), the ompA gene has been engineered to bear a polylmker rich m restriction endonuclease sites at a genetic region corresponding to extracellular loopIV of the polypeptide (figure 14) . This plasmid is digested with Sad (enzyme and reaction buffer from New England Biolabs) and Xbal (enzyme and reaction buffer from MBI Fermentas), under conditions described by the suppliers, yielding a fragment of the restriction site polylmker and the linearized plasmid. The linearized plasmid is isolated by standard gel electrophoresis . Single stranded oligonucleotides are synthesized by conventional technology to read 5'GGTCTAGA(VNN) nCGAGCTC3 ' , in which N represents all four nucleotides, each with equal probability; and V represents C, G, or A, each with equal probability. In a procedure commonly used to generate double stranded oligonucleotides from single stranded templates, a 5' primer and a proof reading polymerase, such as the Vent polymerase, are used to generate double-stranded oligonucleotides from the single-stranded templates. These double-stranded, highly variable oligonucleotides are then digested with Sad and Xbal, under conditions described by the suppliers, yielding highly variable oligonucleotides with non-complementary sticky ends 5'-CTAG and AGCT-3'. The highly-variable oligonucleotides are then ligated into the linearized plasmid (MBI ligation buffer, conditions described by supplier) ; they ligate only one oligo per plasmid, and in only one orientation. Conventional electrophoretic size fractionation is used to isolate ligated plasmids containing an insert. The product of this procedure is a population of plasmids (pEV218I) bearing, at the genetic site corresponding to loop IV of OmpA, a highly variable exogenous sequence free of TAA and TGA stop codons.
To construct a vector population m which each vector harbors between 0 and n variable oligonucleotides at a prescribed site, the following procedure may be used. First, a restriction site that is unique in the vector is located at the prescribed site of insertion, i.e., at a genetic site corresponding to the imprinted domain of the imprinted polypeptide. (If a unique restriction site does not already exist at the desired site, one may be engineered by conventional methods described m the literature, such as oligonucleotide-directed mutagenesis (e.g. Brown, 1992 and Freudl, 1989) . If the endonuclease (R) that cleaves at this unique site leaves complementary sticky ends RS and RS ' , then the variable oligonucleotides generated by the procedures described above are given defined and conserved complementary sticky ends RS and RS ' . The vectors bearing the gene for the imprinted polypeptide are cleaved by R. The vectors are then combined with an excess of variable oligonucleotide inserts. Several distinct ligation events can occur. First, the vectors, which have complementary sticky ends RS and RS ' , can ligate to other vectors or self-ligate, closing without an oligonucleotide insert. Second, the variable oligonucleotides, which also have complementary sticky ends RS and RS ' can l) self-ligate as a loop, n) ligate into a vector with sticky ends RS and RS ' , m) ligate to each other, forming a multi- oligonucleotide concatenation, which itself can perform ligations 1, 11, or m. After ligation, size fractionation can be used to discard all inserts that did not ligate into a vector and all vectors that did not receive inserts. This yields a population of vectors, each of which harbors between 1 and n variable oligonucleotides at the prescribed site m the gene encoding the imprinted polypeptide. A specific example of this method of constructing a vector population in which each vector harbors between 0 and n variable oligonucleotides at a prescribed site may be found in the literature (the construction of such a library of vectors is described m detail in Brown, 1992) .
The vectors bearing variable oligonucleotide inserts are used to transform a population of cells that lack a gene for the imprinted polypeptide. Conventional methods of transformation may be used. Transformants are selected by their expression of the vector-borne selectable marker, such as a gene for antibiotic resistance. (Efficient transformation
9 of competent prokaryotic cells achieves up to 10 transformants/ ml.) The product of transformation is a population of cells, each of which harbors a distinctive oligonucleotide insertion at a genetic site corresponding to a point m the imprinted domain. After transformation, the cell population is amplified by growth a suitable medium.
In a preferred embodiment of this invention, m which the CISTEM cell population is engineered to display variable oligopeptides at loop IV of E. coli OmpA, transformation is performed m the following manner. The variable plasmid population, pEV218I, whose construction was described above, is used to transform E . col i strain UH203 { lac, supF, ompA, recA, proAB, rpsL) , according to a common method of transformation described in the literature (Freudl, 1989) . The UH203 population is grown Luna broth supplemented with 100 mg Ap/ml, selecting for Ap resistance of transformants.
Expression of the imprinted polypeptide is induced with the compound appropriate to the vector's promoter. Thus, for example, m the preferred embodiment mentioned above, the population of UH203 transformed with pEV218I is induced with lm IPTG) . This yields a population ot cells comprising as many clones as there were transformants in the previous step. Each clone displays a distinctive oligopeptide at the imprinted domain. If the imprinted polypeptide is a glycoprotein or a proteoglycan, glycosylation sites in the imprinted domain bear diverse and unpredictable oligosaccharides. (Commonly practiced procedures for verifying that an extracellular domain-in this case, the imprinted domain-has been glycosylated are described in the literature (Lis et al . , 1993).) In short, the CISTEM cell population displays a vast combinatorial library of polypeptides.
In a preferred embodiment, a CISTEM is constructed wherein the imprinted polypeptide is a polytopic membrane protein in which the domain that accepts insertions is directly adjacent to the adhesion domain of a selection agent suitable for use in the CISTEM. This preferred configuration is depicted in figure 5, and several of the polypeptide/selection agent combinations that exhibit this configuration are listed in table 1, and in the literature (Tyler et al . , 1996). When these polypeptide/pathogen combinations, or others like them, are used, the CISTEM is assembled with a minimum of genetic manipulation of the imprinted polypeptide; the only necessary manipulation is the introduction of an oligopeptide tag, as well as variable peptide sequences, to the imprinted domain, as described above. However, the list of polypeptides that may be imprinted is by no means limited to polytopic proteins exhibiting adjacent adhesion and imprinted domains. To further expand the diversity of polypeptides that may be imprinted by CISTEMs, two genetic manipulations are employed.
When secreted proteins are imprinted, they are first converted to their glycolipid- or transmembrane-anchored isoforms. Because translocation of secreted proteins and membrane insertion of integral proteins are handled by the same cellular machinery (reviewed in Singer, 1990, and above) , many secreted proteins may be converted to membrane-bound isoforms simply by carboxyl-terminal attachment of a hydrophobic transmembrane span or a signal sequence for glycolipid addition. For example, the gene-fragment encoding the 37 carboxyl terminal amino acids of decay accelerating factor, a eukaryotic glycophospholipid-anchored protein, can be fused to the gene for a secreted protein, such as human growth hormone, in order to target and anchor the protein to the apical surface of human intestinal cell lines. This procedure for converting a secreted protein to its glycolipid- anchored isoform is described in the literature (Cross, 1990) . Conversion of secreted polypeptides to isoforms that are anchored by transmembrane spans is also described in the literature. For example, heparin-binding epidermal growth factor-like growth factor (HB-EGF) , can be expressed as a longer, membrane-spanning isoform (HB-EGFTMII) comprising an extracellular domain that is structurally identical to the secreted polypeptide (Onu et al., 1994). Other examples of and techniques for the conversion of secreted polypeptides into isoforms anchored by transmembrane spans are described in the literature (Singer, 1990) .
A second genetic manipulation of the imprinted polypeptide is performed when the domain that is to be imprinted lacks an adjacent adhesion domain. This occurs e.g. when the polypeptide chosen for imprinting is not polytopic, or when the selection agents that adhere to domains adjacent to the imprinted domain are unknown or poorly suited for use in the CISTEM. In either situation, the sequence for a functional adhesion domain may be fused to the gene for the extra- or intracellular domain, creating a gene for a chimeric polypeptide comprising both the functional adhesion domain and the variable domain. For example, the colicin A adhesion domain from E. coli OmpF, when fused to separate domains from another outer membrane protein, OmpC, remains functional as a colicin A receptor (Fourel et al . , 1990) . A group of eukaryotic viral systems, which are described in the literature (Chu et al . , 1995), provide further examples of the insertion of functional adhesion domains. In their application to embodiments of this invention, these systems operate as follows. The antigen-binding site of an antibody (scA) is displayed on the surface of a virus, such as spleen necrosis virus, as described in the literature (Chu et al., 1995). The antigen that is recognized by the scA is then expressed as an exogenous insert in the extracellular domain that will serve rt ε TJ tr Φ μ
Φ Φ
3 μ-1 Hi
1 Φ cn μ
^^ 3 μ
»-3 μ- Φ tr rt a
Φ Φ a Φ cn 3 rt rt tr μ O o
3 a
Ω rt μ- rt tr 3
3 Φ φ μ 3 J TJ rt μ-1 Φ CΛ μ J Hi μ-
3 O 3 a μ TJ
3 μ
Hi PJ μ-
3 3 3
3 Ω rt
O Φ rt J μ- O O
O t-h
3 * J rt TJ ^ tr Φ
Φ TJ
Ω rt
PJ Hi P- rt 3 a
Φ 3 Φ
X Ω CΛ
O rt μ μ- rt μ- O tr
Φ 3 J
CΛ cn rt a a PJ
Φ μ h-1 cn Φ μ- p-
3 μ 3
Φ Φ PJ J a rt rt 3
Φ O μ a Hi PJ
!<
Figure imgf000062_0001
above provide an outline of the proclivities displayed by distinct families of outer membrane polypeptides) . For example, type la polypeptides are generally competent for functions involving high-affinity adhesion or specific binding to extracellular targets. They are therefore well-suited to be imprinted for use as highly specific markers or antagonists of small polypeptides, such as small peptide hormones. Type II polypeptides, on the other hand, are often enzymes with extracellular active sites, such as glycosyl transferases . Some type II polypeptides, then, are well suited to be imprinted to exhibit enzymatic activity. When this invention is used to imprint a polypeptide to serve as a functionally active effector, the preferable imprinted polypeptide is usually a membrane-bound isoform of a secreted protein, such as human growth hormone.
Furthermore, in preferred embodiments of this invention, the imprinted polypeptide is a molecule that naturally exhibits functional or structural homology with molecules that perform functions similar or identical to the one desired of the imprinted polypeptide. For example, structural homologues to E . coli OmpA have been found in Haemophil us f Shi gella , Nei sseri a , Bordetell a , and Streptococcus . In these virulent bacteria, the OmpA homologues play a role in adhesion to host tissues, and are often recognized by the host's specific immune response. The structural homology between OmpA and these antigenic adhesins makes OmpA well-suited for imprinting for use in applications requiring mimicry of homologous adhesins. Two such applications are competitive inhibition of microbial adhesion, and vaccination, described in greater detail below.
Unlike many other forms of combinatorial chemistry, this invention can screen libraries of polypeptides expressed by eukaryotic cells. This greatly expands the range of molecular properties accessible to this invention, for eukaryotic polypeptides are adorned by saccharides, a second a tier of extremely complex, chemically distinctive molecules. The oligo- and polysaccharides adorning the imprinted domain are not varied in the same systematic manner as the underlying polypeptide sequence, because, as was described above, the saccharide molecule attached to a particular site on a polypeptide is not a simple function of the local amino acid sequence. Instead, the glycosylation at a site is altered in complex and unpredictable ways by changes to the local sequence or to the sequence elsewhere in the polypeptide (Lis et al., 1993). Therefore, when the glycosylation sites within the imprinted domain are preserved while immediately flanking regions are varied by the method described above, a semi- random diversity of sugar structures is appended to a systematic diversity of polypeptide sequences, creating a complex glycoprotein or proteoglycan library.
When a CISTEM' s target compound naturally interacts with processed, e.g. glycosylated polypeptides (i.e., in its natural setting, the target compound binds to the processed forms, e.g. to the glycoproteins or proteoglycans), it is preferred that the imprinted polypeptide be processed, especially glycosylated. This requires that the CISTEM cell population be eukaryotic. The adhesions that underpin inflammation, blood clotting, sperm-egg interactions, and intercellular recognition are all mediated by glycoproteins or proteoglycans. Therefore, for the identification of molecules for use as antagonists or agonists in these processes, CISTEMs with eukaryotic cell populations and glycosylated imprinted domains are preferred.
The unpredictable effects of peptide sequence variation upon glycosylation throughout the polypeptide merit a cautionary note. Because changes to a peptide sequence can alter glycosylation on entirely different domains of a polypeptide, the CISTEM' s selection (pathogenic) agent must not require a specific glycosylation on its adhesion domain. Otherwise, unexpected changes in the sugars adorning the adhesion domain renders some cells immune to the pathogenic agent even if they do not bind the target compound at their imprinted domains. As was noted above, the selection site, especially an adhesion domain of a pathogenic agent, used in a CISTEM is well-characterized. In most cases, this characterization reveals whether or not the agent requires glycosylations for adhesion. This may be verified, moreover, by removing glycosylations from the receptor and testing for susceptibility to the agent; methods for removing glycosylations are described in the literature (Lis et al., 1993) .
For use as imprinted polypeptides in embodiments of this invention, polypeptides that exhibit structural integrity are preferred. It is particularly preferred that they not only maintain their properties when liberated from the cell surface, but also exhibit stability in the environmental- perhaps in vivo-conditions in which the imprinted molecules are ultimately applied. Among prokaryotic polypeptides, β -barrels are exceptionally stable; they can be denatured only by boiling in strong detergents. And in eukaryotic polypeptides, glycosylation often adds structural stability to otherwise malleable protein structures (Lis et al . , 1993). In some cases, the structural integrity of imprinted domains may be heightened by genetic manipulations. Such manipulations may be performed before or after imprinting, and they may be used to increase the stability of structures that will be attached to, or liberated from, the membrane.
Genetic manipulations of the imprinted polypeptide may also be used to introduce an element of rational structural design to a CISTEM. Structural analysis of the target compound, its native ligands, or molecules that naturally perform functions similar to that desired of the imprinted polypeptide, may guide the insertion of predetermined motifs into the imprinted polypeptide. A tendency toward the formation of certain structural motifs, such as α-helices, may be incorporated into imprinted domains by inserting variable oligopeptides that are not entirely random, but instead are statistically biased toward sequences that form the desired motifs. Predetermined motifs may also be inserted into the conserved regions of imprinted domains. For example, polyproline or a-helices may be used to extend a variable region away from the cell surface, making it more accessible to a sunken epitope on the target compound. While genetic manipulation informed by structural analysis is by no means necessary to CISTEMs, it may be utilized in combination with CISTEMs to design polypeptides that are well-suited to the performance of desired functions. In a preferred embodiment, the CISTEM cell population consists of cells that lack a chromosomal copy of the gene for the imprinted polypeptide, but are capable, upon transfection with an episomal or vector-borne copy of this gene, of translating, exporting, and inserting the imprinted polypeptide in the membrane. Thus, for example, in several preferred CISTEMs, which imprint E. coli OmpA to bind to any one of several targets, E. coli strain UH203 (Freudl, 1989), which is ompA~ , composes the CISTEM cell population. Similarly, in CISTEMs that imprint E. coli LamB to bind to a target, E. coli strain S1755 (Brown, 1992), which lacks the lamB gene, composes the cell population. A eukaryotic example is provided by several preferred eukaryotic CISTEMs, which imprint human ICAM-1 to bind to any one of several targets. In these CISTEMs, COS-7 cells are used; this strain lacks ICAM-1, but is easily transfected with an ICAM-1 gene borne on the CDM8 expression vector (the COS-7/CDM8 expression system is described in Staunton et al., 1990).
The strains that compose CISTEM cell populations are preferably stop-codon suppressors; that is, they recognize one of the stop-codons, such as UAG, as a translatable codon. The reason for this is that any stop-codons recognized by the strain composing the CISTEM cell population must be excluded from the variable oligonucleotides that are inserted into the vector-borne gene for the imprinted polypeptide. (The synthesis of variable oligonucleotides free of certain stop- codons, and their insertion into the gene for the imprinted polypeptide, are described above) . Therefore, the synthesis of variable oligonucleotides is simplified when the CISTEM cell population is able to recognize one of the stop-codons. For example, E. coli UH203, which is used in several preferred CISTEMs for imprinting OmpA, is supF, i.e., capable of translating the UAG stop-codon. Consequently, variable oligonucleotides inserted into the OmpA vector (pEV218), must be free of the stop codons UAA and UGA, but not of UAG. This is easily achieved by the method of synthesis described above. Similarly, S1755, the strains used in CISTEMs imprinting LamB, is supE44, i.e., capable of translating UAG. The medium in which the CISTEM cell population is propagated depends simply on the strain composing the population. For example, E. coli UH203 is grown in Luria broth or agar. When a strain contains a plasmid with an antibiotic resistance gene, the appropriate antibiotics are added to the medium or agar. Standard media for all commonly used strains of prokaryotic and eukaryotic cells are described in the literature and familiar to those practiced in the art. The conditions for transformation of CISTEM cells with the vector bearing the imprinted polypeptide also depend upon the cell type, and are described m more detail above, as well as in the literature. Induction of expression of the imprinted polypeptide requires incubation of the cell population with the compound that activates the imprinted polypeptide' s mducible promoter. As is described above, however, certain inducible promoters, such as the lac promoter/operator elements, are "leaky", allowing for the appropriate level of expression of the imprinted polypeptide without any induction.
When the CISTEM cell population is incubated with a suspension or solution of the target compound, the medium generally contains only the minimal requisites for cellular sustenance m addition to the target compound. The presence of abundant macromolecules other than the target compound can result m selection for clones whose imprinted domain bind not to the target, but rather to one of the constituents of the medium. Thus, for example, in preferred CISTEMs imprinting OmpA, E . coli UH203 are incubated m the presence of any one of several target compounds in ATCC Medium 52. However, m some CISTEMs, Luna broth was used. In several preferred CISTEMs for imprinting ICAM-1, COS cells are incubated with target compound m TC Minimal Medium Eagle or Minimal TC Medium 99. Minimal media for other commonly used prokaryotic and eukaryotic cell lines are described m the literature. The concentration of target compound and duration of incubation are important to CISTEM function, and are discussed m detail below.
When the invention is used for the identification of functionally active polypeptides, such as enzymes or inhibitors of a target compound's biological activity, μ- Ω Ω Φ Ω Hi PJ μ- 3 X Hi PJ μ- μ- μ Ω a o Hi rt TJ Ω 3 r-υ Ω cn rt ε 3 3 rt μ-
3 O Φ CΛ o O μ 3 3 Φ O 3 3 a Φ tr μ- Hi o 3 O tr 3 ~— O O tr μ- n TJ 3 tr 3
TJ 3 rt 3 μ Φ TJ 3 3 3 TJ Φ cn 3 CΛ * μ * CΛ Φ PJ • 3 {—• Φ rt _S TJ a Φ Ω μ TJ PJ TJ μ μ- Φ 3 3 μ 3 μ- ι-< rt Φ CΛ 3 μ-* Ω 3 tr s φ μ 3 μ- o CΛ tr o Ω TJ μ- TJ rt a a μ- rt a 3 μ- ft 3 cn μ- μ- * Φ σ Ω 1 μ- μ 3 tr
3 3 3 Φ φ 3 o 3 μ- tr 3 μ- Φ Ω 3 tr 3 £ tr Ω cn 3 3 —> φ rt μ^ Φ 3 3 rt 3 μ- μ- 3 μ rt 3 μ-> Ω Φ rt t-h CΛ rt Ω Φ *—• 3 TJ {—• 3 μ- rt Φ tr a ε -- rt
Φ a 3 cn a μ-1 Hi Φ Ω PJ PJ ω Φ μ- Φ rt rt 3 ,—• φ cn rt μ Φ rt • tr μ- a tr CΛ O a φ rt 3 μ- a φ tr μ μ- Ω μ- *< 3 tr 3 w O μ- Ω O ε Φ μ μ- PJ Hi o a Φ μ- < Φ TJ Ω rt 3 O Φ rt M TJ μ TI Ω O 3 a ε tr cn 3 μ- 3 a PJ O 3 3 3 μ- • CΛ N φ μ-1 O Φ O 3 μ μ- < o Φ cn o tr 3 o tr μ- tr 3 Φ o 3 μ- 3 CΛ μ- PJ -* φ 3 • a PJ O X*! TJ Ω Φ μ a ε
3 μ- Ω rt μ- a 3 cn CΛ TJ Ω a M a rt 3 n rt rt φ TJ 3 TJ 3 Φ μ rt μ- μ- J tr tr Λ ε J Φ 3 rt o tr 3 Φ μ- a PJ Φ tr rt TJ TJ cn μ-> TJ < Φ tr rt rt μ- Φ 3 tr rt μ- μ-1 PJ {—• μ- 3 μ- 0 tr rt X μ O tr μ- μ-1 M J rt φ X Φ μ- tr
3 rt rt TJ P- o 3 Φ μ PJ O PJ rt rt rt 3 Φ tr φ cn Φ rt μ- 3 O rt O 3 O -1 tr μ Ω o Φ rt 3 μ- O tr tr CΛ ω φ 3 3 Φ μ o Ω TJ μ- μ 3 3 μ- 3 rt φ φ Φ rt o tr φ tr tt μ- p) 3 TJ Φ Φ μ- rt Φ 3 TI PJ TJ O CΛ TJ 3 cn tr t-h PJ 3*" Ω CΛ J μ- a O μ-1 μ- o cn 3 μ- 3 φ rt μ α 3 n TJ Φ l-h < μ- Φ Φ rt rt cn O Φ 3 PJ Ω Φ μ- PJ 3 Φ u* 3 μ-1 μ- O Φ σ 3 φ μ rt
Φ μ- 3 a tr J 3 cn cn PJ a < 3 3 rt μ 3 O X cn μ- 1 3 ^ μ- tr rt
Ω 3 TJ cn 3 Φ tr tr Ω a Φ TJ CΛ a rt tr 3 ε Ω 3 3 μ- Ω cn j-> 3 3 PJ rt X μ Φ μ h-J Φ PJ μ O tr PJ O 3 μ Φ o Φ PJ 3 tr o TJ 3 μ 3 o rt rt μ μ- μ- *—• Φ μ- μ- Φ X μ- Hi Φ Ω μ rt μ- μ-1 3 μ rt Φ μ- 3 TJ O PJ μ- μ- — . 3 3 Φ X
< 3 3 Φ 3 cn 3 Φ tr CΛ Φ 3 Φ PJ • Ω a μ- μ rt tr 3 TI rt φ a PJ Φ φ 3 rt Ω cn TJ tr 3 Φ ft μ- 3 ε rt rt Ω μ- ?T TJ tr μ- 3 μ- φ Ω 3 PJ μ rt J Φ rt φ μ μ- rt a tr O rt tr tr Φ rt 3 w μ- O rt X rt 3 a 3 3 X TJ TJ Φ
"< Hi a P- μ μ- J a Φ 3 Φ 3 a μ- PJ rt μ- ~ tr X! tr rt O μ o Ω
Hi o < 3 3 Φ ε PJ rt 3 rt o o Ω μ-1 • tr O Φ PJ PJ μ- 3 Φ l-1 PJ O σ Φ a 3 Φ rt a 3 μ- tr μ- a O a 3 3 tr " TJ Φ 3 rt tr tr rt 3 μ- Hi *< CΛ 3 μ- Ω o cn Φ rt rt o 3 o PJ rt o Φ CΛ tr μ- 3 o Φ μ- cn Φ TJ TJ
3 rt 3 PJ a o μ- tr < TJ 3 rt tr 3 PJ rt 0 o TJ CΛ Φ 3 3 < a rt μ φ CΛ O a Φ 3 X rt TJ t-h Φ μ PJ tr CΛ φ PJ X tr Hi μ rt rt rt < 3 Φ φ μ TJ μ- 3
CΛ a μ- Φ ε a rt μ- J • μ- μ- Φ 3 μ- φ Φ μ- 3 tr μ- Φ 3 ε rt rt Φ rt 3 3
3 3 o O μ- Φ 3 3 3 μ- TJ 3 3 rt μ a a PJ 3 3 μ- O a μ- μ- a rt rt rt 3 3 a rt μ- rt o rt TJ tr φ Φ μ- rt Ω rt Φ μ- rt 3 a μ-1 tr tr Hi Hi J μ- ^ 3 Φ μ- 3 PJ μ-1 ^-. o Φ rt cn φ 3 μ- X 3 tr tα Φ Φ J O φ o PJ CΛ 3 μ- N a a CΛ TJ tr PJ ε μ-1 CΛ PJ a PJ tr O rt . 3 μ Ω
CΛ P- μ- 3 3 Φ rt tr μ μ-1 TJ CΛ tr < Φ μ 3 μ PJ 3 μ rt rt < ε Ω rt Φ X o tr Φ TJ 3 μ- φ Φ o TJ a μ μ- Φ rt 3 Φ tr X PJ μ- 3 3
PJ CΛ 3 rt tr rt Φ cn o TJ 3 TJ CΛ CΛ φ 3 φ 3 μ- rt Ω 3 Φ μ-> TJ μ μ-1 Λ μ μ Ω P- μ- μ- tr μ- μ-1 TJ rt cn rt tr Φ TJ X rt rt 3 O O Φ TJ VD μ-> *< μ-' ω
X Φ ft ttl O 3 Φ t-h O ^< Φ Φ Φ μ- o rt Φ tr TJ Φ CΛ 3 μ-1 Φ rt VD IV) 0 TJ
Φ —> O μ- 3 a o 3 TJ 3 a μ-1 a ε PJ μ- 3 Φ o μ μ- μ 3 -J o rt 3 o 3 rt Ω cn Λ PJ μ-1 Φ a Φ φ 3 a a rt μ PJ cn O 3 3 PJ μ — ' - μ- CΛ 3
• CΛ tr J • X μ-1 a TJ Φ a Ω tr φ CΛ 3 rt Ω μ- Ω rt μ^ rt X . cn Ω rt cn a μ- 3 rt Φ 0 o rt a o rt rt μ- Φ 3 cn rt 3 Ω Φ 3 3 φ P- μ- Φ
M μ- 3 rt TI tr 3 ε 3 μ- 3 μ- tr 3 CΛ μ μ h-1 μ- μ- 3 μ μ μ rt M μ- n 3 0J μ
3 3 a μ- Φ rt μ- J a o J O PJ μ- Φ Φ rt O O μ-1 μ 3 Φ 3 3 M 3 *—•
• μ - 3 μ- φ μ μ- 3 rt Hi O Ω Ω μ- Hi 3 3 Cn Ω Hi Ω rt CΛ rt φ Ω
O ε rt CΛ rt ω X 3 • 3 μ- 3 φ o TJ μ rt μ-1 3 O rt φ H Φ tr rt tr 3"* μ- rt PJ μ- PJ μ- X TJ X PJ 3 3 μ- 3 3 3 tr μ W rt Φ tr μ- Φ 3 *• μ cn TJ tr rt 3 Ω X cn 3 a rt 3 μ- a PJ 3 3 O μ- a T μ- J £ > O 3
Φ Ω TJ X φ μ φ tr CΛ PJ Φ μ o O μ- Ω tr rt cn a 3 a O CΛ Ω μ- μ tr rt PJ μ- φ V-1 o cn Φ φ 3 3 rt Φ 3 μ N PJ Φ 3 Φ μ- 3 rt μ- tr rt Ω
PJ μ- rt rt φ Ω μ- cn μ 3 rt O PJ φ rt cn μ μ — . o 3 n μ- 3 Φ tr 3 ε rt μ μ Ω Φ a Φ rt O Cπ μ- CΛ μ- μ- 3 rt 3 a I-H o TJ O μ-1 o tr X CΛ rt a Φ Φ rt ε tr "-^ 3 0 O μ-< 3 ^ CΛ 3 μ cn μ Φ Φ μ- 3 a μ- φ 3 3 3 μ-1 σ H H- Φ 3 a rt o μ rt tr rt μ- ^ W 3 3 Λ 3 Φ tr tr φ tr CΛ
Figure imgf000068_0001
Φ S rt a
~ cn φ •< CΛ
the procedure demonstrates that the agent is competent to perform CISTEM selection.
Second, this procedure is used to determine the experimental conditions that will allow reliable CISTEM selection. To adjust CISTEM selection, multiple parameters may be varied, including the following: concentration of the target compound ( [T] ) with which the CISTEM cell population is incubated; duration of incubation with the target compound; initial multiplicity of infection (Mol)or concentration of the selection agent; duration of incubation with the selection agent; temperature; level of expression of the imprinted polypeptide. Each of these parameters may be optimized by varying the parameter value over a number of CISTEM trials, as will be described here. However, every parameter need not be optimized for every embodiment of this invention. In preferred embodiments of the invention, at least two parameters are optimized by multiple trials, as described here, while other parameters are set by common laboratory procedure or precedents described here. In the optimization described here, concentration of target compound, [T] , and initial multiplicity of infection, Mol, are the two parameters that are varied over a number of trials. In the examples below, an optimization is described in which two different parameters - namely, level of expression of the imprinted polypeptide and duration of incubation with the selection agent - are varied.
The procedures for demonstrating that the selection agent is competent to perform CISTEM selection, and for optimizing CISTEM parameters, make use of the cell population whose construction is described above. In this cell population, an oligopeptide tag is exhibited in the imprinted domain. This tag binds with high affinity and specificity to a readily available ligand. (The T7 tag/T7 antibody is a preferred example of a tag/ligand pair; this pair and others are described above and in the literature) . In the optimization described here, expression of the imprinted polypeptide bearing the oligopeptide tag is regulated to the relatively low level estimated to be roughly 1000 copies per cell. As was noted above, however, in the optimization of some embodiments of this invention, such as that described in the examples, the level of expression of the imprinted polypeptide is one of the varied parameters. The method by which regulation of the imprinted polypeptide is achieved depends upon which heterologous promoter controls the gene for the imprinted polypeptide. For example, on pEV218T7, the ompA plasmid used in a preferred embodiment of this invention, ompA is under the control of the lac regulatory elements, which are induced by isopropyl-b-D-thiogalactopyranoside (IPTG) . In addition, the ompA gene on pEV218T7 comprises a TAG stop codon corresponding to amino acid 7 (Freudl, 1989) of the polypeptide; therefore, when expressed in the amber-suppressing (supF) host E. coli UH203, ompA expression is regulated not only by the level of induction with IPTG, but also by the concentration of the suppressor tRNA, which is expressed from a plasmid. Low level expression of OmpA is achieved with 1.0 mM IPTG induction. On pSB1649, lamB is under the control of the lac regulatory elements, and expression is regulated as described in the literature (Brown, 1992) . Regulation of the expression of ICAM-1 from pCDl.8, the CDM8 vector construct used in a preferred eukaryotic embodiment of this invention, is also described in the literature (Staunton et al., 1990).
Two populations of cells (populations A and B) exhibiting the oligopeptide tag in the imprinted domain are grown in a medium appropriate to the cell type and the expression vector. (For instance, in a particularly preferred embodiment of the invention, E. coli UH203 transformed by pEV218T7 are grown at 37°C in Luria broth supplemented with lOOμg/ml ampicillin, and 1.0 mM IPTG for induction of pEV218T7 expression; conditions for growth of other cell types are described above and in the literature.) A third population of cells (C) of the same strain as populations A and B, transformed by the same vector as populations A and B, but without the oligopeptide tag, is grown in the same medium. (Thus, for example, in the particularly preferred embodiment mentioned above, E. coli UH203 transformed by pEV218, rather than pEV218T7, are grown in the conditions described above.) Populations B and C are each grown to generate three clonal subpopulations of 10 cells, Bl, B2, B3, and Cl, C2, C3. Population A is grown to generate nine clonal subpopulations, Al , A2...A9, each comprising 10 cells.
Populations Al ...9 and C1...3 are incubated with the target compound, while populations B1...3 are incubated under identical conditions, but without the target. Optimal conditions and time required for binding between the oligopeptide tag and the target are prescribed by the supplier of the tag/target system or in the literature (Phizicky et al., 1995). In the particularly preferred embodiment, in which the T7 tag is displayed at loop IV of E. coli OmpA, and the T7 antibody (Novagen) serves as the target compound, the cell population expressing pEV218T7 is incubated with mAbT7 for approximately 60 minutes. The concentration of target compound with which cells are incubated is systematically varied across populations Al ...9 and Cl ...3 in order to identify the concentration that is optimal for CISTEM selection (table 3) .
Populations Al ...3 are each incubated with the lowest concentration of target compound. This minimum concentration varies, depending upon the properties of the agent and the
3 availability of the target, but is generally approximately 10 x concentration of cells x number of imprinted polypeptides per cell. Thus, for example, for a population of 10 E. coli grown under the conditions described above to express roughly
1000 copies of OmpA per cell, 0.001 mg mAbT7 is used in 1 ml
3 of medium. This is approximately 4 x 10 target molecules
(mAbs) per imprinted polypeptide. Populations A4...6 are incubated with a higher concentration of target compound, such as lOx the concentration used with populations Al ....3.
Populations A7...9 and C1...3 are incubated with the highest concentration of target, such as lOOx the concentration used with populations Al ...3.
Following incubation, populations A1...9, B1...3, and
C1...3 are exposed to the selection agent. The technique of exposure depends upon the nature of the agent, and, for all well-known agents, is described in the literature. Methods of exposing prokaryotic cell populations to bacteriophages are described by Calendar (Calendar, 1988); methods of exposing prokaryotic cell populations to colicins are described by
Pattus et al . (Pattus et al., 1990); and methods of exposing eukaryotic cell populations to viruses are described in the literature (e.g. Staunton et al . , 1990, Tyler et al . , 1996). Like the concentration of target compound, in the optimization described here, the concentration of agent is varied across the cell populations in order to identify optimal conditions for the CISTEM' s selection (table 3). (As was noted above, however, not every parameter is varied in optimizing each embodiment of this invention. In the optimization described below, for example, the concentration of target is not one of the parameters that is varied. Instead, this concentration is fixed, while two other parameters are varied) . Populations Al,4,7, Bl, and Cl are exposed to the lowest concentration of the selection agent. This concentration will vary from one CISTEM to another, depending upon the properties of the agent, the method and duration of exposure, and the availability (and thus the concentration) of the target compound, but it will generally be roughly the minimum concentration required to affect every cell in a fully susceptible population during a CISTEM run. Thus, for many lytic bacteriophages, such as coliphages K3 or λ, when the method of exposure is classical phage infection (Calendar, 1988), and the duration of incubation with the bacteriophage is 60 minutes at 37°C, the concentration will be 2 - 5 x the cell concentration, or Mol » 2 - 5. Cell populations A2,5,8, B2, and C2 are exposed to a doubled concentration of the agent, and populations A3, 6, 9, B3, and C3 are exposed to yet another doubling of agent concentration. Table 3 summarizes the variation of target and agent concentrations over the cell populations in this procedure for optimizing CISTEM selection.
The concentrations in this procedure may vary depending on the nature of the selection agent and the method of exposure, but several fundamental aspects of the procedure do not change from one CISTEM to another. First, in the control populations, B and C, the selection agent has unimpeded access to its receptor, and therefore affects every cell. Population B is susceptible to the agent because, although cells bear the tag at the imprinted domain, the population is not exposed to the tag's ligand. Therefore, the selection agent's site, e.g. the adhesion domain, remains exposed and functional. Population C is susceptible because, although the cell population is exposed to the ligand, the ligand is not bound by the imprinted polypeptide, which lacks the ligand' s tag. Because the selection agent's adhesion domain remains exposed and functional in populations B and C alike, the results of exposing these populations to the agent are the same. Thus, in these control trials, exposure to the agent yields the same results for populations Bl and Cl; likewise for populations B2 and C2; and for populations B3 and C3. These controls confirm two conclusions critical to the CISTEM: 1. the presence of the target ligand does not affect the selection agent's activity, provided the ligand is not bound to the imprinted polypeptide; 2. the presence of the tag in the imprinted domain does not affect the selection agent's activity, provided the tag is not bound to a target ligand.
Whereas control populations B and C are entirely susceptible to the selection agent, populations Al ...9 exhibit a range of resistance, extending even to immunity. In the cells of population A, the imprinted domain harbors the oligopeptide tag, which binds the target ligand during incubation. As a result, the adjacent adhesion domain is occluded from the selection agent, and the cells are protected. Populations Al ...9 exhibit a range of resistance because they are exposed to a range of concentrations of target ligand. When the range of concentrations is broad enough, the range of resistance extends from partial susceptibility to complete immunity.
Based on these preliminary trials, the concentration of selection agent that allows for functional CISTEM selection is found as follows. Ubiquitous pathogenic effect--!. e. every cell in a population is affected by the agent--is easily observed when cell populations are exposed to agents according to protocols described in the literature. For example, in exposing prokaryotic populations to virulent lytic phage, ubiquitous pathogenic effect results in confluent lysis of the population, with zero surviving colonies (Calendar, 1988) . The lowest concentration of selection agent at which every cell in the control populations (B and C) is affected by the agent is an appropriate level for CISTEM selection. For example, if some cells from Bl and Cl (table 3) escape the selection agent, but the cells of B2 and C2 are all affected by the agent, then the appropriate concentration of selection agent for the CISTEM is Mol = 10. In this example, the minimum concentration required for ubiquitous effect could be more exactly determined by further trials with Mol's ranging from 5 to 10.
After the CISTEM' s concentration of selection agent is found, the optimal concentration of target compound is determined as follows. Each CISTEM has an approximate threshold affinity, Ka = t, such that clones whose imprinted domains bind the target with an affinity below this threshold (Ka < t) are eliminated from the population by the selection agent, whereas clones whose imprinted domains exhibit an affinity above the threshold (Ka > t) are protected from the agent, and thus selected. The threshold for a CISTEM can be set by adjusting the concentration of the target compound. To set the threshold to approximately the affinity that the oligopeptide tag exhibited by populations A and B shows for its ligand, the target concentration used in the CISTEM is the lowest level that provides complete protection to population Ai in the presence of selection agent at the CISTEM' s Mol. (Complete immunity to the agent is easily observed when cell populations are exposed to agents according to common protocols described in the literature.) Thus, for example, if the CISTEM' s concentration of selection agent, determined as described above, is Mol = 10, and population A5 exhibits complete immunity to the selection agent while population A2
4 is slightly reduced by the agent (table 3) , then 10 molecules of target compound per imprinted polypeptide is used in a
CISTEM to select for cells whose imprinted polypeptides bind the target with an affinity equal to or greater than the affinity exhibited by the oligopeptide tag for its ligand. A slightly lower concentration of target compound, such as
3 10 molecules per imprinted polypeptide, is used to select for cells with imprinted polypeptides that exhibit an affinity for the target that is significantly higher than the affinity exhibited by the oligopeptide tag for its ligand. A higher
5 concentration of target compound, such as 10 molecules per imprinted polypeptide is used to set the CISTEM' s threshold affinity well below the affinity exhibited by the tag for its ligand.
The application of CISTEM to cell populations yields cell clones that survive the selection procedure by virtue of the interaction between the polypeptide of interest and the target (s). The sequence of the polypeptide of interest is encoded by the DNA sequence which has been inserted into the display domain. The surviving cell population can be monoclonal or polyclonal. In the latter instance, surviving cells need to be plated out on solid medium plates to recover colonies derived from single cells (state of the art bacterial genetics) . Amplification of monoclonal cell populations and the subsequent preparation of their relevant episomal or vector DNA is performed according to standard protocols of recombinant DNA methodology. Since the episomal DNA codes for the display protein and its inserted polypeptide of interest, DNA sequence analysis of the inserted DNA tag (state of the art recombinant DNA technology) provides the DNA sequence of the insert and thereby allows the prediction of the ammo acid sequence of the polypeptide of interest which has been expressed and displayed at the surface as integral part of the membrane protein. The inserted DNA is derived from a pool of randomly synthesized oligonucleotides or from DNA fragments derived from natural occurmg mRNA (cDNA) or genomic DNA. The selected DNA and cognate polypeptide sequences can be scrutinized to match sequences of known proteins and their cognate genes. Thus, sequence analysis can be employed to identify the polypeptide of interest and to attribute and/or relate its composition to known genes or DNA sequences and their predicted proteins.
Modifications and elaborations of the basic CISTEM achieve several extensions of its capabilities, including the following: 1. augmentation of the maximum affinity of selected polypeptides for the target compound; 11. specification of the number of binding sites joining a multivalent imprinted polypeptide to the target compound; 111. selection for imprinted polypeptides that exhibit both high affinity for the target compound and low affinity for a second compound; IV. selection not only for affinity, but also for functional activity; v. screening the polypeptides expressed from a cDNA library for binding to a target compound; vi . selection for imprinted polypeptides that exhibit affinity for one or more of a set of multiple, distinct target compounds.
To increase the maximum affinity attainable in a CISTEM, iterative CISTEM runs are performed (figure 10) . Once a population of clones exhibiting affinity for the target compound has been selected by a CISTEM, the vectors bearing the gene for the imprinted polypeptide are isolated from this population by common techniques appropriate to the vector, and described in the literature. New variable oligonucleotides, such as those described above, are then inserted into the vectors alongside the original variable insertions. This is performed by the same procedure that was used to achieve the original insertion of variable oligonucleotides at a genetic site corresponding to the imprinted domain. (One such procedure is described above) . Alternatively, variation is introduced not as an extension of the original oligonucleotide inserts, but rather as an oligomeric rearrangement of the original inserts (the procedure of oligomeric rearrangement of oligonucleotide inserts is described in Brown, 1992) or a limited mutagenesis of the original inserts (techniques for the limited mutation of oligonucleotide inserts are described m Balmt et al . , 1993; insertion of mutated oligonucleotides is performed according to the procedure described above for insertion of variable oligonucleotides into a targeted site in a vector) . In other words, oligopeptide "sequence space" may be searched by a number of strategies, some of which have been applied to screening other combinatorial libraries (Balmt et al., 1993, Brown, 1992, Janda, 1994). The vectors bearing fresh variation at a site corresponding to the imprinted domain are then used to transform a population of cells, according to common protocols cited above, creating a new CISTEM cell population. This population is then subjected to yet another round of selection by the CISTEM (figure 10). This cycle of introducing variation, selecting for affinity for the target compound, and allowing selected clones to multiply, is J Ω tr Hi cn rt rt cn Ω PJ cn μ ε TJ cn cn Hi rt μ- < cn o μ- a TJ μ- CΛ tr tr TJ CΛ TJ CΛ Hi rt μ- rt Φ μ- μ- Φ tr tr 3 μ rt Φ o tr O μ- μ- O μ- Φ cn Φ φ Hi 3 o o 3 μ- Φ μ- Φ rt PJ Φ o J rt μ-1 3 μ Φ Φ tr Φ μ-1 3 μ- μ- rt rt μ rt Ω O Ω T 3 TJ TJ 3 3 μ μ μ X μ μ Φ
3 a CΛ Φ J J φ 3 Ω *< Φ φ Φ tr μ-1 rt 3 rt rt 3 3 μ X μ- a Hi PJ PJ 3 vQ μ
3 CΛ cn rt Ω TJ o φ rt 3 Ω a tr T rt μ ti PJ O μ tr tr μ- μ-1 μ- μ-1 3 μ- o rt H1 Φ 3 Φ 3 rt μ o Ω μ- O ft φ Ω μ- tr PJ μ- rt μ 3 Φ Φ 3 3 3 φ TJ 3 μ Φ (- 3 Hi μ rt rt rt Φ rt μ μ- Φ 3 rt 3 rt φ o < TJ o 3 Φ rt X Φ cn rt "• rt rt μ iQ O 3 Q Φ Ω Hi tr Φ μ- X tr o O < Ω Φ X tr a Hi 3 rt μ 3 3 a Φ Ό μ- μ- Φ μ μ- μ- μ- μ-1 Φ μ- Φ Ω a
TJ TJ Φ 3 3 μ- φ a Φ μ μ- μ PJ μ- μ Φ tr μ 3 μ- O a Φ 3 O Ω 3 Φ 3 O μ-1 μ 3 - O 3 J μ cn cn μ- a φ 3 Φ cn Hi Φ a Φ TJ CΛ 3 X rt ttl μ X CΛ cn μ- 3 3 3
Φ Φ rt a 3 rt rt Φ 3 Φ cn X cn • μ 3 o < μ rt μ- φ Φ ** tr TJ rt 3 TJ 3 cn J rt CΛ μ O 3 CΛ X CΛ tr • T φ Φ PJ O μ 3 μ- μ- φ o O O a rt 3 n *< PJ *< 3 O rt a cn μ o ~ J Φ μ- 3 >^ -* o 3 μ z TJ 3 μ- PJ o 3 3 t-h 3 tr rt tr Ω tr 3 μ- μ μ X μh μ ft J ε rt Φ TI φ HI 3 Φ rt φ TJ 3 μ- 3 rt X tr tr Φ Φ CΛ r . Ω Φ Hi φ 3 *— cn 3 Φ tr 0 μ 3 Φ 3 £ tr a μ- X μ rt X 3 CΛ Φ μ- Ω μ- O ^ ι-3 o o μ a rt X rt ω μ 3 o O o "• Ω O μ- μ- o rt O tr • μ-1 a 3 Φ 3 μ rt μ- TI TJ 3 μ- μ σ μ Φ X 3 3 rt Φ cn μ-1 CΛ 3 < 3 -• TJ Φ rt >< φ μ-1 a CΛ PJ 3 £ Φ tr CΛ o PJ μ-
3 μ O μ^ tr a tr CΛ PJ CΛ Φ μ- X PJ μ tr ι-3 1 TJ φ -' a Φ μ TJ CΛ μ μ- rt Hi rt 3
O 3 o Φ φ - o Φ μ l-* X CΛ μ o 3 μ- n Φ tr μ- o μ CΛ rt o X XI μ Hi 3 μ- tr rt a rt TJ 3 n μ Hi μ J* Φ J Φ O μ- rt μ- l-h cn PJ Φ μ- 3 μ-1 φ ^ tr 3 3 φ μ- μ- o μ- 3 Φ o PJ μ- μ T rt • 3 £ rt Ω 3 rt O 3 μ- rt -> X cn TJ ^< a φ PJ Φ rt 3 3 μ 3 Ω μ- 3
Φ μ o μ J ft μ 3 rt 3 φ tr 3 Φ Φ Φ μ TJ ε μ- 3 rt 3 X μ rt μ- 3 X
TJ 3 3 O •-3 J o Ω cn O μ- Ω J μ-1 J X Ω 3 H- μ- Φ rt tr rt 3 rt Ω Φ PJ μ- Φ PJ X CΛ μ rt 3 3 tr μ- μ 3 Φ Φ 3 o μ-1 μ- 3 Φ μ rt rt Φ CΛ 3 TJ O μ- j μ- O a 3 n 3 μ O tr rt Φ a ~ Φ 3 X 3 }—• a Φ 3 φ n μ- O O Φ rt rt Ω μ o PJ 3 Ω X CΛ X TJ rt Φ
O a - X Φ a μ-1 • O Φ cn PJ 3 a Hi PJ Φ μ- Φ tr X ttl μ-> TJ TJ O CΛ Φ Φ φ tr rt
TJ Φ μ rt rt cn rt a X Φ tr Φ rt o Ω a a X Φ O O 3 n H ø. rt TJ Φ
Φ TJ J < Φ rt o TJ O μ- H μ- μ- rt ε
X *—• tr TJ μ tr Φ tr Φ rt rt ΓJ 3 -' rt TI rt Φ
CΛ O 3 Φ J tr o t-h O rt tr a cn μ 3 Φ O Φ o μ- a I μ- X tr M 3 ^ μ- CΛ £ Ω μ- a Φ
. μ^ a μ cn φ o TJ rt Φ φ φ rt J Φ Hi TJ rt Φ o Φ σ tr Ω Φ CΛ a TJ 3 -3 CΛ tr 0 a Φ 3 < < O 3 O 3 μ CΛ μ- Ω 3 o < 3 tr < 3 μ- μ- μ- O ι-3 " φ 3 M 3 φ CΛ
S T rt 3 rt TJ J PJ < 3 Φ Ω μ-1 rt Φ μ-1 Φ Φ PJ rt rt σ 3 μ- TI TJ O £ ε TJ μ- rt μ Φ tr O tr o CΛ J 3 3 Φ ε O μ-' Φ μ- tr Ω PJ a μ- tr μ- TJ 3 £ 3 rt 3 cn μ- TJ O cn μ tr rt TJ φ Φ t-h μ 3 ι-3 rt CΛ a Ω Φ rt *— cn X φ rt rt μ- 3 φ PJ rt O T 3 μ- CΛ rt Φ 3 Φ Φ Φ tr rt μj O Φ 3 tn μ- Hi rt μ " 3 O O μ- 3 PJ - μ 3 μ μ μ^ a 1 O tr μ 3 X μ o μ-1 μ CΛ a a £ O O < O Φ μ- ' J 3 TJ μ O TJ CΛ < rt 3 μ- 3 rt Φ rt 3 l-h a 3 μ- φ a μπ 3* 3 μ PJ μ tr J μ 3 μ 3 μ o μ- 3 tr a 3 3 μ- CΛ tr tr o Φ μ rt O μ- cn • a 3 J Φ CΛ 3 μ CΛ μ- o μ Φ Ω o PJ μ- Hi μ 3 μ Φ • rt CΛ TJ μ PJ μ- μ μ- 3 Φ TJ
Ω CΛ H 3 tr μ-- CΛ Φ μ- 3 ε μ-1 Ω 3 O 3 o μ- Φ μ-j rt o X 3 CΛ Ω X μ tr H μ- a K3 J PJ CΛ Φ a μ- φ φ a Hi rt μ- PJ PJ TJ ≥, a . — φ tr 3 μ-j tr μ- Φ μ μ- rt £ rt CΛ H- φ tr tr ε φ < o 3 o a Φ μ-1 3 3 tr μ φ Hi PJ X) PJ 1 3 μ- cn Φ 3 ti tr tr J O Ω μ- TJ μ-1 tr μ 3 cn rt 3 a Ω a 0 μ-1 φ X TJ μ- < rt tr μ rt X 3 Φ rt
Φ PJ 3 Φ μ- Hi ε rt CΛ o Φ μ- rt μ PJ Φ μ- μ Φ Φ ε PJ 3 φ < rt o X! 3 TJ X tr Ω ω Φ ε rt 3 cn Hi Φ μ- T Ω Φ μ- μ a Φ cn h-1 TJ CΛ 3 3 μ- ~ 3 μ Φ 3 Φ μ n μ Ω o a
CΛ π μ- μ O T 3 0 tr a 3 rt φ cn Ω -> o • μ- μ 0 ^ μ μ- X rt 1 o Φ μ t-h μ tr rt rt 3 3 O \— μ-1 Hi Φ cn μ cn -> 3 3 Φ 3 rt TJ Φ 3 tr cn 3 CΛ PJ φ a
Φ μ- CΛ 3* μ- rt • TJ PJ μ- 3 Hi μ O a a μ- !< TI μ-1 TJ X cn tr Φ tr P- Ω Ω X 3 cn φ PJ o
X 3 3 ι-3 P rt tr 3 rt X O o o Hi PJ φ σ tr TJ μ- rt μ μ- μ-1 Φ TJ μ-> μ^ tr tr 3 tr M Φ 3 Hi 3 μ- a O R rt *< J M μ-1 μ- O ε μ 3 3 rt cn Φ φ μ O μ- o ^< rt μ-1 φ μ- φ TJ £
•» 3 3 3 O 3 a Φ Hi J
O ε £ rt rt Ω a TJ CΛ X 3 3 1 cn μ- —' rt 3 Φ 3 CΛ a μ- μ-
3 rt Hi rt 3 3 tr rt rt tr PJ μ Ω rt ft Φ rt μ- Φ a • μ o rt 3 3 tr CΛ TJ J o μ- rt μ- Ω Φ tr tr Φ PJ μ- μ- O μ- rt Φ Φ 3 i~> Φ Φ 3 CΛ μ- μ- tr μ-
O Φ Φ o Hi μ rt tr O O μ-1 PJ Φ Φ cn tr 3 3 a tr a μ-1 TJ Φ TI X! 3 rt O μ 3 . rt PJ
Hi TJ ft μ- 3 t-h Φ μ μ- X TJ Φ r Φ rt φ CΛ μ Ω μ- μ- μ- PJ μ o φ ε • 3 rt Φ 3 φ cn O t-h cn 3 Φ Φ a rt O tr μ μ φ μ- rt CΛ μ o rt rt tr PJ a rt PJ Ω μ J μ- Ω rt μ- μ- TJ 3 Ω tr 3 J Φ Φ ε 3 Φ CΛ 3 μ- . tr o ^ tr μ ft J CΛ μ cn Φ μ- μ rt μ Φ μ- PJ φ μ X - rt a rt ω < > rt o rt
Φ X rt rt O μ-1 a CΛ Φ P- rt t-h tr Φ μ- Φ Φ ^ J μ- M tr φ O μ- tr μ- 3 φ rt 3 μ- μ- o o μ a 3 Ω CΛ φ ft 3 O Φ 3 3 cn cn 3 rt Ω Ω < 3 Φ PJ J φ CΛ 1-3 μ-1 3 a rt Φ Φ 3 a rt £
imprinted polypeptide that is imprinted, a new epitope on the target compound is bound.
Certain embodiments of this invention select for imprinted polypeptides that exhibit high on-rate in their binding with the target compound. In these CISTEMs, the CISTEM cell population is not pre-mcubated with the target compound prior to introduction of the selection agent. Rather, the selection agent is introduced simultaneously with the target compound. The concentrations of agent and target compound are varied over multiple trials, as is described in the optimization procedures detailed elsewhere in this document. Concentrations are adjusted such that, upon simultaneous introduction of agent and target to the CISTEM cell population, clones whose imprinted polypeptides bind to the target relatively rapidly are protected, whereas clones whose imprinted polypeptide binds to the target more slowly are eliminated by the selection agent.
A CISTEM that imprints a polypeptide to show high affinity for a target compound but extremely low affinity for a second compound, which is here called the "repelled compound", is constructed as follows. The size of the target compound, the size of the repelled compound, and the distance between the imprinted domain and the adhesion domain are altered, if necessary, to meet the following condition: when the target compound is bound to the imprinted domain, it occludes the adhesion domain, whereas when the repelled compound is bound to the imprinted domain, it is too small to occlude the adhesion domain. If this criterion is not met fortuitously (i.e. if the repelled compound does not happen to be extremely small), then the diameter of the target compound and the distance between the domains may be altered, as follows. The target compound may be enlarged by creating a multimer. (The procedure for creating a multimer depends upon the nature of the target compound, but many techniques for joining multiple copies of a compound are commonly practiced and described m the literature.) The distance between the imprinted domain and the adhesion domain may be increased by moving either domain to a site on the imprinted polypeptide that is topologically farther from the other domain. Transposing the imprinted domain simply involves inserting variable oligonucleotides, according to the procedure described above, at a genetic site corresponding to a different extracellular domain of the imprinted polypeptide. In many prokaryotic and eukaryotic embodiments of this invention, the adhesion domain may be transposed to a different extracellular domain of the imprinted polypeptide without affecting its function as a receptor for the selection agent. The procedure for transposing an adhesion domain is described above as well as m the literature (Chu et al . , 1995) . If, as a result of moving the adhesion domain and the imprinted domain farther away from each other, one or more extra- or intracellular domains that are not involved in CISTEM function occur between the two, topologically separated domains that are critical to CISTEM function, the superfluous domains may be replaced by polyprolme linkers m order to ensure that extra- or intracellular structures do not impede proper occlusion of the adhesion domain by a target compound bound to the imprinted domain.
Once the three critical distances - the distance between the imprinted domain and the adhesion domain, the diameter of the target compound, and the diameter of the repelled compound - are m accordance with the criterion stated above, the CISTEM cell population is incubated with both the target compound and the repelled compound. Then the CISTEM' s selection agent is introduced, and this creates two selective pressures. One pressure favors cells expressing imprinted polypeptides with higher affinity for the target compound (figure 12) : a target compound bound to an imprinted domain occludes the adjacent adhesion domain, and thus serves to protect the underlying cell from the selection agent. Among cells that exhibit some affinity for the target compound, a second selective pressure favors cells with lower affinity for the repelled compound; a repelled compound bound to an imprinted domain not only fails to occlude the adjacent adhesion domain, but also prevents binding by a target compound (figure 12) . In short, an imprinted polypeptide ' s affinity for the repelled compound effectively reduces its affinity for the target compound. Thus, selection favors cells that exhibit imprinted domains with high affinity for the target compound and low affinity for the repelled compound (figure 12) .
To select not only for affinity but also for functional activity of imprinted polypeptides, a basic CISTEM may be coupled to a secondary screen for activity. In the first step of coupled selection, a CISTEM selects clones whose imprinted polypeptides bind the target compound m the prescribed fashion. From this population of clones, or from the imprinted domains extracted from such a population, a secondary screen selects the structures whose binding to the target compound exhibits the desired functional activity. Imprinted domains liberated from the cell surface by detergent or by expression as a secreted isomorph (lsomorphic forms and their mterconversion are described above) may be subjected to any commonly practiced screens or assays for activity or modification of the target compound's activity. For example, one CISTEM imprints E. coli OmpA to bind with high affinity to soluble IL-6-Receptor (This target compound is described in more detail above) . The imprinted polypeptides, freed from the cell membrane by a conventional gentle detergent, which is known not to denature OmpA, are then subjected to a second screen for their capacity to agonize the cell-bound, functional IL-6-Receptor . This screen for agonistic activity is described m the literature (Hibi et al . , 1990). Alternatively, extremely strong selection for functional activity may be exerted upon the CISTEM' s selected set of cellular clones, and does not require liberation of the imprinted domains from the cellular membranes. In fact, precisely because they yield replicating cell populations, CISTEMs can be coupled to selection regimes very similar to genetic complementation. In these selection regimes, each cell's supply of a vital compound, such as a nutrient, is dependent upon the imprinted domain's performance of the function desired of it, such as enzymatic activity. In other words, the imprinted domain's functional activity either converts the target compound to a compound vital to the underlying cell, or secures the vital compound for the cell. In certain embodiments of this invention, the target compound consists of a library of separate and distinct molecules, which is here called „the target library". A CISTEM m which a target library (rather than a single target compound) is used serves to select interacting pairs m which one member of each selected pair is the oligopeptide displayed at the imprinted domain of a protected clone, and the other member is a target library molecule that binds to this oligopeptide and thus serves to protect the clone. Provided each molecule m the target library is large enough to meet the size requirement for CISTEM function, and is present in the medium at a concentration adequate to confer protection upon cells whose imprinted polypeptides bind to one or more molecules m the target library, the target library may comprise as many distinct molecules as is desired, and may be constructed according to any known procedure for making or isolating libraries of molecules.
In CISTEMs involving target libraries, two strategies are used, if necessary, to increase the concentration of each molecule m the target library to the minimum level required for effective CISTEM selection, i . e . , to achieve the concentration required to confer protection from the selection agent upon those cells whose imprinted polypeptides bind to one or more of the molecules m the target library. In one strategy, the CISTEM cell population and the target library are each amplified, divided into fractions, and exposed to one another m microtiter wells. This allows the concentration of target molecules to be sufficiently high to confer protection upon cells whose imprinted polypeptides bind to one or more molecules in the target library.
In the second strategy for increasing the concentration of each molecule m the target library, the target library is secreted by the CISTEM cell population itself. As was described above, libraries of variable oligopeptides can be displayed as fusions to gene protein 3 of E. coli filamentous phage (M13, fl, or fd) (filamentous phage-display technology is described in Janda, 1994 and Phizicky et al . , 1995). Filamentous phage create relatively avirulent infections m which phage particles are liberated from the host cell by extrusion, leaving the cell in tact. Therefore, a CISTEM cell population that secretes its own target library can be engineered as follows.
As in the basic CISTEM described above, a cell population that lacks a gene for the imprinted polypeptide is transformed with a library of exogenous copies of this gene, m which variable oligonucleotides have been inserted at the genetic site corresponding to the imprinted domain. For example, in the construction of an E. coli CISTEM cell population described above, E. coli UH203 is transformed with pEV218I, a library of ompA plasmids with variable oligonucleotides at the genetic site corresponding to extracellular loop IV of the membrane protein. The resultant cell population displays a library of exogenous oligopeptides at the imprinted domain. In an extension of the basic CISTEM, this cell population is then transformed with a second extra-chromosomal genetic element that encodes a secreted library of oligopeptides. The OmpA- imprinting cell population, for example, is transfected with a library of filamentous bacteriophage engineered to display exogenous oligopeptides at gene protein 3. (Many such libraries are described in the literature, reviewed m Janda, 1994 and Phizicky et al . , 1995.) Each cell m the population now has two extra-chromosomal genetic elements: one encodes OmpA, in which an extracellular loop displays an exogenous oligopeptide; the other encodes a stably extruded M13 phage, which gp3 displays a second exogenous oligopeptide, distinct from the one displayed on the cell surface (figure 14) .
The CISTEM cell population expressing two extra- chromosomal genetic elements is then exposed to the CISTEM' s selection agent, following the same protocols for exposure as were described for the basic CISTEM. Upon exposure to the selection agent, a clone m the CISTEM' s population is protected only if the target library molecule that is secreted by the clone binds specifically to the clone' s imprinted domain. That is, a clone is protected by an autocrme loop. In the specific CISTEM m which E . coli displays an OmpA library and secretes an M13 library, K3 serves as the selection agent (figure 13) . An E. coli clone is protected from K3 only if the exogenous polypeptide displayed at loop IV of OmpA binds specifically to the M13 that the clone itself secretes. The genes for both members of this interacting pair of oligopeptides are easily recovered from the protected cell. In sum, through autocrme protection of clones, a CISTEM with a target library selects for specifically interacting pairs of oligopeptides, as well as for the genes that encode them.
In another example of autocrme protection, a CISTEM is constructed according to the procedure ust described, but a hemolysin library is used m place of the phage M13 library. Hemolysms are secreted bacterial polypeptides that tolerate exogenous insertions. Thus, the hemolysin gene can be used to carry exogenous genes, which encode polypeptides that are then secreted from the cell.
Two possibilities for an intracellular CISTEM are depicted m figure 20. In both possibilities, the protective partner :partner interaction occurs within the cell. Version A shows a typical CISTEM, wherein the two partners are transported to the cell surface. In version B, the positive selection occurs, because the two partners cannot be transported to the cell surface and e.g. accumulate withm the cell and/or become substrates for protein degradation pathways .
Certain embodiments of this invention are used to identify antigens associated with tumors. A CISTEM for the identification and isolation of cancer antigens may be constructed as follows: First, a cDNA library is derived from a patient's tumor cells. Protocols for isolation of cells and derivation of the cDNA library are described m the literature. The tumor-derived cDNAs serve as the variable inserts m the construction of the CISTEM cell population: they are inserted into the imprinted polypeptide vector at a genetic site corresponding to the imprinted domain of the polypeptide. The cDNAs are inserted m all three reading frames m order to guarantee that polypeptides associated with the tumor are expressed by the CISTEM cell population. Methods for inserting cDNAs into vectors are available m the art. These vectors are used to transform a population of cells lacking an endogenous copy of the gene for the imprinted polypeptide. When expression of the vector-borne gene is induced, each clone the CISTEM cell population displays one tumor-associated polypeptide, fused to the imprinted domain on the cell surface.
The target compound in a CISTEM for the identification of tumor antigens preferably consists of polyclonal antibodies, usually isolated from the same patient whose cells are used as the source for the cDNA library. In other words, the tumor cells that supply cDNA and the antibodies that comprise the target compound are preferably autologous. Methods for the collection of cancer patient serum and the isolation of polyclonal antibodies, which are used as the CISTEM target compound, are described in the literature.
Following incubation of the CISTEM cell population with the target antibodies, the selection agent is added. Any clone m which the imprinted domain displays a tumor-associated polypeptide that binds autologous antibodies has a different status concerning the selection agent than a cell with no such binding. E.g. in case of binding, these cells may be protected from the selection agent. By contrast, any clones m which the imprinted domain displays a tumor polypeptide that is not recognized by autologous antibodies is eliminated. Thus, this CISTEM identifies tumor-associated polypeptides that interact specifically with autologous antibodies. It is shown m the literature that such polypeptides are likely to be lmmunogenic, eliciting a tumor-specifIC humoral and cellular immune response.
Certain embodiments of this invention are used to determine whether certain molecules interact specifically with an imprinted polypeptide that has not been altered by the insertion of variable oligopeptides. Any outer membrane polypeptide that is used as a receptor by a pathogenic agent, including polypeptides engineered by the manipulations described above, may be used m these embodiments of the invention. Because the imprinted polypeptide is not engineered to display highly variable oligopeptides, there is no „ιmprmted domain", and the selection agent's adhesion domain may be located anywhere on the imprinted polypeptide. As in the basic CISTEM cell populations described above, the imprinted polypeptide is expressed from an mducible promoter. The cell population, induced to express the imprinted polypeptide, is exposed to a target library, according to one of the methods described above for constructing target libraries and introducing them to CISTEMs. Either microtiter wells or the autocrme loop methods may be used to increase the concentration of target library molecules. The CISTEM' s selection agent is then introduced according to procedures described for the basic CISTEM. Cells are protected only if molecules m the target library bind to the imprinted polypeptide, and thus prevent adhesion by the selection agent. Thus, these embodiments of this invention provide an easy and straightforward method for assaying a set of molecules' affinity for the imprinted polypeptide.
Therefore, preferred areas, for which the present method is especially suitable is the discovery and isolation of novel antigens for use m vaccines, tumors, novel targets for therapeutic intervention, novel diagnostic markers, or molecules useful for monitoring treatment efficacy. Such new products, identified, isolated and prepared by the method according to the present invention is also subject matter of the present invention.
According to another aspect, the present invention relates to the cells/the cell population to be used m the present invention with a) a membrane polypeptide which has the polypeptide of interest m its extra- or intracellular domain and b) a selection site being specific for a selection agent, such that upon binding of a target compound to the polypeptide of interest the selection site of cells which exhibit such a binding of said target compound to said polypeptide of interest is modified m a manner that said selection agent may be used for selecting those cells which exhibit said binding of said target compound to said polypeptide of interest from cells which do not exhibit said binding.
The present invention also relates to a kit comprising the cells/cell population according to the present invention, the target molecule (and/or a library of target compounds) and one or more selection agents. These kits may be provided &J H- M o IVi O 3 T) μ- π 3 3 cn 3 μ Φ μ- Ω Ω 3 3 PJ rt TJ ε a ε 3 Hi n
. V 3 μ- > TJ CΛ H- 3 3 tr O a Φ X CΛ tr O tr a 3 O tr tr μ- μ- φ O TJ rr ft μ- X rt CΛ tr a o t? tr CΛ 3 0 Φ 3 φ C . X PJ μ- μ rt rt μ φ t? W 2 cn μ- 3 • Φ H 3 ε g Φ rt 3 μ- 3 TJ 3 μ- rt μ Ω Φ tr tr Ω v TJ TI tr 3 X I—1 μ M Ω rt CΛ 1 cn μ TJ 3 μ- J 3 < tr 3 tr Ω 0 3 H-
* f μ- O μ- CO PJ £ rt tr μ-> μ- TI μ- μ-1 H rt Ω μ O J μ- Φ 3 rt > tr a φ Hi
2 M X ε N tr Ω Φ Φ ft o μ- Ω φ 3* Φ J μ- < 3 3 Ω Ω Φ Ω μ- cn a μ-
0 TJ ω . PJ 3 cn rt μ tr T 3 X rt CΛ Φ a μ-1 3 φ rt μ TJ Φ o a Ω X ~ μ- Ω
1- tr1 rt rt Ω tr μ- I μ- rt Φ μ • μ- X -' CΛ ^< μ 3 3 o tr Ω 3
H- w IV) r μ- ft o < μ- O 3 o a μ-> 3 3 μ- Ω t-h rt ^ Φ rt TJ rt μ rt PJ H- o Φ o Φ ε φ X TJ μ tr rt o X 3 3 O o cn "< Ω CΛ μ- μ O a ^< μ h-1 μ-1 o μ» 3 μ cn • tr X μ- Φ 3 cn a < 3 μ rt Φ o Φ Ω μ- μ- Φ ^ d .. cn 3 H- π VO PJ Φ 3 μ- 3 tr μ- φ TJ 3 μ O rt 3 3 3 CΛ PJ 3 < 3 TJ rt tr o o O rt X rt a 3 μ- O rt rt 3 o 3 3 Hi Ω TJ rt φ X 3 rt 3 ttl
(0 jj O 3 Hi TJ tr CΛ cn Φ μ- 3 ε tr rt 3 Ω -> O \-> CΛ 3 μ μ- μ O ti 2J ε μ- tr Φ ι-3 tr ^. 3 3 cn cn rt Φ μ- 3 3 rt rt 3 μ- : Φ rt μ- 3 TJ μ n rt o 3 TI O o X TI tr O a 3 tr PJ φ 3 rt rt o PJ X O
3 n O μπ X TI £ ε TI μ- O rt Φ 3 3 μ φ 3 3 < 3 3** tr tr CΛ Φ
CD . rt μ CΛ φ • CΛ μ- φ μ- X Hi "< μ Ω < 2 3 a rt φ Ω Φ o 3 μ-1 μ- Φ PJ ε μ- ι-3 Ω μ X Ω 3 • TJ Φ Ω μ- μ- § μ^ μ- CΛ 3 rt a 3 Φ 3 CΛ n cr n o 3 M O 3 o • 3 rt Cn 3 Φ rt O CΛ PJ to CΛ Ω ^ rt μ- CΛ o Hi "• l-i o X £ co 3 3 CO φ φ O 3 a O o μ- < rt Hi rt TJ Φ
3 tr1 TJ . μ- • TJ Φ μ Λ . TJ t-h a . — . PJ —> 3 CΛ O Φ Φ o tr PJ Ω cn a
3 H o O n > CΛ CΛ PJ tr cr 1 PJ 3 J o rt PJ TJ 3 3 TJ μ Φ rt rt 3 Φ
(D CΛ t-h Φ TI o TI tr >• Ω o μ 3 μ rt μ PJ rt Ω *• Ω CΛ μ tr μ- Ω ttl o CΛ μ- 3 μ- TJ o rt ε J < *< rt PJ Φ φ tr O TJ o O tr μ-
TJ £ μ- rt Φ X TJ xj μ^ ε TI μ- cn 3 μ- tr tr o -— . a rt 3 3 O μ 3 X 3 3 ti trj σ tr Ω • > • PJ cn μ- o Φ TJ 3 Φ PJ rt PJ μ- PJ a TJ Hi o CΛ Φ CΛ PJ 3
0 μ- Φ rt μ-> 1 μ-1 CΛ X 3 rt O X μ CΛ tr 3 Ω 3 CΛ a O a TJ 3 cn tr ft y-1 μ- Cn o tV) 3 CΛ • cn tr TJ h-1 Φ Φ μ- O X μ- 3 μ- 3 Φ cn a -"
ID n μ- a O M μ- Φ φ o >< Hi Φ CΛ μ 3 3 tr 3 3 a Ω Ω • μ- rt φ μ- o rt •< 3 PJ CΛ cn a tr μ-1 TJ μ- X O TJ it¬ 3 X a φ μ- rt CΛ μ
3 t-* μ- 3 3 ι-3 tr Φ O Φ Ω • φ X TJ ε cn 0 s' t-h 3 3 ">, φ Φ TJ
Φ J tr a TI o T o tr rt O TJ TJ 3 ,—> μ- TJ O 3 Φ ttl tr rt X PJ 3 μ
> TJ CΛ 3 £ ε TI rt 0 ε 3 Φ rt μ PJ rt φ μ 3 Φ TJ *< μ- rt CΛ rt o μ- μ-> cn g < μ- ε TJ TJ μ- Φ μ- tr Ω a TJ μ tr Hi PJ tr φ μ- J.
O Ω TJ σi IV) O cn Φ o rt a CΛ 3 rt 3 tr PJ rt ; Φ CΛ 3 φ
Ω t-h cn PJ TI PJ μ-> 3 3 3 μ- φ φ rt μ 3 Ω PJ CΛ μ tr μ- TJ X Ω
H X cn μ- 3 00 J Φ a cn ε a tr o Ω 3 μ 3 3 Φ 3 tr TJ i-i ft μ- O φ tr X ... J rt 3 Φ -»• μ- Φ cn 3 3 tr PJ X J μ φ Ω cn
^ 3 Hi o • μ- 3 π tr rt cn rt tr Ω Φ PJ CΛ Ω n μ Φ CΛ 3 ω rt ε μ-> 3 TI a Φ CΛ -*• TI tr * i 1 o μ- Φ Ω rt Φ 3 3 cn 3 3 3
\ μ rt co TJ μ- CΛ μ- O ti TJ Ω 3 Φ PJ 3 CΛ 3 3 Φ μ^ Ω 3 g J tr • rt μ X 3 β μ- O TI X 3 rt o ! CΛ 3 3 rt H
5 a Ω 3 rt Φ a
Ω Φ tr CΛ μ- • 3 TI 3 ttl μ- • rt tr ε 3 μ- rt Ω μ- TI Φ rt μ- μ
CT" φ TI Φ tr 3 μ- ' TJ £ TJ xt tv> - Φ 3 a Ω μ- Φ Ω £ μ- 3 3 cn
•H π μ- o rt o μ-1 μ PJ • 3 μ- Ω CΛ 3 CΛ rt μ- X ε rt
-j μ-1 M X TJ ε μ- μ- cn μ- 3 tr ttl Ω rt a 3 "» μ-1 3 0 μ- 3 μ- 3
3 CΛ • μ cn 3 PJ Hi Φ 3 o 3 o 0 O J a μ-" Φ μ-1 Ω < ttl rt 3 o ι-3 Φ X 3 μ- μ-> rt cn a ε -1 3 — ' cn Φ cn >< rt PJ 3 Φ μ tr μ-> a
H J TI J a Ω φ φ CΛ tr μ-1 TJ -» Φ rt TJ rt tr rt μ-1 3 O 3 w μ £ -J μ- φ J Ω a t-3 O O < O O X φ μ 3 PJ o μ- rt 3 μ- μ 3 3 n X μ-> rt rt TI ε Φ ε 3 Φ 3 μ Φ tr Ω a 3 TJ μ- 3 a
M n ω rt μ- H 3 μ-1 μ- μ- a £ cn cn μ μ- 3 rt Φ 3 TJ μ- Ω X μ O μ- μ-
S M Φ O 3 CΛ 3 o o o • tr "> 3 a Ω 3 μ- 3 μ-1 Φ 3 φ 3 3 3 N
CΛ 3 3 TJ CΛ 3 3 3 3 O X cn • Ω 3 i-i μ- TJ Ω PJ TJ Hi 3 Φ
Φ μ-> μ TI —~ tr 3 TI 3 ε • — - Φ φ 3 N rt Ω 3 μ- φ O a I Ω VO £ φ o μ- 3 μ- μ- ~ a rt φ 3 o 3 μ cn Ω
£ rt ε 3 cn 3 X O μ tr μ O PJ rt O Φ μ- PJ μ- cn ttl tr O
1 a <! rt μ- X 3 3 3 φ μ- Φ μ- O μ- X a 3 X rt * 3 3 o 3 Ω tr X 3 cn 3
Φ
E. coli Outer membrane protein A is a 325-resιdue polypeptide comprising an ammo-termmal, eight-stranded β- barrel with four extra-cellular domains, three periplasmic loops, and a roughly 150-residue periplasmic carboxyl-termmal moiety (figure 3) (Freudl, 1989, Morona et al . , 1985). The fa- barrel conformation is extremely stable, and can be denatured only by boiling in strong detergents. Naturally present in
5 approximately 2-3 x 10 copies per cell, OmpA is among the most abundant proteins m the E. coli outer membrane. It is also widely conserved m bacterial evolution.
As was described m the specification, exogenous oligonucleotides, inserted into the ompA gene at sites corresponding to extracellular domains of the polypeptide, are normally translated as part of the polypeptide, and do not disturb the polypeptide ' s insertion in the outer membrane.
Furthermore, the oligopeptides encoded by these exogenous genetic insertions are displayed on the surface of the cell, adopt stable folded structures, and can exhibit enzymatic, agonistic, or antagonistic activities.
b) Construction of a cell population for testing and optimizing CISTEM selection
To construct a cell population for use m testing and optimizing CISTEM selection, an E. coli population lacking wild-type OmpA was engineered to express a plasmid-encoded copy of OmpA, m which extracellular loop IV of the polypeptide displayed the T7 tag oligopeptide (Met-Ala-Ser- Met-Thr-Gly-Gly-Gln-Gln-Met-Gly) , which binds to the commercially available T7 antibody (Novagene) . In pEV218 (Freudl, 1989) , the ompA gene, which is under the IPTG- mducible control of the lac promoter, has been engineered to bear a polylmker rich in restriction endonuclease sites at a genetic region corresponding to extracellular loopIV of the polypeptide (figure 9) . This plasmid was digested with Sad (enzyme and reaction buffer from New England Biolabs) and Xbal (enzyme and reaction buffer from MBI Fermentas), yielding a fragment of the restriction site polylmker and the linearized plasmid with sticky ends left by the restriction enzymes. The linearized plasmid with sticky ends was isolated by standard gel electrophoresis. Two single stranded oligonucleotides were synthesized by conventional technology (custom oligonucleotides are available from commercial laboratories, and methods for synthesizing single-stranded oligonucleotides are commonly use and described m the literature) to read as follows :
Oligonucleotide 1: 5' - CG[ATGGCTAGCATGACTGGTGGACAGCAAATGGGT] - 3'
Oligonucleotide 2: 5' - CTAG[ACCCATTTGCTGTCCACCAGTCATGCTAGCCAT]CGAGCT - 3'
The sequence m brackets composes the T7 tag gene sequence. These two single-stranded oligonucleotides were annealed m Klenow reaction buffer (MBI catalogue) , forming the double stranded T7-tag gene with sticky ends complementary to the sticky ends on pEV218, linearized by Sad and Xbal as described m the previous paragraph. When the double-stranded oligonucleotide was ligated (MBI Fermentas ligation buffer) into the linearized plasmid, the T7 tag gene was inserted, in frame, into the ompA gene on pEV218. The new construct was designated pEV218T7.
The plasmid pEV218T7 was used to transform a population of the strain UH203 (lac, supF, ompA, recA, proAB, rpsL) (Freudl, 1989) , according to common methods of transformation described m detail above.
c) Preliminary optimization of CISTEM selection by coliphage K3
Coliphage K3 invades E. col i by binding to loops I, II, and III of OmpA. Loop IV is not used by the phage, and, indeed, exogenous oligopeptides at loopIV do not affect K3 susceptibility of the cell (the K3 adhesion domains are shown m figure 3; bacteriophages are reviewed briefly m the specification, and bacteriophage K3, as well as its receptor requirements, are described m Freudl, 1989 and Morona et al . , 1985) . The following procedure was performed to fulfill two objectives: a) to conclusively demonstrate that cells whose imprinted polypeptides bind the target compound are protected from K3 attack, whereas cells whose imprinted polypeptides fail to bind the target are eliminated by the phage, and b) to achieve preliminary optimization of the duration of incubation of the CISTEM cell population with phage K3. The principle of this experiment is illustrated m figure 14.
E . coli MC4100 (pEGFP-Nl), a kanamycm-resistant, ompA wild-type strain, which will here be referred to as U (for „Untagged"), was cultivated overnight m Luna broth (LB) supplemented with 25μg kanamycm/ml . E. coli UH203 (pEV218T7), which will here be referred to as T (for „Tagged"), is an ampicill -resistant, ompA strain that has been transformed, as described m the specification, by a plasmid encoding OmpA bearing the T7 tag sequence at the fourth extracellular loop of the polypeptide. This strain was cultivated overnight m LB supplemented with 100 μg/ml ampicillm/ml and 5 mM IPTG for induction of pEV218T7 expression. Each overnight culture was diluted with LB to OD600 = 0.001. To create each of two mixed populations, 50μl aliquots of the U culture were combined with 50μl aliquots of the T culture. One mixed population was incubated for 20 mm. at 37C with 20μl PBS alone, while the other incubated under the same conditions with 20μl PBS plus 0.1 μg antι-T7 tag mAb (Novagene mAbT7). (In table 4, the presence or absence of mAbT7 is indicated by a + or -.) Phage K3 diluted in 1 ml LB were then introduced to each mixed population at a Multiplicity of Infection (Mol) = 10. (Phage stock was prepared by the conventional method for preparation of T-even phages (Calendar, 1988).) Sample aliquots of 100 μl were taken from each population at the appointed times shown in table 4, namely, immediately preceding introduction of phage (shown m the table as T = 0, Phage -), immediately following introduction of phage (shown m the table as T = 0, Phage +), and at 2 mm., 10 mm., 30 mm., 60 mm., 90 mm., 120 mm., 180 mm., and 240 mm. after the introduction of phage. Aliquots were plated on LB plates supplemented with 100 μg ampicillin/ml and on LB plates supplemented with 25 μg kanamycm/ml. Colonies were counted after incubation of the plates overnight at 37°C. The results of this experiment demonstrated that interaction between mAbT7 and the T7 tag at loopIV of OmpA protected cells from phage K3. This conclusion is summarized most concisely in table 5. Cells exhibiting the T7 tag at loopIV of OmpA were protected from phage K3 infection, and this protection was dependent upon incubation of the cells with mAbT7. Cells that expressed wild-type OmpA, rather than the OmpA-T7 tag fusion were susceptible to phage K3, irrespective of the presence or absence of mAbT7. Thus, cells whose imprinted polypeptides bind the target compound are protected from K3 attack, whereas cells whose imprinted polypeptides fail to bind the target are eliminated by the phage .
This experiment also serves m the preliminary optimization of CISTEM selection by phage K3. As is shown m table 4 and figure 15, m the mixed population of U and T cells that were incubated with mAbT7, U cells were completely eliminated from the population after 30 mm. incubation with phage K3. T cells, by contrast, maintained a roughly stable population well beyond 30 mm. of incubation with the phage. This suggested that when starting with Mol - 10, the minimum time period for phage K3 to eliminate all cells that failed to bind the target would be 30 - 60 mm.
The initial precipitous drop m T cells followed by abrupt stabilization (see figure 15) suggested that the antibody may not yet have come to equilibrium m binding the T7 tag at the time the phage were introduced. Pre-mcubation with the antibody was 20 mm. m duration , and complete protection of T cells was apparently achieved at approximately 30 mm. after the introduction of phage, suggesting that the time required for the antibody to come to equilibrium binding the tag was approximately 50 mm.. This suggestion was confirmed by the following preliminary optimization of the duration of the cell population's incubation with the target.
Strains U and T were cultivated overnight under the conditions described above. Each of five mixed populations of U and T cells was created by taking a 50μl aliquot of the U culture (OD600 = 0.001) and combining it with a 50μl aliquot of the T culture (OD600 = 0.001) . As is shown m table 6 and figure 16, each mixed population was incubated at 37°C with 7nM mAbT7 diluted in PBS for a different time period: 1 mm.; 10 mm.; 20 mm; 30 mm; or 60 mm.. Phage K3 stock, diluted with LB to 1 ml and Mol = 10, was added to each mixed cell population immediately following the population' s incubation with the antibody. Each population was incubated with phage for 30 mm. Aliquots were then taken and their composition of U and T cells determined as described above. As is shown in table 6, cells exhibiting wildtype OmpA remained unprotected from phage irrespective of the duration of incubation with mAbT7. By contrast, cells exhibiting the T7 tag at loop IV of OmpA received increasing protection from the antibody as the duration of incubation was lengthened. As was expected from the results the preliminary optimization of phage K3 incubation (described above) the antibody confered complete protection upon T cells only after an incubation time between 30 and 60 mm.
d) Initial test of CISTEM selection
The preliminary, single-parameter optimizations of incubation time with mAbT7 and the duration of K3 selection allowed an initial test of the CISTEM' s capacity to select rare cells whose imprinted polypeptides bind the target from a background of cells whose imprinted polypeptides fail to bind the target. To demonstrate the selection of rare cells, the following procedure was performed.
E . coli strains U (Untagged) and T (Tagged) are described above. U was cultivated overnight m LB supplemented with 25μg kanamycm/ml. T was cultivated overnight m LB supplemented with lOOμg ampicillm/ml and ImM IPTG to induce expression of pEV218T7, the plasmid encoding OmpA with the T7 tag at loopIV of the polypeptide. Each overnight culture was diluted with LB to OD600 - 0.005. A 50μl aliquot of the U culture was combined with a 50μl aliquot of the T culture to create a mixed population. As is shown in table 7, although the mixed population was created with equal sized aliquots of U and T cultures, it was composed predominantly of U cells. The reason for this was probably that the very low level of OmpA expression induced in the T strain (only ImM IPTG was used) made this strain significantly less viable than the U strain under standard conditions of cultivation. The mixed population was incubated at 37C with 68nM mAbT7 (described above) diluted in PBS for 60 min.. Following incubation, 100 μl aliquots of the cell population were plated on LB plates supplemented with 25 μg kanamycin/ml or lOOμg ampicillin/ml . Colonies were counted after overnight growth, showing that the population consisted of approximately 1 T cell for every 120 U cells
(table 7) . Phage K3 stock diluted in LB was then added to the cell population at Mol = 2, bringing the total volume to 1 ml. The cells were incubated with phage for 120 min. at 37C. Following incubation with phage, 100 μl aliquots of the cell popμlation were plated on LB plates supplemented with 25 μg kanamycin/ml or lOOμg ampicillin/ml. Colonies were counted after overnight growth.
Control experiments were performed according to the same protocol, except that PBS was added without mAbT7 and/or LB was added without phage K3, as shown in table 7. The results in table 7 demonstrated that rare cells (1 in > 100) exhibiting an imprinted domain that bound the target were completely protected from phage K3 while all cells that failed to bind the target were quickly eliminated from the population .
A systematic test of the CISTEM' s sensitivity in selecting rare cells that bind the target from a vast excess of cells that fail to bind the target is performed as follows. Overnight cultures of E. coli strains U (Untagged) and T
(Tagged) are described above. 50 μl - 1 ml aliquots of the U and T overnight cultures are combined to create four populations, each representing a distinct ratio of U cells to
T cells: 108 U to 104 T; 108 U to 103 T; 108 U to 102 T; and 10 9 U to 102 T. Each population is incubated for 60 min. at
37C with lOnM mAbT7. To monitor the population composition
(i.e. the numbers of U and T cells), every 30 min. (i.e., at
30 min. and 60 min.) two aliquots of 5 μl are taken, one aliquot is plated on LB plates supplemented with lOOμg Ap/ml, and the other aliquot is plated on LB supplemented with 25μg
Km/ml. The plates are then incubated overnight and colonies counted to determine population composition. Immediately after removal of aliquots at 60 mm., bacteriophage K3 (described above) are introduced to the cell population at Mol = 2. The admixture is incubated at 37°C for 180 mm.. Again, to monitor the population composition, a sample aliquot of 5 μl is taken by micropipette every 20 mm., plated, grown, and U and T cells counted as described above. After incubation with phage, a sample of approximately 1000 cells is removed, plated, grown, and its composition determined as described above.
Demonstration of the CISTEM' s capacity to select rare cells that bind the target compound from a background of at least 100 times as many cells that fail to bind the target permits a final, two-parameter optimization of CISTEM performance. Each of five populations of T is induced for 15 mm. by a distinct amount of IPTG m the medium: 0 mM; 0.25 mM; 0.5 mM; 1.0 mM; or 2.0 mM. Five mixed populations of 10 U
3 and 10 T, each representing a different level of induction of
T cells, are created by combining aliquots, as described above. 10 nM mAbT7 is added to each population, and the populations are incubated for 60 mm. at 37°C. During the 60 mm. incubation, the population composition is monitored, as described above, by taking sample aliquots of 5 μl every 20 mm., plating, growing, and counting the numbers of U and T cells. After the removal of an aliquot at 60 mm., coliphage K3 is introduced at an Mol of 2. Over the next 180 mm., the populations are monitored by 5 μl aliquot sampling every 20 mm.. The [IPTG] and duration of incubation with K3 that result m the elimination of all U cells, with concomitant elimination of fewest T cells, are optimal parameter values for use in the CISTEM.
The optimization trials described here provide parameter- values ([IPTG], duration of incubation with K3, [mAbT7], duration of incubation with mAbT7) that allow the CISTEM to destroy all cells that fail to bind the target, while eliminating as few high-affmity cells as possible.
e) Construction of an OmpA-imprinting CISTEM cell population OmpA plasmid pEV218 (described above, shown m figure 9, construction described m Freudl, 1989) , bears a polylinker rich in restriction endonuclease sites at a genetic region corresponding to extracellular loopIV of the polypeptide. This plasmid is digested with Sad (enzyme and reaction buffer from New England Biolabs) and Xbal (enzyme and reaction buffer from MBI Fermentas) , under conditions described by the suppliers, yielding a fragment of the restriction site polylmker and the linearized plasmid. The linearized plasmid is isolated by standard gel electrophoresis. 10 distinct, single-stranded oligonucleotides are synthesized by conventional technology (synthesis of variable oligonucleotides is described above) to read as follows:
5 ' NNGGTCTAGA (VNN) 20CGAGCTCNN3 '
m which N represents all four nucleotides, each with equal probability; and V represents C, G, or A, each with equal probability. (£. coli strain UH203 is supF, capable of translating the UAG stop-codon. Therefore, variable oligonucleotides inserted into the OmpA vector pEV218 must be free of the stop codons UAA and UGA, but not of UAG.) Using a standard protocol described m the literature (Brown, 1992), a 5' primer and a proof reading polymerase (such as the Vent polymerase) are used to generate double-stranded oligonucleotides from the single-stranded templates. These double-stranded, highly variable oligonucleotides are then digested with Sad and Xbal, under conditions described by the suppliers (New England Biolabs and MBI Fermentas, respectively) , yielding highly variable oligonucleotides with non-complementary sticky ends. The highly-variable oligonucleotides are then ligated (MBI ligation buffer, conditions described by supplier) into pEV218, linearized as described m the previous paragraph. Because the Sad and Xbal sticky ends are non-complementary, they ligate m only one oligonucleotide per plasmid, and in only one orientation. Conventional electrophoretic size fractionation is used to isolate ligated plasmids containing an insert. The product of this procedure is a population of plasmids (collectively designated pEV218I) bearing, at the genetic site corresponding to loop IV of OmpA, a highly variable exogenous sequence free of UAA and UGA stop codons.
The variable plasmid population, pEV218I, is used to transform E. coli strain UH203 { lac, supF, ompA, recA, proAB, rpsL) (this strain, its propagation and its transformation, are described more completely above, and in Freudl, 1989) . The UH203 population is grown in Luria broth supplemented with
O r lOOμg Ap/ml, yielding a population of 10 cells comprising 10 clones, each one of which carries a distinct member of the pEV218I population.
f) CISTEM-selection for OmpA exhibiting high-affinity for mAbT7
The UH203 (pEV218I) population of 108 cells plus approximately 100 UH203 (pEV218T7) cells is incubated at 37C for 15 min. in 100ml LB medium supplemented with 100 μg Ap/ml and IPTG between 0.25 and 2.0 mM, as determined in the optimization trial described above. This yields a population of 10 8 clones, comprising 106 distinct clones, each of which displays a distinct, exogenous, oligopeptide sequence at loop
IV of OmpA. One of these clones displays the T7 tag sequence.
After the initial 15 min. induction of pEV218I/pEV218T7 expression, lOnM mAbT7 is added to the 100ml of cells. The cells are incubated with the mAb for 60 min. at 37°C.
Coliphage K3 is then introduced at Mol = 2 (the procedure for growth and handling of K3 is described above, and in Freudl,
1989, 126, and Morona et al . , 1985). Incubation with K3 proceeds for the minimum amount of time required, as determined in the optimization trial described above.
Following incubation with phage, the cell population is centrifuged to separate cells from phage and the products of lysis, and cells are resuspended in 100ml NB, without lOOmM
NaCl supplement, to guarantee complete cessation of infection by K3. This cell population represents the set of clones selected by the CISTEM for binding to the target compound mAbT7. To amplify the numbers of selected clones, the selected cell population is grown to 10 cells at 37°C in 100ml NB .
Expression of OmpA is then heavily induced by 2.5 mM IPTG for
4 15 mm.. Following induction, 10 cells from the selected population are plated onto NB agar containing 0 . 5% sodium deoxycholate . (Sodium deoxycholate prevents growth of cells that fail to insert OmpA m the outer membrane; such cells are
"false positives", selected by the CISTEM simply because they display no OmpA, and not because of their capacity to bind mAbT7 at loop IV of OmpA. The sodium deoxycholate supplement also prevents K3 infection, providing a further guarantee against continuing infection.) Plated cells are incubated at
37°C to grow colonies, which are composed of cells in which loop IV of OmpA contains an exogenous oligopeptide with high affinity for the target compound, mAbT7.
11. EXAMPLE 2 : AN E . COLI OMPA/COLIPHAGE K3/NEISSERIA MENINGITIDIS ANTIBODY CISTEM
a) E. coli Outer membrane protein A and Neissenal Opa proteins
Several recent findings show that one important OmpA function explaining such high levels of expression and evolutionary conservation of the polypeptide is adhesion t : :cco and invasion of host tissues. It has recently been demonstrated that E. coli OmpA contributes significantly t : ;<co invasion of brain microvascular endothelial cells a critical step m pathogenesis of neonatal E. coli meningitis. Similarly, OmpA homologs are directly involved microbial adherence, invasion of host cells, and pathogenesis by Haemophil us mfl uenzae, Neisseria menmgi tidis, Neisseria gonorrhoeae, and other virulent microbes.
The structural homology between OmpA and a wide range of antigenic adhesins makes OmpA well-suited for imprinting for use m applications requiring mimicry of homologous adhesins. One such application is vaccination. When a polypeptide (in this case, OmpA) is imprinted to bind to antibodies composing the sera raised against an antigen, the imprinted polypeptide may elicit an immune response very similar to the one provoked by the antigen (Folgori et al . , 1994). Antibodies suitable for use as the target compound in a CISTEM for imprinting E. coli OmpA to mimic Nei sseria meningi tidi s Opa proteins (Olyhoek et al., 1991) include mAb B33, which reacts with all Opa proteins, mAbs P322 and P514, which react with a broad subset of Opa proteins, and mAb B306, mAb A222/5b (Olyhoek et al . , 1991), and others described in the literature.
b) CISTEM-selection for OmpA exhibiting high-affinity for antibodies against
Neisseria meningitidis Opa proteins
The construction of a cell population for testing and optimizing CISTEM selection, the optimization and demonstration of CISTEM function, and the construction of an OmpA-imprinting CISTEM cell population are performed exactly as described above: this CISTEM, like the one described in Example 1, uses phage K3 to select from a cell population displaying variable inserts 20 amino acids in length at loop IV of OmpA. However, whereas Example 1 imprints OmpA to bind to mAbT7, this CISTEM imprints OmpA to bind to antibodies against Neisseria meningi tidi s Opa proteins. Therefore, where the protocol described in Example 1, part f, calls for 10 nM mAbT7, this CISTEM calls for 10 nM of each of the following antibodies: B33; P322; P514; B306; and A222/5b. Aside from this substitution, the protocol for imprinting OmpA to bind anti-Opa antibodies is as described in Example 1, part f.
iii. EXAMPLE 3: MONITORING THE DYNAMICS OF CISTEM SELECTION
To monitor the process by which cells that fail to bind the target compound are eliminated, while those cells that bind the target compound specifically are selected, the following experiment was performed. E. coli strains U (Untagged) and T (Tagged) are described above (in Example 1). U was cultivated overnight in LB supplemented with 25μg kanamycin/ml. T was cultivated overnight in LB supplemented with lOOμg amplicillin/ml and 2mM IPTG to induce expression of pEV218T7, the plasmid encoding OmpA with the T7 tag at extracellular loopIV of the mature polypeptide. Each overnight culture was diluted with LB to OD600 = 0.005. To create each of three mixed populations, aliquots of the two cultures were combined to compose a population in which the ratio of T cells to U cells was approximately 1 to 200. Sample aliquots were taken from the populations and plated on LB plates supplemented with lOOμg amplicillin/ml, and on LB plates supplemented with 25μg kanamycin/ml. After overnight incubation, colonies were counted, showing that the ratio of Tagged to Untagged cells was approximately IT to 190U (table 8, t= -60) . Two of the three mixed populations were incubated for 60 min. at 37C without mAb, whereas the third population was incubated under the same conditions with 68nM mAbT7 (described above) . (In table 8, the presence or absence of mAbT7 is indicated by a + or a -, respectively.) After one hour of incubation (i.e., at time t=0 in table Al), Phage K3 was introduced to two of the three mixed populations at a Multiplicity of Infection of 6. (In table 8, the presence of absence of phage K3 is indicated by a + or -, respectively.) Over the next 180 min., sample aliquots were taken from the populations at the times indicated in table 8, namely, immediately preceding the addition of phage, and then every 15 minutes thereafter. Aliquots were used to estimate phage titer, and were plated on the appropriate antibiotics to determine numbers of U and T cells.
Figure 17 shows the dynamics of phage titer and Mol over the course of CISTEM selection. Mol climbs exponentially and reaches a maximum near 100,000. Figure 18 shows the dynamics of cell growth and death during CISTEM selection. The rapid exponential growth of phage results in a monotonic decline of U cells, irrespective of the presence of absence of mAbT7. By contrast, in the presence of mAbT7, T cells are protected from the extremely high Mol, and consequently they expand in number. Thus, specific interaction between the antibody and the epitope displayed at OmpA loopIV resulted in selection of cells exhibiting the epitope, while cells that did not exhibit the epitope were eliminated. This result is summarized in figure 19. iv: EXAMPLE 4: REDUCTION OF BACKGROUND THROUGH THE USE OF HELPER CELLS
The term "background" is used to refer to cells that fail to bind the target compound but nonetheless remain in the selected population. In general, it is desireable to minimize the number of background cells, such that CISTEM reliably eliminates cells whose imprinted domains fail to bind the target compound, while selecting those cells whose imprinted domains do bind the target compound. The following experiment illustrates one strategy for reducing background. The CISTEM cell population is supplemented with "helper cells", the function of which is propagation and amplification of the selective agent. This amplification of the selective agent increases the intensity of selection, and thereby reduces residual background cells. The helper cells lack a selectable marker that is present in the rest of the CISTEM cell population; this permits efficient elimination of the helper cells after they have served their purpose of amplifying the selective agent.
UH203 (pEV218T7) , the "T" strain, was cultivated overnight in LB supplemented with lOOμg ampicillin/ml and lmM IPTG to induce expression of the ompA plasmid. As a background strain that does not exhibit the T7 epitope at loopIV of OmpA, UHFl(pEV218) was used. UHF1 is virtually isogenic with UH203, except that it carries a gene for Chloramphenicol resistance. The plasmid pEV218 is identical to pEV218T7, except that it carries a restriction-enzyme site polylmker in place of the gene for the T7 tag. UHF1 (pEV218 ) , the "U" strain, was cultivated overnight in LB supplemented with lOOμg ampicillin/ml and lmM IPTG to induce expression of pEV218. To create mixed populations, aliquots of the "T" and "U" strains were combined to compose populations with a ratio of T to U cells of approximately 1 to 100 (table 9) . As helper cells, MC4100 (pEGFPNl) were used. lOOμl of MC4100 (pEGFPNl) at OD600 = 0.5 were added to two mixed populations. Two other mixed populations received no helper cells. One population with helper cells, and one without helper cells, were incubated for 1 hour at 37C with 680nM mAbT7, while the other two populations were incubated under the same conditions without antibodies. After incubation (i.e., at time t=0 on table 9), phage K3 were added to all populations at an Mol = 2. All populations were then incubated for two hours. Aliquots were taken and plated on agar supplemented with ampicillin alone, or with ampicillin plus chloramphenicol . The numbers of U and T cells in the population at the end of CISTEM selection are shown in table Bl. The background proportion, defined as U cells/ (U cells + T cells), is reduced from 34% in the absence of helper cells to 16% with the addition of lOOμl of helper cells at OD6oo. This experiment thus illustrates the principle of helper cells: they are added to CISTEM cell population to modify the activity of the selective agent.
v. EXAMPLE 5: THE SENSITIVITY OF CISTEM
The following experiment was performed to demonstrate the sensitivity of CISTEM selection. The E. coli strains UH203 (pEV218T7) , here called "T" cells, and UHF1 (pEV218) , here called "U" cells, were cultured overnight with lmM IPTG to induce expression of the ompA plasmids. To create mixed populations, aliquots of the T culture were diluted 10, 100, 1000, or 10,000 times, and then combined with undiluted aliquots of the U culture to compose populations with T to U ratios as shown in table 10. Helper cells were added as described in the previous example. One set of five mixed populations, representing each of the serial dilutions of T cells (see table 10), was incubated for 1 hour at 37C with 340 nM mAbT7. Two replicate sets of five mixed populations (a total of 10 populations, comprising two of each of the serial dilutions of T cells) were incubated under the same conditions without the antibodies. Immediately following incubation, phage K3 was added to the set of five mixed populations that had been incubated with the antibodies, as well as to one of the two sets of five mixed populations that were not incubated with the antibodies. In table 10, the presence of antibodies or phage is indicated by a +, whereas the absence of antibodies or phage is indicated by a -. After two hours of incubation, aliquots from each population were plated on agar containing the appropriate antibiotics, as described above. After overnight culture, colonies were counted to determine the composition of the CISTEM cell populations.
Table 10, shows not only the final numbers of cells, but also several indices of CISTEM' s performance in selecting T cells from a background of U cells. Enrichment, an index of the selection system's capacity for amplifying clones that bind the target compound while eliminating those that do not, is defined as ( T Cel l s /U Cel ls ) after selerr mn/ ( T Cel ls /U Cells ) befDre selection -
Yield, an index of the proportion of all cells that bind the target compound that are successfully selected by the CISTEM, is estimated by comparing the number of T cells m the K3- populations with the number of T cells m the K3+, mAb+ populations. Yield is defined as (T cells) ι +, Ab+/ (T cells) 3-.
vi.: EXAMPLE 6: MAPPING THE EPITOPE RECOGNIZED BY mAbT7
One application of CISTEM is in mapping the epitopes involved in protem-protem interactions. To illustrate this application, CISTEM was used to select cells displaying the phage T7 polymerase epitope that is recognized by the T7-polymerase specific mAbT7 (described above) . The plasmid pET17xb (Novagene) contains the entire gene for phage T7 polymerase. This plasmid was digested with micrococcal nuclease, according to protocols provided by the supplier, to yield fragments of a range of sizes from approximately 50 basepairs to 300bp. The band corresponding to fragments between 60 and 75 bp in length was excized with a razorblade and the fragments were resuspended. The fragments were treated with SI nuclease (conditions according to supplier) to remove the 3' phosphates left by micrococcal nuclease. The fragments were then phosphorylated by T4 polynucleotidkmase . The ompA plasmid pEV218 (described above) was cleaved by the restriction enzymes Ecll36II and Sail (conditions of digestion according to supplier) , at unique sites in the genetic region corresponding to the fourth extracellular loop of the mature polypeptide. The linearized vector was dephosphorylated by calf intestinal phosphatase. Combining lOOng linearized vector with 200ng fragments, the fragments were ligated into the linearized vector by standard blunt-end ligation using T4 ligase. PEG was added to the ligation buffer to favor circularization. Circularization of the vector without insertion of a fragments creates a Sail site. Therefore, the products of ligation were treated with Sail to linearize vectors that did not contain a fragment. The circularized vectors containing fragments were isolated by electrophoresis, and used to transform the highly competent, commercially available strain XL-1 blue. Cells were plated on LB supplemented with lOOμg/ml ampicillin to select for transformants. Colonies were grown in 250ml LB supplemented with ampicillin, as well as glucose for catabolic repression of low- level expression of the ompA plasmid. Plasmid DNA was extracted using the Qiagene kit. 7μg of plasmid DNA was cleaved by Sail and used for transformation of UH203 (an ompA deficient strain described above) by electroporation and CaMg transformation. Cells were grown on LB supplemented with lOOμg ampicillin to select for transformants. This population of transformants will be refered to as UH203 (pEV218T7L) .
The UH203 (pEV218T7L) population was grown in overnight culture with lmM IPTG to induce expression of the ompA plasmid. The cells were incubated for one hour with 680nM mAbT7, allowing for binding between the monocolonal antibody and the epitope presented at OmpA loopIV by certain members of the cell population. Following incubation, phage K3 were added at an Mol = 2, and the cells were incubated with phage for two hours. The cells were then plated, and the results of selection (shown in fig.21) are summarized in table 11. 25 colonies were sampled, their plasmids were isolated as above, and the plasmids were subjected to cleavage by the restriction endonuclease Nhel. The genetic sequence encoding the epitope recognized by mAbT7 (i.e., the T7 tag) contains a Nhel site; no other Nhel site appears in pEV218. Therefore, only plasmids comprising the sequence encoding the epitope of interest are vulnerable to Nhel. 24 of the 25 clones sampled were vulnerable to Nhel. TABLES
Table 1: Each polypeptide (column A) is listed with the selection agent (column B) that could be used in a CISTEM to imprint the polypeptide; the respective domains that compose the agent's adhesion domains (column C; numbering begins at the amino terminal) ; a domain that is interacting (e.g. for sterical reasons) with the adhesion domain and could be imprinted to bind a target compound (column D)
Figure imgf000103_0001
Table 2
Figure imgf000104_0001
Table 3: Serial dilutions of selection agent and target concentrations in preliminary optimization of CISTEM selection. The cell populations listed in the table are described in the text.
Mol or [Selection Agent] / [Cells]
Figure imgf000104_0003
Figure imgf000104_0002
Table 4
Figure imgf000105_0001
Presence or Absence of
Table 5 mAbT7 in Medium
mAbT7 + mAbT7 -
K3 Susceptible
Presence or Absence Tag + K3 Resistant of T7 Tag at OmpA Loop IV Tag - K3 Susceptible K3 Susceptible
Table 6
Figure imgf000106_0001
Table 7
Figure imgf000106_0002
Table 8
Figure imgf000107_0001
Figure imgf000108_0001
Figure imgf000108_0002
Figure imgf000109_0001
Table 10
Figure imgf000110_0001
Table 11
Figure imgf000111_0001
BIBLIOGRAPHY
Arp, J., et al. (1996) A source of glycosylated human T-cell lymphotropic virus type 1 envelope protein: expression of gp46 by the vaccinia vιrus/T7 polymerase system. Journal of Virology 70: 7349-7359.
Balmt, R.F. and J.W. Larrick (1993) Antibody engineering by parsimonious mutagenesis. Gene 137: 109-118
Block, C. et al . (1996) Quantitative structure-activity analysis correlating Ras/Raf interaction m vitro to Raf activation m vivo. Nature Structural Biology 3 (3): 244-251.
Boder, E.T. and K.D. Wittrup (1997) Yeast surface display for screening combinatorial polypeptide libraries. Nature Biotechnology 15: 553-557.
Boulanger and Letellier (1988) Characterization of ion channels involved m the penetration of phage T4 into E. coli cells. Journal of Biological Chemistry 263 (20): 9767-9775.
Boyd, D. (1994) Use of gene fusion to determine membrane protein topology, pp. 144-163 m White, S.H., ed. Membrane Protein Structure: Experimental Approaches. Oxford, New York.
Brown, S. (1992) Engineered iron oxide-adhesion mutants of the Escheri chia coli phage 1 receptor. Proceedings of the National Academy of Science, USA 89: 8651-8655.
Calendar, R. (1988) ed. The Bacteriophages vols. 1 and 2. Plenum, New York.
Charbonnier, J.B. et al . (1995) Crystal structure of the complex of a catalytic antibody Fab fragment with a transition state analog: structural similarities in esterase-like catalytic antibodies. Proceedings of the National Academy of Sciences, USA 92(25): 11721-5. - Ill - Chavez, R.A., and Z.A. Hall (1991) The transmembrane topology of the amino terminus of the alpha subunit of the nicotine acetylcholine receptor. Journal of Biological Chemistry 266: 15532-15538.
Chu, T.H., and R. Dornburg (1995) Retroviral vector particles displaying the antigen binding site of antibody enable cell- type specific gene transfer. Journal of Virology 69: 2659- 2663.
Couch, R.B. (1996) Rhinoviruses : propagation in cell culture. In Fields, B.N. et al.: 714-717
Cross, G.A. (1990) Glycolipid anchoring of plasma membrane proteins. Annual Review of Cell Biology 6: 1-39.
Fantini, J. et al . (1997) Synthetic soluble analogs of galactosylceramide (GalCer) bind to the V3 domain of HIV-1 gpl20 and inhibit HIV-1-induced fusion and entry. Journal of Biological Chemistry 272: 7245-7252.
Folgori, A., R. Tafi, A. Meola, F. Felici, G. Galfre, R. Cortese, P. Monaci, and A. Nicosia (1994) A general strategy to identify mimotopes of pathological antigens using only random peptide libraries and human sera. EMBO 13: 2236-2243.
Fourel, D., C. Hikita, J.M. Bolla, S. Mizushima, and J.M. Pages (1990) Characterization of OmpF domains involved in Escheri chi a col i K-12 sensitivity to colicins A and N. Journal of Bacteriology 172: 3675-3680.
Francisco, J.A., et al. (1993) Specific adhesion and hydrolysis of cellulose by intact Escheri chia coli expressing surface anchored cellulase or cellulose binding domains. Biotechnology 11: 491-495.
Freudl, R. (1989) Insertion of peptides into cell-surface- exposed areas of the Escheri chia coli OmpA protein does not interfere with export and membrane assembly. Gene 82: 229-236. Georgiou, G. et al . (1997) Display of heterologous proteins on the surface of microorganisms: from the screening of combinatorial libraries to libe recombinant vaccines. Nature Biotechnology 15: 29-34.
Hibi, M. et al. (1990) Molecular cloning and expression of an IL-6 signal transducer, gpl30. Cell 63: 1149-1157.
Janda, K.D. (1994) Tagged versus untagged libraries: methods for the generation and screening of combinatorial chemical libraries. Proceedings of the National Academy of Science, USA: 10779-10785.
Lis, H., and N. Sharon (1993) Protein glycosylation: Structural and functional aspects. European Journal of Biochemistry 218: 1-27
Marks, J.D., H.R. Hoogenbom, T.P. Bonnert, J. McCafferty, A.D.
Griffiths, and G. Winter (1991) By-passing immunization: human antibodies from V-gene libraries displayed on phage . Journal of Molecular Biology 222: 581-597.
Mollenkopf, H.-J. et al. (1996) Conversion of bacterial gene products to secretion-competent fusion proteins. Biotechnique 21: 854-860.
Moran, P., and I.W. Caras (1991) Fusion of sequence elements from non-anchored proteins to generate a fully functional signal for glycophosphatidylinositol membrane anchor attachment. Journal of Cell Biology 115: 1595-1600.
Morona, R., C. Kramer, and U. Henning (1985) Bacteriophage receptor area of outer membrane protein ompA of Escheri chi a coli K-12. Journal of Bacteriology 164: 539-543.
Olyhoek, A.J. et al . (1991) Cloning and expression in E . coli of ope, the gene for an unusual Class 5 outer membrane protein from Neisseria meningitides . Microbial Pathogenesis 11: 249- 257.
Onu, M., G. Raab, K. Lau, J.A. Abraham, and M. Klagsbrun (1994) Purification and Characterization of Transmembrane Forms of Heparin-binding EGF-like Growth Factor. Journal of Biological Chemistry 269(49): 31315-21.
Pattus, F., D. Massotte, H.U. Wilmsen, J. Lakey, D. Tsernoglou, A. Tucker, and M.W. Parker (1990) Colicins: prokaryotic killer-pores. Experientia 46: 180 192.
Phizicky, E.M. and S. Fields (1995) Protein-protein interactions: methods for detection and analysis. Microbiological Reviews 59 (1): 94-123.
Singer, S.J. (1990) The structure and insertion of integral proteins in membranes. Annual Review of Cell Biology 6: 247- 296.
Staunton, D.E., M.L. Dustin, H.P. Erickson, and T.A. Springer (1990) The arrangement of the immunoglobulin-like domains of ICAM-1 and the binding sites for LFA-1 and rhinovirus. Cell : 61: 243-354.
Tiireci, 0., U. Sahin, et al . (1997) Serological analysis of human tumor antigens: molecular definition and implications. Mol.Med. Today 3: 342-349.
Tyler, K.L. and B.N. Fields (1996) Pathogenesis of viral infections. In Fields, B.N. et al . : 173-218.
Yasukawa, K. et al . (1990) Purification and characterization of soluble human IL-6 receptor expressed in CHO cells. Journal of Biochemistry 108: 673-676.
Zeilestra-Ryalls, J. et al. (1993) Sequence analysis and phenotypic characterization of groEL mutations that block lambda and T4 Bacteriophage growth. Journal of Bacteriology 175 (4) : 1134-43.

Claims

1.: A method for identifying and isolating at least one of a polypeptide of interest, a target compound or a gene of said polypeptide of interest or a gene of said target compound, wherein at least one of said polypeptide of interest and said target compound is unknown comprising the steps a) constructing a cell population by engineering it to express a membrane polypeptide which has said polypeptide of interest in its extra- or intracellular domain, said extra- or intracellular domain being exhibited in one or more copies on each cell's surface directly or indirectly interacting with a selection site specific for a selection agent, such that upon binding of said target compound to said polypeptide of interest said selection site of cells which exhibit such a binding of said target compound to said polypeptide of interest is modified in a manner that said selection agent may be used for selecting those cells which exhibit said binding of said target compound to said polypeptide of interest from cells which do not exhibit said binding, b) exposing said cell population to said target compound or a population of said target compounds for a period of time sufficient for said target compound to bind to said polypeptide of interest, c) exposing said cell population simultaneously with or after step b) to said selection agent for a period of time sufficient for said agent to interact with said selection site, d) selecting for clones which show said binding of said target compound to said polypeptide of interest , e) optionally allowing said selected clones to multiply and f) isolating said polypeptide of interest and/or said target compound and/or said gene of said polypeptide of interest and/or said gene of said target compound.
2.: The method according to claim 1, wherein said interaction between said extra- or intracellular domain with said polypeptide of interest and said selection site is a sterical interaction, especially a sterical interaction which does not allow said selection agent to interact with said selection site after binding of said target compound to said polypeptide of interest .
3.: The method according to claim 1 or 2, wherein said selection site is immediately adjacent to the said extra- or intracellular domain with said polypeptide of interest.
4.: The method according to any one of claims 1 to 3, wherein said selection agent is a toxic agent.
5.: The method according to any one of claims 1 to 4, wherein said selection site is an adhesion domain for said selection agent.
6.: The method according to any one of claims 1 to 5, wherein said membrane polypeptide is selected from glycolipid-anchored polypeptides, ╬▒-helical polytopic receptors, bitopic integral membrane polypeptides, polytopic integral membrane polypeptides or membrane-bound isoforms of a secreted polypeptide.
7.: The method according to any one of claims 1 to 6, wherein said polypeptide of interest is identified and isolated.
8.: The method according to any one of claims 1 to 7, wherein said extra- or intracellular domain of said membrane polypeptide contains a highly variable region which is, upon binding to said target compound, identified and isolated as said polypeptide of interest or a section thereof.
9.: The method according to claim 8, wherein said highly variable domain comprises polypeptides encoded by a cDNA library or sections thereof.
10.: The method according to any one of claims 1 to 9, wherein said membrane polypeptide is heterologous for the cells.
11.: The method according to any one of claims 1 to 10, wherein said membrane polypeptide is genetically engineered to contain said extra- or intracellular domain and said selection site residing adjacent to each other on the cellular surface.
12.: The method according to any one of claims 1 to 11, wherein said target compound is composed of a mixture of compounds.
13.: The method according to any one of claims 1 to 12, characterised in that it is carried out in the form of a high- throughput screening assay wherein said cell population is exposed to a multiplicity of target compounds from which an unknown binding partner for a polypeptide of interest is to be identified and isolated.
14.: The method according to any one of claims 1 to 13, wherein said target compound is secreted by the cells expressing said membrane polypeptide.
15.: The method according to any one of claims 1 to 14, wherein said selection agent is a pathogenic agent.
16.: The method according to any one of claims 1 to 15, wherein said selection agent is a virus, preferably a bacteriophage.
17.: The method according to any one of claims 15 to 16, wherein said pathogenic agent is a modified pathogenic agent with a different effect on the cells than the corresponding naturally occurring pathogenic agent, preferably a stronger effect than the naturally occurring pathogenic agent.
18.: The method according to any one of claims 1 to 17, wherein steps b) through e) are repeated at least once, preferably comprising an increase in the selection pressure in the repeated steps .
19.: The method according to claim 18, wherein steps b) through e) are repeated until said cell population comprises a desired number of distinct clones.
20.: The method according to claim 18 or 19, wherein new sequence variations in said variable regions of clones selected in step d) are introduced before steps b) to e) are repeated.
21.: The method according to any one of claims 1 to 20, wherein said membrane protein exhibits an additional domain on the surface of the cell, said additional domain being adjacent to both, the variable domain and said selection site and said additional domain also interacts, preferably by binding, with said target compound.
22.: The method according to any one of claims 1 to 21, wherein said polypeptide of interest bind to said target compound and affect its function, comprising screening polypeptides exhibited by said clones selected m step d) and amplified in step e) by their ability to affect the function of said target compound.
23.: The method according to any one of claims 1 to 22, wherein said polypeptide of interest exhibits an enzymatic activity, comprising a step wherein said cell population is placed m a nutrient medium which each cell' s supply of a vital compound depends upon that cell's variable region acting enzymatically upon a compound structurally similar, but not identical to said target compound.
24.: The method according to any one of claims 1 to 23, wherein said membrane polypeptide is an ╬▒-helical polytopic receptor, a glycolipid-anchored protein, a glycoprotein or a proteoglycan.
25.: Cell population for use in a method according to any one of claims 1 to 24 comprising a library of cells, each cell exhibiting a) a membrane polypeptide which has said polypeptide of interest m its extra- or intracellular domain and b) a selection site being specific for a selection agent, such that upon binding of a target compound to the polypeptide of interest said selection site of cells which exhibit such a binding of said target compound to said polypeptide of interest is modified in a manner that said selection agent may be used for selecting those cells which exhibit said binding of said target compound to said polypeptide of interest from cells which do not exhibit said binding.
26.: Kit comprising a) a cell population according to claim 25, b)a target compound and/or a library of target compounds and c)one or more selection agent (s) .
27.: A method for producing a pharmaceutical preparation, comprising identifying and isolating a novel pharmaceutically active compound according to a method according to any one of claims 1 to 24, adding pharmaceutically acceptable auxiliary components and completing said pharmaceutical preparation
PCT/EP1998/007922 1997-12-05 1998-12-07 Method and cells to screen combinatorial libraries WO1999030151A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
AU18769/99A AU749042B2 (en) 1997-12-05 1998-12-07 Method and cells to screen combinatorial libraries
JP2000524659A JP2001526054A (en) 1997-12-05 1998-12-07 Methods and cells for screening combinatorial libraries
CA002310381A CA2310381A1 (en) 1997-12-05 1998-12-07 Method and cells to screen combinatorial libraries
EP98963539A EP1036321A1 (en) 1997-12-05 1998-12-07 Method and cells to screen combinatorial libraries

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
EP97121451A EP0922957B1 (en) 1997-12-05 1997-12-05 Method to determine biomolecular interaction
EP97121451.5 1998-07-23
EP98890217 1998-07-23
EP98890217.7 1998-07-23

Publications (1)

Publication Number Publication Date
WO1999030151A1 true WO1999030151A1 (en) 1999-06-17

Family

ID=26145960

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP1998/007922 WO1999030151A1 (en) 1997-12-05 1998-12-07 Method and cells to screen combinatorial libraries

Country Status (5)

Country Link
EP (1) EP1036321A1 (en)
JP (1) JP2001526054A (en)
AU (1) AU749042B2 (en)
CA (1) CA2310381A1 (en)
WO (1) WO1999030151A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000045839A2 (en) * 1999-02-01 2000-08-10 Cistem Biotechnologies Gmbh A method for selecting and producing vaccine and diagnostic preparations
WO2002102855A2 (en) * 2000-11-17 2002-12-27 University Of Rochester In vitro methods of producing and identifying immunoglobulin molecules in eukaryotic cells
AT410798B (en) * 2001-01-26 2003-07-25 Cistem Biotechnologies Gmbh METHOD FOR IDENTIFYING, ISOLATING AND PRODUCING ANTIGENS AGAINST A SPECIFIC PATHOGEN
US6706477B2 (en) 1997-09-22 2004-03-16 University Of Rochester Methods for producing polynucleotide libraries in vaccinia virus
WO2005001130A1 (en) * 2003-06-09 2005-01-06 Applera Corporation Methods for selecting protein binding moieties
US7067251B2 (en) 2000-03-28 2006-06-27 University Of Rochester Methods of directly selecting cells expressing inserts of interest
EP1830190A2 (en) * 2000-11-17 2007-09-05 University Of Rochester In vitro methods of producing and identifying immunoglobulin molecules in eukaryotic cells

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
E.T. BODER ET AL.: "Yeast surface display for screening combinatorial polypeptide libraries", NATURE BIOTECHNOLOGY, vol. 15, no. 6, 1 June 1997 (1997-06-01), WASHINGTON DC USA, pages 553 - 557, XP002064402 *
M.C. KIEKE ET AL.: "Isolation of anti-T cell receptor scFv mutants by yeast surface display.", PROTEIN ENINEERING, vol. 10, no. 11, 1 November 1997 (1997-11-01), OXFORD UK, pages 1303 - 1310, XP002064403 *

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6706477B2 (en) 1997-09-22 2004-03-16 University Of Rochester Methods for producing polynucleotide libraries in vaccinia virus
US6872518B2 (en) 1997-09-22 2005-03-29 University Of Rochester Methods for selecting polynucleotides encoding T cell epitopes
WO2000045839A3 (en) * 1999-02-01 2001-10-04 Cistem Biotechnologies Gmbh A method for selecting and producing vaccine and diagnostic preparations
WO2000045839A2 (en) * 1999-02-01 2000-08-10 Cistem Biotechnologies Gmbh A method for selecting and producing vaccine and diagnostic preparations
US7067251B2 (en) 2000-03-28 2006-06-27 University Of Rochester Methods of directly selecting cells expressing inserts of interest
WO2002102855A3 (en) * 2000-11-17 2003-06-26 Univ Rochester In vitro methods of producing and identifying immunoglobulin molecules in eukaryotic cells
US7858559B2 (en) 2000-11-17 2010-12-28 University Of Rochester In vitro methods of producing and identifying immunoglobulin molecules in eukaryotic cells
WO2002102855A2 (en) * 2000-11-17 2002-12-27 University Of Rochester In vitro methods of producing and identifying immunoglobulin molecules in eukaryotic cells
US8741810B2 (en) 2000-11-17 2014-06-03 University Of Rochester In vitro methods of producing and identifying immunoglobulin molecules in eukaryotic cells
CN1306272C (en) * 2000-11-17 2007-03-21 罗切斯特大学 In vitro methods of producing and identifying immunoglobulin molecules in eukaryotic cells
EP1830190A2 (en) * 2000-11-17 2007-09-05 University Of Rochester In vitro methods of producing and identifying immunoglobulin molecules in eukaryotic cells
EP1830190A3 (en) * 2000-11-17 2008-07-09 University Of Rochester In vitro methods of producing and identifying immunoglobulin molecules in eukaryotic cells
AT410798B (en) * 2001-01-26 2003-07-25 Cistem Biotechnologies Gmbh METHOD FOR IDENTIFYING, ISOLATING AND PRODUCING ANTIGENS AGAINST A SPECIFIC PATHOGEN
WO2005001130A1 (en) * 2003-06-09 2005-01-06 Applera Corporation Methods for selecting protein binding moieties
JP2007501636A (en) * 2003-06-09 2007-02-01 アプレラ コーポレイション Method for selecting protein binding moieties

Also Published As

Publication number Publication date
AU1876999A (en) 1999-06-28
CA2310381A1 (en) 1999-06-17
EP1036321A1 (en) 2000-09-20
AU749042B2 (en) 2002-06-20
JP2001526054A (en) 2001-12-18

Similar Documents

Publication Publication Date Title
US7772189B2 (en) Phage displayed cell binding peptides
Rakonjac et al. Filamentous bacteriophage: biology, phage display and nanotechnology applications
Rodi et al. Phage-display technology–finding a needle in a vast molecular haystack
EP1904634B1 (en) Novel phage display technologies
Russel et al. Filamentous phage
AU761648B2 (en) Methods and reagents for isolating biologically active peptides
AU5122499A (en) Improved transformation efficiency in phage display through modification of a coat protein
Gupta et al. High-density functional display of proteins on bacteriophage lambda
US20020025536A1 (en) Methods and reagents for isolating biologically active antibodies
AU749042B2 (en) Method and cells to screen combinatorial libraries
Johnsson et al. Phage display of combinatorial peptide and protein libraries and their applications in biology and chemistry
JP2002503453A (en) Cytotoxic heteromeric protein combination library
Rakonjac et al. Structure, biology, and applications of filamentous bacteriophages
EP0922957B1 (en) Method to determine biomolecular interaction
Kay Biologically displayed random peptides as reagents in mapping protein-protein interactions
Nakajima et al. Expression of random peptide fused to invasin on bacterial cell surface for selection of cell-targeting peptides
US8063178B2 (en) Phage displayed Trp cage ligands
JP2007511218A (en) Mycolactone locus: assembly line for the production of novel polyketides, therapeutic and protective uses
WO2003061596A2 (en) Methods and reagents for generating chimeric serum peptide carri ers
Phage F ilamentous phages constitute a large family of bacterial viruses that infect many Gram-negative bacteria and even a Gram-positive bacterium (18). These long, slender viruses contain a circular, single-stranded DNA genome encased in a somewhat flexible tube composed of thousands of copies of a single major coat protein (figure 12-1). Two minor proteins at one end and two others at the other end
Orsolya Bacteriological Applications of Random Phage Display Libraries
LECTURE Symposium on Plasmid Biology
Purdy To assemble, or not to assemble: the initiation of retroviral capsid assembly
AU2004201825A1 (en) Improved transformation efficiency in phage display through modification of a coat protein

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 18769/99

Country of ref document: AU

ENP Entry into the national phase

Ref document number: 2310381

Country of ref document: CA

Ref country code: CA

Ref document number: 2310381

Kind code of ref document: A

Format of ref document f/p: F

WWE Wipo information: entry into national phase

Ref document number: 1998963539

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: KR

WWP Wipo information: published in national office

Ref document number: 1998963539

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWE Wipo information: entry into national phase

Ref document number: 09555623

Country of ref document: US

WWG Wipo information: grant in national office

Ref document number: 18769/99

Country of ref document: AU

WWW Wipo information: withdrawn in national office

Ref document number: 1998963539

Country of ref document: EP