WO1987003300A1 - Enhanced secretion of heterologous proteins by hosts using substituted promoters - Google Patents

Enhanced secretion of heterologous proteins by hosts using substituted promoters Download PDF

Info

Publication number
WO1987003300A1
WO1987003300A1 PCT/EP1986/000675 EP8600675W WO8703300A1 WO 1987003300 A1 WO1987003300 A1 WO 1987003300A1 EP 8600675 W EP8600675 W EP 8600675W WO 8703300 A1 WO8703300 A1 WO 8703300A1
Authority
WO
WIPO (PCT)
Prior art keywords
host
promoter
secretion
dna
dna sequence
Prior art date
Application number
PCT/EP1986/000675
Other languages
French (fr)
Inventor
Joachim F. Ernst
Ursula Schmeissner
Original Assignee
Biogen N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Biogen N.V. filed Critical Biogen N.V.
Priority to EP86906826A priority Critical patent/EP0283475B1/en
Priority to DE3689846T priority patent/DE3689846T2/en
Publication of WO1987003300A1 publication Critical patent/WO1987003300A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/575Hormones
    • C07K14/65Insulin-like growth factors, i.e. somatomedins, e.g. IGF-1, IGF-2
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/37Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi
    • C07K14/39Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi from yeasts
    • C07K14/395Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from fungi from yeasts from Saccharomyces
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/67General methods for enhancing the expression
    • C12N15/69Increasing the copy number of the vector
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/80Vectors or expression systems specially adapted for eukaryotic hosts for fungi
    • C12N15/81Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/01Fusion polypeptide containing a localisation/targetting motif
    • C07K2319/02Fusion polypeptide containing a localisation/targetting motif containing a signal sequence

Definitions

  • This invention relates to expression systems and recombinant DNA molecules that facilitate enhanced secretion of heterologous proteins by hosts, to hosts comprising such recombinant DNA molecules and to methods of producing desired proteins using such hosts.
  • Proteins prepared by recombinant DNA methods are sometimes difficult to isolate because the protein must be extracted from the transformed microorganism by means such as cell lysis that typically destroy the microorganism. Obtaining sufficiently pure protein in high yield from such extraction procedures is difficult because of the large number of different organic compounds liberated when lysis takes place.
  • secreted protein is more easily separated from the culture medium, and the microorganism can survive the separation process to produce and to secrete more of the desired protein.
  • Most proteins naturally secreted by prokaryotes and eukaryotes are initially synthesized in the form of precursors containing an amino-terminal extension several amino acids long. This extension, called a secretion leader or signal sequence, allows the precursor protein to cross the cell membrane of the microorganism and enter the culture medium or periplasmic space of the cell. During secretion, the secretion leader is cleaved from the protein, leading to the presence of mature protein in the culture medium or periplasmic space.
  • the present invention solves the foregoing problems by providing expression systems 'that facilitate enhanced secretion of heterologous proteins.
  • a promoter of at most intermediate strength and a heterologous DNA secretion signal sequence in a multicopy vector results in a higher yield of a desired protein from the host compared to hosts transformed by a multicopy vector using a strong promoter.
  • one embodiment of the present invention comprises a DNA sequence that is an expression and secretion control sequence for producing and secreting a selected protein from a host in which the protein is made, said sequence comprising (a) a promoter that is active in said host and is of at most intermediate strength, and (b) a heterologous DNA secretion signal sequence, recognized by said host, beginning with a start codon, operatively linked to said promoter.
  • the expression and secretion control sequence may then be operatively linked to a
  • DNA sequence coding for a desired protein and employed to transform a host to produce and secrete the desired protein.
  • Yeast is a preferred host, and the actin (ACT) and iso-1-cytochrome c (CYCl) promoters are preferred promoters for yeast hosts.
  • ACT actin
  • CYCl iso-1-cytochrome c
  • S. cerevisiae is the most preferred host.
  • the secretion signal sequence is preferably the secretion signal sequence of the MF ⁇ 1 gene.
  • the present invention also relates to hosts transformed by the recombinant DNA molecules discussed above and to methods of using transformed hosts in the preparation and secretion of desired proteins.
  • Figure 1A shows the structure of the MF ⁇ 1 secretion precursor. Stippled regions indicate spacer regions separating the MF ⁇ 1 repeats. During secretion the secretion leader is cleaved off at the position indicated by the arrow.
  • Figure IB shows a general scheme for the construction of a yeast secretion precursor comprising the alpha mating factor (MF ⁇ 1) secretion leader fused to the desired heterologous protein. During secretion, the secretion leader is cleaved at the position indicated by the arrow.
  • Figure 2 shows the construction of a gene fusion between the MF ⁇ 1 expression control sequence and the SMC gene and insertion of the MF ⁇ 1/SMC fusion into a yeast expression vector.
  • Figure 3 shows two graphs: growth (as reflected by optical density at 600 nm) and SMC levels in the culture fluid of yeast strain BJ1991 transformed with URA3-type expression vectors.
  • the CYCl promoter construction (p336/l) is shown by ( ⁇ ); the ACT promoter construction (p364/l) by ( ⁇ ); and the MF ⁇ 1 promoter construction (p446/l) by ( ⁇ ).
  • Figure 4A shows the construction of actin promoter fragments ending in EcoRI sites.
  • Figure 4B shows the DNA structure of promoter ends for actin and the pEX-5, pEX-7, and pEX-8 vectors.
  • Figure 5 shows the construction of SMC secretion vectors based on the ACT and CYCl promoters.
  • Figure 6 shows the structure of SMC secretion vectors based on the yeast LEU2 gene.
  • Figure 7 shows two graphs: growth (as reflected by optical density at 600 nm) and SMC levels in the culture fluid of yeast strain BJ1991 transformed with LEU2-type expression vectors.
  • the CYCl promoter construction (504/1) is shown by ( ⁇ ); the ACT promoter construction (p482/18) by ( ⁇ ); and the MF ⁇ 1 promoter construction (pMF-SMC, rearranged) by ( ⁇ ).
  • FIG 8 shows the construction of secretion vectors for TNF.
  • Figure 9 shows the construction of secretion vectors for TPA.
  • Cloning the process of obtaining a population of organisms or DNA sequences derived from one such organism or sequence by asexual reproduction. Recombinant DNA Molecule or Hybrid DNA -
  • Protein - A polypeptide containing a linear series of more than fifty amino acids, e.g., pro insulin, serum albumin, human growth hormone, parathyroid hormone, and interferon. As used herein, however, a protein also comprises a polypeptide chain of fewer than fifty amino acids.
  • Polypeptide - A linear series of amino acids connected one to the other by peptide bonds between the amino and carboxy groups of adjacent amino acids.
  • Expression The process undergone by a gene to produce a polypeptide or protein. It is a combination of transcription and translation.
  • Promoter The region of DNA responsible for binding RNA polymerase to initiate transcription.
  • a promoter is located before the ribosome binding site.
  • a strong promoter is defined as one that has a strong affinity for RNA polymerase and that would accordingly be expected to aid in obtaining high expression rates.
  • a promoter of intermediate strength has a lower affinity for RNA polymerase.
  • the strength of a promoter may be affected by the host, the desired protein and the expression vector of the recombinant system. The effects of these factors are more fully discussed below.
  • levels of mRNA and levels of expressed protein are proportional to promoter strength.
  • promoter strength may affect the number of copies of a multicopy expression vector produced within a given host.
  • a host will produce many copies of a multicopy vector that contains an intermediate or weak promoter but will produce fewer copies of a multicopy vector that contains a strong promoter.
  • An estimate of the strength of a promoter may be based on the percent of total mRNA in a cell produced by the expression control sequences of that promoter.
  • Table 1 shows a list of genes, including their associated promoters and associated mRNA, as described in the literature. A characterization of the promoter strength is also included.
  • promoters of glycolytic genes are considered “strong.” Those skilled in the art will understand, however, that promoters may not have the same strength within different strains of a host, with different desired proteins or in different multicopy vectors. In S. cerevisiae, for example, the MF ⁇ 1 promoter is active in MAT ⁇ strains, but inactive in MATa strains. Those skilled in the art will appreciate that promoter strength depends upon many factors.
  • Promoters of the present invention should be selected so that the combination of promoter strength and copy number of the multicopy vector provides optimum yields of desired protein. Promoters of intermediate strength provide the best combination of promoter strength with copy number. Weaker promoters, in combination with high copy numbers, are also within the scope of the invention. Strong promoters adversely affect the copy number of the multicopy vectors within the transformed host, and unacceptable yields are obtained from those transformants.
  • a promoter should be selected that has maximal strength without significantly reducing plasmid copy number. Generally, a promoter may be considered of at most intermediate strength with respect to a given combination of host, vector and desired protein if the percent of total mRNA in the host corresponding to the promoter is less than 0.3%, and preferably is at most 0.15%.
  • Ribosome Binding Site The region of DNA which codes for a site on mRNA which helps the mRNA bind to the ribosome, so that translation can begin.
  • a ribosome binding site is located after (downstream from) the promoter and before (upstream from) the translational start signal of the DNA sequence to be expressed to produce the desired protein.
  • Gene A DNA sequence which encodes, as a template for mRNA, a sequence of amino acids characteristic of a specific polypeptide or protein.
  • Expression Control Sequence A DNA sequence that controls and regulates expression of genes when operatively linked to those genes.
  • Such sequences include the lac system, the ⁇ -lactamase system, the trp system, the tac, and trc systems, the major operator and promoter regions of phage ⁇ , the control region of fd coat protein, the early and late promoters of SV40, promoters derived from polyoma virus and adenovirus, metallothionine promoters, the promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase, e.g., Pho5, the promoters of the yeast ⁇ -matihg factors, and other sequences known in the art to control the expression of genes in prokaryotic or eukaryotic cells and their viruses or combinations thereof.
  • the gene can be linked to a eukaryotic promoter, such as that for the SV40 early region, coupled to the gene encoding dihydrofolate reductase and selectively amplified in Chinese hamster ovary cells to produce a cell line containing many copies of actively transcribed eukaryotic genes.
  • a eukaryotic promoter such as that for the SV40 early region
  • the gene encoding dihydrofolate reductase selectively amplified in Chinese hamster ovary cells to produce a cell line containing many copies of actively transcribed eukaryotic genes.
  • the precursor is synthesized within a host cell, e.g., preproinsulin, preserum albumin, prehuman growth hormone, preparathyroid hormone, and preinter feron.
  • a mature protein is obtained by secreting the precursor through the cell membrane of a host with an attendant loss or clipping of the signal sequence of its precursor.
  • RNA consisting of a sugar moiety (pentose), a phosphate, and a nitrogenous heterocyclic base.
  • the base is linked to the sugar moiety via the glycosidic carbon (l' carbon of the pentose) and that combination of base and sugar is called a nucleoside.
  • the base characterizes the nucleotide.
  • the four DNA bases are adenine ("A"), guanine ("G”), cytosine ("C”) and thymine (“T”).
  • the four RNA bases are A, G, C and uracil ( "U” ) .
  • DNA Sequence A linear series of nucleotides connected one to the other by phosphodiester bonds between the 3' and 5' carbons of adjacent pentoses.
  • Codon - A DNA sequence of three nucleotides (a triplet) which encodes, through messenger RNA ("mRNA”), an amino acid, a translational start signal or a translational termination signal.
  • mRNA messenger RNA
  • Plasmid - A non-chromosomal double-stranded DNA sequence comprising an intact "replicon" such that the plasmid is replicated in a host cell.
  • the characteristics of that organism are changed or transformed as a result of the DNA of the plasmid.
  • a plasmid carrying the gene for tetracycline resistance (Tet R ) transforms a host cell previously sensitive to tetracycline into one which is resistant to it.
  • a host cell transformed by a plasmid is called a "transformed host” or a "transformant.”
  • Phage or Bacteriophage - Bacterial virus which may include DNA sequences contained in a protein envelope or coat ("capsid").
  • Cloning Vehicle A plasmid, phage DNA or other DNA sequence which is able to replicate in a host cell, characterized by one or a small number of endonuclease recognition sites at which its DNA sequence may be cut in a determinable fashion without attendant loss of an essential biological function of the DNA, e.g., replication, production of coat proteins or loss of promoter or binding sites, and which contains a marker suitable for use in the identification of transformed cells, e.g., tetracycline resistance or ampicillin resistance.
  • a cloning vehicle is also known as a vector. In the present invention, the vector must be a "multicopy" vector, i.e., capable of producing multiple copies of itself within the host. Single copy vectors, incapable of producing copies, are not within the scope of this invention.
  • DNA Signal Sequence A DNA sequence which encodes, as a template for mRNA, a sequence of typically hydrophobic amino acids at the amino terminus of the polypeptide or protein, i.e., a "signal sequence” or "secretion leader sequence” of the protein.
  • a DNA signal sequence is operatively linked to and located immediately before the DNA sequence of the desired protein and after its translational start signal (e.g., ATG). It is believed that only a portion of a signal sequence of a precursor of a protein is essential, for the precursor of the protein to be transported through the cell membrane of a host and for proper clipping of the precursor's signal sequence to free the protein during secretion.
  • DNA signal sequence includes those DNA sequences that code for that portion of a signal sequence required for secretion.
  • an expression system of this invention is operatively linked to a DNA sequence coding. for the desired protein and used to transform an appropriate host. The host may then be cultured under appropriate conditions of growth. The desired protein may then be isolated from the culture. Optimum results will, of course, depend on a number of variables, including the appropriate choice of host, promoter, vector, and growth conditions.
  • hosts have been used in recombinant DNA technology and are well known in the art.
  • a wide variety of hosts may be useful in the method of this invention. These hosts include for example, bacteria, such as E.coli (for example E.coli HB101 or E. coli MC1061), Bacillus, Streptomyces, and Pseudomonas, fungi, such as yeasts, animal cells, such as CHO cells, mouse, swine, bovine, fowl or fish cells, plant cells in tissue culture, human tissue cells, or other hosts known in the art.
  • E.coli for example E.coli HB101 or E. coli MC1061
  • Bacillus Streptomyces
  • Pseudomonas fungi
  • yeasts such as yeasts
  • animal cells such as CHO cells, mouse, swine, bovine, fowl or fish cells
  • plant cells in tissue culture human tissue cells, or other hosts known in the art.
  • an appropriate host is controlled by a number of factors recognized by the art. These include, for example, compatibility with the chosen vector, ease of recovery of the desired protein, expression characteristics, biosafety and costs.
  • the host must be able to recognize both the promoter used and the secretion signal sequence used.
  • the toxicity of the desired protein on the host must be considered, as well as the viability of the host when transformed by a muticopy vector. No absolute choice of host may be made for a particular recombinant DNA molecule or polypeptide from any of these factors alone. Instead, a balance of these factors must be struck with the realization that not all hosts may be equally effective for expression of a particular DNA sequence operatively linked to a particular expression control sequence.
  • yeast is a preferred host, and S. cerevisiae is especially preferred.
  • Various promoters may be used in the expression systems of this invention, provided, however, that the promoter is active in the chosen host and is of at most intermediate strength.
  • the toxicity of the desired protein to the host affects the promoter choice; for secretion of a less toxic protein, a relatively stronger promoter can be used than for secretion of a more toxic protein.
  • the coding region attached to a promoter in a multicopy vector also affects the strength of the promoter.
  • the pgk promoter is very strong when attached to its own coding region (PGK), but weak when attached to an alpha interferon gene. J. Mellor et al., "Factors Affecting Heterologous Gene Expression in Saccharomyces cerevisiae," Gene, 33, pp. 215-26 (1985).
  • a promoter In choosing a promoter it should also be appreciated that the junction of a strong promoter to a heterologous gene will considerably weaken the strong promoter, as described by Mellor et al., supra. Because promoter strength is affected by so many variables, the fact that a promoter is too strong for use in one host-vector-desired protein combination does not exclude the use of that promoter in a different combination.
  • Promoters that may be considered for use in recombinant DNA molecules made from the present invention include, for the yeast S. cerevisiae, the promoters of the ACT (actin) gene, the CYCl (iso-lcytochrome c) gene and the URA3 gene. Kim and Warner, J. Mol. Biol., 165, pp. 79-89 (1983) (mRNA level about 0.008%). For secretion expression systems, strong promoters not suitable for use with
  • S. cerevisiae include, for example, the G3PDH and the enolase promoters.
  • Preferred promoters in secretion expression systems for use with bacteria such as E.coli and B. subtilis should be of at most intermediate strength. Strong promoters, such as the trp, trc, lac promoters should generally be avoided. Intermediate promoter strength might, however, conveniently be adjusted with regulated promoters, such as the trp or lac promoters, if intermediate inducing growth conditions are chosen.
  • promoters that are active during all phases of growth. If the promoters were to be shut off during growth, the adjustment of vector copy number would presumably not take place unless the promoters were induced. Regulated promoters, however, would probably not provide an advantage. Presumably, turning on a strong promoter in a high-copy secretion construction would immediately lyse the host, as in insulin secretion by E.coli.
  • a host makes few copies (i.e., low copy numbers) of multicopy vectors if the promoter is strong in the host-vector-desired protein system, and more copies (i.e., higher copy numbers) are made of multicopy vector having a weaker promoter, resulting in improved yields of desired protein.
  • a promoter weaker than the CYCl promoter discussed in Example 3 e.g., CYC7 or trpl
  • Lowest strength promoters are limited by the promoter strength, and the strongest promoters give only limited copy numbers.
  • the multicopy vector and, in particular, the sites chosen therein for insertion of the expression systems of this invention are determined by a variety of factors, e.g., number of sites susceptible to a particular restriction enzyme, size of the protein to be expressed, expression characteristics such as the location of start and stop codons relative to the vector sequences, and other factors recognized by those of skill in the art.
  • the choice of a vector is determined by a balance of these factors, not all selections being equally effective for a given case.
  • the preferred vectors are plasmids, such as those containing origins of replication derived from chromosomal and non-chromosomal DNA. In the yeast S. cerevisae, the various ARS- and 2 u-type vectors (Botstein et al.) are preferred.
  • multicopy plasmids such as ColEl-, pBR322 and RP4 plasmids, or M13 and lambda phage vectors
  • non-integrating multicopy vectors such as vectors based on the bovine papilloma virus
  • the choice of vector will also be influenced by the chosen promoter, optimal promoter and vector systems will lead to less toxicity of the secreted protein (due to too high copy number or over production based on too strong a promoter) and consequently, higher levels of secretion may be obtained.
  • the heterologous secretion signal sequences of this invention must be recognized and correctly processed by the host.
  • the signal sequence may comprise a naturally occurring signal sequence, an effective portion of a naturally occurring signal sequence, or a combination of two or more signal sequences or effective portions of signal sequences.
  • the signal sequence must be operatively linked to both the promoter and the start signal, ATG.
  • the signal sequence begins with the translation start signal.
  • the signal sequences is selected from signal sequences already used by the host to aid secretion of its own proteins.
  • the desired protein may include any polypeptide or protein, and may include fusions, preproteins, immature proteins or any desired sequence of amino acids.
  • desired proteins of this invention include proteins, and other amino acid sequences, that are secretable by some microorganism or cell.
  • secretable proteins and other amino acid sequences include serum proteins, analgesic polypeptides such ⁇ -endorphin, somatostatin, insulin, growth hormone (human and bovine), luteinizing hormone, ACTH, pancreatic polypeptide preproteins, preproinsulin, proinsulin, and the A and B chains of insulin.
  • the desired proteins of the present invention may comprise proteins and other amino acid sequences naturally secreted into the medium by the selected host.
  • the desired protein of the present invention need not be secreted by the selected host. As will be seen in Example 6, the host need only manu- facture the desired protein operatively linked to the chosen signal sequence and correctly process the desired protein into the endoplasmic reticulum. If the desired protein enters the secretion pathway of the host, benefits may be obtained for the desired protein, such as glycosylation. Secretion is, however, the preferred result for the desired protein.
  • the present invention may be useful in mammalian cells, if non-integrating, multicopy vectors, such as vectors derived from bovine papilloma virus (D. DiMaio et al., "Bovine PapillomaVirus Vector that Propagates as a Plasmid in Both Mouse and Bacterial Cells," Proc. Nat. Acad. Sci. USA, 79, pp. 30-34 (1982)) are used in conjunction with an intermediate strength promoter.
  • bovine papilloma virus D. DiMaio et al., "Bovine PapillomaVirus Vector that Propagates as a Plasmid in Both Mouse and Bacterial Cells," Proc. Nat. Acad. Sci. USA, 79, pp. 30-34 (1982)
  • bovine papilloma virus D. DiMaio et al., "Bovine PapillomaVirus Vector that Propagates as a Plasmid in Both Mouse and Bacte
  • This example describes the preparation of a transformed yeast which is used in a later example for comparative purposes.
  • the transformed yeast of this example has a plasmid containing a MF ⁇ l promoter (which is a strong promoter in yeast as shown below) operatively linked to a MF ⁇ l secretion leader and a DNA sequence coding for SMC.
  • a MF ⁇ l promoter which is a strong promoter in yeast as shown below
  • Other examples describe the substitution of the ACT and CYCl promoters for the MF ⁇ l promoter, and a comparison of the secretion levels of the variously transformed yeasts.
  • other examples describe the preparation of transformed yeasts having plasmids characterized by DNA sequences coding for desired proteins other than SMC.
  • the MF ⁇ l secretion precursor comprising a secretion leader and four MF ⁇ l repeats is shown in Figure 1A.
  • the stippled regions indicate spacer regions between the MF ⁇ l repeats.
  • the pre-MF ⁇ l gene and associated expression control sequence which resided on a 1.7 kb EcoRl fragment, were subcloned into pUC18, the selected cloning vector.
  • the result ing plasmid (p220/3) now included an a MF ⁇ l promoter, an MF ⁇ l signal sequence and an MF ⁇ l DNA coding sequence.
  • modified oligomer (3') TA TTT TCT CTA GAA CTT CGA ACC (5') original: (3') TA TTT TCT CTC CGA CTT CGA ACC (5') sequence lys arg glu ala glu ala trp
  • the resulting plasmid was called p254 ( Figure 2). It was further modified to prevent unintended cleavage at a Bglll site found at the 5' end of the MF ⁇ l promoter. We altered this undesired Bglll site by cleaving with Bglll and filling in the protruding ends using the large (Klenow) fragment of DNA polymerase I and deoxynucleotide triphosphates and subsequent ligation using T4 ligase.
  • a StuI site was introduced, in a procedure similar to the mutagenesis described above, at the position corresponding to the same lys-arg cleavage site: modified oligomer: (5') TA AAA AGG CCT CTT GAA GC (3')
  • Example 2 Preparation Of ACT/MF ⁇ l/SMC DNA sequences
  • Levels of the mRNA coding for actin (D. Gallwitz et al., "The Actin Gene in the Yeast Saccharomyces cerevisiae: 5' and 3' End Mapping, Flanking and Putative Regulatory Sequences," Nucl. Ac. Res., 9, pp. 6339-50 (1981)) suggest that the ACT promoter is a promoter of intermediate strength, see Table 1 . See also Himmelfarb, supra.
  • Plasmid pYA30l shown in Figure 4, contains the ACT gene and its associated expression control sequences on a 4 kb EcoRI-BamHI fragment.
  • Plasmid pYA301 is a 4 kb EcoRI-BamHI subclone of pYA208 (Gallwitz and Sures, supra) inserted into pBR322.
  • Bal31 was cut at the single Xhol site and treated it with Bal31. Following an empirically determined time of Bal31 treatment we cut the plasmid with EcoRI and its protruding ends were filled in using the large (Klenow) fragment of DNA polymerase I.
  • the treated plasmid was religated under dilute conditions. Approximately half of the resulting plasmids, identified collectively as p ⁇ 4, contained an EcoRI site. These EcoRI sites had been formed by joining the filled-in EcoRI end to the Bal31 ACT promoter end. Structures of various p ⁇ 4 promoter ends in the vicinity of the ATG start codon of the religated plasmid are shown in Figure 4B.
  • the modification was carried out by cutting plasmid p309/1 with PstI and HindiII. The large fragment was isolated and joined to a fragment of pS30/25 resulting from partial digestion of that plasmid with PstI and HindiII.
  • the resulting plasmid, p355/1, (ACT promoter/MF ⁇ l secretion leader/SMC structural DNA sequence) was transferred to a yeast shuttle vector, as shown in Figure 5, which resulted in formation of plasmid p364/1. SMC secretion rates by yeast transformed with this plasmid are shown in Figure 3.
  • iso-l-cytochrome c amounts to approximately 0.2% of the total cellular protein of yeast, and the encoding mRNA amounts to about 0.05% of the total poly(A) RNA.
  • CYCl iso-l-cytochrome c
  • the CYCl promoter is thus a weak promoter.
  • tripartite SMC expression vectors based on the MF ⁇ l secretion leader and the MF ⁇ l, the ACT, and the CYCl promoters has been described above.
  • Each of these vectors contained the yeast URA3 gene for selection in yeast.
  • the expression of the plasmid LEU2 gene is low due to a partial deletion; higher copy numbers of the LEU2 type expression vectors are needed to allow the growth of transformants on leucine-free media.
  • E. Erhart and C. P. Hollenberg "The Presence of a Defective LEU2 Gene on 2y DNA Recombinant Plasmids of Saccharomyces cervisiae is responsible for. Curing and High Copy Number," J. Bacteriol.,
  • yeasts having the URA3 gene or the LEU2 gene in recombinant DNA experiments, see D. Botstein et al., "Sterile Host Yeast (SHY): a Eukaryotic System for Biological Containment for Recombinant DNA Experiments," Gene, 8, pp. 17-24 (1979).
  • the LEU2-type vectors by inserting the chosen promoter/MF ⁇ 1 secretion leader/ SMC structural sequence into plasmid pJDB207, using procedures similar to those used with the URA3 plasmids. The structures of the resulting plasmids are shown in Figure -6.
  • the plasmid pJDB207 is described in J. D. Beggs, "Multiple-copy Yeast Vectors," Molecular Genetics in Yeast, Alfred Benzon Symposium 16, 383-95 (1981).
  • YE439 is a LEU2- type vector with the ACT promoter/MF ⁇ l secretion leader/SMC structural sequence. It was deposited on October 30, 1985 and is identified as DSM 3578.
  • YE466 is a LEU2-type vector with the CYCl promoter/MF ⁇ l secretion leader/SMC structural sequence. It was deposited on October 30, 1985 and is identified as DSM 3579. Transformants were grown in the "SD" medium described in F.
  • URA3 transformants A high percentage of URA3 transformants actually have few or no URA3 expression plasmids. These unproductive transformants contribute little to overall SMC secretion. Conceivably, a high percentage of URA3 transformants also have high copy numbers and high promoter strength. These transformants would contribute little to SMC secretion because of slow growth and reduced cell viability.
  • This suggested mechanism does not apply to LEU2 vectors because the minimum copy number of the LEU2-type vectors in transformants grown in leucine-free media is about 35 (Erhart and Hollenberg. supra).
  • the production medium used contains leucine (but no uracil), elevated copy numbers are not required for growth of LEU2 vector-transformants in this medium. Thus, yeast strains transformed with LEU2 appear more homogeneous than URA3 vectors with respect to copy number and mRNA content.
  • Table 2 does not give the actual ratio of SMC to actin transcripts, and the numbers are for comparative purposes only. Transformation frequencies for all six plasmid constructions are also shown in Table 2. Aproximately similar transformation frequencies were obtained with allthe URA3-type vectors.
  • vector pTNFll (see Figure 8): (1) a
  • TNF secretion vector based on the ACT promoter, instead of the MF ⁇ l promoter, is also shown in Figure 8.
  • pACT-TNF-EX we ligated the following fragments to construct pACT-TNF-EX: (1) a 1.2 kb Pstl-Hindlll fragment from pTNFll carrying most of the MF ⁇ l secretion leader fused to the TNF gene (described supra); (2) a 0.5 kb BamHI-PstI fragment from p355/1, shown in Figure 5, carrying the ACT promoter and part of the MF ⁇ l leader (described supra); and (3) the 9 kb BamHI-HindIII fragment of pEX-2 (described supra).
  • the gene coding for human tissue plasminogen activator carries a convenient Bglll site at a position corresponding to the first amino acid of TPA.
  • D. Pennica et al. "Cloning and Expression of Human Tissue-type Plasminogen Activator cDNA in E. coli," Nature, 301, pp. 214-21 (1983).
  • a convenient BgllI site at the MF ⁇ l position corresponding to the lys-arg junction.
  • the structure of the MF ⁇ l/TPA fusion point in the following constructions is as follows: (5')...AAA AGA TCT TAC...(3') lys arg ser tyr MF ⁇ l TPA
  • TPA expressed was biologically active, indicating correct folding. Attempts to express TPA in yeast without a secretion signal sequence produced non-active TPA; and (2) the cell associated TPA was glyco- sylated. Secretion at least into the lumen of the endoplasmic reitculum is necessary for glycosylation to occur.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Mycology (AREA)
  • Molecular Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Microbiology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Medicinal Chemistry (AREA)
  • Diabetes (AREA)
  • Endocrinology (AREA)
  • Toxicology (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Peptides Or Proteins (AREA)
  • Lubricants (AREA)
  • Medicines Containing Material From Animals Or Micro-Organisms (AREA)

Abstract

Improved secretion of heterologous proteins by hosts such as yeast by using promoters of at most intermediate strength with heterologous DNA secretion signal sequences. A promoter of at most intermediate strength, such as the actin (ACT) or iso-l-cytochrome c (CYCl) promoter in S. cerevisiae is operatively linked to a DNA signal sequence, such as the MFalpha) signal sequence. A DNA sequence for a selected protein, such as somatomedin C (SMC), tissue plasminogen activator (TPA) or tumor necrosis factor (TNF), may be operatively linked to the DNA signal sequence.

Description

ENHANCED SECRETION OF HETEROLOGOUS PROTEINS BY HOSTS USING SUBSTITUTED PROMOTERS
Technical Field Of The Invention This invention relates to expression systems and recombinant DNA molecules that facilitate enhanced secretion of heterologous proteins by hosts, to hosts comprising such recombinant DNA molecules and to methods of producing desired proteins using such hosts.
Background Of The Invention
Proteins prepared by recombinant DNA methods are sometimes difficult to isolate because the protein must be extracted from the transformed microorganism by means such as cell lysis that typically destroy the microorganism. Obtaining sufficiently pure protein in high yield from such extraction procedures is difficult because of the large number of different organic compounds liberated when lysis takes place.
One attractive alternative to typical separation procedures is to have the host secrete the selected protein into its environment. A secreted protein is more easily separated from the culture medium, and the microorganism can survive the separation process to produce and to secrete more of the desired protein. Most proteins naturally secreted by prokaryotes and eukaryotes are initially synthesized in the form of precursors containing an amino-terminal extension several amino acids long. This extension, called a secretion leader or signal sequence, allows the precursor protein to cross the cell membrane of the microorganism and enter the culture medium or periplasmic space of the cell. During secretion, the secretion leader is cleaved from the protein, leading to the presence of mature protein in the culture medium or periplasmic space.
Although the rate of secretion in bacteria is typically very low, bacterial secretion has been used with recombinant DNA technology. See, for example, United States Patents 4,411,994 and 4,338,397, and Villa-Komaroff et al., Proc. Natl. Acad. Sci. USA, 75, 3727-31 (1978).
The general safety of yeasts and human experience with yeast fermentation have made yeasts desirable candidates for use as hosts in recombinant DNA technology. However, reported attempts to obtain secreted mature proteins from recombinant yeasts have suffered from low yields. See, for example, C. N. Chang et al., "Recognition and Cleavage of Hybrid Invertase Signals and Mature Forms of Human Interferon (IFN-α2) in Yeast," Meeting Abstracts, The Molecular Biology of Yeast, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, p. 393 (1983); B. Meyhack and A. Hinnen, "High Levels of Expression of Foreign Genes Under the Control of the
Yeast PHO5 Promoter," Meeting Abstracts, The Molecular Biology of Yeast, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, p . 156 ( 1983 ) ; S . D. Emr, "An MFα1-SUC2 ( α-Factor-Invertase) Gene Fusion for Study of Protein Localization and Gene Expression in Yeast," Proc. Natl. Acad. Sci. USA, 80, pp. 7080-84 (1983); A. Brake et al., "α-Factor-Directed Synthesis and Secretion of Mature Foreign Proteins in Saccharomyces cerevisiae," Proc. Natl. Acad. Sci. USA, 81, pp. 4642-46 (1984); G. A. Bitter et al., "Secretion of Foreign Proteins from Saccharomyces cerevisiae Directed by o-Factor Gene Fusions," Proc. Natl. Acad. Sci. USA, 81, pp. 5330-34 (1984); A. Singh, et al., "Synthesis, Secretion and Processing of α-Factor-Interferon Fusion Proteins in Yeast," Nucl. Ac. Res., 12, pp. 8927-38 (1984); European Patent Application 123,228; and European Patent
Application 128,733; and G. Bitter et al, "Secretion of Foreign Proteins from Saccharomyces cerevisiae Directed by α-Factor Phesomone," Nucl. Ac. Res., 11, pp. 4049-63 (1983).
Summary of the Invention
The present invention solves the foregoing problems by providing expression systems 'that facilitate enhanced secretion of heterologous proteins. We have found that using a promoter of at most intermediate strength and a heterologous DNA secretion signal sequence in a multicopy vector results in a higher yield of a desired protein from the host compared to hosts transformed by a multicopy vector using a strong promoter. Thus, one embodiment of the present invention comprises a DNA sequence that is an expression and secretion control sequence for producing and secreting a selected protein from a host in which the protein is made, said sequence comprising (a) a promoter that is active in said host and is of at most intermediate strength, and (b) a heterologous DNA secretion signal sequence, recognized by said host, beginning with a start codon, operatively linked to said promoter. The expression and secretion control sequence may then be operatively linked to a
DNA sequence coding for a desired protein and employed to transform a host to produce and secrete the desired protein. Yeast is a preferred host, and the actin (ACT) and iso-1-cytochrome c (CYCl) promoters are preferred promoters for yeast hosts. S. cerevisiae is the most preferred host. When the host is S. cerevisiae, the secretion signal sequence is preferably the secretion signal sequence of the MFα1 gene.
The present invention also relates to hosts transformed by the recombinant DNA molecules discussed above and to methods of using transformed hosts in the preparation and secretion of desired proteins.
Brief Description Of The Drawings
Figure 1A shows the structure of the MFα1 secretion precursor. Stippled regions indicate spacer regions separating the MFα1 repeats. During secretion the secretion leader is cleaved off at the position indicated by the arrow.
Figure IB shows a general scheme for the construction of a yeast secretion precursor comprising the alpha mating factor (MFα1) secretion leader fused to the desired heterologous protein. During secretion, the secretion leader is cleaved at the position indicated by the arrow. Figure 2 shows the construction of a gene fusion between the MFα1 expression control sequence and the SMC gene and insertion of the MFα1/SMC fusion into a yeast expression vector.
Figure 3 shows two graphs: growth (as reflected by optical density at 600 nm) and SMC levels in the culture fluid of yeast strain BJ1991 transformed with URA3-type expression vectors. The CYCl promoter construction (p336/l) is shown by (● ); the ACT promoter construction (p364/l) by (▲); and the MFα1 promoter construction (p446/l) by (■). Figure 4A shows the construction of actin promoter fragments ending in EcoRI sites.
Figure 4B shows the DNA structure of promoter ends for actin and the pEX-5, pEX-7, and pEX-8 vectors.
Figure 5 shows the construction of SMC secretion vectors based on the ACT and CYCl promoters.
Figure 6 shows the structure of SMC secretion vectors based on the yeast LEU2 gene. Figure 7 shows two graphs: growth (as reflected by optical density at 600 nm) and SMC levels in the culture fluid of yeast strain BJ1991 transformed with LEU2-type expression vectors. The CYCl promoter construction (504/1) is shown by (● ); the ACT promoter construction (p482/18) by (▲); and the MFα1 promoter construction (pMF-SMC, rearranged) by (■ ).
Figure 8 shows the construction of secretion vectors for TNF. Figure 9 shows the construction of secretion vectors for TPA.
Detailed Description Of The Invention In order that the invention may be more fully understood, the following detailed description is provided. In this specification, some of the following terms are used:
Cloning - the process of obtaining a population of organisms or DNA sequences derived from one such organism or sequence by asexual reproduction. Recombinant DNA Molecule or Hybrid DNA -
A molecule consisting of sequences of DNA from different genomes which have been joined end-to-end outside of living cells and that can be maintained in living cells. Protein - A polypeptide containing a linear series of more than fifty amino acids, e.g., pro insulin, serum albumin, human growth hormone, parathyroid hormone, and interferon. As used herein, however, a protein also comprises a polypeptide chain of fewer than fifty amino acids. Polypeptide - A linear series of amino acids connected one to the other by peptide bonds between the amino and carboxy groups of adjacent amino acids.
Expression - The process undergone by a gene to produce a polypeptide or protein. It is a combination of transcription and translation.
Transcription - The process of producing mRNA from a gene.
Translation - The process of producing a protein or polypeptide from mRNA.
Promoter - The region of DNA responsible for binding RNA polymerase to initiate transcription. In bacterial expression systems a promoter is located before the ribosome binding site. A strong promoter is defined as one that has a strong affinity for RNA polymerase and that would accordingly be expected to aid in obtaining high expression rates. A promoter of intermediate strength has a lower affinity for RNA polymerase. In recombinant systems,, the strength of a promoter may be affected by the host, the desired protein and the expression vector of the recombinant system. The effects of these factors are more fully discussed below. In systems that do- not secrete the desired protein, i.e., intracellular expression systems, levels of mRNA and levels of expressed protein are proportional to promoter strength. In intracellular expression systems, no changes of plasmid copy number have been observed with variation of promoter strength. J. Mellor et al, "Factors Affecting Heterologous Gene Expression in Saccharomyces cerevisiae," Gene, 33, pp. 215-26 (1985). In secretion expression systems, we have discovered that promoter strength may affect the number of copies of a multicopy expression vector produced within a given host. In secretion expression systems, a host will produce many copies of a multicopy vector that contains an intermediate or weak promoter but will produce fewer copies of a multicopy vector that contains a strong promoter. An estimate of the strength of a promoter may be based on the percent of total mRNA in a cell produced by the expression control sequences of that promoter. Table 1 shows a list of genes, including their associated promoters and associated mRNA, as described in the literature. A characterization of the promoter strength is also included.
Figure imgf000010_0001
1. J. L. Bennetzen and B. D. Hall, "Codon Selection in Yeast," J. Biol. Chem., 257, pp. 3026-31 (1982)2. C. H. Kim and J. R. Warner, "Messenger RNA for Ribosomal Proteins in Yeast," J. Mol. Biol., 165, pp. 78-89 (1983). 3. H. J. Himmelfarb et al., "Isolation of the SUP45 Omnipotent Suppressor Gene of Saccharomyces cerevisiae and Characterization of its Gene Product," Mol. Cell. Biol., 5, pp. 816-22 (1985).4. M. J. Dobson et al., "Expression in Saccharomyces cerevisiae of Human Interferon-Alpha Directed by TRP1 5' Region," Nucl. Ac. Res., 11, pp. 2287-2302 (1983). Generally, promoters of glycolytic genes are considered "strong." Those skilled in the art will understand, however, that promoters may not have the same strength within different strains of a host, with different desired proteins or in different multicopy vectors. In S. cerevisiae, for example, the MFα1 promoter is active in MATα strains, but inactive in MATa strains. Those skilled in the art will appreciate that promoter strength depends upon many factors. Merely because a promoter is unacceptable with respect to one combination of host, vector and desired protein does not mean that the promoter will not be acceptable for another combination of host, vector and desired protein. Promoters of the present invention should be selected so that the combination of promoter strength and copy number of the multicopy vector provides optimum yields of desired protein. Promoters of intermediate strength provide the best combination of promoter strength with copy number. Weaker promoters, in combination with high copy numbers, are also within the scope of the invention. Strong promoters adversely affect the copy number of the multicopy vectors within the transformed host, and unacceptable yields are obtained from those transformants. A promoter should be selected that has maximal strength without significantly reducing plasmid copy number. Generally, a promoter may be considered of at most intermediate strength with respect to a given combination of host, vector and desired protein if the percent of total mRNA in the host corresponding to the promoter is less than 0.3%, and preferably is at most 0.15%.
Ribosome Binding Site - The region of DNA which codes for a site on mRNA which helps the mRNA bind to the ribosome, so that translation can begin. In bacterial expression systems, a ribosome binding site is located after (downstream from) the promoter and before (upstream from) the translational start signal of the DNA sequence to be expressed to produce the desired protein. Gene - A DNA sequence which encodes, as a template for mRNA, a sequence of amino acids characteristic of a specific polypeptide or protein. Expression Control Sequence - A DNA sequence that controls and regulates expression of genes when operatively linked to those genes. Such sequences include the lac system, the β-lactamase system, the trp system, the tac, and trc systems, the major operator and promoter regions of phage λ , the control region of fd coat protein, the early and late promoters of SV40, promoters derived from polyoma virus and adenovirus, metallothionine promoters, the promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase, e.g., Pho5, the promoters of the yeast α-matihg factors, and other sequences known in the art to control the expression of genes in prokaryotic or eukaryotic cells and their viruses or combinations thereof. For mammalian cells, the gene can be linked to a eukaryotic promoter, such as that for the SV40 early region, coupled to the gene encoding dihydrofolate reductase and selectively amplified in Chinese hamster ovary cells to produce a cell line containing many copies of actively transcribed eukaryotic genes. Those skilled in the art will appreciate that not every expression control sequence listed above as an example may be suitable for use with each host, desired protein and vectors combination. Percursor of a Protein - A protein with a signal sequence operatively linked to the protein. Typically, the precursor is synthesized within a host cell, e.g., preproinsulin, preserum albumin, prehuman growth hormone, preparathyroid hormone, and preinter feron. In accordance with this invention, a mature protein is obtained by secreting the precursor through the cell membrane of a host with an attendant loss or clipping of the signal sequence of its precursor. Nucleotide - A monomeric unit of DNA or
RNA consisting of a sugar moiety (pentose), a phosphate, and a nitrogenous heterocyclic base. The base is linked to the sugar moiety via the glycosidic carbon (l' carbon of the pentose) and that combination of base and sugar is called a nucleoside. The base characterizes the nucleotide. The four DNA bases are adenine ("A"), guanine ("G"), cytosine ("C") and thymine ("T"). The four RNA bases are A, G, C and uracil ( "U" ) . DNA Sequence - A linear series of nucleotides connected one to the other by phosphodiester bonds between the 3' and 5' carbons of adjacent pentoses.
Codon - A DNA sequence of three nucleotides (a triplet) which encodes, through messenger RNA ("mRNA"), an amino acid, a translational start signal or a translational termination signal. For example, the nucleotide triplets TTA, TTG, CTT, CTC, CTA and CTG encode for the amino acid leucine ("Leu"), TAG, TAA and TGA are translational stop signals and ATG is a translational start signal.
Plasmid - A non-chromosomal double-stranded DNA sequence comprising an intact "replicon" such that the plasmid is replicated in a host cell. When the plasmid is placed within a host organism, the characteristics of that organism are changed or transformed as a result of the DNA of the plasmid. For example, a plasmid carrying the gene for tetracycline resistance (TetR) transforms a host cell previously sensitive to tetracycline into one which is resistant to it. A host cell transformed by a plasmid is called a "transformed host" or a "transformant." Phage or Bacteriophage - Bacterial virus which may include DNA sequences contained in a protein envelope or coat ("capsid").
Cloning Vehicle - A plasmid, phage DNA or other DNA sequence which is able to replicate in a host cell, characterized by one or a small number of endonuclease recognition sites at which its DNA sequence may be cut in a determinable fashion without attendant loss of an essential biological function of the DNA, e.g., replication, production of coat proteins or loss of promoter or binding sites, and which contains a marker suitable for use in the identification of transformed cells, e.g., tetracycline resistance or ampicillin resistance. A cloning vehicle is also known as a vector. In the present invention, the vector must be a "multicopy" vector, i.e., capable of producing multiple copies of itself within the host. Single copy vectors, incapable of producing copies, are not within the scope of this invention.
Host - An organism which on transformation by a cloning vehicle enables the cloning vehicle to replicate and to accomplish its other biological functions, e.g., the production of polypeptides or proteins through expression of the genes of a plasmid.
DNA Signal Sequence - A DNA sequence which encodes, as a template for mRNA, a sequence of typically hydrophobic amino acids at the amino terminus of the polypeptide or protein, i.e., a "signal sequence" or "secretion leader sequence" of the protein. For secretion, such a DNA signal sequence is operatively linked to and located immediately before the DNA sequence of the desired protein and after its translational start signal (e.g., ATG). It is believed that only a portion of a signal sequence of a precursor of a protein is essential, for the precursor of the protein to be transported through the cell membrane of a host and for proper clipping of the precursor's signal sequence to free the protein during secretion. Combinations of signal sequences or parts of signal sequences may also be used provided that proper processing takes place in the host. Hence, the term "DNA signal sequence" as used herein includes those DNA sequences that code for that portion of a signal sequence required for secretion. In accordance with this invention, to obtain expression of a desired protein, an expression system of this invention is operatively linked to a DNA sequence coding. for the desired protein and used to transform an appropriate host. The host may then be cultured under appropriate conditions of growth. The desired protein may then be isolated from the culture. Optimum results will, of course, depend on a number of variables, including the appropriate choice of host, promoter, vector, and growth conditions.
Many hosts have been used in recombinant DNA technology and are well known in the art. A wide variety of hosts may be useful in the method of this invention. These hosts include for example, bacteria, such as E.coli (for example E.coli HB101 or E. coli MC1061), Bacillus, Streptomyces, and Pseudomonas, fungi, such as yeasts, animal cells, such as CHO cells, mouse, swine, bovine, fowl or fish cells, plant cells in tissue culture, human tissue cells, or other hosts known in the art.
The selection of an appropriate host is controlled by a number of factors recognized by the art. These include, for example, compatibility with the chosen vector, ease of recovery of the desired protein, expression characteristics, biosafety and costs. In the present invention, the host must be able to recognize both the promoter used and the secretion signal sequence used. In addition, the toxicity of the desired protein on the host must be considered, as well as the viability of the host when transformed by a muticopy vector. No absolute choice of host may be made for a particular recombinant DNA molecule or polypeptide from any of these factors alone. Instead, a balance of these factors must be struck with the realization that not all hosts may be equally effective for expression of a particular DNA sequence operatively linked to a particular expression control sequence. While the present invention is not limited to S. cerevisiae, yeast is a preferred host, and S. cerevisiae is especially preferred. Various promoters may be used in the expression systems of this invention, provided, however, that the promoter is active in the chosen host and is of at most intermediate strength.
The toxicity of the desired protein to the host affects the promoter choice; for secretion of a less toxic protein, a relatively stronger promoter can be used than for secretion of a more toxic protein. The coding region attached to a promoter in a multicopy vector also affects the strength of the promoter. For example, the pgk promoter is very strong when attached to its own coding region (PGK), but weak when attached to an alpha interferon gene. J. Mellor et al., "Factors Affecting Heterologous Gene Expression in Saccharomyces cerevisiae," Gene, 33, pp. 215-26 (1985). In choosing a promoter it should also be appreciated that the junction of a strong promoter to a heterologous gene will considerably weaken the strong promoter, as described by Mellor et al., supra. Because promoter strength is affected by so many variables, the fact that a promoter is too strong for use in one host-vector-desired protein combination does not exclude the use of that promoter in a different combination.
Promoters that may be considered for use in recombinant DNA molecules made from the present invention include, for the yeast S. cerevisiae, the promoters of the ACT (actin) gene, the CYCl (iso-lcytochrome c) gene and the URA3 gene. Kim and Warner, J. Mol. Biol., 165, pp. 79-89 (1983) (mRNA level about 0.008%). For secretion expression systems, strong promoters not suitable for use with
S. cerevisiae include, for example, the G3PDH and the enolase promoters.
Preferred promoters in secretion expression systems for use with bacteria such as E.coli and B. subtilis should be of at most intermediate strength. Strong promoters, such as the trp, trc, lac promoters should generally be avoided. Intermediate promoter strength might, however, conveniently be adjusted with regulated promoters, such as the trp or lac promoters, if intermediate inducing growth conditions are chosen.
In the examples, we have used promoters that are active during all phases of growth. If the promoters were to be shut off during growth, the adjustment of vector copy number would presumably not take place unless the promoters were induced. Regulated promoters, however, would probably not provide an advantage. Presumably, turning on a strong promoter in a high-copy secretion construction would immediately lyse the host, as in insulin secretion by E.coli.
As discussed above, we have discovered that, for secretion expression systems, a host makes few copies (i.e., low copy numbers) of multicopy vectors if the promoter is strong in the host-vector-desired protein system, and more copies (i.e., higher copy numbers) are made of multicopy vector having a weaker promoter, resulting in improved yields of desired protein. However, the use of a promoter weaker than the CYCl promoter discussed in Example 3 (e.g., CYC7 or trpl) will not improve yields due to higher copy numbers, because maximal copy numbers are seen for the CYCl promoter construction (Table 2). Lowest strength promoters are limited by the promoter strength, and the strongest promoters give only limited copy numbers. Though not wishing to be bound by any theory, it appears that high copy numbers combined with strong promoters result in high levels of mRNA, oversecretion of the desired protein or both. In a population of transformants containing cells transformed by vectors having strong promoters, a mixture of copy numbers occurs in the population. Those cells having high copy numbers with strong promoters will stop growing and eventually lyse, while those transformants with lower copy numbers and the same strong promoter will survive. The population will eventually contain only low copy number transformation. See generally T. J. Silhavy, et al., "Mechanisms of Protein Localization," Microbiol. Res., 47, pp. 313-34 (1983).
The multicopy vector and, in particular, the sites chosen therein for insertion of the expression systems of this invention are determined by a variety of factors, e.g., number of sites susceptible to a particular restriction enzyme, size of the protein to be expressed, expression characteristics such as the location of start and stop codons relative to the vector sequences, and other factors recognized by those of skill in the art. The choice of a vector is determined by a balance of these factors, not all selections being equally effective for a given case. The preferred vectors are plasmids, such as those containing origins of replication derived from chromosomal and non-chromosomal DNA. In the yeast S. cerevisae, the various ARS- and 2 u-type vectors (Botstein et al.) are preferred. In E.coli, multicopy plasmids, such as ColEl-, pBR322 and RP4 plasmids, or M13 and lambda phage vectors, can be used. For animal cells, non-integrating multicopy vectors, such as vectors based on the bovine papilloma virus, are preferred. The choice of vector will also be influenced by the chosen promoter, optimal promoter and vector systems will lead to less toxicity of the secreted protein (due to too high copy number or over production based on too strong a promoter) and consequently, higher levels of secretion may be obtained.
The heterologous secretion signal sequences of this invention must be recognized and correctly processed by the host. The signal sequence may comprise a naturally occurring signal sequence, an effective portion of a naturally occurring signal sequence, or a combination of two or more signal sequences or effective portions of signal sequences. The signal sequence must be operatively linked to both the promoter and the start signal, ATG. Preferably, the signal sequence begins with the translation start signal. In a preferred embodiment, the signal sequences is selected from signal sequences already used by the host to aid secretion of its own proteins.
The desired protein may include any polypeptide or protein, and may include fusions, preproteins, immature proteins or any desired sequence of amino acids. Preferably, however, desired proteins of this invention include proteins, and other amino acid sequences, that are secretable by some microorganism or cell. Examples of such secretable proteins and other amino acid sequences include serum proteins, analgesic polypeptides such β-endorphin, somatostatin, insulin, growth hormone (human and bovine), luteinizing hormone, ACTH, pancreatic polypeptide preproteins, preproinsulin, proinsulin, and the A and B chains of insulin. The desired proteins of the present invention may comprise proteins and other amino acid sequences naturally secreted into the medium by the selected host.
The desired protein of the present invention need not be secreted by the selected host. As will be seen in Example 6, the host need only manu- facture the desired protein operatively linked to the chosen signal sequence and correctly process the desired protein into the endoplasmic reticulum. If the desired protein enters the secretion pathway of the host, benefits may be obtained for the desired protein, such as glycosylation. Secretion is, however, the preferred result for the desired protein.
The present invention may be useful in mammalian cells, if non-integrating, multicopy vectors, such as vectors derived from bovine papilloma virus (D. DiMaio et al., "Bovine PapillomaVirus Vector that Propagates as a Plasmid in Both Mouse and Bacterial Cells," Proc. Nat. Acad. Sci. USA, 79, pp. 30-34 (1982)) are used in conjunction with an intermediate strength promoter. The following, non-limiting examples will serve to further illustrate the present invention.
Example 1
Preparation Of MFαl/SMC DNA Sequences
This example describes the preparation of a transformed yeast which is used in a later example for comparative purposes. The transformed yeast of this example has a plasmid containing a MFαl promoter (which is a strong promoter in yeast as shown below) operatively linked to a MFαl secretion leader and a DNA sequence coding for SMC. Other examples describe the substitution of the ACT and CYCl promoters for the MFαl promoter, and a comparison of the secretion levels of the variously transformed yeasts. In addition, other examples describe the preparation of transformed yeasts having plasmids characterized by DNA sequences coding for desired proteins other than SMC.
We first isolated the gene for pre-MFαl and its associated expression control systems and we then operatively linked the MFαl promoter and signal sequence to a DNA sequence for SMC.
The DNA sequence coding for the precursor of MFαl has been sequenced by J. Kurjan and I. Herskowitz, "Structure of a Yeast Pheromone Gene (MFα): A Putative α-Factor Precursor Contains Four Tandem Copies of Mature α-Factor," Cell, 30, pp. 933-43 (1982). We isolated the gene coding for those pre-MFαl and its associated expression control sequences from S. cerevisiae using the library constructed by K. A. Nasmyth, and S. I. Reed, "Isolation of Genes by Complementation in Yeast: Molecular Cloning of a Cell-Cycle Gene," Proc. Natl. Acad. Sci. USA, 77, 2119-23 (1980). Instead of using the cdc28 mutant described in Nasmyth and Reed for isolation and cloning of the cdc28 gene, we used the following oligonucleotide corresponding to amino acids 97 to 102 of the published pre-MFαl:
(5') GTA CAT TGG TTG C/GCC G/A/TGG (3')
The MFαl secretion precursor, comprising a secretion leader and four MFαl repeats is shown in Figure 1A. The stippled regions indicate spacer regions between the MFαl repeats. Before substituting a desired heterologous protein for the four MFαl repeats, as shown in Figure IB, the pre-MFαl gene and associated expression control sequence, which resided on a 1.7 kb EcoRl fragment, were subcloned into pUC18, the selected cloning vector. The result ing plasmid (p220/3) now included an a MFαl promoter, an MFαl signal sequence and an MFαl DNA coding sequence. In order to isolate the MFαl coding sequence from the MFαl promoter and MFαl signal sequence, we subjected plasmid p220/3 to mutagenesis using the procedure described in B. A. Oostra et al.,
"Transforming Activity of Polyoma Virus Middle-T
Antigen Probed by Site-Directed Mutagenesis," Nature,
304, pp. 456-59 (1983). As a result of the mutagenesis, a convenient Bglll site was introduced at the junction between the secretion leader and the coding sequence for the repeats in the MFαl DNA sequence as follows: modified oligomer: (3') TA TTT TCT CTA GAA CTT CGA ACC (5') original: (3') TA TTT TCT CTC CGA CTT CGA ACC (5') sequence lys arg glu ala glu ala trp
The resulting plasmid was called p254 (Figure 2). It was further modified to prevent unintended cleavage at a Bglll site found at the 5' end of the MFαl promoter. We altered this undesired Bglll site by cleaving with Bglll and filling in the protruding ends using the large (Klenow) fragment of DNA polymerase I and deoxynucleotide triphosphates and subsequent ligation using T4 ligase. To allow for a different cleavage in the plasmid (Examples 5 and 6), a StuI site was introduced, in a procedure similar to the mutagenesis described above, at the position corresponding to the same lys-arg cleavage site: modified oligomer: (5') TA AAA AGG CCT CTT GAA GC (3')
BallI-sequence: (5') TA AAA AGA GAT CTT GAA GC (3') lys arg We then made a MFαl/SMC (secretion leader/ desired protein) fusion by inserting a 500 bp HindiII fragment carrying a synthetic SMC gene starting with a unique Ncol site into the Hindi 11 site of plasmid p254 (described above) which contained the MFαl promoter and secretion leader and obtained plasmid F-9 as shown in Figure 2 . The synthetic SMC gene is described in G. Buell et al., "Optimizing the Expression in E.coli of a Synthetic Gene Encoding Somatomedin-C (IGF-l)," Nucl. Ac. Res., 13, pp. 1923- 38 (1985). We then cut plasmid F-9 with Bglll and Ncol, with simultaneous S1 treatment and religation to remove the Δ portion of plasmid F-9 and obtain pS30/25, shown in Figure 2. The correct fusion of the MFαl secretion leader and the SMC structual DNA sequence had the following sequence (glycine is the first amino acid of SMC): (secretion leader)-AAA-AGA-GGT-CCA-(SMC) lys arg gly pro As a result the SMC DNA sequence was operatively linked to the MFαl secretion leader so. that the SMC protein would be properly secreted by yeast. As outlined in Figure 2, we introduced the MFα1/SMC fusion into a yeast shuttle vector carrying origins of replication for E. coli and yeast (ori and the 2 μ origin of replication, respectively), as well as selectable markers for both organisms (E. coli: bla; yeast: URA3 ). SMC secretion rates by
S. cerevisiae transformed with this expression vector are shown in Figure 3.
Example 2 Preparation Of ACT/MFαl/SMC DNA sequences The DNA sequence encoding actin in yeast i s described in D . Gallwitz and I . Sures , " Structure of a Split Yeast Gene: Complete Nucleotide Sequence of the Actin Gene in Saccharomyces cerevisiae, " Proc. Natl. Acad. Sci. USA, 77, pp. 2546-50 (1980). Levels of the mRNA coding for actin (D. Gallwitz et al., "The Actin Gene in the Yeast Saccharomyces cerevisiae: 5' and 3' End Mapping, Flanking and Putative Regulatory Sequences," Nucl. Ac. Res., 9, pp. 6339-50 (1981)) suggest that the ACT promoter is a promoter of intermediate strength, see Table 1 . See also Himmelfarb, supra.
In order to use the ACT promoter in the constructions of this and the following examples, we placed convenient restriction sites at the end of the promoter. Plasmid pYA30l, shown in Figure 4, contains the ACT gene and its associated expression control sequences on a 4 kb EcoRI-BamHI fragment. Plasmid pYA301 is a 4 kb EcoRI-BamHI subclone of pYA208 (Gallwitz and Sures, supra) inserted into pBR322. We cut plasmid pYA301 at the single Xhol site and treated it with Bal31. Following an empirically determined time of Bal31 treatment we cut the plasmid with EcoRI and its protruding ends were filled in using the large (Klenow) fragment of DNA polymerase I. Finally, the treated plasmid was religated under dilute conditions. Approximately half of the resulting plasmids, identified collectively as pΔ4, contained an EcoRI site. These EcoRI sites had been formed by joining the filled-in EcoRI end to the Bal31 ACT promoter end. Structures of various pΔ4 promoter ends in the vicinity of the ATG start codon of the religated plasmid are shown in Figure 4B. To operatively link the isolated ACT promoter to the MFαl signal sequence SMC fusions, we matched the 3 ' end of the ACT promoter to the amino terminal end of the MFαl/SMC fusion. We accomplished this by introducing an EcoRI site upstream (i.e., earlier in the transcription reading direction) of the ATG start codon of the MFαl secretion leader by carrying out the following mutagenesis using the procedure described in Oostra et al., supra: modified oligomer: (5') A ATA TAA ACG AAT TCA AGA ATG AG (3') original sequence: (5') A ATA TAA ACG ACC AAA AGA ATG AG (3') We named the resulting plasmid p269/20.
We joined the BamHI-EcoRI fragment of pEX-7 (Fig. 4) isolated above and containing the ACT promoter to the EcoRI-Sall fragment of plasmid p269/20 (which carried the MFαl secretion leader) and inserted the fragment into a BamHI and Sail cut pUC8 vector as shown in Figure 5. We further modified the resulting plasmid (p309/1) by replacing the MFαl secretion leader region of plasmid p309/l by the secretion leader region, joined to the SMC gene, of plasmid pS30/25. Plasmid pS30/25 is shown in Figure 2. The modification was carried out by cutting plasmid p309/1 with PstI and HindiII. The large fragment was isolated and joined to a fragment of pS30/25 resulting from partial digestion of that plasmid with PstI and HindiII. The resulting plasmid, p355/1, (ACT promoter/MFαl secretion leader/SMC structural DNA sequence) was transferred to a yeast shuttle vector, as shown in Figure 5, which resulted in formation of plasmid p364/1. SMC secretion rates by yeast transformed with this plasmid are shown in Figure 3.
EXAMPLE 3
Preparation Of CYCl/MFαl/SMC Plasmids
Under inducing conditions, iso-l-cytochrome c (CYCl) amounts to approximately 0.2% of the total cellular protein of yeast, and the encoding mRNA amounts to about 0.05% of the total poly(A) RNA. R. S. Zitomer and B. D. Hall, "Yeast Cytochrome c Messenger RNA. In Vitro Translation and Specefic Immunoprecipitation of the CYCl Gene Product," J. Biol. Chem., 251, 6320-26 (1976). The CYCl promoter is thus a weak promoter.
In order to construct a CYCl promoter/MFαl secretion leader/SMC coding sequence, we isolated an EcoRI-HindiII fragment carrying the MFαl secretion leader correctly fused to the SMC gene from plasmid p355/l, which is shown in Figure 5. We inserted this fragment between the EcoRI and Hindi II sites of pEX-2 to obtain plasmid p446/l (See Figure 5). pEX-2 is described in J. F. Ernst and
R. C. Chan, "Characterization of S. cervisiae Mutants Supersensitive to Aminoglycoside Antibiotics," J. Bacteriol., 163, pp. 8-14 (1985).
The vector p446/l expressed the MFαl/SMC fusion under the control of the CYCl promoter. SMC secretion rates from yeast transformed by this vector are shown in Figure 3.
EXAMPLE 4
Comparison Of Secretion Of SMC Using The MFαl, ACT And CYCl Promoters With Both The
URA3 Gene And The LEU2 Gene on Expression Vectors
The construction of tripartite SMC expression vectors based on the MFαl secretion leader and the MFαl, the ACT, and the CYCl promoters has been described above. Each of these vectors contained the yeast URA3 gene for selection in yeast. We also constructed vectors based on selection using the yeast LEU2 gene. The expression of the plasmid LEU2 gene is low due to a partial deletion; higher copy numbers of the LEU2 type expression vectors are needed to allow the growth of transformants on leucine-free media. E. Erhart and C. P. Hollenberg, "The Presence of a Defective LEU2 Gene on 2y DNA Recombinant Plasmids of Saccharomyces cervisiae is Responsible for. Curing and High Copy Number," J. Bacteriol.,
156, pp. 625-35 (1983). For a discussion of the use of yeasts having the URA3 gene or the LEU2 gene in recombinant DNA experiments, see D. Botstein et al., "Sterile Host Yeast (SHY): a Eukaryotic System for Biological Containment for Recombinant DNA Experiments," Gene, 8, pp. 17-24 (1979).
We constructed the LEU2-type vectors by inserting the chosen promoter/MFα1 secretion leader/ SMC structural sequence into plasmid pJDB207, using procedures similar to those used with the URA3 plasmids. The structures of the resulting plasmids are shown in Figure -6. The plasmid pJDB207 is described in J. D. Beggs, "Multiple-copy Yeast Vectors," Molecular Genetics in Yeast, Alfred Benzon Symposium 16, 383-95 (1981). We transformed the three URA-type vectors derived from plasmid pEX-2 (which is described in J. F. Ernst and R. C. Chan, supra) and the three LEU2-type vectors into yeast strain BJ1991. Wedeposited two of the recombinant yeast strains in the Deutsche Samulung Von Mikroorganismen (West German Culture Collection). One strain, YE439, is a LEU2- type vector with the ACT promoter/MFαl secretion leader/SMC structural sequence. It was deposited on October 30, 1985 and is identified as DSM 3578. Another strain, YE466, is a LEU2-type vector with the CYCl promoter/MFαl secretion leader/SMC structural sequence. It was deposited on October 30, 1985 and is identified as DSM 3579. Transformants were grown in the "SD" medium described in F. Sherman et al., "Methods in Yeast Genetics," Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1981), containing tryptophan and leucine (for URA3-type vectors), or tryptophan and uracil (for LEU2-type vectors) to an optical density of 2 at 600 nm. We used these cultures to inoculate production medium, consisting of SD medium containing 4% casamino acids and tryptophan (inoculum was 10% of final volume of productiori medium) At appropriate times during growth, 0.5 ml of the cultures were pelleted using a microfuge. We redissolved the cell pellets in 1/10 the original culture volume by boiling 5 min in sodium dodecyl sulphate (SDS) sample buffer, which is described in U. K. Laemmli, "Cleavage of the Structural Proteins During the Assembly of the Head of Bacteriophage T4, " Nature, 227, pp. 680-85 (1970). We then determined the SMC levels in the redissolved cell pellets and in the culture fluid after dilution, using a radioimmune assay (Nichols Institute Diagnostics, San Juan Capistrano, California).
With all six constructions, less than 10% of the total SMC was cell associated. We purified, SMC present in the culture medium, and analyzed it, finding that the amino terminal and carboxyl terminal amino acids were identical to human SMC and that the biological activity of the secreted SMC was the same as human SMC. Surprisingly, the weakest promoter used, CYCl, resulted in the highest SMC secretion values shown in Figures 3 and 7 and Table 2. This was unexpected because the strongest promoter would have been expected to give the highest secretion values. Unexpectedly, the transformants carrying the CYCl constructions grew slowest and showed a pronounced optimum curve of SMC secretion, indicating the occurrence of cell lysis and SMC degradation (see Figure 7). To determine the reason for the unexpected results, we analyzed SMC plasmid copy number and transcript levels in the transformants described herein after three days in production medium. The results are shown in Table 2. We discovered an inverse relationship between copy number and promoter strength in secretion systems. The secretion system with the weakest promoter (CYCl) proved to have the highest copy numbers. The MFαl constructs had the lowest copy numbers. The average SMC transcript level proved to be proportional to copy numbers for both URA3 and LEU2 constructs. In turn, with LEU2 constructions, mRNA levels were proportional to secretion levels. With URA3 constructions, no significant increase of SMC secretion levels with increased mRNA was observed. While not wishing to be bound by theory, the URA3 constructs do not appear to have the same effect on the SMC secretion levels because of a high percentage of URA3 plasmid loss.
A high percentage of URA3 transformants actually have few or no URA3 expression plasmids. These unproductive transformants contribute little to overall SMC secretion. Conceivably, a high percentage of URA3 transformants also have high copy numbers and high promoter strength. These transformants would contribute little to SMC secretion because of slow growth and reduced cell viability. This suggested mechanism does not apply to LEU2 vectors because the minimum copy number of the LEU2-type vectors in transformants grown in leucine-free media is about 35 (Erhart and Hollenberg. supra). In addition, since the production medium used contains leucine (but no uracil), elevated copy numbers are not required for growth of LEU2 vector-transformants in this medium. Thus, yeast strains transformed with LEU2 appear more homogeneous than URA3 vectors with respect to copy number and mRNA content.
With the URA3 transcripts, mRNA levels increased as promoter strength weakened, even though secretion levels remained fairly constant. This trend appears to reflect the lower copy numbers for URA3 constructs. It appears that, for LEU2 transcripts, a copy number of at least 40 is required to obtain minumum levels of mRNA and SMC secretion.
Table 2 does not give the actual ratio of SMC to actin transcripts, and the numbers are for comparative purposes only. Transformation frequencies for all six plasmid constructions are also shown in Table 2. Aproximately similar transformation frequencies were obtained with allthe URA3-type vectors. However, dramatic differences were seen for the LEU2-type vectors, the MFαl promoter constructions could not be transformed in yeast at all, relatively high frequencies were observed for the ACT promoter construction and lower frequencies of transformation, compared to the control (pJDB207) and the ACT construction, were characteristic of the CYC1 construction We did obtain one rare LEU2-type vector transformant with the MPαl construction; however, this transformant contained a rearranged plasmid that directed the synthesis of low amounts of SMC as shown parenthetically in Table 2. This result suggests that the MFαl promoter does not permit a sufficiently high copy number to overcome the effects of the LEU2 defect. Thus, secretion rate, cell viability, promoter strength and vector copy number appear to be interrelated.
Figure imgf000032_0001
EXAMPLE 5
Secretion of TNF
We constructed fusions of the MFαl secretion leader to the gene encoding human tumor necrosis factor (TNF) as shown in Figure 8 by using the procedure described above for MFαl/SMC fusions. We then analyzed the effect of promoter replacement on TNF secretion.
We ligated the following three fragments to construct vector pTNFll (see Figure 8): (1) a
1.2 kb EcoRI-StuI fragment carrying the promoter and the secretion leader of MFαi ending in a StuI site (described supra); (2) a 9 kb EcoRI-Hindi II fragment of pEX-2 carrying the yeast 2y origin of replication and the URA3 gene for selection in yeast (described supra); and (3) a blunt-ended Hindi II fragment (0.9 kb) carrying the TNF gene. Isolation procedures for this third fragment are disclosed in D. Pennica et al., "Human Tumor Necrosis Factor: Precursor Structure, Expression and Homology to Lymphotoxin," Nature, 312, pp. 724-29 (1984). We generated the blunt-ended HindiII fragment by using synthetic oligonucleotide linkers to recreate the 5' end of the TNF gene by extension from an Aval site situated close to the 5' end of the TNF gene (valine is the first amino acid of TNF) . The junction of MFαl to the TNF gene had the following sequence:
(5') ... AAA AGG GTA CGT TCT TCC ... (3') lys arg val arg ser ser MFαl TNF
The construction of a TNF secretion vector based on the ACT promoter, instead of the MFαl promoter, is also shown in Figure 8. We ligated the following fragments to construct pACT-TNF-EX: (1) a 1.2 kb Pstl-Hindlll fragment from pTNFll carrying most of the MFαl secretion leader fused to the TNF gene (described supra); (2) a 0.5 kb BamHI-PstI fragment from p355/1, shown in Figure 5, carrying the ACT promoter and part of the MFαl leader (described supra); and (3) the 9 kb BamHI-HindIII fragment of pEX-2 (described supra).
We transformed yeast strain BJ1991 with pTNFll or pACT-TNF-EX and selected for Ura+ prototrophs. Transformants were grown selectively in liquid SD medium lacking uracil. Shake flasks containing YPD medium, which is disclosed in F. Sherman et al., supra, were inoculated with minimal cultures (10% final volume) and incubated at 30°C on a rotatory shaker. At appropriate times during incubation, we centrifuged 1 ml samples of the yeast culture for 1 min using a microfuge. We then treated the cell pellet with zymolase to generate spheroplasts. We then lysed the spheroplasts using 1% triton X-100 at 20% of the original culture volume, followed by removal of cell debris by a short centrifugation step. We electrophoresed samples containing TNF on 12.5% SDS-acrylamide gels and transferred the proteins to nitrocellulose by standard methods. We probed the protein blot with rabbit antibodies for human TNF, and we visualized regions of the blot containing TNF/anti-TNF complexes using peroxidasecoupled swine anti-rabbit antibodies. Maximal TNF expression results in the culture fluid and in cell extracts are shown in Table 3. The results indicated that a significant improvement of secretion of TNF was obtained when the MFαl promoter was replaced by the ACT promoter. Only 10-20% of the total TNF produced was cell associated.
Figure imgf000035_0001
EXAMPLE 6
Secretion of TPA
The gene coding for human tissue plasminogen activator (TPA) carries a convenient Bglll site at a position corresponding to the first amino acid of TPA. D. Pennica et al., "Cloning and Expression of Human Tissue-type Plasminogen Activator cDNA in E. coli," Nature, 301, pp. 214-21 (1983). In order to allow correct fusion to the MFα1 secretion leader, we introduced a convenient BgllI site at the MFαl position corresponding to the lys-arg junction. The structure of the MFαl/TPA fusion point in the following constructions is as follows: (5')...AAA AGA TCT TAC...(3') lys arg ser tyr MFαl TPA
We inserted a Bglll fragment carrying the MFα1 promoter and secretion leader into the single Bglll site of the previously constructed vector pPAY4 as shown in Figure 9. In this manner, the PHO5 promoter and secretion leader were replaced by the MFαl promoter and secretion leader. We further modified the resulting expression vector, pMF-TPA, by replacing the PHO5 and MFαl promoters by the ACT promoter, as shown in Figure 9, to obtain expression vector pACT- MF-TPA. The PHO5 promoter present in both TPA constructions is repressed in regular (high-phosphate) production media. Accordingly, the expression results described below are due to the activity of the MFαl promoter because the tests were carried out in regular (high phosphate) media.
We used each of the expression plasmids, pACT-MF-TPA and pMF-TPA, to transform yeast cells of strain BJ1991, as described above, selecting for
Leu prototrophs. We selectively grew transformants in SD medium. We inoculated shake flasks containing YPD medium with the SD cultures (10% of final volume) and incubated at 30°C. At appropriate times during incubation, we prepared cell extracts as described above in Example 4.
We determined TPA activity in cell fractions by halo formation on fibrinogen-plasminogen-agar plates as described in A. Granelli-Piperno and E. Reich, "A Study of Proteases and Protease-Inhibitor Complexes in Biological Fluids", J. Exp. Med., 148, pp. 223-34 (1978). The results shown in Table 4 indicate that replacement of the MFα1 promoter by the ACT promoter is advantageous in obtaining secretion of heterologous proteins by yeast using alpha mating factor fusions.
Figure imgf000036_0001
As shown in Table 3, most of the TPA produced was cell associated; only 5% of the total TPA produced appeared in the medium. However, the following evidence suggested that yeast TPA had entered the secretion pathway:
(1) the TPA expressed was biologically active, indicating correct folding. Attempts to express TPA in yeast without a secretion signal sequence produced non-active TPA; and (2) the cell associated TPA was glyco- sylated. Secretion at least into the lumen of the endoplasmic reitculum is necessary for glycosylation to occur.
It will be apparent to those skilled in the art that various modifications may be made in the invention without departing from its spirit or scope, and our basic construction can be altered to provide other embodiments which utilize the processes and compositions of this invention. Therefore, it will be appreciated that the scope of this invention is to be defined by the claims appended hereto rather than the specific embodiments which have been presented as examples.

Claims

WHAT IS CLAIMED IS:
1. A DNA sequence comprising:
(a) a promoter of at most intermediate strength; and (b) a DNA secretion signal sequence, beginning with a start codon, heterologous with respect to said promoter and operatively linked to said promoter.
2. The DNA sequence of claim 1, wherein said start codon is ATG.
3. The DNA sequence of claim 1 or 2, wherein said promoter is selected from the group consisting of: the ACT, CYCl, CYC7, trpl, URA3, LEU2, trp and lac promoters.
4. The DNA sequence of any one of the preceding claims, wherein said DNA secretion signal sequence is the MFαl secretion signal sequence.
5. A DNA sequence according to any one of the preceding claims further comprising a DNA sequence coding for a desired protein operatively linked to said DNA secretion signal sequence.
6. The DNA sequence of claim 5, wherein said desired protein is capable of being secreted by a microorganism or cell.
7. The DNA sequence of claim 5 or 6, wherein said desired protein is selected from the group consisting of serum proteins, analgesic polypeptides, β-endorphin, somatostatin, SMC, insulin, human growth hormone, bovine growth hormone, luteinizing hormone, ACTH, pancreatic polypeptide preproteins, TPA, TNF, preproinsulin, proinsulin, the A chain of insulin and the B chain of insulin.
8. A multicopy expression vector comprising a DNA sequence according to any one of claims 1-7.
9. The multicopy expression vector of claim 8, wherein said multicopy expression vector is selected from the group consisting of plasmids and multicopy phages.
10. A host transformed by the DNA sequence of any one of claims 1-7.
11. A host transformed by the multicopy expression vector of any one of claims 8-9.
12. The host of any one of claims 10-11, wherein said multicopy expression vector is selected from the group consisting of: ARS-type plasmids, 2μ-type plasmids, ColEl type plasmids, pBR322, RP4 plasmid, M13 phage and lambda phage.
13. The host of any one of claims 10-12, wherein said promoter does not materially reduce the copy number of said multicopy expression vector in said host.
14. The host of any one of claims 10-13, wherein the expression control sequence of said promoter produces less than 0.3 percent of the total mRNA within said host.
15. The host of claim 14, wherein at most 0.15 percent of the total mRNA within said host is the product of said promoter.
16. The host of any of claims 10-15, wherein said host is selected from the group consisting of: bacteria, fungi, yeasts, animal cells, plant cells in tissue culture or human tissue cells.
17. The host of any one of claims 10-16, wherein said host is selected from the group consisting of: E.coli, Bacillus, Streptomyces,
Pseudomonas, S. cerevisiae, CHO cells, mouse cells, swine cells, bovine cells, fowl cells or fish cells.
18. The host of any one of claims 10-17, wherein said host is yeast.
19. The host of claim 18, wherein said yeast is S. cerevisiae.
20. A method for producing a desired protein, comprising culturing the host of any one of claims 10-19.
PCT/EP1986/000675 1985-11-25 1986-11-24 Enhanced secretion of heterologous proteins by hosts using substituted promoters WO1987003300A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP86906826A EP0283475B1 (en) 1985-11-25 1986-11-24 Enhanced secretion of heterologous proteins by hosts using substituted promoters
DE3689846T DE3689846T2 (en) 1985-11-25 1986-11-24 INCREASED EXHIBITION OF HETEROLOGICAL PROTEINS BY HOST CELLS THROUGH THE USE OF SUBSTITUTED PROMOTORS.

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB858529014A GB8529014D0 (en) 1985-11-25 1985-11-25 Enhanced secretion of heterologous proteins
GB8529014 1985-11-25

Publications (1)

Publication Number Publication Date
WO1987003300A1 true WO1987003300A1 (en) 1987-06-04

Family

ID=10588758

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP1986/000675 WO1987003300A1 (en) 1985-11-25 1986-11-24 Enhanced secretion of heterologous proteins by hosts using substituted promoters

Country Status (8)

Country Link
US (1) US5082783A (en)
EP (1) EP0283475B1 (en)
JP (2) JP2528849B2 (en)
AT (1) ATE105865T1 (en)
CA (1) CA1318617C (en)
DE (1) DE3689846T2 (en)
GB (1) GB8529014D0 (en)
WO (1) WO1987003300A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0281246A2 (en) * 1987-02-04 1988-09-07 Invitron Corporation Method to enhance amplification and expression of foreign genes
US5030563A (en) * 1987-07-07 1991-07-09 Genetics Institute, Inc. Bacterial hypersecretion using mutant repressor sequence
EP0436597A1 (en) * 1988-09-02 1991-07-17 Protein Eng Corp Generation and selection of recombinant varied binding proteins.
CN1325650C (en) * 1998-10-16 2007-07-11 联邦科学及工业研究组织 Delivery system for porcine somatotropin
US8067198B2 (en) 2003-06-25 2011-11-29 Prolume Ltd. Protein expression system

Families Citing this family (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5326700A (en) * 1990-11-06 1994-07-05 Eli Lilly And Company DNA sequences encoding t-PA derivatives with cleavable sites
DE4107612A1 (en) * 1991-03-09 1992-09-10 Behringwerke Ag RECOMBINANT PROTEINS WITH THE IMMUNE REACTIVITY OF HEPATITIS B VIRUS E ANTIGENS (HBEAG), METHODS FOR THEIR PRODUCTION AND THEIR USE IN IMMUNOASSAYS AND VACCINE SUBSTANCES
US5298422A (en) * 1991-11-06 1994-03-29 Baylor College Of Medicine Myogenic vector systems
US5925564A (en) * 1991-11-06 1999-07-20 Baylor College Of Medicine Expression vector systems and method of use
WO1995016772A1 (en) 1993-12-14 1995-06-22 Cornell Research Foundation, Inc. Adenovirus gene expression system
EP0822984A4 (en) 1995-04-27 2000-05-03 Human Genome Sciences Inc Human tumor necrosis factor receptors
US5925351A (en) * 1995-07-21 1999-07-20 Biogen, Inc. Soluble lymphotoxin-β receptors and anti-lymphotoxin receptor and ligand antibodies as therapeutic agents for the treatment of immunological disease
EP0878552A1 (en) * 1997-05-13 1998-11-18 Erasmus Universiteit Rotterdam Molecular detection of chromosome aberrations
WO1999057309A1 (en) * 1998-05-04 1999-11-11 Dako A/S Method and probes for the detection of chromosome aberrations
US9453251B2 (en) 2002-10-08 2016-09-27 Pfenex Inc. Expression of mammalian proteins in Pseudomonas fluorescens
WO2005000898A2 (en) * 2003-06-27 2005-01-06 Biogen Idec Ma Inc. Use of hydrophobic-interaction-chromatography or hinge-region modifications for the production of homogeneous antibody-solutions
PL2336153T3 (en) 2003-11-21 2016-08-31 Pfenex Inc Improved expression systems with SEC-system secretion
CA2560742A1 (en) * 2004-03-23 2005-10-06 Biogen Idec Ma Inc. Receptor coupling agents and therapeutic uses thereof
EP2412816B1 (en) 2004-07-26 2014-12-03 Pfenex Inc. Process for improved protein expression by strain engineering
WO2006074399A2 (en) * 2005-01-05 2006-07-13 Biogen Idec Ma Inc. Multispecific binding molecules comprising connecting peptides
SG154441A1 (en) * 2006-10-20 2009-08-28 Biogen Idec Inc Treatment of demyelinating disorders
US8338376B2 (en) * 2006-10-20 2012-12-25 Biogen Idec Ma Inc. Compositions comprising variant LT-B-R-IG fusion proteins
MX2009004718A (en) 2006-11-02 2009-06-19 Acceleron Pharma Inc Alk1 receptor and ligand antagonists and uses thereof.
CA2677179C (en) 2007-01-31 2016-02-16 Dow Global Technologies Inc. Bacterial leader sequences for increased expression
US9580719B2 (en) 2007-04-27 2017-02-28 Pfenex, Inc. Method for rapidly screening microbial hosts to identify certain strains with improved yield and/or quality in the expression of heterologous proteins
EP2142651B1 (en) 2007-04-27 2013-05-22 Pfenex Inc. Method for rapidly screening microbial hosts to identify certain strains with improved yield and/or quality in the expression of heterologous proteins
US20110020830A1 (en) * 2008-03-31 2011-01-27 Schneider Jane C Design for rapidly cloning one or more polypeptide chains into an expression system
AU2009241755B2 (en) 2008-05-02 2015-10-01 Acceleron Pharma Inc. Methods and compositions based on ALK1 antagonists for modulating angiogenesis and pericyte coverage
JP2010183885A (en) * 2009-02-13 2010-08-26 Kobe Univ Method for producing protein and expression vector used therefor
CN108178789B (en) 2011-04-20 2021-11-02 阿塞勒隆制药公司 Endoglin polypeptides and uses thereof
KR20140123558A (en) 2012-02-02 2014-10-22 악셀레론 파마 인코포레이티드 Alk1 antagonists and their uses in treating renal cell carcinoma
BR112015022181A8 (en) 2013-03-12 2018-01-23 Massachusetts Eye & Ear Infirmary mullerian inhibiting substance proteins (mis) and their uses for treating diseases
CN105658672A (en) 2013-08-22 2016-06-08 阿塞勒隆制药公司 TGF-beta receptor type II variants and uses thereof
EA035481B1 (en) 2013-10-25 2020-06-23 Акселерон Фарма, Инк. Endoglin peptides to treat fibrotic diseases
WO2015089321A2 (en) 2013-12-11 2015-06-18 The General Hospital Corporation Use of mullerian inhibiting substance (mis) proteins for contraception and ovarian reserve preservation
AU2016301380B2 (en) 2015-08-04 2021-07-01 Acceleron Pharma Inc. Methods for treating myeloproliferative disorders
SI3628049T1 (en) 2017-05-04 2023-10-30 Acceleron Pharma Inc. Tgf-beta receptor type ii fusion proteins and uses thereof

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0200590A1 (en) * 1985-03-25 1986-11-05 Genetica Method for the microbiological synthesis of human serum albumin

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4411994A (en) * 1978-06-08 1983-10-25 The President And Fellows Of Harvard College Protein synthesis
US4338397A (en) * 1980-04-11 1982-07-06 President And Fellows Of Harvard College Mature protein synthesis
US4546082A (en) * 1982-06-17 1985-10-08 Regents Of The Univ. Of California E. coli/Saccharomyces cerevisiae plasmid cloning vector containing the alpha-factor gene for secretion and processing of hybrid proteins
EP0116201B1 (en) * 1983-01-12 1992-04-22 Chiron Corporation Secretory expression in eukaryotes
ATE220101T1 (en) * 1983-04-25 2002-07-15 Chiron Corp HYBRID DNA SYNTHESIS OF MATURE INSULIN-LIKE GROWTH FACTORS
US4588684A (en) * 1983-04-26 1986-05-13 Chiron Corporation a-Factor and its processing signals
IL71991A (en) * 1983-06-06 1994-05-30 Genentech Inc Preparation of mature human IGF and EGF via prokaryotic recombinant DNA technology

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0200590A1 (en) * 1985-03-25 1986-11-05 Genetica Method for the microbiological synthesis of human serum albumin

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Gene, Volume 35, 1985, M.A. PETERS et al.: "Expression of a Biologically Active Analogue of Somatomedin-C/Insulin-like Growth Factor I", pages 83-89 see the whole document *
Nature, Volume 287, 2 October 1980, D.V. GOEDDEL et al.: "Human Leukocyte Interferon Produced by E. Coli is Biologically Active", pages 411-416 see page 414, column 2 *
Nucleic Acids Research, Volume 11, No. 8, 1983, M.J. DOBSON et al.: "Expression in Saccharomyces Cerevisiae of Human Interferon-alpha Directed by the TRP1 5' Region", pages 2287-2302 see page 2295, paragraph 2; page 2296 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0281246A2 (en) * 1987-02-04 1988-09-07 Invitron Corporation Method to enhance amplification and expression of foreign genes
EP0281246A3 (en) * 1987-02-04 1989-07-12 Invitron Corporation Method to enhance amplification and expression of foreign genes
US5030563A (en) * 1987-07-07 1991-07-09 Genetics Institute, Inc. Bacterial hypersecretion using mutant repressor sequence
EP0436597A1 (en) * 1988-09-02 1991-07-17 Protein Eng Corp Generation and selection of recombinant varied binding proteins.
EP0436597A4 (en) * 1988-09-02 1992-05-20 Protein Engineering Corporation Generation and selection of recombinant varied binding proteins
CN1325650C (en) * 1998-10-16 2007-07-11 联邦科学及工业研究组织 Delivery system for porcine somatotropin
US8067198B2 (en) 2003-06-25 2011-11-29 Prolume Ltd. Protein expression system
US9115364B2 (en) 2003-06-25 2015-08-25 Prolume Ltd. Protein expression system

Also Published As

Publication number Publication date
JP2572952B2 (en) 1997-01-16
ATE105865T1 (en) 1994-06-15
DE3689846D1 (en) 1994-06-23
DE3689846T2 (en) 1994-09-22
CA1318617C (en) 1993-06-01
GB8529014D0 (en) 1986-01-02
US5082783A (en) 1992-01-21
EP0283475B1 (en) 1994-05-18
JPS63502717A (en) 1988-10-13
EP0283475A1 (en) 1988-09-28
JPH07222595A (en) 1995-08-22
JP2528849B2 (en) 1996-08-28

Similar Documents

Publication Publication Date Title
US5082783A (en) Enhanced secretion of heterologous proteins by hosts using substituted promoters
JP2793215B2 (en) Improved expression and secretion of heterologous proteins in yeast using truncated .ALPHA.-factor leader sequences
US6642029B1 (en) Hybrid DNA synthesis of mature insulin-like growth factors
DE3308215C2 (en) Expression, processing and secretion of heterologous protein by yeast
CA2090969C (en) Production of insulin-like growth factor-1 in methylotrophic yeast cells
EP0123544A2 (en) Process for expressing heterologous protein in yeast, expression vehicles and yeast organisms therefor
ERNST Improved secretion of heterologous proteins by Saccharomyces cerevisiae: effects of promoter substitution in alpha-factor fusions
EP0347928A2 (en) Pichia pastoris alcohol oxidase II regulatory region
US5187261A (en) Process for the preparation of mature human serum albumin
JPH07110233B2 (en) Releasable East Promoter
AU667852B2 (en) Novel DNA molecules and hosts
JP2511251B2 (en) Gene system for the production of mature insulin
WO1992013951A1 (en) Production of human serum albumin in methylotrophic yeast cells
AU665034B2 (en) Production of human parathyroid hormone from microorganisms
EP0127304A1 (en) Process for producing heterologous protein in yeast, expression vehicle therefor, and yeast transformed therewith
JPS6344890A (en) Development system of filamentous fungi
JPH03504561A (en) Production and purification of recombinant human interleukin-3 and its mutant proteins
NO813759L (en) PLASMID, PROCEDURE FOR MANUFACTURING SUCH A PRODUCT, AND USE THEREOF
Ludwig et al. High-level heterologous gene expression in Saccharomyces cerevisiae from a stable 2μm plasmid system
US5521093A (en) Yeast vector coding for heterologous gene fusions linked via KEX2 cleavage site and coding for truncated KEX2 genes
US6183989B1 (en) Process for making desired polypeptides in yeast
US5104795A (en) Shortened phosphoglycerate kinase promoter
EP0310137B1 (en) BAR1 secretion signal
US5879926A (en) Yeast strains for the production of mature heterologous proteins, especially hirudin
EP0245479B1 (en) Shortened phosphoglycerate kinase promoter

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): JP

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH DE FR GB IT LU NL SE

WWE Wipo information: entry into national phase

Ref document number: 1986906826

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1986906826

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 1986906826

Country of ref document: EP