WO2018201050A1 - Tryptathionine biosynthesis by flavin mono-oxygenase 1 (fmo1) - Google Patents

Tryptathionine biosynthesis by flavin mono-oxygenase 1 (fmo1) Download PDF

Info

Publication number
WO2018201050A1
WO2018201050A1 PCT/US2018/029954 US2018029954W WO2018201050A1 WO 2018201050 A1 WO2018201050 A1 WO 2018201050A1 US 2018029954 W US2018029954 W US 2018029954W WO 2018201050 A1 WO2018201050 A1 WO 2018201050A1
Authority
WO
WIPO (PCT)
Prior art keywords
nucleic acid
peptide
expression
fmol
promoter
Prior art date
Application number
PCT/US2018/029954
Other languages
French (fr)
Inventor
Jonathan D. WALTON
R. Michael SGAMBELLURI
Original Assignee
Board Of Trustees Of Michigan State University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Board Of Trustees Of Michigan State University filed Critical Board Of Trustees Of Michigan State University
Publication of WO2018201050A1 publication Critical patent/WO2018201050A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P21/00Preparation of peptides or proteins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y114/00Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
    • C12Y114/13Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with NADH or NADPH as one donor, and incorporation of one atom of oxygen (1.14.13)
    • C12Y114/13008Flavin-containing monooxygenase (1.14.13.8), i.e. dimethylaniline-monooxygenase

Definitions

  • Cyclic and bicyclic peptides have been found in plants, bacteria, fungi, and animals (Arnison et al., Nat. Prod. Rep. 30:108-160 (2013); Umemura et al., Fung. Genet. Biol. 68:23-30 (2014)).
  • macrocyclized peptides include the cyanobactins from marine bacteria (Donia et al., Nat. Chem. Biol. 2:729-735 (2006)) and the orbitides and cyclotides from plants (Craik & Malik, Curr. Opin. Chem. Biol. 17:546-554 (2013)).
  • Cyclization confers several pharmacologically advantageous attributes to peptides, including increased rigidity, enhanced membrane permeability, and resistance to proteolysis and denaturation (Craik & Allewell, Biol. Chem. 287 :26999-27000 (2012)).
  • many cyclic peptides have potent biological activities against a wide range of medically and ecologically significant targets (Bockus et al., Curr. Top. Med. Chem 13:821-836 (2013); Giordanetto & Kihlberg, J. Med. Chem. 57:278-295 (2014); Walton, Plant Cell. 8:1723-1733 (1996)).
  • compositions and methods involving expression cassettes, enzymes, and methods useful for generating cross-bridges in peptides can involve using nucleic acids encoding a flavin mono-oxygenase (FMO) enzyme. Methods are also described for generating cross-bridges in peptides within in vitro (e.g., cell free) reaction mixtures.
  • FMO flavin mono-oxygenase
  • Methods are also described for generating cross-bridges in peptides within in vitro (e.g., cell free) reaction mixtures.
  • the cross-bridge enzymes can be used in conjunction with other enzymes such as enzymes that cyclize peptides. These cross-bridge and cyclization enzymes are useful for making a variety of crosslinked, bicyclic and/or cyclic peptides.
  • Described herein are expression systems that can include at least one expression cassette having a promoter operably linked to a heterologous nucleic acid segment encoding a flavin mono-oxygenase (FMO) with at least 95% sequence identity to SEQ ID NO: 2, 4, 5, 6, or 7.
  • FMO flavin mono-oxygenase
  • Host cells containing such expression systems are also described herein, and methods for using such expression systems and host cells.
  • the methods can include incubating a host cell that has one or more expression systems that can include at least one expression cassette having a promoter operably linked to a heterologous nucleic acid segment encoding a flavin mono-oxygenase (FMO) with at least 95% sequence identity to SEQ ID NO: 2, 4, S, 6, or 7, for a time and under conditions sufficient for generation of a
  • FMO flavin mono-oxygenase
  • the expression systems, host cells, and methods can also include use of at least one expression cassette having a promoter operably linked to a heterologous nucleic acid segment encoding a prolyl oligopeptidase with at least 95% sequence identity to SEQ ID NO:ll, 13, 14, or 15.
  • tryptathionine cross-bridges between tryptophan (Tip) and cysteine
  • (Cys) residues in the peptide substrate are formed in vivo (within a host cell) by FMOl while in other cases tryptathionine cross-bridges between tryptophan (Tip) and cysteine (Cys) residues in the peptide substrate are formed in vitro (not within a host) by FMOl.
  • additional cellular enzymes and other factors can help optimize cross- bridge formation and/or can facilitate cyclization of peptides. Accordingly, in some cases the cross-bridges are prepared in vivo.
  • the FMOl enzyme can be incubated in vitro with a peptide that includes tryptophan (Tip) and cysteine (Cys) residues to generate a peptide with a tryptathionine cross-bridge between the Tip and Cys residues.
  • Tip tryptophan
  • Cys cysteine
  • the FMOl proteins can be used in in vitro compositions, assay mixtures, and/or enzymatic manufacturing processes.
  • FIG. 1A-1B illustrates cyclization of peptides.
  • FIG. 1A illustrates conversion of a 35- amino acid precursor peptide (SEQ ID NO: 19) to a cyclic octapeptide by a prolyl oligopeptidase B from Galerina marginata (GmPOPB) enzyme (SEQ ID NOs: 19 and 27- 29).
  • GmPOPB catalyzes two reactions to convert the linear 35mer precursor peptide (SI) to cyclo(IWGIGCNP, SEQ ID NO:l), an intermediate in amanitin biosynthesis.
  • Product P3 is then cross-linked (Tip to Cys) as illustrated in FIG.
  • FIG. IB illustrates the reaction catalyzed by flavin mono-oxygenase (FMOl) to form tryptathionine.
  • the substrate is the product of POPB.
  • FIG. 2A-2B illustrates how hydroxylation of an indole ring can activate the indole for further reaction including cross-linking to cysteine.
  • FIG. 2A illustrates 3 -hydroxylation of indole as catalyzed by an FMO from Methylophaga.
  • FIG. 2B illustrates reaction via Savige- Fontana synthesis of tryptathionine from 3a-hydroxy-pyrrolo[2,3-b]-indole (Hpi), in which 3- hydoxyTrp is an intermediate. See, Cho et al., J Struct Biol 175:39-48 (2011); May JP, Perrin DM (2007) Tryptathionine bridges in peptide synthesis. Biopolymers 88:714-724.)
  • FIGs. 3A-3C illustrate in vitro expression of FMOl and its catalysis of a tryptathionine crossbridge product.
  • FIG 3 A shows expression of FMOl as a 60-kDa protein fused to a myc tag. Left panel: Coomassie stained gel. Right panel: western blot of the same gel with anti-myc antibodies.
  • FIG. 3B shows LC/MS analysis of substrate (monocyclic cyclo[IWGIGCNP], bottom) and product after reaction with FMOl (bicyclic[IWGIGCNP], top).
  • FIG. 3C shows a mass spectrum of the product eluting at 12.0 min as shown in FIG. 3B, illustrating the parent ion at M+H+ 839 (tallest peak) as predicted for bicyclic compound.
  • FIGs. 4A-4B illustrate that targeted gene disruption of GmFMOl abolishes amanitin biosynthesis.
  • FIG. 4A shows an HPLC trace (UV absorbance at 280 nm) illustrating amanitin made by wild type G. marginata (solid trace) and not by deletion mutant FMOl G.
  • FIG. 4B shows a Southern blot of genomic DNA from deletion mutant FMOl G. marginata cells. The Southern blot shows that the FMOl gene has been deleted in the mutant. The results show that the FMOl gene is important for amanitin production.
  • Cyclic peptides, bicyclic peptides and peptides with cross-bridges are useful as drugs due to their stability, high surface area, membrane permeability, and resistance to proteases.
  • the bicyclic peptide toxins of Amanita mushrooms, the amatoxins such as amanitin and the phallotoxins such as phalloidin include a thioether cross-bridge between tryptophan (Trp) and cysteine (Cys). This cross bridge, called tryptathionine, contributes to the activities of these powerful compounds. Tryptathionine is not present in any other natural product, and its biosynthetic pathway is unknown.
  • Amatoxins currently cannot be chemically synthesized and Amanita cannot be cultured, so the only current source is extraction from mushrooms collected in the wild.
  • the enzyme that catalyzes tryptathione biosynthesis can be used for in vitro or in vivo production of a new class of bicyclic peptides with potential pharmaceutical or industrial applications, ultimately including the amatoxins themselves.
  • a gene encoding a flavin mono-oxygenase is clustered with two genes in the amanitin biosynthetic pathway, prolyl oligopeptidase B (POPB) and AMAl.
  • FMO flavin mono-oxygenase
  • POPB prolyl oligopeptidase B
  • AMAl a gene encoding a flavin mono-oxygenase
  • AMAl encodes the 35- amino acid pro-protein for amanitin
  • POPB encodes the macrocyclase that converts the 35mer precursor peptide to a cyclic octapeptide, cyclo (IWGIGCNP, SEQ ID NO:l)
  • Macrocyclases are enzymes that can convert a linear substrate into a circular (cyclic) product (e.g., a cyclic peptide). Genes for such enzymes are also present in Amanita bisporigera and Amanita phalloides (Pulman et al. BMC Genomics 17:1038 (2016).
  • FMOl catalyzes tryptathionine biosynthesis as shown by the following reaction.
  • step (A) involves 3-hydroxylation of indole, like the reaction catalyzed by an FMO from Methylophaga.
  • step (B) is a reaction pathway for the Savige-Fontana synthesis of tryptathionine from the synthetic compound 3a-hydroxy-pyrrolo[2,3-b]-indole (Hpi), in which 3-hydoxyTrp is an intermediate.
  • peptides After formation of the tryptathionine cross bridge by FMOl, peptides can undergo one or more hydroxylations and sulfoxidations. For example, after formation of a tryptathionine cross bridge, the remaining steps in amanitin biosynthesis include two to four hydroxylations and sulfoxidation. Identification of the tryptathionine forming enzyme FMOl is a key missing link.
  • Flavin mono -oxygenase (FMO) nucleic acids and enzymes are described herein that are useful for generating crosslinked and/or cyclized peptides.
  • FMO Flavin mono -oxygenase
  • a nucleic acid encoding a Galerina marginata FMOl (GmFMOl) enzyme was isolated as a cDNA.
  • Galerina marginata FMOl (GmFMOl) protein encoded by such a cDNA is shown below SEQ ID NO:2.
  • the region that includes amino acid positions 34-105 of the SEQ ID NO:2 FMOl protein includes a Rossmann-fold NAD(P)(+)-binding protein domain.
  • FMOl is shown below as SEQ ID NO:3.
  • This FM01 -related protein from Amanita muscaria has accession number KJL66345.1 and the following sequence (SEQ ID NO:7).
  • a cDNA clone encoding a FMOl protein is synthesized, isolated, and/or obtained from a selected cell.
  • cDNA clones from other species that encode a FMOl -related protein
  • the nucleic acid encoding a FMOl protein can be any nucleic acid with a coding region that hybridizes to SEQ ID NO:3.
  • Such a FMOl -related nucleic acid can encode a protein that has FMOl activity.
  • the FMOl nucleic acid can encode a FMOl protein with an amino acid sequence that has at least 90%, or at least 95%, or at least 96%, or at least 97%, at least 98%, or at least 99% sequence identity to SEQ ID NO:2, 4, 5, 6, or 7.
  • restriction endonucleases the entire coding sequence for the FMOl nucleic acid is subcloned downstream of the promoter in a 5 ' to 3 ' sense orientation.
  • the FMOl proteins can be used in in vitro compositions, assay mixtures, and/or enzymatic manufacturing processes.
  • the FMOl proteins in these compositions, mixtures, and processes can have amino acid sequences that have at least 90%, or at least 95%, or at least 96%, or at least 97%, at least 98%, or at least 99% sequence identity to SEQ ID NO:2, 4, 5, 6, or 7.
  • the FMOl nucleic acids and/or FMOl enzymes can be used in conjunction with other enzymes.
  • the FMOl nucleic acids and/or FMOl enzymes can be co-expressed or combined in vitro with other enzymes that can cychze peptides.
  • prolyl oligopeptidase or "POP” refers to a member of a family of enzymes classified and referred to as EC 3.4.21.26-enzymes that are capable of cleaving a peptide sequence, for example via hydrolysis of Pro-l-Xaa» Ala-I-Xaa in oligopeptides.
  • Prolyl oligopeptidases are also referred to as "post-proline cleaving enzyme,” “proline- specific endopeptidase,” “post-proline endopeptidase,” “proline endopeptidase,” “endoprolyl peptidase,” “prolyl endopeptidase,” “post-proline cleaving enzyme,” “post-proline endopeptidase,” and “prolyl endopeptidase.”
  • POP A enzymes are found in the majority of mushrooms. POPB enzymes generally have approximately about 50% to 60% (e.g. 55%) amino acid identity to POP A enzymes. The POPB enzymes are primarily found in Amanita,
  • Galerina and Lepiota peptide-producing mushroom species.
  • POP A Amanita bisporigera prolyl oligopeptidase sequence is shown below as SEQ ID NO:8.
  • cDNA nucleic acid sequence that encodes the Amanita bisporigera prolyl oligopeptidase (POPA) protein is shown below as SEQ ID NO:9.
  • POPA prolyl oligopeptidase
  • prolyl oligopeptidase (POP A) sequences have about 65% sequence identity.
  • POP A from Amanita bisporigera and Galerina marginata do not have significant macrocyclase activity, and instead POPA enzymes often just cut small peptides at proline residues.
  • POPB Prolyl oligopeptidase B
  • cDNA nucleic acid sequence that encodes the Amanita bisporigera prolyl oligopeptidase (POPB) protein is shown below as SEQ ID NO:12.
  • a cDNA clone encoding a cyclization enzyme such as POPB enzyme is synthesized, isolated, and/or obtained from a selected cell.
  • cDNA clones from other species are isolated from selected fungal tissues.
  • the nucleic acid encoding an enzyme e.g. with cyclization activity
  • Such a cyclization-related nucleic acid can encode a protein that has POP activity.
  • the POP nucleic acid can encode a POP protein with an amino acid sequence that has at least 90%, or at least 95%, or at least 96%, or at least 97%, at least 98%, or at least 99% sequence identity to SEQ ID NO:l 1, 13, 14, or 15.
  • restriction endonucleases the entire coding sequence for a selected POP nucleic acid segment is subcloned downstream of the promoter in a 5' to 3' sense orientation.
  • the cyclization proteins can be used in in vitro compositions, assay mixtures, and/or enzymatic manufacturing processes.
  • the cyclization proteins in these compositions, mixtures, and processes can have amino acid sequences that have at least 90%, or at least 95%, or at least 96%, or at least 97%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 11 , 13, 14, or 15.
  • the cross-bridge and cyclization enzymes described herein can be expressed from expression cassettes.
  • Such an expression cassette (also referred to as a transgene) can include a promoter operably linked to a nucleic acid segment that encodes any of the FMOl or other enzymes described herein.
  • expression vector and "expression cassette” refer to a recombinant nucleic acid molecule containing a desired nucleic acid coding segment and appropriate nucleic acid sequences for the expression of the operably linked nucleic acid coding segment in a particular host cell or organism.
  • Nucleic acid sequences for expression of proteins in prokaryotes usually include a promoter, an operator (optional), and a ribosome-binding site, often along with other sequences.
  • Eukaryotic cells can utilize promoters, enhancers, and termination and polyadenylation signals.
  • Nucleic acid sequences for expression of proteins can be heterologous to the nucleic acid coding segment.
  • a “heterologous nucleic acid,” “heterologous gene” and “heterologous protein” indicate a nucleic acid, gene or protein, respectively, which has come from a source other than its native source.
  • the terms “heterologous” and “exogenous” are sometimes used interchangeably with “recombinant.”
  • Such heterologous nucleic acid, heterologous genes, and/or heterologous proteins may have been artificially supplied to the biological system or host cell.
  • endogenous protein refers to a protein or gene that is native to the biological system, cell, species or chromosome under study.
  • a “native” or “endogenous” nucleic acid is a nucleic acid that does not contain nucleotides or other features normally present in the chromosome in which it is normally found in nature.
  • Promoters provide for expression of mRNA from the FMOl and other nucleic acids.
  • the promoter can be a FMOl native promoter or a promoter that expresses a native cyclization enzyme.
  • the promoter can in some cases be heterologous to the FMOl nucleic acid segment.
  • Cyclization enzyme can also be expressed from heterologous promoters.
  • such a heterologous promoter may not be naturally linked to such a FMOl and/or other nucleic acid segment.
  • some expression cassettes and expression vectors have been recombinantly engineered to include a FMOl or other nucleic acid coding segment operably linked to a heterologous promoter.
  • a FMOl or other nucleic acid coding segment is operably linked to the promoter, for example, when it is located downstream from the promoter.
  • promoters can be included in the expression cassettes and/or expression vectors.
  • the endogenous FMOl promoter, or an endogenous promoter that expresses a native cyclization enzyme can be employed.
  • Promoter regions are typically found in the flanking DNA upstream from the coding sequence in both prokaryotic and eukaryotic cells.
  • a promoter sequence provides for regulation of transcription of the downstream gene sequence and typically includes from about SO to about 2,000 nucleotide base pairs.
  • Promoter sequences can also contain regulatory sequences such as enhancer sequences that can influence the level of gene expression.
  • Some isolated promoter sequences can provide for gene expression of heterologous DNAs, that is a DN A different from the native or homologous DNA.
  • Promoters can be strong or weak, or inducible.
  • a strong promoter provides for a high level of gene expression, whereas a weak promoter provides for a very low level of gene expression.
  • An inducible promoter is a promoter that provides for the turning on and off gene expression in response to an exogenously added agent, or to an environmental or developmental stimulus.
  • a bacterial promoter such as the P ⁇ promoter can be induced to vary levels of gene expression depending on the level of isothiopropylgalactoside added to the transformed cells.
  • Promoters can also provide for cell specific, tissue specific or developmental regulation.
  • a strong promoter for heterologous DNAs can be advantageous because it provides for a sufficient level of gene expression for easy detection and selection of transformed cells and provides for a high level of gene expression when desired.
  • the promoter within such expression cassettes / vectors can be functional in a fungal cell.
  • Expression cassettes / vectors can include, but are not limited to, fungal promoters of genes such as isocytochrome CI (CYC1), alcohol dehydrogenase (ADH1), transcription elongation factor (TEF), yeast glyceraldehyde phosphodehydrogenase (GAPDH), yeast phosphoglycerate kinase (PGK below), yeast cytochrome b5, Yb5 promoter, or a combination thereof.
  • fungal promoters of genes such as isocytochrome CI (CYC1), alcohol dehydrogenase (ADH1), transcription elongation factor (TEF), yeast glyceraldehyde phosphodehydrogenase (GAPDH), yeast phosphoglycerate kinase (PGK below), yeast cytochrome b5, Yb5 promoter, or a combination thereof.
  • promoters examples include fungal and bacterial promoters. Some specific nonlimiting examples include; the aprE promoter or a mutant aprE promoter (see, WO 01/51643); the aph promoter of the Streptomycesfradiae aminoglycoside 3'-phosphotransferase gene; an Aspergillus niger glucoamylase (glaA) promoter, the glucose isomerase (GI) promoter of Actinoplanes missouriensis and the derivative GI (GIT) promoter (U.S. Pat. No.
  • the glucose isomerase (GI) promoter from Streptomyces lividans e.g., SEQ ID NO: 1 in WO 03/089621
  • the short wild-type GI promoter SEQ ID NO: 33 in WO 03/089621
  • the 1.5 GI promoter SEQ ID NO: 31 in WO 03/089621
  • the 1.20 GI promoter SEQ ID NO: 32 in WO 03/089621
  • any of the variant GI promoters SEQ ID NOs: 9-28 in WO 03/089621 as disclosed in WO 03/089621
  • the cbhl, cbh2, egll and egl2 promoters from filamentous fungi and specifically the Trichoderma reesei cellobiohydrolase promoter (GenBank Accession No.
  • Streptomyces phage promoters are also disclosed in Labes et al., 1997, Microbiol. 143:1503- 1512. Other promoters mat can be used are described in Deuschle et aL, 1986 EMBO J. 5:2987-2994 and WO 96/00787.
  • Plant promoters can also be used such as the CaMV 35S promoter (Odell et al., Nature. 313:810-812 (1985)), or others such as CaMV 19S (Lawton et al., Plant Molecular Biology. 9:315-324 (1987)), nos (Ebert et al., Proa Natl. Acad. ScL USA. 84:5745-5749 (1987)), Adhl (Walker et aL, Proc. Natl Acad. Sci. USA. 84:6624-6628 (1987)), sucrose synthase (Yang et al., Proc. Natl. Acad Sci. USA.
  • promoters include the poplar xylem-specific secondary cell wall specific cellulose synthase 8 promoter, cauliflower mosaic virus promoter, the Z10 promoter from a gene encoding a 10 kD zein protein, a Z27 promoter from a gene encoding a 27 kD zein protein, inducible promoters, such as the light inducible promoter derived from the pea rbcS gene (Coruzzi et al., EMBO J. 3:1671 (1984)) and the actin promoter from rice
  • Seed specific promoters such as the phaseolin promoter from beans, may also be used (Sengupta-Gopalan, Proc. Natl. Acad. Sci. USA. 83:3320-3324 (1985).
  • a FMOl nucleic acid or another enzyme-encoding nucleic acid segment can be combined with a promoter by standard methods to yield an expression cassette or transgene, for example, as described in Sambrook et al. (MOLECULAR CLONING: A LABORATORY MANUAL. Second Edition (Cold Spring Harbor, NY: Cold Spring Harbor Press (1989);
  • plasmid containing a promoter can be constructed as described in Jefferson ⁇ Plant Molecular Biology Reporter 5:387-405 (1987)) or obtained from Takara/Clontech Lab in Mountain View, California (e.g., pBI121 or pBI221). Typically, these plasmids are constructed to have multiple cloning sites having specificity for different restriction enzymes downstream from the promoter.
  • the FMOl or other enzyme encoding nucleic acids can be subcloned downstream from the promoter region using restriction enzymes and positioned to ensure mat the nucleic acid segment is inserted in proper orientation with respect to the promoter so that the coding segment can be expressed as sense or antisense RNA.
  • the expression cassette so formed can be subcloned into a plasmid or other vector (e.g., an expression vector, such as the pESC-HIS vector illustrated herein for expression of GmFMOl).
  • an endogenous FMOl gene can be modified to generate modified host cells and the host cells can express a modified FMOl protein. Mutations can be introduced into FMOl -expressing cell genomes by introducing targeting vectors, T-DNA, transposons, nucleic acids encoding TALENS, CRISPR, or ZFN nucleases, and combinations thereof into a recipient host cell to create a transformed cell.
  • the endogenous FMOl or cyclization enzyme gene can be deleted and host cells with such a deleted endogenous FMOl or cyclization enzyme gene can be transformed to include a modified FMOl or cyclization enzyme transgene, for example, by transformation of the host cells with a FMOl or cyclization enzyme expression cassette or expression vector.
  • the FMOl and/or cyclization enzyme can be encoded within expression cassettes and expressed in host cells.
  • the host cells can be any eukaryotic or prokaryotic cell.
  • a host cell can be any fungal cell or microorganism.
  • fungi are available for use as host cells.
  • a host cell may be located in a transgenic mushroom or other fungus such as Saccharomyces cerevisiae (yeast).
  • Prokaryotes include but are not limited to gram negative or positive bacterial cells. Numerous cell lines and cultures are available for use as host cells.
  • Host cells can be obtained through the American Type Culture Collection (ATCC), an organization that serves as an archive for living cultures and genetic materials (atcc.org).
  • ATCC American Type Culture Collection
  • An appropriate host can be determined by one of skill in the art based on the vector nucleic acid sequence and the desired result.
  • Bacterial cells used as host cells for expression vector replication and/or expression include, among those listed elsewhere herein, DH5 -alpha, JM109, and KC8, as well as a number of commercially available bacterial hosts such as SURETM. Competent Cells and SOLOPACK.TM Gold Cells (Stratagene, La Jolla). Alternatively, bacterial cells such as E. coli LE392 can be used as host cells.
  • a host cell is used as a recipient for heterologous nucleic acids.
  • a host cell may be "transfected” or “transformed,” which refers to a process by which exogenous nucleic acid is transferred or introduced into the host cell.
  • An expression cassette, plasmid or cosmid for example, can be introduced into a host cell, where the expression cassette, plasmid or cosmid can be integrated into the host cell genome or autonomously replicated.
  • a transformed cell includes the primary subject cell and its progeny.
  • fungal cells that can serve as hosts include yeast cells, mold cells, and mushroom cells.
  • yeast cells examples include yeast cells, mold cells, and mushroom cells.
  • the following types of cells can serve as host cells:
  • Peptides are substrates for the FMOl and/or cyclization enzymes described herein.
  • a variety of different peptide substrate sequences can be cross-linked and/or cyclized.
  • the product of POPB, cyclo(IWGIGCNP) may be a natural substrate for FMOl, FMOl expressed in a cell-free in vitro system exhibited nonspecific FMO activity, indicating that the FMOl enzyme can be active on a variety of peptide substrates.
  • Peptide substrates can be of different sizes or lengths, especially when cyclization enzymes are present with FMOl enzymes.
  • peptide substrates can have about 6 to about 100 amino acids, or about 7 to about 75 amino acids, or about 8 to about SO amino acids, or about 8 to about 35 amino acids, or about 10 to about 30 amino acids.
  • Peptide substrates can contain conventional and/or nonconventional amino acids.
  • the peptide substrates can contain hydroxylated amino acids, D-amino acids, atypical amino acids (e.g., ornithine, amino-isobutyric acid), N-methylated amino acids (e.g., N-methyl glycine), beta- amino acids (beta- alanine), or combinations thereof.
  • cyclo(IWGIGCNP) can be a substrate for FMOl crosslinking, where the lie, the Tip, and/or the Pro are pre-hydroxylated.
  • POPB can cyclize peptides containing hydroxylated, D-amino acids, N-methyl-amino acids, and beta-amino acids.
  • FMOl can also bicyclize cyclic peptides forms.
  • Precursor peptides that are cyclized can serve as peptide substrates for FMO enzymes.
  • the precursor peptides can be linear and one or more cyclization enzymes can be used with the FMO enzymes.
  • the peptide substrates can be made by biosynthesis of peptide substrate precursors in a host cell such as E. coli. In other cases, peptide substrates with or without unusual amino acids can be chemically synthesized.
  • synthetic peptide substrates can be modified.
  • peptide substrates can be synthesized by recombinant host cells that have expression cassettes with modified coding region segments.
  • selected nucleotides of synthetic G. marginata alpha-amanitin sequences can be changed to make other types of peptide toxins and peptides.
  • replacing the codon AAC (Asn) with GAC (Asp) provides beta-amanitin instead of alpha-amanitin. Beta- amanitin production in G. marginata can readily be detected by reverse-phase HPLC.
  • Alpha-amanitin for example,
  • Beta-amanitin has the following structure.
  • Phallacidin for example, is c
  • Non-natural amanitin derivatives can involve replacing Gly with Ala by replacing GGT with GCT in a nucleic acid segment encoding a precursor peptide for amanitin.
  • the peptide substrates can contain Trp and Cys at positions convenient for forming tryptathionine cross-bridges (e.g., Trp and Cys may in some cases not be adjacent in the primary structure or sequence of the peptide substrates).
  • Linear and cyclic peptides of at least six, seven, eight, nine, ten or more amino acids can be made that include the following sequence: XWXXXCXP (SEQ ID NO:16), where X is any amino acid.
  • the proline can be retained in these peptides for correct processing by POP, and the presence of Trp (W) and Cys (C) allow for biosynthesis of tryptathionine, a unique hallmark of the Amanita toxin peptides.
  • Peptide sequences mat include MDSTN (SEQ ID NO:17), TRIPL (SEQ ID NO:18), or other peptide sequences with prolines in conserved positions can be used in the peptide substrates, for example, when forming a cyclized peptide product.
  • the peptide substrates can have an amino acid sequence that includes a start site (e.g., methionine), a leader sequence (which is cleaved by POPB), a follower peptide that is removed when POPB cyclizes the core region (IWGIGCNP; SEQ ID NO:l) by
  • a smaller peptide representing the amino acid sequence of the final peptide product (the core, IWGIGCNP; SEQ ID NO:l).
  • a peptide substrate an exemplary precursor peptide that includes an amino acid sequence for mature ama toxins and phallotoxins is illustrated as: MFDTNATRLPIWGIGCNPWTAEHVDQTLASGNDIC (SEQ ID NO: 19), where the peptide mat is cyclized is underlined. See also FIG. 1 A. This is an example of an excellent peptide substrate for POPB.
  • peptide products from endogenous peptides and proteins can be by standard techniques including but not limited to HPLC methods.
  • Isolated peptide products can be evaluated in assays for assessing structure and/or biological activity. Peptide structures can be assessed by high resolution MS and NMR.
  • the toxicity of peptide products such as synthetic amanitin toxins can be determined in assays that measure inhibition of transcription in eukaryotic cells, such as capability to inhibit RNA Polymerase II or other assays such as binding to actin, immunosuppression, or blocking the mitochondrial permeability transition pore.
  • Useful peptide products can be produced in bulk (either within host cells or by reaction mixtures containing crosslinking and/or cyclization enzymes).
  • Table 1 illustrates exemplary sequences encoded by AMA1 and PHA1 genes.
  • cyclic and/or crosslinked peptides that can be made using the enzymes described herein can have useful properties.
  • cyclic, crosslinked peptide toxins of Amanita mushrooms are chemically and biologically unique among natural products. Such toxins have uses as biochemical reagents and as anti-cancer therapeutics, for example, in targeted therapeutics such as antibody-drug conjugates.
  • the enzymes described herein can be used to biosynthcsize many types of cyclic / crosslinked peptides.
  • Cyclic and/or crosslinked peptides can be made in vivo (e.g., in host cells or host organisms) or in ⁇ dtro (e.g., in a reaction mixture containing one or more enzymes, a peptide substrate, and any other cofactors or other reactants that may facilitate the cyclization or crosslinking reaction).
  • peptide substrates can be co- expressed in a host cell with a FMOl and/or other cyclization enzyme.
  • an expression cassette can be used that has a promoter operably linked to a nucleic acid segment encoding a peptide substrate.
  • Such peptides can then be expressed with FMOl and/or cyclization enzymes so that cyclization and/or cross-bridge formation can occur in a host cell.
  • the expression cassette encoding the peptide substrate can have any of the promoters described herein.
  • the FMOl and/or cyclization enzymes can be expressed from expression cassettes within the host cells. After incubation of the host cells in a culture medium that supports the growth and functioning of the host cells, the cyclic / crosslinked peptides can be isolated from the cell culture.
  • peptide substrates can be mixed with one or more FMOl and/or other cyclization enzymes in a reaction mixture.
  • the reaction mixture can contain other factors to enhance the formation of cyclic / crosslinked peptide products.
  • the reaction can contain one or more of the following:
  • NADPH NADPH
  • FAD FAD
  • ATP oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • dithiothreitol oxygen
  • the reaction temperature that allows synthesis of peptide is 0 to 60° C, or 5 to 40° C.
  • reaction pH that allows synthesis of peptide is 6.S to 10.S, and preferably 6.S to 9.5.
  • bicyclic peptides can be produced using the enzymes and methods described herein.
  • Host cells such as yeast, bacterial (e.g., E. coli), or Calerina cells can be transformed with one or more copies of an expression cassette encoding a substrate precursor peptide useful for making a cyclic or bicyclic peptide, and one or more expression cassettes encoding FMOl and/or cyclization enzymes.
  • a reaction mixture of peptide substrates for such cyclic or bicyclic peptides as well as FMOl and/or cyclization enzymes can be incubated in vitro to generate the cyclic or bicyclic product. Definitions
  • peptide refers to compounds containing two or more amino acids linked by the carboxyl group of one amino acid to the amino group of another, i.e. "peptidyl linkages" to form an amino acid sequence. It is contemplated that peptides may be purified and/or isolated from natural sources or prepared by recombinant or synthetic methods. Amino acid sequences may be encoded by naturally or non-naturally occurring nucleic acid sequences or synthesized by recombinant nucleic acid sequences or artificially synthesized. A peptide may be a linear peptide or a cyclopeptide, i.e. cyclic peptide including bicyclic feature.
  • cyclic peptide or “cyclopeptide” in general refers to a peptide comprising at least one internal bond attaching nonadjacent amino acids of the peptide, such as when the end amino acids of a linear sequence are attached to form a circular peptide.
  • a "bicyclic peptide” may have at least two internal bonds forming a cyclopeptide of the present inventions, such as when the end amino acids of a linear sequence are attached to form a circular peptide in addition to another internal bond attaching two nonadjacent amino acids, for example, see e.g. FIG. 1.
  • the term "Amanita peptide” or “Amanita toxin” or “Amanita peptide toxin” refers to any linear or cyclic peptide produced by a mushroom, not restricted to a biologically active toxin. It is not intended that the present invention be limited to a toxin or a peptide produced by an Amanita mushroom and includes similar peptides and toxins produced by other fungi, including but not limited to species of Lepiota, Conocybe, Galerina, and the like.
  • An Amanita peptide toxin resembles any of the amatoxins and phallotoxins, such as similarity of amino acid sequences, matching toxin motifs as shown herein, encoded between the conserved regions (A and B) of their proproteins, encoded by hypervariable regions of their proproteins (P) and the like.
  • the Amanita peptides include, but are not restricted to, amatoxins such as the amanitins, and phallotoxins such as phalloidin and phallacidin.
  • an exemplary Amanita peptide in one embodiment ranges from 6- 17 amino acids in length.
  • an Amanita peptide toxin ranges from 7-11 amino acids in length.
  • an Amanita peptide is linear. In another embodiment, an Amanita peptide is a bicyclic peptide. It is not meant to limit an Amanita peptide to a naturally produced peptide. In some embodiments, an Amanita peptide has an artificial sequence, in other words a nucleic acid encoding an artificial peptide sequence was not naturally found in a fungus or found encoded by a nucleic acid sequence isolated from a fungus.
  • biologically active refers to a peptide that when contacted with a cell, tissue or organ induces a biological activity, such as stimulating a cell to divide, causing a cell to alter its function (e.g., altering T cell function, causing a cell to change expression of genes, etc.)
  • peptide in general refers to a primary sequence of amino acids that are joined by covalent “peptide linkages.
  • Polypeptides may encompass either peptides or proteins. In general, a peptide consists of a few amino acids, and is shorter than a protein.
  • amino acid sequence and like terms, such as “polypeptide” or “protein” are not meant to limit the amino acid sequence to the complete, native amino acid sequence associated with the recited protein molecule.
  • synthetic or “artificial” in relation to a peptide sequence refers to a peptide made either artificially from covalently bonding amino acids, such as by Peptide Synthesizer, (for example, Applied Biosystems) or a peptide derived from an amino acid sequence encoded by a recombinant nucleic acid sequence.
  • toxin refers to a substance (for example, a cyclopeptide) that is detrimental (Le., poisonous) to cells and/or organisms, in particular a human organism.
  • a substance for example, a cyclopeptide
  • the peptides made by the enzymes and methods described herein are not toxins.
  • amatoxin generally refers to a family of peptide compounds, related to and including the amanitins.
  • An amatoxin refers to any small peptide, linear and cyclic, comprising an exemplary chemical structure (e.g., as illustrated herein, see also FIG. 1) or encoded by a nucleic acid sequence.
  • Such an amatoxin nucleic acid sequence and/or proprotein can have higher sequence homology to amanitin (or e.g., an AMAl nucleic acid) than to an analogous sequence of a phallotoxin (or e.g., a PHAl nucleic acid).
  • the peptides made by the enzymes and methods described herein are not amatoxins or phallotoxins. However, in other cases, the enzymes and methods described herein can efficiently make amatoxins and/or phallotoxins.
  • phallotoxin generally refers to a family of peptide compounds, related to and including phallacidin and phalloidin.
  • a phallotoxin refers to any small peptide encoded by nucleic acid sequences where the nucleic acid sequence and/or proprotein has a higher sequence homology to a phallotoxin (or e.g., a PHA1 nucleic acid) than to an analogous sequence of an amatoxin (or e.g., an AMA1 nucleic acid).
  • microorganism refers to microscopic organisms and taxonomically related macroscopic organisms within the categories of algae, bacteria, fungi (including lichens), protozoa, viruses, and subviral agents.
  • eukaryotic and "eukaryote” are used in the broadest sense. It includes, but is not limited to, any organisms containing membrane bound nuclei and membrane bound organelles. Examples of eukaryotes include but are not limited to animals, plants, algae, diatoms, and fungi.
  • prokaryote and “prokaryotic” are used in the broadest sense. It includes, but is not limited to, any organisms without a distinct nucleus. Examples of prokaryotes include but are not limited to bacteria, bhie-green algae (cyanobacteria), archaebacteria, actinomycetes and mycoplasma. In some cases, a host cell is any microorganism.
  • fungi is used in reference to eukaryotic organisms such as mushrooms, rusts, molds and yeasts, including dimorphic fungi “Fungus” or “fungi” also refers to a group of lower organisms lacking chlorophyll and dependent upon other organisms for source of nutrients.
  • muscle refers to the fruiting body of a fungus.
  • fruiting body refers to a reproductive structure of a fungus which produces spores, typically comprising the whole reproductive structure of a mushroom including cap, gills and stem, for example, a prominent fruiting body produced by species of Ascomycota and Basidiomycota, examples of fruiting bodies are "mushrooms,"
  • fruiting body cell refers to a cell of a cap or stem which may be isolated or part of the structure.
  • spore refers to a microscopic reproductive cell or cells.
  • mycelium refers to a mass of fungus hyphae, otherwise known as a vegetative portion of a fungus.
  • Basidiomycota in reference to a Phylum or Division refers to a group of fungi whose sexual reproduction involves fruiting bodies comprising basidiospores formed on club-shaped cells known as basidia.
  • Basidiomycetes in reference to a class of Phylum Basidiomycota refers to a group of fungi. Basidiomycetes include mushrooms, of which some are rich in cyclopeptides and/or toxins, and includes certain types of yeasts, rust and smut fungi, gilled- mushrooms, puffballs, polypores, jelly fungi, brackets, coral, mushrooms, boletes, puffballs, stinkhorns, etc.
  • Ascomycota or “ascomycetes” in reference to members of a fungal Phylum or Division refers to a “sac fungus” group.
  • Ascomycota a class of fungus
  • “Ascomycetes” includes Candida albicans, unicellular yeast, Morchella esculentum, the morel, and Neuwspom crassa. Some ascomycetes cause disease, for example.
  • Candida albicans causes thrush and vaginal infections; or produce chemical toxins associated with diseases, for example, Aspergillus flavus produces a contaminant of nuts and stored grain called aflatoxin, that acts both as a toxin and a deadly natural carcinogen.
  • Amanita refer to a genus of fungus whose members comprise poisonous mushrooms, e.g., Amanita (A.) bisporigera, A. virosa, A. ocreata, A. suballiacea, and A. tenuifolia which are collectively referred to as "death angels” or “Destroying Angels” and "Amanita phalloides” or “A. phalloides var. alba” or “A. phalloides var. verna” or “A. verna”, referred to as "death cap.”
  • the toxins of these mushrooms frequently cause death through liver and kidney failure in humans. Not all species of this genus are deadly, for example, Amanita muscaria, the fly agaric, induces gastrointestinal distress and/or hallucinations while others do not induce detectable symptoms.
  • nonribosomal peptide synthetase is an enzyme that catalyzes the biosynthesis of a small (20 or fewer amino acids) peptide or depsipeptide, linear or circular, and is composed of one or more domains (modules) typical of this class of enzyme. Each domain is responsible for aminoacyl adenylation of one component amino acid.
  • NRPSs can also contain auxiliary domains catalyzing, e.g., N-methylation and amino acid epimerization (Walton, et al., in Advances in Fungal Biotechnology for Industry, Agriculture, and Medicine, et al., Eds. (Kluwer Academic/Plenum, N.Y., 2004, pp.
  • Examples are gramicidin synthetase, HC-toxin synthetase, cyclosporin synthetase, and enniatin synthetase.
  • cell As used herein, the terms “cell,” “cell line,” “cell population,” and “cell culture” may be used interchangeably. All of these terms also include their progeny, which are any and all subsequent generations. It is understood that all progeny may not be identical due to deliberate or inadvertent mutations.
  • host fungus cell refers to any fungal cell, for example, a yeast cell, a mold cell, and a mushroom cell (such as Neurospora crassa, Aspergillus nidulans,
  • the term "Fungal expression system” refers to a system using fungi to produce (express) enzymes and other proteins and peptides.
  • filamentous fungi which are currently used or proposed for use in such processes are Neurospora crassa, Acremonium chrysogenum, Tofypocladium geodes, Mucor circineUoides, Trichoderma reesei, Aspergillus nidul ns, Aspergillus niger, Coprinus cinereus, Aspergillus oryzfle, etc.
  • basidiomycete genes for example, Gola. et al., (2003) J Basic Microbiol. 43(2):104-12; herein incorporated by reference
  • fungal expression systems using, for example, a monokaryotic laccase-deficient Pycnoporus cinnabarinus strain BRFM 44 (Banque de Resources Fongiques de Marseille, Marseille, France), and SchizpphyUum commune, (for example, Alexandra, et al., (2004) Appl Environ Microbiol. 70(ll):6379-638; Lugones, et al., (1999) Mol. Microbiol. 32:681-700; Schuren, et al., (1994) Curr. Genet. 26:179-183; all of which are herein incorporated by reference).
  • transgene refers to a foreign gene, such as a heterologous gene, that is placed into an organism by, for example, introducing the foreign gene into cells or primordial tissue.
  • foreign gene refers to any nucleic acid (e.g., gene segment) that is introduced into the genome of a host cell by experimental manipulations and may include gene sequences found in that cell so long as the introduced gene does not reside in the same location as does the naturally-occurring gene.
  • vector is used in reference to nucleic acid molecules that transfer DNA segment(s) from one cell to another.
  • the term “vehicle” is sometimes used interchangeably with “vector.”
  • a vector “backbone” comprises those parts of the vector which mediate its maintenance and enable its intended use (e.g., the vector backbone may contain sequences necessary for replication, genes imparting drug or antibiotic resistance, a multiple cloning site, and possibly operably linked promoter and/or enhancer elements which enable the expression of a cloned nucleic acid).
  • the cloned nucleic acid e.g., such as a cDNA coding sequence, or an amplified PCR product
  • a "recombinant vector” indicates that the nucleotide sequence or arrangement of its parts is not a native configuration, and has been manipulated by molecular biological techniques.
  • the term implies that the vector is comprised of segments of DNA that have been artificially joined.
  • recombinant nucleic acid or “recombinant gene” or “recombinant DNA molecule” or “recombinant nucleic acid sequence” indicates that the nucleotide sequence or arrangement of its parts is not a native configuration, and has been manipulated by molecular biological techniques. The term implies that the DNA molecule is comprised of segments of DNA that have been artificially joined together. Protocols and reagents to manipulate nucleic acids are common and routine in the art (See e.g., Maniatis et al. (eds.), Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, NY.
  • a "recombinant protein” or “recombinant polypeptide” refers to a protein molecule that is expressed from a recombinant DNA molecule. Use of these terms indicates that the primary amino acid sequence, arrangement of its domains or nucleic acid elements which control its expression are not native, and have been manipulated by molecular biology techniques. As indicated above, techniques to manipulate recombinant proteins are also common and routine in the art.
  • exogenous and heterologous are sometimes used interchangeably with “recombinant.”
  • An "exogenous nucleic acid,” “exogenous gene” and “exogenous protein” indicate a nucleic acid, gene or protein, respectively, that has come from a source other than its native source, and has been artificially supplied to the biological system.
  • endogenous protein refers to a protein or gene that is native to the biological system, species or chromosome under study.
  • a “native” or “endogenous” polypeptide does not contain amino acid residues encoded by recombinant vector sequences; that is, the native protein contains only those amino acids found in the polypeptide or protein as it occurs in nature.
  • a “native” polypeptide may be produced by recombinant means or may be isolated from a naturally occurring source.
  • a “native” or “endogenous” gene is a gene that does not contain nucleic acid elements encoded by sources other than the chromosome on which it is normally found in nature.
  • heterologous nucleic acid or “heterologous polypeptide” refers to a nucleic acid or polypeptide that is not in its natural environment.
  • a heterologous coding region includes a coding region from one species introduced into another species.
  • a heterologous nucleic acid or heterologous polypeptide also includes a nucleic acid or polypeptide native to an organism that has been altered in some way (e.g., mutated, added in multiple copies, linked to non-native regulatory sequences, etc.).
  • Heterologous nucleic acids or heterologous polypeptides are distinguished from endogenous nucleic acids in that the heterologous nucleic acids are typically joined to nucleic acid segments that are not found naturally associated with the nucleic acids in the chromosome or are associated with portions of the chromosome not found in nature (e.g., coding regions expressed in loci where the corresponding gene is not normally expressed).
  • genomic forms of a gene may also include sequences located on both the 5' and 3' end of the sequences that are present on the RNA transcript. These sequences are referred to as "flanking" sequences or regions (these flanking sequences are located 5' or 3' to the untranslated sequences present on the mRNA transcript).
  • the 5' flanking region may contain regulatory sequences such as promoters and enhancers that control or influence the transcription of the gene.
  • the 3' flanking region may contain sequences that direct the termination of transcription, post-transcriptional cleavage and polyadenylation.
  • regulatory element refers to a genetic element that controls some aspect of the expression of nucleic acid sequences.
  • a promoter is a regulatory element that facilitates the initiation of transcription of an operably linked coding region.
  • Other regulatory elements include splicing signals, polyadenylation signals, termination signals, etc.
  • operable combination when used in reference to nucleic acid segments herein are used to refer to the linkage of nucleic acid segments in such a manner that a nucleic acid molecule capable of directing the transcription of a given coding region and/or the synthesis of a desired protein molecule is produced.
  • the term also refers to the linkage of amino acid sequences in such a manner so that a functional protein is produced.
  • a “promoter” is a control sequence that is a region of a nucleic acid sequence at which initiation and rate of transcription are controlled. It may contain genetic elements at which regulatory proteins and molecules may bind such as RNA polymerase and other transcription factors.
  • the phrases “operably linked,” “operatively positioned,” “operatively linked,” “under control,” and “under transcriptional control” mean that a promoter is in a correct functional location and/or orientation in relation to a nucleic acid segment (e.g., a nucleic acid segment encoding an enzyme) to control transcriptional initiation and/or expression of that nucleic acid segment.
  • a promoter may or may not be used in conjunction with an “enhancer,” which refers to a cis-acting regulatory sequence involved in the transcriptional activation of a nucleic acid segment.
  • a promoter may be one naturally associated with a gene or nucleic acid segment, and may be obtained by isolating the 5' non-coding sequences located upstream of the coding segment and/or exon. Such a promoter can be referred to as "endogenous.”
  • an enhancer may be one naturally associated with a nucleic acid sequence, located either downstream or upstream of that sequence.
  • certain advantages will be gained by positioning the coding nucleic acid segment under the control of a recombinant or heterologous promoter, which refers to a promoter that is not normally associated with a nucleic acid sequence in its natural environment.
  • a recombinant or heterologous enhancer refers also to an enhancer not normally associated with a nucleic acid in its natural environment.
  • promoters or enhancers may include promoters or enhancers of other genes, and promoters or enhancers isolated from any other prokaryotic, viral, or eukaryotic cell, and promoters or enhancers not "naturally occurring," e.g., containing different elements of different transcriptional regulatory regions, and/ or mutations that alter expression.
  • sequences may be produced using recombinant cloning and/or nucleic acid amplification technology, including PCR, in connection with the compositions disclosed herein (see U.S. Pat. No. 4,683,202, U.S. Pat. No. 5,928,906, each incorporated herein by reference). It is further contemplated that control sequences that direct transcription and/or expression of sequences within non-nuclear organelles such as mitochondria, chloroplasts, and the like, can be employed as well.
  • a promoter and/or enhancer can be used that effectively directs the expression of a nucleic acid segment (e.g., including a nucleic acid segment encoding an enzyme) in the cell type, organelle, and organism chosen for expression.
  • a nucleic acid segment e.g., including a nucleic acid segment encoding an enzyme
  • Those of skill in the art of microbiology and molecular biology generally know the use of promoters, enhancers, and cell type combinations for protein expression, for example, see Sambrook et al. (1989); herein incorporated by reference.
  • the promoters employed may be constitutive, tissue-specific, inducible, and/or useful under the appropriate conditions to direct the desired level of expression of the introduced nucleic acid segment encoding a target protein (e.g., high levels of expression that are advantageous in the large-scale production of recombinant proteins and/or peptides).
  • the promoter may be heterologous or endogenous.
  • Suitable control elements such as enhancers/promoters, splice junctions,
  • polyadenylation signals, etc. may be placed in close proximity to the coding region of a coding region to facilitate proper initiation of transcription and/or correct processing of the primary RNA transcript.
  • the coding region utilized in the expression cassettes may contain endogenous enhancers/promoters, splice junctions, intervening sequences, polyadenylation signals, etc. or a combination of both endogenous and exogenous control elements.
  • hybridization is used in reference to the pairing of complementary nucleic acids. Hybridization can be demonstrated using a variety of hybridization assays (Southern blot, Northern Blot, slot blot, phage plaque hybridization, and other techniques). These protocols are available in the art (See e.g., Sambrook et al. (eds.), Molecular Cloning: A Laboratory Manual, Second Edition, Volumes 1-3, Cold Spring Harbor Laboratory Press, NY, [1989]; Ausubel et aL (eds.), Current Protocols in Molecular Biology, Vol. 1-4, John Wiley & Sons, Inc., New York [1994]; all of which are herein incorporated by reference).
  • Hybridization is the process of one nucleic acid pairing with an antiparallel counterpart that may or may not have 100% complementarity. Two nucleic acids that contain 100% antiparallel complementarity will show strong hybridization. Two antiparallel nucleic acids which contain no antiparallel complementarity (generally considered to be less than 30%) will not hybridize. Two nucleic acids which contain between 31-99% complementarity will show an intermediate level of hybridization. A single molecule that contains pairing of complementary nucleic acids within its structure is said to be "self-hybridized.”
  • a "substantially homologous" nucleotide sequence is one that at least partially inhibits a completely complementary probe sequence from hybridizing to a target nucleic acid under conditions of low stringency. This is not to say that conditions of low stringency are such that non-specific binding is permitted; low stringency conditions require that the binding of two sequences to one another be a specific (i.e., selective) interaction.
  • the absence of non-specific binding may be verified by the use of a second target that lacks even a partial degree of complementarity (e.g., less than about 30% identity); in the absence of non-specific binding the probe will not hybridize to the second non-complementary target.
  • substantially homologous refers to any probe that can hybridize to either or both strands of the double-stranded nucleic acid sequence under conditions of high stringency.
  • Hybridization and the strength of hybridization is impacted by such factors as the degree of complementary between the nucleic acids, stringency of the conditions involved, the Tm of the formed hybrid, and the G:C ratio within the nucleic acids.
  • the term “stringency” is used in reference to the conditions of temperature, ionic strength, and the presence of other compounds such as organic solvents, under which nucleic acids hybridize.
  • Low or weak stringency conditions are reaction conditions which favor the complementary base pairing and annealing of two nucleic acids.
  • “High stringency” conditions are those conditions which are less optimal for complementary base pairing and annealing.
  • Several variables affect the strength of hybridization, including the length and nature of the probe and target (DNA, RNA, base composition, present in solution or immobilized, the degree of complementary between the nucleic acids, the Tm of the formed hybrid, and the G:C ratio within the nucleic acids).
  • Conditions may be manipulated to define low or high stringency conditions: factors such as the concentration of salts and other components in the hybridization solution (e.g., the presence or absence of formamide, dextran sulfate, and polyethylene glycol) as well as temperature of the hybridization and/or wash steps. Conditions of "low” or “high” stringency are specific for the particular hybridization technique used.
  • stringency is used in reference to the conditions of temperature, ionic strength, and the presence of other compounds such as organic solvents, under which nucleic acid hybridizations are conducted. Those skilled in the art will recognize that “stringency” conditions may be altered by varying the parameters just described either individually or in concert. With “high stringency” conditions, nucleic acid base pairing will occur only between nucleic acid fragments that have a high frequency of complementary base sequences (e.g., hybridization under “high stringency” conditions may occur between homologs with about 85-100% identity, preferably about 70-100% identity). With medium stringency conditions, nucleic acid base pairing will occur between nucleic acids with an intermediate frequency of complementary base sequences (e.g.
  • hybridization under "medium stringency” conditions may occur between homologs with about 50-70% identity).
  • conditions of "weak” or 'low” stringency are often required with nucleic acids that are derived from organisms that are genetically diverse, as the frequency of complementary sequences is usually less.
  • “High stringency conditions” when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 65° C in a solution consisting of 5 and 1.85 g/1 EDTA, pH adjusted to 7.4 with
  • “Medium stringency conditions” when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 55° C. in a solution consisting of 5 pH adjusted to 7.4 with NaOH), 0.5% SDS, 5xDenhardt's reagent and 100 pg/ml denatured salmon sperm DNA followed by washing in a solution comprising 1.OxSSPE, 1.0% SDS at 42° C. when a probe of about 500 nucleotides in length is employed. Because the washing step is most responsible for the stringency of hybridization, "medium stringency conditions” can be described as washing in 1. OxSSPE, 1.0% SDS at 42° C.
  • Low stringency conditions comprise conditions equivalent to binding or hybridization at 42°C in a solution consisting of SxSSPE (43.8 g/1 NaCl, 6.9 g/1
  • Tm is used in reference to the “melting temperature.”
  • the melting temperature is the temperature at which a population of double-stranded nucleic acid molecules becomes half dissociated "denatures") into single strands.
  • sequence identity is a measure of strict amino acid or nucleotide conservation.
  • Percent similarity is a measure of amino acid conservation which incorporates both strictly conserved amino acids, as well as
  • nucleic acid conservation can pertain to either proteins or nucleic acids. Two proteins can be described as “homologous” or “non-homologous,” but the degree of amino acid conservation is quantitated by percent identity and percent similarity. Nucleic acid conservation is measured by the strict conservation of the bases adenine, thymine, guanine and cytosine in the primary nucleotide sequence. When describing nucleic acid conservation, conservation of the nucleic acid primary sequence is sometimes expressed as percent sequence identity or percent homology.
  • one region may show a high percentage of nucleotide sequence conservation, while a different region can show no or poor conservation. Nucleotide sequence conservation may not necessarily be inferred from an amino acid similarity score. Two proteins may show domains that in one region are homologous, while other regions of the same protein are clearly non-homologous.
  • Example 1 cDNA for Galerina marginata FMOl
  • Galerina marginata CBS 339.88
  • Galerina badipes CBS 268.50
  • Galerina venenata CBS 924.72
  • Galerina hybrida CBS 335.88
  • Galerina marginata CBS 339.88 is monokaryotic and was confirmed to make alpha- amanitin.
  • G. venenata is considered synonymous with G. marginata (Gulden et aL, 2001, 2005).
  • the cultures were maintained on potato dextrose agar.
  • the medium was HSV-2C, which contains (per liter) 1 g yeast extract, 2 g glucose, 0.1 g NH «CL 0.1 g CaSQ4-5 3 ⁇ 40, 1 mg thiamine-HCL and 0.1 mg biotin, pH 52 (Muraoka and
  • the media had the same formulation, except mat the media contained high carbon (HSV-5Q and low carbon (HSV-1Q amounts of glucose (ix.5 g ghicoae and 1 g glucose, respectively) (see, Muraoka and Shinozawa, 2000).
  • Lyophilized fungal mycelia were ground in liquid nitrogen with a mortar and pestle.
  • High molecular weight DNA was isolated using genomic-tip 100/O (Qiagen, Germantown, MD; catalog #10,243), RNA was extracted with TRIzol (mvitrogen, Carlsbad, CA), and cDNA was synthesized following the manufacturers' protocols.
  • Transformation of G. marginata was performed as described previously by the inventors (Luo et aL, 2014).
  • candidate genes 1.5 to 1.8 kb upstream and downstream sequences were used for homologous recombination.
  • the fragments were cloned into the pHg vector (Kemppainen and Pardo, 2010) and transformed into Agrobacterium tumefaciens strain LBA1100 (Kemppainen et aL, 2005), which were then used m die Glolerina transformation.
  • This Example illustrates that the Galerina marginata FMOl enzyme can form a trvpeathionine (sulfide) cross bridge.
  • FMOl catalyzes tryptanuonine biosynthesis is based on reactions catalyzed by other FMOs.
  • a bacterial FMO catalyzes Tip hydroxylation at the three (3) position (May ft Perrin, Biopolymers 88:714-724 (2007)).
  • a chemical route to tryptathionine biosynthesis can involve hydroxylation of Tip at the 3 position, which activates the 2-position for spontaneous reaction with Cys.
  • One mechanism of FMOl action is hydroxylation of the Tip at the 3 position (FIG.2A), so that conjugation to Cys occurs spontaneously (FIG. 2B). In the course of the second step, the hydroxyl group leaves.
  • FMOl catalyzes "cryptic” (i.e., transient) hydroxylation.
  • recombinant FMOl was incubated with cyclo(lWGIGCNP, SEQ ID NO:l), NADPH, and FAD, and the reaction products were analyzed by LC/MS.
  • FIGs. 3B shows extracted ion counts (EIC) at m/z 839 of (top) product, and the (bottom) substrate. As illustrated by FIG. 3B, when FMOl was incubated with
  • cyclo(IWGIGCNP), NADPH, FAD, and an NADPH-regenerating system a new product was observed with a mass 2 Da smaller than the substrate, which is consistent with oxidative coupling of Cys and Trp.
  • the m/z of cyclo(IWGIGCNP) is 841 (M + IT)
  • the predicted mass of cyclo(IWGIGCNP) with a sulfide cross-bridge between Trp and Cys is 839 (M + H + ).
  • the substrate, cyclo(IWGIGCNP), m/z 841 elutes at 13.0 min.
  • FIG. 3C shows that the compound with m/z 839 is the dominant component eluting at 12.0 min.
  • the peak at m/z 861 in FIG. 3C corresponds to the sodium adduct.
  • the change in the UV absorption spectrum (ratio of 29S to 280 nm) of the new, FMOl -catalyzed product was consistent with the structure being the bicyclic compound. Coupling of Cys to Trp shifts the UV absorption maximum of Trp from 280 nm to 295 nm (Wieland, 1986).
  • FIG. 4A illustrates LC-MS analysis of amanitin from wild type G. marginata (solid line) and mutant G. marginata (dashed line).
  • FIG. 4B illustrates Southern analysis blot of G. marginata wild type and G. marginata deletion mutant genomic DNA. The probes employed were for GmFMOl; Hyg stands for hygromycin B resistance (the selection for transformation). The southern blots show that the FMOl gene has been deleted in the mutant. The gene disruption method employed is described in Luo et aL Chemistry and Biology 21 :1610 (2014). The results show that GmFMOl is needed for biosynthesis of amanitin. References
  • An expression system comprising at least one expression cassette comprising a
  • FMO flavin mono-oxygenase
  • peptide substrate contains one or more D-amino acid, N-methyl amino acid, beta amino acids, other atypical amino acid (e.g., ornithine, amino-isobutyric acid), or combinations thereof.
  • expression cassette comprising a promoter operably linked to a heterologous nucleic acid segment encoding a prolyl oligopeptidase that has macrocyclase activity.
  • prolyl oligopeptidase has at least 90%, or at least 91 %, or at least 93%, or at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99% sequence identity to SEQ ID NO:l 1, 13, 14, or 15.
  • a host cell comprising an expression system comprising at least one expression
  • FMO flavin mono -oxygenase
  • the host cell of statement 13, wherein the flavin mono-oxygenase has at least 90%, or at least 91 %, or at least 93%, or at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99% sequence identity to SEQ ID NO: 2, 4, 5, 6, or 7.
  • prolyl oligopeptidase has at least 90%, or at least 91%, or at least 93%, or at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99% sequence identity to SEQ ID NO:l 1, 13, 14, or 15.
  • the host cell of statement 13-15 or 16 further comprising at least one expression cassette comprising a promoter operably linked to a heterologous nucleic acid segment encoding a peptide substrate.
  • the host cell of statement 17 wherein the peptide substrate has about 6 to about 100 amino acids, or about 7 to about 75 amino acids, or about 8 to about 50 amino acids, or about 8 to about 35 amino acids, or about 10 to about 30 amino acids. 19.
  • a method comprising incubating the host cell of statement 13-18 or 19, for a time and under conditions sufficient for generation of a tryptathionine cross-bridge between a tryptophan (Tip) and a cysteine (Cys) in the peptide substrate.
  • a method comprising contacting a peptide substrate with a flavin mono-oxygenase that has at least 90%, or at least 91%, or at least 93%, or at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99% sequence identity to SEQ ID NO: 2, 4, 5, 6, or 7.
  • peptide substrate has about 6 to about 100 amino acids, or about 7 to about 75 amino acids, or about 8 to about 50 amino acids, or about 8 to about 35 amino acids, or about 10 to about 30 amino acids.
  • peptide substrate is one or more D- amino acid, N-methyl amino acid, beta amino acids atypical amino acid (e.g., ornithine, amino-isobutyric acid), or combinations thereof.
  • reaction mixture further comprises
  • NADPH nicotinamide adenine dinucleotide phosphate
  • FAD flavin adenine dinucleotide
  • a flavin mono-oxygenase that has at least 90%, or at least 91%, or at least 93%, or at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99% sequence identity to SEQ ID NO: 2, 4, 5, 6, or 7, and with at least one amino acid substitution.
  • An expression system comprising at least one expression cassette comprising a
  • FMO flavin mono-oxygenase
  • a host cell comprising the expression system of statement 37.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Genetics & Genomics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biochemistry (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Microbiology (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

Flavin mono-oxygenase (FMO) enzymes, as well as expression cassettes and host cells for expressing such enzymes, and methods for using such FMO enzymes are described herein. The flavin mono-oxygenase enzymes are useful for generating peptidyl cross-bridges.

Description

Trvptathionine Biosynthesis bv Flavin Mono-Oxveenase 1 (FMOl)
This application claims benefit of priority to the filing date of U.S. Provisional Application Ser. No. 62/491,632, filed April 28, 2017, the contents of which are specifically incorporated herein by reference in their entity.
Government Funding
This invention was made with government support under GM088274 awarded by the
National Institutes of Health. The government has certain rights in the invention.
Background of the Invention
Cyclic and bicyclic peptides have been found in plants, bacteria, fungi, and animals (Arnison et al., Nat. Prod. Rep. 30:108-160 (2013); Umemura et al., Fung. Genet. Biol. 68:23-30 (2014)). Examples of macrocyclized peptides include the cyanobactins from marine bacteria (Donia et al., Nat. Chem. Biol. 2:729-735 (2006)) and the orbitides and cyclotides from plants (Craik & Malik, Curr. Opin. Chem. Biol. 17:546-554 (2013)). Cyclization confers several pharmacologically advantageous attributes to peptides, including increased rigidity, enhanced membrane permeability, and resistance to proteolysis and denaturation (Craik & Allewell, Biol. Chem. 287 :26999-27000 (2012)). As a result, many cyclic peptides have potent biological activities against a wide range of medically and ecologically significant targets (Bockus et al., Curr. Top. Med. Chem 13:821-836 (2013); Giordanetto & Kihlberg, J. Med. Chem. 57:278-295 (2014); Walton, Plant Cell. 8:1723-1733 (1996)).
Summary
Described herein are compositions and methods involving expression cassettes, enzymes, and methods useful for generating cross-bridges in peptides. The expression cassettes, and methods can involve using nucleic acids encoding a flavin mono-oxygenase (FMO) enzyme. Methods are also described for generating cross-bridges in peptides within in vitro (e.g., cell free) reaction mixtures. The cross-bridge enzymes can be used in conjunction with other enzymes such as enzymes that cyclize peptides. These cross-bridge and cyclization enzymes are useful for making a variety of crosslinked, bicyclic and/or cyclic peptides.
Described herein are expression systems that can include at least one expression cassette having a promoter operably linked to a heterologous nucleic acid segment encoding a flavin mono-oxygenase (FMO) with at least 95% sequence identity to SEQ ID NO: 2, 4, 5, 6, or 7.
Host cells containing such expression systems are also described herein, and methods for using such expression systems and host cells. For example, the methods can include incubating a host cell that has one or more expression systems that can include at least one expression cassette having a promoter operably linked to a heterologous nucleic acid segment encoding a flavin mono-oxygenase (FMO) with at least 95% sequence identity to SEQ ID NO: 2, 4, S, 6, or 7, for a time and under conditions sufficient for generation of a
tryptathionine cross-bridge between a tryptophan (Tip) and a cysteine (Cys) in the peptide substrate. The expression systems, host cells, and methods can also include use of at least one expression cassette having a promoter operably linked to a heterologous nucleic acid segment encoding a prolyl oligopeptidase with at least 95% sequence identity to SEQ ID NO:ll, 13, 14, or 15.
In some cases, tryptathionine cross-bridges between tryptophan (Tip) and cysteine
(Cys) residues in the peptide substrate are formed in vivo (within a host cell) by FMOl while in other cases tryptathionine cross-bridges between tryptophan (Tip) and cysteine (Cys) residues in the peptide substrate are formed in vitro (not within a host) by FMOl. For example, in some cases additional cellular enzymes and other factors can help optimize cross- bridge formation and/or can facilitate cyclization of peptides. Accordingly, in some cases the cross-bridges are prepared in vivo.
However, the FMOl enzyme can be incubated in vitro with a peptide that includes tryptophan (Tip) and cysteine (Cys) residues to generate a peptide with a tryptathionine cross-bridge between the Tip and Cys residues. Hence, the FMOl proteins can be used in in vitro compositions, assay mixtures, and/or enzymatic manufacturing processes.
Description of the Figures
FIG. 1A-1B illustrates cyclization of peptides. FIG. 1A illustrates conversion of a 35- amino acid precursor peptide (SEQ ID NO: 19) to a cyclic octapeptide by a prolyl oligopeptidase B from Galerina marginata (GmPOPB) enzyme (SEQ ID NOs: 19 and 27- 29). As illustrated, GmPOPB catalyzes two reactions to convert the linear 35mer precursor peptide (SI) to cyclo(IWGIGCNP, SEQ ID NO:l), an intermediate in amanitin biosynthesis. Product P3 is then cross-linked (Tip to Cys) as illustrated in FIG. 2 and hydro xylated (3-4 times) to make bicyclic peptide (see also Luo et aL Chemistry and Biology 21:1610 (2014)). FIG. IB illustrates the reaction catalyzed by flavin mono-oxygenase (FMOl) to form tryptathionine. The substrate is the product of POPB.
FIG. 2A-2B illustrates how hydroxylation of an indole ring can activate the indole for further reaction including cross-linking to cysteine. FIG. 2Aillustrates 3 -hydroxylation of indole as catalyzed by an FMO from Methylophaga. FIG. 2B illustrates reaction via Savige- Fontana synthesis of tryptathionine from 3a-hydroxy-pyrrolo[2,3-b]-indole (Hpi), in which 3- hydoxyTrp is an intermediate. See, Cho et al., J Struct Biol 175:39-48 (2011); May JP, Perrin DM (2007) Tryptathionine bridges in peptide synthesis. Biopolymers 88:714-724.)
FIGs. 3A-3C illustrate in vitro expression of FMOl and its catalysis of a tryptathionine crossbridge product. FIG 3 A shows expression of FMOl as a 60-kDa protein fused to a myc tag. Left panel: Coomassie stained gel. Right panel: western blot of the same gel with anti-myc antibodies. FIG. 3B shows LC/MS analysis of substrate (monocyclic cyclo[IWGIGCNP], bottom) and product after reaction with FMOl (bicyclic[IWGIGCNP], top). FIG. 3C shows a mass spectrum of the product eluting at 12.0 min as shown in FIG. 3B, illustrating the parent ion at M+H+ 839 (tallest peak) as predicted for bicyclic compound.
FIGs. 4A-4B illustrate that targeted gene disruption of GmFMOl abolishes amanitin biosynthesis. FIG. 4A shows an HPLC trace (UV absorbance at 280 nm) illustrating amanitin made by wild type G. marginata (solid trace) and not by deletion mutant FMOl G.
marginata (dotted trace) cells. FIG. 4B shows a Southern blot of genomic DNA from deletion mutant FMOl G. marginata cells. The Southern blot shows that the FMOl gene has been deleted in the mutant. The results show that the FMOl gene is important for amanitin production.
Detailed Description
Cyclic peptides, bicyclic peptides and peptides with cross-bridges are useful as drugs due to their stability, high surface area, membrane permeability, and resistance to proteases. For example, the bicyclic peptide toxins of Amanita mushrooms, the amatoxins such as amanitin and the phallotoxins such as phalloidin, include a thioether cross-bridge between tryptophan (Trp) and cysteine (Cys). This cross bridge, called tryptathionine, contributes to the activities of these powerful compounds. Tryptathionine is not present in any other natural product, and its biosynthetic pathway is unknown. Amatoxins currently cannot be chemically synthesized and Amanita cannot be cultured, so the only current source is extraction from mushrooms collected in the wild. The enzyme that catalyzes tryptathione biosynthesis can be used for in vitro or in vivo production of a new class of bicyclic peptides with potential pharmaceutical or industrial applications, ultimately including the amatoxins themselves.
In the amanitin-producing fungus Galerina marginata, a gene encoding a flavin mono-oxygenase (FMO) is clustered with two genes in the amanitin biosynthetic pathway, prolyl oligopeptidase B (POPB) and AMAl. The specific versions of these three genes in G. marginata are also known as GmAMAl, GmFMOl, and GmPOPB. AMAl encodes the 35- amino acid pro-protein for amanitin, and POPB encodes the macrocyclase that converts the 35mer precursor peptide to a cyclic octapeptide, cyclo (IWGIGCNP, SEQ ID NO:l) (Luo et al., Fung Genet Biol 49:123 (2012); Luo et al. Chemistry and Biology 21:1610 (2014); Sgambelluri et al. ACS Synth Biol, 7 (1): 145-152 (2017)). Macrocyclases are enzymes that can convert a linear substrate into a circular (cyclic) product (e.g., a cyclic peptide). Genes for such enzymes are also present in Amanita bisporigera and Amanita phalloides (Pulman et al. BMC Genomics 17:1038 (2016).
FMOl catalyzes tryptathionine biosynthesis as shown by the following reaction.
Figure imgf000006_0001
The process by which this reaction proceeds is believed to proceed as illustrated below, where step (A) involves 3-hydroxylation of indole, like the reaction catalyzed by an FMO from Methylophaga. Step (B) is a reaction pathway for the Savige-Fontana synthesis of tryptathionine from the synthetic compound 3a-hydroxy-pyrrolo[2,3-b]-indole (Hpi), in which 3-hydoxyTrp is an intermediate.
Figure imgf000006_0002
After formation of the tryptathionine cross bridge by FMOl, peptides can undergo one or more hydroxylations and sulfoxidations. For example, after formation of a tryptathionine cross bridge, the remaining steps in amanitin biosynthesis include two to four hydroxylations and sulfoxidation. Identification of the tryptathionine forming enzyme FMOl is a key missing link.
Flavin mono -oxygenase (FMO) nucleic acids and enzymes are described herein that are useful for generating crosslinked and/or cyclized peptides. For example, a nucleic acid encoding a Galerina marginata FMOl (GmFMOl) enzyme was isolated as a cDNA. The
Galerina marginata FMOl (GmFMOl) protein encoded by such a cDNA is shown below SEQ ID NO:2.
Figure imgf000007_0001
The region that includes amino acid positions 34-105 of the SEQ ID NO:2 FMOl protein includes a Rossmann-fold NAD(P)(+)-binding protein domain.
A nucleic acid sequence (cDNA) encoding the SEQ ID NO:2 Galerina marginata
FMOl is shown below as SEQ ID NO:3.
Figure imgf000007_0002
Figure imgf000008_0001
Figure imgf000009_0001
Figure imgf000010_0001
Figure imgf000011_0001
Figure imgf000012_0001
This FM01 -related protein from Amanita muscaria has accession number KJL66345.1 and the following sequence (SEQ ID NO:7).
Figure imgf000012_0002
In some embodiments, a cDNA clone encoding a FMOl protein is synthesized, isolated, and/or obtained from a selected cell. In other embodiments, cDNA clones from other species (that encode a FMOl -related protein) are isolated from selected fungal tissues. For example, the nucleic acid encoding a FMOl protein can be any nucleic acid with a coding region that hybridizes to SEQ ID NO:3. Such a FMOl -related nucleic acid can encode a protein that has FMOl activity. In another example, the FMOl nucleic acid can encode a FMOl protein with an amino acid sequence that has at least 90%, or at least 95%, or at least 96%, or at least 97%, at least 98%, or at least 99% sequence identity to SEQ ID NO:2, 4, 5, 6, or 7. Using restriction endonucleases, the entire coding sequence for the FMOl nucleic acid is subcloned downstream of the promoter in a 5 ' to 3 ' sense orientation.
In some cases, the FMOl proteins can be used in in vitro compositions, assay mixtures, and/or enzymatic manufacturing processes. The FMOl proteins in these compositions, mixtures, and processes can have amino acid sequences that have at least 90%, or at least 95%, or at least 96%, or at least 97%, at least 98%, or at least 99% sequence identity to SEQ ID NO:2, 4, 5, 6, or 7.
Peptide Cyclization Enzymes
In some cases, the FMOl nucleic acids and/or FMOl enzymes can be used in conjunction with other enzymes. For example, the FMOl nucleic acids and/or FMOl enzymes can be co-expressed or combined in vitro with other enzymes that can cychze peptides.
One example of an enzyme that can cyclize peptides is a prolyl oligopeptidase enzyme. As used herein, "prolyl oligopeptidase" or "POP" refers to a member of a family of enzymes classified and referred to as EC 3.4.21.26-enzymes that are capable of cleaving a peptide sequence, for example via hydrolysis of Pro-l-Xaa» Ala-I-Xaa in oligopeptides. Prolyl oligopeptidases are also referred to as "post-proline cleaving enzyme," "proline- specific endopeptidase," "post-proline endopeptidase," "proline endopeptidase," "endoprolyl peptidase," "prolyl endopeptidase," "post-proline cleaving enzyme," "post-proline endopeptidase," and "prolyl endopeptidase." POP A enzymes are found in the majority of mushrooms. POPB enzymes generally have approximately about 50% to 60% (e.g. 55%) amino acid identity to POP A enzymes. The POPB enzymes are primarily found in Amanita,
Galerina, and Lepiota peptide-producing mushroom species.
One example of wa. Amanita bisporigera prolyl oligopeptidase (POP A) sequence is shown below as SEQ ID NO:8.
Figure imgf000013_0001
Figure imgf000014_0001
An example of a nucleic acid (cDNA) sequence that encodes the Amanita bisporigera prolyl oligopeptidase (POPA) protein is shown below as SEQ ID NO:9.
Figure imgf000014_0002
Figure imgf000015_0001
The following comparison of Amanita bisporigera (SEQ ID NO:8) and Galerina marginata (SEQ ID NO: 10) prolyl oligopeptidase (POPA) sequences is shown below, illustrating conserved and non-conserved sequence regions or domains, where the asterisks illustrate which amino acids are the same between these two proteins.
Figure imgf000016_0001
As illustrated, the Amanita bisporigera (SEQ ID N0:8) and Galerina marginata (SEQ ID NO: 10) prolyl oligopeptidase (POP A) sequences have about 65% sequence identity.
However, POP A from Amanita bisporigera and Galerina marginata do not have significant macrocyclase activity, and instead POPA enzymes often just cut small peptides at proline residues.
Prolyl oligopeptidase B (POPB) is an enzyme that can cyclize peptides. An example of an Amanita bisporigera prolyl oligopeptidase (POPB) sequence is shown below as SEQ
ID N :ll .
Figure imgf000017_0001
An example of a nucleic acid (cDNA) sequence that encodes the Amanita bisporigera prolyl oligopeptidase (POPB) protein is shown below as SEQ ID NO:12.
Figure imgf000017_0002
Figure imgf000018_0001
Figure imgf000019_0001
Figure imgf000020_0001
Figure imgf000021_0001
Figure imgf000022_0001
Figure imgf000023_0001
In some embodiments, a cDNA clone encoding a cyclization enzyme such as POPB enzyme is synthesized, isolated, and/or obtained from a selected cell. In other embodiments, cDNA clones from other species (mat encode a POP-related protein) are isolated from selected fungal tissues. For example, the nucleic acid encoding an enzyme (e.g. with cyclization activity) can be any nucleic acid with a coding region that hybridizes to SEQ ID NO:l 1, 13, 14, or 15. Such a cyclization-related nucleic acid can encode a protein that has POP activity.
In another example, the POP nucleic acid can encode a POP protein with an amino acid sequence that has at least 90%, or at least 95%, or at least 96%, or at least 97%, at least 98%, or at least 99% sequence identity to SEQ ID NO:l 1, 13, 14, or 15. Using restriction endonucleases, the entire coding sequence for a selected POP nucleic acid segment is subcloned downstream of the promoter in a 5' to 3' sense orientation.
In some cases, the cyclization proteins can be used in in vitro compositions, assay mixtures, and/or enzymatic manufacturing processes. The cyclization proteins in these compositions, mixtures, and processes can have amino acid sequences that have at least 90%, or at least 95%, or at least 96%, or at least 97%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 11 , 13, 14, or 15.
Expression Cassettes
The cross-bridge and cyclization enzymes described herein can be expressed from expression cassettes. Such an expression cassette (also referred to as a transgene) can include a promoter operably linked to a nucleic acid segment that encodes any of the FMOl or other enzymes described herein. The terms "expression vector" and "expression cassette" refer to a recombinant nucleic acid molecule containing a desired nucleic acid coding segment and appropriate nucleic acid sequences for the expression of the operably linked nucleic acid coding segment in a particular host cell or organism. Nucleic acid sequences for expression of proteins in prokaryotes usually include a promoter, an operator (optional), and a ribosome-binding site, often along with other sequences. Eukaryotic cells can utilize promoters, enhancers, and termination and polyadenylation signals.
Nucleic acid sequences for expression of proteins can be heterologous to the nucleic acid coding segment. A "heterologous nucleic acid," "heterologous gene" and "heterologous protein" indicate a nucleic acid, gene or protein, respectively, which has come from a source other than its native source. The terms "heterologous" and "exogenous" are sometimes used interchangeably with "recombinant." Such heterologous nucleic acid, heterologous genes, and/or heterologous proteins may have been artificially supplied to the biological system or host cell. In contrast, the terms "endogenous protein," "native protein," "endogenous gene," and "native gene" refer to a protein or gene that is native to the biological system, cell, species or chromosome under study. A "native" or "endogenous" nucleic acid is a nucleic acid that does not contain nucleotides or other features normally present in the chromosome in which it is normally found in nature.
Promoters provide for expression of mRNA from the FMOl and other nucleic acids. In some cases, the promoter can be a FMOl native promoter or a promoter that expresses a native cyclization enzyme. However, the promoter can in some cases be heterologous to the FMOl nucleic acid segment. Cyclization enzyme can also be expressed from heterologous promoters. In other words, such a heterologous promoter may not be naturally linked to such a FMOl and/or other nucleic acid segment. Instead, some expression cassettes and expression vectors have been recombinantly engineered to include a FMOl or other nucleic acid coding segment operably linked to a heterologous promoter. A FMOl or other nucleic acid coding segment is operably linked to the promoter, for example, when it is located downstream from the promoter.
A variety of promoters can be included in the expression cassettes and/or expression vectors. In some cases, the endogenous FMOl promoter, or an endogenous promoter that expresses a native cyclization enzyme, can be employed. Promoter regions are typically found in the flanking DNA upstream from the coding sequence in both prokaryotic and eukaryotic cells. A promoter sequence provides for regulation of transcription of the downstream gene sequence and typically includes from about SO to about 2,000 nucleotide base pairs. Promoter sequences can also contain regulatory sequences such as enhancer sequences that can influence the level of gene expression. Some isolated promoter sequences can provide for gene expression of heterologous DNAs, that is a DN A different from the native or homologous DNA.
Promoters can be strong or weak, or inducible. A strong promoter provides for a high level of gene expression, whereas a weak promoter provides for a very low level of gene expression. An inducible promoter is a promoter that provides for the turning on and off gene expression in response to an exogenously added agent, or to an environmental or developmental stimulus. For example, a bacterial promoter such as the P^ promoter can be induced to vary levels of gene expression depending on the level of isothiopropylgalactoside added to the transformed cells. Promoters can also provide for cell specific, tissue specific or developmental regulation. A strong promoter for heterologous DNAs can be advantageous because it provides for a sufficient level of gene expression for easy detection and selection of transformed cells and provides for a high level of gene expression when desired. In some cases, the promoter within such expression cassettes / vectors can be functional in a fungal cell.
Expression cassettes / vectors can include, but are not limited to, fungal promoters of genes such as isocytochrome CI (CYC1), alcohol dehydrogenase (ADH1), transcription elongation factor (TEF), yeast glyceraldehyde phosphodehydrogenase (GAPDH), yeast phosphoglycerate kinase (PGK below), yeast cytochrome b5, Yb5 promoter, or a combination thereof.
Examples of other promoters, which may be useful include fungal and bacterial promoters. Some specific nonlimiting examples include; the aprE promoter or a mutant aprE promoter (see, WO 01/51643); the aph promoter of the Streptomycesfradiae aminoglycoside 3'-phosphotransferase gene; an Aspergillus niger glucoamylase (glaA) promoter, the glucose isomerase (GI) promoter of Actinoplanes missouriensis and the derivative GI (GIT) promoter (U.S. Pat. No. 6,562,612 and EPA 351029); the glucose isomerase (GI) promoter from Streptomyces lividans (e.g., SEQ ID NO: 1 in WO 03/089621), the short wild-type GI promoter (SEQ ID NO: 33 in WO 03/089621), the 1.5 GI promoter (SEQ ID NO: 31 in WO 03/089621), the 1.20 GI promoter (SEQ ID NO: 32 in WO 03/089621 ), or any of the variant GI promoters (SEQ ID NOs: 9-28 in WO 03/089621) as disclosed in WO 03/089621; the cbhl, cbh2, egll and egl2 promoters from filamentous fungi and specifically the Trichoderma reesei cellobiohydrolase promoter (GenBank Accession No. D86235); the lacZ and tac promoters (Bagdasarion et al., 1983, Gene 26:273-282); the ermE promoter (Ward et aL, 1986, Mol. Gen. Genet. 203:468-478 and Schmitt-John et al., 1992, Appl. Microbiol BiotechnoL 36:493-498); and the Bacillus subtilis phage 029 promoters (Pulido et al., 1986, Gene 49:377-382). Promoters effective in Streptomyc.es are listed in Hopwood et al., (Hopwood et al., Regulation of Gene Expression in Antibiotic-producing Streptomyces. In Booth, 1. and Higgins, C. (Eds) SYMPOSIUM OF THE SOCIETY FOR GENERAL MICROBIOLOGY, REGULATION OF GENE EXPRESSION, Cambridge University Press, 1986 pgs. 251-276).
Streptomyces phage promoters are also disclosed in Labes et al., 1997, Microbiol. 143:1503- 1512. Other promoters mat can be used are described in Deuschle et aL, 1986 EMBO J. 5:2987-2994 and WO 96/00787.
Plant promoters can also be used such as the CaMV 35S promoter (Odell et al., Nature. 313:810-812 (1985)), or others such as CaMV 19S (Lawton et al., Plant Molecular Biology. 9:315-324 (1987)), nos (Ebert et al., Proa Natl. Acad. ScL USA. 84:5745-5749 (1987)), Adhl (Walker et aL, Proc. Natl Acad. Sci. USA. 84:6624-6628 (1987)), sucrose synthase (Yang et al., Proc. Natl. Acad Sci. USA. 87:4144-4148 (1990)), a-tubulin, ubiquitin, actin (Wang et al., Mol. Cell. Biol. 12:3399 (1992)), cab (Sullivan et al., Mol Gen. Genet. 215:431 (1989)), PEPCase (Hudspeth et al., Plant Molecular Biology. 12:579-589 (1989)) or those associated with the R gene complex (Chandler et al., The Plant Cell. 1:1175-1183 (1989)). Further suitable promoters include the poplar xylem-specific secondary cell wall specific cellulose synthase 8 promoter, cauliflower mosaic virus promoter, the Z10 promoter from a gene encoding a 10 kD zein protein, a Z27 promoter from a gene encoding a 27 kD zein protein, inducible promoters, such as the light inducible promoter derived from the pea rbcS gene (Coruzzi et al., EMBO J. 3:1671 (1984)) and the actin promoter from rice
(McElroy et al., The Plant Cell. 2:163-171 (1990)). Seed specific promoters, such as the phaseolin promoter from beans, may also be used (Sengupta-Gopalan, Proc. Natl. Acad. Sci. USA. 83:3320-3324 (1985).
A FMOl nucleic acid or another enzyme-encoding nucleic acid segment can be combined with a promoter by standard methods to yield an expression cassette or transgene, for example, as described in Sambrook et al. (MOLECULAR CLONING: A LABORATORY MANUAL. Second Edition (Cold Spring Harbor, NY: Cold Spring Harbor Press (1989);
MOLECULAR CLONING: A LABORATORY MANUAL. Third Edition (Cold Spring Harbor, NY: Cold Spring Harbor Press (2000)). Briefly, a plasmid containing a promoter can be constructed as described in Jefferson {Plant Molecular Biology Reporter 5:387-405 (1987)) or obtained from Takara/Clontech Lab in Mountain View, California (e.g., pBI121 or pBI221). Typically, these plasmids are constructed to have multiple cloning sites having specificity for different restriction enzymes downstream from the promoter. The FMOl or other enzyme encoding nucleic acids can be subcloned downstream from the promoter region using restriction enzymes and positioned to ensure mat the nucleic acid segment is inserted in proper orientation with respect to the promoter so that the coding segment can be expressed as sense or antisense RNA. Once the FMOl or other enzyme nucleic acid is operably linked to a promoter, the expression cassette so formed can be subcloned into a plasmid or other vector (e.g., an expression vector, such as the pESC-HIS vector illustrated herein for expression of GmFMOl).
In some cases, an endogenous FMOl gene can be modified to generate modified host cells and the host cells can express a modified FMOl protein. Mutations can be introduced into FMOl -expressing cell genomes by introducing targeting vectors, T-DNA, transposons, nucleic acids encoding TALENS, CRISPR, or ZFN nucleases, and combinations thereof into a recipient host cell to create a transformed cell.
In some cases, the endogenous FMOl or cyclization enzyme gene can be deleted and host cells with such a deleted endogenous FMOl or cyclization enzyme gene can be transformed to include a modified FMOl or cyclization enzyme transgene, for example, by transformation of the host cells with a FMOl or cyclization enzyme expression cassette or expression vector.
Accordingly, the FMOl and/or cyclization enzyme can be encoded within expression cassettes and expressed in host cells.
The host cells can be any eukaryotic or prokaryotic cell. In some cases, a host cell can be any fungal cell or microorganism. Several types of fungi are available for use as host cells. For example, a host cell may be located in a transgenic mushroom or other fungus such as Saccharomyces cerevisiae (yeast). Prokaryotes include but are not limited to gram negative or positive bacterial cells. Numerous cell lines and cultures are available for use as host cells. Host cells can be obtained through the American Type Culture Collection (ATCC), an organization that serves as an archive for living cultures and genetic materials (atcc.org).
An appropriate host can be determined by one of skill in the art based on the vector nucleic acid sequence and the desired result.
Bacterial cells used as host cells for expression vector replication and/or expression include, among those listed elsewhere herein, DH5 -alpha, JM109, and KC8, as well as a number of commercially available bacterial hosts such as SURE™. Competent Cells and SOLOPACK.TM Gold Cells (Stratagene, La Jolla). Alternatively, bacterial cells such as E. coli LE392 can be used as host cells.
In some embodiments, a host cell is used as a recipient for heterologous nucleic acids. A host cell may be "transfected" or "transformed," which refers to a process by which exogenous nucleic acid is transferred or introduced into the host cell. An expression cassette, plasmid or cosmid, for example, can be introduced into a host cell, where the expression cassette, plasmid or cosmid can be integrated into the host cell genome or autonomously replicated. A transformed cell includes the primary subject cell and its progeny.
Examples of fungal cells that can serve as hosts include yeast cells, mold cells, and mushroom cells. For example, the following types of cells can serve as host cells:
Acremonium chrysogenum, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Cochliobolus carbonum, Coprinus cinereus, Mucor circinelloides, Neurospora crassa, Saccharomyces cerevisiae, Tolypocladium geodes, Trichoderma reesei, Ustilago maydis, and the like.
Peptide Substrates
Peptides are substrates for the FMOl and/or cyclization enzymes described herein. A variety of different peptide substrate sequences can be cross-linked and/or cyclized. For example, although the product of POPB, cyclo(IWGIGCNP), may be a natural substrate for FMOl, FMOl expressed in a cell-free in vitro system exhibited nonspecific FMO activity, indicating that the FMOl enzyme can be active on a variety of peptide substrates.
Peptide substrates can be of different sizes or lengths, especially when cyclization enzymes are present with FMOl enzymes. For example, peptide substrates can have about 6 to about 100 amino acids, or about 7 to about 75 amino acids, or about 8 to about SO amino acids, or about 8 to about 35 amino acids, or about 10 to about 30 amino acids.
Peptide substrates can contain conventional and/or nonconventional amino acids. In some cases, the peptide substrates can contain hydroxylated amino acids, D-amino acids, atypical amino acids (e.g., ornithine, amino-isobutyric acid), N-methylated amino acids (e.g., N-methyl glycine), beta- amino acids (beta- alanine), or combinations thereof. For example, cyclo(IWGIGCNP) can be a substrate for FMOl crosslinking, where the lie, the Tip, and/or the Pro are pre-hydroxylated. POPB can cyclize peptides containing hydroxylated, D-amino acids, N-methyl-amino acids, and beta-amino acids. FMOl can also bicyclize cyclic peptides forms. Precursor peptides that are cyclized can serve as peptide substrates for FMO enzymes. Alternatively, the precursor peptides can be linear and one or more cyclization enzymes can be used with the FMO enzymes.
The peptide substrates can be made by biosynthesis of peptide substrate precursors in a host cell such as E. coli. In other cases, peptide substrates with or without unusual amino acids can be chemically synthesized.
In some cases, synthetic peptide substrates can be modified. To produce cyclized / crosslinked peptides with different sequences, peptide substrates can be synthesized by recombinant host cells that have expression cassettes with modified coding region segments. For example, selected nucleotides of synthetic G. marginata alpha-amanitin sequences can be changed to make other types of peptide toxins and peptides. In one example, replacing the codon AAC (Asn) with GAC (Asp) provides beta-amanitin instead of alpha-amanitin. Beta- amanitin production in G. marginata can readily be detected by reverse-phase HPLC.
Figure imgf000029_0002
Alpha-amanitin, for example,
Figure imgf000029_0003
(also referred to as c
Figure imgf000029_0005
Figure imgf000029_0004
Figure imgf000029_0001
Beta-amanitin has the following structure.
Figure imgf000030_0001
Phallacidin, for example, is c
Figure imgf000030_0004
Figure imgf000030_0003
and it has the following structure.
Figure imgf000030_0002
Various amino acids in precursor peptides can be modified to make non-natural amanitin derivatives. One example of a non-natural amanitin derivative can involve replacing Gly with Ala by replacing GGT with GCT in a nucleic acid segment encoding a precursor peptide for amanitin.
The peptide substrates can contain Trp and Cys at positions convenient for forming tryptathionine cross-bridges (e.g., Trp and Cys may in some cases not be adjacent in the primary structure or sequence of the peptide substrates). Linear and cyclic peptides of at least six, seven, eight, nine, ten or more amino acids can be made that include the following sequence: XWXXXCXP (SEQ ID NO:16), where X is any amino acid. The proline can be retained in these peptides for correct processing by POP, and the presence of Trp (W) and Cys (C) allow for biosynthesis of tryptathionine, a unique hallmark of the Amanita toxin peptides. Peptide sequences mat include MDSTN (SEQ ID NO:17), TRIPL (SEQ ID NO:18), or other peptide sequences with prolines in conserved positions can be used in the peptide substrates, for example, when forming a cyclized peptide product.
The peptide substrates can have an amino acid sequence that includes a start site (e.g., methionine), a leader sequence (which is cleaved by POPB), a follower peptide that is removed when POPB cyclizes the core region (IWGIGCNP; SEQ ID NO:l) by
transpeptidation, and a smaller peptide representing the amino acid sequence of the final peptide product (the core, IWGIGCNP; SEQ ID NO:l). One example of a peptide substrate (an exemplary precursor peptide) that includes an amino acid sequence for mature ama toxins and phallotoxins is illustrated as: MFDTNATRLPIWGIGCNPWTAEHVDQTLASGNDIC (SEQ ID NO: 19), where the peptide mat is cyclized is underlined. See also FIG. 1 A. This is an example of an excellent peptide substrate for POPB.
Expression of synthetic peptides and peptide toxins can be monitored by assays including but not limited to, LC / MS and HPLC. Neither the precursor peptide nor the cyclic toxin peptide products are secreted. Hence, the cyclic and/or cross-linked products can be found inside host cells.
Separation of peptide products from endogenous peptides and proteins (e.g. peptides and proteins produced from genomic DNA originally contained in host cells) can be by standard techniques including but not limited to HPLC methods. Isolated peptide products can be evaluated in assays for assessing structure and/or biological activity. Peptide structures can be assessed by high resolution MS and NMR. The toxicity of peptide products such as synthetic amanitin toxins can be determined in assays that measure inhibition of transcription in eukaryotic cells, such as capability to inhibit RNA Polymerase II or other assays such as binding to actin, immunosuppression, or blocking the mitochondrial permeability transition pore. Useful peptide products can be produced in bulk (either within host cells or by reaction mixtures containing crosslinking and/or cyclization enzymes).
Table 1 illustrates exemplary sequences encoded by AMA1 and PHA1 genes.
Table 1: Exemplary AMA1 and PHA1 comparisons
Figure imgf000031_0001
Figure imgf000032_0001
Methods
The cyclic and/or crosslinked peptides that can be made using the enzymes described herein can have useful properties. For example, cyclic, crosslinked peptide toxins of Amanita mushrooms are chemically and biologically unique among natural products. Such toxins have uses as biochemical reagents and as anti-cancer therapeutics, for example, in targeted therapeutics such as antibody-drug conjugates. The enzymes described herein can be used to biosynthcsize many types of cyclic / crosslinked peptides.
Cyclic and/or crosslinked peptides can be made in vivo (e.g., in host cells or host organisms) or in \dtro (e.g., in a reaction mixture containing one or more enzymes, a peptide substrate, and any other cofactors or other reactants that may facilitate the cyclization or crosslinking reaction).
For in vivo production of cyclic / crosslinked peptides, peptide substrates can be co- expressed in a host cell with a FMOl and/or other cyclization enzyme. For example, an expression cassette can be used that has a promoter operably linked to a nucleic acid segment encoding a peptide substrate. Such peptides can then be expressed with FMOl and/or cyclization enzymes so that cyclization and/or cross-bridge formation can occur in a host cell. The expression cassette encoding the peptide substrate can have any of the promoters described herein. Similarly, the FMOl and/or cyclization enzymes can be expressed from expression cassettes within the host cells. After incubation of the host cells in a culture medium that supports the growth and functioning of the host cells, the cyclic / crosslinked peptides can be isolated from the cell culture.
For in vitro production of cyclic / crosslinked peptides, peptide substrates can be mixed with one or more FMOl and/or other cyclization enzymes in a reaction mixture. The reaction mixture can contain other factors to enhance the formation of cyclic / crosslinked peptide products. For example, the reaction can contain one or more of the following:
NADPH, FAD, ATP, oxygen, dithiothreitol to maintain free sulfhydryls, buffer (to maintain the pH), or metal cofactors.
The reaction temperature that allows synthesis of peptide is 0 to 60° C, or 5 to 40° C.
In addition, the reaction pH that allows synthesis of peptide is 6.S to 10.S, and preferably 6.S to 9.5.
For example, bicyclic peptides can be produced using the enzymes and methods described herein. Host cells such as yeast, bacterial (e.g., E. coli), or Calerina cells can be transformed with one or more copies of an expression cassette encoding a substrate precursor peptide useful for making a cyclic or bicyclic peptide, and one or more expression cassettes encoding FMOl and/or cyclization enzymes. Alternatively, for example, a reaction mixture of peptide substrates for such cyclic or bicyclic peptides as well as FMOl and/or cyclization enzymes can be incubated in vitro to generate the cyclic or bicyclic product. Definitions
The use of the article "a" or "an" is intended to include one or more.
As used herein, terms defined in the singular are intended to include those terms defined in the plural and vice versa.
As used herein, "peptide" refers to compounds containing two or more amino acids linked by the carboxyl group of one amino acid to the amino group of another, i.e. "peptidyl linkages" to form an amino acid sequence. It is contemplated that peptides may be purified and/or isolated from natural sources or prepared by recombinant or synthetic methods. Amino acid sequences may be encoded by naturally or non-naturally occurring nucleic acid sequences or synthesized by recombinant nucleic acid sequences or artificially synthesized. A peptide may be a linear peptide or a cyclopeptide, i.e. cyclic peptide including bicyclic feature.
As used herein, "cyclic peptide" or "cyclopeptide" in general refers to a peptide comprising at least one internal bond attaching nonadjacent amino acids of the peptide, such as when the end amino acids of a linear sequence are attached to form a circular peptide. A "bicyclic peptide" may have at least two internal bonds forming a cyclopeptide of the present inventions, such as when the end amino acids of a linear sequence are attached to form a circular peptide in addition to another internal bond attaching two nonadjacent amino acids, for example, see e.g. FIG. 1.
As used herein, the term "Amanita peptide" or "Amanita toxin" or "Amanita peptide toxin" refers to any linear or cyclic peptide produced by a mushroom, not restricted to a biologically active toxin. It is not intended that the present invention be limited to a toxin or a peptide produced by an Amanita mushroom and includes similar peptides and toxins produced by other fungi, including but not limited to species of Lepiota, Conocybe, Galerina, and the like. An Amanita peptide toxin resembles any of the amatoxins and phallotoxins, such as similarity of amino acid sequences, matching toxin motifs as shown herein, encoded between the conserved regions (A and B) of their proproteins, encoded by hypervariable regions of their proproteins (P) and the like. The Amanita peptides include, but are not restricted to, amatoxins such as the amanitins, and phallotoxins such as phalloidin and phallacidin. For example, an exemplary Amanita peptide in one embodiment ranges from 6- 17 amino acids in length. In another embodiment an Amanita peptide toxin ranges from 7-11 amino acids in length. In one embodiment, an Amanita peptide is linear. In another embodiment, an Amanita peptide is a bicyclic peptide. It is not meant to limit an Amanita peptide to a naturally produced peptide. In some embodiments, an Amanita peptide has an artificial sequence, in other words a nucleic acid encoding an artificial peptide sequence was not naturally found in a fungus or found encoded by a nucleic acid sequence isolated from a fungus.
As used herein, "biologically active" refers to a peptide that when contacted with a cell, tissue or organ induces a biological activity, such as stimulating a cell to divide, causing a cell to alter its function (e.g., altering T cell function, causing a cell to change expression of genes, etc.)
The terms "peptide," "polypeptide," "propeptide," "propolypeptide," "prepropeptide," "prepropolypeptide," "precursor peptide", and "protein" in general refer to a primary sequence of amino acids that are joined by covalent "peptide linkages. " Polypeptides may encompass either peptides or proteins. In general, a peptide consists of a few amino acids, and is shorter than a protein. "Amino acid sequence" and like terms, such as "polypeptide" or "protein" are not meant to limit the amino acid sequence to the complete, native amino acid sequence associated with the recited protein molecule.
As used herein, the term "synthetic" or "artificial" in relation to a peptide sequence refers to a peptide made either artificially from covalently bonding amino acids, such as by Peptide Synthesizer, (for example, Applied Biosystems) or a peptide derived from an amino acid sequence encoded by a recombinant nucleic acid sequence.
As used herein, the term "toxin" refers to a substance (for example, a cyclopeptide) that is detrimental (Le., poisonous) to cells and/or organisms, in particular a human organism In some cases, the peptides made by the enzymes and methods described herein are not toxins.
As used herein, "amatoxin" generally refers to a family of peptide compounds, related to and including the amanitins. An amatoxin refers to any small peptide, linear and cyclic, comprising an exemplary chemical structure (e.g., as illustrated herein, see also FIG. 1) or encoded by a nucleic acid sequence. Such an amatoxin nucleic acid sequence and/or proprotein can have higher sequence homology to amanitin (or e.g., an AMAl nucleic acid) than to an analogous sequence of a phallotoxin (or e.g., a PHAl nucleic acid). In some cases, the peptides made by the enzymes and methods described herein are not amatoxins or phallotoxins. However, in other cases, the enzymes and methods described herein can efficiently make amatoxins and/or phallotoxins.
As used herein, "phallotoxin" generally refers to a family of peptide compounds, related to and including phallacidin and phalloidin. A phallotoxin refers to any small peptide encoded by nucleic acid sequences where the nucleic acid sequence and/or proprotein has a higher sequence homology to a phallotoxin (or e.g., a PHA1 nucleic acid) than to an analogous sequence of an amatoxin (or e.g., an AMA1 nucleic acid).
As used herein the term "microorganism" refers to microscopic organisms and taxonomically related macroscopic organisms within the categories of algae, bacteria, fungi (including lichens), protozoa, viruses, and subviral agents.
The terms "eukaryotic" and "eukaryote" are used in the broadest sense. It includes, but is not limited to, any organisms containing membrane bound nuclei and membrane bound organelles. Examples of eukaryotes include but are not limited to animals, plants, algae, diatoms, and fungi.
The terms "prokaryote" and "prokaryotic" are used in the broadest sense. It includes, but is not limited to, any organisms without a distinct nucleus. Examples of prokaryotes include but are not limited to bacteria, bhie-green algae (cyanobacteria), archaebacteria, actinomycetes and mycoplasma. In some cases, a host cell is any microorganism.
As used herein, the term "fungi" is used in reference to eukaryotic organisms such as mushrooms, rusts, molds and yeasts, including dimorphic fungi "Fungus" or "fungi" also refers to a group of lower organisms lacking chlorophyll and dependent upon other organisms for source of nutrients.
As used herein, "mushroom" refers to the fruiting body of a fungus.
As used herein, "fruiting body" refers to a reproductive structure of a fungus which produces spores, typically comprising the whole reproductive structure of a mushroom including cap, gills and stem, for example, a prominent fruiting body produced by species of Ascomycota and Basidiomycota, examples of fruiting bodies are "mushrooms,"
"carpophores," " toadstools," "puffballs", and the like.
As used herein, "fruiting body cell" refers to a cell of a cap or stem which may be isolated or part of the structure.
As used herein, "spore" refers to a microscopic reproductive cell or cells.
As used herein, "mycelium" refers to a mass of fungus hyphae, otherwise known as a vegetative portion of a fungus.
As used herein, "Basidiomycota" in reference to a Phylum or Division refers to a group of fungi whose sexual reproduction involves fruiting bodies comprising basidiospores formed on club-shaped cells known as basidia.
As used herein, "Basidiomycetes" in reference to a class of Phylum Basidiomycota refers to a group of fungi. Basidiomycetes include mushrooms, of which some are rich in cyclopeptides and/or toxins, and includes certain types of yeasts, rust and smut fungi, gilled- mushrooms, puffballs, polypores, jelly fungi, brackets, coral, mushrooms, boletes, puffballs, stinkhorns, etc.
As used herein, "Ascomycota" or "ascomycetes" in reference to members of a fungal Phylum or Division refers to a "sac fungus" group. Of the Ascomycota, a class
"Ascomycetes" includes Candida albicans, unicellular yeast, Morchella esculentum, the morel, and Neuwspom crassa. Some ascomycetes cause disease, for example. Candida albicans causes thrush and vaginal infections; or produce chemical toxins associated with diseases, for example, Aspergillus flavus produces a contaminant of nuts and stored grain called aflatoxin, that acts both as a toxin and a deadly natural carcinogen.
As used herein "Amanita" refer to a genus of fungus whose members comprise poisonous mushrooms, e.g., Amanita (A.) bisporigera, A. virosa, A. ocreata, A. suballiacea, and A. tenuifolia which are collectively referred to as "death angels" or "Destroying Angels" and "Amanita phalloides" or "A. phalloides var. alba" or "A. phalloides var. verna" or "A. verna", referred to as "death cap." The toxins of these mushrooms frequently cause death through liver and kidney failure in humans. Not all species of this genus are deadly, for example, Amanita muscaria, the fly agaric, induces gastrointestinal distress and/or hallucinations while others do not induce detectable symptoms.
As used herein, nonribosomal peptide synthetase (NRPS) is an enzyme that catalyzes the biosynthesis of a small (20 or fewer amino acids) peptide or depsipeptide, linear or circular, and is composed of one or more domains (modules) typical of this class of enzyme. Each domain is responsible for aminoacyl adenylation of one component amino acid. NRPSs can also contain auxiliary domains catalyzing, e.g., N-methylation and amino acid epimerization (Walton, et al., in Advances in Fungal Biotechnology for Industry, Agriculture, and Medicine, et al., Eds. (Kluwer Academic/Plenum, N.Y., 2004, pp. 127-162; Finking, et al., (2004) Annu Rev Microbiol 58:453-488, all of which are herein incorporated by reference). Examples are gramicidin synthetase, HC-toxin synthetase, cyclosporin synthetase, and enniatin synthetase.
As used herein, the terms "cell," "cell line," "cell population," and "cell culture" may be used interchangeably. All of these terms also include their progeny, which are any and all subsequent generations. It is understood that all progeny may not be identical due to deliberate or inadvertent mutations.
As used herein, "host fungus cell" refers to any fungal cell, for example, a yeast cell, a mold cell, and a mushroom cell (such as Neurospora crassa, Aspergillus nidulans,
Cochliobolus carbonum, Coprinus cinereus. Ustilago maydis, and the like). As used herein, the term "Fungal expression system" refers to a system using fungi to produce (express) enzymes and other proteins and peptides. Examples of filamentous fungi which are currently used or proposed for use in such processes are Neurospora crassa, Acremonium chrysogenum, Tofypocladium geodes, Mucor circineUoides, Trichoderma reesei, Aspergillus nidul ns, Aspergillus niger, Coprinus cinereus, Aspergillus oryzfle, etc. Further examples include an expression system for basidiomycete genes (for example, Gola. et al., (2003) J Basic Microbiol. 43(2):104-12; herein incorporated by reference) and fungal expression systems using, for example, a monokaryotic laccase-deficient Pycnoporus cinnabarinus strain BRFM 44 (Banque de Resources Fongiques de Marseille, Marseille, France), and SchizpphyUum commune, (for example, Alexandra, et al., (2004) Appl Environ Microbiol. 70(ll):6379-638; Lugones, et al., (1999) Mol. Microbiol. 32:681-700; Schuren, et al., (1994) Curr. Genet. 26:179-183; all of which are herein incorporated by reference).
The term "transgene" as used herein refers to a foreign gene, such as a heterologous gene, that is placed into an organism by, for example, introducing the foreign gene into cells or primordial tissue. The term "foreign gene" refers to any nucleic acid (e.g., gene segment) that is introduced into the genome of a host cell by experimental manipulations and may include gene sequences found in that cell so long as the introduced gene does not reside in the same location as does the naturally-occurring gene.
As used herein, the term "vector" is used in reference to nucleic acid molecules that transfer DNA segment(s) from one cell to another. The term "vehicle" is sometimes used interchangeably with "vector." A vector "backbone" comprises those parts of the vector which mediate its maintenance and enable its intended use (e.g., the vector backbone may contain sequences necessary for replication, genes imparting drug or antibiotic resistance, a multiple cloning site, and possibly operably linked promoter and/or enhancer elements which enable the expression of a cloned nucleic acid). The cloned nucleic acid (e.g., such as a cDNA coding sequence, or an amplified PCR product) is inserted into the vector backbone using common molecular biology techniques.
A "recombinant vector" indicates that the nucleotide sequence or arrangement of its parts is not a native configuration, and has been manipulated by molecular biological techniques. The term implies that the vector is comprised of segments of DNA that have been artificially joined.
As used herein, "recombinant nucleic acid" or "recombinant gene" or "recombinant DNA molecule" or "recombinant nucleic acid sequence" indicates that the nucleotide sequence or arrangement of its parts is not a native configuration, and has been manipulated by molecular biological techniques. The term implies that the DNA molecule is comprised of segments of DNA that have been artificially joined together. Protocols and reagents to manipulate nucleic acids are common and routine in the art (See e.g., Maniatis et al. (eds.), Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, NY.
[1982]; Sambrook et al. (eds.), Molecular Cloning: A Laboratory Manual, Second Edition, Volumes 1-3, Cold Spring Harbor Laboratory Press, NY, [1989]; and Ausubel et al. (eds.), Current Protocols in Molecular Biology, Vol. 1-4, John Wiley & Sons. Inc., New York
[1994]; all of which are herein incorporated by reference). Similarly, a "recombinant protein" or "recombinant polypeptide" refers to a protein molecule that is expressed from a recombinant DNA molecule. Use of these terms indicates that the primary amino acid sequence, arrangement of its domains or nucleic acid elements which control its expression are not native, and have been manipulated by molecular biology techniques. As indicated above, techniques to manipulate recombinant proteins are also common and routine in the art.
The terms "exogenous" and "heterologous" are sometimes used interchangeably with "recombinant." An "exogenous nucleic acid," "exogenous gene" and "exogenous protein" indicate a nucleic acid, gene or protein, respectively, that has come from a source other than its native source, and has been artificially supplied to the biological system. In contrast, the terms "endogenous protein," "native protein," "endogenous gene," and "native gene" refer to a protein or gene that is native to the biological system, species or chromosome under study. A "native" or "endogenous" polypeptide does not contain amino acid residues encoded by recombinant vector sequences; that is, the native protein contains only those amino acids found in the polypeptide or protein as it occurs in nature. A "native" polypeptide may be produced by recombinant means or may be isolated from a naturally occurring source.
Similarly, a "native" or "endogenous" gene is a gene that does not contain nucleic acid elements encoded by sources other than the chromosome on which it is normally found in nature.
As used herein, the term "heterologous nucleic acid" or "heterologous polypeptide" refers to a nucleic acid or polypeptide that is not in its natural environment. For example, a heterologous coding region includes a coding region from one species introduced into another species. A heterologous nucleic acid or heterologous polypeptide also includes a nucleic acid or polypeptide native to an organism that has been altered in some way (e.g., mutated, added in multiple copies, linked to non-native regulatory sequences, etc.). Heterologous nucleic acids or heterologous polypeptides are distinguished from endogenous nucleic acids in that the heterologous nucleic acids are typically joined to nucleic acid segments that are not found naturally associated with the nucleic acids in the chromosome or are associated with portions of the chromosome not found in nature (e.g., coding regions expressed in loci where the corresponding gene is not normally expressed).
In addition to containing introns, genomic forms of a gene may also include sequences located on both the 5' and 3' end of the sequences that are present on the RNA transcript. These sequences are referred to as "flanking" sequences or regions (these flanking sequences are located 5' or 3' to the untranslated sequences present on the mRNA transcript). The 5' flanking region may contain regulatory sequences such as promoters and enhancers that control or influence the transcription of the gene. The 3' flanking region may contain sequences that direct the termination of transcription, post-transcriptional cleavage and polyadenylation.
As used herein, the term "regulatory element" refers to a genetic element that controls some aspect of the expression of nucleic acid sequences. For example, a promoter is a regulatory element that facilitates the initiation of transcription of an operably linked coding region. Other regulatory elements include splicing signals, polyadenylation signals, termination signals, etc.
The terms "in operable combination," "in operable order," "operably linked" and similar phrases when used in reference to nucleic acid segments herein are used to refer to the linkage of nucleic acid segments in such a manner that a nucleic acid molecule capable of directing the transcription of a given coding region and/or the synthesis of a desired protein molecule is produced. The term also refers to the linkage of amino acid sequences in such a manner so that a functional protein is produced.
A "promoter" is a control sequence that is a region of a nucleic acid sequence at which initiation and rate of transcription are controlled. It may contain genetic elements at which regulatory proteins and molecules may bind such as RNA polymerase and other transcription factors. The phrases "operably linked," "operatively positioned," "operatively linked," "under control," and "under transcriptional control" mean that a promoter is in a correct functional location and/or orientation in relation to a nucleic acid segment (e.g., a nucleic acid segment encoding an enzyme) to control transcriptional initiation and/or expression of that nucleic acid segment. A promoter may or may not be used in conjunction with an "enhancer," which refers to a cis-acting regulatory sequence involved in the transcriptional activation of a nucleic acid segment.
A promoter may be one naturally associated with a gene or nucleic acid segment, and may be obtained by isolating the 5' non-coding sequences located upstream of the coding segment and/or exon. Such a promoter can be referred to as "endogenous." Similarly, an enhancer may be one naturally associated with a nucleic acid sequence, located either downstream or upstream of that sequence. Alternatively, certain advantages will be gained by positioning the coding nucleic acid segment under the control of a recombinant or heterologous promoter, which refers to a promoter that is not normally associated with a nucleic acid sequence in its natural environment. A recombinant or heterologous enhancer refers also to an enhancer not normally associated with a nucleic acid in its natural environment. Such promoters or enhancers may include promoters or enhancers of other genes, and promoters or enhancers isolated from any other prokaryotic, viral, or eukaryotic cell, and promoters or enhancers not "naturally occurring," e.g., containing different elements of different transcriptional regulatory regions, and/ or mutations that alter expression. In addition to producing nucleic acid sequences of promoters and enhancers synthetically, sequences may be produced using recombinant cloning and/or nucleic acid amplification technology, including PCR, in connection with the compositions disclosed herein (see U.S. Pat. No. 4,683,202, U.S. Pat. No. 5,928,906, each incorporated herein by reference). It is further contemplated that control sequences that direct transcription and/or expression of sequences within non-nuclear organelles such as mitochondria, chloroplasts, and the like, can be employed as well.
A promoter and/or enhancer can be used that effectively directs the expression of a nucleic acid segment (e.g., including a nucleic acid segment encoding an enzyme) in the cell type, organelle, and organism chosen for expression. Those of skill in the art of microbiology and molecular biology generally know the use of promoters, enhancers, and cell type combinations for protein expression, for example, see Sambrook et al. (1989); herein incorporated by reference. The promoters employed may be constitutive, tissue-specific, inducible, and/or useful under the appropriate conditions to direct the desired level of expression of the introduced nucleic acid segment encoding a target protein (e.g., high levels of expression that are advantageous in the large-scale production of recombinant proteins and/or peptides). The promoter may be heterologous or endogenous.
Suitable control elements such as enhancers/promoters, splice junctions,
polyadenylation signals, etc. may be placed in close proximity to the coding region of a coding region to facilitate proper initiation of transcription and/or correct processing of the primary RNA transcript. Alternatively, the coding region utilized in the expression cassettes may contain endogenous enhancers/promoters, splice junctions, intervening sequences, polyadenylation signals, etc. or a combination of both endogenous and exogenous control elements.
As used herein, the term "hybridization" is used in reference to the pairing of complementary nucleic acids. Hybridization can be demonstrated using a variety of hybridization assays (Southern blot, Northern Blot, slot blot, phage plaque hybridization, and other techniques). These protocols are available in the art (See e.g., Sambrook et al. (eds.), Molecular Cloning: A Laboratory Manual, Second Edition, Volumes 1-3, Cold Spring Harbor Laboratory Press, NY, [1989]; Ausubel et aL (eds.), Current Protocols in Molecular Biology, Vol. 1-4, John Wiley & Sons, Inc., New York [1994]; all of which are herein incorporated by reference).
Hybridization is the process of one nucleic acid pairing with an antiparallel counterpart that may or may not have 100% complementarity. Two nucleic acids that contain 100% antiparallel complementarity will show strong hybridization. Two antiparallel nucleic acids which contain no antiparallel complementarity (generally considered to be less than 30%) will not hybridize. Two nucleic acids which contain between 31-99% complementarity will show an intermediate level of hybridization. A single molecule that contains pairing of complementary nucleic acids within its structure is said to be "self-hybridized."
During hybridization of two nucleic acids under high stringency conditions, complementary base pairing will occur only between nucleic acid fragments that have a high frequency of complementary base sequences. Thus, conditions of "weak" or "low" stringency are often required with nucleic acids that are derived from organisms that are genetically diverse, as the frequency of complementary sequences is usually less. As used herein, two nucleic acids which are able to hybridize under high stringency conditions are considered "substantially homologous." Whether sequences are "substantially homologous" may be verified using hybridization competition assays. For example, a "substantially homologous" nucleotide sequence is one that at least partially inhibits a completely complementary probe sequence from hybridizing to a target nucleic acid under conditions of low stringency. This is not to say that conditions of low stringency are such that non-specific binding is permitted; low stringency conditions require that the binding of two sequences to one another be a specific (i.e., selective) interaction. The absence of non-specific binding may be verified by the use of a second target that lacks even a partial degree of complementarity (e.g., less than about 30% identity); in the absence of non-specific binding the probe will not hybridize to the second non-complementary target. When used in reference to a double-stranded nucleic acid sequence such as a cDNA or genomic DNA, the term "substantially homologous" refers to any probe that can hybridize to either or both strands of the double-stranded nucleic acid sequence under conditions of high stringency.
Hybridization and the strength of hybridization (i.e., the strength of the association between the nucleic acids) is impacted by such factors as the degree of complementary between the nucleic acids, stringency of the conditions involved, the Tm of the formed hybrid, and the G:C ratio within the nucleic acids.
As used herein, the term "stringency" is used in reference to the conditions of temperature, ionic strength, and the presence of other compounds such as organic solvents, under which nucleic acids hybridize. "Low or weak stringency" conditions are reaction conditions which favor the complementary base pairing and annealing of two nucleic acids. "High stringency" conditions are those conditions which are less optimal for complementary base pairing and annealing. Several variables affect the strength of hybridization, including the length and nature of the probe and target (DNA, RNA, base composition, present in solution or immobilized, the degree of complementary between the nucleic acids, the Tm of the formed hybrid, and the G:C ratio within the nucleic acids). Conditions may be manipulated to define low or high stringency conditions: factors such as the concentration of salts and other components in the hybridization solution (e.g., the presence or absence of formamide, dextran sulfate, and polyethylene glycol) as well as temperature of the hybridization and/or wash steps. Conditions of "low" or "high" stringency are specific for the particular hybridization technique used.
As used herein the term "stringency" is used in reference to the conditions of temperature, ionic strength, and the presence of other compounds such as organic solvents, under which nucleic acid hybridizations are conducted. Those skilled in the art will recognize that "stringency" conditions may be altered by varying the parameters just described either individually or in concert. With "high stringency" conditions, nucleic acid base pairing will occur only between nucleic acid fragments that have a high frequency of complementary base sequences (e.g., hybridization under "high stringency" conditions may occur between homologs with about 85-100% identity, preferably about 70-100% identity). With medium stringency conditions, nucleic acid base pairing will occur between nucleic acids with an intermediate frequency of complementary base sequences (e.g. , hybridization under "medium stringency" conditions may occur between homologs with about 50-70% identity). Thus, conditions of "weak" or 'low" stringency are often required with nucleic acids that are derived from organisms that are genetically diverse, as the frequency of complementary sequences is usually less. "High stringency conditions" when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 65° C in a solution consisting of 5 and 1.85 g/1 EDTA, pH adjusted to 7.4 with
Figure imgf000044_0002
NaOH), 0.5% sodium dodecyl sulfate (SDS), 5xDenhardt's reagent and 100 Mg/ml denatured salmon sperm DNA followed by washing in a solution comprising 0.1 xSSPE, 1.0% SDS at 42° C. when a probe of about 500 nucleotides in length is employed. However, the washing step is most responsible for the stringency of hybridization. Hence, "high stringency conditions" can be described as washing in 0.1 xSSPE, 1.0% SDS at 42° C.
"Medium stringency conditions" when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 55° C. in a solution consisting of 5
Figure imgf000044_0001
pH adjusted to 7.4 with NaOH), 0.5% SDS, 5xDenhardt's reagent and 100 pg/ml denatured salmon sperm DNA followed by washing in a solution comprising 1.OxSSPE, 1.0% SDS at 42° C. when a probe of about 500 nucleotides in length is employed. Because the washing step is most responsible for the stringency of hybridization, "medium stringency conditions" can be described as washing in 1. OxSSPE, 1.0% SDS at 42° C.
"Low stringency conditions" comprise conditions equivalent to binding or hybridization at 42°C in a solution consisting of SxSSPE (43.8 g/1 NaCl, 6.9 g/1
NaH2P04H20 and 1.85 g/1 EDTA, pH adjusted to 7.4 with NaOH), 0.1% SDS, 5xDenhardfs reagent (50xDenhardt's contains per 500 ml: 5 g Ficoll (Type 400, Pharamcia), 5 g BSA
(Fraction V; Sigma)) and 100 pg/ml denatured salmon sperm DNA followed by washing in a solution comprising SxSSPE, 0.1% SDS at 42° C when a probe of about 500 nucleotides in length is employed. Because the washing step is most responsible for the stringency of hybridization, "low stringency conditions" can be described as washing in 5xSSPE, 1.0% SDS at 42° C.
As used herein, the term "Tm" is used in reference to the "melting temperature." The melting temperature is the temperature at which a population of double-stranded nucleic acid molecules becomes half dissociated "denatures") into single strands. An estimate of the Tm value may be calculated by the equation: Tm = 81.5+0.41 (% G+C), when a nucleic acid is in aqueous solution at 1 M NaCl (see e.g., Anderson and Young, Quantitative Filter
Hybridization, in Nucleic Acid Hybridization (1985)). Other references include more sophisticated computations that take structural as well as sequence characteristics into account for the calculation of Tm. As used herein, the terms "antiparallel complementarity" and "complementarity" are synonymous. Complementarity can include the formation of base pairs between any type of nucleotides, including non-natural bases, modified bases, synthetic bases and the like.
The following definitions are the commonly accepted definitions of the terms "sequence identity," "similarity" and "homology." Percent sequence identity is a measure of strict amino acid or nucleotide conservation. Percent similarity is a measure of amino acid conservation which incorporates both strictly conserved amino acids, as well as
"conservative" amino acid substitutions, where one amino acid is substituted for a different amino acid having similar chemical properties (Le. a "conservative" substitution). The term "homology" can pertain to either proteins or nucleic acids. Two proteins can be described as "homologous" or "non-homologous," but the degree of amino acid conservation is quantitated by percent identity and percent similarity. Nucleic acid conservation is measured by the strict conservation of the bases adenine, thymine, guanine and cytosine in the primary nucleotide sequence. When describing nucleic acid conservation, conservation of the nucleic acid primary sequence is sometimes expressed as percent sequence identity or percent homology. In the same nucleic acid, one region may show a high percentage of nucleotide sequence conservation, while a different region can show no or poor conservation. Nucleotide sequence conservation may not necessarily be inferred from an amino acid similarity score. Two proteins may show domains that in one region are homologous, while other regions of the same protein are clearly non-homologous.
The following non-limiting Examples illustrate some aspects of the enzymes, enzymatic reactions, and methods provided herein. Example 1 : cDNA for Galerina marginata FMOl
This Example illustrates isolation of a cDNA for Galerina marginata (CBS 339.88)
FMOl.
Four species of Galerina were obtained from Centraalbureau voor Schimmelcultures (CBS), Utrecht, Netherlands, including Galerina marginata (CBS 339.88), Galerina badipes (CBS 268.50), Galerina venenata (CBS 924.72), and Galerina hybrida (CBS 335.88).
Galerina marginata CBS 339.88 is monokaryotic and was confirmed to make alpha- amanitin. G. venenata is considered synonymous with G. marginata (Gulden et aL, 2001, 2005). The cultures were maintained on potato dextrose agar. For DNA isolation, the isolates were cultured in liquid medium for 15-30 d with rotary shaking at 120 rpm at 23 °C. The medium was HSV-2C, which contains (per liter) 1 g yeast extract, 2 g glucose, 0.1 g NH«CL 0.1 g CaSQ4-5 ¾0, 1 mg thiamine-HCL and 0.1 mg biotin, pH 52 (Muraoka and
Shinozawa, J. BioecL Bioeng. 89, 73-76 (2000)). For induction experiments, the media had the same formulation, except mat the media contained high carbon (HSV-5Q and low carbon (HSV-1Q amounts of glucose (ix.5 g ghicoae and 1 g glucose, respectively) (see, Muraoka and Shinozawa, 2000).
Lyophilized fungal mycelia were ground in liquid nitrogen with a mortar and pestle. High molecular weight DNA was isolated using genomic-tip 100/O (Qiagen, Germantown, MD; catalog #10,243), RNA was extracted with TRIzol (mvitrogen, Carlsbad, CA), and cDNA was synthesized following the manufacturers' protocols.
Transformation of G. marginata was performed as described previously by the inventors (Luo et aL, 2014). For candidate genes, 1.5 to 1.8 kb upstream and downstream sequences were used for homologous recombination. The fragments were cloned into the pHg vector (Kemppainen and Pardo, 2010) and transformed into Agrobacterium tumefaciens strain LBA1100 (Kemppainen et aL, 2005), which were then used m die Glolerina transformation.
Analysis of the genomic location of the Galerina marginata FMOl gene indicated that this gene is tightly clustered in the genome of Galerina between AMAJ, which the inventors have shown encodes the "core" 35-amino acid precursor peptide (Luo et aL, 2012), and FOPB, which the inventors have shown is a unique, dedicated prolyl oligopeptidase that catalyzes macrocyclization (FIG. 1; Luo et aL, 2014). Genes for natural products biosynthetic pathways are typically clustered in fungi. Hence, such clustering of the FMOl gene with known genes in a pathway is evidence that the new gene has a function in die amanitin biosynthetic pathway.
Example 2: GmFMOl catalyzes tryptatMonine biosynthesis
This Example illustrates that the Galerina marginata FMOl enzyme can form a trvpeathionine (sulfide) cross bridge.
One reason to believe that FMOl catalyzes tryptanuonine biosynthesis is based on reactions catalyzed by other FMOs. A bacterial FMO catalyzes Tip hydroxylation at the three (3) position (May ft Perrin, Biopolymers 88:714-724 (2007)). A chemical route to tryptathionine biosynthesis can involve hydroxylation of Tip at the 3 position, which activates the 2-position for spontaneous reaction with Cys. One mechanism of FMOl action is hydroxylation of the Tip at the 3 position (FIG.2A), so that conjugation to Cys occurs spontaneously (FIG. 2B). In the course of the second step, the hydroxyl group leaves. Thus, the inventors hypothesized that FMOl catalyzes "cryptic" (i.e., transient) hydroxylation. To test die theory that FMOl catalyzes tryptathionine biosynthesis, recombinant FMOl was incubated with cyclo(lWGIGCNP, SEQ ID NO:l), NADPH, and FAD, and the reaction products were analyzed by LC/MS.
FMOl was successfully expressed in a wheat germ cell free system (ENDEXT
Technology, see website at fsciences.com/eg/tech.html) and it exhibited detectable activity consistent with tryptathionine biosynthesis (FIG 3A). In particular, a new product was found with a mass two daltons lower than the substrate, cyclo(IWGIGCNP) (FIG. 3B). This is the expected mass for cyclo(IWGIGCNP) (SEQ ID NO: 1) plus the Trp-Cys cross bridge.
Expression in E. coli and yeast is also an option.
FIGs. 3B shows extracted ion counts (EIC) at m/z 839 of (top) product, and the (bottom) substrate. As illustrated by FIG. 3B, when FMOl was incubated with
cyclo(IWGIGCNP), NADPH, FAD, and an NADPH-regenerating system, a new product was observed with a mass 2 Da smaller than the substrate, which is consistent with oxidative coupling of Cys and Trp. In particular, the m/z of cyclo(IWGIGCNP) is 841 (M + IT), and the predicted mass of cyclo(IWGIGCNP) with a sulfide cross-bridge between Trp and Cys is 839 (M + H+). The substrate, cyclo(IWGIGCNP), m/z 841, elutes at 13.0 min. FIG. 3C shows that the compound with m/z 839 is the dominant component eluting at 12.0 min. The peak at m/z 861 in FIG. 3C corresponds to the sodium adduct. The change in the UV absorption spectrum (ratio of 29S to 280 nm) of the new, FMOl -catalyzed product was consistent with the structure being the bicyclic compound. Coupling of Cys to Trp shifts the UV absorption maximum of Trp from 280 nm to 295 nm (Wieland, 1986).
Example 3: Gene knockout in G. marginata
This Example illustrates that targeted gene disruption of GmFMOl abolishes amanitin biosynthesis, demonstrating that FMOl is needed for amanitin biosynthesis.
Deletion of GmFMOl in G. marginata results in total loss of a-amanitin. FIG. 4A illustrates LC-MS analysis of amanitin from wild type G. marginata (solid line) and mutant G. marginata (dashed line). FIG. 4B illustrates Southern analysis blot of G. marginata wild type and G. marginata deletion mutant genomic DNA. The probes employed were for GmFMOl; Hyg stands for hygromycin B resistance (the selection for transformation). The southern blots show that the FMOl gene has been deleted in the mutant. The gene disruption method employed is described in Luo et aL Chemistry and Biology 21 :1610 (2014). The results show that GmFMOl is needed for biosynthesis of amanitin. References
Arnison PG, et al. (2013) Ribosomally synthesized and post-translationally modified peptide natural products: overview and recommendations for a universal nomenclature. Nat Prod Rep 30:108-160. doi: 10.1039/c2np20085f. PMID: 23165928.
Bockus et al. (2013) Form and function in cyclic peptide natural products: a pharmacokinetic perspective. Curr Top Med Chem 13:821.
Bouchard et al. (2014) Antibody-drug conjugates - a new wave of cancer drugs. Bioorg Med Chem Lett 24:5357.
Craik & AllewelL Biol. Chem. 287:26999-27000 (2012)
Craik & Malik, Curr. Opin. Chem Biol. 17:546-554 (2013)
Donia et aL, Nat. Chem. BioL 2:729-735 (2006)
Hallen et al (2007) Gene family encoding the major toxins of lethal Amanita mushrooms. Proc Natl Acad Sci 104:19097.
Liu et al. (2015) TP53 loss creates therapeutic vulnerability in colorectal cancer. Nature 520:697.
Luo et aL Biosynthesis of alpha-amanitin in Galerina marginata. Fung Genet Biol 49:123 (2012).
Luo H, Hong SY, Sgambelluri RM, Angelos E, Li X, Walton JD (2014) Peptide macrocyclization catalyzed by a prolyl oligopeptidase involved in a-amanitin biosynthesis. Chem Biol 21:1610-1617. PMID: 25484237.
May M (2014) Drug development: time for teamwork. Nature 509:S4-S5.
May JP, Perrin DM (2007) Tryptathionine bridges in peptide synthesis. Biopolymers 88:714-724.
May JP, Founder P, Patrick BO, Perrin DM (2008) Synthesis, characterisation, and in vitro evaluation of Pro2-Ile3-S deoxo-amaninamide and Pro2-D-allo-Ile3-S-deoxo- amaninamide. Chemistry 14:3410-3417.
Pulman JA, Childs KL, Sgambelluri RM, Walton JD (2016) Expansion and diversification of the MSDIN family of cyclic peptide genes in the poisonous agarics Amanita phalloides and A. bisporigera. BMC Genomics 17:1038. PMID: 27978833.
Riley, R-> A. et al. (2014). Extensive sampling of basidiomycete genomes demonstrates inadequacy of the white rot/brown rot paradigm for wood decay fungi. Proc Natl Acad Sci USA 111 :9923-9928. PMID: 24958869.Sgambelluri RM, Smith MO, Walton JD (2017) Versatility of prolyl oligopeptidase B in peptide macrocyclization. ACS Synth Biol, doi: 10.1021/acssynbio.7b00264. Smith et al. (2013) Design, synthesis, and diversification of ribosomaliy derived peptide macrocycles. Our Opin Struct Biol 23:571 -580. Umemura et aL, Fung. Genet. Biol. 68:23-30 (2014).
Walton, J.D. (1996) Host-selective toxins: agents of compatibility. Plant Cell 8: 1723- 1733.
Wieland T (1986) Peptides of poisonous Amanita mushrooms. Springer, New York.
All patents and publications referenced or mentioned herein are indicative of the levels of skill of those skilled in the art to which the invention pertains, and each such referenced patent or publication is hereby specifically incorporated by reference to the same extent as if it had been incorporated by reference in its entirety individually or set forth herein in its entirety. Applicants reserve the right to physically incorporate into this specification any and all materials and information from any such cited patents or publications.
The following statements of the invention are intended to describe and summarize various embodiments of the invention according to the foregoing description in the specification.
Statements:
1. An expression system comprising at least one expression cassette comprising a
promoter operably linked to a heterologous nucleic acid segment encoding a flavin mono-oxygenase (FMO).
2. The expression system of statement 1 , wherein the flavin mono-oxygenase has at least 90%, or at least 91 %, or at least 93%, or at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99% sequence identity to SEQ ID NO: 2, 4, 5, 6, or 7.
3. The expression system of statement 1 or 2, wherein the flavin mono-oxygenase can catalyze a tryptathionine cross-bridge between a tryptophan (Trp) and a cysteine (Cys) in a peptide substrate.
4. The expression system of statement 1 , 2, or 3, wherein the flavin mono-oxygenase can catalyze a tryptathionine cross-bridge between a tryptophan (Trp) and a cysteine (Cys) in a peptide substrate in the presence of nicotinamide adenine dinucleotide phosphate (NADPH), flavin adenine dinucleotide (FAD), or a combination thereof.
5. The expression system of statement 1-3 or 4, wherein the peptide substrate has about 6 to about 100 amino acids, or about 7 to about 75 amino acids, or about 8 to about 50 amino acids, or about 8 to about 35 amino acids, or about 10 to about 30 amino acids. 6. The expression system of statement 1-4 or 5, wherein the peptide substrate has
conventional and/or nonconventional amino acids. 7. The expression system of statement 1 -5 or 6, wherein the peptide substrate has one or more hydroxylated amino acids.
8. The expression system of statement 1-6 or 7, wherein the peptide substrate contains one or more D-amino acid, N-methyl amino acid, beta amino acids, other atypical amino acid (e.g., ornithine, amino-isobutyric acid), or combinations thereof.
9. The expression system of statement 1-7 or 8, wherein the peptide substrate has the following sequence: XWXXXCXP (SEQ ID NO: 16), where X is any amino acid.
10. The expression system of statement 1 -8 or 9, wherein one or more lie, Tip, or Pro amino acids are hydroxylated in the peptide substrate.
11. The expression system of statement 1-9 or 10, further comprising at least one
expression cassette comprising a promoter operably linked to a heterologous nucleic acid segment encoding a prolyl oligopeptidase that has macrocyclase activity.
12. The expression system of statement 11, wherein the prolyl oligopeptidase has at least 90%, or at least 91 %, or at least 93%, or at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99% sequence identity to SEQ ID NO:l 1, 13, 14, or 15.
13. A host cell comprising an expression system comprising at least one expression
cassette comprising a promoter operably linked to a heterologous nucleic acid segment encoding a flavin mono -oxygenase (FMO).
14. The host cell of statement 13, wherein the flavin mono-oxygenase has at least 90%, or at least 91 %, or at least 93%, or at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99% sequence identity to SEQ ID NO: 2, 4, 5, 6, or 7.
15. The host cell of statement 13 or 14, further comprising at least one expression cassette comprising a promoter operably linked to a heterologous nucleic acid segment encoding a prolyl oligopeptidase.
16. The host cell of statement 15, wherein the prolyl oligopeptidase has at least 90%, or at least 91%, or at least 93%, or at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99% sequence identity to SEQ ID NO:l 1, 13, 14, or 15.
17. The host cell of statement 13-15 or 16, further comprising at least one expression cassette comprising a promoter operably linked to a heterologous nucleic acid segment encoding a peptide substrate.
18. The host cell of statement 17, wherein the peptide substrate has about 6 to about 100 amino acids, or about 7 to about 75 amino acids, or about 8 to about 50 amino acids, or about 8 to about 35 amino acids, or about 10 to about 30 amino acids. 19. The host cell of statement 17 or 18, wherein the peptide substrate has the following sequence: XWXXXCXP (SEQ ID NO:16), where X is any amino acid.
20. A method comprising incubating the host cell of statement 13-18 or 19, for a time and under conditions sufficient for generation of a tryptathionine cross-bridge between a tryptophan (Tip) and a cysteine (Cys) in the peptide substrate.
21. The method of statement 20, wherein the conditions sufficient for generation of a tryptathionine cross-bridge comprise culture conditions for maintenance of the host cell.
22. The method of statement 20 or 21, wherein the conditions sufficient for generation of a tryptathionine cross-bridge comprise conditions where the host cell expresses the flavin mono-oxygenase and the peptide substrate.
23. A method comprising contacting a peptide substrate with a flavin mono-oxygenase that has at least 90%, or at least 91%, or at least 93%, or at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99% sequence identity to SEQ ID NO: 2, 4, 5, 6, or 7.
24. The method of statement 23, wherein the flavin mono-oxygenase catalyzes a
tryptathionine cross-bridge between a tryptophan (Trp) and a cysteine (Cys) in the peptide substrate.
25. The method of statement 23 or 24, wherein the flavin mono-oxygenase catalyzes a tryptathionine cross-bridge between a tryptophan (Trp) and a cysteine (Cys) in a peptide substrate in the presence of nicotinamide adenine dinucleotide phosphate (NADPH), flavin adenine dinucleotide (FAD), or a combination thereof.
26. The method of statement 23, 24, or 25, wherein the peptide substrate has about 6 to about 100 amino acids, or about 7 to about 75 amino acids, or about 8 to about 50 amino acids, or about 8 to about 35 amino acids, or about 10 to about 30 amino acids.
27. The method of statement 23-25 or 26, wherein the peptide substrate has conventional and/or nonconventional amino acids.
28. The method of statement 23-26 or 27, wherein the peptide substrate is one or more D- amino acid, N-methyl amino acid, beta amino acids atypical amino acid (e.g., ornithine, amino-isobutyric acid), or combinations thereof.
29. The method of statement 23-27 or 28, wherein the peptide substrate has the following sequence: XWXXXCXP (SEQ ID NO: 16), where X is any amino acid.
30. The method of statement 23-28 or 29, wherein the peptide substrate hasone or more hydroxylated amino acids. 31. The method of statement 23-29 or 30, performed in vitro.
32. The method of statement 23-30 or 31, performed in a cell-free reaction mixture.
33. The method of statement 32, wherein the reaction mixture further comprises
nicotinamide adenine dinucleotide phosphate (NADPH), flavin adenine dinucleotide (FAD), or a combination thereof.
34. A flavin mono-oxygenase that has at least 90%, or at least 91%, or at least 93%, or at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99% sequence identity to SEQ ID NO: 2, 4, 5, 6, or 7, and with at least one amino acid substitution.
35. The flavin mono-oxygenase of statement 34 with less than 100% sequence identity to SEQ ID NO: 2, 4, 5, 6, or 7.
36. The flavin mono-oxygenase of statement 34 or 35 with less than 99.5% sequence identity to SEQ ID NO: 2, 4, 5, 6, or 7.
37. An expression system comprising at least one expression cassette comprising a
promoter operably linked to a heterologous nucleic acid segment encoding the flavin mono-oxygenase (FMO) of statement 34, 35, or 36.
38. A host cell comprising the expression system of statement 37.
The specific methods and compositions described herein are representative of preferred embodiments and are exemplary and not intended as limitations on the scope of the invention. Other objects, aspects, and embodiments will occur to those skilled in the art upon consideration of this specification, and are encompassed within the spirit of the invention as defined by the scope of the claims. It will be readily apparent to one skilled in the art that varying substitutions and modifications may be made to the invention disclosed herein without departing from the scope and spirit of the invention. The invention illustratively described herein suitably may be practiced in the absence of any element or elements, or limitation or limitations, which is not specifically disclosed herein as essential.
The methods and processes illustratively described herein suitably may be practiced in differing orders of steps, and the methods and processes are not necessarily restricted to the orders of steps indicated herein or in the claims. As used herein and in the appended claims, the singular forms "a," "an," and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a reference to "a compound," "a nucleic acid" or "a promoter" includes a plurality of such compounds, nucleic acids or promoters (for example, a solution of compounds or nucleic acids, or a series of promoters), and so forth. Under no circumstances may the patent be interpreted to be limited to the specific examples or embodiments or methods specifically disclosed herein. Under no circumstances may the patent be interpreted to be limited by any statement made by any Examiner or any other official or employee of the Patent and Trademark Office unless such statement is specifically and without qualification or reservation expressly adopted in a responsive writing by Applicants.
The terms and expressions that have been employed are used as terms of description and not of limitation, and there is no intent in the use of such terms and expressions to exclude any equivalent of the features shown and described or portions thereof, but it is recognized that various modifications are possible within the scope of the invention as claimed. Thus, it will be understood that although the present invention has been specifically disclosed by preferred embodiments and optional features, modification and variation of the concepts herein disclosed may be resorted to by those skilled in the art, and that such modifications and variations are considered to be within the scope of this invention as defined by the appended claims and statements of the invention.

Claims

What is Claimed:
1. An expression system comprising at least one expression cassette comprising a
promoter operably linked to a heterologous nucleic acid segment encoding a flavin mono-oxygenase (FMO) with at least 95% sequence identity to SEQ ID NO: 2, 4, 5, 6, or 7.
2. The expression system of claim 1 , further comprising at least one expression cassette comprising a promoter operably linked to a heterologous nucleic acid segment encoding a prolyl oligopeptidase with at least 95% sequence identity to SEQ ID NO:ll, 13, 14, or 15.
3. The expression system of claim 1 , further comprising at least one expression cassette comprising a promoter operably linked to a heterologous nucleic acid segment encoding a peptide substrate.
4. The expression system of claim 1 , wherein the peptide substrate has one or more hydroxylated amino acids.
5. A host cell comprising an expression system comprising at least one expression
cassette comprising a promoter operably linked to a heterologous nucleic acid segment encoding a flavin mono-oxygenase (FMO) with at least 95% sequence identity to SEQ ID NO: 2, 4, 5, 6, or 7.
6. The host cell of claim 5, where the expression system further comprises at least one expression cassette comprising a promoter operably linked to a heterologous nucleic acid segment encoding a peptide substrate.
7. The host cell of claim 5, where the expression system further comprises at least one expression cassette comprising a promoter operably linked to a heterologous nucleic acid segment encoding a prolyl oligopeptidase with at least 95% sequence identity to SEQ ID NO: 11, 13, 14, or 15.
8. A method comprising incubating the host cell of claim 5, for a time and under
conditions sufficient for generation of a tryptathionine cross-bridge between a tryptophan (Trp) and a cysteine (Cys) in the peptide substrate.
9. A method comprising contacting a peptide substrate with a flavin mono-oxygenase that has at least 95% sequence identity to SEQ ID NO: 2, 4, 5, 6, or 7.
10. The method of claim 9, wherein the peptide substrate has one or more hydroxylated amino acids.
11. The method of claim 9, wherein the peptide substrate is one or more D- amino acid, N- methyl amino acid, beta amino acids atypical amino acid, or combinations thereof.
12. The method of claim 9, wherein the flavin mono-oxygenase catalyzes a tryptathionine cross-bridge between a tryptophan (Trp) and a cysteine (Cys) in the peptide substrate.
PCT/US2018/029954 2017-04-28 2018-04-27 Tryptathionine biosynthesis by flavin mono-oxygenase 1 (fmo1) WO2018201050A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201762491632P 2017-04-28 2017-04-28
US62/491,632 2017-04-28

Publications (1)

Publication Number Publication Date
WO2018201050A1 true WO2018201050A1 (en) 2018-11-01

Family

ID=63919292

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2018/029954 WO2018201050A1 (en) 2017-04-28 2018-04-27 Tryptathionine biosynthesis by flavin mono-oxygenase 1 (fmo1)

Country Status (1)

Country Link
WO (1) WO2018201050A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023168442A1 (en) * 2022-03-03 2023-09-07 Transfoam Llc D.B.A. Ourobio Methods for the coproduction of polyhydroxyalkanoates and indigoid derivatives from whey protein and lactose

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012068236A2 (en) * 2010-11-16 2012-05-24 Dyadic International (Usa) Inc. Novel fungal oxidoreductases
US20130084601A1 (en) * 2010-06-14 2013-04-04 Amorepacific Corporation Novel soil microorganism, novel oxidoreductase separated from the soil microorganism, gene encoding the oxidoreductase, and method for producing aglycones using the microorganism, the oxidoreductase and the gene
US20150080549A1 (en) * 2011-12-28 2015-03-19 Chugai Seiyaku Kabushiki Kaisha Peptide-compound cyclization method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130084601A1 (en) * 2010-06-14 2013-04-04 Amorepacific Corporation Novel soil microorganism, novel oxidoreductase separated from the soil microorganism, gene encoding the oxidoreductase, and method for producing aglycones using the microorganism, the oxidoreductase and the gene
WO2012068236A2 (en) * 2010-11-16 2012-05-24 Dyadic International (Usa) Inc. Novel fungal oxidoreductases
US20150080549A1 (en) * 2011-12-28 2015-03-19 Chugai Seiyaku Kabushiki Kaisha Peptide-compound cyclization method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
SGAMBELLURI ET AL.: "Biosynthesis of Cyclic Peptide Natural Products in Mushrooms", THESIS, 13 December 2017 (2017-12-13), pages 1 - 162, XP055536172, Retrieved from the Internet <URL:https://d.lib.msu.edu/etd/6906/datastream/OBJ/download/Biosynthesis_of_Cyclic_Peptide_Natural_Products_in_Mushrooms.pdf> [retrieved on 20180712] *
TRUMAN, A: "Cyclisation mechanisms in the biosynthesis of ribosomally synthesised and post-translationally modified peptides", BEILSTEIN JOURNAL OF ORGANIC CHEMISTRY, vol. 12, 20 June 2016 (2016-06-20), pages 1250 - 1268, XP055536169 *
WALTON, J: "Biosynthesis of the Amanita Cyclic Peptide Toxins", THE CYCLIC PEPTIDE TOXINS OF AMANITA AND OTHER POISONOUS MUSHROOMS, 10 May 2018 (2018-05-10), pages 93 - 130 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023168442A1 (en) * 2022-03-03 2023-09-07 Transfoam Llc D.B.A. Ourobio Methods for the coproduction of polyhydroxyalkanoates and indigoid derivatives from whey protein and lactose

Similar Documents

Publication Publication Date Title
US9932571B2 (en) Use of galerina marginata genes and proteins for peptide production
CA2731760C (en) Pyripyropene a biosynthetic gene
AU2012267241B2 (en) Genetic manipulation and expression systems for Pucciniomycotina and Ustilaginomycotina subphyla
CN106661585A (en) Microbial ergothioneine biosynthesis
CN110408635B (en) Application of nucleic acid construct containing streptavidin element in protein expression and purification
Blecher et al. A novel plant peptidyl-prolyl-cis-trans-isomerase (PPIase): cDNA cloning, structural analysis, enzymatic activity and expression
KR102345759B1 (en) Methods for modulating biosynthetic activity in vitro by knock-out of nuclease systems
EP3999640A2 (en) Fungus claviceps purpurea with increased ergot alkaloid production, method of preparation thereof, expression cassette
KR20190117511A (en) Gene related to cyclic peptide compound, method for preparing cyclic peptide compound using same and transformant having same
WO2018201050A1 (en) Tryptathionine biosynthesis by flavin mono-oxygenase 1 (fmo1)
Suvarna et al. Molecular analysis of the LYS2 gene of Candida albicans: homology to peptide antibiotic synthetases and the regulation of the⋅-aminoadipate reductase
Cardoza et al. Characterization and nitrogen-source regulation at the transcriptional level of the gdhA gene of Aspergillus awamori encoding an NADP-dependent glutamate dehydrogenase
CN110938649A (en) Protein synthesis system for improving expression quantity of foreign protein and application method thereof
Galvis et al. Cloning and characterisation of a pea mitochondrial NDPK
Goodwin et al. Evidence for sulfate derepression of an arylsulfatase gene of Colletotrichum gloeosporioides f. sp. malvae during infection of round-leaved mallow, Malva pusilla
US9181542B2 (en) Method for controlling the chaperone activity of peroxiredoxins using irradiation
JP5210530B2 (en) RrhJ1I restriction / modification enzyme and its gene
WO2019216248A1 (en) Peptide macrocyclase
EP1231266B1 (en) Arabidopsis-origin gdp-4-keto-6-deoxy-d-mannose-3,5-epimerase-4-reductase gene
JP2011120508A (en) Method for producing succinic acid
JP3819429B2 (en) Phenylacetyl-CoA ligase derived from Penicillium chrysogenum
CN117802055B (en) Deoxyarteannuin B synthase and application thereof
KR20120127617A (en) Method for producing pyripyropene derivative by enzymatic process
Antelo et al. The fatty acid synthase of the basidiomycete Omphalotus olearius is a single polypeptide
JP5626263B2 (en) RrhJ1I modifying enzyme and gene thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18792046

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18792046

Country of ref document: EP

Kind code of ref document: A1