CN115491361A - 扩环酶及其突变体在生产g-7-adca中的应用 - Google Patents
扩环酶及其突变体在生产g-7-adca中的应用 Download PDFInfo
- Publication number
- CN115491361A CN115491361A CN202110676893.1A CN202110676893A CN115491361A CN 115491361 A CN115491361 A CN 115491361A CN 202110676893 A CN202110676893 A CN 202110676893A CN 115491361 A CN115491361 A CN 115491361A
- Authority
- CN
- China
- Prior art keywords
- amino acid
- ala
- thr
- leu
- asp
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 101710104123 Deacetoxycephalosporin C synthase Proteins 0.000 title claims abstract description 53
- 238000004519 manufacturing process Methods 0.000 title description 4
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 claims abstract description 81
- 229940056360 penicillin g Drugs 0.000 claims abstract description 33
- 230000000694 effects Effects 0.000 claims abstract description 27
- 238000006049 ring expansion reaction Methods 0.000 claims abstract description 4
- 108090000623 proteins and genes Proteins 0.000 claims description 91
- 102000004169 proteins and genes Human genes 0.000 claims description 75
- 235000018102 proteins Nutrition 0.000 claims description 73
- 150000001413 amino acids Chemical group 0.000 claims description 64
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 44
- 229960004441 tyrosine Drugs 0.000 claims description 44
- 235000002374 tyrosine Nutrition 0.000 claims description 44
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 claims description 44
- 125000000539 amino acid group Chemical group 0.000 claims description 42
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 40
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 claims description 36
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims description 36
- 229960002433 cysteine Drugs 0.000 claims description 36
- 235000018417 cysteine Nutrition 0.000 claims description 36
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 36
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims description 32
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims description 32
- 229960000310 isoleucine Drugs 0.000 claims description 32
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 claims description 30
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims description 29
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 claims description 28
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 claims description 28
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 28
- 235000014304 histidine Nutrition 0.000 claims description 28
- 229960002885 histidine Drugs 0.000 claims description 28
- 229960004295 valine Drugs 0.000 claims description 28
- 239000004474 valine Substances 0.000 claims description 28
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 24
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 24
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims description 24
- 238000006243 chemical reaction Methods 0.000 claims description 24
- 229960003136 leucine Drugs 0.000 claims description 24
- 229930182817 methionine Natural products 0.000 claims description 24
- 229960004452 methionine Drugs 0.000 claims description 24
- 238000000034 method Methods 0.000 claims description 24
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 claims description 20
- 239000004475 Arginine Substances 0.000 claims description 20
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 claims description 20
- 239000004471 Glycine Substances 0.000 claims description 20
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 claims description 20
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 20
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 claims description 20
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims description 20
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 20
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 claims description 20
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 claims description 20
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 claims description 20
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 20
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 claims description 20
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 claims description 20
- 239000004472 Lysine Substances 0.000 claims description 20
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 claims description 20
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 20
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 20
- 239000004473 Threonine Substances 0.000 claims description 20
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 claims description 20
- 229960003767 alanine Drugs 0.000 claims description 20
- 235000004279 alanine Nutrition 0.000 claims description 20
- 229960003121 arginine Drugs 0.000 claims description 20
- 235000009697 arginine Nutrition 0.000 claims description 20
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims description 20
- 229960001230 asparagine Drugs 0.000 claims description 20
- 235000009582 asparagine Nutrition 0.000 claims description 20
- 229960005261 aspartic acid Drugs 0.000 claims description 20
- 235000003704 aspartic acid Nutrition 0.000 claims description 20
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims description 20
- 235000013922 glutamic acid Nutrition 0.000 claims description 20
- 229960002989 glutamic acid Drugs 0.000 claims description 20
- 239000004220 glutamic acid Substances 0.000 claims description 20
- 235000004554 glutamine Nutrition 0.000 claims description 20
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 claims description 20
- 229960002743 glutamine Drugs 0.000 claims description 20
- 229960002449 glycine Drugs 0.000 claims description 20
- 235000018977 lysine Nutrition 0.000 claims description 20
- 229960003646 lysine Drugs 0.000 claims description 20
- 235000008729 phenylalanine Nutrition 0.000 claims description 20
- 229960005190 phenylalanine Drugs 0.000 claims description 20
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims description 20
- 229960002429 proline Drugs 0.000 claims description 20
- 229960001153 serine Drugs 0.000 claims description 20
- 235000004400 serine Nutrition 0.000 claims description 20
- 235000008521 threonine Nutrition 0.000 claims description 20
- 229960002898 threonine Drugs 0.000 claims description 20
- 229960004799 tryptophan Drugs 0.000 claims description 20
- 229930182555 Penicillin Natural products 0.000 claims description 15
- 229940049954 penicillin Drugs 0.000 claims description 15
- 230000014509 gene expression Effects 0.000 claims description 10
- 108020004707 nucleic acids Proteins 0.000 claims description 9
- 102000039446 nucleic acids Human genes 0.000 claims description 9
- 150000007523 nucleic acids Chemical class 0.000 claims description 9
- 238000000855 fermentation Methods 0.000 claims description 8
- 230000004151 fermentation Effects 0.000 claims description 8
- 239000013598 vector Substances 0.000 claims description 8
- 229940024606 amino acid Drugs 0.000 claims description 7
- 235000001014 amino acid Nutrition 0.000 claims description 7
- 230000008569 process Effects 0.000 claims description 7
- 241000894006 Bacteria Species 0.000 claims description 6
- 238000002360 preparation method Methods 0.000 claims description 4
- 230000009261 transgenic effect Effects 0.000 claims description 4
- 238000010367 cloning Methods 0.000 claims 1
- 244000005700 microbiome Species 0.000 claims 1
- 102000004190 Enzymes Human genes 0.000 abstract description 21
- 108090000790 Enzymes Proteins 0.000 abstract description 21
- NVIAYEIXYQCDAN-CLZZGJSISA-N 7beta-aminodeacetoxycephalosporanic acid Chemical compound S1CC(C)=C(C(O)=O)N2C(=O)[C@@H](N)[C@@H]12 NVIAYEIXYQCDAN-CLZZGJSISA-N 0.000 abstract description 15
- 238000005516 engineering process Methods 0.000 abstract description 9
- 108010073038 Penicillin Amidase Proteins 0.000 abstract description 7
- 230000009471 action Effects 0.000 abstract description 4
- 239000000758 substrate Substances 0.000 abstract description 4
- 238000009412 basement excavation Methods 0.000 abstract description 2
- 125000003275 alpha amino acid group Chemical group 0.000 abstract 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 24
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 19
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 19
- 108010000761 leucylarginine Proteins 0.000 description 19
- 108010061238 threonyl-glycine Proteins 0.000 description 19
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 18
- 108010012581 phenylalanylglutamate Proteins 0.000 description 17
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 14
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 14
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 14
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 14
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 14
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 14
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 14
- VRJZMZGGAKVSIQ-SRVKXCTJSA-N Cys-Tyr-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VRJZMZGGAKVSIQ-SRVKXCTJSA-N 0.000 description 14
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 14
- NPTGGVQJYRSMCM-GLLZPBPUSA-N Gln-Gln-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPTGGVQJYRSMCM-GLLZPBPUSA-N 0.000 description 14
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 14
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 14
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 14
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 14
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 14
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 14
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 14
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 14
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 14
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 14
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 14
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 14
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 14
- ALHULIGNEXGFRM-QWRGUYRKSA-N Phe-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=CC=C1 ALHULIGNEXGFRM-QWRGUYRKSA-N 0.000 description 14
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 14
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 14
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 14
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 14
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 14
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 14
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 14
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 14
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 14
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 14
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 14
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 14
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 14
- 108010044940 alanylglutamine Proteins 0.000 description 14
- 108010013835 arginine glutamate Proteins 0.000 description 14
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 14
- 108010081551 glycylphenylalanine Proteins 0.000 description 14
- 108010071207 serylmethionine Proteins 0.000 description 14
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 13
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 13
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 13
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 13
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 12
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 12
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 12
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 12
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 11
- 108010049041 glutamylalanine Proteins 0.000 description 11
- KPGXRSRHYNQIFN-UHFFFAOYSA-N 2-oxoglutaric acid Chemical compound OC(=O)CCC(=O)C(O)=O KPGXRSRHYNQIFN-UHFFFAOYSA-N 0.000 description 10
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 10
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 10
- 238000004321 preservation Methods 0.000 description 10
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 9
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 9
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 9
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 9
- YLVGUOGAFAJMKP-JYJNAYRXSA-N Arg-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YLVGUOGAFAJMKP-JYJNAYRXSA-N 0.000 description 9
- KACWACLNYLSVCA-VHWLVUOQSA-N Asp-Trp-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KACWACLNYLSVCA-VHWLVUOQSA-N 0.000 description 9
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 9
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 9
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 9
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 9
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 9
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 9
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 9
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 9
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 9
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 9
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 9
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 9
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 9
- MPYZGXUYLNPSNF-NAZCDGGXSA-N Trp-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O MPYZGXUYLNPSNF-NAZCDGGXSA-N 0.000 description 9
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 9
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 9
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 9
- 108010031719 prolyl-serine Proteins 0.000 description 9
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 8
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 8
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 8
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 8
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 8
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 8
- 238000001514 detection method Methods 0.000 description 8
- 239000013604 expression vector Substances 0.000 description 8
- 108010078144 glutaminyl-glycine Proteins 0.000 description 8
- 108010034529 leucyl-lysine Proteins 0.000 description 8
- 230000035772 mutation Effects 0.000 description 8
- UWOPETAWXDZUJR-ACZMJKKPSA-N Asp-Cys-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O UWOPETAWXDZUJR-ACZMJKKPSA-N 0.000 description 7
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 7
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 7
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 7
- 108010016616 cysteinylglycine Proteins 0.000 description 7
- 108010054813 diprotin B Proteins 0.000 description 7
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 6
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 6
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 6
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 6
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 5
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 5
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 5
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 5
- MTANSHNQTWPZKP-KKUMJFAQSA-N Arg-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O MTANSHNQTWPZKP-KKUMJFAQSA-N 0.000 description 5
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 5
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 5
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 5
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 5
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 5
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 5
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 5
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 5
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 5
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 5
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 5
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 5
- ZTPWXNOOKAXPPE-DCAQKATOSA-N Lys-Arg-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N ZTPWXNOOKAXPPE-DCAQKATOSA-N 0.000 description 5
- 108010066427 N-valyltryptophan Proteins 0.000 description 5
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 5
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 5
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 5
- GYUUYCIXELGTJS-MEYUZBJRSA-N Thr-Phe-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O GYUUYCIXELGTJS-MEYUZBJRSA-N 0.000 description 5
- JTMZSIRTZKLBOA-NWLDYVSISA-N Trp-Thr-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O JTMZSIRTZKLBOA-NWLDYVSISA-N 0.000 description 5
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 5
- HWXBTNAVRSUOJR-UHFFFAOYSA-N alpha-hydroxyglutaric acid Natural products OC(=O)C(O)CCC(O)=O HWXBTNAVRSUOJR-UHFFFAOYSA-N 0.000 description 5
- 229940009533 alpha-ketoglutaric acid Drugs 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 230000003197 catalytic effect Effects 0.000 description 5
- 108010010147 glycylglutamine Proteins 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 4
- 108020004414 DNA Proteins 0.000 description 4
- DHNWZLGBTPUTQQ-QEJZJMRPSA-N Gln-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N DHNWZLGBTPUTQQ-QEJZJMRPSA-N 0.000 description 4
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 4
- NKDSBBBPGIVWEI-RCWTZXSCSA-N Met-Arg-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NKDSBBBPGIVWEI-RCWTZXSCSA-N 0.000 description 4
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 4
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 4
- 241000187433 Streptomyces clavuligerus Species 0.000 description 4
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 4
- 229960005070 ascorbic acid Drugs 0.000 description 4
- 235000010323 ascorbic acid Nutrition 0.000 description 4
- 239000011668 ascorbic acid Substances 0.000 description 4
- 238000004128 high performance liquid chromatography Methods 0.000 description 4
- 239000008363 phosphate buffer Substances 0.000 description 4
- 238000003259 recombinant expression Methods 0.000 description 4
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 3
- 229930186147 Cephalosporin Natural products 0.000 description 3
- 108010025076 Holoenzymes Proteins 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 241000228417 Sarocladium strictum Species 0.000 description 3
- 239000003242 anti bacterial agent Substances 0.000 description 3
- 229940088710 antibiotic agent Drugs 0.000 description 3
- 229940124587 cephalosporin Drugs 0.000 description 3
- 150000001780 cephalosporins Chemical class 0.000 description 3
- 238000012258 culturing Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 230000007613 environmental effect Effects 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 239000013642 negative control Substances 0.000 description 3
- 239000013641 positive control Substances 0.000 description 3
- NYZGVTGOMPHSJW-CIUDSAMLSA-N Arg-Glu-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N NYZGVTGOMPHSJW-CIUDSAMLSA-N 0.000 description 2
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 2
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 2
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 241001052560 Thallis Species 0.000 description 2
- WNLRTRBMVRJNCN-UHFFFAOYSA-N adipic acid Chemical compound OC(=O)CCCCC(O)=O WNLRTRBMVRJNCN-UHFFFAOYSA-N 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 239000004197 calcium inosinate Substances 0.000 description 2
- 239000004195 dipotassium inosinate Substances 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 238000009790 rate-determining step (RDS) Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 230000005026 transcription initiation Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- CIPQGGYPCPIDBB-IUODEOHRSA-N (6r,7r)-3-methyl-8-oxo-7-[(2-phenylacetyl)amino]-5-thia-1-azabicyclo[4.2.0]oct-2-ene-2-carboxylic acid Chemical compound N([C@H]1[C@@H]2N(C1=O)C(=C(CS2)C)C(O)=O)C(=O)CC1=CC=CC=C1 CIPQGGYPCPIDBB-IUODEOHRSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- JBMKAUGHUNFTOL-UHFFFAOYSA-N Aldoclor Chemical group C1=C(Cl)C(S(=O)(=O)N)=CC2=C1NC=NS2(=O)=O JBMKAUGHUNFTOL-UHFFFAOYSA-N 0.000 description 1
- 241000187390 Amycolatopsis lactamdurans Species 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 1
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 1
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- JPWIMMUNWUKOAD-STQMWFEESA-N Gly-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN JPWIMMUNWUKOAD-STQMWFEESA-N 0.000 description 1
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 1
- 108020005350 Initiator Codon Proteins 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- 241000187654 Nocardia Species 0.000 description 1
- 241000228150 Penicillium chrysogenum Species 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 1
- FZWLAAWBMGSTSO-UHFFFAOYSA-N Thiazole Chemical group C1=CSC=N1 FZWLAAWBMGSTSO-UHFFFAOYSA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 239000001361 adipic acid Substances 0.000 description 1
- 235000011037 adipic acid Nutrition 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000002924 anti-infective effect Effects 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- IYNDLOXRXUOGIU-LQDWTQKMSA-M benzylpenicillin potassium Chemical compound [K+].N([C@H]1[C@H]2SC([C@@H](N2C1=O)C([O-])=O)(C)C)C(=O)CC1=CC=CC=C1 IYNDLOXRXUOGIU-LQDWTQKMSA-M 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 229960004841 cefadroxil Drugs 0.000 description 1
- NBFNMSULHIODTC-CYJZLJNKSA-N cefadroxil monohydrate Chemical compound O.C1([C@@H](N)C(=O)N[C@H]2[C@@H]3N(C2=O)C(=C(CS3)C)C(O)=O)=CC=C(O)C=C1 NBFNMSULHIODTC-CYJZLJNKSA-N 0.000 description 1
- 229960002588 cefradine Drugs 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 229940106164 cephalexin Drugs 0.000 description 1
- ZAIPMKNFIOOWCQ-UEKVPHQBSA-N cephalexin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@@H]3N(C2=O)C(=C(CS3)C)C(O)=O)=CC=CC=C1 ZAIPMKNFIOOWCQ-UEKVPHQBSA-N 0.000 description 1
- RDLPVSKMFDYCOR-UEKVPHQBSA-N cephradine Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@@H]3N(C2=O)C(=C(CS3)C)C(O)=O)=CCC=CC1 RDLPVSKMFDYCOR-UEKVPHQBSA-N 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 238000002742 combinatorial mutagenesis Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 208000012839 conversion disease Diseases 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 108010093176 deacetoxycephalosporin C synthetase Proteins 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000004134 energy conservation Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- -1 hnRNA Proteins 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 231100000053 low toxicity Toxicity 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 239000012064 sodium phosphate buffer Substances 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 239000002351 wastewater Substances 0.000 description 1
- 238000005303 weighing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P35/00—Preparation of compounds having a 5-thia-1-azabicyclo [4.2.0] octane ring system, e.g. cephalosporin
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y114/00—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
- C12Y114/20—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with 2-oxoglutarate as one donor, and the other dehydrogenated (1.14.20)
- C12Y114/20001—Deacetoxycephalosporin-C synthase (1.14.20.1)
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明提供一种扩环酶及其突变体在生产G‑7‑ADCA中的应用。其氨基酸序列如SEQ ID No.5‑13之一所示。本发明通过NCBI序列比对、挖掘,以及定向进化等技术产生了一系列的具有扩环活性的酶,这些酶能够将底物青霉素G一步扩环为G‑7‑ADCA;G‑7‑ADCA可以再通过青霉素酰化酶的作用,脱去侧链,转化为7‑ADCA。因此,这些扩环酶在工业上具有重要的应用价值。
Description
技术领域
本发明涉及扩环酶及其突变体在生产G-7-ADCA中的应用,属于生物技术领域。
背景技术
头孢类抗生素是目前临床上广泛应用的一类抗感染药物,具有低毒性、广谱性等优点。7-氨基-3-脱乙酰氧基头孢烷酸(7-ADCA)是重要的头孢类抗生素合成的母核之一,可以用于合成头孢氨苄、头孢拉定及头孢羟氨苄等,市场用量巨大。
目前,工业上生产7-ADCA主要有两种方法:化学酶法和发酵法。化学酶法制备7-ADCA的工艺以青霉素G为原料,采用化学方法进行氧化、扩环、重排后生成苯乙酰-7-氨基去乙酰氧基头孢烯酸(G-7-ADCA),然后在青霉素酰化酶的作用下脱去侧链生成7-ADCA。该工艺比较复杂,成本较高,且产生大量的废水和污染,对环境的破坏很大(Wei CL et al,.Applied and Environmental Mirobiology,2005)。在国家节能减排和环保压力下,该工艺正逐步被淘汰。
工业上生产7-ADCA还可以采用发酵法。1998年,Cho等研究发现了扩环酶(DAOCS)能够对青霉素G具有一定的活性,使从青霉素G生产G-7-ADCA成为可能(Cho et al.,PNAS,1998)。因此,目前一些企业中常用的方法是将扩环酶(DAOCS)基因,克隆至产黄青霉菌中,通过在发酵过程中添加己二酸,直接发酵生产G-7-ADCA,然后利用青霉素酰化酶脱去侧链制备7-ADCA。然而,发酵法存在工艺复杂、发酵周期长等问题,且核心技术主要掌握在国外公司手中,目前国内未见采用发酵法生产7-ADCA的工业方法。
全酶法制备7-ADCA具有绿色环保、成本低等优点。全酶法从廉价的青霉素G钾盐出发,经青霉素扩环酶的催化扩环反应生成G-7-ADCA,再经青霉素酰化酶的作用水解除去酰基侧链,生成7-ADCA(如图1所示)。青霉素酰化酶的研究已经很成熟,且已经在工业上应用了很多年。现在用的较多的是固定化的青霉素酰化酶;而扩环酶的活性较低,在从青霉素G合成7-ADCA的路径中是限速步骤。因此,迫切需要提高扩环酶的活性。
青霉素扩环酶(DAOCS),也称为脱乙酰氧基头孢菌素C合成酶,是一种依赖于Fe2+、O2和α-酮戊二酸的酶,主要催化青霉素的五元噻唑环向六元噻嗪环的转化(扩环),是催化棒状链霉菌和其他原核生物合成头孢类抗生素的关键步骤。因此,DAOCS具有重要的工业应用价值。
但在实际应用中,发现DAOCS对青霉素G的生物活性较低,无法满足工业应用,这成为了使用全酶法合成7-ADCA的限速步骤。因此,迫切需要对扩环酶进行改造,以期能够大幅提高其对青霉素G的转化效率,为工业应用奠定基础。
目前顶头孢霉菌(C.acremonium),棒状链霉菌(S.clavuligerus)和诺卡氏菌(N.lactamdurans)来源的DAOCS已经表达纯化出来,并且来源于棒状链霉菌的scDAOCS已经获得了晶体结构,这使得通过蛋白质工程技术改造scDAOCS,提高其催化活性、拓宽其底物谱成为可能,并且已经取得了一些进展。
发明内容
本发明要解决的技术问题是如何提高扩环酶对青霉素G的扩环催化活性。
为了解决上述技术问题,本发明首先提供了一种蛋白质。
本发明提供的蛋白质如下(a)或(b)或(c)或(d):
(a)其氨基酸序列如SEQ ID No.4所示(命名为H7),或在(a)中的氨基酸序列经过取代、缺失或添加一个或几个氨基酸且具有扩环酶活性的由(a)衍生的蛋白质;
(b)其氨基酸序列如SEQ ID No.5所示(命名为E727),或在(b)中的氨基酸序列经过取代、缺失或添加一个或几个氨基酸且具有扩环酶活性的由(b)衍生的蛋白质;
(c)其氨基酸序列如SEQ ID No.6所示(命名为E735),或在(c)中的氨基酸序列经过取代、缺失或添加一个或几个氨基酸且具有扩环酶活性的由(c)衍生的蛋白质;
(d)其氨基酸序列如SEQ ID No.13(命名为scDAOCS-V303K)所示,或在(d)中的氨基酸序列经过取代、缺失或添加一个或几个氨基酸且具有扩环酶活性的由(d)衍生的蛋白质。
进一步地,所述蛋白质为如下(1)—(6)中的任一种:
(1)第155位为n1,第184位为n2的SEQ ID NO:5所示的氨基酸序列表示的蛋白质;
(2)第275位为n3,第281位氨基酸残基为n4,第305位为n5的SEQ ID NO:5所示的氨基酸序列表示的蛋白质;
(3)第155位为n1,第184位为n2,第275位为n3,第281位为n4,第305位为n5的SEQID NO:5所示的氨基酸序列表示的蛋白质;
(4)第155位为m1,第184位为m2的SEQ ID NO:6所示的氨基酸序列表示的蛋白质;
(5)第275位为m3,第281位为m4,第305位为m5的SEQ ID NO:6所示的氨基酸序列表示的蛋白质;
(6)第155位为m1,第184位为m2,第275位为m3,第281位为m4,第305位为m5的SEQID NO:6所示的氨基酸序列表示的蛋白质;
具体地,SEQ ID NO:5所示的氨基酸序列中第155位的n1可为:半胱氨酸、酪氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、脯氨酸、色氨酸、丝氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸或组氨酸,第184位的n2可为:酪氨酸、组氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、甲硫氨酸、脯氨酸、色氨酸、丝氨酸、半胱氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸或精氨酸,第275位的n3可为:缬氨酸、异亮氨酸、甘氨酸、丙氨酸、亮氨酸、甲硫氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸或组氨酸,第281位的n4可为:半胱氨酸、酪氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、脯氨酸、色氨酸、丝氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸或组氨酸,第305位的n5可为:亮氨酸、甲硫氨酸、甘氨酸、丙氨酸、缬氨酸、异亮氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸或组氨酸;
SEQ ID NO:6所示的氨基酸序列中第155位的m1可为苯丙氨酸、赖氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、甲硫氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、精氨酸或组氨酸,第184位的m2可为天冬氨酸、谷氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、甲硫氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、赖氨酸、精氨酸或组氨酸,第275位的m3可谷氨酸、组氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、甲硫氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、赖氨酸或精氨酸,第281位的m4可为:酪氨酸、脯氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、甲硫氨酸、色氨酸、丝氨酸、半胱氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸或组氨酸,第305位的m5可为异亮氨酸、甲硫氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸或组氨酸。
进一步地,所述蛋白质为如下(1)—(6)中的任一种:
(1)将SEQ ID NO:5所示的氨基酸序列的第155位氨基酸残基由半胱氨酸突变为酪氨酸,且将第184位氨基酸残基由酪氨酸突变为组氨酸得到的蛋白质,命名为E727-M2(C155Y/Y184H),其氨基酸序列如SEQ ID NO:7所示;
(2)将SEQ ID NO:5所示的氨基酸序列的第275位氨基酸残基由缬氨酸突变为异亮氨酸,将第281位氨基酸残基由半胱氨酸突变为酪氨酸,且将第305位氨基酸残基由亮氨酸突变为甲硫氨酸得到的蛋白质,命名为E727-M3(V275I/C281Y/L305M),其氨基酸序列如SEQID NO:8所示;
(3)将SEQ ID NO:5所示的氨基酸序列的第155位氨基酸残基由半胱氨酸突变为酪氨酸,将第184位氨基酸残基由酪氨酸突变为组氨酸,将第275位氨基酸残基由缬氨酸突变为异亮氨酸,将第281位氨基酸残基由半胱氨酸突变为酪氨酸,且将第305位氨基酸残基由亮氨酸突变为甲硫氨酸得到的蛋白质,命名为E727-M5(C155Y/Y184H/V275I/C281Y/L305M),其氨基酸序列如SEQ ID NO:9所示;
(4)将SEQ ID NO:6所示的氨基酸序列的第155位氨基酸残基由半胱氨酸突变为酪氨酸,且将第184位氨基酸残基由酪氨酸突变为组氨酸得到的蛋白质,命名为E735-M2(C155Y/Y184H),其氨基酸序列如SEQ ID NO:10所示;
(5)将SEQ ID NO:6所示的氨基酸序列的第275位氨基酸残基由缬氨酸突变为异亮氨酸,将第281位氨基酸残基由半胱氨酸突变为酪氨酸,且将第305位氨基酸残基由异亮氨酸突变为甲硫氨酸得到的蛋白质,命名为E735-M3(V275I/C281Y/I305M),其氨基酸序列如SEQ ID NO:11所示;
(6)将SEQ ID NO:6所示的氨基酸序列的第155位氨基酸残基由半胱氨酸突变为酪氨酸,将第184位氨基酸残基由酪氨酸突变为组氨酸,将第275位氨基酸残基由缬氨酸突变为异亮氨酸,将第281位氨基酸残基由半胱氨酸突变为酪氨酸,且将第305位氨基酸残基由异亮氨酸突变为甲硫氨酸得到的蛋白质,命名为E735-M5(C155Y/Y184H/V275I/C281Y/I305M),其氨基酸序列如SEQ ID No.12所示;
编码上述蛋白质的核酸分子也属于本发明的保护范围。
所述核酸分子可以是DNA,如cDNA、基因组DNA或重组DNA;所述核酸分子也可以是RNA,如mRNA、hnRNA或tRNA等。
所述核酸分子具体为编码上述蛋白质的基因。
含上述核酸分子的重组载体、表达盒、转基因细胞系或重组菌也属于本发明的保护范围。
所述重组载体可为重组表达载体,也可为重组克隆载体。
所述重组表达载体可用现有的表达载体构建。所述表达载体还可包含外源基因的3’端非翻译区域,即包含聚腺苷酸信号和任何其它参与mRNA加工或基因表达的DNA片段。所述聚腺苷酸信号可引导聚腺苷酸加入到mRNA前体的3’端。使用所述基因构建重组表达载体时,在其转录起始核苷酸前可加上任何一种增强型、组成型、组织特异型或诱导型启动子,它们可单独使用或与其它的启动子结合使用;此外,使用本发明的基因构建重组表达载体时,还可使用增强子,包括翻译增强子或转录增强子,这些增强子区域可以是ATG起始密码子或邻接区域起始密码子等,但必需与编码序列的阅读框相同,以保证整个序列的正确翻译。所述翻译控制信号和起始密码子的来源是广泛的,可以是天然的,也可以是合成的。翻译起始区域可以来自转录起始区域或结构基因。
所述表达盒由能够启动所述基因表达的启动子,所述基因,以及转录终止序列组成。
所述蛋白质在作为青霉素扩环酶中的应用也属于本发明的保护范围。
所述蛋白质在作为青霉素扩环酶催化青霉素G扩环生成G-7-ADCA提高青霉素G的转化率中的应用也属于本发明的保护范围。
所述蛋白质或所述核酸分子或所述重组载体、表达盒、转基因细胞系或重组菌在如下任一中的应用也属于本发明的保护范围:
(b1)制备具有青霉素扩环酶活性的产品;
(b2)制备G-7-ADCA。
本发明还保护一种制备G-7-ADCA的方法。
本发明所提供的制备G-7-ADCA的方法,具体可包括步骤:制备所述蛋白质;以所述蛋白质为青霉素扩环酶催化青霉素G扩环生成G-7-ADCA。
本发明还保护一种在青霉素扩环酶催化青霉素G扩环生成G-7-ADCA反应中提高青霉素G的转化率的方法的方法。
本发明所提供的青霉素扩环酶催化青霉素G扩环生成G-7-ADCA反应中提高青霉素G的转化率的方法,其中以所述蛋白质为青霉素扩环酶催化青霉素G扩环生成G-7-ADCA。
本发明通过NCBI序列比对、挖掘,以及定向进化等技术产生了一系列的具有扩环活性的酶,这些酶能够将底物青霉素G一步扩环为G-7-ADCA;G-7-ADCA可以再通过青霉素酰化酶的作用,脱去侧链,转化为7-ADCA。因此,这些扩环酶在工业上具有重要的应用价值。
附图说明
图1为扩环酶催化的7-ADCA反应流程图。
图2为SCHEMA技术用到的序列比对信息。
图3为依据Rossetta Design方法所获Loop环相关突变位点位置示意。
具体实施方式
下述实施例中所使用的实验方法如无特殊说明,均为常规方法。
下述实施例中所用的材料、试剂等,如无特殊说明,均可从商业途径得到。
实施例1:基于SCHEMA技术产生的扩环酶突变体
1.基因获得以及载体构建
利用SCHEMA技术,分析模拟DNA shuffling进行重组。根据计算模拟的结果,筛选出20条序列。经过密码子优化后,利用全基因合成的方式获得全长基因,并将其构建到pET-24a表达载体上。SCHEMA技术用到的三个扩环酶序列(E632,SEQ ID No.1,8MUT SEQ IDNo.2,及E633 SEQ ID No.3)及比对如图2所示。
经SCHEMA技术比对,获得如表1序列,根据所获序列进行基因合成,表达后根据催化效率进行筛选。
2.蛋白的表达以及活性检测
将上述表达载体转化宿主菌BL21(DE3)中,同时设置阴性对照组(转pET-24a的空载体)和阳性对照组(pET-24a-scDAOCS),37℃条件下过夜培养;挑取单菌落,转接到含有100μg/mL卡那霉素的LB培养基中,37℃、220rpm条件下培养8-10h;然后,按照1/100的接种量转接到含有100μg/mL的50mLTB培养基中,在37℃下培养至OD600达到0.6左右时,加入0.1mM IPTG诱导蛋白表达,诱导时间16h,获得表达扩环酶的重组菌体。
3.反应体系
收集菌体。将上述菌体倒入50mL离心管中,8000rpm离心10min,获取菌体;菌体用50mM pH 7.4的磷酸盐缓冲液重悬洗涤一次;再用8000rpm离心10min收集菌体;称重。
反应液配方:50mM pH 7.4的磷酸盐缓冲液,50μg/mL FeSO4,0.4mM抗坏血酸,5mM或10mMα-酮戊二酸,5mM或10mM青霉素G。
反应。按照1g湿菌体加入10mL液比例,在菌体中加入反应液,重悬。置于25℃摇床,220rpm,反应2h。
检测G-7-ADCA的生成量。反应2h后,12000rpm,离心1min,取上清进行HPLC检测。
液相色谱检测条件为:色谱柱:Agilent ZORBAX SB-C18 StableBondAnalytical4.6×250mm;流动相:20mM pH 3.0磷酸钠缓冲液:甲醇=55:45。流速:1mL/min,检测波长215nm。
4.催化青霉素G转化的转化率数据结果
表1利用SCHEMA技术产生的扩环酶突变体转化率
反应条件:
50mM pH 7.4的磷酸盐缓冲液,50μg/mL FeSO4,0.4mM抗坏血酸,10mMα-酮戊二酸,10mM青霉素G,200μL粗酶液。置于25℃摇床,220rpm,反应30min。
5.酶的改造
将活性较高的E727及E735进行双突变、三突变及五突变体的改造,选取高活性突变体H7中的突变位点,移入E727以及E735,获得E727-M2(C155Y/Y184H)、E727-M3(V275I/C281Y/L305M)、E727-M5(C155Y/Y184H/V275I/C281Y/L305M)、E735-M2(C155Y/Y184H)、E735-M3(V275I/C281Y/I305M)、E727-M5(C155Y/Y184H/V275I/C281Y/I305M)突变体。其活性如下:
表2利用SCHEMA程序产生的扩环酶突变体转化率
由表2可知,E727-M2催化青霉素G转化的转化率最高并优于H7。
6.纯酶反应及其HPLC检测
1mL体系进行纯酶活性验证。pH 7.4的PBS缓冲溶液,50μg/mL FeSO4,0.4mM抗坏血酸,5mMα-酮戊二酸,5mM青霉素G,100μL酶液。置于25℃摇床,220rpm,反应30min。HPLC检测检测结果见表3。
表3利用扩环酶突变体催化的酶活
酶活(U)定义为25℃下,在1分钟内能转化1微摩尔底物的酶量;比酶活(U/mg)为每mg蛋白的酶活。
经实验验证,突变体E727-M2的酶活比scH7提高了25%,具有较好的应用前景。
实施例2、基于Rosetta Design方法针对扩环酶H7进行改造所获突变体
1.基因的获得。
所有计算设计都使用美国华盛顿大学David Baker团队开发的Rosetta design软件完成。选择满足所有必需条件的前20个构象进行分析及实验验证。以扩环酶7突变体scDAOCS H7为模板,经过动力学模拟loop环相关突变位点,构建了4个组合突变库,突变库如下表4。突变位点在扩环酶上的位置如图3所示。
表4依据RosettaDesign方法所建立的突变体库
2.突变体库的建立。
用所选残基的相应引物进行位点组合突变,相应的引物如表5。
表5引物序列表
以质粒scDAOCS H7为模板,将表5中引物T72-F1、T72-F2、T72-F3、T72-F4、T73-F1、T73-F2、T73-F3、T73-F4按3:1:1:1:3:1:1:1比例混合,取2μL上述的混合液作为上游引物,取表5中T72-T73-R74-R 2μL作为下游引物,扩增基因片段L1。以质粒scDAOCS H7为模板,将表5中Y94-D95-Y97-F1,Y94-D95-Y97-F2,Y94-D95-Y97-F3按2:1:1比例混合,取2μL作为上游引物,取表4中Y94-D95-Y97-R2μL作为下游引物,扩增基因片段L2。以质粒scDAOCS H7为模板,将表5中L179-F1,L179-F2,L179-F3按2:1:1比例混合,取2μL作为上游引物,取表5中R242-H244-R1,R242-H244-R2按2:1混合,取混合液2μL作为下游引物,扩增基因片段L3。以质粒scDAOCS H7为模板,将表4中N301-F1,N301-F2,Y302-F1,Y302-F2,Y302-F3,V303-F1,V303-F2按2:2:6:6:3:6:2比例混合,取2μL作为上游引物,取表5中N301-Y302-V303-R 2μL作为下游引物,扩增基因片段L4。第一次轮PCR程序:94℃保温2min,(98℃保温15s,55℃保温30s,72℃保温30s)×28个循环,72℃保温5min。
分别取2μL第一轮PCR产物L1,L2,L3,L4为引物,以质粒scDAOCS H7为模板,进行第二PCR,构建构建ML1,ML2,ML3,ML4突变库。第二轮PCR程序:94℃保温2min,(98℃保温15s,60℃保温30s,72℃保温5min)×28个循环,72℃保温10min。
将获得的第二轮PCR产物进行如下操作:
向第二轮PCR的产物中加入1μL的Dpn I酶用于消化质粒模板,37℃恒温条件下处理3h。取2μL酶消化后的第二轮PCR产物电转化进大肠杆菌BL21(DE3)中,将电转后的大肠杆菌BL21(DE3)菌液均匀的涂布在卡那霉素抗性(浓度为50μg/mL)的LB平板上,37℃恒温条件下培养14h后长出单菌落,即突变体库。将一个平板上的所有单菌落刮下送测序验证突变体库的多样性。
3.突变体筛选及反应体系
将突变体库中的单菌落分别接种于96个深孔板中含终浓度50μg/mL卡那霉素的300μL LB培养基中,37℃培养8-12h,同时设置阴性对照组(pET-24a的空载体)和阳性对照组(pET-24a-scDAOCSH7),然后将200μL种子液接种于含终浓度0.1-0.2mM IPTG和50μg/mL卡那霉素的800μLTB培养基中。在25℃下表达16h后,离心收集细胞菌体,加入400μL的50mM,含50μg/mL FeSO4,0.4mM抗坏血酸,5mM或10mMα-酮戊二酸,5mM或10mM青霉素G的pH7.4磷酸盐缓冲液,25℃反应2h。反应2h后,12000rpm,离心1min,取上清进行HPLC检测。液相色谱检测条件见实施例一。获得酶催化反应转化率数据如下:
表6.突变体转化率
以pET24a为阴性对照,scDAOCS-H7、为阳性对照,scDAOCS-V303K突变体进行摇瓶复筛,同时,对该突变体进行测序。
结果见表7,突变体scDAOCS-H7-V303K全细胞和粗酶相较于scDAOCS-H7均有活力提升。
表7.复筛高活性突变体库的转化率
综上所述,本发明构建了扩环酶突变体,相比scDAOCS-H7扩环酶,酶活力得到了较大的提升,转化率提升了25%,具有广阔的应用前景。
序列表
野生型WT-scDAOCS
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTMRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTQYFDRQYTASRAVAREVLRATGTEPDGGVEAFLDCEPLLRFRYFPQVPEHRSAEEQPLRMAPHYDLSMVTLIQQTPCANGFVSLQAEVGGAFTDLPYRPDAVLVFCGAIATLVTGGQVKAPRHHVAAPRRDQIAGSSRTSSVFFLRPNADFTFSVPLARECGFDVSLDGETATFQDWIGGNYVNIRRTSKA
SEQ IDNo.1(E632)
MDTTVPTFSLDELQEGLHQDEFRRCLTEKGVFYLTDSGLSEADHKSAKDVAVDFFEHGTEEEKRAVTSPIPTIRRGFSGLESESTAQITNTGTYTDYSMCYSMGTSDNLFPTADFERVWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDCEPLLRFRYFPEVPEHRSAEEEPLRMAPHYDLSIVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPRHHVAAPGRDQRAGSSRTSSVFFLRPKSDFSFSVPLARECGFDVSLDGETATFGDWIGGNYVNIRRTSKA
SEQ IDNo.2(8MUT)
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTQYFDRQYTASRAVAREVLRATGTEPDGGVEAFLDYEPLLRFRYFPQVPEHRSAEEQPLLMAPHHDLSMVTLIQQTPCANGFVSLQAEVGGAFVDLPYRPDAVLVFCGAIATLVTGGQVKAPRHHVAAPRRDQIAGSSRTSSVFFLRPNADFTFSIPLAREYGFDVSLDGETATFQDWIGGNYVNMRRTSKA
SEQ IDNo.3(E633)
MDTTVPTFHLAELQEGLHQDEFRSCLMEKGVFYLTGSSLSEADQKSAKDVVVDFFEHGTEEEKRAVTSPVPTIRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTQYFDRQYTASRAVAREVLRATGTEPDGGVEAFLDCEPLLRFRYFPEVPEHRSAEEEPLRMAPHYDLSMVTLIQQTPCANGFVSLQAEVGGAFTDLPYRPDAVLVFCGAIATLVTGGQVKAPKHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSVPLAKRCGFDIGLDGDTAAFQDWIAGNYVNLRTKTKA
SEQ ID No.4(ScDAOCSH7):
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTQYFDRQYTASRAVAREVLRATGTEPDGGVEAFLDYEPLLRFRYFPQVPEHRSAEEQPLRMAPHHDLSMVTLIQQTPCANGFVSLQAEVGGAFVDLPYRPDAVLVFCGAIATLVTGGQVKAPRHHVAAPRRDQIAGSSRTSSVFFLRPNADFTFSIPLAREYGFDVSLDGETATFQDWIGGNYVNMRRTSKA
SEQ ID No.5(E727):
MDTTVPTFHLAELQEGLHQDEFRSCLMEKGVFYLTGSSLSEADQKSAKDVVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDCEPLLRFRYFPEVPEHRSAEEEPLRMAPHYDLSIVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPRHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSVPLAKRCGFDIGLDGDTAAFQDWIAGNYVNLRTKTKA
SEQ ID No.6(E735):
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYTDYSMCYSMGTSDNLFPTADFERVWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDCEPLLRFRYFPEVPEHRSAEEEPLRMAPHYDLSMVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPKHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSVPLAKRCGFDIGLDGDTAAFQDWIAGNYVNLRTKTKA
SEQ ID No.7(E727-M2):
MDTTVPTFHLAELQEGLHQDEFRSCLMEKGVFYLTGSSLSEADQKSAKDVVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDYEPLLRFRYFPEVPEHRSAEEEPLRMAPHHDLSIVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPRHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSVPLAKRCGFDIGLDGDTAAFQDWIAGNYVNLRTKTKA
SEQ ID No.8(E727-M3):
MDTTVPTFHLAELQEGLHQDEFRSCLMEKGVFYLTGSSLSEADQKSAKDVVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDCEPLLRFRYFPEVPEHRSAEEEPLRMAPHYDLSIVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPRHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSIPLAKRYGFDIGLDGDTAAFQDWIAGNYVNMRTKTKA
SEQ ID No.9(E727-M5):
MDTTVPTFHLAELQEGLHQDEFRSCLMEKGVFYLTGSSLSEADQKSAKDVVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDYEPLLRFRYFPEVPEH
RSAEEEPLRMAPHHDLSIVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPRHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSIPLAKRYGFDIGLDGDTAAFQDWIAGNYVNMRTKTKA
SEQ ID No.10(E735-M2):
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYTDYSMCYSMGTSDNLFPTADFERVWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDYEPLLRFRYFPEVPEHRSAEEEPLRMAPHHDLSMVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPKHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSVPLAKRCGFDIGLDGDTAAFQDWIAGNYVNLRTKTKA
SEQ ID No.11(E735-M3):
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYTDYSMCYSMGTSDNLFPTADFERVWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDCEPLLRFRYFPEVPEHRSAEEEPLRMAPHYDLSMVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPKHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSIPLAKRYGFDIGLDGDTAAFQDWIAGNYVNMRTKTKA
SEQ ID No.12(E735-M5):
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYTDYSMCYSMGTSDNLFPTADFERVWTHYFDRMYDASREVARQVLKATGTEPDGGVDAFLDYEPLLRFRYFPEVPEHRSAEEEPLRMAPHHDLSMVTLIQQTPCANGFVSLQAEVDGTFVDLPARPDAVLVFCGAVATLVTGGKVKAPKHHVVAPARDRIAGSSRTSSVFFLRPNADFTFSIPLAKRYGFDIGLDGDTAAFQDWIAGNYVNMRTKTKA
SEQ ID No.13(scDAOCS-V303K):
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTTRRGFTGLESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTQYFDRQYTASRAVAREVLRATGTEPDGGVEAFLDYEPLLRFRYFPQVPEHRSAEEQPLRMAPHHDLSMVTLIQQTPCANGFVSLQAEVGGAFVDLPYRPDAVLVFCGAIATLVTGGQVKAPRHHVAAPRRDQIAGSSRTSSVFFLRPNADFTFSIPLAREYGFDVSLDGETATFQDWIGGNYKNMRRTSKA。
SEQUENCE LISTING
<110> 中国科学院天津工业生物技术研究所
<120> 扩环酶及其突变体在生产G-7-ADCA中的应用
<130> GNCAQ211831
<160> 1
<170> PatentIn version 3.5
<210> WT-scDAOCS
<211> 311
<212> PRT
<213> C. acremonium
<400> WT-scDAOCS
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Met Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr Gln Tyr Phe Asp Arg Gln Tyr Thr
115 120 125
Ala Ser Arg Ala Val Ala Arg Glu Val Leu Arg Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Glu Ala Phe Leu Asp Cys Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Gln Val Pro Glu His Arg Ser Ala Glu Glu Gln
165 170 175
Pro Leu Arg Met Ala Pro His Tyr Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Gly Gly Ala Phe Thr Asp Leu Pro Tyr Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Ile Ala Thr Leu Val Thr Gly Gly Gln Val Lys Ala
225 230 235 240
Pro Arg His His Val Ala Ala Pro Arg Arg Asp Gln Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Val Pro Leu Ala Arg Glu Cys Gly Phe Asp Val Ser Leu Asp
275 280 285
Gly Glu Thr Ala Thr Phe Gln Asp Trp Ile Gly Gly Asn Tyr Val Asn
290 295 300
Ile Arg Arg Thr Ser Lys Ala
305 310
<210> 1
<211> 311
<212> PRT
<213> Artificial sequence
<400> 1
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Asp Glu Leu Gln Glu Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Thr Glu Lys Gly Val Phe
20 25 30
Tyr Leu Thr Asp Ser Gly Leu Ser Glu Ala Asp His Lys Ser Ala Lys
35 40 45
Asp Val Ala Val Asp Phe Phe Glu His Gly Thr Glu Glu Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Ile Pro Thr Ile Arg Arg Gly Phe Ser Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Thr Tyr Thr Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ser Asp Asn Leu Phe Pro Thr
100 105 110
Ala Asp Phe Glu Arg Val Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Cys Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His Tyr Asp Leu Ser Ile Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Arg His His Val Ala Ala Pro Gly Arg Asp Gln Arg Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Lys Ser Asp Phe Ser
260 265 270
Phe Ser Val Pro Leu Ala Arg Glu Cys Gly Phe Asp Val Ser Leu Asp
275 280 285
Gly Glu Thr Ala Thr Phe Gly Asp Trp Ile Gly Gly Asn Tyr Val Asn
290 295 300
Ile Arg Arg Thr Ser Lys Ala
305 310
<210> 2
<211> 311
<212> PRT
<213> Artificial sequence
<400> 2
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr Gln Tyr Phe Asp Arg Gln Tyr Thr
115 120 125
Ala Ser Arg Ala Val Ala Arg Glu Val Leu Arg Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Glu Ala Phe Leu Asp Tyr Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Gln Val Pro Glu His Arg Ser Ala Glu Glu Gln
165 170 175
Pro Leu Leu Met Ala Pro His His Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Gly Gly Ala Phe Val Asp Leu Pro Tyr Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Ile Ala Thr Leu Val Thr Gly Gly Gln Val Lys Ala
225 230 235 240
Pro Arg His His Val Ala Ala Pro Arg Arg Asp Gln Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Ile Pro Leu Ala Arg Glu Tyr Gly Phe Asp Val Ser Leu Asp
275 280 285
Gly Glu Thr Ala Thr Phe Gln Asp Trp Ile Gly Gly Asn Tyr Val Asn
290 295 300
Met Arg Arg Thr Ser Lys Ala
305 310
<210> 3
<211> 311
<212> PRT
<213> Artificial sequence
<400> 3
Met Asp Thr Thr Val Pro Thr Phe His Leu Ala Glu Leu Gln Glu Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Ser Cys Leu Met Glu Lys Gly Val Phe
20 25 30
Tyr Leu Thr Gly Ser Ser Leu Ser Glu Ala Asp Gln Lys Ser Ala Lys
35 40 45
Asp Val Val Val Asp Phe Phe Glu His Gly Thr Glu Glu Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Ile Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr Gln Tyr Phe Asp Arg Gln Tyr Thr
115 120 125
Ala Ser Arg Ala Val Ala Arg Glu Val Leu Arg Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Glu Ala Phe Leu Asp Cys Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His Tyr Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Gly Gly Ala Phe Thr Asp Leu Pro Tyr Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Ile Ala Thr Leu Val Thr Gly Gly Gln Val Lys Ala
225 230 235 240
Pro Lys His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Val Pro Leu Ala Lys Arg Cys Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Leu Arg Thr Lys Thr Lys Ala
305 310
<210> 4
<211> 311
<212> PRT
<213> Artificial sequence
<400> 4
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr Gln Tyr Phe Asp Arg Gln Tyr Thr
115 120 125
Ala Ser Arg Ala Val Ala Arg Glu Val Leu Arg Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Glu Ala Phe Leu Asp Tyr Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Gln Val Pro Glu His Arg Ser Ala Glu Glu Gln
165 170 175
Pro Leu Arg Met Ala Pro His His Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Gly Gly Ala Phe Val Asp Leu Pro Tyr Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Ile Ala Thr Leu Val Thr Gly Gly Gln Val Lys Ala
225 230 235 240
Pro Arg His His Val Ala Ala Pro Arg Arg Asp Gln Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Ile Pro Leu Ala Arg Glu Tyr Gly Phe Asp Val Ser Leu Asp
275 280 285
Gly Glu Thr Ala Thr Phe Gln Asp Trp Ile Gly Gly Asn Tyr Val Asn
290 295 300
Met Arg Arg Thr Ser Lys Ala
305 310
<210> 5
<211> 311
<212> PRT
<213> Artificial sequence
<400> 5
Met Asp Thr Thr Val Pro Thr Phe His Leu Ala Glu Leu Gln Glu Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Ser Cys Leu Met Glu Lys Gly Val Phe
20 25 30
Tyr Leu Thr Gly Ser Ser Leu Ser Glu Ala Asp Gln Lys Ser Ala Lys
35 40 45
Asp Val Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Cys Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His Tyr Asp Leu Ser Ile Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Arg His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Val Pro Leu Ala Lys Arg Cys Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Leu Arg Thr Lys Thr Lys Ala
305 310
<210> 6
<211> 311
<212> PRT
<213> Artificial sequence
<400> 6
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Thr Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ser Asp Asn Leu Phe Pro Thr
100 105 110
Ala Asp Phe Glu Arg Val Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Cys Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His Tyr Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Lys His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Val Pro Leu Ala Lys Arg Cys Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Leu Arg Thr Lys Thr Lys Ala
305 310
<210> 7
<211> 311
<212> PRT
<213> Artificial sequence
<400> 7
Met Asp Thr Thr Val Pro Thr Phe His Leu Ala Glu Leu Gln Glu Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Ser Cys Leu Met Glu Lys Gly Val Phe
20 25 30
Tyr Leu Thr Gly Ser Ser Leu Ser Glu Ala Asp Gln Lys Ser Ala Lys
35 40 45
Asp Val Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Tyr Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His His Asp Leu Ser Ile Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Arg His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Val Pro Leu Ala Lys Arg Cys Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Leu Arg Thr Lys Thr Lys Ala
305 310
<210> 8
<211> 311
<212> PRT
<213> Artificial sequence
<400> 8
Met Asp Thr Thr Val Pro Thr Phe His Leu Ala Glu Leu Gln Glu Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Ser Cys Leu Met Glu Lys Gly Val Phe
20 25 30
Tyr Leu Thr Gly Ser Ser Leu Ser Glu Ala Asp Gln Lys Ser Ala Lys
35 40 45
Asp Val Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Cys Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His Tyr Asp Leu Ser Ile Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Arg His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Ile Pro Leu Ala Lys Arg Tyr Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Met Arg Thr Lys Thr Lys Ala
305 310
<210> 9
<211> 311
<212> PRT
<213> Artificial sequence
<400> 9
Met Asp Thr Thr Val Pro Thr Phe His Leu Ala Glu Leu Gln Glu Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Ser Cys Leu Met Glu Lys Gly Val Phe
20 25 30
Tyr Leu Thr Gly Ser Ser Leu Ser Glu Ala Asp Gln Lys Ser Ala Lys
35 40 45
Asp Val Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Tyr Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His His Asp Leu Ser Ile Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Arg His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Ile Pro Leu Ala Lys Arg Tyr Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Met Arg Thr Lys Thr Lys Ala
305 310
<210> 10
<211> 311
<212> PRT
<213> Artificial sequence
<400> 10
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Thr Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ser Asp Asn Leu Phe Pro Thr
100 105 110
Ala Asp Phe Glu Arg Val Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Tyr Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His His Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Lys His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Val Pro Leu Ala Lys Arg Cys Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Leu Arg Thr Lys Thr Lys Ala
305 310
<210> 11
<211> 311
<212> PRT
<213> Artificial sequence
<400> 11
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Thr Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ser Asp Asn Leu Phe Pro Thr
100 105 110
Ala Asp Phe Glu Arg Val Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Cys Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His Tyr Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Lys His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Ile Pro Leu Ala Lys Arg Tyr Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Met Arg Thr Lys Thr Lys Ala
305 310
<210> 12
<211> 311
<212> PRT
<213> Artificial sequence
<400> 12
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Thr Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ser Asp Asn Leu Phe Pro Thr
100 105 110
Ala Asp Phe Glu Arg Val Trp Thr His Tyr Phe Asp Arg Met Tyr Asp
115 120 125
Ala Ser Arg Glu Val Ala Arg Gln Val Leu Lys Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Asp Ala Phe Leu Asp Tyr Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Glu Val Pro Glu His Arg Ser Ala Glu Glu Glu
165 170 175
Pro Leu Arg Met Ala Pro His His Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Asp Gly Thr Phe Val Asp Leu Pro Ala Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Val Ala Thr Leu Val Thr Gly Gly Lys Val Lys Ala
225 230 235 240
Pro Lys His His Val Val Ala Pro Ala Arg Asp Arg Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Ile Pro Leu Ala Lys Arg Tyr Gly Phe Asp Ile Gly Leu Asp
275 280 285
Gly Asp Thr Ala Ala Phe Gln Asp Trp Ile Ala Gly Asn Tyr Val Asn
290 295 300
Met Arg Thr Lys Thr Lys Ala
305 310
<210> 13
<211> 311
<212> PRT
<213> Artificial sequence
<400> 13
Met Asp Thr Thr Val Pro Thr Phe Ser Leu Ala Glu Leu Gln Gln Gly
1 5 10 15
Leu His Gln Asp Glu Phe Arg Arg Cys Leu Arg Asp Lys Gly Leu Phe
20 25 30
Tyr Leu Thr Asp Cys Gly Leu Thr Asp Thr Glu Leu Lys Ser Ala Lys
35 40 45
Asp Leu Val Ile Asp Phe Phe Glu His Gly Ser Glu Ala Glu Lys Arg
50 55 60
Ala Val Thr Ser Pro Val Pro Thr Thr Arg Arg Gly Phe Thr Gly Leu
65 70 75 80
Glu Ser Glu Ser Thr Ala Gln Ile Thr Asn Thr Gly Ser Tyr Ser Asp
85 90 95
Tyr Ser Met Cys Tyr Ser Met Gly Thr Ala Asp Asn Leu Phe Pro Ser
100 105 110
Gly Asp Phe Glu Arg Ile Trp Thr Gln Tyr Phe Asp Arg Gln Tyr Thr
115 120 125
Ala Ser Arg Ala Val Ala Arg Glu Val Leu Arg Ala Thr Gly Thr Glu
130 135 140
Pro Asp Gly Gly Val Glu Ala Phe Leu Asp Tyr Glu Pro Leu Leu Arg
145 150 155 160
Phe Arg Tyr Phe Pro Gln Val Pro Glu His Arg Ser Ala Glu Glu Gln
165 170 175
Pro Leu Arg Met Ala Pro His His Asp Leu Ser Met Val Thr Leu Ile
180 185 190
Gln Gln Thr Pro Cys Ala Asn Gly Phe Val Ser Leu Gln Ala Glu Val
195 200 205
Gly Gly Ala Phe Val Asp Leu Pro Tyr Arg Pro Asp Ala Val Leu Val
210 215 220
Phe Cys Gly Ala Ile Ala Thr Leu Val Thr Gly Gly Gln Val Lys Ala
225 230 235 240
Pro Arg His His Val Ala Ala Pro Arg Arg Asp Gln Ile Ala Gly Ser
245 250 255
Ser Arg Thr Ser Ser Val Phe Phe Leu Arg Pro Asn Ala Asp Phe Thr
260 265 270
Phe Ser Ile Pro Leu Ala Arg Glu Tyr Gly Phe Asp Val Ser Leu Asp
275 280 285
Gly Glu Thr Ala Thr Phe Gln Asp Trp Ile Gly Gly Asn Tyr Lys Asn
290 295 300
Met Arg Arg Thr Ser Lys Ala
305 310
Claims (10)
1.蛋白质,为如下(b)或(c)或(d):
(b)其氨基酸序列如SEQ ID No.5所示,或在(b)中的氨基酸序列经过取代、缺失或添加一个或几个氨基酸且具有扩环酶活性的由(b)衍生的蛋白质;
(c)其氨基酸序列如SEQ ID No.6所示,或在(c)中的氨基酸序列经过取代、缺失或添加一个或几个氨基酸且具有扩环酶活性的由(c)衍生的蛋白质;
(d)其氨基酸序列如SEQ ID No.13所示,或在(d)中的氨基酸序列经过取代、缺失或添加一个或几个氨基酸且具有扩环酶活性的由(d)衍生的蛋白质。
2.根据权利要求1所述的蛋白质,其特征在于:所述蛋白质为如下(1)—(6)中的任一种:
(1)第155位为n1,第184位为n2的SEQ ID NO:5所示的氨基酸序列表示的蛋白质;
(2)第275位为n3,第281位氨基酸残基为n4,第305位为n5的SEQ ID NO:5所示的氨基酸序列表示的蛋白质;
(3)第155位为n1,第184位为n2,第275位为n3,第281位为n4,第305位为n5的SEQ IDNO:5所示的氨基酸序列表示的蛋白质;
(4)第155位为m1,第184位为m2的SEQ ID NO:6所示的氨基酸序列表示的蛋白质;
(5)第275位为m3,第281位为m4,第305位为m5的SEQ ID NO:6所示的氨基酸序列表示的蛋白质;
(6)第155位为m1,第184位为m2,第275位为m3,第281位为m4,第305位为m5的SEQ IDNO:6所示的氨基酸序列表示的蛋白质。
3.根据权利要求2所述的蛋白质,其特征在于:SEQ ID NO:5所示的氨基酸序列中第155位的n1为:半胱氨酸、酪氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、脯氨酸、色氨酸、丝氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸或组氨酸,第184位的n2为:酪氨酸、组氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、甲硫氨酸、脯氨酸、色氨酸、丝氨酸、半胱氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸或精氨酸,第275位的n3为:缬氨酸、异亮氨酸、甘氨酸、丙氨酸、亮氨酸、甲硫氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸或组氨酸,第281位的n4为:半胱氨酸、酪氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、脯氨酸、色氨酸、丝氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸或组氨酸,第305位的n5为:亮氨酸、甲硫氨酸、甘氨酸、丙氨酸、缬氨酸、异亮氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸或组氨酸;
SEQ ID NO:6所示的氨基酸序列中第155位的m1为苯丙氨酸、赖氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、甲硫氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、精氨酸或组氨酸,第184位的m2为天冬氨酸、谷氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、甲硫氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、赖氨酸、精氨酸或组氨酸,第275位的m3谷氨酸、组氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、甲硫氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、赖氨酸或精氨酸,第281位的m4为:酪氨酸、脯氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、甲硫氨酸、色氨酸、丝氨酸、半胱氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸或组氨酸,第305位的m5为异亮氨酸、甲硫氨酸、甘氨酸、丙氨酸、缬氨酸、亮氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸或组氨酸。
4.根据权利要求1-3中任一项所述的蛋白质,其特征在于:所述蛋白质为如下(1)—(6)中的任一种:
(1)将SEQ ID NO:5所示的氨基酸序列的第155位氨基酸残基由半胱氨酸突变为酪氨酸,且将第184位氨基酸残基由酪氨酸突变为组氨酸得到的蛋白质,其氨基酸序列如SEQ IDNO:7所示;
(2)将SEQ ID NO:5所示的氨基酸序列的第275位氨基酸残基由缬氨酸突变为异亮氨酸,将第281位氨基酸残基由半胱氨酸突变为酪氨酸,且将第305位氨基酸残基由亮氨酸突变为甲硫氨酸得到的蛋白质,其氨基酸序列如SEQ ID NO:8所示;
(3)将SEQ ID NO:5所示的氨基酸序列的第155位氨基酸残基由半胱氨酸突变为酪氨酸,将第184位氨基酸残基由酪氨酸突变为组氨酸,将第275位氨基酸残基由缬氨酸突变为异亮氨酸,将第281位氨基酸残基由半胱氨酸突变为酪氨酸,且将第305位氨基酸残基由亮氨酸突变为甲硫氨酸得到的蛋白质,其氨基酸序列如SEQ ID NO:9所示;
(4)将SEQ ID NO:6所示的氨基酸序列的第155位氨基酸残基由半胱氨酸突变为酪氨酸,且将第184位氨基酸残基由酪氨酸突变为组氨酸得到的蛋白质,其氨基酸序列如SEQ IDNO:10所示;
(5)将SEQ ID NO:6所示的氨基酸序列的第275位氨基酸残基由缬氨酸突变为异亮氨酸,将第281位氨基酸残基由半胱氨酸突变为酪氨酸,且将第305位氨基酸残基由异亮氨酸突变为甲硫氨酸得到的蛋白质,其氨基酸序列如SEQ ID NO:11所示;
(6)将SEQ ID NO:6所示的氨基酸序列的第155位氨基酸残基由半胱氨酸突变为酪氨酸,将第184位氨基酸残基由酪氨酸突变为组氨酸,将第275位氨基酸残基由缬氨酸突变为异亮氨酸,将第281位氨基酸残基由半胱氨酸突变为酪氨酸,且将第305位氨基酸残基由异亮氨酸突变为甲硫氨酸得到的蛋白质,其氨基酸序列如SEQ ID No.12所示。
5.编码权利要求1-4中任一项所述蛋白质的核酸分子,所述核酸分子为编码权利要求1-4中任一项所述蛋白质的基因。
6.含有权利要求5所述核酸分子的重组载体、表达盒、转基因细胞系或重组菌。
7.权利要求1-4中任一项所述的蛋白质在作为青霉素扩环酶中的应用。
8.权利要求1-4中任一项所述的蛋白质在作为青霉素扩环酶催化青霉素G扩环生成G-7-ADCA提高青霉素G的转化率中的应用。
9.权利要求1-4中任一项所述的蛋白质或权利要求5所述的核酸分子或权利要求6所述的重组载体、表达盒、转基因细胞系或重组菌在如下任一中的应用:
(b1)制备具有青霉素扩环酶活性的产品;
(b2)制备G-7-ADCA。
10.一种制备G-7-ADCA的方法或一种在青霉素扩环酶催化青霉素G扩环生成G-7-ADCA反应中提高青霉素G的转化率的方法,为下述方法中的一种:
1)制备权利要求1-4中任一项所述的蛋白质;以所述蛋白质为青霉素扩环酶催化青霉素G扩环生成G-7-ADCA;
2)发酵含有编码权利要求权利要求1-4中任一项所述蛋白质基因的微生物,以获得的发酵物催化青霉素G扩环生成G-7-ADCA;
3)将编码权利要求1-4中任一项所述蛋白质的基因克隆到产青霉素G的菌体中得到重组菌,发酵该重组菌催化青霉素G扩环得到G-7-ADCA。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110676893.1A CN115491361A (zh) | 2021-06-18 | 2021-06-18 | 扩环酶及其突变体在生产g-7-adca中的应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110676893.1A CN115491361A (zh) | 2021-06-18 | 2021-06-18 | 扩环酶及其突变体在生产g-7-adca中的应用 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115491361A true CN115491361A (zh) | 2022-12-20 |
Family
ID=84465268
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110676893.1A Pending CN115491361A (zh) | 2021-06-18 | 2021-06-18 | 扩环酶及其突变体在生产g-7-adca中的应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115491361A (zh) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
IN188444B (zh) * | 1996-03-29 | 2002-09-28 | Werner Grabher | |
CN1448506A (zh) * | 2002-04-01 | 2003-10-15 | 骏翰生化股份有限公司 | 突变的青霉素扩环酶以及用其制备7-adca的方法 |
US20090087893A1 (en) * | 2004-04-22 | 2009-04-02 | Orchid Chemicals & Pharmaceutical Ltd. | Modified Expandase Enzyme and its use |
CN107304418A (zh) * | 2016-04-18 | 2017-10-31 | 百瑞全球有限公司 | 青霉素扩环酶突变体、编码该突变体的dna、含有该突变体的试剂盒及其应用 |
CN112852913A (zh) * | 2020-04-17 | 2021-05-28 | 中国科学院天津工业生物技术研究所 | 脱乙酰氧头孢菌素C合成酶突变体及其在β-内酰胺抗生素母核合成中的应用 |
-
2021
- 2021-06-18 CN CN202110676893.1A patent/CN115491361A/zh active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
IN188444B (zh) * | 1996-03-29 | 2002-09-28 | Werner Grabher | |
CN1448506A (zh) * | 2002-04-01 | 2003-10-15 | 骏翰生化股份有限公司 | 突变的青霉素扩环酶以及用其制备7-adca的方法 |
US20090087893A1 (en) * | 2004-04-22 | 2009-04-02 | Orchid Chemicals & Pharmaceutical Ltd. | Modified Expandase Enzyme and its use |
CN107304418A (zh) * | 2016-04-18 | 2017-10-31 | 百瑞全球有限公司 | 青霉素扩环酶突变体、编码该突变体的dna、含有该突变体的试剂盒及其应用 |
CN112852913A (zh) * | 2020-04-17 | 2021-05-28 | 中国科学院天津工业生物技术研究所 | 脱乙酰氧头孢菌素C合成酶突变体及其在β-内酰胺抗生素母核合成中的应用 |
Non-Patent Citations (2)
Title |
---|
LIN等: "Reconstitution of TCA cycle with DAOCS to engineer Escherichia coli into an efficient whole cell catalyst of penicillin G", PNAS, vol. 112, no. 32, 11 August 2015 (2015-08-11), pages 9855 * |
季俊杰;张红梅;: "定点突变改造棒状链霉菌青霉素扩环酶", 江苏农业科学, no. 08, 25 August 2013 (2013-08-25), pages 53 - 54 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Altenbuchner et al. | Hydantoinases and related enzymes as biocatalysts for the synthesis of unnatural chiral amino acids | |
CN112877307B (zh) | 一种氨基酸脱氢酶突变体及其应用 | |
WO2017143944A1 (zh) | 一种青霉素g酰化酶突变体 | |
Yang et al. | Expression and purification of extracellular penicillin G acylase in Bacillus subtilis | |
CN111778229B (zh) | 环己烯甲酸酯水解酶及其突变体、编码基因、表达载体、重组菌与应用 | |
CN113355299B (zh) | 酮酸还原酶、基因、工程菌及在合成手性芳香2-羟酸中的应用 | |
CN112852913B (zh) | 脱乙酰氧头孢菌素C合成酶突变体及其在β-内酰胺抗生素母核合成中的应用 | |
Forney et al. | Alteration of the catalytic efficiency of penicillin amidase from Escherichia coli | |
CN115491361A (zh) | 扩环酶及其突变体在生产g-7-adca中的应用 | |
Luo et al. | Cloning and co-expression of D-amino acid oxidase and glutaryl-7-aminocephalosporanic acid acylase genes in Escherichia coli | |
CN110699345A (zh) | 一种卤醇脱卤酶突变体及其应用 | |
CN114196659B (zh) | 酰胺酶突变体、编码基因、工程菌及其应用 | |
CN117603930B (zh) | 一种表达突变型西罗血红素合酶的重组菌 | |
CN113755539B (zh) | 一种二氢嘧啶氨基水解酶及其应用 | |
CN115505581B (zh) | 一种芳胺-n乙酰转移酶的突变体及其应用 | |
CN112011523B (zh) | 一种提高乙酰丙酮合成效率的乙酰丙酮裂解酶突变体、其基因、表达载体、细胞及应用 | |
CN112852912B (zh) | 一种合成7-氨基去乙酰氧基头孢烷酸的方法 | |
CN109913431A (zh) | 一种来源于白曲霉的单胺氧化酶用于手性胺中间体的制备 | |
CN115029329B (zh) | 一种羰基还原酶突变体及其在制备r-扁桃酸中的应用 | |
CN117947137A (zh) | 用于合成头孢克洛的青霉素酰胺酶 | |
CN117247928A (zh) | 一种头孢菌素c酰化酶催化的酰化反应及其应用 | |
CN117247925A (zh) | 头孢菌素c酰化酶突变体及其在头孢菌素母核合成中的应用 | |
CN117247927A (zh) | 一种头孢菌素c酰化酶的人工合成及其突变体应用 | |
CN118185913A (zh) | 一种副球菌来源的酰胺酶及其突变体的应用 | |
CN118272331A (zh) | 一种烯还原酶突变体及其在(r)-香茅醛合成中的应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |