CN114591940A - Fusion protein for catalyzing glucose to synthesize D-psicose and construction method thereof - Google Patents
Fusion protein for catalyzing glucose to synthesize D-psicose and construction method thereof Download PDFInfo
- Publication number
- CN114591940A CN114591940A CN202210352448.4A CN202210352448A CN114591940A CN 114591940 A CN114591940 A CN 114591940A CN 202210352448 A CN202210352448 A CN 202210352448A CN 114591940 A CN114591940 A CN 114591940A
- Authority
- CN
- China
- Prior art keywords
- ala
- gly
- leu
- asp
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- BJHIKXHVCXFQLS-PUFIMZNGSA-N D-psicose Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)C(=O)CO BJHIKXHVCXFQLS-PUFIMZNGSA-N 0.000 title claims abstract description 60
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 title claims abstract description 50
- 239000008103 glucose Substances 0.000 title claims abstract description 47
- 102000037865 fusion proteins Human genes 0.000 title claims abstract description 40
- 108020001507 fusion proteins Proteins 0.000 title claims abstract description 40
- 238000010276 construction Methods 0.000 title claims abstract description 12
- 238000006243 chemical reaction Methods 0.000 claims abstract description 29
- 108700040099 Xylose isomerases Proteins 0.000 claims abstract description 15
- 239000000758 substrate Substances 0.000 claims abstract description 15
- 108030002106 D-psicose 3-epimerases Proteins 0.000 claims abstract description 14
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 14
- 230000003197 catalytic effect Effects 0.000 claims abstract description 3
- 108090000623 proteins and genes Proteins 0.000 claims description 31
- 238000000034 method Methods 0.000 claims description 16
- 239000013612 plasmid Substances 0.000 claims description 13
- 241000588724 Escherichia coli Species 0.000 claims description 9
- 230000014509 gene expression Effects 0.000 claims description 8
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 claims description 5
- 238000004255 ion exchange chromatography Methods 0.000 claims description 5
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 claims description 5
- 239000008055 phosphate buffer solution Substances 0.000 claims description 5
- 229910021580 Cobalt(II) chloride Inorganic materials 0.000 claims description 4
- 150000002500 ions Chemical class 0.000 claims description 4
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L magnesium chloride Substances [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 claims description 4
- 229910001629 magnesium chloride Inorganic materials 0.000 claims description 4
- 230000037361 pathway Effects 0.000 claims description 4
- 241001041760 Acidothermus cellulolyticus 11B Species 0.000 claims description 3
- 241000883281 [Clostridium] cellulolyticum H10 Species 0.000 claims description 3
- 239000001963 growth medium Substances 0.000 claims description 3
- 230000000284 resting effect Effects 0.000 claims description 3
- 230000002194 synthesizing effect Effects 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 2
- 230000004927 fusion Effects 0.000 claims 3
- 230000009466 transformation Effects 0.000 claims 2
- 241000894006 Bacteria Species 0.000 claims 1
- 238000006555 catalytic reaction Methods 0.000 claims 1
- 239000013604 expression vector Substances 0.000 claims 1
- 102000004190 Enzymes Human genes 0.000 abstract description 10
- 108090000790 Enzymes Proteins 0.000 abstract description 10
- 238000003786 synthesis reaction Methods 0.000 abstract description 6
- 230000015572 biosynthetic process Effects 0.000 abstract description 5
- 102000004196 processed proteins & peptides Human genes 0.000 abstract description 5
- 238000010364 biochemical engineering Methods 0.000 abstract description 2
- 239000013067 intermediate product Substances 0.000 abstract 3
- 238000009792 diffusion process Methods 0.000 abstract 2
- 108010075254 C-Peptide Proteins 0.000 abstract 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 38
- 241000269331 Ambystoma Species 0.000 description 32
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 20
- 108010005233 alanylglutamic acid Proteins 0.000 description 20
- RFSUNEUAIZKAJO-VRPWFDPXSA-N D-Fructose Natural products OC[C@H]1OC(O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-VRPWFDPXSA-N 0.000 description 19
- 241000205573 Jeffersonia Species 0.000 description 16
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 16
- 239000004816 latex Substances 0.000 description 16
- 229920000126 latex Polymers 0.000 description 16
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 14
- 108010047857 aspartylglycine Proteins 0.000 description 14
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 14
- 108010025306 histidylleucine Proteins 0.000 description 13
- 108010018006 histidylserine Proteins 0.000 description 12
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 11
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 11
- 108010049041 glutamylalanine Proteins 0.000 description 11
- 108010061238 threonyl-glycine Proteins 0.000 description 11
- ZUPVLBAXUUGKKN-VHSXEESVSA-N His-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CN=CN2)N)C(=O)O ZUPVLBAXUUGKKN-VHSXEESVSA-N 0.000 description 10
- 108010044940 alanylglutamine Proteins 0.000 description 10
- 102000004169 proteins and genes Human genes 0.000 description 10
- GYUUYCIXELGTJS-MEYUZBJRSA-N Thr-Phe-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O GYUUYCIXELGTJS-MEYUZBJRSA-N 0.000 description 9
- 108010011559 alanylphenylalanine Proteins 0.000 description 9
- 108010037850 glycylvaline Proteins 0.000 description 9
- 108010092114 histidylphenylalanine Proteins 0.000 description 9
- 241000880493 Leptailurus serval Species 0.000 description 8
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 8
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 8
- 108010015792 glycyllysine Proteins 0.000 description 8
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 7
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 7
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 7
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 7
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 7
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 7
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 7
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 7
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 7
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 7
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 7
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 7
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 7
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 7
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 7
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 7
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 7
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 7
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 7
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 7
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 7
- BJPPYOMRAVLXBY-YUMQZZPRSA-N Gln-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N BJPPYOMRAVLXBY-YUMQZZPRSA-N 0.000 description 7
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 7
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 7
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 7
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 7
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 7
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 7
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 7
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 7
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 7
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 7
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 7
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 7
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 7
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 7
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 7
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 7
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 7
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 7
- COZMNNJEGNPDED-HOCLYGCPSA-N Gly-Val-Trp Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O COZMNNJEGNPDED-HOCLYGCPSA-N 0.000 description 7
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 7
- BZKDJRSZWLPJNI-SRVKXCTJSA-N His-His-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O BZKDJRSZWLPJNI-SRVKXCTJSA-N 0.000 description 7
- YVCGJPIKRMGNPA-LSJOCFKGSA-N His-Met-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O YVCGJPIKRMGNPA-LSJOCFKGSA-N 0.000 description 7
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 7
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 7
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 7
- IXEFKXAGHRQFAF-HVTMNAMFSA-N Ile-Glu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IXEFKXAGHRQFAF-HVTMNAMFSA-N 0.000 description 7
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 7
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 7
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 7
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 7
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 7
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 7
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 7
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 7
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 7
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 7
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 7
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 7
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 7
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 7
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 7
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 7
- CRIODIGWCUPXKU-AVGNSLFASA-N Lys-Pro-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O CRIODIGWCUPXKU-AVGNSLFASA-N 0.000 description 7
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 7
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 7
- ILKCLLLOGPDNIP-RCWTZXSCSA-N Met-Met-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ILKCLLLOGPDNIP-RCWTZXSCSA-N 0.000 description 7
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 7
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 7
- XEXSSIBQYNKFBX-KBPBESRZSA-N Phe-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=CC=C1 XEXSSIBQYNKFBX-KBPBESRZSA-N 0.000 description 7
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 7
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 7
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 7
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 7
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 7
- CXMSESHALPOLRE-MEYUZBJRSA-N Phe-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O CXMSESHALPOLRE-MEYUZBJRSA-N 0.000 description 7
- FXEKNHAJIMHRFJ-ULQDDVLXSA-N Phe-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N FXEKNHAJIMHRFJ-ULQDDVLXSA-N 0.000 description 7
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 7
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 7
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 7
- IDCKUIWEIZYVSO-WFBYXXMGSA-N Ser-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C)C(O)=O)=CNC2=C1 IDCKUIWEIZYVSO-WFBYXXMGSA-N 0.000 description 7
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 7
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 7
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 7
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 7
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 7
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 7
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 7
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 7
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 7
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 7
- CZSMNLQMRWPGQF-XEGUGMAKSA-N Trp-Gln-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CZSMNLQMRWPGQF-XEGUGMAKSA-N 0.000 description 7
- XLVRTKPAIXJYOH-HOCLYGCPSA-N Trp-His-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)NCC(=O)O)N XLVRTKPAIXJYOH-HOCLYGCPSA-N 0.000 description 7
- STKZKWFOKOCSLW-UMPQAUOISA-N Trp-Thr-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 STKZKWFOKOCSLW-UMPQAUOISA-N 0.000 description 7
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 7
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 7
- OJPRSVJGNCAKQX-SRVKXCTJSA-N Val-Met-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OJPRSVJGNCAKQX-SRVKXCTJSA-N 0.000 description 7
- JMCOXFSCTGKLLB-FKBYEOEOSA-N Val-Phe-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JMCOXFSCTGKLLB-FKBYEOEOSA-N 0.000 description 7
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 7
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 7
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 7
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 7
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 7
- 108010062796 arginyllysine Proteins 0.000 description 7
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 7
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 7
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 7
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 7
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 7
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 7
- 108010081551 glycylphenylalanine Proteins 0.000 description 7
- 108010036413 histidylglycine Proteins 0.000 description 7
- 108010028295 histidylhistidine Proteins 0.000 description 7
- 108010085325 histidylproline Proteins 0.000 description 7
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 7
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 7
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 7
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 7
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 7
- 108010070643 prolylglutamic acid Proteins 0.000 description 7
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 6
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 6
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 6
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 5
- 238000001502 gel electrophoresis Methods 0.000 description 5
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 5
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 5
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 4
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 4
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 4
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 4
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 4
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 4
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 4
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 4
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 4
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 4
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 4
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 4
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 4
- MRYDJCIIVRXVGG-QEJZJMRPSA-N Asp-Trp-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O MRYDJCIIVRXVGG-QEJZJMRPSA-N 0.000 description 4
- 241000672609 Escherichia coli BL21 Species 0.000 description 4
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 4
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 4
- PNAOVYHADQRJQU-GUBZILKMSA-N Glu-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N PNAOVYHADQRJQU-GUBZILKMSA-N 0.000 description 4
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 4
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 4
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 4
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 4
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 4
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 4
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 4
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 4
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 4
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 4
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 4
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 4
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 4
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 4
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 4
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 4
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 4
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 4
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 4
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 4
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 4
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 4
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 4
- HOJUNFDJDAPVBI-BZSNNMDCSA-N Pro-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 HOJUNFDJDAPVBI-BZSNNMDCSA-N 0.000 description 4
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 4
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 4
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 4
- PHNBFZBKLWEBJN-BPUTZDHNSA-N Trp-Glu-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PHNBFZBKLWEBJN-BPUTZDHNSA-N 0.000 description 4
- VCGOTJGGBXEBFO-FDARSICLSA-N Trp-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VCGOTJGGBXEBFO-FDARSICLSA-N 0.000 description 4
- MDXLPNRXCFOBTL-BZSNNMDCSA-N Tyr-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MDXLPNRXCFOBTL-BZSNNMDCSA-N 0.000 description 4
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 4
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 4
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 4
- 108010017391 lysylvaline Proteins 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 239000002609 medium Substances 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 3
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 3
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 3
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 3
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 3
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 3
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 3
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 3
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 3
- WZUZGDANRQPCDD-SRVKXCTJSA-N Asp-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N WZUZGDANRQPCDD-SRVKXCTJSA-N 0.000 description 3
- QDFBJJABJKOLTD-FXQIFTODSA-N Cys-Asn-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QDFBJJABJKOLTD-FXQIFTODSA-N 0.000 description 3
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 3
- 229930091371 Fructose Natural products 0.000 description 3
- 239000005715 Fructose Substances 0.000 description 3
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 3
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 3
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 3
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 3
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 3
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 3
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 3
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 3
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 3
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 3
- AVQOSMRPITVTRB-CIUDSAMLSA-N His-Asn-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AVQOSMRPITVTRB-CIUDSAMLSA-N 0.000 description 3
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 3
- DEOQGJUXUQGUJN-KKUMJFAQSA-N His-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DEOQGJUXUQGUJN-KKUMJFAQSA-N 0.000 description 3
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 3
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 3
- AKQFLPNANHNTLP-VKOGCVSHSA-N Ile-Pro-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N AKQFLPNANHNTLP-VKOGCVSHSA-N 0.000 description 3
- 108090000769 Isomerases Proteins 0.000 description 3
- 102000004195 Isomerases Human genes 0.000 description 3
- WMTOVWLLDGQGCV-GUBZILKMSA-N Leu-Glu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WMTOVWLLDGQGCV-GUBZILKMSA-N 0.000 description 3
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 3
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 3
- XBAJINCXDBTJRH-WDSOQIARSA-N Lys-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N XBAJINCXDBTJRH-WDSOQIARSA-N 0.000 description 3
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 3
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 3
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 3
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 241001052560 Thallis Species 0.000 description 3
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 3
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 3
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 3
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 3
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 3
- LYPKCSYAKLTBHJ-ILWGZMRPSA-N Tyr-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CC=C(C=C4)O)N)C(=O)O LYPKCSYAKLTBHJ-ILWGZMRPSA-N 0.000 description 3
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 3
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 3
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 3
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 3
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 3
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 230000036983 biotransformation Effects 0.000 description 3
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 108010003137 tyrosyltyrosine Proteins 0.000 description 3
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 2
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 2
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 2
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 2
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 2
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 2
- DEAGTWNKODHUIY-MRFFXTKBSA-N Ala-Tyr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DEAGTWNKODHUIY-MRFFXTKBSA-N 0.000 description 2
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 2
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 2
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 2
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 2
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 2
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 2
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 2
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 2
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 2
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 2
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 2
- XEEIQMGZRFFSRD-XVYDVKMFSA-N Cys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N XEEIQMGZRFFSRD-XVYDVKMFSA-N 0.000 description 2
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 2
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 2
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- KLJMRPIBBLTDGE-ACZMJKKPSA-N Glu-Cys-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O KLJMRPIBBLTDGE-ACZMJKKPSA-N 0.000 description 2
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 2
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 2
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 2
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 2
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 2
- LPHGXOWFAXFCPX-KKUMJFAQSA-N Glu-Pro-Phe Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O LPHGXOWFAXFCPX-KKUMJFAQSA-N 0.000 description 2
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 2
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 2
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 2
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 2
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 2
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 2
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 2
- KRBMQYPTDYSENE-BQBZGAKWSA-N His-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 KRBMQYPTDYSENE-BQBZGAKWSA-N 0.000 description 2
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 2
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 2
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 2
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 2
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 2
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 2
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 2
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 2
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 2
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 2
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 2
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 2
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 2
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 2
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 2
- PGLGNCVOWIORQE-SRVKXCTJSA-N Lys-His-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O PGLGNCVOWIORQE-SRVKXCTJSA-N 0.000 description 2
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 2
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 2
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 2
- VWJFOUBDZIUXGA-AVGNSLFASA-N Lys-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N VWJFOUBDZIUXGA-AVGNSLFASA-N 0.000 description 2
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 2
- OGAZPKJHHZPYFK-GARJFASQSA-N Met-Glu-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGAZPKJHHZPYFK-GARJFASQSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 2
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 2
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 2
- GLUYKHMBGKQBHE-JYJNAYRXSA-N Phe-Val-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 GLUYKHMBGKQBHE-JYJNAYRXSA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 2
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 2
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 2
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 2
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 2
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 2
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 2
- TZNNEYFZZAHLBL-BPUTZDHNSA-N Trp-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O TZNNEYFZZAHLBL-BPUTZDHNSA-N 0.000 description 2
- KSVMDJJCYKIXTK-IGNZVWTISA-N Tyr-Ala-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KSVMDJJCYKIXTK-IGNZVWTISA-N 0.000 description 2
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 2
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 2
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 2
- KRXFXDCNKLANCP-CXTHYWKRSA-N Tyr-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 KRXFXDCNKLANCP-CXTHYWKRSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 2
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 2
- HVRRJRMULCPNRO-BZSNNMDCSA-N Val-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 HVRRJRMULCPNRO-BZSNNMDCSA-N 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 230000004186 co-expression Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 235000013305 food Nutrition 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 239000002054 inoculum Substances 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- NWUYHJFMYQTDRP-UHFFFAOYSA-N 1,2-bis(ethenyl)benzene;1-ethenyl-2-ethylbenzene;styrene Chemical compound C=CC1=CC=CC=C1.CCC1=CC=CC=C1C=C.C=CC1=CC=CC=C1C=C NWUYHJFMYQTDRP-UHFFFAOYSA-N 0.000 description 1
- YREOLPGEVLLKMB-UHFFFAOYSA-N 3-methylpyridin-1-ium-2-amine bromide hydrate Chemical compound O.[Br-].Cc1ccc[nH+]c1N YREOLPGEVLLKMB-UHFFFAOYSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- LKDRXBCSQODPBY-JDJSBBGDSA-N D-allulose Chemical compound OCC1(O)OC[C@@H](O)[C@@H](O)[C@H]1O LKDRXBCSQODPBY-JDJSBBGDSA-N 0.000 description 1
- LKDRXBCSQODPBY-VRPWFDPXSA-N D-fructopyranose Chemical compound OCC1(O)OC[C@@H](O)[C@@H](O)[C@@H]1O LKDRXBCSQODPBY-VRPWFDPXSA-N 0.000 description 1
- BJHIKXHVCXFQLS-UYFOZJQFSA-N D-fructose group Chemical group OCC(=O)[C@@H](O)[C@H](O)[C@H](O)CO BJHIKXHVCXFQLS-UYFOZJQFSA-N 0.000 description 1
- 125000002353 D-glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- WHUUTDBJXJRKMK-GSVOUGTGSA-N D-glutamic acid Chemical compound OC(=O)[C@H](N)CCC(O)=O WHUUTDBJXJRKMK-GSVOUGTGSA-N 0.000 description 1
- 208000004930 Fatty Liver Diseases 0.000 description 1
- WVUZERSNWGUKJY-BPUTZDHNSA-N Gln-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N WVUZERSNWGUKJY-BPUTZDHNSA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- JPWIMMUNWUKOAD-STQMWFEESA-N Gly-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN JPWIMMUNWUKOAD-STQMWFEESA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- 206010019708 Hepatic steatosis Diseases 0.000 description 1
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- 108010009384 L-Iditol 2-Dehydrogenase Proteins 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- PBXYXOAEQQUVMM-ULQDDVLXSA-N Phe-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PBXYXOAEQQUVMM-ULQDDVLXSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 1
- 108090001066 Racemases and epimerases Proteins 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- 102100026974 Sorbitol dehydrogenase Human genes 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- XTOCLOATLKOZAU-JBACZVJFSA-N Tyr-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N XTOCLOATLKOZAU-JBACZVJFSA-N 0.000 description 1
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 239000011942 biocatalyst Substances 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000003912 environmental pollution Methods 0.000 description 1
- 238000006345 epimerization reaction Methods 0.000 description 1
- 208000010706 fatty liver disease Diseases 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 239000000543 intermediate Substances 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 239000003456 ion exchange resin Substances 0.000 description 1
- 229920003303 ion-exchange polymer Polymers 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 150000002574 ketohexoses Chemical class 0.000 description 1
- 238000011031 large-scale manufacturing process Methods 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 235000013615 non-nutritive sweetener Nutrition 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 231100000240 steatosis hepatitis Toxicity 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 229910021642 ultra pure water Inorganic materials 0.000 description 1
- 239000012498 ultrapure water Substances 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/90—Isomerases (5.)
- C12N9/92—Glucose isomerase (5.3.1.5; 5.3.1.9; 5.3.1.18)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/90—Isomerases (5.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/02—Monosaccharides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/24—Preparation of compounds containing saccharide radicals produced by the action of an isomerase, e.g. fructose
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y501/00—Racemaces and epimerases (5.1)
- C12Y501/03—Racemaces and epimerases (5.1) acting on carbohydrates and derivatives (5.1.3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y503/00—Intramolecular oxidoreductases (5.3)
- C12Y503/01—Intramolecular oxidoreductases (5.3) interconverting aldoses and ketoses (5.3.1)
- C12Y503/01018—Glucose isomerase (5.3.1.18)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02E—REDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
- Y02E50/00—Technologies for the production of fuel of non-fossil origin
- Y02E50/10—Biofuels, e.g. bio-diesel
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- General Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The invention provides a fusion protein for catalyzing glucose to synthesize D-psicose and a construction method thereof, relating to the field of biochemical engineering. In order to avoid the problems of low conversion efficiency and the like caused by diffusion of intermediate products in a double-enzyme catalytic system, the invention constructs fusion proteins connected by connecting peptides with different lengths and improves the conversion efficiency by constructing a substrate channel. Specifically, the glucose isomerase gene and D-psicose 3-epimerase were linked using flexible linker peptides and rigid linker peptides of different lengths, and the results showed that the efficiency of D-psicose synthesis increased with the increase in the length of the linker peptide, and that the rigid linker peptide was superior to the flexible linker peptide. The fusion protein provided by the invention transfers the intermediate product to the next enzyme through the substrate channel, thereby reducing the diffusion of the intermediate product and improving the synthesis efficiency of D-psicose.
Description
Technical Field
The invention belongs to the field of biochemical engineering, and particularly relates to a fusion protein for catalyzing glucose to synthesize D-psicose and a construction method thereof.
Background
D-Psicose (D-Psicose or D-Allulose) is a six-carbon sugar with a very low content in nature, and is an epimer of the C-3 site of D-fructose. D-psicose is very difficult to digest and absorb and provides little energy for life activities, and thus is a very useful low-calorie sweetener. In the field of medical health, D-psicose can inhibit fatty liver enzyme and intestinal alpha-glycosidase, thereby reducing accumulation of fat in vivo and inhibiting increase of blood glucose concentration. In the field of food application, D-psicose has the advantages of high sweetness, good solubility, low calorie and low blood sugar reaction and the like, and is considered to be one of the most ideal sucrose substitutes. D-psicose is currently approved by the U.S. Food and Drug Administration (FDA) for safety. Therefore, D-psicose can be applied as a safe additive in the fields of medical care and food processing.
At first, D-psicose is synthesized by a chemical method, and with the progress of research, the chemical synthesis method has the defects of high cost, high process difficulty, complex purification, easy environmental pollution and the like, and is gradually replaced by a biotransformation method. The biotransformation of D-psicose was first proposed by professor Izumori, university of Xiangchuan, Japan, to accomplish the biotransformation between rare ketohexoses using hexitols as intermediates, and it includes epimerase, polyol dehydrogenase and aldoketose isomerase. Wherein the D-psicose 3-epimerase can realize the interconversion between D-fructose and D-psicose. The D-fructose can be generated by glucose isomerase to catalyze glucose to perform epimerization reaction. The method of producing D-psicose from glucose in one step using a two-enzyme system of glucose isomerase and D-psicose 3-epimerase has additional advantages in that the proximity of the two enzymes creates a substrate-rich microenvironment for the second enzyme, greatly reducing the time for the substrate to diffuse into the second enzyme. Therefore, the co-expression of glucose isomerase and D-psicose 3-epimerase provides an excellent opportunity for innovation in the one-step production of D-psicose.
Disclosure of Invention
In view of the above, the present invention aims to provide a method for synthesizing D-psicose from glucose by using a fusion protein as a catalyst, thereby realizing direct synthesis of D-psicose from glucose which is an inexpensive substrate.
In order to achieve the above object, the present invention provides the following technical solutions:
the invention provides a fusion protein for catalyzing glucose to synthesize D-psicose and a construction method thereof.
Optionally, the escherichia coli is escherichia coli BL21(DE 3).
Optionally, the construction method of the fusion protein is as follows: the glucose isomerase-encoding gene derived from Acidothermus cellulolyticus 11B (AcceGI, SEQ ID NO:15) and the D-psicose 3-epimerase-encoding gene derived from Clostridium cellulolyticum H10 (CcDPEase, SEQ ID NO:16) were cloned between the BamHI and HindIII sites of the vector pET28a (PB) N or the BamHI site of the vector pET28a (PB) N-CdDPEase (see Table 1 for primers), respectively, using a one-step cloning kit (Nanjing nopraz Biotech, Inc.) to obtain recombinant plasmids pET28a (PB) N-GP, pET28a (PB) N-GS 6851, pET28a (PB) N-GS2P, pET28a (PB) N-GS3P, pET a (PB) N-GS1P, pET28a (PB) N-GS2 and GE 28 GE3 map (PB) GE a as shown in the figure). Wherein the expression of all pathway genes is controlled by a T7 promoter and a T7 terminator. The recombinant plasmid is transformed into escherichia coli BL21(DE3), and the recombinant escherichia coli BL21/GP, BL21/GS1P, BL21/GS2P, BL21/GS3P, BL21/GE1P, BL21/GE2P and BL21/GE3P are obtained. The recombinant strains can be used for catalyzing direct synthesis of D-psicose from glucose.
Optionally, the conditions for converting glucose into D-psicose by using the recombinant strains BL21/GP, BL21/GS1P, BL21/GS2P, BL21/GS3P, BL21/GE1P, BL21/GE2P and BL21/GE3P are as follows: culturing the recombinant E.coli to OD600When the concentration was 0.6-0.8, IPTG was added to the mixture at a final concentration of 0.5mM, and the mixture was induced at 25 ℃ and 200rpm for 8 hours. Centrifugally collecting thalli at 4 ℃, 8000rpm, washing the thalli by using 20mM phosphate buffer solution with pH of 4.5-8.5 to remove a culture medium, and then re-suspending the thalli by using the phosphate buffer solution with corresponding pH to obtain resting cells which can be used as a biocatalyst for subsequent reaction.
Optionally, the concentration of the substrate glucose is 100 g/L.
Optionally, the catalytic ion CoCl2At a concentration of 1mM, MgCl2Is 5 mM.
Optionally, the conversion condition is reaction for 2 hours at 45-85 ℃.
Compared with the prior art, the invention has the following beneficial effects:
(1) the method uses cheap raw material glucose as a substrate, greatly reduces the production cost of the D-psicose, and is suitable for large-scale production.
(2) The product D-psicose is easy to separate and purify with fructose and glucose by an ion exchange resin method, the process is safe and environment-friendly, and pollution is not easy to cause.
(3) The co-expression of the two enzymes in the fusion protein creates a substrate-rich microenvironment for the second enzyme.
(4) The proximity of the two enzymes in the fusion protein reduces the time for the substrate to diffuse to the second enzyme, greatly increasing the rate of reaction.
Drawings
FIG. 1. reaction process for converting glucose into D-psicose. The reaction is to convert glucose into D-fructose by using glucose isomerase (AcceGI), and then convert the generated D-fructose into D-psicose by using D-psicose 3-epimerase (CcDPEase).
FIG. 2 functional verification of glucose isomerase and D-psicose 3-epimerase. (A) And (3) verifying and verifying the function of the glucose isomerase AcceGI by taking glucose as a substrate. (B) The function of D-psicose 3-epimerase CcDPEase was verified by using fructose as a substrate. Wherein BL21/AcceGI is Escherichia coli expressing glucose isomerase AcceGI; BL21/CcDPEase is Escherichia coli expressing D-psicose 3-epimerase CcDPEase; BL21/pET28a (PB) N is E.coli containing empty plasmid pET28a (PB) N as a blank control. D-Glu is D-glucose, D-Fru is D-fructose, and D-Psi is D-psicose.
FIG. 3 plasmid map of fusion protein. (A) pET28a (PB) N-GP, (B) pET28a (PB) N-GS1P, (C) pET28a (PB) N-GS2P, (D) pET28a (PB) N-GS3P, (E) pET28a (PB) N-GE1P, (F) pET28a (PB) N-GE2P, (G) pET28a (PB) N-GE 3P. The recombinant plasmid contains a glucose isomerase coding gene (AcceGI), a D-psicose 3-epimerase coding gene (CcDPEase) and a link peptide, wherein the expression of the genes is controlled by a T7 promoter and a T7 terminator and is connected in series in a monocistronic mode to form a fusion protein.
FIG. 4 protein expression of fusion proteins. Strain BL21/pET28a (PB) N containing empty plasmid pET28a (PB) N was a blank control.
FIG. 5 comparison of the activities of seven fusion proteins. (A) Comparison of conversion rates of seven fusion proteins to glucose, and (B) comparison of conversion rates of seven fusion proteins to fructose.
FIG. 6 optimal temperature and pH for the fusion protein to catalyze the production of D-psicose from glucose. (A) Temperature optimization of glucose to D-psicose conversion, (B) pH optimization of glucose to D-psicose conversion.
Detailed description of the invention
The invention provides a fusion protein for catalyzing glucose to synthesize D-psicose and a construction method thereof.
The following examples are provided to illustrate the method for producing D-psicose directly from glucose by using the fusion protein and the construction method of the fusion protein, but they should not be construed as limiting the scope of the present invention.
Detailed Description
Example 1: functional verification of glucose isomerase and D-psicose isomerase
Coli BL21/AcceGI and BL21/CcDPEase each containing a glucose isomerase-encoding gene (AcceGI, SEQ ID NO:15) and a D-psicose 3-epimerase-encoding gene (CcDPEase, SEQ ID NO:16) were cultured overnight in LB medium at 37 ℃ and 200rpm, transferred to a 250mL Erlenmeyer flask containing 25mL of fresh LB medium with an inoculum size of 4% (v/v), and cultured at 37 ℃ and 200rpm to logarithmic phase. Then, IPTG was added to the cells at a final concentration of 500. mu.M, and the cells were further cultured at 25 ℃ and 200rpm for 8 hours to express the glucose isomerase gene and the D-psicose 3-epimerase gene. And respectively adding glucose and D-fructose to verify the functions of the glucose isomerase and the D-psicose isomerase. Coli BL21/pET28a (PB) N with empty plasmid pET28a (PB) N was used as a blank control, and the other operating conditions were the same.
To the system containing BL21/AcceGI was added glucose at a final concentration of 5g/L, 1mM CoCl25mM MgCl2Reacting for 1h at 80 ℃; to the system containing BL21/CcDPEase was added D-fructose at a final concentration of 5g/L, 1mM CoCl25mM MgCl2And reacting at 60 ℃ for 1 h. The reaction supernatant was detected by ion chromatography.
The method for analyzing D-fructose and D-psicose by ion chromatography is as follows: the ion chromatography model is a Saimeifei ICS-6000 high-pressure ion chromatograph, the detector is an electrochemical detector, the chromatographic column model is PA-20, the mobile phase A is ultrapure water, the mobile phase B is 200mM NaOH aqueous solution, and gradient elution is carried out to obtain (B%): 0min 10%, 15min 10%, 15.1min 100%, 25min 100%, 25.1min 10%, 35min 10%, constant flow rate 0.5mL/min, sample size 25 μ L, and column oven and detector temperature both 30 ℃.
The ion chromatographic analysis result shows that the peak-off time of the D-fructose and the D-psicose is 7.863min and 9.475min respectively.
The blank (E.coli BL21/pET28a (PB) N) synthesized neither D-fructose nor D-psicose; the experimental groups (E.coli BL21/AcceGI and BL21/CcDPEase) synthesized D-fructose and D-psicose, respectively (FIG. 2). The results show that glucose isomerase can convert D-glucose into D-fructose; d-psicose 3-epimerase can convert D-fructose into D-psicose.
Example 2: construction and functional analysis of fusion proteins
The glucose isomerase-encoding gene derived from Acidothermus cellulolyticus 11B (AcceGI, SEQ ID NO:15) and D-psicose 3-epimerase derived from Clostridium cellulolyticum H10 (CcDPEase, SEQ ID NO:16) were cloned between the BamHI and HindIII sites of vector pET28a (PB) N or the BamHI site of vector pET28a (PB) N-CcDPEase (primers see Table 1), respectively, using a one-step cloning kit (Nanjing Nozanza Biotech Co., Ltd.), to obtain recombinant plasmids pET28a (PB) N-GP, pET28a (PB) N-GS 6851, pET28a (PB) N-GS2P, pET28a (PB) N-pET 3P, T a (PB) N-GE P, pET28 (PB) N-GE3 PB a (PB) N-GE2 a, as shown in FIG. 3 map, with peptide linker 3626. Wherein the expression of all pathway genes is controlled by a T7 promoter and a T7 terminator. The recombinant plasmid is transformed into escherichia coli BL21(DE3), and the recombinant escherichia coli BL21/GP, BL21/GS1P, BL21/GS2P, BL21/GS3P, BL21/GE1P, BL21/GE2P and BL21/GE3P are obtained. The recombinant strains can be used for catalyzing the synthesis of D-psicose from glucose.
TABLE 1 primers used in the present invention
TABLE 2 linker peptides used in the construction of fusion proteins of the invention
Recombinant Escherichia coli BL21/GP, BL21/GS1P, BL21/GS2P, BL21/GS3P, BL21/GE1P, BL21/GE2P and BL21/GE3P are cultured overnight in LB medium at 37 ℃ and 200rpm, transferred to a 250mL triangular flask containing 25mL of fresh LB medium with an inoculum size of 4% (v/v), and cultured at 37 ℃ and 200rpm to logarithmic phase. Then, IPTG was added to the resulting mixture to a final concentration of 500. mu.M, and the mixture was further cultured at 25 ℃ and 200rpm for 8 hours to express glucose isomerase and D-psicose 3-epimerase genes in the fusion protein. Protein expression was examined by protein gel electrophoresis (SDS-PAGE).
The protein gel electrophoresis method for detecting the protein expression condition comprises the following steps: the protein gel electrophoresis model is a Saimer flying polyacrylamide protein gel electrophoresis apparatus, and the voltage of 150V is used for 30 min.
Protein gel electrophoresis analysis results show that the molecular weights corresponding to the fusion proteins GP, GS1P, GS2P, GS3P, GE1P, GE2P and GE3P are 81.475kDa, 81.79kDa, 82.105kDa, 82.420kDa, 81.945kDa, 82.416kDa and 82.886kDa respectively.
The blank control (E.coli BL21/pET28a (PB) N) had no obvious protein band; the experimental group (Escherichia coli BL21/GP, BL21/GS1P, BL21/GS2P, BL21/GS3P, BL21/GE1P, BL21/GE2P and BL21/GE3P) has obvious protein bands (FIG. 4). This result indicates that the genes of the fusion proteins are all expressed.
The induced system was divided into two groups, one of which was added with glucose at a final concentration of 50g/L, 1mM Co2+And 5mM Mg2+Reacting for 1h at 80 ℃; the other group was added with D-fructose to a final concentration of 50g/L, 1mM Co2+And 5mM Mg2+And reacting for 1h at 60 ℃. The conversion rates of the seven fusion proteins for converting glucose and converting D-fructose were examined by ion chromatography. Among the seven fusion proteins, GS3P and GE3P have higher conversion rate of converting D-glucose into D-fructose (FIG. 5A); at the same time, the conversion rate of D-fructose into D-psicose by GS3P and GE3P was also higher (FIG. 5B). This result indicates that the higher the number of linker peptides used, the higher the conversion rate of the fusion protein, in the case of the fusion protein formed by linking the same linker peptides.
Example 3: optimization of reaction conditions for catalyzing glucose to generate D-psicose by fusion proteins GE3P and GS3P
In order to improve the yield and the conversion rate of the D-psicose generated by one-step conversion of glucose catalyzed by the fusion protein, the invention further optimizes the reaction conditions for converting the D-psicose.
The invention optimizes the reaction conditions for converting the glucose into the D-psicose: the reaction temperature range is 45-85 ℃, the reaction pH range is 4.5-8.5, and the concentration of substrate glucose is 100 g/L. As a result, the optimum reaction temperature was 65 ℃ C (FIG. 6A), and the optimum reaction pH was 7.5 (FIG. 6B).
Under the optimal conditions, the yield of the D-psicose can reach 5.69 g/L. The result shows that the method for generating D-psicose by catalyzing glucose through the fusion protein can be smoothly carried out, and has great application potential.
The foregoing is only an alternative embodiment of the present invention, and it should be noted that modifications and embellishments could be made by those skilled in the art without departing from the principle of the present invention, and these should be considered as the protection scope of the present invention.
Sequence listing
<110> Zhengzhou university
<120> fusion protein for catalyzing glucose to synthesize D-psicose and construction method thereof
<160> 16
<170> SIPOSequenceListing 1.0
<210> 1
<211> 733
<212> PRT
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 1
Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro
1 5 10 15
Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gln Gln Met Gly Arg
20 25 30
Gly Ser Met Ser Leu Thr Thr Ala Ser Ser Lys Thr Ile Glu Val Ala
35 40 45
Thr Pro Ser Lys Glu Asp Arg Phe Ser Phe Gly Leu Trp Thr Val Gly
50 55 60
Trp Gln Ala Arg Asp Pro Phe Gly Glu Ala Thr Arg Pro Pro Leu Asp
65 70 75 80
Pro Val Glu Ala Val His Lys Leu Ala Glu Leu Gly Ala Tyr Gly Val
85 90 95
Thr Phe His Asp Asp Asp Leu Val Pro Phe Gly Ser Ser Asp Ala Glu
100 105 110
Arg Ala Arg Leu Ile Asp Arg Phe Lys Lys Ala Leu Ala Asp Thr Gly
115 120 125
Leu Val Val Pro Met Met Thr Thr Asn Leu Phe Thr His Pro Ile Phe
130 135 140
Lys Asp Gly Ala Phe Thr Ala Asn Asp Arg Ser Ile Arg Arg Tyr Ala
145 150 155 160
Ile Arg Lys Val Met Arg Asn Leu Asp Leu Ala Ala Glu Leu Gly Ala
165 170 175
Arg Thr Tyr Val Phe Trp Gly Gly Arg Glu Gly Ser Glu Ile Asp Ala
180 185 190
Ala Lys Asp Ile Arg Ala Ala Leu Asp Arg Tyr Arg Glu Ala Ile Asp
195 200 205
Thr Leu Ala Gln Tyr Val Lys Asp Gln Gly Tyr Gly Ile Arg Phe Ala
210 215 220
Leu Glu Pro Lys Pro Asn Glu Pro Arg Gly Asp Ile Phe Leu Pro Thr
225 230 235 240
Ile Gly His Ala Leu Ala Phe Ile Asn Ser Leu Glu His Ser Asp Ile
245 250 255
Val Gly Leu Asn Pro Glu Val Gly His Glu Gln Met Ser Asn Leu Asn
260 265 270
Phe Val His Gly Ile Ala Gln Ala Leu Trp His Gly Lys Leu Phe His
275 280 285
Ile Asp Leu Asn Gly Gln His Gly Pro Lys Tyr Asp Gln Asp Leu Val
290 295 300
Phe Gly His Gly Asp Leu Leu Ser Ala Phe Phe Leu Val Asp Leu Leu
305 310 315 320
Glu Asn Gly Phe Pro Gly Gly Gly Pro Val Tyr Asp Gly Pro Arg His
325 330 335
Phe Asp Tyr Lys Pro Met Arg Thr Glu Asp Ile Asp Gly Val Trp Ala
340 345 350
Ser Ala Ala Ala Asn Met Arg Thr Tyr Leu Leu Leu Lys Gln Arg Ala
355 360 365
Lys Ala Phe Arg Ala Asp Pro Glu Val Gln Ala Ala Leu Thr Ala Ser
370 375 380
Arg Val Pro Glu Leu Ala Val Pro Thr Leu Gly Glu Gly Glu Ser Tyr
385 390 395 400
Ala Asp Leu Leu Ala Asp Arg Ser Ala Trp Glu Glu Phe Asp Val Asp
405 410 415
Arg Ala Ala Asn Gln Gly Tyr Gly Tyr Ala Arg Leu Asp Gln Leu Ala
420 425 430
Ile Glu His Leu Leu Gly Ala Arg Gly Lys His Gly Ile Tyr Tyr Ala
435 440 445
Tyr Trp Glu Gln Glu Trp Glu Ala Asp Tyr Lys Tyr Tyr Ile Glu Lys
450 455 460
Val Ala Lys Leu Gly Phe Asp Ile Leu Glu Ile Ala Ala Ser Pro Leu
465 470 475 480
Pro Phe Tyr Ser Asp Ile Gln Ile Asn Glu Leu Lys Ala Cys Ala His
485 490 495
Gly Asn Gly Ile Thr Leu Thr Val Gly His Gly Pro Ser Ala Glu Gln
500 505 510
Asn Leu Ser Ser Pro Asp Pro Asp Ile Arg Lys Asn Ala Lys Ala Phe
515 520 525
Tyr Thr Asp Leu Leu Lys Arg Leu Tyr Lys Leu Asp Val His Leu Ile
530 535 540
Gly Gly Ala Leu Tyr Ser Tyr Trp Pro Ile Asp Tyr Thr Lys Thr Ile
545 550 555 560
Asp Lys Lys Gly Asp Trp Glu Arg Ser Val Glu Ser Val Arg Glu Val
565 570 575
Ala Lys Val Ala Glu Ala Cys Gly Val Asp Phe Cys Leu Glu Val Leu
580 585 590
Asn Arg Phe Glu Asn Tyr Leu Ile Asn Thr Ala Gln Glu Gly Val Asp
595 600 605
Phe Val Lys Gln Val Asp His Asn Asn Val Lys Val Met Leu Asp Thr
610 615 620
Phe His Met Asn Ile Glu Glu Asp Ser Ile Gly Gly Ala Ile Arg Thr
625 630 635 640
Ala Gly Ser Tyr Leu Gly His Leu His Thr Gly Glu Cys Asn Arg Lys
645 650 655
Val Pro Gly Arg Gly Arg Ile Pro Trp Val Glu Ile Gly Glu Ala Leu
660 665 670
Ala Asp Ile Gly Tyr Asn Gly Ser Val Val Met Glu Pro Phe Val Arg
675 680 685
Met Gly Gly Thr Val Gly Ser Asn Ile Lys Val Trp Arg Asp Ile Ser
690 695 700
Asn Gly Ala Asp Glu Lys Met Leu Asp Arg Glu Ala Gln Ala Ala Leu
705 710 715 720
Asp Phe Ser Arg Tyr Val Leu Glu Cys His Lys His Ser
725 730
<210> 2
<211> 738
<212> PRT
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 2
Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro
1 5 10 15
Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gln Gln Met Gly Arg
20 25 30
Gly Ser Met Ser Leu Thr Thr Ala Ser Ser Lys Thr Ile Glu Val Ala
35 40 45
Thr Pro Ser Lys Glu Asp Arg Phe Ser Phe Gly Leu Trp Thr Val Gly
50 55 60
Trp Gln Ala Arg Asp Pro Phe Gly Glu Ala Thr Arg Pro Pro Leu Asp
65 70 75 80
Pro Val Glu Ala Val His Lys Leu Ala Glu Leu Gly Ala Tyr Gly Val
85 90 95
Thr Phe His Asp Asp Asp Leu Val Pro Phe Gly Ser Ser Asp Ala Glu
100 105 110
Arg Ala Arg Leu Ile Asp Arg Phe Lys Lys Ala Leu Ala Asp Thr Gly
115 120 125
Leu Val Val Pro Met Met Thr Thr Asn Leu Phe Thr His Pro Ile Phe
130 135 140
Lys Asp Gly Ala Phe Thr Ala Asn Asp Arg Ser Ile Arg Arg Tyr Ala
145 150 155 160
Ile Arg Lys Val Met Arg Asn Leu Asp Leu Ala Ala Glu Leu Gly Ala
165 170 175
Arg Thr Tyr Val Phe Trp Gly Gly Arg Glu Gly Ser Glu Ile Asp Ala
180 185 190
Ala Lys Asp Ile Arg Ala Ala Leu Asp Arg Tyr Arg Glu Ala Ile Asp
195 200 205
Thr Leu Ala Gln Tyr Val Lys Asp Gln Gly Tyr Gly Ile Arg Phe Ala
210 215 220
Leu Glu Pro Lys Pro Asn Glu Pro Arg Gly Asp Ile Phe Leu Pro Thr
225 230 235 240
Ile Gly His Ala Leu Ala Phe Ile Asn Ser Leu Glu His Ser Asp Ile
245 250 255
Val Gly Leu Asn Pro Glu Val Gly His Glu Gln Met Ser Asn Leu Asn
260 265 270
Phe Val His Gly Ile Ala Gln Ala Leu Trp His Gly Lys Leu Phe His
275 280 285
Ile Asp Leu Asn Gly Gln His Gly Pro Lys Tyr Asp Gln Asp Leu Val
290 295 300
Phe Gly His Gly Asp Leu Leu Ser Ala Phe Phe Leu Val Asp Leu Leu
305 310 315 320
Glu Asn Gly Phe Pro Gly Gly Gly Pro Val Tyr Asp Gly Pro Arg His
325 330 335
Phe Asp Tyr Lys Pro Met Arg Thr Glu Asp Ile Asp Gly Val Trp Ala
340 345 350
Ser Ala Ala Ala Asn Met Arg Thr Tyr Leu Leu Leu Lys Gln Arg Ala
355 360 365
Lys Ala Phe Arg Ala Asp Pro Glu Val Gln Ala Ala Leu Thr Ala Ser
370 375 380
Arg Val Pro Glu Leu Ala Val Pro Thr Leu Gly Glu Gly Glu Ser Tyr
385 390 395 400
Ala Asp Leu Leu Ala Asp Arg Ser Ala Trp Glu Glu Phe Asp Val Asp
405 410 415
Arg Ala Ala Asn Gln Gly Tyr Gly Tyr Ala Arg Leu Asp Gln Leu Ala
420 425 430
Ile Glu His Leu Leu Gly Ala Arg Gly Gly Gly Gly Gly Ser Lys His
435 440 445
Gly Ile Tyr Tyr Ala Tyr Trp Glu Gln Glu Trp Glu Ala Asp Tyr Lys
450 455 460
Tyr Tyr Ile Glu Lys Val Ala Lys Leu Gly Phe Asp Ile Leu Glu Ile
465 470 475 480
Ala Ala Ser Pro Leu Pro Phe Tyr Ser Asp Ile Gln Ile Asn Glu Leu
485 490 495
Lys Ala Cys Ala His Gly Asn Gly Ile Thr Leu Thr Val Gly His Gly
500 505 510
Pro Ser Ala Glu Gln Asn Leu Ser Ser Pro Asp Pro Asp Ile Arg Lys
515 520 525
Asn Ala Lys Ala Phe Tyr Thr Asp Leu Leu Lys Arg Leu Tyr Lys Leu
530 535 540
Asp Val His Leu Ile Gly Gly Ala Leu Tyr Ser Tyr Trp Pro Ile Asp
545 550 555 560
Tyr Thr Lys Thr Ile Asp Lys Lys Gly Asp Trp Glu Arg Ser Val Glu
565 570 575
Ser Val Arg Glu Val Ala Lys Val Ala Glu Ala Cys Gly Val Asp Phe
580 585 590
Cys Leu Glu Val Leu Asn Arg Phe Glu Asn Tyr Leu Ile Asn Thr Ala
595 600 605
Gln Glu Gly Val Asp Phe Val Lys Gln Val Asp His Asn Asn Val Lys
610 615 620
Val Met Leu Asp Thr Phe His Met Asn Ile Glu Glu Asp Ser Ile Gly
625 630 635 640
Gly Ala Ile Arg Thr Ala Gly Ser Tyr Leu Gly His Leu His Thr Gly
645 650 655
Glu Cys Asn Arg Lys Val Pro Gly Arg Gly Arg Ile Pro Trp Val Glu
660 665 670
Ile Gly Glu Ala Leu Ala Asp Ile Gly Tyr Asn Gly Ser Val Val Met
675 680 685
Glu Pro Phe Val Arg Met Gly Gly Thr Val Gly Ser Asn Ile Lys Val
690 695 700
Trp Arg Asp Ile Ser Asn Gly Ala Asp Glu Lys Met Leu Asp Arg Glu
705 710 715 720
Ala Gln Ala Ala Leu Asp Phe Ser Arg Tyr Val Leu Glu Cys His Lys
725 730 735
His Ser
<210> 3
<211> 743
<212> PRT
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 3
Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro
1 5 10 15
Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gln Gln Met Gly Arg
20 25 30
Gly Ser Met Ser Leu Thr Thr Ala Ser Ser Lys Thr Ile Glu Val Ala
35 40 45
Thr Pro Ser Lys Glu Asp Arg Phe Ser Phe Gly Leu Trp Thr Val Gly
50 55 60
Trp Gln Ala Arg Asp Pro Phe Gly Glu Ala Thr Arg Pro Pro Leu Asp
65 70 75 80
Pro Val Glu Ala Val His Lys Leu Ala Glu Leu Gly Ala Tyr Gly Val
85 90 95
Thr Phe His Asp Asp Asp Leu Val Pro Phe Gly Ser Ser Asp Ala Glu
100 105 110
Arg Ala Arg Leu Ile Asp Arg Phe Lys Lys Ala Leu Ala Asp Thr Gly
115 120 125
Leu Val Val Pro Met Met Thr Thr Asn Leu Phe Thr His Pro Ile Phe
130 135 140
Lys Asp Gly Ala Phe Thr Ala Asn Asp Arg Ser Ile Arg Arg Tyr Ala
145 150 155 160
Ile Arg Lys Val Met Arg Asn Leu Asp Leu Ala Ala Glu Leu Gly Ala
165 170 175
Arg Thr Tyr Val Phe Trp Gly Gly Arg Glu Gly Ser Glu Ile Asp Ala
180 185 190
Ala Lys Asp Ile Arg Ala Ala Leu Asp Arg Tyr Arg Glu Ala Ile Asp
195 200 205
Thr Leu Ala Gln Tyr Val Lys Asp Gln Gly Tyr Gly Ile Arg Phe Ala
210 215 220
Leu Glu Pro Lys Pro Asn Glu Pro Arg Gly Asp Ile Phe Leu Pro Thr
225 230 235 240
Ile Gly His Ala Leu Ala Phe Ile Asn Ser Leu Glu His Ser Asp Ile
245 250 255
Val Gly Leu Asn Pro Glu Val Gly His Glu Gln Met Ser Asn Leu Asn
260 265 270
Phe Val His Gly Ile Ala Gln Ala Leu Trp His Gly Lys Leu Phe His
275 280 285
Ile Asp Leu Asn Gly Gln His Gly Pro Lys Tyr Asp Gln Asp Leu Val
290 295 300
Phe Gly His Gly Asp Leu Leu Ser Ala Phe Phe Leu Val Asp Leu Leu
305 310 315 320
Glu Asn Gly Phe Pro Gly Gly Gly Pro Val Tyr Asp Gly Pro Arg His
325 330 335
Phe Asp Tyr Lys Pro Met Arg Thr Glu Asp Ile Asp Gly Val Trp Ala
340 345 350
Ser Ala Ala Ala Asn Met Arg Thr Tyr Leu Leu Leu Lys Gln Arg Ala
355 360 365
Lys Ala Phe Arg Ala Asp Pro Glu Val Gln Ala Ala Leu Thr Ala Ser
370 375 380
Arg Val Pro Glu Leu Ala Val Pro Thr Leu Gly Glu Gly Glu Ser Tyr
385 390 395 400
Ala Asp Leu Leu Ala Asp Arg Ser Ala Trp Glu Glu Phe Asp Val Asp
405 410 415
Arg Ala Ala Asn Gln Gly Tyr Gly Tyr Ala Arg Leu Asp Gln Leu Ala
420 425 430
Ile Glu His Leu Leu Gly Ala Arg Gly Gly Gly Gly Gly Ser Gly Gly
435 440 445
Gly Gly Ser Lys His Gly Ile Tyr Tyr Ala Tyr Trp Glu Gln Glu Trp
450 455 460
Glu Ala Asp Tyr Lys Tyr Tyr Ile Glu Lys Val Ala Lys Leu Gly Phe
465 470 475 480
Asp Ile Leu Glu Ile Ala Ala Ser Pro Leu Pro Phe Tyr Ser Asp Ile
485 490 495
Gln Ile Asn Glu Leu Lys Ala Cys Ala His Gly Asn Gly Ile Thr Leu
500 505 510
Thr Val Gly His Gly Pro Ser Ala Glu Gln Asn Leu Ser Ser Pro Asp
515 520 525
Pro Asp Ile Arg Lys Asn Ala Lys Ala Phe Tyr Thr Asp Leu Leu Lys
530 535 540
Arg Leu Tyr Lys Leu Asp Val His Leu Ile Gly Gly Ala Leu Tyr Ser
545 550 555 560
Tyr Trp Pro Ile Asp Tyr Thr Lys Thr Ile Asp Lys Lys Gly Asp Trp
565 570 575
Glu Arg Ser Val Glu Ser Val Arg Glu Val Ala Lys Val Ala Glu Ala
580 585 590
Cys Gly Val Asp Phe Cys Leu Glu Val Leu Asn Arg Phe Glu Asn Tyr
595 600 605
Leu Ile Asn Thr Ala Gln Glu Gly Val Asp Phe Val Lys Gln Val Asp
610 615 620
His Asn Asn Val Lys Val Met Leu Asp Thr Phe His Met Asn Ile Glu
625 630 635 640
Glu Asp Ser Ile Gly Gly Ala Ile Arg Thr Ala Gly Ser Tyr Leu Gly
645 650 655
His Leu His Thr Gly Glu Cys Asn Arg Lys Val Pro Gly Arg Gly Arg
660 665 670
Ile Pro Trp Val Glu Ile Gly Glu Ala Leu Ala Asp Ile Gly Tyr Asn
675 680 685
Gly Ser Val Val Met Glu Pro Phe Val Arg Met Gly Gly Thr Val Gly
690 695 700
Ser Asn Ile Lys Val Trp Arg Asp Ile Ser Asn Gly Ala Asp Glu Lys
705 710 715 720
Met Leu Asp Arg Glu Ala Gln Ala Ala Leu Asp Phe Ser Arg Tyr Val
725 730 735
Leu Glu Cys His Lys His Ser
740
<210> 4
<211> 748
<212> PRT
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 4
Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro
1 5 10 15
Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gln Gln Met Gly Arg
20 25 30
Gly Ser Met Ser Leu Thr Thr Ala Ser Ser Lys Thr Ile Glu Val Ala
35 40 45
Thr Pro Ser Lys Glu Asp Arg Phe Ser Phe Gly Leu Trp Thr Val Gly
50 55 60
Trp Gln Ala Arg Asp Pro Phe Gly Glu Ala Thr Arg Pro Pro Leu Asp
65 70 75 80
Pro Val Glu Ala Val His Lys Leu Ala Glu Leu Gly Ala Tyr Gly Val
85 90 95
Thr Phe His Asp Asp Asp Leu Val Pro Phe Gly Ser Ser Asp Ala Glu
100 105 110
Arg Ala Arg Leu Ile Asp Arg Phe Lys Lys Ala Leu Ala Asp Thr Gly
115 120 125
Leu Val Val Pro Met Met Thr Thr Asn Leu Phe Thr His Pro Ile Phe
130 135 140
Lys Asp Gly Ala Phe Thr Ala Asn Asp Arg Ser Ile Arg Arg Tyr Ala
145 150 155 160
Ile Arg Lys Val Met Arg Asn Leu Asp Leu Ala Ala Glu Leu Gly Ala
165 170 175
Arg Thr Tyr Val Phe Trp Gly Gly Arg Glu Gly Ser Glu Ile Asp Ala
180 185 190
Ala Lys Asp Ile Arg Ala Ala Leu Asp Arg Tyr Arg Glu Ala Ile Asp
195 200 205
Thr Leu Ala Gln Tyr Val Lys Asp Gln Gly Tyr Gly Ile Arg Phe Ala
210 215 220
Leu Glu Pro Lys Pro Asn Glu Pro Arg Gly Asp Ile Phe Leu Pro Thr
225 230 235 240
Ile Gly His Ala Leu Ala Phe Ile Asn Ser Leu Glu His Ser Asp Ile
245 250 255
Val Gly Leu Asn Pro Glu Val Gly His Glu Gln Met Ser Asn Leu Asn
260 265 270
Phe Val His Gly Ile Ala Gln Ala Leu Trp His Gly Lys Leu Phe His
275 280 285
Ile Asp Leu Asn Gly Gln His Gly Pro Lys Tyr Asp Gln Asp Leu Val
290 295 300
Phe Gly His Gly Asp Leu Leu Ser Ala Phe Phe Leu Val Asp Leu Leu
305 310 315 320
Glu Asn Gly Phe Pro Gly Gly Gly Pro Val Tyr Asp Gly Pro Arg His
325 330 335
Phe Asp Tyr Lys Pro Met Arg Thr Glu Asp Ile Asp Gly Val Trp Ala
340 345 350
Ser Ala Ala Ala Asn Met Arg Thr Tyr Leu Leu Leu Lys Gln Arg Ala
355 360 365
Lys Ala Phe Arg Ala Asp Pro Glu Val Gln Ala Ala Leu Thr Ala Ser
370 375 380
Arg Val Pro Glu Leu Ala Val Pro Thr Leu Gly Glu Gly Glu Ser Tyr
385 390 395 400
Ala Asp Leu Leu Ala Asp Arg Ser Ala Trp Glu Glu Phe Asp Val Asp
405 410 415
Arg Ala Ala Asn Gln Gly Tyr Gly Tyr Ala Arg Leu Asp Gln Leu Ala
420 425 430
Ile Glu His Leu Leu Gly Ala Arg Gly Gly Gly Gly Gly Ser Gly Gly
435 440 445
Gly Gly Ser Gly Gly Gly Gly Ser Lys His Gly Ile Tyr Tyr Ala Tyr
450 455 460
Trp Glu Gln Glu Trp Glu Ala Asp Tyr Lys Tyr Tyr Ile Glu Lys Val
465 470 475 480
Ala Lys Leu Gly Phe Asp Ile Leu Glu Ile Ala Ala Ser Pro Leu Pro
485 490 495
Phe Tyr Ser Asp Ile Gln Ile Asn Glu Leu Lys Ala Cys Ala His Gly
500 505 510
Asn Gly Ile Thr Leu Thr Val Gly His Gly Pro Ser Ala Glu Gln Asn
515 520 525
Leu Ser Ser Pro Asp Pro Asp Ile Arg Lys Asn Ala Lys Ala Phe Tyr
530 535 540
Thr Asp Leu Leu Lys Arg Leu Tyr Lys Leu Asp Val His Leu Ile Gly
545 550 555 560
Gly Ala Leu Tyr Ser Tyr Trp Pro Ile Asp Tyr Thr Lys Thr Ile Asp
565 570 575
Lys Lys Gly Asp Trp Glu Arg Ser Val Glu Ser Val Arg Glu Val Ala
580 585 590
Lys Val Ala Glu Ala Cys Gly Val Asp Phe Cys Leu Glu Val Leu Asn
595 600 605
Arg Phe Glu Asn Tyr Leu Ile Asn Thr Ala Gln Glu Gly Val Asp Phe
610 615 620
Val Lys Gln Val Asp His Asn Asn Val Lys Val Met Leu Asp Thr Phe
625 630 635 640
His Met Asn Ile Glu Glu Asp Ser Ile Gly Gly Ala Ile Arg Thr Ala
645 650 655
Gly Ser Tyr Leu Gly His Leu His Thr Gly Glu Cys Asn Arg Lys Val
660 665 670
Pro Gly Arg Gly Arg Ile Pro Trp Val Glu Ile Gly Glu Ala Leu Ala
675 680 685
Asp Ile Gly Tyr Asn Gly Ser Val Val Met Glu Pro Phe Val Arg Met
690 695 700
Gly Gly Thr Val Gly Ser Asn Ile Lys Val Trp Arg Asp Ile Ser Asn
705 710 715 720
Gly Ala Asp Glu Lys Met Leu Asp Arg Glu Ala Gln Ala Ala Leu Asp
725 730 735
Phe Ser Arg Tyr Val Leu Glu Cys His Lys His Ser
740 745
<210> 5
<211> 738
<212> PRT
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 5
Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro
1 5 10 15
Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gln Gln Met Gly Arg
20 25 30
Gly Ser Met Ser Leu Thr Thr Ala Ser Ser Lys Thr Ile Glu Val Ala
35 40 45
Thr Pro Ser Lys Glu Asp Arg Phe Ser Phe Gly Leu Trp Thr Val Gly
50 55 60
Trp Gln Ala Arg Asp Pro Phe Gly Glu Ala Thr Arg Pro Pro Leu Asp
65 70 75 80
Pro Val Glu Ala Val His Lys Leu Ala Glu Leu Gly Ala Tyr Gly Val
85 90 95
Thr Phe His Asp Asp Asp Leu Val Pro Phe Gly Ser Ser Asp Ala Glu
100 105 110
Arg Ala Arg Leu Ile Asp Arg Phe Lys Lys Ala Leu Ala Asp Thr Gly
115 120 125
Leu Val Val Pro Met Met Thr Thr Asn Leu Phe Thr His Pro Ile Phe
130 135 140
Lys Asp Gly Ala Phe Thr Ala Asn Asp Arg Ser Ile Arg Arg Tyr Ala
145 150 155 160
Ile Arg Lys Val Met Arg Asn Leu Asp Leu Ala Ala Glu Leu Gly Ala
165 170 175
Arg Thr Tyr Val Phe Trp Gly Gly Arg Glu Gly Ser Glu Ile Asp Ala
180 185 190
Ala Lys Asp Ile Arg Ala Ala Leu Asp Arg Tyr Arg Glu Ala Ile Asp
195 200 205
Thr Leu Ala Gln Tyr Val Lys Asp Gln Gly Tyr Gly Ile Arg Phe Ala
210 215 220
Leu Glu Pro Lys Pro Asn Glu Pro Arg Gly Asp Ile Phe Leu Pro Thr
225 230 235 240
Ile Gly His Ala Leu Ala Phe Ile Asn Ser Leu Glu His Ser Asp Ile
245 250 255
Val Gly Leu Asn Pro Glu Val Gly His Glu Gln Met Ser Asn Leu Asn
260 265 270
Phe Val His Gly Ile Ala Gln Ala Leu Trp His Gly Lys Leu Phe His
275 280 285
Ile Asp Leu Asn Gly Gln His Gly Pro Lys Tyr Asp Gln Asp Leu Val
290 295 300
Phe Gly His Gly Asp Leu Leu Ser Ala Phe Phe Leu Val Asp Leu Leu
305 310 315 320
Glu Asn Gly Phe Pro Gly Gly Gly Pro Val Tyr Asp Gly Pro Arg His
325 330 335
Phe Asp Tyr Lys Pro Met Arg Thr Glu Asp Ile Asp Gly Val Trp Ala
340 345 350
Ser Ala Ala Ala Asn Met Arg Thr Tyr Leu Leu Leu Lys Gln Arg Ala
355 360 365
Lys Ala Phe Arg Ala Asp Pro Glu Val Gln Ala Ala Leu Thr Ala Ser
370 375 380
Arg Val Pro Glu Leu Ala Val Pro Thr Leu Gly Glu Gly Glu Ser Tyr
385 390 395 400
Ala Asp Leu Leu Ala Asp Arg Ser Ala Trp Glu Glu Phe Asp Val Asp
405 410 415
Arg Ala Ala Asn Gln Gly Tyr Gly Tyr Ala Arg Leu Asp Gln Leu Ala
420 425 430
Ile Glu His Leu Leu Gly Ala Arg Gly Glu Ala Ala Ala Lys Lys His
435 440 445
Gly Ile Tyr Tyr Ala Tyr Trp Glu Gln Glu Trp Glu Ala Asp Tyr Lys
450 455 460
Tyr Tyr Ile Glu Lys Val Ala Lys Leu Gly Phe Asp Ile Leu Glu Ile
465 470 475 480
Ala Ala Ser Pro Leu Pro Phe Tyr Ser Asp Ile Gln Ile Asn Glu Leu
485 490 495
Lys Ala Cys Ala His Gly Asn Gly Ile Thr Leu Thr Val Gly His Gly
500 505 510
Pro Ser Ala Glu Gln Asn Leu Ser Ser Pro Asp Pro Asp Ile Arg Lys
515 520 525
Asn Ala Lys Ala Phe Tyr Thr Asp Leu Leu Lys Arg Leu Tyr Lys Leu
530 535 540
Asp Val His Leu Ile Gly Gly Ala Leu Tyr Ser Tyr Trp Pro Ile Asp
545 550 555 560
Tyr Thr Lys Thr Ile Asp Lys Lys Gly Asp Trp Glu Arg Ser Val Glu
565 570 575
Ser Val Arg Glu Val Ala Lys Val Ala Glu Ala Cys Gly Val Asp Phe
580 585 590
Cys Leu Glu Val Leu Asn Arg Phe Glu Asn Tyr Leu Ile Asn Thr Ala
595 600 605
Gln Glu Gly Val Asp Phe Val Lys Gln Val Asp His Asn Asn Val Lys
610 615 620
Val Met Leu Asp Thr Phe His Met Asn Ile Glu Glu Asp Ser Ile Gly
625 630 635 640
Gly Ala Ile Arg Thr Ala Gly Ser Tyr Leu Gly His Leu His Thr Gly
645 650 655
Glu Cys Asn Arg Lys Val Pro Gly Arg Gly Arg Ile Pro Trp Val Glu
660 665 670
Ile Gly Glu Ala Leu Ala Asp Ile Gly Tyr Asn Gly Ser Val Val Met
675 680 685
Glu Pro Phe Val Arg Met Gly Gly Thr Val Gly Ser Asn Ile Lys Val
690 695 700
Trp Arg Asp Ile Ser Asn Gly Ala Asp Glu Lys Met Leu Asp Arg Glu
705 710 715 720
Ala Gln Ala Ala Leu Asp Phe Ser Arg Tyr Val Leu Glu Cys His Lys
725 730 735
His Ser
<210> 6
<211> 743
<212> PRT
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 6
Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro
1 5 10 15
Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gln Gln Met Gly Arg
20 25 30
Gly Ser Met Ser Leu Thr Thr Ala Ser Ser Lys Thr Ile Glu Val Ala
35 40 45
Thr Pro Ser Lys Glu Asp Arg Phe Ser Phe Gly Leu Trp Thr Val Gly
50 55 60
Trp Gln Ala Arg Asp Pro Phe Gly Glu Ala Thr Arg Pro Pro Leu Asp
65 70 75 80
Pro Val Glu Ala Val His Lys Leu Ala Glu Leu Gly Ala Tyr Gly Val
85 90 95
Thr Phe His Asp Asp Asp Leu Val Pro Phe Gly Ser Ser Asp Ala Glu
100 105 110
Arg Ala Arg Leu Ile Asp Arg Phe Lys Lys Ala Leu Ala Asp Thr Gly
115 120 125
Leu Val Val Pro Met Met Thr Thr Asn Leu Phe Thr His Pro Ile Phe
130 135 140
Lys Asp Gly Ala Phe Thr Ala Asn Asp Arg Ser Ile Arg Arg Tyr Ala
145 150 155 160
Ile Arg Lys Val Met Arg Asn Leu Asp Leu Ala Ala Glu Leu Gly Ala
165 170 175
Arg Thr Tyr Val Phe Trp Gly Gly Arg Glu Gly Ser Glu Ile Asp Ala
180 185 190
Ala Lys Asp Ile Arg Ala Ala Leu Asp Arg Tyr Arg Glu Ala Ile Asp
195 200 205
Thr Leu Ala Gln Tyr Val Lys Asp Gln Gly Tyr Gly Ile Arg Phe Ala
210 215 220
Leu Glu Pro Lys Pro Asn Glu Pro Arg Gly Asp Ile Phe Leu Pro Thr
225 230 235 240
Ile Gly His Ala Leu Ala Phe Ile Asn Ser Leu Glu His Ser Asp Ile
245 250 255
Val Gly Leu Asn Pro Glu Val Gly His Glu Gln Met Ser Asn Leu Asn
260 265 270
Phe Val His Gly Ile Ala Gln Ala Leu Trp His Gly Lys Leu Phe His
275 280 285
Ile Asp Leu Asn Gly Gln His Gly Pro Lys Tyr Asp Gln Asp Leu Val
290 295 300
Phe Gly His Gly Asp Leu Leu Ser Ala Phe Phe Leu Val Asp Leu Leu
305 310 315 320
Glu Asn Gly Phe Pro Gly Gly Gly Pro Val Tyr Asp Gly Pro Arg His
325 330 335
Phe Asp Tyr Lys Pro Met Arg Thr Glu Asp Ile Asp Gly Val Trp Ala
340 345 350
Ser Ala Ala Ala Asn Met Arg Thr Tyr Leu Leu Leu Lys Gln Arg Ala
355 360 365
Lys Ala Phe Arg Ala Asp Pro Glu Val Gln Ala Ala Leu Thr Ala Ser
370 375 380
Arg Val Pro Glu Leu Ala Val Pro Thr Leu Gly Glu Gly Glu Ser Tyr
385 390 395 400
Ala Asp Leu Leu Ala Asp Arg Ser Ala Trp Glu Glu Phe Asp Val Asp
405 410 415
Arg Ala Ala Asn Gln Gly Tyr Gly Tyr Ala Arg Leu Asp Gln Leu Ala
420 425 430
Ile Glu His Leu Leu Gly Ala Arg Gly Glu Ala Ala Ala Lys Glu Ala
435 440 445
Ala Ala Lys Lys His Gly Ile Tyr Tyr Ala Tyr Trp Glu Gln Glu Trp
450 455 460
Glu Ala Asp Tyr Lys Tyr Tyr Ile Glu Lys Val Ala Lys Leu Gly Phe
465 470 475 480
Asp Ile Leu Glu Ile Ala Ala Ser Pro Leu Pro Phe Tyr Ser Asp Ile
485 490 495
Gln Ile Asn Glu Leu Lys Ala Cys Ala His Gly Asn Gly Ile Thr Leu
500 505 510
Thr Val Gly His Gly Pro Ser Ala Glu Gln Asn Leu Ser Ser Pro Asp
515 520 525
Pro Asp Ile Arg Lys Asn Ala Lys Ala Phe Tyr Thr Asp Leu Leu Lys
530 535 540
Arg Leu Tyr Lys Leu Asp Val His Leu Ile Gly Gly Ala Leu Tyr Ser
545 550 555 560
Tyr Trp Pro Ile Asp Tyr Thr Lys Thr Ile Asp Lys Lys Gly Asp Trp
565 570 575
Glu Arg Ser Val Glu Ser Val Arg Glu Val Ala Lys Val Ala Glu Ala
580 585 590
Cys Gly Val Asp Phe Cys Leu Glu Val Leu Asn Arg Phe Glu Asn Tyr
595 600 605
Leu Ile Asn Thr Ala Gln Glu Gly Val Asp Phe Val Lys Gln Val Asp
610 615 620
His Asn Asn Val Lys Val Met Leu Asp Thr Phe His Met Asn Ile Glu
625 630 635 640
Glu Asp Ser Ile Gly Gly Ala Ile Arg Thr Ala Gly Ser Tyr Leu Gly
645 650 655
His Leu His Thr Gly Glu Cys Asn Arg Lys Val Pro Gly Arg Gly Arg
660 665 670
Ile Pro Trp Val Glu Ile Gly Glu Ala Leu Ala Asp Ile Gly Tyr Asn
675 680 685
Gly Ser Val Val Met Glu Pro Phe Val Arg Met Gly Gly Thr Val Gly
690 695 700
Ser Asn Ile Lys Val Trp Arg Asp Ile Ser Asn Gly Ala Asp Glu Lys
705 710 715 720
Met Leu Asp Arg Glu Ala Gln Ala Ala Leu Asp Phe Ser Arg Tyr Val
725 730 735
Leu Glu Cys His Lys His Ser
740
<210> 7
<211> 748
<212> PRT
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 7
Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro
1 5 10 15
Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gln Gln Met Gly Arg
20 25 30
Gly Ser Met Ser Leu Thr Thr Ala Ser Ser Lys Thr Ile Glu Val Ala
35 40 45
Thr Pro Ser Lys Glu Asp Arg Phe Ser Phe Gly Leu Trp Thr Val Gly
50 55 60
Trp Gln Ala Arg Asp Pro Phe Gly Glu Ala Thr Arg Pro Pro Leu Asp
65 70 75 80
Pro Val Glu Ala Val His Lys Leu Ala Glu Leu Gly Ala Tyr Gly Val
85 90 95
Thr Phe His Asp Asp Asp Leu Val Pro Phe Gly Ser Ser Asp Ala Glu
100 105 110
Arg Ala Arg Leu Ile Asp Arg Phe Lys Lys Ala Leu Ala Asp Thr Gly
115 120 125
Leu Val Val Pro Met Met Thr Thr Asn Leu Phe Thr His Pro Ile Phe
130 135 140
Lys Asp Gly Ala Phe Thr Ala Asn Asp Arg Ser Ile Arg Arg Tyr Ala
145 150 155 160
Ile Arg Lys Val Met Arg Asn Leu Asp Leu Ala Ala Glu Leu Gly Ala
165 170 175
Arg Thr Tyr Val Phe Trp Gly Gly Arg Glu Gly Ser Glu Ile Asp Ala
180 185 190
Ala Lys Asp Ile Arg Ala Ala Leu Asp Arg Tyr Arg Glu Ala Ile Asp
195 200 205
Thr Leu Ala Gln Tyr Val Lys Asp Gln Gly Tyr Gly Ile Arg Phe Ala
210 215 220
Leu Glu Pro Lys Pro Asn Glu Pro Arg Gly Asp Ile Phe Leu Pro Thr
225 230 235 240
Ile Gly His Ala Leu Ala Phe Ile Asn Ser Leu Glu His Ser Asp Ile
245 250 255
Val Gly Leu Asn Pro Glu Val Gly His Glu Gln Met Ser Asn Leu Asn
260 265 270
Phe Val His Gly Ile Ala Gln Ala Leu Trp His Gly Lys Leu Phe His
275 280 285
Ile Asp Leu Asn Gly Gln His Gly Pro Lys Tyr Asp Gln Asp Leu Val
290 295 300
Phe Gly His Gly Asp Leu Leu Ser Ala Phe Phe Leu Val Asp Leu Leu
305 310 315 320
Glu Asn Gly Phe Pro Gly Gly Gly Pro Val Tyr Asp Gly Pro Arg His
325 330 335
Phe Asp Tyr Lys Pro Met Arg Thr Glu Asp Ile Asp Gly Val Trp Ala
340 345 350
Ser Ala Ala Ala Asn Met Arg Thr Tyr Leu Leu Leu Lys Gln Arg Ala
355 360 365
Lys Ala Phe Arg Ala Asp Pro Glu Val Gln Ala Ala Leu Thr Ala Ser
370 375 380
Arg Val Pro Glu Leu Ala Val Pro Thr Leu Gly Glu Gly Glu Ser Tyr
385 390 395 400
Ala Asp Leu Leu Ala Asp Arg Ser Ala Trp Glu Glu Phe Asp Val Asp
405 410 415
Arg Ala Ala Asn Gln Gly Tyr Gly Tyr Ala Arg Leu Asp Gln Leu Ala
420 425 430
Ile Glu His Leu Leu Gly Ala Arg Gly Glu Ala Ala Ala Lys Glu Ala
435 440 445
Ala Ala Lys Glu Ala Ala Ala Lys Lys His Gly Ile Tyr Tyr Ala Tyr
450 455 460
Trp Glu Gln Glu Trp Glu Ala Asp Tyr Lys Tyr Tyr Ile Glu Lys Val
465 470 475 480
Ala Lys Leu Gly Phe Asp Ile Leu Glu Ile Ala Ala Ser Pro Leu Pro
485 490 495
Phe Tyr Ser Asp Ile Gln Ile Asn Glu Leu Lys Ala Cys Ala His Gly
500 505 510
Asn Gly Ile Thr Leu Thr Val Gly His Gly Pro Ser Ala Glu Gln Asn
515 520 525
Leu Ser Ser Pro Asp Pro Asp Ile Arg Lys Asn Ala Lys Ala Phe Tyr
530 535 540
Thr Asp Leu Leu Lys Arg Leu Tyr Lys Leu Asp Val His Leu Ile Gly
545 550 555 560
Gly Ala Leu Tyr Ser Tyr Trp Pro Ile Asp Tyr Thr Lys Thr Ile Asp
565 570 575
Lys Lys Gly Asp Trp Glu Arg Ser Val Glu Ser Val Arg Glu Val Ala
580 585 590
Lys Val Ala Glu Ala Cys Gly Val Asp Phe Cys Leu Glu Val Leu Asn
595 600 605
Arg Phe Glu Asn Tyr Leu Ile Asn Thr Ala Gln Glu Gly Val Asp Phe
610 615 620
Val Lys Gln Val Asp His Asn Asn Val Lys Val Met Leu Asp Thr Phe
625 630 635 640
His Met Asn Ile Glu Glu Asp Ser Ile Gly Gly Ala Ile Arg Thr Ala
645 650 655
Gly Ser Tyr Leu Gly His Leu His Thr Gly Glu Cys Asn Arg Lys Val
660 665 670
Pro Gly Arg Gly Arg Ile Pro Trp Val Glu Ile Gly Glu Ala Leu Ala
675 680 685
Asp Ile Gly Tyr Asn Gly Ser Val Val Met Glu Pro Phe Val Arg Met
690 695 700
Gly Gly Thr Val Gly Ser Asn Ile Lys Val Trp Arg Asp Ile Ser Asn
705 710 715 720
Gly Ala Asp Glu Lys Met Leu Asp Arg Glu Ala Gln Ala Ala Leu Asp
725 730 735
Phe Ser Arg Tyr Val Leu Glu Cys His Lys His Ser
740 745
<210> 8
<211> 2202
<212> DNA
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 8
atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60
atggcaagca tgactggtgg acagcaaatg ggtcgcggat ccatgtctct tactactgct 120
tcttctaaaa ctatcgaagt tgcaactcca tcaaaagaag atcgtttctc tttcggtctt 180
tggactgttg gttggcaagc tcgtgaccca ttcggtgaag ctactcgccc accacttgac 240
ccagttgaag ctgtacacaa acttgctgaa ttaggtgcat acggtgttac tttccacgat 300
gatgaccttg ttccattcgg ttcttctgat gctgaacgcg ctcgcttaat cgatcgtttc 360
aaaaaagcac tagctgatac tggtttagtt gtaccaatga tgacaacaaa cttattcact 420
catcctatct tcaaagatgg tgctttcact gctaacgatc gttctatccg tcgttatgct 480
atccgcaaag taatgcgtaa cttagattta gcggctgaat taggtgctcg tacatacgta 540
ttctggggtg gtcgtgaagg ttctgaaatc gatgctgcaa aagacatccg tgctgcttta 600
gatcgttacc gtgaagctat tgacactctt gctcaatacg ttaaagatca aggttacggt 660
attcgtttcg cattagaacc aaaaccaaac gaaccacgcg gtgacatctt cttaccaact 720
atcggtcacg ctttagcttt catcaactct ttagaacact ctgatattgt tggtcttaac 780
cctgaagtag gtcacgaaca aatgtctaac ttaaacttcg tgcacggtat cgctcaagct 840
ctttggcacg gtaaattatt ccacattgat cttaacggtc aacacggtcc taaatacgac 900
caagacttag ttttcggtca cggtgattta ttatctgctt tcttccttgt agacttactt 960
gaaaacggtt tcccaggtgg aggaccagta tacgatggac ctcgtcactt cgattacaaa 1020
cctatgcgta ctgaagatat tgatggtgtt tgggcatcag cagcagctaa catgcgtact 1080
tacttacttt taaaacaacg tgcaaaagca ttccgtgctg atcctgaagt tcaagcagct 1140
ttaacagctt cacgcgttcc tgaattagct gttccaacat taggtgaagg tgaatcttac 1200
gctgatttat tagctgatcg ttcagcttgg gaagaatttg atgttgatcg tgctgctaac 1260
caaggttacg gttacgctcg tcttgatcaa ttagctatcg aacacttatt aggtgctcgt 1320
ggtaaacacg gtatctacta cgcttactgg gaacaagaat gggaagctga ttacaaatac 1380
tacatcgaaa aagtagctaa attaggtttc gacatcttag aaatcgctgc ttctccactt 1440
ccattctact ctgatatcca aattaacgaa ttaaaagcat gcgctcacgg taacggtatt 1500
acacttactg ttggtcacgg cccatctgct gaacaaaact tatcttctcc tgacccagac 1560
attcgtaaaa acgctaaagc attctacact gatcttttaa aacgtttata caaattagat 1620
gtacacttaa ttggtggtgc tctttactct tactggccaa tcgactacac taaaactatc 1680
gataaaaaag gtgactggga acgttctgtt gaatcagtac gtgaagttgc taaagttgct 1740
gaggcttgtg gtgttgattt ctgtttagaa gtattaaacc gtttcgaaaa ctacttaatc 1800
aacactgctc aagaaggtgt tgatttcgtt aaacaagttg atcacaataa cgttaaagta 1860
atgttagata cattccatat gaacatcgaa gaagattcta tcggtggtgc aatccgtact 1920
gctggttctt acttaggtca ccttcacact ggtgaatgta accgtaaagt acctggtcgt 1980
ggtcgtatcc catgggttga gatcggtgaa gcattagctg acatcggtta caacggttct 2040
gtagtaatgg aaccattcgt acgtatgggt ggtactgttg gttctaacat taaagtttgg 2100
cgtgacatct ctaacggtgc tgatgaaaaa atgttagatc gtgaagctca agctgcatta 2160
gatttctctc gttacgttct tgaatgtcac aaacactctt aa 2202
<210> 9
<211> 2217
<212> DNA
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 9
atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60
atggcaagca tgactggtgg acagcaaatg ggtcgcggat ccatgtctct tactactgct 120
tcttctaaaa ctatcgaagt tgcaactcca tcaaaagaag atcgtttctc tttcggtctt 180
tggactgttg gttggcaagc tcgtgaccca ttcggtgaag ctactcgccc accacttgac 240
ccagttgaag ctgtacacaa acttgctgaa ttaggtgcat acggtgttac tttccacgat 300
gatgaccttg ttccattcgg ttcttctgat gctgaacgcg ctcgcttaat cgatcgtttc 360
aaaaaagcac tagctgatac tggtttagtt gtaccaatga tgacaacaaa cttattcact 420
catcctatct tcaaagatgg tgctttcact gctaacgatc gttctatccg tcgttatgct 480
atccgcaaag taatgcgtaa cttagattta gcggctgaat taggtgctcg tacatacgta 540
ttctggggtg gtcgtgaagg ttctgaaatc gatgctgcaa aagacatccg tgctgcttta 600
gatcgttacc gtgaagctat tgacactctt gctcaatacg ttaaagatca aggttacggt 660
attcgtttcg cattagaacc aaaaccaaac gaaccacgcg gtgacatctt cttaccaact 720
atcggtcacg ctttagcttt catcaactct ttagaacact ctgatattgt tggtcttaac 780
cctgaagtag gtcacgaaca aatgtctaac ttaaacttcg tgcacggtat cgctcaagct 840
ctttggcacg gtaaattatt ccacattgat cttaacggtc aacacggtcc taaatacgac 900
caagacttag ttttcggtca cggtgattta ttatctgctt tcttccttgt agacttactt 960
gaaaacggtt tcccaggtgg aggaccagta tacgatggac ctcgtcactt cgattacaaa 1020
cctatgcgta ctgaagatat tgatggtgtt tgggcatcag cagcagctaa catgcgtact 1080
tacttacttt taaaacaacg tgcaaaagca ttccgtgctg atcctgaagt tcaagcagct 1140
ttaacagctt cacgcgttcc tgaattagct gttccaacat taggtgaagg tgaatcttac 1200
gctgatttat tagctgatcg ttcagcttgg gaagaatttg atgttgatcg tgctgctaac 1260
caaggttacg gttacgctcg tcttgatcaa ttagctatcg aacacttatt aggtgctcgt 1320
ggtggtggag gcggaagtaa acacggtatc tactacgctt actgggaaca agaatgggaa 1380
gctgattaca aatactacat cgaaaaagta gctaaattag gtttcgacat cttagaaatc 1440
gctgcttctc cacttccatt ctactctgat atccaaatta acgaattaaa agcatgcgct 1500
cacggtaacg gtattacact tactgttggt cacggcccat ctgctgaaca aaacttatct 1560
tctcctgacc cagacattcg taaaaacgct aaagcattct acactgatct tttaaaacgt 1620
ttatacaaat tagatgtaca cttaattggt ggtgctcttt actcttactg gccaatcgac 1680
tacactaaaa ctatcgataa aaaaggtgac tgggaacgtt ctgttgaatc agtacgtgaa 1740
gttgctaaag ttgctgaggc ttgtggtgtt gatttctgtt tagaagtatt aaaccgtttc 1800
gaaaactact taatcaacac tgctcaagaa ggtgttgatt tcgttaaaca agttgatcac 1860
aataacgtta aagtaatgtt agatacattc catatgaaca tcgaagaaga ttctatcggt 1920
ggtgcaatcc gtactgctgg ttcttactta ggtcaccttc acactggtga atgtaaccgt 1980
aaagtacctg gtcgtggtcg tatcccatgg gttgagatcg gtgaagcatt agctgacatc 2040
ggttacaacg gttctgtagt aatggaacca ttcgtacgta tgggtggtac tgttggttct 2100
aacattaaag tttggcgtga catctctaac ggtgctgatg aaaaaatgtt agatcgtgaa 2160
gctcaagctg cattagattt ctctcgttac gttcttgaat gtcacaaaca ctcttaa 2217
<210> 10
<211> 2232
<212> DNA
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 10
atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60
atggcaagca tgactggtgg acagcaaatg ggtcgcggat ccatgtctct tactactgct 120
tcttctaaaa ctatcgaagt tgcaactcca tcaaaagaag atcgtttctc tttcggtctt 180
tggactgttg gttggcaagc tcgtgaccca ttcggtgaag ctactcgccc accacttgac 240
ccagttgaag ctgtacacaa acttgctgaa ttaggtgcat acggtgttac tttccacgat 300
gatgaccttg ttccattcgg ttcttctgat gctgaacgcg ctcgcttaat cgatcgtttc 360
aaaaaagcac tagctgatac tggtttagtt gtaccaatga tgacaacaaa cttattcact 420
catcctatct tcaaagatgg tgctttcact gctaacgatc gttctatccg tcgttatgct 480
atccgcaaag taatgcgtaa cttagattta gcggctgaat taggtgctcg tacatacgta 540
ttctggggtg gtcgtgaagg ttctgaaatc gatgctgcaa aagacatccg tgctgcttta 600
gatcgttacc gtgaagctat tgacactctt gctcaatacg ttaaagatca aggttacggt 660
attcgtttcg cattagaacc aaaaccaaac gaaccacgcg gtgacatctt cttaccaact 720
atcggtcacg ctttagcttt catcaactct ttagaacact ctgatattgt tggtcttaac 780
cctgaagtag gtcacgaaca aatgtctaac ttaaacttcg tgcacggtat cgctcaagct 840
ctttggcacg gtaaattatt ccacattgat cttaacggtc aacacggtcc taaatacgac 900
caagacttag ttttcggtca cggtgattta ttatctgctt tcttccttgt agacttactt 960
gaaaacggtt tcccaggtgg aggaccagta tacgatggac ctcgtcactt cgattacaaa 1020
cctatgcgta ctgaagatat tgatggtgtt tgggcatcag cagcagctaa catgcgtact 1080
tacttacttt taaaacaacg tgcaaaagca ttccgtgctg atcctgaagt tcaagcagct 1140
ttaacagctt cacgcgttcc tgaattagct gttccaacat taggtgaagg tgaatcttac 1200
gctgatttat tagctgatcg ttcagcttgg gaagaatttg atgttgatcg tgctgctaac 1260
caaggttacg gttacgctcg tcttgatcaa ttagctatcg aacacttatt aggtgctcgt 1320
ggtggtggag gcggaagtgg cggtggtggc agcaaacacg gtatctacta cgcttactgg 1380
gaacaagaat gggaagctga ttacaaatac tacatcgaaa aagtagctaa attaggtttc 1440
gacatcttag aaatcgctgc ttctccactt ccattctact ctgatatcca aattaacgaa 1500
ttaaaagcat gcgctcacgg taacggtatt acacttactg ttggtcacgg cccatctgct 1560
gaacaaaact tatcttctcc tgacccagac attcgtaaaa acgctaaagc attctacact 1620
gatcttttaa aacgtttata caaattagat gtacacttaa ttggtggtgc tctttactct 1680
tactggccaa tcgactacac taaaactatc gataaaaaag gtgactggga acgttctgtt 1740
gaatcagtac gtgaagttgc taaagttgct gaggcttgtg gtgttgattt ctgtttagaa 1800
gtattaaacc gtttcgaaaa ctacttaatc aacactgctc aagaaggtgt tgatttcgtt 1860
aaacaagttg atcacaataa cgttaaagta atgttagata cattccatat gaacatcgaa 1920
gaagattcta tcggtggtgc aatccgtact gctggttctt acttaggtca ccttcacact 1980
ggtgaatgta accgtaaagt acctggtcgt ggtcgtatcc catgggttga gatcggtgaa 2040
gcattagctg acatcggtta caacggttct gtagtaatgg aaccattcgt acgtatgggt 2100
ggtactgttg gttctaacat taaagtttgg cgtgacatct ctaacggtgc tgatgaaaaa 2160
atgttagatc gtgaagctca agctgcatta gatttctctc gttacgttct tgaatgtcac 2220
aaacactctt aa 2232
<210> 11
<211> 2247
<212> DNA
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 11
atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60
atggcaagca tgactggtgg acagcaaatg ggtcgcggat ccatgtctct tactactgct 120
tcttctaaaa ctatcgaagt tgcaactcca tcaaaagaag atcgtttctc tttcggtctt 180
tggactgttg gttggcaagc tcgtgaccca ttcggtgaag ctactcgccc accacttgac 240
ccagttgaag ctgtacacaa acttgctgaa ttaggtgcat acggtgttac tttccacgat 300
gatgaccttg ttccattcgg ttcttctgat gctgaacgcg ctcgcttaat cgatcgtttc 360
aaaaaagcac tagctgatac tggtttagtt gtaccaatga tgacaacaaa cttattcact 420
catcctatct tcaaagatgg tgctttcact gctaacgatc gttctatccg tcgttatgct 480
atccgcaaag taatgcgtaa cttagattta gcggctgaat taggtgctcg tacatacgta 540
ttctggggtg gtcgtgaagg ttctgaaatc gatgctgcaa aagacatccg tgctgcttta 600
gatcgttacc gtgaagctat tgacactctt gctcaatacg ttaaagatca aggttacggt 660
attcgtttcg cattagaacc aaaaccaaac gaaccacgcg gtgacatctt cttaccaact 720
atcggtcacg ctttagcttt catcaactct ttagaacact ctgatattgt tggtcttaac 780
cctgaagtag gtcacgaaca aatgtctaac ttaaacttcg tgcacggtat cgctcaagct 840
ctttggcacg gtaaattatt ccacattgat cttaacggtc aacacggtcc taaatacgac 900
caagacttag ttttcggtca cggtgattta ttatctgctt tcttccttgt agacttactt 960
gaaaacggtt tcccaggtgg aggaccagta tacgatggac ctcgtcactt cgattacaaa 1020
cctatgcgta ctgaagatat tgatggtgtt tgggcatcag cagcagctaa catgcgtact 1080
tacttacttt taaaacaacg tgcaaaagca ttccgtgctg atcctgaagt tcaagcagct 1140
ttaacagctt cacgcgttcc tgaattagct gttccaacat taggtgaagg tgaatcttac 1200
gctgatttat tagctgatcg ttcagcttgg gaagaatttg atgttgatcg tgctgctaac 1260
caaggttacg gttacgctcg tcttgatcaa ttagctatcg aacacttatt aggtgctcgt 1320
ggtggggggg gaggttcagg tggaggcgga agtggcggtg gtggcagcaa acacggtatc 1380
tactacgctt actgggaaca agaatgggaa gctgattaca aatactacat cgaaaaagta 1440
gctaaattag gtttcgacat cttagaaatc gctgcttctc cacttccatt ctactctgat 1500
atccaaatta acgaattaaa agcatgcgct cacggtaacg gtattacact tactgttggt 1560
cacggcccat ctgctgaaca aaacttatct tctcctgacc cagacattcg taaaaacgct 1620
aaagcattct acactgatct tttaaaacgt ttatacaaat tagatgtaca cttaattggt 1680
ggtgctcttt actcttactg gccaatcgac tacactaaaa ctatcgataa aaaaggtgac 1740
tgggaacgtt ctgttgaatc agtacgtgaa gttgctaaag ttgctgaggc ttgtggtgtt 1800
gatttctgtt tagaagtatt aaaccgtttc gaaaactact taatcaacac tgctcaagaa 1860
ggtgttgatt tcgttaaaca agttgatcac aataacgtta aagtaatgtt agatacattc 1920
catatgaaca tcgaagaaga ttctatcggt ggtgcaatcc gtactgctgg ttcttactta 1980
ggtcaccttc acactggtga atgtaaccgt aaagtacctg gtcgtggtcg tatcccatgg 2040
gttgagatcg gtgaagcatt agctgacatc ggttacaacg gttctgtagt aatggaacca 2100
ttcgtacgta tgggtggtac tgttggttct aacattaaag tttggcgtga catctctaac 2160
ggtgctgatg aaaaaatgtt agatcgtgaa gctcaagctg cattagattt ctctcgttac 2220
gttcttgaat gtcacaaaca ctcttaa 2247
<210> 12
<211> 2217
<212> DNA
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 12
atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60
atggcaagca tgactggtgg acagcaaatg ggtcgcggat ccatgtctct tactactgct 120
tcttctaaaa ctatcgaagt tgcaactcca tcaaaagaag atcgtttctc tttcggtctt 180
tggactgttg gttggcaagc tcgtgaccca ttcggtgaag ctactcgccc accacttgac 240
ccagttgaag ctgtacacaa acttgctgaa ttaggtgcat acggtgttac tttccacgat 300
gatgaccttg ttccattcgg ttcttctgat gctgaacgcg ctcgcttaat cgatcgtttc 360
aaaaaagcac tagctgatac tggtttagtt gtaccaatga tgacaacaaa cttattcact 420
catcctatct tcaaagatgg tgctttcact gctaacgatc gttctatccg tcgttatgct 480
atccgcaaag taatgcgtaa cttagattta gcggctgaat taggtgctcg tacatacgta 540
ttctggggtg gtcgtgaagg ttctgaaatc gatgctgcaa aagacatccg tgctgcttta 600
gatcgttacc gtgaagctat tgacactctt gctcaatacg ttaaagatca aggttacggt 660
attcgtttcg cattagaacc aaaaccaaac gaaccacgcg gtgacatctt cttaccaact 720
atcggtcacg ctttagcttt catcaactct ttagaacact ctgatattgt tggtcttaac 780
cctgaagtag gtcacgaaca aatgtctaac ttaaacttcg tgcacggtat cgctcaagct 840
ctttggcacg gtaaattatt ccacattgat cttaacggtc aacacggtcc taaatacgac 900
caagacttag ttttcggtca cggtgattta ttatctgctt tcttccttgt agacttactt 960
gaaaacggtt tcccaggtgg aggaccagta tacgatggac ctcgtcactt cgattacaaa 1020
cctatgcgta ctgaagatat tgatggtgtt tgggcatcag cagcagctaa catgcgtact 1080
tacttacttt taaaacaacg tgcaaaagca ttccgtgctg atcctgaagt tcaagcagct 1140
ttaacagctt cacgcgttcc tgaattagct gttccaacat taggtgaagg tgaatcttac 1200
gctgatttat tagctgatcg ttcagcttgg gaagaatttg atgttgatcg tgctgctaac 1260
caaggttacg gttacgctcg tcttgatcaa ttagctatcg aacacttatt aggtgctcgt 1320
ggtgaagcag ctgccaagaa acacggtatc tactacgctt actgggaaca agaatgggaa 1380
gctgattaca aatactacat cgaaaaagta gctaaattag gtttcgacat cttagaaatc 1440
gctgcttctc cacttccatt ctactctgat atccaaatta acgaattaaa agcatgcgct 1500
cacggtaacg gtattacact tactgttggt cacggcccat ctgctgaaca aaacttatct 1560
tctcctgacc cagacattcg taaaaacgct aaagcattct acactgatct tttaaaacgt 1620
ttatacaaat tagatgtaca cttaattggt ggtgctcttt actcttactg gccaatcgac 1680
tacactaaaa ctatcgataa aaaaggtgac tgggaacgtt ctgttgaatc agtacgtgaa 1740
gttgctaaag ttgctgaggc ttgtggtgtt gatttctgtt tagaagtatt aaaccgtttc 1800
gaaaactact taatcaacac tgctcaagaa ggtgttgatt tcgttaaaca agttgatcac 1860
aataacgtta aagtaatgtt agatacattc catatgaaca tcgaagaaga ttctatcggt 1920
ggtgcaatcc gtactgctgg ttcttactta ggtcaccttc acactggtga atgtaaccgt 1980
aaagtacctg gtcgtggtcg tatcccatgg gttgagatcg gtgaagcatt agctgacatc 2040
ggttacaacg gttctgtagt aatggaacca ttcgtacgta tgggtggtac tgttggttct 2100
aacattaaag tttggcgtga catctctaac ggtgctgatg aaaaaatgtt agatcgtgaa 2160
gctcaagctg cattagattt ctctcgttac gttcttgaat gtcacaaaca ctcttaa 2217
<210> 13
<211> 2232
<212> DNA
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 13
atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60
atggcaagca tgactggtgg acagcaaatg ggtcgcggat ccatgtctct tactactgct 120
tcttctaaaa ctatcgaagt tgcaactcca tcaaaagaag atcgtttctc tttcggtctt 180
tggactgttg gttggcaagc tcgtgaccca ttcggtgaag ctactcgccc accacttgac 240
ccagttgaag ctgtacacaa acttgctgaa ttaggtgcat acggtgttac tttccacgat 300
gatgaccttg ttccattcgg ttcttctgat gctgaacgcg ctcgcttaat cgatcgtttc 360
aaaaaagcac tagctgatac tggtttagtt gtaccaatga tgacaacaaa cttattcact 420
catcctatct tcaaagatgg tgctttcact gctaacgatc gttctatccg tcgttatgct 480
atccgcaaag taatgcgtaa cttagattta gcggctgaat taggtgctcg tacatacgta 540
ttctggggtg gtcgtgaagg ttctgaaatc gatgctgcaa aagacatccg tgctgcttta 600
gatcgttacc gtgaagctat tgacactctt gctcaatacg ttaaagatca aggttacggt 660
attcgtttcg cattagaacc aaaaccaaac gaaccacgcg gtgacatctt cttaccaact 720
atcggtcacg ctttagcttt catcaactct ttagaacact ctgatattgt tggtcttaac 780
cctgaagtag gtcacgaaca aatgtctaac ttaaacttcg tgcacggtat cgctcaagct 840
ctttggcacg gtaaattatt ccacattgat cttaacggtc aacacggtcc taaatacgac 900
caagacttag ttttcggtca cggtgattta ttatctgctt tcttccttgt agacttactt 960
gaaaacggtt tcccaggtgg aggaccagta tacgatggac ctcgtcactt cgattacaaa 1020
cctatgcgta ctgaagatat tgatggtgtt tgggcatcag cagcagctaa catgcgtact 1080
tacttacttt taaaacaacg tgcaaaagca ttccgtgctg atcctgaagt tcaagcagct 1140
ttaacagctt cacgcgttcc tgaattagct gttccaacat taggtgaagg tgaatcttac 1200
gctgatttat tagctgatcg ttcagcttgg gaagaatttg atgttgatcg tgctgctaac 1260
caaggttacg gttacgctcg tcttgatcaa ttagctatcg aacacttatt aggtgctcgt 1320
ggtgaagcag ctgccaagga ggcagctgcg aagaaacacg gtatctacta cgcttactgg 1380
gaacaagaat gggaagctga ttacaaatac tacatcgaaa aagtagctaa attaggtttc 1440
gacatcttag aaatcgctgc ttctccactt ccattctact ctgatatcca aattaacgaa 1500
ttaaaagcat gcgctcacgg taacggtatt acacttactg ttggtcacgg cccatctgct 1560
gaacaaaact tatcttctcc tgacccagac attcgtaaaa acgctaaagc attctacact 1620
gatcttttaa aacgtttata caaattagat gtacacttaa ttggtggtgc tctttactct 1680
tactggccaa tcgactacac taaaactatc gataaaaaag gtgactggga acgttctgtt 1740
gaatcagtac gtgaagttgc taaagttgct gaggcttgtg gtgttgattt ctgtttagaa 1800
gtattaaacc gtttcgaaaa ctacttaatc aacactgctc aagaaggtgt tgatttcgtt 1860
aaacaagttg atcacaataa cgttaaagta atgttagata cattccatat gaacatcgaa 1920
gaagattcta tcggtggtgc aatccgtact gctggttctt acttaggtca ccttcacact 1980
ggtgaatgta accgtaaagt acctggtcgt ggtcgtatcc catgggttga gatcggtgaa 2040
gcattagctg acatcggtta caacggttct gtagtaatgg aaccattcgt acgtatgggt 2100
ggtactgttg gttctaacat taaagtttgg cgtgacatct ctaacggtgc tgatgaaaaa 2160
atgttagatc gtgaagctca agctgcatta gatttctctc gttacgttct tgaatgtcac 2220
aaacactctt aa 2232
<210> 14
<211> 2247
<212> DNA
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 14
atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60
atggcaagca tgactggtgg acagcaaatg ggtcgcggat ccatgtctct tactactgct 120
tcttctaaaa ctatcgaagt tgcaactcca tcaaaagaag atcgtttctc tttcggtctt 180
tggactgttg gttggcaagc tcgtgaccca ttcggtgaag ctactcgccc accacttgac 240
ccagttgaag ctgtacacaa acttgctgaa ttaggtgcat acggtgttac tttccacgat 300
gatgaccttg ttccattcgg ttcttctgat gctgaacgcg ctcgcttaat cgatcgtttc 360
aaaaaagcac tagctgatac tggtttagtt gtaccaatga tgacaacaaa cttattcact 420
catcctatct tcaaagatgg tgctttcact gctaacgatc gttctatccg tcgttatgct 480
atccgcaaag taatgcgtaa cttagattta gcggctgaat taggtgctcg tacatacgta 540
ttctggggtg gtcgtgaagg ttctgaaatc gatgctgcaa aagacatccg tgctgcttta 600
gatcgttacc gtgaagctat tgacactctt gctcaatacg ttaaagatca aggttacggt 660
attcgtttcg cattagaacc aaaaccaaac gaaccacgcg gtgacatctt cttaccaact 720
atcggtcacg ctttagcttt catcaactct ttagaacact ctgatattgt tggtcttaac 780
cctgaagtag gtcacgaaca aatgtctaac ttaaacttcg tgcacggtat cgctcaagct 840
ctttggcacg gtaaattatt ccacattgat cttaacggtc aacacggtcc taaatacgac 900
caagacttag ttttcggtca cggtgattta ttatctgctt tcttccttgt agacttactt 960
gaaaacggtt tcccaggtgg aggaccagta tacgatggac ctcgtcactt cgattacaaa 1020
cctatgcgta ctgaagatat tgatggtgtt tgggcatcag cagcagctaa catgcgtact 1080
tacttacttt taaaacaacg tgcaaaagca ttccgtgctg atcctgaagt tcaagcagct 1140
ttaacagctt cacgcgttcc tgaattagct gttccaacat taggtgaagg tgaatcttac 1200
gctgatttat tagctgatcg ttcagcttgg gaagaatttg atgttgatcg tgctgctaac 1260
caaggttacg gttacgctcg tcttgatcaa ttagctatcg aacacttatt aggtgctcgt 1320
ggtgaggcag ctgcgaaaga agcagctgcc aaggaggcag ctgcgaagaa acacggtatc 1380
tactacgctt actgggaaca agaatgggaa gctgattaca aatactacat cgaaaaagta 1440
gctaaattag gtttcgacat cttagaaatc gctgcttctc cacttccatt ctactctgat 1500
atccaaatta acgaattaaa agcatgcgct cacggtaacg gtattacact tactgttggt 1560
cacggcccat ctgctgaaca aaacttatct tctcctgacc cagacattcg taaaaacgct 1620
aaagcattct acactgatct tttaaaacgt ttatacaaat tagatgtaca cttaattggt 1680
ggtgctcttt actcttactg gccaatcgac tacactaaaa ctatcgataa aaaaggtgac 1740
tgggaacgtt ctgttgaatc agtacgtgaa gttgctaaag ttgctgaggc ttgtggtgtt 1800
gatttctgtt tagaagtatt aaaccgtttc gaaaactact taatcaacac tgctcaagaa 1860
ggtgttgatt tcgttaaaca agttgatcac aataacgtta aagtaatgtt agatacattc 1920
catatgaaca tcgaagaaga ttctatcggt ggtgcaatcc gtactgctgg ttcttactta 1980
ggtcaccttc acactggtga atgtaaccgt aaagtacctg gtcgtggtcg tatcccatgg 2040
gttgagatcg gtgaagcatt agctgacatc ggttacaacg gttctgtagt aatggaacca 2100
ttcgtacgta tgggtggtac tgttggttct aacattaaag tttggcgtga catctctaac 2160
ggtgctgatg aaaaaatgtt agatcgtgaa gctcaagctg cattagattt ctctcgttac 2220
gttcttgaat gtcacaaaca ctcttaa 2247
<210> 15
<211> 1326
<212> DNA
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 15
atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60
atggcaagca tgactggtgg acagcaaatg ggtcgcggat ccatgtctct tactactgct 120
tcttctaaaa ctatcgaagt tgcaactcca tcaaaagaag atcgtttctc tttcggtctt 180
tggactgttg gttggcaagc tcgtgaccca ttcggtgaag ctactcgccc accacttgac 240
ccagttgaag ctgtacacaa acttgctgaa ttaggtgcat acggtgttac tttccacgat 300
gatgaccttg ttccattcgg ttcttctgat gctgaacgcg ctcgcttaat cgatcgtttc 360
aaaaaagcac tagctgatac tggtttagtt gtaccaatga tgacaacaaa cttattcact 420
catcctatct tcaaagatgg tgctttcact gctaacgatc gttctatccg tcgttatgct 480
atccgcaaag taatgcgtaa cttagattta gcggctgaat taggtgctcg tacatacgta 540
ttctggggtg gtcgtgaagg ttctgaaatc gatgctgcaa aagacatccg tgctgcttta 600
gatcgttacc gtgaagctat tgacactctt gctcaatacg ttaaagatca aggttacggt 660
attcgtttcg cattagaacc aaaaccaaac gaaccacgcg gtgacatctt cttaccaact 720
atcggtcacg ctttagcttt catcaactct ttagaacact ctgatattgt tggtcttaac 780
cctgaagtag gtcacgaaca aatgtctaac ttaaacttcg tgcacggtat cgctcaagct 840
ctttggcacg gtaaattatt ccacattgat cttaacggtc aacacggtcc taaatacgac 900
caagacttag ttttcggtca cggtgattta ttatctgctt tcttccttgt agacttactt 960
gaaaacggtt tcccaggtgg aggaccagta tacgatggac ctcgtcactt cgattacaaa 1020
cctatgcgta ctgaagatat tgatggtgtt tgggcatcag cagcagctaa catgcgtact 1080
tacttacttt taaaacaacg tgcaaaagca ttccgtgctg atcctgaagt tcaagcagct 1140
ttaacagctt cacgcgttcc tgaattagct gttccaacat taggtgaagg tgaatcttac 1200
gctgatttat tagctgatcg ttcagcttgg gaagaatttg atgttgatcg tgctgctaac 1260
caaggttacg gttacgctcg tcttgatcaa ttagctatcg aacacttatt aggtgctcgt 1320
ggttaa 1326
<210> 16
<211> 984
<212> DNA
<213> Artificial sequence (2 Ambystoma latex x Ambystoma jeffersonia)
<400> 16
atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60
atggcaagca tgactggtgg acagcaaatg ggtcgcggat ccatgaaaca cggtatctac 120
tacgcttact gggaacaaga atgggaagct gattacaaat actacatcga aaaagtagct 180
aaattaggtt tcgacatctt agaaatcgct gcttctccac ttccattcta ctctgatatc 240
caaattaacg aattaaaagc atgcgctcac ggtaacggta ttacacttac tgttggtcac 300
ggcccatctg ctgaacaaaa cttatcttct cctgacccag acattcgtaa aaacgctaaa 360
gcattctaca ctgatctttt aaaacgttta tacaaattag atgtacactt aattggtggt 420
gctctttact cttactggcc aatcgactac actaaaacta tcgataaaaa aggtgactgg 480
gaacgttctg ttgaatcagt acgtgaagtt gctaaagttg ctgaggcttg tggtgttgat 540
ttctgtttag aagtattaaa ccgtttcgaa aactacttaa tcaacactgc tcaagaaggt 600
gttgatttcg ttaaacaagt tgatcacaat aacgttaaag taatgttaga tacattccat 660
atgaacatcg aagaagattc tatcggtggt gcaatccgta ctgctggttc ttacttaggt 720
caccttcaca ctggtgaatg taaccgtaaa gtacctggtc gtggtcgtat cccatgggtt 780
gagatcggtg aagcattagc tgacatcggt tacaacggtt ctgtagtaat ggaaccattc 840
gtacgtatgg gtggtactgt tggttctaac attaaagttt ggcgtgacat ctctaacggt 900
gctgatgaaa aaatgttaga tcgtgaagct caagctgcat tagatttctc tcgttacgtt 960
cttgaatgtc acaaacactc ttaa 984
Claims (8)
1. A fusion protein for catalyzing glucose to synthesize D-psicose and a construction thereof are characterized in that glucose is used as a substrate, and the fusion protein (GP, SEQ ID NO:1, GS1P, SEQ ID NO:2, GS2P, SEQ ID NO:3, GS3P, SEQ ID NO:4, GE1P, SEQ ID NO:5, GE2P, SEQ ID NO:6, GE3P and SEQ ID NO:7) is used for directly converting the glucose into the D-psicose (the reaction process is shown in figure 1).
2. The fusion protein according to claim 1, wherein the fusion protein-encoding gene (GP, SEQ ID NO:8, GS1P, SEQ ID NO:9, GS2P, SEQ ID NO:10, GS3P, SEQ ID NO:11, GE1P, SEQ ID NO:12, GE2P, SEQ ID NO:13 and GE3P, SEQ ID NO:14) is a fusion of a glucose isomerase-encoding gene (AcceGI, SEQ ID NO:15), a D-psicose 3-epimerase-encoding gene (CcDPEase, SEQ ID NO:16) and a linker peptide-encoding gene.
3. The method of claim 1 and claim 2, wherein S is flexible linker peptide (GGGGS) and E is rigid alpha helical linker peptide (EAAAK), and wherein the number indicates the number of tandem copies of the linker peptide used, e.g., GS2P indicates that a glucose isomerase-encoding gene and a D-psicose 3-epimerase-encoding gene are fused by linking the two copies of the flexible linker peptide. The glucose isomerase coding gene is derived from Acidothermus cellulolyticus 11B. The coding gene of the D-psicose 3-epimerase is derived from Clostridium cellulolyticum H10.
4. The method for synthesizing D-psicose according to claim 1, wherein plasmid pET28a (PB) N (SEQ ID NO:17) is used as the expression vector of the transformation pathway, and recombinant plasmids pET28a (PB) N-GP, pET28a (PB) N-GS1P, pET28a (PB) N-GS2P, pET28a (PB) N-GS3P, pET28a (PB) N-GE1P, pET28a (PB) N-GE2P and pET28a (PB) N-GE3P containing all pathway genes are constructed (the plasmid structure is shown in FIG. 3).
5. According to the claim 1, the claim 2, the claim 3 and the claim 4, the Escherichia coli BL21(DE3) is used as an expression host, the recombinant plasmid is transformed into Escherichia coli BL21(DE3), and the engineering strains BL21/GP, BL21/GS1P, BL21/GS2P, BL21/GS3P, BL21/GE1P, BL21/GE2P and BL21/GE3P containing all fusion genes are obtained.
6. The method for synthesizing D-psicose using fusion protein with glucose as substrate according to claim 1 and the recombinant engineered strain of claim 5, wherein the transformation conditions used are as follows: (1) in LB culture medium, culturing Escherichia coli recombinant engineering bacteria to OD6000.6-0.8, isopropyl- β -D-thiogalactoside (IPTG) was added to a final concentration of 0.5mM, and gene expression was induced at 25 ℃ and 200rpm for 8 h; (2) centrifuging at 4 deg.C and 8000rpm to collect thallus, removing culture medium with 20mM Phosphate Buffer Solution (PBS) with pH of 4.5-8.5, and resuspending thallus with phosphate buffer solution with corresponding pH to obtain resting cell; (3) glucose was added as a substrate to resting cells at a final concentration of 100g/L and 1mM CoCl2And 5mM MgCl2Reacting for 2 hours at 45-85 ℃ as catalytic ions; (4) the yield of D-psicose was analyzed using ion chromatography.
7. According to claim 1 and claim 6, the optimal temperature for catalyzing the direct conversion of glucose into D-psicose by using the fusion protein in the present invention is: at 65 ℃, the optimal pH is: 7.5.
8. according to claim 1 and claim 7, the method for directly producing D-psicose from glucose by using fusion protein catalysis has the advantage that the yield of D-psicose is not less than 5.69g/L under the optimal temperature and pH conditions.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210352448.4A CN114591940A (en) | 2022-04-04 | 2022-04-04 | Fusion protein for catalyzing glucose to synthesize D-psicose and construction method thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210352448.4A CN114591940A (en) | 2022-04-04 | 2022-04-04 | Fusion protein for catalyzing glucose to synthesize D-psicose and construction method thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114591940A true CN114591940A (en) | 2022-06-07 |
Family
ID=81811808
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210352448.4A Pending CN114591940A (en) | 2022-04-04 | 2022-04-04 | Fusion protein for catalyzing glucose to synthesize D-psicose and construction method thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114591940A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023242877A1 (en) * | 2022-06-16 | 2023-12-21 | Fertis India Pvt Ltd | Enzymatic synthesis of d-allulose and its derivatives from fusion enzymes |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103710329A (en) * | 2013-12-24 | 2014-04-09 | 山西天骄食业有限公司 | Method for preparation of co-expressed recombinase with genetic engineering technology |
CN107723307A (en) * | 2017-10-09 | 2018-02-23 | 中国科学院天津工业生物技术研究所 | A kind of method and its application for efficiently preparing the epimerase of D psicoses 3 |
CN113801240A (en) * | 2021-08-26 | 2021-12-17 | 华南理工大学 | D-psicose-3-epimerase activity aggregate and preparation method and application thereof |
CN113980880A (en) * | 2021-09-24 | 2022-01-28 | 吉林中粮生化有限公司 | Genetically engineered bacterium, application thereof and method for producing psicose by taking glucose as raw material |
-
2022
- 2022-04-04 CN CN202210352448.4A patent/CN114591940A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103710329A (en) * | 2013-12-24 | 2014-04-09 | 山西天骄食业有限公司 | Method for preparation of co-expressed recombinase with genetic engineering technology |
CN107723307A (en) * | 2017-10-09 | 2018-02-23 | 中国科学院天津工业生物技术研究所 | A kind of method and its application for efficiently preparing the epimerase of D psicoses 3 |
CN113801240A (en) * | 2021-08-26 | 2021-12-17 | 华南理工大学 | D-psicose-3-epimerase activity aggregate and preparation method and application thereof |
CN113980880A (en) * | 2021-09-24 | 2022-01-28 | 吉林中粮生化有限公司 | Genetically engineered bacterium, application thereof and method for producing psicose by taking glucose as raw material |
Non-Patent Citations (3)
Title |
---|
WANMENG MU ET AL.: ""Characterization of a thermostable glucose isomerase with an acidic pH optimum from Acidothermus cellulolyticus"", 《FOOD RESEARCH INTERNATIONAL》, vol. 47, pages 364 - 367, XP028926041, DOI: 10.1016/j.foodres.2011.09.006 * |
WANMENG MU ET AL.: ""Cloning, expression, and characterization of a D-Psicose 3-Epimerase from Clostridium cellulolyticum H10"", 《J. AGRIC. FOOD CHEM. 》, vol. 59, pages 7785 - 7792 * |
YAN MEN ET AL.: ""Co-expression of d-glucose isomerase and d-psicose 3-epimerase: Development of an efficient one-step production of d-psicose"", 《ENZYME AND MICROBIAL TECHNOLOGY》, vol. 64, pages 212 - 127 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023242877A1 (en) * | 2022-06-16 | 2023-12-21 | Fertis India Pvt Ltd | Enzymatic synthesis of d-allulose and its derivatives from fusion enzymes |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11203742B2 (en) | Sucrose phosphorylase mutant with improved enzyme activity and construction method thereof and use thereof | |
US9988618B2 (en) | Psicose epimerase and psicose production method using same | |
CN108034648B (en) | D-psicose 3-epimerase mutant with improved thermal stability | |
CN108018278B (en) | D-psicose 3-epimerase mutant with improved catalytic efficiency | |
CN108624576B (en) | Mutant of L-amino acid deaminase and preparation method and application thereof | |
CN112695021B (en) | Alpha-glycosidase gene mutant and application thereof in preparation of 2-O-alpha-D-glucosyl-L-ascorbic acid | |
CN113817763B (en) | Directed evolution method, mutant and application of beta-galactosidase family genes | |
CN114591939B (en) | High-heat-resistance D-psicose-3-epimerase mutant and application thereof | |
CN114591940A (en) | Fusion protein for catalyzing glucose to synthesize D-psicose and construction method thereof | |
CN111394410B (en) | High-catalytic-activity neuraminic acid synthase and application thereof | |
CN113151337A (en) | Method for expressing trehalose synthase by using EF-Tu promoter in corynebacterium glutamicum and application | |
US9752170B2 (en) | Method of production of monosaccharides | |
CN112746061A (en) | Meso-diaminopimelate dehydrogenase mutants and uses thereof | |
CN111455003A (en) | Method for preparing D-psicose from microalgae | |
CN111411066A (en) | Double-way composite neuraminic acid-producing bacillus subtilis and construction method thereof | |
CN114317509B (en) | Cellobiose epimerase mutant and application thereof | |
CN115896206A (en) | Synthesis method of D-mannose | |
CN112831532B (en) | Method for enzymatic synthesis of D-leucine | |
CN111471667B (en) | Chitosanase Csn-PT and application thereof | |
CN108251406B (en) | L-rhamnose-1-phosphate aldolase and application thereof in catalytic synthesis of rare sugar D-psicose | |
CN115011622A (en) | Screening method and application of D-psicose 3-epimerase mutant | |
CN106906192B (en) | Glucosyltransferase and application thereof in synthesis of crocetin glucose ester | |
CN111534498A (en) | Cyclodextrin glucosyltransferase mutant with improved disproportionation specific activity and AA-2G yield | |
CN112522343B (en) | Method for producing D-psicose by using NAD (P) -dependent alcohol dehydrogenase as catalyst | |
CN114958894B (en) | Construction method and application of spermidine synthetic multienzyme complex based on CcmK2 fibrous protein |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20220607 |
|
WD01 | Invention patent application deemed withdrawn after publication |