WO1998031703A9 - Methods for altering three-dimensional protein structure and compositions produced thereby - Google Patents
Methods for altering three-dimensional protein structure and compositions produced therebyInfo
- Publication number
- WO1998031703A9 WO1998031703A9 PCT/US1998/000853 US9800853W WO9831703A9 WO 1998031703 A9 WO1998031703 A9 WO 1998031703A9 US 9800853 W US9800853 W US 9800853W WO 9831703 A9 WO9831703 A9 WO 9831703A9
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- protein
- leu
- altered
- val
- pro
- Prior art date
Links
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 141
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 141
- 239000000203 mixture Substances 0.000 title abstract description 15
- 238000006384 oligomerization reaction Methods 0.000 claims abstract description 59
- 150000007523 nucleic acids Chemical group 0.000 claims abstract description 34
- 125000001165 hydrophobic group Chemical group 0.000 claims abstract description 25
- 229920001850 Nucleic acid sequence Polymers 0.000 claims abstract description 24
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 4
- 102000037240 fusion proteins Human genes 0.000 claims abstract description 4
- 102100019730 TP53 Human genes 0.000 claims description 122
- 230000002209 hydrophobic Effects 0.000 claims description 37
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 36
- 238000006467 substitution reaction Methods 0.000 claims description 29
- 150000001413 amino acids Chemical class 0.000 claims description 27
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 18
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 17
- 102000008300 Mutant Proteins Human genes 0.000 claims description 17
- 108010021466 Mutant Proteins Proteins 0.000 claims description 17
- 125000000510 L-tryptophano group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[C@@]([H])(C(O[H])=O)N([H])[*])C2=C1[H] 0.000 claims description 15
- 239000008194 pharmaceutical composition Substances 0.000 claims description 13
- BQVUABVGYYSDCJ-ZFWWWQNUSA-N Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-ZFWWWQNUSA-N 0.000 claims description 10
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 claims description 5
- 102000004190 Enzymes Human genes 0.000 claims description 3
- 108090000790 Enzymes Proteins 0.000 claims description 3
- 239000003937 drug carrier Substances 0.000 claims description 3
- 102000018146 globin family Human genes 0.000 claims description 3
- 108060003196 globin family Proteins 0.000 claims description 3
- LZDNBBYBDGBADK-KBPBESRZSA-N (2S)-2-[[(2S)-2-amino-3-methylbutanoyl]amino]-3-(1H-indol-3-yl)propanoic acid Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-KBPBESRZSA-N 0.000 claims description 2
- 108010066427 N-valyltryptophan Proteins 0.000 claims description 2
- 230000001105 regulatory Effects 0.000 claims description 2
- BVRPESWOSNFUCJ-LKTVYLICSA-N BNC210 Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 BVRPESWOSNFUCJ-LKTVYLICSA-N 0.000 claims 1
- WMDZARSFSMZOQO-DRZSPHRISA-N Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WMDZARSFSMZOQO-DRZSPHRISA-N 0.000 claims 1
- MUFXDFWAJSPHIQ-XDTLVQLUSA-N Ile-Tyr Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 MUFXDFWAJSPHIQ-XDTLVQLUSA-N 0.000 claims 1
- KFKWRHQBZQICHA-STQMWFEESA-N Leu-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 claims 1
- JMCOUWKXLXDERB-WMZOPIPTSA-N Phe-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 JMCOUWKXLXDERB-WMZOPIPTSA-N 0.000 claims 1
- GJNDXQBALKCYSZ-RYUDHWBXSA-N Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 GJNDXQBALKCYSZ-RYUDHWBXSA-N 0.000 claims 1
- VEYJKJORLPYVLO-RYUDHWBXSA-N Val-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VEYJKJORLPYVLO-RYUDHWBXSA-N 0.000 claims 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 claims 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 claims 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 claims 1
- 108010009962 valyltyrosine Proteins 0.000 claims 1
- 235000018102 proteins Nutrition 0.000 description 112
- 235000001014 amino acid Nutrition 0.000 description 29
- 108090001123 antibodies Proteins 0.000 description 19
- 102000004965 antibodies Human genes 0.000 description 19
- 210000004027 cells Anatomy 0.000 description 18
- 239000000539 dimer Substances 0.000 description 18
- 108010087924 alanylproline Proteins 0.000 description 17
- 229920003013 deoxyribonucleic acid Polymers 0.000 description 17
- VBKBDLMWICBSCY-IMJSIDKUSA-N Ser-Asp Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O VBKBDLMWICBSCY-IMJSIDKUSA-N 0.000 description 16
- 238000000034 method Methods 0.000 description 16
- 108010026333 seryl-proline Proteins 0.000 description 16
- 125000003275 alpha amino acid group Chemical group 0.000 description 14
- 230000000694 effects Effects 0.000 description 14
- 241000880493 Leptailurus serval Species 0.000 description 13
- 108010057821 leucylproline Proteins 0.000 description 13
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 12
- VTJUNIYRYIAIHF-IUCAKERBSA-N Leu-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O VTJUNIYRYIAIHF-IUCAKERBSA-N 0.000 description 12
- XZKQVQKUZMAADP-IMJSIDKUSA-N Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(O)=O XZKQVQKUZMAADP-IMJSIDKUSA-N 0.000 description 12
- UPJONISHZRADBH-XPUUQOCRSA-N Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UPJONISHZRADBH-XPUUQOCRSA-N 0.000 description 12
- 230000027455 binding Effects 0.000 description 12
- 201000011510 cancer Diseases 0.000 description 12
- 108010054155 lysyllysine Proteins 0.000 description 12
- 108020004707 nucleic acids Proteins 0.000 description 12
- DWBZEJHQQIURML-IMJSIDKUSA-N Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O DWBZEJHQQIURML-IMJSIDKUSA-N 0.000 description 11
- 230000004568 DNA-binding Effects 0.000 description 11
- 108010068380 arginylarginine Proteins 0.000 description 11
- 108010077112 prolyl-proline Proteins 0.000 description 11
- 108020003175 receptors Proteins 0.000 description 10
- 102000005962 receptors Human genes 0.000 description 10
- FAPWRFPIFSIZLT-UHFFFAOYSA-M sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 10
- YBTCBQBIJKGSJP-BQBZGAKWSA-N Glu-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O YBTCBQBIJKGSJP-BQBZGAKWSA-N 0.000 description 9
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 9
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 9
- 108010060199 cysteinylproline Proteins 0.000 description 9
- 108010018006 histidylserine Proteins 0.000 description 9
- 239000011780 sodium chloride Substances 0.000 description 9
- FSHURBQASBLAPO-WDSKDSINSA-N Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)N FSHURBQASBLAPO-WDSKDSINSA-N 0.000 description 8
- WPWUFUBLGADILS-WDSKDSINSA-N Ala-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WPWUFUBLGADILS-WDSKDSINSA-N 0.000 description 8
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical compound NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 8
- VHOLZZKNEBBHTH-YUMQZZPRSA-N His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 VHOLZZKNEBBHTH-YUMQZZPRSA-N 0.000 description 8
- HGNRJCINZYHNOU-LURJTMIESA-N Lys-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(O)=O HGNRJCINZYHNOU-LURJTMIESA-N 0.000 description 8
- LDEBVRIURYMKQS-UHFFFAOYSA-N Serinyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CO LDEBVRIURYMKQS-UHFFFAOYSA-N 0.000 description 8
- BECPPKYKPSRKCP-ZDLURKLDSA-N Thr-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O BECPPKYKPSRKCP-ZDLURKLDSA-N 0.000 description 8
- 239000003153 chemical reaction reagent Substances 0.000 description 8
- 230000000875 corresponding Effects 0.000 description 8
- KZNQNBZMBZJQJO-YFKPBYRVSA-N gly pro Chemical compound NCC(=O)N1CCC[C@H]1C(O)=O KZNQNBZMBZJQJO-YFKPBYRVSA-N 0.000 description 8
- 108010077515 glycylproline Proteins 0.000 description 8
- 108010028295 histidylhistidine Proteins 0.000 description 8
- 108010034529 leucyl-lysine Proteins 0.000 description 8
- 108010012581 phenylalanylglutamate Proteins 0.000 description 8
- SONUFGRSSMFHFN-IMJSIDKUSA-N Asn-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O SONUFGRSSMFHFN-IMJSIDKUSA-N 0.000 description 7
- WYVKPHCYMTWUCW-UHFFFAOYSA-N Cysteinyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CS WYVKPHCYMTWUCW-UHFFFAOYSA-N 0.000 description 7
- HFKJBCPRWWGPEY-BQBZGAKWSA-N L-arginyl-L-glutamic acid Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HFKJBCPRWWGPEY-BQBZGAKWSA-N 0.000 description 7
- ATIPDCIQTUXABX-UWVGGRQHSA-N Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ATIPDCIQTUXABX-UWVGGRQHSA-N 0.000 description 7
- 238000005481 NMR spectroscopy Methods 0.000 description 7
- JXWLMUIXUXLIJR-QWRGUYRKSA-N Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JXWLMUIXUXLIJR-QWRGUYRKSA-N 0.000 description 7
- WOUIMBGNEUWXQG-VKHMYHEASA-N Ser-Gly Chemical compound OC[C@H](N)C(=O)NCC(O)=O WOUIMBGNEUWXQG-VKHMYHEASA-N 0.000 description 7
- QOLYAJSZHIJCTO-VQVTYTSYSA-N Thr-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O QOLYAJSZHIJCTO-VQVTYTSYSA-N 0.000 description 7
- 108010013835 arginine glutamate Proteins 0.000 description 7
- 230000003993 interaction Effects 0.000 description 7
- 102000035365 modified proteins Human genes 0.000 description 7
- 108091005569 modified proteins Proteins 0.000 description 7
- 230000001225 therapeutic Effects 0.000 description 7
- 230000003612 virological Effects 0.000 description 7
- CXRCVCURMBFFOL-FXQIFTODSA-N (2S)-1-[(2S)-2-[[(2S)-2-azaniumylpropanoyl]amino]propanoyl]pyrrolidine-2-carboxylate Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 6
- RDIKFPRVLJLMER-BQBZGAKWSA-N Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)N RDIKFPRVLJLMER-BQBZGAKWSA-N 0.000 description 6
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 6
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 6
- 102100015262 MYC Human genes 0.000 description 6
- HGCNKOLVKRAVHD-RYUDHWBXSA-N Met-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-RYUDHWBXSA-N 0.000 description 6
- ZKQOUHVVXABNDG-IUCAKERBSA-N Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 ZKQOUHVVXABNDG-IUCAKERBSA-N 0.000 description 6
- RWCOTTLHDJWHRS-YUMQZZPRSA-N Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RWCOTTLHDJWHRS-YUMQZZPRSA-N 0.000 description 6
- AFWBWPCXSWUCLB-WDSKDSINSA-N Pro-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 AFWBWPCXSWUCLB-WDSKDSINSA-N 0.000 description 6
- 229940014598 TAC Drugs 0.000 description 6
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 6
- 238000004166 bioassay Methods 0.000 description 6
- 238000004132 cross linking Methods 0.000 description 6
- 238000001415 gene therapy Methods 0.000 description 6
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 6
- 238000003929 heteronuclear multiple quantum coherence Methods 0.000 description 6
- 229910052739 hydrogen Inorganic materials 0.000 description 6
- 239000001257 hydrogen Substances 0.000 description 6
- 108010068488 methionylphenylalanine Proteins 0.000 description 6
- 238000002560 therapeutic procedure Methods 0.000 description 6
- JSLGXODUIAFWCF-UHFFFAOYSA-N Arginyl-Asparagine Chemical compound NC(N)=NCCCC(N)C(=O)NC(CC(N)=O)C(O)=O JSLGXODUIAFWCF-UHFFFAOYSA-N 0.000 description 5
- OSASDIVHOSJVII-UHFFFAOYSA-N Arginyl-Cysteine Chemical compound SCC(C(O)=O)NC(=O)C(N)CCCNC(N)=N OSASDIVHOSJVII-UHFFFAOYSA-N 0.000 description 5
- RJUHZPRQRQLCFL-IMJSIDKUSA-N Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O RJUHZPRQRQLCFL-IMJSIDKUSA-N 0.000 description 5
- FUESBOMYALLFNI-VKHMYHEASA-N Gly-Asn Chemical compound NCC(=O)N[C@H](C(O)=O)CC(N)=O FUESBOMYALLFNI-VKHMYHEASA-N 0.000 description 5
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 5
- OTXBNHIUIHNGAO-UWVGGRQHSA-N Leu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN OTXBNHIUIHNGAO-UWVGGRQHSA-N 0.000 description 5
- 108010047857 aspartylglycine Proteins 0.000 description 5
- 230000003247 decreasing Effects 0.000 description 5
- 238000006471 dimerization reaction Methods 0.000 description 5
- KCXVZYZYPLLWCC-UHFFFAOYSA-N edta Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000006011 modification reaction Methods 0.000 description 5
- 108010029020 prolylglycine Proteins 0.000 description 5
- 238000000159 protein binding assay Methods 0.000 description 5
- 239000001488 sodium phosphate Substances 0.000 description 5
- 229910000162 sodium phosphate Inorganic materials 0.000 description 5
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 5
- LQJAALCCPOTJGB-YUMQZZPRSA-N (2S)-1-[(2S)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carboxylic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O LQJAALCCPOTJGB-YUMQZZPRSA-N 0.000 description 4
- BUZMZDDKFCSKOT-CIUDSAMLSA-N (2S)-2-[[(2S)-2-[[(2S)-2-amino-4-carboxybutanoyl]amino]-4-carboxybutanoyl]amino]pentanedioic acid Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 4
- QXRNAOYBCYVZCD-BQBZGAKWSA-N (2S)-6-amino-2-[[(2S)-2-aminopropanoyl]amino]hexanoic acid Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN QXRNAOYBCYVZCD-BQBZGAKWSA-N 0.000 description 4
- GJSURZIOUXUGAL-UHFFFAOYSA-N 2-((2,6-Dichlorophenyl)imino)imidazolidine Chemical compound ClC1=CC=CC(Cl)=C1NC1=NCCN1 GJSURZIOUXUGAL-UHFFFAOYSA-N 0.000 description 4
- XUUXCWCKKCZEAW-YFKPBYRVSA-N 2-[[(2S)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]acetic acid Chemical compound OC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N XUUXCWCKKCZEAW-YFKPBYRVSA-N 0.000 description 4
- CXISPYVYMQWFLE-VKHMYHEASA-N Ala-Gly Chemical compound C[C@H]([NH3+])C(=O)NCC([O-])=O CXISPYVYMQWFLE-VKHMYHEASA-N 0.000 description 4
- JQDFGZKKXBEANU-UHFFFAOYSA-N Alanyl-Cysteine Chemical compound CC(N)C(=O)NC(CS)C(O)=O JQDFGZKKXBEANU-UHFFFAOYSA-N 0.000 description 4
- JQFZHHSQMKZLRU-IUCAKERBSA-N Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N JQFZHHSQMKZLRU-IUCAKERBSA-N 0.000 description 4
- BNODVYXZAAXSHW-UHFFFAOYSA-N Arginyl-Histidine Chemical compound NC(=N)NCCCC(N)C(=O)NC(C(O)=O)CC1=CN=CN1 BNODVYXZAAXSHW-UHFFFAOYSA-N 0.000 description 4
- XNSKSTRGQIPTSE-UHFFFAOYSA-N Arginyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CCCNC(N)=N XNSKSTRGQIPTSE-UHFFFAOYSA-N 0.000 description 4
- FYRVDDJMNISIKJ-UWVGGRQHSA-N Asn-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FYRVDDJMNISIKJ-UWVGGRQHSA-N 0.000 description 4
- PSZNHSNIGMJYOZ-WDSKDSINSA-N Asp-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PSZNHSNIGMJYOZ-WDSKDSINSA-N 0.000 description 4
- FRYULLIZUDQONW-IMJSIDKUSA-N Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FRYULLIZUDQONW-IMJSIDKUSA-N 0.000 description 4
- CKAJHWFHHFSCDT-WHFBIAKZSA-N Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O CKAJHWFHHFSCDT-WHFBIAKZSA-N 0.000 description 4
- NPDLYUOYAGBHFB-UHFFFAOYSA-N Asparaginyl-Arginine Chemical compound NC(=O)CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N NPDLYUOYAGBHFB-UHFFFAOYSA-N 0.000 description 4
- HXWUJJADFMXNKA-UHFFFAOYSA-N Asparaginyl-Leucine Chemical compound CC(C)CC(C(O)=O)NC(=O)C(N)CC(N)=O HXWUJJADFMXNKA-UHFFFAOYSA-N 0.000 description 4
- QJMCHPGWFZZRID-UHFFFAOYSA-N Asparaginyl-Lysine Chemical compound NCCCCC(C(O)=O)NC(=O)C(N)CC(N)=O QJMCHPGWFZZRID-UHFFFAOYSA-N 0.000 description 4
- 102000031025 DNA-Binding Proteins Human genes 0.000 description 4
- 108091000102 DNA-Binding Proteins Proteins 0.000 description 4
- JSIQVRIXMINMTA-ZDLURKLDSA-N Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O JSIQVRIXMINMTA-ZDLURKLDSA-N 0.000 description 4
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 4
- PFMUCCYYAAFKTH-YFKPBYRVSA-N Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CN PFMUCCYYAAFKTH-YFKPBYRVSA-N 0.000 description 4
- MMFKFJORZBJVNF-UWVGGRQHSA-N His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 MMFKFJORZBJVNF-UWVGGRQHSA-N 0.000 description 4
- HTOOKGDPMXSJSY-STQMWFEESA-N His-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 HTOOKGDPMXSJSY-STQMWFEESA-N 0.000 description 4
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 4
- DVCSNHXRZUVYAM-BQBZGAKWSA-N Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O DVCSNHXRZUVYAM-BQBZGAKWSA-N 0.000 description 4
- NFNVDJGXRFEYTK-YUMQZZPRSA-N Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O NFNVDJGXRFEYTK-YUMQZZPRSA-N 0.000 description 4
- XWOBNBRUDDUEEY-UWVGGRQHSA-N Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XWOBNBRUDDUEEY-UWVGGRQHSA-N 0.000 description 4
- NTISAKGPIGTIJJ-IUCAKERBSA-N Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(C)C NTISAKGPIGTIJJ-IUCAKERBSA-N 0.000 description 4
- NPBGTPKLVJEOBE-IUCAKERBSA-N Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NPBGTPKLVJEOBE-IUCAKERBSA-N 0.000 description 4
- UGTZHPSKYRIGRJ-YUMQZZPRSA-N Lys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UGTZHPSKYRIGRJ-YUMQZZPRSA-N 0.000 description 4
- YSZNURNVYFUEHC-BQBZGAKWSA-N Lys-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YSZNURNVYFUEHC-BQBZGAKWSA-N 0.000 description 4
- ZOKVLMBYDSIDKG-CSMHCCOUSA-N Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ZOKVLMBYDSIDKG-CSMHCCOUSA-N 0.000 description 4
- ADHNYKZHPOEULM-BQBZGAKWSA-N Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O ADHNYKZHPOEULM-BQBZGAKWSA-N 0.000 description 4
- QXOHLNCNYLGICT-YFKPBYRVSA-N Met-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(O)=O QXOHLNCNYLGICT-YFKPBYRVSA-N 0.000 description 4
- DZMGFGQBRYWJOR-YUMQZZPRSA-N Met-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O DZMGFGQBRYWJOR-YUMQZZPRSA-N 0.000 description 4
- KAKJTZWHIUWTTD-VQVTYTSYSA-N Met-Thr Chemical compound CSCC[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)O)C([O-])=O KAKJTZWHIUWTTD-VQVTYTSYSA-N 0.000 description 4
- NDYNTQWSJLPEMK-UHFFFAOYSA-N Methionyl-Cysteine Chemical compound CSCCC(N)C(=O)NC(CS)C(O)=O NDYNTQWSJLPEMK-UHFFFAOYSA-N 0.000 description 4
- OZILORBBPKKGRI-RYUDHWBXSA-N Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 OZILORBBPKKGRI-RYUDHWBXSA-N 0.000 description 4
- ROHDXJUFQVRDAV-UWVGGRQHSA-N Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ROHDXJUFQVRDAV-UWVGGRQHSA-N 0.000 description 4
- FADYJNXDPBKVCA-UHFFFAOYSA-N Phenylalanyl-Lysine Chemical compound NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 4
- RVQDZELMXZRSSI-IUCAKERBSA-N Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 RVQDZELMXZRSSI-IUCAKERBSA-N 0.000 description 4
- BEPSGCXDIVACBU-UHFFFAOYSA-N Prolyl-Histidine Chemical compound C1CCNC1C(=O)NC(C(=O)O)CC1=CN=CN1 BEPSGCXDIVACBU-UHFFFAOYSA-N 0.000 description 4
- WBAXJMCUFIXCNI-WDSKDSINSA-N Ser-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WBAXJMCUFIXCNI-WDSKDSINSA-N 0.000 description 4
- ILVGMCVCQBJPSH-WDSKDSINSA-N Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO ILVGMCVCQBJPSH-WDSKDSINSA-N 0.000 description 4
- RZEQTVHJZCIUBT-UHFFFAOYSA-N Serinyl-Arginine Chemical compound OCC(N)C(=O)NC(C(O)=O)CCCNC(N)=N RZEQTVHJZCIUBT-UHFFFAOYSA-N 0.000 description 4
- FFOKMZOAVHEWET-UHFFFAOYSA-N Serinyl-Cysteine Chemical compound OCC(N)C(=O)NC(CS)C(O)=O FFOKMZOAVHEWET-UHFFFAOYSA-N 0.000 description 4
- SBMNPABNWKXNBJ-UHFFFAOYSA-N Serinyl-Lysine Chemical compound NCCCCC(C(O)=O)NC(=O)C(N)CO SBMNPABNWKXNBJ-UHFFFAOYSA-N 0.000 description 4
- VPZKQTYZIVOJDV-LMVFSUKVSA-N Thr-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(O)=O VPZKQTYZIVOJDV-LMVFSUKVSA-N 0.000 description 4
- HYLXOQURIOCKIH-VQVTYTSYSA-N Thr-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N HYLXOQURIOCKIH-VQVTYTSYSA-N 0.000 description 4
- IQHUITKNHOKGFC-MIMYLULJSA-N Thr-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IQHUITKNHOKGFC-MIMYLULJSA-N 0.000 description 4
- GXDLGHLJTHMDII-WISUUJSJSA-N Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(O)=O GXDLGHLJTHMDII-WISUUJSJSA-N 0.000 description 4
- IMMPMHKLUUZKAZ-WMZOPIPTSA-N Trp-Phe Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 IMMPMHKLUUZKAZ-WMZOPIPTSA-N 0.000 description 4
- HPYDSVWYXXKHRD-VIFPVBQESA-N Tyr-Gly Chemical compound [O-]C(=O)CNC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 HPYDSVWYXXKHRD-VIFPVBQESA-N 0.000 description 4
- AOLHUMAVONBBEZ-STQMWFEESA-N Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AOLHUMAVONBBEZ-STQMWFEESA-N 0.000 description 4
- 108010064997 VPY tripeptide Proteins 0.000 description 4
- GIAZPLMMQOERPN-YUMQZZPRSA-N Val-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GIAZPLMMQOERPN-YUMQZZPRSA-N 0.000 description 4
- WPSXZFTVLIAPCN-UHFFFAOYSA-N Valyl-Cysteine Chemical compound CC(C)C(N)C(=O)NC(CS)C(O)=O WPSXZFTVLIAPCN-UHFFFAOYSA-N 0.000 description 4
- GVRKWABULJAONN-UHFFFAOYSA-N Valyl-Threonine Chemical compound CC(C)C(N)C(=O)NC(C(C)O)C(O)=O GVRKWABULJAONN-UHFFFAOYSA-N 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 108010093581 aspartyl-proline Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 230000004663 cell proliferation Effects 0.000 description 4
- 238000007796 conventional method Methods 0.000 description 4
- 239000003814 drug Substances 0.000 description 4
- 230000002068 genetic Effects 0.000 description 4
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 230000001965 increased Effects 0.000 description 4
- 108010091871 leucylmethionine Proteins 0.000 description 4
- 239000003446 ligand Substances 0.000 description 4
- 239000006166 lysate Substances 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 108010044655 lysylproline Proteins 0.000 description 4
- 108010056582 methionylglutamic acid Proteins 0.000 description 4
- 108010005942 methionylglycine Proteins 0.000 description 4
- 108010051242 phenylalanylserine Proteins 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 239000000523 sample Substances 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- 238000002922 simulated annealing Methods 0.000 description 4
- 108010084932 tryptophyl-proline Proteins 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- 238000002424 x-ray crystallography Methods 0.000 description 4
- ULXYQAJWJGLCNR-YUMQZZPRSA-N (3S)-3-[[(2S)-2-amino-4-methylpentanoyl]amino]-4-(carboxymethylamino)-4-oxobutanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 3
- SOYWRINXUSUWEQ-DLOVCJGASA-N (4S)-4-amino-5-[[(2S)-1-[[(1S)-1-carboxy-2-methylpropyl]amino]-3-methyl-1-oxobutan-2-yl]amino]-5-oxopentanoic acid Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 3
- TUTIHHSZKFBMHM-UHFFFAOYSA-N 4-amino-5-[(3-amino-1-carboxy-3-oxopropyl)amino]-5-oxopentanoic acid Chemical compound OC(=O)CCC(N)C(=O)NC(CC(N)=O)C(O)=O TUTIHHSZKFBMHM-UHFFFAOYSA-N 0.000 description 3
- XZWXFWBHYRFLEF-FSPLSTOPSA-N Ala-His Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 XZWXFWBHYRFLEF-FSPLSTOPSA-N 0.000 description 3
- BFNBIHQBYMNNAN-UHFFFAOYSA-N Ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 3
- OMLWNBVRVJYMBQ-YUMQZZPRSA-N Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OMLWNBVRVJYMBQ-YUMQZZPRSA-N 0.000 description 3
- KWBQPGIYEZKDEG-FSPLSTOPSA-N Asn-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O KWBQPGIYEZKDEG-FSPLSTOPSA-N 0.000 description 3
- 108010047041 Complementarity Determining Regions Proteins 0.000 description 3
- YBAFDPFAUTYYRW-YUMQZZPRSA-N Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O YBAFDPFAUTYYRW-YUMQZZPRSA-N 0.000 description 3
- YSWHPLCDIMUKFE-QWRGUYRKSA-N Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YSWHPLCDIMUKFE-QWRGUYRKSA-N 0.000 description 3
- JLXVRFDTDUGQEE-YFKPBYRVSA-N Gly-Arg Chemical compound NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N JLXVRFDTDUGQEE-YFKPBYRVSA-N 0.000 description 3
- CZVQSYNVUHAILZ-UWVGGRQHSA-N His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 CZVQSYNVUHAILZ-UWVGGRQHSA-N 0.000 description 3
- 108090000745 Immune Sera Proteins 0.000 description 3
- 125000002068 L-phenylalanino group Chemical group [H]OC(=O)[C@@]([H])(N([H])[*])C([H])([H])C1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 description 3
- LRKCBIUDWAXNEG-CSMHCCOUSA-N Leu-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRKCBIUDWAXNEG-CSMHCCOUSA-N 0.000 description 3
- CIOWSLJGLSUOME-BQBZGAKWSA-N Lys-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O CIOWSLJGLSUOME-BQBZGAKWSA-N 0.000 description 3
- AIXUQKMMBQJZCU-IUCAKERBSA-N Lys-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O AIXUQKMMBQJZCU-IUCAKERBSA-N 0.000 description 3
- 125000000534 N(2)-L-lysino group Chemical group [H]OC(=O)[C@@]([H])(N([H])[*])C([H])([H])C([H])([H])C(C([H])([H])N([H])[H])([H])[H] 0.000 description 3
- NYQBYASWHVRESG-MIMYLULJSA-N Phe-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 NYQBYASWHVRESG-MIMYLULJSA-N 0.000 description 3
- LZLREEUGSYITMX-UHFFFAOYSA-N Serinyl-Tryptophan Chemical compound C1=CC=C2C(CC(NC(=O)C(CO)N)C(O)=O)=CNC2=C1 LZLREEUGSYITMX-UHFFFAOYSA-N 0.000 description 3
- 108010070144 Single-Chain Antibodies Proteins 0.000 description 3
- 102000005632 Single-Chain Antibodies Human genes 0.000 description 3
- CUTPSEKWUPZFLV-UHFFFAOYSA-N Threoninyl-Cysteine Chemical compound CC(O)C(N)C(=O)NC(CS)C(O)=O CUTPSEKWUPZFLV-UHFFFAOYSA-N 0.000 description 3
- PDSLRCZINIDLMU-QWRGUYRKSA-N Tyr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PDSLRCZINIDLMU-QWRGUYRKSA-N 0.000 description 3
- ZSXJENBJGRHKIG-UHFFFAOYSA-N Tyrosyl-Serine Chemical compound OCC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 ZSXJENBJGRHKIG-UHFFFAOYSA-N 0.000 description 3
- IBIDRSSEHFLGSD-YUMQZZPRSA-N Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-YUMQZZPRSA-N 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 150000001408 amides Chemical class 0.000 description 3
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 3
- 235000011130 ammonium sulphate Nutrition 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- -1 and Leu344 for He Chemical compound 0.000 description 3
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 3
- 239000000969 carrier Substances 0.000 description 3
- 108091006028 chimera Proteins 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 229940079593 drugs Drugs 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 238000000990 heteronuclear single quantum coherence spectrum Methods 0.000 description 3
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 3
- 239000002502 liposome Substances 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- RZVAJINKPMORJF-UHFFFAOYSA-N p-acetaminophenol Chemical compound CC(=O)NC1=CC=C(O)C=C1 RZVAJINKPMORJF-UHFFFAOYSA-N 0.000 description 3
- 230000022983 regulation of cell cycle Effects 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- IOUPEELXVYPCPG-UHFFFAOYSA-N val-gly Chemical compound CC(C)C(N)C(=O)NCC(O)=O IOUPEELXVYPCPG-UHFFFAOYSA-N 0.000 description 3
- 206010051113 Arterial restenosis Diseases 0.000 description 2
- 206010003210 Arteriosclerosis Diseases 0.000 description 2
- IIFDPDVJAHQFSR-WHFBIAKZSA-N Asn-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O IIFDPDVJAHQFSR-WHFBIAKZSA-N 0.000 description 2
- 206010003816 Autoimmune disease Diseases 0.000 description 2
- 206010063659 Aversion Diseases 0.000 description 2
- 229920000062 Coding strand Polymers 0.000 description 2
- 229920002676 Complementary DNA Polymers 0.000 description 2
- 108060001039 GCN4 Proteins 0.000 description 2
- 102000000588 Interleukin-2 Human genes 0.000 description 2
- 108010002350 Interleukin-2 Proteins 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- MLTRLIITQPXHBJ-BQBZGAKWSA-N Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O MLTRLIITQPXHBJ-BQBZGAKWSA-N 0.000 description 2
- 206010025135 Lupus erythematosus Diseases 0.000 description 2
- NVGBPTNZLWRQSY-UWVGGRQHSA-N Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN NVGBPTNZLWRQSY-UWVGGRQHSA-N 0.000 description 2
- 101700075357 MYC Proteins 0.000 description 2
- 108020004999 Messenger RNA Proteins 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- FELJDCNGZFDUNR-WDSKDSINSA-N Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FELJDCNGZFDUNR-WDSKDSINSA-N 0.000 description 2
- 102000002067 Protein Subunits Human genes 0.000 description 2
- 108010001267 Protein Subunits Proteins 0.000 description 2
- 206010039073 Rheumatoid arthritis Diseases 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- BQBCIBCLXBKYHW-CSMHCCOUSA-N Thr-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O BQBCIBCLXBKYHW-CSMHCCOUSA-N 0.000 description 2
- 108010078814 Tumor Suppressor Protein p53 Proteins 0.000 description 2
- 102000015098 Tumor Suppressor Protein p53 Human genes 0.000 description 2
- 230000001594 aberrant Effects 0.000 description 2
- 230000002776 aggregation Effects 0.000 description 2
- 238000004220 aggregation Methods 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 102000038129 antigens Human genes 0.000 description 2
- 108091007172 antigens Proteins 0.000 description 2
- 229960000070 antineoplastic Monoclonal antibodies Drugs 0.000 description 2
- 201000001320 atherosclerosis Diseases 0.000 description 2
- 125000004429 atoms Chemical group 0.000 description 2
- 230000001580 bacterial Effects 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 230000002708 enhancing Effects 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 239000000833 heterodimer Substances 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 239000002609 media Substances 0.000 description 2
- 229920002106 messenger RNA Polymers 0.000 description 2
- 230000005012 migration Effects 0.000 description 2
- 108010045030 monoclonal antibodies Proteins 0.000 description 2
- 102000005614 monoclonal antibodies Human genes 0.000 description 2
- 229960000060 monoclonal antibodies Drugs 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 238000007911 parenteral administration Methods 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 201000004681 psoriasis Diseases 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- PXIPVTKHYLBLMZ-UHFFFAOYSA-N sodium azide Chemical compound [Na+].[N-]=[N+]=[N-] PXIPVTKHYLBLMZ-UHFFFAOYSA-N 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 210000004881 tumor cells Anatomy 0.000 description 2
- 125000001493 tyrosinyl group Chemical group [H]OC1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- SITLTJHOQZFJGG-XPUUQOCRSA-N α-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SITLTJHOQZFJGG-XPUUQOCRSA-N 0.000 description 2
- BVZABQIRMYTKCF-JSGCOSHPSA-N (2S)-2-[[(2S)-2-azaniumyl-3-(1H-indol-3-yl)propanoyl]amino]-4-methylsulfanylbutanoate Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(O)=O)=CNC2=C1 BVZABQIRMYTKCF-JSGCOSHPSA-N 0.000 description 1
- 229920000160 (ribonucleotides)n+m Polymers 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N 2-[[(2S)-2-[[(2S)-2-azaniumyl-4-methylpentanoyl]amino]-4-methylpentanoyl]amino]acetate Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N 2-mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
- MGSKVZWGBWPBTF-UHFFFAOYSA-N AEBSF Chemical compound NCCC1=CC=C(S(F)(=O)=O)C=C1 MGSKVZWGBWPBTF-UHFFFAOYSA-N 0.000 description 1
- 241000432074 Adeno-associated virus Species 0.000 description 1
- 206010059512 Apoptosis Diseases 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108010071919 Bispecific Antibodies Proteins 0.000 description 1
- 102000003858 Chymases Human genes 0.000 description 1
- 108090000227 Chymases Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- ZSRSLWKGWFFVCM-WDSKDSINSA-N Cys-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O ZSRSLWKGWFFVCM-WDSKDSINSA-N 0.000 description 1
- QIVBCDIJIAJPQS-SECBINFHSA-N D-tryptophane Chemical compound C1=CC=C2C(C[C@@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-SECBINFHSA-N 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 229940088598 Enzyme Drugs 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 230000035693 Fab Effects 0.000 description 1
- FYYSIASRLDJUNP-WHFBIAKZSA-N Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FYYSIASRLDJUNP-WHFBIAKZSA-N 0.000 description 1
- IEFJWDNGDZAYNZ-BYPYZUCNSA-N Gly-Glu Chemical compound NCC(=O)N[C@H](C(O)=O)CCC(O)=O IEFJWDNGDZAYNZ-BYPYZUCNSA-N 0.000 description 1
- 238000001535 HNCA Methods 0.000 description 1
- 238000002130 HNCOCA Methods 0.000 description 1
- KRBMQYPTDYSENE-BQBZGAKWSA-N His-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 KRBMQYPTDYSENE-BQBZGAKWSA-N 0.000 description 1
- 102000015434 Humanized Monoclonal Antibodies Human genes 0.000 description 1
- 108010064750 Humanized Monoclonal Antibodies Proteins 0.000 description 1
- 210000004408 Hybridomas Anatomy 0.000 description 1
- 229960000310 ISOLEUCINE Drugs 0.000 description 1
- 102000010789 Interleukin-2 Receptors Human genes 0.000 description 1
- 108010038453 Interleukin-2 Receptors Proteins 0.000 description 1
- 229920002459 Intron Polymers 0.000 description 1
- 108020004391 Introns Proteins 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- 206010024324 Leukaemias Diseases 0.000 description 1
- 239000004367 Lipase Substances 0.000 description 1
- 210000004698 Lymphocytes Anatomy 0.000 description 1
- 206010025323 Lymphomas Diseases 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- KPNNXHVGOKRBEF-UHFFFAOYSA-N N-hydroxy-7-naphthalen-2-ylsulfanylheptanamide Chemical compound C1=CC=CC2=CC(SCCCCCCC(=O)NO)=CC=C21 KPNNXHVGOKRBEF-UHFFFAOYSA-N 0.000 description 1
- 229920002957 Naked DNA Polymers 0.000 description 1
- 229920002248 Nuclear DNA Polymers 0.000 description 1
- 229920000272 Oligonucleotide Polymers 0.000 description 1
- 206010025310 Other lymphomas Diseases 0.000 description 1
- FAXGPCHRFPCXOO-LXTPJMTPSA-N Pepstatin Chemical compound OC(=O)C[C@H](O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)C[C@H](O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)CC(C)C FAXGPCHRFPCXOO-LXTPJMTPSA-N 0.000 description 1
- 229950000964 Pepstatin Drugs 0.000 description 1
- 102000035443 Peptidases Human genes 0.000 description 1
- 108091005771 Peptidases Proteins 0.000 description 1
- 229960005190 Phenylalanine Drugs 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- GVUVRRPYYDHHGK-UHFFFAOYSA-N Prolyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C1CCCN1 GVUVRRPYYDHHGK-UHFFFAOYSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 102000009092 Proto-Oncogene Proteins c-myc Human genes 0.000 description 1
- 108010087705 Proto-Oncogene Proteins c-myc Proteins 0.000 description 1
- 229920005654 Sephadex Polymers 0.000 description 1
- 239000012507 Sephadex™ Substances 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- YZMPDHTZJJCGEI-BQBZGAKWSA-N Ser-His Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 YZMPDHTZJJCGEI-BQBZGAKWSA-N 0.000 description 1
- PPQRSMGDOHLTBE-UWVGGRQHSA-N Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PPQRSMGDOHLTBE-UWVGGRQHSA-N 0.000 description 1
- 108091008153 T cell receptors Proteins 0.000 description 1
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 1
- WCRFXRIWBFRZBR-GGVZMXCHSA-N Thr-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WCRFXRIWBFRZBR-GGVZMXCHSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 206010052779 Transplant rejections Diseases 0.000 description 1
- GSEJCLTVZPLZKY-UHFFFAOYSA-N Tris Chemical compound OCCN(CCO)CCO GSEJCLTVZPLZKY-UHFFFAOYSA-N 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- DXYQIGZZWYBXSD-UHFFFAOYSA-N Tryptophyl-Proline Chemical compound C=1NC2=CC=CC=C2C=1CC(N)C(=O)N1CCCC1C(O)=O DXYQIGZZWYBXSD-UHFFFAOYSA-N 0.000 description 1
- CGWAPUBOXJWXMS-HOTGVXAUSA-N Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 CGWAPUBOXJWXMS-HOTGVXAUSA-N 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000008186 active pharmaceutical agent Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000003466 anti-cipated Effects 0.000 description 1
- 239000004599 antimicrobial Substances 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 238000002820 assay format Methods 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 238000005460 biophysical method Methods 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- UIIMBOGNXHQVGW-UHFFFAOYSA-M buffer Substances [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 230000000711 cancerogenic Effects 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 231100000315 carcinogenic Toxicity 0.000 description 1
- 230000030833 cell death Effects 0.000 description 1
- 230000001413 cellular Effects 0.000 description 1
- 230000002032 cellular defenses Effects 0.000 description 1
- 230000000973 chemotherapeutic Effects 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 210000000038 chest Anatomy 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 230000000295 complement Effects 0.000 description 1
- 239000012468 concentrated sample Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000002596 correlated Effects 0.000 description 1
- 238000005100 correlation spectroscopy Methods 0.000 description 1
- 230000001808 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 230000001809 detectable Effects 0.000 description 1
- 229910052805 deuterium Inorganic materials 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 201000009910 diseases by infectious agent Diseases 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 238000002825 functional assay Methods 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 239000003262 industrial enzyme Substances 0.000 description 1
- 230000002458 infectious Effects 0.000 description 1
- 230000002452 interceptive Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl β-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 102000004882 lipase Human genes 0.000 description 1
- 108090001060 lipase Proteins 0.000 description 1
- 235000019421 lipase Nutrition 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 210000004962 mammalian cells Anatomy 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000001404 mediated Effects 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 239000006151 minimal media Substances 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 101700034118 myca Proteins 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 229920002113 octoxynol Polymers 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 108010091212 pepstatin Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 239000002798 polar solvent Substances 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- QLROSWPKSBORFJ-BQBZGAKWSA-N pro glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 QLROSWPKSBORFJ-BQBZGAKWSA-N 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 230000002062 proliferating Effects 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 238000001711 protein immunostaining Methods 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 230000002285 radioactive Effects 0.000 description 1
- 238000001959 radiotherapy Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000000452 restraining Effects 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 230000000707 stereoselective Effects 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 210000001519 tissues Anatomy 0.000 description 1
- 238000001551 total correlation spectroscopy Methods 0.000 description 1
- 108090000464 transcription factors Proteins 0.000 description 1
- 102000003995 transcription factors Human genes 0.000 description 1
- 230000001131 transforming Effects 0.000 description 1
- 102000027575 transmembrane receptors Human genes 0.000 description 1
- 108091007901 transmembrane receptors Proteins 0.000 description 1
- 231100000588 tumorigenic Toxicity 0.000 description 1
- 230000000381 tumorigenic Effects 0.000 description 1
- 238000002096 two-dimensional nuclear Overhauser enhancement spectroscopy Methods 0.000 description 1
- 238000004104 two-dimensional total correlation spectroscopy Methods 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 238000009281 ultraviolet germicidal irradiation Methods 0.000 description 1
- 231100000402 unacceptable toxicity Toxicity 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 125000002987 valine group Chemical group [H]N([H])C([H])(C(*)=O)C([H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
Definitions
- This relates generally to the field of protein structure and protein design.
- a second important determinant of structure is amino acid helical propensity, which reflects the entropic cost of incorporating a residue into an ordered secondary structure element [P. C. Lyu et al, Science. 250: 669 (1990); K. T. O'Neil and W. F. DeGrado, Science. 250: 646 (1990); S. Padmanabhan et al, Nature. 344: 268 (1990); T. P. Creamer and G. D. Rose, Proc. Natl. Acad. Sci. USA. 89: 5937 (1992)].
- these designed proteins may have novel functions, or a change in their functional properties relative to the native protein from which they were derived.
- these designed proteins retain the function of the native protein from which they are derived, but have some other advantage, such as enhanced stability, improved binding, lower molecular mass, or the like.
- the present invention provides a method for altering the three-dimensional structure of proteins.
- the method involves the steps of: identifying a native protein with three-dimensional structure to be altered; identifying the hydrophobic residues within this protein; distinguishing the hydrophobic residues on the basis of side chain size into large and small hydrophobic residues; providing mutants of the protein having substitutions in the hydrophobic residues distinguished above; and assaying the mutant proteins for a switch in three-dimensional structure.
- the method of the invention further involves the steps of generating mutants of the selected protein having at least one of the large hydrophobic residues within its hydrophobic core substituted by hydrophobic residues with small side chains and assaying the mutant proteins for ability to at least maintain the biological function of the native protein.
- the method of the invention involves substituting at least one of the small hydrophobic residues within the hydrophobic core of the protein with large hydrophobic residues.
- the present invention provides an altered protein comprising p53 oligomerization domain altered according to the above method, wherein the side chain size of the hydrophobic amino acid with the largest side chain in the p53 protein hydrophobic core (Phe341) has been decreased and the side chain of another hydrophobic amino acid in the hydrophobic core (Leu344) has been increased.
- the invention provides an altered p53 protein oligomerization domain designed as described above, which has further been modified to contain a Lys at amino acid position 340.
- the invention provides a p53 protein containing the altered p53 oligomerization domains as described above in place of the native p53 oligomerization domain.
- the invention provides p53 fusion proteins comprising an altered p53 oligomerization domain fused to a heterologous protein.
- the present invention provides nucleic acid sequences encoding the altered proteins according to the present invention.
- the invention provides vectors comprising nucleic acid sequences of the invention under the control of suitable regulatory sequences.
- the invention provides host cells transformed with the vectors of the invention. Also provided are pharmaceutical compositions containing the nucleic acid sequence of the invention and method of administering same. Other aspects and advantages of the present invention are described further in the following detailed description of the preferred embodiments thereof.
- Fig. 1 illustrates the three-dimensional structure of the p53wt oligomerization domain corresponding to residues 325-355 of SEQ ID NO: 2.
- Residue type is indicated by the single letter code: F, Phe; L, Leu.
- Fig. 2 illustrates the three-dimensional structure of the p53KIY oligomerization domain corresponding to residues 327-353 of human p53 [SEQ ID NO: 2]. Residue type is indicated by the single letter code: Y, Tyr; I, He.
- Fig. 3 A illustrates the packing of residues with large hydrophobic side chains, specifically, Phe341 in p53wt [SEQ ID NO: 2].
- the ⁇ -helix of one of the subunits is not shown for clarity, also only two of the four subunits are shown.
- Residue type is abbreviated: L, Leu; F, Phe; N, Asn.
- Fig. 3B illustrates the packing of Tyr344 in p53KIY [SEQ ID NO: 4].
- Residue type is abbreviated: L, Leu; Y, Tyr; N, Asn.
- This invention provides a general method for changing the three-dimensional structure of proteins.
- the invention further provides modified proteins generated by this method and nucleic acid sequences encoding them. These modified proteins and nucleic acid sequences are particularly useful in pharmaceutical compositions and therapeutic regimens, and also in biotechnology and other industrial applications.
- the altered proteins of the invention have an altered three-dimensional structure and are further characterized by maintaining substantially the same level of a desired biological function of the native protein.
- the altered proteins of the invention advantageously can be designed or selected such that they are also characterized by improved functional properties relative to the native (wild-type) domain, e.g., improved binding ability, improved stability, or the like.
- the present invention provides a method for altering the three-dimensional structure of proteins without denaturing the proteins.
- the examples provided herein demonstrate alteration of the three-dimensional structure of the oligomerization domain of wild-type p53.
- other proteins may be readily altered.
- the method of the invention is performed upon a protein that has a hydrophobic core (i.e., upon essentially all proteins). Most preferably, however, the protein is useful for industrial, therapeutic or diagnostic purposes.
- non-linear proteins including, without limitation, industrial enzymes, such as proteases, lipases, chymases, etc., and therapeutically useful proteins, such as the members of the globin family, in which the side chain sizes of specific hydrophobic residues are highly conserved [D. Bashford et al, J. Mol. Biol.. 196: 199 (1987)].
- the three-dimensional (or crystalline) structure of the protein to be altered is known.
- the three-dimensional structure may be determined using known techniques, e.g., NMR spectroscopy, x-ray crystallography and the like.
- hydrophobic residues within this protein are identified and distinguished into large and small hydrophobic residues on the basis of side chain size.
- amino acid residues Phe, Tyr and Trp are considered large and amino acid residues Ala, Val and He are considered small.
- Leu is intermediate in size and may be considered either large or small.
- the total surface area not only is the total surface area, but also the surface area buried upon folding with is relevant, as the latter determines the strength of the hydrophobic effect [J.R. Livingstone et al, Biochemistry. 30:4237-4244 (1991)].
- the total surface areas of residues [G.D. Rose et al, Science. 229:834-838 (1985)] and the surface areas buried upon folding are provided in the table below. All values are in square Angstroms
- mutant proteins are generated by increasing the size of one or more of the small residues and/or decreasing the size of one or more of the large residues. Since Leu cannot be unambiguously classified as large or small, mutants are generated which replace Leu with both larger and smaller amino acids.
- it is preferable to decrease the size of the largest residue in the hydrophobic core e.g., Phe is substituted with He, Val or Ala
- increase the size of one of the smaller residues in the hydrophobic core e.g., Val is substituted with Phe or Trp. All substitutions are made in such a manner as to preserve the hydrophobic character of the native residue.
- one or two substitutions are made for each secondary structure element whose packing in the three-dimensional structure of the protein is to be altered.
- two substitutions are performed, one decreases the side chain size of a large hydrophobic amino acid and the other increases the side chain size of a small hydrophobic amino acid.
- the mutant proteins may be generated using conventional techniques.
- the peptides may be synthesized using a commercially available automatic synthesizer according to standard procedures.
- other standard techniques may be utilized. See, e.g., Merrifield, J. Amer. Chem. Soc. 85:2149-2154 (1963).
- the mutant proteins of the invention are generated recombinantly, making use of a variety of well-known techniques (such as site- directed mutagenesis [see, Gillman & Smith, Gene. 8:81-97 (1979) and S. Roberts et al, Nature. 328:731-734 (1987)]) and, desirably, the nucleic acid sequences of the invention. See, e.g., Sambrook et al., Molecular Cloning. A Laboratory Manual.. 2d Edit., Cold Spring Harbor Laboratory, New York (1989).
- mutant proteins generated by the method of the invention may be assayed for biological function as a preliminary screening step. In this manner, mutant proteins having the desired biological function (e.g., binding ability) may be selected.
- the assays for examining the function of the protein will depend on the particular function that needs to be assayed. Such assays are well known to those of skilled in the art and are not a limitation on this invention. For example, if the protein altered according to the invention is a DNA binding protein, then one could use a DNA binding assay to examine the function of the mutant proteins. If engineering involves a protein that forms oligomers, e.g., the oligomerization domain of p53, then one could use a glutaraldehyde crosslinking assay [W.
- the mutant proteins generated according to the method of the invention may then be screened for a change in the three-dimensional structure.
- Biophysical methods to probe protein structure include NMR spectroscopy, X-ray crystallography, [G. M. Clore et al, Science. 265: 386 (1994); W. Lee et al, Nature Structural Biol.. ⁇ : 877 (1994); G. M. Clore et al, Nature Structural Biol.. 2: 321 (1995); P. D. Jeffrey et al, Science, 267: 1498 (1995)], among other techniques.
- the applications of a DNA binding assay, a glutaraldehyde crosslinking assay and NMR spectroscopy are illustrated in the examples of engineered p53 oligomerization domains appropriate for determining three-dimensional conformation.
- the p53 oligomerization domain has been altered according to the method of the invention.
- the method of the invention may involve generating mutant of fragments of a useful protein which is responsible for biological activity. As described above, similar mutants may be generated using the hydrophobic core of other selected proteins, or full-length proteins, as desired.
- the inventor has altered the oligomerization domain of p53 to provide a p53 protein with an altered three-dimensional structure and oligomerization stoichiometry relative to the native (wild-type) domain.
- All references to p53 residue numbers herein refer to the numbering scheme provided by Zakut-Houri et al, EMBO J, 4: 1251-1255 (1985) [GenBank Code Hsp53] for human p53.
- the nucleotide and amino acid sequences of human p53 are reproduced as SEQ ID NOS: 1 and 2, respectively.
- SEQ ID NOS: 1 and 2 respectively.
- the p53 tumor suppressor protein is a sequence-specific transcription factor with fundamental significance to the pathogenesis and therapy of human cancer [C. C. Harris, Science. 262: 1980 (1993); L. J. Ko and C. Prives, Genes Dev.. 10: 1054 (1996)].
- the tumor suppressor activity of p53 requires homo-oligomerization [M. J. F. Waterman et al, Cancer Res., 56: 158 (1996)], which is mediated by a thirty residue domain at the C-terminus of the protein [H. Sakamoto et al, Proc. Natl. Acad. Sci. USA. 91 : 8974 (1994); P. Wang et al, Mol. Cell. Biol.. 14: 5182 (1994); J. L.
- the native thirty residue p53 oligomerization domain has a ⁇ -strand, a tight turn and an ⁇ -helix in each subunit. [Clore et al, cited above (1994); Lee et al, cited above; Clore et al, cited above, (1995); Jeffrey et al, cited above]. This oligomerization domain folds independently with antiparallel packing of its ⁇ -helices.
- the altered p53 generated according to the invention contains substitutions of residues Phe341 and Leu344 [SEQ ID NO: 2] in the ⁇ -helix by other hydrophobic amino acids, that decreased the side chain size at position 341 and increased the side chain size at position 344 [corresponding to SEQ ID NO: 2], resulting in an altered p53 domain that assembles as a dimer instead of a tetramer.
- the three-dimensional structure of a mutant dimeric domain determined in solution by NMR spectroscopy differs substantially from the wild-type structure, since the ⁇ -helices are packed parallel, rather than antiparallel, and are rotated significantly relative to each other and to the ⁇ -strands.
- the engineered p53 oligomerization domain was found to drive the sequence-specific DNA binding function of the modified p53 protein.
- the present invention provides altered p53 oligomerization domains that assemble as dimers, rather than tetramers, and which have a different three-dimensional structure relative to wild-type p53.
- These altered p53 oligomerization domains of the invention desirably contain the following residues, which differ from the residues in native human p53 [SEQ ID NO: 2].
- the altered p53 oligomerization domains described above may be further modified to contain Lys at position 340 [SEQ ID NO: 4].
- the inventor has found that the latter modification increases the solubility of the mutant p53 oligomerization domains with altered three-dimensional structure. For example, at a 1-2 mM concentration a mutant p53 domain with He at position 341 and Tyr at position 344 [SEQ ID NO: 3] precipitates within one hour when heated to 40°C. In contrast, a mutant p53 domain [SEQ ID NO: 4] with Lys at position 340, He at position 341 and Tyr at position 344 remains soluble under the same conditions. Based on the functional activities of other dimeric p53 proteins
- the altered p53 oligomerization domain has a different three-dimensional structure than the wild-type p53 domain, the two types of domains will not hetero-oligomerize. Accordingly, a p53 tumor suppressor protein will not be sequestered into inactive hetero-oligomers with tumor-derived p53 mutant proteins.
- a p53 protein of the invention can be delivered by gene therapy vectors and not be suppressed by the mutant p53 protein present in tumor cells.
- the altered p53 oligomerization domains of the invention may be engineered on an otherwise unmodified p53wt protein.
- the altered p53 oligomerization domain may also be fused to a selected heterologous protein.
- the altered p53 oligomerization domains may be engineered on p53 proteins bearing additional modifications.
- One suitable modification is substitution of residue threonine 284 [SEQ ID NO: 2] with Arginine. This substitution enhances the tumor suppressor function of wild-type p53 5- to 7-fold [Wieczorek et al, Nature Medicine. 2: 1143 (1996)].
- Suitable heterologous proteins include those which in the past have been fused to a leucine zipper.
- Leucine zippers have the disadvantage that they may interact with host leucine-zipper bearing proteins, which interaction may compromise the biological activity of the chimeric protein bearing the leucine zipper.
- the altered p53 oligomerization domains described in this application do not exist in nature, and do not form oligomers with any host proteins.
- both leucine zippers and the altered p53 oligomerization domain of the invention have similar topologies, i.e., both types of domains have parallel ⁇ -helices, they may be used in many similar applications.
- heterologous proteins include single chain antibody variable chains (scFv antibodies).
- the prior art has described scFv antibodies fused to Jun and Fos leucine zippers to produce dimeric antibodies that have higher affinity for their ligands, because they are bivalent [Kruif and Logtenberg, J. Biol. Chem., 271 :7630 (1996)].
- scFv antibodies can be fused to the altered p53 oligomerization domain of the invention by using the sequences encoding the altered p53 oligomerization domain in place of the leucine zipper sequences of the prior art. This will lead to homodimeric (hence monospecific), bivalent (high affinity) antibodies.
- a suitable heterologous protein includes a soluble interleukin-2 (IL-2) receptor.
- IL-2 soluble interleukin-2
- Wu et al, J. Biol. Chem.. 270: 16039 (1995) has described a soluble IL-2 receptor complex formed by attaching leucine zippers to the C-terminus of the extracellular domain of the receptor.
- the leucine zipper domain of the prior art is replaced by the altered p53 oligomerization domain of the invention.
- Such soluble domains are useful for screening ligands (drugs) that bind to the native receptors.
- Such soluble domains are also useful therapeutically as decoys competing for ligand binding with the endogenous receptors of patients.
- the extracellular domains of the T-cell receptor can be isolated in a soluble form, then fused to the modified p53 domains, using a modified version of the technique described in Chang et al., Proc. Natl. Acad. Sci. USA. 91: 11408 (1994).
- heterodimers can be isolated from homodimers by conventional protein chromatography or other suitable techniques.
- the altered p53 oligomerization domains of the invention may be fused to a transmembrane receptor. Many such receptors become physiologically activated by ligand-induced dimerization. Fusion of a dimerization domain to such receptors can therefore constitutively activate them.
- a transmembrane receptor such as the Trp-Met fusion receptor, in which the Trp protein provides a leucine zipper which induces dimerization and activation of the Met receptor [Rodriques and Park, Mol. Cell Biol. 13:6711 (1993)].
- the altered p53 oligomerization domains of the invention may be used to activate a receptor of choice. Depending upon the type of receptor to which the modified p53 oligomerization domain of the invention is fused, the outcome could be cell proliferation or cell death.
- the altered p53 oligomerization domain of the invention can also be used to induce dimerization of DNA binding proteins.
- Many DNA binding proteins for example c-Myc, bind DNA as dimers.
- c-Myc will not homodimerize, but will bind DNA as a hetero-dimer with a protein called Max.
- c-Myc will homo- dimerize if its native leucine zipper is replaced by the leucine zipper of GCN4, since the latter zipper has a high tendency to homo-oligomerize.
- a c-Myc fusion bearing a GCN4 leucine zipper binds DNA with a very high affinity [Halazonetis and Kandil, Science. 255:464 (1992)].
- a c-Myc protein whose native C-terminus is fused to the modified p53 oligomerization domain of this invention, would bind to DNA with high affinity and could compete for DNA binding of the native Myc/Max heterodimer without interfering with native proteins which contain leucine zippers. Furthermore, if the chimeric Myc protein lacked the N-terminal domain of p53, which is required for carcinogenic transformation of cells overexpressing c-Myc, then it can be used to revert the tumorigenic phenotype of cells overexpressing c-Myc, such as many leukemia and lymphoma cells.
- EWS-ATF-1 proteins that bind DNA and are implicated in cancer development, such as EWS-ATF-1 [Fujimura et al., Oncogene. 12: 159 (1996)] and the E2A-HLF [Yoshihara et al., Mol. Cell. Biol.. 15:3247 (1995)], can be similarly modified.
- altered p53 produced according to the method of the invention is used by way of example only.
- Other proteins e.g., enzymes, antibodies and members of the globin family
- the present invention further provides nucleic acid sequences encoding the altered proteins of this invention.
- the nucleic acid sequences of the invention include the complementary DNA sequence representing the non-coding strand, the messenger RNA sequence, the corresponding cDNA sequence and the RNA sequence complementary to the messenger RNA sequence.
- Variants of these nucleic acids of the invention include variations due to the degeneracy of the genetic code and are encompassed by this invention. Such variants may be readily identified and/or constructed by one of skill in the art. In certain cases specific codon usage may be employed to optimize expression.
- the above nucleotide sequences can be included within larger DNA or RNA fragments, or may be interrupted by introns.
- the nucleic acids encoding the proteins of the invention are present in the context of vectors suitable for amplification in prokaryotic or eukaryotic cells.
- vectors suitable for amplification in prokaryotic or eukaryotic cells Many such vectors are known and many of these are commercially available.
- plasmids with bacterial or yeast replication origins allow amplification in bacteria or yeast, respectively.
- Such vectors allow the production of large quantities of nucleic acids encoding the proteins of the invention, which nucleic acids can be used for gene therapy or for expression of the proteins of the invention, e.g., p53.
- the nucleic acids encoding the proteins of the invention are present in the context of vectors suitable for expression in cell-free extracts or lysates or in prokaryotic or eukaryotic cells.
- vectors are known [Ausubel et al, Current Protocols in Molecular Biology. Greene Publishing Associates and John Wiley & Sons, NY (1997)] and many of these are commercially available.
- the vector pGEM4 Promega, Madison, WI
- the vector pSV2 [ATCC] is suitable for expression in mammalian cells.
- Such vectors allow the production of the proteins of the invention in vitro for analysis of their functional properties or for delivery to patients.
- the nucleic acid sequences of the invention may be inserted into a vector capable of targeting and infecting a desired cell, either in vivo or ex vivo for gene therapy, and causing the encoded modified protein of this invention to be expressed by that cell.
- viral vectors are useful for this purpose, e.g., adenoviruses, retroviruses and adeno-associated viruses (AAV) [Schreiber et al. , Biotechniques. 14: 818-823 (1993); Davidson et al, Nature Genetics. 3: 219-223 (1993); Roessler et al, J. Clin. Invest.. 92: 1085-1092 (1993); Smythe et al, Ann. Thorac.
- AAV adeno-associated viruses
- these viral vectors containing nucleic acid sequences encoding a protein of the invention are prepared by one of skill in the art with resort to conventional techniques (see references mentioned above).
- a recombinant viral vector, e.g. an adenovirus, of the present invention comprises DNA of at least that portion of the viral genome which is capable of infecting the target cells operatively linked to the nucleic acid sequences of the invention.
- infection is generally meant the process by which a virus transfers genetic material to its host or target cell.
- the virus used in the construction of a vector of the invention is rendered replication-defective to remove the effects of viral replication on the target cells.
- the replication-defective viral genome can be packaged by a helper virus in association with conventional techniques.
- the vector(s) containing the nucleic acids encoding an altered protein of the invention is suspended in a pharmaceutically acceptable carrier, such as saline, and administered parenterally (or by other suitable means) in sufficient amounts to infect the desired cells and provide sufficient levels of modified protein to achieve the desired therapeutic or prophylactic effect, e.g., sufficient p53 activity to arrest abnormal cellular proliferation.
- a pharmaceutically acceptable carrier such as saline
- Other pharmaceutically acceptable carriers are well known to those of skill in the art.
- a suitable amount of the vector containing the chimeric nucleic acid sequences is between about 10 6 to 10 9 infectious particles per mL carrier.
- the delivery of the vector may be repeated as needed to sustain satisfactory levels of biological activity. For example, where modified p53 is administered, activity may be determined by monitoring clinical symptoms.
- this therapy may be combined with other therapies for the disease or condition being treated.
- therapy involving the administration of a vector capable of expressing an altered p53 protein of the invention is well suited for use in conjunction with conventional cancer therapies, including surgery, radiation and chemotherapy.
- Nucleic acid sequences driving expression of a protein of the invention may also be introduced by "carriers" other than viral vectors, such as liposomes, nucleic acid-coated gold beads or can simply be injected in situ [Fujiwara et al (1994b), cited above; Fynan et al, Proc. Natl. Acad. Sci. USA. 90: 11478-11482 (1993); Cohen, Science. 259: 1691-1692 (1993); Wolff et al, Biotechniques. U : 474-485 (1991)].
- carriers other than viral vectors, such as liposomes, nucleic acid-coated gold beads
- compositions The altered proteins and nucleic acid sequences of this invention may also be formulated into pharmaceutical compositions and administered using a therapeutic regimen compatible with the particular formulation.
- the composition may contain "naked” DNA, or a vector containing the nucleic acid sequences.
- naked DNA means substantially pure DNA which is not associated with a protein, lipid, carbohydrate or contained within a cell or an artificial delivery system such as a liposome.
- compositions within the scope of the present invention include compositions containing an altered protein of the invention (or a nucleic acid sequence encoding a modified protein) in an effective amount to have the desired physiological effect, e.g. to arrest the growth of cancer cells without causing unacceptable toxicity for the patient.
- Suitable formulations for parenteral administration include aqueous solutions of the active compounds in water-soluble or water-dispersible form, e.g. saline.
- suspensions of the active compounds may be administered in suitable conventional lipophilic carriers or in liposomes.
- compositions may be supplemented by active pharmaceutical ingredients, where desired.
- Optional antibacterial, antiseptic, and antioxidant agents in the compositions can perform their ordinary functions.
- the pharmaceutical compositions of the invention may further contain any of a number of suitable viscosity enhancers, stabilizers, excipients and auxiliaries which facilitate processing of the active compounds into preparations that can be used pharmaceutically.
- these preparations, as well as those preparations discussed below, are designed for parenteral administration.
- compositions designed for oral or rectal administration are also considered to fall within the scope of the present invention.
- suitable amount or “effective amount” means an amount which is effective to treat the conditions referred to below.
- a preferred dose of a pharmaceutical composition containing a protein of this invention is generally effective above about 0.1 mg modified protein per kg of body weight (mg/kg), and preferably from about 1 mg/kg to about 100 mg/kg. These doses may be administered with a frequency necessary to achieve and maintain satisfactory activity levels. Although a preferred range has been described above, determination of the effective amounts for treatment of each type of tumor or other condition may be determined by those of skill in the art.
- Dosage units of such pharmaceutical compositions containing the proteins of this invention preferably contain about 1 mg to 5 g of the protein.
- nucleic acids encoding altered p53 proteins and the altered p53 proteins themselves can be introduced into human patients for therapeutic benefits in conditions characterized by insufficient wild-type p53 activity. Such conditions have been described in the art. See, e.g., PCT/US95/15353 (June 6, 1996).
- the pharmaceutical compositions of the invention including the gene therapy vectors, may be employed to induce the cellular defense to DNA damaging agents such as sunlight UV irradiation, as well as radiation and chemotherapeutics used for cancer treatment.
- the therapeutic indications include inducing apoptosis of specific cells, such as proliferating lymphocytes, the prevention of transplant rejection, and the treatment of autoimmune diseases, e.g., systemic lupus erythrematosis, rheumatoid arthritis and the like.
- compositions of this invention may also be employed to restore p53 function in tumor cells and to suppress cell proliferation in diseases other than cancers, which are characterized by aberrant cell proliferation.
- diseases include psoriasis, atherosclerosis and arterial restenosis.
- Pharmaceutical compositions containing other altered proteins of the invention (or nucleic acids encoding them) may also be readily prepared and used for a variety of indications which will be readily apparent to one of skill in the art.
- altered proteins of the invention are useful for generating antibodies, which may be used as diagnostic reagents, for example, to monitor the presence of modified protein or protein domain.
- Specific antisera may be generated using known techniques. See,
- antibodies of the invention may be produced by conventional methods, including the Kohler and Milstein hybridoma technique, recombinant techniques, such as described by Huse et al, Science, 246: 1275-1281 (1 88), or any other techniques known to the art.
- the invention further encompasses functional fragments of the antibodies of the invention, including, Fab, F v , and F(ab') 2 fragments, the binding site of the antibodies, and the complementarity determining regions (CDRs).
- the binding site and/or CDRs may be contained in a synthetic molecule which provides antibody framework regions.
- these functional fragments may be used in the production of recombinant antibodies, including bifunctional antibodies, chimeric antibodies, and humanized antibodies, which preferably retain the antigen binding specificity of the antibodies of the invention.
- Such recombinant antibodies may be constructed and produced according to known techniques [see, e.g., S. D. Gillies et al, J. Immunol. Meth..
- the altered proteins of the invention may be used therapeutically or as diagnostic reagents. These reagents may optionally be labeled using diagnostic labels, such as radioactive labels, colorimetric enzyme label systems and the like conventionally used in diagnostic or therapeutic methods. Alternatively, the N- or C- terminus of an altered protein of the invention may be tagged with a detectable label which can be recognized by a specific antisera.
- diagnostic labels such as radioactive labels, colorimetric enzyme label systems and the like conventionally used in diagnostic or therapeutic methods.
- diagnostic labels such as radioactive labels, colorimetric enzyme label systems and the like conventionally used in diagnostic or therapeutic methods.
- the N- or C- terminus of an altered protein of the invention may be tagged with a detectable label which can be recognized by a specific antisera.
- the reagents derived from p53 may be used in diagnosis of a variety of conditions associated with p53 and/or aberrant cell proliferation, including autoimmune diseases, e.g., systemic lupus erythrematosis, rheumatoid arthritis and the like, cancers, psoriasis, atherosclerosis and arterial restenosis.
- autoimmune diseases e.g., systemic lupus erythrematosis, rheumatoid arthritis and the like
- cancers psoriasis
- atherosclerosis e.g., atherosclerosis and arterial restenosis
- reagents produced from other altered proteins of the invention e.g., antibodies and the like, may similarly be utilized as diagnostic reagents.
- the selection of the appropriate assay format and label system is within the skill of the art and may readily be chosen without requiring additional explanation by resort to the wealth of art in the diagnostic area.
- Example 1 Method for altering the three-dimensional structure of wild-type p53 Wild-type p53 was identified as the native protein with three-dimensional structure to be altered, according to the method of the invention.
- wtp53 [SEQ ID NO: 2] was found to have a structure which consists of a ⁇ -strand, a tight turn and an ⁇ -helix.
- Four identical subunits assemble as a dimer of dimers (Fig. 1).
- the ⁇ -strands form an antiparallel ⁇ -sheet and the ⁇ -helices also pack antiparallel.
- Two primary dimers form a tetramer by packing their ⁇ -helices at an 81 ° angle.
- the residues of the hydrophobic core were distinguished into large and small hydrophobic residues.
- the center of the hydrophobic core of the domain is formed primarily by residues Phe341 and Leu344, Phe341 is positioned at the interface of the two p53 monomers that form the primary dimers, whereas Leu344 forms the interface between the primary dimers
- Mutants of p53 were then generated according to the method of the invention and assayed for a switch in conformation. Plasmids encoding mutant p53 proteins were generated by PCR-directed mutagenesis of pGEMhump53wtB, as described in Waterman et al, EMBO J.. 14:512-519 (1995) which is incorporated by reference herein. The names of the mutants indicate the hydrophobic residues at positions 341 and 344 [SEQ ID NO: 3], respectively, using the single letter amino acid code. For example, p53FL is wild-type p53 [SEQ ID NO: 2]. Additional mutants made include: p53A344, Ala344 [SEQ ID NO: 3]; p53IF, Ile341 and Phe344
- DNA binding of the mutants was assayed using 32 P-labeled oligonucleotide BC.S10 and in vitro translated p53 (Waterman et al, cited above)]. DNA binding is an indirect measure of the function of the p53 oligomerization domain. This assay was performed to screen the mutants whose function is similar to wtp53, as a preliminary to assaying for conformational switch. Table 1 shows the effect of amino acid substitutions targeting residues 341 and 344 of human p53 on the subunit stoichiometry of p53 as assayed by the electrophoretic migration of its complexes with DNA. The names of the mutants [SEQ ID NO: 3] indicate the hydrophobic residues at positions 341 and 344, respectively, using the single letter amino acid code.
- p53FL is wild-type p53 [SEQ ID NO: 2].
- Wild-type p53 (p53wt; SEQ ID NO: 2) and these mutant p53 oligomerization domains [SEQ ID NOS: 3 and 4] were expressed in E. c ⁇ li, purified to homogeneity and assayed for subunit stoichiometry by glutaraldehyde crosslinking using the techniques described below. 1. Expression and Purification
- the cells were pelleted, stirred on ice for 20 min. in glycerol, 0.7% v/v Triton-X and 0.4% v/v ⁇ -mercaptoethanol, and then for another 15 min. in lysis buffer (10 mM Tris [pH 8.0], 500 mM NaCl, 5 mM EDTA, 1 mM DTT, 0.6 mg/ml lysozyme) containing Pefabloc SC (Pentapharm, Basel, Switzerland) and pepstatin (Sigma, St. Louis, MO). 120 U/ml DNase I (Sigma) was added and stirring continued for another 30 min.
- lysis buffer 10 mM Tris [pH 8.0], 500 mM NaCl, 5 mM EDTA, 1 mM DTT, 0.6 mg/ml lysozyme
- Pefabloc SC Pentapharm, Basel, Switzerland
- pepstatin Sigma, St. Louis, MO
- p53 was purified in three steps. In a first step, the p53 was run on a 50 ml Phenyl-Sepharose column (Pharmacia, Piscataway, NJ) equilibrated with 1.5 M ammonium sulfate, 50 mM sodium phosphate [pH 7.0], 5 mM EDTA and eluted by decreasing salt concentration on a GradiFrac system (Pharmacia).
- a second step the eluate from the first step was run on an 8 ml Phenyl-Superose column (Pharmacia) equilibrated with 1.7 M ammonium sulfate, 50 mM sodium phosphate [pH 7.0], 5mM EDTA and eluted by decreasing salt concentration on a SMART system (Pharmacia).
- the eluate from the second step was run on a 1 ml Resource S column (Pharmacia) equilibrated with 50 mM sodium phosphate [pH 7.0], 50 mM NaCl, 0.01 mM EDTA and eluted by increasing NaCl concentration on the SMART system.
- 0.1% v/v glutaraldehyde Sigma, St. Louis, MO
- Table 2 illustrates subunit stoichiometry of p53wt [SEQ ID NO: 2], p53IY (Ile341 and Tyr344) [SEQ ID NO: 2] and p53KIY (He341 and Tyr344) [SEQ ID NO: 4] oligomerization domains as determined by glutaraldehyde (Gl.) crosslinking and SDS-gel electrophoresis.
- Multidimensional solution NMR spectroscopy was performed on uniformly 15 N- and 15 N, 13 C-labeled samples to determine the structural basis for the switch in oligomerization stoichiometry observed in the mutant proteins as follows.
- p53IY SEQ ID NO: 3
- p53KIY SEQ ID NO: 4
- p53wt SEQ ID NO: 2
- NOE restraints were obtained from 2D NOESY (unlabeled sample in D20), 3D 15 N-edited NOESY-HMQC ( 15 N-labeled sample) and 13 C-edited HSQC-NOESY ( 15 N/ 13 C-labeled sample) experiments [A. Majumdar and E. R. P. Zuiderweg, J. Magn. Reson.. 102B: 242 (1993)].
- Inter- and intrasubunit NOEs were differentiated with 12 C-filtered/ 13 C-edited and 13 C-filtered/ 13 C- edited HMQC-NOESY-HMQC experiments performed on an equilibrated 1 : 1 mixture of 12 C- and 13 C-labeled samples [W.
- Glu346, Ala347 and Lys351 [corresponding to SEQ ID NO: 2] differed by more than 0.4 ppm in the proton frequency and/or by more than 1.2 ppm in the nitrogen frequency between p53KIY [SEQ ID NO: 4] and p53wt [SEQ ID NO: 2] (Table 3).
- Such large chemical shift differences are suggestive of changes in three-dimensional structure [K. Wuthrich, cited above], especially since some residues, such as Gln331 and Arg333 are 10 and 7 A, respectively, from the nearest substituted residue in the established p53wt structure [G. M. Clore et al, Science. 265: 386 (1994); W. Lee et al, Nature Structural Biol..
- Table 3 illustrates shifts in the amide resonance frequencies of Gln331 and Arg333 by amino acid substitutions targeting Phe341 and Leu344 related to p53wt [SEQ ID NO: 2]. Proteins are labelled as in Tables 1 and 2.
- P53K340 [SEQ ID NO: 4] has Lys at position 340.
- P53KIF [SEQ ID NO: 4] has Lys340, Ile341 and Phe344.
- p53FF [SEQ ID NO: 3] and p53wt [SEQ ID NO: 2] are tetramers
- p53KIY [SEQ ID NO: 4] are dimers (Table 2).
- NOE intensities were classified as strong, medium and weak, corresponding to distance restraints of 1.8-3.2, 1.8-4.0 and 1.8-5.0 A, respectively.
- Dihedral angles were restrained to -140 ⁇ 60° or to 60 ⁇ 55° for 3J HNH ⁇ measurements of >8.5 or ⁇ 5 Hz, respectively.
- Hydrogen bond restraints were incorporated as two NOEs restraining O-NH to 1.7-2.3 A and N-O to 2.8-3.3 A. Pseudo atom restraints were used whenever stereospecific assignments could not be made.
- the average structure was calculated from thirty simulated annealing structures and was refined using restrained minimization and a repulsive term to stimulate the van der Waal's potential [Br ⁇ nger, cited above].
- Geometry was evaluated with PROCHECK and PROMOTIF [R.A. Laskowski et al, J. Appl. Cryst.. 26:283 (1993)].
- Intraresidue (122) 0.001 ⁇ 0.0003 0.000
- the p53KIY oligomerization domain [SEQ ID NO: 4] is a dimer with two-fold cyclic symmetry.
- Each subunit consists of three secondary structure elements: a ⁇ -strand forms an antiparallel ⁇ -sheet and the two ⁇ -helices pack parallel to each other (Fig. 2).
- Parallel packing of the ⁇ -helices is stabilized by hydrophobic interactions substantially involving Tyr344, which interacts with Ile341 of the same subunit and Tyr344 of the other (Fig. 2).
- the structural switch between p53KIY [SEQ ID NO: 4] and p53wt [SEQ ID NO: 2] can be evaluated by comparing their structures (Figs. 1 and 2). Such comparison reveals differences in the orientation of the secondary structure elements.
- the interhelical angle changes from 155° in the primary dimer of p53wt to 83° in p53KIY [SEQ ID NO: 4], similar to the 81 ° angle with which the ⁇ -helices pack across primary dimers in p53wt.
- p53wt [SEQ ID NO: 2] and p53KIY [SEQ ID NO: 4] differ by only three amino acid substitutions.
- the changes in the sizes of the side chains at positions 341 and 344 appear to be necessary and sufficient for the structural switch.
- p53KIY In p53KIY [SEQ ID NO: 3], the decrease in side chain size of residue 341 results in fewer interactions being required to shield its surface from solvent, while the increase in side chain size of residue 344 requires new interactions.
- the switch in three dimensional structure between p53wt [SEQ ID NO: 2] and p53KIY [SEQ ID NO: 4] can be explained in terms of the loss of contacts that bury the side chain of residue 341 and gain in contacts that bury the side chain of residue 344.
- the tip of the Phe341 ring lies in a hydrophobic pocket formed by the side chains of Leu344, Asn345 and Leu348, all from the other subunit (Fig. 3A).
- p53KIY [SEQ ID NO: 4] oligomerization domains is probably secondary to the altered packing of the ⁇ -helices.
- residues Leu344 form a hydrophobic patch for assembly of two primary dimers into a tetramer (Fig. 1).
- Tyr 344 is involved in parallel packing of the ⁇ -helices and does not allow two dimers to form a tetramer.
- MOLECULE TYPE DNA (genomic)
- FEATURE FEATURE
- CAC TCC AGC CAC CTG AAG TCC AAA AAG GGT CAG TCT ACC TCC CGC CAT 1275 His Ser Ser His Leu Lys Ser Lys Lys Gly Gin Ser Thr Ser Arg His 365 370 375 380
Abstract
Methods for altering the three-dimensional structure of a selected protein without denaturing the protein are described. The method involves identifying the hydrophobic residues within the protein to be altered, distinguishing the hydrophobic residues in the protein into large and small residues. Mutants of the protein are then generated and assayed for a switch in three-dimensional structure. Also described are modified p53 oligomerization domains, produced according to the method of the invention, and fusion proteins containing these modified oligomerization domains. Also described are nucleic acid sequences encoding these proteins and compositions containing the proteins and nucleic acid sequences.
Description
METHODS FOR ALTERING THREE-DIMENSIONAL PROTEIN STRUCTURE AND COMPOSITIONS PRODUCED THEREBY
Field of the Invention
This relates generally to the field of protein structure and protein design.
Background of the Invention
The three-dimensional structure of a protein is determined by its amino acid sequence. However, previous attempts to predict and design protein structures have not been successful [D. Jones and J. Thornton, J. Computer-Aided Mol. Design. 7:439 (1993), J. W. Bryson et al, Science. 270: 935 (1995)]. Comparisons among mutant proteins suggest that not all residues contribute equally to three-dimensional structure [M. H. Hecht et al, Proc. Natl. Acad. Sci. USA. 80: 2676 (1983); B. W. Matthews, Biochemistry, 26: 6885 (1987); J. F. Reidhaar-Olson and R. T. Sauer, Science. 241 :53 (1988); D. D. Loeb et al, Nature. 340:397 (1989); D. W. Heinz et al, Proc. Natl. Acad. Sci. USA, 89: 3751 (1992); M. E. Milla et α/., Nature Structural Biol., 1: 518 (1994).]. Protein native structures are stabilized primarily by the aversion of hydrophobic side chains to polar solvent [W. Kauzmann, Adv. Protein Chem.. 14: 1 (1959); C. Chothia, Nature. 248: 338 (1974); F. M. Richards, Annu. Rev. Biophys. Bioeng.. 6: 151 (1977); K. A. Dill, Biochemistry. 24: 1501 (1985); K. A. Dill, Biochemistry. 29: 7133 (1990); J. R. Livingstone et al, Biochemistry. 30: 4237 (1991); K. A. Sharp et al, Biochemistry. 30: 9686 (1991); C. N. Pace, J. Mol. Biol.. 226: 29 (1992)]. This aversion, which is referred to as the hydrophobic effect, distributes the hydrophobic and polar residues in the protein interior and on the surface, respectively, such that the pattern of hydrophobic and polar residues in the linear sequence of a protein is a significant determinant of three-dimensional structure [Jones and Thornton, cited above; Bryson, cited above]. A second important determinant of structure is amino acid helical propensity, which reflects the entropic cost of incorporating a residue into an ordered secondary structure element [P. C. Lyu
et al, Science. 250: 669 (1990); K. T. O'Neil and W. F. DeGrado, Science. 250: 646 (1990); S. Padmanabhan et al, Nature. 344: 268 (1990); T. P. Creamer and G. D. Rose, Proc. Natl. Acad. Sci. USA. 89: 5937 (1992)]. Protein design efforts, sequence analysis of structurally related and unrelated proteins and theoretical models of proteins consisting of only hydrophobic and polar residues suggest that a number of protein conformations are compatible with a specific hydrophobic/polar and helical propensity profile [Jones and Thornton, cited above; Bryson, cited above; K. A. Dill et al, Protein Science. 4: 561 (1995)]. A unique structure is therefore attained by additional determinants, for example, in a few native proteins and in simple designed proteins by incorporating binding sites for metal ions or other prosthetic groups [J.
Miller et al, EMBO J.. 4: 1609 (1985); H. Paulsen et al, Eur. J. Biochem.. 215: 809 (1993); T. M. Handel et al, Science. 261 : 879 (1993); D. E. Robertson et al, Nature. 368: 425 (1994)]. In the majority of proteins, hydrogen bonds and electrostatic interactions have been proposed to stabilize unique native structures [Jones and Thornton, cited above; Bryson et al, cited above; D. F. Stickle et al, J. Mol. Biol. 226: 1143 (1992); Z. S. Hendsch and B. Tidor, Protein Science. 3: 211 (1994)]. Yet, polar and charged residues are not highly conserved in evolution [D. J. Barlow and J. M. Thornton, J. Mol. Biol. 168: 867 (1983)] and in at least one example all of the buried salt bridges can be eliminated with no effect on structure [C D. Waldburger et al. Nature Structural Biol.. 2: 122 (1995)]. Since the determinants that allow proteins to adopt unique well-packed structures are not understood, it has been virtually impossible to modify the three-dimensional structure of a protein such that it exhibits novel properties. Thus, even though it is well-appreciated that the amino acid sequence of a protein determines its three-dimensional structure, no amino acid substitutions have been described that change the three-dimensional structure of a protein.
What is needed are methods for altering protein structures to provide designed proteins. Desirably, these designed proteins may have novel functions, or a change in their functional properties relative to the native protein from which they were derived. Preferably, these designed proteins retain the function of the native protein from
which they are derived, but have some other advantage, such as enhanced stability, improved binding, lower molecular mass, or the like.
Summary of the Invention
In one aspect, the present invention provides a method for altering the three-dimensional structure of proteins. The method involves the steps of: identifying a native protein with three-dimensional structure to be altered; identifying the hydrophobic residues within this protein; distinguishing the hydrophobic residues on the basis of side chain size into large and small hydrophobic residues; providing mutants of the protein having substitutions in the hydrophobic residues distinguished above; and assaying the mutant proteins for a switch in three-dimensional structure. In another aspect, the method of the invention further involves the steps of generating mutants of the selected protein having at least one of the large hydrophobic residues within its hydrophobic core substituted by hydrophobic residues with small side chains and assaying the mutant proteins for ability to at least maintain the biological function of the native protein. Optionally, the method of the invention involves substituting at least one of the small hydrophobic residues within the hydrophobic core of the protein with large hydrophobic residues.
In yet another aspect, the present invention provides an altered protein comprising p53 oligomerization domain altered according to the above method, wherein the side chain size of the hydrophobic amino acid with the largest side chain in the p53 protein hydrophobic core (Phe341) has been decreased and the side chain of another hydrophobic amino acid in the hydrophobic core (Leu344) has been increased.
In another aspect, the invention provides an altered p53 protein oligomerization domain designed as described above, which has further been modified to contain a Lys at amino acid position 340.
In still another aspect, the invention provides a p53 protein containing the altered p53 oligomerization domains as described above in place of the native p53 oligomerization domain.
In a further aspect, the invention provides p53 fusion proteins comprising an altered p53 oligomerization domain fused to a heterologous protein.
In still a further aspect, the present invention provides nucleic acid sequences encoding the altered proteins according to the present invention. In yet a further aspect, the invention provides vectors comprising nucleic acid sequences of the invention under the control of suitable regulatory sequences.
In another aspect, the invention provides host cells transformed with the vectors of the invention. Also provided are pharmaceutical compositions containing the nucleic acid sequence of the invention and method of administering same. Other aspects and advantages of the present invention are described further in the following detailed description of the preferred embodiments thereof.
Brief Description of the Drawings
Fig. 1 illustrates the three-dimensional structure of the p53wt oligomerization domain corresponding to residues 325-355 of SEQ ID NO: 2. Residue type is indicated by the single letter code: F, Phe; L, Leu.
Fig. 2 illustrates the three-dimensional structure of the p53KIY oligomerization domain corresponding to residues 327-353 of human p53 [SEQ ID NO: 2]. Residue type is indicated by the single letter code: Y, Tyr; I, He.
Fig. 3 A illustrates the packing of residues with large hydrophobic side chains, specifically, Phe341 in p53wt [SEQ ID NO: 2]. The α-helix of one of the subunits is not shown for clarity, also only two of the four subunits are shown. Residue type is abbreviated: L, Leu; F, Phe; N, Asn.
Fig. 3B illustrates the packing of Tyr344 in p53KIY [SEQ ID NO: 4]. Residue type is abbreviated: L, Leu; Y, Tyr; N, Asn.
Detailed Description of the Invention
This invention provides a general method for changing the three-dimensional structure of proteins. The invention further provides modified proteins generated by this method and nucleic acid sequences encoding them. These modified proteins and
nucleic acid sequences are particularly useful in pharmaceutical compositions and therapeutic regimens, and also in biotechnology and other industrial applications.
Suitably, the altered proteins of the invention, have an altered three-dimensional structure and are further characterized by maintaining substantially the same level of a desired biological function of the native protein. However, the altered proteins of the invention advantageously can be designed or selected such that they are also characterized by improved functional properties relative to the native (wild-type) domain, e.g., improved binding ability, improved stability, or the like.
The methods, altered proteins, nucleic acids, and pharmaceutical compositions of the invention are discussed in more detail below.
I. Method For Altering Three-Dimensional Structure of Proteins
In one aspect, the present invention provides a method for altering the three-dimensional structure of proteins without denaturing the proteins. The examples provided herein demonstrate alteration of the three-dimensional structure of the oligomerization domain of wild-type p53. However, other proteins may be readily altered. Desirably, the method of the invention is performed upon a protein that has a hydrophobic core (i.e., upon essentially all proteins). Most preferably, however, the protein is useful for industrial, therapeutic or diagnostic purposes. Included among desirable proteins for application of the method of the invention are non-linear, proteins including, without limitation, industrial enzymes, such as proteases, lipases, chymases, etc., and therapeutically useful proteins, such as the members of the globin family, in which the side chain sizes of specific hydrophobic residues are highly conserved [D. Bashford et al, J. Mol. Biol.. 196: 199 (1987)].
Desirably, the three-dimensional (or crystalline) structure of the protein to be altered is known. Alternatively, the three-dimensional structure may be determined using known techniques, e.g., NMR spectroscopy, x-ray crystallography and the like.
Once selected, the hydrophobic residues within this protein are identified and distinguished into large and small hydrophobic residues on the basis of
side chain size. For the purposes of this specification, amino acid residues Phe, Tyr and Trp are considered large and amino acid residues Ala, Val and He are considered small. Leu is intermediate in size and may be considered either large or small.
More particularly, for the performance of the method of the invention, not only is the total surface area, but also the surface area buried upon folding with is relevant, as the latter determines the strength of the hydrophobic effect [J.R. Livingstone et al, Biochemistry. 30:4237-4244 (1991)]. By way of example, the total surface areas of residues [G.D. Rose et al, Science. 229:834-838 (1985)] and the surface areas buried upon folding are provided in the table below. All values are in square Angstroms
Amino acid Total Surface Area Surface Area Bu
Alanine 118 80-86
Valine 164 127-141
Isoleucine 181 154-158
Leucine 193 156-164
Tyrosine 236 137-177
Phenylalanine 222 168-194
Tryptophan 266 177-224
Once so distinguished, mutant proteins are generated by increasing the size of one or more of the small residues and/or decreasing the size of one or more of the large residues. Since Leu cannot be unambiguously classified as large or small, mutants are generated which replace Leu with both larger and smaller amino acids. As a first step, it is preferable to decrease the size of the largest residue in the hydrophobic core (e.g., Phe is substituted with He, Val or Ala) and increase the size of one of the smaller residues in the hydrophobic core (e.g., Val is substituted with Phe or Trp). All substitutions are made in such a manner as to preserve the hydrophobic character of the native residue. In general, one or two substitutions are made for each secondary structure element whose packing in the three-dimensional structure of the
protein is to be altered. When two substitutions are performed, one decreases the side chain size of a large hydrophobic amino acid and the other increases the side chain size of a small hydrophobic amino acid.
The mutant proteins may be generated using conventional techniques. For example, the peptides may be synthesized using a commercially available automatic synthesizer according to standard procedures. Alternatively, other standard techniques may be utilized. See, e.g., Merrifield, J. Amer. Chem. Soc. 85:2149-2154 (1963). Preferably, however, the mutant proteins of the invention are generated recombinantly, making use of a variety of well-known techniques (such as site- directed mutagenesis [see, Gillman & Smith, Gene. 8:81-97 (1979) and S. Roberts et al, Nature. 328:731-734 (1987)]) and, desirably, the nucleic acid sequences of the invention. See, e.g., Sambrook et al., Molecular Cloning. A Laboratory Manual.. 2d Edit., Cold Spring Harbor Laboratory, New York (1989).
The mutant proteins generated by the method of the invention may be assayed for biological function as a preliminary screening step. In this manner, mutant proteins having the desired biological function (e.g., binding ability) may be selected. The assays for examining the function of the protein will depend on the particular function that needs to be assayed. Such assays are well known to those of skilled in the art and are not a limitation on this invention. For example, if the protein altered according to the invention is a DNA binding protein, then one could use a DNA binding assay to examine the function of the mutant proteins. If engineering involves a protein that forms oligomers, e.g., the oligomerization domain of p53, then one could use a glutaraldehyde crosslinking assay [W. Lee et al, Nature Structural Biology. 1 :877 (1994)] to examine the function of the mutant proteins. Other suitable functional assays will be readily apparent to one of skill in the art, based on the function of the native protein which has been altered by the method of the invention.
The mutant proteins generated according to the method of the invention may then be screened for a change in the three-dimensional structure. Biophysical methods to probe protein structure include NMR spectroscopy, X-ray
crystallography, [G. M. Clore et al, Science. 265: 386 (1994); W. Lee et al, Nature Structural Biol.. \: 877 (1994); G. M. Clore et al, Nature Structural Biol.. 2: 321 (1995); P. D. Jeffrey et al, Science, 267: 1498 (1995)], among other techniques. The applications of a DNA binding assay, a glutaraldehyde crosslinking assay and NMR spectroscopy are illustrated in the examples of engineered p53 oligomerization domains appropriate for determining three-dimensional conformation.
In a currently preferred embodiment, the p53 oligomerization domain has been altered according to the method of the invention. Thus, the method of the invention may involve generating mutant of fragments of a useful protein which is responsible for biological activity. As described above, similar mutants may be generated using the hydrophobic core of other selected proteins, or full-length proteins, as desired.
II. Altered Proteins of the Invention
Using the method of the invention, the inventor has altered the oligomerization domain of p53 to provide a p53 protein with an altered three-dimensional structure and oligomerization stoichiometry relative to the native (wild-type) domain. All references to p53 residue numbers herein refer to the numbering scheme provided by Zakut-Houri et al, EMBO J, 4: 1251-1255 (1985) [GenBank Code Hsp53] for human p53. The nucleotide and amino acid sequences of human p53 are reproduced as SEQ ID NOS: 1 and 2, respectively. Although reference is made by way of example to human p53, one of skill in the art could readily substitute other non-human p53 sequences. Alignment of the highly conserved p53 sequences is provided in Soussi et al, Oncogene. 5_:945-952 (1990).
The p53 tumor suppressor protein is a sequence-specific transcription factor with fundamental significance to the pathogenesis and therapy of human cancer [C. C. Harris, Science. 262: 1980 (1993); L. J. Ko and C. Prives, Genes Dev.. 10: 1054 (1996)]. The tumor suppressor activity of p53 requires homo-oligomerization [M. J. F. Waterman et al, Cancer Res., 56: 158 (1996)], which is mediated by a thirty residue domain at the C-terminus of the protein [H. Sakamoto et al, Proc. Natl.
Acad. Sci. USA. 91 : 8974 (1994); P. Wang et al, Mol. Cell. Biol.. 14: 5182 (1994); J. L. F. Waterman et al, EMBO J.. 14: 512-519 (1995)]. The native thirty residue p53 oligomerization domain has a β-strand, a tight turn and an α-helix in each subunit. [Clore et al, cited above (1994); Lee et al, cited above; Clore et al, cited above, (1995); Jeffrey et al, cited above]. This oligomerization domain folds independently with antiparallel packing of its α-helices.
In a preferred embodiment, the altered p53 generated according to the invention contains substitutions of residues Phe341 and Leu344 [SEQ ID NO: 2] in the α-helix by other hydrophobic amino acids, that decreased the side chain size at position 341 and increased the side chain size at position 344 [corresponding to SEQ ID NO: 2], resulting in an altered p53 domain that assembles as a dimer instead of a tetramer. The three-dimensional structure of a mutant dimeric domain determined in solution by NMR spectroscopy differs substantially from the wild-type structure, since the α-helices are packed parallel, rather than antiparallel, and are rotated significantly relative to each other and to the β-strands. The engineered p53 oligomerization domain was found to drive the sequence-specific DNA binding function of the modified p53 protein.
Thus, in another aspect, the present invention provides altered p53 oligomerization domains that assemble as dimers, rather than tetramers, and which have a different three-dimensional structure relative to wild-type p53. These altered p53 oligomerization domains of the invention [SEQ ID NO: 3] desirably contain the following residues, which differ from the residues in native human p53 [SEQ ID NO: 2].
Position 341 Position 344
Val Phe
Val Trp
Val Tyr
He Phe
He Trp
He Tyr
Leu Phe
Leu Trp
Leu Tyr
— Trp
Optionally, the altered p53 oligomerization domains described above may be further modified to contain Lys at position 340 [SEQ ID NO: 4]. The inventor has found that the latter modification increases the solubility of the mutant p53 oligomerization domains with altered three-dimensional structure. For example, at a 1-2 mM concentration a mutant p53 domain with He at position 341 and Tyr at position 344 [SEQ ID NO: 3] precipitates within one hour when heated to 40°C. In contrast, a mutant p53 domain [SEQ ID NO: 4] with Lys at position 340, He at position 341 and Tyr at position 344 remains soluble under the same conditions. Based on the functional activities of other dimeric p53 proteins
[constructed by replacing the native p53 oligomerization domain with a leucine zipper, which is a dimerization domain found in certain nuclear DNA binding proteins; Pietenpol et al., Proc. Natl. Acad. Sci. USA. 91 : 1998 (1996)], it is anticipated that the altered p53 proteins described herein will retain their tumor suppressor function. Because these p53 proteins are dimers, rather than tetramers, their molecular mass is half of that of wild-type p53 and therefore have certain advantages over wild-type p53. For example, they can be introduced more easily into cells. In addition, because the altered p53 oligomerization domain has a different three-dimensional structure than the wild-type p53 domain, the two types of domains will not hetero-oligomerize.
Accordingly, a p53 tumor suppressor protein will not be sequestered into inactive hetero-oligomers with tumor-derived p53 mutant proteins. Thus, a p53 protein of the invention can be delivered by gene therapy vectors and not be suppressed by the mutant p53 protein present in tumor cells. As desired, the altered p53 oligomerization domains of the invention may be engineered on an otherwise unmodified p53wt protein. The altered p53 oligomerization domain may also be fused to a selected heterologous protein. Alternatively, the altered p53 oligomerization domains may be engineered on p53 proteins bearing additional modifications. One suitable modification is substitution of residue threonine 284 [SEQ ID NO: 2] with Arginine. This substitution enhances the tumor suppressor function of wild-type p53 5- to 7-fold [Wieczorek et al, Nature Medicine. 2: 1143 (1996)].
Suitable heterologous proteins include those which in the past have been fused to a leucine zipper. Leucine zippers have the disadvantage that they may interact with host leucine-zipper bearing proteins, which interaction may compromise the biological activity of the chimeric protein bearing the leucine zipper. The altered p53 oligomerization domains described in this application do not exist in nature, and do not form oligomers with any host proteins. However, because both leucine zippers and the altered p53 oligomerization domain of the invention have similar topologies, i.e., both types of domains have parallel α-helices, they may be used in many similar applications.
An example of suitable heterologous proteins include single chain antibody variable chains (scFv antibodies). The prior art has described scFv antibodies fused to Jun and Fos leucine zippers to produce dimeric antibodies that have higher affinity for their ligands, because they are bivalent [Kruif and Logtenberg, J. Biol. Chem., 271 :7630 (1996)]. According to the present invention, scFv antibodies can be fused to the altered p53 oligomerization domain of the invention by using the sequences encoding the altered p53 oligomerization domain in place of the leucine zipper sequences of the prior art. This will lead to homodimeric (hence monospecific), bivalent (high affinity) antibodies.
Another example of a suitable heterologous protein includes a soluble interleukin-2 (IL-2) receptor. Wu et al, J. Biol. Chem.. 270: 16039 (1995) has described a soluble IL-2 receptor complex formed by attaching leucine zippers to the C-terminus of the extracellular domain of the receptor. According to the present invention, the leucine zipper domain of the prior art is replaced by the altered p53 oligomerization domain of the invention. Such soluble domains are useful for screening ligands (drugs) that bind to the native receptors. Such soluble domains are also useful therapeutically as decoys competing for ligand binding with the endogenous receptors of patients. The methods for attaching the altered p53 oligomerization domain are readily apparent to those of skill in the art. For example, the extracellular domains of the T-cell receptor can be isolated in a soluble form, then fused to the modified p53 domains, using a modified version of the technique described in Chang et al., Proc. Natl. Acad. Sci. USA. 91: 11408 (1994). Where desired or needed, heterodimers can be isolated from homodimers by conventional protein chromatography or other suitable techniques.
As another example, the altered p53 oligomerization domains of the invention may be fused to a transmembrane receptor. Many such receptors become physiologically activated by ligand-induced dimerization. Fusion of a dimerization domain to such receptors can therefore constitutively activate them. Such an example is provided by the Trp-Met fusion receptor, in which the Trp protein provides a leucine zipper which induces dimerization and activation of the Met receptor [Rodriques and Park, Mol. Cell Biol. 13:6711 (1993)]. Similarly, the altered p53 oligomerization domains of the invention may be used to activate a receptor of choice. Depending upon the type of receptor to which the modified p53 oligomerization domain of the invention is fused, the outcome could be cell proliferation or cell death.
The altered p53 oligomerization domain of the invention can also be used to induce dimerization of DNA binding proteins. Many DNA binding proteins, for example c-Myc, bind DNA as dimers. c-Myc will not homodimerize, but will bind DNA as a hetero-dimer with a protein called Max. However, c-Myc will homo- dimerize if its native leucine zipper is replaced by the leucine zipper of GCN4, since
the latter zipper has a high tendency to homo-oligomerize. A c-Myc fusion bearing a GCN4 leucine zipper binds DNA with a very high affinity [Halazonetis and Kandil, Science. 255:464 (1992)]. Thus, a c-Myc protein whose native C-terminus is fused to the modified p53 oligomerization domain of this invention, would bind to DNA with high affinity and could compete for DNA binding of the native Myc/Max heterodimer without interfering with native proteins which contain leucine zippers. Furthermore, if the chimeric Myc protein lacked the N-terminal domain of p53, which is required for carcinogenic transformation of cells overexpressing c-Myc, then it can be used to revert the tumorigenic phenotype of cells overexpressing c-Myc, such as many leukemia and lymphoma cells. Other proteins that bind DNA and are implicated in cancer development, such as EWS-ATF-1 [Fujimura et al., Oncogene. 12: 159 (1996)] and the E2A-HLF [Yoshihara et al., Mol. Cell. Biol.. 15:3247 (1995)], can be similarly modified.
As described above, altered p53 produced according to the method of the invention is used by way of example only. Other proteins (e.g., enzymes, antibodies and members of the globin family) may be similarly altered and used for a variety of purposes, as described herein.
Ill . Nucleic A cid Sequences
The present invention further provides nucleic acid sequences encoding the altered proteins of this invention. In addition to the coding strand, the nucleic acid sequences of the invention include the complementary DNA sequence representing the non-coding strand, the messenger RNA sequence, the corresponding cDNA sequence and the RNA sequence complementary to the messenger RNA sequence. Variants of these nucleic acids of the invention include variations due to the degeneracy of the genetic code and are encompassed by this invention. Such variants may be readily identified and/or constructed by one of skill in the art. In certain cases specific codon usage may be employed to optimize expression. The above nucleotide sequences can be included within larger DNA or RNA fragments, or may be interrupted by introns.
A. Expression Vectors
In another embodiment, the nucleic acids encoding the proteins of the invention are present in the context of vectors suitable for amplification in prokaryotic or eukaryotic cells. Many such vectors are known and many of these are commercially available. For example plasmids with bacterial or yeast replication origins allow amplification in bacteria or yeast, respectively. Such vectors allow the production of large quantities of nucleic acids encoding the proteins of the invention, which nucleic acids can be used for gene therapy or for expression of the proteins of the invention, e.g., p53. In yet another embodiment the nucleic acids encoding the proteins of the invention are present in the context of vectors suitable for expression in cell-free extracts or lysates or in prokaryotic or eukaryotic cells. Many such vectors are known [Ausubel et al, Current Protocols in Molecular Biology. Greene Publishing Associates and John Wiley & Sons, NY (1997)] and many of these are commercially available. For example, the vector pGEM4 (Promega, Madison, WI) is suitable for expression of the proteins in cell-free lysates, while the vector pSV2 [ATCC] is suitable for expression in mammalian cells. Such vectors allow the production of the proteins of the invention in vitro for analysis of their functional properties or for delivery to patients.
B, Gene Therapy Vectors
The nucleic acid sequences of the invention may be inserted into a vector capable of targeting and infecting a desired cell, either in vivo or ex vivo for gene therapy, and causing the encoded modified protein of this invention to be expressed by that cell. Many such viral vectors are useful for this purpose, e.g., adenoviruses, retroviruses and adeno-associated viruses (AAV) [Schreiber et al. , Biotechniques. 14: 818-823 (1993); Davidson et al, Nature Genetics. 3: 219-223 (1993); Roessler et al, J. Clin. Invest.. 92: 1085-1092 (1993); Smythe et al, Ann. Thorac. Surg.. 57: 1395-1401 (1994); Kaplitt et al, Nature Genetics. 8: 148-154 (1994)]. There has already been success using viral vectors driving expression of
wild-type p53 [Fujiwara et al, Cancer Res.. 53: 4129-4133 (1993); Fujiwara et al, Cancer Res.. 54: 2287-2291 (1994); Friedmann, Cancer. 70(6 Suppl): 1810-1817 (1992); Fujiwara et al, Curr. Opin. Oncol.. 6: 96-105 (1994b); Roth et al, Nature Medicine. 2:985-991 (1996)]. For use in gene therapy, these viral vectors containing nucleic acid sequences encoding a protein of the invention, e.g., an altered p53 protein, are prepared by one of skill in the art with resort to conventional techniques (see references mentioned above). For example, a recombinant viral vector, e.g. an adenovirus, of the present invention comprises DNA of at least that portion of the viral genome which is capable of infecting the target cells operatively linked to the nucleic acid sequences of the invention. By "infection" is generally meant the process by which a virus transfers genetic material to its host or target cell. Preferably, the virus used in the construction of a vector of the invention is rendered replication-defective to remove the effects of viral replication on the target cells. In such cases, the replication-defective viral genome can be packaged by a helper virus in association with conventional techniques.
Briefly, the vector(s) containing the nucleic acids encoding an altered protein of the invention is suspended in a pharmaceutically acceptable carrier, such as saline, and administered parenterally (or by other suitable means) in sufficient amounts to infect the desired cells and provide sufficient levels of modified protein to achieve the desired therapeutic or prophylactic effect, e.g., sufficient p53 activity to arrest abnormal cellular proliferation. Other pharmaceutically acceptable carriers are well known to those of skill in the art. A suitable amount of the vector containing the chimeric nucleic acid sequences is between about 106 to 109 infectious particles per mL carrier. The delivery of the vector may be repeated as needed to sustain satisfactory levels of biological activity. For example, where modified p53 is administered, activity may be determined by monitoring clinical symptoms.
As desired, this therapy may be combined with other therapies for the disease or condition being treated. For example, therapy involving the administration of a vector capable of expressing an altered p53 protein of the
invention is well suited for use in conjunction with conventional cancer therapies, including surgery, radiation and chemotherapy.
Nucleic acid sequences driving expression of a protein of the invention may also be introduced by "carriers" other than viral vectors, such as liposomes, nucleic acid-coated gold beads or can simply be injected in situ [Fujiwara et al (1994b), cited above; Fynan et al, Proc. Natl. Acad. Sci. USA. 90: 11478-11482 (1993); Cohen, Science. 259: 1691-1692 (1993); Wolff et al, Biotechniques. U : 474-485 (1991)].
IV. Pharmaceutical Compositions The altered proteins and nucleic acid sequences of this invention may also be formulated into pharmaceutical compositions and administered using a therapeutic regimen compatible with the particular formulation. When administered in the form of nucleic acid sequences, the composition may contain "naked" DNA, or a vector containing the nucleic acid sequences. As used herein, the term "naked DNA" means substantially pure DNA which is not associated with a protein, lipid, carbohydrate or contained within a cell or an artificial delivery system such as a liposome.
Pharmaceutical compositions within the scope of the present invention include compositions containing an altered protein of the invention (or a nucleic acid sequence encoding a modified protein) in an effective amount to have the desired physiological effect, e.g. to arrest the growth of cancer cells without causing unacceptable toxicity for the patient.
Suitable formulations for parenteral administration include aqueous solutions of the active compounds in water-soluble or water-dispersible form, e.g. saline. Alternatively, suspensions of the active compounds may be administered in suitable conventional lipophilic carriers or in liposomes.
The compositions may be supplemented by active pharmaceutical ingredients, where desired. Optional antibacterial, antiseptic, and antioxidant agents in the compositions can perform their ordinary functions. The pharmaceutical
compositions of the invention may further contain any of a number of suitable viscosity enhancers, stabilizers, excipients and auxiliaries which facilitate processing of the active compounds into preparations that can be used pharmaceutically. Preferably, these preparations, as well as those preparations discussed below, are designed for parenteral administration. However, compositions designed for oral or rectal administration are also considered to fall within the scope of the present invention.
Those of skill in the pharmaceutical art should be able to derive suitable dosages and schedules of administration. As used herein, the terms "suitable amount" or "effective amount" means an amount which is effective to treat the conditions referred to below. A preferred dose of a pharmaceutical composition containing a protein of this invention is generally effective above about 0.1 mg modified protein per kg of body weight (mg/kg), and preferably from about 1 mg/kg to about 100 mg/kg. These doses may be administered with a frequency necessary to achieve and maintain satisfactory activity levels. Although a preferred range has been described above, determination of the effective amounts for treatment of each type of tumor or other condition may be determined by those of skill in the art.
Dosage units of such pharmaceutical compositions containing the proteins of this invention preferably contain about 1 mg to 5 g of the protein.
V. Therapeutic Indications
The nucleic acids encoding altered p53 proteins and the altered p53 proteins themselves can be introduced into human patients for therapeutic benefits in conditions characterized by insufficient wild-type p53 activity. Such conditions have been described in the art. See, e.g., PCT/US95/15353 (June 6, 1996). For example, the pharmaceutical compositions of the invention, including the gene therapy vectors, may be employed to induce the cellular defense to DNA damaging agents such as sunlight UV irradiation, as well as radiation and chemotherapeutics used for cancer treatment. The therapeutic indications include inducing apoptosis of specific cells,
such as proliferating lymphocytes, the prevention of transplant rejection, and the treatment of autoimmune diseases, e.g., systemic lupus erythrematosis, rheumatoid arthritis and the like.
The pharmaceutical compositions of this invention may also be employed to restore p53 function in tumor cells and to suppress cell proliferation in diseases other than cancers, which are characterized by aberrant cell proliferation. Among such diseases are included psoriasis, atherosclerosis and arterial restenosis. Pharmaceutical compositions containing other altered proteins of the invention (or nucleic acids encoding them) may also be readily prepared and used for a variety of indications which will be readily apparent to one of skill in the art.
VI. Antibodies
The altered proteins of the invention are useful for generating antibodies, which may be used as diagnostic reagents, for example, to monitor the presence of modified protein or protein domain. Specific antisera may be generated using known techniques. See,
Sambrook, cited above, Chapter 18, generally, incorporated by reference. Similarly, antibodies of the invention, both polyclonal and monoclonal, may be produced by conventional methods, including the Kohler and Milstein hybridoma technique, recombinant techniques, such as described by Huse et al, Science, 246: 1275-1281 (1 88), or any other techniques known to the art.
The invention further encompasses functional fragments of the antibodies of the invention, including, Fab, Fv, and F(ab')2 fragments, the binding site of the antibodies, and the complementarity determining regions (CDRs). Optionally, the binding site and/or CDRs may be contained in a synthetic molecule which provides antibody framework regions. Further, these functional fragments may be used in the production of recombinant antibodies, including bifunctional antibodies, chimeric antibodies, and humanized antibodies, which preferably retain the antigen binding specificity of the antibodies of the invention. Such recombinant antibodies may be constructed and produced according to known techniques [see, e.g., S. D. Gillies et
al, J. Immunol. Meth.. 125: 191-202 (1989); and G. E. Mark and E. A. Pladlan, "Humanization of Monoclonal Antibodies", The Handbook of Experimental Pharmacology. Vol. 113, Chapter 4, pp. 105-133, Springer- Verlag (June, 1994)]. These functional fragments and recombinant antibodies may be used for a variety of purposes, including any of those described herein for the antibodies of the invention. In general, polyclonal antisera, monoclonal antibodies and other antibodies which bind to an altered protein as antigen are useful as research tools, as diagnostic reagents, as therapeutic agents, and for producing other antibodies (as described above) which are similarly useful.
VII. Diagnostic Reagents
The altered proteins of the invention may be used therapeutically or as diagnostic reagents. These reagents may optionally be labeled using diagnostic labels, such as radioactive labels, colorimetric enzyme label systems and the like conventionally used in diagnostic or therapeutic methods. Alternatively, the N- or C- terminus of an altered protein of the invention may be tagged with a detectable label which can be recognized by a specific antisera. For example, the reagents derived from p53 may be used in diagnosis of a variety of conditions associated with p53 and/or aberrant cell proliferation, including autoimmune diseases, e.g., systemic lupus erythrematosis, rheumatoid arthritis and the like, cancers, psoriasis, atherosclerosis and arterial restenosis. For example, in tissue biopsies, the presence of p53 could be directly verified by RT-PCR or immunostaining. Reagents produced from other altered proteins of the invention, e.g., antibodies and the like, may similarly be utilized as diagnostic reagents. The selection of the appropriate assay format and label system is within the skill of the art and may readily be chosen without requiring additional explanation by resort to the wealth of art in the diagnostic area.
These examples illustrate the method of the invention as performed in connection with p53 and the preparation of modified p53 proteins of the invention. These examples are illustrative only and do not limit the scope of the invention.
Example 1 - Method for altering the three-dimensional structure of wild-type p53 Wild-type p53 was identified as the native protein with three-dimensional structure to be altered, according to the method of the invention.
The three-dimensional structure of this protein was studied by NMR spectroscopy and X-ray crystallography [G. M. Clore et al, Science. 265: 386 (1994); W. Lee et al,
Nature Structural Biol.. 1: 877 (1994); G. M. Clore et al, Nature Structural Biol.. 2:
321 (1995); P. D. Jeffrey et al, Science. 267: 1498 (1995)].
Using these techniques, wtp53 [SEQ ID NO: 2] was found to have a structure which consists of a β-strand, a tight turn and an α-helix. Four identical subunits assemble as a dimer of dimers (Fig. 1). In the primary dimer, the β-strands form an antiparallel β-sheet and the α-helices also pack antiparallel. Two primary dimers form a tetramer by packing their α-helices at an 81 ° angle.
According to the method of the invention, the residues of the hydrophobic core were distinguished into large and small hydrophobic residues. The center of the hydrophobic core of the domain is formed primarily by residues Phe341 and Leu344, Phe341 is positioned at the interface of the two p53 monomers that form the primary dimers, whereas Leu344 forms the interface between the primary dimers
[Fig. 1; SEQ ID NO: 2].
Mutants of p53 were then generated according to the method of the invention and assayed for a switch in conformation. Plasmids encoding mutant p53 proteins were generated by PCR-directed mutagenesis of pGEMhump53wtB, as described in Waterman et al, EMBO J.. 14:512-519 (1995) which is incorporated by reference herein. The names of the mutants indicate the hydrophobic residues at positions 341 and 344 [SEQ ID NO: 3], respectively, using the single letter amino acid code. For example, p53FL is wild-type p53 [SEQ ID NO: 2]. Additional mutants made include: p53A344, Ala344 [SEQ ID NO: 3]; p53IF, Ile341 and Phe344
[SEQ ID NO: 3]; p53IY, Ile341 and Tyr344; p53KIY, Lys340, Ile341 and Tyr344
[SEQ ID NO: 4]; and p53K340, Lys340 [SEQ ID NO: 4],
Once generated, DNA binding of the mutants was assayed using 32P-labeled oligonucleotide BC.S10 and in vitro translated p53 (Waterman et al, cited
above)]. DNA binding is an indirect measure of the function of the p53 oligomerization domain. This assay was performed to screen the mutants whose function is similar to wtp53, as a preliminary to assaying for conformational switch. Table 1 shows the effect of amino acid substitutions targeting residues 341 and 344 of human p53 on the subunit stoichiometry of p53 as assayed by the electrophoretic migration of its complexes with DNA. The names of the mutants [SEQ ID NO: 3] indicate the hydrophobic residues at positions 341 and 344, respectively, using the single letter amino acid code. p53FL is wild-type p53 [SEQ ID NO: 2].
Table 1
Protein Subunit Stoichiometry p53VI tetramer p53VL tetramer p53VF dimer p53VW dimer p53II tetramer p53IL tetramer p53IF dimer p53IW dimer p53LI tetramer p53LL tetramer p53LF dimer/tetramer mixture p53LW dimer p53FI tetramer p53FL tetramer p53FF tetramer p53FW dimer
A. DNA Binding Assay
Full-length p53 proteins with Phe341 substituted for Val, He or Leu, and Leu344 for He, Phe or Trp, and all combinations thereof, retain their ability to bind DNA (Table 1) [SEQ ID NO: 3]. Since monomeric full-length p53 does not bind DNA [M.J. F. Waterman et al, Cancer Res.. 56: 158 (1996; T. D. Halazonetis et al, EMBO J.. 12:5057 (1993); P. Hainaut et al, Oncogene. 9:299 (1994)], these substitutions do not denature the oligomerization domain. Some complexes of the mutant proteins with DNA migrate in gel electrophoresis faster than those with the wild-type protein, a behavior that in general correlated with the side chain size of residue 344 being greater than that of residue 341 (Table 1). Electrophoretic migration on native gels depends on molecular size [J. L. Hedrick and A.J. Smith, Archs. Biochem. Biophys.. 126:155 (1968)], suggesting an effect of the substitutions on oligomerization stoichiometry. Indeed, the DNA complex of p53IF [SEQ ID NO: 3] comigrated with that of p53 Ala344, a known dimeric mutant [J.L.F. Waterman and T.D. Halazonetis, EMBO J. 14:512-519 (1995)]. Furthermore, DNA binding of p53IF [SEQ ID NO: 3] was unaffected by substitutions of Met 340 with Lys and/or of Phe344 with Tyr. Since these residues are at the dimer-dimer interface in wild-type p53 (Fig. 1), tolerance to the substitutions in p53IF [SEQ ID NO: 3] is consistent with it not having a dimer-dimer interface. In contrast, substitution of Met340 with Lys dissociated wild-type p53 into dimers.
B. Assay for Stoichiometry
Since DNA binding is only an indirect measure of the function of the p53 oligomerization domain, certain mutants whose complexes with DNA migrated faster than wild-type p53 were selected for further study. Wild-type p53 (p53wt; SEQ ID NO: 2) and these mutant p53 oligomerization domains [SEQ ID NOS: 3 and 4] were expressed in E. cυli, purified to homogeneity and assayed for subunit stoichiometry by glutaraldehyde crosslinking using the techniques described below.
1. Expression and Purification
A PCR fragment with 5' Bam HI and 3' Hind III restriction sites encoding a methionine and amino acids 304-363 of human wild-type p53 [SEQ ID NO: 2] was cloned in the vector pTST [S.P. Eisenberg et al, Nature. 343:341 (1990)]. Equivalent plasmids were constructed for the mutants. Proteins were expressed in E. coli (BL21) grown at 30°C in unlabeled or isotopically labeled 15N- or 15N- and 13C-labeled minimal media (Isotec, Miamisburg, OH). After overnight induction with IPTG [Sigma, St. Louis, MO, the cells were pelleted, stirred on ice for 20 min. in glycerol, 0.7% v/v Triton-X and 0.4% v/v β-mercaptoethanol, and then for another 15 min. in lysis buffer (10 mM Tris [pH 8.0], 500 mM NaCl, 5 mM EDTA, 1 mM DTT, 0.6 mg/ml lysozyme) containing Pefabloc SC (Pentapharm, Basel, Switzerland) and pepstatin (Sigma, St. Louis, MO). 120 U/ml DNase I (Sigma) was added and stirring continued for another 30 min. The lysate was cleared by centrifugation at 300,000 g for 70 min., adjusted to 1.5 M ammonium sulfate and cleared again by centrifugation at 300,000 for 15 min. p53 was purified in three steps. In a first step, the p53 was run on a 50 ml Phenyl-Sepharose column (Pharmacia, Piscataway, NJ) equilibrated with 1.5 M ammonium sulfate, 50 mM sodium phosphate [pH 7.0], 5 mM EDTA and eluted by decreasing salt concentration on a GradiFrac system (Pharmacia). In a second step, the eluate from the first step was run on an 8 ml Phenyl-Superose column (Pharmacia) equilibrated with 1.7 M ammonium sulfate, 50 mM sodium phosphate [pH 7.0], 5mM EDTA and eluted by decreasing salt concentration on a SMART system (Pharmacia). In a third step, the eluate from the second step was run on a 1 ml Resource S column (Pharmacia) equilibrated with 50 mM sodium phosphate [pH 7.0], 50 mM NaCl, 0.01 mM EDTA and eluted by increasing NaCl concentration on the SMART system. The eluate from the third step, purified p53, was adjusted to 50 mM sodium phosphate [pH 7.0], 50 mM NaCl, 0.1 mM deuterated EDTA (Isotec), 0.1 mM sodium azide by gel filtration on a Sephadex G-50 column (NAP, Pharmacia) and concentrated to 0.25 ml by ultrafiltration (Amicon, Beverly, MA) to a final concentration of about 2 mM.
2. Glutaraldehyde Crosslinking
Mutants p53He341Tyr344 (p53IY) [SEQ ID NO: 3] and p53Lys340He341Tyr344 (p53KIY; SEQ ID NO: 4) were studied, because they function like p53IF [SEQ ID NO: 3] in the DNA binding assay, but are less hydrophobic and therefore less likely to aggregate non-specifically in concentrated samples.
Concentrations (0.1 mM protein) of the p53 oligomerization domains, purified as described above, were incubated with 0.1% v/v glutaraldehyde (Sigma, St. Louis, MO) for 15 min. at 37°C in a buffer containing 200 mM sodium phosphate and 50 mM Tris-HCl [pH 7.5] and immediately analyzed by SDS-PAGE.
Table 2 illustrates subunit stoichiometry of p53wt [SEQ ID NO: 2], p53IY (Ile341 and Tyr344) [SEQ ID NO: 2] and p53KIY (He341 and Tyr344) [SEQ ID NO: 4] oligomerization domains as determined by glutaraldehyde (Gl.) crosslinking and SDS-gel electrophoresis.
TABLE 2 Protein Subunit Stoichiometry p53wt Tetramer p53IY Dimer p53KIY Dimer
The wild-type domain crosslinked into tetramers, confirming its known tetrameric subunit stoichiometry. In contrast, p53IY [SEQ ID NO: 3] and p53KIY [SEQ ID NO: 4] formed only dimers. These dimers were quite stable, since the efficiency of crosslinking was undiminished at temperatures as high as 55°C.
Thus, the oligomerization stoichiometries of p53wt [SEQ ID NO: 2] and the p53 mutants [SEQ ID NOS: 3 and 4] were found to correlate with the relative sizes of their hydrophobic side chains at positions 341 and 344.
Example 2 - NMR Structure Determination
Multidimensional solution NMR spectroscopy was performed on uniformly 15N- and 15N,13C-labeled samples to determine the structural basis for the switch in oligomerization stoichiometry observed in the mutant proteins as follows. At high concentration and temperature, p53IY [SEQ ID NO: 3] aggregated in less than one hour. In contrast, p53KIY [SEQ ID NO: 4] (and p53wt; SEQ ID NO: 2) could be maintained at 40 °C at high concentrations for months without aggregation. Detailed structural analysis of p53 KIY was then pursued. All experiments were performed at 40 °C on two Bruker DMX instruments operating at 750 and 600 MHz and the spectra processed using FELIX (Biosym Inc., San Diego, CA). Backbone sequential assignments were determined from HNCA, HNCOCA, CBCACONH and HNCACB 3D triple (1H 13C, 15N) resonance and TOCSY-HMQC and NOESY-HMQC 3D double (1H 15N) resonance experiments [D. Marion et al, Biochemistry, 28: 6150 (1989); S. Grzesiek and A. Bax, J. Magn. Reson.. 96: 432 (1992); S. Grzesiek and A. Bax, J. Am. Chem. Soc. 114: 6291 (1992); M. Wittekind and L. Mueller, J. Magn. Reson.. 101B: 201 (1993); F. Lohr and H. Ruterjans, J. Biomolec NMR. 5: 25 (1995)]. Side chain assignments were made using 2D COSY, 2D TOCSY, 3D TOCSY-HMQC and 3D HCCH-TOCSY experiments [R. R. Ernst et al, Principles of Nuclear Magnetic Resonance in One and Two Dimensions (Oxford University Press, New York, 1986); A. Bax et al, J. Magn. Reson.. 88: 425 (1990)]. NOE restraints were obtained from 2D NOESY (unlabeled sample in D20), 3D 15N-edited NOESY-HMQC (15N-labeled sample) and 13C-edited HSQC-NOESY (15N/13C-labeled sample) experiments [A. Majumdar and E. R. P. Zuiderweg, J. Magn. Reson.. 102B: 242 (1993)]. Inter- and intrasubunit NOEs were differentiated with 12C-filtered/13C-edited and 13C-filtered/13C- edited HMQC-NOESY-HMQC experiments performed on an equilibrated 1 : 1 mixture of 12C- and 13C-labeled samples [W. Lee et al, FEBS Letters. 350: 87 (1994)]. Coupling constants were obtained from HMQC-J, HNHA and HNCA-J measurements
and hydrogen bonds from proton-deuterium exchange [K. Wuthrich, NMR of proteins and nucleic acids (Wiley, New York, 1986), G. Vuister and A. Bax, J. Am. Chem. Soc. 115: 7777 (1993)].
The backbone *H and 15N amide resonance frequencies of residues Gly325, Leu330, Gln331, Arg333, Arg337, Phe338, Glu339, Glu343, Asn345,
Glu346, Ala347 and Lys351 [corresponding to SEQ ID NO: 2] differed by more than 0.4 ppm in the proton frequency and/or by more than 1.2 ppm in the nitrogen frequency between p53KIY [SEQ ID NO: 4] and p53wt [SEQ ID NO: 2] (Table 3). Such large chemical shift differences are suggestive of changes in three-dimensional structure [K. Wuthrich, cited above], especially since some residues, such as Gln331 and Arg333 are 10 and 7 A, respectively, from the nearest substituted residue in the established p53wt structure [G. M. Clore et al, Science. 265: 386 (1994); W. Lee et al, Nature Structural Biol.. 1: 877 (1994); G. M. Clore et al, Nature Structural Biol.. 2: 321 (1995); P. D. Jeffrey et al, Science. 267: 1498 (1995). Using amide resonance frequencies (two-dimensional HSQC spectra) as monitors of three-dimensional structure [K. Wuthrich, cited above], verification that this pattern of chemical shift changes was due to the amino acids at positions 341 and 344 was obtained by studying p53IF [SEQ ID NO: 3], p53Lys340Ile341Phe344 (p53KIF; SEQ ID NO: 4), p53K340 [SEQ ID NO: 4] and p53Phe344 (p53FF; SEQ ID NO: 3) proteins. The first three mutant proteins were examined as crude bacterial lysates [A. M. Gronenborn and G. M. Clore, Protein Science. 5:174 (1996)], whereas p53FF [SEQ ID NO: 3] was purified. A number of well-resolved resonances in a small region of the HSQC spectra, corresponding to the β-strand, indicated that the shifts in resonance frequencies characteristic of p53KIY [SEQ ID NO: 4] occurred in p53KIF and p53IF [SEQ ID NO: 3] (mutants with a larger hydrophobic side chain at position 344 relative to 341), but not in p53K340 [SEQ ID NO: 4] or in p53FF [SEQ ID NO: 3] (Table 3) which did not differ from p53wt [SEQ ID NO: 2],
Table 3 illustrates shifts in the amide resonance frequencies of Gln331 and Arg333 by amino acid substitutions targeting Phe341 and Leu344 related to
p53wt [SEQ ID NO: 2]. Proteins are labelled as in Tables 1 and 2. P53K340 [SEQ ID NO: 4] has Lys at position 340. P53KIF [SEQ ID NO: 4] has Lys340, Ile341 and Phe344.
Table 3
Shift > 1.5 ppm Shift > 0.5 ppm in the 15N amide in the 'H amide
Protein frequency of Gln331 frequency of Arg333 p53wt No No p53KIY Yes Yes p53KIF Yes Yes p53IF Yes Not determined p53K340 No No p53FF No No
While proteins with either Tyr or Phe at position 344 yield similar spectra [SEQ ID NO: 3], the substitution of Met340 for Lys [SEQ ID NO: 4], useful in preventing aggregation, does not affect the spectra. A double amino acid substitution involving residues 341 and 344 was required to elicit changes in resonance frequencies, since p53FF [SEQ ID NO: 3], which contains a substitution of only Leu344 for Phe, displays HSQC spectra very similar to p53wt [SEQ ID NO: 2] (Table 3). Notably, in the DNA binding assays, p53FF [SEQ ID NO: 3] and p53wt [SEQ ID NO: 2] are tetramers, whereas p53IF [SEQ ID NO: 3], p53KIY [SEQ ID NO: 4], and p53KIF [SEQ ID NO: 4] are dimers (Table 2).
To monitor intersubunit NOEs an equilibrated 1 : 1 mixture of 15N- and 15N, 13C-labeled p53KIY polypeptides [SEQ ID NO: 3] was subjected to a 12C- filtered, 13C-edited HMQC-NOESY-HMQC experiment. The intersubunit contacts involving residues Thr329 and Leu330 were consistent with an antiparallel β-sheet, as in p53wt [SEQ ID NO: 2]. However, the short-range contacts involving residues Ile341, Ala347, Leu348 and Leu350 suggested parallel packing of the regions
corresponding to residues 341 to 350 [SEQ ID NO: 2]. This was inconsistent with antiparallel packing of the α-helices as observed in the primary dimer of p53wt [SEQ ID NO: 2] (Fig. 1).
The surprising magnitude of structural change was confirmed in the well-defined three-dimensional structure for p53KIY [SEQ ID NO: 4] (Table 4) calculated from experimentally derived distance, dihedral angle and hydrogen bond restraints as follows. The structures were folded from both random and distance geometry embedded starting points using simulated annealing minimization, a total of 678 experimental distance, dihedral angle and hydrogen bond restraints for residues 327-353 [SEQ ID NOS: 2 and 3] and a non-crystallographic symmetry term [A. T. Brϋnger, X-PLOR. A system for X-ray crystallography and NMR (Yale University Press, New Haven and London, 1992)]. NOE intensities were classified as strong, medium and weak, corresponding to distance restraints of 1.8-3.2, 1.8-4.0 and 1.8-5.0 A, respectively. Dihedral angles were restrained to -140±60° or to 60±55° for 3JHNHα measurements of >8.5 or <5 Hz, respectively. Hydrogen bond restraints were incorporated as two NOEs restraining O-NH to 1.7-2.3 A and N-O to 2.8-3.3 A. Pseudo atom restraints were used whenever stereospecific assignments could not be made. The average structure was calculated from thirty simulated annealing structures and was refined using restrained minimization and a repulsive term to stimulate the van der Waal's potential [Brϋnger, cited above]. Geometry was evaluated with PROCHECK and PROMOTIF [R.A. Laskowski et al, J. Appl. Cryst.. 26:283 (1993)].
The results are provided in Table 4. In this table, <SA> is the ensemble of the 30 final simulated annealing structures and (SA)r is the refined average structure. The number of restraints is given in parentheses and applies to the entire dimer. The Lennard-Jones Energy was calculated using the CHARMM22 force field [B.R. Brooks et al, J. Comput. Chem., 4: 187 (1983)].
Table 4 - Structure statistics
RMSD from Experimental
Distance Restraints(A)1 <SA> (SA)r
Intrasubunit (384)
Intraresidue (122) 0.001±0.0003 0.000
Sequential (120) 0.053±0.012 0.097
Short Range (124) 0.031±0.013 0.035
Long Range (18) 0.064+0.020 0.061
Intersubunit (200) 0.044±0.007 0.077
Hydrogen Bonds (38) 0.073±0.01 1 0.100
RMSD from Experimental Angle Restraints(°)
Dihedral Angles (56) 0.62±0.21 0.87
RMSD from Idealized Geometry
Bonds (A) 0.0028+0.0001 0.0062
Angles (°) 0.55±0.026 0.69
Impropers (°) 0.47±0.078 0.66
Lennard-Jones Energy
(kcal/mol) -407±23 -437
Atomic RMSD from Average (A)
Backbone 0.51
All Atoms 1.41
Core Sidechains 1.05
1 None of the simulated annealing structures was found to have distance violations greater than 0.3 A or dihedral angle violations greater than 3 °.
Example 3 - Three-Dimensional Structure
The p53KIY oligomerization domain [SEQ ID NO: 4] is a dimer with two-fold cyclic symmetry. Each subunit consists of three secondary structure elements: a β-strand forms an antiparallel β-sheet and the two α-helices pack parallel to each other (Fig. 2). Parallel packing of the α-helices is stabilized by hydrophobic interactions substantially involving Tyr344, which interacts with Ile341 of the same subunit and Tyr344 of the other (Fig. 2).
The structural switch between p53KIY [SEQ ID NO: 4] and p53wt [SEQ ID NO: 2] can be evaluated by comparing their structures (Figs. 1 and 2). Such comparison reveals differences in the orientation of the secondary structure elements. The interhelical angle changes from 155° in the primary dimer of p53wt to 83° in p53KIY [SEQ ID NO: 4], similar to the 81 ° angle with which the α-helices pack across primary dimers in p53wt.
The switch between two well-defined structures observed here for the p53 oligomerization domain is remarkable. p53wt [SEQ ID NO: 2] and p53KIY [SEQ ID NO: 4] differ by only three amino acid substitutions. The changes in the sizes of the side chains at positions 341 and 344 appear to be necessary and sufficient for the structural switch.
The effect of changing the side chain sizes of residues 341 and 344 on protein three-dimensional structure may be attributed to the hydrophobic effect [W.
Kauzmann, Adv. Protein Chem.. 14: 1 (1959); C. Chothia, Nature. 248: 338 (1974); F. M. Richards, Annu. Rev. Biophys. Bioeng.. 6: 151 (1977); K. A. Dill, Biochemistry. 24:1501 (1985); K. A. Dill, Biochemistry. 29: 7133 (1990); J. R. Livingstone et al, Biochemistry. 30: 4237 (1991); K. A. Sharp et al, Biochemistry. 30: 9686 (1991); C. N. Pace, J. Mol. Biol.. 226: 29 (1992). In p53wt, Phe341 is protected from exposure to solvent by being in the protein interior, where it interacts with multiple amino acids. In p53KIY [SEQ ID NO: 3], the decrease in side chain size of residue 341 results in fewer interactions being required to shield its surface from solvent, while the increase in side chain size of residue 344 requires new interactions. The switch in three dimensional structure between p53wt [SEQ ID NO: 2] and p53KIY [SEQ ID NO: 4]
can be explained in terms of the loss of contacts that bury the side chain of residue 341 and gain in contacts that bury the side chain of residue 344. Specifically in p53wt, the tip of the Phe341 ring lies in a hydrophobic pocket formed by the side chains of Leu344, Asn345 and Leu348, all from the other subunit (Fig. 3A). In p53KIY, Tyr344 not Ile341, interacts with Tyr344, Asn345 and Leu348, from the other subunit (Fig. 3B). The intersubunit interaction between Leu348 and Phe341 in p53wt [SEQ ID NO: 2] stabilizes antiparallel packing of the α-helices, whereas in p53KIY [SEQ ID NO: 4] the intersubunit interaction between Leu348 and Tyr344 stabilizes parallel packing. The change in subunit stoichiometry between the p53wt [SEQ ID NO:
2] and p53KIY [SEQ ID NO: 4] oligomerization domains is probably secondary to the altered packing of the α-helices. In p53wt, residues Leu344 form a hydrophobic patch for assembly of two primary dimers into a tetramer (Fig. 1). In p53KIY [SEQ ID NO: 4], Tyr 344 is involved in parallel packing of the α-helices and does not allow two dimers to form a tetramer.
All references referred to above are incorporated by reference herein. Numerous modifications and variations of the present invention are included in the above-identified specification and are expected to be obvious to one of skill in the art. Such modifications and alterations to the compositions and processes of the present invention are believed to be encompassed in the scope of the claims appended hereto.
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: Wistar Institute of Anatomy &, Biology
Halazonetis, Thanos D.
(ii) TITLE OF INVENTION: Methods for Altering Three-Dimensional
Protein Structure and Compositions Produced Thereby
(iii) NUMBER OF SEQUENCES: 4
(iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Howson and Howson
(B) STREET: Spring House Corporate Cntr., PO Box 457
(C) CITY: Spring House
(D) STATE: Pennsylvania
(E) COUNTRY: USA
(F) ZIP: 19477
(v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk
(B) COMPUTER: IBM PC compatible
(C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: Patentin Release #1.0, Version #1.30
(vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: WO
(B) FILING DATE:
(C) CLASSIFICATION:
(vii) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: US 60/035,458
(B) FILING DATE: 17-JAN-1997
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Kodroff, Cathy A.
(B) REGISTRATION NUMBER: 33,980
(C) REFERENCE/DOCKET NUMBER: WST74APCT
(ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: 215-540-9200
(B) TELEFAX: 215-540-5818
(2) INFORMATION FOR SEQ ID NO:l:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1317 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA (genomic)
( ix ) FEATURE :
(A) NAME/KEY: CDS
(B) LOCATION: 136..1314
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l:
GTCTAGAGCC ACCGTCCAGG GAGCAGGTAG CTGCTGGGCT CCGGGGACAC TTTGCGTTCG 60
GGCTGGGAGC GTGCTTTCCA CGACGGTGAC ACGCTTCCCT GGATTGGCAG CCAGACTGCC 120
TTCCGGGTCA CTGCC ATG GAG GAG CCG CAG TCA GAT CCT AGC GTC GAG CCC 171 Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro 1 5 10
CCT CTG AGT CAG GAA ACA TTT TCA GAC CTA TGG AAA CTA CTT CCT GAA 219 Pro Leu Ser Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu 15 20 25
AAC AAC GTT CTG TCC CCC TTG CCG TCC CAA GCA ATG GAT GAT TTG ATG 267 Asn Asn Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met 30 35 40
CTG TCC CCG GAC GAT ATT GAA CAA TGG TTC ACT GAA GAC CCA GGT CCA 315 Leu Ser Pro Asp Asp lie Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro 45 50 55 60
GAT GAA GCT CCC AGA ATG CCA GAG GCT GCT CCC CCC GTG GCC CCT GCA 363 Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro Ala 65 70 75
CCA GCA GCT CCT ACA CCG GCG GCC CCT GCA CCA GCC CCC TCC TGG CCC 411 Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser Trp Pro 80 85 90
CTG TCA TCT TCT GTC CCT TCC CAG AAA ACC TAC CAG GGC AGC TAC GGT 459 Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly Ser Tyr Gly 95 100 105
TTC CGT CTG GGC TTC TTG CAT TCT GGG ACA GCC AAG TCT GTA ACT TGC 507 Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lys Ser Val Thr Cys 110 115 120
ACG TAC TCC CCT GCC CTC AAC AAG ATG TTT TGC CAA CTG GCC AAG ACC 555 Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys Gin Leu Ala Lys Thr 125 130 135 140
TGC CCT GTG CAG CTG TGG GTT GAT TCC ACA CCC CCG CCC GGC ACC CGC 603 Cys Pro Val Gin Leu Trp Val Asp Ser Thr Pro Pro Pro Gly Thr Arg 145 150 155
GTC CGC GCC ATG GCC ATC TAC AAG CAG TCA CAG CAC ATG ACG GAG GTT 651 Val Arg Ala Met Ala He Tyr Lys Gin Ser Gin His Met Thr Glu Val 160 165 170
GTG AGG CGC TGC CCC CAC CAT GAG CGC TGC TCA GAT AGC GAT GGT CTG 699 Val Arg Arg Cys Pro His His Glu Arg Cys Ser Asp Ser Asp Gly Leu 175 180 185
GCC CCT CCT CAG CAT CTT ATC CGA GTG GAA GGA AAT TTG CGT GTG GAG 747 Ala Pro Pro Gin His Leu He Arg Val Glu Gly Asn Leu Arg Val Glu 190 195 200
TAT TTG GAT GAC AGA AAC ACT TTT CGA CAT AGT GTG GTG GTG CCC TAT 795 Tyr Leu Asp Asp Arg Asn Thr Phe Arg His Ser Val Val Val Pro Tyr 205 210 215 220
GAG CCG CCT GAG GTT GGC TCT GAC TGT ACC ACC ATC CAC TAC AAC TAC 843 Glu Pro Pro Glu Val Gly Ser Asp Cys Thr Thr He His Tyr Asn Tyr 225 230 235
ATG TGT AAC AGT TCC TGC ATG GGC GGC ATG AAC CGG AGA CCC ATC CTC 891 Met Cys Asn Ser Ser Cys Met Gly Gly Met Asn Arg Arg Pro He Leu 240 245 250
ACC ATC ATC ACA CTG GAA GAC TCC AGT GGT AAT CTA CTG GGA CGG AAC 939 Thr He He Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn 255 260 265
AGC TTT GAG GTG CGT GTT TGT GCC TGT CCT GGG AGA GAC CGG CGC ACA 987 Ser Phe Glu Val Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr 270 275 280
GAG GAA GAG AAT CTC CGC AAG AAA GGG GAG CCT CAC CAC GAG CTG CCC 1035 Glu Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro 285 290 295 300
CCA GGG AGC ACT AAG CGA GCA CTG CCC AAC AAC ACC AGC TCC TCT CCC 1083 Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser Pro 305 310 315
CAG CCA AAG AAG AAA CCA CTG GAT GGA GAA TAT TTC ACC CTT CAG ATC 1131 Gin Pro Lys Lys Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu Gin He 320 325 330
CGT GGG CGT GAG CGC TTC GAG ATG TTC CGA GAG CTG AAT GAG GCC TTG 1179 Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn Glu Ala Leu 335 340 345
GAA CTC AAG GAT GCC CAG GCT GGG AAG GAG CCA GGG GGG AGC AGG GCT 1227 Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly Gly Ser Arg Ala 350 355 360
CAC TCC AGC CAC CTG AAG TCC AAA AAG GGT CAG TCT ACC TCC CGC CAT 1275 His Ser Ser His Leu Lys Ser Lys Lys Gly Gin Ser Thr Ser Arg His 365 370 375 380
AAA AAA CTC ATG TTC AAG ACA GAA GGG CCT GAC TCA GAC TGA 1317
Lys Lys Leu Met Phe Lys Thr Glu Gly Pro Asp Ser Asp 385 390
(2) INFORMATION FOR SEQ ID NO: 2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser Gin 1 5 10 15
Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn Val Leu 20 25 30
Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu Ser Pro Asp 35 40 45
Asp He Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro Asp Glu Ala Pro 50 55 60
Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro Ala Pro Ala Ala Pro 65 70 75 80
Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser Trp Pro Leu Ser Ser Ser 85 90 95
Val Pro Ser Gin Lys Thr Tyr Gin Gly Ser Tyr Gly Phe Arg Leu Gly 100 105 110
Phe Leu His Ser Gly Thr Ala Lys Ser Val Thr Cys Thr Tyr Ser Pro 115 120 125
Ala Leu Asn Lys Met Phe Cys Gin Leu Ala Lys Thr Cys Pro Val Gin 130 135 140
Leu Trp Val Asp Ser Thr Pro Pro Pro Gly Thr Arg Val Arg Ala Met 145 150 155 160
Ala He Tyr Lys Gin Ser Gin His Met Thr Glu Val Val Arg Arg Cys 165 170 175
Pro His His Glu Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin 180 185 190
His Leu He Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp 195 200 205
Arg Asn Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu 210 215 220
Val Gly Ser Asp Cys Thr Thr He His Tyr Asn Tyr Met Cys Asn Ser 225 230 235 240
Ser Cys Met Gly Gly Met Asn Arg Arg Pro He Leu Thr He He Thr 245 250 255
Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu Val 260 265 270
Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu Glu Glu Asn 275 280 285
Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro Pro Gly Ser Thr 290 295 300
Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser Pro Gin Pro Lys Lys 305 310 315 320
Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu Gin He Arg Gly Arg Glu 325 330 335
Arg Phe Glu Met Phe Arg Glu Leu Asn Glu Ala Leu Glu Leu Lys Asp 340 345 350
Ala Gin Ala Gly Lys Glu Pro Gly Gly Ser Arg Ala His Ser Ser His 355 360 365
Leu Lys Ser Lys Lys Gly Gin Ser Thr Ser Arg His Lys Lys Leu Met 370 375 380
Phe Lys Thr Glu Gly Pro Asp Ser Asp 385 390
(2) INFORMATION FOR SEQ ID NO: 3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(ix) FEATURE:
(A) NAME/KEY: Modified-site
(B) LOCATION: 341
(D) OTHER INFORMATION: /note= "Amino acid in position 341 can be He, Leu, Phe or Val"
(ix) FEATURE:
(A) NAME/KEY: Modified-site
(B) LOCATION: 344
(D) OTHER INFORMATION: /note= "Amino acid in position 344 can be Ala, He, Leu, Phe, Trp or Tyr"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser Gin 1 5 10 15
Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn Val Leu 20 25 30
Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu Ser Pro Asp 35 40 45
Asp He Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro Asp Glu Ala Pro 50 55 60
Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro Ala Pro Ala Ala Pro 65 70 75 80
Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser Trp Pro Leu Ser Ser Ser 85 90 95
Val Pro Ser Gin Lys Thr Tyr Gin Gly Ser Tyr Gly Phe Arg Leu Gly 100 105 110
Phe Leu His Ser Gly Thr Ala Lys Ser Val Thr Cys Thr Tyr Ser Pro 115 120 125
Ala Leu Asn Lys Met Phe Cys Gin Leu Ala Lys Thr Cys Pro Val Gin 130 135 140
Leu Trp Val Asp Ser Thr Pro Pro Pro Gly Thr Arg Val Arg Ala Met 145 150 155 160
Ala He Tyr Lys Gin Ser Gin His Met Thr Glu Val Val Arg Arg Cys 165 170 175
Pro His His Glu Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin 180 185 190
His Leu He Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp 195 200 205
Arg Asn Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu 210 215 220
Val Gly Ser Asp Cys Thr Thr He His Tyr Asn Tyr Met Cys Asn Ser 225 230 235 240
Ser Cys Met Gly Gly Met Asn Arg Arg Pro He Leu Thr He He Thr 245 250 255
Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu Val 260 265 270
Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu Glu Glu Asn 275 280 285
Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro Pro Gly Ser Thr 290 295 300
Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser Pro Gin Pro Lys Lys 305 310 315 320
Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu Gin He Arg Gly Arg Glu 325 330 335
Arg Phe Glu Met Xaa Arg Glu Xaa Asn Glu Ala Leu Glu Leu Lys Asp 340 345 350
Ala Gin Ala Gly Lys Glu Pro Gly Gly Ser Arg Ala His Ser Ser His 355 360 365
Leu Lys Ser Lys Lys Gly Gin Ser Thr Ser Arg His Lys Lys Leu Met 370 375 380
Phe Lys Thr Glu Gly Pro Asp Ser Asp 385 390
(2) INFORMATION FOR SEQ ID NO: 4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(ix) FEATURE:
(A) NAME/KEY: Modified-site
(B) LOCATION: 341
(D) OTHER INFORMATION: /note= "Amino acid in position 341 can be He, Leu, Phe or Val"
(ix) FEATURE:
(A) NAME/KEY: Modified-site
(B) LOCATION: 344
(D) OTHER INFORMATION: /note= "Amino acid in position 344 can be Ala, He, Leu, Phe, Trp or Tyr"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 :
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser Gin 1 5 10 15
Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn Val Leu 20 25 30
Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu Ser Pro Asp 35 40 45
Asp He Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro Asp Glu Ala Pro 50 55 60
Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro Ala Pro Ala Ala Pro 65 70 75 80
Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser Trp Pro Leu Ser Ser Ser 85 90 95
Val Pro Ser Gin Lys Thr Tyr Gin Gly Ser Tyr Gly Phe Arg Leu Gly 100 105 110
Phe Leu His Ser Gly Thr Ala Lys Ser Val Thr Cys Thr Tyr Ser Pro 115 120 125
Ala Leu Asn Lys Met Phe Cys Gin Leu Ala Lys Thr Cys Pro Val Gin 130 135 140
Leu Trp Val Asp Ser Thr Pro Pro Pro Gly Thr Arg Val Arg Ala Met 145 150 155 160
Ala He Tyr Lys Gin Ser Gin His Met Thr Glu Val Val Arg Arg Cys 165 170 175
Pro His His Glu Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin 180 185 190
His Leu He Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp 195 200 205
Arg Asn Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu 210 215 220
Val Gly Ser Asp Cys Thr Thr He His Tyr Asn Tyr Met Cys Asn Ser 225 230 235 240
Ser Cys Met Gly Gly Met Asn Arg Arg Pro He Leu Thr He He Thr 245 250 255
Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu Val 260 265 270
Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu Glu Glu Asn 275 280 285
Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro Pro Gly Ser Thr 290 295 300
Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser Pro Gin Pro Lys Lys 305 310 315 320
Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu Gin He Arg Gly Arg Glu 325 330 335
Arg Phe Glu Lys Xaa Arg Glu Xaa Asn Glu Ala Leu Glu Leu Lys Asp 340 345 350
Ala Gin Ala Gly Lys Glu Pro Gly Gly Ser Arg Ala His Ser Ser His 355 360 365
Leu Lys Ser Lys Lys Gly Gin Ser Thr Ser Arg His Lys Lys Leu Met 370 375 380
Phe Lys Thr Glu Gly Pro Asp Ser Asp 385 390
Claims
1. A method for altering the three-dimensional structure of a selected protein without denaturing the protein, comprising the steps of:
(a) providing a selected protein with three-dimensional structure to be altered;
(b) identifying the hydrophobic residues within the protein;
(c) distinguishing the hydrophobic residues on the basis of side chain size into large and small hydrophobic residues;
(d) providing mutants of the protein having substitutions in the hydrophobic residues distinguished in (c); and
(e) assaying the mutant proteins for a switch in three-dimensional structure.
2. The method according to claim 1(c) wherein the hydrophobic residues distinguished are in a hydrophobic core.
3. The method according to claim 2 wherein step (d) wherein the mutants have a substitution of at least one large hydrophobic amino acid with a small hydrophobic amino acid.
4. The method according to claim 3 wherein the substitution is selected from the group consisting of: Trp with Phe; Trp with Tyr; Trp with Leu; Trp with He; Trp with Val; Phe with Leu; Phe with He; Phe with Val; Tyr with Leu; Tyr with He; Tyr with Val; Leu with He; and Leu with Val.
5. The method according to claim 3 wherein the mutants of (d) have a substitution of at least one small hydrophobic amino acid with a large hydrophobic amino acid.
6. The method according to claim 5 wherein the substitution is selected from the group consisting of: Val with Leu; Val with Phe; Val with Tyr; Val with Trp; He with Leu; He with Phe; He with Tyr; He with Trp; Leu with Phe; Leu with Tyr; and Leu with Trp.
7. A method for altering the three-dimensional structure of a selected non-linear protein having a hydrophobic core without denaturing the protein, comprising the steps of:
(a) providing a selected protein with three-dimensional structure to be altered;
(b) identifying the hydrophobic residues within the protein;
(c) distinguishing the hydrophobic residues on the basis of side chain size into large and small hydrophobic residues;
(d) providing mutants of the protein having substitutions in the hydrophobic residues distinguished in (c); and
(e) assaying the mutant proteins for a switch in three-dimensional structure.
8. The method according to any of claims 1 to 7, wherein the protein is selected from the group consisting of enzymes and members of the globin family.
9. The method according to any of claims 1 to 7, wherein the protein contains a p53 oligomerization domain.
10. An altered p53 oligomerization domain, wherein said domain has an altered three-dimensional structure produced according to the method of any of claims 1 to 7.
11. The altered p53 oligomerization domain according to claim 10 containing a substitution of Phe341 with a small amino acid and a substitution of a small amino acid with a large amino acid.
12. The altered p53 oligomerization domain to claim 10 or 11 wherein Leu344 is substituted with a large amino acid in the altered p53 domain.
13. The altered p53 oligomerization domain according to claim 10, wherein the altered p53 domain contains a pair of amino acids at positions 341 and 344, respectively, selected from the group consisting of: Val-Phe, Val-Trp, Ile-Phe, Ile-Trp, Leu-Phe, Leu-Trp, Phe-Trp, Val-Tyr, and Ile-Tyr.
14. The altered p53 oligomerization domain according to claim 10, wherein the three-dimensional structure of the altered p53 domain contains a substitution of Met at position 340 for Lys.
15. A fusion protein comprising an altered p53 oligomerization domain according to any of claims 10 to 14 fused to a heterologous protein.
16. A modified p53 protein comprising an altered p53 oligomerization domain according to any of claims 10 to 14.
17. The modified p53 protein according to claim 16, wherein p53 is modified to contain a substitution of residue Thr284 with Arg.
18. A nucleic acid sequence encoding any of the proteins of claims 10 to 17.
19. A vector comprising a nucleic acid sequence according to claim 18, wherein expression of the encoded protein is directed by regulatory control sequences.
20. A host cell transformed with the vector according to claim 19.
21. A pharmaceutical composition comprising a protein according to any of claims 10 to 17, a nucleic acid sequence according to claim 18, or a vector according to claim 19, and a pharmaceutically acceptable carrier.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU60278/98A AU6027898A (en) | 1997-01-17 | 1998-01-15 | Methods for altering three-dimensional protein structure and compositions produced thereby |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US3545897P | 1997-01-17 | 1997-01-17 | |
US60/035,458 | 1997-01-17 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO1998031703A1 WO1998031703A1 (en) | 1998-07-23 |
WO1998031703A9 true WO1998031703A9 (en) | 1998-12-17 |
Family
ID=21882801
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1998/000853 WO1998031703A1 (en) | 1997-01-17 | 1998-01-15 | Methods for altering three-dimensional protein structure and compositions produced thereby |
Country Status (2)
Country | Link |
---|---|
AU (1) | AU6027898A (en) |
WO (1) | WO1998031703A1 (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6388062B1 (en) | 1998-05-08 | 2002-05-14 | The Wistar Institute Of Anatomy And Biology | Modified p53 tetramerization domains having hydrophobic amino acid substitutions |
AU4849200A (en) * | 1999-05-12 | 2000-11-21 | Xencor, Inc. | Novel nucleic acids and proteins with p53 activity and altered tetramerization domains |
-
1998
- 1998-01-15 AU AU60278/98A patent/AU6027898A/en not_active Abandoned
- 1998-01-15 WO PCT/US1998/000853 patent/WO1998031703A1/en active Application Filing
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5932546A (en) | Peptides and compounds that bind to the thrombopoietin receptor | |
Sewell et al. | Bioactivity of metallothionein-3 correlates with its novel. beta. Domain sequence rather than metal binding properties | |
Geyer et al. | Structure of the anchor-domain of myristoylated and non-myristoylated HIV-1 Nef protein | |
Whiteheart et al. | SNAP family of NSF attachment proteins includes a brain-specific isoform | |
AU2002248609B2 (en) | IGF antagonist peptides | |
JP2008521759A (en) | Fusion protein containing BH3-domain sequence of BH3-only protein | |
JP2007259856A (en) | SrcSH3 BINDING PEPTIDE, METHOD FOR ISOLATING AND METHOD FOR UTILIZING THE SAME | |
JPH10503523A (en) | ASP ▲ B1 ▼ Insulin analog | |
WO2009155556A2 (en) | Crkl targeting peptides | |
Vidal et al. | Molecular and cellular analysis of Grb2 SH3 domain mutants: interaction with Sos and dynamin | |
JPH08508884A (en) | Macrophage inflammatory protein variant | |
JP2003530867A (en) | Thrombopoietin receptor modulating peptide | |
JP3995884B2 (en) | Isolated peptide corresponding to the amino acid sequence of NY-ESO-1 and binding to MHC class I and MHC class II molecules and method of use thereof | |
AU733964B2 (en) | Novel flt-3 receptor agonists | |
JP2000083689A (en) | Sialoadhesin family member-2 (saf-2) | |
US20050033035A1 (en) | Mutants of igf binding proteins and methods of production of antagonists thereof | |
US6030942A (en) | Peptides peptide analogs peptidomimetics and other small molecules useful for inhibiting the activity of ribonucleotide reductase | |
US20090087445A1 (en) | Method for the Redox Potential-Dependent Detection of Target Molecules by Interacting Polypeptides | |
WO1998031703A9 (en) | Methods for altering three-dimensional protein structure and compositions produced thereby | |
WO1998031703A1 (en) | Methods for altering three-dimensional protein structure and compositions produced thereby | |
Tzeng et al. | Solution structure of the human BTK SH3 domain complexed with a proline-rich peptide from p120cbl | |
WO2003014144A2 (en) | Molecule | |
AU777774B2 (en) | Novel method for identifying antibacterial compounds | |
JP2000125888A (en) | Sialoadhesin family member-3 | |
JP2002527037A (en) | Cytokine family member EF-7 |