DK2422201T3 - STRUCTURE OF THE C-terminal region of the insulin receptor ALPHA CHAIN ​​AND THE INSULIN-LIKE GROWTH FACTOR RECEPTOR ALPHA CHAIN - Google Patents

STRUCTURE OF THE C-terminal region of the insulin receptor ALPHA CHAIN ​​AND THE INSULIN-LIKE GROWTH FACTOR RECEPTOR ALPHA CHAIN Download PDF

Info

Publication number
DK2422201T3
DK2422201T3 DK10766481.5T DK10766481T DK2422201T3 DK 2422201 T3 DK2422201 T3 DK 2422201T3 DK 10766481 T DK10766481 T DK 10766481T DK 2422201 T3 DK2422201 T3 DK 2422201T3
Authority
DK
Denmark
Prior art keywords
igf
leu
chain
glu
ser
Prior art date
Application number
DK10766481.5T
Other languages
Danish (da)
Inventor
Michael Colin Lawrence
Brian John Smith
John Gerbrandt Tasman Menting
Colin Wesley Ward
Original Assignee
Inst Medical W & E Hall
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inst Medical W & E Hall filed Critical Inst Medical W & E Hall
Application granted granted Critical
Publication of DK2422201T3 publication Critical patent/DK2422201T3/en

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
    • G16B15/30Drug targeting using structural data; Docking or binding prediction
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P19/00Drugs for skeletal disorders
    • A61P19/08Drugs for skeletal disorders for bone diseases, e.g. rachitism, Paget's disease
    • A61P19/10Drugs for skeletal disorders for bone diseases, e.g. rachitism, Paget's disease for osteoporosis
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P25/00Drugs for disorders of the nervous system
    • A61P25/28Drugs for disorders of the nervous system for treating neurodegenerative disorders of the central nervous system, e.g. nootropic agents, cognition enhancers, drugs for treating Alzheimer's disease or other forms of dementia
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P3/00Drugs for disorders of the metabolism
    • A61P3/04Anorexiants; Antiobesity agents
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P3/00Drugs for disorders of the metabolism
    • A61P3/08Drugs for disorders of the metabolism for glucose homeostasis
    • A61P3/10Drugs for disorders of the metabolism for glucose homeostasis for hyperglycaemia, e.g. antidiabetics
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P35/00Antineoplastic agents
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P9/00Drugs for disorders of the cardiovascular system
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P9/00Drugs for disorders of the cardiovascular system
    • A61P9/10Drugs for disorders of the cardiovascular system for treating ischaemic or atherosclerotic diseases, e.g. antianginal drugs, coronary vasodilators, drugs for myocardial infarction, retinopathy, cerebrovascula insufficiency, renal arteriosclerosis
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/705Receptors; Cell surface antigens; Cell surface determinants
    • C07K14/72Receptors; Cell surface antigens; Cell surface determinants for hormones
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/566Immunoassay; Biospecific binding assay; Materials therefor using specific carrier or receptor proteins as ligand binding reagents where possible specific carrier or receptor proteins are classified with their target compounds
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B5/00ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Organic Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Public Health (AREA)
  • Veterinary Medicine (AREA)
  • Animal Behavior & Ethology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Molecular Biology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Biophysics (AREA)
  • Immunology (AREA)
  • Biotechnology (AREA)
  • Hematology (AREA)
  • Biomedical Technology (AREA)
  • Theoretical Computer Science (AREA)
  • Medical Informatics (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Diabetes (AREA)
  • Urology & Nephrology (AREA)
  • Cell Biology (AREA)
  • Biochemistry (AREA)
  • Endocrinology (AREA)
  • Physical Education & Sports Medicine (AREA)
  • Orthopedic Medicine & Surgery (AREA)
  • Heart & Thoracic Surgery (AREA)
  • Toxicology (AREA)
  • Zoology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Rheumatology (AREA)
  • Crystallography & Structural Chemistry (AREA)

Description

DESCRIPTION
Field of the Invention [0001] The present invention relates generally to structural studies of the insulin binding site of the insulin receptor (IR) and the insulin-like growth factor 1 receptor (IGF-1R). More particularly, the present invention relates to methods of using the crystal structure of the low affinity insulin binding site of the IR ectodomain comprising the C-terminal region of the IR α-chain, as well as the corresponding region of IGF-1R, and related structural information to screen for and design compounds that interact with or modulate the function of IR and/or IGF-1R.
Background to the Invention [0002] The insulin receptor (IR) and its homologue the type 1 insulin-like growth factor 1 receptor (IGF-1 R), are closely related members of the tyrosine kinase receptor family and are large, transmembrane, glycoprotein dimers consisting of several structural domains.
[0003] The key role of the insulin receptor (IR) is in glucose uptake and metabolism by muscle and fat. Mouse knockout studies have also shown IR to be important in adipogenesis, neovascularization, the regulation of hepatic glucose synthesis and glucose-induced pancreatic insulin secretion (Kitamura et al., 2003). IR signalling is also important in the brain, being involved in the regulation of food intake, peripheral fat deposition and the reproductive endocrine axis as well as in learning and memory (Wada et al., 2005). Dysfunctional IR signalling has been implicated in diseases including type I and type II diabetes, dementia and cancer.
[0004] IR exists as two splice variant isoforms, IR-Aand IR-B, which respectively lack or contain the 12 amino acids coded by exon 11. The longer variant, IR-B, is the isoform responsible for signalling metabolic responses. In contrast, IR-A signals predominantly mitogenic responses, is the preferentially expressed isoform in several cancers (Denley et al., 2003) and is capable of binding insulin-like growth factor 2 (IGF-11) with high affinity (Denley et al., 2004).
[0005] The sequence of IR is highly homologous to the sequence of IGF-1 R, indicating that the three-dimensional structures of both receptors are most likely closely similar. The mature human IR and IGF-1 R molecules are each homodimers comprising two α-chains and two β-chains, the a- and β-chains arising from the post-translational cleavage at the furin cleavage site at residues 720-723 (IR-A numbering with the mature N-terminal residue numbered 1) or 707-710 (IGF-1R). The structural organization of IR and IGF-1 R has been reviewed extensively (Adams et al., 2000; De Meyts and Whittaker, 2002; Ward et al., 2003; Lawrence et al., 2007; Ward and Lawrence, 2009). The sequence relationship and domain organization of these receptors are presented in Figure 1.
[0006] The extracellular part of each IR or IGF-1 R monomer contains (sequentially from N- to C-terminus) a leucine-rich repeat domain (L1), a cysteine-rich region (CR) and a second leucine-rich repeat domain (L2), followed by three fibronectin type III domains, (Fnlll-1, -2 and -3). The Fnlll-2 domain contains a large insert domain (ID) of approximately 120 residues, wthin which lies the α-β cleavage site. Intracellularly, each monomer contains a tyrosine kinase catalytic domain flanked by two regulatory regions that contain the phosphotyrosine binding sites for signalling molecules. Each α-chain is linked to its partner β-chain via a disulphide bond between residues Cys647 and Cys860 (Sparrow ef al., 1997) in the case of IR and/or Cys633-Cys849 in the case of IGF-1 R. The α-chains of both IR and IGF-1 R are cross-linked by disulphide bonds in two places. The first is at Cys524 (IR) or Cys514 (IGF-1 R) in the Fnlll-1 domain, cross-linked to its counterpart in the opposite monomer, and the second involves one or more of the residues Cys682, Cys683 and Cys685 (IR) or Cys669, Cys670 and Cys672 (IGF-1 R) in the insert region of each Fnlll-2 domain, cross-linked to their counterparts in the opposite monomer (Sparrow et al., 1997).
[0007] The domains of IR and IGF-1 R exhibit high (47-67%) amino acid sequence identity indicative of high conservation of three-dimensional structure. The crystal structure of the first three domains of IGF-1 R (L1-CR-L2) has been determined (Garrett et al., 1998) and revealed that the L domains consist of a single-stranded right-handed β-helix (a helical arrangement of β-stands), while the cysteine-rich region is composed of eight related disulfide-bonded modules. The crystal structure of the first three domains of IR (L1-CR-L2) has also been determined (WO 07/147213, Lou et al., 2006) and as anticipated is closely similar to that of its IGF-1 R counterpart. Other evidence for the close structural similarity of IR and IGF-1 R arises from : (i) electron microscopic analyses (Tulloch et al., 1999), (ii) the fact that hybrid receptors (heterodimers of one IR monomer disulphide-bonded to one of IGF-1 R monomer) exist naturally and are commonly found in tissues expressing both receptors (Bailyes et al., 1997); and (iii) the fact that receptor chimeras can be constructed which have whole domains or smaller segments of polypeptide from one receptor replaced by the corresponding domain or sequence from the other (reviewed in Adams et al., 2000).
[0008] The current model for insulin binding proposes that, in the basal state, the IR homodimer contains two identical pairs of binding sites (referred to as Site 1 and Site 2) on each monomer (De Meyts and Whittaker, 2002; Schaffer, 1994; De Meyts, 1994; De Meyts, 2004; Kiselyov ef a/., 2009). Binding of insulin to a low affinity site (Site 1) on one α-subunit is followed by a second binding event between the bound insulin and a different region of the second IR α-subunit (Site 2). This ligand-mediated bridging between the two α-subunits generates the high affinity state that results in signal transduction. In contrast, soluble IR ectodomain, which is not tethered at its C-terminus, cannot generate the high affinity receptor-ligand complex The soluble IR ectodomain can bind two molecules of insulin simultaneously at its two Site Is, but only with low affinity (Adams et at., 2000). The model for IGF-I or IGF-II binding to IGF-1R is the same as that just described for insulin binding to IR and involves IGF-I (or IGF-II) binding to an initial low affinity site (Site 1) and subsequent cross-linking to a second site (Site 2) on the opposite monomer to form the high affinity state, as described for the IR. Flowever, the values of the kinetic parameters describing these events are somewhat different in the two systems (Surinya et at., 2008; Kiselyov et at., 2009).
[0009] While similar in structure, IGF-1R and IR serve different physiological function. IGF-1R is expressed in almost all normal adult tissue except for liver, which is itself the major site of IGF-I production (Buttel et at., 1999). A variety of signalling pathways are activated following binding of IGF-I or IGF-II to IGF-1R, including Src and Ras, as well as downstream pathways, such as the MAP kinase cascade and the P13K/AKT axis (Chow etal., 1998). IR is primarily involved in metabolic functions whereas IGF-1R mediates growth and differentiation. Consistent with this, ablation of IGF-I (i.e. in IGF-I knock-out mice) results in embryonic growth deficiency, impaired postnatal growth, and infertility. In addition, IGF-1R knock-out mice were only 45% of normal size and died of respiratory failure at birth (Liu et at., 1993). However, both insulin and IGF-I can induce both mitogenic and metabolic effects.
[0010] Various non-crystallographic 3-D structural analyses of the IR and the interaction of insulin with the IR have been undertaken using electron microscopic techniques (Luo et at., 1999; Ottensmeyer ef at., 2000, 2001; Yip and Ottensmeyer, 2001). However, due to the low resolution information obtained (>20 angstrom), the conclusions of these studies have been questioned (De Meyts and Whittaker, 2002).
[0011] Crystal structures of the ectodomain of IR have been presented previously (WO 07/147213, McKern et al., 2006; Lou et at., 2006) and have elucidated some potential ligand/IR interactions, in particular part of the low affinity site on the surface of IR L1. However, an area of ambiguous electron density on the surface of the IR L1 domain could not be resolved (WO 07/147213, McKern et al., 2006). Accordingly, there is a need in the art to more fully resolve the structures of both IR and IGF-1 R in order to elucidate all potential ligand/receptor interactions. This information would provide a more complete understanding of the mechanisms of action of both IR and IGF-1 R necessary for the development of IR and IGF-1 R agonists/antagonists.
Summary of the Invention [0012] The present inventors have determined the crystal structure of the low affinity insulin binding site of human IR. In particular, the crystal structure of the low affinity insulin binding site of human IR ectodomain comprising the C-terminal region of the insulin receptor α-chain has been determined. This structure allows visualisation, for the first time, of the intact low affinity insulin receptor binding site region controlling the initial binding of insulin and the subsequent formation of the high affinity insulin-IR complex that leads to signal transduction. The structure shows, for the first time, the way in which the C-terminal region of the insulin receptor α-chain associates with the first leucine-rich repeat (L1) domain of the receptor to form the complete low affinity insulin binding site. The structure also provides direct insight, for the first time, into the way the so-called Site 1 insulin mimetic peptides bind to the low affinity binding site of the insulin receptor and also provides a basis for designing insulin mimetic peptides that interact with the low affinity insulin binding site of IR. The structural information presented also indicates, by analogy, the corresponding regions in the closely related IGF-1 R that are involved in insulin growth factor (IGF) binding.
[0013] The identification of molecular structures having a high degree of specificity for only one of IR or IGF-1 R is important in the development of efficacious and safe therapeutics. For example, a molecule developed as an insulin agonist should have little or no IGF-I activity in order to avoid the mitogenic activity of IGF-I and a potential for facilitating neoplastic growth. The determination of which regions of IR and IGF-1 R have sufficient differences to confer selectivity for their respective ligands or for therapeutic molecules such as chemical entities or biological reagents is therefore an important and significant advancement. Similarly, it is believed that the ability to be able to identify molecular structures that mimic the active binding regions of insulin and/or IGF-I and which impart selective agonist or antagonist activity will also aid and advance the development of new drugs.
[0014] To assist in the design of agonists/antagonists of IR and/or IGF-1R, the present inventors have used the structure of human IR ectodomain comprising the C-terminal region of the insulin receptor α-chain (Appendix I) to place a model of the C-terminal region of the insulin receptor α-chain in the 3D structure of IGF-1R ectodomain (Appendix II). The present inventors have also used these models to place a model of the C-terminal region of the IGF-1 R α-chain in the 3D structure of IR ectodomain and IGF-1 R ectodomain (Appendixes III and IV, respectively). The present inventors used all of these structures to place a model of an insulin mimetic peptide (S519C16) in the binding site of IR and IGF-1R (Appendixes V and VI, respectively). The models, with coordinates in Appendixes II to VI, are oriented relative to atomic coordinates found in Appendix I and may be used in conjunction with atomic coordinates of Appendix I to design a compound which binds to the insulin binding site of IR and/or a compound which binds to the IGF binding site of IGF-1 R.
[0015] With regards to defining structures by combining subsets of coordinates from Appendix I to Appendix VI, such combinations may be achieved by methods such as assembling combinations of complete domains from each set, assembling combinations of complete domains from each set wherein the coordinates and corresponding amino acid sequence from one structure are transposed onto those of the other, refining less resolved regions of one crystal using the corresponding coordinates of the other.
[0016] Accordingly, the present invention provides a computer assisted method of identifying, designing or screening for a compound that can potentially interact with insulin receptor (IR) and/or insulin-like growth factor-1 receptor (IGF-1 R), comprising performing structure-based identification, design or screening of a compound based on the compound's interactions with a structure defined by the atomic coordinates of one or more of Appendixes I to VI, or a subset of atomic coordinates of one or more thereof at least representing the C-terminal region of the α-chain of IR, the C-terminal region of the α-chain of IGF-1 R, or a mimetic of the C-terminal region of the α-chain of IR and/or IGF-1 R.
[0017] In one embodiment, the method comprises identifying, designing or screening for a compound which interacts with the three-dimensional structure of; (i) the low affinity insulin binding site of IR, the structure being defined by the atomic coordinates shown in one or more of Appendixes I, III and V, and/or (ii) the low affinity insulin-like growth factor (IGF) binding site of IGF-1 R, the structure being defined by the atomic coordinates shown in one or more of Appendixes II, IV and VI, wherein interaction of the compound with the structure is favoured energetically.
[0018] In another embodiment, the method further comprises synthesising or obtaining an identified or designed candidate compound and determining the ability of the candidate compound to interact with IR and/or IGF-1 R.
[0019] The atomic coordinates define one or more regions of the low affinity binding site of IR for insulin, and/or the low affinity binding site of IGF-1 R for IGF, comprising the C-terminal region of the α-chain of IR, the C-terminal region of the α-chain of IGF-1R, or a mimetic of the C-terminal region of the α-chain of IR and/or IGF-1 R.
[0020] The C-terminal region of the α-chain of IR comprises amino acids 693 to 710 of IR α-chain (SEQ ID NO: 13).
[0021] In another preferred embodiment, the atomic coordinates defining the low affinity insulin binding site of IR further comprise the leucine-rich repeat 1 (L1) domain and/or the cysteine-rich (CR) domain of the IR ectodomain.
[0022] In yet another preferred embodiment, the atomic coordinates define portions of the molecular surface of the central β-sheet of the L1 domain and portions of the molecular surface of the second leucine-rich repeat (LRR) which contain Phe39 and/or the loop in the fourth LRR rung of the L1 domain.
[0023] In yet another preferred embodiment, the atomic coordinates define module 6 of the CR domain of IR.
[0024] In another embodiment, the atomic coordinates further define one or more amino acid sequences selected from IR amino acid residues 1-156, 157-310, 594 and 794.
[0025] In a preferred embodiment, the one or more amino acids selected from IR amino acid residues 1-156 comprise at least one amino acid selected from Arg14, Asn15, Gln34, Leu36, Leu37, Phe39, Pro43, Phe46, Leu62, Phe64, Leu87, Phe88, Phe89, Asn90, Phe96, Glu97, Arg118, Glu120 and His144.
[0026] In another embodiment, the one or more amino acids selected from IR amino acid residues 157-310 comprise at least one of the amino acid sequences selected from 192-310, 227-303 and 259-284.
[0027] The crystal structure of the first three domains of the ectodomain of IGF-1R has been previously reported (WO 99/028347). The crystal structure of the first three domains of the ectodomain of IR was subsequently reported (WO 07/147213), enabling, for the first time, direct comparison of the regions controlling ligand specificity in the closely related IGF-1R and IR. However, the structure of the intact low affinity insulin binding site (i.e. inclusive of the C-terminal region of the receptor a-chain) could not be elucidated. As will be evident to the skilled person, the findings presented here on the intact insulin binding site of IR ectodomain structure, shape and orientation can be transposed onto the IGF binding site of IGF-1R ectodomain structure, shape and orientation.
[0028] The present invention has enabled the identification of previously unrecognised regions of the insulin binding site of IR ectodomain. By analogy, the present invention also identifies the equivalent regions in the IGF-1R, given the structural organisation of domains in the two receptors is effectively the same. The present invention has identified the critical regions of IR involved in the binding of insulin and in mediating the subsequent formation of the high affinity insulin-IR complex that leads to signal transduction. Once again, it will be evident to the skilled person that these findings can be transposed onto IGF-1R.
[0029] The present invention is therefore also useful in the identification and/or design of compounds which bind to the low affinity IGF binding site of IGF-1 R.
[0030] In one embodiment, the atomic coordinates defining one or more regions of the low affinity binding site of IGF-1 R for IGF, comprise the C-terminal region of the α-chain of IGF-1 R. The C-terminal region of the α-chain of IGF-1 R comprises amino acids 681 to 697 of IGF-1 R α-chain (SEQ ID NO: 15).
[0031] In another embodiment, the atomic coordinates defining the low affinity IGF binding site of IGF-1 R further comprise the L1 domain and/or the CR domain of IGF-1 R ectodomain.
[0032] In a preferred embodiment, the atomic coordinates define the central β-sheer of the L1 domain, and/or that part of the second LRR containing Ser35, and/or the loop in the fourth LRR rung of the L1 domain.
[0033] In another preferred embodiment, the atomic coordinates define module 6 of the CR domain of IGF-1 R.
[0034] The mimetic of the C-terminal region of the α-chain of IR and/or IGF-1R is S519C16 (SEQ ID NO: 18).
[0035] In further embodiment, the compound substitutes for the C-terminal region of the α-chain of IR and/or the C-terminal region of the α-chain of IGF-1 R in the formation of the low affinity binding site of IR or IGF-1 R. Such compounds may act as either agonists or antagonists of these receptors. In one alternative of this embodiment, insulin and/or IGF-1 R binds the low affinity binding site of IR and/or IGF-1 R in the presence of the compound. In another alternative of this embodiment, insulin and/or IGF-1R does not bind, or has reduced binding to, the low affinity binding site of IR and/or IGF-1 R in the presence of the compound.
[0036] In another embodiment, a candidate compound for interacting with IR and/or IGF-1 R is chemically modified as a result of structure-based evaluation.
[0037] In a further embodiment, the chemical modification is designed to either: 1. i) reduce the potential for the candidate compound to bind to IR whilst maintaining binding to IGF-1 R; or 2. ii) reduce the potential for the candidate compound to bind to IGF-1 R, whilst maintaining binding to IR.
[0038] Candidate compounds and compounds identified or designed using a method of the present invention may be any suitable compound, including naturally occurring compounds, de novo designed compounds, library generated compounds (chemically or recombinantly generated), mimetics etc., and include organic compounds, new chemical entities, antibodies, binding proteins other than antibody-based molecules (nonimmunoglobulin proteins) including, for example, protein scaffolds such as lipocalins, designed ankyrin repeat proteins (DARPins, Stumpp et a/., 2007) and protein A domains (reviewed in Binz et al, 2005), avimers (Silverman et at., 2005), and other new biological entities such as nucleic acid aptamers (reviewed in Ulrich, 2006).
[0039] The present invention is also useful for improving the properties of known ligands for the low affinity binding sites of IR and/or IGF-1 R. For example, existing IR or IGF-1 R low affinity binding site ligands can be screened against the 3D structure of the insulin binding site of IR ectodomain or a region of the insulin binding site of IR ectodomain defined by the atomic coordinates of Appendix I or a portion thereof (optionally utilising the atomic coordinates given in Appendixes II to VI to further refine the screen and/or the assessment of the potential to energetically interact with IR), and an assessment made of the potential to energetically interact with the insulin binding site of IR.
[0040] Thus, the present invention also provides a computer assisted method for redesigning a compound which is known to bind to IR and/or IGF-1R comprising performing structure-based evaluation of the compound based on the compound's interactions with a structure defined by the atomic coordinates of one or more of Appendixes I to VI, or a subset of atomic coordinates of one or more thereof at least representing the C-terminal region of the α-chain of IR, the C-terminal region of the a-chain of IGF-1R, or a mimetic of the C-terminal region of the α-chain of IR and/or IGF-1R, and redesigning or chemically modifying the compound as a result of the evaluation.
[0041] In one embodiment, the compound which is known to bind to IR and/or IGF-1 R is redesigned or chemically modified to (i) improve affinity for binding to IR, and/or (ii) lower affinity for binding to IGF-IR.
[0042] In another embodiment, the compound which is known to bind to IR and/or IGF-1 R is redesigned or chemically modified to (i) improve affinity for binding to IGF-1 R, and/or (ii) lower affinity for binding to IR.
[0043] When screening potential ligands or compounds for selectivity for binding to the insulin binding site of IR or IGF-1 R, it will be important to concentrate on those areas of difference in the 3D structure between the low affinity binding site of ectodomains of IR and IGF-1 R. Such areas are identified and described herein. In particular, it will be important to concentrate on those areas of difference which are identified as being potentially important in the binding of insulin to the receptors.
[0044] Accordingly, in a further embodiment the compound is redesigned or modified so as to lower the affinity to IR or IGF-1 R by virtue of the structural differences between IR and IGF-1 R at or in the vicinity of the C-terminal region of the α-chain of IR and the C-terminal region of the α-chain of IGF-1 R.
[0045] Also described herein is a computer system for identifying one or more compounds that can potentially interact with IR and/or IGF-1 R, the system containing data representing the structure of: (i) the low affinity insulin binding site of IR, the structure being defined by the atomic coordinates shown in one or more of Appendixes I, III and V; (ii) the low affinity IGF binding site of IGF-1 R, the structure being defined by the atomic coordinates shown in one or more of Appendixes II, IV and VI; and/or (iii) the C-terminal region of the α-chain of IR, the C-terminal region of the α-chain of IGF-1 R, or a mimetic of the C-terminal region of the a-chain of IR and/or IGF-1 R, the structure being defined by a subset of atomic coordinates shown in one or more of Appendixes I to VI.
[0046] Also described is a computer-readable medium having recorded thereon data representing a model and/or the atomic coordinates as shown in one or more of Appendixes I to VI, or a subset of atomic coordinates of one or more thereof at least representing: 1. i) the C-terminal region of the α-chain of IR; 2. ii) the C-terminal region of the α-chain of IGF-1 R; and/or 3. iii) a mimetic of the C-terminal region of the α-chain of IR and/or IGF-1 R, as any one of i) to iii) associates with IR and/or IGF-1 R.
[0047] Further described are a set of coordinates as shown in one or more of Appendixes I to VI, or a subset of atomic coordinates of one or more thereof at least representing: 1. i) the C-terminal region of the α-chain of IR; 2. ii) the C-terminal region of the α-chain of IGF-1 R; and/or 3. iii) a mimetic of the C-terminal region of the α-chain of IR and/or IGF-1 R, as any one of i) to iii) associates with IR and/or IGF-1 R.
[0048] The three-dimensional structure of the C-terminal region of the IR and/or IGF-1 R α-chain may be used to develop models useful for drug design and/or in silico screening of candidate compounds that interact with and/or modulate IR and/or IGF-1 R. Other physicochemical characteristics may also be used in developing the model, e.g. bonding, electrostatics, etc.
[0049] Generally the term "in silico" refers to the creation in a computer memory, i.e., on a silicon or other like chip. Stated otherwise "in silico" means "virtual". When used herein the term "in silico" is intended to refer to screening methods based on the use of computer models rather than in vitro or in vivo experiments.
[0050] Accordingly, the present invention also provides a computer-assisted method of identifying a compound that potentially interacts with IR and/or IGF-1R, which method comprises fitting the structure of: (i) the low affinity insulin binding site of IR, the structure being defined by the atomic coordinates shown in one or more of Appendixes I, III and V; (ii) the low affinity IGF binding site of IGF-1R, the structure being defined by the atomic coordinates shown in one or more of Appendixes II, IV and VI; and/or (iii) the C-terminal region of the α-chain of IR, the C-terminal region of the α-chain of IGF-1R, or a mimetic of the C-terminal region of the α-chain of IR and/or IGF-1 R, the structure being defined by a subset of atomic coordinates shown in one or more of Appendixes I to VI, to the structure of a candidate compound.
[0051] Also provided by the present invention is a computer-assisted method for identifying a compound able to interact with IR and/or IGF-1 R using a programmed computer comprising a processor, which method comprises the steps of: (a) generating, using computer methods, a set of atomic coordinates of a structure that possesses energetically favourable interactions with the atomic coordinates of: (i) the low affinity insulin binding site of IR, the structure being defined by the atomic coordinates shown in one or more of Appendixes I, III and V; (ii) the low affinity IGF binding site of IGF-1 R, the structure being defined by the atomic coordinates shown in one or more of Appendixes II, IV and VI; and/or (iii) the C-terminal region of the α-chain of IR, the C-terminal region of the α-chain of IGF-1 R, or a mimetic of the C-terminal region of the α-chain of IR and/or IGF-1 R, the structure being defined by a subset of atomic coordinates shown in one or more of Appendixes I to VI, which coordinates are entered into the computer thereby generating a criteria data set; (b) comparing, using the processor, the criteria data set to a computer database of chemical structures; (c) selecting from the database, using computer methods, chemical structures which are complementary or similar to a region of the criteria data set; and optionally, (d) outputting, to an output device, the selected chemical structures which are complementary to or similar to a region of the criteria data set.
[0052] The present invention further provides a computer-assisted method for identifying potential mimetics of IR and/or IGF-1 R using a programmed computer comprising a processor, the method comprising the steps of (a) generating a criteria data set from a set of atomic coordinates of: (i) the low affinity insulin binding site of IR, the structure being defined by the atomic coordinates shown in one or more of Appendixes I, III and V; (ii) the low affinity IGF binding site of IGF-1 R, the structure being defined by the atomic coordinates shown in one or more of Appendixes II, IV and VI; and/or (iii) the C-terminal region of the α-chain of IR, the C-terminal region of the α-chain of IGF-1 R, or a mimetic of the C-terminal region of the α-chain of IR and/or IGF-1 R, the structure being defined by a subset of atomic coordinates shown in one or more of Appendixes I to VI, which coordinates are entered into the computer; (b) (i) comparing, using the processor, the criteria data set to a computer database of chemical structures stored in a computer data storage system and selecting from the database, using computer methods, chemical structures having a region that is structurally similar to the criteria data set; or (ii) constructing, using computer methods, a model of a chemical structure having a region that is structurally similar to the criteria data set; and, optionally, (c) outputting to an output device: (i) the selected chemical structures from step (b)(i) having a region similar to the criteria data set; or (ii) the constructed model from step (b)(ii).
[0053] The present invention further provides a method for evaluating the ability of a compound to interact with IR and/or IGF-1R, the method comprising the steps of: (a) employing computational means to perform a fitting operation between the compound and the binding surface of a computer model of the low affinity binding site for insulin on IR ectodomain, and/or the low affinity binding site for IGF on IGF-1 R ectodomain, using atomic coordinates wherein the root mean square deviation between the atomic coordinates and a subset of atomic coordinates of one or more of Appendixes I to VI or a subset of atomic coordinates of one or more thereof at least representing the C-terminal region of the α-chain of IR, the C-terminal region of the α-chain of IGF-1 R, or a mimetic of the C-terminal region of the α-chain of IR and/or IGF-1 R, is not more than 1.5 Å; and (b) analysing the results of the fitting operation to quantify the association between the compound and the binding surface model.
[0054] The present invention also provides a method of using molecular replacement to obtain structural information about a molecule or a molecular complex of unknown structure, comprising the steps of: (i) generating an X-ray diffraction pattern of the crystallized molecule or molecular complex; and (ii) applying the atomic coordinates of one or more of Appendixes I to VI, or a subset of atomic coordinates of one or more thereof at least representing the C-terminal region of the α-chain of IR, the C-terminal region of the α-chain of IGF-1 R, or a mimetic of the C-terminal region of the α-chain of IR and/or IGF-1 R, to the X-ray diffraction pattern to generate a three-dimensional electron density map of at least a region of the molecule or molecular complex whose structure is unknown.
[0055] Also described herein is a compound that binds to IR and/or IGF-1 R ectodomain designed, redesigned or modified using the methods of the invention. Preferably, such compounds have an affinity (Kd) for IR and/or IGF-1 R of less than 10'5 M. In a particularly preferred example, the compound binds to the low affinity binding site of IR and/or to the low affinity binding site of IGF-1R.
[0056] Further described is an isolated peptide or mimetic thereof which binds the L1 domain of IR and/or the L1 domain of IGF-1R, the peptide comprising: (i) an amino acid sequence as provided in SEQ ID NO: 13 or SEQ ID NO: 15; (ii) an amino acid sequence which is at least 50% identical, more preferably at least 80% identical, more preferably at least 90% identical, more preferably at least 95% identical, to SEQ ID NO: 13 and/or SEQ ID NO: 15; or (iii) a fragment of i) or ii) which binds the L1 domain of IR and/or the L1 domain of IGF-1R, wherein the peptide has a helical structure.
[0057] Further described is an isolated polynucleotide encoding the isolated peptide or mimetic thereof, as well as a vector comprising said polynucleotide and a host cell comprising said vector.
[0058] Additionally described herein is a composition comprising a compound, a peptide or mimetic, and/or a polynucleotide as described herein, and optionally an acceptable carrier or diluent, more preferably a pharmaceutically acceptable carrier or diluent.
[0059] Further described herein is a method for preventing or treating a disease associated with aberrant IR and/or IGF-1 R functioning and/or signalling, the method comprising administering to a subject in need thereof a compound, a peptide or mimetic, and/or a polynucleotide as described herein.
[0060] Also described herein is use of a compound, a peptide or mimetic, and/or a polynucleotide as described herein, for the manufacture of a medicament for treating a disease in a subject associated with aberrant IR and/or IGF-1 R functioning and/or signalling.
[0061] Examples of diseases associated with aberrant IR and/or IGF-1 R functioning and/or signalling include, but are not limited to, obesity, type I and type II diabetes, cardiovascular disease, osteoporosis, dementia and cancer.
[0062] Also described are manufacturing steps such as incorporating the compound, such as a peptide, into a pharmaceutical composition in the manufacture of a medicament.
[0063] Throughout this specification, preferred aspects and embodiments apply, as appropriate, separately, or in combination, to other aspects and embodiments, mutatis mutandis, whether or not explicitly stated as such.
[0064] The present invention will now be described further with reference to the following examples, which are illustrative only and non-limiting.
Brief Description of the Figures [0065] Some figures contain colour representations or entities. Coloured versions of the figures are available from the Patentee upon request or from an appropriate patent Office. A fee may be imposed if obtained from a Patent Office.
Figure 1: Shows the sequence alignment of the ectodomains of human insulin receptor (IR, exon 11-isoform) and human IGF1 receptor (IGF-1 R). Residues conserved-between the sequences are indicated by vertical bars and potential N-linked glycosylation sites are indicated by shading. Disulphide links are indicated by square braces above the alignment. Sequence sources were: IR (Ullrich et at., 1985), human type 1 IGF receptor (Ulrich et al., 1986).
Figure 2. ITC curves for the titration of (a) IR classical aCT peptide against IR485, (b) IGF-1 R classical aCT peptide against IR485, and (c) IR classical aCT.714A peptide against IR485.
Figure 3. ITC curves for the titration of (a) ZFP-insulin against IR485 pre-complexed with a 10-fold molar ratio of IR classical aCT peptide, (b) ZFP-insulin against IR485 pre-complexed with a 10-fold molar ratio of IGF-1 R classical aCT peptide, (c) IGF-1 R against IR485 pre-complexed with a 10-fold molar ratio of IR classical aCT peptide, and (d) IGF-1 R against IR485 pre-complexed with a 10-fold molar ratio of IGF-1 R classical aCT peptide.
Figure 4. ITC curves for the titration of (a) S519C16 against IR485, (b) S519C16 against IR485 pre-complexed with a 10-fold molar ratio of IR classical aCT peptide, (c) S519N20 against IR485, and (d) S519 against IR485.
Figure 5. Dynamic light scattering volume distribution curves obtained from samples of (a) IR485 at 6 mg/ml, (b) IR485 at 0.5 mg/ml; (c) IR485 at 6 mg/ml plus a 3-fold molar ratio of IR aCT peptide, (d) IR485 at 6 mg/ml plus a 3-fold molar ratio of IR classical aCT peptide and a 2-fold molar ratio of ZFP-insulin.
Figure 6. The crystal structure of IR ectodomain comprising the C-terminal region of the α-chain of IR. (a) Negative B-factor enhanced (F0-Fc) electron density overlaid with the final model of IR residues 693-710; (b) Detail of the interaction between IR residues 693-710 (yellow backbone, green carbons, non-bold numbering) and the surface of L1 -β2 (pink backbones, cyan carbons, bold numbering); (c) Sequence alignment of the C-terminal regions of the α-chains of IR and IGF-1R and the S519C16 peptide (Menting et al., 2009). Shaded regions show conservation between the three sequences and boxed regions show segments predicted to be helical in conformation (Menting et al., 2009).
Figure 7. Model structure of the C-terminal region of IR α-chain bound to the L1 domain of IGF-1R (Appendix II) generated from the crystal structure of IR ectodomain inclusive of residues 693-710. The backbone of the respective L1 domain is shown as an orange coil, the side chains of residues within the L1 domain that interact with the respective bound peptide are shown with green carbon atoms, red oxygen atoms and blue nitrogen atoms, and the backbone of the bound peptide helix is shown as a blue coiL Selected peptide residues that interact with the L1 domain are shown with cyan carbon atoms, red oxygen atoms and blue nitrogen atoms. The remaining peptide residues, which have more limited or no interaction with the L1 domain are represented only by their α-carbon atoms shown as spheres embedded in the peptide coil, with other atoms within these residues omitted for clarity. Residues that lie in the C-terminal region of IR α-chain are underlined for clarity.
Figure 8. Model structure of the C-terminal region of IGF-1R α-chain bound to the L1 domain of IR (Appendix III). Colouring and style is as described above for Figure 7.
Figure 9. Model structure of the C-terminal region of IGF-1 R α-chain bound to the L1 domain of IGF-1 R (Appendix IV). Colouring and style is as described above for Figure 7.
Figure 10. Model structure of the S519C16 peptide bound to the L1 domain of IR (Appendix V). Colouring and style is as described above for Figure 7.
Figure 11. Model structure of the S519C16 peptide bound to the L1 domain of IGF-1R (Appendix VI). Colouring and style is as described above for Figure 7.
Figure 12. Sample isothermal titration calorimetry curves obtained for the titration against insulin mini-receptor IR485 of N-terminally biotinylated aCT peptide 698-719 containing the following respective mutations: (A) wild type, (B) T704Y, (C) R702W, (D) R702Y, (E) T704W and (F) R702Y / T704W.
Key to the Sequence Listing [0066] SEQ ID NO: 1 - Amino acid sequence of mature human insulin receptor ectodomain (isoform A). SEQ ID NO: 2 - Amino acid sequence of mature human insulin receptor ectodomain (isoform B). SEQ ID NO: 3 - Amino acid sequence of mouse insulin receptor. SEQ ID NO: 4 - Amino acid sequence of rhesus monkey insulin receptor, predicted. SEQ ID NO: 5 - Amino acid sequence of bovine insulin receptor, predicted. SEQ ID NO: 6 - Amino acid sequence of mature human insulin-like growth factor receptor 1 (IGF-1 R) ectodomain. SEQ ID NO: 7 - Amino acid sequence of mouse insulin-like growth factor receptor 1 (IGF-1 R). SEQ ID NO: 8 - Amino acid sequence of rhesus monkey insulin-like growth factor receptor 1 (IGF-1 R), predicted. SEQ ID NO: 9 - Amino acid sequence of bovine insulin-like growth factor receptor 1 (IGF-1 R), predicted. SEQ ID NO: 10 - Amino acid sequence of IR485. SEQ ID NO: 11 - Amino acid sequence of the classical α-chain C-terminal peptide (aCT) of human IR. SEQ ID NO: 12 - Amino acid sequence of the F714A mutant of the classical α-chain C-terminal peptide ('aCT') of human IR. SEQ ID NO: 13 - Amino acid sequence of the C-terminal region of the α-chain of human IR. SEQ ID NO: 14 - Amino acid sequence of the classical α-chain C-terminal peptide (aCT) of human IGF-1R. SEQ ID NO: 15 - Amino acid sequence of the C-terminal region of the α-chain of human IGF-1R. SEQ ID NO: 16 - Amino acid sequence of the S519 peptide. SEQ ID NO: 17 - Amino acid sequence of the S519N20 peptide. SEQ ID NO: 18 - Amino acid sequence of the S519C16 peptide. SEQ ID NO: 19 - FYXWF motif.
Detailed Description of the Invention [0067] Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art (e g. in molecular biology, biochemistry, structural biology, and computational biology). Standard techniques are used for molecular and biochemical methods (see generally, Sambrook etal., 2001, and Ausubel et al., 1999, and chemical methods.
[0068] Throughout this specification the word "comprise", or variations such as "comprises" or "comprising", vull be understood to imply the inclusion of a stated element, integer or step, or group of elements, integers or steps, but not the exclusion of any other element, integer or step, or group of elements, integers or steps. IR ectodomain crystals and crystal structure [0069] The present invention relates to use of a crystal comprising a C-terminal region of the IR α-chain based on the IRAp construct (see Examples).
[0070] As used herein, the term "crystal" means a structure (such as a three dimensional (3D) solid aggregate) in which the plane faces intersect at definite angles and in which there is a regular structure (such as internal structure) of the constituent chemical species. The term "crystal" refers in particular to a solid physical crystal form such as an experimentally prepared crystal.
[0071] Crystals may be prepared using any IR ectodomain, i.e. the IR polypeptide containing the extracellular domain and lacking the transmembrane domain and the intracellular tyrosine kinase domain. Typically, the extracellular domain comprises residues 1 to 917 (mature receptor numbering) of human IR, or the equivalent thereof together with any post-translational modifications of these residues such as N- or O-linked glycosylation.
[0072] In a preferred embodiment the IR polypeptide is human IR (SEQ ID NOs: 1 and 2). However, the IR polypeptide may also be obtained from other species, such as other mammalian, vertebrate or invertebrate species. Examples of IR polypeptides from other species are given in SEQ ID NOs: 3 to 5.
[0073] Crystals may be constructed with wild-type IR polypeptide ectodomain sequences or variants thereof, including allelic variants and naturally occurring mutations as well as genetically engineered variants. Typically, variants have at least 95 or 98% sequence identity with a corresponding wild-type IR ectodomain polypeptide.
[0074] Optionally, the crystal of IR ectodomain may comprise one or more molecules which bind to the ectodomain, or otherwise soaked into the crystal or cocrystallised with IR ectodomain. Such molecules include ligands or small molecules, wlnich may be candidate pharmaceutical agents intended to modulate the interaction between IR and its biological targets. The crystal of IR ectodomain may also be a molecular complex with other receptors of the IGF receptor family such as IGF-1 R. The complex may also comprise additional molecules such as the ligands to these receptors.
[0075] The production of IR ectodomain crystals is described below.
[0076] In a preferred embodiment, an IR ectodomain crystal comprising the C-terminal segment of the IR α-chain has the atomic coordinates set forth in Appendix I. As used herein, the term "atomic coordinates" or "set of coordinates" refers to a set of values which define the position of one or more atoms with reference to a system of axes. It will be understood by those skilled in the art that atomic coordinates may be varied, without affecting significantly the accuracy of models derived therefrom.
[0077] It will be understood that any reference herein to the atomic coordinates or subset of the atomic coordinates shown in Appendix I shall include, unless specified otherwise, atomic coordinates having a root mean square deviation of backbone atoms of not more than 1.5 A, preferably not more than 1 A, when superimposed on the corresponding backbone atoms described by the atomic coordinates shown in Appendix I. Also, any reference to the atomic coordinates or subset of the atomic coordinates shown in Appendixes II to VI shall include, unless specified otherwise, atomic coordinates having a root mean square deviation of backbone atoms of not more than 2.5 A when superimposed on the corresponding backbone atoms described by the atomic coordinates shown in Appendixes II to VI.
[0078] The following defines what is intended by the term "root mean square deviation (RMSD)" between two data sets. For each element in the first data set, its deviation from the corresponding item in the second data set is computed. The squared deviation is the square of that deviation, and the mean squared deviation is the mean of all these squared deviations.
The root mean square deviation is the square root of the mean squared deviation.
[0079] Preferred variants are those in which the RMSD of the x, y and z coordinates for all backbone atoms other than hydrogen is less than 1.5 A (preferably less than 1 A, 0.7 A or less than 0.3 A) compared with the coordinates given in Appendix I. It will be readily appreciated by those skilled in the art that a 3D rigid body rotation and/or translation of the atomic coordinates does not alter the structure of the molecule concerned.
[0080] In a highly preferred embodiment, the crystal has the atomic coordinates as shown in Appendix I.
[0081] The present invention also relates to a crystal structure of the low affinity insulin binding site of IR ectodomain polypeptide comprising the C-terminal region of the IR α-chain, or a region thereof.
[0082] The atomic coordinates obtained experimentally for amino acids 4 to 655, 693 to 710 (the "C-terminal region of the IR a-chain"), and 755 to 909 of human IR-A (mature receptor numbering; SEQ ID NO: 1) are shown in Appendix I. However, a person skilled in the art will appreciate that a set of atomic coordinates determined by X-ray crystallography is not without standard error. Accordingly, any set of structure coordinates for an IR ectodomain polypeptide comprising the C-terminal region of the IR a-chain that has a root mean square deviation of protein backbone atoms of less than 0.75 A when superimposed (using backbone atoms) on the atomic coordinates listed in Appendix I shall be considered identical.
[0083] The present invention also comprises the atomic coordinates of the C-terminal region of the IR α-chain that substantially conforms to the atomic coordinates listed in Appendix I.
[0084] A structure that "substantially conforms" to a given set of atomic coordinates is a structure wherein at least about 50% of such structure has an RMSD of less than about 1.5 A for the backbone atoms in secondary structure elements in each domain, and more preferably, less than about 1.3 A for the backbone atoms in secondary structure elements in each domain, and, in increasing preference, less than about 1.0 A, less than about 0.7 A, less than about 0.5 A, and most preferably, less than about 0.3 A for the backbone atoms in secondary structure elements in each domain.
[0085] In a more preferred embodiment, a structure that substantially conforms to a given set of atomic coordinates is a structure wherein at least about 75% of such structure has the recited RMSD value, and more preferably, at least about 90% of such structure has the recited RMSD value, and most preferably, about 100% of such structure has the recited RMSD value.
[0086] In an even more preferred embodiment, the above definition of "substantially conforms" can be extended to include atoms of amino acid side chains. As used herein, the phrase "common amino acid side chains" refers to amino acid side chains that are common to both the structure which substantially conforms to a given set of atomic coordinates and the structure that is actually represented by such atomic coordinates.
[0087] According to the present invention the C-terminal region of the IR ectodomain α-chain spans residues 693 to 710 (SEQ ID NO: 13).
[0088] As used herein, the term "IR ectodomain" refers to the extracellular domain of IR lacking the transmembrane domain and the intracellular tyrosine kinase domain of IR, typically comprising residues 1 to 917 (mature IR-A receptor numbering) of human IR, or the equivalent thereof, together with any post-translational modifications of these residues such as N- or O-linked glycosylation.
[0089] As used herein, the term "low affinity binding site" for IR means the regions of IR involved in forming the low affinity binding site (also known as "Site 1") of IR for insulin, comprising the C-terminal region of the IR α-chain and additionally one or both of the L1 domain of IR and the CR domain of IR. Insulin binding to the low affinity binding site of IR induces formation of the high affinity insulin binding site of IR and subsequent signal transduction.
[0090] As used herein, the term "C-terminal region" of the IR α-chain refers to amino acids 693-710 of isoform A (IR-A) of the human IR α-chain as given in SEQ ID NO: 13, with numbering according to mature isoform A of human IR (SEQ ID NO: 1).
[0091] As used herein, the term "classical α-chain C-terminal peptide", or "aCT", refers in IR to a region of the C-terminal a-chain of IR previously described in the literature as being important for insulin binding (Kurose et al., 1994; Kristensen et al, 2002), and comprising amino acids 704-719 (mature IR-A receptor numbering) as given in SEQ ID NO: 11.
[0092] As used herein, the term "leucine-rich repeat domain 1" or "L1 domain" refers in IR to a leucine-rich domain comprising amino acids 1-156 of mature human IR (SEQ ID NO: 1). The L1 domain of IR comprises a central β-sheer, which comprises amino acids selected from 10-15, 32-37, 60-65, 88-97, 116-121 and 142-147 of mature human IR (SEQ ID NO: 1).
[0093] As used herein, the term "leucine-rich repeat domain 2" or "L2 domain" refers in IR to a leucine-rich domain comprising amino acids 310-469 of mature human IR (SEQ ID NO: 1).
[0094] As used herein, the term "loop in the fourth leucine-rich repeat (LRR) rung of the L1 domain", or variations thereof, refers in IR to a leucine-rich domain comprising amino acids 85-91 of mature human IR (SEQ ID NO: 1).
[0095] As used herein, the term "cysteine-rich domain" or "CR domain" refers in IR to a cysteine-rich domain comprising amino acids 157-309 of mature human IR (SEQ ID NO: 1). The CR domain contains many different modules. As used herein, the term "module 6 of the CR domain" refers in IR to amino acids 256-286 of mature human IR (SEQ ID NO: 1). IGF-1R ectodomain structure
[0096] Due to the high sequence homology and structural similarity between IR and IGF-1R, the present invention also provides a model for the C-terminal region of IGF-1R α-chain as it associates with IGF-1R to form the low affinity IGF binding site. According to the present invention the C-terminal region of the IGF-1R ectodomain α-chain comprises residues 681-697 (SEQ ID NO: 15).
[0097] As used herein, the term "IGF-1R ectodomain" refers to the extracellular domain of IGF-1R lacking the transmembrane domain and the intracellular tyrosine kinase domain of IGF-1R, typically comprising residues 1 to 905 (mature receptor numbering) of human IGF-1R, or the equivalent thereof, together with any post-translational modifications of these residues such as N- or O-linked glycosylation.
[0098] As used herein, the term "low affinity binding site" for IGF-1 R means the regions of IGF-1 R involved in forming the low affinity binding site (also known as "Site 1") of IGF-1 R for IGF, comprising the C-terminal region of the IGF-1 R α-chain and additionally one or both of the L1 domain of IGF-1 R and the CR domain of IGF-1 R. IGF binding to the low affinity binding site of IGF-1 R induces formation of the high affinity IGF binding site of IGF-1 R and subsequent signal transduction.
[0099] As used herein, the term "C-terminal region" of the IGF-1 R α-chain refers to amino acids 681-697 of human IGF-1 R a-chain as given in SEQ ID NO: 15, with numbering according to mature human IGF-1 R (SEQ ID NO: 6).
[0100] As used herein, the term "classical α-chain C-terminal peptide", or "aCT", refers in IGF-1 R to a region of IGF-1 R corresponding to the C-terminal α-chain of IR previously described in the literature as being important for insulin binding (Kurose et al., 1994; Kristensen et al, 2002), and comprising amino acids 691-706 of IGF-1 R (mature IGF-1 R numbering) as given in SEQ ID NO: 14.
[0101] As used herein, the term "leucine-rich repeat domain 1" or "L1 domain" refers in IGF-1R to a leucine-rich domain comprising amino acids 1-149 of mature human IGF-1R (SEQ ID NO: 6).
[0102] As used herein, the term "leucine-rich repeat domain 2" or "L2 domain" refers in IGF-1 R to a leucine-rich domain comprising amino acids 300-459 of mature human IGF-1 R (SEQ ID NO: 6).
[0103] As used herein, the term "that part of the second LRR containing Ser35" refers in IGF-1 R to amino acids 35-41 of mature human IGF-1 R (SEQ ID NO: 6).
[0104] As used herein, the term "cysteine-rich domain" or "CR domain" refers in IGF-1 R to a cysteine-rich domain comprising amino acids 150-299 of mature human IGF-1 R (SEQ ID NO: 6). The CR domain contains many different modules. As used herein, the term "module 6 of the CR domain" refers to amino acids 249-275 of mature human IGF-1 R (SEQ ID NO: 6).
Manipulation of the atomic coordinates [0105] It will be appreciated that a set of atomic coordinates for a polypeptide is a relative set of points that define a shape in three dimensions. Thus, it is possible that an entirely different set of coordinates could define a similar or identical shape. Moreover, slight variations in the individual coordinates will have little effect on overall shape.
[0106] The variations in coordinates may be generated due to mathematical manipulations of the atomic coordinates. For example, the atomic coordinates set forth in Appendix I could be manipulated by crystallographic permutations of the atomic coordinates, fractionalisation of the atomic coordinates, integer additions or subtractions to sets of the structure coordinates, inversion of the atomic coordinates, or any combination thereof.
[0107] Alternatively, modification in the crystal structure due to mutations, additions, substitutions, and/or deletions of amino acids, or other changes in any of the components that make up the crystal could also account for variations in atomic coordinates.
[0108] Various computational analyses are used to determine whether a molecular complex or a portion thereof is sufficiently similar to all or parts of the structure of the extracellular domain of IR described above. Such analyses may be carried out in current software applications, such as the Sequoia program (Bruns et al., 1999).
[0109] The Molecular Similarity program permits comparisons between different structures, different conformations of the same structure, and different parts of the same structure.
[0110] Comparisons typically involve calculation of the optimum translations and rotations required such that the root mean square deviation of the fit over the specified pairs of equivalent atoms is an absolute minimum. This number is given in Angstroms.
[0111] Accordingly, atomic coordinates of an IR and/or IGF-1 R ectodomain comprising the low affinity binding site of the present invention include atomic coordinates related to the atomic coordinates listed in Appendixes I to VI by whole body translations and/or rotations. Accordingly, RMSD values listed above assume that at least the backbone atoms of the structures are optimally superimposed which may require translation and/or rotation to achieve the required optimal fit from vtfiich to calculate the RMSD value.
[0112] A three dimensional structure of an IR and/or IGF-1 R ectodomain polypeptide or region thereof which substantially conforms to a specified set of atomic coordinates can be modelled by a suitable modeling computer program such as MODELLER (Sail & Blundell, 1993), using information, for example, derived from the following data: (1) the amino acid sequence of the human IR and/or IGF-1 R ectodomain polypeptide; (2) the amino acid sequence of the related portion(s) of the protein represented by the specified set of atomic coordinates having a three dimensional configuration; and, (3) the atomic coordinates of the specified three dimensional configuration. A three dimensional structure of an IR and/or IGF-1 R ectodomain polypeptide which substantially conforms to a specified set of atomic coordinates can also be calculated by a method such as molecular replacement, wfnich is described in detail below.
[0113] Atomic coordinates are typically loaded onto a machine-readable medium for subsequent computational manipulation.
Thus models and/or atomic coordinates are advantageously stored on machine-readable media, such as magnetic or optical media and random-access or read-only memory, including tapes, diskettes, hard disks, CD-ROMs and DVDs, flash memory cards or chips, servers and the internet. The machine is typically a computer.
[0114] The atomic coordinates may be used in a computer to generate a representation, e.g. an image, of the three-dimensional structure of the IR and/or IGF-1R ectodomain crystal which can be displayed by the computer and/or represented in an electronic file.
[0115] The atomic coordinates and models derived therefrom may also be used for a variety of purposes such as drug discovery, biological reagent (binding protein) selection and X-ray crystallographic analysis of other protein crystals.
Molecular re place ment/bindina [0116] The structure coordinates of IR and/or IGF-1 R comprising the C-terminal region of the α-chain, such as those set forth in Appendixes I to IV, can also be used for determining the three-dimensional structure of a molecular complex which contains at least the C-terminal region of the α-chain of IR and/or IGF-1 R. In particular, structural information about another crystallised molecular complex may be obtained. This may be achieved by any of a number of well-known techniques, including molecular replacement.
[0117] Methods of molecular replacement are generally known by those of skill in the art (generally described in Brunger, 1997; Navaza & Saludjian, 1997; Tong & Rossmann, 1997; Bentley, 1997; Lattman, 1985; Rossmann, 1972; McCoy, 2007).
[0118] Generally, X-ray diffraction data are collected from the crystal of a crystallised target structure. The X-ray diffraction data is transformed to calculate a Patterson function. The Patterson function of the crystallised target structure is compared with a Patterson function calculated from a known structure (referred to herein as a search structure). The Patterson function of the search structure is rotated on the target structure Patterson function to determine the correct orientation of the search structure in the crystal. A translation function is then calculated to determine the location of the search structure with respect to the crystal axes. Once the search structure has been correctly positioned in the unit cell, initial phases for the experimental data can be calculated. These phases are necessary for calculation of an electron density map from which structural differences can be observed and for refinement of the structure. Preferably, the structural features (e.g., amino acid sequence, conserved disulphide bonds, and beta-strands or beta-sheets) of the search molecule are related to the crystallised target structure.
[0119] The electron density map can, in turn, be subjected to any well-known model building and structure refinement techniques to provide a final, accurate structure of the unknown (i.e. target) crystallised molecular complex (e.g. see Jones et at., 1991; Brunger ef a/., 1998).
[0120] Obtaining accurate values for the phases, by methods other than molecular replacement, is a time-consuming process that involves iterative cycles of approximations and refinements and greatly hinders the solution of crystal structures. However, when the crystal structure of a protein containing at least a homologous portion has been solved, the phases from the known structure provide a satisfactory starting estimate of the phases for the unknown structure.
[0121] By using molecular replacement, all or part of the structure coordinates of IR and/or IGF-1R comprising the C-terminal region of the α-chain provided herein (and set forth in Appendixes I to IV) can be used to determine the structure of a crystallised molecular complex whose structure is unknown more rapidly and efficiently than attempting to determine such information ab initio. This method is especially useful in determining the structure of IR and/or IGF-1 R mutants and homologues.
[0122] The structure of any portion of any crystallised molecular complex that is sufficiently homologous to any portion of the extracellular domain of IR and/or IGF-1 R can be solved by this method..
[0123] Such structure coordinates are also particularly useful to solve the structure of crystals of IR and/or IGF-1 R co-complexed with a variety of molecules, such as chemical entities. For example, this approach enables the determination of the optimal sites for the interaction between chemical entities, and the interaction of candidate IR and/or IGF-1 R agonist or antagonists.
[0124] All of the complexes referred to above may be studied using well-known X-ray diffraction techniques and may be refined against 1.5-3.5 Å resolution X-ray data to an R value of about 0.25 or less using computer software, such as X-PLOR (Yale University, distributed by Molecular Simulations, Inc.; see Brunger, 1996). This information may thus be used to optimize known IR and/or IGF-1Ragonist/antagonists, such as anti-IR and/or anti-IF-1 R antibodies, and more importantly, to design new or improved IR and/or IGF-1R agonists/antagonists.
Target sites for compound identification, design or screening [0125] The three-dimensional structure of the low affinity binding site of IR and/or IGF-1R provided by the present invention (Appendixes I to IV) can be used to identify potential target binding sites in the low affinity insulin binding site of IR and/or IGF-1R (i.e. to identify those regions of the low affinity binding site of IR and/or IGF-1R involved in and important to the binding of insulin and/or IGF and subsequent signal transduction) as well as in methods for identifying or designing compounds vtfiich interact with the low affinity binding site of IR and/or IGF-1 R e.g. potential modulators of IR and/or IGF-1 R.
[0126] The three-dimensional structure of IR and/or IGF-1 R provided by the present invention (Appendixes I to IV) can be used to identify potential target binding sites in the L1 domain of IR and/or IGF-1 R important for binding to the C-terminal region of the IR and/or IGF-1 R α-chain (i.e. to identify those regions of the L1 domain of IR and/or IGF-1 R involved in and important to the binding of C-terminal region of the IR and/or IGF-1 R α-chain) as well as in methods for identifying or designing compounds which interact with the L1 domain of IR and/or IGF-1 R in a manner similar to the C-terminal region of the IR and/or IGF-1 R α-chain e.g. potential modulators of IR and/or IGF-1 R.
[0127] The low affinity binding site of IR is a region of IR ectodomain involved in insulin docking to the receptor. Preferred low affinity target binding sites comprise the C-terminal region of the α-chain and and one or more regions from the L1 domain and/or the CR domain of IR ectodomain. With regards to the L1 domain, the target binding site preferably comprises portions of the molecular surface of the central β-sheet of L1 and portions of the molecular surface of the second leucine-rich repeat (LRR) which contain Phe39 or the loop in the fourth LRR rung of L1, or preferably both, as defined above. With regards the CR domain, the target binding site preferably comprises module 6 of the CR domain, as defined above.
[0128] The low affinity target binding site in IR may comprise amino acids 693-710 (encompassing the C-terminal region of the IR α-chain) plus one or more of the following amino acid sequences: (i) amino acids 1-156; (ii) amino acids 157-310, and; (iii) amino acids 594 and 794.
[0129] With regards to amino acids 1-156, the target binding site preferably comprises at least one amino acid selected from Arg14, Asn15, Gln34, Leu36, Leu37, Phe39, Pro43, Phe46, Leu62, Phe64, Leu87, Phe88, Phe89, Asn90, Phe96, Glu97, Arg118, Glu120 or His144.
[0130] With regards to amino acids 157-310, the target binding site preferably comprises at least one amino acid from the amino acid sequence 192-310, more preferably at least one amino acid from the sequence 227-303, yet more preferably least one amino acid selected from the sequence 259-284.
[0131] With regards to amino acids 594 and 794, the target binding site preferably comprises at least one amino acid selected from Asn594 or Arg794.
[0132] In a preferred embodiment, van der Waals and/or hydrophobic interactions account for the major portion of the binding energy between a compound and a low affinity insulin binding site of IR.
[0133] The three-dimensional structure of the C-terminal region of the IR α-chain provided by the present invention can also be used to identify or more clearly elucidate potential target binding sites on IGF-1 R ectodomain (i.e. to identify those regions, or at least more accurately elucidate those regions, of IGF-1 R ectodomain involved in and important to the binding of IGF and signal transduction) as well as in methods used for identifying or designing compounds which interact with potential target binding sites of IGF-1 R ectodomain, e.g. potential modulators of IGF-1 R.
[0134] Preferred target binding sites are those governing specificity, i.e. those regions of IGF-1 R ectodomain involved in the initial low affinity binding of IGF (i.e. the initial binding of IGF to IGF-1 R).
[0135] The low affinity binding site of IGF-1 R is a region of IGF-1 R ectodomain involved in IGF-I binding to the receptor. Preferred low affinity target binding sites comprise the C-terminal region of IGF-1 R α-chain and one or more regions from the L1 domain and/or the CR domain of IGF-1 R ectodomain. With regards to the L1 domain, the target binding site preferably comprises the central β-sheet of the L1 domain, and/or that part of the second LRR containing Ser35, and/or the loop in the fourth LRR rung of the L1 domain, or preferably all of these, as defined above. With regards the CR domain, the target binding site preferably comprises module 6 of the CR domain, as defined above.
[0136] The low affinity IGF binding site may comprise amino acids 681-697 (encompassing the C-terminal region of the IGF-1R α-chain) plus one or more amino acids from the following amino acid sequences: (i) amino acids 1-149; and (ii) amino acids ISO-298.
[0137] With regards to amino acids 1-149, the target binding site preferably comprises at least one amino acid from the amino acid sequence 1-62, preferably 1-49, and more preferably amino acid sequence 23-49. With regards to amino acids 150-298, the target binding site preferably comprises at least one amino acid from the amino acid sequence 185-298, more preferably at least one amino acid from the sequence 220-294, yet more preferably least one amino acid selected from the sequence 252-273. The target binding site preferably comprises at least one amino acid selected from Arg10, His30, Leu32, Leu33, Leu56, Phe58, Arg59, Phe82, Tyr83, Asn84, Tyr85, Va188, Phe90, Argy112 and Asp136.
[0138] In a preferred embodiment, van der Waals and/or hydrophobic interactions account for the major portion of the binding energy between a compound and a low affinity binding site of IGF-1 R.
[0139] Additional preferred binding sites in the case of both IR and IGF-1 R, particularly for biological macromolecules such as proteins or aptamers, are those that are devoid of glycosylation or devoid of steric hindrance from glycan covalently attached to the polypeptide at sites in the spatial vicinity.
Design, selection, fitting and assessment of chemical entities that bind IR and/or IGF-1R
[0140] Using a variety of known modelling techniques, the crystal structure can be used to produce a model for the low affinity binding site of IR and/or IGF-1 R, or at least part of the C-terminal region of the α-chain of IR or IGF-1 R.
[0141] As used herein, the term "modelling” includes the quantitative and qualitative analysis of molecular structure and/or function based on atomic structural information and interaction models. The term "modelling" includes conventional numeric-based molecular dynamic and energy minimisation models, interactive computer graphic models, modified molecular mechanics models, distance geometry and other structure-based constraint models.
[0142] Molecular modelling techniques can be applied to the atomic coordinates of the low affinity binding site of IR and/or IGF-1R, or at least part of the C-terminal region of the α-chain of IR or IGF-1 R, or a region thereof to derive a range of 3D models and to investigate the structure of binding sites, such as the binding sites of monoclonal antibodies, nonimmunoglobulin binding proteins and inhibitory peptides.
[0143] These techniques may also be used to screen for or design small and large chemical entities which are capable of binding IR and modulating the ability of IR to interact with extracellular biological targets, such as insulin or members of the IGF receptor family e.g. which modulate the ability of IR to heterodimerise. The screen may employ a solid 3D screening system or a computational screening system.
[0144] Such modelling methods are to design or select chemical entities that possess stereochemical complementary to the low affinity binding site of IR and/or IGF-1 R, or to the regions of the L1 domain of IR and/or IGF-1 R with which the C-terminal region of the α-chain of IR and/or IGF-1 R interact: By "stereochemical complementarity" we mean that the compound or a portion thereof makes a sufficient number of energetically favourable contacts with the receptor as to have a net reduction of free energy on binding to the receptor.
[0145] Such stereochemical complementarity is characteristic of a molecule that matches intra-site surface residues lining the groove of the receptor site as enumerated by the coordinates set out in Appendix I, optionally also utilising the coordinates set out in Appendixes II to VI. By "match" we mean that the identified portions interact with the surface residues, for example, via hydrogen bonding or by non-covalent Van der Waals and Coulomb interactions (with surface or residue) which promote desolvation of the molecule within the site, in such a way that retention of the molécule at the binding site is favoured energetically.
[0146] It is preferred that the stereochemical complementarity is such that the compound has a Kd for the receptor site of less than kHm, more preferably less than 10'5M and more preferably 10"®M. In a most preferred embodiment, the Kq value is less than 10"®M and more preferably less than 10'9M.
[0147] Chemical entities which are complementary to the shape and electrostatics or chemistry of the receptor site characterised by amino acids positioned at atomic coordinates set out in Appendixes I to IV will be able to bind to the receptor, and when the binding is sufficiently strong, substantially prohibit the interaction of the IR and/or IGF-1R ectodomain with biological target molecules such as insulin or IGF.
[0148] It will be appreciated that it is not necessary that the complementarity between chemical entities and the receptor site extend over all residues of the receptor site in order to inhibit binding of a molecule or complex that naturally interacts with IR and/or IF-1R ectodomain.
[0149] A number of methods may be used to identify chemical entities possessing stereochemical complementarity to the low affinity binding site of IR and/or IGF-1R, or to the regions of the L1 domain of IR and/or IGF=1R with which the C-terminal region of the α-chain of IR and/or IGF-1R interact. For instance, the process may begin by visual inspection of the entire low affinity insulin binding site comprising the C-terminal region of the α-chain of IR, or the equivalent region in IGF-1R, on the computer screen based on the coordinates in Appendixes I to IV generated from the machine-readable storage medium. Alternatively, selected fragments or chemical entities may then be positioned in a variety of orientations, or docked, within the low affinity binding site of IR and/or IGF-1R, or within the L1 domain of IR and/or IGF-1R in a manner similar to the C-terminal region of the a-chain of IR or IGF-1 R, as defined supra. Similar methods could be used to identify chemical entities or compounds that may interact with the L1 domain of IR and/or IGF-1 R in a manner similar to that of the C-terminal region of the α-chain of IR and/or IGF-1 R.
[0150] Modelling software that is well known and available in the art may be used (Guida, 1994). These include Discovery Studio (Accelrys Software Inc., San Diego), SYBYL (Tripos Associates, Inc., St. Louis, Mo., 1992), Maestro (Schrodinger LLC, Portland), MOE (Chemical Computing Group Inc., Montreal, Canada). This modelling step may be followed by energy minimization with standard molecular mechanics force fields such as AMBER (Weiner et al., 1984), OPLS (Jorgensen and Tirado-Rives, 1988) and CHARMM (Brooks et at., 1983). In addition, there are a number of more specialized computer programs to assist in the process of selecting the binding moieties of this invention.
[0151] Specialised computer programs may also assist in the process of selecting fragments or chemical entities. These include, inter alia: 1. 1. GRID (Goodford, 1985). GRID is available from Molecular Discovery Ltd., Italy. 2. 2. AUTODOCK (Goodsell & Olsen, 1990). AUTODOCK is available from Scripps Research Institute, La Jolla, CA. 3. 3. DOCK (Kuntz et at, 1982). DOCK is available from University of California, San Francisco, CA. 4. 4. GLIDE (Friesner et al., 2004). GLIDE is available from Schrodinger LLC, Portland.
5. 5. GOLD (Cole et al., 2005). GOLD is avaible from The Cambridge, Crystallographic Data Centre, Cambridge, UK
[0152] Once suitable chemical entities or fragments have been selected, they can be assembled into a single compound. In one embodiment, assembly may proceed by visual inspection of the relationship of the fragments to each other on the three-dimensional image displayed on a computer screen in relation to the structure coordinates of the low affinity binding site of IR and/or IGF-1 R, or the L1 domain to which the C-terminal region of the α-chain of IR or IGF-1 R binds. This is followed by manual model building using software such as Discovery Studio, Maestro, MOE or Sybyl. Alternatively, fragments may be joined to additional atoms using standard chemical geometry.
[0153] The above-described evaluation process for chemical entities may be performed in a similar fashion for chemical compounds.
[0154] Useful programs to aid one of skilled in the art in connecting the individual chemical entities or fragments include: 1. 1. CAVEAT (Bartlett et al., 1989). CAVEAT is available from the University of California, Berkeley, CA. 2. 2. GANDI (Day and Caflisch, 2008). GANDI is available from the University of Zurich.
[0155] Other molecular modeling techniques may also be employed in accordance with this invention, see, e.g., Cohen et al. (1990) and Navia & Murcko (1992).
[0156] There are two preferred approaches to designing a molecule that complement the stereochemistry of the low affinity binding site of IR and/or IGF-1R, or the L1 domain to which the C-terminal region of the α-chain of IR or IGF-1R binds. The first approach is to in silico directly dock molecules from a three-dimensional structural database, to the target binding site, using mostly, but not exclusively, geometric criteria to assess the goodness-of-fit of a particular molecule to the site. In this approach, the number of internal degrees of freedom (and the corresponding local minima in the molecular conformation space) is reduced by considering only the geometric (hard-sphere) interactions of two rigid bodies, where one body (the active site) contains "pockets" or "grooves" that form binding sites for the second body (the complementing molecule).
[0157] Flexibility of the receptor, IR or IGFR, can be incorporated into the in silico screening by the application of multiple conformations of the receptor (Totrov and Abagyan, 2008). The multiple conformations of the receptor can be generated from the coordinates listed in Appendixes 1 to VI computationally by use of molecular dynamics simulation or similar approaches.
[0158] This approach is illustrated by Kuntzei a/. (1982) and Ewing ei al. (2001), whose algorithm for ligand design is implemented in a commercial software package, DOCK version 4.0, distributed by the Regents of the University of California and further described in a document, provided by the distributor, which is entitled "Overview of the DOCK program suite" the contents of which are hereby incorporated by reference. Pursuant to the Kuntz algorithm, the shape of the cavity in which the C-terminal region of the α-chain of IR or IGF-1R fits is defined as a series of overlapping spheres of different radii. One or more extant databases of crystallographic data, such as the Cambridge Structural Database System (The Cambridge Crystallographic Data Centre, Cambridge, U.K), the Protein Data Bank maintained by the Research Collaboratory for Structural Bioinformatics (Rutgers University, N.J., U.S.A.), LeadQuest (Tripos Associates, Inc., St. Louis, MO), Available Chemicals Directory (Symyx Technologies Inc.), and the NCI database (National Cancer Institute, U.S.A) is then searched for molecules which approximate the shape thus defined.
[0159] Molecules identified on the basis of geometric parameters, can then be modified to satisfy criteria associated with chemical complementarity, such as hydrogen bonding, ionic interactions and van der Waals interactions. Different scoring functions can be employed to rank and select the best molecule from a database. See for example Bohm & Stahl (1999). The software package FlexX, marketed by Tripos Associates, Inc. (St. Louis, MO) is another program that can be used in this direct docking approach (see Rarey et al., 1996).
[0160] The second preferred approach entails an assessment of the interaction of respective chemical groups ("probes") with the active site at sample positions within and around the site, resulting in an array of energy values from which three-dimensional contour surfaces at selected energy levels can be generated. The chemical-probe approach to ligand design is described, for example, by Goodford, (1985), and is implemented in several commercial software packages, such as GRID (product of Molecular Discovery Ltd., Italy).
[0161] Pursuant to this approach, the chemical prerequisites for a site-complementing molecule are identified at the outset, by probing the active site with different chemical probes, e.g., water, a methyl group, an amine nitrogen, a carboxyl oxygen, or a hydroxyl. Favoured sites for interaction between the active site and each probe are thus determined, and from the resulting three-dimensional pattern of such sites a putative complementary molecule can be generated. This may be done either by programs that can search three-dimensional databases to identify molecules incorporating desired pharmacophore patterns or by programs which use the favoured sites and probes as input to perform de novo design. Suitable programs for determining and designing pharmacophores include CATALYST (Aecelrys Software, Inc), and CERIUS2, DISCO (Abbott Laboratories, Abbott Park, IL; distributed by Tripos Associates Inc.).
[0162] The pharmacophore can be used to screen in silico compound libraries/ three-dimensional databases, using a program such as CATALYST (Aecelrys Software, Inc) and Sybyl/3DB Unity (Tripos Associates, Inc., St. Louis, MO).
[0163] Databases of chemical structures are available from a number of sources including Cambridge Crystallographic Data Centre (Cambridge, U.K.), Molecular Design, Ltd., (San Leandro, CA), Tripos Associates, Inc. (St. Louis, MO), Chemical Abstracts Service (Columbus, OH), the Available Chemical Directory (Symyx Technologies, Inc.), the Derwent World Drug Index (WDI), BioByteMasterFile, the National Cancer Institute database (NCI), Medchem Database (BioByte Cortp.), and the Maybridge catalogue.
[0164] De novo design programs include LUDI (Aecelrys Software Inc., San Diego, CA), Leapfrog (Tripos Associates, Inc.), and LigBuilder (Peking University, China).
[0165] Once an entity or compound has been designed or selected by the above methods, the efficiency with which that entity or compound may bind to IR and/or IGF-1R can be tested and optimised by computational evaluation. For example, a compound that has been designed or selected to function as an IR binding compound must also preferably traverse a volume not overlapping that occupied by the binding site vtfien it is bound to the native IR. An effective IR binding compound must preferably demonstrate a relatively small difference in energy between its bound and free states (i.e., a small deformation energy of binding). Thus, the most efficient IR and/or IGF-1R binding compound should preferably be designed with a deformation energy of binding of not greater than about 10 kcal/mole, preferably, not greater than -7 kcal/mole. IR and/or IGF-1R binding compounds may interact with IR and/or IGF-1R in more than one conformation that are similar in overall binding energy. In those cases, the deformation energy of binding is taken to be the difference between the energy of the free compound and the average energy of the conformations observed when the compound binds to the protein.
[0166] Acompound designed or selected as binding to IR and/or IGF-1R may be further computationally optimised so that in its bound state it would preferably lack repulsive electrostatic interaction with the target protein.
[0167] Such non-complementary (e.g., electrostatic) interactions include repulsive charge-charge, dipole-dipole and charge-dipole interactions. Specifically, the sum of all electrostatic interactions between the compound and the protein when the compound is bound to IR and/or IGF-1R, preferably make a neutral or favourable contribution to the enthalpy of binding.
[0168] Once an IR and/or IGF-1 R-binding compound has been optimally selected or designed, as described above, substitutions may then be made in some of its atoms or side groups to improve or modify its binding properties. Generally, initial substitutions are conservative, i.e., the replacement group will have approximately the same size, shape, hydrophobicity and charge as the original group. It should, of course, be understood that components known in the art to alter conformation should be avoided. Such substituted chemical compounds may then be analysed for efficiency of fit to IR by the same computer methods described in detail above.
[0169] Specific computer software is available in the art to evaluate compound deformation energy and electrostatic interaction. Examples of programs designed for such uses include: Gaussian 03, (Frisch, Gaussian, Inc., Pittsburgh, PA); GAMESS (Gordon et al., Iowa State University); Jaguar (Schrodinger LLC, Portland);AMBER, version 9.0 (Case et al, University of California at San Francisco); CHARMM (Accelrys Software, Inc., San Diego, CA); and.GROMACS version 4.0 (van der Spoel et al.).
[0170] The screening/design methods may be implemented in hardware or software, or a combination of both. However, preferably, the methods are implemented in computer programs executing on programmable computers each comprising a processor, a data storage system (including volatile and non-volatile memory and/or storage elements), at least one input device, and at least one output device. Program code is applied to input data to perform the functions described above and generate output information. The output information is applied to one or more output devices, in known fashion. The computer may be, for example, a personal computer, microcomputer, or workstation of conventional design.
[0171] Each program is preferably implemented in a high level procedural or object-oriented programming language to communicate with a computer system. However, the programs can be implemented in assembly or machine language, if desired. In any case, the language may be compiled or interpreted language.
[0172] Each such computer program is preferably stored on a storage medium or device (e.g., ROM or magnetic diskette) readable by a general or special purpose programmable computer, for configuring and operating the computer when the storage media or device is read by the computer to perform the procedures described herein. The system may also be considered to be implemented as a computer-readable storage medium, configured with a computer program, where the storage medium so configured causes a computer to operate in a specific and predefined manner to perform the functions described herein.
Compounds [0173] Compounds described herein include both those designed or identified using a screening method of the invention and those which are capable of recognising and binding to the low affinity binding sites of IR and/or IGF-1 R, as defined above. Also described are compounds that bind to the L1 domain of IR and/or IGF-1 R in a manner similar to that of the C-terminal region of the α-chain of IR and/or IGF-1 R, i.e. compounds which mimic the C-terminal region of the α-chain of IR and/or IGF-1 R.
[0174] Compounds capable of recognising and binding to the low affinity binding site of IR and/or IGF-1 R may be produced using a screening method based on use of the atomic coordinates corresponding to the 3D structure of the the low affinity binding site of IR and/or IGF-1 R, or alternatively may be identified by screening against a specific target molecule which is indicative of the capacity to bind to the low affinity binding site of IR and/or IGF-1R.
[0175] Compounds capable of recognising and binding to the L1 domain of IR and/or IGF-1 R in a manner similar to that of the C-terminal region of the α-chain of IR and/or IGF-1 R (i.e. compounds which mimic the C-terminal region of the α-chain of IR and/or IGF-1 R) may be produced using a screening method based on use of the atomic coordinates corresponding to the 3D structure of the the C-terminal region of the α-chain of IR and/or IGF-1 R in isolation or as it associates with IR and/or IGF-1 R, or alternatively may be identified by screening against a specific target molecule which is indicative of the capacity to bind to the low affinity binding site of IR and/or IGF-1 R.
[0176] The candidate compounds and/or compounds identified or designed using a method of the present invention may be any suitable compound, synthetic or naturally occurring, preferably synthetic. In one embodiment, a synthetic compound selected or designed by the methods of the invention preferably has a molecular weight equal to or less than about 5000, 4000, 3000, 2000, 1000 or 500 daltons. A compound as described herein is preferably soluble under physiological conditions.
[0177] The compounds may encompass numerous chemical classes, though typically they are organic molecules, preferably small organic compounds having a molecular weight of more than 50 and less than about 2,500 daltons, preferably less than 1500, more preferably less than 1000 and yet more preferably less than 500. Such compounds can comprise functional groups necessary for structural interaction with proteins, particularly hydrogen bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably at least two of the functional chemical groups. The compound may comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic structures substituted wth one or more of the above functional groups. Compounds can also comprise biomolecules including peptides, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural analogs, or combinations thereof.
[0178] Compounds may include, for example: (1) Peptides such as soluble peptides, including lg-tailed fusion peptides and members of random peptide libraries (see, e.g., Lam et al., 1991; Fbughten et al., 1991) and combinatorial chemistry-derived molecular libraries made of D-and/or L-configuration amino acids; (2) Phosphopeptides (e.g., members of random and partially degenerate, directed phosphopeptide libraries, see, e.g., Songyang ei al., 1993); (3) Antibodies (e.g., polyclonal, monoclonal, humanized, anti-idiotypic, chimeric, and single chain antibodies as well as Fab, (Fab)2', Fab expression library and epitopebinding fragments of antibodies); (4) Nonimmunoglobulin binding proteins such as but not restricted to avimers, DARPins and lipocalins; (5) Nucleic acid-based aptamers; and (6) Small organic and inorganic molecules.
[0179] Ligands can be obtained from a wide variety of sources including libraries of synthetic or natural compounds. Synthetic compound libraries are commercially available from, for example, Maybridge Chemical Co. (Tintagel, Cornwall, UK), AMRI (Budapest, Hungary) and ChemDiv (San Diego, CA), Specs (Delft, The Netherlands).
[0180] Natural compound libraries comprising bacterial, fungal, plant or animal extracts are available from, for example, Pan Laboratories (Bothell, WA), TimTec (Newark, DE). In addition, numerous means are available for random and directed synthesis of a wide variety of organic compounds and biomolecules, including expression of randomized oligonucleotides.
[0181] Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant and animal extracts can be readily produced. Methods for the synthesis of molecular libraries are readily available (see, e.g., DeWitt et al., 1993; Erb et al., 1994; Zuekermann ef al., 1994; Cho ef al., 1993; Carell etat., 1994a; Carell etal., 1994b; and Gallop et al., 1994). In addition, natural or synthetic compound libraries and compounds can be readily modified through conventional chemical, physical and biochemical means (see, e.g., Blondelle and Houghton, 1996), and may be used to produce combinatorial libraries. In another approach, previously identified pharmacological agents can be subjected to directed or random chemical modifications, such as acylation, alkylation, esterification, amidification, and the analogs can be screened for IR and/or IGF-1 R-modulating activity.
[0182] Numerous methods for producing combinatorial libraries are known in the art, including those involving biological libraries; spatially addressable parallel solid phase or solution phase libraries ; synthetic library methods requiring deconvolution ; the "one-bead one-compound" library method; and synthetic library methods using affinity chromatography selection. The biological library approach is limited to polypeptide or peptide libraries, while the other four approaches are applicable to polypeptide, peptide, non-peptide oligomer, or small molecule libraries of compounds (Lam, 1997).
[0183] Compounds also include those that may be synthesized from leads generated by fragment-based drug design, wherein the binding of such chemical fragments is assessed by soaking or co-crystallizing such screen fragments into crystals provided by the invention and then subjecting these to an X-ray beam and obtaining diffraction data. Difference Fourier techniques are readily applied by those skilled in the art to determine the location within the IR ectodomain and/or IGF-1 R ectodomain structure at which these fragments bind, and such fragments can then be assembled by synthetic chemistry into larger compounds with increased affinity for the receptor.
Isolated peptides or mimetics thereof [0184] Compounds identified or designed using the methods of the invention can be a peptide or a mimetic thereof. In one example, there is described an isolated peptide or mimetic thereof which binds the L1 domain of IR and/or the L1 domain of IGF-1R, the peptide comprising: 1. i) an amino acid sequence as provided in SEQ ID NO: 13 or SEQ ID NO: 15; 2. ii) an amino acid sequence which is at least 50% identical to SEQ ID NO: 13 and/or SEQ ID NO: 15; or 3. iii) a fragment of i) or ii) which binds the L1 domain of IR and/or the L1 domain of IGF-1R, wherein the peptide has a helical structure.
[0185] The isolated peptides or mimetics may be conformationally constrained molecules or alternatively molecules which are not conformationally constrained such as, for example, non-constrained peptide sequences. The term "conformationally constrained molecules" means conformationally constrained peptides and conformationally constrained peptide analogues and derivatives.
[0186] The term "analogues" refers to molecules having a chemically analogous structure to naturally occurring a-amino acids. Examples include molecules containing gemdiaminoalkyl groups or alklylmalonyl groups.
[0187] The term "derivatives" includes α-amino acids wherein one or more side groups found in the naturally occurring a-amino acids have been modified. Thus, for example the amino acids may be replaced with a variety of uncoded or modified amino acids such as the corresponding D-amino acid or N-methyl amino acid. Other modifications include substitution of hydroxyl, thiol, amino and carboxyl functional groups with chemically similar groups.
[0188] With regard to peptides and mimetics thereof, other examples of other unnatural amino acids or chemical amino acid analogues/derivatives which can be introduced as a substitution or addition include, but are not limited to, 2,4-diaminobutyric acid, α-amino isobutyric acid, 4-aminobutyric acid, 2-aminobutyric acid, 6-amino hexanoic acid, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, cyclohexylalanine, β-alanine, fluoro-amino acids, designer amino acids such as β-methyl amino acids, Ca-methyl amino acids, Να-methyl amino acids, and amino acid analogues in general.
[0189] The mimetic may be a peptidomimetic. A "peptidomimetic" is a molecule that mimics the biological activity of a peptide but is no longer peptidic in chemical nature. By strict definition, a peptidomimetic is a molecule that no longer contains any peptide bonds (that is, amide bonds between amino acids). However, the term peptide mimetic is sometimes used to describe molecules that are no longer completely peptidic in nature, such as pseudo-peptides, semi-peptides and peptoids. Whether completely or partially non-peptide, peptidomimetics, provide a spatial arrangement of reactive chemical moieties that closely resembles the three-dimensional arrangement of active groups in the peptide on which the peptidomimetic is based. As a result of this similar active-site geometry, the peptidomimetic has effects on biological systems which are similar to the biological activity of the peptide.
[0190] There are sometimes advantages for using a mimetic of a given peptide rather than the peptide itself, because peptides commonly exhibit two undesirable properties: (1) poor bioavailability; and (2) short duration of action. Peptide mimetics offer an obvious route around these two major obstacles, since the molecules concerned are small enough to be both orally active and have a long duration of action. There are also considerable cost savings and improved patient compliance associated with peptide mimetics, since they can be administered orally compared with parenteral administration for peptides. Furthermore, peptide mimetics are generally cheaper to produce than peptides.
[0191] Suitable peptidomimetics based on the C-terminal region of the α-chain of IR and/or IGR-1R can be developed using readily available techniques. Thus, for example, peptide bonds can be replaced by non-peptide bonds that allow the peptidomimetic to adopt a similar structure, and therefore biological activity, to the original peptide. Further modifications can also be made by replacing chemical groups of the amino acids with other chemical groups of similar structure. The development of peptidomimetics derived from peptides of the C-terminal region of the IR and/or IGF-1 R α-chain can be aided by reference to the three dimensional structure of these residues as provided in Appendixes I to IV. This structural information can be used to search three-dimensional databases to identify molecules having a similar structure, using programs such as Sybyl/3DB Unity (Tripos
Associates, St. Louis, MO).
[0192] Those skilled in the art will recognize that the design of a peptidomimetic may require slight structural alteration or adjustment of a chemical structure designed or identified using the methods of the invention. In general, chemical compounds identified or designed using the methods of the invention can be synthesized chemically and then tested for ability to modulate IR and/or IGF-1R activity using any of the methods described herein. The methods of the invention are particularly useful because they can be used to greatly decrease the number potential mimetics which must be screened for their ability to modulate IR and/or IGF-1 R activity.
[0193] The peptides or peptidomimetics can be used in assays for screening for candidate compounds which bind to regions of IR and/or IGF-1 R and potentially interfere with the binding of insulin to IR and/or signal transduction and/or the binding of IGF to IGF-1 R and/or signal transduction. Peptides or peptidomimetics which mimic target binding sites are particularly useful as specific target molecules for identifying potentially useful ligands for IR and/or IGF-1 R.
[0194] Standard solid-phase ELISA assay formats are particularly useful for identifying compounds that bind to the receptor. In accordance with this embodiment, the peptide or peptidomimetic immobilized on a solid matrix, such as, for example an array of polymeric pins or a glass support. Conveniently, the immobilized peptide or peptidomimetic is a fusion polypeptide comprising Glutathione-S-transferase (GST; e.g. a CAP-ERK fusion), wherein the GST moiety facilitates immobilization of the protein to the solid phase support. This assay format can then be used to screen for candidate compounds that bind to the immobilised peptide or peptidomimetic and/or interefere with binding of a natural binding partner of IR and/or IGF-1 R to the immobilised peptide or peptidomimetic.
[0195] As used herein a "fragment" is a portion of a peptide which maintains a defined activity of the "full-length" peptide, namely the ability to bind to the low affinity binding site of IR and/or IGF-1 R, or to bind to the L1 domain of IR and/or IGF-1 R. Fragments can be any size as long as they maintain the defined activity. Preferably, the fragment maintains at least 50%, more preferably at least 75%, of the activity of the full length polypeptide.
[0196] The % identity of a peptide is determined by GAP (Needleman and Wunsch, 1970) analysis (GCG program) with a gap creation penalty=5, and a gap extension penalty=0.3. The query sequence is at least 10 amino acids in length, and the GAP analysis aligns the two sequences over a region of at least 10 amino acids. More preferably, the GAP analysis aligns two sequences over their entire length.
[0197] With regard to a defined peptide, it will be appreciated that % identity figures higher than those provided above will encompass preferred embodiments. Thus, where applicable, in light of the minimum % identity figures, it is preferred that the peptide comprises an amino acid sequence which is at least 50%, more preferably at least 55%, more preferably at least 60%, more preferably at least 65%, more preferably at least 70%, more preferably at least 75%, more preferably at least 80%, more preferably at least 85%, more preferably at least 90%, more preferably at least 91%, more preferably at least 92%, more preferably at least 93%, more preferably at least 94%, more preferably at least 95%, more preferably at least 96%, more preferably at least 97%, more preferably at least 98%, more preferably at least 99%, more preferably at least 99.1%, more preferably at least 99.2%, more preferably at least 99.3%, more preferably at least 99.4%, more preferably at least 99.5%, more preferably at least 99.6%, more preferably at least 99.7%, more preferably at least 99.8%, and even more preferably at least 99.9% identical to the relevant nominated SEQ ID NO.
[0198] Amino acid sequence mutants of the peptides identified or designed using the methods of the invention, can be prepared by introducing appropriate nucleotide changes into a nucleic acid as described herein, or by in vitro synthesis of the desired peptide. Such mutants include, for example, deletions, insertions or substitutions of residues within the amino acid sequence. A combination of deletion, insertion and substitution can be made to arrive at the final construct, provided that the final peptide product possesses the desired characteristics.
[0199] In designing amino acid sequence mutants, the location of the mutation site and the nature of the mutation will depend on characteristic(s) to be modified. The sites for mutation can be modified individually or in series, e.g., by (1) substituting first with conservative amino acid choices and then with more radical selections depending upon the results achieved, (2) deleting the target residue, or (3) inserting other residues adjacent to the located site.
[0200] Substitution mutants have at least one amino acid residue in the peptide removed and a different residue inserted in its place. Sites of interest are those in which particular residues obtained from various strains or species are identical. These sites, especially those falling within a sequence of at least three other identically conserved sites, are preferably substituted in a relatively conservative manner. Such conservative substitutions are shown in Table 1 under the heading of "exemplary substitutions”.
Table 1 - Exemplary substitutions.
[0201] In a preferred embodiment a mutant/variant peptide has one or two or three or four conservative amino acid changes when compared to a peptide defined herein. Details of conservative amino acid changes are provided in Table 1.
[0202] Also described herein are peptides which are differentially modified during or after synthesis, e.g., by biotinylation, benzylation, glycosylation, acetylation, phosphorylation, amidation, derivatization by known protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other cellular ligand, etc. These modifications may serve to increase the stability and/or bioactivity of the peptide.
[0203] The residues that form the IR segment 693-710 can be grouped into three classes :-Class A: those whose side chains are completely buried in the interface with L1 (viz. F701 and F705); Class B: those whose side chains lie at the periphery of the interface with L1 (viz. K694, E697, E698, S700, R702, T704, Y708 and L709); and Class C: those whose side chains appear to have no interaction with L1 (viz. L693, E695, L696, S699, K703, E706, D707 and H710). In terms of using the 693-710 peptide itself as a scaffold for mimetics that might compete with the C-terminal region of the IR α-chain peptide in its binding to the L1 domain, design might focus in the first instance on substitution of those residues in belonging to Class B, given that the two residues lying in Class A are relatively optimally packed within the interface and already have few degrees of freedom. Higher affinity binding might be achieved by substitution of one or more of the Class B residues with either naturally-occuring amino acids or non-natural amino acids. For example, the substitution of one or more of the charged residues K694, E697, E698 and R702 with residues that have reduced rotameric degrees of freedom (i.e. reduced entropy) may lead to higher affinity binding or altered physicochemical properties of the compound. Such modification for example may include substitution by the naturally occuring amino acids Phe, Tyr or Trp. As a further example, in the case of K694 and R702, it may be possible to substitute these residues by more bulky non-natural amino acids that retain the terminal cationic character, for example substitution by the basic phenyl propanoic acid derivatives App (L-2-amino-3-(4-aminophenyl)propanoic acid) and/or Gpp, (L-2-amino-3-(4-guanidinophenyl)propanoic acid) (Svenson ef a/., 2009). A further strategy for design might involve substitutions to improve the overall stability of the helical structure (for example helix stapling - see Danial ef a/., 2008). Such substiutions would likely be made within Class C residues. A yet further strategy to improve affinity or physicochemical properties might involve truncation of the helical segment and/or attaching an N- or C-terminal group also designed to improve affinity. Similar principles of design may be applied to generate modified peptides based on the native IGF-1R peptide 681-697 as outlined above for the IR peptide.
[0204] In a particularly preferred example, a peptide or mimetic thereof does not comprise any one of the following described in US 7,173,005: 1. a) X-| X2 X3 X4 X5 wherein X-|, X2, X4 and X5 are aromatic amino acids, and X3 is any polar amino acid; 2. b) Χβ X7 Χδ Xg X10 X11 X12 X13 wherein X6 and X7 are aromatic amino acids, Xg, Xg, Xu and X12 are any amino acid, and X10 and X13 are hydrophobic amino acids; 3. c) X-|4 X-|5 X-|6 X17 X18 Xl9 X20 X21 wherein X-14, and X-17 are hydrophobic amino acids, X15, X16, X18 and X^g are any amino acid, and X20 and X21, are aromatic amino acids; 4. d) X22 X23 X24 X25 X26 X27 *28 X29 X30 X31 X32 X33 X34 X35 X36 X37 X38 X39 X40 X41 wherein X22, X25. X28. X29. X30. X33. X34. X35. x36. X37. x38. X40. anc| X41 are any amino acid, X35 and X37 may be any amino acid for binding to IR, whereas X35 is preferably a hydrophobic amino acid and X37 is preferably glycine for binding to IGF-1R and possess agonist or antagonist activity. X23 and X26 are hydrophobic amino acids. This sequence further comprises at least two cysteine residues, preferably at X25 and X40 X31 and X32 are small amino acids; 5. e) X42 X43 Xf4 S45 X46 Xf7 X48 X49 X50 X51 X52 X53 X54 X55 X56 X57 X58 X59 X50 X61 wherein X42, X43, X44. Xf5, X533 X55, X56, X58, X60 and X61 may be any amino acid, X43, X46, X49, X50, X54 are hydrophobic amino acids, X47 and X59 are preferably cysteines, X48 is a polar amino acid, and X51, X52 and X57 are small amino acids; 6 f) X62 Xo3 X64 Xo5 X06 X67 Χδ8 Xø9 x70 X71 x72 x73 x74 x75 x76 x77 x78 x79 ΧδΟ Χδ1 wherein X52, X35, X38. Χθ9. x71. x73. X76. X77. X78. X80. and X81 may be any amino acid; Xg3, X70, X74 are hydrophobic amino acids; Xg4 is a polar amino acid, X37 and X75 are aromatic amino acids and X72 and X79 are preferably cysteines capable of forming a loop; 7. g) H Xg2 Χδ3 Χδ4 Χδ5 Χδ6 Χδ7 ^38 *89 Xgo Χθ1 Χθ2 wherein Xq2 is proline or alanine, Χβ3 is a small amino acid, X34 is selected from leucine, serine or threonine, Xss is a polar amino acid, X36, X38, X39 and X90 are any amino acid, and Xgy, X91 and X92 are an aliphatic amino acid; 8. h) X-|04 Xl05 Xl06 Xl07 X1O8 Xl09 Xl10 Y111 X|12 Xl13 Xl14 wherein at least one of the amino acids of X106 through Xm, and preferably two, are tryptophan separated by three amino acids, and wherein at least one of X104, Xl05 and X106 and at least one of Xu 2. X113 and X-114 are cysteine; 9. i) an amino acid sequence comprising the sequence JBA5: DYKDLCQSWGVRIGWLAGLCPKK or JBA5 minus FLAG® tag and terminal lysines: LCQSWGVRIGWLAGLCP (Formula 9); and 10. j) W X-123 G Y Xl24 W X-125 X126 wherein X-123 is selected from proline, glycine, serine, arginine, alanine or leucine, but more preferably proline; X-124 is any amino acid, but preferably a charged or aromatic amino acid; X125 is a hydrophobic amino acid preferably leucine or phenylalanine, and most preferably leucine. X-126 is any amino acid, but preferably a small amino acid.
[0205] In a further preferred example, a peptide or mimetic thereof is more structurally similar to the native C-terminal region of the α-chain of IR and/or the C-terminal region of the α-chain of IGF-1R than it is to any one of a) to j) above (such as the peptides provided as SEQ ID Nos: 16 to 18).
[0206] The design of synthetic non-peptide mimetics of α-helices is an established art (see for example Davis et al., 2006). In particular, methods of mimicry of i, i+4, i+7 motifs (such as those identified within the C-terminal helical region of the α-chain of IR and IGF-1 R and which interact the respective L1 domains (i.e. IR residues Phe701, Phe705, Tyr708 and IGF-1 R residues Tyr688, Phe 692, Phe 695) are known. For example, these motifs may be mimicked by terphenyl, oligophenyl, chalcone or 1,4-benzodiazepine-2,5-dione scaffolds (Davis et al., 2006) or by benzoylurea scaffolds (US 2008153802). Non-peptide mimetics of a-helices have been investigated as therapeutics in a number of disease contexts, for example HIV1 infection (disruption of the assembly of the hexameric helical bundle (Ernst et al., 2001)) and cancer (disruption of the assembly of the FIDM2-p53 complex (Yin et al., 2005); inhibitors of Bcl-2 family heterodimerisation (Degterev et al., 2001).
[0207] With regard to redesigning compounds using a method of the invention, in an embodiment the compound is redesigned to be more structurally similar to the native C-terminal region of the α-chain of IR and/or the C-terminal region of the α-chain of IGF-1 R. Examples of peptides which could be redesigned in this manner include, but are not limited to, those described by Schaffer et al. (2003) and/or US 7,173,005 (see above).
Interaction of compounds with IR and/or IGF-1R
[0208] A compound may interact with the low affinity binding site of IR and/or IGF-1R by binding either directly or indirectly to that region. A compound which binds directly, binds to the specified region. A compound which binds indirectly, binds to a region in close proximity to or adjacent to the low affinity binding site of IR and/or IGF-1R with the result that it interferes with the ability of IR to bind to insulin, or IGF-1 R to bind IGF, either antagonistically or agonistically. Such interference may be steric, electrostatic, or allosteric. Preferably, a compound interacts with the low affinity binding site of IR and/or IGF-1R by binding directly to the specified region. In the case of compounds that bind to specific target molecules, such compounds bind directly to the specific target molecule.
[0209] A compound may alternatively interact with the L1 domain of IR and/or IGF-1 R in a manner similar to that of the C-terminal region of the α-chain of IR and/or IGF-1 R by binding either directly or indirectly to that region. A compound which binds directly, binds to the specified region. A compound which binds indirectly, binds to a region in close proximity to or adjacent to the L1 domain of IR and/or IGF-1 R in a manner similar to that of the C-terminal region of the α-chain of IR and/or IGF-1 R with the result that it interferes with the ability of IR to bind to insulin, or IGF-1 R to bind IGF, either antagonistically or agonistically. Such interference may be steric, electrostatic, or allosteric. Preferably, a compound interacts with the L1 domain of IR and/or IGF-1 R in a manner similar to that of the C-terminal region of the α-chain of IR and/or IGF-1 R by binding directly to the specified region. In the case of compounds that bind to specific target molecules, such compounds bind directly to the specific target molecule.
[0210] Binding can be either by covalent or non-covalent interactions, or both. Examples of non-covalent interactions include electrostatic interactions, van der Waals interactions, hydrophobic interactions and hydrophilic interactions.
[0211] When a compound interacts with IR and/or IGF-1 R, it preferably "modulates" IR or IGF-1 R, respectively. By "modulate" we mean that the compound changes an activity of IR or IGF-1 R by at least 10%. Suitably, a compound modulates IR or IGF-1 R by increasing or decreasing signal transduction via IR or IGF-1 R, respectively. The phrase "decreases signal transduction" is intended to encompass partial or complete inhibition of signal transduction via IR or IGF-1 R. The ability of a candidate compound to increase or decrease signal transduction via IR or IGF-1 R can be assessed by any one of the IR or IGF-1 R cell-based assays described herein.
[0212] Compounds may act as antagonists or agonists for insulin binding to IR or as antagonists or agonists for IGF binding to IGF-1 R.
[0213] Compounds preferably have an affinity for IR or IGF-1 R sufficient to provide adequate binding for the intended purpose. Suitably, such compounds and compounds which bind to specific target molecules of IR or IGF-1 R have an affinity (Kq) of from 10"5 to ΙΟ'15 M. For use as a therapeutic, the compound suitably has an affinity (Kq) of from 10'7 to 10'15 M, preferably from 10"8 to 10-12 m anq more preferably from 10"10 to 10"12 M. Where a compound is to be used as a reagent in a competitive assay to identify other ligands, the compound suitably has an affinity (Kq) of from 10"® to 10"12 M.
[0214] As will be evident to the skilled person, the crystal structures presented herein have enabled, for the first time, direct comparison of the regions controlling insulin or IGF binding in the closely related IR and IGF-1 R. The structures have enabled the identification of the C-terminal region of the α-chain of IR and IGF-1 R, critical for the initial binding of insulin or IGF, respectively, and in the subsequent formation of the high affinity insulin-IRor IGF-IGF-1R complex that leads to signal transduction.
[0215] In one preferred embodiment, a compound has a high specificity for IR and/or a specific target molecule of IR but not for IGF-1 R, i.e. a compound selectively binds to IR or has enhanced selectivity for IR over IGF-1 R. In this respect, a compound suitably has an affinity (Kq) for IR and/or a specific target molecule of IR of no more than 10"5 M, preferably no more than 1O"7 M, and an affinity for IGF-1 R of at least 10"5 M, preferably at least 10"3 M. Such compounds are desirable as, for example, IR agonists where the propensity to interact with IGF-1 R and thus, for example, promote undesirable cell proliferation, is reduced.
[0216] In a preferred embodiment, the (IR or specific target molecule of IR)/IGF-1R binding affinity ratio for a compound is at least 10 and preferably at least 100.
[0217] In another preferred embodiment, a compound has a high specificity for IGF-1 R and/or a specific target molecule of IGF-1R but not for IR, i.e. a compound selectively binds to IGF-1 R or has enhanced selectivity for IGF-1 R over IR. In this respect, a compound suitably has an affinity (Kq) for IGF-1 R and/or a specific target molecule of IGF-1 R of no more than 10"5 M, preferably no more than 10"^ M, and an affinity for IR of at least 10'5 M, preferably at least 10'3 M. Such compounds are desirable as, for example, IGF-1R agonists where there propensity to interact with IR and thus, for example, promote glucose uptake and metabolism, is reduced.
[0218] In a preferred embodiment, the (IGF-1Ror specific target molecule of IGF-1 R)/(IR) binding affinity ratio for a compound is at least 10 and preferably at least 100.
Screening assays and confirmation of binding [0219] Compounds described herein may be subjected to further confirmation of binding to IR and/or IGF-1 R by cocrystallization of the compound with IR and/or IGF-1 R and structural determination, as described herein.
[0220] Compounds designed or selected according to the methods of the present invention are preferably assessed by a number of in vitro and in vivo assays of IR and/or IGF-1 R function to confirm their ability to interact with and modulate IR and/or IGF-1 R activity. For example, compounds may be tested for their ability to bind to IR and/or IGF-1 R and/or for their ability to modulate e.g. disrupt, IR and/or IGF-1 R signal transduction.
[0221] Libraries may be screened in solution by methods generally known in the art for determining whether ligands competitively bind at a common binding site. Such methods may include screening libraries in solution (e.g., Houghten, 1992), or on beads (Lam, 1991), chips (Fodor, 1993), bacteria or spores (U.S. 5,223,409), plasmids (Cull et at., 1992), or on phage (Scott & Smith, 1990; Devlin, 1990; Cwirla et al., 1990; Felici, 1991; U.S. 5,223,409).
[0222] Where the screening assay is a binding assay, IR or IGF-1 R may be joined to a label, where the label can directly or indirectly provide a detectable signal. Various labels include radioisotopes, fluorescent molecules, chemiluminescent molecules, enzymes, specific binding molecules, particles, e.g., magnetic particles, and the like. Specific binding molecules include pairs, such as biotin and streptavidin, digoxin and antidigoxin, etc. For the specific binding members, the complementary member would normally be labeled with a molecule that provides for detection, in accordance with known procedures.
[0223] A variety of other reagents may be included in the screening assay. These include reagents like salts, neutral proteins, e.g., albumin, detergents, etc., which are used to facilitate optimal protein-protein binding and/or reduce non-specific or background interactions. Reagents that improve the efficiency of the assay, such as protease inhibitors, nuclease inhibitors, antimicrobial agents, etc., may be used. The components are added in any order that produces the requisite binding. Incubations are performed at any temperature that facilitates optimal activity, typically between 4 and 40 °C.
[0224] Direct binding of compounds to IR or IGF-1 R can also be done by Surface Plasmon Resonance (BIAcore) (reviewed in Morton & Myszka, 1998). Here the receptor is immobilized on a CM5 or other sensor chip by either direct chemical coupling using amine or thiol-disulphide exchange coupling (Nice & Catimel, 1999) or by capturing the receptor ectodomain as an Fc fusion protein to an appropriately derivatised sensor surface (Morten & Myszka, 1998). The potential binding molecule (called an analyte) is passed over the sensor surface at an appropriate flow rate and a range of concentrations. ' The classical method of analysis is to collect responses for a wide range of analyte concentrations. A range of concentrations provides sufficient information about the reaction, and by using a fitting algorithm such as CLAMP (see Morton & Myszka, 1998), rate constants can be determined (Morton & Myszka, 1998; Nice & Catimel, 1999). Normally, the ligand surface is regenerated at the end of each analyte binding cycle. Surface regeneration ensures that the same number of ligand binding sites is accessible to the analyte at the beginning of each cycle.
[0225] Incubation periods are selected for optimum activity, but may also be optimized to facilitate rapid high-throughput screening. Normally, between 0.1 and 1 hour will be sufficient. In general, a plurality of assay mixtures is run in parallel with different test agent concentrations to obtain a differential response to these concentrations. Typically, one of these concentrations serves as a negative control, i.e. at zero concentration or below the level of detection.
[0226] The basic format of an in vitro competitive receptor binding assay as the basis of a heterogeneous screen for small organic molecular replacements for insulin may be as follows: occupation of the low affinity binding site of IR and/or IGF-1 R is quantified by time-resolved fluorometric detection (TRFD) as described by Denley et al., 2004. R'lR-A, R'lR-B and P6 cells are used as sources of IR-A, IR-B and IGF-1 R respectively. Cells are lysed with lysis buffer (20 mM HEPES, 150 mM NaCI, 1.5 mM MgCl2, 10% (v/v) glycerol, 1% (v/v) Triton X-100, 1 mM EGTApH 7.5) for 1 hour at 4°C. Lysates are centrifuged for 10 minutes at 3500 rpm and then 100 pi is added per well to a white Greiner Lumitrac 600 plate previously coated with anti-insulin receptor antibody 83-7 or anti-IGF-1 R antibody 24-31. Neither capture antibody interferes with receptor binding by insulin, IGF-I or IGF-II. Approximately 100,000 fluorescent counts of europium-labelled insulin or europium-labelled IGF-I are added to each well along with various amounts of unlabelled competitor and incubated for 16 hours at 4°C. Wells are washed with 20 mM Tris, 150 mM NaCI, 0.05% (v/v) Tween 20 (TBST) and DELFIA enhancement solution (100 μΙ/well) is added. Time-resolved fluorescence is measured using 340 nm excitation and 612 nm emission filters with a BMG Lab Technologies Polarstar™ Fluorimeter or a Wallac Victor II (EG & G Wallac, Inc.).
[0227] Examples of other suitable assays which may be employed to assess the binding and biological activity of compounds to and on IR are well known in the art. For example, suitable assays may be found in PCT International Publication Number WO 03/027246. Examples of suitable assays include the following: 1. (i) Receptor autophosphorylation (as described by Denley et a/., 2004). R" IR-A, R'lR-B cells or P6 cells are plated in a Falcon 96 well flat bottom plate at 2.5 x 104 cells/well and grown overnight at 37°C, 5% CO2. Cells are washed for 4 hours in serum-free medium before treating with one of either insulin, IGF-I or IGF-II in 100μΙ DMEMwith 1% BSA for 10 minutes at 37°C, 5% CO2. Lysis buffer containing 2mM Na3VC>4 and 1 mg/ml NaF is added to cells and receptors from lysates are captured on 96 well plates precoated with antibody 83-7 or 24-31 and blocked with 1xTBST/0.5% BSA. After overnight incubation at 4°C, the plates are washed with 1 x TBST. Phosphorylated receptor is detected with europium-labelled antiphosphotyrosine antibody PY20 (130 ng/well, room temperature, 2 hours). DELFIA enhancement solution (100 μΙ/well) is added and time resolved fluorescence detected as described above. 2. (ii) Glucose uptake using 2-deoxy-[U-14C] glucose (as described by Olefsky, 1978). Adipocytes between days 8-12 post-differentation in 24-well plates are washed twice in Krebs-Ringer Bicarbonate Buffer (25mM Hepes, pH 7.4 containing 130 mM NaCI, 5 mM KCI, KH2PO4, 1.3 mM MgSC>4.7H20, 25-mM NaHCC>3 and 1.15 mM CaCl2) supplemented with 1% (w/v) RIA-grade BSA and 2 mM sodium pyruvate. Adipocytes are equilibrated for 90 min at 37°C prior to insulin addition, or for 30 min prior to agonist or antagonist addition. Insulin (Actrapid, Novogen) is added over a concentration range of 0.7 to 70 nM for 30 min at 37°C. Agonist or antagonist (0 to 500 μΜ) is added to adipocytes for 90 min followed by the addition of insulin in the case of antagonists. Uptake of 50 μΜ 2-deoxy glucose and 0.5 pCi 2-deoxy-[U-14C] glucose (NEN, PerkinElmer Life Sciences) per well is measured over the final 10 min of agonist stimulation by scintillation counting. 3. (iii) Glucose transporter GLLTT4 translocation using plasma membrane lawns (as described by Robinson & James (1992) and Marsh etat. (1995)). 4. (iv) GLUT4 translocation using plasma membrane lawns (as described by Marsh etal., 1995). 3T3-L1 fibroblasts are grown on glass coverslips in 6-well plates and differentiated into adipocytes. After 8-12 days post-differentiation, adipocytes are serum-starved for 18 hrs in DMEM containing 0.5% FBS. Cells are washed twice in Krebs-Ringer Bicarbonate Buffer, pH 7.4 and equilibrated for 90 min at 37°C prior to insulin (100nM) addition, or for 30 min prior to compound (100μΜ) addition. After treatments, adipocytes are washed in 0.5 mg/ml poly-L-lysine in PBS, shocked hypotonically by three washes in 1:3 (v/v) membrane buffer (30 mM Hepes, pH 7.2 containing 70 mM KCI, 5 mM MgCl2, 3 mM EGTAand freshly added 1 mM DTT and 2 mM PMSF) on ice. The washed cells are then sonicated using a probe sonicator (Microson) at setting 0 in 1:1 (v/v) membrane buffer on ice, to generate a lawn of plasma membrane fragments that remain attached to the coverslip. The fragments are fixed in 2% (w/v) paraformaldehyde in membrane buffer for 20 min at 22°C and the fixative quenched by 100 mM glycine in PBS. The plasma membrane fragments are then blocked in 1% (w/v) Blotto in membrane buffer for 60 min at 22°C and immunolabelled with an in-house rabbit affinity purified anti-GLUT4 polyclonal antibody (clone R10, generated against a peptide encompassing the C-terminal 19 amino acids of GLUT4) and Alexa 488 goat anti-rabbit secondary antibody (Molecular Probes; 1:200). Coverslips are mounted onto slides using FluoroSave reagent (Calbiochem), and imaged using an OptiScan confocal laser scanning immunofluoroscence microscope (Optiscan, VIC., Australia). Data are analysed using ImageJ (NIH) imaging software. At least six fields are examined within each experiment for each condition, and the confocal microscope gain settings over the period of experiments are maintained to minimise be tween-experiment variability.
[0228] Insulin agonist activity may be determined using an adipocyte assay. Insulin increases uptake of 3H glucose into adipocytes and its conversion into lipid. Incorporation of 3H into a lipid phase is determined by partitioning of lipid phase into a scintillant mixture, which excludes water-soluble 3H products. The effect of compounds on the incorporation of 3H glucose at a sub-maximal insulin dose is determined, and the results expressed as increase relative to full insulin response. The method is adapted from Moody et at., (1974). Mouse epididymal fat pads are dissected out, minced into digestion buffer (Krebs-Ringer 25 mM HEPES, 4% HSA, 1.1 mM glucose, 0.4 mg/ml Collagenase Type 1, pH 7.4), and digested for up to 1.5 hours at 36.5 C. After filtration, washing (Krebs-Ringer HEPES, 1% HSA) and resuspension in assay buffer (Krebs-Ringer HEPES, 1% HSA), free fat cells are pipetted into 96-well Picoplates containing test solution and approximately an ED20 insulin.
[0229] The assay is started by addition of 3H glucose (e.g. ex. Amersham TRK 239), in a final concentration of 0.45 mM glucose. The assay is incubated for 2 hours at 36.5 °C, in a Labshaker incubation tower, 400 rpm, then terminated by the addition of Permablend/Toluene scintillant (or equivalent), and the plates sealed before standing for at least 1 hour and detection in a Packard Top Counter or equivalent. A full insulin standard curve (8 dose) is run as control on each plate.
[0230] Data are presented graphically, as the effect of the compound on an (approximate) ED20 insulin response, with data normalized to a full insulin response. The assay can also be run at basal or maximal insulin concentration.
[0231] To test the in vivo activity of a compound, an intravenous blood glucose test may be carried out on Wistar rats as follows. Male Mol:Wistar rats, weighing about 300 g, are divided into two groups. A 10 pi sample of blood is taken from the tail vein for determination of blood glucose concentration. The rats are then anaesthetized (e.g. with Hypnorm/Dormicum) at t =30 min and blood glucose measured again at t =-20 min and at t = 0 min. After the t = 0 sample is taken, the rats are injected into the tail vein with vehicle or test substance in an isotonic aqueous buffer at a concentration corresponding to a 1ml/kg volume of injection. Blood glucose is measured at times 10, 20, 30, 40, 60, 80, 120 and 180 min. The anaesthetic administration is repeated at 20 min intervals.
[0232] Additional assays to determine the effect of binding molecules on IGF-1R activity are as follows: 1. (i) Cell Viability Assay on HT29 cells with induction of Apoptosis: The ability of compounds to inhibit IGF-mediated rescue from apoptosis is measured using the colorectal cell line FTT29 cells (ATCC: FfTB 38) after induction with Na Butyrate. The HT29 cells are plated out onto white Fluoronunc 96 well plates (Nunc) at 12,000cells/ml and incubated at 37°C, 5% CO2 for 48 hours. Media is aspirated and 10ΟμΙ/well of serum free DMEM/F12 is added for 2 hours to serum starve cells. IGF (100μΙ/ well 0.05-50 nM dilutions) in the presence and the absence of inhibitory compound is added in 0.1% BSA solution (Sigma) in DMEM/F12 (Gibco) in triplicate. A final concentration of 5mM Butyrate (Sigma) is added to each well. Plates are incubated at 37°C, 5% CO2 for a further 48 hours. Plates are brought to room temperature and developed (as per instructions for CTG Assay (Promega)). Luminescence signal is measured on the Polarstar plate reader and data is evaluated using table curve to obtain the specific ED50. 2. fiil Cell Migration Assay: The migration assays are performed in the modified 96-well Boyden chamber (Neuroprobe, Bethesda, MA). An 8 μΜ polycarbonate filter, which is pre-soaked in 25 pg/ml of collagen in 10 mM acetic acid overnight at 4 °C, is placed so as to divide the chamber into an upper & lower compartment. Varying concentrations of the IGF-I analogues (25 μΙ of 0-100 nM) diluted in RPMI (Gibco) with 0.5% BSA (Sigma) tested for their migration inducing ability, are placed in the lower compartment in quadruplicates. The wells of the upper chamber are seeded with 50pl/well of 2x105 SW480 (ATCC:CCL 228) pre-incubated for 30mins/37°C with 1.1 μΙ of 2μΜ Calcein (Molecular Probes). Cells migrate for 8 hours at 37°C, 5% CO2. Unmigrated cells are removed by wiping the filter. The filter is then analysed in the Polarstar for fluorescence at excitation wavelength of 485nm and emission wavelength of 520nm. Data is evaluated using table curve to obtain the specific ED50 value. 3. (Nil Mouse Xenograft studies for anti-IGF-1R antibodies: In vivo studies are performed in 56-week-old female athymic BALBc nude mice, homozygous for the nunu allele. Mice are maintained in autoclaved micro-isolator cages housed in a positive pressure containment rack (Thoren Caging Systems Inc., Flazelton, PA, USA. To establish xenografts, mice are injected subcutaneously into the left inguinal mammary line with 3 x 10® or 5 x 10® cells in 100 μΙ of PBS. Tumour volume (TV) is calculated by the formula (length x width2)/2 (Clarke et at., 2000), where length is the longest axis and width the measurement at right angles to length.
Initial biodistribution of potential binding molecules are ascertained by injecting 40 BALBc nude mice with established xenografts with radiolabelled 111ln- or 125l-anti-IGFR antibody (3 pg, 10 pCi) intravenously via the tail vein (total volume=0.1 ml). At designated time points after injection of the radioconjugates (i=4 h, days 1, 2, 3, 5 and 7), groups of mice (n=35) are killed by Ethrane anaesthesia. Mice are then exsanguinated by cardiac puncture, and tumours and organs (liver, spleen, kidney, muscle, skin, bone (femur), lungs, heart, stomach, brain, small bowel, tail and colon) are resected immediately. All samples are counted in a dual gamma scintillation counter (Packard Instruments). Triplicate standards prepared from the injected material are counted at each time point with tissue and tumour samples enabling calculations to be corrected for the physical decay of the isotopes. The tissue distribution data are calculated as the mean ± s.d. percent injected dose per gram tissue (%ID g'1) for the candidate molecule per time point.
Pharmacokinetics for the candidate compounds are ascertained as follows: Serum obtained from mice bearing xenografts, following infusion of radiolabelled-binding molecule as described above, is aliquoted in duplicate and counted in a gamma scintillation counter (Packard Instruments, Melbourne, Australia). Triplicate standards prepared from the injected material are counted at each time point vuth serum samples to enable calculations to be corrected for the isotope physical decay. The results of the serum are expressed as % injected dose per litre (% ID 1""*). Pharmacokinetic calculations are performed of serum data using a curve fitting program (WinNonlin, Pharsight Co., Mountain View, CA, USA). A two-compartment model is used to calculate serum pharmacokinetic parameters of AUC (area under the serum concentration curve extrapolated to infinite time), CL (total serum clearance), T 12a and Τ-|2β (half-lives of the initial and terminal phases of disposition) for 125l-and 111 In-labelled molecule. 4. (iv) Therapeutic in vivo studies: Tumour cells (3 x 10®) in 100 μΙ of media are inoculated subcutaneously into both flanks of 46-week-old female nude mice (n=5 group‘d). Candidate molecule treatment commences day 7 post-tumour cell inoculations (mean ± s.e. tumour volume=60x15 mm®) and consists of six intraperitoneal injections over 2 weeks of appropriate amounts of the candidate molecule or vehicle control. Tumour volume in mm3 is determined as described previously. Data is expressed as mean tumour volume for each treatment group. Differences in tumour size between control and test groups are tested for statistical significance (P<0.05) by f-test.
Uses of compounds [0233] Compounds/chemical entities designed or selected by the methods of the invention described above may be used to modulate IR and/or IGF-1R activity in cells, i.e. activate or inhibit IR and/or IGF-1R activity. Such compounds may interact with the low affinity binding sites of IR and/or IGF-1R as defined herein, or mimic the C-terminal region of the α-chain of IR and/or IGF-1R as defined herein. They may also be used to modulate homodimerisation of IR and/or IGF-1R.
[0234] Modulation of homodimerisation of IR and/or IGF-1R may be achieved by direct binding of the chemical entity to a homodimerisation surface of IR and/or IGF-1R, and/or by an allosteric interaction elsewhere in the IR and/or IGF-1R extracellular domain.
[0235] Given that aberrant IR and/or IGF-1R activity is implicated in a range of disorders, the compounds described above may also be used to treat, ameliorate or prevent disorders characterised by abnormal IR and/or IGF-1R signalling. Examples of such disorders include malignant conditions including tumours of the brain, head and neck, prostate, ovary, breast, cervix, lung, pancreas and colon; and melanoma, rhabdomyosarcoma, mesothelioma, squamous carcinomas of the skin and glioblastoma.
[0236] The compounds designed to interact or identified as interacting with the extracellular domain of IR and/or IGF-1R, and in particular to interact with the target binding sites, are useful as agonists or antagonists against the action of insulin on IR and/or IGF on IGF-1R. The compounds are useful as assay reagents for identifying other useful ligands by, for example, competition assays, as research tools for further analysis of IR and/or IGF-1R and as potential therapeutics in pharmaceutical compositions.
[0237] Compounds described herein are also useful as lead compounds for identifying other more potent or selective compounds. The mimetic compounds described herein are also potentially useful as inhibitors of the action of insulin and in the design of assay kits directed at identifying compounds capable of binding to the low affinity binding site for insulin on IR. The mimetic compounds are also potentially useful as inhibitors of the action of IGF and in the design of assay kits directed at identifying compounds capable of binding to the low affinity binding site for IGF on IGF-1 R. In particular, it is envisaged that compounds described herein will prove particularly useful in selecting/designing ligands which are specific for IR or IGF-1 R.
[0238] In one example, one or more of the compounds can be provided as components in a kit for identifying other ligands (e.g., small, organic molecules) that bind to IR or IGF-1 R. Such kits may also comprise IR or IGF-1 R, or functional fragments thereof. The compound and receptor components of the kit may be labeled (e.g. by radioisotopes, fluorescent molecules, chemiluminescent molecules, enzymes or other labels), or may be unlabeled and labelling reagents may be provided. The kits may also contain peripheral reagents such as buffers, stabilizers, etc. Instructions for use can also be provided.
[0239] IR and IGF-1 R agonists and antagonists, and in particular antagonists, provided by this invention are potentially useful as therapeutics. For example, compounds are potentially useful as treatments for cancers, including, but not limited to, breast, prostate, colorectal, and ovarian cancers. Human and breast cancers are responsible for over 40,000 deaths per year, as present treatments such as surgery, chemotherapy, radiation therapy, and immunotherapy show limited success. Recent reports have shown that a previously identified IGF-1 R antagonist can suppress retinal neovascularization, which causes diabetic retinopathy (Smith ef a/., 1999). IGF-1 R agonist compounds (i.e. existing IGF-1 R compounds which have been modified employing methods of the present invention) are useful for development as treatments for neurological disorders, including stroke and diabetic neuropathy. Reports of several different groups implicate IGF-1R in the reduction of global brain ischemia, and support the use of IGF-I for the treatment of diabetic neuropathy (reviewed in Auer et al., 1998; Apfel, 1999). A number of therapeutics directed against IGF-1R are currently undergoing clinical trial as anti-cancer agents (Hewish et al., 2009) [0240] The IGF-1 R agonist peptides described herein may be useful for enhancing the survival of cells and/or blocking apoptosis in cells.
Administration [0241] Compounds described herein, i.e. ligands or modulators of IR and/or IGF-1 R identified or identifiable by the screening methods of the invention, may preferably be combined with various components to produce the compositions described herein. Preferably the compositions are combined with a pharmaceutically acceptable carrier or diluent to produce a pharmaceutical composition (which may be for human or animal use).
[0242] The formulation will depend upon the nature of the compound and the route of administration but typically they can be formulated for topical, parenteral, intramuscular, oral, intravenous, intra-peritoneal, intranasal inhalation, lung inhalation, intradermal or intra-articular administration. The compound may be used in an injectable form. It may therefore be mixed with any vehicle which is pharmaceutically acceptable for an injectable formulation, preferably for a direct injection at the site to be treated, although it may be administered systemically.
[0243] The pharmaceutically acceptable carrier or diluent may be, for example, sterile isotonic saline solutions, or other isotonic solutions such as phosphate-buffered saline. The compounds may be admixed with any suitable binder(s), lubricant(s), suspending agent(s), coating agent(s), solubilising agent(s). It is also preferred to formulate the compound in an orally active form.
[0244] In general, a therapeutically effective daily oral or intravenous dose of the compounds, including compounds and their salts, is likely to range from 0.01 to 50 mg/kg body weight of the subject to be treated, preferably 0.1 to 20 mg/kg. The compounds and their salts may also be administered by intravenous infusion, at a dose which is likely to range from 0.001-10 mg/kg/hr.
[0245] Tablets or capsules of the compounds may be administered singly or two or more at a time, as appropriate. It is also possible to administer the compounds in sustained release formulations.
[0246] Typically, the physician will determine the actual dosage which will be most suitable for an individual patient and it will vary with the age, weight and response of the particular patient. The above dosages are exemplary of the average case. There can, of course, be individual instances where higher or lower dosage ranges are merited.
[0247] For some applications, the compositions are administered orally in the form of tablets containing excipients such as starch or lactose, or in capsules or ovules either alone or in admixture with excipients, or in the form of elixirs, solutions or suspensions containing flavouring or colouring agents.
[0248] The compositions (as well as the compounds alone) can also be injected parenterally, for example intravenously, intramuscularly or subcutaneously. In this case, the compositions will comprise a suitable carrier or diluent.
[0249] For parenteral administration, the compositions are best used in the form of a sterile aqueous solution which may contain other substances, for example enough salts or monosaccharides to make the solution isotonic with blood.
[0250] For buccal or sublingual administration the compositions may be administered in the form of tablets or lozenges which can be formulated in a conventional manner.
[0251] For oral, parenteral, buccal and sublingual administration to subjects (such as patients), the daily dosage level of the compounds and their pharmaceutically acceptable salts and solvates may typically be from 10 to 500 mg (in single or divided doses). Thus, and by way of example, tablets or capsules may contain from 5 to 100 mg of active compound for administration singly, or two or more at a time, as appropriate. As indicated above, the physician will determine the actual dosage which will be most suitable for an individual patient and it will vary with the age, weight and response of the particular patient.
[0252] The routes of administration and dosages described are intended only as a guide since a skilled practitioner will be able to determine readily the optimum route of administration and dosage for any particular patient depending on, for example, the age, weight and condition of the patient.
Examples
Example 1: A thermodynamic study of ligand binding to the first three domains of the human insulin receptor (IR1: relationship between the IR a-chain C-terminal peptide taCTt and the Site 1 insulin mimetic peptides 1.1 Introduction [0253] In order to study ligand binding to the first three domains of IR ectodomain, isothermal titration calorimetry (ITC) was used. ITC allowed a direct assay of the interactions between the IR ectodomain classical aCT peptide (residues 704-719; SEQ ID NO: 11) and IR485 (a construct consisting of the first three N-terminal domains of the IR; SEQ ID NO: 10), as well as the interaction between the analogous peptide of human IGF-1R and IR485. At the same time the thermodynamics of binding of the N- and C-terminal segments of the insulin mimetic peptide S519 (Schaffer et al., 2003) to IR485, as well as the binding of S519 itself to IR485 were examined. S519 (SLEEEWAQVECEVYGRGCPSGSLDESFYDWFERQLG; SEQ ID NO: 16) is a 36-residue peptide resulting from the affinity-optimization of two covalently-linked peptides, S371, a so-called "Site 1" peptide, and S446, a so-called "Site 2" peptide (Pillutla et al., 2002). S519 binds human IR with Kd=2.0 x 10‘11 (Schaffer et al., 2003). Taken together, the results revealed a remarkable relationship between the IR aCT peptide and the Site 1 mimetic peptide, and suggested a previously undetected structural similarity between the mimetic peptides and insulin itself (Ward and Lawrence, 2009). Finally, the binding to IR485 of the IR aCT peptide carrying a F714A mutation (residues 704-719, TFEDYLHNWAVPRPS; SEQ ID NO: 12) was examined. 1.2 Materials and methods for thermodynamic experiments.
Reagents [0254] The IR485 construct of human IR was expressed in Lec8 mutant CHO cells and purified by gel filtration chromatography as described previously (Lou et al., 2006). IR485 consists of the first 485 residues of the mature human insulin receptor followed by the 16-residue sequence SDDDDKEQKLISEEDLN (SEQ ID NO: 10), which comprises a serine residue followed by an enterokinase cleavage site and a c-myc epitope tag. The c-myc tag was not removed for the ITC studies described here. IR485 was concentrated to 30 mg/ml in Tris-buffered saline (TBSA; 24.8 mM Tris-HCI (pH 8.0), 137 mM NaCI, 2.7 mM KCI and 0.02% sodium azide) using an Ultrafree centrifugal concentrator (Millipore, USA). Porcine insulin (Sigma-Aldrich, USA) was prepared in a zinc-free-form (termed ZFP-insulin) by extensive dialysis against 0.1% (v/v) acetic acid followed by lyophilization. Human IGF-I was obtained from Novozymes GroPep (Australia) as "receptor grade" material.
[0255] The IR classical aCT peptide (residues 704-719, TFEDYLHNWFVPRPS; SEQ ID NO: 11), the IGF-1R classical aCT peptide (residues 691-706, VFENFLHNSIFVPRPE; SEQ ID NO: 14) as well as the F714A mutant of IR aCT (denoted IR aCT.714A; SEQ ID NO: 12) were obtained from Genscript Corporation (USA) at the >98% level of purity. The S519N20 peptide (SLEEEWAQVECEVYGRGCPS; SEQ ID NO: 17) and the S519C16 peptide (GSLDESFYDWFERQLG; SEQ ID NO: 18) were obtained from AusPep (Australia) at the >90% level of purity. The S519 peptide (SLEEEWAQVECEVYGRGCPSGSLDESFYDWFERQLG; SEQ ID NO: 16) was obtained from Activotec (UK) at the >85% purity. Peptides were prepared by dissolving in 10 mM HCI at -4 mg/ml concentration and then diluting with TBSA. Oxidation of peptides S519N20 and S519 (which contain two cysteine residues) was carried out by incubating the respective peptide (prepared at 1 mg/ml in 50 mM ammonium bicarbonate adjusted to pH 8.5 with ammonia solution) in the dark at room temperature for 2 days and then lyophilizing before use. Oxidation was complete as determined by analysis with Ellman's reagent (5,5'-dithiobis-2-nitrobenzoate), All protein concentrations were determined by absorbance measurement at 280 nm using a NanoDrop 1000 spectrophotometer (Thermo Scientific, USA).
Isothermal Titration Calorimetry [0256] ITC experiments were performed using a VP-ITC isothermal titration calorimeter (MicroCal Inc., USA) with the calorimeter cell held at 25 °C. All samples were degassed prior to injection or placement into the cell, and the instrument was temperature equilibrated prior to the start of the injections. In all experiments the volume of sample placed in the cell was 1.4 ml and the titrant was injected in 7 μΙ volumes over 14 s at 3 min intervals, with the total number of injections being 40. The sample contents were stirred at a speed of 310 rpm over the duration of the titration. All titrants (i.e. ZFP-insulin, IGF-I, IR aCT peptide, IGF-1R aCT peptide, IR aCT.714A peptide, S519, S519N20, and S519C20) were first injected into a solution of TBSA alone in order ascertain the heat of dilution, which was then subtracted from the data of interest as appropriate. Data were analyzed using the instrument's software incorporated within the Origin 7 software (OriginLab, USA) and in all cases fitted as a single-site interaction using the methodologies outlined in the instrument's manual. All measurements showing quantifiable interaction were done in triplicate (in some cases at varying concentrations) and the resultant ITC-derived thermodynamic parameters averaged.
Dynamic light scattering (DLS) [0257] DLS measurements were carried out using a Zetasizer NanoZS (Malvern Instruments Ltd, England). Samples of IR485 and IR485 in combination with IR aCT and/or ZFP-insulin at various concentrations were prepared in TBSA and equilibrated overnight at 4 °C. Samples were spun at 13,000 g using a benchtop centrifuge to pellet any macroscopic particulates and then pipetted into a 45 μΙ glass cuvette held at 20 °C. Data were analysed using the instrument's Dispersion Technology Software version 5.0.2 to yield a volume distribution for the scattering particles present in the solution. The results presented were representative of several sets of experiments. 1.3 Results from thermodynamic experiments
The pairwise interactions of IR485, hormones and aCT peptides [0258] ITC experiments investigated the pairwise interaction between selected combinations of IR485, ZFP-insulin, IGF-I, IR aCT peptide, IGF-1R aCT peptide and IR aCT.714A peptide. The results of the analyses are presented in Table 2. The mean dissociation constants (Kq) of the interaction between (i) IR aCT and IR485, (ii) IGF-1R aCT and IR 485, and (iii) IR aCT.714Aand IR485 were determined to be 3.9 ± 1.1 μΜ, 17.6 to μΜ and 6.5 ± 1.7 μΜ respectively. Representative ITC profiles for these measurements are presented in Figure 2. The dissociation constant (Kq) for the interactions between (i) ZFP-insulin and the IR aCT peptide, (ii) IGF-I and the IGF-1R aCT peptide, (iii)-ZFP-insulin and IR485 and (iv) IGF-I and IR485 was in all cases estimated to be weaker than 1 mM.
Table 2 - ITC analysis of pairwise interactions of IR485, hormone and aCT peptides9.
The interaction of hormone with aCT peptide pre-complexed IR485 [0259] ITC experiments investigated the interaction of ZFP-insulin and IGF-I with IR485 pre-complexed with a ten-fold molar ratio of either IR aCT or IGF-1R aCT peptide. Given that the aCT peptides displayed micromolar affinity for IR485 (see above), it was calculated that, at the concentrations of IR485 and aCT peptide employed in this set of experiments, >90% of the IR485 molecules within the ITC cell were in complex with aCT peptide prior to injection of hormone. The results of the ITC investigations are presented in Table 3 with representative ITC curves being presented in Figure 3. The dissociation constant of ZFP-insulin with respect to IR485 in the presence of a 10-fold molar ratio of IR aCT peptide was 17 ± 4nM and in the presence of a 10-fold molar ratio of IGF-1R aCT peptide was 5.7 ± 1.1 nM. In contrast, the dissociation constant of IGF-I with respect to IR485 in the presence of a 10-fold excess of IR aCT peptide was 490 ± 75nM and in the presence of a 10-fold excess of IGF-1R aCT peptide was 22 ± 3nM, with no correction being made for the incomplete saturation of IR485 with peptide prior to injection. The thermodynamic parameters presented in Table 3 show that in all four cases the binding of hormone to aCT peptide pre-complexed IR485 is enthalpically driven.
Table 3 - Isothermal titration of ZFP-insulin and IGF-IR against IR485 pre-mixed with a 10-fold molar ratio of either IR or IGF-1 R aCT peptide9.
The interaction of insulin mimetic peptides with IR485 [0260] ITC experiments investigated the interaction of the insulin mimetic peptides S519 (SEQ ID NO: 16), S519N20 (a Site 2 peptide, corresponding to the 20 N-terminal residues of S519; SEQ ID NO: 17) and S519C16 (a Site 1 peptide, corresponding to the 16 C-terminal residues of S519; SEQ ID NO: 18) with IR485. The results are presented in Table 4, with representative ITC curves being presented in Figure 4. S519 was found to bind IR485 with a dissociation constant of 11 ± 3nM, whilst S519C16 bound IR485 with somewhat higher affinity (Kd = 2.6 ± 0.7nM). The binding of both S519 and S519C16 to IR485 appeared to be enthalpically driven. When IR485 was in the presence of a 10-fold molar ratio of IR aCT peptide, the dissociation constant of S519C16 increased to 16 ± 6nM. The dissociation constant for the interaction between S519N20 and IR485 was estimated to be weaker than 1mM, with accurate determination of Kd precluded at the concentrations of reactants employed.
Table 4 - Isothermal titration of insulin mimetics peptides S519, S519N20 and S519C16 against IR4859.
The multimeric state oflR485 in the presence of IR aCTpeptide [0261] The DLS-determined volume distribution of IR485 at 6 mg/ml (a concentration at which IR485 is overwhelmingly dimeric (Lou et al., 2006) showed a single broad peak centred at a particle diameter of 9.7 nm (Figure 5a). The half-width of the peak was 3.1 nm and the assumption of a spherical particle lead to a calculated scattering particle molecular weight of 136 kDa, closely similar to the molecular weight of -140 kDa estimated by size-exclusion chromatography. Instrumental limitations precluded DLS measurement of IR485 at concentrations at which IR485 was known to be overwhelmingly monomeric (i.e. at < 0.025 mg/ml, (Lou et al., 2006). However, at 0.5 mg/ml, the DLS-determined volume distribution of IR485 showed a single broad peak at a particle diameter of 8.2 nm (Figure 5b). The half-width of the peak was 2.2 nm and the assumption that the scattering arose from a single species of spherical scattering particle lead to a calculated molecular weight of 91 kDa, consistent with the solution being predominantly monomeric. Using this pair of observations as reference values, it was then observed that (i) an addition of a three-food molar ratio of IR aCT peptide to a 6 mg/ml solution of IR485 resulted in a DLS-determined volume distribution which had a single peak centred at 8.6 nm (Figure 5c), and (ii) an addition of a three-fold molar ratio of IR aCT peptide together with a 2-fold molar ratio of ZFP-insulin to a 6 mg/ml solution of IR485 resulted in a volume distribution which had a single peak centred at 8.2 nm (Figure 5d). The half-width of these latter two peaks was 2.4 nm and the calculated molecular weights of the scattering particles (again assuming a single species of spherical scatterers) was 102 kDa and 92 kDa respectively. These data implied that addition of either (i) IR aCT peptide or (ii) IR aCT peptide plus ZFP-insulin to a 6 mg/ml solution of IR485 resulted in a change in its hydrodynamic diameter consistent with the construct undergoing a transition from being overwhelmingly dimeric in solution to predominantly monomeric in solution.
Example 2: Solving the crystal structure of the C-terminal region of the a-chain of IR 2.1 Introduction: ambiguous electron density [0262] As described previously (WO 07/147213), an area of strand-like ambiguous electron density was present near the L1 -β2 face of IR. However, despite numerous different processing and refinement protocols, the density was impossible to interpret in terms of any peptide sequence. The data described in Example 1 above implied a surprising and previously unexpected sequence relationship between the S519C16 peptide (SEQ ID NO: 18) and the 'classical' aCT peptide region (residues 704-719; SEQ ID NO: 11) described previously in the literature (Kurose et al., 1994). X-ray data obtained in WO 07/147213 were revisited and further reviewed with the possibility then in mind that the ambiguous electron density near the ί1-β2 face of IR could have been due to the 'classical' aCT peptide region of the IR a-chain. 2.2 Protein production, crystallisation and data collection from JR ectodomain crystals [0263] Production of IR ectodomain protein, subsequent crystallization and data collection were performed as described previously (WO 07/147213). 2.3 Diffraction data processing and crystallographic refinement [0264] The diffraction images used were those used in the native structure determination of the human insulin receptor ectodomain homodimers ("Native 2" data set, see Table 3 of WO 07/147213). The diffraction images were re-processed using XDS (version 10-September-2008; Kabsch, 1993), including reflections to a maximum resolution of 3.8 Å. Diffraction data processing statistics from XDS are shown in Table 5. About a 1 A difference in the longest cell dimension was observed as compared to analysis previously with the D*trek (Pflugrath, 1999) program.
[0265] The receptor ectodomain monomer complexed with Fab 83-7 and Fab 83-14 (Protein Data Bank entry 2DTG, with minor prior in house improvement) was subjected to individual domain rigid body crystallographic refinement against the XDS-processed diffraction data set using PHENIXv1.3b (Adams et al., 2002). Atotal of fourteen rigid body domains were defined for this purpose (i) chain E and residues 4-191, (ii) chain E and residues 192-311, (iii) chain E and residues 312-464, (iv) chain E and residues 465-594, (v) chain E and residues 595-817, (vi) chain E and residues 818-909, (vii) chain A and residues 1-112, (viii) chain B and residues 1-109, (ix) chain Aand residues 113-220, (x) chain B and residues 110-219, (xi) chain C and residues 1-109, (xii) chain C and residues 110-207, (xiii) chain D and residues 1-106, (xiv) chain D and residues 107-214}. Rigid body refinement was then followed by atomic coordinate refinement and finally by TLS (translation, libration and screw-rotation displacement) refinement, using default protocols within PHENIX Ao/yweighted 2F0-FC difference electron map was then calculated using structure factors whose amplitudes had been artificially inflated by the application of a B value (-153 A2) equal to the negative of that determined by a Wilson plot (BrOnger et al., 2009). The map was then visually inspected using O v12.0 (Jones, 2004). At this stage the segment of electron density previously discerned on the Ι_1-β2 face of IR (WO 07/147213) had a surprisingly clear helical conformation with sufficient variation in side chain electron density to suggest that sequence assignment was possible (see below). O was then used for all subsequent model building of the IR segment 693-710, with model building being iterated with crystallographic refinement within PHENIX Final crystallographic refinement statistics are shown in Table 5. 2.4 Assigning structure to the C-terminal region of the IR a-chain [0266] Following the refinement protocol detailed above, sequence assignment to the C-terminal region of the IR α-chain was possible and based on the following observations (i) the segment Glu698 to Tyr708 was the only region of the insert domain predicted to have helix-forming propensity (Ward and Lawrence, 2009), (ii) inspection of the density at the side-chain positions i, i+4 and i+7 showed that they likely arose from three large (most likely aromatic) residues, that these could in all likelihood be used to define the sequence register, and that the discerned helix would then span residues i-8 to i+9, (iii) the direction of the polypeptide chain within the helix was readily apparent from helical "tree" averaging of the difference electron density (Jones, 2004), and (iv) the only possible assignment of the sequence to the i, i+4, i+7 all-aromatic motif of (ii) above was to associate these residues with Phe701, Phe705 and Tyr708, respectively.
Table 5 - Data processing and crystallographic refinement statistics from IR ectodomain crystal IR IRAp data processed using XDS.
[0267] The observed helical region of density spanned IR residues 693 to 710 (SEQ ID NO: 13; Figure 6). The direction of the polypeptide was consistent with having the inter-monomer disulphide bond-forming triplet Cys682 / Cys683 / Cys685 (Sparrow et al., 1997) in close proximity to the ectodomain two-fold axis (McKern et al., 2006). Subsequent model-building of insulin receptor residues 693-710 (SEQ ID NO: 13) into the difference electron density further supported the correctness of the sequence register assignment by virtue of the shape and electrostatic complementarity of the packing at the interface between L1 and the helical segment.
[0268] The side-chains of Phe701 and Phe705 pack adjacent to each other into a hydrophobic pocket formed by the side-chains of the L1 residues Phe64, Phe88, Phe96, Tyr91 and Arg 118 (Figure 6b). The side-chain of Tyr708 is packed approximately parallel to the surface and interacts with the L1 residues Leu62, Gln34 and Phe64. The side-chains of the residue pair Glu698 and Arg702 lie in close proximity and are juxtaposed against the side chains of the L1 residue pair Arg118 and Asp 120 respectively, the four side chains forming a symmetric charge-compensating cluster. The final interaction between the helix and the surface of the central b-sheet of L1 arises from an interaction between the side chain of Leu709 with the side chains of the L1 residues Leu37 and Phe64. On the opposite surface of the helix side chains of the residue pair Lys703 and Asp707 are in proximity to each other and also likely charge compensate. Further crystallographic refinement of the (IR + Fab 83-7 + Fab 83-14) structure, now inclusive of IR residues 693-710, lead to a reduction of 0.5% in the free R-factor. The shape complementarity (Lawrence and Colman, 1993) of the interface between the insulin receptor residues 693-710 and the L1 domain of the receptor, computed after crystallographic refinement, is high (0.72). Taken together, these results gave overwhelming support to the correctness of the assignment of sequence to the helical density segment.
[0269] The structure provided herein (Appendix I) enables, for the first time, a view of the intact low affinity insulin receptor binding site that includes the critically-important C-terminal region of the receptor α-chain (SEQ ID NO: 13). The atomic coordinates of IR + Fab 83-7 + Fab 83-14 inclusive of the helical segment are now included in Appendix I and are depicted in Figure 6. The modelled helical segment of the IR α-chain (residues 693-710, herein termed 'the C-terminal region of the a-chain of IR'; LKELEESSFRKTFEDYLH; SEQ ID NO: 13) surprisingly encompasses residues N-terminal of the ''classical'' aCT peptide of IR (residues 704-719; TFEDYLHNWFVPRPS; SEQ ID NO: 11) described previously in the literature (Kurose et al., 1994; Figure 6c). The original demarcation of this segment arose from a tryptic digest aimed at isolating receptor segments that were experimentally cross-linked to bound insulin. Tryptic cleavage at Lys703 resulted in the isolation of the segment 704-719 crosslinked to insulin. The involvement of residues immediately N-terminal of 704 in both the formation of the low affinity insulin binding site and in attachment of α-chain C-terminus to the first three domains of the receptor had not been contemplated previously.
Example 3: Molecular modelling 3.1 Introduction [0270] There is a high level of sequence identity between, on the one hand, the L1 domains of IGF-1R and IR, and, on the other hand, between the C-terminal regions of the α-chain of IGF-1R and IR (Figure 1 and Figure 6c). Accordingly, models of IR in complex with S519C16 and the IGF-1R α-chain residues 681-697, respectively, were constructed using the MODELLER program (Sali and Blundell, 1993) with the crystallographic structure of IR ectodomain presented in the main text as a template. Models of the ectodomain of IGF-1R in complex, respectively, with the IGF-1R α-chain residues 681-697 (SEQ ID NO: 15). S519C16 (SEQ ID NO: 18) and the IR α-chain residues 693-710 (SEQ ID NO: 13) were constructed employing the crystal structure of the first three domains of IGF-1R (Garrett et al., 1998), the structure of the IR ectodomain presented here and the known sequence relationship between IR and IGF-1R (Adams et al., 2000). 3.2 Materials and methods for modelling [0271] Twenty-five instances of each model were prepared and the structure with lowest DOPE score (Eramian et al., 2006) selected for further molecular dynamics (MD) simulation. The L1-domains of IR and IGF-1R, residues Pro4-Gln189 and residues Glul-Gln189, respectively, in complex with the various peptides were excised from the full-length models prepared by MODELLER for use in the MD calculations. MD simulations were performed using the GROMACS v4.0 suite (van der Spoel et al., 2005) with the OPLS-aa force field (Jorgensen and Tirado-Rives, 1988). The proteins were solvated in a box of water and the total charge of the system neutralized by replacing water molecules with sodium ions. The LINOS algorithm was used to constrain bond lengths (Hess et al., 1977). Protein and solvent (including ions) were coupled separately to a thermal bath at 300 K using velocity rescaling (Bussi et al., 2007) applied with a coupling time of 0.1 ps. All simulations were performed with a single non-bonded cutoff of 10 Å, applying a neighbour-list update frequency of 10 steps (20 fs). The particle mesh Ewald method was applied to deal with long-range electrostatics with a grid width of 1.2 A and fourth-order spline interpolation. All simulations consisted of an initial minimization to remove close contacts, followed by 100 ps of positional restrained MD to equilibrate the water molecules with the protein fixed. The time step used in all the simulations was 2 fs. MD simulations for each system were run for a total length of 2.0 ns. 3.3 Molecular models [0272] The following models were created using MODELLER and subjected to MD simulations as described above: the C-terminal region of IR α-chain (SEQ ID NO: 13) bound to the L1 domain of IGF-1R (Appendix II; Figure 7), the C-terminal region of IGF-1R α-chain (SEQ ID NO: 15) bound to the L1 domain of IR (Appendix III; Figure 8), the C-terminal region of IGF-1Ra-chain (SEQ ID NO:15) bound to the L1 domain of IGF-1R (Appendix IV; Figure 9), S519C16 mimetic peptide (SEQ ID NO: 18) bound to the L1 domain of IR (Appendix V; Figure 10), and S519C16 mimetic peptide (SEQ ID NO: 18) bound to the L1 domain of IGF-1R (Appendix VI; Figure 11).
[0273] The above models are presented schematically in Figures 7 to 11, with coordinates in Appendixes II to VI, respectively. These models define a common binding surface on IR and IGF-1R capable of binding the C-terminal region of the α-chain of the receptors. The following interactions were observed: 1. (i) IGF-1RIIR α-chain residues 693-710 (Figure 7): This interaction is characterized by several polar and ionic interactions -Y83 of the receptor hydrogen bonds the hydroxyl side-chains of S700 and T704 of the ligand, and E698 and R702 of the ligand form salt bridges with R112 and E114 of the receptor, respectively. Hydrophobic residues on the ligand pack into a hydrophobic pocket on the receptor formed by in part by L32, L33, L56, F58, F82, Y83, Y85, V88 and F90. 2. (//) IR/IGF-1R α-chain residues 681-697 (Figure 8): The complex between IR and the IGF-1R α-chain (residues 681-697) consists of charged interactions between R118 and E120 of IR, and E685 and Y688 of the peptide, respectively - the latter of these interactions can be mediated by a water molecule. There are a large number of hydrophobic residues on IR that contact the peptide, formed in part by L36, L37, L62, F64, F88, F89, Y91, V94 and F96. 3. (//7) IGF-1RIIGF-1R α-chain residues 681-697 (Figure 9): This interaction is characterized by a salt bridge between R112 of the receptor and E685 of the peptide. Additionally, the hydroxy group on the side-chain of Y688 in hydrogen bonded to a water molecule (not shown) that is itself hydrogen bonded to E 114 of the receptor. The hydrophobic resides Y688, F 192, F 195 and L196 pack into the hydrophobic pocket on the surface of the receptor formed in part by the side-chains of L32, L56, F58, F82, Y83, Y85, V88 and F90. 4. (iv) IRIS519C16 (Figure 10): Throughout the MD simulation of the complex between IR (L1 domain) and the S519C16 peptide several hydrogen-bond interactions are observed - between R119 of IR and D4 of the peptide, between E120 of IR and Y8 of the peptide, and between Q14 of the peptide with both residues R14 and Q34 of IR. The interaction between receptor and peptide involves the aromatic side-chains of several residues - F7 and F11 of the peptide bind a pocket on the surface of IR flanked by the residues L62, F64, F88, F89, Y91 and F96. Additionally, L 15 of the peptide packs against the hydrophobic side-chains of L37 and F64. 5. (v) IGF-1RIS519C16 (Figure 11): D4 and Y8 of the peptide hydrogen bonds R112 and E 114 of the receptor, respectively. The aromatic side-chains of F7 and F11 of the peptide bind the hydrophobic pocket on the receptor formed by L56, F58, F82, Y83, V88 and F90. W10 of the peptide can conceivably contact the hydrophobic side-chains of L32 and F82, increasing its affinity.
[0274] The models presented herein can now be used to improve the insulin mimetic peptides developed by Schaffer and coworkers (Schaffer et al., 2003). Isothermal titration calorimetry experiments (see Example 1 above) indicated that a prototypical Site 1 mimetic peptide S519C16 competes with the IR C-terminal peptide 704-719 in binding to a construct consisting of the first three domains of the insulin receptor. The two major residues involved in the interaction between the C-terminal segment of the insulin receptor α-chain and the L1 domain (viz Phe701 and Phe705) are conserved in S519C16 (Figure 3). Of the remaining residues involved in the interaction between the C-terminus of the α-chain and L1, Tyr708 is replaced with an asparagine residue in S519C16, Glu698 is either conserved or replaced by an aspartate residue, Arg702 is replaced by a tyrosine residue and Leu709 is conserved. The two phenylalanine residues form part of the motif FYXWF (SEQ ID NO: 19) that characterizes the Site 1 mimetic peptides (Schaffer et al., 2003). Modelling undertaken with MODELLER and subsequent MD showed that it is possible to dock the S519C16 peptide onto the L1 surfaces of IR and IGF-1R in a way analogous to the docking of cognate peptides (see Figures 6-11).
Example 4: Binding of mutant aCT peptides to an insulin mini-receptor flR485 construct) 4.1 Introduction [0275] Insulin-mimetic peptides have been discovered by phage display technology and classified as "Site" 1,2 or 3 on the basis of competition of binding to insulin receptor (Pillutla et al., 2002). The affinity-matured Site 1 peptides are characterized by a FYXWF motif (SEQ ID NO: 19) (Pillutla et al., 2002); selected Site 1 and 2 peptides have been covalently tethered to yield agonists with up to picomolar affinity for insulin receptor (Schaffer et al., 2003). A sequence relationship has been shown (see Figure 6c) between the aCT region and the prototypic Site 1 mimetic peptide that places the aCT region residues Phe701 and Phe705 in respective alignment with the two flanking phenylalanine residues in the FYXWF motif. This relationship was used to explain the competitive binding of the aCT peptide and the prototypic Site 1 mimetic peptide to the insulin mini-receptor IR485, a construct which consists of the receptor L1-CR-L2 domains only. If this relationship was correct, then mutation of Arg702 and/or Thr704 within the aCT segment to either tyrosine or tryptophan would lead to significantly higher affinity of the segment for the insulin mini-receptor. It would also then be possible to model these substitutions directly onto the structure reported here. These hypotheses were tested as follows. 4.2 Materials and methods for thermodynamic experiments [0276] Biotinylated aCT peptides at >75 % purity spanning residues 698-719 of the insulin receptor were obtained from Genscript Inc. (USA). The insulin mini-receptor IR485 construct was produced and purified as previously described (Lou et al., 2006), omitting the final ion-exchange chromatography step. Isothermal titration calorimetry (ITC) experiments were performed using a VP-ITC isothermal titration calorimeter (MicroCal Inc., USA) with the calorimeter cell held at about 25° C. The ITC cell contained insulin mini-receptor IR485 prepared at about 10 μΜ concentration in Tris-buffered saline plus azide (TBSA; about 24.8 mM Tris-HCI (pH 8.0), 137 mM NaCI, 2.7 mM KCI, and 0.02% sodium azide), and the syringe contained the peptide prepared at about 60 μΜ concentration in TBSA. All samples were degassed prior to injection or placement into the cell, and the instrument was temperature equilibrated prior to the start of the injections.
[0277] In all experiments the volume of the insulin mini-receptor IR485 sample placed in the cell was about 1.4 ml and the titrant was injected in about 7 pi volumes over 14 s at 3 min intervals, with the total number of injections being 40. The sample contents were stirred at a speed of about 310 rpm over the duration of the titration. All titrants were first injected into a solution of TBSA alone in order ascertain the heat of dilution, which was then subtracted from the data of interest as appropriate. Each experiment was performed three times, except for that employing the native peptide, which was performed twice. Data were analyzed using the instrument's software incorporated within the Origin 7 software (OriginLab, USA) and in all cases fitted as a single-site interaction using the methodologies outlined in the instrument's manual. 4.3 Results from thermodynamic experiments [0278] Sample individual titration curves are provided in Figure 12. Table 6 below presents ITC-derived dissociation constants and thermodynamic parameters for a set of mutant aCT peptides titrated against the insulin mini-receptor IR485.
Table 6 - Derived thermodynamic parameters for the titration against IR485 of the N-terminally biotinylated IR peptide 698-719 containing the following respective mutations: wild-type, T704Y, R702W, R702Y, T704W and R702Y / T704W.
[0279] Progressive inclusion of aromatic residues at positions 702 and 704 is seen to result in an up to 100-fold increase in affinity, supporting the view that there is a structural relationship between the Site-1 mimetic peptides and the native aCT segment. The docking of the single- and double-mutant aCT peptides to the surface of the L1 domain was investigated by molecular dynamics simulation using the GROMACS v4.0 suite (van der Spoel et al., 2005) with the OPLS-aa force field (Jorgensen et a/., 1988) and revealed that the aromatic side chains of mutant aCT residues Trp702 and Tyr704 are likely to interact with the surface of the L1 domain and enhance the affinity of the interaction. At position 702 the aromatic side chain is docked into a pocket formed by the Phe96 and the alkyl portion of L1 side-chain Lys121; the hydroxyl group of a variant tyrosine residue can in addition form a hydrogen bond with the Lys121 ε-amino group. At position 704 the aromatic side chain of the variants is docked against L1 side chains Phe88 and Phe89.
[0280] Any discussion of documents, acts, materials, devices, articles or the like which has been included in the present specification is solely for the purpose of providing a context for the present invention. It is not to be taken as an admission that any or all of these matters form part of the prior art base or were common general knowledge in the field relevant to the present invention as it existed before the priority date of each claim of this application.
References [0281]
Adams et al. (2000) Cell. Mol. Life Sci., 57, 1050-1093.
Adams et al. (2002) Acta Cryst. D58, 1948-54.
Apfel (1999) Am. J. Med., 107, 34S-42S.
Auer (1998) Neurology, 51, S39-S43.
Ausubel et al. (1999) Short Protocols in Molecular Biology, 4th Ed, John Wiley &amp; Sons, Inc.; and the full version entitled Current
Protocols in Molecular Biology.
Bailyes et al. (1997) Biochein. J., 327, 209-215.
Bartlett etal. (1989) Royal Chem. Soc., 78, 182-196.
Bentley (1997) Methods Enzymol., 276, 611-619.
Binzet al. (2005) Nature Biotech., 23, 1257-1268.
Blondelle and Houghten (1996) Trends Biotechnol., 14, 60-65.
Bohm and Stahl (1999) M. Med. Chem. Res., 9,445.
Brooks etal. (1983) Comp. Chem., 4, 187-217.
Brunger et al. (1998) Acta Cryst. D54, 905-921.
Brunger (1997) Methods Enzymol., 276, 558-580.
Brunger et al. (2009) Acta Cryst., D65, 128-33.
Bruns et al. (1999) J Mol Biol., 288, 427-439.
Bussi et al. (2007) M. J. Chem. Phys., 126, 14101.
Buttel et al. (1999) Immunol. Cell Biol., 77, 256-262.
Carell et al. (1994a) Angew.Chem. Int. Ed. Engl., 33,2059.
Carell et al. (1994b) Angew. Chem. Int. Ed. Engl., 33, 2061.
Cho et al. (1993) Science, 261, 1303.
Chowet al. (1998) Biol. Chem., 273, 4672-4680.
Clarke et al. (2000) Cancer Res., 60, 4804-4811.
Cohen et al. (1990) J. Med. Chem., 33, 883-894.
Cole et al. (2005) in "Virtual Screening in Drug Discovery (Eds. B. Shoichet, J. Alvarez)", Taylor &amp; Francis CRC Press, Florida, USA.
Cull et al. (1992) Proc. Natl. Acad. Sci. USA, 89, 1865-1869.
Cwirla et al. (1990) Proc. Natl. Acad. Sci. USA, 97, 6378-6382.
Danial et al. (2008) Nat. Med. 14, 144-153.
Davis et al. (2006) Chem. Soc. Rev. 36, 326-334.
Day and Caflisch (2008) J. Chem. Inform. Model. 48,679-90.
Degterev et al. (2001) Nat. Cell. Biol. 3, 173-182.
De Meyts (1994) Diabetologia, 37, S135-S148.
De Meyts and Whittaker (2002) Nat. Rev. Drug Discos., 1, 769-783.
De Meyts (2004) Bioessays, 26, 1351-1362.
Denleyetal. (2003) Fbrm. Metab. Res., 35, 778-785.
Denleyetal. (2004) Mol. Endocrinol., 18, 2502-2512.
Devlin (1990) Science, 249, 404-406.
DeWitt et al. (1993) Proc. Natl. Acad. Sci. USA, 90, 6909.
Eramian et al. (2006) Protein Sci. 15, 1653-66.
Erb et al. (1994) Proc. Natl. Acad. Sci. USA, 91, 11422.
Ernst et al. (2000) J. Magn. Reson. Imaging, 12, 859-865.
Ewing et al. (2001) J. Comput-Aid. Mol. Design, 15,411.
Felici (1991) J. Mol. Biol., 222, 301-310.
Fodor (1993) Nature, 364, 555-556.
Friesner et al. (2004) J. Med. Chem. 47, 1739-1749.
Garrett et al. (1998) Nature, 394, 395-399.
Gallop et al. (1994). J. Med. Chem., 37, 1233.
Goodford (1985) J. Med. Chem., 28, 849-857 (1985).
Goodsell and Olsen (1990) Proteins: Struct. Fund. Genet., 8, 195-202. Guida (1994) Curr. Opin. Struct. Biol., 4, 777-781.
Hess et al. (1977) Comp. Chem. 18,1463-1472.
Hewish et al. (2009) Recent Patents Anticancer Drug Discov, 4, 54-72. Houghten et al. (1991) Nature, 354, 84-86.
Houghten (1992) Biotechniques, 13,412-421.
Jones et al. (1991) Acta Cryst. A47, 110-119.
Jones (2004) Acta Cryst. D60, 2115-25.
Jorgensen and Tirado-Rives (1988) Am. Chem. Soc., 110, 1657-1666 Kabsch (1993) J. Appl. Cryst., 26, 795-800.
Kiselyov et al. (2009) Mol. Sys. Biol., 5, 243.
Kitamura et al. (2003) Annu. Rev. Physiol., 65, 313-332.
Kristensen et al. (2002) J. Biol. Chem., 277, 18340-18345.
Kuntz et al. (1982) J. Mol. Biol., 161, 269-288.
Kurose et al. (1994) J. Biol. Chem., 269, 29190-29197.
Lametal. (1991) Nature, 354,82-84.
Lam (1997) Anticancer Drug Des., 12, 145.
Lawrence and Colman (1993) J. Mol. Biol., 234, 946-950.
Lawrence et al. (2007) Curr. Opin. Struct. Biol., 17, 699-705.
Liuetal. (1993) Cell, 75, 59-72.
Lou et al. (2006) Proc. Natl. Acad. Sci. USA 103, 12429-12434.
Luo et al. (1999) Science, 285, 1077-1080.
Marsh et al. (1995) J. Cell Biol., 130, 1081-1091.
Martin (1992) J. Med. Chem., 35, 2145-2154.
McCoy (2007) Act Cryst D63, 32-41.
McKern et al. (2006) Nature, 443, 218-221.
Menting et al. (2009) Biochemistry (submitted).
Miranker and Karplus (1991) Proteins: Struct. Fund. Genet., 11, 29-34. Moody et al. (1974) Horm. Metab. Res., 6, 12-16.
Morton and Myszka (1998) Methods Enzymol., 295, 268-294.
Navaza and Saludjian (1997) Methods Enzymol., 276, 581-594.
Navia and Murcko (1992) Curr. Opin. Struct. Biol., 2, 202-210.
Nice and Catimel (1999) Bioessays, 21,339-352.
Olefsky (1978) Biochem. J., 172,137-145.
Ottensmeyer et al. (2000) Biochemistry, 39,12103-12112.
Ottensmeyer et al. (2001) Biochemistry, 40, 6988-6988.
Pflugrath (1999) Acta Cryst. D55, 1718-1725.
Pillutla et al. (2002) J. Biol. Chem., 277, 22590-22594.
Rarey et al. (1996) J. Mol. Biol., 261,470.
Robinson and James (1992) Am. J. Physiol., 263, E383-E393.
Rossmann, end. (1972) The Molecular Replacement Method, Int. Sci. Rev. Ser., No. 13, Gordon &amp; Breach, New York.
Sail and Blundell (1993) J. Mol. Biol., 234, 779-815.
Sambrook et al. (2001) Molecular Cloning: A Laboratory Manual, 3rd ed. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.
Schaffer (1994) Eur. J. Biochem., 221, 1127-1132.
Schaffer et al. (2003) Proc. Natl. Acad. Sci. USA, 100,4435-4439.
Scott and Smith (1990) Science, 249, 386-390.
Silverman etal. (2005) Nature Biotech., 23, 1556-1561.
Smith et al. (1999) Nat. Med., 5, 1390-1395.
Songyang et al. (1993) Cell, 72, 767-778.
Sparrow et al. (1997) J. Biol. Chem., 272, 29460-29467.
Stumpp et al. (2007) Curr. Opin. Drug Discov. Develop., 10, 153-159.
Surinya et al. (2008) J. Biol. Chem. 283, 5355-5363.
Svenson et al. (2009) Mol. Pharm., published online 2 April 2009 (DOI: 10.1021/mp900057k).
Tong and Rossmann, (1997) Methods Enzymol., 276, 594-611.
Totrov and Abagyan (2008) Curr. Opin. Struct. Biol., 18, 178-184.
Tulloch et al. (1999) J. Struct. Biol., 125, 11-18.
Ullrich etal. (1985) Nature, 313, 756-761.
Ullrich et al. (1986) EMBO J., 5, 2503-2512.
Ulrich (2006) Handb Exp Pharmacol., 173, 305-326. van der Spoel et al. (2005) J. Comp. Chem., 26, 1701-1718.
Wada et al. (2005) J Pharmacol Sci., 99, 128-143.
Ward et al. (2003) Insulin-like growth factors (LeRoith D, Zumkeller, W. &amp; Baxter R. (eds), Eurekah.com and Kluwer Academic/Plenum Publishers, 1-21.
Ward and Lawrence (2009) BioEssays, 31,422-434.
Weiner et al. (1984) J. Am. Chem. Soc., 106, 765-784.
Yin et al. (2005) Angew. Chem. Int. Ed. Engl., 44, 2704-2707.
Yip et al. (1988) Biochem. Biophys. Res. Commun., 157, 321 -329.
Yip and Ottensmeyer (2003) J. Biol. Chem., 278, 27329-27332.
Zuckermann et al. (1994) J. Med. Chem., 37, 2678. APPENDIX I: ATOMIC COORDINATES FOR IRAp MONOMER (CHAIN CONTAINING THE C-TERMINAL REGION OF THE IR a-CHAIN (CHAIN) WITH ATTACHED Fab 83-7 (CHAINS A AND B) AND Fab 83-14 (CHAIN AND D) [0282] Note: The coordinates in this Appendix describe the asymmetric unit of the cry unit cell. The coordinates of the dimeric form of the above are generated by the application of the appropriate crystallographic two-fold operation, and these coordinates are hence included implicitly in this Appendix
APPENDIX II : ATOMIC COORDINATES FOR THE MODEL OF THE TERMINAL REGION OF IR a-CHAIN BOUND TO IGF-1R ECTODOM
[0283] The structural coordinates for the IR ectodomain containing the C-terminal regio the IR α-chain (Appendix I) were used to place a model of the C-terminal region the IR α-chain in the IGF-1R low affinity binding site for IGF. This model is ori relative to atomic coordinates found in Appendix I and may be used in conjunction with atomic coordinates of Appendixes I and III to VI to design compounds which bind to IR and/or IGF-1R.
APPENDIX III: ATOMIC COORDINATES FOR THE MODEL OF THI TERMINAL REGION OF IGF-1R a-CHAIN BOUND TO IR ECTODOM
[0284] The structural coordinates in Appendixes I and II were used to model the C-term region of the IGF-1R α-chain in the low affinity binding site of IR. This model i oriented relative to atomic coordinates found in Appendixes I and II, and may be in conjunction with atomic coordinates of Appendixes I to II, and IV to VI to des compounds which bind to IR and/or IGF-1R.
APPENDIX IV: ATOMIC COORDINATES FOR THE MODEL OF THI TERMINAL REGION OF IGF-1R a-CHAIN BOUND TO IGF-1R ECTODOMAIN
[0285] The structural coordinates in Appendixes I to III were used to place a model of the terminal region of the IGF-1R α-chain in the IGF-1R low affinity binding site fo This model is oriented relative to atomic coordinates found in Appendixes I to II may be used in conjunction with atomic coordinates of Appendixes I to III and VVI to design compounds which bind to IR and/or IGF-1 R.
APPENDIX V : ATOMIC COORDINATES FOR THE MODEL OF T INSUUN MIMETIC PEPTIDE. S519C16. BOUND TO IR ECTODOMA
[0286] The structural coordinates in Appendixes I and III were used to model the insulin mimetic peptide, S519C16, in the low affinity insulin binding site of IR. This m is oriented relative to atomic coordinates found in Appendixes I and IV, and may used in conjunction with atomic coordinates of Appendixes I to IV and VI to des compounds which bind to IR and/or IGF-1R.
APPENDIX VI : ATOMIC COORDINATES FOR THE MODEL OF THE INSUUN MIMETIC PEPTIDE. S519C16. BOUND TO IGF-1R ECTODOMAIN
[0287] The structural coordinates in Appendixes II, IV and V were used to model the in mimetic peptide, S519C16, in the low affinity IGF binding site of IGF-1R. This model is oriented relative to atomic coordinates found in Appendixes II and IV, may be used in conjunction with atomic coordinates of Appendixes I to V to des compounds which bind to IR and/or IGF-1R.
SEQUENCE LISTING [0288] <110> Walter and Eliza Hall Institute of Medical Research <120> Structure of the C-terminal region of the insulin receptor alpha-chain and of the insulin-like growth factor receptor alpha-chain <130> A/16/231 <150> 61/214.472 <151 > 2009-04-22 <160> 19 <170> Patentln version 3.5 <210> 1 <211 > 917
<212> PRT <213> Homo sapiens <400> 1
His Leu Tyr Pro Gly Glu Val Cys Pro Gly Met Asp Ile Arg Asn Asn 15 10 15
Leu Thr Arg Leu His Glu Leu Glu Asn Cys Ser Val Ile Glu Gly His 20 25 30
Leu Gin Ile Leu Leu Met Phe Lys Thr Arg Pro Glu Asp Phe Arg Asp 35 40 45
Leu Ser Phe Pro Lys Leu Ile Met Ile Thr Asp Tyr Leu Leu Leu Phe 50 55 60
Arg Val Tyr Gly Leu Glu Ser Leu Lys Asp Leu Phe Pro Asn Leu Thr 65 70 75 80
Val Ile Arg Gly Ser Arg Leu Phe Phe Asn Tyr Ala Leu Val Ile Phe 85 90 95
Glu Met Val His Leu Lys Glu Leu Gly Leu Tyr Asn Leu Met Asn Ile 100 105 110
Thr Arg Gly Ser Val Arg Ile Glu Lys Asn Asn Glu Leu Cys Tyr Leu 115 120 125
Ala Thr Ile Asp Trp Ser Arg Ile Leu Asp Ser Val Glu Asp Asn His 130 135 140
Ile Val Leu Asn Lys Asp Asp Asn Glu Glu Cys Gly Asp Ile Cys Pro 145 150 155 160
Gly Thr Ala Lys Gly Lys Thr Asn Cys Pro Ala Thr Val Ile Asn Gly 165 170 175
Gin Phe Val Glu Arg Cys Trp Thr His Ser His Cys Gin Lys Val Cys 180 185 190
Pro Thr Ile Cys Lys Ser His Gly Cys Thr Ala Glu Gly Leu Cys Cys 195 200 205
His Ser Glu Cys Leu Gly Asn Cys Ser Gin Pro Asp Asp Pro Thr Lys 210 215 220
Cys Val Ala Cys Arg Asn Phe Tyr Leu Asp Gly Arg Cys Val Glu Thr 225 230 235 240
Cys Pro Pro Pro Tyr Tyr His Phe Gin Asp Trp Arg Cys Val Asn Phe 245 250 255
Ser Phe Cys Gin Asp Leu His His Lys Cys Lys Asn Ser Arg Arg Gin 260 265 270
Gly Cys His Gin Tyr Val Ile His Asn Asn Lys Cys Ile Pro Glu Cys 275 280 285
Pro Ser Gly Tyr Thr Met Asn Ser Ser Asn Leu Leu Cys Thr Pro Cys 290 295 300
Leu Gly Pro Cys Pro Lys val Cys His Leu Leu Glu Gly Glu Lys Thr 305 310 315 320
Ile Asp Ser Val Thr Ser Ala Gin Glu Leu Arg Gly Cys Thr Val Ile 325 330 335
Asn Gly Ser Leu Ile Ile Asn Ile Arg Gly Gly Asn Asn Leu Ala Ala 340 345 350
Glu Leu Glu Ala Asn Leu Gly Leu Ile Glu Glu Ile Ser Gly Tyr Leu 355 360 365
Lys Ile Arg Arg Ser Tyr Ala Leu Val Ser Leu Ser Phe Phe Arg Lys 370 375 380
Leu Arg Leu Ile Arg Gly Glu Thr Leu Glu Ile Gly Asn Tyr Ser Phe 385 390 395 400
Tyr Ala Leu Asp Asn Gin Asn Leu Arg Gin Leu Trp Asp Trp Ser Lys 405 410 415
His Asn Leu Thr Ile Thr Gin Gly Lys Leu Phe Phe His Tyr Asn Pro 420 425 430
Lys Leu Cys Leu Ser Glu Ile His Lys Met Glu Glu Val Ser Gly Thr 435 440 445
Lys Gly Arg Gin Glu Arg Asn Asp Ile Ala Leu Lys Thr Asn Gly Asp 450 455 460
Gin Ala Ser Cys Glu Asn Glu Leu Leu Lys Phe Ser Tyr Ile Arg Thr 465 470 475 480
Ser Phe Asp Lys Ile Leu Leu Arg Trp Glu Pro Tyr Trp Pro Pro Asp 485 490 495
Phe Arg Asp Leu Leu Gly Phe Met Leu Phe Tyr Lys Glu Ala Pro Tyr 500 505 510
Gin Asn Val Thr Glu Phe Asp Gly Gin Asp Ala Cys Gly Ser Asn Ser 515 520 525
Trp Thr Val Val Asp Ile Asp Pro Pro Leu Arg Ser Asn Asp Pro Lys 530 535 540
Ser Gin Asn His Pro Gly Trp Leu Met Arg Gly Leu Lys Pro Trp Thr 545 550 555 560
Gin Tyr Ala Ile Phe Val Lys Thr Leu Val Thr Phe Ser Asp Glu Arg 565 570 575
Arg Thr Tyr Gly Ala Lys Ser Asp Ile Ile Tyr Val Gin Thr Asp Ala 580 585 590
Thr Asn Pro Ser Val Pro Leu Asp Pro Ile Ser Val Ser Asn Ser Ser 595 600 605
Ser Gin Ile Ile Leu Lys Trp Lys Pro Pro Ser Asp Pro Asn Gly Asn 610 615 620
Ile Thr His Tyr Leu Val Phe Trp Glu Arg Gin Ala Glu Asp Ser Glu 625 630 635 640
Leu Phe Glu Leu Asp Tyr Cys Leu Lys Gly Leu Lys Leu Pro Ser Arg 645 650 655
Thr Trp Ser Pro Pro Phe Glu Ser Glu Asp Ser Gin Lys His Asn Gin 660 665 670
Ser Glu Tyr Glu Asp Ser Ala Gly Glu Cys Cys Ser Cys Pro Lys Thr 675 680 685
Asp Ser Gin Ile Leu Lys Glu Leu Glu Glu Ser Ser Phe Arg Lys Thr 690 695 700
Phe Glu Asp Tyr Leu His Asn Val Val Phe Val Pro Arg Pro Ser Arg 705 710 715 720
Lys Arg Arg Ser Leu Gly Asp Val Gly Asn Val Thr Val Ala Val Pro 725 730 735
Thr Val Ala Ala Phe Pro Asn Thr Ser Ser Thr Ser Val Pro Thr Ser 740 745 750
Pro Glu Glu His Arg Pro Phe Glu Lys Val Val Asn Lys Glu Ser Leu 755 760 765
Val Ile Ser Gly Leu Arg His Phe Thr Gly Tyr Arg Ile Glu Leu Gin 770 775 780
Ala Cys Asn Gin Asp Thr Pro Glu Glu Arg Cys Ser Val Ala Ala Tyr 785 790 795 800
Val Ser Ala Arg Thr Met Pro Glu Ala Lys Ala Asp Asp Ile Val Gly 805 810 815
Pro Val Thr His Glu Ile Phe Glu Asn Asn Val Val His Leu Met Trp 820 825 830
Gin Glu Pro Lys Glu Pro Asn Gly Leu Ile Val Leu Tyr Glu Val Ser 835 840 845
Tyr Arg Arg Tyr Gly Asp Glu Glu Leu His Leu Cys Asp Thr Arg Lys 850 855 860
His Phe Ala Leu Glu Arg Gly Cys Arg Leu Arg Gly Leu Ser Pro Gly 865 870 875 880
Asn Tyr Ser Val Arg Ile Arg Ala Thr Ser Leu Ala Gly Asn Gly Ser 885 890 895
Trp Thr Glu Pro Thr Tyr Phe Tyr Val Thr Asp Tyr Leu Asp Val Pro 900 905 910
Ser Asn Ile Ala Lys 915 <210> 2 <211> 929
<212> PRT <213> Homo sapiens <400>2
His Leu Tyr Pro Gly Glu Val Cys Pro Gly Met Asp Ile Arg Asn Asn 15 10 15
Leu Thr Arg Leu His Glu Leu Glu Asn Cys Ser Val Ile Glu Gly His 20 25 30
Leu Gin Ile Leu Leu Met Phe Lys Thr Arg Pro Glu Asp Phe Arg Asp 35 40 45
Leu Ser Phe Pro Lys Leu Ile Met Ile Thr Asp Tyr Leu Leu Leu Phe 50 55 60
Arg Val Tyr Gly Leu Glu Ser Leu Lys Asp Leu Phe Pro Asn Leu Thr 65 70 75 80
Val Ile Arg Gly Ser Arg Leu Phe Phe Asn Tyr Ala Leu Val Ile Phe 85 90 95
Glu Met Val His Leu Lys Glu Leu Gly Leu Tyr Asn Leu Met Asn Ile 100 105 110
Thr Arg Gly Ser Val Arg Ile Glu Lys Asn Asn Glu Leu Cys Tyr Leu 115 120 125
Ala Thr Ile Asp Trp Ser Arg Ile Leu Asp Ser Val Glu Asp Asn His 130 135 140
Ile Val Leu Asn Lys Asp Asp Asn Glu Glu Cys Gly Asp Ile Cys Pro 145 150 155 160
Gly Thr Ala Lys Gly Lys Thr Asn Cys Pro Ala Thr Val Ile Asn Gly 165 170 175
Gin Phe Val Glu Arg Cys Trp Thr His Ser His Cys Gin Lys Val Cys 180 185 190
Pro Thr Ile Cys Lys Ser His Gly Cys Thr Ala Glu Gly Leu Cys Cys 195 200 205
His Ser Glu Cys Leu Gly Asn Cys Ser Gin Pro Asp Asp Pro Thr Lys 210 215 220
Cys Val Ala Cys Arg Asn Phe Tyr Leu Asp Gly Arg Cys Val Glu Thr 225 230 235 240
Cys Pro Pro Pro Tyr Tyr His Phe Gin Asp Trp Arg Cys Val Asn Phe 245 250 255
Ser Phe Cys Gin Asp Leu His His Lys Cys Lys Asn Ser Arg Arg Gin 260 265 270
Gly Cys His Gin Tyr Val Ile His Asn Asn Lys Cys Ile Pro Glu Cys 275 280 285
Pro Ser Gly Tyr Thr Met Asn Ser Ser Asn Leu Leu Cys Thr Pro Cys 290 295 300
Leu Gly Pro Cys Pro Lys Val Cys His Leu Leu Glu GXy Glu Lys Thr 305 310 315 320
Ile Asp Ser Val Thr Ser Ala Gin Glu Leu Arg Gly Cys Thr Val lie 325 330 335
Asn Gly Ser Leu lie lie Asn lie Arg Gly Gly Asn Asn Leu Ala Ala 340 345 350
Glu Leu Glu Ala Asn Leu Gly Leu Ile Glu Glu Ile Ser Gly Tyr Leu 355 360 365
Lys He Arg Arg Ser Tyr Ala Leu Val Ser Leu Ser Phe Phe Arg Lys 370 375 380
Leu Arg Leu He Arg Gly Glu Thr Leu Glu He Gly Asn Tyr Ser Phe 385 390 395 400
Tyr Ala Leu Asp Asn Gin Asn Leu Arg Gin Leu Trp Asp Trp Ser Lys 405 410 415
His Asn Leu Thr lie Thr Gin Gly Lys Leu Phe Phe His Tyr Asn Pro 420 425 430
Lys Leu Cys Leu Ser Glu He His Lys Met Glu Glu Val Ser Gly Thr 435 440 445
Lys Gly Arg Gin Glu Arg Asn Asp lie Ala Leu Lys Thr Asn Gly Asp 450 455 460
Gin Ala Ser Cys Glu Asn Glu Leu Leu Lys Phe Ser Tyr He Arg Thr 465 47Q 475 480
Ser Phe Asp Lys He Leu Leu Arg Trp Glu Pro Tyr Trp Pro Pro Asp 485 490 495
Phe Arg Asp Leu Leu Gly Phe Met Leu Phe Tyr Lys Glu Ala Pro Tyr 500 505 510
Gin Asn Val Thr Glu Phe Asp Gly Gin Asp Ala Cys Gly Ser Asn Ser 515 520 525
Trp Thr Val Val Asp He Asp Pro Pro Leu Arg Ser Asn Asp Pro Lys 530 535 540
Ser Gin Asn His Pro Gly Trp Leu Met Arg Gly Leu Lys Pro Trp Thr 545 550 555 560
Gin Tyr Ala lie Phe Val Lys Thr Leu Val Thr Phe Ser Asp Glu Arg 565 570 575
Arg Thr Tyr Gly Ala Lys Ser Asp lie lie Tyr Val Gin Thr Asp Ala 580 585 590
Thr Asn Pro Ser Val Pro Leu Asp Pro lie Ser Val Ser Asn Ser Ser 595 600 605
Ser Gin lie lie Leu Lys Trp Lys Pro Pro Ser Asp Pro Asn Gly Asn 610 615 620 lie Thr His Tyr Leu Val Phe Trp Glu Arg Gin Ala Glu Asp Ser Glu 625 630 635 640
Leu Phe Glu Leu Asp Tyr Cys Leu Lys Gly Leu Lys Leu Pro Ser Arg 645 650 655
Thr Trp Ser Pro Pro Phe Glu Ser Glu Asp Ser Gin Lys His Asn Gin 660 665 670
Ser Glu Tyr Glu Asp Ser Ala Gly Glu Cys Cys Ser Cys Pro Lys Thr 675 680 685
Asp Ser Gin lie Leu Lys Glu Leu Glu Glu Ser Ser Phe Arg Lys Thr 690 695 700
Phe Glu Asp Tyr Leu His Asn Val Val Phe Val Pro Arg Lys Thr Ser 705 710 715 720
Ser Gly Thr Gly Ala Glu Asp Pro Arg Pro Ser Arg Lys Arg Arg Ser 725 730 735
Leu Gly Asp Val Gly Asn Val Thr Val Ala Val Pro Thr Val Ala Ala 740 745 750
Phe Pro Asn Thr Ser Ser Thr Ser Val Pro Thr Ser Pro Glu Glu His 755 760 765
Arg Pro Phe Glu Lys Val Val Asn Lys Glu Ser Leu Val lie Ser Gly 770 775 780
Leu Arg His Phe Thr Gly Tyr Arg lie Glu Leu Gin Ala Cys Asn Gin 785 790 795 800
Asp Thr Pro Glu Glu Arg Cys Ser Val Ala Ala Tyr Val Ser Ala Arg 805 810 815
Thr Met Pro Glu Ala Lys Ala Asp Asp lie Val Gly Pro Val Thr His 820 825 830
Glu lie Phe Glu Asn Asn Val Val His Leu Met Trp Gin Glu Pro Lys 835 840 845
Glu Pro Asn Gly Leu lie Val Leu Tyr Glu Val Ser Tyr Arg Arg Tyr 850 855 860
Gly Asp Glu Glu Leu His Leu Cys Asp Thr Arg Lys His Phe Ala Leu 865 870 875 880
Glu Arg Gly Cys Arg Leu Arg Gly Leu Ser Pro Gly Asn Tyr Ser Val 885 890 895
Arg lie Arg Ala Thr Ser Leu Ala Gly Asn Gly Ser Trp Thr Glu Pro 900 905 910
Thr Tyr Phe Tyr Val Thr Asp Tyr Leu Asp Val Pro Ser Asn lie Ala 915 920 925
Lys <210> 3 <211> 1372
<212> PRT <213> Mus musculus <400>3
Met Gly Phe Gly Arg Gly Cys Glu Thr Thr Ala Val Pro Leu Leu Val 15 10 15
Ala Val Ala Ala Leu Leu Val Gly Thr Ala Gly His Leu Tyr Pro Gly 20 25 30
Glu Val Cys Pro Gly Met Asp Ile Arg Asn Asn Leu Thr Arg Leu His 35 40 45
Glu Leu Glu Asn Cys Ser Val lie Glu Gly His Leu Gin lie Leu Leu 50 55 60
Met Phe Lys Thr Arg Pro Glu Asp Phe Arg Asp Leu Ser Phe Pro Lys 65 70 75 80
Leu lie Met lie Thr Asp Tyr Leu Leu Leu Phe Arg Val Tyr Gly Leu 85 90 95
Glu Ser Leu Lys Asp Leu Phe Pro Asn Leu Thr Val lie Arg Gly Ser 100 105 110
Arg Leu Phe Phe Asn Tyr Ala Leu val lie Phe Glu Met Val His Leu 115 120 125
Lys Glu Leu Gly Leu Tyr Asn Leu Met Asn lie Thr Arg Gly Ser Val 130 135 140
Arg lie Glu Lys Asn Asn Glu Leu Cys Tyr Leu Ala Thr lie Asp Trp 145 150 155 160
Ser Arg lie Leu Asp Ser Val Glu Asp Asn Tyr lie Val Leu Asn Lys 165 170 175
Asp Asp Asn Glu Glu Cys Gly Asp Val Cys Pro Gly Thr Ala Lys Gly 180 185 190
Lys Thr Asn Cys Pro Ala Thr Val lie Asn Gly Gin Phe Val Glu Arg 195 200 205
Cys Trp Thr His Ser His Cys Gin Lys Val Cys Pro Thr lie Cys Lys 210 215 220
Ser His Gly Cys Thr Ala Glu Gly Leu Cys Cys His Lys Glu Cys Leu 225 230 235 240
Gly Asn Cys Ser Glu Pro Asp Asp Pro Thr Lys Cys Val Ala Cys Arg 245 250 255
Asn Phe Tyr Leu Asp Gly Gin Cys Val Glu Thr Cys Pro Pro Pro Tyr 260 265 270
Tyr His Phe Gin Asp Trp Arg Cys Val Asn Phe Ser Phe Cys Gin Asp 275 280 285
Leu His Phe Lys Cys Arg Asn Ser Arg Lys Pro Gly Cys His Gin Tyr 290 295 300
Val lie His Asn Asn Lys Cys lie Pro Glu Cys Pro Ser Gly Tyr Thr 305 310 315 320
Met Asn Ser Ser Asn Leu Met Cys Thr Pro Cys Leu Gly Pro Cys Pro 325 330 335
Lys Val Cys Gin lie Leu Glu Gly Glu Lys Thr lie Asp Ser Val Thr 340 345 350
Ser Ala Gin Glu Leu Arg Gly Cys Thr Val Ile Asn Gly Ser Leu Ile 355 360 365
Ile Asn Ile Arg Gly Gly Asn Asn Leu Ala Ala Glu Leu Glu Ala Asn 370 375 380
Leu Gly Leu Ile Glu Glu Ile Ser Gly Phe Leu Lys Ile Arg Arg Ser 385 390 395 400
Tyr Ala Leu Val Ser Leu Ser Phe Phe Arg Lys Leu His Leu Ile Arg 405 410 415
Gly Glu Thr Leu Glu Ile Gly Asn Tyr Ser Phe Tyr Ala Leu Asp Asn 420 425 430
Gin Asn Leu Arg Gin Leu Trp Asp Trp Ser Lys His Asn Leu Thr Ile 435 440 445
Thr Gin Gly Lys Leu Phe Phe His Tyr Asn Pro Lys Leu Cys Leu Ser 450 455 460
Glu Ile His Lys Met Glu Glu Val Ser Gly Thr Lys Gly Arg Gin Glu 465 470 475 480
Arg Asn Asp Ile Ala Leu Lys Thr Asn Gly Asp Gin Ala Ser Cys Glu 485 490 495
Asn Glu Leu Leu Lys Phe Ser Phe Ile Arg Thr Ser Phe Asp Lys Ile 500 505 510
Leu Leu Arg Trp Glu Pro Tyr Trp Pro Pro Asp Phe Arg Asp Leu Leu 515 520 525
Gly Phe Met Leu Phe Tyr Lys Glu Ala Pro Tyr Gin Asn Val Thr Glu 530 535 540
Phe Asp Gly Gin Asp Ala Cys Gly Ser Asn Ser Trp Thr Val Val Asp 545 550 555 560
Ile Asp Pro Pro Gin Arg Ser Asn Asp Pro Lys Ser Gin Thr Pro Ser 565 570 575
His Pro Gly Trp Leu Met Arg Gly Leu Lys Pro Trp Thr Gin Tyr Ala 580 585 590
Ile Phe Val Lys Thr Leu Val Thr Phe Ser Asp Glu Arg Arg Thr Tyr 595 600 605
Gly Ala Lys Ser Asp Ile Ile Tyr Val Gin Thr Asp Ala Thr Asn Pro 610 615 620
Ser Val Pro Leu Asp Pro Ile Ser Val Ser Asn Ser Ser Ser Gin Ile 625 630 635 640
Ile Leu Lys Trp Lys Pro Pro Ser Asp Pro Asn Gly Asn Ile Thr His 645 650 655
Tyr Leu Val Tyr Trp Glu Arg Gin Ala Glu Asp Ser Glu Leu Phe Glu 660 665 670
Leu Asp Tyr Cys Leu Lys Gly Leu Lys Leu Pro Ser Arg Thr Trp Ser 675 680 685
Pro Pro Phe Glu Ser Asp Asp Ser Gin Lys His Asn Gin Ser Glu Tyr 690 695 700
Asp Asp Ser Ala Ser Glu Cys Cys Ser Cys Pro Lys Thr Asp Ser Gin 705 710 715 720 lie Leu Lys Glu Leu Glu Glu Ser Ser Phe Arg Lys Thr Phe Glu Asp 725 730 735
Tyr Leu His Asn Val Val Phe Val Pro Arg Pro Ser Arg Lys Arg Arg 740 745 750
Ser Leu Glu Glu Val Gly Asn Val Thr Ala Thr Thr Leu Thr Leu Pro 755 760 765
Asp Phe Pro Asn Val Ser Ser Thr lie Val Pro Thr Ser Gin Glu Glu 770 775 780
His Arg Pro Phe Glu Lys Val Val Asn Lys Glu Ser Leu Val lie Ser 785 790 795 800
Gly Leu Arg His Phe Thr Gly Tyr Arg lie Glu Leu Gin Ala Cys Asn 805 810 815
Gin Asp Ser Pro Asp Glu Arg Cys Ser Val Ala Ala Tyr Val Ser Ala 820 825 830
Arg Thr Met Pro Glu Ala Lys Ala Asp Asp lie Val Gly Pro Val Thr 835 840 845
His Glu lie Phe Glu Asn Asn Val Val His Leu Met Trp Gin Glu Pro 850 855 860
Lys Glu Pro Asn Gly Leu lie Val Leu Tyr Glu Val Ser Tyr Arg Arg 865 870 875 880
Tyr Gly Asp Glu Glu Leu His Leu Cys Val Ser Arg Lys His Phe Ala 885 890 895
Leu Glu Arg Gly Cys Arg Leu Arg Gly Leu Ser Pro Gly Asn Tyr Ser 900 905 910
Val Arg Val Arg Ala Thr Ser Leu Ala Gly Asn Gly Ser Trp Thr Glu 915 920 925
Pro Thr Tyr Phe Tyr Val Thr Asp Tyr Leu Asp Val Pro Ser Asn lie 930 935 940
Ala Lys lie lie lie Gly Pro Leu lie Phe Val Phe Leu Phe Ser Val 945 950 955 960
Val lie Gly Ser lie Tyr Leu Phe Leu Arg Lys Arg Gin Pro Asp Gly 965 970 975
Pro Met Gly Pro Leu Tyr Ala Ser Ser Asn Pro Glu Tyr Leu Ser Ala 980 985 990
Ser Asp Val Phe Pro Ser Ser Val Tyr Val Pro Asp Glu Trp Glu Val 995 1000 1005
Pro Arg Glu Lys lie Thr Leu Leu Arg Glu Leu Gly Gin Gly Ser 1010 1015 1020
Phe Gly Met Val Tyr Glu Gly Asn Ala Lys Asp lie lie Lys Gly 1025 1030 1035
Glu Ala Glu Thr Arg Val Ala Val Lys Thr Val Asn Glu Ser Ala 1040 1045 1050
Ser Leu Arg Glu Arg Ile Glu Phe Leu Asn Glu Ala Ser Val Met 1055 1060 1065
Lys Gly Phe Thr Cys His His Val Val Arg Leu Leu Gly Val Val 1070 1075 1080
Ser Lys Gly Gin Pro Thr Leu Val Val Met Glu Leu Met Ala His 1085 1090 1095
Gly Asp Leu Lys Ser His Leu Arg Ser Leu Arg Pro Asp Ala Glu 1100 1105 1110
Asn Asn Pro Gly Arg Pro Pro Pro Thr Leu Gin Glu Met Ile Gin 1115 1120 1125
Met Thr Ala Glu Ile Ala Asp Gly Met Ala Tyr Leu Asn Ala Lys 1130 1135 1140
Lys Phe Val His Arg Asp Leu Ala Ala Arg Asn Cys Met Val Ala 1145 1150 1155
His Asp Phe Thr Val Lys Ile Gly Asp Phe Gly Met Thr Arg Asp 1160 1165 1170
Ile Tyr Glu Thr Asp Tyr Tyr Arg Lys Gly Gly Lys Gly Leu Leu 1175 1180 1185
Pro Val Arg Trp Met Ser Pro Glu Ser Leu Lys Asp Gly Val Phe 1190 1195 1200
Thr Ala Ser Ser Asp Met Trp Ser Phe Gly Val Val Leu Trp Glu 1205 1210 1215
Ile Thr Ser Leu Ala Glu Gin Pro Tyr Gin Gly Leu Ser Asn Glu 1220 1225 1230
Gin Val Leu Lys Phe Val Met Asp Gly Gly Tyr Leu Asp Pro Pro 1235 1240 1245
Asp Asn Cys Pro Glu Arg Leu Thr Asp Leu Met Arg Met Cys Trp 1250 1255 1260
Gin Phe Asn Pro Lys Met Arg Pro Thr Phe Leu Glu Ile Val Aen 1265 1270 1275
Leu Leu Lys Asp Asp Leu His Pro Ser Phe Pro Glu Val Ser Phe 1280 1285 1290
Phe Tyr Ser Glu Glu Asn Lys Ala Pro Glu Ser Glu Glu Leu Glu 1295 1300 1305
Met Glu Phe Glu Asp Met Glu Asn Val Pro Leu Asp Arg Ser Ser 1310 1315 1320
His Cys Gin Arg Glu Glu Ala Gly Gly Arg Glu Gly Gly Ser Ser 1325 1330 1335
Leu Ser Ile Lys Arg Thr Tyr Asp Glu His Ile Pro Tyr Thr His 1340 1345 1350
Met Asn Gly Gly Lys Lys Asn Gly Arg Val Leu Thr Leu Pro Arg 1355 1360 1365
Ser Asn Pro Ser 1370 <210> 4 <211 > 443
<212> PRT <213> Macaca mulatta <400>4
Met Met Ser Phe Glu Leu Asp Asn Leu Ala Ala Glu Leu Glu Ala Asn 15 10 15
Leu Gly Leu Ile Glu Glu Ile Ser Gly Tyr Leu Lys Ile Arg Arg Ser 20 25 30
Tyr Ala Leu Val Ser Leu Ser Phe Phe Arg Lys Leu Arg Leu Ile Arg 35 40 45
Gly Glu Thr Leu Glu lie Gly Asn Tyr Ser Phe Tyr Ala Leu Asp Asn 50 55 60
Gin Asn Leu Arg Gin Leu Trp Asp Trp Ser Lys His Asn Leu Thr lie 65 70 75 80
Thr Gin Gly Lys Leu Phe Phe His Tyr Asn Pro Lys Leu Cys Leu Ser 85 90 95
Glu lie His Lys Met Glu Glu Val Ser Gly Thr Lys Gly Arg Gin Glu 100 105 110
Arg Asn Asp lie Ala Leu Lys Thr Asn Gly Asp Gin Ala Ser Cys Glu 115 120 125
Asn Glu Leu Leu Lys Phe Ser Tyr lie Arg Thr Ser Phe Asp Lys lie 130 135 140
Leu Leu Arg Trp Glu Pro Tyr Trp Pro Pro Asp Phe Arg Asp Leu Leu 145 150 155 160
Gly Phe Met Leu Phe Tyr Lys Glu Ala Pro Tyr Gin Asn Val Thr Glu 165 170 175
Phe Asp Gly Gin Asp Ala Cys Gly Ser Asn Ser Trp Thr Val Val Asp 180 185 190 lie Asp Pro Pro Leu Arg Ser Asn Asp Pro Lys Ser Gin Asn His Pro 195 200 205
Gly Trp Leu Met Arg Gly Leu Lys Pro Trp Thr Gin Tyr Ala lie Phe 210 215 220
Val Lys Thr Leu Val Thr Phe Ser Asp Glu Arg Arg Thr Tyr Gly Ala 225 230 235 240
Lys Ser Asp lie lie Tyr Val Gin Thr Asp Ala Thr Asn Pro Ser Val 245 250 255
Pro Leu Asp Pro lie Ser Val Ser Asn Ser Ser Ser Gin lie lie Leu 260 265 270
Lys Trp Lys Pro Pro Ser Asp Pro Asn Gly Asn lie Thr His Tyr Leu 275 280 285
Val Phe Trp Glu Arg Gin Ala Glu Asp Ser Glu Leu Phe Glu Leu Asp 290 295 300
Tyr Cys Leu Lys Gly Leu Lys Leu Pro Ser Arg Thr Trp Ser Pro Pro 305 310 315 320
Phe Glu Ser Glu Asp Ser Gin Lys His Asn Gin Ser Glu Tyr Glu Asp 325 330 335
Ser Ala Gly Glu Cys Cys Ser Cys Pro Lys Thr Asp Ser Gin lie Leu 340 345 350
Lys Glu Leu Glu Glu Ser Ser Phe Arg Lys Thr Phe Glu Asp Tyr Leu 355 360 365
His Asn Val Val Phe Val Pro Arg Lys Thr Ser Ser Gly Thr Gly Ala 370 375 380
Glu Asp Pro Arg Tyr Asp Ser Pro Val Arg Pro Leu Val Pro Ala Pro 385 390 395 400
Cys Arg Ala Gly Gly Val Pro Gly Arg Arg Leu Gly Glu Arg Arg Gly 405 410 415
Phe Cys Gly Phe Leu His Ala Ala Gly Cys Cys Ala Gly Asp Glu Met 420 425 430
Leu His Gin Phe Arg Asn Pro Met Pro Ser Leu 435 440 <210>5 <211 > 1279 <212> PRT <213> Bos taurus <400>5
Met Asp Ile Arg Asn Asn Leu Thr Arg Leu His Glu Leu Ala Asn Cys 15 10 15
Ser Val Ile Glu Gly His Leu Gin Ile Leu Leu Met Phe Lys Thr Arg 20 25 30
Pro Glu Asp Phe Arg Asp Leu Ser Phe Pro Lys Leu Ile Met Ile Thr 35 40 45
Asp Tyr Leu Leu Leu Phe Arg Val Tyr Gly Leu Glu Ser Leu Lys Asp 50 55 60
Leu Phe Pro Asn Leu Thr Val Ile Arg Gly Ser Arg Leu Phe Phe Asn 65 70 75 80
Tyr Ala Leu Val Ile Phe Glu Met Val His Leu Lys Glu Leu Gly Leu 85 90 95
Tyr Asn Leu Met Asn Ile Thr Arg Gly Ser Val Arg Ile Glu Lys Asn 100 105 110
Asn Glu Leu Cys Tyr Leu Ala Thr Ile Asp Trp Ser Arg Ile Leu Asp 115 120 125
Ser Val Glu Asp Asn Tyr Ile Val Leu Asn Lys Asp Asp Asn Glu Glu 130 135 140
Cys Gly Asp Ile Cys Pro Gly Thr Ala Lys Gly Lys Thr Asn Cys Pro 145 150 155 160
Ala Thr Val Ile Asn Gly Gin Phe Val Glu Arg Cys Trp Thr His Ser 165 170 175
His Cys Gin Lys Gly Pro Pro Ser Ala Ile Pro Gly Ala Ala Cys His 180 185 190
Ala Val Thr Arg Ser Pro Pro Gly His Thr Pro Ser Ser Val Arg Gly 195 200 205
Pro Ser His Thr Ala Ala Ala Arg Gly Gly Pro His Thr Arg Phe Leu 210 215 220
Leu Phe Phe Asn Phe Phe Gin Thr Pro Ile Leu Cys Gly Pro Ala Leu 225 230 235 240
Gin Gly Leu Asn Pro Arg Lys Gly Pro Pro Pro Gly Ala Pro Gly Ala 245 250 255
Asp Arg Pro Ala Ala Val Thr Ala Arg Ala Pro Val Gly Arg Ala Glu 260 265 270
Pro Arg Ala Pro Glu Gly Arg Gly Gin Ser Pro Ser Ser Thr Pro Ala 275 280 285
His Trp Leu Ser Ala Arg Ala Ala Leu Arg Leu Pro Pro Pro Pro Gly 290 295 300
Pro Asp Ser Thr Glu Arg Ser Ala Pro Arg Ala Leu Cys Phe Ser Ala 305 310 315 320
Ala Ala Gly Leu Arg Gly Ala Gly Leu Leu Pro Pro Asn Tyr Ser Phe 325 330 335
Tyr Ala Leu Asp Asn Gin Asn Leu Arg Gin Leu Trp Asp Trp Ser Lys 340 345 350
His Asn Leu Thr Ile Thr Gin Gly Lys Leu Phe Phe Kis Tyr Asn Pro 355 360 365
Lys Leu Cys Leu Ser Glu Ile His Lys Met Glu Glu Val Ser Gly Thr 370 375 380
Lys Gly Arg Gin Glu Arg Asn Asp Ile Ala Leu Lys Thr Asn Gly Asp 385 390 395 400
Gin Ala Ser Cys Glu Asn Glu Leu Leu Lys Phe Ser Tyr Ile Arg Thr 405 410 415
Ser Tyr Asp Lys Ile Leu Leu Lys Trp Glu Pro Tyr Trp Pro Pro Asp 420 425 430
Phe Arg Asp Leu Leu Gly Phe Met Leu Phe Tyr Lys Glu Ala Pro Tyr 435 440 445
Gin Asn Val Thr Glu Phe Asp Gly Gin Asp Ala Cys Gly Ser Asn Ser 450 455 460
Trp Thr val Val Asp Ile Asp Pro Pro Thr Arg Ser Asn Asp Pro Lys 465 470 475 480
Ser Gin Asn His Pro Gly Trp Leu Met Arg Gly Leu Lys Pro Trp Thr 485 490 495
Gin Tyr Ala Ile Phe Val Lys Thr Leu Val Thr Phe Ser Asp Glu Arg 500 505 510
Arg Thr Tyr Gly Ala Lys Ser Asp Ile Ile Tyr Val Gin Thr Asp Ala 515 520 525
Thr Asn Pro Ser Val Pro Leu Asp Pro Ile Ser Val Ser Asn Ser Ser 530 535 540
Ser Gin Ile Ile Leu Lys Trp Lys Pro Pro Ser Asp Pro Asn Gly Asn 545 550 555 560
Ile Thr His Tyr Leu Val Phe Trp Glu Arg Gin Ala Glu Asp Ser Glu 565 570 575
Leu Tyr Glu Leu Asp Tyr Cys Leu Lys Gly Leu Lys Leu Pro Ser Arg 580 585 590
Thr Trp Ser Pro Pro Phe Glu Ser Glu Gly Ser Gin Lys His Asn Gin 595 600 605
Ser Glu Tyr Glu Glu Ser Ala Gly Glu Cys Cys Ser Cys Pro Lys Thr 610 615 620
Asp Ser Gin Ile Leu Lys Glu Leu Glu Glu Ser Ser Phe Arg Lys Thr 625 630 635 640
Phe Glu Asp Tyr Leu His Asn Val Val Phe Ile Pro Arg Pro Ser Arg 645 650 655
Lys Arg Arg Ala Leu Gly Asp Val Gly Asn Val Thr Ala Ala Val Pro 660 665 670
Thr Ala Leu Gly Leu Pro Asn Thr Ser Ser Thr Ser Thr Pro Met Ser 675 680 685
Ser Glu Glu His Arg Pro Phe Glu Lys Val Val Asn Lys Glu Ser Leu 690 695 700
Val Ile Ser Gly Leu Arg His Phe Thr Gly Tyr Arg Ile Glu Leu Gin 705 710 715 720
Ala Cys Asn Gin Asp Ser Pro Glu Glu Arg Cys Ser Val Ala Ala Tyr 725 730 735
Val Ser Ala Arg Thr Met Pro Glu Ala Lys Ala Asp Asp Ile Val Gly 740 745 750
Pro Val Thr His Glu Ile Phe Glu Asn Asn Val Val His Leu Met Trp 755 760 765
Gin Glu Pro Lys Glu Pro Asn Gly Leu Ile Val Leu Tyr Glu Val Ser 770 775 780
Tyr Arg Arg Tyr Gly Glu Glu Glu Leu His Leu Cys Val Ser Arg Arg 785 790 795 800
His Tyr Ala Leu Glu Arg Gly Cys Arg Leu Arg Gly Leu Leu Pro Gly 805 810 815
Asn Tyr Ser Val Arg Val Arg Ala Thr Ser Leu Ala Gly Asn Gly Ser 820 825 830
Trp Thr Glu Ala Thr Tyr Phe Tyr Val Thr Asp Tyr Leu Asp Val Pro 835 840 845
Ser Asn ile Ala Lys Ile Ile Ile Gly Pro Leu Ile Phe Val Phe Leu 850 855 860
Phe Ser Val Val Ile Gly Ser Ile Gys Leu Phe Leu Arg Lys Arg Gin 865 870 875 880
Pro Asp Gly Pro Leu Gly Pro Leu Tyr Ala Ser Ser Asn Pro Glu Tyr 885 890 895
Leu Ser Ala Ser Asp Val Phe Pro Cys Ser Val Tyr Val Pro Asp Glu 900 905 910
Trp Glu Val Pro Arg Glu Lys Ile Thr Leu Leu Arg Glu Leu Gly Gin 915 920 925
Gly Ser Phe Gly Met Val Tyr Glu Gly Asn Ala Arg Asp Ile Val Lys 930 935 940
Gly Glu Ala Glu Thr Arg Val Ala Val Lys Thr Val Asn Glu Ser Ala 945 950 955 960
Ser Leu Arg Glu Arg Ile Glu Phe Leu Asn Glu Ala Ser Val Met Lys 965 970 975
Gly Phe Thr Cys His His Val Val Arg Leu Leu Gly Val Val Ser Lys 980 985 990
Gly Gin Pro Thr Leu Val Val Met Glu Leu Met Ala His Gly Asp Leu 995 1000 1005
Lys Ser Tyr Leu Arg Ser Leu Arg Pro Glu Ala Glu Asn Asn Pro 1010 1015 1020
Gly Arg Pro Pro Pro Thr Leu Gin Glu Met Ile Gin Met Ala Ala 1025 1030 1035
Glu Ile Ala Asp Gly Met Ala Tyr Leu Asn Ala Lys Lys Phe Val 1040 1045 1050
His Arg Asp Leu Ala Ala Arg Asn Cys Met Val Ala His Asp Phe 1055 1060 1065
Thr val Lys ile Gly Asp Phe Gly Met Thr Arg Asp Ile Tyr Glu 1070 1075 1080
Thr Asp Tyr Tyr Arg Lys Gly Gly Lys Gly Leu Leu Pro Val Arg 1085 1090 1095
Trp Met Ala Pro Glu Ser Leu Lys Asp Gly Val Phe Thr Thr Ser 1100 1105 1110
Ser Asp Met Trp Ser Phe Gly Val Val Leu Trp Glu Ile Thr Ser 1115 1120 1125
Leu Ala Glu Gin Pro Tyr Gin Gly Leu Ser Asn Glu Gin Val Leu 1130 1135 1140
Lys Phe Val Met Asp Gly Gly Tyr Leu Asp Gin Pro Asp Asn Cys 1145 1150 1155
Pro Glu Arg Val Thr Asp Leu Met Arg Met Cys Trp Gin Phe Asn 1160 1165 1170
Pro Lys Met Arg Pro Thr Phe Leu Glu Ile Val Asp Leu Leu Lys 1175 1180 1185
Asp Asp Leu His Pro Ser Phe Pro Glu Val Ser Phe Phe His Ser 1190 1195 1200
Glu Glu Asn Lys Ala Pro Glu Ser Glu Glu Leu Glu Met Glu Phe 1205 1210 1215
Glu Asp Met Glu Ser Val Pro Leu Asp Arg Ala Ser His Ala Gin 1220 1225 1230
Arg Glu Glu Ala Gly Gly Arg Asp Gly Gly Ser Ala Leu Gly Leu 1235 1240 1245
Lys Arg Asn Tyr Asp Glu His Ile Pro Tyr Thr His Met Asn Gly 1250 1255 1260
Gly Lys Lys Asn Gly Arg Ile Leu Thr Leu Pro Arg Ser Asn Pro 1265 1270 1275
Ser <210> 6 <211> 906
<212> PRT <213> Homo sapiens <400>6
Glu Ile Cys Gly Pro Gly Ile Asp Ile Arg Asn Asp Tyr Gin Gin Leu 15 10 15
Lys Arg Leu Glu Asn Cys Thr Val Ile Glu Gly Tyr Leu His Ile Leu 20 25 30
Leu Ile Ser Lys Ala Glu Asp Tyr Arg Ser Tyr Arg Phe Pro Lys Leu 35 40 45
Thr Val Ile Thr Glu Tyr Leu Leu Leu Phe Arg Val Ala Gly Leu Glu 50 55 60
Ser Leu Gly Asp Leu Phe Pro Asn Leu Thr Val Ile Arg Gly Trp Lys 65 70 75 80
Leu Phe Tyr Asn Tyr Ala Leu Val Ile Phe Glu Met Thr Asn Leu Lys 85 90 95
Asp Ile Gly Leu Tyr Asn Leu Arg Asn Ile Thr Arg Gly Ala Ile Arg 100 105 110
Ile Glu Lys Asn Ala Asp Leu Cys Tyr Leu Ser Thr Val Asp Trp Ser 115 120 125
Leu Ile Leu Asp Ala Val Ser Asn Asn Tyr Ile Val Gly Asn Lys Pro 130 135 140
Pro Lys Glu Cys Gly Asp Leu Cys Pro Gly Thr Met Glu Glu Lys Pro 145 150 155 160
Met Cys Glu Lys Thr Thr Ile Asn Asn Glu Tyr Asn Tyr Arg Cys Trp 165 170 175
Thr Thr Asn Arg Cys Gin Lys Met Cys Pro Ser Thr Cys Gly Lys Arg 180 185 190
Ala Cys Thr Glu Asn Asn Glu Cys Cys His Pro Glu Cys Leu Gly Ser 195 200 205
Cys Ser Ala Pro Asp Asn Asp Thr Ala Cys Val Ala Cys Arg His Tyr 210 215 220
Tyr Tyr Ala Gly Val Cys Val Pro Ala Cys Pro Pro Asn Thr Tyr Arg 225 230 235 240
Phe Glu Gly Trp Arg Cys Val Asp Arg Asp Phe Cys Ala Asn lie Leu 245 250 255
Ser Ala Glu Ser Ser Asp Ser Glu Gly Phe Val lie His Asp Gly Glu 260 265 270
Cys Met: Gin Glu Cys Pro Ser Gly Phe lie Arg Asn Gly Ser Gin Ser 275 280 285
Met Tyr Cys lie Pro Cys Glu Gly Pro Cys Pro Lys Val Cys Glu Glu 290 295 300
Glu Lys Lys Thr Lys Thr lie Asp Ser Val Thr Ser Ala Gin Met Leu 305 310 315 320
Gin Gly Cys Thr He Phe Lys Gly Asn Leu Leu lie Asn He Arg Arg 325 330 335
Gly Asn Asn He Ala Ser Glu Leu Glu Asn Phe Met Gly Leu He Glu 340 345 350
Val Val Thr Gly Tyr Val Lys lie Arg His Ser His Ala Leu Val Ser 355 360 365
Leu Ser Phe Leu Lys Asn Leu Arg Leu lie Leu Gly Glu Glu Gin Leu 370 375 380
Glu Gly Asn Tyr Ser Phe Tyr Val Leu Asp Asn Gin Asn Leu Gin Gin 385 390 395 400
Leu Trp Asp Trp Asp His Arg Asn Leu Thr He Lys Ala Gly Lys Met 405 410 415
Tyr Phe Ala Phe Asn Pro Lys Leu Cys Val Ser Glu lie Tyr Arg Met 420 425 430
Glu Glu Val Thr Gly Thr Lys Gly Arg Gin Ser Lys Gly Asp lie Asn 435 440 445
Thr Arg Asn Asn Gly Glu Arg Ala Ser Cys Glu Ser Asp Val Leu His 450 455 460
Phe Thr Ser Thr Thr Thr Ser Lys Asn Arg lie He He Thr Trp His 465 470 475 480
Arg Tyr Arg Pro Pro Asp Tyr Arg Asp Leu lie Ser Phe Thr Val Tyr 485 490 495
Tyr Lys Glu Ala Pro Phe Lys Asn Val Thr Glu Tyr Asp Gly Gin Asp 500 505 510
Ala Cys Gly Ser Asn Ser Trp Asn Met Val Asp Val Asp Leu Pro Pro 515 520 525
Asn Lys Asp Val Glu Pro Gly He Leu Leu His Gly Leu Lys Pro Trp 530 535 540
Thr Gin Tyr Ala Val Tyr Val Lys Ala Val Thr Leu Thr Met Val Glu 545 550 555 560
Asn Asp His Ile Arg Gly Ala Lys Ser Glu Ile Leu Tyr Ile Arg Thr 565 570 575
Asn Ala Ser Val Pro Ser Ile Pro Leu Asp Val Leu Ser Ala Ser Asn 580 585 590
Ser Ser Ser Gin Leu Ile Val Lys Trp Asn Pro Pro Ser Leu Pro Asn 595 600 605
Gly Asn Leu Ser Tyr Tyr Ile Val Arg Trp Gin Arg Gin Pro Gin Asp 610 615 620
Gly Tyr Leu Tyr Arg His Asn Tyr Cys Ser Lys Asp Lys Ile Pro Ile 625 630 635 640
Arg Lys Tyr Ala Asp Gly Thr Ile Asp Ile Glu Glu Val Thr Glu Asn 645 650 655
Pro Lys Thr Glu Val Cys Gly Gly Glu Lys Gly Pro Cys Cys Ala Cys 660 665 670
Pro Lys Thr Glu Ala Glu Lys Gin Ala Glu Lys Glu Glu Ala Glu Tyr 675 680 685
Arg Lys Val Phe Glu Asn Phe Leu His Asn Ser Ile Phe Val Pro Arg 690 695 700
Pro Glu Arg Lys Arg Arg Asp Val Met Gin Val Ala Asn Thr Thr Met 705 710 715 720
Ser Ser Arg Ser Arg Asn Thr Thr Ala Ala Asp Thr Tyr Asn Ile Thr 725 730 735
Asp Pro Glu Glu Leu Glu Thr Glu Tyr Pro Phe Phe Glu Ser Arg Val 740 745 750
Asp Asn Lys Glu Arg Thr Val Ile Ser Asn Leu Arg Pro Phe Thr Leu 755 760 765
Tyr Arg Ile Asp Ile His Ser Cys Asn His Glu Ala Glu Lys Leu Gly 770 775 780
Cys Ser Ala Ser Asn Phe Val Phe Ala Arg Thr Met Pro Ala Glu Gly 785 790 795 800
Ala Asp Asp Ile Pro Gly Pro val Thr Trp Glu Pro Arg Pro Glu Asn 805 810 815
Ser Ile Phe Leu Lys Trp Pro Glu Pro Glu Asn Pro Asn Gly Leu Ile 820 825 830
Leu Met Tyr Glu Ile Lys Tyr Gly Ser Gin Val Glu Asp Gin Arg Glu 835 840 845
Cys Val Ser Arg Gin Glu Tyr Arg Lys Tyr Gly Gly Ala Lys Leu Asn 850 855 860
Arg Leu Asn Fro Gly Asn Tyr Thr Ala Arg Ile Gin Ala Thr Ser Leu 865 870 875 880
Ser Gly Asn Gly Ser Trp Thr Asp Pro val Phe Phe Tyr Val Gin Ala 885 890 895
Lys Thr Gly Tyr Glu Asn Phe Ile His Leu 900 905 <210>7 <211> 1369
<212> PRT <213> Mus musculus <400>7
Met Lys Ser Gly Ser Gly Gly Gly Ser Pro Thr Ser Leu Trp Gly Leu 15 10 15
Val Phe Leu Ser Ala Ala Leu Ser Leu Trp Pro Thr Ser Gly Glu Ile 20 25 30
Cys Gly Pro Gly Ile Asp Ile Arg Asn Asp Tyr Gin Gin Leu Lys Arg 35 40 45
Leu Glu Asn Cys Thr Val Ile Glu Gly Phe Leu His Ile Leu Leu Ile 50 55 60
Ser Lys Ala Glu Asp Tyr Arg Ser Tyr Arg Phe Pro Lys Leu Thr Val 65 70 75 80
Ile Thr Glu Tyr Leu Leu Leu Phe Arg Val Ala Gly Leu Glu Ser Leu 85 90 95
Gly Asp Leu Phe Pro Asn Leu Thr Val Ile Arg Gly Trp Lys Leu Phe 100 105 110
Tyr Asn Tyr Ala Leu Val Ile Phe Glu Met Thr Asn Leu Lys Asp Ile 115 120 125
Gly Leu Tyr Asn Leu Arg Asn Ile Thr Arg Gly Ala Ile Arg Ile Glu 130 135 140
Lys Asn Ala Asp Leu Cys Tyr Leu Ser Thr Ile Asp Trp Ser Leu Ile 145 150 155 160
Leu Asp Ala Val Ser Asn Asn Tyr Ile Val Gly Asn Lys Pro Pro Lys 165 170 175
Glu Cys Gly Asp Leu Cys Pro Gly Thr Leu Glu Glu Lys Pro Met Cys 180 185 190
Glu Lys Thr Thr Ile Asn Asn Glu Tyr Asn Tyr Arg Cys Trp Thr Thr 195 200 205
Asn Arg Cys Gin Lys Met Cys Pro Ser Val Cys Gly Lys Arg Ala Cys 210 215 220
Thr Glu Asn Asn Glu Cys Cys His Pro Glu Cys Leu Gly Ser Cys Kis 225 230 235 240
Thr Pro Asp Asp Asn Thr Thr Cys Val Ala Cys Arg His Tyr Tyr Tyr 245 250 255
Lys Gly Val Cys Val Pro Ala Cys Pro Pro Gly Thr Tyr Arg Phe Glu 260 265 270
Gly Trp Arg Cys Val Asp Arg Asp Phe Cys Ala Asn Ile Pro Asn Ala 275 280 285
Glu Ser Ser Asp Ser Asp Gly Phe Val Ile His Asp Asp Glu Cys Met 290 295 300
Gin Glu Cys Pro Ser Gly Phe Ile Arg Asn Ser Thr Gin Ser Met Tyr 305 310 315 320
Cys Ile Pro Cys Glu Gly Pro Cys Pro Lys Val Cys Gly Asp Glu Glu 325 330 335
Lys Lys Thr Lys Thr Ile Asp Ser Val Thr Ser Ala Gin Met Leu Gin 340 345 350
Gly Cys Thr Ile Leu Lys Gly Asn Leu Leu Ile Asn Ile Arg Arg Gly 355 360 365
Asn Asn Ile Ala Ser Glu Leu Glu Asn Phe Met Gly Leu Ile Glu Val 370 375 380
Val Thr Gly Tyr Val Lys Ile Arg His Ser His Ala Leu Val Ser Leu 385 390 395 400
Ser Phe Leu Lys Asn Leu Arg Leu Ile Leu Gly Glu Glu Gin Leu Glu 405 410 415
Gly Asn Tyr Ser Phe Tyr Val Leu Asp Asn Gin Asn Leu Gin Gin Leu 420 425 430
Trp Asp Trp Asn His Arg Asn Leu Thr Val Arg Ser Gly Lys Met Tyr 435 440 445
Phe Ala Phe Asn Pro Lys Leu Cys Val Ser Glu Ile Tyr Arg Met Glu 450 455 460
Glu Val Thr Gly Thr Lys Gly Arg Gin Ser Lys Gly Asp Ile Asn Thr 465 470 475 480
Arg Asn Asn Gly Glu Arg Ala Ser Cys Glu Ser Asp Val Leu Arg Phe 485 490 495
Thr Ser Thr Thr Thr Trp Lys Asn Arg Ile Ile Ile Thr Trp His Arg 500 505 510
Tyr Arg Pro Pro Asp Tyr Arg Asp Leu Ile Ser Phe Thr Val Tyr Tyr 515 520 525
Lys Glu Ala Pro Phe Lys Asn Val Thr Glu Tyr Asp Gly Gin Asp Ala 530 535 540
Cys Gly Ser Asn Ser Trp Asn Met Val Asp Val Asp Leu Pro Pro Asn 545 550 555 560
Lys Glu Gly Glu Pro Gly Ile Leu Leu His Gly Leu Lys Pro Trp Thr 565 570 575
Gin Tyr Ala Val Tyr Val Lys Ala Val Thr Leu Thr Met Val Glu Asn 580 585 590
Asp His Ile Arg Gly Ala Lys Ser Glu Ile Leu Tyr Ile Arg Thr Asn 595 600 605
Ala Ser Val Pro Ser Ile Pro Leu Asp Val Leu Ser Ala Ser Asn Ser 610 615 620
Ser Ser Gin Leu Ile Val Lys Trp Asn Pro Pro Thr Leu Pro Asn Gly 625 630 635 640
Asn Leu Ser Tyr Tyr Ile Val Arg Trp Gin Arg Gin Pro Gin Asp Gly 645 650 655
Tyr Leu Tyr Arg His Asn Tyr Cys Ser Lys Asp Lys Ile Pro Ile Arg 660 665 670
Lys Tyr Ala Asp Gly Thr Ile Asp Val Glu Glu Val Thr Glu Asn Pro 675 680 685
Lys Thr Glu Val Cys Gly Gly Asp Lys Gly Pro Cys Cys Ala Cys Pro 690 695 700
Lys Thr Glu Ala Glu Lys Gin Ala Glu Lys Glu Glu Ala Glu Tyr Arg 705 710 715 720
Lys Val Phe Glu Asn Phe Leu His Asn Ser Ile Phe val Pro Arg Pro 725 730 735
Glu Arg Arg Arg Arg Asp Val Met Gin Val Ala Asn Thr Thr Met Ser 740 745 750
Ser Arg Ser Arg Asn Thr Thr Val Ala Asp Thr Tyr Asn Ile Thr Asp 755 760 765
Pro Glu Glu Phe Glu Thr Glu Tyr Pro Phe Phe Glu Ser Arg Val Asp 770 775 780
Asn Lys Glu Arg Thr Val Ile Ser Asn Leu Arg Pro Phe Thr Leu Tyr 785 790 795 800
Arg Ile Asp Ile His Ser Cys Asn His Glu Ala Glu Lys Leu Gly Cys 805 810 815
Ser Ala Ser Asn Phe Val Phe Ala Arg Thr Met Pro Ala Glu Gly Ala 820 825 830
Asp Asp Ile Pro Gly Pro Val Thr Trp Glu Pro Arg Pro Glu Asn Ser 835 840 845
Ile Phe Leu Lys Trp Pro Glu Pro Glu Asn Pro Asn Gly Leu Ile Leu 850 855 860
Met Tyr Glu Ile Lys Tyr Gly Ser Gin Val Glu Asp Gin Arg Glu Cys 865 870 875 880
Val Ser Arg Gin Glu Tyr Arg Lys Tyr Gly Gly Ala Lys Leu Asn Arg 885 890 895
Leu Asn Pro Gly Asn Tyr Thr Ala Arg Ile Gin Ala Thr Ser Leu Ser 900 905 910
Gly Asn Gly Ser Trp Thr Asp Pro Val Phe Phe Tyr Val Pro Ala Lys 915 920 925
Thr Thr Tyr Glu Asn Phe Met His Leu Ile Ile Ala Leu Pro Val Ala 930 935 940
Ile Leu Leu Ile Val Gly Gly Leu Val Ile Met Leu Tyr Val Phe His 945 950 955 960
Arg Lys Arg Asn Asn Ser Arg Leu Gly Asn Gly Val Leu Tyr Ala Ser 965 970 975
Val Asn Pro Glu Tyr Phe Ser Ala Ala Asp Val Tyr Val Pro Asp Glu 980 985 990
Trp Glu Val Ala Arg Glu Lys Ile Thr Met Asn Arg Glu Leu Gly Gin 995 1000 1005
Gly Ser Phe Gly Met Val Tyr Glu Gly Val Ala Lys Gly Val Val 1010 1015 1020
Lys Asp Glu Pro Glu Thr Arg Val Ala Ile Lys Thr Val Asn Glu 1025 1030 1035
Ala Ala Ser Met Arg Glu Arg Ile Glu Phe Leu Asn Glu Ala Ser 1040 1045 1050
Val Met Lys Glu Phe Asn Cys His His Val Val Arg Leu Leu Gly 1055 1060 1065
Val Val Ser Gin Gly Gin Pro Thr Leu Val Ile Met Glu Leu Met 1070 1075 1080
Thr Arg Gly Asp Leu Lys Ser Tyr Leu Arg Ser Leu Arg Pro Glu 1085 1090 1095
Val Glu Gin Asn Asn Leu Val Leu Ile Pro Pro Ser Leu Ser Lys 1100 1105 1110
Met Ile Gin Met Ala Gly Glu Ile Ala Asp Gly Met Ala Tyr Leu 1115 1120 1125
Asn Ala Asn Lys Phe Val His Arg Asp Leu Ala Ala Arg Asn Cys 1130 1135 1140
Met Val Ala Glu Asp Phe Thr Val Lys Ile Gly Asp Phe Gly Met 1145 1150 1155
Thr Arg Asp Ile Tyr Glu Thr Asp Tyr Tyr Arg Lys Gly Gly Lys 1160 1165 1170
Gly Leu Leu Pro Val Arg Trp Met Ser Pro Glu Ser Leu Lys Asp 1175 1180 1185
Gly Val Phe Thr Thr Kis Ser Asp Val Trp Ser Phe Gly Val Val 1190 1195 1200
Leu Trp Glu Ile Ala Thr Leu Ala Glu Gin Pro Tyr Gin Gly Leu 1205 1210 1215
Ser Asn Glu Gin Val Leu Arg Phe Val Met Glu Gly Gly Leu Leu 1220 1225 1230
Asp Lys Pro Asp Asn Cys Pro Asp Met Leu Phe Glu Leu Met Arg 1235 1240 1245
Met Cys Trp Gin Tyr Asn Pro Lys Met Arg Pro Ser Phe Leu Glu 1250 1255 1260
Ile Ile Gly Ser Ile Lys Asp Glu Met Glu Pro Ser Phe Gin Glu 1265 1270 1275
Val Ser Phe Tyr Tyr Ser Glu Glu Asn Lys Pro Pro Glu Pro Glu 1280 1285 1290
Glu Leu Glu Met Glu Pro Glu A$n Met Glu Ser Val Pro Leu Asp 1295 1300 1305
Pro Ser Ala Ser Ser Ala Ser Leu Pro Leu Pro Glu Arg His Ser 1310 1315 1320
Gly His Lys Ala Glu Asn Gly Pro Gly Pro Gly Val Leu Val Leu 1325 1330 1335
Arg Ala Ser Phe Asp Glu Arg Gin Pro Tyr Ala His Met Asn Gly 1340 1345 1350
Gly Arg Ala Asn Glu Arg Ala Leu Pro Leu Pro Gin Ser Ser Thr 1355 1360 1365
Cys <210> 8 <211> 1367
<212> PRT <213> Macaca mulatta <400>8
Met Lys Ser Gly Ser Gly Glu Gly Ser Pro Thr Ser Leu Trp Gly Leu 15 10 15
Leu Phe Leu Ser Ala Ala Leu Ser Leu Trp Pro Thr Ser Gly Glu Ile 20 25 30
Cys Gly Pro Gly Ile Asp Ile Arg Asn Asp Tyr Gin Gin Leu Lys Arg 35 40 45
Leu Glu Asn Cys Thr Val Ile Glu Gly Tyr Leu His Ile Leu Leu Ile 50 55 60
Ser Lys Ala Glu Asp Tyr Arg Ser Tyr Arg Phe Pro Lys Leu Thr Val 65 70 75 80
Ile Thr Glu Tyr Leu Leu Leu Phe Arg Val Ala Gly Leu Glu Ser Leu 85 90 95
Gly Asp Leu Phe Pro Asn Leu Thr Val Ile Arg Gly Trp Lys Leu Phe 100 105 110
Tyr Asn Tyr Ala Leu Val Ile Phe Glu Met Thr Asn Leu Lys Asp Ile 115 120 125
Gly Leu Tyr Asn Leu Arg Asn Ile Thr Arg Gly Ala Ile Arg Ile Glu 130 135 140
Lys Asn Ala Asp Leu Cys Tyr Leu Ser Thr Val Asp Trp Ser Leu Ile 145 150 155 160
Leu Asp Ala Val Ser Asn Asn Tyr Ile Val Gly Asn Lys Pro Pro Lys 165 170 175
Glu Cys Gly Asp Leu Cys Pro Gly Thr Met Glu Glu Lys Pro Met Cys 180 185 190
Glu Lys Thr Thr Ile Asn Asn Glu Tyr Asn Tyr Arg Cys Trp Thr Thr 195 200 205
Asn Arg Cys Gin Lys Met Cys Pro Ser Ala Cys Gly Lys Arg Ala Cys 210 215 220
Thr Glu Asn Asn Glu Cys Cys His Pro Glu Cys Leu Gly Ser Cys Ser 225 230 235 240
Ala Pro Asp Asn Asp Thr Ala Cys Val Ala Cys Arg His Tyr Tyr Tyr 245 250 255
Ala Gly Val Cys Val Pro Ala Cys Pro Pro Asn Thr Tyr Arg Phe Glu 260 265 270
Gly Trp Arg Cys Val Asp Arg Asp Phe Cys Ala Asn Ile Leu Ser Ala 275 280 285
Glu Ser Ser Asp Ser Glu Gly Phe Val Ile His Asp Gly Glu Cys Met 250 295 300
Gin Glu Cys Pro Ser Gly Phe Ile Arg Asn Gly Ser Gin Ser Met Tyr 305 310 315 320
Cys Ile Pro Cys Glu Gly Pro Cys Pro Lys Val Cys Glu Glu Glu Lys 325 330 335
Lys Thr Lys Thr Ile Asp Ser Val Thr Ser Ala Gin Met Leu Gin Gly 340 345 350
Cys Thr Ile Phe Lys Gly Asn Leu Leu Ile Asn Ile Arg Arg Gly Asn 355 360 365
Asn Ile Ala Ser Glu Leu Glu Asn Phe Met Gly Leu Ile Glu Val Val 370 375 380
Thr Gly Tyr Val Lys Ile Arg Kis Ser Kis Ala Leu Val Ser Leu Ser 385 390 395 400
Phe Leu Lys Asn Leu Arg Leu Ile Leu Gly Glu Glu Gin Leu Glu Gly 405 410 415
Asn Tyr Ser Phe Tyr Val Leu Asp Asn Gin Asn Leu Gin Gin Leu Trp 420 425 430
Asp Trp Asp His Arg Asn Leu Thr Ile Lys Ala Gly Lys Met Tyr Phe 435 440 445
Ala Phe Asn Pro Lys Leu Cys val Ser Glu Ile Tyr Arg Met Glu Glu 450 455 460
Val Thr Gly Thr Lys Gly Arg Gin Ser Lys Gly Asp Ile Asn Thr Arg 465 470 475 480
Asn Asn Gly Glu Arg Ala Ser Cys Glu Ser Asp Val Leu His Phe Thr 485 490 495
Ser Thr Thr Thr Trp Lys Asn Arg Ile Ile Ile Thr Trp His Arg Tyr 500 505 510
Arg Pro Pro Asp Tyr Arg Asp Leu ile Ser Phe Thr Val Tyr Tyr Lys 515 520 525
Glu Ala Pro Phe Lys Asn Val Thr Glu Tyr Asp Gly Gin Asp Ala Cys 530 535 540
Gly Ser Asn Ser Trp Asn Met Val Asp Val Asp Leu Pro Pro Asn Lys 545 550 555 560
Asp Val Glu Pro Gly Ile Leu Leu His Gly Leu Lys Pro Trp Thr Gin 565 570 575
Tyr Ala Val Tyr Val Lys Ala Val Thr Leu Thr Mat Val Glu Asn Asp 580 585 590
His Ile Arg Gly Ala Lys Ser Glu Ile Leu Tyr Ile Arg Thr Asn Ala 595 600 605
Ser Val Pro Ser Ile Pro Leu Asp Val Leu Ser Ala Ser Asn Ser Ser 610 615 620
Ser Gin Leu Ile Val Lys Trp Asn Pro Pro Ser Leu Pro Asn Gly Asn 625 630 635 640
Leu Ser Tyr Tyr Ile Val Arg Trp Gin Arg Gin Pro Gin Asp Gly Tyr 645 650 655
Leu Tyr Arg His Asn Tyr Cys Ser Lys Asp Lys Ile Pro Ile Arg Lys 660 665 670
Tyr Ala Asp Gly Thr Ile Asp Ile Glu Glu Val Thr Glu Asn Pro Lys 675 680 685
Thr Glu Val Cys Gly Gly Glu Lys Gly Pro Cys Cys Ala Cys Pro Lys 690 695 700
Thr Glu Ala Glu Lys Gin Ala Glu Lys Glu Glu Ala Glu Tyr Arg Lys 705 710 715 720
Val Phe Glu Asn Phe Leu His Asn Ser Ile Phe Val Pro Arg Pro Glu 725 730 735
Arg Lys Arg Arg Asp Val Met Gin Val Ala Asn Thr Thr Met Ser Ser 740 745 750
Arg Ser Arg Asn Thr Thr val Ala Asp Thr Tyr Asn Ile Thr Asp Leu 755 760 765
Glu Glu Leu Glu Thr Glu Tyr Pro Phe Phe Glu Ser Arg Val Asp Asn 770 775 780
Lys Glu Arg Thr Val Ile Ser Asn Leu Arg Pro Phe Thr Leu Tyr Arg 785 790 795 800
Ile Asp Ile His Ser Cys Asn His Glu Ala Glu Lys Leu Gly Cys Ser 805 810 815
Ala Ser Asn Phe Val Phe Ala Arg Thr Met Pro Ala Glu Gly Ala Asp 820 825 830
Asp Ile Pro Gly Pro Val Thr Trp Glu Pro Arg Pro Glu Asn Ser Ile 835 840 845
Phe Leu Lys Trp Pro Glu Pro Glu Asn Pro Asn Gly Leu Ile Leu Met 850 855 860
Tyr Glu Ile Lys Tyr Gly Ser Gin Val Glu Asp Gin Arg Glu Cys Val 865 870 875 880
Ser Arg Gin Glu Tyr Arg Lys Tyr Gly Gly Ala Lys Leu Asn Arg Leu 885 890 895
Asn Pro Gly Asn Tyr Thr Ala Arg Ile Gin Ala Thr Ser Leu Ser Gly 900 905 910
Asn Gly Ser Trp Thr Asp Pro Val Phe Phe Tyr Val Gin Ala Lys Thr 915 920 925
Gly Tyr Glu Asn Phe Ile His Leu Ile Ile Ala Leu Pro Val Ala Val 930 935 940
Leu Leu Ile Val Gly Gly Leu val Ile Met Leu Tyr Val Phe His Arg 945 950 955 960
Lys Arg Asn Asn Ser Arg Leu Gly Asn Gly Val Leu Tyr Ala Ser Val 965 970 975
Asn Pro Glu Tyr Phe Ser Ala Ala Asp Val Tyr Val Pro Asp Glu Trp 980 985 990
Glu Val Ala Arg Glu Lys Ile Thr Met Ser Arg Glu Leu Gly Gin Gly 995 1000 1005
Ser Phe Gly Met Val Tyr Glu Gly Val Ala Lys Gly Val Val Lys 1010 1015 1020
Asp Glu Pro Glu Thr Arg Val Ala Ile Lys Thr Val Asn Glu Ala 1025 1030 1035
Ala Ser Met Arg Glu Arg Ile Glu Phe Leu Asn Glu Ala Ser Val 1040 1045 1050
Met Lys Glu Phe Asn Cys His His Val Val Arg Leu Leu Gly Val 1055 1060 1065
Val Ser Gin Gly Gin Pro Thr Leu Val Ile Met Glu Leu Met Thr 1070 1075 1080
Arg Gly Asp Leu Lys Ser Tyr Leu Arg Ser Leu Arg Pro Glu Met 1085 1090 1095
Glu Asn Asn Pro Val Leu Ala Pro Pro Ser Leu Ser Lys Met Ile 1100 1105 1110
Gin Met Ala Gly Glu Ile Ala Asp Gly Met Ala Tyr Leu Asn Ala 1115 1120 1125
Asn Lys Phe Val His Arg Asp Leu Ala Ala Arg Asn Cys Met Val 1130 1135 1140
Ala Glu Asp Phe Thr Val Lys Ile Gly Asp Phe Gly Met Thr Arg 1145 1150 1155
Asp Ile Tyr Glu Thr Asp Tyr Tyr Arg Lys Gly Gly Lys Gly Leu 1160 1165 1170
Leu Pro Val Arg Trp Met Ser Pro Glu Ser Leu Lys Asp Gly Val 1175 1180 1185
Phe Thr Thr Tyr Ser Asp Val Trp Ser Phe Gly Val Val Leu Trp 1190 1195 1200
Glu Ile Ala Thr Leu Ala Glu Gin Pro Tyr Gin Gly Leu Ser Asn 1205 1210 1215
Glu Gin Val Leu Arg Phe Val Met Glu Gly Gly Leu Leu Asp Lys 1220 1225 1230
Pro Asp Asn Cys Pro Asp Met Leu Phe Glu Leu Met Arg Met Cys 1235 1240 1245
Trp Gin Tyr Asn Pro Lys Met Arg Pro Ser Phe Leu Glu Ile Ile 1250 1255 1260
Ser Ser Ile Lys Asp Glu Met Glu Pro Gly Phe Arg Glu Val Ser 1265 1270 1275
Phe Tyr Tyr Ser Glu Glu Asn Lys Leu Pro Glu Pro Glu Glu Leu 1280 1285 1290
Asp Leu Glu Pro Glu Asn Met Glu Ser Val Pro Leu Asp Pro Ser 1295 1300 1305
Ala Ser Ser Ser Ser Leu Pro Leu Pro Asp Arg His Ser Gly His 1310 1315 1320
Lys Ala Glu Asn Gly Pro Gly Pro Gly Val Leu Val Leu Arg Ala 1325 1330 1335
Ser Phe Asp Glu Arg Gin Pro Tyr Ala His Met Asn Gly Gly Arg 1340 1345 1350
Lys Asn Glu Arg Ala Leu Pro Leu Pro Gin Ser Ser Thr Cys 1355 . 1360 1365 <210>9 <211> 1204 <212> PRT <213> Bos taurus <400>9
Met Gin Ser Thr Cys Ser Leu Pro Gin Arg Asn Ser Gin His Val Thr 15 10 15
Leu Val lie Gin Ala Leu Gly Pro Arg Arg Val Ala Gly Gly Leu Gly 20 25 30
Val Pro Gly Gly Gly Pro Ser Ala Gin Arg Pro His Thr Leu Pro Val 35 40 45
Pro Thr Val Cys Pro Ser Ala Cys Gly Lys Arg Ala Cys Thr Glu Thr 50 55 60
His Glu Cys Cys His Pro Glu Cys Leu Gly Ser Cys Ser Ala Pro Asp 65 70 75 80
Asn Ala Thr Ala Cys Val Ala Cys Arg His Tyr Tyr Tyr Ala Gly Val 85 90 95
Cys Val Pro Ser Cys Pro Pro Asn Thr Tyr Arg Phe Glu Gly Trp Arg 100 105 110
Cys Val Asp Arg Asp Phe Cys Ala Asn lie Pro Asn Ala Glu Ser Ser 115 120 125
Asp Ser Glu Gly Phe Val lie His Asp Gly Glu Cys Met Gin Glu Cys 130 135 140
Pro Ser Gly Phe lie Arg Asn Gly Ser Gin Ser Met Tyr Cys lie Pro 145 150 155 160
Cys Glu Gly Pro Cys Pro Lys Val Cys Glu Glu Glu Lys Lys Thr Lys 165 170 175
Thr lie Asp Ser Val Thr Ser Ala Gin Met Leu Gin Gly Cys Thr lie 180 185 190
Phe Lys Gly Asn Leu Leu lie Asn lie Arg Arg Gly Asn Asn lie Ala 195 200 205
Ser Glu Leu Glu Asn Phe Met Gly Leu lie Glu Val Val Thr Gly Tyr 210 215 220
Val Lys lie Arg His Ser His Ala Leu Val Ser Leu Ser Phe Leu Lys 225 230 235 240
Asn Leu Arg Gin lie Leu Gly Glu Glu Gin Leu Glu Gly Asn Tyr Ser 245 250 255
Phe Tyr Val Leu Asp Asn Gin Asn Leu Gin Gin Leu Trp Asp Trp Asp 260 265 270
His Arg Asn Leu Thr lie Lys Ala Gly Lys Met Tyr Phe Ala Phe Asn 275 280 285
Pro Lys Leu Cys Val Ser Glu lie Tyr Arg Met Glu Glu Val Thr Gly 290 295 300
Thr Lys Gly Arg Gin Ser Lys Gly Asp He Asn Thr Arg Asn Asn Gly 305 310 315 320
Glu Arg Ala Ser Cys Glu Ser Asp Val Leu His Phe Thr Ser Thr Thr 325 330 335
Thr Ser Lys Asn Arg lie lie lie Thr Trp His Arg Tyr Arg Pro Pro 340 345 350
Asp Tyr Arg Asp Leu He Ser Phe Thr Val Tyr Tyr Lys Glu Ala Pro 355 360 365
Phe Lys Asn Val Thr Glu Tyr Asp Gly Gin Asp Ala Cys Gly Ser Asn 370 375 380
Ser Trp Asn Met Val Asp Val Asp Leu Pro Pro Asn Lys Asp Val Glu 385 390 395 400
Pro Gly lie Leu Leu His Gly Leu Lys Pro Trp Thr Gin Tyr Ala Val 405 410 415
Tyr Val Lys Ala Val Thr Leu Thr Met Val Glu Asn Asp His He Arg 420 425 430
Gly Ala Lys Ser Glu He Leu Tyr lie Arg Thr Asn Ala Ser Val Pro 435 440 445
Ser He Pro Leu Asp Val Leu Ser Ala Ser Asn Ser Ser Ser Gin Leu 450 455 460 lie Val Lys Trp Asn Pro Pro Ser Leu Pro Asn Gly Asn Leu Ser Tyr 465 470 475 480
Tyr He Val Arg Trp Gin Arg Gin Pro Gin Asp Ser Tyr Leu Tyr Arg 485 490 495
His Asn Tyr Cys Ser Lys Asp Lys lie Pro lie Arg Lys Tyr Ala Asp 500 505 510
Gly Thr lie Asp Val Glu Glu Val Thr Glu Asn Pro Lys Thr Glu Val 515 520 525
Cys Gly Gly Glu Lys Gly Pro Cys Cys Ala Cys Pro Lys Thr Glu Ala 530 535 540
Glu Lys Gin Ala Glu Lys Glu Glu Ala Glu Tyr Arg Lys Val Phe Glu 545 550 555 560
Asn Phe Leu His Asn Ala Ile Phe Val Pro Arg Pro Glu Arg Lys Arg 565 570 575
Arg Glu Val Met Gin Ile Ala Asn Thr Thr Met Ser Ser Arg Ser Arg 580 585 590
Asn Thr Thr Val Leu Asp Thr Tyr Asn Ile Thr Asp Pro Glu Glu Leu 595 600 605
Glu Thr Glu Tyr Pro Phe Phe Glu Ser Arg Val Asp Asn Lys Glu Arg 610 615 620
Thr Val ile Ser Asn Leu Arg Pro Phe Thr Leu Tyr Arg Ile Asp Ile 625 630 635 640
His Ser Cys Asn His Glu Ala Glu Lys Leu Gly Cys Ser Ala Ser Asn 645 650 655
Phe Val Phe Ala Arg Thr Met Pro Ala Glu Gly Ala Asp Asp Ile Pro 660 665 670
Gly Pro Val Thr Trp Glu Pro Arg Pro Glu Asn Ser Ile Phe Leu Lys 675 680 685
Trp Pro Glu Pro Glu Asn Pro Asn Gly Leu Ile Leu Met Tyr Glu Ile 690 695 700
Lys Tyr Gly Ser Gin Val Glu Asp Gin Arg Glu Cys Val Ser Arg Gin 705 710 715 720
Glu Tyr Arg Lys Tyr Gly Gly Ala Lys Leu Asn Arg Leu Asn Pro Gly 725 730 735
Asn Tyr Thr Ala Arg Ile Gin Ala Thr Ser Leu Ser Gly Aan Gly Ser 740 745 750
Trp Thr Asp Pro Val Phe Phe Tyr Val Gin Ala Lys Thr Thr Tyr Glu 755 760 765
Asn Phe Ile His Leu Met Ile Ala Leu Pro Ile Ala Val Leu Leu Ile 770 775 780
Val Gly Gly Leu Val Ile Met Leu Tyr Val Phe His Arg Lys Arg Asn 785 790 795 800
Ser Ser Arg Leu Gly Asn Gly Val Leu Tyr Ala Ser Val Asn Pro Glu 805 810 815
Tyr Phe Ser Ala Ala Asp Val Tyr Val Pro Asp Glu Trp Glu Val Ala 820 825 830
Arg Glu Lys Ile Thr Met Ser Arg Glu Leu Gly Gin Gly Ser Phe Gly 835 840 845
Met Val Tyr Glu Gly Val Ala Lys Gly Val Val Lys Asp Glu Pro Glu 850 855 860
Thr Arg Val Ala Ile Lys Thr Val Asn Glu Ala Ala Ser Met Arg Glu 865 870 875 880
Arg Ile Glu Phe Leu Asn Glu Ala Ser Val Met Lys Glu Phe Asn Cys 885 890 895
His His Val Val Arg Leu Leu Gly Val Val Ser Gin Gly Gin Pro Thr 900 905 910
Leu Val Ile Met Glu Leu Met Thr Arg Gly Asp Leu Lys Ser Tyr Leu 915 920 925
Arg Ser Leu Arg Pro Glu Met Glu Asn Asn Pro Val Leu Ala Pro Pro 930 935 940
Ser Leu Ser Lys Met Ile Gin Met Ala Gly Glu Ile Ala Asp Gly Met 945 950 955 960
Ala Tyr Leu Asn Ala Asn Lys Phe Val His Arg Asp Leu Ala Ala Arg 965 970 975
Asn Cys Met Val Ala Glu Asp Phe Thr Val Lys Ile Gly Asp Phe Gly 980 985 990
Met Thr Arg Asp Ile Tyr Glu Thr Asp Tyr Tyr Arg Lys Gly Gly Lys 995 1000 1005
Gly Leu Leu Pro Val Arg Trp Met Ser Pro Glu Ser Leu Lys Asp 1010 1015 1020
Gly Val Phe Thr Thr His Ser Asp Val Trp Ser Phe Gly Val Val 1025 1030 1035
Leu Trp Glu Ile Ala Thr Leu Ala Glu Gin Pro Tyr Gin Gly Leu 1040 1045 1050
Ser Asn Glu Gin Val Leu Arg Phe Val Met Glu Gly Gly Leu Leu 1055 1060 1065
Asp Lys Pro Asp Asn Cys Pro Asp Met Leu Phe Glu Leu Met Arg 1070 1075 1080
Met Cys Trp Gin Tyr Asn Pro Lys Met Arg Pro Ser Phe Leu Glu 1085 1090 1095
Ile Ile Ser Ser Val Lys Asp Glu Met Glu Ala Gly Phe Arg Glu 1100 1105 1110
Val Ser Phe Tyr Tyr Ser Glu Glu Asn Lys Pro Pro Glu Pro Glu 1115 1120 1125
Glu Leu Asp Leu Glu Pro Glu Asn Met Glu Ser Val Pro Leu Asp 1130 1135 1140
Pro Ser Ala Ser Ser Ala Ser Leu Pro Leu Pro Asp Arg His Ser 1145 1150 1155
Gly His Lys Ala Glu Asn Gly Pro Gly Pro Gly Val Leu Val Leu 1160 1165 1170
Arg Ala Ser Phe Asp Glu Arg Gin Pro Tyr Ala His Met Asn Gly 1175 1180 1185
Gly Arg Lys Asn Glu Arg Ala Leu Pro Leu Pro Gin Ser Ser Thr 1190 1195 1200
Cys <210> 10 <211> 502
<212> PRT <213> Homo sapiens <400> 10
His Leu Tyr Pro Gly Glu Val Cys Pro Gly Met Asp Ile Arg Asn Asn 15 10 15
Leu Thr Arg Leu His Glu Leu Glu Asn Cys Ser Val Ile Glu Gly His 20 25 30
Leu Gin Ile Leu Leu Met Phe Lys Thr Arg Pro Glu Asp Phe Arg Asp 35 40 45
Leu Ser Phe Pro Lys Leu Ile Met Ile Thr Asp Tyr Leu Leu Leu Phe 50 55 60
Arg Val Tyr Gly Leu Glu Ser Leu Lys Asp Leu Phe Pro Asn Leu Thr 65 70 75 80
Val Ile Arg Gly Ser Arg Leu Phe Phe Asn Tyr Ala Leu Val Ile Phe 85 90 95
Glu Met Val His Leu Lys Glu Leu Gly Leu Tyr Asn Leu Met Asn Ile 100 105 110
Thr Arg Gly Ser Val Arg Ile Glu Lys Asn Asn Glu Leu Cys Tyr Leu 115 120 125
Ala Thr Ile Asp Trp Ser Arg Ile Leu Asp Ser Val Glu Asp Asn His 130 135 140
Ile Val Leu Asn Lys Asp Asp Asn Glu Glu Cys Gly Asp Ile Cys Pro 145 150 155 160
Gly Thr Ala Lys Gly Lys Thr Asn Cys Pro Ala Thr Val Ile Asn Gly 165 170 175
Gin Phe Val Glu Arg Cys Trp Thr His Ser His Cys Gin Lys Val Cys 180 185 190
Pro Thr Ile Cys Lys Ser His Gly Cys Thr Ala Glu Gly Leu Cys Cys 195 200 205
His Ser Glu Cys Leu Gly Asn Cys Ser Gin Pro Asp Asp Pro Thr Lys 210 215 220
Cys Val Ala Cys Arg Asn Phe Tyr Leu Asp Gly Arg Cys Val Glu Thr 225 230 235 240
Cys Pro Pro Pro Tyr Tyr His Phe Gin Asp Trp Arg Cys Val Asn Phe 245 250 255
Ser Phe Cys Gin Asp Leu His His Lys Cys Lys Asn Ser Arg Arg Gin 260 265 270
Gly Cys His Gin Tyr Val Ile His Asn Asn Lys Cys Ile Pro Glu Cys 275 280 285
Pro Ser Gly Tyr Thr Met Asn Ser Ser Asn Leu Leu Cys Thr Pro Cys 290 295 300
Leu Gly Pro Cys Pro Lys Val Cys His Leu Leu Glu Gly Glu Lys Thr 305 310 315 320
Ile Asp Ser Val Thr Ser Ala Gin Glu Leu Arg Gly Cys Thr Val Ile 325 330 335
Asn Gly Ser Leu Ile Ile Asn Ile Arg Gly Gly Asn Asn Leu Ala Ala 340 345 350
Glu Leu Glu Ala Asn Leu Gly Leu Ile Glu Glu Ile Ser Gly Tyr Leu 355 360 365
Lys Ile Arg Arg Ser Tyr Ala Leu Val Ser Leu Ser Phe Phe Arg Lys 370 375 380
Leu Arg Leu Ile Arg Gly Glu Thr Leu Glu Ile Gly Asn Tyr Ser Phe 385 390 395 400
Tyr Ala Leu Asp Asn Gin Asn Leu Arg Gin Leu Trp Asp Trp Ser Lys 405 410 415
His Asn Leu Thr Thr Thr Gin Gly Lys Leu Phe Phe His Tyr Asn Pro 420 425 430
Lys Leu Cys Leu Ser Glu Ile His Lys Met Glu Glu Val Ser Gly Thr 435 440 445
Lys Gly Arg Gin Glu Arg Asn Asp Ile Ala Leu Lys Thr Asn Gly Asp 450 455 460
Lys Ala Ser Cys Glu Asn Glu Leu Leu Lys Phe Ser Tyr Ile Arg Thr 465 470 475 480
Ser Phe Asp Lys Ile Ser Asp Asp Asp Asp Lys Glu Gin Lys Leu Ile 485 490 495
Ser Glu Glu Asp Leu Asn 500 <210> 11 <211> 16
<212> PRT <213> Homo sapiens <400> 11
Thr Phe Glu Asp Tyr Leu His Asn Val Val Phe Val Pro Arg Pro Ser 15 10 15 <210> 12 <211> 16
<212> PRT <213> Homo sapiens <400> 12
Thr Phe Glu Asp Tyr Leu His Asn Val Val Ala Val Pro Arg Pro Ser 15 10 15 <210> 13 <211> 18
<212> PRT <213> Homo sapiens <400> 13
Leu Lys Glu Leu Glu Glu Ser Ser Phe Arg Lys Thr Phe Glu Asp Tyr 15 10 15
Leu His <210> 14 <211> 16
<212> PRT <213> Homo sapiens <400> 14
Val Phe Glu Asn Phe Leu His Asn Ser Ile Phe Val Pro Arg Pro Glu 15 10 15 <210> 15 <211> 17
<212> PRT <213> Homo sapiens <400> 15
Ala Glu Lys Glu Glu Ala Glu Tyr Arg Lys Val Phe Glu Asn Phe Leu 15 10 15
His <210> 16 <211> 36
<212> PRT <213> Homo sapiens <400> 16
Ser Leu Glu Glu Glu Trp Ala Gin Val Glu Cys Glu Val Tyr Gly Arg 15 10 15
Gly Cys Pro Ser Gly Ser Leu Asp Glu Ser Phe Tyr Asp Trp Phe Glu 20 25 30
Arg Gin Leu Gly 35 <210> 17 <211> 20
<212> PRT <213> Homo sapiens <400> 17
Ser Leu Glu Glu Glu Trp Ala Gin Val Glu Cys Glu Val Tyr Gly Arg 15 10 15
Gly Cys Pro Ser 20 <210> 18 <211> 16
<212> PRT <213> Homo sapiens <400> 18
Gly Ser Leu Asp Glu Ser Phe Tyr Asp Trp Phe Glu Arg Gin Leu Gly 15 10 15
<210> 19 <211 > 5 <212> PRT <213> Artificial Sequence <220> <223> Conserved motif of insulin-mimetic peptides <220> <221 > MISC_FEATURE <222> (3)..(3) <223> Xaa = any amino acid <400> 19
Phe Tyr Xaa Trp Phe 1 5
REFERENCES CITED IN THE DESCRIPTION
This list of references cited by the applicant is for the reader's convenience only. It does not form part of the European patent document. Even though great care has been taken in compiling the references, errors or omissions cannot be excluded and the EPO disclaims all liability in this regard.
Patent documents cited in the description • WOQ7147215A (00071 f00111 (00111 (0027] (0262] (02621 Γ02031 (02641 (02651 • WO99028347A (60271 • US71730053 (02041 (02071 • US2008153802A (02061 • US5223409A (02211 (02211 • WQ03027248A (62271 • WQ61214472A (02S81
Non-patent literature cited in the description • ADAMS et al.Cell. Mol. Life Sci., 2000, vol. 57, 1050-1093 (02811 • ADAMS et al.Acta Cryst., 2002, vol. D58, 1948-54 )0201] • APFELAm. J. Med., 1999, vol. 107, 34S-42S (0281] • AUERNeurology, 1998, vol. 51, S39-S43.i028.il • AUSUBEL et al.Short Protocols in Molecular BiologyJohn Wiley &amp; Sons, Inc. 19990000 [0281] • BAILYES et al.Biochein. J., 1997, vol. 327, 209-21510281] . BARTLETT et al.Royal Chem. Soc., 1989, vol. 78,182-196 Γ02811 • BENTLEYMethods Enzymol., 1997, vol. 276, 611-619 [02811 . BINZet al.Nature Biotech., 2005, vol. 23, 1257-1268 F02811 • BLONDELLEHOUGHTENTrends Biotechnol., 1996, vol. 14, 60-65 F02811 • BOHMSTAHLM. Med. Chem. Res., 1999, vol. 9, 445- [02811 . BROOKS et al.Comp. Chem., 1983, vol. 4, 187-217 [82811 • BRLJNGER et al.Acta Cryst., 1998, vol. D54, 905-921 [02811 • BRtJNGERMethods Enzymol., 1997, vol. 276, 558-580 [02811 • BRIINGER et al.Acta Cryst., 2009, vol. D65, 128-33 [02811 • BRUNS et al.J Mol Biol., 1999, vol. 288, 427-439 [02811 • BUSSI et al.M. J. Chem. Phys., 2007, vol. 126, 14101- [02811 • BUTTELet al.lmmunol. Cell Biol., 1999, vol. 77, 256-262 [02811 • CARELLet al.Angew.Chem. Int. Ed. Engl., 1994, vol. 33, 2059- [0281] • CARELLet al.Angew. Chem. Int. Ed. Engl., 1994, vol. 33, 2061- [02811 • CHO et al.Science, 1993, vol. 261, 1303- [0281] • CHOWet al.Biol. Chem., 1998, vol. 273, 4672-4680 [02811 • CLARKE et al.Cancer Res., 2000, vol. 60, 4804-4811 [02811 • COHEN et al.J. Med. Chem., 1990, vol. 33, 883-894 [0281] • COLE et al.Virtual Screening in Drug DiscoveryTaylor &amp; Francis CRC Press20050000 [02811 . CULL et al.Proc. Natl. Acad. Sci. USA, 1992, vol. 89, 1865-1869 [0281] . CWIRLAet al.Proc. Natl. Acad. Sci. USA, 1990, vol. 97, 6378-6382 [02811 . DANIAL et al.Nat. Med, 2008, vol. 14, 144-153 [0281] • DAVIS et al.Chem. Soc. Rev., 2006, vol. 36, 326-334 [0281] • DAYCAFLISCHJ. Chem. Inform. Model., 2008, vol. 48, 679-90 [02811 . DEGTEREVet al.Nat. Cell. Biol., 2001, vol. 3, 173-182 .[0281.1 • DE MEYTSDiabetologia, 1994, vol. 37, S135-S148 [02811 . DE MEYTSWHITTAKERNat. Rev. Drug Discos., 2002, vol. 1,769-783 [0281] • DEMEYTSBioessays, 2004, vol. 26, 1351-1362 [0281] • DENLEY et al.Horm. Metab. Res., 2003, vol. 35, 778-785 [02811 . DENLEY et al.Mol. Endocrinol., 2004, vol. 18, 2502-2512 [0281] • DEVLINScience, 1990, vol. 249, 404-406 [02811 • DEWITT et al.Proc. Natl. Acad. Sci. USA, 1993, vol. 90, 6909- [02811 • E RAM IAN et al.Protein Sci., 2006, vol. 15, 1653-66 [0281] • ERB et al.Proc. Natl. Acad. Sci. USA, 1994, vol. 91,11422- [02811 • ERNST et al.J. Magn. Reson. Imaging, 2000, vol. 12, 859-8651028¾. • EWING et al.J. Comput-Aid. Mol. Design, 2001, vol. 15, 411- [02811 . FELICIJ. Mol. Biol., 1991, vol. 222, 301-310 [02811 • FODORNature, 1993, vol. 364, 555-556 [02811 . FRIESNER et al.J. Med. Chem., 2004, vol. 47, 1739-1749 [02811 • GARRETT et al.Nature, 1998, vol. 394, 395-399 [02811 • GALLOP et al.J. Med. Chem., 1994, vol. 37,1233- [02811 • GOODFORDJ. Med. Chem., 1985, vol. 28, 849-857 [02811 • GOODSELLOLSENProteins: Struct. Funct. Genet., 1990, vol. 8, 195-202 Γ02811 • GUIDACurr. Opin. Struct. Biol., 1994, vol. 4, 777-781 [0281] • HESS et al.Comp. Chem, 1977, vol. 18, 1463-1472 [02811 • HEWISH et al.Recent Patents Anticancer Drug Discov, 2009, vol. 4, 54-72 [0281] . HOUGHTEN et al.Nature, 1991, vol. 354, 84-86 [02811 • HOUGHTENBiotechniques, 1992, vol. 13, 412-421 [02811 • JONES et aLActa Cryst., 1991, vol. A47, 110-119 [02811 • JONESActa Cryst., 2004, vol. D60, 2115-25 [0281] • JORGENSENTIRADO-RIVESAm. Chem. Soc., 1988, vol. 110, 1657-1666 [02811 • KABSCHJ. Appl. Cryst., 1993, vol. 26, 795-800 [02811 • KISELYOV et al.Mol. Sys. Biol., 2009, vol. 5, 243- [02811 • KITAMURAet al.Annu. Rev. Physiol., 2003, vol. 65, 313-332 [02811 • KRISTENSEN et al.J. Biol. Chem., 2002, vol. 277, 18340-18345 [02811 • KUNTZ et al.J. Mol. Biol., 1982, vol. 161, 269-288 [02811 • KUROSE et al.J. Biol. Chem., 1994, vol. 269, 29190-29197.[0231]. • LAM et al.Nature, 1991. vol. 354. 82-84 102811 • LAMAnticancer Drug Des., 1997, vol. 12, 145- [02811 • LAWRENCECOLMANJ Mol. Biol., 1993, vol. 234, 946-950 [02311 • LAWRENCE et al.Curr. Opin. Struct. Biol., 2007, vol. 17, 699-705 [0281] • LIU et al.Cell, 1993, vol. 75, 59-72 [0231] • LOU et al.Proc. Na ti. Acad. Sci. USA, 2006, vol. 103, 12429-12434 [0281] • LUO et al.Science, 1999, vol. 285, 1077-1080 [02811 • MARSH et al.J. Cell Biol., 1995, vol. 130, 1081-1091 [0281] • MARTINJ. Med. Chem., 1992, vol. 35, 2145-2154 [0281] . MCCOYAct C ryst, 2007, vol. D63, 32-41 [0281] . MCKERN et al.Nature, 2006, vol. 443, 218-221 [0281] . MENTING et al.Biochemistry, 2009, [0281] • MIRANKERKARPLUSProteins: Struct. Fund. Genet., 1991, vol. 11.29-34 [02811 • MOODY etal.Horm. Metab. Res., 1974, vol. 6, 12-16 [0281] • MORTONMYSZKAMethods Enzymol., 1998, vol. 295, 268-2941028¾ • NAVAZASALUDJIANMethods Enzymol., 1997, vol. 276, 581-594 [0281] . NAVIAMURCKOCurr. Opin. Struct. Biol., 1992, vol. 2, 202-210 [0281] • NICECATIMELBioessays, 1999, vol. 21, 339-352 .[0231] . OLEFSKYBiochem. J„ 1978, vol. 172, 137-145 [0281] • OTTENSMEYERetal.Biochemistry, 2000, vol. 39, 12103-12112 [0281] . OTTENSMEYERetal.Biochemistry, 2001, vol. 40, 6988-6988 [0281] . PFLUGRATHActa Cryst., 1999, vol. D55, 1718-1725 [02811 • PILLUTLAet al.J. Biol. Chem., 2002, vol. 277, 22590-22594 [0281] . RAREY et al.J. Mol. Biol., 1996, vol. 261,470-10281] • ROBINSONJAMESAm. J. Physiol., 1992, vol. 263, E383-E393 [02811 • ROSSMANNThe Molecular Replacement Method, Int. Sci. Rev. Ser., No. 13, 1972, [0281] • SALIBLUNDELLJ. Mol. Biol., 1993, vol. 234, 779-815 [02811 • SAM BROOK et al.Molecular Cloning: A Laboratory ManualCold Spring Harbor Laboratory Press20010000 [0231], • SCHÅFFEREur. J. Biochem., 1994, vol. 221.1127-1132 F02811 • SCHAFFER et al.Proc. Natl. Acad. Sci. USA, 2003, vol. 100,4435-4439 [02811 • SCOTTSMITHScience, 1990, vol. 249, 386-390 0)2811 • SILVERMAN et al.Nature Biotech., 2004, vol. 23, 1556-1561 F02811
• SMITH et al.Nat. Med., 1999, vol. 5, 1390-1395.[0281J • SONGYANG et al.Cell, 1993, vol. 72, 767-778 F02811 . SPARROW et al.J. Biol. Chem., 1997, vol. 272, 29460-29467 FQ2811 • STUMPP et al.Curr. Opin. Drug Discov. Develop., 2007, vol. 10, 153-159 [0281] • SURINYAet al.J. Biol. Chem., 2008, vol. 283, 5355-5363 [02811 • SVENSON et al.Mol. Pharm., 2009, [02811 • TONGROSSMANNMethods Enzymol., 1997, vol. 276, 594-611 [0281] • TOTROVABAGYANCurr. Opin. Struct. Biol., 2008, vol. 18, 178-184 Γ02811 • TULLOCH et al.J. Struct. Biol., 1999, vol. 125, 11-18 [02811 . ULLRICH et al.Nature, 1985, vol. 313, 756-761 Γ02811 • ULLRICH et al.EMBO J., 1986, vol. 5, 2503-2512 [02811 • ULRICHHandb Exp Pharmacol., 2006, vol. 173, 305-326 [02811 . VAN DER SPOEL et al.J. Comp. Chem., 2005, vol. 26, 1701-1718 [02811 • WADAet al.J Pharmacol Sci., 2005, vol. 99, 128-143 [02811 • WARD et al.Insulin-like growth factorsPlenum Publishers200300001-21 [02811 • WARDLAWRENCEBioEssays, 2009, vol. 31,422-434 [02811 • WEINER et al.J. Am. Chem. Soc., 1984, vol. 106, 765-784 [02811 • YIN et al.Angew. Chem. Int. Ed. Engl., 2005, vol. 44, 2704-2707 [02811 • YIP et al.Biochem. Biophys. Res. Commun., 1988, vol. 157, 321-329 [0281] • YIPOTTENSMEYERJ. Biol. Chem., 2003, vol. 278, 27329-27332 [02811 • ZUCKERMANN et al.J. Med. Chem., 1994, vol. 37, 2678- [0281]

Claims (16)

1. Computer-assisteret fremgangsmåde til at identificere, designe eller screene for en forbindelse der potentielt kan interagere med insulinreceptor (IR) og/eller insulin-lignende vækstfaktor-1 receptor (IGF-1R), der omfatter at udføre struktur-baseret identifikation, design eller screening af en forbindelse baseret på forbindelsens interaktioner med en struktur defineret af de atomiske koordinater fra én eller flere af Appendix I til VI, eller et delsæt af atomiske koordinater af én eller flere deraf mindst repræsenterer den C-termi-nale region af α-kæden af IR, den C-terminale region af α-kæden af IGF-1R, eller et mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R, hvor de atomiske koordinater definerer én eller flere regioner af lav-affinitetsbindingsstedet af IR for insulin, og/eller lav-affinitetsbindingsstedet af IGF-1R for IGF, der omfatter den C-terminale region af α-kæden af IR, den C-terminale region af α-kæden af IGF-1R, eller et mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R, hvor den C-terminale region af α-kæden af IR omfatter aminosyrerne 693 til 710 af IR a-kæde (SEQ ID NO: 13), hvor den C-terminale region af α-kæden af IGF-1R omfatter aminosyrerne 681 til 697 af IGF-1R α-kæde (SEQ ID NO: 15), og hvor mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R er S519C16 (SEQ ID NO: 18).A computer-assisted method of identifying, designing, or screening for a compound potentially interacting with insulin receptor (IR) and / or insulin-like growth factor-1 receptor (IGF-1R), comprising performing structure-based identification, designing or screening a compound based on the compound's interactions with a structure defined by the atomic coordinates of one or more of Appendices I to VI, or a subset of atomic coordinates of one or more thereof at least representing the C-terminal region of α the chain of IR, the C-terminal region of the α-chain of IGF-1R, or a mimetic of the C-terminal region of the α-chain of IR and / or IGF-1R, where the atomic coordinates define one or more regions of the low-affinity binding site of IR for insulin, and / or the low-affinity binding site of IGF-1R for IGF, comprising the C-terminal region of the α-chain of IR, the C-terminal region of the α-chain of IGF-1R, or a mimetic m of the C-terminal region of the α-chain of IR and / or IGF-1R, wherein the C-terminal region of the α-chain of IR comprises amino acids 693 to 710 of the IR α-chain (SEQ ID NO: 13), wherein the C-terminal region of the α chain of IGF-1R comprises amino acids 681 to 697 of the IGF-1R α chain (SEQ ID NO: 15), and wherein the mimetic of the C-terminal region of the α chain of IR and / or IGF-1R is S519C16 (SEQ ID NO: 18). 2. Fremgangsmåden ifølge krav 1, der omfatter at identificere, designe eller screene for en forbindelse der interagerer med den tre-dimensionelle struktur af; (i) lav-affinitets-insulinbindingsstedet af IR, idet strukturen er defineret af de atomiske koordinater som vist i én eller flere af Appendix I, III og V og /eller (ii) lav-affinitets-insulin-lignende vækstfaktor (IGF) bindingsstedet af IGF-1R, idet strukturen er defineret af de atomiske koordinater som vist i én eller flere af Appendix II, IV og VI, hvor interaktion af forbindelsen med strukturen er begunstiget energetisk, og/eller yderligere omfatter syntetisering eller opnåelse af en identificeret eller designet kandidatforbindelse og bestemme evnen af kandidatforbindelsen til at interagere med IR og/eller IGF-1R.The method of claim 1, comprising identifying, designing, or screening for a compound that interacts with the three-dimensional structure of; (i) the low-affinity insulin binding site of IR, the structure being defined by the atomic coordinates as shown in one or more of Appendix I, III and V and / or (ii) the low-affinity insulin-like growth factor (IGF) binding site. of IGF-1R, the structure being defined by the atomic coordinates as shown in one or more of Appendix II, IV and VI, where interaction of the compound with the structure is energetically favored, and / or further comprising synthesizing or obtaining an identified or designed candidate compound and determine the ability of the candidate compound to interact with IR and / or IGF-1R. 3. Fremgangsmåden ifølge krav 1, hvor de atomiske koordinater definerer lav-affinitets-insulinbindingsstedet af IR der yderligere omfatter det leucin-rige gentagelses 1 (LI) -domæne og/eller det cystein-rige (CR)-domæne af IR-ectodomænet, eventuelt hvor de atomiske koordinater definerer dele af den molekylære overflad af det centrale β-ark af Ll-domænet og dele af den molekylære overflade af den anden leucin-rige gentagelse (LRR) der indeholder Phe39 og/eller spiralen i den fjerde LRR tværforbindelse af LI- domænet, og/eller hvor de atomiske koordinater definerer modul 6 af CR-domænet af IR.The method of claim 1, wherein the atomic coordinates define the low-affinity insulin binding site of IR further comprising the leucine-rich repeat 1 (LI) domain and / or the cysteine-rich (CR) domain of the IR ectodomain, optionally, where the atomic coordinates define parts of the molecular surface of the central β-sheet of the L1 domain and parts of the molecular surface of the second leucine-rich repeat (LRR) containing Phe39 and / or the helix of the fourth LRR cross-linking of The LI domain, and / or where the atomic coordinates define module 6 of the CR domain of IR. 4. Fremgangsmåden ifølge et hvilket som helst af de foregående krav, hvor de atomiske koordinater yderligere definerer én eller flere aminosyresekvenser valgt fra IR aminosyregrupperne 1-156, 157-310, 594 og 794, eventuelt hvor den ene eller flere aminosyrer er valgt fra IR-aminosyregrupperne 1-156 der omfatter mindst en aminosyre valgt fra Argl4, Asnl5, Gln34, Leu36, Leu37, Phe39, Pro43, Phe46, Leu62, Phe64, Leu87, Phe88, Phe89, Asn90, Phe96, Glu97, Arg 118, Glul20 og Hisl44, og/eller hvor den ene eller flere aminosyrer der er valgt fra IR aminosyregrupperne 157-310 omfatter mindst en af aminosyresekvenserne valgt fra 192-310, 227-303 og 259-284.The method of any one of the preceding claims, wherein the atomic coordinates further define one or more amino acid sequences selected from the IR amino acid groups 1-156, 157-310, 594 and 794, optionally wherein the one or more amino acids is selected from IR. the amino acid groups 1-156 comprising at least one amino acid selected from Arg14, Asnl5, Gln34, Leu36, Leu37, Phe39, Pro43, Phe46, Leu62, Phe64, Leu87, Phe88, Phe89, Asn90, Phe96, Glu97, Arg 118, Glul20 and Hisl44 and / or wherein the one or more amino acids selected from the IR amino acid groups 157-310 comprise at least one of the amino acid sequences selected from 192-310, 227-303 and 259-284. 5. Fremgangsmåden ifølge krav 1, hvor de atomiske koordinater definerer lav-affinitets-IGF-bindingsstedet af IGF-1R der yderligere omfatter Ll-domænet og/eller CR-domænet af IGF-lR-ectodomænet, eventuelt hvor de atomiske koordinater definerer det centrale β-ark af Ll-domænet, og/eller at del af det andet LRR indeholder Ser35, og/eller spiralen i den fjerde LRR tværforbindelse af Ll-domænet, og/eller hvor de atomiske koordinater definerer modul 6 af CR-domænet af IGF-1R.The method of claim 1, wherein the atomic coordinates define the low-affinity IGF binding site of IGF-1R further comprising the L1 domain and / or the CR domain of the IGF-1R ectodomain, optionally wherein the atomic coordinates define the central β-sheets of the L1 domain, and / or that part of the second LRR contains Ser35, and / or the spiral of the fourth LRR cross-link of the L1 domain, and / or where the atomic coordinates define module 6 of the CR domain of IGF -1R. 6. Fremgangsmåden ifølge krav 2, hvor forbindelsen erstatter for den C-terminale region af α-kæden af IR og/eller den C-terminale region af α-kæden af IGF-1R i dannelsen af lavaffinitetsbindingsstedet af IR eller IGF-1R.The method of claim 2, wherein the compound replaces for the C-terminal region of the α-chain of IR and / or the C-terminal region of the α-chain of IGF-1R in the formation of the low-affinity binding site of IR or IGF-1R. 7. Fremgangsmåden ifølge et hvilket som helst af de foregående krav, hvor en kandidatforbindelse til interagering med IR og/eller IGF-1R er kemisk modificeret som et resultat af struktur-baseret evaluering, eventuelt hvor den kemiske modifikation er designet til enten at: (i) reducere potentialet for kandidatforbindelsen til at binde til IR så længe der opretholdes binding til IGF-1R; eller (ii) reducere potentialet for kandidatforbindelsen til at binde til IGF-1R, så længe der opretholdes binding til IR.The method of any of the preceding claims, wherein a candidate compound for interacting with IR and / or IGF-1R is chemically modified as a result of structure-based evaluation, optionally where the chemical modification is designed to either: ( i) reducing the potential for the candidate compound to bind to IR as long as binding to IGF-1R is maintained; or (ii) reduce the potential for the candidate compound to bind to IGF-1R as long as binding to IR is maintained. 8. Computer-assisteret fremgangsmåde til at re-designe en forbindelse der er kendt for at binde til IR og/eller IGF-1R der omfatter udførelse af struktur-baseret evaluering af forbindelsen baseret på forbindelsens interaktioner med en struktur defineret af de atomiske koordinater fra én eller flere af Appendix I til VI, eller et delsæt af atomiske koordinater af én eller flere deraf mindst repræsenterer den C-terminale region af a-kæden af IR, den C-terminale region af α-kæden af IGF-1R, eller et mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R, og re-designe eller kemisk modificere forbindelsen som et resultat af evalueringen, hvor de atomiske koordinater udviser én eller flere regioner af lav-affinitetsbindingsstedet af IR for insulin, og/eller lav-affinitetsbindingsstedet af IGF-1R for IGF, der omfatter den C-terminale region af α-kæden af IR, den C-terminale region af α-kæden af IGF-1R, eller et mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R, hvor den C-terminale region af α-kæden af IR omfatter aminosyrerne 693 til 710 af IR a-kæde (SEQ ID NO: 13), hvor den C-terminale region af α-kæden af IGF-1R omfatter aminosyrerne 681 til 697 af IGF-1R α-kæde (SEQ ID NO: 15), og hvor mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R is S519C16 (SEQ ID NO: 18).A computer-assisted method of redesigning a compound known to bind to IR and / or IGF-1R comprising performing structure-based evaluation of the compound based on the compound's interactions with a structure defined by the atomic coordinates of one or more of Appendices I to VI, or a subset of atomic coordinates of one or more thereof at least representing the C-terminal region of the α-chain of IR, the C-terminal region of the α-chain of IGF-1R, or a mimetic of the C-terminal region of the α chain of IR and / or IGF-1R, and redesign or chemically modify the compound as a result of the evaluation, where the atomic coordinates exhibit one or more regions of the low affinity binding site of IR for insulin, and / or the low-affinity binding site of IGF-1R for IGF comprising the C-terminal region of the α chain of IR, the C-terminal region of the α chain of IGF-1R, or a mimetic of the C terminal region of the α chain a f IR and / or IGF-1R, wherein the C-terminal region of the α-chain of IR comprises amino acids 693 to 710 of the IR α-chain (SEQ ID NO: 13), wherein the C-terminal region of the α-chain of IGF-1R comprises amino acids 681 to 697 of the IGF-1R α chain (SEQ ID NO: 15), and wherein the mimetic of the C-terminal region of the α chain of IR and / or IGF-1R is S519C16 (SEQ ID NO: 15). : 18). 9. Fremgangsmåden ifølge krav 8, hvor forbindelsen er re-designet eller kemisk modificeret til (i) at forbedre bindingsaffinitet til IR, og/eller (ii) reducere bindingsaffinitet til IGF-1R, eller hvor forbindelsen er re-designet eller kemisk modificeret til (i) at forbedre bindingsaffinitet til IGF-1R, og/eller (ii) reducere bindingsaffinitet til IR, eventuelt hvor forbindelsen er re-designet eller modificeret til at reducere affiniteten til IR eller IGF-1R i kraft af de strukturelle forskelle mellem IR og IGF-1R ved eller i nærheden af den C-terminale region af α-kæden af IR og den C-terminale region af α-kæden af IGF-1R.The method of claim 8, wherein the compound is redesigned or chemically modified to (i) improve binding affinity for IR, and / or (ii) reduce binding affinity to IGF-1R, or wherein the compound is redesigned or chemically modified to (i) improving binding affinity for IGF-1R, and / or (ii) reducing binding affinity for IR, optionally where the compound is redesigned or modified to reduce affinity for IR or IGF-1R due to the structural differences between IR and IGF-1R at or near the C-terminal region of the α-chain of IR and the C-terminal region of the α-chain of IGF-1R. 10. Computer-assisteret fremgangsmåde til at identificere en forbindelse der potentielt interagerer med IR og/eller IGF-1R, hvilken fremgangsmåde omfatter tilpasning af strukturen af: (i) lav-affinitets-insulinbindingsstedet af IR, strukturen er defineret af de atomiske koordinater som vist i én eller flere af Appendix I, III og V; (ii) lav-affinitets-IGF-bindingsstedet af IGF-1R, strukturen er defineret af de atomiske koordinater som vist i én eller flere af Appendix II, IV og VI og/eller (iii) den C-terminale region af α-kæden af IR, den C-terminale region af a-kæden af IGF-1R, eller et mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R, strukturen er defineret af et delsæt af atomiske koordinater som vist i én eller flere af Appendix I til VI, til strukturen af en kandidatforbindelse, hvor de atomiske koordinater definerer én eller flere regioner af lavaffinitetsbindingsstedet af IR for insulin, og/eller lav-affinitetsbindingsstedet af IGF-1R for IGF, der omfatter den C-terminale region af α-kæden af IR, den C-terminale region af a-kæden af IGF-1R, eller et mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R, hvor den C-terminale region af α-kæden af IR omfatter aminosyrerne 693 til 710 af IR a-kæde (SEQ ID NO: 13), hvor den C-terminale region af α-kæden af IGF-1R omfatter aminosyrerne 681 til 697 af IGF-1R α-kæde (SEQ ID NO: 15), og hvor mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R er S519C16 (SEQ ID NO: 18).A computer-assisted method for identifying a compound potentially interacting with IR and / or IGF-1R, which comprises adapting the structure of: (i) the low-affinity insulin binding site of IR, the structure defined by the atomic coordinates as shown in one or more of Appendices I, III and V; (ii) the low-affinity IGF binding site of IGF-1R, the structure is defined by the atomic coordinates as shown in one or more of Appendix II, IV and VI and / or (iii) the C-terminal region of the α chain of IR, the C-terminal region of the α-chain of IGF-1R, or a mimetic of the C-terminal region of the α-chain of IR and / or IGF-1R, the structure is defined by a subset of atomic coordinates as shown in one or more of Appendices I to VI, for the structure of a candidate compound wherein the atomic coordinates define one or more regions of the low affinity binding site of IR for insulin, and / or the low affinity binding site of IGF-1R for IGF comprising the C terminal region of the α-chain of IR, the C-terminal region of the α-chain of IGF-1R, or a mimetic of the C-terminal region of the α-chain of IR and / or IGF-1R, wherein the C-terminal region of the α chain of IR comprises amino acids 693 to 710 of the IR α chain (SEQ ID NO: 13), wherein the C-terminal region of the α chain of IGF-1R comprises amino acids 681 to 697 of the IGF-1R α chain (SEQ ID NO: 15), and wherein the mimetic of the C-terminal region of the α chain of IR and / or IGF-1R is S519C16 (SEQ ID NO: 15). : 18). 11. Computer-assisteret fremgangsmåde til at identificere en forbindelse der er i stand til at interagere med IR og/eller IGF-1R under anvendelse af en programmeret computer der omfatter en processor, hvilken fremgangsmåde omfatter trinnene at: (a) generere, under anvendelse af computerfremgangsmåder, et sæt af atomiske koordinater af en struktur der har energetisk favorable interaktioner med de atomiske koordinater af: (i) lav-affinitets-insulinbindingsstedet af IR, strukturen er defineret af de atomiske koordinater som vist i én eller flere af Appendix I, III og V; (ii) lav-affinitets-IGF-bindingsstedet af IGF-1R, strukturen er defineret af de atomiske koordinater som vist i én eller flere af Appendix II, IV og VI og/eller (iii) den C-terminale region af α-kæden af IR, den C-terminale region af a-kæden af IGF-1R, eller et mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R, strukturen er defineret af et delsæt af atomiske koordinater som vist i én eller flere af Appendix I til VI, hvilke koordinater er indtastet i computeren og derved genererer et kriteriedatasæt; (b) sammenligne, under anvendelse af processoren, kriteriedatasættet til en computerdatabase af kemiske strukturer; (c) vælge fra databasen, under anvendelse af computerfremgangsmåder, kemiske strukturer der er komplementære eller svarende til en region af kriteriedatasættet og eventuelt, (d) udskrive, til en udskrivningsindretning, de valgte kemiske strukturer der er komplementære med eller svarende til en region af kriteriedatasættet, hvor de atomiske koordinater definerer én eller flere regioner af lavaffinitetsbindingsstedet af IR for insulin, og/eller lav-affinitetsbindingsstedet af IGF-1R for IGF, der omfatter den C-terminale region af α-kæden af IR, den C-terminale region af a-kæden af IGF-1R, eller et mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R, hvor den C-terminale region af α-kæden af IR omfatter aminosyrerne 693 til 710 af IR a- kæde (SEQ ID NO: 13), hvor den C-terminale region af α-kæden af IGF-1R omfatter aminosyrerne 681 til 697 af IGF-1R α-kæde (SEQ ID NO: 15), og hvor mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R er S519C16 (SEQ ID NO: 18).A computer-assisted method for identifying a compound capable of interacting with IR and / or IGF-1R using a programmed computer comprising a processor, the method comprising the steps of: (a) generating, using of computer methods, a set of atomic coordinates of a structure having energetically favorable interactions with the atomic coordinates of: (i) the low-affinity insulin binding site of IR, the structure is defined by the atomic coordinates as shown in one or more of Appendix I, III and V; (ii) the low-affinity IGF binding site of IGF-1R, the structure is defined by the atomic coordinates as shown in one or more of Appendix II, IV and VI and / or (iii) the C-terminal region of the α chain of IR, the C-terminal region of the α-chain of IGF-1R, or a mimetic of the C-terminal region of the α-chain of IR and / or IGF-1R, the structure is defined by a subset of atomic coordinates as shown in one or more of Appendices I to VI, which coordinates are entered into the computer thereby generating a criteria dataset; (b) comparing, using the processor, the criteria dataset to a computer database of chemical structures; (c) selecting from the database, using computer methods, chemical structures that are complementary or corresponding to a region of the criteria dataset and, optionally, (d) printing, to a printing device, the selected chemical structures that are complementary to or corresponding to a region of the criterion dataset, wherein the atomic coordinates define one or more regions of the low affinity binding site of IR for insulin, and / or the low affinity binding site of IGF-1R for IGF comprising the C-terminal region of the α-chain of IR, the C-terminal region of the α-chain of IGF-1R, or a mimetic of the C-terminal region of the α-chain of IR and / or IGF-1R, wherein the C-terminal region of the α-chain of IR comprises amino acids 693 to 710 of IR α chain (SEQ ID NO: 13), wherein the C-terminal region of the α chain of IGF-1R comprises amino acids 681 to 697 of the IGF-1R α chain (SEQ ID NO: 15), and wherein the mimetic of the The C-terminal region of the α-chain of IR and / or IGF-1R is S519C16 (SEQ ID NO: 18). 12. Computer-assisteret fremgangsmåde til at identificere potentielle mimetika af IR og/eller IGF-1R under anvendelse af en programmeret computer der omfatter en processor, hvilken fremgangsmåde omfatter trinnene at: (a) generere et kriteriedatasæt fra et sæt af atomiske koordinater af: (i) lav-affinitets-insulinbindingsstedet af IR, strukturen er defineret af de atomiske koordinater som vist i én eller flere af Appendix I, III og V; (ii) lav-affinitets-IGF-bindingsstedet af IGF-1R, strukturen er defineret af de atomiske koordinater som vist i én eller flere af Appendix II, IV og VI og/eller (iii) den C-terminale region af α-kæden af IR, den C-terminale region af a-kæden af IGF-IR, eller et mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-IR, strukturen er defineret af et delsæt af atomiske koordinater som vist i én eller flere af Appendix I til VI, hvilke koordinater er indtastet i computeren; (b) (i) sammenligne, under anvendelse af processoren, kriteriedatasættet til en computerdatabase af kemiske strukturer lagret i et computerdatalagringssystem og vælge fra databasen, under anvendelse af computerfremgangsmåder, kemiske strukturer med en region der strukturelt svarer til kriteriedatasættet; eller (ii) konstruere, under anvendelse af computerfremgangsmåder, en model af kemisk struktur med en region der strukturelt svarer til kriteriedatasættet og, eventuelt, (c) udskrive til en udskrivningsindretning: (i) de valgte kemiske strukturer fra trin (b)(1) med en region der svarer til kriteriedatasættet; eller (ii) den konstruerede model fra trin (b)(ii), hvor de atomiske koordinater definerer én eller flere regioner af lavaffinitetsbindingsstedet af IR for insulin, og/eller lav-affinitetsbindingsstedet af IGF-1R for IGF, der omfatter den C-terminale region af α-kæden af IR, den C-terminale region af a-kæden af IGF-1R, eller et mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R, hvor den C-terminale region af α-kæden af IR omfatter aminosyrerne 693 til 710 af IR a-kæde (SEQ ID NO: 13), hvor den C-terminale region af α-kæden af IGF-1R omfatter aminosyrerne 681 til 697 af IGF-1R α-kæde (SEQ ID NO: 15), og hvor mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R er S519C16 (SEQ ID NO: 18).A computer-assisted method for identifying potential mimetics of IR and / or IGF-1R using a programmed computer comprising a processor, the method comprising the steps of: (a) generating a criteria dataset from a set of atomic coordinates of: (i) the low affinity insulin binding site of IR, the structure being defined by the atomic coordinates as shown in one or more of Appendices I, III and V; (ii) the low-affinity IGF binding site of IGF-1R, the structure is defined by the atomic coordinates as shown in one or more of Appendix II, IV and VI and / or (iii) the C-terminal region of the α chain of IR, the C-terminal region of the α-chain of IGF-IR, or a mimetic of the C-terminal region of the α-chain of IR and / or IGF-IR, the structure is defined by a subset of atomic coordinates as shown in one or more of Appendices I to VI, which coordinates are entered in the computer; (b) (i) comparing, using the processor, the criteria dataset to a computer database of chemical structures stored in a computer data storage system and selecting from the database, using computer methods, chemical structures with a region structurally similar to the criteria dataset; or (ii) construct, using computer methods, a model of chemical structure with a region structurally similar to the criteria dataset and, optionally, (c) print to a printing device: (i) the selected chemical structures of step (b) (1) ) with a region corresponding to the criteria dataset; or (ii) the constructed model of step (b) (ii), wherein the atomic coordinates define one or more regions of the low-affinity binding site of IR for insulin, and / or the low-affinity binding site of IGF-1R for IGF comprising the C terminal region of the α-chain of IR, the C-terminal region of the α-chain of IGF-1R, or a mimetic of the C-terminal region of the α-chain of IR and / or IGF-1R, wherein the C-terminal region of the α chain of IR comprises amino acids 693 to 710 of IR α chain (SEQ ID NO: 13), wherein the C-terminal region of the α chain of IGF-1R comprises amino acids 681 to 697 of IGF-1R α-chain. chain (SEQ ID NO: 15) and wherein the mimetic of the C-terminal region of the α chain of IR and / or IGF-1R is S519C16 (SEQ ID NO: 18). 13. Fremgangsmåde til at evaluere en forbindelses evne til at interagere med IR og/eller IGF-1R, hvilken fremgangsmåde omfatter trinnene at: (a) anvende computer-organer til at udøve en tilpasningsoperation mellem forbindelsen og bindingsfladen af en computermodel af lav-affinitetsbindingsstedet for insulin på IR-ectodomænet, og/eller lav-affinitetsbindingsstedet for IGF på IGF-lR-ectodomænet, under anvendelse af atom-koordinater hvor kvadratroden af den gennemsnitlige kvadratafvigelse mellem atom-koordinaterne og atom-koordinater fra én eller flere af Appendix I til VI eller et delsæt af atomiske koordinater af én eller flere deraf mindst repræsenterer den C-terminale region af α-kæden af IR, den C-terminale region af α-kæden af IGF-1R, eller et mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R, ikke er mere end 1,5 Å og (b) analysere resultaterne af tilpasningsoperationen til at kvantificere associationen mellem forbindelsen og bindingsflademodellen, hvor de atomiske koordinater udviser én eller flere regioner af lav-affinitetsbindingsstedet af IR for insulin, og/eller lav-affinitetsbindingsstedet af IGF-1R for IGF, der omfatter den C-terminale region af α-kæden af IR, den C-terminale region af α-kæden af IGF-1R, eller et mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R, hvor den C-terminale region af α-kæden af IR omfatter aminosyrerne 693 til 710 af IR a-kæde (SEQ ID NO: 13), hvor den C-terminale region af α-kæden af IGF-1R omfatter aminosyrerne 681 til 697 af IGF-1R α-kæde (SEQ ID NO: 15), og hvor mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R er S519C16 (SEQ ID NO: 18).A method of evaluating a compound's ability to interact with IR and / or IGF-1R, which method comprises the steps of: (a) using computer means to perform a matching operation between the connection and the binding surface of a low-affinity binding site computer model for insulin on the IR ectodomain, and / or low-affinity binding site for IGF on the IGF-1R ectodomain, using atomic coordinates where the square root of the mean square deviation between the atomic coordinates and atomic coordinates from one or more of Appendix I to VI or a subset of atomic coordinates of one or more of them at least represents the C-terminal region of the α-chain of IR, the C-terminal region of the α-chain of IGF-1R, or a mimetic of the C-terminal region of the α chain of IR and / or IGF-1R, is not more than 1.5 Å and (b) analyze the results of the fitting operation to quantify the association between the connection and bonding model one wherein the atomic coordinates exhibit one or more regions of the low-affinity binding site of IR for insulin, and / or the low-affinity binding site of IGF-1R for IGF comprising the C-terminal region of the α chain of IR, the C terminal region of the α chain of IGF-1R, or a mimetic of the C-terminal region of the α chain of IR and / or IGF-1R, the C-terminal region of the α chain of IR comprising amino acids 693 to 710 of IR α chain (SEQ ID NO: 13), wherein the C-terminal region of the α chain of IGF-1R comprises amino acids 681 to 697 of IGF-1R α chain (SEQ ID NO: 15), and wherein mimetic of the C-terminal region of the α chain of IR and / or IGF-1R is S519C16 (SEQ ID NO: 18). 14.14th FremgangsmådeCourse of action til anvendelse af molekylær udskiftning for at opnå strukturel information om et molekyle eller et molekylkompleks af ukendt struktur, der omfatter trinnene at: (i) generere et røntgendiffraktionsmønster af det krystalliserede molekyle eller molekylkomplekset og (ii) påføre de atomiske koordinater fra én eller flere af Appendix I til VI, eller et delsæt af atomiske koordinater af én eller flere deraf mindst repræsenterer region af α-kæden af IR, den C-terminale region af α-kæden af IGF-1R, eller et mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R, til røntgendiffraktionsmønsteret til at generere en tre-dimensionel elektrondensitetsafbildning af mindst én region af molekylet eller molekylkomplekset hvis struktur er ukendt, hvor de atomiske koordinater udviser én eller flere regioner af lav-affinitetsbindingsstedet af IR for insulin, og/eller lav-affinitetsbindingsstedet af IGF-1R for IGF, der omfatter den C-terminale region af α-kæden af IR, den C-terminale region af α-kæden af IGF-1R, eller et mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R, hvor den C-terminale region af α-kæden af IR omfatter aminosyrerne 693 til 710 af IR a- kæde (SEQ ID NO: 13), hvor den C-terminale region af α-kæden af IGF-1R omfatter aminosyrerne 681 til 697 af IGF-1R o-kæde (SEQ ID NO: 15), og hvor mimetikum af den C-terminale region af α-kæden af IR og/eller IGF-1R er S519C16 (SEQ ID NO: 18).for using molecular replacement to obtain structural information about a molecule or molecule complex of unknown structure comprising the steps of: (i) generating an X-ray diffraction pattern of the crystallized molecule or molecule complex and (ii) applying the atomic coordinates of one or more of Appendices I to VI, or a subset of atomic coordinates of one or more thereof, at least represent region of the α chain of IR, the C-terminal region of the α chain of IGF-1R, or a mimetic of the C-terminal region of the α chain of IR and / or IGF-1R, to the X-ray diffraction pattern to generate a three-dimensional electron density image of at least one region of the molecule or molecule complex whose structure is unknown, with the atomic coordinates exhibiting one or more regions of the low affinity binding site of IR for insulin, and / or the low-affinity binding site of IGF-1R for IGF comprising the C-terminal region of the α chain of IR, the C-terminal region of the α-chain of IGF-1R, or a mimetic of the C-terminal region of the α-chain of IR and / or IGF-1R, wherein the C-terminal region of the α-chain of IR comprises the amino acids 693 to 710 of the IR α chain (SEQ ID NO: 13), wherein the C-terminal region of the α chain of IGF-1R comprises amino acids 681 to 697 of the IGF-1R o chain (SEQ ID NO: 15), and wherein the mimetic of the C-terminal region of the α chain of IR and / or IGF-1R is S519C16 (SEQ ID NO: 18).
DK10766481.5T 2009-04-22 2010-03-09 STRUCTURE OF THE C-terminal region of the insulin receptor ALPHA CHAIN ​​AND THE INSULIN-LIKE GROWTH FACTOR RECEPTOR ALPHA CHAIN DK2422201T3 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US21447209P 2009-04-22 2009-04-22
PCT/AU2010/000271 WO2010121288A1 (en) 2009-04-22 2010-03-09 STRUCTURE OF THE C-TERMINAL REGION OF THE INSULIN RECEPTOR α-CHAIN AND OF THE INSULIN-LIKE GROWTH FACTOR RECEPTOR α-CHAIN

Publications (1)

Publication Number Publication Date
DK2422201T3 true DK2422201T3 (en) 2015-05-18

Family

ID=43010574

Family Applications (1)

Application Number Title Priority Date Filing Date
DK10766481.5T DK2422201T3 (en) 2009-04-22 2010-03-09 STRUCTURE OF THE C-terminal region of the insulin receptor ALPHA CHAIN ​​AND THE INSULIN-LIKE GROWTH FACTOR RECEPTOR ALPHA CHAIN

Country Status (6)

Country Link
US (2) US8666718B2 (en)
EP (1) EP2422201B1 (en)
CN (1) CN102460175B (en)
AU (1) AU2010239125B2 (en)
DK (1) DK2422201T3 (en)
WO (1) WO2010121288A1 (en)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7192713B1 (en) 1999-05-18 2007-03-20 President And Fellows Of Harvard College Stabilized compounds having secondary structure motifs
JP5649825B2 (en) 2007-01-31 2015-01-07 デイナ ファーバー キャンサー インスティチュート,インコーポレイテッド Stabilized p53 peptides and methods of use thereof
ES2430067T3 (en) 2007-03-28 2013-11-18 President And Fellows Of Harvard College Sewn polypeptides
RU2639523C2 (en) 2011-10-18 2017-12-21 Эйлерон Терапьютикс, Инк. Peptidomimetic macrocycles and their application
CA2864120A1 (en) 2012-02-15 2013-08-22 Aileron Therapeutics, Inc. Triazole-crosslinked and thioether-crosslinked peptidomimetic macrocycles
CA2862038C (en) 2012-02-15 2021-05-25 Aileron Therapeutics, Inc. Peptidomimetic macrocycles
US20150198619A1 (en) * 2012-09-25 2015-07-16 The Walter And Eliza Hall Institute Of Medical Research Structure of insulin in complex with n- and c-terminal regions of the insulin receptor alpha-chain
WO2014055564A1 (en) * 2012-10-01 2014-04-10 President And Fellows Of Harvard College Stabilized polypeptide insulin receptor modulators
WO2014138429A2 (en) 2013-03-06 2014-09-12 Aileron Therapeutics, Inc. Peptidomimetic macrocycles and use thereof in regulating hif1alpha
US10227390B2 (en) * 2013-06-14 2019-03-12 President And Fellows Of Harvard College Stabilized polypeptide insulin receptor modulators
US10533039B2 (en) 2014-05-21 2020-01-14 President And Fellows Of Harvard College Ras inhibitory peptides and uses thereof
MX2017003797A (en) 2014-09-24 2017-06-15 Aileron Therapeutics Inc Peptidomimetic macrocycles and uses thereof.
CN107614003A (en) 2015-03-20 2018-01-19 艾瑞朗医疗公司 Peptidomimetic macrocyclic compound and application thereof
WO2017004548A1 (en) 2015-07-01 2017-01-05 Aileron Therapeutics, Inc. Peptidomimetic macrocycles
AU2018329956A1 (en) 2017-09-07 2020-08-20 Fog Pharmaceuticals, Inc. Agents modulating beta-catenin functions and methods thereof

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5223409A (en) 1988-09-02 1993-06-29 Protein Engineering Corp. Directed evolution of novel binding proteins
DE69834828T2 (en) * 1997-11-27 2007-01-04 Commonwealth Scientific And Industrial Research Organisation PROCESS FOR THE CONSTRUCTION OF AGONISTS AND ANTAGONISTS OF THE IGF RECEPTOR (1-462)
AU2783099A (en) * 1998-02-24 1999-09-06 Receptron, Inc. Receptor derived peptides as modulators of receptor activity
EP0957164B1 (en) * 1998-05-15 2001-03-21 Novo Nordisk A/S Insulin binding polypeptide
US7173005B2 (en) 1998-09-02 2007-02-06 Antyra Inc. Insulin and IGF-1 receptor agonists and antagonists
CN1791671A (en) * 2003-03-21 2006-06-21 先灵公司 Method of screening for target ligands
MY146381A (en) * 2004-12-22 2012-08-15 Amgen Inc Compositions and methods relating relating to anti-igf-1 receptor antibodies
CN100371346C (en) * 2005-12-14 2008-02-27 浙江大学 Artificial synthesized insulin-simulated peptide and its application
AU2007262666B2 (en) * 2006-06-22 2013-09-12 Walter And Eliza Hall Institute Of Medical Research Structure of the insulin receptor ectodomain
US7956216B2 (en) 2006-12-21 2011-06-07 The Walter And Eliza Hall Institute Of Medical Research Alpha-helical mimetics
CN201194106Y (en) * 2008-03-04 2009-02-11 劲永国际股份有限公司 Computer system apparatus and mobile storage apparatus

Also Published As

Publication number Publication date
AU2010239125A1 (en) 2011-11-03
EP2422201A4 (en) 2013-01-30
EP2422201A1 (en) 2012-02-29
EP2422201B1 (en) 2015-04-22
WO2010121288A1 (en) 2010-10-28
US8666718B2 (en) 2014-03-04
CN102460175B (en) 2014-09-10
US20120122771A1 (en) 2012-05-17
CN102460175A (en) 2012-05-16
US20140154817A1 (en) 2014-06-05
AU2010239125B2 (en) 2015-05-14

Similar Documents

Publication Publication Date Title
DK2422201T3 (en) STRUCTURE OF THE C-terminal region of the insulin receptor ALPHA CHAIN ​​AND THE INSULIN-LIKE GROWTH FACTOR RECEPTOR ALPHA CHAIN
EP2901154B1 (en) Structure of insulin in complex with n- and c-terminal regions of the insulin receptor alpha-chain
US8301398B2 (en) Structure of the insulin receptor ectodomain
Gutmann et al. Cryo-EM structure of the complete and ligand-saturated insulin receptor ectodomain
Brown et al. Structure and functional analysis of the IGF‐II/IGF2R interaction
Gerber et al. An activation switch in the ligand binding pocket of the C5a receptor
Kosinová et al. Insight into the structural and biological relevance of the T/R transition of the N-terminus of the B-chain in human insulin
Pandyarajan et al. Aromatic anchor at an invariant hormone-receptor interface: function of insulin residue B24 with application to protein design
AU2017298565A1 (en) Insulin analogs
US20220340636A1 (en) Insulin analogs
Baleanu-Gogonea et al. Model of the whole rat AT 1 receptor and the ligand-binding site
US8536306B2 (en) Human A2A adenosine receptor crystals and uses thereof
Nikiforovich et al. Modeling the possible conformations of the extracellular loops in G‐protein‐coupled receptors
Calderón et al. Extended metadynamics protocol for binding/unbinding free energies of peptide ligands to class A G-protein-coupled receptors
Roche et al. Computational model for the IGF‐II/IGF2r complex that is predictive of mutational and surface plasmon resonance data
EP2468767B1 (en) Structure of the insulin receptor ectodomain
Gieldon et al. Theoretical Study of the Human Bradykinin–Bradykinin B2 Receptor Complex
Antobreh et al. Molecular modeling and docking studies of the oxytocin receptor
Barron et al. A λ-dynamics investigation of insulin Wakayama and other A3 variant binding affinities to the insulin receptor
Te et al. Predicting the effects of amino acid replacements in peptide hormones on their binding affinities for class B GPCRs and application to the design of secretin receptor antagonists
CA2338678A1 (en) Identification of compounds for modulating dimeric receptors