AU2015203005A1 - A method of diagnosing neoplasms - Google Patents

A method of diagnosing neoplasms Download PDF

Info

Publication number
AU2015203005A1
AU2015203005A1 AU2015203005A AU2015203005A AU2015203005A1 AU 2015203005 A1 AU2015203005 A1 AU 2015203005A1 AU 2015203005 A AU2015203005 A AU 2015203005A AU 2015203005 A AU2015203005 A AU 2015203005A AU 2015203005 A1 AU2015203005 A1 AU 2015203005A1
Authority
AU
Australia
Prior art keywords
hugenest
genes
hugene
transcripts
gene
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
AU2015203005A
Other versions
AU2015203005B2 (en
Inventor
Robert Dunne
Lawrence Clarence Lapointe
Trevor John Lockett
William J. Wilson
Graeme P. Young
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Commonwealth Scientific and Industrial Research Organization CSIRO
Clinical Genomics Pty Ltd
Original Assignee
Commonwealth Scientific and Industrial Research Organization CSIRO
Clinical Genomics Pty Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from AU2008316319A external-priority patent/AU2008316319A1/en
Application filed by Commonwealth Scientific and Industrial Research Organization CSIRO, Clinical Genomics Pty Ltd filed Critical Commonwealth Scientific and Industrial Research Organization CSIRO
Priority to AU2015203005A priority Critical patent/AU2015203005B2/en
Publication of AU2015203005A1 publication Critical patent/AU2015203005A1/en
Application granted granted Critical
Publication of AU2015203005B2 publication Critical patent/AU2015203005B2/en
Priority to AU2018200024A priority patent/AU2018200024A1/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Peptides Or Proteins (AREA)

Description

A METHOD OF DIAGNOSING NEOPLASMS FIELD OF THE INVENTION 5 The present invention relates generally to nucleic acid molecules, the RNA and protein expression profiles of which are indicative of the onset, predisposition to the onset and/or progression of a neoplasm. More particularly, the present invention is directed to nucleic acid molecules, the expression profiles of which are indicative of the onset and/or progression of a large intestine neoplasm, such as an adenoma or an adenocarcinoma. The expression profiles 10 of the present invention are useful in a range of applications including, but not limited to, those relating to the diagnosis and/or monitoring of colorectal neoplasms, such as colorectal adenocarcinomas. Accordingly, in a related aspect the present invention is directed to a method of screening a subject for the onset, predisposition to the onset and/or progression of a neoplasm by screening for modulation in the expression profile of one or more nucleic acid 15 molecule markers. BACKGROUND OF THE INVENTION Bibliographic details of the publications referred to by author in this specification are 20 collected alphabetically at the end of the description. The reference in this specification to any prior publication (or information derived from it), or to any matter which is known, is not, and should not be taken as an acknowledgment or admission or any form of suggestion that that prior publication (or information derived from 25 it) or known matter forms part of the common general knowledge in the field of endeavour to which this specification relates. Adenomas are benign tumours, or neoplasms, of epithelial origin which are derived from glandular tissue or exhibit clearly defined glandular structures. Some adenomas show 30 recognisable tissue elements, such as fibrous tissue (fibroadenomas) and epithelial structure, while others, such as bronchial adenomas, produce active compounds that might give rise to clinical syndromes. Adenomas may progress to become an invasive neoplasm and are then termed 35 adenocarcinomas. Accordingly, adenocarcinomas are defined as malignant epithelial tumours arising from glandular structures, which are constituent parts of many organs of the body. The term adenocarcinoma is also applied to tumours showing a glandular growth pattern. These tumours may be sub-classified according to the substances that they produce, for example mucus secreting and serous adenocarcinomas, or to the microscopic arrangement of -2 their cells into patterns, for example papillary and follicular adenocarcinomas. These carcinomas may be solid or cystic (cystadenocarcinomas). Each organ may produce tumours showing a variety of histological types, for example the ovary may produce both mucinous and cystadenocarcinoma. 5 Adenomas in different organs behave differently. In general, the overall chance of carcinoma being present within an adenoma (i.e. a focus of cancer having developed within a benign lesion) is approximately 5%. However, this is related to size of an adenoma. For instance, in the large bowel (colon and rectum specifically) occurrence of a cancer within an adenoma is 10 rare in adenomas of less than 1 centimetre. Such a development is estimated at 40 to 50% in adenomas which are greater than 4 centimetres and show certain histopathological change such as villous change, or high grade dysplasia. Adenomas with higher degrees of dysplasia have a higher incidence of carcinoma. In any given colorectal adenoma, the predictors of the presence of cancer now or the future occurrence of cancer in the organ include size (especially 15 greater than 9mm) degree of change from tubular to villous morphology, presence of high grade dysplasia and the morphological change described as "serrated adenoma". In any given individual, the additional features of increasing age, familial occurrence of colorectal adenoma or cancer, male gender or multiplicity of adenomas, predict a future increased risk for cancer in the organ - so-called risk factors for cancer. Except for the presence of 20 adenomas and its size, none of these is objectively defined and all those other than number and size are subject to observer error and to confusion as to precise definition of the feature in question. Because such factors can be difficult to assess and define, their value as predictors of current or future risk for cancer is imprecise. 25 Once a sporadic adenoma has developed, the chance of a new adenoma occurring is approximately 30% within 26 months. Colorectal adenomas represent a class of adenomas which are exhibiting an increasing incidence, particularly in more affluent countries. The causes of adenoma, and of progression 30 to adenocarcinoma, are still the subject of intensive research. To date it has been speculated that in addition to genetic predisposition, environmental factors (such as diet) play a role in the development of this condition. Most studies indicate that the relevant environmental factors relate to high dietary fat, low fibre, low vegetable intake, smoking, obesity, physical inactivity and high refined carbohydrates. 35 Colonic adenomas are localised areas of dysplastic epithelium which initially involve just one or several crypts and may not protrude from the surface, but with increased growth in size, usually resulting from an imbalance in proliferation and/or apoptosis, they may protrude. Adenomas can be classified in several ways. One is by their gross appearance and the major -3 descriptors include degrees of protrusion: flat sessile (i.e. protruding but without a distinct stalk) or pedunculated (i.e. having a stalk). Other gross descriptors include actual size in the largest dimension and actual number in the colon/rectum. While small adenomas (less than say 5 or 10 millimetres) exhibit a smooth tan surface, pedunculated and especially larger 5 adenomas tend to have a cobblestone or lobulated red-brown surface. Larger sessile adenomas may exhibit a more delicate villous surface. Another set of descriptors include the histopathological classification; the prime descriptors of clinical value include degree of dysplasia (low or high), whether or not a focus of invasive cancer is present, degree of change from tubular gland formation to villous gland formation (hence classification is tubular, 10 villous or tubulovillous), presence of admixed hyperplastic change and of so-called "serrated" adenomas and its subgroups. Adenomas can be situated at any site in the colon and/or rectum although they tend to be more common in the rectum and distal colon. All of these descriptors, with the exception of number and size, are relatively subjective and subject to interobserver disagreement. 15 The various descriptive features of adenomas are of value not just to ascertain the neoplastic status of any given adenomas when detected, but also to predict a person's future risk of developing colorectal adenomas or cancer. Those features of an adenoma or number of adenomas in an individual that point to an increased future risk for cancer or recurrence of 20 new adenomas include: size of the largest adenoma (especially 10mm or larger), degree of villous change (especially at least 25% such change and particularly 100% such change), high grade dysplasia, number (3 or more of any size or histological status) or presence of serrated adenoma features. None except size or number is objective and all are relatively subjective and subject to interobserver disagreement. These predictors of risk for future neoplasia 25 (hence "risk") are vital in practice because they are used to determine the rate and need for and frequency of future colonoscopic surveillance. More accurate risk classification might thus reduce workload of colonoscopy, make it more cost-effective and reduce the risk of complications from unnecessary procedures. 30 Adenomas are generally asymptomatic, therefore rendering difficult their diagnosis and treatment at a stage prior to when they might develop invasive characteristics and so became cancer. It is technically impossible to predict the presence or absence of carcinoma based on the gross appearance of adenomas, although larger adenomas are more likely to show a region of malignant change than are smaller adenomas. Sessile adenomas exhibit a higher incidence 35 of malignancy than pedunculated adenomas of the same size. Some adenomas result in blood loss which might be observed or detectable in the stools; while sometimes visible by eye, it is often, when it occurs, microscopic or "occult". Larger adenomas tend to bleed more than smaller adenomas. However, since blood in the stool, whether overt or occult, can also be -4 indicative of non-adenomatous conditions, the accurate diagnosis of adenoma is rendered difficult without the application of highly invasive procedures such as colonoscopy combined with tissue acquisition by either removal (i.e. polypectomy) or biopsy and subsequent histopathological analysis. 5 Accordingly, there is an on-going need to elucidate the causes of adenoma and to develop more informative diagnostic protocols or aids to diagnosis that enable one to direct colonoscopy at people more likely to have adenomas. These adenomas may be high risk, advanced or neither of these. Furthermore, it can be difficult after colonoscopy to be certain 10 that all adenomas have been removed, especially in a person who has had multiple adenomas. An accurate screening test may minimise the need to undertake an early second colonoscopy to ensure that the colon has been cleared of neoplasms. Accordingly, the identification of molecular markers for adenomas would provide means for understanding the cause of adenomas and cancer, improving diagnosis of adenomas including development of useful 15 screening tests, elucidating the histological stage of an adenoma, characterising a patient's future risk for colorectal neoplasia on the basis of the molecular state of an adenoma and facilitating treatment of adenomas. To date, research has focused on the identification of gene mutations which lead to the 20 development of colorectal neoplasms. In work leading up to the present invention, however, it has been determined that changes in expression profiles of genes which are also expressed in healthy individuals are indicative of the development of neoplasms of the large intestine, such as adenomas and adenocarcinomas. It has been further determined that in relation to neoplasms of the large intestine, diagnosis can be made based on screening for one or more of 25 a panel of these differentially expressed genes. In a related aspect, it has still further been determined that to the extent that neoplastic tissue has been identified either by the method of the invention or by some other method, the present invention provides still further means of characterising that tissue as an adenoma or a cancer. In yet another aspect, it has been determined that a proportion of these genes are characterised by gene expression which occurs 30 in the context of a neoplastic state but not in the context of a non-neoplastic state, thereby facilitating the development of qualitative analyses which do not require a relative analysis to be performed against a non-neoplastic or normal control reference level. Accordingly, the inventors have identified a panel of genes which facilitate the diagnosis of adenocarcinoma and adenoma development and/or the monitoring of conditions characterised by the 35 development of these types of neoplasms. SUMMARY OF THE INVENTION Throughout this specification and the claims which follow, unless the context requires -5 otherwise, the word "comprise", and variations such as "comprises" and "comprising", will be understood to imply the inclusion of a stated integer or step or group of integers or steps but not the exclusion of any other integer or step or group of integers or steps. 5 As used herein, the term "derived from" shall be taken to indicate that a particular integer or group of integers has originated from the species specified, but has not necessarily been obtained directly from the specified source. Further, as used herein the singular forms of "a", "and" and "the" include plural referents unless the context clearly dictates otherwise. 10 Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The subject specification contains amino acid and nucleotide sequence information prepared 15 using the programme Patentln Version 3.4, presented herein after the bibliography. Each amino acid and nucleotide sequence is identified in the sequence listing by the numeric indicator <210> followed by the sequence identifier (eg. <210>1, <210>2, etc). The length, type of sequence (amino acid, DNA, etc.) and source organism for each sequence is indicated by information provided in the numeric indicator fields <21 1>m <212> and <213>, 20 respectively. Amino acid and nucleotide sequences referred to in the specification are identified by the indicator SEQ ID NO: followed by the sequence identifier (eg. SEQ ID NO: 1, SEQ ID NO: 2, etc). The sequence identifier referred to in the specification correlates to the information provided in numeric indicator field <400> in the sequence listing, which is followed by the sequence identifier (eg. <400>1, <400>2, etc). That is SEQ ID NO: 1 as 25 detailed in the specification correlates to the sequence indicated as <400>1 in the sequence listing. One aspect of the present invention is directed to a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method 30 comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 201328_at 221577_x_at 205828_at 201341_at 221922_at 205886_at 201416_at 60474_at 205890_s_at 201417_at 222696_at 205910_s_at 201468_s_at 223447_at 205941 s at 201506_at 223970_at 206224_at -6 201563_at 225541_at 206976_s at 201656_at 225835_at 207173_x_at 201925_s_at 226360_at 207457_s_at 201926_s_at 227174_at 208079_s_at 202286_s_at 227475_at 208712_at 202718_at 228303_at 209218_at 202831_at 228653_at 209309_at 202833_s_at 228754_at 209752_at 202935_s_at 228915_at 209773_s_at 202936_s_at 229215_at 209774_x_at 203124 s at 231832 at 209792 s at 203256_at 231941_s_at 209875_s_at 203313_s_at 232176_at 209955_s_at 203510_at 232252_at 210052_s_at 203860_at 232481_s_at 210511_s_at 203895 at 234331 s at 210559 s at 203896_s_at 235210_s_at 210766_s_at 203961_at 235976_at 211506_s_at 203962_s_at 236894_at 212281_s_at 204259_at 238017_at 212344_at 204351_at 238021_s_at 212353_at 204401_at 238984_at 212354_at 204404_at 241031_at 213905_x_at 204855_at 200660_at 214022_s_at 204885_s_at 200832_s_at 214974_x_at 205174_s at 200903_s at 215091_s at 205366_s_at 201014_s_at 217430_x_at 205470_s_at 201112_s_at 217996_at 205513_at 201195_s_at 218507_at 205765_at 201261_x_at 218963s at 205825_at .201292_at 218984_at 205927_s_at 201338_x_at 219787_s_at 205983_at 201479_at 219911_s_at 206239_s_at 201577_at 221729_at 206286_s_at 201601_x_at 221730_at 207158_at 201666_at 221731 x at 207850_at 202310_s_at 221923_s_at 209369_at 202311_s_at 37892_at -7 210445_at 202404_s at 222449_at 210519_s_at 202431_s_at 222450_at 211429_s_at 202504_at 222549_at 212063_at 202779_s_at 222608_s_at 212070_at 202859_x at 223062_s_at 212190_at 202954_at 224428_s at 212531_at 202998_s_at 224646_x_at 212942_s_at 203083_at 224915_x_at 213880_at 203213_at 225295_at 213975_ s_at 203878_s_at 225520_at 214235_at 204051_s_at 225664 at 214651_s_at 204127_at 225681_at 217523_at 204170_ s_at 225767_at 217867_x_at 204320 at 225799_at 218086_at 204470_at 225806_at 218211_s_at 204475_at 226227_x_at 218704_at 204580_at 226237_at 218796_at 204620_s_at 226311_at 218872_at 204702_s_at 226777_at 219630_at 205361_s_at 226835_s_at 219682_s at 205476 at 227140 at 219727_at 205479_s_at 229802_at 219955_at 205713_s_at 231766_s_at 219956_at 205815_at 232151_at; and/or 200665_s_at (ii) FOXQ1 RNF43 CDCA7 LOXL2 MMP1 CCL20 TDGF1 AZGP1 TCN1 CTSE MTHFD1L MYC MMP7 MSLN ANLN COMP WDR72 TIMPI H19 AURKA INHBA PCSK1 FAP PAICS COL1IA1 CST1 DACH1 PUS7 GDF15 BGN VCAN ZNRF3 CTHRC1 AXIN2 SQLE CCND1 COL1A1 MET REG3A CSE1L LGR5 SOX9 TESC PFDN4 DUSP27 TMEPAI UBE2C C20orf42 -8 SERPINB5 CDH11 TMEM97 SLC11A2 ASCL2 MMP11 TRIM29 NFE2L3 SULF1 CXCL1 KLKlI CEL SLC6A6 QPCT LY6G6D NLF1 TACSTD2 PDZKlIP1 SLC7A5 NPDC1 NEBL CD55 FABP6 ENCI PCCA ECT2 SLITRK6 SERPINE2 FLJ37644 COL12A1 MLPH UBE2S KRT23 LlTD1 HOXA9 TOP2A CXCL3 WDR51B TBX3 CDC2 MMP3 HOXB6 BACE2 RFC3 SFRP4 FAM84A GPX2 LILRB1 UBD COL5A2 TPX2 NPM1 SCD MMP12 KCNN4 RDHE2 DPEP1 SORD LOC541471 PLAU LCN2 PSATI CYP3A5 HSPH1 PLCB4 CXCL5 ANXA3 GTF3A SPINKI IGFBP2 CYP3A5P2 KLK1O DUOX2 TGIFl C14orf94 GPSM2 SOX4 CXCL2 RP5-875H10.1 NME1 THBS2 SLCO4A1 RPL22L1 MUC20 CLDN1 CKS2 APOBECI RPESP REGIB PHLDAl HIG2 RRM2 CDH3 SERPINAl COL1OA1 DKC1 SPARC COL1A2 AHCY CD44 IL8 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to control levels is indicative of a neoplastic large intestine cell or a cell predisposed to the onset of a neoplastic state. 5 In another aspect there is provided a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: 10 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 225681_at; 227140_at; and/or (ii) CTHRC1 -9 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to control levels is indicative of a neoplastic large intestine cell or a cell predisposed to the onset of a neoplastic state. 5 In yet another aspect there is provided a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: 10 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 227475_at 204475_at 202859_x_at 202404_s_at; and/or 15 (ii) FOXQl, MMPl, IL8, COLIA2 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to control levels is indicative of a 20 neoplastic large intestine cell or a cell predisposed to the onset of a neoplastic state. In still another aspect there is provided a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: 25 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 205513_at, 204259_at, 227174_at, 210511_sat, 37892_at; and/or (ii) TCN1, MMP7, WDR72, INHBA, COL 11A1 30 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to control levels is indicative of a neoplastic large intestine cell or a cell predisposed to the onset of a neoplastic state. 35 Preferably, said control level is a non-neoplastic level. In still yet another aspect there is provided a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected 40 from: - 10 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 221577_x_at 212942_s_at 202310_s_at 213880_at 232252_at 204855_at 229215_at 212354_at 228754_at 202286_s_at 203962_s_at 203860_at 202935_s at 218963_s_at 207850_at 205828_at 204051_s_at 205890_s_at 200832_s_at 205983_at 212531_at; and/or (ii) LGR5 GDF15 COLIAl ASCL2 DUSP27 SERPINB5 TACSTD2 SULFI SLC6A6 FLJ37644 NEBL PCCA MMP3 KRT23 CXCL3 SCD SFRP4 UBD DPEP1 LCN2 5 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to control levels is indicative of a neoplastic large intestine cell or a cell predisposed to the onset of a neoplastic state. In a further aspect the present invention is directed to a method of screening for the onset or 10 predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 201328_at 205825_at 219956_at 201468_s_at 205927_s_at 221922_at 201656_at 206239_s_at 223447_at 201925_s_at 207158_at 223970_at 201926 s at 210445_at 225835_at 202718_at 210519_s_at 226360_at 202831_at 211429_s_at 228303_at 202833_s_at 212063_at 228653_at 203124_s_at 213975_s_at 228915_at 203313_s_at 214235_at 231832_at 203860_at 214651_s_at 231941_s_at -11 203895_at 217523_at 232176_at 203896_s_at 217867_x_at 232481_s_at 204401_at 218086_at 234331_s_at 204885_s_at 218211_s_at 235210_s_at 205174_s_at 218796_at 235976_at 205366_s_at 219630_at 236894_at 205470_s_at 219682_s_at 238017_at 205513_at 219727_at 238984_at 205765_at 219955_at; 241031_at; and/or (ii) APOBECI HOXB6 QPCT BACE2 IGFBP2 RDHE2 C20orf42 ITGA6 REG4 CD44 KCNN4 RETNLB CD55 KLK11 RP5-875H10.1 CTSE LlTD1 RPESP CYP3A5 LILRB1 SERPINAl CYP3A5P2 MLPH SLC11A2 DACHI MSLN SLC12A2 DUOX2 MUC20 SLITRK6 ETS2 NLF1 SPINKI FABP6 NPDC1 TBX3 FAM84A NQO1 TCN1 GALNT6 PCCA TGIF1 GPSM2 PCSK1 WDR51B GPX2 PDZK1IP1 ZNRF3 HOXA9 PLCB4 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to control levels is indicative of an 5 adenoma cell or a cell predisposed to the onset of an adenoma state. In another further aspect of the present invention there is provided a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts 10 selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: -12 200660_at 205476_at 217430_x_at 200665_s_at 205479_s_at 217996_at 200832_s_at 205713_s_at 218507_at 200903_s_at 205815_at 218963_s_at 201014_s_at 205828_at 218984_at 201112_s_at 205886_at 219787_s_at 201195_s_at 205890_s_at 219911_s_at 201261_x_at 205910_s_at 221729_at 201292_at 205941_s_at 221730_at 201338_x_at 206224_at 221731_ x_at 201479_at 206976_s_at 221923_s_at 201577_at 207173_x_at 37892_at 201601_x_at 207457_s_at 222449_at 201666_at 208079_s_at 222450_at 202310_s_at 208712_at 222549_at 202311_s_at 209218_at 222608_s_at 202403_s_at 209309_at 223062_s_at 202404_s_at 209752_at 224428_s_at 202431_s at 209773_s at 224646_x at 202504_at 209774_x_at 224915_x_at 202779_sat 209792_s_at 225295_at 202859_x_at 209875_s_at 225520_at 202954_at 209955_s_at 225664_at 202998_s_at 210052_s_at 225681_at 203083_at 210511_s_at 225767_at 203213_at 210559_s_at 225799_at 203878_s_at 210766s_at 225806_at 204051_s_at 211506s at 226227_x_at 204127_at 212281 s at 226237_at 204170_s_at 212344_at 226311_at 204320_at 212353_at 226777_at 204470_at 212354_at 226835_s_at 204475_at 213905_x_at 227140_at 204580_at 214022_s_at 229802_at 204620_s_at 214974_x_at 231766_s_at 204702_s_at 215091_s at; 232151 at; and/or 205361_s_at - 13 (ii) AHCY DKC1 PSATI ANLN ECT2 PUS7 AURKA FAP REGlA AZGP1 GTF3A REGIB BGN H19 REG3A C14orf94 HIG2 RFC3 C20orfl99 HSPH1 RRM2 CCL20 IFITMI S1O0Al1 CCND1 IL8 SCD CDC2 INHBA SFRP4 CDCA7 KLK1O SLC39A1O CDH1 1 KRT23 SLC7A5 CEL LOC541471 SLCO4A1 CKS2 LOXL2 SPARC CLDN1 LY6G6D SPP1 COL1OA1 MMP1 SQLE COLI1A1 MMP11 SULF1 COL12A1 MMP12 THBS2 COL1A1 MMP3 TIMPI COL1A2 MTHFD1L TMEM97 COL5A2 MYC TMEPAI COL8A1 NFE2L3 TOP2A COMP NME1 TPX2 CSE1L NPM1 TRIM29 CST1 PAICS UBD CTHRCl PFDN4 UBE2C CXCL1 PHLDA1 UBE2S CXCL2 PLAU VCAN CXCL5 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to control levels is indicative of a cancer cell or a cell predisposed to the onset of a cancerous state. 5 In yet another further aspect there is provided a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: - 14 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 202286_s_at 235976_at 209309_at 204259_at 236894_at 211506_s_at 204885_s_at 214974_x_at 205174_s_at 238984_at 219787_s_at 205825_at 241031 at 37892_at 207850_at 202311_s_at 222608_s_at 213880_at 204320_at 223062_s_at 217523_at 204475_at 225806_at 227174_at 204702_s_at 226237_at 228915_at 205910_s at 227140_at 232252_at 206224_at 229802_at; and/or (ii) MMP1 PCSK1 ANLN MMP7 CST1 DACHI LGR5 QPCT COLIlA1 WDR72 ECT2 C14orf94 COL1IA1 SLITRK6 AZGP1 COL1Al L1TD1 REG4 DUSP27 KIAA 1199 NFE2L3 NLF1 PSATl CEL IL8 CXCL5 CD44 TACSTD2 CXCL3 COL8A1 MSLN 5 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to background levels is indicative of a neoplastic cell or a cell predisposed to the onset of a neoplastic state. Yet another aspect of the present invention provides a method of screening for the onset or 10 predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 204885_s_at 217523_at 236894_at 205174_s_at .228915_at 238984_at 205825_at 235976_at 241031_at; and/or -15 (ii) CD44 MSLN QPCT DACH1 NLF1 REG4 L1TD1 PCSK1 SLITRK6 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to background levels is indicative of an 5 adenoma cell or a cell predisposed to the onset of an adenoma state. In yet still another aspect there is provided a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected 10 from: (i) the gene or genes detected by Affymetrix probeset IDs: 202311_s_at 209309_at 223062_s_at 204320 at 211506 s at 225806 at 204475_at 214974_x_at 226237_at 204702_s_at 219787_s_at 227140_at 205910_s_at 37892_at 229802_at; and/or 206224_at 222608_s_at (ii) ANLN COLIA1 IL8 AZGP1 COL8A1 MMP1 C14orf94 CSTl NFE2L3 CEL CXCL5 PSATI COLllAl ECT2 15 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to background levels is indicative of a cancer cell or a cell predisposed to the onset of a cancerous state. In still yet another aspect of the present invention, there is provided a method of 20 characterising a neoplastic cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 200884_at 214234_s_at 226248_s_at -16 203240_at 214235_at 226302_at 203963_at 214433_s_at 227676_at 204508_s_at 215125_s_at 227719_at 204607_at 215867_x_at 227725_at 204811_s_at 217109_at 228232_s_at 204895_x_at 217110_s_at 229070_at 204897_at 218211_s_at 231832_at 205259_at 219543_at 232176_at 205765_at 219955_at 23248 1_s_at 205927_s_at 221841_s_at 235976_at 208063_s at 221874_at 236894_at 208937_s_at 223969_s_at 237521_x_at 210107_at 223970_at 242601_at 213106_at; and/or (ii) CLCA1 CTSE ATP8B1 FCGBP C6orfl05 CACNA2D2 HMGCS2 CKB KLF4 RETNLB ATP8A1 CYP3A5P2 L1TD1 MUC4 CAPN9 SLITRK6 UGT1A1 NR3C2 VSIG2 SELENBP1 PBLD LOC253012 PTGER4 CA12 ST6GALNAC1 MLPH WDR51B IDI KIAA1324 FAM3D CYP3A5 in said cell or cellular population wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to a gastrointestinal cancer cell level is 5 indicative of an adenoma cell or a cell predisposed to the onset of an adenoma state. In another aspect there is provided a method of characterising a neoplastic cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of one or more genes or 10 transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 200600_at 204006_s at 213428_s at - 17 200665_s at 204051 s at 213524_s at 200832_s_at 204122_at 213869_x_at 200974_at 204320_at 213905_x_at 200986_at 204475_at 214247_s_at 201058_s at 204620_s_at 215049_x at 201069 at 205479_s at 215076_s at 201105_at 205547_s_at 215646_s_at 201141_at 205828_at 216442_x_at 201147_s_at 207173_x_at 217430_x_at 201150_s_at 207191_s_at 217762_s_at 201162_at 208747_s at 217763_s at 201163_s_at 208782_at 217764_s_at 201185_at 208788_at 218468_s_at 201261_x_at 208850_s_at 218469_at 201289_at 208851_s_at 218559_s_at 201426_s at 209101_at 218638_s_at 201438_at 209156_s_at 219087_at 201616_s_at 209218_at 221011_s_at 201645_at 209395_at 221729_at 201667_at 209396_s_at 221730_at 201744_s at 209596_at 221731 x at 201792_at 209875_s_at 37892_at 201842_s_at 209955_s_at 223122_s_at 201852_x_at 210095_s-at 223235_s_at 201859_at 210495_x_at 224560_at 201893_x at 210511 s at 224694_at 202237_at 210764_s_at 224724_at 202238_s_at 210809_s_at 225664_at 202283_at 211161_s_at 225681_at 202291_s_at 211571_s_at 225710_at 202310 s at 211719_x at 225799_at 202311_s_at 211813_x_at 226237_at 202403_s_at 211896_s at 226311_at 202404_s_at 211959_at 226694_at 202450_s_at 211964_at 226777_at 202620_s at 211966_at 226930_at 202766_s_at 211980_at 227099_s_at 202859_x_at 211981_at 227140_at - 18 202878_s_at 212077_at 227566_at 202917_s_at 212344_at 229218_at 202998_s_at 212353_at 229802_at 203083_at 212354_at 231579_s_at 203325_s_at 212464_s_at 231766_s_at 203382_s_at 212488_at 231879_at 203477_at 212489_at 232458_at 203570_at 212667_at 233555_s_at 203645_s_at 213125_at 234994_at 203878_sat; and/or (ii) COL1A2 LGALS1 SRGN CTHRC1 ELOVL5 LBH FN1 MGP CTGF POSTN MMP2 TNC SPP1 LOXL2 GOS2 MMP1 MYL9 SQLE SPARC DCN EFEMPI LUM CALD1 APOE GREM1 FBN1 MSN IL8 MMP3 IGFBP3 IGFBP5 IGFBP7 SERPINFI SFRP2 FSTL1 ISLR SULFI COL4A2 HINT ASPN VCAN COL5Al COL6A3 SMOC2 OLFML2B COL8A1 HTRA1 KIAA1913 COL12A1 CYR61 PALM2-AKAP2 COL5A2 FAP SERPING1 CDHll VIM TYROBP THBS2 TIMP2 ACTA2 COL15A1 SCD COL3A1 COLllAl TIMP3 PLOD2 S100A8 AEBP1 MMP11 FNDC1 GJA1 CD163 SFRP4 NNMT FCGR3B INHBA COLlA1 PLAU COL6A2 SULF2 MAFB - 19 ANTXR1 COL6A1 LOC541471 GPNMB SPON2 LOC387763 BGN CTSK CHI3L1 TAGLN MXRA5 THYl COL4A1 ciS LOXL1 RAB31 DKK3 CD93 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to a gastrointestinal adenoma cell level is indicative of a cancer or a cell predisposed to the onset of a cancerous state. 5 In a further aspect there is provided a method of characterising a neoplastic cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: 10 (I) the gene or genes detected by Affymetrix probeset IDs: 210107_at; and/or (ii) CLCA1 in a biological sample from said individual wherein a higher level of expression of the genes 15 or transcripts of group (i) and/or group (ii) relative to a gastrointestinal cancer control level is indicative of an adenoma cell or a cell predisposed to the onset of an adenoma state. In another aspect there is provided a method of characterising a neoplastic cell or cellular population, which cell or cellular population is derived from the large intestine of an 20 individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 203240_at 219955_at 242601_at 204607_at 232481_s_at 227725_at 223969_sat 228232_s_at; and/or (ii) FCGBP L1TD1 LOC253012 HMGCS2 SLITRK6 ST6GALNAC1 RETNLB VSIG2 25 - 20 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to a gastrointestinal cancer control level is indicative of an adenoma cell or a cell predisposed to the onset of an adenoma state. 5 A further aspect of the present invention is directed to a method of characterising a cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: 10 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 202404_s_at, 210809_sat, 227140_at, 225681_at, 209875_s_at, 204475_at, 212464_s_at; and/or, 15 (ii) COL1A2, FN1, SPP1, CTHRC1, POSTN, MMP1 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or (ii) relative to a gastrointestinal adenoma control level is indicative of a cancer or a cell predisposed to the onset of a cancerous state. 20 In yet another further aspect the present invention is directed to a method of characterising a cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: 25 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 200665_s_at, 218468_sat, 211959_at, 201744_s_at, 202859_xat; and/or, (ii) SPARC, GREM1, IGFBP5, LUM, IL8, 30 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or (ii) relative to a gastrointestinal adenoma control level is indicative of a cancer or a cell predisposed to the onset of a cancerous state. 35 In still yet another further aspect the present invention is directed to a method of characterising a cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: - 21 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 223122_s_at 207173_x_at 210511_s_at 212353_at 203083_at 209156_s_at 219087_at 203477_at 224694_at 201438_at 37892_at 201141_at 226237_at 202917_s_at 213905_x_at 225664_at 226930_at 205547_s_at 221730_at 204051_s_at; and/or (ii) SFRP2 CDH1 1 INHBA SULF1 THBS2 COL6A2 ASPN COL15A1 ANTXR1 COL6A3 COL1 lA1 GPNMB COL8A1 S100A8 BGN COL12AI FNDC1 TAGLN COL5A2 SFRP4 in a biological sample from said individual wherein a higher level of expression of the genes 5 or transcripts of group (i) and/or (ii) relative to a gastrointestinal adenoma control level is indicative of a cancer or a cell predisposed to the onset of a cancerous state. Yet another aspect of the present invention provides a method of characterising a neoplastic cell or cellular population, which cell or cellular population is derived from the large intestine 10 of an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 235976_at 236894_at; and/or (ii) SLITRK6 L1TD1 15 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or (ii) relative to neoplastic tissue background levels is indicative of an adenoma cell or a cell predisposed to the onset of an adenoma state. 20 In still another aspect the present invention provides a method of characterising a neoplastic cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: - 22 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 202311_s_at, 209396_s _at, 226237_at, 204320_at, 215646_s_at, 227140_at, 204475_at, 37892_at, 229802_at, 209395_at; and/or 5 (ii) COL1A1, VCAN, CHI3L1, MMP1, COL8A1, COL1 1A1 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or (ii) relative to neoplastic tissue background neoplastic cell 10 levels is indicative of a cancer or a cell predisposed to the onset of a cancerous state. A related aspect of the present invention provides a molecular array, which array comprises a plurality of: 15 (i) nucleic acid molecules comprising a nucleotide sequence corresponding to any one or more of the neoplastic marker genes hereinbefore described or a sequence exhibiting at least 80% identity thereto or a functional derivative, fragment, variant or homologue of said nucleic acid molecule; or 20 (ii) nucleic acid molecules comprising a nucleotide sequence capable of hybridising to any one or more of the sequences of (i) under medium stringency conditions or a functional derivative, fragment, variant or homologue of said nucleic acid molecule; or 25 (iii) nucleic acid probes or oligonucleotides comprising a nucleotide sequence capable of hybridising to any one or more of the sequences of (i) under medium stringency conditions or a functional derivative, fragment, variant or homologue of said nucleic acid molecule; or 30 (iv) probes capable of binding to any one or more of the proteins encoded by the nucleic acid molecules of (i) or a derivative, fragment or, homologue thereof wherein the level of expression of said marker genes of (i) or proteins of (iv) is indicative of the neoplastic state of a cell or cellular subpopulation derived from the large intestine. 35 BRIEF DESCRIPTION OF THE DRAWINGS Figure 1 is a graphical representation of nuclear factor (erythroid-derived-2)-like 2. Blk=normal, gm=inflamed, red=adenoma,blue=cancer 40 - 23 Figure 2 is a graphical representation of SLIT and NTRK-like family, member 6 (left) and LINE 1 type transposase domain containing 1 (right). Figure 3 is a graphical representation of collagen type XI, alpha 1. 5 Figure 4 is an image of the immunohistochemical staining for MSLN in a normal tissue shows mild staining in the cytoplasm of the colonocytes. Figure 5 is an image of the immunohistochemical staining for MSLN in a representative 10 cancer tissue shows moderate staining in the cytoplasm of the colonic epithelia. Figure 6 is an image of the immunohistochemical staining of MSLN in a moderately differentiated adenoma shows strong staining in the multilayered colonic epithelium. 15 Figure 7 is a graphical representation depicting Probeset 205828 at Expression profile across 68 clinical specimens comprising 30 non-disease controls (black), 19 adenomas (red) and 19 adenocarcinoma (green). Figure 8 is a graphical representation of the measurement of MMP3 expression in protein 20 extract from 27 clinical stool specimens. Protein extraction was performed on 27 clinical stool specimens comprising 6 non-disease controls (circle), 10 adenoma (triangle) and 11 adenocarcinoma (cross) emulsified using a PBS pH 7.4 solution containing 0.05% Tween-20 and lx Protease Inhibitor Cocktail (Roche). Proteins were further solubilised by 30 minutes incubation in an ultrasonic water bath. Solubilised proteins were isolated by centrifugation 25 and endogenous levels of MMP3 in the resulting protein extracts were using measured using a commercially available Luminex bead-based suspension immunoassay as recommended by manufacturer (R&D Systems). Figure 9 is a graphical representation of Affymetrix probeset ID 205828 at which is 30 known to hybrising to transcripts of the MMP3 gene depicting a gene expression profile in 68 clinical specimens comprising 30 non-disease controls (black), 19 adenomas (red) and 19 cancers (green). Figure 10 is a schematic representation of predicted RNA variants derived from 35 hCG_1815491. cDNA clones derived from map region 8579310 to 8562303 on human chromosome 16 were used to locate exon sequences. Arrows: Oligo nucleotide primer sets were designed to allow measurement of individual RNA variants by PCR. Primers covering splice junctions are shown as spanning intron sequences which is not included in - 24 the actual oligonucleotide primer sequence. 5 DETAILED DESCRIPTION OF THE INVENTION The present invention is predicated, in part, on the elucidation of gene expression profiles which characterise large intestine cellular populations in terms of their neoplastic state and, 10 more particularly, whether they are malignant or pre-malignant. This finding has now facilitated the development of routine means of screening for the onset or predisposition to the onset of a large intestine neoplasm or characterising cellular populations derived from the large intestine based on screening for upregulation of the expression of these molecules, relative to control expression patterns and levels. To this end, in addition to assessing 15 expression levels of the subject genes relative to normal or non-neoplastic levels, it has been determined that a proportion of these genes are expressed only in the diseased state, thereby facilitating the development of a simple qualitative test based on requiring assessment only relative to test background levels. 20 In accordance with the present invention, it has been determined that the genes detailed above are modulated, in terms of differential changes to their levels of expression, depending on whether the cell expressing that gene is neoplastic or not. It should be understood that reference to a gene "expression product" or "expression of a gene" is a reference to either a transcription product (such as primary RNA or mRNA) or a translation product such as 25 protein. These genes and their expression products, whether they be RNA transcripts or encoded proteins, are collectively referred to as "neoplastic markers". Accordingly, one aspect of the present invention is directed to a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method 30 comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 201328_at 221577_x_at 205828_at 201341_at 221922_at 205886 at 201416_at 60474_at 205890_s_at 201417_at 222696_at 205910_s_at 201468_s_at 223447_at 205941_s_at - 25 201506_at 223970_at 206224_at 201563_at 225541_at 206976_s_at 201656_at 225835_at 207173_x_at 201925_s_at 226360_at 207457_s_at 201926_s_at 227174_at 208079_s_at 202286_s_at 227475_at 208712_at 202718_at 228303_at 209218_at 202831_at 228653_at 209309_at 202833_s_at 228754_at 209752_at 202935_s_at 228915_at 209773_s_at 202936_s_at 229215_at 209774_x at 203124_s_at 231832_at 209792_s_at 203256_at 231941_s_at 209875_s_at 203313_s_at 232176_at 209955_s_at 203510_at 232252_at 210052_s_at 203860_at 232481_s at 210511_s at 203895_at 234331_s_at 210559_s_at 203896_s_at 235210_s_at 210766_s_at 203961_at 235976_at 211506_s_at 203962_s_at 236894_at 212281_s_at 204259_at 238017_at 212344_at 204351_at 238021_s_at 212353_at 204401_at 238984_at 212354_at 204404 at 241031_at 213905_x_at 204855_at 200660_at 214022_s_at 204885 s at 200832 s at 214974 x at 205174_s_at 200903_s_at 215091_s_at 205366_s_at 201014_s_at 217430_x_at 205470_s_at 201112_s_at 217996_at 205513_at 201195_s_at 218507_at 205765_at 201261 x at 218963 s at 205825_at 201292_at 218984_at 205927_s_at 201338_x_at 219787_s_at 205983_at 201479_at 219911 _s_at 206239_s_at 201577_at 221729_at 206286_s_at 201601_x_at 221730 at 207158_at 201666_at 221731_x_at 207850_at 202310_s_at 221923_s_at -26 209369_at 202311_s_at 37892_at 210445_at 202404_s_at 222449_at 210519_s_at 202431_s_at 222450_at 211429_s_at 202504_at 222549_at 212063_at 202779_s_at 222608_s_at 212070_at 202859_x at 223062 s at 212190_at 202954_at 224428_s_at 212531_at 202998_s_at 224646_x_at 212942_s_at 203083_at 224915_x_at 213880_at 203213_at 225295_at 213975_s_at 203878_s_at 225520_at 214235_at 204051_s_at 225664_at 214651_s_at 204127_at 225681_at 217523_at 204170_s_at 225767_at 217867_x_at 204320_at 225799_at 218086_at 204470 at 225806 at 218211_s_at 204475_at 226227_x_at 218704_at 204580_at 226237_at 218796_at 204620_s_at 226311_at 218872_at 204702_s_at 226777_at 219630_at 205361_s_at 226835 s at 219682_s_at 205476_at 227140_at 219727_at 205479_s_at 229802_at 219955_at 205713_s_at 231766_s_at 219956_at 205815_at 232151_at; and/or 200665_s at (ii) CTHRCl CDCA7 LOXL2 FOXQ1 RNF43 TDGF1 AZGP1 MMP1 CCL20 MTHFD1L MYC TCN1 CTSE ANLN COMP MMP7 MSLN H19 AURKA WDR72 TIMPI FAP PAICS INHBA PCSK1 DACH1 PUS7 COL1IA1 CSTl VCAN ZNRF3 GDF15 BGN SQLE CCND1 COL1A1 AXIN2 REG3A CSE1L LGR5 MET TESC PFDN4 -27 DUSP27 SOX9 UBE2C C20orf42 SERPINB5 TMEPAI TMEM97 SLCll A2 ASCL2 CDH1 1 TRIM29 NFE2L3 SULFI MMP11 KLK11 CEL SLC6A6 CXCL1 LY6G6D NLF1 TACSTD2 QPCT SLC7A5 NPDCl NEBL PDZK1IP1 FABP6 ENCI PCCA CD55 SLITRK6 SERPINE2 FLJ37644 ECT2 MLPH UBE2S KRT23 COL12AI HOXA9 TOP2A CXCL3 L1TD1 TBX3 CDC2 MMP3 WDR51B BACE2 RFC3 SFRP4 HOXB6 GPX2 LILRB1 UBD FAM84A TPX2 NPMl SCD COL5A2 KCNN4 RDHE2 DPEP1 MMP12 LOC541471 PLAU LCN2 SORD CYP3A5 HSPHI PLCB4 PSATI ANXA3 GTF3A SPINKI CXCL5 CYP3A5P2 KLK1O DUOX2 IGFBP2 C14orf94 GPSM2 SOX4 TGIFI RP5-875H10.1 NMEI THBS2 CXCL2 RPL22L1 MUC20 CLDN1 SLCO4A1 APOBECI RPESP REGlB CKS2 HIG2 RRM2 CDH3 PHLDA1 COL1OA1 DKC1 SPARC SERPINAl AHCY CD44 COLlA2 IL8 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to control levels is indicative of a neoplastic large intestine cell or a cell predisposed to the onset of a neoplastic state. 5 Reference to "large intestine" should be understood as a reference to a cell derived from one of the six anatomical regions of the large intestine, which regions commence after the terminal region of the ileum, these being: 10 (i) the cecum; (ii) the ascending colon; (iii) the transverse colon; -28 (iv) the descending colon; (v) the sigmoid colon; and (vi) the rectum. 5 Reference to "neoplasm" should be understood as a reference to a lesion, tumour or other encapsulated or unencapsulated mass or other form of growth which comprises neoplastic cells. A "neoplastic cell" should be understood as a reference to a cell exhibiting abnormal growth. The term "growth" should be understood in its broadest sense and includes reference to proliferation. In this regard, an example of abnormal cell growth is the uncontrolled 10 proliferation of a cell. Another example is failed apoptosis in a cell, thus prolonging its usual life span. The neoplastic cell may be a benign cell or a malignant cell. In a preferred embodiment, the subject neoplasm is an adenoma or an adenocarcinoma. Without limiting the present invention to any one theory or mode of action, an adenoma is generally a benign tumour of epithelial origin which is either derived from epithelial tissue or exhibits clearly 15 defined epithelial structures. These structures may take on a glandular appearance. It can comprise a malignant cell population within the adenoma, such as occurs with the progression of a benign adenoma to a malignant adenocarcinoma. Preferably, said neoplastic cell is an adenoma or adenocarcinoma and even more preferably a 20 colorectal adenoma or adenocarcinoma. Each of the genes and transcripts detailed in sub-paragraphs (i) and (ii), above, would be well known to the person of skill in the art, as would their encoded proteins. The identification of the expression products of these genes and transcripts as markers of neoplasia occurred by 25 virtue of differential expression analysis using Affymetrix HGU133Aor HGUl33B gene chips. To this end, each gene chip is characterised by approximately 45,000 probe sets which detect the RNA transcribed from the genome. On average, approximately 11 probe pairs detect overlapping or consecutive regions of the RNA transcript. In general, the genes from which the RNA transcripts described herein are identified by the Affymetrix probesets are 30 well known and characterised genes. However, to the extent that some of the probesets detect RNA transcripts which are not yet defined, these transcripts are indicated as "the gene, genes or transcripts detected by Affymetrix probe x". In some cases a number of genes and/or transcripts may be detectable by a single probeset. It should be understood, however, that this is not intended as a limitation as to how the expression level of the subject gene or transcript 35 can be detected. In the first instance, it would be understood that the subject gene transcript is also detectable by other probesets which would be present on the Affymetrix gene chip. The reference to a single probesets is merely included as an identifier of the gene transcript of interest. In terms of actually screening for the transcript, however, one may utilise a probe or probeset directed to any region of the transcript and not just to the 3' terminal 600bp - 29 transcript region to which the Affymetrix probesets are often directed. Reference to each of the genes and transcripts detailed above and their transcribed and translated expression products should therefore be understood as a reference to all forms of 5 these molecules and to fragments or variants thereof. As would be appreciated by the person of skill in the art, some genes are known to exhibit allelic variation between individuals. Accordingly, the present invention should be understood to extend to such variants which, in terms of the present diagnostic applications, achieve the same outcome despite the fact that minor genetic variants between the actual nucleic acid sequences may exist between 10 individuals or that within one individual there may exist two or more splice variants of one subject gene. The present invention should therefore be understood to extend to all forms of RNA (eg mRNA, primary RNA transcript, miRNA, etc), cDNA and peptide isoforms which arise from alternative splicing or any other mutation, polymorphic or allelic variation. It should also be understood to include reference to any subunit polypeptides such as precursor 15 forms which may be generated, whether existing as a monomer, multimer, fusion protein or other complex. For example, in one embodiment of the invention, the subject gene is CDH3. Analysis of the AceView Database reveals that there exist 12 CDH3 alternative mRNA transcripts. Nine are 20 generated by alternative splicing while three are unspliced forms. In terms of the genes encompassed by the present invention, means for determining the existence of such variants and characterising same, are described in Example 6. To the extent that the genes of the present invention are described by reference to an Affymetrix probeset, Table 9 provides details of the nucleic acid sequence to which each probeset is directed. Based on this 25 information, the skilled person could, as a matter of routine procedure, identify the gene in respect of which that sequence forms part. A typical protocol for doing this is also outlined in Example 6. It should be understood that the "individual" who is the subject of testing may be any human 30 or non-human mammal. Examples of non-human mammals includes primates, livestock animals (e.g. horses, cattle, sheep, pigs, donkeys), laboratory test animals (e.g. mice, rats, rabbits, guinea pigs), companion animals (e.g. dogs, cats) and captive wild animals (e.g. deer, foxes). Preferably the mammal is a human. 35 The method of the present invention is predicated on the comparison of the level of the neoplastic markers of a biological sample with the control levels of these markers. The "control level" may be either a "normal level", which is the level of marker expressed by a corresponding large intestine cell or cellular population which is not neoplastic, or the background level which is detectable in a negative control sample.
-30 The normal (or "non-neoplastic") level may be determined using tissues derived from the same individual who is the subject of testing. However, it would be appreciated.that this may be quite invasive for the individual concerned and it is therefore likely to be more convenient 5 to analyse the test results relative to a standard result which reflects individual or collective results obtained from individuals other than the patient in issue. This latter form of analysis is in fact the preferred method of analysis since it enables the design of kits which require the collection and analysis of a single biological sample, being a test sample of interest. The standard results which provide the normal level may be calculated by any suitable means 10 which would be well known to the person of skill in the art. For example, a population of normal tissues can be assessed in terms of the level of the neoplastic markers of the present invention, thereby providing a standard value or range of values against which all future test samples are analysed. It should also be understood that the normal level may be determined from the subjects of a specific cohort and for use with respect to test samples derived from 15 that cohort. Accordingly, there may be determined a number of standard values or ranges which correspond to cohorts which differ in respect of characteristics such as age, gender, ethnicity or health status. Said "normal level" may be a discrete level or a range of levels. An increase in the expression level of the subject genes relative to normal levels is indicative of the tissue being neoplastic. 20 Without limiting the present invention to any one theory or mode of action, although each of the genes hereinbefore described is differentially expressed, either singly or in combination, as between neoplastic versus non-neoplastic cells of the large intestine, and is therefore diagnostic of the existence of a large intestine neoplasm, the expression of some of these 25 genes was found to exhibit particularly significant levels of sensitivity, specificity and positive and negative predictive value. Accordingly, in a preferred embodiment one would screen for and assess the expression level of one or more of these genes. To this end, and without limiting the present invention to any one theory or mode of action, the following markers were determined to be expressed in neoplastic tissue at a level of 3, 4, 5 or 7 fold 30 greater than non-neoplastic tissue when assessed by virtue of the method exemplified herein. Gene, genes or transcripts detected Fold Increase Gene by Affymetrix Probe No: 225681_at - CTHRCl 7 227140_at 227475_at FOXQl 5 204475_at MMP1 202859_x at IL8 -31 Fold Increase Gene, genes or transcripts detected Gene by Affymetrix Probe No: 202404_s at COLlA2 205513_at TCN1 4 204259_at MMP7 227174_at WDR72 210511_s_at INHBA 37892_at COLI1A1 221577_x_at GDF15 3 202310_s_at COLlA1 213880_at LGR5 232252_at DUSP27 204855_at SERPINB5 229215_at ASCL2 212354_at SULFI 228754_at SLC6A6 202286_s_at TACSTD2 203962_s_at NEBL 203860_at PCCA 202935_s_at FLJ37644 218963_s at KRT23 207850_at CXCL3 205828_at MMP3 204051_s_at SFRP4 205890_s_at UBD 200832_s_at SCD 205983_at DPEP1 212531 at LCN2 There is therefore more particularly provided a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected 5 from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 225681_at; 227140_at; and/or 10 (ii) CTHRC1 - 32 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to control levels is indicative of a neoplastic large intestine cell or a cell predisposed to the onset of a neoplastic state. _ 5 Preferably, said control level is a non-neoplastic level. In another embodiment, there is provided a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected 10 from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 227475_at, 204475_at, 202859_x_at, 202404_s_at; and/or 15 (ii) FOXQ1, MMP1, IL8, COL1A2 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to control levels is indicative of a neoplastic large intestine cell or a cell predisposed to the onset of a neoplastic state. 20 Preferably, said control level is a non-neoplastic level. In yet another embodiment there is provided a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method 25 comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 205513_at, 204259_at, 227174_at, 210511_s_at, 37892_at; and/or 30 (ii) TCN1, MMP7, WDR72, INHBA, COLl 1A1 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to control levels is indicative of a 35 neoplastic large intestine cell or a cell predisposed to the onset of a neoplastic state. Preferably, said control level is a non-neoplastic level. In still yet another preferred embodiment, there is provided a method of screening for the 40 onset or predisposition to the onset of a large intestine neoplasm in an individual, said method - 33 comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 221577_x_at 232252_at 202310 s_at 213880_at 212354_at 204855_at 229215_at 203962_s_at 228754_at 202286_s_at 218963_s_at 203860_at 202935_s_at 204051_s_at 207850_at 205828_at 205983_at 205890_s_at 200832_s at 212531 _at; and/or 5 (ii) LGR5 GDF15 COL1A1 ASCL2 DUSP27 SERPINB5 TACSTD2 SULF SLC6A6 FLJ37644 NEBL PCCA MMP3 KRT23 CXCL3 SCD SFRP4 UBD DPEP1 LCN2 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to control levels is indicative of a neoplastic large intestine cell or a cell predisposed to the onset of a neoplastic state. 10 Preferably, said control level is a non-neoplastic level. According to these aspects of the present invention, said large intestine tissue is preferably colorectal tissue. 15 The detection method of the present invention can be performed on any suitable biological sample. To this end, reference to a "biological sample" should be understood as a reference to any sample of biological material derived from an animal such as, but not limited to, cellular material, biofluids (eg. blood), faeces, tissue biopsy specimens, surgical specimens or 20 fluid which has been introduced into the body of an animal and subsequently removed (such as, for example, the solution retrieved from an enema wash). The biological sample which is tested according to the method of the present invention may be tested directly or may require some form of treatment prior to testing. For example, a biopsy or surgical sample may require homogenisation prior to testing or it may require sectioning for in situ testing of the 25 qualitative expression levels of individual genes. Alternatively, a cell sample may require - 34 permeabilisation prior to testing. Further, to the extent that the biological sample is not in liquid form, (if such form is required for testing) it may require the addition of a reagent, such as a buffer, to mobilise the sample. 5 To the extent that the neoplastic marker gene expression product is present in a biological sample, the biological sample may be directly tested or else all or some of the nucleic acid or protein material present in the biological sample may be isolated prior to testing. In yet another example, the sample may be partially purified or otherwise enriched prior to analysis. For example, to the extent that a biological sample comprises a very diverse cell population, it 10 may be desirable to enrich for a sub-population of particular interest. It is within the scope of the present invention for the target cell population or molecules derived therefrom to be treated prior to testing, for example, inactivation of live virus or being run on a gel. It should also be understood that the biological sample may be freshly harvested or it may have been stored (for example by freezing) prior to testing or otherwise treated prior to testing (such as 15 by undergoing culturing). The choice of what type of sample is most suitable for testing in accordance with the method disclosed herein will be dependent on the nature of the situation. Preferably, said sample is a faecal (stool) sample, enema wash, surgical resection, tissue biopsy or blood sample. 20 In a related aspect, it has been determined that certain of the markers hereinbefore defined are more indicative of adenoma development versus cancer development or vice versa. This is an extremely valuable finding since it enables one to more specifically characterise the likely nature of a neoplasm which is detected by virtue of the method of the present invention. 25 Accordingly, in a related aspect the present invention is directed to a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: 30 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 201328_at 205825_at 219956_at 201468_s_at 205927_s_at 221922_at 201656_at 206239_s_at 223447_at 201925_s_at 207158_at 223970_at 201926_s_at 210445 _at 225835_at 202718_at 210519 _s_at 226360_at 202831_at 211429_s_at 228303_at 202833_s_at 212063_at 228653_at -35 203124_s_at 213975_s_at 228915_at 203313_s_at 214235_at 231832_at 203860 _at 214651_s_at 231941_s_at 203895_at 217523_at 232176_at 203896_s_at 217867_x_at 232481_s_at 204401_at 218086_at 234331_s_at 204885_s_at 218211_s_at 235210_s_at 205174_s_at 218796_at 235976_at 205366_s_at 219630_at 236894_at 205470_s_at 219682_s_at 238017_at 205513 at 219727_at 238984_at 205765_at 219955_at; 241031_at; and/or (ii) APOBECI HOXB6 QPCT BACE2 IGFBP2 RDHE2 C20orf42 ITGA6 REG4 CD44 KCNN4 RETNLB CD55 KLKll RP5-875H10.1 CTSE L1TD1 RPESP CYP3A5 LILRB1 SERPINAl CYP3A5P2 MLPH SLC11A2 DACHI MSLN SLC12A2 DUOX2 MUC20 SLITRK6 ETS2 NLFI SPINKI FABP6 NPDCl TBX3 FAM84A NQO1 TCN1 GALNT6 PCCA TGIF1 GPSM2 PCSK1 WDR51B GPX2 PDZK1IP1 ZNRF3 HOXA9 PLCB4 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to control levels is indicative of an 5 adenoma cell or a cell predisposed to the onset of an adenoma state. In another preferred embodiment of this aspect of the present invention there is provided a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more 10 genes or transcripts selected from: -36 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 200660_at 205476_at 217430_x_at 200665_s_at 205479_s_at 217996_at 200832_s_at 205713_s_at 218507_at 200903_s_at 205815_at 218963_s_at 201014_s_at 205828_at 218984_at 201112_s_at 205886_at 219787_s_at 201195_s_at 205890_s_at 219911_s_at 201261_x_at 205910_s_at 221729_at 201292_at 205941_s_at 221730_at 201338_x_at 206224_at 221731_x_at 201479_at 206976_s_at 221923_s_at 201577_at 207173_x_at 37892_at 201601_x_at 207457_s_at 222449_at 201666_at 208079_s_at 222450_at 202310 s_at 208712_at 222549_at 202311 _s_at 209218_at 222608_s_at 202403_s_at 209309_at 223062_s_at 202404_s_at 209752_at 224428_s_at 202431_s_at 209773_s_at 224646_x_at 202504_at 209774_x_at 224915_x_at 202779_s_at 209792_s_at 225295_at 202859_x_at 209875_s_at 225520_at 202954_at 209955_s_at 225664_at 202998_s_at 210052_s_at 225681_at 203083_at 210511 s_at 225767_at 203213_at 210559_s_at 225799_at 203878_s_at 210766_s_at 225806_at 204051_s_at 211506_s_at 226227_x_at 204127_at 212281_s_at 226237_at 204170_s at 212344_at 226311_at 204320_at 212353_at 226777_at 204470_at 212354_at 226835_s at 204475_at 213905_x_at 227140_at 204580_at 214022_s_at 229802_at 204620_s_at 214974_x_at 231766_s_at 204702_s_at 215091_s_at; 232151_at; and/or -37 205361_s_at (ii) AHCY DKC1 PSATI ANLN ECT2 PUS7 AURKA FAP REGlA AZGP1 GTF3A REGIB BGN H19 REG3A C14orf94 HIG2 RFC3 C20orfl99 HSPH1 RRM2 CCL20 IFITMI S10OAll CCND1 IL8 SCD CDC2 INHBA SFRP4 CDCA7 KLK1O SLC39A10 CDH11 KRT23 SLC7A5 CEL LOC541471 SLCO4A1 CKS2 LOXL2 SPARC CLDN1 LY6G6D SPP1 COL1OA1 MMP1 SQLE COL1IA1 MMP11 SULF1 COL12A1 MMP12 THBS2 COL1A1 MMP3 TIMPI COL1A2 MTHFD1L TMEM97 COL5A2 MYC TMEPAI COL8A1 NFE2L3 TOP2A COMP NME1 TPX2 CSE1L NPM1 TRIM29 CSTl PAICS UBD CTHRC1 PFDN4 UBE2C CXCL1 PHLDA1 UBE2S CXCL2 PLAU VCAN CXCL5 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to control levels is indicative of a cancer 5 cell or a cell predisposed to the onset of a cancerous state. According to these aspects, said control levels are preferably non-neoplastic levels and said large intestine tissue is colorectal tissue. Even more preferably, said biological sample is a stool sample or blood sample.
- 38 In a related aspect, it has been determined that a subpopulation of the markers of the present invention are not only expressed at levels higher than normal levels, their expression pattern is uniquely characterised by the fact that expression levels above that of background control 5 levels are not detectable in non-neoplastic tissue. This determination has therefore enabled the development of qualitative screening systems which are simply designed to detect marker expression relative to a control background level. In accordance with this aspect of the present invention, said "control level" is therefore the "background level". Preferably, said background level is of the chosen testing methodology. 10 According to this aspect, there is therefore provided a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: 15 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 202286 s at 235976_at 209309_at 204259_at 236894_at 211506_s_at 204885_s_at 214974_x_at 205174_s_at 238984_at 219787_s_at 205825_at 241031_at 37892_at 207850_at 202311_s at 222608_s_at 213880_at 204320_at 223062_s_at 217523_at 204475_at 225806_at 227174_at 204702_s_at 226237_at 228915_at 205910_s_at 227140_at -232252_at 206224_at 229802_at; and/or (ii) MMP1 PCSK1 ANLN MMP7 CST1 DACHI LGR5 QPCT COL1 1A1 WDR72 ECT2 C14orf94 COLIlA1 SLITRK6 AZGP1 COL1A1 L1TDl REG4 DUSP27 KIAA 1199 NFE2L3 NLF1 PSATI CEL IL8 CXCL5 CD44 TACSTD2 CXCL3 COL8A1 - 39 MSLN in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to background levels is indicative of a neoplastic cell or a cell predisposed to the onset of a neoplastic state. 5 In a most preferred embodiment, said genes or transcripts are selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 227140_at 227174_at 241031_at 204475_at 37892_at 211506_s_at 204259_at 202311_s_at 202286_s_at 213880_at 232252_at 226237_at; and/or (ii) MMP1 COLI1A1 IL8 MMP7 COL1A1 TACSTD2 LGR5 DUSP27 COL8A1 WDR72 NLF1 10 Preferably, said neoplasm is an adenoma or an adenocarcinoma and said gastrointestinal tissue is colorectal tissue. In yet another embodiment, it has been determined that a further subpopulation of these 15 markers are more characteristic of adenoma development, while others are more characteristic of cancer development. Accordingly, there is provided a convenient means of qualitatively obtaining indicative information in relation to the characteristics of the subject neoplasm. According to this embodiment there is provided a method of screening for the onset or 20 predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 204885_s_at 217523_at 236894_at 205174_s_at 228915_at 238984_at 205825_at 235976_at 241031_at; and/or 25 (ii) CD44 MSLN QPCT DACH1 NLF1 REG4 - 40 L1TD1 PCSK1 SLITRK6 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to background levels is indicative of an adenoma cell or a cell predisposed to the onset of an adenoma state. 5 In yet still another preferred embodiment there is provided a method of screening for the onset or predisposition to the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: 10 (i) the gene or genes detected by Affymetrix probeset IDs: 202311_s_at 209309_at 223062_s_at 204320_at 211506_s_at 225806_at 204475_at 214974_x_at 226237_at 204702_s _at 219787_s_at 227140_at 205910 s at 37892_at 229802_at; and/or 206224_at 222608_s_at (ii) ANLN COL1A1 IL8 AZGP1 COL8A1 MMP1 C14orf94 CST1 NFE2L3 CEL CXCL5 PSATI COLI1A1 ECT2 in a biological sample from said individual wherein a higher level of expression of the genes 15 or transcripts of group (i) and/or group (ii) relative to background levels is indicative of a cancer cell or a cell predisposed to the onset of a cancerous state. Preferably, said large intestine tissue is colorectal tissue. 20 More preferably, said biological sample is a blood sample or stool sample. As detailed hereinbefore, the present invention is designed to screen for a neoplastic cell or cellular population, which is located in the large intestine. Accordingly, reference to "cell or cellular population" should be understood as a reference to an individual cell or a group of 25 cells. Said group of cells may be a diffuse population of cells, a cell suspension, an encapsulated population of cells or a population of cells which take the form of tissue. Reference to "expression" should be understood as a reference to the transcription and/or - 41 translation of a nucleic acid molecule. In this regard, the present invention is exemplified with respect to screening for neoplastic marker expression products taking the form of RNA transcripts (eg primary RNA or mRNA). Reference to "RNA" should be understood to encompass reference to any form of RNA, such as primary RNA or mRNA. Without limiting 5 the present invention in any way, the modulation of gene transcription leading to increased or decreased RNA synthesis will also correlate with the translation of some of these RNA transcripts (such as mRNA) to produce a protein product. Accordingly, the present invention also extends to detection methodology which is directed to screening for modulated levels or patterns of the neoplastic marker protein products as an indicator of the neoplastic state of a 10 cell or cellular population. Although one method is to screen for mRNA transcripts and/or the corresponding protein product, it should be understood that the present invention is not limited in this regard and extends to screening for any other form of neoplastic marker expression product such as, for example, a primary RNA transcript. It is well within the skill of the person of skill in the art to determine the most appropriate screening target for any 15 given situation. To this end, the genes which are known to encode an expression product which is either secreted by the cell or membrane bound is detailed in the table, below. It would be appreciated that screening for neoplastic markers which are secreted or membrane bound may provide particular advantages in terms of the design of a diagnostic screening product. 20 The gene or genes detected by Affymetrix probe Nos: 200600_at 203961_at 212077_at 224724_at 200832_s at 203962_s at 212281_s at 224915_x at 200903_s_at 204006_s_at 212344_at 225295_at 200974_at 204122_at 212353_at 225520_at 201058_s_at 204127_at 212354_at 225541_at 201069_at 204170_s_at 212942_s_at 225664_at 201105_at 204320_at 213125_at 225710_at 201112_s_at 204351_at 213524_s_at 225767_at 201195_sat 204401_at 213869_x_at 225799_at 201328_at 204404_at 213880_at 225806_at 201341_at 204702_s_at 214022_s_at 225835_at 201416_at 204885_s_at 214235_at 226227_x_at 201417_at 205361_s_at 214651_s_at 226311_at 201426_s_at 205366_s_at 215049_x_at 226360_at 201468_s_at 205547_s_at 217523_at 226694_at 201479_at 205765_at 217762_s_at 226777_at -42 201563_at 205825_at 217763_s_at 226835_s_at 201601_x_at 205890_s_at 217764_s_at 227099_s_at 201616_s_at 205927_s at 217867_x_at 227140_at 201656_at 205941_s_at 217996_at 227174_at 201667_at 205983_at 218086_at 227475_at 201925_s_at 206286_s_at 218211_s_at 227566_at 201926_s_at 206976_s_at 218559_s_at 228303_at 202237_at 207158_at 218704_at 228653_at 202238_s_at 207173_x_at 218796_at 228754_at 202286_s_at 207191_s_at 218872_at 228915_at 202431_s_at 208079_s_at 218963_s_at 229215_at 202450_s_at 208712_at 218984_at 229802_at 202504_at 208782_at 219630_at 231766_s_at 202620_s_at 208788_at 219682_s_at 231832_at 202779_s_at 208850_s_at 219727_at 231879_at 202831_at 208851_s_at 219787_s_at 232151_at 202878_s_at 209156_s_at 219911_s_ at 232176_at 202917_s_at 209218_at 219955_at 232252_at 202935_s_at 209369_at 219956_at 232481_s_at 202936_s_at 209596_at 221011_s_at 233555_s_at 202954_at 209773_s_at 221922_at 234331_s_at 203124_s_at 209955_s_at 221923_s_at 234994_at 203213_at 210052_s_at 222449_at 235210_s_at 203256_at 210445_at 222450_at 235976_at 203313_s_at 210519_s_at 222549_at 236894_at 203382_s_at 210559_s_at 222608_s_at 238017_at 203645_s_at 210766_s_at 223062_s_at 203860_at 211964_at 223235_s_at 241031_at 203878_s_at 211966_at 224428_s_at 37892_at 203895_at 212063_at 224646_x_at 60474_at 203896_s at 212070_at 224694_at ACTA2 DUOX2 LOC541471 SlOAll AHCY DUSP27 MAFB S100A8 ANLN ECT2 MLPH SlooP ANTXR1 ELOVL5 MMP1 1 SCD ANXA3 ENC1 MMP2 SLC11A2 APOBECI ETS2 MSLN SLC12A2 -43 APOE FABP6 MSN SLC39A10 ASCL2 FAM84A MTHFD1L SLC6A6 AURKA FAP MXRA5 SLC7A5 BACE2 FCGR3B MYC SLCO4A1 Cl4orf94 FLJ37644 MYL9 SLITRK6 C20orfl99 FOXQ1 NEBL SMOC2 C20orf42 FSTL1 NFE2L3 SORD CALDI GOS2 NLF1 SOX4 CCND1 GALNT6 NNMT SOX9 CD163 GJAl NPDCl SQLE CD44 GPR56 NPM1 SULF1 CD55 GPSM2 NQO1 SULF2 CD93 GPX2 OLFML2B TACSTD2 CDC2 H19 PALM2-AKAP2 TAGLN CDCA7 PCCA TBX3 CDH11 INT PCSK1 TDGF1 CDH3 HOXA9 PDZK1IP1 TESC CKS2 HOXB6 PFDN4 TGIFl CLDN1 HSPH1 PHLDA1 THY1 COL1OA1 IFITMI PLCB4 TMEM97 COL1 1A1 ISLR PLOD2 TMEPAI COL12A1 ITGA6 PSATl TPX2 COL4A2 KCNN4 PUS7 TRIM29 COL6A2 KIAA 1199 RAB31 TYROBP CSE1L KIAA1913 RDHE2 UBD CTSE KRT23 RFC3 UBE2C CTSK L1TD1 RNF43 UBE2S CYP3A5 LBH RP5-875H10.1 VIM CYP3A5P2 LGALS1 RPESP WDR51B DACHI LGR5 RPL22L1 WDR72 DKC1 LOC387763 RRM2 ZNRF3 DPEP1 Reference to "nucleic acid molecule" should be understood as a reference to both deoxyribonucleic acid molecules and ribonucleic acid molecules and fragments thereof. The present invention therefore extends to both directly screening for mRNA levels in a biological 5 sample or screening for the complementary cDNA which has been reverse-transcribed from an mRNA population of interest. It is well within the skill of the person of skill in the art to - 44 design methodology directed to screening for either DNA or RNA. As detailed above, the method of the present invention also extends to screening for the protein product translated from the subject mRNA. 5 Preferably, the level of gene expression is measured by reference to genes which encode a protein product and, more particularly, said level of expression is measured at the protein level. Accordingly, to the extent that the present invention is directed to screening for markers which are detailed in the preceding table, said screening is preferably directed to the encoded protein. 10 As detailed hereinbefore, it should be understood that although the present invention is exemplified with respect to the detection of expressed nucleic acid molecules (e.g. mRNA), it also encompasses methods of detection based on screening for the protein product of the subject genes. The present invention should also be understood to encompass methods of 15 detection based on identifying both proteins and/or nucleic acid molecules in one or more biological samples. This may be of particular significance to the extent that some of the neoplastic markers of interest may correspond to genes or gene fragments which do not encode a protein product. Accordingly, to the extent that this occurs it would not be possible to test for a protein and the subject marker would have to be assessed on the basis of 20 transcription expression profiles. In terms of screening for the upregulation of expression of a marker it would also be well known to the person of skill in the art that changes which are detectable at the DNA level are indicative of changes to gene expression activity and therefore changes to expression product 25 levels. Such changes include but are not limited to, changes to DNA methylation. Accordingly, reference herein to "screening the level of expression" and comparison of these "levels of expression" to control "levels of expression" should be understood as a reference to assessing DNA factors which are related to transcription, such as gene/DNA methylation patterns. 30 The term "protein" should be understood to encompass peptides, polypeptides and proteins (including protein fragments). The protein may be glycosylated or unglycosylated and/or may contain a range of other molecules fused, linked, bound or otherwise associated to the protein such as amino acids, lipids, carbohydrates or other peptides, polypeptides or proteins. 35 Reference herein to a "protein" includes a protein comprising a sequence of amino acids as well as a protein associated with other molecules such as amino acids, lipids, carbohydrates or other peptides, polypeptides or proteins. The proteins encoded by the neoplastic markers of the present invention may be in multimeric - 45 form meaning that two or more molecules are associated together. Where the same protein molecules are associated together, the complex is a homomultimer. An example of a homomultimer is a homodimer. Where at least one marker protein is associated with at least one non-marker protein, then the complex is a heteromultimer such as a heterodimer. 5 Reference to a "fragment" should be understood as a reference to a portion of the subject nucleic acid molecule or protein. This is particularly relevant with respect to screening for modulated RNA levels in stool samples since the subject RNA is likely to have been degraded or otherwise fragmented due to the environment of the gut. One may therefore actually be 10 detecting fragments of the subject RNA molecule, which fragments are identified by virtue of the use of a suitably specific probe. Reference to the "onset" of a neoplasm, such as adenoma or adenocarcinoma, should be understood as a reference to one or more cells of that individual exhibiting dysplasia. In this 15 regard, the adenoma or adenocarcinoma may be well developed in that a mass of dysplastic cells has developed. Alternatively, the adenoma or adenocarcinoma may be at a very early stage in that only relatively few abnormal cell divisions have occurred at the time of diagnosis. The present invention also extends to the assessment of an individual's predisposition to the development of a neoplasm, such as an adenoma or adenocarcinoma. 20 Without limiting the present invention in any way, changed levels of the neoplastic markers may be indicative of that individual's predisposition to developing a neoplasia, such as the future development of an adenoma or adenocarcinoma or another adenoma or adenocarcinoma. 25 In yet another related aspect of the present invention, markers have been identified which enable the characterisation of neoplastic tissue of the large intestine in terms of whether it is an adenoma or a cancer. This development now provides a simple yet accurate means of characterising tissue using means other than the traditional methods which are currently utilised. 30 According to this aspect of the present invention, there is provided a method of characterising a neoplastic cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: 35 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 200884_at 214234_s_at 226248_s_at 203240_at 214235_at 226302_at 203963_at 214433_s_at 227676_at -46 204508_s_at 215125_s_at 227719_at 204607_at 215867_x at 227725_at 204811_s_at 217109_at 228232_s_at 204895_x_at 217110_s_at 229070_at 204897_at 218211_s_at 231832_at 205259_at 219543_at 232176_at 205765_at 219955_at 232481_s_at 205927_s_at 221841_s_at 235976_at 208063_s_at 221874_at 236894_at 208937_s_at 223969_s_at 237521_x_at 210107_at 223970_at 242601_at 213106_at; and/or (ii) CLCA1 CTSE ATP8B1 FCGBP C6orfl05 CACNA2D2 HMGCS2 CKB KLF4 RETNLB ATP8A1 CYP3A5P2 L1TD1 MUC4 CAPN9 SLITRK6 UGT1A1 NR3C2 VSIG2 SELENBP1 PBLD LOC253012 PTGER4 CA12 ST6GALNAC1 MLPH WDR51B IDI KIAA1324 FAM3D CYP3A5 in said cell or cellular population wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to a gastrointestinal cancer cell level is 5 indicative of an adenoma cell or a cell predisposed to the onset of an adenoma state. In another aspect there is provided a method of characterising a neoplastic cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of one or more genes or 10 transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 200600 at 204006_s at 213428_s at 200665_s_at 204051_s_at 213524_s_at 200832_s_at 204122_at 213869_x_at - 47 200974_at 204320_at 213905_x at 200986_at 204475_at 214247_s_at 201058_s_at 204620_s_at 215049_x_at 201069_at 205479_s_at 215076_s_at 201105_at 205547_s_at 215646_s_at 201141 at 205828 at 216442 x at 201147_s_at 207173_x_at 217430_x_at 201150 _s_at 207191_s_at 217762_s_at 201162_at 208747_s_at 217763_s_at 201163_s_at 208782_at 217764_s_at 201185_at 208788 at 218468 s at 201261_x_at 208850_s_at 218469_at 201289_at 208851_s_at 218559_s_at 201426_s_at 209101_at 218638_s_at 201438_at 209156_s_at 219087_at 201616 s at 209218 at 221011 s at 201645_at 209395_at 221729_at 201667_at 209396_s_at 221730_at 201744_s_at 209596_at 221731_x_at 201792_at 209875_s_at 37892_at 201842_s_at 209955_s_at 223122_s_at 201852_x_at 210095_s_at 223235_s_at 201859_at 210495_x_at 224560_at 201893_x_at 210511 s_at 224694_at 202237_at 210764_s_at 224724_at 202238_s at 210809_s at 225664_at 202283_at 211161_s_at 225681_at 202291_s_at 211571_s_at 225710_at 202310_s_at 211719_x_at 225799_at 202311_s_at 211813_x_at 226237_at 202403_s_at 211896_s_at 226311 at 202404_s_at 211959_at 226694_at 202450_s_at 211964_at 226777_at 202620_s_at 211966_at 226930_at 202766_s_at 211980_at 227099_s_at 202859_x at 211981 at 227140_at 202878_s_at 212077_at 227566_at 202917_s_at 212344_at 229218_at - 48 202998_s at 212353_at 229802_at 203083_at 212354_at 231579_s_at 203325_s_at 212464_s_at 231766_s_at 203382_s_at 212488_at 231879_at 203477_at 212489_at 232458_at 203570_at 212667_at 233555_s at 203645_s_at 213125_at 234994_at 203878_s_at; and/or (ii) COL1A2 LGALS1 SRGN CTHRCl ELOVL5 LBH FN1 MGP CTGF POSTN MMP2 TNC SPP1 LOXL2 GOS2 MMP1 MYL9 SQLE SPARC DCN EFEMPI LUM CALDI APOE GREMI FBN1 MSN IL8 MMP3 IGFBP3 IGFBP5 IGFBP7 SERPINFI SFRP2 FSTL1 ISLR SULFI COL4A2 HNT ASPN VCAN COL5A1 COL6A3 SMOC2 OLFML2B COL8A1 HTRAl KIAA1913 COL12A1 CYR61 PALM2-AKAP2 COL5A2 FAP SERPINGI CDH11 VIM TYROBP THBS2 TIMP2 ACTA2 COL15Al SCD COL3A1 COL1 1A1 TIMP3 PLOD2 S100A8 AEBP1 MMP1l FNDC1 GJA1 CD163 SFRP4 NNMT FCGR3B INHBA COLlAl PLAU COL6A2 SULF2 MAFB ANTXR1 COL6A1 LOC541471 GPNMB SPON2 LOC387763 -49 BGN CTSK CHI3L1 TAGLN MXRA5 THY1 COL4A1 CiS LOXLl RAB31 DKK3 CD93 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to a gastrointestinal adenoma cell level is indicative of a cancer or a cell predisposed to the onset of a cancerous state. 5 Preferably, said gastrointestinal tissue is colorectal tissue. Reference to an "adenoma control level" or "cancer control level" should be understood as a reference to the level of said gene expression in a population of adenoma or cancer 10 gastrointestinal cells, respectively. As discussed hereinbefore in relation to "normal levels", the subject level may be a discrete level or a range of levels. Accordingly, the definition of "adenoma control level" or "cancer control level" should be understood to have a corresponding definition to "normal level", albeit in the context of the expression of genes by a neoplastic population of large intestine cells. 15 In terms of this aspect of the present invention, the subject analysis is performed on a population of neoplastic cells. These cells may be derived in any manner, such as sloughed of neoplastic cells which have been collected via an enema wash or from a gastrointestinal sample, such as a stool sample. Alternatively, the subject cells may have been obtained via a 20 biopsy or other surgical technique. Without limiting this aspect of the invention in any way, several of the markers of this aspect of the present invention have been determined to be expressed at particularly significant levels above those of neoplastic cells. For example, increased expression levels of 3- and 5-fold 25 have been observed in respect of the following markers, when assessed by the method exemplified herein, which are indicative of gastrointestinal adenomas. Fold Increase Gene or genes detected by Gene Affymetrix Probe No: 5 210107_at CLCA1 203240_at FCGBP 204607_at HMGCS2 3 223969_s at RETNLB 219955_at L1TDI 232481_s_at SLITRK6 - 50 Fold Increase Gene or genes detected by Gene Affymetrix Probe No: 228232_s_at VSIG2 242601_at LOC253012 227725_at ST6GALNAC1 In another example, increased expression levels of between 3- and 9-fold have been observed in respect of the following markers which are indicative of gastrointestinal cancers, when assessed by the method herein exemplified: 5 Gene or genes detected by Gene Fold Increase Gn Affymetrix Probe No: 9 202404_s at COL1A2 8 225681_at CTHRC1 212464_s at FN1 7 - -_ 210809_s_at POSTN 6 209875_s at SPP1 227140_st 5 -MMP 1 204475_at 200665_s_at SPARC 201744_s at LUM 4 218468 s at GREMI 202859_x_at IL8 211959_at IGFBP5 223122_s_at SFRP2 212353_at SULF1 3 219087_at ASPN 201438_at COL6A3 226237_at COL8A1 225664_at COL12A1 221730_at COL5A2 207173_x_at CDH11 203083_at THBS2 203477_at COL15A1 37892_at COL1lAl 202917_s_at SlOOA8 226930_at FNDCl 204051_s at SFRP4 -51 Fold Increase Gene or genes detected by Gene Affymetrix Probe No: 210511_s_at INHBA 209156_s_at COL6A2 224694_at ANTXR1 201141_at GPNMB 213905_x_at BGN 205547_s_at TAGLN According to this embodiment, there is therefore provided a method of characterising a neoplastic cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of 5 one or more genes or transcripts selected from: (i) the gene or genes detected by Affymetrix probeset IDs: 210107_at; and/or (ii) CLCA1 10 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to a gastrointestinal cancer control level is indicative of an adenoma cell or a cell predisposed to the onset of an adenoma state. In another embodiment, there is provided a method of characterising a neoplastic cell or 15 cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 203240_at 219955_at 242601_at 204607_at 232481_s_at 227725_at 223969_s_at 228232_s_at; and/or 20 (ii) FCGBP LITD1 LOC253012 HMGCS2 SLITRK6 ST6GALNACl RETNLB VSIG2 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to a gastrointestinal cancer control level is - 52 indicative of an adenoma cell or a cell predisposed to the onset of an adenoma state. Preferably, said gastrointestinal tissue is colorectal tissue. 5 Still more preferably, said biological sample is a tissue sample. In another preferred embodiment the present invention is directed to a method of characterising a cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression 10 of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 202404_s_at 210809_s_at 227140_at 225681_at 209875_s_at 204475_at 212464_s_at; and/or (ii) COL1A2 FN1 SPP1 CTHRCl POSTN MMP1 15 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or (ii) relative to a gastrointestinal adenoma control level is indicative of a cancer or a cell predisposed to the onset of a cancerous state. In yet another preferred embodiment the present invention is directed to a method of 20 characterising a cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 200665_s_at 218468_s_at 211959_at 201744_s_at 202859_x_at; and/or 25 (ii) SPARC GREMI IGFBP5 LUM IL8 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or (ii) relative to a gastrointestinal adenoma control level is indicative of a cancer or a cell predisposed to the onset of a cancerous state. 30 In still yet another preferred embodiment the present invention is directed to a method of - 53 characterising a cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: 5 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 223122_s_at 207173_x_at 210511_s_at 212353_at 203083_at 209156_s_at 219087_at 203477_at 224694_at 201438_at 37892_at 201141_at 226237_at 202917_s_at 213905_x_at 225664_at 226930_at 205547_s at 221730_at 204051_s_at; and/or (ii) SFRP2 CDH 11 INHBA SULF1 THBS2 COL6A2 ASPN COL15Al ANTXR1 COL6A3 COL1 1A1 GPNMB COL8Al S100A8 BGN COL12A1 FNDC1 TAGLN COL5A2 SFRP4 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or (ii) relative to a gastrointestinal adenoma control level is 10 indicative of a cancer or a cell predisposed to the onset of a cancerous state. Preferably, said gastrointestinal tissue is colorectal tissue. Even more preferably, said biological sample is a tissue sample. 15 In still another related aspect it has been determined that a subset of the markers of this aspect of the present invention are useful as qualitative markers of neoplastic tissue characterisation in that these markers, if detectable above background levels in neoplastic tissue are indicative of either adenoma or cancerous tissue. 20 According to this aspect, the present invention provides a method of characterising a neoplastic cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: 25 - 54 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 235976_at 236894_at; and/or (ii) SLITRK6 LITD1 in a biological sample from said individual wherein a higher level of expression of the genes 5 or transcripts of group (i) and/or (ii) relative to neoplastic tissue background levels is indicative of an adenoma cell or a cell predisposed to the onset of an adenoma state. In another aspect the present invention provides a method of characterising a neoplastic cell or cellular population, which cell or cellular population is derived from the large intestine of 10 an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 202311_s_at 209396_s_at 226237_at 204320_at 215646 s_at 227140_at 204475_at 37892_at 229802_at 209395_at; and/or (ii) COLlAl VCAN CHI3L1 MMP1 COL8Al COLIlAl 15 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or (ii) relative to neoplastic tissue background neoplastic cell levels is indicative of a cancer or a cell predisposed to the onset of a cancerous state. 20 Preferably, said gastrointestinal tissue is colorectal tissue. Still more preferably, said biological sample is a tissue sample. In a most preferred embodiment, the methods of the present invention are preferably directed 25 to screening for proteins encoded by the markers of the present invention. Although the preferred method is to detect the expression products of the neoplastic markers for the purpose of diagnosing neoplasia development or predisposition thereto, the detection of converse changes in the levels of said markers may be desired under certain circumstances, 30 for example, to monitor the effectiveness of therapeutic or prophylactic treatment directed to modulating a neoplastic condition, such as adenoma or adenocarcinoma development. For example, where elevated levels of the subject markers indicate that an individual has - 55 developed a condition characterised by adenoma or adenocarcinoma development, for example, screening for a decrease in the levels of these markers subsequently to the onset of a therapeutic regime may be utilised to indicate reversal or other form of improvement of the subject individual's condition. 5 The method of the present invention is therefore useful as a one-time test or as an on-going monitor of those individuals thought to be at risk of neoplasia development or as a monitor of the effectiveness of therapeutic or prophylactic treatment regimes directed to inhibiting or otherwise slowing neoplasia development. In these situations, mapping the modulation of 10 neoplastic marker expression levels in any one or more classes of biological samples is a valuable indicator of the status of an individual or the effectiveness of a therapeutic or prophylactic regime which is currently in use. Accordingly, the method of the present invention should be understood to extend to monitoring for increases or decreases in marker expression levels in an individual relative to their normal level (as hereinbefore defined), 15 background control levels, cancer levels, adenoma levels or relative to one or more earlier marker expression levels determined from a biological sample of said individual. Means of testing for the subject expressed neoplasm markers in a biological sample can be achieved by any suitable method, which would be well known to the person of skill in the art, 20 such as but not limited to: (i) In vivo detection. Molecular Imaging may be used following administration of imaging probes or reagents 25 capable of disclosing altered expression of the markers in the intestinal tissues. Molecular imaging (Moore et al., BBA, 1402:239-249, 1988; Weissleder et al., Nature Medicine 6:351-355, 2000) is the in vivo imaging of molecular expression that correlates with the macro-features currently visualized using "classical" diagnostic 30 imaging techniques such as X-Ray, computed tomography (CT), MRI, Positron Emission Tomography (PET) or endoscopy. (ii) Detection of up-regulation of RNA expression in the cells by Fluorescent In Situ Hybridization (FISH), or in extracts from the cells by technologies such as Quantitative 35 Reverse Transcriptase Polymerase Chain Reaction (QRTPCR) or Flow cytometric qualification of competitive RT-PCR products (Wedemeyer et al., Clinical Chemistry 48:9 1398-1405, 2002). (iii) Assessment of expression profiles of RNA, for example by array technologies (Alon et - 56 al., Proc. Natl. A cad. Sci. USA: 96, 6745-6750, June 1999). A "microarray" is a linear or multi-dimensional array of preferably discrete regions, each having a defined area, formed on the surface of a solid support. The density of the 5 discrete regions on a microarray is determined by the total numbers of target polynucleotides to be detected on the surface of a single solid phase support. As used herein, a DNA microarray is an array of oligonucleotide probes placed onto a chip or other surfaces used to detect complementary oligonucleotides from a complex nucleic acid mixture. Since the position of each particular group of probes in the array is 10 known, the identities of the target polynucleotides can be determined based on their binding to a particular position in the microarray. Recent developments in DNA microarray technology make it possible to conduct a large scale assay of a plurality of target nucleic acid molecules on a single solid phase 15 support. U.S. Pat. No. 5,837,832 (Chee et al.) and related patent applications describe immobilizing an array of oligonucleotide probes for hybridization and detection of specific nucleic acid sequences in a sample. Target polynucleotides of interest isolated from a tissue of interest are hybridized to the DNA chip and the specific sequences detected based on the target polynucleotides' preference and degree of hybridization at 20 discrete probe locations. One important use of arrays is in the analysis of differential gene expression, where the profile of expression of genes in different cells or tissues, often a tissue of interest and a control tissue, is compared and any differences in gene expression among the respective tissues are identified. Such information is useful for the identification of the types of genes expressed in a particular tissue type and 25 diagnosis of conditions based on the expression profile. In one example, RNA from the sample of interest is subjected to reverse transcription to obtain labelled cDNA. See U.S. Pat. No. 6,410,229 (Lockhart et al.) The cDNA is then hybridized to oligonucleotides or cDNAs of known sequence arrayed on a chip or other 30 surface in a known order. In another example, the RNA is isolated from a biological sample and hybridised to a chip on which are anchored cDNA probes. The location of the oligonucleotide to which the labelled cDNA hybridizes provides sequence information on the cDNA, while the amount of labelled hybridized RNA or cDNA provides an estimate of the relative representation of the RNA or cDNA of interest. See 35 Schena, et al. Science 270:467-470 (1995). For example, use of a cDNA microarray to analyze gene expression patterns in human cancer is described by DeRisi, et al. (Nature Genetics 14:457-460 (1996)). In a preferred embodiment, nucleic acid probes corresponding to the subject nucleic - 57 acids are made. The nucleic acid probes attached to the microarray are designed to be substantially complementary to the nucleic acids of the biological sample such that specific hybridization of the target sequence and the probes of the present invention occurs. This complementarity need not be perfect, in that there may be any number of 5 base pair mismatches that will interfere with hybridization between the target sequence and the single stranded nucleic acids of the present invention. It is expected that the overall homology of the genes at the nucleotide level probably will be about 40% or greater, probably about 60% or greater, and even more probably about 80% or greater; and in addition that there will be corresponding contiguous sequences of about 8-12 10 nucleotides or longer. However, if the number of mutations is so great that no hybridization can occur under even the least stringent of hybridization conditions, the sequence is not a complementary target sequence. Thus, by "substantially complementary" herein is meant that the probes are sufficiently complementary to the target sequences to hybridize under normal reaction conditions, particularly high 15 stringency conditions. A nucleic acid probe is generally single stranded but can be partly single and partly double stranded. The strandedness of the probe is dictated by the structure, composition, and properties of the target sequence. In general, the oligonucleotide 20 probes range from about 6, 8, 10, 12, 15, 20, 30 to about 100 bases long, with from about 10 to about 80 bases being preferred, and from about 15 to about 40 bases being particularly preferred. That is, generally entire genes are rarely used as probes. In some embodiments, much longer nucleic acids can be used, up to hundreds of bases. The probes are sufficiently specific to hybridize to a complementary template sequence 25 under conditions known by those of skill in the art. The number of mismatches between the probe's sequences and their complementary template (target) sequences to which they hybridize during hybridization generally do not exceed 15%, usually do not exceed 10% and preferably do not exceed 5%, as-determined by BLAST (default settings). 30 Oligonucleotide probes can include the naturally-occurring heterocyclic bases normally found in nucleic acids (uracil, cytosine, thymine, adenine and guanine), as well as modified bases and base analogues. Any modified base or base analogue compatible with hybridization of the probe to a target sequence is useful in the practice of the 35 invention. The sugar or glycoside portion of the probe can comprise deoxyribose, ribose, and/or modified forms of these sugars, such as, for example, 2'-O-alkyl ribose. In a preferred embodiment, the sugar moiety is 2'-deoxyribose; however, any sugar moiety that is compatible with the ability of the probe to hybridize to a target sequence - 58 can be used. In one embodiment, the nucleoside units of the probe are linked by a phosphodiester backbone, as is well known in the art. In additional embodiments, internucleotide 5 linkages can include any linkage known to one of skill in the art that is compatible with specific hybridization of the probe including, but not limited to phosphorothioate, methylphosphonate, sulfamate (e.g., U.S. Pat. No. 5,470,967) and polyamide (i.e., peptide nucleic acids). Peptide nucleic acids are described in Nielsen et al. (1991) Science 254: 1497-1500, U.S. Pat. No. 5,714,331, and Nielsen (1999) Curr. Opin. 10 Biotechnol. 10:71-75. In certain embodiments, the probe can be a chimeric molecule; i.e., can comprise more than one type of base or sugar subunit, and/or the linkages can be of more than one type within the same primer. The probe can comprise a moiety to facilitate hybridization to 15 its target sequence, as are known in the art, for example, intercalators and/or minor groove binders. Variations of the bases, sugars, and internucleoside backbone, as well as the presence of any pendant group on the probe, will be compatible with the ability of the probe to bind, in a sequence-specific fashion, with its target sequence. A large number of structural modifications, are possible within these bounds. Advantageously, 20 the probes according to the present invention may have structural characteristics such that they allow the signal amplification, such structural characteristics being, for example, branched DNA probes as those described by Urdea et al. (Nucleic Acids Symp. Ser., 24:197-200 (1991)) or in the European Patent No. EP-0225,807. Moreover, synthetic methods for preparing the various heterocyclic bases, sugars, nucleosides and 25 nucleotides that form the probe, and preparation of oligonucleotides of specific predetermined sequence, are well-developed and known in the art. A preferred method for oligonucleotide synthesis incorporates the teaching of U.S. Pat. No. 5,419,966. Multiple probes may be designed for a particular target nucleic acid to account for 30 polymorphism and/or secondary structure in the target nucleic acid, redundancy of data and the like. In some embodiments, where more than one probe per sequence is used, either overlapping probes or probes to different sections of a single target gene are used. That is, two, three, four or more probes, are used to build in a redundancy for a particular target. The probes can be overlapping (i.e. have some sequence in common), 35 or are specific for distinct sequences of a gene. When multiple target polynucleotides are to be detected according to the present invention, each probe or probe group corresponding to a particular target polynucleotide is situated in a discrete area of the microarray.
- 59 Probes may be in solution, such as in wells or on the surface of a micro-array, or attached to a solid support. Examples of solid support materials that can be used include a plastic, a ceramic, a metal, a resin, a gel and a membrane. Useful types of solid supports include plates, beads, magnetic material, microbeads, hybridization chips, 5 membranes, crystals, ceramics and self-assembling monolayers. One example comprises a two-dimensional or three-dimensional matrix, such as a gel or hybridization chip with multiple probe binding sites (Pevzner et al., J. Biomol. Struc. & Dyn. 9:399-410, 1991; Maskos and Southern, Nuc. Acids Res. 20:1679-84, 1992). Hybridization chips can be used to construct very large probe arrays that are 10 subsequently hybridized with a target nucleic acid. Analysis of the hybridization pattern of the chip can assist in the identification of the target nucleotide sequence. Patterns can be manually or computer analyzed, but it is clear that positional sequencing by hybridization lends itself to computer analysis and automation. In another example, one may use an Affymetrix chip on a solid phase structural support in 15 combination with a fluorescent bead based approach. In yet another example, one may utilise a cDNA microarray. In this regard, the oligonucleotides described by Lockkart et al. (i.e. Affymetrix synthesis probes in situ on the solid phase) are particularly preferred, that is, photolithography. 20 As will be appreciated by those in the art, nucleic acids can be attached or immobilized to a solid support in a wide variety of ways. By "immobilized" herein is meant the association or binding between the nucleic acid probe and the solid support is sufficient to be stable under the conditions of binding, washing, analysis, and removal. The binding can be covalent or non-covalent. By "non-covalent binding" and grammatical 25 equivalents herein is meant one or more of either electrostatic, hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent attachment of a molecule, such as streptavidin, to the support and the non-covalent binding of the biotinylated probe to the streptavidin. By "covalent binding" and grammatical equivalents herein is meant that the two moieties, the solid support and the probe, are 30 attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. Covalent bonds can be formed directly between the probe and the solid support or can be formed by a cross linker or by inclusion of a specific reactive group on either the solid support or the probe or both molecules. Immobilization may also involve a combination of covalent and non-covalent interactions. 35 Nucleic acid probes may be attached to the solid support by covalent binding such as by conjugation with a coupling agent or by covalent or non-covalent binding such as electrostatic interactions, hydrogen bonds or antibody-antigen coupling, or by - 60 combinations thereof. Typical coupling agents include biotin/avidin, biotin/streptavidin, Staphylococcus aureus protein A/IgG antibody F, fragment, and streptavidin/protein A chimeras (T. Sano and C. R. Cantor, Bio/Technology 9:1378-81 (1991)), or derivatives or combinations of these agents. Nucleic acids may be attached 5 to the solid support by a photocleavable bond, an electrostatic bond, a disulfide bond, a peptide bond, a diester bond or a combination of these sorts of bonds. The array may also be attached to the solid support by a selectively releasable bond such as 4,4' dimethoxytrityl or its derivative. Derivatives which have been found to be useful include 3 or 4 [bis-(4-methoxyphenyl)]-methyl-benzoic acid, N-succinimidyl-3 or 4 10 [bis-(4-methoxyphenyl)]-methyl-benzoic acid, N-succinimidyl-3 or 4 [bis-(4 methoxyphenyl)]-hydroxymethyl-benzoic acid, N-succinimidyl-3 or 4 [bis-(4 methoxyphenyl)]-chloromethyl-benzoic acid, and salts of these acids. In general, the probes are attached to the microarray in a wide variety of ways, as will 15 be appreciated by those in the art. As described herein, the nucleic acids can either be synthesized first, with subsequent attachment to the microarray, or can be directly synthesized on the microarray. The microarray comprises a suitable solid substrate. By "substrate" or "solid support" 20 or other grammatical equivalents herein is meant any material that can be modified to contain discrete individual sites appropriate for the attachment or association of the nucleic acid probes and is amenable to at least one detection method. The solid phase support of the present invention can be of any solid materials and structures suitable for supporting nucleotide hybridization and synthesis. Preferably, the solid phase support 25 comprises at least one substantially rigid surface on which the primers can be immobilized and the reverse transcriptase reaction performed. The substrates with which the polynucleotide microarray elements are stably associated and may be fabricated from a variety of materials, including plastics, ceramics, metals, acrylamide, cellulose, nitrocellulose, glass, polystyrene, polyethylene vinyl acetate, polypropylene, 30 polymethacrylate, polyethylene, polyethylene oxide, polysilicates, polycarbonates, Teflon, fluorocarbons, nylon, silicon rubber, polyanhydrides, polyglycolic acid, polylactic acid, polyorthoesters, polypropylfumerate, collagen, glycosaminoglycans, and polyamino acids. Substrates may be two-dimensional or three-dimensional in form, such as gels, membranes, thin films, glasses, plates, cylinders, beads, magnetic 35 beads, optical fibers, woven fibers, etc. A preferred form of array is a three dimensional array. A preferred three-dimensional array is a collection of tagged beads. Each tagged bead has different primers attached to it. Tags are detectable by signalling means such as color (Luminex, Illumina) and electromagnetic field (Pharmaseq) and -61 signals on tagged beads can even be remotely detected (e.g., using optical fibers). The size of the solid support can be any of the standard microarray sizes, useful for DNA microarray technology, and the size may be tailored to fit the particular machine being used to conduct a reaction of the invention. In general, the substrates allow optical 5 detection and do not appreciably fluoresce. In one embodiment, the surface of the microarray and the probe may be derivatized with chemical functional groups for subsequent attachment of the two. Thus, for example, the microarray is derivatized with a chemical functional group including, but 10 not limited to, amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being particularly preferred. Using these functional groups, the probes can be attached using functional groups on the probes. For example, nucleic acids containing amino groups can be attached to surfaces comprising amino groups, for example using linkers as are known in the art; for example, homo-or hetero-bifunctional linkers as are 15 well known. In addition, in some cases, additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be used. In this embodiment, the oligonucleotides are synthesized as is known in the art, and then attached to the surface of the solid support. As will be appreciated by those skilled 20 in the art, either the 5' or 3' terminus may be attached to the solid support, or attachment may be via an internal nucleoside. In an additional embodiment, the immobilization to the solid support may be very strong, yet non-covalent. For example, biotinylated oligonucleotides can be made, which bind to surfaces covalently coated with streptavidin, resulting in attachment. 25 The arrays may be produced according to any convenient methodology, such as preforming the polynucleotide microarray elements and then stably associating them with the surface. Alternatively, the oligonucleotides may be synthesized on the surface, as is known in the art. A number of different array configurations and methods for their 30 production are known to those of skill in the art and disclosed in WO 95/25116 and WO 95/35505 (photolithographic techniques), U.S. Pat. No. 5,445,934 (in situ synthesis by photolithography), U.S. Pat. No. 5,384,261 (in situ synthesis by mechanically directed flow paths); and U.S. Pat. No. 5,700,637 (synthesis by spotting, printing or coupling); the disclosure of which are herein incorporated in their entirety by reference. Another 35 method for coupling DNA to beads uses specific ligands attached to the end of the DNA to link to ligand-binding molecules attached to a bead. Possible ligand-binding partner pairs include biotin-avidin/streptavidin, or various antibody/antigen pairs such as digoxygenin-antidigoxygenin antibody (Smith et al., Science 258:1122-1126 (1992)). Covalent chemical attachment of DNA to the support can be accomplished by - 62 using standard coupling agents to link the 5'-phosphate on the DNA to coated microspheres through a phosphoamidate bond. Methods for immobilization of oligonucleotides to solid-state substrates are well established. See Pease et al., Proc. Natd. Acad. Sci. USA 91( 11):5022-5026 (1994). A preferred method of attaching 5 oligonucleotides to solid-state substrates is described by Guo et al., Nucleic Acids Res. 22:5456-5465 (1994). Immobilization can be accomplished either by in situ DNA synthesis (Maskos and Southern, supra) or by covalent attachment of chemically synthesized oligonucleotides (Guo et al., supra) in combination with robotic arraying technologies. 10 In addition to the solid-phase technology represented by microarray arrays, gene expression can also be quantified using liquid-phase assays. One such system is kinetic polymerase chain reaction (PCR). Kinetic PCR allows for the simultaneous amplification and quantification of specific nucleic acid sequences. The specificity is 15 derived from synthetic oligonucleotide primers designed to preferentially adhere to single-stranded nucleic acid sequences bracketing the target site. This pair of oligonucleotide primers form specific, non-covalently bound complexes on each strand of the target sequence. These complexes facilitate in vitro transcription of double stranded DNA in opposite orientations. Temperature cycling of the reaction mixture 20 creates a continuous cycle of primer binding, transcription, and re-melting of the nucleic acid to individual strands. The result is an exponential increase of the target dsDNA product. This product can be quantified in real time either through the use of an intercalating dye or a sequence specific probe. SYBR(r) Green 1, is an example of an intercalating dye, that preferentially binds to dsDNA resulting in a concomitant 25 increase in the fluorescent signal. Sequence specific probes, such as used with TaqMan technology, consist of a fluorochrome and a quenching molecule covalently bound to opposite ends of an oligonucleotide. The probe is designed to selectively bind the target DNA sequence between the two primers. When the DNA strands are synthesized during the PCR reaction, the fluorochrome is cleaved from the probe by the 30 exonuclease activity of the polymerase resulting in signal dequenching. The probe signalling method can be more specific than the intercalating dye method, but in each case, signal strength is proportional to the dsDNA product produced. Each type of quantification method can be used in multi-well liquid phase arrays with each well representing primers and/or probes specific to nucleic acid sequences of interest. When 35 used with messenger RNA preparations of tissues or cell lines, an array of probe/primer reactions can simultaneously quantify the expression of multiple gene products of interest. See Germer et al., Genome Res. 10:258-266 (2000); Heid et al., Genome Res. 6:986-994 (1996).
- 63 (iv) Measurement of altered neoplastic marker protein levels in cell extracts, for example by immunoassay. 5 Testing for proteinaceous neoplastic marker expression product in a biological sample can be performed by any one of a number of suitable methods which are well known to those skilled in the art. Examples of suitable methods include, but are not limited to, antibody screening of tissue sections, biopsy specimens or bodily fluid samples. 10 To the extent that antibody based methods of diagnosis are used, the presence of the marker protein may be determined in a number of ways such as by Western blotting, ELISA or flow cytometry procedures. These, of course, include both single-site and two-site or "sandwich" assays of the non-competitive types, as well as in the traditional competitive binding assays. These assays also include direct binding of a labelled 15 antibody to a target. Sandwich assays are among the most useful and commonly used assays. A number of variations of the sandwich assay technique exist, and all are intended to be encompassed by the present invention. Briefly, in a typical forward assay, an 20 unlabelled antibody is immobilized on a solid substrate and the sample to be tested brought into contact with the bound molecule. After a suitable period of incubation, for a period of time sufficient to allow formation of an antibody-antigen complex, a second antibody specific to the antigen, labelled with a reporter molecule capable of producing a detectable signal is then added and incubated, allowing time sufficient for the 25 formation of another complex of antibody-antigen-labelled antibody. Any unreacted material is washed away, and the presence of the antigen is determined by observation of a signal produced by the reporter molecule. The results may either be qualitative, by simple observation of the visible signal, or may be quantitated by comparing with a control sample. Variations on the forward assay include a simultaneous assay, in which 30 both sample and labelled antibody are added simultaneously to the bound antibody. These techniques are well known to those skilled in the art, including any minor variations as will be readily apparent. In the typical forward sandwich assay, a first antibody having specificity for the marker 35 or antigenic parts thereof, is either covalently or passively bound to a solid surface. The solid surface is typically glass or a polymer, the most commonly used polymers being cellulose, polyacrylamide, nylon, polystyrene, polyvinyl chloride or polypropylene. The solid supports may be in the form of tubes, beads, discs of microplates, or any other surface suitable for conducting an immunoassay. The binding processes are well- - 64 known in the art and generally consist of cross-linking, covalently binding or physically adsorbing, the polymer-antibody complex is washed in preparation for the test sample. An aliquot of the sample to be tested is then added to the solid phase complex and incubated for a period of time sufficient (e.g. 2-40 minutes) and under suitable 5 conditions (e.g. 25*C) to allow binding of any subunit present in the antibody. Following the incubation period, the antibody subunit solid phase is washed and dried and incubated with a second antibody specific for a portion of the antigen. The second antibody is linked to a reporter molecule which is used to indicate the binding of the second antibody to the antigen. 10 An alternative method involves immobilizing the target molecules in the biological sample and then exposing the immobilized target to specific antibody which may or may not be labelled with a reporter molecule. Depending on the amount of target and the strength of the reporter molecule signal, a bound target may be detectable by direct 15 labelling with the antibody. Alternatively, a second labelled antibody, specific to the first antibody is exposed to the target-first antibody complex to form a target-first antibody-second antibody tertiary complex. The complex is detected by the signal emitted by the reporter molecule. 20 By "reporter molecule" as used in the present specification, is meant a molecule which, by its chemical nature, provides an analytically identifiable signal which allows the detection of antigen-bound antibody. Detection may be either qualitative or quantitative. The most commonly used reporter molecules in this type of assay are either enzymes, fluorophores or radionuclide containing molecules (i.e. radioisotopes) 25 and chemiluminescent molecules. In the case of an enzyme immunoassay, an enzyme is conjugated to the second antibody, generally by means of glutaraldehyde or periodate. As will be readily recognized, however, a wide variety of different conjugation techniques exist, which are 30 readily available to the skilled artisan. Commonly used enzymes include horseradish peroxidase, glucose oxidase, beta-galactosidase and alkaline phosphatase, amongst others. The substrates to be used with the specific enzymes are generally chosen for the production, upon hydrolysis by the corresponding enzyme, of a detectable color change. Examples of suitable enzymes include alkaline phosphatase and peroxidase. It is also 35 possible to employ fluorogenic substrates, which yield a fluorescent product rather than the chromogenic substrates noted above. In all cases, the enzyme-labelled antibody is added to the first antibody hapten complex, allowed to bind, and then the excess reagent is washed away. A solution containing the appropriate substrate is then added to the complex of antibody-antigen-antibody. The substrate will react with the enzyme linked - 65 to the second antibody, giving a qualitative visual signal, which may be further quantitated, usually spectrophotometrically, to give an indication of the amount of antigen which was present in the sample. "Reporter molecule" also extends to use of cell agglutination or inhibition of agglutination such as red blood cells on latex beads, 5 and the like. Alternately, fluorescent compounds, such as fluorecein and rhodamine, may be chemically coupled to antibodies without altering their binding capacity. When activated by illumination with light of a particular wavelength, the fluorochrome 10 labelled antibody adsorbs the light energy, inducing a state to excitability in the molecule, followed by emission of the light at a characteristic color visually detectable with a light microscope. As in the EIA, the fluorescent labelled antibody is allowed to bind to the first antibody-hapten complex. After washing off the unbound reagent, the remaining tertiary complex is then exposed to the light of the appropriate wavelength 15 the fluorescence observed indicates the presence of the hapten of interest. Immunofluorescence and EIA techniques are both very well established in the art and are particularly preferred for the present method. However, other reporter molecules, such as radioisotope, chemiluminescent or bioluminescent molecules, may also be employed. 20 (v) Without limiting the present invention to any one theory or mode of action, during development gene expression is regulated by processes that alter the availability of genes for expression in different cell lineages without any alteration in gene sequence, and these states can be inherited through a cell division - a process called epigenetic 25 inheritance. Epigenetic inheritance is determined by a combination of DNA methylation (modification of cytosine to give 5-methyl cytosine, 5meC) and by modifications of the histone chromosomal proteins that package DNA. Thus methylation of DNA at CpG sites and modifications such as deacetylation of histone H3 on lysine 9, and methylation on lysine 9 or 27 are associated with inactive chromatin, 30 while the converse state of a lack of DNA methylation, acetylation of lysine 9 of histone H3 is associated with open chromatin and active gene expression. In cancer, this epigenetic regulation of gene expression is frequently found to be disrupted (Esteller & Herman, 2000; Jones & Baylin, 2002). Genes such as tumour suppressor or metastasis suppressor genes are often found to be silenced by DNA methylation, while other genes 35 may be hypomethylated and inappropriately expressed. Thus, among genes that elevated or inappropriate expression in cancer, this in some instances is characterised by a loss of methylation of the promoter or regulatory region of the gene. A variety of methods are available for detection of aberrantly methylated DNA of a - 66 specific gene, even in the presence of a large excess of normal DNA (Clark 2007). Thus, elevated expression of certain genes may be detected through detection of the presence of hypomethylated sequences in tissue, bodily fluid or other patient samples. 5 Epigenetic alterations and chromatin changes in cancer are also evident in the altered association of modified histones with specific genes(Esteller, 2007); for example activated genes are often found associated with histone H3 that is acetylated on lysine 9 and methylated on lysine 4. The use of antibodies targeted to altered histones allows for the isolation of DNA associated with particular chromatin states and has potential use in 10 cancer diagnosis. (vi) Determining altered expression of protein neoplastic markers on the cell surface, for example by immunohistochemistry. 15 (vii) Determining altered protein expression based on any suitable functional test, enzymatic test or immunological test in addition to those detailed in points (iv) and (v) above. A person of ordinary skill in the art could determine, as a matter of routine procedure, the appropriateness of applying a given method to a particular type of biological sample. 20 Without limiting the present invention in any way, and as detailed above, gene expression levels can be measured by a variety of methods known in the art. For example, gene transcription or translation products can be measured. Gene transcription products, i.e., RNA, can be measured, for example, by hybridization assays, run-off assays., Northern blots, or 25 other methods known in the art. Hybridization assays generally involve the use of oligonucleotide probes that hybridize to the single-stranded RNA transcription products. Thus, the oligonucleotide probes are complementary to the transcribed RNA expression product. Typically, a sequence-specific 30 probe can be directed to hybridize to RNA or cDNA. A "nucleic acid probe", as used herein, can be a DNA probe or an RNA probe that hybridizes to a complementary sequence. One of skill in the art would know how to design such a probe such that sequence specific hybridization will occur. One of skill in the art will further know how to quantify the amount of sequence specific hybridization as a measure of the amount of gene expression for the gene 35 was transcribed to produce the specific RNA. The hybridization sample is maintained under conditions that are sufficient to allow specific hybridization of the nucleic acid probe to a specific gene expression product. "Specific hybridization", as used herein, indicates near exact hybridization (e.g., with few if any - 67 mismatches). Specific hybridization can be performed under high stringency conditions or moderate stringency conditions. In one embodiment, the hybridization conditions for specific hybridization are high stringency. For example, certain high stringency conditions can be used to distinguish perfectly complementary nucleic acids from those of less 5 complementarity. "High stringency conditions", "moderate stringency conditions" and "low stringency conditions" for nucleic acid hybridizations are explained on pages 2.10.1-2.10.16 and pages 6.3.1-6.3.6 in Current Protocols in Molecular Biology (Ausubel, F. et al., "Current Protocols in Molecular Biology", John Wiley & Sons, (1998), the entire teachings of which are incorporated by reference herein). The exact conditions that determine the stringency of 10 hybridization depend not only on ionic strength (e.g., 0.2.times.SSC, 0.1 .times.SSC), temperature (e.g., room temperature, 42 0 C., 68'C.) and the concentration of destabilizing agents such as formamide or denaturing agents such as SDS, but also on factors such as the length of the nucleic acid sequence, base composition, percent mismatch between hybridizing sequences and the frequency of occurrence of subsets of that sequence within other non 15 identical sequences. Thus, equivalent conditions can be determined by varying one or more of these parameters while maintaining a similar degree of identity or similarity between the two nucleic acid molecules. Typically, conditions are used such that sequences at least about 60%, at least about 70%, at least about 80%, at least about 90% or at least about 95% or more identical to each other remain hybridized to one another. By varying hybridization conditions 20 from a level of stringency at which no hybridization occurs to a level at which hybridization is first observed, conditions that will allow a given sequence to hybridize (e.g., selectively) with the most complementary sequences in the sample can be determined. Exemplary conditions that describe the determination of wash conditions for moderate or low 25 stringency conditions are described in Kraus, M. and Aaronson, S., 1991. Methods Enzymol., 200:546-556; and in, Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, (1998). Washing is the step in which conditions are usually set so as to determine a minimum level of complementarity of the hybrids. Generally, starting from the lowest temperature at which only homologous hybridization occurs, each *C. by which the final wash 30 temperature is reduced (holding SSC concentration constant) allows an increase by 1% in the maximum mismatch percentage among the sequences that hybridize. Generally, doubling the concentration of SSC results in an increase in Tm of about 17 0 C. Using these guidelines, the wash temperature can be determined empirically for high, moderate or low stringency, depending on the level of mismatch sought. For example, a low stringency wash can 35 comprise washing in a solution containing 0.2.times.SSC/0. 1% SDS for 10 minutes at room temperature; a moderate stringency wash can comprise washing in a pre-warmed solution (42'C) solution containing 0.2.times.SSC/0. 1% SDS for 15 minutes at 42 0 C.; and a high stringency wash can comprise washing in pre-warmed (68*C.) solution containing - 68 O.l.times.SSC/0.1% SDS for 15 minutes at 68 0 C. Furthermore, washes can be performed repeatedly or sequentially to obtain a desired result as known in the art. Equivalent conditions can be determined by varying one or more of the parameters given as an example, as known in the art, while maintaining a similar degree of complementarity between the target nucleic 5 acid molecule and the primer or probe used (e.g., the sequence to be hybridized). A related aspect of the present invention provides a molecular array, which array comprises a plurality of: 10 (i) nucleic acid molecules comprising a nucleotide sequence corresponding to any one or more of the neoplastic marker genes hereinbefore described or a sequence exhibiting at least 80% identity thereto or a functional derivative, fragment, variant or homologue of said nucleic acid molecule; or 15 (ii) nucleic acid molecules comprising a nucleotide sequence capable of hybridising to any one or more of the sequences of (i) under medium stringency conditions or a functional derivative, fragment, variant or homologue of said nucleic acid molecule; or 20 (iii) nucleic acid probes or oligonucleotides comprising a nucleotide sequence capable of hybridising to any one or more of the sequences of (i) under medium stringency conditions or a functional derivative, fragment, variant or homologue of said nucleic acid molecule; or 25 (iv) probes capable of binding to any one or more of the proteins encoded by the nucleic acid molecules of (i) or a derivative, fragment or, homologue thereof wherein the level of expression of said marker genes of (i) or proteins of (iv) is indicative of the neoplastic state of a cell or cellular subpopulation derived from the large intestine. 30 Preferably, said percent identity is at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99%. Low stringency includes and encompasses from at least about 1% v/v to at least about 15% 35 v/v- formamide and from at least about IM to at least about 2M salt for hybridisation, and at least about IM to at least about 2M salt for washing conditions. Alternative stringency conditions may be applied where necessary, such as medium stringency, which includes and encompasses from at least about 16% v/v at least about 30% v/v formamide and from at least about 0.5M to at least about 0.9M salt for hybridisation, and at least about 0.5M to at least - 69 about 0.9M salt for washing conditions, or high stringency, which includes and encompasses from at least about 31% v/v to at least about 50% v/v formamide and from at least about 0.01M to at least about 0.15M salt for hybridisation, and at least about 0.01M to at least about 0. 15M salt for washing conditions. In general, washing is carried out at Tm = 69.3 + 0.41 (G 5 + C) % [19] = -1 2-C. However, the Tm of a duplex DNA decreases by 1 C with every increase of 1% in the number of mismatched based pairs (Bonner et al (1973) J. Mol. Biol. 81:123). Preferably, the subject probes are designed to bind to the nucleic acid or protein to which they 10 are directed with a level of specificity which minimises the incidence of non-specific reactivity. However, it would be appreciated that it may not be possible to eliminate all potential cross-reactivity or non-specific reactivity, this being an inherent limitation of any probe based system. 15 In terms of the probes which are used to detect the subject proteins, they may take any suitable form including antibodies and aptamers. A library or array of nucleic acid or protein probes provides rich and highly valuable information. Further, two or more arrays or profiles (information obtained from use of an 20 array) of such sequences are useful tools for comparing a test set of results with a reference, such as another sample or stored calibrator. In using an array, individual probes typically are immobilized at separate locations and allowed to react for binding reactions. Primers associated with assembled sets of markers are useful for either preparing libraries of sequences or directly detecting markers from other biological samples. 25 A library (or array, when referring to physically separated nucleic acids corresponding to at least some sequences in a library) of gene markers exhibits highly desirable properties. These properties are associated with specific conditions, and may be characterized as regulatory profiles. A profile, as termed here refers to a set of members that provides diagnostic 30 information of the tissue from which the markers were originally derived. A profile in many instances comprises a series of spots on an array made from deposited sequences. A characteristic patient profile is generally prepared by use of an array. An array profile may be compared with one or more other array profiles or other reference profiles. The 35 comparative results can provide rich information pertaining to disease states, developmental state, receptiveness to therapy and other information about the patient. Another aspect of the present invention provides a diagnostic kit for assaying biological samples comprising an agent for detecting one or more neoplastic markers and reagents useful - 70 for facilitating the detection by said agent. Further means may also be included, for example, to receive a biological sample. The agent may be any suitable detecting molecule. The present invention is further described by the following non-limiting examples: 5 EXAMPLE 1 Methods and Materials 10 Affymetrix GeneChip data Gene expression profiling data and accompanying clinical data was purchased from GeneLogic Inc (Gaithersburg, MD USA). For each tissue analyzed, oligonucleotide microarray data for 44,928 probesets (Affymetrix HGU133A & HGU133B, combined), experimental and clinical descriptors, and digitally archived microscopy images of 15 histological preparations were received. A quality control analysis was performed to remove arrays not meeting essential quality control measures as defined by the manufacturer. Transcript expression levels were calculated by both Microarray Suite (MAS) 5.0 (Affymetrix) and the Robust Multichip Average (RMA) normalization techniques 20 (Affymetrix. GeneChip expression data analysis fundamentals. Affymetrix, Santa Clara, CA USA, 2001; Hubbell et al. Bioinformatics, 18:1585-1592, 2002; Irizarry et al. Nucleic Acid Research, 31, 2003)MAS normalized data was used for performing standard quality control routines and the final data set was normalized with RMA for all subsequent analyses. 25 Univariate differential expression Differentially expressed gene transcripts were identified using a moderated t-test implemented in the limma library downloaded from the Bioconductor repository for R. (G. K. Smyth. Statistical Applications in Genetics and Molecular Biology, 3(l):Article 3, 2004; G K Smyth. Bioinformatics and Computational Biology Solutions using R and Bioconductor. Springer, 30 New York, 2005). Significance estimates (p-values) were corrected to adjust for multiple hypothesis testing using the Bonferonni correction. Tissue specific expression patterns To construct a filter for hypothetically 'turned on' gene expression the mean expression level 35 for all 44,928 probesets across the full range of 454 tissues was first estimated. To estimate an expression on/off threshold, the 44,928 mean values were ranked and the expression value equivalent to the 30th percentile across the dataset calculated. This arbitrary threshold was chosen because it was theorized that the majority of transcripts (and presumably more than 30%) in a given specimen should be transcriptionally silenced. Thus this threshold represents - 71 a conservative upper bound for what is estimated as non-specific, or background, signal. Gene symbol annotations To map Affymetrix probeset names to official gene symbols the annotation metadata 5 available from Bioconductor was used. hgul33plus2 library version 1.16.0, which was assembled using Entrez Gene data downloaded on 15 March 2007, was used. Estimates ofperformance characteristics Diagnostic utility for each table of markers shown herein was estimated including: sensitivity, 10 specificity, positive predictive value, negative predictive value, likelihood ratio positive, likelihood ratio negative. These estimates were calculated in the same data used to discover the markers and will therefore potentially overestimate the performance characteristics in future tissue samples. To improve the generalisabilty of the estimates a modified jackknife resampling technique was used to calculate a less biased value for each characteristic. 15 Results A range of univariate statistical tests were applied on Affymetrix oligonucleotide microarray data to reveal human genes that could be used to discriminate colorectal neoplastic tissues from non-neoplastic tissues. There were further identified a number of gene transcripts that 20 appear to be useful for differentiating colorectal adenomas from colorectal carcinoma. Also identified were a subset of these transcripts that may have particular diagnostic utility because due to the protein products being either secreted or displayed on the cell surface of epithelial cells. Finally, there were identified a further subset of transcripts expressed specifically in neoplastic tissues and at low- or near-background levels in non-neoplastic tissues. 25 Genes differentially expressed in neoplastic tissues From a total GeneChip set of 44,928 probesets it was determined that over 11,000 probesets were differentially expressed by moderated t-test using the limma package in BioConductor (G. K. Smyth, 2004 supra) employing conservative (Bonferroni) multiple test correction. 30 When this list was further filtered to include only those probesets demonstrating a 2-fold or greater mean expression change between the neoplastic and non-neoplastic tissues, 206 probesets were found to be expressed higher in neoplasias relative to normals. These 205 probesets were annotated using the most recent metadata and annotation packages 35 available for the chips. The 205 overexpressed probesets were mapped to 174 gene symbols. A-expression ProbeSet ID Gene Symbol Maps UP 205 157 - 72 Hypothetical markers specific for colorectal neoplasia While differential gene expression patterns are useful for diagnostic purposes, this project also seeks to identify diagnostic proteins shed into the lumen of the gut by neoplastic colorectal 5 epithelia. To discover candidate proteins the list of differentially expressed transcripts were filtered with a selection criteria aimed at identifying markers specifically expressed in colorectal neoplasia tissues. This filter criteria is based on a theoretical assumption that most genes on the GeneChip will be turned 'off and that any microarray signals for such 'off transcripts will reflect technical assay background and non-specific oligonucleotide binding. 10 Accordingly, to select genes specifically expressed in neoplastic tumours (i.e. 'on') the non neoplastic signals were compared with a hypothetical background signal threshold from across all genes on the chip. By design, all transcripts in the candidate pool from which the 'on' transcripts are chosen are at least two fold overexpressed in the diseased tissues. Combined, it is hypothesized that these criteria yield the subset of differentially expressed 15 genes that are specifically expressed in neoplasia. The expression profile for a representative 'on' transcript is shown in Figure 1. Genes differentially expressed between adenomas and cancer tissues There were 33 transcripts observed that were differentially expressed at least two-fold higher 20 in adenoma tissues relative to cancer tissues. In particular, there were identified several transcripts that exhibit an expression pattern specific for adenomas, including SLITRK6 and LlTDI, shown in Figure 2. Further, there were also identified cancer specific transcripts. The expression profile of one 25 such transcript, COL1 lA1 is shown in Figure 3. EXAMPLE 2 Probesets Elevated In Neoplasia 30 Differential expression analysis was applied to Affymetrix gene chip data measuring RNA concentration in 454 colorectal tissues including 161 adenocarcinoma specimens, 29 adenoma specimens, 42 colitis specimens and 222 non-diseased tissues. Using conservative corrections for multiple hypothesis testing, it was determined that over 25% of the 44,928 probesets 35 measured in each tissue experiment were differentially expressed between the 190 neoplasia specimens and 264 non-neoplasia controls. To identify robust biomarkers for colorectal neoplasia the list of putative probeset biomarkers were further filtered to include only those probesets shown to be expressed at least 2-fold higher in neoplastic vs. non-neoplastic tissues.
- 73 205 probesets hybridising to approximately 157 putative genes were observed to be expressed at a statistically significant higher level in neoplastic tissues relative to non-neoplastic controls. 5 Validation/Hypothesis testing To validate these discovery results the hybridisation of 199 candidate probesets were measured against RNA extracts from 68 clinical specimens comprising 19 adenomas, 19 adenocarcinomas, and 30 non-diseased controls using a custom-designed 'Adenoma Gene Chip'. Six (6) probesets were not tested as they were not included on the custom design. It 10 was confirmed that 186/199 (88%) of the target probesets or probesets which also hybridise to the target locus were likewise differentially expressed (P < 0.05) in these independently derived tissues. The results of testing these probesets in 68 independently collected clinical specimens is shown in Table 1. 15 We further tested the 142 of the 157 unique gene loci to which the 205 probesets are understood to hybridise. We note the remaining 15 gene symbols were not represented in the validation data. We observed that 133 of 142 gene symbols were represented in the validation data by at least one differentially expressed probeset and many symbols included multiple probesets against regions across the putative locus. A complete list of probesets that bind to 20 target loci is shown in Table 2. Conclusion The candidate probesets shown in Tables 1 and 2 are differentially expressed in neoplastic colorectal tissues compared to non-neoplastic controls. 25 EXAMPLE 3 Probesets Demonstrating A Neoplasia-Specific Profile 30 During analysis of the data, a novel expression profile was observed between neoplastic and non-neoplastic phenotypes. It was hypothesized that a subset of quantitatively differentially expressed probesets are furthermore qualitatively differentially expressed. Such probesets show evidence of a neoplasia-specific gene expression profile, i.e. these probesets appear to be expressed above background levels in neoplastic tissues only. This observation and the 35 resulting hypothesis are based on two principles: 1. That the majority of human transcripts that are present on a genome-wide GeneChip (e.g. U 133) will not likely be expressed in the colorectal mucosa; and - 74 2. That microarray binding intensity for such 'off probesets to labelled cRNA will reflect technical assay background, i.e. non-specific oligonucleotide binding. To generate a list of neoplasia specific probesets the non-neoplastic intensity of differentially 5 expressed probesets was compared with a hypothetical background signal threshold from across all probesets on the chip. Bydesign, all probesets in the candidate pool from which the 'on' transcripts are chosen are at least two fold over-expressed in the diseased tissues. Combined, these criteria yield the subset of differentially expressed transcript species that are specifically expressed in neoplasia. 10 Validation/Hypothesis Testing' The custom gene chip design precludes testing the hypothetically neoplasia-specific probesets using the same principles as used for discovery. In particular, the custom gene chip (by design) does not contain a large pool of probesets anticipated to hybridise to hypothetically 15 'off /'non-transcribed' gene transcripts. This is because the custom gene chip design is biased toward differentially expressed transcripts in colorectal neoplastic tissues. The usual differential expression testing (limma) was therefore to these candidate probesets for neoplasia-specific transcripts. Of the 33 probesets on the custom gene chip, 32 probesets 20 (or probesets which bind to the same locus) were differentially expressed between the 38 neoplastic tissues (adenoma & cancer) and non-neoplastic controls. The results of these validation experiments is shown in Table 3. All probesets which are known to hybridise to the gene loci to which the 33 probesets claimed 25 herein were tested. Of the 32 putative gene loci targeted by the probesets, 29 were present in the validation data. Twenty-eight (28) of these 29 gene symbols demonstrated at least one hybridising probeset which was differentially expressed in the neoplastic tissues. Results for these experiments, including all probesets that bind to each target locus in a differentially expressed manner are shown in Table 4. 30 EXAMPLE 4 Probesets Useful For Characterizing Neoplastic Tissues 35 Differential expression analysis was applied to Affymetrix gene chip data measuring RNA concentration in neoplastic tissues including 161 adenocarcinoma specimens and 29 adenoma specimens. It was observed that 43 probesets hybridizing to approximately 33 putative gene symbols were expressed higher (P < 0.05) in adenoma tissues relative to cancer tissues.
- 75 Conversely, 145 probesets (104 gene symbols) were identified to be expressed higher in cancer relative to adenomas. Validation/Hypothesis testing 5 188 (43 + 145) of these probesets were then measured in a set of independent clinical specimens including 19 adenoma tissues and 19 cancer tissues. It was confirmed that 158 (30 + 128) of the target probesets (or probesets against the same gene locus) were likewise differentially expressed (P < 0.05) in these independently-derived tissues. Probesets elevated in adenoma and cancers relative to each other are shown in Table 5 and Table 6 respectively. 10 It was further observed that 137 (33+104) gene loci are diagnostically useful for discriminating colorectal adenomas and cancers relative to each phenotype. The validation data included probesets designed to hybridise to 128 of these candidate gene symbols. It was observed that 21 of the 31 genes elevated in adenomas relative to cancers were likewise 15 differentially expressed by at least one probeset. Of the 97 gene symbols elevated in cancer relative to adenoma it was confirmed that 89 gene symbols demonstrated at least one probeset in the validation data to be likewise differentially expressed. The validation testing of the adenoma and cancer elevated gene loci is shown in Table 7 and Table 8, respectively. 20 Conclusion It was concluded that the candidate probesets shown in FIXME are differentially expressed between adenomatous and adenocarcinoma tissues and thus useful for distinguishing these tissues. Gene transcripts that hybridise to these probesets are thus diagnostically informative in a clinical setting to classify such neoplastic tissues. 25 EXAMPLE 5 Materials And Methods For Examples 2 To 3 30 Gene expression profiling data measured in 454 colorectal tissue specimens including neoplastic, normal and non-neoplastic disease controls was purchased from GeneLogic Inc (Gaithersburg, MD USA). For each tissue specimen an Affymetrix (Santa Clara, CA USA) oligonucleotide microarray data totalling 44,928 probesets (HGUl33A & HGUl33B, combined), experimental and clinical descriptors, and digitally archived microscopy images 35 of histological preparations was received. Prior to applying discovery methods to these data extensive quality control methods were carried out, including statistical exploration, review of clinical records for consistency and histopathology audit of a random sample of arrays. Microarrays that did not meet acceptable quality criteria were removed from the analysis.
- 76 Hypothesis testing Candidate transcription biomarkers were tested using a custom oligonucleotide microarray of 25-mer oligonucleotide probesets designed to hybridise to candidate RNA transcripts 5 identified during discovery. Differential expression hypotheses were tested using RNA extracts derived from independently collected clinical samples comprising 30 normal colorectal tissues, 19 colorectal adenoma tissues, and 19 colorectal adenocarcinoma tissues. Each RNA extract was confirmed to meet strict quality control criteria. 10 Colorectal tissue specimens All tissues used for hypothesis testing were obtained from a tertiary referral hospital tissue bank in metropolitan Adelaide, Australia (Repatriation General Hospital and Flinders Medical Centre). Access to the tissue bank for this research was approved by the Research and Ethics Committee of the Repatriation General Hospital and the Ethics Committee of Flinders 15 Medical Centre. Informed patientconsent was received for each tissue studied. Following surgical resection, specimens were placed in a sterile receptacle and collected from theatre. The time from operative resection to collection from theatre was variable but not more than 30 minutes. Samples, approximately 125mm3 (5x5x5mm) in size, were taken from 20 the macroscopically normal tissue as far from pathology as possible, defined both by colonic region as well as by distance either proximal or distal to the pathology. Tissues were placed in cryovials, then immediately immersed in liquid nitrogen and stored at -1 50C until processing. 25 RNA extraction RNA extractions were performed using Trizol(R)reagent (Invitrogen, Carlsbad, CA, USA) as per manufacturer's instructions. Each sample was homogenised in 300pL of Trizol reagent using a modified Dremel drill and sterilised disposable pestles. An additional 200pL of Trizol reagent was added to the homogenate and samples were incubated at RT for 10 30 minutes. lOO1L of chloroform was then added, samples were shaken vortexed for 15 seconds, and incubated at RT for 3 further minutes. The aqueous phase containing target RNA was obtained by centrifugation at 12,000 rpm for 15 min, 40C. RNA was then precipitated by incubating samples at RT for 10 min with 250pL of isopropanol. Purified RNA precipitate was collected by centrifugation at 12,000 rpm for 10 minutes, 40C and supernatants were 35 discarded. Pellets were then washed with lmL 75% ethanol, followed by vortexing and centrifugation at 7,500g for 8 min, 40C. Finally, pellets were air-dried for 5 min and resuspended in 80pL of RNase free water. To improve subsequent solubility samples were incubated at 55*C for 10 min. RNA was quantified by measuring the optical density at - 77 A260/280 nm. RNA quality was assessed by electrophoresis on a 1.2% agarose formaldehyde gel. Gene Chip processing 5 To test hypotheses related to biomarker candidates for colorectal neoplasia RNA extracts were assayed using a custom GeneChip designed in collaboration with Affymetrix (Santa Clara, CA USA). These custom GeneChips were processed using the standard Affymetrix protocol developed for the HU Gene ST 1.0 array described in (Affy:WTAssay). 10 Statistical software and data processing The R statistics environment R and BioConductor libraries (BioConductor, www.bioconductor.org) (BIOC) were used for most analyses. To map probeset IDs to gene symbol on the Custom GeneChip, hgul33plus2 library version 2.2.0, which was assembled using Entrez Gene data downloaded on Apr 18 12:30:55 2008 (BIOC) was used. 15 Hypothesis testing of differentially expressed biomarkers To assess differential expression between tissue classes, the Student's t test for equal means between two samples or the robust variant provided by the limma library (Smyth)(limma) was used. The impact of false discovery due to multiple hypothesis testing was mitigated by 20 applying a Bonferroni adjustment to P values in the discovery process (MHT:Bonf). For hypotheses testing the slightly less conservative multiple hypothesis testing correction of Benjamini & Hochberg, which aims to control the false discovery rate of solutions(MHT:BH) was applied. 25 Discovery of tissue-specific gene expression patterns Discovery methods using gene expression data often yield numerous candidates, many of which are not suitable for commercial products because they involve subtle gene expression differences that would be difficult to detect in laboratory practice. Pepe et al. note that the 'ideal' biomarker is detectable in tumor tissue but not detectable (at all) in non-tumour tissue 30 (Pepe:biomarker:development.) To bias toward candidates that meet this criterion, an analysis method was developed that aims to enrich the candidates for biomarkers whose qualitative- absence or presence measurement is diagnostic for the phenotype of interest. This method attempts to select candidates that show a prototypical 'turned-on' or 'turned-off pattern relative to an estimate of the background/noise expression across the chip. Such RNA 35 transcripts are more likely to correlate with downstream translated proteins with diagnostic potential or to predict upstream genomic changes (e.g. methylation status) that can be used diagnostically. This focus on qualitative rather than quantitative outcomes may simplify the product development process for such biomarkers.
- 78 The method is based on the assumption that the pool of extracted RNA species in any given tissue (e.g. colorectal mucosae) will specifically bind to a relatively small subset of the full set of probesets on a GeneChip designed to measure the whole genome. On this assumption, it is 5 estimated that most probesets on a full human gene chip will not exhibit specific, high intensity signals. This observation is utilised to approximate the background or 'non-specific binding' across the chip by choosing a theoretical level equal to the value of e.g. lowest 30% quantile of the 10 ranked mean values. This quantile can be arbitrarily set to some level below which a reasonable assumption is made that the signals do not represent above-background RNA binding. Finally, this background estimate is used as a threshold to estimate the 'OFF' probesets in an experiment for, say, the non-neoplastic tissue specimens. 15 Conversely, probesets which are 1) expressed above this theoretical threshold level and 2) at differentially higher levels in the tumour specimens may be a tumour specific candidate biomarker. In this case the concept of 'fold-change' thresholds can also be conveniently applied to further emphasize the concept of absolute expression increases in a putatively 'ON' probeset. 20 Given the assumption of low background binding for a sizeable fraction of the measured probesets, this method was only used in the large GeneLogic data and discovery. To construct a filter for hypothetically 'turned on' biomarkers in the GeneLogic discovery data, the mean expression level for all 44,928 probesets was first estimated across the full range of 25 454 tissues. The 44,928 mean values were then ranked and the expression value equivalent to the 25th percentile across the dataset calculated. This arbitrary threshold was chosen because it was theorized that the majority of transcripts (and presumably more than 25%) in a given specimen should exhibit low concentration which effectively transcriptional silence. Thus this threshold represents a conservative upper bound for what is estimated as non-specific, or 30 background, expression. EXAMPLE 6 Determine Gene Identity of a Nucleic Acid Sequence of Interest which is Define by an 35 Affymetrix Probeset BLAST the sequence of interest using online available Basic Local Alignment Search Tools [BLAST]. e.g. NCBI/BLAST - 79 (http://blast.ncbi.nlm.nih.gov/Blast.cgi) (a) Select "Human" in BLAST ASSEMBLED GENOMES on the web page http://blast.ncbi.nlm.nih.gov/Blast.cgi (b) Leave the default settings, i.e.: 5 e Database: Genome (all assemblies) * Program: megaBLAST: compare highly related nucleotide sequences * Optional parameters: Expect: 0.01, Filter: default, Descriptions: 100, Alignments: 100 (c) Copy/Paste Sequence into the "BLAST" window 10 (d) Click "Begin Search" (e) Click "View Report" Assessment of the Open BLAST Search Results Multiple significant sequence alignments may be identified when "blasting" the sequence. 15 Identify gene nomenclature of the identiied sequence match (a) Click the link to one of the identified hits (b) The new page will schematically depict the position of the hit on one chromosome. It will be apparent which gene is hit. 20 (c) Retrieve the "hit" sequence clicking on the link (d) Do a search for the gene in the provided "search" window. This provides the gene nucleotide coordinates for the gene. Determine promiscuity of Sequence 25 (a) Open the NCBI/BLAST tool, (http://blast.ncbi.nim.nih.gov/Blast.cgi) (b) Click on "nucleotide Blast" under "basic BLAST" (c) Copy/paste the sequence of interest into the "Query Sequence" window (d) Click "Blast". 30 Assessment of the nBLAST Search Results of the Sequence (a) The nBLAST exercise with the Sequence may result in multiple Blast hits of which some accession entry numbers are listed in "Description". (b) These hits should be reviewed. 35 Determine location of the Sequence in the Gene The Ensembl database is an online database, which produces and maintains automatic annotation selected eukaryotic genomes (www.ensembl.org/index.html) -80 Identify location of the Sequence in the Gene (a) Set "Search" to Homo Sapiens, Type "the gene name" in the provided Search Field Ensemble.org/index.html) (b) Click "Go" 5 (c) Click the "Vega protein-coding Gene: OTTHUMGOOOOOO 144184" link to get an annotation report (d) Click on "Gene DAS Report" to retrieve information regarding Alternative splice site database: Type "the gene name" in search field * Click on "the gene entry" 10 0 Scroll down to "evidence" * Review alternative splice sites " Click "Confirmed intron/exons" to get a list of coordinates for the exons & introns. 15 Alternative splicing and/or transcription The AceView Database provides curated and non-redundant sequence representation of all public mRNA sequences. The database is available through NCBI: http://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/ 20 Further investigation of the Gene mRNA transcripts (a) Type "the gene name" into the provided "search" field (b) Click "Go" (c) The following information is available from the resulting entry in AceView: " The number of cDNA clones from which the gene is constructed (ie originated 25 from experimental work involving isolation of mRNA) e The mRNAs predicted to be produced by the gene * The existence of non-overlapping alternative exons and validated alternative polyadenylation sites * The existence of truncations 30 0 The possibility of regulated alternate expression " Introns recorded as participating in alternatively splicing of the gene (d) Classic splice site motives -81 Application of Method to LOC643911/hCG_1815491 Materials and Methods 5 Extraction of RNA RNA extractions were performed using Trizol(R) reagent (Invitrogen, Carlsbad, CA, USA) as per manufacturer's instructions. Each sample was homogenised in 300pL of Trizol reagent using a modified dremel drill and sterilised disposable pestles. Additional 200pL of Trizol 10 reagent was added to the homogenate and samples were incubated at RT for 10 minutes. 10pL of chloroform was then added, samples were shaken vortexed for 15 seconds, and incubated at RT for 3 further minutes. The aqueous phase containg target RNA was obtained by centrifugation at 12,000 rpm for 15 min, 400 C. RNA was then precipitated by incubating samples at RT for 10 min with 250pL of isopropanol. Purified RNA precipitate was collected 15 by centrifugation at 12,000 rpm for 10 minutes, 400 C and supernatants were discarded. Pellets were then washed with 1mL 75% ethanol, followed by vortexing and centrifugation at 7,500g for 8 min, 40- C . Finally, pellets were air-dried for 5 min and resuspended in 8OpL of RNase free water. To improve subsequent solubility samples were incubated at 55o C for 10 min. RNA was quantified by measuring the optical density at A260/280 nm. RNA quality was 20 assessed by electrophoresis on a 1.2% agarose formaldehyde gel. Gene Chip processing RNA samples to analyze on Human Exon 1.0 ST GeneChips were processed using the 25 Affymetrix WT target labeling and control kit (part# 900652) following the protocol described in (Affymetrix 2007 P/N 701880 Rev.4). Briefly: First cycle cDNA was synthesized from 1Ong ribosomal reduced RNA using random hexamer primers tagged with T7 promoter sequence and SuperScript II (Invitrogen, Carlsbad CA), this was followed by DNA Polymerase I synthesis of the second strand cDNA. Anti-sense cRNA was then synthesized 30 using T7 polymerase. Second cycle sense cDNA was then synthesised using SuperScript II, dNTP+ dUTP, and random hexamers to produce sense strand cDNA incorporating uracil. This single stranded uracil containing cDNA was then fragmented using a combination of uracil DNA glycosylase (UDG) and apurinic/ apyrimidinic endonucleasel (APE 1). Finally the DNA was biotin labelled using terminal deoxynucleotidyl transferase (TdT) and the 35 Affymetrix proprietary DNA Labeling reagent. Hybridization to the arrays was carried out at 45 0 C for 16-18hours. Washing and staining of the hybridized GeneChips was carried out using the Affymetrix - 82 Fluidics Station 450 and scanned with the Affymetrix Scanner 3000 following recommended protocols. SYBR green basedQuantitative Real Time-PCR 5 Quantitative real time polymerase chain reaction was performed on RNA isolated from clinical samples for the amplification and detection of the various hCG_ 1815491 transcripts. Firstly cDNA was synthesized from 2ug of total RNA using the Applied Biosystems High 10 Capacity Reverse transcription Kit (P/N 4368814). After synthesis the reaction was diluted 1:2 with water to obtain a final volume of 40ul and lul of this diluted cDNA used in subsequent PCR reactions. PCR was performed in a 25ul volume using 12.5ul Promega 2x PCR master mix (P/N 15 M7502), 1.5ul 5uM forward primer, 1.5ul 5uM reverse primer, 7.875ul water, 0.625ul of a 1:3000 dilution of 10,000x stock of SYBR green 1 pure dye (Invitrogen P/N S7567), and lul of cDNA. Cycling conditions for amplification were 950 for 2minutes xl cycle, 95' for 15 seconds and 20 600 for 1 minute x40 cycles. The amplification reactions were performed in a Corbett Research Rotor-Gene RG3000 or a Roche LightCycler480 real-time PCR machine. When the Roche LightCycler480 real-time PCR machine was used for amplification the reaction volume was reduced to 1Oul and performed in a 384 well plate but the relative ratios between all the components remained the same. Final results were calculated using the AACt method 25 with the expression levels of the various hCG_1815491 transcripts being calculated relative to the expression level of the endogenous house keeping gene HPRT. End-point PCR 30 End point PCR was performed on RNA isolated from clinical samples for the various hCG_1815491 transcripts. Conditions were identical to those described for the SYBR green assay above but with the SYBR green dye being replaced with water. The amplification reactions were performed in a MJ Research PTC-200 thermal cycler. 2.5pl of the amplified products were analysed on 2% agarose E-gel (Invitrogen) along with a 100-base pair DNA 35 Ladder Marker.
- 83 Results The nucleotide structure and expression levels of transcripts related to hCG_1815491 was analysed based on the identification of diagnostic utility of Affymetrix probesets 238021_s_at 5 and 238022_at from the gene chip analysis. The gene hCG_1815491 is currently represented in NCBI as a single RefSeq sequence, XM_93911. The RefSeq sequence of hCG_1815491 is based on 89 GenBank accessions from 83 cDNA clones. Prior to March 2006, these clones were predicted to represent two 10 overlapping genes, LOC388279 and LOC650242 (the latter also known as LOC64391 1). In March 2006, the human genome database was filtered against clone rearrangements, co aligned with the genome and clustered in a minimal non-redundant way. As a result, LOC388272 and LOC650242 were merged into one gene named hCG_1815491 (earlier references to hCG_1815491 are: LOC388279, LOC64391 1, LOC650242, XM_944116, 15 AF275804, XM_373688). It has been determined that the Ref Sequence, which is defined by the genomic coordinates 8579310 to 8562303 on human chromosome 16 as defined by the NCBI contig reference NT_010498.15| Hsl6_10655, NCBI 36 March 2006 genome encompasses hCG_1815491. 20 The 10 predicted RNA variants derived from this gene have been aligned with the genomic nucleotide sequence residing in the map region 8579310 to 8562303. This alignment analysis revealed the existence of at least 6 exons of which several are alternatively spliced. The identified exons are in contrast to the just 4 exons specified in the NCBI hCG_1815491 RefSeq XM 93911. Two additional putative exons were also identified in the Ref Sequence 25 by examination of included probesets on Affymetrix Genechip HuGene Exon 1.0 that target nucleotide sequences embedded in the Ref Sequence. The identified and expanded exon intron structure of hCG_1815491 have been used to design specific oligonucleotide primers, which allowed measurement of the expression of RNA variants generated from the Ref Sequence by using PCR-based methodology (Figure 4) 30 EXAMPLE 7 Immunohistochemistry is a useful method for evaluating changes in local expression of up or down-regulated markers in human tissue. 35 Materials and Methods: Four micrometre sections were incubated in a universal decloaking buffer for 75 minutes at 80ptL to expose masked epitopes. Protein expression was determined using an antibody - 84 targeting the C-terminal domain of Mesothelin (MSLN) on colonic biopsies from 30 patients (10 normals, 10 cancers, 10 adenomas). Antibodies were applied for one hour at room temperature. After washing, sections were incubated with polymeric horse- radish peroxidase. Antibody localization was visualized using 3'3' diaminobenzidine. 5 Result: There was a marked upregulation of MSLN in the adenoma and cancer tissues compared to the normal controls. The normal tissues showed mild staining for MSLN in the cytoplasm of the colonic epithelium but the cancer and particularly the adenomas tissues shows significant 10 upregulation of MSLN in their multilayered epithelium. This upregulation was observed in all 10 adenomas tissue and in 9 out of the 10 cancer tissues. These patterns of staining are illustrated in Figures 4, 5 and 6. Conclusion: 15 Elevated expression of MSLN has been detected in colon neoplasia, confirming the upregulation observed in the mRNA expression data and verifying the diagnostic utility of both the MSLN mRNA and protein for detection of colorectal neoplasia. EXAMPLE 8 20 EVIDENCE OF MMP3 PROTEIN EXPRESSION IN STOOLS OF PATIENTS WITH COLORECTAL NEOPLASIA. Affymetrix probeset designated 205828_at was identified to be expressed higher in 190 25 neoplastic tissue specimens relative to 264 non-neoplastic specimens. The probeset 205828_at hybridizes to RNA transcribed from the gene encoding Matrix Metalloproteinase 3 (MMP3) NM_002422. The differential expression profile of probeset 205828_at was further demonstrated by profiling RNA collected from 68 independent clinical specimens comprising 19 adenomas, 19 adenocarcinomas and 30 non-disease controls, Figure 7. 30 Materials and Methods A commercially available bead-suspension immunoassay targeting the protein MMP3 was purchased from R&D Systems (MMP Kit reagents LMPOOO and LMP513) to measure MMP3 concentration in stools of human patients diagnosed with colorectal neoplasia. Proteins were 35 extracted from stool specimens using a phosphate buffered saline wash from 6 non-disease controls, 10 adenoma and 11 adenocarcinoma subjects. The resulting protein extracts were analyzed using the Luminex bead-based suspension MMP3 assay as recommended by manufacturer.
- 85 Results An elevated endogenous expression of MMP3 was observed in stool specimens from patients diagnosed with colon adenomas or adenocarcinomas relative to non-neoplastic controls (Figure 8). 5 Conclusion Measurement of MMP3 protein in bodily fluids such as stool samples is useful for diagnosing colorectal neoplasia. 10 TABLES Probeset designations include both HG-133plus2 probeset IDs and Human Gene 1.OST array probe ids. The latter can be conveniently mapped to Transcript Cluster ID using the Human Gene 1.OST probe tab file provided by Affymetrix 15 (http://www.affymetrix.com/Auth/analysis/downloads/na22/wtgene/HuGene-I O-st v1.probe.tab.zip). Using publicly available software such as NetAffx (provided by Affymetrix), the Transcript Cluster ID may be further mapped to gene symbol, chromosomal location, etc. 20 Table 1. Probesets demonstrated to be expressed higher in neoplastic tissues relative to non-neoplastic controls. TargetPS: Affymetrix HG-U133plus2 probeset id; Symbol: putative gene symbol corresponding to target probeset id - multiple symbol names indicate the possibility of 25 probeset hybridisation to multiple gene targets; Signif. FDR: Adjusted p-value for mean difference testing between RNA extracted from neoplasia and non-neoplastic tissues. Adjustment is made using Benjamini & Hochberg correction for multiple hypothesis testing (Benjamini and Hochberg, 1995); D.value50: Diagnostic effectiveness parameter estimate corresponding to the area of a receiver operator characteristic ROC. This parameter provides a 30 convenient estimate of diagnostic utility and is described in (Saunders, 2006); FC: fold change between mean expression level of neoplasia vs. non-neoplasia; Sens-Spec: Estimate of diagnostic performance corresponding to the ROC curve point demonstrating equal sensitivity and specificity; CI (95): 95% confidence interval of sensitivity and specificity estimates. 35 Table 2. Evidence of multiple probesets which correspond to gene symbols claimed herein exhibiting RNA concentration differences between neoplasia and non-neoplastic controls. Symbol: gene - 86 symbol; ValidPSUP: Affymetrix probeset IDs demonstrating statistically significant overexpression in neoplastic RNA extracts relative to non-neoplastic controls. Signif. FDR: Adjusted p-value for mean difference testing between RNA extracted from neoplasia and non neoplastic tissues. Adjustment is made using Benjamini & Hochberg correction for multiple 5 hypothesis testing (Benjamini and Hochberg, 1995); D.value50: Diagnostic effectiveness parameter estimate corresponding to the area of a receiver operator characteristic ROC. This parameter provides a convenient estimate of diagnostic utility and is described in (Saunders, 2006); FC: fold change between mean expression level of neoplasia vs. non-neoplasia; Sens Spec: Estimate of diagnostic performance corresponding to the ROC curve point 10 demonstrating equal sensitivity and specificity; CI (95): 95% confidence interval of sensitivity and specificity estimates. Example 2 Table 3. 15 Probesets which demonstrate a qualitatively (in addition to quantitative) elevated profile in neoplastic tissues relative to non-neoplastic controls. TargetPS: Affymetrix HG-Ul33plus2 probeset id; Symbol: putative gene symbol corresponding to target probeset id - multiple symbol names indicate the possibility of probeset hybridisation to multiple gene targets; 20 Signif. FDR: Adjusted p-value for mean difference testing between RNA extracted from neoplasia and non-neoplastic tissues. Adjustment is made using Benjamini & Hochberg correction for multiple hypothesis testing (Benjamini and Hochberg, 1995); D.value50: Diagnostic effectiveness parameter estimate corresponding to the area of a receiver operator characteristic ROC. This parameter provides a convenient estimate of diagnostic utility and is 25 described in (Saunders, 2006); FC: fold change between mean expression level of neoplasia vs. non-neoplasia; Sens-Spec: Estimate of diagnostic performance corresponding to the ROC curve point demonstrating equal sensitivity and specificity; CI (95): 95% confidence interval of sensitivity and specificity estimates. 30 Table 4. Evidence of multiple probesets which correspond to gene symbols claimed herein exhibiting qualitative changes in RNA concentration in neoplastic tissues. Symbol: gene symbol; ValidPS._UP: Affymetrix probeset IDs demonstrating statistically significant overexpression 35 in neoplastic RNA extracts relative to non-neoplastic controls. Signif. FDR: Adjusted p-value for mean difference testing between RNA extracted from neoplasia and non-neoplastic tissues. Adjustment is made using Benjamini & Hochberg correction for multiple hypothesis testing (Benjamini and Hochberg, 1995); D.value50: Diagnostic effectiveness parameter -87 estimate corresponding to the area of a receiver operator characteristic ROC. This parameter provides a convenient estimate of diagnostic utility and is described in (Saunders, 2006); FC: fold change between mean expression level of neoplasia vs. non-neoplasia; Sens-Spec: Estimate of diagnostic performance corresponding to the ROC curve point demonstrating 5 equal sensitivity and specificity; CI (95): 95% confidence interval of sensitivity and specificity estimates. Table 5 10 Probesets demonstrated to be expressed higher in adenoma tissues relative to cancer tissues. TargetPS: Affymetrix HG-Ul33plus2 probeset id; Symbol: putative gene symbol corresponding to target probeset id - multiple symbol names indicate the possibility of probeset hybridisation to multiple gene targets; Signif. FDR: Adjusted p-value for mean difference testing between RNA extracted from neoplasia and non-neoplastic tissues. 15 Adjustment is made using Benjamini & Hochberg correction for multiple hypothesis testing (Benjamini and Hochberg, 1995); D.value50: Diagnostic effectiveness parameter estimate corresponding to the area of a receiver operator characteristic ROC. This parameter provides a convenient estimate of diagnostic utility and is described in (Saunders, 2006); FC: fold change between mean expression level of adenomas vs. cancers; Sens-Spec: Estimate of 20 diagnostic performance corresponding to the ROC curve point demonstrating equal sensitivity and specificity; CI (95): 95% confidence interval of sensitivity and specificity estimates. Table 6 25 Probesets demonstrated to be expressed higher in cancer tissues relative to adenoma tissues. TargetPS: Affymetrix HG-Ul33plus2 probeset id; Symbol: putative gene symbol corresponding to target probeset id - multiple symbol names indicate the possibility of probeset hybridisation to multiple gene targets; Signif. FDR: Adjusted p-value for mean difference testing between RNA extracted from neoplasia and non-neoplastic tissues. 30 Adjustment is made using Benjamini & Hochberg correction for multiple hypothesis testing (Benjamini and Hochberg, 1995); D.value50: Diagnostic effectiveness parameter estimate corresponding to the area of a receiver operator characteristic ROC. This parameter provides a convenient estimate of diagnostic utility and is described in (Saunders, 2006); FC: fold change between mean expression level of cancer tissues vs. adenoma tissues; Sens-Spec: 35 Estimate of diagnostic performance corresponding to the ROC curve point demonstrating equal sensitivity and specificity; CI (95): 95% confidence interval of sensitivity and specificity estimates.
- 88 Table 7 Evidence of multiple probesets which correspond to gene symbols claimed herein exhibiting RNA concentration differences between adenoma and cancer tissues. Symbol: gene symbol ; 5 ValidPSUP: Affymetrix probeset IDs demonstrating statistically significant overexpression in neoplastic RNA extracts relative to non-neoplastic controls. Signif. FDR: Adjusted p-value for mean difference testing between RNA extracted from neoplasia and non-neoplastic tissues. Adjustment is made using Benjamini & Hochberg correction for multiple hypothesis testing (Benjamini and Hochberg, 1995); D.value50: Diagnostic effectiveness parameter 10 estimate corresponding to the area of a receiver operator characteristic ROC. This parameter provides a convenient estimate of diagnostic utility and is described in (Saunders, 2006); FC: fold change between mean expression level of adenoma tissues vs. cancer tissues; Sens-Spec: Estimate of diagnostic performance corresponding to the ROC curve point demonstrating equal sensitivity and specificity; CI (95): 95% confidence interval of sensitivity and 15 specificity estimates. Table 8 Evidence of multiple probesets which correspond to gene symbols claimed herein exhibiting 20 RNA concentration differences between cancer and adenoma tissues. Symbol: gene symbol ; ValidPSUP: Affymetrix probeset IDs demonstrating statistically significant overexpression in neoplastic RNA extracts relative to non-neoplastic controls. Signif. FDR: Adjusted p-value for mean difference testing between RNA extracted from neoplasia and non-neoplastic tissues. Adjustment is made using Benjamini & Hochberg correction for multiple hypothesis 25 testing (Benjamini and Hochberg, 1995); D.value50: Diagnostic effectiveness parameter estimate corresponding to the area of a receiver operator characteristic ROC. This parameter provides a convenient estimate of diagnostic utility and is described in (Saunders, 2006); FC: fold change between mean expression level of cancer tissues vs. adenoma tissues; Sens-Spec: Estimate of diagnostic performance corresponding to the ROC curve point demonstrating 30 equal sensitivity and specificity; CI (95): 95% confidence interval of sensitivity and specificity estimates. Those skilled in the art will appreciate that the invention described herein is susceptible to variations and modifications other than those specifically described. It is to be understood 35 that the invention includes all such variations and modifications. The invention also includes all of the steps, features, compositions and compounds referred to or indicated in this specification, individually or collectively, and any and all combinations of any two or more of said steps or features.
- 89 TABLE 1 TargetPS Symbol Signif. D.val5 FC Sens- CI (95) FDR Spec 203256 at CDH3 3.75E-31 4.2476 37.54 98.3 95.5-99.5 200660_at S100A1I:LOC730558:LOC730278: 2.68E-27 3.8969 3.83 97.4 93.9-99.1 LOC729659 201341_at ENCI 1.44E-26 3.667 3.93 96.7 92.6-98.7 212063_at MAPK10:CD44 3.35E-26 3.7161 7.06 96.8 92.9-98.8 217523_at MAPK10:CD44 3.35E-26 3.7151 7.06 96.8 92.9-98.8 201563 at SORD 3.44E-25 3.4659 4.36 95.8 91.3-98.3 202431 s at LOC731404:LOC729194:MYC 5.19E-25 3.6014 4.11 96.4 92.2-98.6 221577_x_at GDF15 5.51E-25 3.5554 5.84 96.2 91.8-98.5 204702_s_at LOC650331:NFE2L3:LOC642996 1.51E-24 3.52 4.57 96.1 91.6-98.4 203961_at NEBL 5.41E-24 3.5292 5.52 96.1 91.7-98.4 203962_s at NEBL 5.41E-24 3.5295 5.52 96.1 91.7-98.4 201338_x at GTF3A 1.74E-22 3.2283 2.92 94.7 89.6-97.6 215091_s at GTF3A 1,74E-22 3.2283 2.92 94.7 89.5-97.6 207850_at CXCL2:CXCL3 5.76E-22 3.2874 7.85 95 90-97.8 222549 at CLDN1 6.1OE-22 3.2561 13.09 94.8 89.7-97.7 204259_at MMP7 7.61E-22 3.2624 69.29 94.9 89.8-97.7 228754_at SLC6A6:LOC728721 1.15E-21 3.2282 5.02 94.7 89.5-97.6 209774_x at CXCL2:CXCL3 1.88E-21 3.2884 7.85 95 90-97.8 218872_at TESC 3.26E-21 3.2134 16.84 94.6 89.5-97.6 212942_s_at KIAA1199 8.12E-21 3.0489 25.16 93.6 88-97 209369_at ANXA3 1.15E-20 3.0417 3.34 93.6 87.9-97 204404_at SLC12A2 1.40E-20 3.0281 3.42 93.5 87.9-96.9 225835_at SLC12A2 1.40E-20 3.0261 3.42 93.5 87.8-96.9 219911 s at SLCO4A1 1.46E-20 3.0968 4.84 93.9 88.4-97.2 203510_at MET 1.50E-20 3.0806 3.35 93.8 88.3-97.1 202936_s_at SOX9 1.63E-20 3.0982 3.85 93.9 88.4-97.2 201416_at SOX4 2.30E-20 3.0179 2.89 93.4 87.8-96.9 201417_at SOX4 2.30E-20 3.0146 2.89 93.4 87.7-96.9 208712_at CCND1 7.06E-20 2.9082 2.85 92.7 86.7-96.4 204351_at SlooP 4.56E-19 2.9463 4.37 93 87.1-96.6 227475_at FOXQ1 4.85E-19 2.9494 10.97 93 87.1-96.6 205983_at DPEP1 5.27E-19 2.8878 23.9 92.6 86.6-96.3 204470 at CXCL2:CXCL1 5.53E-19 2.9257 10.4 92.8 86.8-96.5 212531_at LCN2 5.68E-19 2.9407 14.6 92.9 87-96.5 217867_x at BACE2 5.87E-19 2.9042 3.8 92.7 86.7-96.4 228915_at DACHI 1.05E-18 2.9218 5.81 92.8 86.9-96.5 218704_at RNF43 2.12E-18 2.9218 3.24 92.8 86.9-96.5 202504_at TRIM29 2.36E-18 2.8231 11.01 92.1 85.9-96 241031_at NLF1 3.12E-18 2.7999 8.12 91.9 85.6-95.9 201195_s at SLC7A5:LAT1-3TM 5.11E-18 2.7562 6.31 91.6 85.2-95.7 201656_at ITGA6 7.38E-18 2.6842 2.62 91 84.5-95.3 -90 229215 at ASCL2 7.76E-18 2.7573 7.73 91.6 85.3-95.7 217996_at PHLDA1 1.34E-17 2.7007 5.27 91.2 84.6-95.4 205476_at CCL20 2.03E-17 2.6222 10.14 90.5 83.8-94.9 226360_at ZNRF3 2.03E-17 2.7245 4.91 91.3 84.9-95.6 219956_at GALNT6:ELAI 2.90E-17 2.682 5.27 91 84.4-95.3 201506_at TGFBI 4.14E-17 2.6863 4.31 91 84.5-95.3 212070_at GPR56 7.45E-17 2.5866 2.6 90.2 83.5-94.7 212281_s at LOC731966:LOC729599:TMEM97 1.88E-16 2.583 2.77 90.2 83.4-94.7 202831 at GPX2 4.OOE-16 2.397 3.56 88.5 81.2-93.5 225541 at LOC442108:RPL22L1 4.37E-16 2.5983 4.19 90.3 83.5-94.8 218984_at PUS7:LOC730279 5.26E-16 2.6006 3.06 90.3 83.6-94.8 219630_at PDZK1IP1 5.38E-16 2.506 4.22 89.5 82.5-94.2 202935_s at FLJ37644 1.04E-15 2.4727 3.64 89.2 82.1-94 204401_at KCNN4 1.85E-15 2.4677 3.07 89.1 82-94 222696_at AXIN2 1.90E-15 2.4301 4.44 88.8 81.6-93.7 221923_s_at NPM1 2.22E-15 2.6018 2.18 90.3 83.5-94.8 201666_at TIMP1 2.44E-15 2.4295 3.01 88.8 81.6-93.7 210511 s at INHBA 2.80E-15 2.5137 3.76 89.6 82.6-94.3 223062_s at LOC651255:PSAT1:LOC389173:LO 3.32E-15 2.4895 5.52 89.3 82.3-94.1 C729779:C8orf62 225520_at MTHFD1L 4.46E-15 2.4846 4.04 89.3 82.3-94.1 206224 at CST1 7.04E-15 2.4174 13.01 88.7 81.5-93.6 201014 s at PAICS 1.19E-14 2.3935 2.49 88.4 81.3-93.5 209309at AZGP1:LOC401393 1.31E-14 2.3949 6.19 88.4 81.2-93.5 218796_at C20orf42 1.38E-14 2.4561 2.78 89 82-93.9 60474_at C20orf42 1.38E-14 2.4547 2.78 89 81.9-93.9 219787_s at ECT2 1.42E-14 2.4283 2.82 88.8 81.6-93.7 231832_at WDR51B:GALNT4 1.79E-14 2.4259 2.12 88.7 81.6-93.7 218507_at HIG2 2.50E-14 2.3654 4.25 88.2 80.9-93.3 202286_s at TACSTD2 3.13E-14 2.3701 15.71 88.2 80.9-93.3 205513_at TCNI 4.81E-14 2.3019 15.22 87.5 80.1-92.8 224428_s_at CDCA7:LOC442172 6.41E-14 2.2662 4.27 87.1 79.6-92.5 203124_s at SLC1lA2 7.07E-14 2.3616 2.18 88.1 80.8-93.3 224915_x_at TALDO1:C20orfl99 8.45E-14 2.2732 2.87 87.2 79.7-92.6 226227_x_at TALDO1:C20orfl99 8.45E-14 2.2707 2.87 87.2 79.7-92.6 226835_s at TALDOI:C20orfl99 8.45E-14 2.2746 2.87 87.2 79.7-92.6 200832 s at SCD:LOC651109:LOC645313 1.23E-13 2.2759 4.28 87.2 79.8-92.6 204170_s_at CKS2 2.13E-13 2.2413 4.15 86.9 79.3-92.3 219682_s at TBX3 3.27E-13 2.2498 4.94 87 79.4-92.4 203313_s_at TGIF 3.35E-13 2.175 2.12 86.2 78.4-91.8 201601 x at IFITM1 4.78E-13 2.0659 3.87 84.9 77-90.8 214022 s at IFITMI 4.78E-13 2.065 3.87 84.9 77-90.8 201328_at ETS2 5.23E-13 2.2182 2.36 86.6 79-92.2 201112_s at CSE1L 7.98E-13 2.2336 2.28 86.8 79.2-92.2 210766_s at CSE1L 7.98E-13 2.232 2.28 86.8 79.2-92.2 205361_s at TMEM23:PFDN4:HDAC9 1.69E-12 2.2328 2.24 86.8 79.2-92.3 -91 218086_at NPDC1 2.66E-12 2.169 2.53 86.1 78.4-91.8 206286_s at TDGF1:TDGF3 3.07E-12 2.1205 6.33 85.5 77.7-91.4 204855_at SERPINB5 4.80E-12 2.1265 10.36 85.6 77.8-91.4 203878_s at MMP11 6.30E-12 2.0959 4.55 85.3 77.4-91.1 202833_s at SERPINAl 7.12E-12 2.0804 3.43 85.1 77.1-91 211429_s at SERPINAl 7.12E-12 2.0818 3.43 85.1 77.2-91 228303_at No Symbol 9.19E-12 2.1527 1.61 85.9 78.2-91.6 225767_at No Symbol 9.19E-12 2.1514 1.61 85.9 78.1-91.6 226311_at No Symbol 9.19E-12 2.1527 1.61 85.9 78.1-91.6 226777_at No Symbol 9.19E-12 2.152 1.61 85.9 78.1-91.6 227140_at No Symbol 9.19E-12 2.152 1.61 85.9 78.1-91.6 229802_at No Symbol 9.19E-12 2.1519 1.61 85.9 78.2-91.6 232151_at No Symbol 9.19E-12 2.1534 1.61 85.9 78.1-91.6 213880 at LGR5 1.10E-11 2.0619 7.89 84.9 76.9-90.8 225295_at SLC39A1O 2.32E-11 2.1227 1.93 85.6 77.7-91.3 205470_s_at KLK11 3.11E-11 2.0097 6.04 84.3 76.2-90.3 205174sat QPCT 3.46E-11 2.0435 3.03 84.7 76.6-90.7 222449_at TMEPAI 3.73E- 11 2.0605 2.2 84.9 77-90.8 222450 at TMEPAI 3.73E-11 2.0616 2.2 84.9 76.9-90.9 238021 s at LOC643911 3.73E-11 1.647 1.33 79.5 70.8-86.5 227174 at WDR72 5.22E-11 1.9844 14.11 83.9 75.8-90.1 202779_s_at UBE2S:LOC731049,UBE2S:LOC73 5.62E-11 1.9821 2.4 83.9 75.8-90 1049 219727_at DUOX2 9.34E-11 1.9355 9.27 83.3 75.1-89.6 210445 at FABP6 1.28E-10 1.9366 3.01 83.4 75.1-89.6 205828_at MMP3 1.44E-10 1.8964 15.63 82.8 74.6-89.2 218963_s at KRT23 3.58E-10 1.8856 6.81 82.7 74.5-89.1 223447_at REG4 5.97E-10 1.8052 7.75 81.7 73.2-88.3 238984_at REG4 5.97E-10 1.8036 7.75 81.6 73.3-88.3 204475_at MMPI 1.22E-09 1.8132 10.59 81.8 73.4-88.3 228653_at RP5-875H10.1 1.33E-09 1.8788 1.99 82.6 74.3-89.1 204580_at MMP12 1.99E-09 1.7881 6.24 81.4 73-88.1 203895_at PLCB4 2.29E-09 1.8389 2.37 82.1 73.7-88.6 203896_s_at PLCB4 2.29E-09 1.8414 2.37 82.1 73.8-88.6 235210_s at RPESP 2.38E-09 1.8005 9.07 81.6 73.2-88.2 201468_sat NQO1 2.68E-09 1.6978 2.62 80.2 71.6-87.1 210519 sat NQO1 2.68E-09 1.6973 2.62 80.2 71.6-87 222608_s at ANLN 4.94E-09 1.7543 2.63 81 72.5-87.7 212344_at SULF1 9.70E-09 1.7754 1.74 81.3 72.9-88 212353_at SULFI 9.70E-09 1.7777 1.74 81.3 72.9-88 212354_at SULF1 9.70E-09 1.7751 1.74 81.3 72.8-87.9 201925_s at CD55 1.45E-08 1.7612 2.22 81.1 72.6-87.8 201926_s at CD55 1.45E-08 1.7603 2.22 81.1 72.5-87.8 202954_at PAK3:UBE2C 1.74E-08 1.6455 2.82 79.5 70.9-86.4 209792_s at KLK1O 2.64E-08 1.6142 4.52 79 70.2-86.1 205890 s_at UBD:GABBR1,UBD 4.07E-08 1.6362 6.59 79.3 70.6-86.3 -92 209773_s_at RRM2 4.25E-08 1.6077 2.25 78.9 70.2-86 234331 _s_at FAM84A:LOC653602 5.97E-08 1.6031 1.73 78.9 70.1-86 206976_s_at HSPH1 6.17E-08 1.6476 1.85 79.5 70.8-86.5 202718_at IGFBP2 8.07E-08 1.6348 1.93 79.3 70.6-86.3 225664_at TMEM30A:COL12A1 9.48E-08 1.6379 3.16 79.4 70.7-86.3 231766_s at TMEM30A:COL12A1 9.48E-08 1.6394 3.16 79.4 70.7-86.3 201261_x at BGN 1.01E-07 1.5939 2.69 78.7 69.9-85.9 213905_x at BGN 1.01E-07 1.596 2.69 78.8 69.9-85.8 204127_at RFC3 1.25E-07 1.5497 2.25 78.1 69.2-85.3 207457_s at C6orf21:LY6G6D,C6orf21:LY6G6D 1.34E-07 1.5549 6.64 78.2 69.3-85.4 ,C6orf21:LY6G6D 210052_s_at TPX2 1.56E-07 1.6655 2.03 79.8 71.1-86.7 202859_x at LOC652128:IGHG1:IGHM:IGHV4- 1.99E-07 1.6331 3.1 79.3 70.6-86.3 31:LOC647189:IGHV1 69:IGHA1:IL8:EXOC7:SIX6:IGHD: IGH@:IGHG3:Cl2orf32:ZCWPW2: IF16:IGHG4:IGHA2:IGHG2:RAC1 211506_s_at LOC652128:IGHG1:IGHM:IGHV4- 1.99E-07 1.6324 3.1 79.3 70.6-86.4 31:LOC647189:IGHV1 69:IGHA1:IL8:EXOC7:SIX6:IGHD: IGH@:IGHG3:C12orf32:ZCWPW2: IFI6:IGHG4:IGHA2:IGHG2:RAC1 205479_s_at PLAU 2.25E-07 1.5392 3.5 77.9 69.1-85.1 238017 at RDHE2 2.71E-07 1.5511 2.52 78.1 69.3-85.3 204320_at COLIlAl 2.93E-07 1.5718 2.6 78.4 69.6-85.6 37892_at COL1IA1 2.93E-07 1.57 2.6 78.4 69.5-85.6 203213_at CDC2 3.58E-07 1.5342 2.72 77.8 69-85.1 210559_s at CDC2 3.58E-07 1.5348 2.72 77.9 69-85.1 232252_at DUSP27 4.68E-07 1.5123 4.81 77.5 68.7-84.8 225799_at MGC4677,MGC4677:LOC541471 7.40E-07 1.4933 2.04 77.2. 68.3-84.6 206239_s at SPINKI 1.70E-06 1.4816 2.74 77.1 68.1-84.5 225806_at C14orf94 2.40E-06 1.6542 1.19 79.6 70.9-86.5 204885_s at MSLN 3.78E-06 1.4263 1.93 76.2 67.2-83.7 202998_s at ENTPD4:LOXL2 4.81E-06 1.3717 2.19 75.4 66.3-83 207158_at APOBECI 9.49E-06 1.3811 1.63 75.5 66.4-83.1 218211 s at MLPH 1.16E-05 1.4176 1.47 76.1 67.1-83.6 205366_s at HOXB6 1.33E-05 1.3346 1.87 74.8 65.7-82.5 225681_at CTHRCl 1.83E-05 1.2913 2.57 74.1 64.9-81.8 205815_at REG3A 2.22E-05 1.2228 12.09 73 63.8-80.9 214974_x_at CXCL5 2.79E-05 1.281 4.48 73.9 64.8-81.7 207173_x at CDH11 3.44E-05 1.2644 2.25 73.6 64.4-81.5 209955_s at IFIH1:FAP 6.66E-05 1.2544 2.33 73.5 64.3-81.3 205886_at REGIB 8.08E-05 1.1731 13.15 72.1 62.8-80.1 205713_s at COMP 0.0001 1.1543 1.69 71.8 62.6-79.9 208079_s at AURKA:STK6P 0.0001 1.1173 1.81 71.2 61.8-79.3 209218_at SQLE 0.0001 1.1362 2.69 71.5 62.2-79.6 -93 212190_at SERPINE2 0.0002 1.1279 1.9 71.4 62.1-79.5 219955 at L1TD1 0.0002 1.1579 2.36 71.9 62.6-79.9 236894_at LITD1 0.0002 1.1598 2.36 71.9 62.6-79.9 209875_s at SPP1 0.0004 1.0666 3.32 70.3 60.9-78.5 205910_s at CEL 0.0006 1.0877 1.46 70.7 61.2-78.8 209752_at REGlA 0.0006 1.0018 10.93 69.2 59.8-77.5 213975_s_at LILRA1:LILRB1 0.0013 1.0772 1.55 70.5 61.1-78.7 202310_s at COL1A1 0.0018 1.007 2.5 69.3 59.9-77.5 202311 s at COL1A1 0.0018 1.0058 2.5 69.2 59.8-77.5 217430_x at COLlAl 0.0018 1.006 2.5 69.3 59.9-77.5 221729_at COL5A2 0.0026 1.054 1.36 70.1 60.7-78.4 221730_at COL5A2 0.0026 1.0559 1.36 70.1 60.8-78.4 205825_at PCSK1 0.0029 1.107 1.48 71 61.6-79.2 203860_at PCCA 0.0038 0.9396 1.42 68.1 58.6-76.5 224646 x at RPS12:H19 0.0051 1.0213 1.47 69.5 60.1-77.8 205941 s at COL1OAl 0.0143 0.8495 2.05 66.4 57-75.1 226237 at COL8A1 0.0254 0.8307 1.43 66.1 56.6-74.8 223970 at RETNLB 0.0304 0.8148 1.47 65.8 56.3-74.4 205765 at CYP3A5:CYP3A7 0.0545 0.8252 1.58 66 56.4-74.6 232176 at SLITRK6 0.0576 0.8693 1.31 66.8 57.3-75.4 232481 s at SLITRK6 0.0576 0.8701 1.31 66,8 57.4-75.3 235976_at SLITRK6 0.0576 0.868 1.31 66.8 57.4-75.3 200665_s_at SPARC 0.0684 0.6727 1.45 63.2 53.6-72 205927_s_at CTSE 0.0694 0.7488 1.83 64.6 55.1-73.3 214651 s at HOXA9 0.0759 0.7491 1.41 64.6 55-73.3 204051_s at SFRP4 0.0829 0.6983 1.29 63.7 54.1-72.4 202404_s at COL1A2:LOC728628 0.1891 0.6492 1.9 62.7 53.1-71.6 204620_s at CSPG2 0.9788 0.1834 1.05 53.7 44-63 221731 x at CSPG2 0.9788 0.183 1.05 53.6 44.1-63 203083_at THBS2 0.9844 0.2784 1.26 55.5 46-64.8 214235 at CYP3A5:CYP3A43:CYP3A5P2 0.9932 0.3165 1.16 56.3 46.7-65.6 94 Nl 6C 00 00 N 0 1 ~00 Co a'0 09 tl r- 'IT - '0 cc '.0 WN If W) ~ Wf) a' 'IT -- I/n 'I rjrn 6C o, C CI) \ 00 00;w (N '. I a II (N a' Zt '4 UN uN0 ' : ,0 C.0 hi- (N) u'.0 D n, - m = c - mI 1.0~ ~ ~ C> 00 \CC ,C -r ' D ; o ) 0 In \.U ON 'IT a' tn 00 ) x 00 C, ;j n C I~ u 2 . ICl 00 -O on ca c7N 00 V)~0 In 1'.0 c) L; mCJa Z. 00OJ 0 If.0a 0' 0O U N I 0N I ;; I.IC U. W)U ,oU( 00 00 u p00 '.o V- r.'N 0 al Ua 00 (D C)vt 0 u '.D \0 U' ?~a 4.I00 0. ' co u Lfl( - r' - - U00 00 C-4 \ o r- C- "T NT c) m da 0 - n V - (7 0 r 0m U)IrC TqT"- r 5t '.0 N\ (N v It -~ r- . C NN 0 ~ 00 U(N UT m-- ) W C:, C -4 C)= ur > uCDW : 0C4\ 0 U'0 \C m N- 0 00 U" M-n; a r-r4O ( In -; -;00 U0a' 0 u~a' 0 C4Ir~ 'I w ' OZ U'0 a -- 1 0 W C, - U .4~N( ~ 2 ~ U r r W)U - = U = =Z U. 0 1 !N r - bC I = rUU bU~~ o 00Z~I' 0~ C1 '.0 Lf5 mf z '.0 N a 0 .. N - - O (N0 1 (N4 a' 4 (N Z 0 N 4- N~ ;.j a W) , u .0a l'O(N 1cO 0.0~ r.Nr r- 0e n r C r- '. C0 0r) - 'IT 0 M. 0 U T 0. \0 ~ - r. (T 0 N-0- c N 1. (N a' (N- 0 U U N '. . 0 ' '( 0 0 r rn e0 r- wzt c4 L;~ U - 00 0 0 0 UT 0 N m '( '( C) 00; C' o I U U 'U7NU~'0 ~ C,4 C' - (N. X U)~~~ L. - q\ N- - ;r 6 0Ak 6 A\ ,cwMr 0 ,' \6 U c )0 . - c4m N\omr ,r 95 A~ Nr-: 00 r '.0'.0 00 00cl -00 00N ON 00C 0 N ONC ON C 0% O%% 000 00 000 00% %0 -q NN 000qN 00 0% In 0% 0. C-I 0C0 In CI en% 0% In Nq '-00 C'4 00 c en C- - CnIC -N 0 u n0 N- 10 00I Mn - '.0 r N V. C C)r; c 00 ;_; t I -t C .0 .; 'q- 0% Cs r -c T :z\ on '.0 . ; z-; 00 v- '.p N \IC - M - - , N 1 0 N '.U v-Q o - N '0Z. 0% ' . 0 rI 'n C ON N 0 .'. C%'.0 - - Nf 2 . N Nq Nn rn. U) - C.0 O ) 00 (-i 0) N O0 0 , . D . 00 ' O I'0 -4 t CIT o 00 0 (4\ rL"T 0 C. N00 HbI 10 -0 . 00 In 0)U 10000a m qC*C r T- T e N 0 ,.\.0 C-CIn)I 0
OC
)
N '00 oo r 10 '. 00q ). C=) N C C) U 0 - 0 0- '. C 0 r- Or)r 00 CD( \0 mW N -T C4V' -I Z 't oo IC 5t N 0~ v 000 r) ': c.5 z C\ 'tr '- 00 r C .C4 0 0 0%-7 C 00 0) 0%c '~0 m c C 0)- 10N0 C- - =.M.D 00 C1 -I Or' .) 0 q -I' C ) 00%000 L;00 ) '.oD0-oZn M0 ) 0 1 04.(D )00 z 0C:) u 00 00% -u C.-'- z0 0 00 0C'0'O CI 0 - - 000II-0 0) t (D 0 Nt "1- 0 0CIC-J 00 .4 4 ~Q C-I)( = q .0)0w0)0) N-C-I I' 0 I0 C-4 r- N'.n r \p en (D \' r ) -,T r 0 ~ 00 C4n w'0'00 - OC 00 v M (D U0)0-I0 C N (U Mc-I 0 In ) V ' %0 V ~ I~ m e-0% 0 0 ON 00% '-. -Tj 00rIN- m In0 0)-NN 0)NN 0 0 ~ '. ( %00 0 - 0N ,- I- u 00 W) nN 0) CD .- ' 0q 'I IV) 000 0CIin -N 0 0 c W)It-D OCEn InI% -~' 0 ( W 0 In 'A cn~~ ~~~ IU -WI 10 uo cu u 00)0) N0 ) CN1 I fo n u~ \00uv 0 0 u CO u - 0 0 %0 i0cl0 I0000% 50'3~n 10 ~0Cc% ~ e I 00 en T~ T r- N - N\.o cn C-I '\0 In t~ n 0 N 0 00 00 C-4 0 N~ - - 11 0% z N- 00 'I m) C- - w 00 00 In In ) 0 \.0 N- Nl 0 0) - 0 0%\ - In 0 00 00 a, 0 - 0) m4 I 0 N-u 0) 0 N C 0 00 0) '. T 11 IT k~n - 0It 0\ N- \0 mn m' 0 00r c W pc C pC 00 CIN-IN- CQQ~ N 0 n IT00C-IN 0)N r- ) ON NI U) U5 En cn cn En U) m ) w) w) U) ) En U) I 0)0)0)\ =I 0)0=00)z 0)0)0))0) 0)0000 = 0)000 = 0)00 0) 0)z0z0z0=0 z0) NN 1I r '.0 96 00 -004 o 0 N 00 00 0 u N u X0 M. 000nC C,4 M i r W. . C cq Il Csl: N 1 W . l j IV n I N 00'0C C,; C : , 00w wa)wZ l ) ;jr ;C 00 T.L; -\ ND 0 7 D W r C)C Cl 00 ~ Z~ I000t ( z ~ )l0O. C'J N 00l0 a\ r-~. 00 1r- Cl.4 00- CDCq u C1 00 1: tn 0 .- C\ ,0 = 0C no oC , % N0 1V : 0 U b C: ;5 r- CD \.o Iu - D" ) - .. Z ,-t o \ -46 . )c r ), o cN 5N 1\ C 1 W I 0 a. 1l 00. ia C 0 1~) 0 l Enw I ww I 00 C) u (D I U - Q . 00 O a.> U, ~ a~~ v. u) l00~n II)00N C ) u U U= 0 00 vll0 v)ln 'a-i~ vC~.i a) ln ~ i Ca- u z l =l 00 g 00) =) u -- )a Ua..)o Q)oa a) 0aa IaQ)~ 0
I)
0 C14 Z) 1 ClCl 4 . aa. =0 ~~~jr Nn in cf a. a. 00 Z) ..
-
I '. na 0C0'00 - C a . . na 0i i .a 0 0 C\ u \ \0 I MC 4 00 \ S )CN.c 11 14C, ' C14 - 00 ;.. j4; 0 a)0 ;j~ 0 1 0 0 )0 - 0 0 -. V 0- ;. ;.. 01 0 w 0;. .; W W 1 -00 11N 1u I W I I I .0 I IC-C W W.i e N CJ. 100 1 10 1 W -I a.N ~ a. N 02 Cl . u 0a C> ~ l0l v . na f ~t . N'o ~ ~ 0 ~ 0 - 0 000 r. r. (ON a. N. . r. 0. 0 0 n a(u n a '0a . ~ Q) Ul -000 u. u) u N un a. Vl ul a.rll . 8 a) 0t N un a. =l rl a. = u 4 =l -0 0 a) ~ ~ c Q) a ) a ) a ~ a ) a ) a ) a ) a ) a ) ) a)A v) v) a ) a ) a ) a ) ) a ) a ) a ) C',Cl ,; N~ 4 UAr AmA w c o8 URr-AAr- 6 oNC cN~~~ ~ ~ ~ ~ N NH -- tr n0C -0 C4C t Tr ' DC ) C 1 - C40 1 97 00 00 00N rO: 0ON r- 6N . 1 00 00 00 C 00 0\ 00 C% 00 C 00 (O 00 N*1 N-r 00 C\ 0> 0DCD C 00 -q - - C F 00 C W) 0q 0' NN~ 0 ~ C 0) - ae l . O\ N00 C0D '.0C; W~ L ) -00 't '-* -- N N '4 m C C' 7) '.c. C- I.- - ..a1 inr kn W) r- '.0 a)O'..u - I '~ I14 a.)CNla W ) r-l a.D0N00~ C;\ 00 r 00 r-N 0.D v ''T = ' m -zr-'t"Tr- r0 u -v e L;0- 00~ . 0- (0 ~ Co u- 4ev C - -,- - ) CA(iCl (D' 00~ .~ 00 V -n Ca aC, 0 ~ 00 1 1 I v (UI an - I.L a))( u- O I C) u ao.) - \Z 'T 00 (. 00 Ln C\l.. Is Nn _N0 \o C CO~0 0 0 a0'. 0a.)'0 M Va~ C> C)Ov u~C. d) 00 w0 00 0qlorC 00 00 \U) -- 00 m 1 -0 Q0 W)C r ; ) .a "- o -\. Mo a.) c*e C14 0 a..)a.0 M00 0 W) 0-Cl \ 0 "T~ . r- C'4 OC(~N 0 c C0Q C-4 r.0 - W) uN0- (c lo C' )0 M ; 00 4; r n 4- )v) k 0 . ) 0 0 l~ a)) r u II I I r.0 u u 0 I -*= ( d W0; 'j , Q 0~ C:, '. C ~C~ '0 ~ ~ C ~000 0 0-. 0~ (DPu S-c q 0 C14 M En 00 00 U 98 00 0N 00 CN 00 C, 000'7% 000', 000a\ M. 0', r- 00 00 Cl C ClCl C\ 0', 00 C 6l C4 I N Nl .0 rN 00 0 C14C ClClClC Cl Cl 0', ON 0\ 0N 00 00 W) 00 N- I,' Cl4 00Cl'. 09 0 CY C4 w R .I0 'I C l N . m rq'I Cl -16, 4 ! o- '0 -0',2 C> (. c 0 0 ~ -FlC 00 CN 'I 11=0 - La O 'IT,'. r- V22 00 4\ e sC la2'T - o-' 'I C LoI ~ ' 1L o0a0 ', cq C4 CqCD o V W) 0 0: W~))Cl I 5 C N0~0 2 v \ r tC q Ur 00 n C IIr La "ol\~0 Q) kn cqj u en C3,n~ N I~~ 00', It UcN; In M. (D ' - m. In . Nq L; -=lC ~ a ' Cl - w nI N Ia =o Ma 05 ~ N. La 11 10 Wn Lau - '0 N' I i - pn N In CN wa U~ u N rIn I)C g .z'V C4 UCl0 U ~I'0 ,0 V.l~ = .C Za U U~ 100 ~0 0 0) U V pI 0'.0 10 u- 00In0 .- N r.r.rqr. r.v - . 0 I '0~ r. r. u u C004 0)C 0 ) r-Lt ClC. 0', -c 1 I 0) NjN- 0)'0'0.0
)
~ 0)~ 2-0'Z 0)l-00 ~ 0 0~000~ 000 ~0 0 000 ,Z u0N0 loo-- ~ l0~ r- C 04 00 rC', 0.0' rl',Ir, rl000 .' -. ff a - . u r-. w) V r- W)k )-n t L cq M a a La La La La L u). N ))0 0)0)0)- \o ') 0 ) )100 \ W I 0 )0)0)C )0)0 00)t00 'It C40)0)0 00 00- , 0 -40W )10 0 W r C4 vt 0 0 w 00C 00000 ) 7Na - - 10 L CD0c U 0 IT V o' U 4r vM0zzzC 99 0 0C\ 00 QOO 0 0000 OCO 00O OCN OCNO0C -s ONC \N II% Ct( t- 00 If0 ON N r- ON 00 000 0 N (N N r'. (N 0 LqrnL rq c (N - O i0 -- N N -J u N 0 .0 -~- - OCD'I I N . 00 CD V) 'I 'T eq mO -\ (N 4.0 en~ 00 "'' 1.0 000 'IN W.IIII 0;N rn 0 0 -0 0~I' - ' 1. .0YP zi0( a) cp \cN 0~0L N u~ N-; r- 4 -W m 5 C)a) c1I 00 ~ C 10 It C) c0 CD ON 0120 O (N . ) r- \. C'4) 0 (N 0 G~ rc \C 0~ It V'd Q)u ) N 00 k~rna ON) 00 C 00 0'I V) I 'IN W) ~ W) 0 C \-C0 \- 0 00 r =0 t 00u N t u. v) C14 00 I) r. I) v.\ CD QI N rq "t C:, 7,I ON I cqN~~0 ~ r ( , a)00OtN 0' O U0 M-.~~0 a~) 0 0 -e - N. U Uo('\0 0a~(( 6~'JO C-4Ie~N C)- 04N U \.0* a00 0 O = r- 000 m 0 0 00 S rq r- 0ok zs oo 'N0 c, N a) 0 Z =sO~ 00c)0 00 rq 0 W000 000N0C40 = Z0 0 Q) tn V. IN 00 t ' w In uI I -r - M N uN 00 IOd m f : -)-IT-c'NN oIN 0 (-N w. r-'( 5 na r o ) r u m0W)O0 M~~r 0e'I N Q L 00 CN r'N v u OON('N 0N0CO0ON O zN0 0 cLN 0 L Wcn-cn Enr~ w)O- N 0)( U) CN'0 I)' N W, I c nr O w w 0~~QIS 0\0~ 0 I v I 0 00 =Nr r- u) r. r. 0 .) U~I0) a)Ia)Iar.ar.aoara) 0 A) Cn ON V. 0, m 00 00 u) v)a) a) u) u) CD~ Q) u) u) ON v 0, I,( Z. ON I/N (U C) 0n 'n o 00 II - n C1 00 00 ,-r - 0D V NI)( I N)ON0 IN0 . 0 0'00 0 0t ON , r-N Nu Irn kn N 0 00 'T0 O ON1a)-( O'00 't O0 u 0 ',T ONT ON a) ON Nf N U 00 t- - Q ' r) (Nt Q a) V) o/ (N \0 (Ni/ '0'0 0 NT :N N m . ONi 0l c / N :T0 r) (N = N ON ' 4 N 0 ON (N N, IN r. 0, N j N 0 q-N . W) zt 0 0 0 m N ON C14 A) a) t3 I/r-N00 ON C(N v' 00Or)-IN0- - a) 0 M j ('N 0- W) u I '.0 (N - v a 00 00 0 kn Z 000 CP 0 0 00n Z ON "P 0 c CN 0000 'P P (D 7 00Z 00 CD -, - Z ;.; Z - - :3 - I ZN CD -a)(Na ; = -a)O Za aa-a O a , a -aa) 0 W) M0) 1 -I- 100 N 60 ~ 00 InrN 0%00 C\ 00 ON 00 0% 00 C\000%l 00 0%00 00 C\ CN N 00 00 0% 0 00 00 00 00 ON N' 'zt No NN W% ) 000%N % ON ~0 0 C) 0 000 1-O Cl Cl Cl W) C C~ CI N 0 00 0j~ 6- -- D - CN l0l 00) \00% 1 00 O - r C; N C oq 0% . tn Ul 00 NI r- M .V4 r NC 1 0C . L 0" dC - . . =~ ~ Nq 00) CN - u0 t10 m n0 .I 1 ') Q) % 1 t 00Nr oD 0 n0 - e 7 ' T uMcq10 Z 0 0 0 00)1 e D e TNMW 00 v ,,l 4- N4- . t) It ;..;' ;.0 ;..;r -0 'L;0 ~ 0 'f 0 0 0 0 0 V 10 0 0) d /c I I "C' I I IV) I I 00v , Q 10 (D 10 0 p0u)u00 000 0\ U C1 U% IN v 7 a r 00 kn 0 u Z'I N 00 -~N~5 ~ 0I C) C 0 Cl 0 0 00 CN0O M 00 0 0 -c "t0 : - ow)i- r4 N o 00C l) 0V W D v 0T en0 rn 0 -~ v N CT) 100 000 m INC 000 0(N 0% l r-.VN0 0 u I'll~ u) INW)c 00 4 U* C4~ U, UU ,U ,U ,UV m 00 ZZ z V)) z r 0 I0C (: ( t\ t 00 0 I z0 0 - -C I z I m en N00 ' V ~ - C4No 0.) ON%~ 00 U 0C) = 7 I Q) C- r4 NeN0 00 \0 ID WI 00 It m .. LnC It cI CI u . iN %0 r) IN t C :T 0% c - 0 qZV - cmo\or 0' -:t0 - t r~l-~ 0 nk tc C It0l00 i l CN In~ - 00lr0.0 0ersJN00N0%Irc C 00rp- 0)0C0-- 0) 0r C-4U I: 0%0- 0 -; r4 : 3,C - - (0%0 u~0 C>N, U0000005 0 CD00C~ 000 CS ,I 4%rnc; V - ;;4 ; ONl0 0~ m 5 r) r) "T 0%% 0 Cl m Io k 0r r C)00 C0'It0 ) C14 \0 r- N N 0 W) C O D t0 00 ' -'Tr 0 ~ 0 000cqN r 0 r- ) V * Q)(N 0' l 0 CD U C4 rN )u oo N. c- 5 ,g- r. r- oo w) ;; c,4 w-- 14-um-\4-Jm I o 5 46oo r lqo c T ) o I~ ON 11N1 N rl Io IN ! IN II '1 IN 0 nI~ nm 0C 0 r 0)0ww)ww)w0.w) w00 0) . w 'n)))0)). Ln 0) ) )0n) 0 V0) )1 0 ) 0 ) ) 0 ) ) 0 0 ) ) 0 ) ) 0 0 0 ) 0 ) ) 0 ) ) 0 ) 0 0 ) 0 N 0 ) ) 0 ) 0 0 ) I~ 0 ) 0 0 ) 0 0 ) 000 0~0 000 000 000 00 ~ 000 ~ 00 00 ~~oCo ~I 00 101 N- O00 '.O r- C(N V)~N Nf CN00 C% 00 ON 00 C% 000,l 00 C,00 C\ 000C7 00 ON O00 C\ C 00 00 00 ON00 00 00 00 00 00 00 00 ON (N'C -q 0- rN NT~ 0 n 0 (N ON ~ (Nr-C o 0 T 00 en00 en- 00 00 CO (N 0 (N, 0N 00 00 0 r- C-4 (N N~ (N, 00 Lfl 'CIT 0 (N q N) 0,C N- 00 0) 00 tn -u 0 0 - 00 (N ON . .; . IC I cC N 4N ,I-C r -0t - rq Or -T en C - I N Ul 0,T "U, loC \D r u u, C, , 0 0 - 00 C en 0 CN 0 W)C4 mOI l rn \.o~~ eO.q N 0 N0 00 000 00 r- . .. CO U O N0 N u 3 a* 0 o uO(N 00 0uu- f COu * =~~ ~ ~ ~ WOr' CO In CCO-)CD 0 Dr C 4 I1 .- . I N 0-T **(" - u Z ' 00 b ~ CC r- 0 0;i cn 00 CO V r. O C0 C ~ C ~ uNr~( = [-flCi W) 0) N0 v 0. Qf)~ r-)C( V ~Ce~C ~ OUC%~ uO m0~N CO e- (z CO(-O C> ~0 COON,0 O r ~ - 0 0 C4 U, Uw m Zno 00 c, CC0CO0lN)=0 00 0 .1. CO ICOC vNC C CO O'4~0 N CO NO .OU) C) 0 C) O -- zI - 0r 0 U0 00 CO. en 6N en (= z I f CO CO. W) (N, 10. 0 m 1 W) C14 tM 4o-T rl-, C) Q)~-0 C ICO( v M~ 0 0 N~ ( vC \0 m U IC 0 0 MO CO-C1 r =)~NI- "O -0CC C,\ 5OC00I Z) C /fN~ C(N'C0 .r oo 0 Q.)( k n: ( DN CO0.O :3 V ~00 0) C -4 Q) )O'tn ) C O a)00 O 0 a) cN 0 O 00\ ;- cO 0 W - =s - C - - In . ON r) - C -'--
U
1 5, inN U, 4, wO w, U, U), U, U, CD z, ON 1 in V) 'n0 r4O ~ 0 I loO I~ I U, 00 , 1 U Q.~)) , u0 =00u COOC (U \,COOCCOO CO Q) C CD MCCCCCC~~.~CCC vO r. CO CO5 ~ ~ N ~ N ~ O ~CO 0 ~ ~ Z C .-- NN 0) 0~00 N Q)(0 N)NN~- W)(N~I( COO ) O4O. 0)~N N (N CO WN en r4 ~ ~ ~ r ITO CO ~ C w- 6 CO - - r- oj C - I U IA kA c Iz I U, U, ,U U I 000qCD00Ou0- a0 7 0D (DC\ m00 C) v ) ve 40 00 Ln w w w 0 oj v(u uv v (mu 0 u 0 00 N 0u u v CN 0 C 102 r14en - l - coe5nl C l Cen OC1 OCl 00 ON 00 ON 00OC 00 C 000\ 00 C\ r- C CN00 1- Cl l 4 00 00 00 00 00NN 00 00 00 00 00 00 0N 004 W)l C14 Cl Cl Cl C4 kfj Wr en -'0'0'0' 00 N '0 . 00 en en knC1 ml \n'0 0) eq Cl C4 Cl4 Cl4 Cl 00 cq C\ 0mn - l en \.D Cl aH H Cl U U ~ ~~~ f.0~ ~lH U~ Cl en 0 \.0 1T00 \.O 00 I/n N- \0 N C 00 Ir) 0I '0 00 Cli 00 ,zr mn en 22 U' C\U'(D r Cl ;.;N 00 00 0 n 00 N) I'e N .L n Nl WN '.o 22Ck Cl . c 0, ocq en ll e 'qtC 'e1 v Cl 0 0000 r- 0N I Cl 00 C 00N0 0 len0 W I(~~e 0)0 en ON M-4'. '0 ) - 0) rcl 0)0 Tf O. uN Zi Zi Nz W - 0)0 00 0)'. ~ 00 ( 0O ;~ - N.' N. ON V)W) "2242: i r-- z)0 00 -? '"Te I'. I. 't n n "~ N o' 22t 'j, U~ U' I C' (D p 0): 0I N00 r-'0)~ C1 10 cq fI~C' C ) 0 Cl 0 ) -C cl 0'.. Q 0) U (D u C, V) 0 4-; 0 U V~ ~~~ -n 0 -l~ ~ 0 C - ttD 5'' ''0 I I v) . w 2 00 '0,.. =r .e 0 )0 - en 00 V) M' r- N(D C 14000 ~ l N *0C '.0 0 ) E- u0Ne0 I-7,0 0)Cl 00l~ C ~eC I' ' - zTn- 0C C)Nn~~- r- rn~ oo "'t0 v . NNt >00 V) 0 00 0) CO'4 '\Q0 N C CNC1 -Cl en U'0~,e me 'W0- C> r) 00 - , 0l- uC (C14 0 - 0mcN C)00 Q)IrN. NN r C u W "Tf 2 2 C w) vl'-~~. N. U'0C'J -220 M C4lN0r--00 0220 U2 rJ4 2 0 e 2~ -k Q) Cl cp 222 ,j.. .~j~ 'U 2~.~-2202 'U 0)1 ;-;U 0N0 222 r- I.-IDU U *o U' 0' W)) NUUUe~ C,' 'N ' ' - In1 4 0) 2 N 0 0)0 (, (D 00 0 n - ND .. lll00 I 0 '~l~~0I~0 ~ ~ ~ ~000 -00 00 Cen- 0r i 0
W)'
0 00 0)N0en.. 0)On.0 C-,0flf 0)Cl00If 0)0N 0\ 0) If n N n Cl en CN\0 e -t IfI C\~0 m0~e 22 0)Nn C\O00 nt n0On 0' 00 k 0 .00 "t '-T0220 0)C r- Cl02M22VM2nC\0 q '0) )0 I 00 Cl ~o 0 ~ ~'.0Cl ooooo- 000 C-4o ~ 0 0 0 0 2 ' 0 C, 00 zt o W' z)l 0 z z z IC In - - 4 6 ; *. a\ 0 . I I I .C f 00 wl~ t0 00N00 - m 0Cl-4-.0- mO0 W) Zn~nC 0 M ~eCl0C C,4 rI) 00 00 C400~ CN -n'~ INe--'0~- 0 e O- r-nr~l~I m)0tC-- 0)r,70e v r' lO O~ne 0)NC l'd-I-0 m 0)00 ' rnV, e 0W V00WC- - -,in -- r- eq -n 0 * n .. J v - - -4 CN -7! U ' R <?N U 'C.'-A U ' ' c ! TA m ' U '5' .U n I, go Th 7: n ' 7! 0 0)0) ' 0)0)0)0)0 0 ))))) '00000 )))0 ) 0000 ))))) I00 cu)00)U 0 )0V )000 ))))n))))).00000'0)))). ))))
)
C 00)) Hq N 0Cl N o H C rJ~u u0U U w F U 103 00Cr O r -1r ON ~ r- r C -~ ON N- 00 00 00 .0 00 \o\o 00 00 00 00 00 00 00 00 IN00 M) C\ r C\J 00 D M O N- N 00 ON Nq M (N N 00 m u 1,- ON CO (N (N 00 ~t(N 0 00 00 0N 00 110- ' 00 ON - M 1- 00 N-e N 00- ND00r N I m) .. - 0 0 In0 1;7; ~ ~NL 00q C C ( w o rn ON NO Q. r-\ 00N 0(N L . r U z~ -. 0 ON C-4 icN4 C)0 vk r 0 C7\ NO r- Z IM Z. wi C) Q . r- 0 N j v C)~'j3 U\ U) r. (= O (=O r.N It C) w13 ) C) C) v u v C\ No~ CD ON tu z 0 VN)) uN rn~N kn Lnloo 0 4~~ ~ r4j C 03 ON Iq Cs. ~ ~ ~ ~N 'N 00'0 0 N .. 0It C- W) v~ rq r-0 cq LC) 0 C *.(N( . *ON ON 0 00 C- cq 00= W0 4 W Cl '0 N q 'IT C) U) t- I * U 3 Z 0N U U) I en re) \C N'NIT~O ) ) C ~~~~~~C U\- .. ( , N O ) C) kn C CD c0 Z Ien A) 1 O Nt0
=
1 ~ N r-- - w N( 4 m C) W) C) 0 C - W) 00 N q C) =400N ON )Cr (O\ 00 00 Z I CZZ00 ,_ =)0 VCw)C4 N cu r 10C14 0 Z)0,, thN~ Ln/'13 rN- ~ C C ONI/Nw C N u r.'O v 00 CU)U cq C CN C, u 0 % V) O N ZI( N ~ C) I. - . ) 1- r- Q) NN cu .. U (N /N~ 0 C)7,'C C) W1
-
C C) C)C)C C)ONN C)C C) N " C)-INO I-N~ I'D~/ON( C)OCDN' I '( r- 0 ISO 000I.01f m - 4-W oo - - (-) v Q)) U) U) U),0 C-4 rq r) C-4U) U M M m) C C C) M C) C) CD C) In C) C ) C ) C ) C ) C ) ) C C) C) C)C44C ) C C ) C "t kAr - -. n u 00 C In .. 0 - - . 0 L 104 r-a)0 C - C r - oCo cN-4 00 00 00 00 00 00 C%- 00 Ml N00 -) C14 -~ ClCD0 Cl el l l Cl C Cl Cl C C14C O Cl 000 z 0 6 4 N D 0 C 00 C) 00 Irl W) \n N c IN m W) C 0 C) N In~ (6C 1 V O\0 t -O IDCl14 \C r- V IT M T00 0* 1,6 w C00 \0 0 00 -'OND Cl .1 V~00 - 0 r'0 - Cln \o 'IT r o. V 0 m W - W C4 z ~00 csC knV V ONl ON W) w CDN~ \c~IP W I) C ) w W)0r > 0 I fl 071~ 0 0 ( 0 (-' tN' .- :l 00 00 ( l-40004 Cl 'IT 0 I- 0 ,n~ O Z , 00 r- .0 "T C,)If 0 ClO~l~I I I 'T OfN W C ) l~) ~ 0 V u 14 I- u N 00a, 00100m V ou \C u 0 -; 0D ;-; 00 00 0 'I .. c Z I ZI I v 00 ~ 0 0) C 0 0\00 00 N U) 100N- CD V 0.) N O\ Cl ClW l )c)(= 1 00 0 "t 0 0 0C (0C\Nrl C5 C 0 C l I cn~~~ ~ ~ ~ ~ = -. Cl o -o, w C 7 C N L Cl zi 0n zO 0 la \ r-4 4j Ln 0) \o 00 \- 0 N 0f CN 0 In 7i c~\O -,T N- 00r 0 , 0- 0. l \. In u I I Io C) 00 V0 -U NuV \ V D ,t 00 (U r" -so 105 I~C I I* r-\ 000 00- 00 r-I00 00 r 1 00 00 00 00 00 00 00 r- - N o00 NnrnC o o 0 00O * ~0 'I- In oR -n W CDU 0 0t 0 7, mI Cl - CD m N - C' ) O 0 ( O~r In ClCl ' 0 W)C W) 000-0 .; N 0' r 0** LnO <D I ' 00 r00 m I'D' r=\ C' kn f~ - nC ' (c0 0 a001n C> en) C\ C Nl) n v W)-0 ~- Cl )0'q\ eCl \D = Cl00- a)If tn oI ' >W n0 - Z -UC4 -C C14 0D u 1. t- D \ r4 r-l (CN C>C\ v aO( r- o Z m a); U' 0; 00- .;C -r - W - - r )\ W U00 W u n I- ;j 4i \0 Z.N;f) I~a I 2U 0 00Z 0 v 0 -C'U II.10r.v C uU0 0 Ca UO' \C ~ a 00la u 0 ~a uf v. u) 00Cl 00 00T I00C OkN ( C N c) N)~ 0 ~ Z 0, t ~ U'~ ~.~. ~ U ~ ~ 4.. *n00 e' - ) z 0 CN 0C Zl, CNV .z 0 A 0 t M c.00 to c I c r-~ r- U)q* r- oot 0 I rnV - ) 0" C N r 0O*f) Q.~ oo r- c-4)0- C') a) \D0 U~00 I ) r.0 ) I0 uC i0 w 5 5) 00 ~ 0 ' O0N ' 0 00\00 t C 0 r I ~C14 0~ - r 00 -cN C tn 0 14 \N C\Cl l 'I -- I-N~ ( O ''~ r- a) CN 000 Z) 0 \O C4 "tl ~ - ~ C -; ;.j "o W\C C\00 w 4- ' ' N 0 ~ C J l 0 .; (u WUU rUU A~U I u W) ) W - 0 3 I V) L;U =) W- EnE 0N 1 En V V I I V)II U Z 00 u m I-'--vInLauu c (U v C ~a CD arara)0ua)u-u 0 ZCD IN(~g~ l =~ 0 0 0 t 0 ~ 1 CD uC -'I-~~Z N I~ 00 I q 0 0 0N g ~ 0 0 0 N N OO . N N 0 l.QqN 0 N 0I 5 co ;- I- . I~ t- . . = M - '* -- * I 7 C oc ' a 00a~~~) 'a a a a a a a a a ~ ~ ~ ) aD k) 00~~)~~~~a a 0 a) tA)O a .t C" a) 0 Qn C)r-- - D - ) - r Q ' e C D u NrC 4ur J 0o (4*A C C "t'I U : mC 0 U -OC14 Clr q 0 \cC 0 uC T I )V rn~~~~C = Dk Tt Tc )0 W o r N ) -" c c' 0 nC -~ va. CDc )r )0 ' -M \ Dr ) C n' r I0 0uC p cq'I 9 N u T Inu 0 L;e c Z0 'i' 0"H- 106 I I I 00 II r-o 00C N-0 -00 N- NO 0 N-00 1'-00 - 0 cq- Cl - 00 00 00 00 00 00 00 00 V- N 00 N00 cq 00 %10 0) 0) -> Vd) ' 00 00 00 0I N r 00 00 N 0000N\ 00 cq a' a 00 'I 0D Vi a' Cl O\ N 6D C14 rj Z r 0 a' C4 1:T Cl C 0. VN Ci 'IT '.0 N0 0n r- 00ICq C) Clll 0)00 r 0) 4) CD .0 CDO ON 00 0 )-CN. 00 =U C nr o 7% t 00 00 oo~* r-e~ oc V q2U Lc:jnI I r1* U en cn C- -TC) I. =1IIV ' 11 0 ' 0 0enMUr) 0 v U 0 IT r UC U m~ rn - 00 U u 0 0Cl ct 0 00 U In 0 0) *. C II 4 -~ ='. jr j z I 00 00 :3 En1 N 00~ U~~~~a N ' ( U)~'.0-00e
)
t 00U v U0'~0)I) 00 07 0 Ue-C z jlN I0N'0 00 C )C 10) UI,1 IC) 0 ) Z z00 z~ z'/0 C14 ~ ~ h 0) \0 00" W Q) C 4 00 W- 'I_-\D m e - o 00'~' Ic) C> r- rn =Il0 - C1 00 .. tQ CD 00'0 0 0 - re) 'IT0 C) Cl' N CD 10 0 0 CNIC C)0'r~ C0 00 ~00 0a'r-' tna'. 0 r-a' z 0) q0 ~0~ " 1 0 N ~ 0Caem a Cl ~ . 0)'0 . 0 9.0' C' cna Z =s~a ~\ 'n w n If r) I lnI 0 U 1 h z 1 1 1 Cl 0) U 00 U v U L 0 a) UN r U 'n 5O l z z rW .00 w as~'C 'n 00 rn0 ~I I I 4'. \D0 'I U))0 0)z 0 00)0 U %C 0 \.o000 U ))) 0)00 0 00)W0 C 0 00 )) 0) 6i 00 -n \ nw - c4 C T5 -i " c 00 -qr (a-nCNC400 U 107 'ZO0 00 1f 00 Intr 'Ac C ) r-o 00O r-0 -- 0 - 0NO N-00 N- r-00 r-00 - N\ 00 00 rJ -- r- r N1 Cfl ON V0 NnV) M 00 00 00 00 00 00 00 00 00 Nr '. 0 rn 0 kn00'. 000 - NIN00 00 00) 00 A c. C, 0 I'D'~ 00 C .0 Ic- m - 00'. \0~ IC 00 000 c~~ 00N 'IT. a'0 OO - 0 ~ 00 00 '.c ~C1 '.00'. "D ~ " 0N \0 :10 '.ON C\vt C 0C '.0 a) 0 7 o C) It 00 00 N- r~-I r- re)I C) W u 00N r- 00 -; = cIi c, 'o (C\' cl, 0 -- C-i " cIN'O zo oa= 1 00 01) -M 4 '1 O\ -5 'S i.. .)a IT~2h~ 5 0u qv h r. a) W) r- 0 c-0))'0 ) NI 0 0 r cq rn w- r-'. 0 00 00 C14a Ia a) m~)~~) ) 1 ~ )a C. ' 0\c C 7 m) Q) a N) n ) 00 Ul M 0 ~ 00 C* 10 -)za) Io 0jI04 rn Ii 'T rI IC - o t t' I'r'Q0 Ic-. N-: N0 0)0 I01N- IN-c \'oiN M0 a ;. Lt ';0-r 0 I~ - '. a) ;_-; -WI a)rrn -) c- -) N 0 -~~'0 ' I ~ ~ U, ~V)* VII w ~I IT h " h C ) - 0 N u 4) 0 0 u . )~ - a ~ I '0 00r Ca a g a0~a N ) 0)r a) a-)INNj aCa IC\ N r40 IN IN v 0 t- 00 uf 00 u r- u N 0u r .0 0 0N - ' 0 v v~ 00 v u r~ 0 4 g . a o 1 -3 IN -, 0 a)l 00 0n 00 I 000 1 0 a)0 ~ '0'0 a I -N a ) 0 0 0 e0(J '~ r 0 0 - '.) 5f N IN IN z0 \'o w N =i 0IN0. z 0 00 'I N =)I 0N- 0 z :: 0000 0000 N 000 cq00 Uo 0 \o \- IN5 r e o0 Inr 00 NN \-00T - , eI r )r-( n\0 r o0 ):3 C 4- U0 j'4 j )4 - f * - - -* 0' 4- 108 NOO r-00 \00 00r- 0 0O o 00 ~.00 00 00 00 r- 00 -n ci 0 n 00NNNNN N 00 V) V.O 0 ci q 000 0o -0 o0 m 00 .O0 mi - q cn 00 C) ci, eni ~c n 000~ rnC or nr 00~ **00 Vci) ONen* rn 00 ciq C, r-CIt \c 0 0 tnr n 0 0 v - L "T cin 0' oC c)v -i400 mC>~ci .
0 II-U k C en , 0 C, v 00 en eri- CN tn -rn 00 ~C-4 W) \)00 CN N4 W I, r. - V 00.- e \C ;.. It r .. Z N z - cO 0)0- -- Z rn 0 0 0 0 CO 0), W.I)~ 1 =~~\ Q - n m( ~~~W (Do ) ;-0 40I~i~.1 ci r0 ) ~ ~ 0 .. a) -rO CO\j-\ rFO 3 \' C In- r00 -~ C - U 0IT0 'It Cr0 r M c i- =r 0 . C tI o I-'0p . .0r0 7Q N "T00f r-C.- 0 U 0)=QWl'T Il-- 0m u M %cN '- -- I-r o r ,,pw m m C - - m 0 000 0 00-000I I0 C- w u 0 InLnv) q q UnZ C4 0 109 'SW),600 (C;) 00 tn ('J00 ICo 00\000 \.c00 \0 00 \0 00 r-00 o 00 00 r- N-C lt N 00 N n D N 0W rr NN 000 MN 000 00I~ 00 00 00 0 0 0. N: N Nl N C ) 0- C)C\ n -4c 0 C\ '.o Cl -\ 0 -1tW) W) N) Nl 00 r) 00 C -r \. 00 0wC4 r . eN -N 00 cuo 0.r - N Q 00 * 00 W) U, ;-; rI Z. I- ; - I N O I .. I. Nn u. w V) C.) 0) u' 00 0 0 000 COC v vl W) - 0'l r 0 00 1 N 0 0N'. 0N' tO 9 z) =r00 '00 r ;j 0 'D 0 Z zN7! 00O00 m 5 5 a'a0 I Cl 1000V) 00 v 00 *.) CU 0 C> cn 0 a I ~ Cl C InN 0 n C,4. lo 00 -. 1 r. I) - u - N '0 00 00 MO *. 0 ~ N'O O O~-' 0- IrO m :CN)CO u 0-00N vJ~ 'D a'N 00 f r- C7 m f -q m ' W) 0 \ Z; -t 0f " 0n r-~ = w) 0\ F- Q 0 c, 0 00 W) Cl \0 W) n 0 n C, C) 00I r - w) 00 r- C3r-\ V 00 C) W) Cl 'IT a' C'4 a' 00W~ \p r- m C00CO W) ClC 0) pC l: ~0 00 MN NC\ - m'.0COmCW C :4l a' CO 0 U 0 0 wQ
IU
0 ) 0
)
~ u- 10 V 0 r, N ) 0)0 q )))l)))) p*p 0U)v) C = 0 0 )
=
) C: W N'.0)..0ur0O0)0 r. r. z .z0z) 0zur .4 C ~ C NC )~ :S z \ s; 000- '0l OI1~ 0)COOn ~ wl u 0 '.0en0\CO ooa'CO N'' d,'~ rj- (u 'N Q)o - -loc vC W Ii) Itl N ) It- t~~ \- C- m -- -- -*o Z kn Wt M) 't C- It ? l .( n k -r r T - C D M I I :.) 1~0)0 ) 1 ) 0) 0 ) I 0 ) 0 I 0 ) 01)0 I I M0 o 00(-) 0000 CIS) 0) ) . 00 . = ' ) 5 5 5 5 - = = C 3 en 0 z ;3 )0~0 00 a0 0 0 0 0 cp 50 00C0)5)0 Ut z0 0U z z z0 ': 0ZZ .Z z \C -0 W)C 0V 0C4e -1 o r 0 - nF C \4 W )C 0 lo' 0\D 00~ ~ ~ U. C) C-4 mU;t t u0 Mc 1 o\ n C r - C - C \W)W -TN 110 C ; 0\ In -6 -'.T o 00o0 oc r- ~ Ic \.o000 '.000 '.000 '.0 00 '.000 '.0 00 '.000 C 10. In n en N N 00 V n - '.0 00 '4 - r- It- 2C 1 ; 00 ff 00 (N 00 '.0In V)0 '.0 W) kn V)InI I o~C C? 000 '.0 I ZO 0 Hx 0'. -f Nl 0 m \C0 00 Nll \.0 r 0'. C) 00 'T C1 '.0 rN N 0 00' n c.'. N\N - 0 In '0 00 , 'IT C'0, U nC I (Ne r- , C 0 Z DNW - rJn . . r)'IT00 r4 -'C UW I~e C\ v fI). \ CD (nr-C4 n ' '. w 0 ~00 - urn r 0 Z.n0 a.- =-.( - m 'o C) CN OCP 05~ r \D~~( N r- N03 en~0a r-n00 V U) m n w w I [- .. ~ Wn W M '.0 - 0 0 t; C1 Z~~~ I O CU. j Z) N ' U 00 u '~o r. 0.N~ eq'.N "T l 0~ r.n a)00N r.)I C0' =.'0 ~ a.)c \000 -- 0 e C,'. 0 - - 64 CD L2 , N cq 00t0 0 Inu -~~C ON ~I "~ " , 'Tq r . .~. Ln 1S . '.0 000 ;_;~O O O 0 v- w) r- -D T 0~~0 V r) r- r-- ' c C" 00 W)C 0 C M . 10/'I a.)Tr ZCC) -r N DC 0 Z 0 ' O - 0 aUIq)IDU-) r c)C)0 ON - Ca.
1 0 W))a ' = \o U00r I ~00 W W0 W0 v0( . = cq C,4 r In 0 0C'a(u03N ( u00N k 0 v v '0 W) 0~ N 0 - v'0 v~'~ cu ) vr- -v0 0a' vN ) ff 00Z 0V, C, k'r,- q-u t 0 000)0'.5 1.0 '0 Q) In0 C:, C-4 Q 4 r.- ) nl 0t 06 U ZQ o 0Z H Ck0 - n U. I )10 OOqO 00C- ON 0-N ONNr- o06 00\ 0 0r r' cO' - '4O rON r-io - \C - "I 1'r- 1 -1 or 0 00 IC~ 0 NO in~ en W) -q 0- -- 6> W -1 NO 00 NO %DI/ 00 C> ONC - E5N NO - D CD - C> C 0 0 0 0 D 0) 0D 0D C) 0) 0 C0 00 ON I -0 0 ON W)N rq C 1 rn~k tn -0 u~ 00 Q l% 00 C1 D ;) 00 r- *n~ -q I ( (~ rnN0 ocnrn %ben v n 0 0 0 C 5 CN 00N 00 cn ON, b '~ 00t-t 1 o 0 CDul3qW (D~*~ (C " 10 'I .C'1r INON r.O =I -cN 00 u)N N I' / 1~0 ) C) NINN 0I/ (D ~ ~ (D I/N'3 r-ON U~0o - 'rri 0 . C \r C) v% 0 rx 0 r- 00Q0 A ( - ~ 00 i/N 10 5 N 0 ' ZU Z (37( 00 0cn-0 1 0n tn C- u0 00 0 O In 0 Q) " n 0- D l 0N -0 0 %D riO N q t\ 000 C> W) v (D r 'oC\00 IN00~ 0 - N5ON0OCD MN CJ4fl- O0 ~ N I/NNO J, en/N N CD NNmO.I r%
Z
0 r~ O Nr W) I n.0 r-O -q0 00 4r CN~~~~O I -. - \ C --; 0 0 o0 0 )1001 oo W 00 V (U r- ON _ _ _ 00 u_ _vz_ __V_ _ _ _ U ~~ ~ ~ ~ 00 U CNu5u'I T v 5 I 112 NQ~~~fr W)0 r ,~ ~ O N 00 ON \%6 t;m C4 W \C \0 '0 \.o ~ 0 ' 0 00 C14 W 'f 00 00 - V) m ~ O ~ N ~ 00 en 0% '0 rN W1 I rn 00 N. rN en q 0\ I- 'I 00 - o. 0 00 0 00 o1 'IT cq Nl Cl 0 rN 'I W) 0\ Wn N C) C> 0 0 0 0 0 'D N N '0 '0 Cl 0% 00 m Nl Nl '0 I- I 0\ 0, It cq m~ 'IT 1 0 1.0 N- 00 ON In cq N0 I D 0) 0 - ) In In In .0 %0 N- 00 00 00 o C> 0 0 0 0 D C0 0: 0 0 - 0% -~t r.< - V) U Cl C uu 0 0 ~000 00 CD tA00 NN 1%0~ 0 nC m- N ;j 00 Q) '.U0 00n 00 ) 0 ff000ON CN m I 600 r-4 M6)It u) ~ 0 6)C ) .r 6)o 6 c7 c0 i- u Q ~ . V 10010 u 00 I N' 'T Cl kn 004 W, ~C 0 'IT L) 6 r- W')0 6) I 6) L) cl C1 - tn6 0- 0 0 0 L 6)6)~~ 00 CL In) 00 CIwn. u~ ~ ~ 0) '.0T r, N a, 00 W- -'t Q u 00 00 C 6)) r. 6f ) 0 0 0 00 IT . 6)Qt 6) 6) 6)<0%6 6)0 0enI 04 U) 6 M- P. UU U n c p - 113 TABLE 3 TargetPS Symbol Signif. D.val5 FC Sens- CI (95) FDR Spec 217523_at MAPK1O:CD44 3.35E-26 3.7166 7.06 96.8 92.9-98.8 204702_s_at LOC650331:NFE2L3:LOC642996 1.51E-24 3.521 4.57 96.1 91.7-98.4 207850_at CXCL2:CXCL3 5.76E-22 3.2893 7.85 95 90-97.8 204259_at MMP7 7.61E-22 3.2634 69.29 94.9 89.8-97.7 228915 _at DACH1 1.05E-18 2.9232 5.81 92.8 86.9-96.5 241031 at NLF1 3.12E-18 2.8001 8.12 91.9 85.7-95.9 223062_s_at LOC651255:PSAT1:LOC389173:LOC7297 3.32E-15 2.4882 5.52 89.3 82.3-94.2 79:C8orf62 206224_at CST1 7.04E-15 2.4165 13.01 88.7 81.5-93.6 209309_at AZGP1:LOC401393 1.31E-14 2.3986 6.19 88.5 81.3-93.5 219787_s_at ECT2 1.42E-14 2.4245 2.82 88.7 81.6-93.7 202286_s_at TACSTD2 3.13E-14 2.3676 15.71 88.2 80.9-93.3 227140 at No Symbol 9.19E-12 2.1522 1.61 85.9 78.2-91.6 229802_at No Symbol 9.19E-12 2.1536 1.61 85.9 78.2-91.6 213880 at LGR5 1.1OE-11 2.0621 7.89 84.9 76.9-90.8 205174sat QPCT 3.46E-11 2.0422 3.03 84.6 76.7-90.6 238021_s_at LOC643911 3.73E-11 1.6459 1.33 79.5 70.8-86.5 227174_at WDR72 5.22E-11 1.9854 14.11 84 75.8-90.1 238984_at REG4 5.97E-10 1.8033 7.75 81.6 73.2-88.3 204475_at MMPI 1.22E-09 1.8138 10.59 81.8 73.4-88.4 222608 s at ANLN 4.94E-09 1.7541 2.63 81 72.4-87.7 211506_s_at LOC652128:IGHG1:IGHM:IGHV4- 1.99E-07 1.6343 3.1 79.3 70.6-86.3 31:LOC647189:IGHV1 69:IGHA1:IL8:EXOC7:SIX6:IGHD:IGH@: IGHG3:Cl2orf32:ZCWPW2:IF16:IGHG4:I GHA2:IGHG2:RAC 204320_at COLI1A1 2.93E-07 1.5718 2.6 78.4 69.6-85.6 37892_at COL11AI 2.93E-07 1.5707 2.6 78.4 69.6-85.6 232252_at DUSP27 4.68E-07 1.5112 4.81 77.5 68.6-84.8 225806_at C14orf94 2.40E-06 1.6519 1.19 79.6 70.9-86.6 204885_s at MSLN 3.78E-06 1.4266 1.93 76.2 67.2-83.7 214974_x at CXCL5 2.79E-05 1.2818 4.48 73.9 64.7-81.6 236894_at L1TD1 0.0002 1.1596 2.36 71.9 62.6-79.9 205910_s at CEL 0.0006 1.0869 1.46 70.7 61.3-78.8 202311_s_at COLlAl 0.0018 1.0054 2.5 69.2 59.9-77.6 205825_at PCSK1 0.0029 1.1086 1.48 71 61.7-79.1 226237 at COL8A1 0.0254 0.8303 1.43 66.1 56.6-74.7 235976 at SLITRK6 0.0576 0.8689 1.31 66.8 57.2-75.3 -114 TABLE 4 Gene Symbol ValidPSUP Symbol Signif. D.val5 FC Sens- CI FDR __ Spec (95) CD44 366106-HuGene st:314808-HuGene st:59730- MAPK10 3.35E- 3.7136 7.06 96.892.9 HuGene_ st:599371-HuGenest:391296- :CD44 26 98.8 HuGenest:1031797-HuGenest:314340 HuGenest: 10723-HuGenest:950067 HuGenest:282016-HuGenest:480680 HuGenest:69560-HuGenest:388781 HuGenest:243049-HuGenest:374652 HuGenest: 194553-HuGenest: 1075454 uGenest:204489_s_at:619139 HuGene st:210916 s at:542762 HuGene st:1557905_s_at:212014_x_at:229221 at:20449 0 s at:234418_x at:209835_x at:212063_at:216062at: 777408 HuGenest:234411_x_at:217523 at:216056_at:1565868 at:83114-HuGene st NFE2L3 95873-HuGenest:1058969-HuGenest:86088- LOC6503 1.51E- 3.5213 4.57 96.191.6 HuGene st:1093456-HuGene st:347351- 31:NFE2 24 98.4 HuGene st:878719-HuGenest:880179- L3:LOC6 HuGene st:240089_at:822197-HuGenest:873612- 42996 HuGenest:517821-HuGene st:489354 HuGene st:20603-HuGene st:204702 s at MMP7 267137-HuGenest:1068392-HuGene st:794865- MMP7 7.61E- 3.265269.29 94,989.8 HuGenest:876922-HuGene st:669567- 22 97.8 HuGenest:1039935-HuGenest:514120 uGenest:1092830-HuGenest:745768 HuGenest:272260-HuGenest:30733 HuGene st:401681-HuGene st:854024 HuGene st:221616-HuGene st:805045 HuGenest:78383 8-HuGenest:204259_at:934756 HuGene st:446688-HuGene st:10259 HuGene st:267021-HuGene st:889042-HuGene st CXCL3 11327-HuGene st:626337-HuGene st:322604- CXCL2:C 1.95E- 3.2907 7.85 9590 HuGene st:256806-HuGene st:261067- XCL3 21 97.8 HuGene st:448247-HuGene st:666569 HuGenest:222646-HuGenest:262117 HuGenest:798675-HuGenest:391369 HuGenest:865299-HuGenest:877897 HuGenest:575433-HuGene st:43120-HuGenest:42070 HuGene st:33632-HuGene st:965957 HuGene st:556939-HuGene st:26460 HuGene st:796450-HuGene st:230101_at:541868 uGene st:490134-HuGene st: 1064107 HuGene st:230192-HuGene st:645829 HuGene st:455502-HuGene st:249140 HuGene st:906606-HuGene st: 1098767 HuGenest:721989-HuGenest:500578 HuGenest:33301-HuGenest:207850_at:622384 uGenest:209774_x_at:432036-HuGenest:30559 uGene st:187557-HuGene st:1101027 HuGenest: 1038334-HuGene st:260925 HuGene st:515662-HuGene st:258081 HuGene st: 1569203_at:280828-HuGenest: 1000069 HuGene st:509822-HuGene st:890213- - 115 HuGene st:739461-HuGene st:342746 HuGenest:98929-HuGenest:818131 HuGenest: 1091592-HuGenest:420504 HuGenest:41644-HuGene st:655278 HuGene st:188562-HuGene st:458602-HuGene st DACHI 686187-HuGenest:985190-HuGenest:1011822- DACHI 1.05E- 2.9206 5.81 92.886.9 HuGenest:1092705-HuGenest:693255- 18 96.5 HuGenest:722787-HuGenest:465556 HuGenest: I13230-HuGene_ st:49101 HuGenest:290879-HuGenest:646013-HuGenest:3984 HuGenest:37863 1-HuGenest: 1002554 HuGene st:620749-HuGene st:802309 HuGene st:82679-HuGenest:205471_s_at:303920 HuGene st:338273-HuGene st:20454 HuGene st:1567100_at:205472 s at:228915_at:1562342 at:169641-HuGene st:984254-HuGene st:1567101 at NLF1 269939-HuGenest:314919-HuGenest:552234- NLF1 3.12E- 2.8001 8.12 91.985.7 HuGenest:622058-HuGenest:68900- 18 95.9 HuGenest:376228-HuGenest:194806 HuGenest:923707-HuGenest:745661 HuGenest:996995-HuGenest:534469 HuGenest:560627-HuGenest:711203 HuGenest:241031_at:93378-HuGene st:611899 HuGenest:57671-HuGene st:443570 HuGene st:231416-HuGene st PSAT1 139416-HuGenest:795030-HuGenest:160439- LOC6512 3.32E- 2.4883 5.52 89.382.3 HuGenest:322148-HuGenest:85375- 55:PSAT 15 94.1 HuGenest:223335-HuGenest:2658-HuGenest:399855- 1:LOC38 uGenest:220892_s_at:941491-HuGenest:1017348- 9173:LO HuGene st:856207-HuGene st: 194347- C729779: HuGene st:553133-HuGene st:223062_s_at:987517- C8orf62 HuGene st:336174-HuGene st CST1 95123-HuGenest:125297-HuGenest:235257- CST1 7.04E- 2.4165 13.01 88.781.5 HuGenest:102028-HuGenest:291462- 15 93.7 HuGenest:206224_at:906914-HuGene st:936009 HuGene st:1055285-HuGene st AZGP1 363333-HuGenest:620891-HuGenest:619378- AZGP1:L 1.31E- 2.3984 6.19 88.581.3 HuGenest:741747-HuGenest:28018- OC40139 14 93.5 HuGene st:594026-HuGene st:222884- 3 HuGene st: 1006522-HuGenest:974237 HuGenest:47389 1-HuGenest:784628 HuGenest:488818 HuGenest:209309_at:217014_s_at:63620 HuGenest:366096-HuGenest:950952 HuGenest:601798-HuGene st:399823 HuGene st:734365-HuGene st:1054227-HuGene st ECT2 770888-HuGenest:82454-HuGenest:171418- ECT2 1.42E- 2.4269 2.82 88.881.6 HuGene st:52318-HuGene st:530079- 14 93.7 HuGene st:288145-HuGene st:234992_x_at:64803 HuGenest:705846-HuGenest: 158755 HuGenest:554622-HuGene_ st:237241_at:629949 HuGenest:222681-HuGenest:241642 HuGenest:143403-HuGenest:770445 HuGenest:218688-HuGenest:700408 HuGene st:165018-HuGene st:917458 uGene st:219787_s at:609829-HuGenest:110350 uGene st -116 TACSTD2 1004611-HuGenest:755233-HuGene_-st:815649- TACSTD 3.13E- 2.3693 15.71 88.280.9 HuGenest:567105-HuGenest:8281- 2 14 93.3 HuGene st:1041491-HuGene st:339450 HuGenest:227128 s_at:653539-HuGenest:1079680 HuGenest:958592-HuGenest:202286_s_at:620831 HuGene st:53075-HuGene st:1009541 uGene st:861600-HuGene st:181597 HuGene st:130895-HuGene st:849592-HuGene st L8 1044664-HuGene _st:714746-HuGenest:442029- AHNAK: 3.28E- 2.3273 1.73 87.880.3 HuGenest:1 19200-HuGenest:550444- IGHG1 12 93 HuGene st:493978-HuGenest:713906 HuGene st:943156-HuGene st:504843 uGene st:501902-HuGene st:497219 HuGenest:23149-HuGenest:387164 HuGenest:562654-HuGenest: 1047492 HuGene st:538558-HuGenest:239300 HuGenest:211430_s_at:390695-HuGene st:1081759 uGenest: 144630-HuGenest:661013 HuGenest:393191-HuGenest:55017 HuGenest:1005159-HuGenest:359116 HuGenest:373132-HuGenest:689050 HuGenest:611041-HuGenest:715689 uGenest:289354-HuGenest:820633 HuGenest:555788-HuGenest:988945 uGenest:307991-HuGenest:211506_s_at:924368 HuGenest:4791 1-HuGene st:929972 HuGenest:561 100-HuGenest:68522 HuGenest:217039_x_at:231668_x_at:1080400 HuGenest:141662-HuGenest:765200 HuGene st:648539-HuGene st:274868 HuGene st:233969 at LGR5 784585-HuGene st:777519-HuGene st:937559- LGR5 1.1OE- 2.0645 7.89 84.976.9 HuGene _st:297045-HuGenest:796093- 11 90.8 HuGenest:783411-HuGenest:876648 HuGenest:842003-HuGenest:102017 HuGene st:56225 1 -HuGene st:295997 HuGene st:747520-HuGenest:811121 HuGenest:677407-HuGenest:522834 HuGenest:475414-HuGenest:802999 HuGene_ st: 1066925-HuGenest:216864 HuGenest:890928-HuGenest:179502 HuGene_ st:79633 HuGene st:210393 at:241266 at:213880 at QPCT 261837-HuGenest:108729-HuGenest:942514- QPCT 3.46E- 2.0415 3.03 84.676.7 HuGenest:184578-HuGenest:454020- 11 90.6 HuGenest:202482-HuGenest:1012784 HuGenest:854999-HuGenest:62841 HuGene_ st:271540-HuGenest:524919 HuGenest:78507-HuGenest:89785-HuGenest:728309 HuGenest:625205-HuGenest:170279 HuGenest:561083-HuGenest:322236 HuGene st:205174 s_at:303448-HuGene st WDR72 829740-HuGenest:1063796-HuGenest:725768- WDR72 5.22E- 1.9856 14.11 8475.9 HuGenest:227174_at:36564- 11 90 HuGenest:236741_at:604794-HuGenest:1040675 HuGene st:527199-HuGene st: 1052168 HuGene st: 1563874 at: 177868-HuGene st:667158 HuGene st:542462-HuGene st:136243- - 117 HuGenest:551835-HuGenest REG4 186424-HuGenest:382639-HuGenest:93 1162- REG4 5.97E- 1.8057 7.75 81.773.3 HuGenest:400261-HuGenest:638045- 10 88.3 HuGenest:852602-HuGenest:628254 uGenest:849775-HuGenest:274371 HuGenest:1554436 a_at:1092015 HuGenest:223447_at:29614-HuGenest:661518 HuGenest:580789-HuGenest:929866 HuGenest:160171-HuGenest:954850 HuGene_ st:758344-HuGenest:254891 HuGenest:238984_at:421464-HuGenest:758547 HuGene st:364821-HuGene st:701510-HuGene st MMP1 61706-HuGenest:300572-HuGenest:1020786- MMPI 1.22E- 1.8141 10.59 81.873.4 HuGenest:437171-HuGenest:671620- 09 88.4 HuGenest:689073-HuGene_ st:622653 HuGenest:958445-HuGenest:445730 HuGenest:914223-HuGenest:693724 HuGenest:673683-HuGenest:524115 HuGenest:422476-HuGenest:361198 HuGene st:710307-HuGene st:468477 HuGene st:840324-HuGenest: 1070117 HuGenest:473664-HuGene st:732367 HuGene st:204475 at:353235-HuGene st ANLN 522941-HuGenest:550252-HuGenest:477118- ANLN 4.94E 1.7566 2.63 8172.5 HuGenest:858635-HuGenest:34728- 09 87.7 HuGenest:165560-HuGenest:226318 HuGene st:207100-HuGene st:842538 HuGenest:343961-HuGene st:899690 HuGene st:705871-HuGene st:42619 HuGene st:984996-HuGenest:733290 HuGene st:1008901-HuGenest:752472 HuGenest:619756-HuGenest:674545 HuGenest:558324-HuGenest:261208 HuGene st:222608 s at:1552619 a at COL1IA1 254341-HuGenest:485975-HuGenest:869898- COL1IA 2.93E 1.5724 2.6 78.469.6 HuGenest:755872-HuGenest:2674-HuGenest:230890-1 07 85.6 HuGenest:6042-HuGenest: 1100922 HuGene st:800510-HuGene st:64257 HuGene st:301971-HuGenest:360139 HuGenest:549979-HuGenest:603002 HuGenest:756718-HuGenest: 1043994 HuGenest:198973-HuGenest:550144 HuGenest:986684-HuGene st:743059 HuGenest:708360-HuGenest:575064 HuGenest:633985-HuGenest:392228 uGenest:204320_at:121425-HuGenest:468885 HuGene st:20543-HuGene st:170503-HuGene st -118 DUSP27 612025-HuGenest:737703-HuGenest:124098- DUSP27 4.68E- 1.5098 4.81 77.568.6 HuGene st:102948-HuGene st:35826- 07 84.8 HuGene st:226732-HuGene st:513585 HuGene st: 135885-HuGene st:737040 HuGenest:446627-HuGenest:724445 HuGenest:289189-HuGenest:5396-HuGenest:520095 HuGenest:172210-HuGenest:277848 HuGenest:312172-HuGenest:903613 HuGenest:232252_at:303717-HuGenest:846174 HuGene st C14orf94 225806_at:243446_at:88782-HuGenest:537586- Cl4orf94 2.40E- 1.6524 1.19 79.670.9 uGenest:973643-HuGenest:646464- 06 86.5 HuGenest:729584-HuGenest:973338 HuGenest:95576-HuGenest:301260 HuGenest:559586-HuGenest:942952 HuGene st:82798-HuGene st:218383_at:165751 HuGene st:965936-HuGenest:31043 HuGenest:650638-HuGenest:984201 HuGene st:925802-HuGene st:306659 HuGene st:345230-HuGene st MSLN 63302-HuGenest:821643-HuGene st:99135- MSLN 3.78E- 1.427 1.93 76.267.2 uGenest:192667-HuGenest:765000- 06 83.7 HuGenest:44188-HuGenest:5962 1-HuGenest:465236 HuGenest:559922-HuGenest:82252 HuGenest: 1039687-HuGenest: 1053735 HuGene st:204885_s_at:870302-HuGenest:246473 uGenest:58043-HuGenest: 122629 HuGene st:826602-HuGene st:68974 1 HuGene st:931732-HuGene st:227498-HuGene st CXCL5 264137-HuGene st:272626-HuGene st:745715- CXCL5 2.79E 1.2814 4.48 73.964.7 HuGenest:968572-HuGenest:1054472- 05 81.7 HuGene _st:394185-HuGenest:181128 HuGenest:456756-HuGenest:415578 HuGenest:215101_s_at:214974 x_at:935926 HuGenest:781518-HuGenest:235326 HuGenest:941963-HuGenest:242589 HuGenest:798477-HuGene st:257906 HuGene st:350119-HuGene st:222708-HuGene st L1TD1 859419-HuGenest:968483-HuGenest:1010927- L1TD1 2.OOE- 1.1595 2.36 71.962.6 HuGenest:236894_at:715579-HuGenest:540230- 04 79.9 HuGenest:414266-HuGenest:572495 HuGenest:327643-HuGenest: 1058922 HuGenest:18534-HuGenest:1063550 HuGene st: 1030986-HuGenest:842123 HuGene st:709222-HuGene st:887612 HuGenest:219955_at:496121-HuGenest:942986 HuGene st:799837-HuGene st CEL 1553970 s at:898561-HuGenest:857424- CEL 6.OOE- 1.0872 1.46 70.761.3 HuGenest:456577-HuGenest:1035054- 04 78.8 HuGene st:539061-HuGene st:788693 HuGene st:314198-HuGene st:786044 HuGene st:205910_s_at:958725-HuGenest:473620 HuGene st:74757-HuGenest:169727 HuGene st:711091-HuGene st:461956 uGene st:897116-HuGene st:911062 HuGene st:796522-HuGene st -119 COL1A1 487433-HuGene st:719132- COLlA1 1.80E- 1.0053 2.5 69.259.9 HuGene st:1556499 s at:202311 s at:202310 s at:100 03 77.6 3153-HuGene st: 1029566-HuGenest PCSK1 981596-HuGenest:31911-HuGenest:701212- PCSK1 2.90E- 1.1082 1.48 7161.6 uGenest:673651-HuGenest:166044- 03 79.1 HuGenest:989024-HuGenest: 1039153 HuGenest:68881 1-HuGenest:1068709 HuGenest: 10839 10-HuGenest:527047 HuGenest:994213-HuGenest:190818 HuGenest:785459-HuGenest:127562 HuGene st: 1012339-HuGene st:343010 HuGene st:109185-HuGenest:115345 HuGenest:470044-HuGene st:856975 HuGene st:454100-HuGene st COL8A1 713455-HuGenest COL8A1 2.54E 0.8325 1.43 66.156.6 02 74.7 TABLE 5 TargetPS Symbol Signif. D.val5 FC Sens- CI (95) FDR Spec 213106_at ATP8A1 6.60E-08 2.2884 3.11 87.4 76.9-94.1 210107 at CLCA1 3.19E-07 2.1573 9.98 86 75.2-93.1 204811 s at CACNA2D2 3.61E-05 1.659 4.41 79.7 67.7-88.5 223969_s_at RETNLB 4.3968E- 1.6581 11.64 79.6 67.7-88.5 05 223970_at RETNLB 4.3968E- 1.6576 11.64 79.6 67.8-88.5 05 228232 s at VSIG2 0.0001 1.5718 2.4 78.4 66.3-87.6 242601 at LOC253012 0.0001 1.5777 5.28 78.5 66.5-87.6 227719_at No Symbol 0.0002 1.642 1.98 79.4 67.4-88.3 237521_x_at No Symbol 0.0002 1.6433 .1.98 79.4 67.5-88.3 203240 at FCGBP 3.OOE-04 1.5005 3.07 77.3 65.2-86.7 204897_at PTGER4 6.OOE-04 1.4732 1.55 76.9 64.7-86.4 227676_at FAM3D 0.001 1.2706 2.15 73.7 61.3-83.8 205765_at CYP3A5 1.20E-03 1.5279 1.8 77.8 65.6-87.1 232176_at SLITRK6 0.0016 1.3618 5.96 75.2 62.8-85 232481_s at SLITRK6 0.0016 1.3607 5.96 75.2 62.9-85 235976_at SLITRK6 0.0016 1.3648 5.96 75.3 62.9-84.9 221874_at KIAA1324 3.80E-03 1.325 2.1 74.6 62.2-84.5 226248_s at KIAA1324 0.0038 1.3246 2.1 74.6 62.2-84.6 204607_at HMGCS2 5.20E-03 1.2354 3.59 73.2 60.7-83.3 203963_at CA12 6.50E-03 1.1828 1.86 72.3 59.7-82.6 204508 s at CA12 6.50E-03 1.1861 1.86 72.3 59.8-82.6 215867_x_at CA12 6.50E-03 1.1858 1.86 72.3 59.7-82.6 227725_at ST6GALNAC1 0.0068 1.1419 1.76 71.6 59-82 200884_at CKB 7.30E-03 1.1481 2.25 71.7 59.1-82.1 219955_at L1TD1 7.70E-03 1.2502 3.27 73.4 60.9-83.5 236894 at L1TD1 0.0077 1.2465 3.27 73.3 60.9-83.5 205259 at NR3C2 1.29E-02 1.1933 1.63 72.5 60.1-82.8 218211 s at MLPH 1.38E-02 1.2269 1.59 73 60.5-83.2 -120 214234_s_at CYP3A5P2 1.94E-02 1.098 1.7 70.8 58.1-81.3 214235 at CYP3A5P2 1.94E-02 1.0976 1.7 70.8 58.2-81.3 204895_x_at MUC4:TAF5L:LOC650855:L 4.53E-02 1.1998 1.31 72.6 60.1-82.8 OC645744 217109_at MUC4:TAF5L:LOC650855:L 4.53E-02 1.2028 1.31 72.6 60.2-82.8 OC645744 217110_s_at MUC4:TAF5L:LOC650855:L 4.53E-02 1.198 1.31 72.5 60-82.8 OC645744 221841_s_at KLF4 7.31E-02 1.0009 1.58 69.2 56.5-79.9 215125 s at UGT1A9 7.37E-02 1.0664 1.28 70.3 57.6-80.9 208063_s at CAPN9 7.40E-02 0.998 1.84 69.1 56.4-79.9 214433 s at SELENBP1 9.02E-02 0.9783 2 68.8 56-79.6 226302_at ATP8B1 0.093 1.039 1.4 69.8 57.2-80.5 231832_at WDR51B 0.1284 1.0008 1.28 69.2 56.5-79.9 219543_at PBLD 1.66E-01 0.8734 1.89 66.9 54.2-77.9 208937_s_at ID1 2.34E-01 0.7639 1.66 64.9 52.1-76.2 205927 s at CTSE 2.81E-01 0.7393 3.23 64.4 51.6-75.8 229070_at C6orflO5 0.7794 0.7027 1.37 63.7 50.9-75.2 TABLE 6 TargetPS Symbol Signif. D.val5 FC Sens- CI (95) FDR Spec 200665_s at SPARC 3.05E-09 2.3771 3.24 88.3 78-94.6 212667_at SPARC 3.0479E-09 2.3752 3.24 88.3 78.1-94.6 211964 at COL4A2 5.5315E-09 2.4316 2.87 88.8 78.8-95 211966_at COL4A2 5.5315E-09 2.4321 2.87 88.8 78.8-95 211980_at COL4A1 1.243E-08 2.3031 3.03 87.5 77.2-94.2 211981_at COL4A1 1.243E-08 2.3045 3.03 87.5 77.2-94.2 218638_s_at SPON2 1.9471E-08 2.3321 3.38 87.8 77.5-94.3 221729_at COL5A2 5.7658E-08 2.2145 3.31 86.6 76-93.5 221730_at COL5A2 5.7658E-08 2.2138 3.31 86.6 75.9-93.6 208782 at FSTL1 7.8434E-08 2.228 2.64 86.7 76.1-93.7 201261_x_at BGN 9.6881E-08 2.2253 2.99 86.7 76-93.6 213905_x at BGN 9.6881E-08 2.2232 2.99 86.7 76.1-93.6 209955_s_at IFIH1:FAP 1.358E-07 2.2261 4.88 86.7 76.2-93.6 226237_at COL8A1 3.6053E-07 2.1456 3.09 85.8 75-93 202310 s at COL1A1 5.44E-07 2.0121 3.29 84.3 73-92 202311_s_at COL1A1 5.44E-07 2.0131 3.29 84.3 73.1-92 217430_x_at COL1A1 5.4426E-07 2.015 3.29 84.3 73.2-92 201438_at COL6A3 5.7026E-07 2.0286 2.73 84.5 73.5-92.1 202403_s_at COL1A2:LOC728628 8.85E-07 2.0609 2.62 84.9 73.9-92.4 202404_s_at COL1A2:LOC728628 8.85E-07 2.0621 2.62 84.9 73.7-92.4 229218_at COL1A2:LOC728628 8.8466E-07 2.0618 2.62 84.9 73.9-92.4 208850_s at THY1 1.0326E-06 1.9159 3.34 83.1 71.8-91.1 208851_s_at THY1 1.0326E-06 1.915 3.34 83.1 71.7-91.1 213869_x at THYl 1.0326E-06 1.9132 3.34 83.1 71.7-91.1 203477_at COL15A1 1.0611E-06 2.0121 3.37 84.3 73.2-91.9 201616_s at CALDI 1.11E-06 2.1136 1.7 85.5 74.6-92.8 212077_at CALDI 1.1059E-06 2.1131 1.7 85.5 74.5-92.8 - 121 201162 at IGFBP7 1.1574E-06 2.0016 1.91 84.2 72.9-91.8 201163_s at IGFBP7 1.16E-06 1.999 1.91 84.1 73-91.9 210511_s_at INHBA 1.1943E-06 2.0027 3.38 84.2 73-91.9 225664_at TMEM30A:COL12A1 1.2492E-06 2.0313 4.16 84.5 73.4-92.1 231766_s at TMEM30A:COL12A1 1.2492E-06 2.0303 4.16 84.5 73.4-92.1 231879_at TMEM30A:COL12A1 1.2492E-06 2.0325 4.16 84.5 73.5-92.1 203325_sat COL5A1 1.6076E-06 1.934 2.75 83.3 72.1-91.3 212488_at COL5A1 1.6076E-06 1.9342 2.75 83.3 72.1-91.3 212489_at COL5A1 1.6076E-06 1.9339 2.75 83.3 72-91.2 203083_at THBS2 2.3382E-06 1.9081 6.5 83 71.6-91 202291_s_at MGP:Cl2orf46 2.50E-06 1.9993 2.62 84.1 73-91.9 201645_at TNC 3.74E-06 1.9608 2.14 83.7 72.5-91.5 201852_x at COL3A1 9.29E-06 1.8472 2.32 82.2 70.7-90.5 211161_s_at COL3A1 9.2906E-06 1.8501 2.32 82.3 70.7-90.5 215076_s at COL3A1 9.2906E-06 1.8495 2.32 82.2 70.7-90.5 232458 at COL3A1 9.2906E-06 1.8473 2.32 82.2 70.7-90.5 212344_at SULFI 1.4436E-05 1.7528 2.84 81 69.3-89.5 212353_at SULF1 1.4436E-05 1.7555 2.84 81 69.2-89.5 212354_at SULF1 1.4436E-05 1.7521 2.84 80.9 69.3-89.5 201185_at HTRA1 1.7178E-05 1.7346 2.29 80.7 68.9-89.3 202998_s at ENTPD4:LOXL2 1.99E-05 1.7754 2.45 81.3 69.6-89.7 224724_at SULF2 2.0766E-05 1.6837 2.25 80 68.2-88.7 233555_s_at SULF2 2.0766E-05 1.6795 2.25 79.9 68-88.7 201069_at MMP2 2.5409E-05 1.7345 3.02 80.7 69-89.3 201147_s_at TIMP3 2.62E-05 1.7776 2.56 81.3 69.6-89.7 201150 s at TIMP3 2.62E-05 1.7774 2.56 81.3 69.6-89.7 209156 s at COL6A2 2.9551E-05 1.7373 1.67 80.7 69-89.4 227099_s at LOC387763 0.00003052 1.8968 1.43 82.9 71.4-90.9 214247 s at DKK3 3.1838E-05 1.7434 1.86 80.8 69.1-89.5 202450_s at CTSK 3.98E-05 1.7153 2.16 80.4 68.7-89.1 225799_at MGC4677,MGC4677:LOC5414 4.1139E-05 1.8003 1.45 81.6 70-90 71 209395 at CHI3L1:MYBPH 5.1515E-05 1.6358 3.06 79.3 67.4-88.2 209396_s at CHI3L1:MYBPH 5.1515E-05 1.6352 3.06 79.3 67.4-88.2 202878_s at CD93 5.19E-05 1.6674 2.36 79.8 67.9-88.6 204320_at COL11A1 5.3391E-05 1.7594 2.55 81 69.3-89.6 37892_at COL1IA1 5.3391E-05 1.7556 2.55 81 69.3-89.6 221011 s at LBH 6.3299E-05 1.6771 1.9 79.9 68.1-88.7 213125_at OLFML2B 6.6557E-05 1.7057 1.97 80.3 68.5-89 204475 at MMP1 7.0616E-05 1.6759 3.91 79.9 68-88.8 226694_at AKAP2:PALM2:PALM2- 7.4206E-05 1.7231 1.58 80.6 68.8-89.2 AKAP2 202766_s at FBN1 0.0001 1.5984 1.48 78.8 66.7-87.9 205828 at MMP3 0.0001 1.6223 6.68 79.1 67.3-88.1 207191_s at ISLR 0.0001 1.6448 1.87 79.5 67.5-88.4 213428 s at COL6A1 0.0001 1.7301 1.58 80.6 68.8-89.3 226930_at FNDC1 0.0001 1.521 3.14 77.7 65.6-87 201792 at AEBP1 0.0002 1.4772 2.11 77 64.8-86.5 203878 s at MMP11 0.0002 1.6971 1.28 80.2 68.4-88.9 211959 at IGFBP5 0.0002 1.5662 3.96 78.3 66.3-87.5 225710 at No Symbol 0.0002 1.6426 1.98 79.4 67.5-88.4 - 122 226311_at No Symbol 0.0002 1.6436 1.98 79.4 67.5-88.3 226777_at No Symbol 0.0002 1.645 1.98 79.5 67.5-88.4 227140 at No Symbol 0.0002 1.6427 1.98 79.4 67.6-88.3 229802_at No Symbol 0.0002 1.6421 1.98 79.4 67.5-88.3 205479_s_at PLAU 0.0003 1.484 2.11 77.1 64.9-86.5 210495_x at FN1 0.0003 1.494 2.87 77.2 65.1-86.6 211719 x_at FN1 0.0003 1.4894 2.87 77.2 65.1-86.7 212464_s_at FN1 0.0003 1.493 2.87 77.2 65-86.7 216442_x at FNI 0.0003 1.4935 2.87 77.2 65-86.6 217762_s at RAB31 0.0003 1.4864 2.08 77,1 65-86.6 217763_s_at RAB31 0.0003 1.4848 2.08 77.1 64.9-86.6 217764_s_at RAB31 0.0003 1.4882 2.08 77.2 65-86.5 225681 at CTHRC1 0.0003 1.4588 3.41 76.7 64.5-86.3 201105_at LGALS1 0.0004 1.4844 1.61 77.1 64.9-86.6 208788_at ELOVL5 0.0004 1.5418 1.86 78 65.8-87.2 224694_at ANTXR1 0.0005 1.52 2 77.6 65.5-86.9 200974_ at ACTA2 0.0006 1.4251 1.75 76.2 64-85.8 210095_s_at IGFBP3 0.0006 1.5538 1.38 78.1 66-87.4 201426_s_at VIM 8.OOE-04 1.4819 1.48 77.1 64.9-86.4 219087_at ASPN 0.0009 1.4046 2.75 75.9 63.6-85.5 227566_at HNT 0.001 1.4215 2.26 76.1 64-85.7 201667_at GJA1 1.30E-03 1.4084 1.63 75.9 63.7-85.6 200600_at MSN 1.40E-03 1.4942 1.36 77.2 65-86.7 204051 s at SFRP4 0.0015 1.4056 2.2 75.9 63.6-85.6 209101_at CTGF 0.0015 1.3331 1.78 74.7 62.4-84.6 204620_s at CSPG2 0.0016 1.21 1.38 72.7 60.3-83 211571_s at CSPG2 0.0016 1.2123 1.38 72.8 60.1-82.9 215646_s at CSPG2 0.0016 1.2095 1.38 72.7 60.3-83 221731_x_at CSPG2 0.0016 1.2115 1.38 72.8 60.3-83 204006_s at FCGR3B 0.0026 1.2308 2.14 73.1 60.6-83.2 203570_at LOXL1 0.0027 1.4048 1.33 75.9 63.7-85.6 201744_s_at LUM 2.90E-03 1.3418 2.18 74.9 62.5-84.7 202283_at SERPINF1 3.1OE-03 1.2121 1.76 72.8 60.2-83 209596_at MXRA5 0.0034 1.2974 1.62 74.2 61.7-84.1 210809 s at POSTN 0.005 1.3011 2.58 74.2 61.8-84.2 205547 s at TAGLN 0.0051 1.1829 2.19 72.3 59.7-82.6 202237_at NNMT 5.40E-03 1.223 1.44 73 60.4-83.1 202238_s_at NNMT 5.40E-03 1.2259 1.44 73 60.5-83.1 218468_s at GREM1 0.007 1.3017 1.31 74.2 61.8-84.2 218469_at GREMI 0.007 1.3027 1.31 74.3 61.9-84.2 208747_s_at CIS 0.0088 1.3575 1.77 75.1 62.9-85 224560 at TIMP2 0.0097 1.1629 1.72 72 59.4-82.3 231579_s_at TIMP2 0.0097 1.1641 1.72 72 59.5-82.4 209875_s at SPP1 0.0109 1.0673 3.91 70.3 57.6-80.9 202859_x at IL8 0.0112 1.3238 4.54 74.6 62.2-84.5 200832_s at SCD:LOC651109:LOC645313 1.27E-02 1.0749 2.33 70.5 57.9-81.1 201058_s at MYL9 0.0131 1.1765 1.37 72.2 59.6-82.5 203645 s at CD163 0.0146 1.1655 2.19 72 59.4-82.3 215049_x at CD163 0.0146 1.1664 2.19 72 59.4-82.3 202917_s at S100A8 1.48E-02 1.0979 3.17 70.8 58.3-81.4 - 123 201289_at CYR61 0.015 1.0919 1.69 70.7 58.1-81.2 210764_s at CYR61 0.015 1.0938 1.69 70.8 58.2-81.2 218559_s at MAFB 0.0182 1.162 1.48 71.9 59.3-82.3 203382_s at APOE 0.0285 1.1124 1.43 71.1 58.5-81.6 201893_x_at DCN 3.64E-02 1.0821 1.29 70.6 58-81.1 211813_x_at DCN 0.0364 1.0838 1.29 70.6 57.8-81.1 211896_s at DCN 0.0364 1.0814 1.29 70.6 58-81.1 213524_s at GOS2 0.0383 0.994 1.53 69 56.3-79.8 207173_x at CDH1I 0.0384 1.0121 1.98 69.4 56.6-80.1 209218 at SQLE 0.041 1.0162 2.15 69.4 56.6-80.1 201141 at GPNMB 0.0623 1.0165 1.52 69.4 56.7-80.1 201859_at PRG1 0.0634 0.798 1.34 65.5 52.6-76.7 234994_at KIAA1913 0.1061 0.9579 1.2 68.4 55.7-79.3 223122 s at SFRP2 0.1336 0.968 1.24 68.6 55.9-79.4 223235_s_at SMOC2 0.2175 0.6547 2.06 62.8 50-74.4 200986_at SERPING1 0.2636 0.7831 1.26 65.2 52.4-76.4 201842_s_at EFEMPI 4.91E-01 0.6821 1.27 63.3 50.5-74.8 204122 at TYROBP 0.4923 0.7066 1.23 63.8 51.1-75.3 202620_s at PLOD2 5.15E-01 0.7784 1.51 65.1 52.3-76.4 TABLE 7 Gene Symbol ValidPSUP Symbol Signif. D.val5 FC Sens- CI FDR Spec (95) ATP8A1 231484_at:792569-HuGenest:210192_at:393806- ATP8A1 6.60E- 2.2854 3.11 87,376.9 HuGenest:743026-HuGenest:200684- 08 94 HuGenest: 1570592_a_at:645563-HuGenest:245580 uGenest:175799-HuGenest:20566 uGenest:636204-HuGenest:376757 uGenest:273140-HuGenest:873028 HuGenest:864954-HuGenest:349056 HuGene st:538894-HuGenest:167620 HuGene st:807498-HuGene st CLCA1 210107_at:426361-HuGenest:389155- CLCA1 3.19E- 2.1536 9.98 85.975.1 HuGenest:622359-HuGenest:523802- 07 93.1 HuGenest: 196470-HuGenest: 1078884 uGenest:515258-HuGenest:638133 uGenest:764167-HuGenest:921948 uGenest:372283-HuGenest:252084 HuGenest:98798-HuGenest:1059095 HuGenest:441678-HuGenest:490107 HuGenest:284008-HuGenest:258568 HuGene__st:602588-HuGene st:472452 HuGene st:640066-HuGene st - 124 CACNA2D2 805306-HuGene st:1022165-HuGenest:356782- CACNA 3.61E- 1.6587 4.41 79.767.8 HuGenest:70395-HuGene st:525161- 2D2 05 88.5 HuGenest:729314-HuGenest:866616 HuGenest: 13978-HuGenest:715678 HuGene st:299289-HuGene st:592015 uGenest:756393-HuGene st:607464 uGenest:862691-HuGene st:229636_at:885048 uGenest:680228-HuGene st:393608 HuGene st:886897-HuGene st:413699-HuGene st RETNLB 173654-HuGenest:223970_at:1094109- RETNL 4.40E- 1.6599 11.64 79.767.8 HuGene st:708431-HuGene st:231770- B 05 88.5 HuGene st: 1065075-HuGenest:318917 HuGenest:719419-HuGenest:380366 uGenest:221119-HuGenest:774168 uGenest:223969_s_at:523432-HuGene st:45260 uGenest:963420-HuGenest:329703 HuGenest:223761-HuGenest:1036667 HuGenest:143113-HuGene st:414008 HuGene st:232543-HuGene st VSIG2 391438-HuGene _st:228232_ s_at:265221- VSIG2 0.0001 1.5745 2.4 78.466.3 HuGenest:343187-HuGenest:223925- 87.6 HuGenest:652010-HuGenest: 1069591 uGenest:265186-HuGenest: 1095607 uGenest:788466-HuGenest:675062 uGenest:826720-HuGenest: 1093892 uGenest:985113-HuGene st:637781 HuGene st:997596-HuGene st LOC253012 1014415-HuGenest:912640-HuGenest:463072- LOC253 0.0001 1.5764 5.28 78.566.5 HuGene st:749900-HuGenest:381523- 012 87.7 HuGenest:896826-HuGenest:804632 HuGenest:304834-HuGenest:39629 HuGene st:568837-HuGene st: 1070854 HuGene st: 162938-HuGene st: 1026726 HuGenest:441915-HuGene st:242601_at:51876 HuGenest: 199784-HuGenest:363568 HuGene st:50979-HuGene st:736612-HuGene st FCGBP 203240_at:88809-HuGenest:1085208- FCGBP 0.0003 1.4997 3.07 77.365.2 HuGenest:338164-HuGenest:71797- 86.7 HuGenest:1079875-HuGenest:63672 HuGene st:997435-HuGene st:226428 HuGene st:948120-HuGene st:22847 HuGenest:36995-HuGenest:508230 HuGene st: 16108-HuGenest:311904 HuGenest:426005-HuGenest:27264 HuGenest:516715-HuGenest:841000 HuGenest:70870 1 -HuGene st:781083 HuGene st:634071-HuGene st:948682-HuGene st PTGER4 204896_s_at:109314-HuGenest:986807- PTGER 0.0006 1.4713 1.55 76.964.7 HuGenest:446764-HuGene st:548266- 4 86.4 HuGenest:919403-HuGenest:965220 HuGene st: 192344-HuGene st: 1943 HuGene st:252442-HuGene st:392964 HuGenest:23686-HuGene st:204897_at:812085 HuGene st - 125 FAM3D 361373-HuGenest:963526- FAM3D 0.001 1.2699 2.15 73.761.3 uGenest:227676_at:741172-HuGenest:608385- 83.8 uGenest: 1074915-HuGenest: 182670 uGenest:24363-HuGenest:269456 uGenest:833883-HuGenest:280625 HuGenest:187246-HuGene st:964986 HuGene st:66738-HuGene st:80884-HuGene st CYP3A5 205765_at:67735-HuGenest:214234_s_at:941416- CYP3A 0.0012 1.5288 1.8 77.865.7 HuGene st:238807-HuGene st:611620-HuGene st 5 87 CYP3A5P2 205765_at:67735-HuGenest:214234_s_at:941416- CYP3A 0.0012 1.5297 1.8 77.865.6 HuGene st:238807-HuGene st:611620-HuGene st 5 87 SLITRK6 936650-HuGenest:921047-HuGenest:371185- SLITRK 0.0016 1.36 5.96 75.262.8 HuGene st:510939-HuGene st:356154- 6 85 HuGenest:205559-HuGene st:386297 HuGenest:483895-HuGenest:227644 HuGenest:458058-HuGene _st:23248 1_s_at:907326 HuGene st:255645-HuGenest:830582 HuGenest:615220-HuGenest:369624 HuGenest:78770-HuGenest:573187 HuGene st:1025122-HuGene st:584646-HuGene st KIAA1324 243349_at:226248_s_at:1052694-HuGene st:893707- KIAA13 0.0038 1.3258 2.1 74.662.2 HuGene_st:597814-HuGenest:585604- 24 84.5 HuGene st:240530-HuGene st HMGCS2 616916-HuGenest:283650- MGCS 0.0052 1.234 3.59 73.160.6 HuGenest:204607 at:43162-HuGenest:371076- 83.3 HuGenest:407398-HuGenest:900260 HuGene st:619083-HuGenest:729816 HuGenest:171889-HuGene st:253113 HuGene st:448587-HuGene st:593619 HuGenest:624250-HuGenest:865416 HuGenest:888308-HuGenest:397691 HuGene st:789982-HuGene st:658722-HuGene st CA12 215867_x_at:210735_s_at:204508_s._at:203963_at:638 CA12 0.0065 1.1823 1.86 72.359.7 145-HuGenest: 1013062-HuGenest:280470- 82.6 uGenest:214164_x_at: 875965-HuGene st:749017 HuGenest: 1049334-HuGenest:439438 HuGenest:65565-HuGenest:1017708 HuGenest:226479-HuGenest: 125567 HuGenest:698714-HuGene st:963408 HuGene st:189118-HuGene st:1049350-HuGene st ST6GALNAC 524915-HuGene st:694442-HuGenest:372293- ST6GA 0.0068 1.1407 1.76 71.658.9 1 HuGenest:247017-HuGenest:186959- LNACI 82 HuGene_ st:149798-HuGenest:611157 HuGenest:887450-HuGenest:227725_at:608767 uGene st:891399-HuGene st:95718 uGene st:780611-HuGenest:1006431 uGenest:266448-HuGenest:768462 uGenest:616368-HuGene st:66760 uGene st:1067993-HuGene-st:69927 HuGene st:104705-HuGene st - 126 CKB 200884_at:43 5665 -HuGene-st: 888623- CKB 0.0073 1.1468 2.25 71.759.2 HuGene st:480007-HuGene st:396007- 82.1 HuGene st:963331-HuGene st:40899 HuGene-st:373774-HuGene-st:25470 HuGene st: 16603-HuGene st:766369 HuGene-st:405417-HuGene-st:718720 HuGene st:1041238-HuGene st:23457 HuGene st:24890-HuGene st:769422 HuGene-st: 896708-HuGene st:58 1051I _________uGene st:549987-HuGene st:384 128-HuGene st LITD1 219955_at:842123-HuGene-st: 1095330- 1ITD1 0.0077 1.2497 3.27 73.460.9 HuGene st:1058922-HuGene st:572495- 83.5 HuGene-st:414266-HuGene-st:887612 HuGene st:799837-HuGene st: 177635 HuGene-st:715579-HuGene-st:709222 HuGene-st:968483-HuGene st: 1030986 _________uGene st:327643-HuGene st:540230-HuGene st ___ NR3C2 100953-HuGene-st:239673 at:981978-HuGene-st NRC2 0.012911.1951 1.631 72.559.9 _________ __________________________________ _________ ____82.7 MLPH 218211_-s-at MLPH 0.0138 1.2275 1.591 7360.5 ________________________________________________ ____ ____ 83.2 UGT1A1 208596_sat:22 1305_sat:204532_xat:232654_sat UGT1A 0.0376 1.0578 1.68 70.257.5 __________________________________3_ __ 80.8 MC4 NA MC4: 0.0453 1.1986 1.31 72.660.1 TAF5L: 82.8 L0C650 855 :LO C64574 KLF4 NA KLF4 0.0731 1.0018 1.58 69.256.5 CAPN9 NA CAPN9 0.074 0.9979 1.84 69.156.4 _________ __________________________________ _____79.9 SELENBP1 NA SELEN 0.0902 0.9777 2 68.856.1 __________~~BP 1_____________________ P1 7_9.6 ,TP8B1 NA ATP8B1 0.093 1.0347 1.4 69.857.2 _________ __________________________________ _________ ____80.4 WDR51B NA WDR51 0.1284 1.0004 1.28 69.256.5 ______ ________________________ _B_ __ __ 80 PBLD NA PBLD 0.1655 0.873 1 1.89 66.954.1 ____________~~~~ 1__________________________ ___ ________ 78 IDl A IDI 0.2342 0.7662 1.66 64.952.2 r_____ ______________________________ __ __ 76.2 CTSE NA CTSE 0.2809 0.7409 3.23 64.45 1.7 ____________~~~~~~~~~~~ _________________________.8 C6orflO 0 NA C6orflO0 0.7794 073 1.7 63.750.8 _________________________________5 ________ 75.1 - 127 TABLE 8 Gene ValidPSUP Symbol Signif. D.val5 FC Sens- CI (95) Symbol FDR Spec SPARC 677215-HuGenest:456544-HuGenest:1041514- SPARC 3.0479E- 2.3799 3.24 88.378.1 HuGenest:868223-HuGenest:1053181- 09 94.6 HuGenest:196416-HuGenest:1078055 HuGene st:213861-HuGene st:452422 HuGenest: 1078242-HuGenest:497056 HuGenest: 1285-HuGenest:793848 HuGenest:455765-HuGenest:72235 HuGenest:568912-HuGenest:887159 HuGene _st:255885-HuGenest:225477 HuGenest: 162094-HuGene st: 1026630 HuGene st:200665 s at COL4A2 540082-HuGenest:295949-HuGenest:1009411- COL4A2 5.5315E- 2.4297 2.87 88.878.6-95 HuGene st:245530-HuGene st:834880- 09 HuGene st:801333-HuGene st:500428 HuGene st:990355-HuGene st:469754 HuGene st:980150-HuGene st: 184258 HuGene st:580118-HuGenest:281233 HuGenest:732269-HuGene st: 1075712 HuGenest:643598-HuGenest:772366 HuGenest:211966_at:512520-HuGenest:973004 HuGenest:83963 1-HuGene st:211964_at:316644 HuGene st:1015712-HuGene st COL4A1 831741-HuGenest:634068-HuGenest:816476- COL4A1 1.243E- 2.3004 3.03 87.577-94.2 HuGenest: 1030276-HuGenest: 144637- 08 HuGenest:24196-HuGenest:272762 HuGenest:604042-HuGenest:392785 uGenest:782962-HuGene st:381555 HuGenest:576333-HuGenest:310901 HuGenest:237606-HuGenest:719873 HuGenest:914012-HuGenest:30564 HuGenest:828024-HuGenest:687375 HuGene st:211981 at:211980 at:812556 HuGene st:681655-HuGene st:233652 at SPON2 173417-HuGene_ st:845288-HuGenest:170898- SPON2 1.9471E- 2.3295 3.38 87.877.4 HuGenest:6411-HuGenest:981126- 08 94.3 HuGenest: 1098117-HuGenest: 1049111 HuGenest:1018834-HuGenest:300655 HuGenest:45524-HuGenest:256565 HuGenest:533681-HuGenest:346131 HuGenest:251693-HuGenest:937 110 HuGene_ st:347749-HuGene st:645379 HuGenest: 107844-HuGene st:396405 HuGene st:218638 s at:1569496 s at COL5A2 631247-HuGenest:59739-HuGenest:260654- COL5A2 5.7658E- 2.2164 3.31 86.676-93.5 HuGenest:125943-HuGenest:292068- 08 HuGenest:788391-HuGenest:408375 HuGenest:561511-HuGenest:838505 HuGenest:915389-HuGenest:684636 HuGene st:817279-HuGene st:514415 HuGene st:583557-HuGene st: 1009654 HuGenest:416341-HuGenest:260479 HuGene st:292241-HuGene st: 1034540 HuGene st:101178-HuGenest:221730 at:221729 at_ - 128 FSTL1 1020674-HuGenest:291086-HuGenest:964097- FSTL1 7.8434E- 2.2277 2.64 86.776.1 HuGene st:874354-HuGene st:745908- 08 93.6 HuGene st: 129330-HuGene st: 1039443 HuGenest:855602-HuGene st:494940 HuGenest:666633-HuGenest:10001 HuGenest:654906-HuGenest:322189 HuGenest:891629-HuGene st:225280 HuGene st:67871-HuGene st:965246-HuGene st BGN 722197-HuGenest:1090822-HuGenest:45294- BGN 9.6881E- 2.2246 2.99 86.776.1 HuGenest:381650-HuGenest:987820- 08 93.6 HuGene st:978124-HuGene st:171694 HuGene st:643594-HuGenest:429977 HuGene st: 1096605-HuGene st:861988 HuGenest:164623-HuGene st:201262_s_at:497499 HuGene st:422042-HuGene st: 1051954 HuGene st:262594-HuGene st:279145 HuGenest:838396-HuGenest:298959 HuGene st:1 1448-HuGene st: 111675 HuGene st:213905 x at:201261 x at:973749 HuGene st FAP 58645-HuGene st:581008-HuGene st:284881- IFIH1:FA 1.358E- 2.2253 4.88 86.776.1 HuGenest:1022038-HuGenest:916010- P 07 93.6 HuGenest:947725-HuGenest:477772 HuGenest:22772-HuGenest:636840 HuGenest:40379-HuGenest:993707 HuGenest:322964-HuGenest:759386 HuGenest:132558-HuGenest:600137 HuGenest:915847-HuGenest:246189 HuGenest:445095-HuGenest:389696 HuGenest:438119-HuGene st:748913 HuGene st:209955 s at COL8AI 333339-HuGenest:214587_at:586420- COL8AI 3.6053E- 2.1439 3.09 85.874.9-93 HuGenest:103513-HuGenest:651142- 07 HuGenest:617293-HuGenest:107398 HuGenest:79621-HuGenest:806363 HuGene st:327074-HuGene st:41199-HuGene st COL1Al 767019-HuGenest:654019-HuGenest:52480- COL1Al 5.4426E- 2.0142 3.29 84.373.1-92 HuGenest:975188-HuGene _st:731985- 07 HuGenest:849620-HuGenest:652845 HuGenest: 1077639-HuGenest: 149590 HuGene st:788754-HuGenest: 1033327 HuGenest:258815-HuGenest:719132 HuGenest:971026-HuGenest: 1061961 HuGenest:435403-HuGenest:498576 HuGene st: 1003153-HuGenest: 1029566 HuGenest:487433-HuGenest:165153 HuGenest:125258 HuGenest:217430_x_at:202310_s at:1556499_s_at: 202311 s at - 129 COL6A3 312177-HuGenest:24239-HuGenest:337032- COL6A3 5.7026E- 2.0313 2.73 84.573.5 HuGene st:31627-HuGene st:72819- 07 92.1 HuGenest:85665-HuGene st:51265 HuGenest:315258-HuGenest:423421 HuGene st:976272-HuGene st:627131 uGene st:547019-HuGene st:482866 HuGene st:871788-HuGenest:273133 HuGene st:945341-HuGenest:223275 HuGenest:768482-HuGenest:618342 HuGene st:993598-HuGene st:716493 HuGene st:153821-HuGene st:201438 at COL1A2 918394-HuGenest:75788-HuGenest:240155- COL1A2: 8.8466E- 2.0613 2.62 84.973.9 HuGene st:1079758-HuGenest:174439- LOC7286 07 92.3 HuGenest:133455-HuGenest:720682- 28 HuGenest:723785-HuGenest: 1076123 HuGene_ st:252002-HuGenest:733200 HuGene _st:890123-HuGenest:1070579 HuGenest:52384-HuGenest:928914 HuGenest:257376-HuGenest:355867 HuGenest:472073-HuGenest:349798 HuGenest:153814-HuGenest:825347 HuGene st:202404 s at:202403 s at THY1 149448-HuGenest:804495-HuGenest:287669- THYI 1.0326E- 1.9155 3.34 83.171.7 HuGenest:329513-HuGenest:432229- 06 91.1 HuGenest:834028-HuGenest:462769 HuGenest:408326-HuGenest:603473 HuGenest:497809-HuGenest:301090 HuGene st:606465-HuGene st:208850_s at:278686 HuGene st:804789-HuGenest:236645 HuGenest:257047-HuGenest:554409 HuGene st:108391 HuGenest:208851_s_at:213869_x at:908785 HuGene st:516398-HuGene st COL15AI 330621-HuGenest:910395-HuGenest:811192- COL15A 1.0611E- 2.0128 3.37 84.373.2-92 HuGene st:294092-HuGene st:958891- 1 06 HuGene st:407270-HuGene st:423090 HuGene st: 1079597-HuGenest:95084 HuGene st:775288-HuGene st:813875 HuGene st:1001637-HuGene st:49304 HuGenest:970691-HuGenest:346838 HuGenest:91483-HuGenest:528528 HuGenest:453998-HuGene st:254490 HuGene st:203477 at:201064-HuGene st CALDI 816439-HuGenest:318906-HuGenest:558226- CALDI 1.1059E- 2.1159 1.7 85.574.6 HuGene st:755661-HuGene st:519079- 06 92.8 HuGene st: 1094139-HuGenest:975649 HuGene st:407536-HuGenest:908350 HuGene st:576686-HuGene st:688034 HuGene st:792918-HuGene st:596642 HuGenest:797317-HuGenest: 1023569 HuGenest:450656-HuGene st:231881_at: 165931 HuGene st - 130 IGFBP7 363517-HuGenest:787689-HuGenest:86333- IGFBP7 1.1574E 1.9989 1.91 84.172.9 HuGene st:882867-HuGene st:620969- 06 91.8 HuGene st:167271-HuGene st:192752 HuGene st:521467-HuGene st:699448 HuGenest:882712-HuGene st:211325 uGenest:329940-HuGenest:980809 HuGenest:22805-HuGenest:288233 HuGene st:230028-HuGenest:551901 HuGene st:921418-HuGene st:232544_at:969620 HuGene st:985682-HuGene st INHBA 1088123-HuGenest:151327-HuGenest:363337- INHBA 1.1943E- 2.0051 3.38 84.273-91.9 HuGenest:874496-HuGenest:402595- 06 HuGenest:289422-HuGenest:838338 HuGenest:746887-HuGenest:623455 HuGenest:395552-HuGenest:865665 HuGene st:328796-HuGenest:1088331 HuGenest:751719-HuGenest:36929 HuGenest:548889-HuGenest:909672 HuGene_ st:210511_sat:344276 HuGene st:204926 at:112496-HuGene st COL12A1 905983-HuGenest:794749-HuGenest:1063560- TMEM30 1.2492E- 2.03 4.16 84.573.4 HuGenest:567963-HuGenest:140178- A:COL12 06 92.1 HuGenest:289954-HuGene_ st:620694- Al HuGenest:31736-HuGenest: 1009559 HuGene st:435472-HuGene st:415220 HuGene st:447689-HuGene st:405565 uGene st:180263-HuGene st:595404 HuGene st:759154-HuGene st:419264 HuGene st:319853 HuGenest:225664_at:23 1879_at:874183 HuGenest:913204 HuGenest:231766_s_at:234951_s at:1057284 HuGene st COL5A1 64984-HuGene st:815841-HuGene st:299983- COLSA1 1.6076E- 1.9335 2.75 83.3 72-91.3 HuGene st:751853-HuGene st:33784- 06 HuGene st:366345-HuGenest:63200 HuGene st:293998-HuGene st:544275 HuGenest:940629-HuGenest:383962 HuGenest:995286-HuGenest:640471 HuGenest:77586-HuGenest:212489_at:633141 HuGenest:203325_s_at:777727-HuGenest:400357 HuGenest:212488_ at:291267-HuGenest:285959 HuGene st:886716-HuGene st THBS2 789762-HuGenest:451028-HuGenest:807449- THBS2 2.3382E- 1.9035 6.5 82.971.6-91 HuGenest:895973-HuGenest:360384- 06 HuGenest:885374-HuGenest:178235 HuGene st:241784-HuGenest:774957 HuGenest:565405-HuGenest:684979 HuGenest:890746-HuGene st:588267 HuGenest: 1033680-HuGenest:387904 HuGenest:1096131-HuGenest:739113 uGenest:614690-HuGene st:270369 HuGene st:892858-HuGene st - 131 rNC 441837-HuGenest:278723-HuGenest:1030770- TNC 3.7414E- 1.9636 2.14 83.772.4 HuGenest:767997-HuGenest:514659- 06 91.6 HuGenest:873244-HuGenest:837401 HuGenest:621609-HuGenest:888816 HuGene st:660419-HuGene st:889974 HuGene st:780538-HuGene st:157747 HuGenest:908522-HuGene st:263796 HuGenest:280913-HuGene st:201645_at: 1016643 HuGenest:243540_at:537682-HuGenest:859608 HuGenest: 1079032-HuGene st:458991 HuGene st:241272 at VGP 148839-HuGenest MGP:C12 5.8225E- 1.9994 2.62 84.173-91.9 orf46 06 OL3A1 410491-HuGenest:718613-HuGenest:1083201- COL3A1 9.2906E- 1.8484 2.32 82.270.8 HuGenest:364429-HuGenest:857967- 06 90.5 HuGenest:714343-HuGenest:977793 HuGenest:295757-HuGenest:22453 HuGenest:804687-HuGenest:682411 HuGenest:490951-HuGenest:622631 HuGene st:782663-HuGenest:425013 HuGenest:214723-HuGenest:86407 HuGenest:651938-HuGenest:403765 HuGenest:376736-HuGene st:376290 HuGene st:248520-HuGene st SULF1 740854-HuGene st:517026-HuGene st:933799- SULF1 0.000014 1.7533 2.84 8169.2 HuGenest:72604-HuGenest:1045684- 436 89.6 HuGenest:492122-HuGenest:20084 HuGenest:512275-HuGenest:754502 HuGenest:730003-HuGenest:532406 HuGene st: 187508-HuGenest:671986 HuGenest:34752-HuGenest:381424 HuGenest:166879-HuGenest:530745 HuGene st:1018600 HuGene st:212344_at:212354_at:482916 HuGene st:212353 at TRA1 660603-HuGenest:252436-HuGene st:299850- HTRAl 0.000017 1.7312 2.29 80.768.9 HuGenest:317595-HuGenest:977390- 178 89.3 HuGenest:311841-HuGenest:946172 HuGenest:761215-HuGenest:577479 HuGenest:332527-HuGene st: 1082067 HuGene st:981054-HuGene st:5171 HuGene st: 1019008-HuGene st:757466 HuGene st:723015-HuGene st LOXL2 937941-HuGenest:827532-HuGenest:346541- ENTPD4: 0.000019 1.7774 2.45 81.369.6 HuGene _st:271051-HuGenest:228808_s at:254550- LOXL2 895 89.7 HuGenest:260752-HuGenest:46172 HuGenest:950166-HuGenest:281266 HuGenest:818138-HuGenest:626390 HuGenest:291104-HuGenest:84360 HuGenest:228823-HuGenest:966956 HuGenest:202997_s_at:898787 HuGene_st:202999_s_at:1074111 HuGenest:1064065 uGenest:202998_s_at:328673 HuGene st:1562263 at:34062-HuGene st - 132 SULF2 367910-HuGenest:661661-HuGenest:43751- SULF2 0.000020 1.6803 2.25 8068.1 HuGenest:395846-HuGenest:771921- 766 88.8 HuGenest:550105-HuGenest:195771 HuGenest:43 5480-HuGenest:229718 HuGenest:826084-HuGenest:262732 uGenest: 848382-HuGenest: 861858 HuGene st:907457-HuGene st:801956 HuGene st:54769-HuGenest:214147 HuGenest: 1034817-HuGene st:929065 HuGene st:233555 s at:430325-HuGene st MMP2 650123-HuGenest:78720-HuGenest:168982- MMP2 0.000025 1.7328 3.02 80.768.8 HuGenest:598990-HuGenest:346729- 409 89.3 HuGenest:356008-HuGenest:821467 HuGenest:905494-HuGenest:568028 HuGenest:711286-HuGenest:666575 HuGenest:208210-HuGene st:20913 HuGenest:382005-HuGene st:186287 HuGenest:355238-HuGenest:60728 HuGenest:1566677_at: 1027926-HuGene st:38019 HuGene st:39088-HuGene st TIMP3 686923-HuGenest:767021- TIMP3 0.000026 1.7785 2.56 81.369.6 HuGenest:231888_at:201148_s_at:326731- 195 89.8 HuGenest:72249-HuGenest:369765 HuGene st:711121-HuGene st:844721 HuGene st:856619-HuGene st:241221 HuGene st:515184-HuGene st:948668 HuGene st:410881-HuGene st COL6A2 404866-HuGenest:420794-HuGene_ st:536399- COL6A2 0.000029 1.7418 1.67 80.869-89.4 HuGenest:813295-HuGenest:160736- 551 HuGene st:501950-HuGene st: 148827 HuGene st:501480-HuGene st:375216 HuGenest:143566-HuGene st:1086501 HuGene st:475436-HuGene st:794188 HuGene st:704415-HuGene st:213290 at OC387763 724781-HuGenest:816467-HuGene st LOC3877 0.000032 1.8978 1.43 82.971.4 63 52 90.9 DKK3 573143-HuGene st:509897-HuGene st:829432- DKK3 0.000031 1.7435 1.86 80.869.1 HuGene_ st:425906-HuGenest: 1076690- 838 89.5 HuGenest:480668-HuGene st:302214 HuGenest:673860-HuGenest:584666 HuGene st:966441-HuGene st: 138862 HuGene st:19243-HuGenest:600102 HuGenest:778172-HuGenest:387542 HuGene st:202196_s at:230508_at:86457 HuGene st CTSK 592586-HuGenest:682010-HuGenest:30909- CTSK 0.000039 1.7156 2.16 80.468.8 HuGenest:235976-HuGenest:913861- 778 89.1 HuGenest:25198-HuGene_ st:96515 HuGene st:221416-HuGene st:341005 HuGene st:371949-HuGene st:796154 HuGenest:555578-HuGenest:416489 HuGenest:123013-HuGenest:50644 HuGenest:636682-HuGene st:63600 HuGene st:356075-HuGene st:321593-HuGene st - 133 LOC541471 19600-HuGenest:149287-HuGenest:916693- MGC467 0.000041 1.804 1.45 81.669.9-90 HuGenest:569637-HuGenest:868769- 7,MGC46 139 HuGenest:469295-HuGenest:277416- 77:LOC5 HuGenest:71551-HuGenest:1558836_at:129947- 41471 HuGenest:644672-HuGenest:276744 HuGenest:873188-HuGenest:830227 HuGenest:101170-HuGenest:624891 HuGene _st:85671-HuGene _st:236489 HuGenest:709116-HuGene st:279980 HuGene st:812194-HuGene st CHI3L1 561691-HuGenest:116044-HuGenest:261087- CHI3L1: 0.000051 1.6351 3.06 79.367.4 HuGene _st:57128-HuGenest:970931- MYBPH 515 88.3 HuGenest:518568-HuGenest:362438 HuGenest:388847-HuGenest:738096 HuGenest:209396_s_at:171378-HuGene st:968794 HuGenest:477315-HuGenest:852388 HuGenest:9037 1 -HuGenest:209395 at:657732 HuGenest: 1050574-HuGene st:1074 100 HuGene st:219902-HuGene st:744799-HuGene st CD93 392816-HuGenest:131787-HuGenest:900048- CD93 0.000051 1.6662 2.36 79.867.8 HuGenest:1024597-HuGenest:449159- 891 88.6 HuGenest:937408-HuGenest:835153 HuGene_ st:231835-HuGenest:1084191 HuGenest:970338-HuGenest:835439 HuGenest:345929-HuGene_ st:310915 HuGenest:381822-HuGene st:627382 HuGene st:731932-HuGene st COL11A1 372742-HuGenest:523307-HuGene _st:360139- COL11A 0.000053 1.7583 2.55 8169.3 HuGenest:64257-HuGenest:633985- 1 391 89.5 HuGenest:986684-HuGenest:2674 HuGenest:575064-HuGenest: 1023506 HuGenest: 198973-HuGenest:20543 HuGenest:254341-HuGenest:81512 HuGene st:549979-HuGene st:347332-HuGene st LBH 764120-HuGenest:189486-HuGenest:766856- LBH 0.000063 1.6753 1.9 79.967.9 HuGenest:1083711-HuGenest:541408- 299 88.7 uGenest:375190-HuGenest:938798 uGenest:133862-HuGene st:956722 HuGene st:97472-HuGene st:19277-HuGene st OLFML2B 1085957-HuGenest:354052-HuGenest:682043- OLFML2 0.000066 1.705 1.97 80.368.5-89 HuGenest:249054-HuGenest:867838- B 557 HuGenest:430882-HuGenest:704400 HuGenest: 1065400-HuGenest:588559 HuGenest:827842-HuGenest:272527 HuGene__st:134966-HuGenest:523921 HuGenest:895516-HuGenest: 1022747 HuGenest:529564-HuGene st:539568 HuGene st:427677-HuGene st:122501-HuGene st MMP1 524115-HuGenest:689073-HuGenest:300572- MMP1 0.000070 1.6771 3.91 79.968.1 HuGene st:914223-HuGenest:958445- 616 88.7 HuGenest:61706-HuGenest:437171 HuGenest:422476-HuGenest:361198 HuGenest:693724-HuGenest:671620 HuGenest:673683-HuGenest: 1020786 HuGenest:732367-HuGene st:710307 HuGene st:445730-HuGene st:622653 HuGene st:468477-HuGenest: 1070117 HuGene st:840324-HuGene st:473664-HuGene st -134 PALM2- 945487-HuGenest:454513-HuGenest:799637- AKAP2:P 0.000074 1.724 1.58 80.668.9 AKAP2 HuGene st:91787-HuGene st:448601- ALM2:P 206 89.2 HuGene st:839008-HuGenest: 1044350- ALM2 HuGenest:327679-HuGenest:665040- AKAP2 HuGene st:79769-HuGene st:216125 HuGene st:205232-HuGenest:818531 HuGenest: 169199-HuGene st:537891 HuGene st:667958-HuGene st:202760 s at FBN1 288955-HuGene st:112593-HuGene st:427611- FBN1 0.0001 1.5949 1.48 78.766.7 HuGenest:921008-HuGenest:82155- 87.8 HuGene st:828273-HuGenest:999573 HuGenest: 192029-HuGenest: 162818 HuGenest:79456 1 -HuGenest:316216 HuGenest:971683-HuGenest:394963 HuGenest:600623-HuGenest:169932 HuGene st:941525 HuGene st:202765_s at: 1094040 HuGene st:192231-HuGene st:235318 at MMP3 7912-HuGenest:438906-HuGenest:483424- MMP3 0.0001 1.6224 6.68 79.167.2 HuGenest:634624-HuGene st:191436- 88.1 HuGene _st:542596-HuGenest:664043 HuGene st:766524-HuGene st:134417 HuGene st:945713-HuGene st:149350 HuGenest:357556-HuGenest:206224 HuGenest:560206-HuGenest:822598 HuGenest:200029-HuGene st:701539 HuGene st:526573-HuGene st:35444-HuGene st ISLR 1034362-HuGene st:755902-HuGene st:864958- ISLR 0.0001 1.6424 1.87 79.467.4 HuGene _st:599962-HuGene st:207191_s_at:256349- 88.4 HuGenest:223953-HuGenest:172743 uGenest:46312-HuGenest:11475 HuGene st:500169-HuGene st COL6A1 138884-HuGenest:100156-HuGenest:694215- COL6A1 0.0001 1.7314 1.58 80.768.9 HuGenest: 133534-HuGenest:354247- 89.3 HuGene st:737256 HuGene st:214200_s_at:212939_at:216904_at:88421 0-HuGenest:57954-HuGenest: 138366 uGenest:175316-HuGenest:459478 HuGenest:190064-HuGenest:914102 HuGenest:392220-HuGenest:212937_s_at:125972 HuGenest:214513-HuGene st:528191 HuGenest:858575 HuGenest:212938_at:212091_ s at:212940_at:87147 2-HuGene st FNDC1 194684-HuGenest:358155-HuGenest:959100- FNDC1 0.0001 1.5241 3.14 77.765.6-87 HuGenest:187071-HuGenest:449250 HuGenest:879569-HuGenest:588958 HuGenest:162623-HuGenest:711103 uGenest:30750-HuGenest:34787 HuGene st:260399-HuGene st:418036-HuGene st - 135 AEBP1 479622-HuGenest:192126-HuGenest:209682- AEBP1 0.0002 1.4786 2.11 7764.8 HuGenest:285969-HuGene_ st: 1041745- 86.4 HuGenest:668627-HuGene st:913865 HuGenest:944753-HuGenest:446850 HuGenest: 11831 0-HuGene _st:342980 HuGenest:845095-HuGenest: 185307 HuGene st:901268-HuGene st:494284 HuGene st:201792_at:51739-HuGenest:262459 HuGene st MMP11 203878_s_at:213602_s_at:203877_at:324921- MMP11 0.0002 1.6993 1.28 80.268.3-89 uGenest:718792-HuGenest:514533 uGenest:720030-HuGenest:935371 uGenest:844472-HuGenest:548687 uGenest:519881-HuGenest:858774 HuGenest:943793-HuGenest:671549 HuGene st:127919-HuGene st:670295 HuGene st:235908_at:203876_s at:986917 HuGene st:155442-HuGene st:1036101-HuGene st IGFBP5 532497-HuGenest:823295-HuGenest:323206- IGFBP5 0.0002 1.5676 3.96 78.366.4 HuGene st:941332-HuGene st:723182- 87.5 HuGene st:301318-HuGenest:1007784 HuGene st:503864-HuGene st:492730 HuGene st:370183-HuGenest:867094 HuGene st:965814-HuGenest:778210 HuGene st:203425_s at:692986 HuGene st: 1006982-HuGenest:807775 HuGene st:1035577-HuGene st:158302 HuGenest:73598-HuGenest:1555997_s_at:231985 HuGene st:203424 s at:211958 at:203426_s at:334 486-HuGene st:383523-HuGene st PLAU 611756-HuGenest:246040-HuGenest:783863- PLAU 0.0003 1.4846 2.11 77.164.8 HuGene st: 1027344-HuGenest: 186016- 86.5 HuGenest:31176-HuGenest:692506 HuGenest:52541-HuGenest:205479 s_at:471777 HuGenest: 156254-HuGene_st:1085903 HuGenest:211668_s_at:131855-HuGenest:585614 HuGenest: 1076162-HuGene st:323438 HuGene st:410729-HuGene st CTHRC1 647908-HuGenest:235232-HuGenest:86783- CTHRC1 0.0003 1.4612 3.41 76.764.5 HuGenest:756615-HuGene st:779928- 86.2 HuGenest:385407-HuGene st:978256 HuGene st:607929-HuGene st:426340-HuGene st FNi 1002635-HuGene st:585884-HuGene st:338536- FNi 0.0003 1.4907 2.87 77.265-86.6 HuGene st:586400-HuGene st:445068 HuGenest:395670-HuGene st:19848 HuGenest:443264-HuGenest:662053 HuGenest: 144815-HuGenest: 103145 HuGenest:372804-HuGenest:652840 uGene _st:935697-HuGenest:484350 HuGenest:422572-HuGenest:814551 HuGenest:235724-HuGenest:214701_s_at:692191 HuGenest:132330-HuGenest:210495 x_at:639087 HuGenest:216442_x_at:211719_x at:214702_at:21 2464 s at:1558199 at - 136 RAB31 784521-HuGenest:805481-HuGenest:936297- RAB31 0.0003 1.4857 2.08 77.165-86.6 HuGenest:576193-HuGenest:321915 HuGenest:282618-HuGenest:930765 HuGene st:397177-HuGene st:861778 HuGene st:411822-HuGenest:240571_at:816751 HuGenest: 1028297-HuGene_st:291722 HuGenest:722575-HuGenest:904649 HuGenest:991710-HuGene st: 103228 1 HuGene st:217764 s at:252384-HuGene st LGALS1 540889-HuGenest:882911-HuGenest:289133- LGALSI 0.0004 1.4863 1.61 77.164.8 uGenest:775286-HuGenest: 179375- 86.5 uGenest:877452-HuGenest:697650 uGene _st: 1720-HuGenest: 107624 uGene st:810235-HuGene st: 190535 uGenest:529287-HuGene st:725969 HuGene _st:976579-HuGenest: 1075722 HuGenest:453861-HuGene st:216405_at:624673 HuGene st:724499-HuGene st ELOVL5 559433-HuGenest:422981-HuGenest:211457- ELOVL5 0.0004 1.542 1.86 7865.8 HuGenest:551828-HuGenest:412588- 87.2 HuGenest:912245-HuGenest:287185 HuGenest:103066-HuGenest:399004 HuGenest:528360-HuGenest:938564 HuGene st: 1000247-HuGene st:620190 HuGene st:336885-HuGene_ st:165821 HuGenest:214153_at:731356-HuGenest:601072 HuGene st:755480-HuGene st:8873 HuGene st:1567222_x_at:208788_at:1093316 HuGene st ANTXR1 203217-HuGenest:501276-HuGenest:799524- ANTXR1 0.0005 1.5167 2 77.665.4 HuGenest:85875-HuGenest:12099- 86.9 HuGenest:230429-HuGenest:368173 HuGenest:369297-HuGene st:227660_at:86683 HuGenest:682518-HuGenest:129277 HuGenest:827509-HuGenest:901442 HuGenest:408237-HuGenest:414321 HuGenest:330209 uGene st:234430 at:241549 at:755570 HuGene st:220093 at ACTA2 479447-HuGenest:183075-HuGenest:1070983- ACTA2 6.OOE-04 1.4242 1.75 76.263.8 HuGenest:548186-HuGenest:529704- 85.8 HuGene st:530590-HuGene st:630797 HuGenest:404109-HuGenest:237159 HuGene _st:90357-HuGenest:293590 HuGenest: 1078299-HuGenest: 120087 HuGenest:855050-HuGenest:757648 HuGenest:980364-HuGene st:785232 HuGene st:313690-HuGene st:670805-HuGene st IGFBP3 242422-HuGenest:1046998-HuGenest:585931- IGFBP3 0.0006 1.552 1.38 78.166-87.3 HuGenest:145671-HuGene st:71362 HuGenest:719466-HuGene st:412306 HuGenest:922332-HuGenest:491408 HuGenest:929740-HuGene st:276131 HuGene st:201952-HuGene st - 137 VIM 1093618-HuGenest:398387-HuGenest:837477- vIM 0.0008 1.4804 1.48 7764.9 HuGenest: 1079757-HuGenest:339807- 86.5 HuGene st:302722-HuGene st: 139661 HuGene st:436158-HuGene st:564251 HuGene st:234475-HuGene st:770617 HuGenest:155888-HuGenest:319318 HuGene st:5271 10-HuGene st:994975 HuGene st:192324-HuGene st ASPN 720939-HuGenest:96944- ASPN 0.0009 1.4011 2.75 75.863.5 HuGene st:224396_s_at:640448- 85.6 HuGenest: 1055545-HuGenest:447994 HuGenest:835521 -HuGenest:567513 HuGenest:546047-HuGene st:219087_at:673620 HuGene st HNT 291569-HuGenest:1099035-HuGenest:237480- HNT 0.001 1.425 2.26 76.263.8 HuGenest:222020_s_at:959054-HuGenest:941200- 85.8 HuGene st:715332-HuGenest:761810 HuGenest:673027-HuGenest:653559 HuGene_ st:241934_at:403764-HuGenest:719722 HuGene st:227566 at GJA1 210697-HuGenest:348617-HuGenest:100706- GJA1 0.0013 1.4083 1.63 75.963.7 HuGenest:202715-HuGenest:817684- 85.6 HuGene st:851453-HuGenest:317083 HuGene st:852817-HuGenest:150947 HuGene st:491742-HuGene st:612542 HuGene st:352375-HuGene st:684816-HuGene st MSN 77554-HuGenest:103225-HuGenest:1038630- MSN 1.40E-03 1.4927 1.36 77.265.1 HuGenest:555838-HuGenest:250445- 86.7 HuGenest:770842-HuGenest:240960_at:905037 HuGenest:693942-HuGene st:788076 HuGene st:102934-HuGene st:686613-HuGene st SFRP4 965138-HuGenest:512201-HuGenest:321583- SFRP4 0.0015 1.407 2.2 75.963.5 HuGenest:784119-HuGenest:4568- 85.5 HuGenest: 1073348-HuGenest:606783 HuGene st:203475-HuGene st:779727 HuGene st:236875-HuGenest:674172 HuGene st:204051 s at:1084814-HuGene st CTGF 418229-HuGene st:547481-HuGene st:661089- CTGF 0.0015 1.334 1.78 74.862.4 HuGenest:338622-HuGenest:353624- 84.6 HuGenest:466254-HuGenest:691815 HuGene st: 156933-HuGene st: 1031177 HuGene st:265466-HuGene st: 1079612 HuGene st: 1073900-HuGene st: 1080114-HuGene st FCGR3B 775320-HuGenest:483078-HuGene st:897088- FCGR3B 0.0026 1.232 2,14 73.160.5 HuGenest:891504-HuGenest:288407- 83.3 HuGenest:367930-HuGenest:512343 HuGenest:524627-HuGenest:290249 HuGenest:204007_at:993144-HuGenest: 1084295 HuGenest:606544-HuGenest: 160442 HuGenest:974942-HuGene st:582410 HuGene st: 117174-HuGene st LOXLI 244262-HuGenest:240421-HuGenest:999775- LOXL1 0.0027 1.4045 1.33 75.963.7 HuGene st 85.5 - 138 LUM 253522-HuGenest:646969-HuGenest:566007- LUM 0.0029 1.3426 2.18 74.962.5 HuGenest:629878-HuGenest:346266- 84.7 HuGenest:749731-HuGenest:848594 HuGenest:781980-HuGenest:838285 HuGenest:594339-HuGenest: 1095278 HuGenest:450762-HuGene st:205322 HuGenest:130751-HuGene st:183047 HuGene st:15672-HuGene st SERPINFI 837342-HuGenest SERPINF 0.0031 1.2108 1.76 72.860.2-83 1 MXRA5 319799-HuGene st:432137-HuGene st:408065- MXRA5 0.0034 1.2959 1.62 74.161.6 HuGenest:291182-HuGenest:549236- 84.1 HuGene st:217706-HuGenest:260019 HuGene st: 1079370-HuGenest:216837 HuGenest:607674-HuGenest:829671 HuGenest:348569-HuGenest:357055 HuGene st:394366-HuGene st:192313 HuGene st:124763-HuGene st POSTN 954838-HuGenest:153510-HuGene_ st:538299- POSTN 0.005 1.301 2.58 74.261.7 HuGenest:649547-HuGenest:779754- 84.3 HuGenest:106590-HuGenest:636479 HuGenest:874100-HuGenest:13993 HuGenest:417058-HuGenest:724846 HuGene st:588010-HuGene st:608772 HuGene st:210809_s_at:743877-HuGenest:713459 HuGenest:713707-HuGenest:753556 HuGene st:776445-HuGene st:388659-HuGene st TAGLN 613243-HuGenest:823959-HuGene _st:898711- TAGLN 0.0051 1.1811 2.19 72.359.6 HuGene _st:505391-HuGenest:26929- 82.6 HuGene st:458482-HuGene st:66114 HuGene st:304704-HuGene st:543865 HuGene st:226523 at:270642-HuGenest: 179049 HuGene st:279405-HuGene st:902958 HuGene st:931316-HuGene st NNMT 231559_at:588888-HuGenest:160284- NNMT 0.0054 1.2225 1.44 72.960.5 HuGenest:591616-HuGenest:9689 10- 83.2 uGenest:789017-HuGenest:218836 uGenest:775762-HuGene st:202238_s_at:95357 uGene st:1098599-HuGene st:793701-HuGene st IL8 471213-HuGenest:373132-HuGenest LOC7314 0.0062 1.6808 1.33 8068.1 67:C6orfl 88.7 42:CCDC 42:IF16:T RBV21 1:H2AFZ :RNF20:T RBV5 4:TRBC1 :TRBV3 1:TRBVI 9:TRBV7 -2:IL23A - 139 GREMI 651822-HuGenest:546941-HuGenest:639839- GREM1 0.007 1.3044 1.31 74.361.8 HuGenest:326978-HuGenest:593695- 84.2 HuGenest:127360-HuGenest:407017 HuGenest: 1081939-HuGenest:345681 HuGenest:292333-HuGenest:864620 HuGenest:2445-HuGenest:556201 HuGenest:569365-HuGene st:411208 HuGene st:330742-HuGene st CiS 947234-HuGenest:962144-HuGenest:567397- ciS 0.0088 1.3551 1.77 75.162.8 HuGenest:490300-HuGenest:459367- 84.9 HuGene st:1059233-HuGene st:915378 HuGene st:68256 1-HuGenest:881984 HuGenest:332123-HuGenest:37033 HuGenest:433793-HuGenest:564959 HuGenest:240802-HuGenest:245571 uGene st: 156493-HuGenest: 186313 HuGenest:866948-HuGenest:661278 HuGene st:665064-HuGene st:615340 HuGene st:981405-HuGene st: 127169 HuGenest:224321-HuGenest:426716 HuGenest:905688-HuGene st: 1009344 HuGene st: 184395-HuGene st TIMP2 310783-HuGene st:526357-HuGene st:965142- TIMP2 0.0097 1.1623 1.72 71.959.4 HuGenest:463158-HuGenest:563754- 82.3 HuGene st:914626-HuGene st: 106688 HuGene st: 1087828-HuGenest:740790 HuGene st:799495-HuGene_ st SPP1 87046-HuGenest:904296-HuGenest:809583- SPP1 0.0109 1.0677 3.91 70.357.6 HuGene st:44881-HuGene st:1568574 x at:459132- 80.9 HuGene st:963342-HuGene_st:909306 HuGenest:1070547-HuGenest:31302 HuGene st:975259-HuGenest:743585 HuGene st:530424-HuGenest:709614 HuGenest: 1022539-HuGene st:575437 HuGene st:809262-HuGene st:259921-HuGene st SCD 214388-HuGene st:1054693-HuGene st:433956- SCD:LO 1.27E-02 1.073 2.33 70.457.7-81 HuGene st:1088784-HuGene st:1019041- C651109: HuGene st:698268-HuGenest:64902- LOC6453 HuGenest:749928-HuGene st:713000- 13 HuGene st:562893-HuGene st MYL9 206106-HuGenest:608837-HuGenest:706186- MYL9 0.0131 1.1754 1.37 72.259.6 HuGenest:874422-HuGenest:543240- 82.4 HuGenest:676360-HuGenest: 117460 HuGenest:404703-HuGenest:979054 HuGenest:244149_at:467647-HuGenest:925187 HuGene st:173220-HuGene st:923797-HuGene st CD163 967100-HuGenest:180883-HuGenest:900552- CD163 0.0146 1.1667 2.19 7259.4 HuGenest:662901-HuGenest:217692- 82.3 HuGenest:74082-HuGene st:620892 HuGene st:899841-HuGene st:97726-HuGene st S100A8 508168-HuGenest:306748-HuGenest:473960- S100A8 0.0148 1.1001 3.17 70.958.3 HuGenest:98947-HuGenest:777038- 81.3 HuGenest:233365-HuGenest:671798 HuGenest:214328-HuGenest:214370_at:368299 HuGenest: 16960-HuGenest:627642 HuGene st:669207-HuGene st:75346 HuGene st:521635-HuGene st:215650 HuGene st:618819-HuGene st - 140 CYR61 709380-HuGenest:788428-HuGenest:385534- CYR61 0.015 1.093 1.69 70.858.2 HuGenest:924057-HuGenest:951763- 81.3 HuGenest:48239-HuGenest:462225 HuGenest: 789527-HuGene st: 673588 HuGene st:306375-HuGene st MAFB 883249-HuGenest MAFB 0.0182 1.1589 1.48 71.959.3 82.3 APOE 555956-HuGenest:99757-HuGene st APOE 0.0285 1.1117 1.43 71.158.6 81.5 DCN 570449-HuGene st:1067446-HuGene st DCN 0.0364 1.0819 1.29 70.658-81.1 GOS2 888942-HuGenest:346957-HuGene st:62728- GOS2 0.0383 0.9923 1.53 6956.3 HuGene st:334340-HuGene st:897401-HuGene st 79.9 CDHIl 60296-HuGenest:609892-HuGenest:167614- CDH1l 0.0384 1.01 1.98 69.356.7-80 HuGene st:184349-HuGenest:546348 HuGene st:960949-HuGenest:583244 HuGenest:599824-HuGenest:626011 HuGenest:96018-HuGene st:217538 HuGene st:180379-HuGene st SQLE 861626-HuGenest:198540-HuGene_ st:1010566- SQLE 0.041 1.0133 2.15 69.456.7 HuGenest:895693-HuGene st:967759- 80.1 HuGene st:894379-HuGene st:827097-HuGene st GPNMB NA GPNMB 0.0623 1.0155 1.52 69.456.8 80.1 KIAA1913 NA KIAA191 0.1061 0.9593 1.2 68.455.7 3 79.3 SFRP2 NA SFRP2 0.1336 0.9696 1.24 68.656-79.4 SMOC2 NA SMOC2 0.2175 0.6581 2.06 62.950-74.4 SERPINGI NA SERPIN 2.64E-01 0.7805 1.26 65.252.5 G 1 76.5 EFEMPI NA EFEMPI 0.4911 0.6831 1.27 63.450.5 74.9 TYROBP NA TYROBP 0.4923 0.7065 1.23 63.851.1 75.2 PLOD2 NA PLOD2 0.5151 0.7775 1.51 65.152.3 76.3 - 141 TABLE 9 SEQ Probe ID Set ID NO: Target Sequence caaggctgggccgggaagggcgtgggttgaggagaggctccagacccgcacgccgcgcgcacagagctctcagcgccgctccagccaca 1 200660 gcctcccgcgcctcgctcagctccaacatggcaaaaatctccagccctacagagactgagcggtgcatcgagtccctgattgctgtttccagaa at gtatgctggaaaggatggttataactacactctctccaagacagagttcctaagcttcatgaatacagaactagctgccttcacaaagaaccagaag gaccctggtgtccttgaccgcatgatgaagaaactggacaccaacagtgatggtcagctagatttctcagaatttcttaatctgattggtggcctag tatggettgccatgactccttcctcaaggctgtcccttcccagaagcggacctgaggaccc gttggttcaaacttttgggagcacggactgtcagttctctgggaagtggtcagcgcatcctgcagggcttctcctcctctgtcttttggagaaccagg 2 200665 gctcttctcaggggctctagggactgccaggctgtttcagccaggaaggccaaaatcaagagtgagatgtagaaagttgtaaaatagaaaaagtg s at gagttggtgaatcggttgttetttcctcacatttggatgattgtcataaggtttttagcatgttecteettttettcaccetccectttgttcttetattaatcaa - gagaaacttcaaagttaatgggatggtcggatctcacaggctgagaactcgttcacctccaagcatttcatgaaaaagctgcttcttattaatcataca aactctcaccatgatgtg aaaagcgaggtggccatgttatgctggtggttgccagggcctccaaccactgtgccactgacttgctgtgtgaccctgggcaagtcacttaactata aggtgcctcagttttecttctgttaaaatggggataataatactgacetacetcaaagggcagttttgaggcatgactaatgetttttagaaagcattttg 3 200832_ ggatccttcagcacaggaattcaagacetgagtattttttataataggaatgtecaccatgaacttgatacgtecgtgtgteccagatgetgtcatta s-at gtctatatggttctccaagaaactgaatgaatccattggagaagcggtggataactagccagacaaaatttgagaatacataaacaacgcattgcca cggaaacatacagaggatgcettttetgtgattgggtgggattttttccetttttatgtgggatatagtagttacttgtgacaagaataattttggaataatt tctattaatatcaactctgaagctaattgtactaatctgagat gcagcgggaacagagtaccctcttcaagccccggtcatgatggaggtcccagccacagggaaccatgagctcagtggtcftggaacagctcact 4 200903 aagtcagtccttccttagcctggaagccagtagtggagtcacaaagcccatgtgttttgccatctaggccttcacctggtctgtggacttatacctgtg s_at tgcttggtttacaggtccagtggttcttcagcccatgacagatgagaaggggctatattgaagggcaaagaggaactgttgtttgaattttcctgaga gcctggettagtgctgggccttctcttaaacctcattacaatgaggttagtacttttagtccctgt 5 201014 agtgttgatgggctctacttctgatcttggtcactgtgaaaaaatcaagaaggcctgtggaaattttggcattccatgtgaacttgagtaacatctg s at - gcataaaggaccagatgaaactctgaggattaaagctgagtatgaaggggatggcattcctactgtatttgtggcagtggcaggcagaagtaatg - gtttgggaccagtgatgtctgggaacactgcatatccagttatcagc agatctgtgcggttggcataaccaacttactaacagaatgtcccccaatgatggacactgagtataccaaactgtggactccattattacagttttga 6 201112 ttggtctttttgagttacccgaagatgataccattcctgatgaggaacattttattgacatagaagatacaccaggatatcagactgcctttcacagtt s at ggcatttgctgggaaaaaagagcatgatcctgtaggtcaaatggtgaataaccccaaaattcacctggcacagtcacttcacatgttgttaccgcc tgtccaggaagggttc tcagaaggtaggggccgtgtcccgcggtgctgactgaggcctgcttccccctccccctcctgctgtgctggaattcacagggaccagggccac cgcaggggactgtctcagaagacttgatttttccgtccctttttctccacactccactgacaaacgtccccagcggtttccacttgtgggttcaggtg 7 201195_ tttteaagcacaacecaccacaacaagcaagtgcattttcagtegttgtgettttttgttttgtgetaacgtettactaatttaaagatgetgteggcacca s-at tgtttatttatttccagtggtcatgctcagccttgctgctctgcgtggcgcaggtgccatgcctgctccctgtctgtgtccagccacgcagggcat cactgtgacgtcggccgaccaggctggacaccctctgccgagtaatgacgtgtgtggctgggaccttctttatttgtgttaatggtaacctgttac actgggetgggttgggtagggtgttctggc tctetctttctgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtettgtgettectcagacetttetegettetgagettggtggectgttecctccate 8 201261 tctccgaacctggcttcgcctgtccctttcactccacaccctctggccttctgccttgagctgggactgctttctgtctgtcggcctgcacccagcc x at ctgcccacaaaaccccagggacagcggtctccccagcctgccctgctcaggccttgcccccaaacctgtactgtcccggaggaggttgggagg - tggaggcccagcatcccgcgcagatgacaccatcaaccgccagagtcccagacaccggttttcctagaagcccctcacccccactggcccact ggtggctaggtctccccttatccttctggtccagcgcaaggaggggctgcttctgaggtcggtggctgttttcattaaagaaacac tacagatactctactacactcagcctettatgtgccaagtttttetttaagcaatgagaaattgetcatgttettcatettetcaaateatcagaggcegaa 9 201292 gaaaaacactttggctgtgtctataacttgacacagtcaatagaatgaagaaaattagagtagttatgtgattatttcagctcttgacctgtccctctg at -gctgcctctgagtctgaatctcccaaagagagaaaccaatttctaagaggactggattgcagaagactcggggacaacatttgatccaagattta aatgttatattgataaccatgetcagcaatgagetattagattcattgggaaatetccataatttcaatttgtaaaetttgttaagactgtetacattgtt atatg agcatcggagccattcattggagaaaacgttttgatcaaaatggagacttttgtagtcgtttcaaaagagcacctgagtcatgtgtattcccggnn nctttataaatgacccggtcaagttggtttcaaagtncgacaggcttgtctgtttactagctgcgtggccttggacgggtggctgacatctgtaaaga 10 201328 atcctcctgtgatgaaactgaggaatcgggtggccgggcaagctgggaagagcaaagccagnagctgcgctgcctcaatacccacaaaagac at catteccagtatacataagcacaggatgtttttetcaagagggatgtatttatcacttggacatetgtttataatataaacagacatgtgactgggaacat cttgctgccaaaagaatcctaggcagtggctcattgtatgtgaggttgaaccacgtgaaattgccaatattaggctggcttttattacaaagaagga gtttcatggggttcagcctaacagttatggaaactacagtcttataaaccattggcatg 201338 gatgtatgtcgctgtccaagagaaggctgtggaagaactatacaactgtgtttaatctccaaagccatatcttccttccatgaggaaagcegcc x at ttttgtgtgtgaacatgctggctgtggcaaaacatttgcaatgaaacaaagtctcactaggcatgctgttgtacatgatctgacaagaagaaaatg - aagctcaaagtcaaaaaatctcgtgaaaaacgggagtttggcctctcatctcagtggatatatcctcccaaaaggaaacaagggcaaggcttattt -142 tgtgtcaaaacggagagtcacccaactgtgtggaagacaagatgctctcgacagttgcagtacttacccttggctaagaactgcactgtttgtttaa aggactgcagaccaaggagtcgagctttctctca ggtctgttcttgaatcctctattaattactgtgtgtgagccagagggagctgtggtaagggttgggcccccagcctgtagggaactttctggactccc 12 201341_ actetttgaategatataggcatttggtctcactacttgaccattetcacctgtgaaacgteccacactttgaagcaaatacaattcacagcacagtac at acacaaaaaccttggcataagacagagaaggtcttcttattttgtgggctggttgctgtagaaacatataacaaagggcagccctccactttggta taattgtgtagcccettttctttgggcttgacacctgtcttgaataagagtgattagagctgcataatgtccctctctt gaggtcagatttggagcttctcattgcacgcggagattattattgcatcgggttccaagccaatgggaagcccgggggaggggtttggcatgagg aagcgttggttacagcagctgattggctgcagccaagactgtgaaaggataaagaggcgcgaggcggaattggggtctgctctaagctgcagca 13 201416_ agagaaactgtgtgtgaggggaagaggcctgtttcgctgtcgggtctctagttcttgcacgctctttaagagtctgcactggaggaactcctgccatt at accagctcccttcttgcagaagggagggggaaacatacatttattcatgccagtctgttgcatgcaggctttnnggcttcctaccttgcaacaaaata attgcaccaactccttagtgccgattccgcccacagagagtcetggagccacagtcttttttgctttgcattgtaggagagggactaagtgtagaga ctatgtegcttteetgagctaccgagagcgctcgtgaactggaatcaact gtaaaccacatettttttgcactttttataageaaaaacgtgcegtttaaaccactggatetatetaaatgcegatttgagttcgegacactatgtactg 14 201417 egtttttcattettgtatttgactatttaatectttctacttgtcgctaaatataattgttttagtettatggcatgatgatagcatatgtgtteaggtttatagetg at ttgtgtttaaaaattgaaaaaagtggaaaacatetttgtacatttaagtetgtattataataagcaaaaagattgtgtgtatgtatgtttaatataacatgae aggcactaggacgtctgcctttttaaggcagttccgttaagggtttttgtttttaaacttttttttgccatccatcctgtgcaatatgccgtgta gatgacttaccatgggatggggtccagtcccatgaccttggggtacaattgtaaacctagagttttatcaactttggtgaacagttttggcataatagt 15 201468 caatttctacttctggaagtcatctcattccactgttggtattatataattcaaggagaatatgataaaacactgccctcttgtggtgcattgaaagaaga s at gatgagaaatgatgaaaaggttgcctgaaaaatgggagacagcctcttacttgccaagaaaatgaagggattggaccgagctggaaaacctcctt - taccagatgetgactggcactggtggtttttgetetegacatatccacaatagetgacggetgggtgtttcagtttgcaaaatattttgttgeettcatett cactgcaattttgtgta ttcagacccagtaaetgtcegcagetgtetgetagtggttgtettaacatcgtagtectagtttgcatttttaaateccetetgtttaaaaggtttgtaaaa 16 201479 caaaaacaaaaaactaagtctgctcagtgaaatgctgtagaaccctaaataagtggtagaagagtgtcactgaattttgtctctgaattcagtataact at gagttttgtecatgetggtgtetgggttataggcetgatgggectggtagttttccatettgttetggcetagaggtcagtectttgeactteetcaaage ttgtgtacagtgetcacetaaatecatetgactacttgttectgtgccetettgtttaggctegtttacttttaaaaaatgaaattgttcattgetgggaga agaatgttgtaatttttacttattaaagtcaacttgttaagttttttatgtattcctgttgggttttcttgttg acaggaggaatgcaccacggcagctctccgccaatttetctcagatttccacagagactgtttgaatgttttcaaaaccaagtatcacactttaatgta catgggccgcaccataatgagatgtgagccttgtgcatgtgggggaggagggagagagatgtactttttaaatcatgttccccctaaacatggctgt 17 201506 taacccactgcatgcagaaacttggatgtcactgcctgacattcacttccagagaggacctatcccaaatgtggaattgactgcctatgccaagtcc at ctggaaaaggagcttcagtattgtggggctcataaaacatgaatcaagcaatecagcctcatgggaagtcctggcacagtttttgtaaagccttg acagctggagaaatggcatcattataagctatgagttgaaatgttctgtcaaatgtgtctcacatctacacgtggcttggaggttttatggggcctg tecaggtagaaaagaaatggtatgtagagcttagatttccctattgtgacagagcc aggaaactgctacttgtggacctcaccagagaccaggagggtttggttagctcacaggacttcccccaccccagaagattagcatccatactag 18 201563 actcatactcaactcaactaggctcatactcaattgatggttattagacaattccatttctttctggttattataaacagaaaatctttcctttctcattacc at agtaaaggetettggtatetttetgttggaatgatttetatgaacttgtettattttaatggtgggttttttttggtaagatttagacetaaategcateatg ccaacttgtgactttgagactattcatcaagaatgaggatatagtagccatgacatagcttgagctatagcctttaattccttactttggtatgggtgga gggtgagtttgaagaggttctgattttcttgtaacctggga gattccgccttgttggtctgaaattcatgcaagcttccgaagatcttctcaaggaacactacgttgacctgaaggaccgtccattctttgccggctgg tgaaatacatgcactcagggccggtagttgccatggtctgggaggggctgaatgtggtgaagacgggccgagtcatgctcggggagaccaac 19 201577 ctgcagactccaagcctgggaccatccgtggagacttctgcatacaagttggcaggaacattatacatggcagtgattctgtggagagtgcagag at aaggagatcggcttgtggtttcaccctgaggaactggtagattacacgagctgtgctcagaactggatctatgaatgacaggagggcagaccaca ttgcttttcacatccatttcccctccttcccatgggcagaggaccaggctgtaggaaatctagttatttacaggaacttcatcataatttggagggaag tettggagetgtgagttctccctgtacagtgttaccatc agaaaaccacacttctcataccttcactcaacacttccttccccaaagccagaagatgcacaaggaggaacatgaggtggctgtgctgggggcac 20 201601 cccccagcaccatccftccaaggtccaccgtgattaacatccacagcgagacctccgtgcccgaccatgtcgtctggtccctgttcaacaccctctt x at cttgaactggtgctgtctgggcttcatagcattcgcctactccgtaaagtctagggacaggaagatggttggcgacgtgaccggggccaggcct - atgcctccaccgccaagtgcctgaacatctgggccctgattctgggcatcctcatgaccattggattcatcctgttactggtattcggctctgtaacag tctaccatattatgttacagataatacaggaaaaacggggttactagtagccgcccatagcctgcaacctttgcactccactgtgcaatgctggcc gtagtgccactgttgttttggggggggetttttttettttteeggaaaateettaaacettaagatactaaggacgttgttttggttgtacttggaattcttag teacaaaatatatttgtttacaaaaatttctgtaaaacaggttataacagtgttaaagtetcagtttettgettggggaacttgtgtecctaatgtgttaga 21 201656 ttgctagattgctaaggagctgatacttgacagttttttagacctgtgttactaaaaaaaagatgaatgtcggaaaagggtgttgggagggtggtcaa at caaagaaacaaagatgttatggtgtttagacttatggttgttaaaaatgtcatctcaagtcaagtcactggtctgtttgcatttgatacatttttgtactaac tagcattgtaaaattatttcatgattagaaattacctgtggatatttgtataaaagtgtgaaataaattttttataaaagtgttcattgtttcgtaacacagcat t 22 201666 tcagggecaagttcgtggggacaccagaagtcaaccagaccacettataccagcgttatgagatcaagatgaccaagatgtataaagggttccaa at gccttaggggatgccgctgacatccggttcgtctacacccccgccatggagagtgtctgcggatacttccacaggtcccacaaccgcagcgagg agtttctcattgetggaaaactgcaggatggactcttgcacatcactacctgcagtttcgtggctccctggaacagcctgagcttagctcagcgccg - 143 gggcttcaccaagacctacactgttggctgtgaggaatgcacagtgtttccctgtttatccatcccctgcaaactgcagagtggcactcattgttgt ggacggacc gtgattttttccacaagatetgtaatgttatttccacttataaaggaaataaaaaatgaaaaacattatttggatatcaaaagcaaataaaaacccaatte 23 201925 agtctettctaagcaaaattgctaaagagagatgaaccacattataaagtaatctttggctgtaaggcattttcatctttccttcgggttggcaaaatattt s at taaaggtaaaacatgctggtgaaccaggggtgttgatggtgataagggaggaatatagaatgaaagactgaatcttcctttgttgcacaaatagagt - ttggaaaaagcctgtgaaaggtgtcttctttgacttaatgtctttaaaagtatccagagatactacaatattaacataagaaaagattatatattattttg aatcgagatgtccatagtcaaatttgtaa gagagcactctatttattgtactgtgaataatgatgaaggagagtggagtggcccaccacctgaatgcagaggaaaatctctaacttccaaggtcc 24 201926 accaacagttcagaaacctaccacagtaaatgttccaactacagaagtctcaccaacttctcagaaaaccaccacaaaaaccaccacaccaaatg s at ctcaagcaacacggagtacacctgtttccaggacaaccaagcattttcatgaaacaaccccaaataaaggaagtggaaccacttcaggtactacc - cgtcttctatctgggcacacgtgtttcacgttgacaggtttgcttgggacgctagtaaccatgggcttgctgacttagccaaagaagagttaagaaga aaatacacacaagtatacagactgttcctagtttcttagactta gtatgacaacccgggatcgtttgcaagtaactgaatccattgcgacattgtgaaggcttaaatgagtttagatgggaaatagcgttgttatcgccttg 25 202286 ggtttaaattatttgatgagttccacttgtatcatggcctacccgaggagaagaggagtttgttaactgggcctatgtagtagcctcatttaccatcgttt s_at gtattactgaccacatatgcttgtcactgggaaagaagcctgtttcagctgcctgaacgcagtttggatgtctttgaggacagacattgcccggaaac teagtetatttacttcagcttgccc tggcctacatggaccagcagactggcaacctcaagaaggccctgctcctccagggctccaacgagatcgagatccgcgccgagggcaacagc 26 202310 cgcttcacctacagcgtcactgtcgatggctgcacgagtcacaccggagcctggggcaagacagtgattgaatacaaaaccaccaagacctccc s at - gcctgcccatcatcgatgtggcccccttggacgftggtgccccagaccaggaattcggcttcgacgttggccctgtctgcttcctgtaaactccctc - catcccaacctggctccctcccacccaaccaactttccccccaacccggaaacagacaagcaacccaaactgaaccccctcaaaagccaaaaa atgggagacaatttcacatggactttggaaaatatttttttcctttgcattcatctctcaaacttagtttttatctttgaccaaccgaacatgacca gctccccatttttataccaaaggtgctacatctatgtgatgggtggggtggggagggaatcactggtgctatagaaattgagatgcccccccaggc 27 202311 cgaagtcttgtaattttttctgttttntnttttt tggaggatggattcaaggttt s at acatgggaggagagcgtgtgcggctccagcccagcccgctgctcactttccaccctctctccacctgcctctggcttctcaggcctctgctctccg - acctctctcctctgaaaccctcctccacagctgcagcccatcctcccggctccctcctagtctgtcctgcgtcctctgtccccgggtttcagagacaa cttcccaaagcacaaagcagtttttncccctaggggtgggaggaagcaaaagactctgtacctattttgt aacctgaaaacatcccagccaagaactggtataggagctccaaggacaagaaacacgtctggctaggagaaactatcaatgctggcagccagtt 28 202403 tgaatataatgtagaaggagtgacttccaaggaaatggetacccaacttgccttcatgcgcctgctggccaactatgcctctcagaacatcaccta s at cactgcaagaacagcattgcatacatggatgaggagactggcaacctgaaaaaggctgtcattctacagggctctaatgatgttgaacttgttgtg - agggcaacagcaggttcacttacactgttcttgtagatggctgctctaaaaagacaaatgaatggggaaagacaatcattgaatacaaaacaaata agccatcacgcctgcccttccttgatattgcacctttggacatcggtggtgctgaccaggaattctttgtggacattggcccagtctgtt actttcccatgagtgtgatecacattgttaggtgetgacetagacagagatgaactgaggtecttgttttgttttgttcataatacaaaggtgetaattaat 29 202404 agtatttcagatacttgaagaatgttgatggtgetagaagaatttgagaagaaatactcctgtattgagttgtategtgtggtgtattttttaaaaaatttga s-at tttagcattcatattttecatettattcccaattaaaagtatgcagattatttgcccaaagttgtectettettcagattcagcatttgttettgccagtetcatt ttcatcttcttccatggttccacagaagctttgtttcttgggca gcaacaaccgaaaatgcaccagccccaggtcctcggacaccgaggagaatgtcaagaggcgaacacacaacgtcttggagcgccagaggag 30 202431 gaacgagctaaaacggagcttttttgccctgcgtgaccagatcccggagttggaaaacaatgaaaaggcccccaaggtagttatccttaaaaaag s at ccacagcatacatcctgtccgtccaagcagaggagcaaaagctcatttctgaagaggacttgttgcggaaacgacgagaacagttgaaacacaa - acttgaacagctacggaactcttgtgcgtaaggaaaagtaaggaaaacgattccttctaacagaaatgtcctgagcaatcacctatgaacttgtttca aatgcatgatcaaatgcaacctcacaaccttggctgagtc ggaaacctctcagtgtcttgacatcaccctacccaggcggtgggtctccaccacagccactttgagtctgtggtccctggagggtggtttctga 31 202504_ ctggcaggatgaccttagccaagatattcctctgttccctctgctgagataaagaattcccttaacatgatataatccacccatgcaaatagctactgg at cccagctaccatttaccatttgcctacagaatttcattcagtctacactttggcattctctctggcgatggagtgtggctgggctgaccgcaaaaggtg ccttacacactgcccccaccctcagccgttgccccatcagaggctgcctcctccttctgattaccccccatgttgcatatcaggg atccccaactgtgacaagcatggcctgtacaacctcaaacagtgcaagatgtctctgaacgggcagcgtggggagtgctggtgtgtgaacccca 32 202718 acaccgggaagctgatccagggagcccccaccatccggggggaccccgagtgtcatctcttctacaatgagcagcaggaggcttgcggggtg at cacacccagcggatgcagtagaccgcagccagccggtgcctggcgcccctgccccccgcccctctccaaacaccggcagaaaacggagagt ______gettgggtggtg 33 202779 ccgaacgtgggcgccaatggcgagatctgcgtcaacgtgctcaagagggactggacggctgagctgggcatccgacacgtactgctgaccat s at aagtgcctgctgatccaccctaaccccgagtctgcactcaacgaggaggcgggccgcctgctcttggagaactacgaggagtatgcggctcgg - gcccgtetgtcacagagatccacgg ctacccttatgatgacccattttccctcatgaccgatcccaagctcatcatttggagccctgtgcgccgctcagatgtggcctggaactttgagagtt 34 202831_ cctcatagggccggagggagagcccttccgacgctacagccgcaccttcccaaccatcaacattgagcctgacatcaagcgcctccttaaagttg at ccatatagatgtgaactgctcaacacacagatctcctactccatccagtcctgaggagccttaggatgcagcatgccttcaggagacactgctgga cctcagcattccttgatatcagtcctcttagcagagcettgcctttecctetgcetgtttccttt ctggttggtgattcaact I I tgggetccaagacttgggtaagetctgggcetteacagaatgatggcacettectaaaccetcatgggtgg - 144 35 202833 gaagcgtttaggcatgtttaacatccagcactgtaagaagtgtcagtgggtactgtaatgaaatactgggcaatgccaccgccatttttcc s at tacctgatgaggggaaactacagcacctggaaaatgaactcacccacgatatcatcaccaagttcctggaaaatgaagacagaaggttgccag - cttacatttacccaaactgtccattactggaa gtacccagttaaattttcatttcagataaacaacaaataatttttagtataagtacattattgtttatetgaaagttttaattgaactaacaatcctagtttgat 36 202859 actcccagtcttgtcattgccagctgtgttggtagtgctgtgttgaattacggaataatgagttagaactattaaaacagccaaaatccacagtcaat x at attagtaatttcttgetggttgaaaettgtattatgtacaaatagattcttataatattatttaaatgactgcatttttaaatacaaggetttatatttaacttt aagatgtttttatgtgctctccaaattttttttactgtttctgattgtat gagaggaccaaccagaattccetttggacatttgtgtttttttgtttttttattttgtttgttttttettettettettetteettaaagacattaagctaaaggca actcgtacccaaatttccaagacacaaacatgacctatccaagcgcattacccacttgtggccaatcagtggccaggccaaccttggctaaatgga 37 202935_ gcagcgaaatcaacgagaaactggactttttaaaccctcttcagagcaagcgtggaggatgatggagaatcgtgtgatcagtgtgctaaatctctct s-at gectgtttggactttgtaattatttttttagcagtaattaaagaaaaaagtcetetgtgaggaatattetetattttaaatatttttagtatgtactgtgtatgatt cattaccattttgaggggatttatacatatttagataaaattaaatgetettatttccaacagctaaactactettagttgaacagtgtgccctagetttt sata ettgcaaccagagta aatgcaagcatgtgtcatccatatttctetacatcttctcttggagtgagggaggtacctggaggggatcagcccactgacagaccttaatcttaatt 38 202936_ actgetgtggetagagagtttgaggattgetttaaaaaagacagcaaaettttttataaaaaaagatatattaacagttttagaagtcagtaga s at ataaaatettaaagcactcataatatggcatecttcaatttgtataaaageagatetttttaaaaaagatacttctgtaacttaagaaacetggcata aatcatattttgtctttaggtaaaagctttggtttgtgttcgtgttttgtttgtttcacttgtttccctcccagccccaaaccttttgttctctccgtgaaattac cttt gccttccctgaatcagacaaccttttcaaatgggtagggaccatcatggagcagctggaacagtatatgaagacctgaggtataagttcgcta 39 202954 gagttccccagtggctacccttacaatgcgcccacagtgaagttcctcacgccctgctatcaccccaacgtggacacccagggtaacatatgcctg at gacatcctgaaggaaaagtggtctgccctgtatgatgtcaggaccattctgctctccatccagagccttctaggagaacccaacattgatagtcctt gaacacacatgctgccgagctctggaaaaaccccacagcttttaagaagtacctgcaagaaacctactcaaagcaggtcaccagccaggagcc ctgacccaggctgcccagcctgtccttgtgtcgtctttttaatttttccttagatggtctgtcctttttgtgattt gccagtcttgaccgggatgaggcccacagacaggttgtcatcagcttgtccattcaagccaccgagctcaccacagacacagtggagccgcg ctcttctccagtgacacgtggacaaatgcgggctcatcagcccccccagagagggtcaggccgaaccccatttetcctcttaggtcattttcag 40 202998 caaaettgaatatetagacetetettocaatgaaaccetecagtetattatagtcacatagataatggtgecacgtgttgatttggtgagetcagac s-at ttggtgettccetectccacaacccccacccettgtttttcaagatactattattatattttcacagacttttgaagcacaaatttattggcatttaatattgga catetgggccettggaagtacaaatetaaggaaaaaccaacccactgtgtaagtgacteatetteetgttgttecaattetgtgggttttgatteaacg gtgetataaccagggtectgggtgacagggegetcactgagcaccatgtgtcatcacagaca caggaaatagtcacteatcccactccacataaggggtttagtaagagaagtetgtetgtetgatgatggatagggggcaaatettttccetttetgtt aatagtcatcacatttctatgccaaacaggaacgatecataaettagtettaatgtacacattgcatttgataaaattaattttgttgtttcctttgaggtt 41 203083_ gategttgtgttgttttgetgcactttttacttttgegtgtggagetgtattcccgagacaacgaagegttgggatacttcattaaatgtagegactgte at aacagegtgcaggttttetgtttctgtgttgtggggtcaaccgtacaatggtgtgggaatgacgatgatgtgaatatttagaatgtaccatatttttgta aattaatatgtttctaaacaaatttategtataggttgatgaaacgtcatgtgttttgccaaagactgtaaatatttatttatgtgttcacatggtcaaaatt tcaccactgaaaccctgcacttagctagaacct atggtaacatgtatctttgccctgggtctgggtgggtccagtcagtctcagatttacaagcatttaggagcctaggtaaaagctgtagtattcttttaa aagttatatttatgacttgcaatgatagaaaactccttccaattaaatggcattttataatattatgtgtgtacttcacagtgttaaaaataccctcatacgtt 42 203124 attgcatttgatetteacagaaagtgcattttaaccagtactetgggtgcaataaataatatgtagaaatttaagtectccaattecagcatatccagtga s at gttttgacagtgtgtttatgtggaatgtttaaggatatacaattgtactatataaattggttcttgttcttcttaaatgtgacatgaaataattgtgctgetac attatactggaaattaacaggggaaaagggaagagctcttggctcccttgaggttctgctagtggtgttaggagtggttacaactgagcttttagtaa ccatttaaccg tgotaagtteaagttcgtaatgetttgaagtatttttatgetetgaatgtttaaatgttetcatcagtttettgccatgttgttaactatacaacctggetaaa 43 203213 gatgaatatttttetactggtattttaatttgacetaaatgtttaagcatteggaatgagaaaactatacagatttgagaaatgatgetaaatttatagga at gttttcagtaacttaaaaagctaacatgagagcatgccaaaatttgctaagtttacaaagatcaagggctgtccgcaacagggaagaacagttttg aaaaatgaactatettatttttaggtaggttttgaaagetttttgtetaagtgaattettatgcettggtcagagtaataaetgaaggagntgttatett I ggctttcgagtctgagtttaaaactacacattttgacatagtgtttattagcagccatc gtggccgtagcaacttggcggagacaggctatgagtctgacgttagagtggttgcttccttagcctttcaggatggaggaatgtgggcagtttgact 4 203256 tcagcactgaaaacctctccacctgggccagggttgcctcagaggccaagtttccagaagcctcttacctgccgtaaaatgtcaacctgtgtct S at - gggcetgggcetgetgtgactgacetacagtggactttetetetggaatggaacettettaggectcetggtgcaaettaattttttttttaatgetatett caaaacgttagagaaagttcttcaaaagtgcagcccagagtgtgggcccactggcgtctgatttctggtttccagaccccaatgcctcccat tcggatggatctctgcgtttttatactgagtgtgctaggttgccccttattttttattttccctgttgcgttgctatagatg agagtggtcttttcaacactcctccccctactccaccggacctcaaccaggacttcagtggatttcagcttctagtggatgttgcactcaaacgggct 45 203313 gcagagatggagcttcaggcaaaacttacagcttaacccattttcaagcaaaacagttctcagaaatgtcatgattgcggggtgaaggcaagag s at atgaattgattattttatatattttattaatatttgacatgggattgtaaaacagttctgttactgagatgtttaatggaatacagtcattcaag s-a aactataaacttaaagctactgtagaaacaaagggttttettttaaatgtttcttggtagattattcataatgtgagatggttcccaatatcatgtgatttt tttt~cetccecttccctttttttgttattttttcagactgtgcaatacttagagaacetatagcatetetcattcccatgtggaacaggatgcccacatact .145 gtctaatta gaagccaagggttaacccagcaagctacaaagagggtgtgtcacactgaaactcaatagttgagtttggctgtgttgcaggaaaatgattataact aaaagctctctgatagtgcagagacttaccagaagacacaaggaattgtactgaagagctattacaatccaaatattgccgtttcataaatgtaataa 46 203510 gtaatactaattcacagagtattgtaaatggtggatgacaaaagaaaatctgetctgtggaaagaaagaactgtctctaccagggtcaagagcatga at acgcatcaatagaaagaactcggggaaacatcccatcaacaggactacacacttgtatatacattcttgagaacactgcaatgtgaaaatcacgttt getatttataaaettgtecttagattaatgtgtetggacagattgtgggagtaagtgattettetaagaattagatacttgtcactgcctatacctgcagct gaaetgaatggtacttcgtatg gaagcaggtggaaacatgagcattcagtttcttggtacagtgtacaaggtgaatatcttaaccagacttgccgcagaattgaacaaatttatgtgga aaaagtgactgaggacacaagcagtgttctgcgttccccgatgcccggagtggtggtggcgtctctgtcaagcctggagacgcggtagcagaa 47 203860 ggtcaagaaatttgtgtgattgaagccatgaaaatgcagaatagtatgacagctgggaaaactggcacggtgaaatctgtgcactgtcaagtgga at gacacagttggagaaggggatctgctcgtggagctggaatgaaggatttataacctttcagtcatcacccaatttaattagccatttgcatgatgttt cacacacaattgattcaagcattatacaggaacacccctgtgcagctacgtttacgtcgtcatttattccacagagtcaagaccaatattctgccaaaa aatcaccaatggaaattttcatgatataaatacttgtactatagatgtacttctgctgtg tgccagcgactgtctcagactgggcagggaggtttggcatgacttaagaggaagggcagtettgggaccgctatgcaggtcctggcaaacct ggctgccctgtctcatccctgtccctcagggtagcaccatggcaggactgggggaactggagtgtccttgctgtatccctgttgtgaggttcettcc 48 203878 aggggctggcactgaagcaagggtgctggggccccatggccttcagccctggctgagcaactgggctgtagggcagggccacttctgaggt s at caggtcttggtaggtgcctgcatctgtctgcettctggctgacaatcctggaaatctgttctccagaatccaggccaaaaagttcacagtcaaatggg gaggggtattcttcatgcaggagacccaggcctggaggtgcaacatactcaatctgtcccaggcggatctctgaagccttttgca gcactgetatcctccaaagccattgtaaatgtgtgtacagtgtgtataaaccttcttttctt gactaatecatacacagttaacetaatgccaaataaatactggttaaataaatgtatggcacagaatataatttgactatcaagacttttagcataatga aaaaccctctctctatatatatatgtgtatatgaattatgtgggcattcttgatacttcaagttctagtttgaaaaaaatacataactaatttaattttacacaa 49 203895 aaatatttatgcagattttcagaatttcatatcaggaaatgacctttttatgtctgttaaatatcaaaacaatttgctacagtgttaattgcatggttttaa at gcctgctgtagttgagttgcagacagtgcatgaaaaagtattccgctgggaattgagccatgccaccaaagccaagaggagcgcatggaaaccc ggtagtctagaactaatcagattactgattttagggcacagcaccagatgaattgttgtatatgcttgtaaaaattgattctgtgtgttcctctgaacaaa gcgg gtccagcttgaacatctagaattcctagagaaacagaatgagcaggcgaaggagatgcagcagatggtgaaattggaagccgagatggaccgc 50 203896 agaccagcaacagtagtatgaaactccaaaatgcaaactgaagcagcaaacccacaaagcatcaaaagactcactcacaaaacttctgaacaca s at - aactccatggatgaaagctgtttattttgtttcctttatgtgtaaacaagatgatatctgaaaccagagagacttggaatgtctgactgactttatttaa - agcttgagtattgcatttccttggccaaacaaaaatagetacaaatccacaaaaatttactattccagtaaggcagagtecaaccattgataatacaac ttaaacatgtttgctataaaataccatcacaagtaaatgagcttggtgtgaacaactcttcctttgtgatgccttagg ggactagagcaacatcgtgctgcccaaaggactaacctatgcaaactagttcacattttagtggatgtcgcagttaatgtgtaataagacattatttcc 51 203961 cctgcataatgtacaacagcattgaaatgacacattaagcctagcatcacattgtatagtacagtcactcacaaacccttcaaggctaccctaatcatt at aacattaatatttgtttaaaagcaaatcaccgatttatctattgaaactacttaaatgacggcaaaccaggaatgacagatggctgtgtcagcaatgg tttaatgtgttccetgcaagtggtetectatgantagaaetgegttetcaaatgeactetetteagggtettaatattetgtgttttetetetgtatttgtaaaa cattataacacattaatttcctatctctacacatttgg gtcaaggcattgtatgttgcttctgtggttattattctgtgatgcttagactacttgaacccataaacttggaagaatctttgagcaaattttctcagttgtct 52 203962 gtatgactteagtatatteetgggaatgccataggattttttgtgettgataeatggtatecagtttgeatagtatcactietttgtaatecagttgetgttaa s at gaatgatgtactttaaaggaaaagagaaaactgcatcacagtcccattctccagtgtccatgcaatgaattgctgagcatttaggaagcagcaccaa - gtctattacaggcatggtgtgaaacttgatgtttgacctgtgatcaaaattgaaccattgtacagtttggcttctgtttgcttcaaaatatgtagaattgtg gttgatgattaatttgcgagactaactttgagagtgtaacagttttgaagaaaacattgaatgttttacaaatgaaggggcttcacggaatgttacaa 204051_ aaccagccagtcccaagaagaacattaaaactaggagtgcccagaagagaacaaacccgaaaagagtgtgagctaactagtttccaaagcgga 53 5s at - gacttccgacttccttacaggatgaggctgggcattgcctgggacagcctatgtaaggccatgtgccccttgccctaacaactcactgcagtgctct - tcatagacacatettgeagcattttettaaggetatgettcagtttttetttgtaagccatcacaagccatagtggtaggtttgccetttggtacagaag ttgtgttttgtcattagetetgccetttttaattaaatattttggttcatggaccaaagggtttacttgacaaatttgtgtgacagacteegaacaatteettta 54 204127_ etaegaagtataatttataaaataaaatatacecattttaagggtacagtttgatttttgaccagtgaaactatgateccaatcaaggtatagatgeogte at accccaaaaagttccctccatatccctttgcagtcagttcatccctaccctggcccagatgatcactgatcttgtcattatagatgagttttgccagttca agaatttaatggaatcagatattgtaagcattettgtgtaatacttcattetetetcattattgagattcatecatattgttgaatgttteactagttaatgtttat tgttcaatatttttgtatatacttttaaagcctatteacttgctgatggatett cgctctcgtttcattttctgcagcgcgccacgaggatggcccacaagcagatctactactcggacaagtacttcgacgaacactacgagtaccggc 204170 atgttatgttacccagagaactttccaaacaagtacctaaaactcatctgatgtctgaagaggagtggaggagacttggtgtccaacagagttagg 55 at - ctgggttcattacatgattcatgagccagaaccacatattcttctctttagacgacctcttccaaaagatcaacaaaaatgaagtttatctggggatgt - caaatetttttcaaatttaatgtatatgtgtatataaggtagtattcagtgaatacttgagaaatgtacaaatetttcatscatacetgtgcatgagetgtatt cttcacagcaacagagctcagttaaatgcaactgcaagtaggttactgtaagatgtttaagataaaagttcttccagtcagtttttctcttaagtgct ctcatggggactcctacccatttgatgggccaggaaacacgtggtcatgctttggcctgggacaggtctcggaggagatgctcacttcgatg 56 204259_ aggatgaacgctggacggatggtagcagtctagggattaacttcctgtatgctgcaactcatgaacttggccattctttgggtatgggacattcctct at gatcctaatgcagtgatgtatccaacctatggaaatggagatccccaaaattttaaatttcccaggatgatattaaaggcattcagaaactatatgga aagagaagtaattcaagaaagaaatagaaacttcaggagaacatccattcattcattcattggattgtatatcattgttgcacaatcagaattgataa -146 gcactgtteetccactccatttagcaattatgtcaccettttattgcagttggtttttgaatgtettteacteettttattggttaaactcctttatggtgtgac tgtgtettattccatctatgagctttgtcagtgcgcgtagatgt gaaaatgtaccttggtgccaccaacccattttgtgccacatgcaagttttgaataaggatgtatggaaaacaacgctgcatatacaggtaccatttag gaaataccgatgcctttgtgggggcagaatcacagacaaaagctttgaaaatcataaagatataagttggtgtggctaagatggaaacagggtg 57 204320 attcttgattcccaattctcaactctccttttcctatttgaatttctttggtgctgtagaaaacaaaaaaagaaaaatatatattataaaaaatatggtgtc at attetcatccatccaggatgtactaaaacagtgtgtttaataaattgtaattattttgtgtacagttetatactgttatctgtgtccatttccaaaaettgeac gtgtecetgaatteegetgactetaatttatgaggatgecgaaetctgatggcaataatatatgtattatgaaaatgaagttatgattteegatgacecta agtc gggtctgaatctagcaccatgacggaactagagacagccatgggcatgatcatagacgtcttttcccgatattcgggcaggagggcagcacg 58 204351 agaccctgaccaagggggagctcaaggtgctgatggagaaggagctaccaggcttcctgcagagtggaaaagacaaggatgccgtggataaa at ttgctcaaggacctggacgccaatggagatgcccaggtggacttcagtgagttcatcgtgttcgtggctgcaatcacgttgctgtcacaagtact ttgagaaggcaggactcaaatgatgccctggagatgtcacagattcctgcagagccatggtcccaggcttcccaaaagtgtttgttggcaattatt ccctaggctgagcctgctcatgtacc gtagctggacccacgaggaggaaccaggctactttccccagtactgaggtggtggacatcgtctctgccactcctgacccagccctgaacaaag cacctcaagtgcaaggaccaaagggggccctggcttggagtgggttggcttgctgatggctgctggaggggacgctggtaaagtgggtagg 59 204401_ cttggcccacctgaggccccaggtgggaacatggtcacccccactctgcataccctcatcaaaaacactctcactatgctgctatggacgacctcc at agctctcagttacaagtgcaggcgactggaggcaggactcctgggtccctgggaaagagggtactaggggcccggatccaggatttgggag gcttcagttaccgctggccgagctgaagaactgggtatgaggctggggcggggctggaggtggcgccccctggtgggacaacaaagagga accatttttcagctgc agaccaagacataccggcagatcaggttaaatgagttattaaaggaacattcaagcacagctaatattattgtcatgagtctcccagttgcacgaaa 60 204404 aggtgctgtgtctagtgctctctacatggcatggttagaagctctatctaaggacctaccaccaatcctcctagttcgtgggaatcatcagagtgtct at taccttctattcataaatgttetatacagtggacagccetccagaatggtacttcagtgectagtgtagtaacetgaaatettcaatgacacattaacate acaatggcgaatggtgacttttctttcacgatttcattaatttgaaagcacacaggaaagcttgctccattgataacgtgtatggagacttcggttttagt caattccatatc tatgattaaetetacctgcacactgtectattatattcattetttttgaaatgtcaaccccaagttagttcaatetggattcatatttaatttgaaggtagaatg 61 204470 ttttcaaatgttctccagtcattatgttaatatttctgaggagcctgcaacatgccagccactgtgatagaggctggcggatcaagaaatggcaat at - gagatcattgtgaaggcaggggaatgtatgtgcacatctgttttgtaactgtttagatgaatgtcagttgttatttattgaaatgatttcacagtgtgtggt eaacatttetcatgttgaaaetttaagaactaaaatgttetaaatatccettggacattttatgtetttettgtaaggcatactgcettgtttaatggtagtttta cagtgtttctggcttagaaca gaagaaetgtctattttctcagtcatttttaacctetagagtcactgatacacagaatataatettatttatacctcagtttgcatatttttttactatttagaat 62 204475 gtagccetttgtactgatataatttagttecacaaatggtgggtacaaaaagtcaagtttgtggettatggatatataggecagagttgeaaagat at cttttccagagtatgcaactctgacgttgatcccagagagcagcttcagtgacaaacatatcctttcaagacagaaagagacaggagacatgagtct ttgecggaggaaaagcagctcaagaacacatgtgcagtcactggtgtcaccctggataggcaagggataactctttaacaca agatgatggaccctggttatcccaaactgattaccaagaacttccaaggaatcgggcctaaaattgatgcagtcttctattctaaaaacaaatactact 63 204580 atttcttccaaggatctaaccaatttgaatatgacttcctactccaacgtatcaccaaaacactgaaaagcaatagctggtttggttgttagaaatggtg at taattaatggtttttgttagttcacttcagettaataagtatttattgcatatttgetatgtecteagtgtaccactacttagagatatgtatcataaaaataaa atetgtaaaccataggtaatgattatataaaatacataatatttttcaattttgaaaactetaattgtecattcttgettgactctactattaagtttgaaaata gttaccttcaaagcaagataattctatttgaagcatgctctgtaagttgcttcctaacatccttggactgagaaattatacttactttggcataactaaa tgccgtgctcccaaaacattttaaatgaaagtattggcattcaaaaagacagcagacaaaatgaaagaaaatgagagcagaaagtaagcatttcca gcctatetaatttetttagttttetatttgeetccagtgeagtecatttectaatgtataccagcetactgtactatttaaaatgetcaatttcagcaccgatg 64 204620 gecatgtaaataagatgatttaatgttgattttaatectgtatataaaataaaaagtcacaatgagtttgggcatatttaatgatgattatggagcettaga s-at ggtetttaatcattggtteggetgettttatgtagtttaggetggaaatggtttcacttgetetttgactgtcagcaagactgaagatggottttcetggac agctagaaaacacaaaatcttgtaggtcattgcacctatctcagccataggtgcagtttgcttctacatgatgctaaaggctgcgaatgggatctga tggaactaaggactccaatgtcgaactcttctttgctgc tgctctccagtgtacccatgatggaagtatcttgatagtacccaaagaactggtggcctcaggccacaaaaaggaaacccaaaagggaaagaga aagtgagaagaaactgaagatggactctattatgtgaagtagtaatgttcagaaactgattatttggatcagaaaccattgaaactgttcaagaattg 65 204702 tatetttaagtactgetacttgaataaetcagttaacgetgttttgaagettacatggacaaatgtttaggactaagatcacacttgtgggcaatetgg s-at gggagccacaacttttcatgaagtgcattgtatacaaaattcatagttatgtccaaagaataggttaacatgaaaacccagtaagactttccatttgg cagccatcetttttaagagtaagttggttacttcaaaaagagcaaacactggggatcaaattattttaagaggtatttcagttttaaatgcaaaatagcct tattttcatttagtttgttagcactatagtgagcttttcaaacactattttaatc cttcgttcgcagagcttttcagattgtggaatgttggataaggaattatagacctctagtagctgaaatgcaagaccccaagaggaagttcagatctt aatataaattcactttcatttttgatagctgtcccatctggtcatgtggttggcactagactggtggcaggggcttctagctgactcgcacagggattct 66 204855 cacaatagccgatatcagaatttgtgttgaaggaacttgtctcttcatctaatatgatagcgggaaaaggagaggaaactactgcctttagaaaatat at aagtaaagtgattaaagtgctcacgttaccttgacacatagtttttcagtctatgggtttagttactttagatggcaagcatgtaacttatattaatagtaat ttgtaaagttgggtggataagetatecetgttgecggttcatggattacttetetataaaaaatatatatttaccaaaaaattttgtgacatteetteteccat ctcttccttgacatgcattgtaaataggttcttcttgttctgag 67 204885- tttccagaacatgaacgggtcgaatacttcgtgaagatccagtcttctgggtggggcccccacggaggatttgaaggcgctcagtcagcaga -147 s at atgtgagcatggacttggccacgttcatgaagctgcggacggatgcggtgctgccgttgactgtggctgaggt gtctggcacaccatggatgacaatgaagaaaatttggatgaatcaaccattgacaatctaaacaaaatcctacaagtctttgtgttggaatatttcatt tgtaatactctgatttagtttaggataattggttctagaattgaattcaaaagtcaaggcatcatttaaaataattgatttcagacaaatgctgtgtggaa 68 205174 acatctatectatagateatectattettatgtgtetttggttatcagatcaattacagaataattgtgttgtgatattgtgtectaaattgetcattaatttttatt s at tacagattgaaaaagaggcaccgtgtaaagaaaatggcaaaataaatatctttccaaggatcatcatcacgatagctaaacagtacttaaatagcgg ttggaactaggtagcetttegaattttatgattttcatatgtggaaatetattacatgtaatacaaaacaaacatgtagtttgaaggcggtcagatttcttt gag aaggcggctgcagaagatgtcaatgttactttcgaagatcaacaaaagataaacaaatttgcacggaatacaagtagaatcacagagtgaagga 69 205361 agaaatagaagtaaaaaagaaacaactccaaaacctagaagatgcttgtgatgacatcatgcttgcagatgatgattgcttaatgataccttatcaaa s_at ttggtgatgtcttcattagccattctcaagaagaaacgcaagaaatgttagaagaagcaaagaaaaatttgcaagaagaaattgacgccttagaat cagagtggaatcaattcagcgagtgttagcagatttgaaagttcagttgtatgcaaaattcgggagcaacataaaccttgaagctg cctcagcctgatcaggcttcctggtgagaactgaggagcggactcacttgatgtttcctggaagcagagcaaaagttctcttgtccctgtcgcgtct 70 205366 cattttgtccatgtcccccgtgcacggttcaatggtagattcgctgtcctcagcgggggccttgaagactccctgatccagacctggtgtctctcc s_at caccccctccccaaagccactggaaggagcacatactacctagaagtaagaagaggagcctcagaagaaaacaaagttctattttattaattttcta tgtgttgtgtttgtagtcttgtcttagctctggacg gccaacatcaccatcattgagcaccagaagtgtgagaacgcctaccccggcaacatcacagacaccatggtgtgtgccagcgtgcaggaaggg ggcaaggactcctgccagggtgactccgggggccctctggtctgtaaccagtctcttcaaggcattatctcctggggccaggatccgtgtgcgat 71 205470 cacccgaaagcctggtgtctacacgaaagtctgcaaatatgtggactggatccaggagacgatgaagaacaattagactggacccacccaccac s-at agcccatcaccctccatttccacttggtgtttggttcctgttcactctgttaataagaaaccctaagccaagaccctctacgaacattctttgggcctct ggactacaggagatgctgtcacttaataatcaacctggggttcgaaatcagtgagacctggattcaaattctgccttgaaatattgtgacttgggaa tgacaacacctggtttgttctctgttgtatccccagccccaaagacagctcctggccatatatca ttcacacggcagctggccaatgaaggctgtgacatcaatgctatcatctttcacacaaagaaaaagttgtctgtgtgcgcaaatccaaaacagactt 72 205476 gggtgaaatatattgtgcgtctcctcagtaaaaaagtcaagaacatgtaaaaactgtggcttttctggaatggaattggacatagcccaagaacaga at - aagaaccttgctggggttggaggtttcacttgcacatcatggagggtttagtgcttatctaatttgtgcctcactggacttgtccaattaatgaagttgat teatattgcatcatagtttgetttgtttaageatcacattaaagttaaactgtattttatgttatttatagetgtaggttttetgtgtttagctatttaatactaattt tccataagctattttggtttagtgcaaagtataaaattatatttgggggggaataagattatatggactttcttgcaagcaacaa cccgaccggtgggcatttgtgaggcccatggttgagaaatgaataatttcccaattaggaagtgtaagcagctgaggtctcttgagggagttag caatgtgggagcagcggtttggggagcagagacactaacgacttcagggcagggctctgatattccatgaatgtatcaggaaatatatatgtgtgt 73 205479 gtatgtttgcacacttgttgtgtgggctgtgagtgtaagtgtgagtaagagctggtgtctgattgttaagtctaaatatttccttaaactgtgtggactgtg s-at atgccacacagagtggtctttctggagaggttataggtcactcctggggcctcttgggtcccccacgtgacagtgcctgggaatgtacttattctgca gcatgacetgtgaccagcactgtetcagtttcactttcacatagatgtccctttettggecagttatccetteetttagcetagttcatecaatectcactg ggtggg9 aacaaagactcttettgegtetetgettcaggtaacttcaacateteegetgatgagcctataactgtgacacctcctgactcacaatcatatatetecg 74 205513_ teaattactetgtgagaatcaatgaaacatatttcaccaatgtcactgtgetaaatggttetgtetteetcagtgtgatggagaaagcccagaaaatga at atgatactatatttggtttcacaatggaggagcgctcatgggggccctatatcacctgtattcagggcctatgtgccaacaataatgacagaacctac tgggaacttctgagtggaggcgaaccactgagccaaggagctggtagttacgttgtccgcaatggagaaaacttggaggttcgctggagcaaat actaataagcccaaactttcctcagctgcataaaatccatttgcagtggagttccatgtttattgtccttatgccttct caaacgtattggcaggcgaaccccttccgtgctgtggccgagcctggcatccaactcaaggctgtgaagtttccacaggccccggggaacag 75 205713 tgcggaacgctctgtggcatacaggagacacagagtcccaggtgcggctgctgtggaaggacccgcgaaacgtgggttggaaggacaagaa s at gtcctatcgttggttcctgcagcaccggccccaagtgggctacatcagggtgcgattctatgagggccctgagctggtggccgacagcaacgtg - gtcttggacacaaccatgcggggtggccgcctgggggtcttctgcttctcccaggagaacatcatctgggccaacctgcgttaccgtgcaatga caccatcccagaggactatgagacccatcagctgcggcaagcctagggacc caccacctacctatgatgccgtggtacagatggagtaccttgacatggtggtgaatgaaacacfcagattattcccagttgtattagacttgagagg acttgcaagaaagatgttgaaatcaatggggtattcattcccaaagggtcaatggtggtgattccaacttatgctcttcaccatgacccaaagtactg 76 205765_ gacagagcctgaggagttccgccctgaaaggttcagtaagaagaaggacagcatagatccttacatatacacaccctttggaactggacccaga at aactgcattggcatgaggtttgctctcatgaacatgaaacttgctctaatcagagtccttcagaacttctccttcaaaccttgtaaagaaacacagat cccttgaaattagacacgcaaggacttcttcaaccagaaaaacccattgttctaaaggtggattcaagagatggaaccctaagtggagaatgagtta ttctaaggacttctactttggtcttcaagaaagctgtgccccagaacaccagagatttcaacttagtca tgctatgccttgtttttgtcaccaaaatcctggacagatgcagatctggcctgccagaagcggccctctggaaacctggtgtctgtgctcagtgggg ctgagggatccttcgtgtcctccctggtgaagagcattggtaacagctactcatacgtctggattgggctccatgaccccacacagggcaccgag 77 205815_ ccaatggagaaggttgggagtggagtagcagtgatgtgatgaattactttgcatgggagagaaatccctccaccattcaagccccggcactgt at gcgagcctgtcgagaagcacagcatttctgaggtggaaagattataactgtaatgtgaggttaccctatgttgcaagttcactgactagtgcagga gggaagtcagcagcctgtgtttggtgtgcaactcatcatgggcatgagaccagtgtgaggactcaccctggaagagaatattcgcttaattccccc aacctgaccacctcattcttatctttcttctgtttcttcctccccgctgtcatttcagtctcttcattttgtc 205825 tttccattcccaatctagtgctagatgtataaatctttcttttgattcttcctaacaaaatattttctgggttaaaaccccagccaatcattgggttgtagcc 78 at -aaaggtteacteteaagaagettaatatttaaataaaatcatattgaatgtttecaacetggagtataatatteagatataaaacagttttgtcagtetttet I tagtgeetgtgtggatttttgtgaaaatgtcaaagagaaaaettatatactatttccttgaaattttaaactatattttettacaggtatttataatatacca . 148 atgettttatcaaacagaattttaaagagcataataaattatattaaagaaccaaaagttttcctgagaataagaaagtttcacccaataaaatatttttga aaggcatgttcctetgtcaatgaaaaaaagtacatgtatgtgttgtgatattaaaagtgacatttgtetaatagcetaatacaacatgtagetgagtttaa catgtgtggtcttg gaaaatcgatgcagccatttctgataaggaaaagaacaaaacatatttctttgtagaggacaaatactggagatttgatgagaagagaaattccatg gagccaggctttcccaagcaaatagctgaagactttccagggattgactcaaagattgatgctgtttttgaagaatttgggttcttttatttctttactgga 79 205828 tcttcacagttggagtttgacccaaatgcaaagaaagtgacacacactttgaagagtaacagctggettaattgttgaaagagatatgtagaaggca at caatatgggcactttaaatgaagctaataattcttcacctaagtctctgtgaattgaaatgttcgttttctcctgcctgtgctgtgactcgagtcacactca agggaaettgagegtgaatctgtatettgccggtcatttttatgttattacagggcatteaaatgggetgetgettagettgcacettgtcacatagagt gatctttcccaagagaaggggaagcactcgtgtgcaacagac cacccaggcggagggtgccttcgtggcctcactgattaaggagagtagcactgatgacagcaatgtctggattggctccatgacccaaaaaag 80 205886 aaccgccgctggcactggagtagtgggtccctggtctcctacaagtcctgggacactggatccccgagcagtgctaatgctggtactgtgaag at - cctgacttcatgctcaggattcaagaaatggaaggatgaatcttgtgagaagaagttctcctttgtttgcaagttcaaaaactagaggaagtgaaaa atggatgtetagaactggtectgcaattactatgaagtcaaaaattaaactagactatgtetccaactcagttcagaccatetectccctaatgagtttg catcgctgatcttcagtaccttc gatcttaaagccacggagaagcctctcatcttatggcattgacaaagagaagaccatccaccttaccctgaaagtggtgaagcccagtgatgagg agctgcccttgtttcttgtggagtcaggtgatgaggcaaagaggcacctcctccaggtgcgaaggtccagctcagtggcacaagtgaaagcaatg 81 205890_ atcgagactaagacgggtataatccctgagacccagattgtgacttgcaatggaaagagactggaagatgggaagatgatggcagattacggca s at tcagaaagggcaacttactcttcctggcatcttattgtattggagggtgaccaccctggggatggggtgttggcaggggtcaaaaagcttatttctttt aatctcttactcaacgaacacatcttctgatgatttcccaaaattaatgagaatgagatgagtagagtaagatttgggtgggatgggtaggatgaagt atattgcccaactctatgtttctttga 82 205910 s at gactccaaggaagctcagatgcctgcagtcattaggttttagcgtcccatgagccttggtatcaagaggccacaagagtgggaccccaggggtc tccacacacggccaggcctgtttatctacactgctgcccactcctctctccagctccacatgctgtacctggatcattctgaagcaaattccgagcatt acatcatttgtecataaatatttetaacateettaaatatacaateggaatteaagcateteccattgteccacaaatgtttggetgttttgtagttggatt 83 205927_ gtttgtattaggattcaagcaaggeccatatattgcatttatttgaaatgttgtaagtctctttcattacagagtttagcacatttgaacgttgtggtt s-at gaaatcccgaggtgtcatttgacatggttctctgaacttatctttcctataaaatggtagttagatctggaggttgattttgtggcaaaaatacttctag gtggtgctgggtacttcttgttgcatcctgtcaggaggcagataatgctggtgcctctctattggtaatgttaagactgctgggtgggtttggagtttt ggc atactatttttcataccacgtgcatgtgaaagggactcatgtttgggtaggcctgtataagaatggcacccctgtaatgtacacctatgatgaatacac 84 205941 caaaggctacctggatcaggcttcagggagtgccatcatcgatctcacagaaaatgaccaggtgtggctccagcttcccaatgccgagtcaaatg s at gcetatactcetetgagtatgtecactcctetttetcaggattcctagtggetccaatgtgagtacaccccacagagetaatetaaatettgtgetagaa - aaagcattttaaetctaccccaccctacaaaatgcatatggaggtaggetgaaaagaatgtaatttttatttctgaaatacagatttgagetatcag accaacaaaccttccccctgaaaagtgagcagcaacgtaaaaacgtatgtgaagcctctcttgaa gccgaccatctggatcacatcaaggaggtggcaggagccagagccgtgggttttggtggggactttgatggtgttccaagggtccctgaggggc tggaggacgtctccaagtatccagacctgatcgctgagctgctcaggaggaactggacggaggcggaggtcaagggcgcactggtgacaac 85 205983 ctgctgagggtcttcgaggctgtggaacaggccagcaacctcacacaggctcccgaggaggagcccatcccgctggaccagctgggtggctc at ctgcaggacccattacggctactcctctggggcttccagcctccatcgccactgggggctcctgctggcctccctcgctcccctggtcttgttg tctctcctgtgaaacctgggagaccagagtcccctttagggttcccggagctccgggaagacccgcccatcccaggactccagatgccaggag cctgctgcccacatgcaaggaccagcatctcctgagaggacgcctgggcttacctggggggcaggatgcetggggacagttag ggaggataggataatcccgggtggcatctataacgcagacctcaatgatgagtgggtacagcgtgcccttcacttcgccatcaggagtataaca 86 206224_ aggccaccaaagatgactactacagacgtccgctgcgggtactaagagccaggcaacagaccgttgggggggtgaattacttcttcgacgtaga at ggtgggccgaaccatatgtaccaagtcccagcccaacttggacacctgtgccttccatgaacagccagaactgcagaagaaacagttgtgctcttt cgagatctacgaagttccctgggagaacagaaggtccctggtgaaatccaggtgtcaagaatcctagggatctgtgccag gagacgtggtaagtgcggtgcagttttcaactgacctctggacgcagaacttcagccatgaaggtaacaggcatttttttcagtgcttggccc 87 206239_ tgttgagtctatctggtaacactggagctgactccctgggaagagaggccaaatgttacaatgaacttaatggatgcaccaagatatatgacctgt s at ctgtgggactgatggaaatacttatcccaatgaatgcgtgttatgttttgaaggtcggaaacgccagactttatcctcattaaaaattgggcttg ctgagaaccaaggttttgaaatcccatcaggtcaccgc gtggaccttagaatacagttttgagtagagttgatcaaaatcaattaaaatagtctctttaaaaggaaagaaaacatctttaaggggaggaaccagag tgctgaaggaatggaagtccatctgcgtgtgtgcagggagactgggtaggaaagaggaagcaaatagaagagagaggttgaaaaacaaaatg 88 206286 ggttacttgattggtgattaggtggtggtagagaagcaagtaaaaaggctaaatggaagggcaagtttccatcattatagaaagtatataagaca s-at agaaetcccetttttttcccaaaggcattataaaaagaatgaagcctccttagaaaaaaaattatacctcaatgteccccaacaagattgettaataaatt gtgtttcetccaagetattcaattcttttaactgttgtagaagacaaaatgttcacaatatatttagttgtaaaccaagtgatcaaactacatattgtaaage ccattttt aagtetgtagtetttatgatcetaaaagggaaaattgccttggtaaetttcagatteetgtggaattgtgaattcatactaagetttctgtgcagtetcacc 89 206976 atttgcatcactgaggatgaaactgacttttgtettttggagaaaaaaaactgtactgttgttcaagagggctgtgattaaaattttaagcatttgttct s-at gccaaggtagttttcttgcattttgetctccattcagcatgtgtgtgggtgtggatgtttataaacaagactaagtetgacttcataagggetttetaaaac catttctgtccaagagaaaatgactttttgctttgatattaaaaattcaatgagtaaaacaaaagctagtcaaatgtgttagcagcatgcagaacaaaa -149 actttaaactttctctctcactatacagtatattgtcaatgtgaaagtgtggaatggaagaaatgtcgatcctgttgtaactga gcaccctggtgtgactctagtgatctacgtagctcggcttttttggcacatggatcaacaaaatcggcaaggtctcagggaccttgttaacagtgga gtaactattcagattatgagagcatcagagtattatcactgctggaggaattttgtcaactacccacctggggatgaagctcactggccacaataccc 90 207158 acctctgtggatgatgttgtacgcactggagctgcactgcataattctaagtettccaccctgtttaaagatttcaagaagatggcaaaatatttaca at tttttcagacttcatcttcaaaactgccattaccaaacgattcecacacatecttttagetacagggetgatacatcettetgtggettggagatgaat aggatgattccgtgtgtgtactgattcaagaacaagcaatgatgacccactaaagagtgaatgccatttagaattagaaatgttcacaaggtacc caaaactctgtagct gaacatccaagtetttettcttttttaagttgtcaaagaagettccacaaaattagaaaggacaacagtetgagctgtaatttegccttaaactetggae 91 207173 actetatatgtagtgcatttttaaaettgaaatatataatatteagecagettaaacecatacaatgtatgtacaatacaatgtacaattatgtetettgage x at atcaatettgttactgctgattettgtaaatetttttgettctactttcatettaaactaatacgtgccagatataaetgtettgtttcagtgagagacgcccta - tttctatgtcatttaatgtatetatttgtacaattttaaagttettattttagtatacatataaatatcagtattetgacatgtaagaaaatgttacggcatcac acttatatttta aaccgaatgcggtgctacaactgtggtggaagccccagcagttcttgcaaagaggccgtgaccacctgtggcgagggcagaccccagccagg 92 207457 cctggaacagatcaagctacctggaaaccccccagtgaccttgattcaccaacatccagcctgcgtcgcagcccatcattgcaatcaagtggaga s at cagagtcggtgggagacgtgacttatccagcccacagggactgctacctgggagacctgtgcaacagcgccgtggcaagccatgtggcccctg - caggcattttggctgcagcagctaccgccctgacctgtctcttgccaggactgtggagcggatagggggagtaggagtagagaagggaacaag ggagcaagggaacaagggacatctgaacatct agaacagcagetttetagggacagetggaaagggacttaatgtgtttgactatttettacgagggttetacttatttatgtatttatttttgaaagettgtat 93 207850 tttaatattttacatgctgttatttaaagatgtgagtgtgtttcatcaaacatagctcagtcctgattatttaattggaatatgatgggttttaaatgtgtcatta at aactaatatttagtgggagaccataatgtgtcagccaccttgataaatgacagggtggggaactggagggtngggggattgaaatgcaagcaatt agtggatcactgttagggtaagggaatgtatgtacacatctattttttatacttttttttttaaaaaagaatgtcagttgttatttatteaaattatetcacattat gtgttcaacatttttatgctgaagtttcccttagacattttatgtcttgcttgtagggcataatgccttgtttaatgtcatttgcaggttt ccetcaatetagaacgetacacaagaaatattttgtttttactcagcaggtgtgeettaacetecctattcagaaagetecacatcaataaacatgacac tctgaagtgaaagtagccacgagaattgtgctacttatactggaacataatctggaggcaaggttcgactgcagtgaaccttgcctccagattatg 94 208079 aaccagtataagtagcacaattctcgtggctactttcacttcagagtgtcatgtttattgatgtggagctttctgaatagggaggttaaggcacacctg s-at tgagtaaaacaaatatttcttgtgtagcgttcttaggaatctggtgtctgtccggccccggtaggcctgttgggtttctagtcctccttaccatcattcc atatgagagtgtgaaaataggaacacgtgctctacctccatttagggatttgcttgggatacagaagaggccatgtgtctcagagctgttaagggctt atttttttaaaacattggagtcatagcatgtgtgtaa gttttgggtatgtttaatetgttatgtactagtgttetgtttgttattgttttgttaattacaccataatgetaatttaaagagactccaaatetcaatgaagcca 95 208712 gctcacagtgctgtgtgccccggtcatctagcaagctgccgaaccaaaagaatttgcaccccgctgcgggcccacgtggttggggccctgccct at ggcagggtcatcctgtgctcggaggccatctcgggcacaggcccaccccgccccacccctccagaacacggctcacgcttacctcaaccatcct ggctgcggcgtctgtctgaaccacgcgggggccttgagggacgctttgtctgtcgtgatggggcaagggcacaagtcctggatgttgtgtgtatc gagaggccaaaggctggtggcaagtgcacggggcacagcggagtctgtcctgtgacgcgcaagttgagggttgggcggcg gattccctgcatcaactaagaaaagcctgttttctttatttcaaacttggtggcgaatgtgttgcgggtcctgttgggctgctttctgtattgtctcctaac 96 209218 cetetagttttaattggacacttetttgetgttgeaatetatgeegtgtatgetttaagteagaacettggattacaaaacetegagcettetcagta at gtggtgctgtattgtacaaagcgtgttctgtaatatttcctctaatttactcagaaatgaagtatatggttattaagcttaaaggggaaccatttgtgaat gaatatttggaacttaccaagtcctaagagacttttggaagaggatatatatagcatagtaccataccacttata tgcggaaatacctgaaatacagcaaaaatatcctggaccggcaagatcctccctctgtggtggtcaccagccaccaggccccaggagaaaaga agaaactgaagtgcctggcctacgacttctacccagggaaaattgatgtgcactggactcgggccggcgaggtgcaggagcctgagttacggg 97 209309_ gagatgttcttcacaatggaaatggcacttaccagtcctgggtggtggtggcagtgcccccgcaggacacagccccctactcctgccacgtgcag at cacagcagcctggcccagcccctcgtggtgccctgggaggccagctaggaagcaagggttggaggcaatgtgggatctcagacccagtagct gcccttcctgcctgatgtgggagctgaaccacagaaatcacagtcaatggatccacaaggcctgaggagcagtgtggggggacagacaggag gtggatttggagaccgaagactgggatgcctgtcttgagtagacttggacccaaaaaatcatctcaccttgagccca gaagacttactgttggccatagttaattgtgtgaggaacacgccggcctttttagccgaaagactgcatgagccttgaagggtattggaatgatg 98 209369 agtttactetgaacegaataatggtgtecagatcagaaattgacettttggacattegaacagagttcaagaagcattatggetattccctatattcage at aattaaatcggatacttctggagactatgaaatcacactcttaaaaatctgtggtggagatgactgaaccaagaagataatctccaaaggtccacgat gggettttccaacagctccaccttacttcttctcatactatttaagagaacaagcaaatataaacagcaacttgtgttcctaacagg agagattcattgcagctcagcatggctcagaccagctcatacttcatgctgatctcctgcctgatgtttctgttcagagccaaggccaagaggccc agacagagttgccccaggcccggatcagctgcccagaaggcaccaatgcctatcgctcctactgctactactttaatgaagaccgcgagacctg 99 209752_ ggttgatgcagatctctattgccagaacatgaattcgggcaacctggtgtctgtgctcacccaggccgagggtgcctttgtggcctactgattaag at gagagtggcactgatgacttcaatgtctggattggcctccatgaccccaaaaagaaccgccgctggcactggagcagtgggtccctggtctccta caagtcctggggcattggagccccaagcagtgttaatcctggctactgtgtgagcctgacctcaagcacaggattccagaaatggaaggatgtg cttgtgaagacaagttctcct ttttaccttggatgetgacttetaaatgaaetgaagatgtgccettacttggetgatttttt tttecatetcataagaaaaatcagetgaagtgttaccaac 100 209773 tagecacaccatgaattgtcegtaatgtteattaacageatetttaaaactgtgtagetaceteacaaccagtectgtetgtttatagtgetggtagtate s at accttttgccagaaggcctggctggctgtgacttaccatagcagtgacaatggcagtcttggctttaaagtgaggggtgaccctttagtgagcttag cacagcgggattaaacagtcctttaaccagcacagccagttaaaagatgcagcctcactgcttcaacgcagatt . 150 agagagacacagctgcagaggccacctggattgcgcctaatgtgtttgagcatcacttaggagaagtcttctatttatatttatttatttatttatttgttt gttttagaagattetatgttaatatttatgtgtaaaataaggttatgattgaatctacttgcacacteteccattatatttattgtttatttaggtcaaaceca 101 209774 agttagttcaatcctgattcatatttaatttgaagatagaaggtttgcagatattctctagtcatttgttaatatttttgtgatgacatatcacatgtcagcc x at actgtgatagaggctgaggaatccaagaaaatggccagtaagatcaatgtgacggcagggaaatgtatgtgtgtctattttgtaactgtaaagatga atgtcagttgttatttattgaaatgatttcacagtgtgtggtcaacatttetcatgttgaagetttaagaactaaaatgttctaaatateccttggacattttat gtetttettgtaagatactgccttgtttaatgttaattatgcagtgtttccctc tcctctcgtggggtgtttacccctgtggctctgcccagcatccagctgtctacacccagatctgcaaatacatgtcctggatcaataaagtcatacgct ccaactgatccagatgctacgctccagctgatcagatgttatgtctgtgatccagatgcccagaggtccatgtcatctcttcctccccag 102 209792 teggetgaaetetccettgtetgeactgttoaaacetetgecgcetecacacetetaaacatetecceteteaccteattceccacetatecccatt s at ctctgcctgtactgaagctgaaatgcaggaagtggtggcaaaggtttattccagagaagccaggaagccggtcatcacccagcctctgagagca gttactggggtcacccaacctgacttcctctgccactccccgctgtgtgactttgggcaagccaagtgccctctctgaacctcagtttcctcattgca aaatgggaacaatgacgtgcctacctcttagacatgttgtg gaatggtgcatacaaggccatccccgttgcccaggacctgaacgcgccttctgattgggacagccgtgggaaggacagttatgaaacgagtcag ctggatgaccagagtgctgaaacccacagccacaagcagtecagattatataagcggaaagctaatgatgagagcaatgagcattccgatgtgat 103 209875 tgatagtcaggaactttccaaagtcagccgtgaattccacagccatgaatttcacagccatgaagatatgctggttgtagaccccaaaagtaaggaa s-at gaagataaacacctgaaatttcgtatttctcatgaattagatagtgcatcttctgaggtcaattaaaaggagaaaaaatacaatttctcactttgattta gtcaaaagaaaaaatgctttatagcaaaatgaaagagaacatgaaatgcttctttctcagtttattggttgaatgtgtatctatttgagttggaaataac tgatgtgtttgataattagtttagtttgtggcttcatggaa acagctttccaaggtgacaaactcctctatgcagtgtatcgaaagctgggtgtttatgaagttgaagaccagattacagctgtcagaaaattcataga aatgggtttcattgatgaaaaaagaatagccatatggggctggtcctatggaggatacgtttcatcactggcccttgcattggaatggtttttaa 104 209955_ atgtggtatagcagtggctccagtctecagctgggaatattacgcgtctgtctacacagagagattcatgggtctcccaacaaaggatgataatttg s_at agcactataagaattcaactgtgatggcaagagcagaatatttcagaaatgtagactatcttctcatccacggaacagcagatgataatgtgcacttt cagaactcagcacagattgctaaagetctggttaatgcacaagtggatttccaggcaatgtggtactctgaccagaaccacggcttatccggcctgt ccacgaaccacttatacacccacatgacccacttcctaaagcagtg agtcaagtgaccagcctctgactgtgcctgtatctcccaaattctccactcgattccactgctaaactcagctgtgagctgcggataccgcccggca 105 210052 atgggacctgctcttaacctcaaacctaggaccgtcttgctttgtcattgggcatggagagaacccatttctccagacttttacctacccgtgcctgag s at -aaagcatacttgacaaetgtggactcagttttgttgagaattgttttacattactaaggetaataatgagatgtaaetcatgaatgtetegattagac - tccatgtagttacttcctttaaaccatcagccggccttttatatgggtcttcactctgactagaatttagtctctgtgtcagcacagtgtaatctctattgct attgcccc gctttcaccggcaagttcgagatggagagtgagaagaattatgatgagttcatgaagctccttgggatctccagcgatgtaatgaaaaggccg 106 210445 aacttcaagatcgtcacggaggtgcagcaggatgggcaggacttcacttggtcccageactactccgggggccacaccatgaccaacaagttca at ctgttggcaaggaaagcaacatacagacaatggggggcaagacgttcaaggccactgtgcagatggagggegggaagctggtggtgaatttcc ccaactatcaccagacctcagagatcgtgggtgacaagctggtggaggtctccaccateggaggcgtgacctatgagegcgtgagca aaaggagcagtcgcacagacctttcctcatgctgcaggcccggcagtctgaagaccaccctcatcgccggcgtcggcggggcttggagtgtgat 107 210511 ggcaaggtcaacatctgctgtaagaaacagttctttgtcagtttcaaggacatcggctggaatgactggatcattgctccctctggctatatgcaa 7 a5 1 ctactgcgagggtgagtgcccgagccatatagcaggcacgtccgggtcctcactgtccttccactcaacagtcatcaaccactaccgcatgcggg s-at gccatagcccetttgccaacctcaaatcgtgctgtgtgcccaccaagctgagacccatgtccatgttgtactatgatgatggtcaaaacatcatcaaa aaggacattcagaacatgatcgtggaggagtgtgggtgctcatagagttgcccagc cagaccttgtgatattecagttccccctgcagtggtttggagtccctgccattctgaaaggctggtttgagccttcataggagagtttgcttacacttac 108 210519 gctgccatgtatgacaaaggacccttccggagtggcattctgcatttctgtggcttccaagtcttagaacctcaactgacatatagcattgggcacac s at - tccagcagacgcccgaattcaaatcctggaaggatggaagaaacgcctggagaatatttgggatgagacaccactgtattttgctccaagcagcc - tctttgacctaaacttccaggcaggattcttaatgaaaaaagaggtacaggatgaggagaaaaacaagaaatttggcctttctgtgggccatcactt gggcaagtccatcccaactgac gtaacactctggtacagatctccagaagtattgctggggtcagctcgttactcaactccagttgacatttggagtataggcaccatatttgtgaata 109 210559 gcaactaagaaaccacttttccatggggattcagaaattgatcaactcttcaggattttcagagctttgggcactcccaataatgaagtgtggccaga sat agtggaatctttacaggactataagaatacatttcccaaatggaaaccaggaagcctagcatcccatgtcaaaaacttggatgaaaatggcttggat ttgctctcgaaaatgttaatctatgatccagccaaacgaatttctggcaaaatggcactgaatcatccatattttaat 110 210766 ggttccatcaatggtgagcaccagcctgaatgcagaagcgctccagtatctccaagggtaccttcaggcagccagtgtgacactgctttaaactgc s at atttttctnaatgggctaaacccagatggtttcctaggaaatcacaggcttctgagcacagctgcatt tactggaacctatgatctgaagagcgtcctgggtcaactgggcatcactaaggtcttcagcaatggggctgacctctccggggtcacagaggagg 11 211429 cacccctgaagctctccaaggccgtgcataaggctgtgctgaccatcgacgagaaagggactgaagctgctggggccatgtttttagaggcata s_at cccatgtctatcccccccgaggtcaagttcaacaaaccctttgtcttcttaatgattgaacaaaataccaagtctcccctcttcatgggaaaagtggtg aatcccacccaaaaataactgcctctcgctcctcaacccctcccctccatccctggccccctccctggatgacattaaaga 112 211506 gtgtgaaggtgcagttttgccaaggagtgctaaagaattagatgtcagtgcataaagacatactccaaacctttccaccccaaatttatcaaagaac 2 s at tgagagtgattgagagtggaccacactgcgccaacacagaaattattgtaaagctttctgatggaagagagctctgtctggaccccaaggaaaact S gtgcagagggttgtggagaa 113 212063- attgtaaatettgtgtetectgaagacttccettaaaattagetctgagtgaaaaatcaaaagagacaaaagacatettegaatccatatttcaagcet . 151 at ggtagaattggcttttctagcagaacctttccaaaagttttatattgagattcataacaacaccaagaattgattttgtagccaacattcattcaatactgtt atatcagaggagtaggagagaggaaacatttgacttatctggaaaagcaaaatgtacttaagaataagaataacatggtcattcacctttatgttata gatatgtctttgtgtaaatcatttgttttgagttttcaaagaatagcccattgtteattettgtgctgtacaatgaccactgttattgttactttgacttttcaga gcacaccc tccaaggactgagactgacctcctctggtgacactggcctagngcctgacactctcctaagaggttctctccaagcccccaaatagctccaggcg 114 212070 ccctcggccgcccatcatggttaattctgtccaacaaacacacacgggtagattgctggcctgttgtaggtggtagggacacagatgaccgacctg at gtcactcctcctgccaacattcagtctggtatgtgaggcgtgcgtgaagcaagaactcctggagctacagggacagggagccatcattcctgcct gggaatcctggaagacttcctgcaggagtcagcgttcaatcttgaccttgaagatgggaaggatgttctttttacgtaccaattet cgatgcaagtgtttctgttctgggaggtattggagggaaaaaancaagcaggatggctggaacactgtactgaggaatgaatagaaaggttca gatgtetaaaagattetttaaactactgaactgttacctaggttaacaaccetgttgagtatttgetgtttgtccagttcaggaatttttgttttgttttgte tat 115 212190 atgtgeggettttcagaagaaatttaatcagtgtgacagaaaaaaaaatgttttatggtagettactttatgaaaaaaaaattatttgeettttaaatte at ttttcccccatccccetccaaagtettgatagcaagegttattttgggggtagaaacggtgaaatetetagcetetttgtgtttttgttgttgttgttgttgtt gttttatataatgcatgtattcactaaaataaaatttaaaaaatctgtcttgctagacaaggttgetgttgtgcagtgtgcctgtcactactggtetgta ctccttggatttgc 116 212281 tacagccaggcataacatatccactgtgtgcatagagggtctcttcacgttgatgcttggcattccatcagctttctctaagtctttgctcaagttcaac s at ttaaaatgatgttag ggaaaacacctcatttgacettgccagctgacettcaaaccetgcatttgaaccgaccaacattaagtecagagagtaaacttgaatggaataacga cattccagaagttaatcatttgaattetgaacactggagaaaaacegaaaaatggacggggcatgaagagactaatcatetggnaaaccgatttca 117 212344 gtggcgatggcatgacagagctagagctcgggcccagccccaggctgcagcccattcgcaggcacccgaaagaacttccccagtatggtggt at ctggaaaggacatttttgaagatcaactatatcttcctgtgcattccgatggaatttcagttcatcagatgttcaccatggccaccgcagaacaccgaa gtaattccagcatagcggggaagatgttgaccaaggtggagaagaatcacgaaaaggagaagtcacagcacctagaaggcagcgcctccttt cactctcctctgattagatgaaactgttaccttacccta aatatcettgttgtgtattaggtttttaaataccagetaaaggattacctcactgagtcatcagtaccetectattcagctccccaagatgatgtgttttge 118 212353 ttacectaagagaggttttettettatttttagataataagtgettagataaattatgttttetttaagtgtttatggtaaaetettttaaagaaaatttaatat at gttatagetgaatctttggtaactttaaatetttatcatagactctgtacatatgttcaaattagetgettgcetgatgtgtgtatcateggtgggatgaca gaacaaacatatttatgatcatgaataatgtgctttgtaaaaagatttcaagttattaggaagcatactctgttttttaatca gtgtgcacacggagactcategttataatttactatetgccaagagtagaaagaaaggetggggatatttgggttggettggttttgattttttgettgttt gtttgttttgtactaaaacagtattatettttgaatategtagggacataagtatatacatgttatecaatcaagatggetagaatggtgcetttetgagtgt 119 212354 etaaaaettgacacccetggtaaatettteaacacacttccactgeetgegtaatgaagttttgatteataaccactggaattttaatgcegteatt at ttcagttagatgattgcactttgagattaaaatgccatgtetatttgattagtettatttttttatttttacaggettatcagtetcactgttggetgtcattgtg acaaagtcaaataaacccccaaggacgacacacagtatggatcacatattgtttgacattaagcttttgccagaaaatgttgcatgtgttttacctcga ctt caagagctacaatgtcacctccgtcctgtttaggaaaaagaagtgtgactactggatcaggacttttgttccaggttgccagcccggcgagttacg 120 212531_ ctgggcaacattaagagttaccctggattaacgagttacctcgtccgagtggtgageaccaactacaaccagcatgctatggtgttcttaagaaag at - tttctcaaaacagggagtacttcaagatcaccctctacgggagaaccaaggagctgacttcggaactaaaggagaacttcatccgcttctccaaat ctctgggcctccctgaaaaccacatcgtcttccctgtcccaatcgaccagtgtatcgacggctgagtgcacaggtgccgccagntgccgcaccag cccgaacaccattgaggga ccttettgtccacggttttgttgagttttcactetetaatgcaagggtetcacactgtgaaccacttaggatgtgatcactttcaggtggccaggaatgtt 121 212942_ gaatgtctttggctcagttcatttaaaaaagatatctatttgaaagttctcagagttgtacatatgtttcacagtacaggatctgtacataaaagtttcttt s at ctaaaccattcaccaagagccaatatetaggcattttcttggtagcacaaattttettattgettagaaaattgtecteettgttatttetgtttgtaagactta - agtgagttaggtetttaaggaaagcaacgetectetgaaatgcttgtettttttctgttgccgaaatagctggtcetttttegggagttagatgtatagagt gtttgtatgtaaacatttcttgtaggcatcaccatg caatatetgacaccactttggacteaagagacteagtaacgtattatectgtttatttagettggttttagetgtgttetctetggataacc 'cacttgatgtt 122 213880 aggaacattacttetctgettattecatattaatactgtgttaggtattttaagaagcaagttattaaataagaaaagtcaaagtattaattettaccttetatt at atcctatattagcttcaatacatccaaaccaaatggctgttaggtagatttatttttatataagcatgtttattttgatcagatgttttaacttggatttgaaaa aatacatttatgagatgttttataagatgtgtaaatatagaaetgtatttattactatagtaaaggttcagtaacattaaggaccatgataatgataataaa ccttgtacagtggcatattctttgatttatattgtgtttctctgcccatt 213905 cacaaaaccccagggacagcggtctccccagcctgccctgctcangccttgcccccaaacctgtactgtcccggaggaggttgggaggtggag 123 x at gcccagcatcccgcgcagatgacaccggttttcctagaagcccctcacccccactggcccactggtggctaggtctccccttatccttctggtcca Sa gcgcaaggaggggctgcttctgaggtcggtggctgtctttccattaaagaaacaccgtg gaaataacccagacttaatcttgaatgatncgattatgcccaatattaagtananaatataagaaaaggttatcttaaatagatcttaggcaaaatace 124 213975 agetgatgaaggcatetgatgccttcatetgtteagtcatetccaaaaacagtaaaaataaccactttttgttgggcaatatgaaatttttaaaggagta 14 s at gaataccaaatgatagaaacagactgcetgaattgagaattttgatttnttaaagtgtgtttetttetaaattgetgttcettaatttgattaatttaatteatg - tattatgattaaatetgaggcagatgagcttacaagtattgaaataattactaattaatcacaaatgtgaagttatgcatgatgtaaaaaatacaaacatt ctaattaaaggctt 125 214022 tcaacaccctcttcttgaactggtgctgtctgggcttcatagcattcgcctactccgtgaagtctagggacaggaagatggttggcgacgtgaccgg s at ggcccaggcctatgcctccaccgccaagtgcctgaacattgggcctgattctgggcatcctcatgaccattggattcatcctgttactggtattcg - 152 gctctgtgacagtctaccatattatgttacagataatacaggaaaaacggggttactagtagccgcccatagcctgcaacctttgcactccactgtgc aatgctggccctgcacngctngggctgttgcccctgcccccttggtectgcccctagatacagcagtttatacccacacacctgtctacagtgtcatt caata 126 214235 ggtgaggggatgacccctggagatgaagggaagaggtgaagccttagcaaaaatgcctcctcaccactccccaggagaatttttataaaaagca at -taatcactgattccttcactgacataatgtaggaagcctctgaggagaaaaacaaagggagaaacatagagaacggttgctactggcagaagcat aagatctttgtacaatattgctggccctggttcacctgtttactgttatcacaata gtgattcaaacttctgtgtactgggtgatgcacccattgtgattgtggaagatagaattcaatttgaactcaggtgtttatgaggggaaaaaaacagtt 127 214651 gcatagagtatagetetgtagtggaatatgtettetgtataaetaggetgttaacetatgattgtaaagtagetgtaagaattteccagtgaaataaaaa s at - aaaattttaagtgtteteggggatgcatagattcatcattttetecacettaaaaatgegggcatttaagtetgtecattatetatatagtectgtettgteta - ttgtatatataatctatatgattaaagaaaatatgcataatcagacaagegtaatgcataatcagacaagettgaatattgtttttgeaccagacgaaca gtgaggaaattcggagctatacatatgtgcag agtcagtgttgtettaatatcettgataatgetgtaaagtttatttttacaaatatttetgtttaagetatttcacetttgtttggaaatecttccettttaaagag aaaatgtgacacttgtgaaaaggttgtaggaaagcttccctttttttntttaaacctttaaatgacaaacetaggtaataggttgtaaatttctat 128 214974 ttttgetttgtttaatgaacatttgtetttcagaataggattetgtgataatatttaaatggcaaaaacaaacatattttgtgcaattaacaaagctact x at gcaagaaaaataaaacatttttggtaaaaacgtatgtatttatatattatatatttatatatctaaaagaatttagcattgetaggctttttagatgcc tattgtgtatettttaaaggttttgaccattttgttatgagtaattacatatatattacattcactatattaaaattgtacttttttactatgtgnetcattggttcat agtetttattttgtectttgaa tatgtegetgtecaagagaaggetgtggaagaacetatacaactgtgtttaatctccaaagccatatectetecttccatgaggaaagcegcccttttg 129 215091_ tgtgtgaacatgetggetgtggcaaaacatttgcaatgaaacaaagtetcactaggeatgetgttgtacatgatectgacaagaagaaaatgaaget s-at caaagtcaaaaaatetegtgaaaaacggagntnggcctetcatetcagtggatatatcceteccaaaaggaaacaagggcaaggettatenttgtg teaaaacggagagtcacccaaetgtgtggaagacaagatgetetenacagttgcagtacttacccttggetaagaaet 130 217430 agggcctaagggtgacagaggtgatgctggtcccaaaggtgctgatggctctcctggcaaagatggcgtccgtggtctgaccggccccattggt x at cctcctggccctgctggtgcccctggtgacaagggggaccccattcccgaggagctttatgag gagftattattatctcatagcgtatgttttcttgnacagccttgaataatttgattgacctctgggatgttattaaagctaacgttccttctatttcacaaa 131 217523 ttttegttatgacttcagaaggateattaaetetggtatetgtttgtttgettgtatggcaccaataagcagatttettetttetaatetatggattagtataga at - ccaggagaaggctaatacagagactatgaaacgggaataagtttttttaacgatatggcaaaattgtgactctgaaagatcattcatgtatatttaaa attaccacagtcataaaaagtcttggactttcatgaggaaatagcatagctagatatgaaaaaatatagaaaatcttcatcaatggaactattttggg gtagacactaatcatatgaaaagacaaatgctcattccctaagatagcctga gcttetacgtcatcttcgacagagcccagaagagggtgggcttcgcagcgagcccctgtgcagaaattgcaggtgctgcagtgttgaaatttcg 132 217867 ggcctttctcaacagaggatgtagccagcaactgtgtccccgctcagtctttgagcgagcccattttgtggattgtgtctatgcgctcatgaggtct x at gtggagccatcctccttgtcttaatcgtcctgctgctgctgccgttccggtgtcagcgtcgcccccgtgaccctgaggtgtcaatgatgagtctct - ctggtcagacatcgctggaaatgaatagccaggcctgacctcaagcaaccatgaactcagctattaagaaaatcacatttccagggcagcagccg ggatcgatggtggcgctttctcctgtgcccacccgtcttcaatctctgttctgctcccagatgccttctagattcactgttt gaagtgggacgagcacatttctattgtcttcacttggatcaaaagcaaaacagtctctccgccccgcaccagatcaagtagtttggacatcacccta 133 217996 ctgaaaaettgegattettettagttttctgcatacttttcatcacgatgcaggaaacgatttegagtcaagaagacttttatttatgaacctttgaaaggat at cgtettgtatggtgaattttetaggagegatgatgtactgtaattttattttaatgtattttgatttatgattatttattagttttttttaaatgettgttetaagacat ttctgaatgtagaccattttccaaaaaggaaactttattttcaaaaacctaatccgtagtaattcctaatcttggagaataaaaaagggggtggaggg gaaaacattaagaatttattcattatttctcgagtactttcagaaagtctgacactttcattgttgtgccagctggtt ggcacagagcgcggagatgtaccactaccagcaccaacggcaacagatgctgtgcctggagcggcataaagagccacccaaggagctggac acggcctcctcggatgaggagaatgaggacggagacttcacggtgtacgagtgcccgggcctggccccgaccggggaaatggaggtggca 134 218086 accctctgttcgaccacgccgcactgtccgcgcccctgccggcccccagctcaccgcctgcactgccatgacctggaggcagacagacgccca at cctgctccccgacctcgaggcccccggggaggggcagggcctggagcttcccactaaaaacatgttttgatgctgtgtgcttttggtgggct gggctccaggccctgggaccccttgccagggagacccccgaacctttgtgccaggacacctcctggtcccctgcacctctcctgttggtttaga cccccaaactggagggggcatggagaaccgtagagcgcaggaacgggtgggtaatt gccacaccttcgcgaaacctgtggtggcccaccagtcctaacgggacaggacagagagacagagcagccctgcactgttttccctccaccaca 135 218211 gccatcctgtccctcattggctctgtgctttccactatacacagtcaccgtcccaatgagaaacaagaaggagcaccctccacatggactcccacct s at - gcaagtggacagcgacattcagtcctgcactgctcacctgggtttactgatgactcctggctgccccaccatcctctctgatctgtgagaaacagct - aagctgctgtgacttccctttaggacaatgttgtgtaaatctttgaaggacacaccgaagacctttatactgtgatcttttacccctttcactcttggtttc ttatgttgc tggtgtatgctgtgctttcctcagcagtatggctctgacatctcttagatgtcccaacttcagctgttgggagatggtgatattttcaaccctacttctaa 136 218507 acatetgtetggggtteetttagtettgaatgtettatgetcaattatttggtgttgagectetettecacaagagetectccatgtttggatagcagttgaa at gaggttgtgtgggtgggetgttgggagtgaggatggagtgttcagtgcccatttetcattttacattttaaagtegtteetccaacatagtgtgtattggt ctgaagggggtggtgggatgccaaagcctgctcaagttatggacattgtggccaccatgtggct gaatttctgctggactttatctgggcagaggaaggatggaatgaaggtagaaaaggcagaattacagctgagcggggacaacaaagagttcttct 137 218704_ ctgggaaaagttttgtcttagagcaaggatggaaaatggggacaacaaaggaaaagcaaagtgtgacccttgggtttggacagccagaggccc at agctccccagtataagccatacaggccagggacccacaggagagtggattagagcacaagtctggcctcactgagtggacaagagctgatggg cctcatcagggtgacattcaccccagggcagcctgaccactctt gcccctcaggcattatcccatttggaatgtgaatgtggtggcaaagtgggc - 153 agaggaccccacctgggaacctttttccctcagttagtggggagactagcacctaggtacccacatgggtatttatatctgaaccagacagacgtt gaatcaggcactat gagacagacttggcaagggaccccctggttctgagccagtagctgccatctggaaattcctcttttagcctctccttagaggtgaatgtgaatgaag cctcccaggcacccgctgaatttctgaggccttgcttaaagctcagaagtggtttaggcatttggaaaatctggttcacatcataaagaattgatttg 138 218796 aaatgttttetatagaaacaagtgetaagtgtacegtattatacttgatgttggtcatttetcagtectattteteagttetattattttagaacetagteagtt at ctttaagattataaetggtcetacattaaaataatgettetegatgtcagattttacctgtttgetgetgagaacatetetgcctaatttaccaaagccaga cettcagtteaacatgetteettagettttcatagttgtetgacatttccatgaaaacaaaggaaccaaetttgttttaaccaaaetttgtttggttacagtttt caggggagcgtttcttecatgaca catgtacgactcggacagcgacggccgcatcactctggaagaatatcgaaatgtaaagtggtcgaggagctgctgtgggaaaccctcacatct 139 218872 agaaggagtccgetcgctccatcgccgacggggccatgatggaggcggccagcgtgtgcatggggcagatggagcctgatcaggtgtacgag at - gggatcaccttcgaggacttcctgaagatctggcaggggatcgacattgagaccaagatgcacgtccgcttccttaacatggaaaccatggccct ctgccactgacccaccgccacctccgcggagagactgcactttgcaatggggccgcctccccgcgtagctggagcagcccaggcccggcgga cagcctcttcctgcagcgccggtacatagccaaggctcgtctgcgcaccttgtgtcttgtagggtatggtatgtgggacttcgct gaggaggaactgacgcagctacgccacgaactggagcggcagaacaatgaataccaagtgctgctgggcatcaaaacccacctggagaagg aaatcaccacgtaccgacggctcctggagggagagagtgaagggacacgggaagaatcaaagtcgagcatgaaagtgtctgcaactcaaag 140 218963 atcaaggccataacccaggagaccatcaacggaagattagttctttgtcaagtgaatgaaatccaaaagcacgcatgagaccaatgaaagtttccg s at cctgttgtaaagtctattttcccccaaggaaagtccttgcacagacaccagtgagtgagttctaaaagatacccttggaattatcagactagaaatt ttatttttttttttetgtaacagtetcaccagacttctcataatgetettaatatattgeacttttetaatcaaagtgegagtttatgagggtaaagetctacttt cctactg aatcctgcaattctcaatcttgcactgcagcctcgacctcccaggctccagtgactetcccacctcagcctcctaagtagctgggagtacaggcgc gcaccaccacgcetagctgatttttgtatttttttgtagagacgggggtttggccatgttgccgaggctaactectgggattacaggcatgagctgtgc 141 218984_ tggcegggtttttttcttgatgtaaacgtgtacagetgttttattagttaaggtetaatttttactctaggtgeettttatgtteagaaetetttecactggae at tggtatttgetcaaaaataaataatggtagagaagaaaactataaaaatggacaaggetttettctatcagtagegtttaccetttgtcaccagtggettt t ggtatttccatgtetggcattgcataaaettetetggtgtgaaaggataaatatgcctttetaaagttgtatatcaaaattgtatcaatttttattttctatgatt tctagaaacaaatgtaataaatatttttaaaatctcctttctactggttatgta ctcgttgcaatcgcctttgcagtcaaccacttctggtgccaggaggagccggagcctgcacacatgatcctgaccgtcggaaacaaggcagatg 142 219630 gagtcctggtgggaacagatggaaggtactcttcgatggcggccagtttcaggtccagtgagcatgagaatgcctatgagaatgtgccgagga at - ggaaggcaaggtccgcagcaccccgatgtaaccttctctgtggctccaaccccaagactcccaggcacatgggatggatgtccagtgctaccac ccaagccccctccttctttgtgtggaatctgcaatagtgggctgactccctccagccccatgccggccctacccgcccttgaagtatagccagcca aggttggagctcagaccgtgtctaggttggggctcg gagagaaaacaatatagccccctaccctttcccaatcctttgccctcaaatcagtgacccaagggagggggggatttaaagggaaggagtggg caaaacacataaaatgaatttattatatctaagetetgtagcaggatteatgtegttetttgacagttctttetetttectgtatatgcaataacaaggtttta 143 219682 aaaaaaaaaaaaaaaagtgagactattagacaaagtatttatgtaattatttgataactcttgtaaataggtggaatatgaatgcttggaaaattaaactt s at taatttattgacattgtacatagctctgtgtaaatagaattgcaactgtcaggttttgtgttcttgttttectttagttgggtttatttccaggtcacagaattgc tgttaacactagaaaacacacttectgeaccaacaccaataccetttcaaaagagttgtetgcaacatttttgtttttttttaatgtecaaaagtgggg gaaagtgctatttcctattttcaccaaaattggggaaggagtgccactttccagc ttagcactgaaagtctcttgccccaggaaaccccatcagtcccaggcagattgggacagctggtcaccttacgcaagagccaggctgaaacatcc 144 219727 cctccatactcagetetttaaettttcttettttcatogggetetttectaaaaagetgagetgtaaaatattacategaggtataataaataatcatg at -tacatgttttaccaccacccaggtcaagacatagaatgtttcaacatttccatcaccccagaaaeteccecttgtacccccttccacttegteteccetag etectagaagcaaccactgatgtgatttetaccaaatocagttttggtectactaaatatactettttgagactggcetettttactcaccataatgcetttg taattc tagetgtttcagagagagtacggtatatttatggtaattttatccactagcaaatettgatttagtttgatagtgtgtggaatttattttgaaggataagac 145 219787 catgggaaaattgtggtaaagactgtttgtaccettcatgaaataattetgaagttgecatcagttttactaatettetgtgaaatgeatagatatgegeat s at - gttcaaetttattgtggtettataattaaatgtaaaattgaaaattcatttgetgtttcaaagtgtgatatetttcacaatagcetttttatagtcagtaattca -gaataatcaagttcatatggataaatgcatttttatttctatttetttagggagtgetacaaatgtttgtcacttaaatttcaagtttctgttttaatagttaaet gactatagattgttttctatgccatgtatgtgccacttctgagagtagtaaatgactctttgctacatttta gtgtaccagaattcggccatgagccgctacatactcatcatggggctcctgtacaaggtgctgggcgtcctcttctttgccatagcctgcttttatac aagcccctgtcggagtcttcagatggcctggaaacttgtctgcccagccagtcctcagcccctgacagtgccacagatagccagctccagagca 146 219911 gcgtctgaccaccgcccgcgccacccggccacggcgggeactcagcatttcctgatgacagaacagtgccgttgggtgatgcaatcacacgg s_at gaacttctatttgacctgcaaccttctacttaacctgtggtttaaagtcggctgtgacctcctgtecccagagctgtacggccctgcagtgggtggga ggaaettgcataaatatatatttatggacacacagtttgcatcagaacgtgtttatagaatgtgtttatacccgategtgtgtggtgtgegtgaggaca aactccgcaggggctgtgaatcccactgggagggcggtgggcctgcagcccgaggaaggcttgtgtgtctcagttaa gaagttgcaacattcgtttgataggaattccagaaaaggagagttatgagaatagggcagaggacataattaaagaaataattgatgaaaatttg 147 219955 agaactaaagaaaggttcaagtcttgagattgtcagtgcttgtcgagtacctagtaaaattgatgaaaagagactgactcctagacacatcttggtga at -aattttggaattctagtgataaagagaaaataataagggcttctagagagagaagagaaattacctaccaaggaacaagaatcaggttgacagcag acttatcactggacacactggatgctagaagtaaatggagcaatgtcttcaaagttctgctggaaaaaggctttaatcctagaatctatatccagcc Iaaaatggcatttgatttaggggcaaaacaaaggtatttttagtattgaagaatttagagattatgttttgcatatgcccacettgagagaattactggg -154 gaataatataccttagcacgccagggtgactaca 148 219956 ttgcttgttccccggaggttgaagctacagtgagccttgattgtgtcactgcactccagcctgggcaacaggtaagacttgttcaaaaaaaaaac at aaaaaagaagaagaaaagtacttctacagccatgtcctattccttgatcatccaaagcacctgcagagtccagtgaaatgatatattctggctgggc gacggcgtcaaggtcgtgggacgtgacacgaccgctgcggcgtcagctcagccttgcaagaccccaggcgcccggctgcactggactgt cgccgccgccgtcgcagtcggaccaactgctggcagaatcttcgtccgcacggccccagctggagttgcacttgcggccgcaagcgccagg 149 221577 gggcgccgcagagcgcgtgcgcgcaacggggaccactgtccgctcgggcccgggcgttgctgccgttgcacacggtccgcgcgtcgctgg x_at aagacctgggctgggccgattgggtgctgtcgccacgggaggtgcaagtgaccatgtgcatcggcgcgtgcccgagccagttcgggggca aacatgcacgcgcagatcaagacgagcctgcaccgcctgaagcccgacacggtgccagcgccctgctgcgtgcccgccagctacaatccat ggtgctcattcaaaagaccgacaccggggtgtcgctccagacctatgatgacttgttagccaaagactgccactgca tggaattagaccatttggectttgaactttcataggaaaaatgacccaacatttcttagcatgagetacetcatetetagaagctgggatggacttacta 150 221729 tttgtttatattttagatactgaaaggtgetatgettetgttattatteeaagactggagataggcagggetaaaaaggtattattattttteetttaatgat at ggtgctaaaattcttcctataaaattccttaaaaataaagatggtttaatcactaccattgtgaaaacataactgttagacttcccgtttctgaaagaaag agcatcgttccaatgcttgttcactgttcctctgtcatactgtatctggaatgctttgtaatacttgcatgcttcttagaccagaacatgtaggtccccttgt gtctcaatactttttttttcttaattgcatttgttggctctattttaattt tagattccggtatatcgttcttcaagacacttgctctaagcggaatggaaatgtgggcaagactgtctttgaatatagaacacagaatgtggcacgct 151 221730 tgcccatcatagatcttgctcctgtggatgttggcggcacagaccaggaattcggcgttgaaattgggccagtttgttttgtgtaaagtaagccaaga at cacatcgacaatgagcaccaccatcaatgaccaccgccattcacaagaactttgactgtttgaagttgatcctgagactcttgaagtaatggctgat ctgcatcagcattgtatatatggtettaagtgectggecteettatcettcagaatatttattttacttacaatectcaagttttaattgattttaaatatttttca atacaacagtttaggtttaagatgaccaatgacaatgaccacctt tttcagcaccgatggccatgtaaataagatgatttaatgttgattttaatcctgtatataaantaaaaagtncncaatgagtttngggcatatttaatgat gattatggagccttagaggtetttaatcattggttenggetgettttatgtagttaggetggaaatggttteacttgetetttgactgtcagcaagactga 152 221731 agatggcttttcctggacagctagaaaacacaaaatcttgtaggtcattgcacctatctcagccataggtgcagtttgctttacatgatgtaaagg x-at tgegaatgggatectgatggaactaaggactccaatgtcgaaetettetttgetgcattccttttcttcacttacaagaaaggcetgaatggaggactt ttetgtaaccaggaacatttggggtcaaagtgetaataattaaetcaaccaggtctactttttaatggetttcataacactaaetcataaggttaccg atcaatgcatttcatacggatatagacctagggctctggagggtgggg 153 221922 gtaaatagttaacetagtagtetattaaggcattaatacttetetggacatgegegtttgagggtggaggggtectgtaaggtgetteategtetgtg at attactgcttgggatgtgttctttggcagcttgtgagattactttacctagtgtttataaagtaggaagttaagtgaatcatagattagaatttaatactetta tggaaataattttttaacatcttaattgacaatggcgtttttttataca 154 221923 tagtecatactgagtgtcatcaacaatecagactgaagtettetattttaatetcaatccettttetgatttgecacecatgeetetteaggetggaaaca s at atetettggttccctaaagcactttcttetgactgctgtgattcagtgaaccttgccetttgetttctattacttgtgcatttgcetcacctgacaatgttttaa - atcgcctttgtatctccttagctgctcaataa aatatgtcagtgcttgcttgatggaaacttctcttgtgtctgttgagactttaagggagaaatgtcggaatttcagagtcgctgacggcagagggtg 155 222449 agcccccgtggagtctgcagagaggccttggccaggagcggcgggctttcccgaggggccactgtcctgcagagtggatgtttgcctagt at gacaggttatcaccacgttatatattccctaccgaaggagacaccttttcccccctgacccagaacagcctttaaatcacaagcaaaataggaaagt taaccacggaggcaccgagttccag ggetgggggagagcegggttcattccctgtectcattggtcgtccctatgaattgtacgttcagagaaatttttttectatgtgcaacacgaagette 156 222450 cagaaccataaaatatcccgtcgataaggaaagaaaatgtcgttgttgttgtttttctggaaactgcttgaaatcttgctgtactatagagctcagaag at - gacacagcccgtcctcccctgcctgcctgattccatggctgttgtgctgattccaatgctttcacgttggttcctgggtgggaactgctctcttigca gccccatttcccaagctctgttcaagttaaacttatgtaagctttccgtggcatgcggggcgcgcacccacgtcccgctgcgtaagacttgtattt ggatgccaatccacaggcctgaagaaactgcttgttgtg gtgagtatggcccaatgctttctgtggctaaacagatgtaatgggaagaaataaaagcctacgtgttggtaaatcaacagcaagggagatttttga 157 222549 atcataataactcataaggtgctatctgttcagtgatgccctcagagctcttgctgttagctggcagctgacgctgctaggatagttagtttggaaatg at - gtacttcataataaactacacaaggaaagtcagccaccgtgtcttatgaggaattggacctaataaattttagtgtgccttccaaacctgagaatatat gattttggaagttaaaatttaaatggettttgccacatacatagatetteatgatgtgtgagtgtaattecatgtggatatcagttaccaaacattacaaaa aaattttatggcccaaaatgaccaacgaaattgttacaatagaatttatccaattttgatctttttatattcttctaccacacctggaaacagacc catggtttacatttactcagctactatatatgcagtgtggtgcacattttcacagaattctggcttcattaagatcattatttttgnctgcgtagttacaga 158 222608 ettagcatattagttttttctactcetacaagtgtaaattgaaaaatetttatattaaaaaagtaaactgttatgaagetgetatgtactaataatactttgett s at - gccaaagtgtttgggttttgttgttgtttgtttgtttgtttgtttttggttcatgaacaacagtgtctagaaacccattttgaaagtggaaaattattaagtcac - ctatcacetttaaacgcetttttaaaattataaaatattgtaaagcagggtetcaaettttaaatacactttgaacttettctetgaattattaaagttettat gacctcatttataaacactaaattctgtcacctcctg gecgetgtgetttegtggaaatgacagtteettgttttgtttetgttttgttttacattagtcattggaccacagccattcaggaactacccctgcce 159 222696 cacaaagaaatgaacagttgtagggagacccagcagcacettteetecacacacetteattttgangttegggtttttgtgttaagttaatetgtacatt at - ctgtttgccattgttacttgtactatacatctgtatatagtgtacggcaaaagagtattaatccactatctctagtgcttgactttaaatcagtacagtacct gtacctgcacggtcacccgctccgtgtgtcgccetatattgagggetcaagetttccettgttttttgaaaggggtttatgtataaatatattttatgccttt ttattacaagtcttgt 160 223062 ggagtggtttaagagtgccaggcgaagggcaaactgtagatcgatctttatgctgttattacaggagaagtgacatactttatatatgtttatattagca s at aggtetgtttttaataccatatactttatatttctatacatttatatttetaataatacagttatcactgatatatgtagacacttagaatttattaaatcetga . 155 cettgtgcattatagcattccattagcaagagttgtacccctecccagtettegcettcctetttttaagetgttttatgaaaaagacetagaagttettg attcattttaccattetttccataggtagaagagaaagttgattggttggttgtttttcaattatgccattaaactaaacatttetgttaaattaccctatectt tgttctctactgttttctttgtaatgtatgactacgagagtgatactttgtgaaaagttttcccctattgtttatctattgtca ttcctgtgcaagtaccgaccatagagcaagaatcaagattctgctaactcctgcacagccccgtcctcttcctttctgctagcctggctaaattgctc 161 223447 attatttcagaggggaaacctagcaaactaagagtgataagggccctactacactggcttttttaggcttagagacagaaactttagcattggcca at gtagtggettetagetctaaatgttgccccgccatccetttccacagtatecttcttccctcctcccetgtetetggetgtetcgagcagtetagaagag tgcatctccagcctatgaaacagctgggtctttggccataagaagtaaagatttgaagacagaaggaagaaactcaggagtaagcttctagacccc ttcagcttctacacccttctgccctctctccattgcctgcaccccaccccagccactcaactcctgcttgtttttcctttggccatagg ggagctcagagatctaagctgctttccatcttttctcccagccccaggacactgactctgtacaggatggggccgtctttgcctctttcatcta atcccccttctccagctgatcaacccggggagtactcagtgttccttagactccgttatggataagaagatcaaggatgtttcaacagttagagta 162 223970 cagtccctctcctataagcaagaagctctcgtgtgctagtgtcaaaagccaaggcagaccgtcctcctgccctgctgggatggctgtcactggtg at tgcttgtggctatggctgtggttcgtgggatgttcagctggaaaccacctgccactgccagtgcagtgtggtggactggaccactgcccgctgtg ccacctgacctgacagggaggaggctgagaactcagttttgtgaccatgacagtaatgaaaccagggtcccaaccaagaaatctaactcaaacgt cccact gcaaagtttattcagtteacatgtaaggtattgcaaataaattettggacaattttgtatggaaaettgatattaaaaactagtetgtggttettgcagtt 163 224428 ettgtaaatttataaaccaggcacaaggttcaagtttagattaagcacttttataacaatgataagtgcetttttggagatgtaaettttagcagtttgtta sat acctgacatctctgccagtctagtttctgggcaggtttcctgtgtcagtattccccctcctctttgcattaatcaaggtatttggtagaggtggaattaa gtgtttgtatgtc aatttacttgcatatgtaaaccattgctgtgccattcaa 164 224646 x at agacggcettgagtctcagtacgagtgtgcgtgagtgtgagccaccttggcaagtgcctg ggagtgtggtacttctcctagttgcagtcaggcttcatacgctnttgtcctgcccgttagagcagccagcgggtacagaatggattttggaagaggg agtcaccactggacctccaaggaagccacgtgcagacatctacaaccttcgatctcctgacgagtttattgtggccaaaaccaggctttgattgaa 165 224915 ccaggatgaatgcgggtgttggaagtagaatatatatatacatataaaattggttgggagccacgtgtaccagtgtgtgttgatcttggcttgattcag x at tctgccttgtaacagaaactggcgatggaatatgagaggagccctctggaaagaaaaggacagaccctgtgctttcatgaaagtgaagattgg tgaaccagttccacaaggttactgtatacatagcctgagtttaaaaggctgtgcccacttcaagaatgtcattgttagactttgaaatttctaactgccta cctgca agggetgtaacagttgetgetagtattagggttccacateattetaatgtatagttcaagtettaatagacaatetgaatcactacattttttggetc caacattecttttagettgaccagtetaatttaaaatgtgtttgttggaggtcattaacgttacttgtacaatgctgtcactgtgtgacatccatatgaat 166 225295 ggtatatatcaatcaatcaatcaatcannnneattgeatteaatcaatcagetgtgattgattnnatgettagaaatactatagtaaetagatgeagt at gtgaattttttecattaacaaacaaacaagtcagtggettaaatgtgattatggtcetgcaaggtgattettgetaaaatatetaaactttgtttgttttaa ctgaatcattttttaaettaaaaagetggaaaatatcaaatgetgttttttttnncattgtcaacagtggtgtgtcattatgtatgttectaatgettatgg aactcctcca ggaacgatgagcaccatgccaggactgcccacccggccctgcttttatgacatagatcttgataccgaaacagaacaagttaaaggcttgtttaa 167 225520 gtggacaaggctctcacaggacccgatgcagactcctgaaacagactactctttgcctttttgctgcagttggagaagaaactgaatttgaaaaatg at tctgttatgcaatgctggagacatggtgaaataggccaaagatttcttcttcgttcaagatgaattctgttcacagtggagtatggtgttcggcaaaag gacctccaccaagactgaaagaaactaatttatttetgtttetgtggagtttecattatttctactgettacactttagaatgtttattttatggggactaagg gattangagtgtgaactaaaaggtaacattttccactctcaagttttctactttgtctttgaactgaa gaaacacaaccaagacgcgaggatcnnnnnctntnnnnnnnnnnnnnctngcaagatggcgccgcagaaagacaggaagcccaagagg 168 225541 teaacctggaggttaatttggacettacteatecagtagaagatggaattttgattetggaaattttgageaatttetaegggagaaggttaaagtca at atggcaaaactggaaatctcgggaatgttgttcacattgaacgcttcaagaataaaatcacagttgtttctgagaaacagttctctaaaaggtatttga aataccttaccaagaaataccttaagaagaacaatcttcgtgattggcttcgagtggttgcatctgacaaggagacetacgaacttcgttacttcag attagtcaagatgaagatgaatcagagtcggaggactaggcaaaggctccccttacagggctttgcttatt ggaacccagagetgetgtgtatttgagegggcagttatettttgetatacttattttcaatteaattacaccacgattcaaataatteccetectaaaac 169 225664 caaaaaggagggaaacgtcaactccattgcaattacttatcttectcttctatctctgttatacgccggggcatagaatgctcgtatacatctctttaaca at accacaaaccttaagccatgtagatgaagttagtgcatcaacgggatacagttccatattgccttaaacctccttgttttagacacactaacatttatac caaattgcagattattctgcagagagggaattgcatgtttgtgttgta aattaatattcatcgcacttcttctgtggaaggacMgtgaaggaattggtgctggattagtggatgttgctatctgggttggcacttgttcagattacc 170 225681 caaaaggagatgettctactggatggaatteagtttetegeateattattgaagaactaccaaaataaatgetaatttteatttgetacetetttttatta at .- tgcettggaatggttcacttaaatgacattttaaataagtttatgtatacatctgaatgaaaagcaaagetaaatatgttacagaccaaagtgtgatttca cactgttttaaatetagcattattcattttgettcaatcaaaagtggtttcaatatttttttagttggttagaatactttttcatagtcacattetetcaaceta taatttggaatattgttgtggtcttttgtttttctcttagtatagcatttttaaaaaaatataaaagctaccaatctttgtacaatttg ccgtgtgagcgatcgcggtgggttcgggccggtgtgacgcgtgcgccggccggccgccgaggggctgccgttctgcctccgaccggtcgtgt 171 225767 gtgggttgacttcggaggcgctctgcctcggaaggaaggaggtgggtggacgggggggcctggtggggttgcgcgcacgcgcgcacggcc at -gggcccccngccctgaacgcgaacgctcgaggtggccgcgcgcaggtgtttcctcgtaccgcagggccccctcccttccccaggcgtcctcg gcgcctctgcgggcccgaggaggagcggctggcgggtggggggagtgtgacccaccctcggtgagaaaagccttcttaggattgagag I _ I gcgtgccttgggggtac 172 225799- aaatgactggatggtcgctgctttttaagtttcaaattgacattccagacaagcggtgcctgagcccgtgcctgtttagatttcacagcacagtt - 156 at ctgggaaggtggagccaccagcctctccntgaataactgggagatgaaacaggaagctctatgacacacttgatcgaatatgacagacacngaa aatcacgactcanccccctccagcacctctacctgttgcccgccgatcacagccggaatgcagctgaaagattccctggggcctggttccaaccg cccactgtgga tctgaggcctctgcatttgcgggtggtctgcctgtgatattttggtcatgggctggttg tcttctcaggtcacttgtacactggtttcctagtagaagctcacttgccaccttcaggggggtccggattgatcatcacaatcccaaaactnga 173 225806 gttggggggaactggagggagcaaaacactgatttgatactagtcagtttgcttgaaactagttcacctaaagctagatctcttaaaaccaatttactg 13 at - aaaaettgtttgettaaagttaatgacttaatgaetaatttgecaaaagetcaatteetattttggtgtgtttatatecatttaggtgtectattettttttgtcat gctttggatatttcaaggatttatatctattcatccaagagtacttctgagntattatcagcaacataaatttatcaaatttgcagcactttgtaaatgatga gattgcttcctacctttatggatgtcttt aatgcattacttteacttaacactagacaccaggtegaaaattttcaaggttatagtacttatttcaacaattcttagagatgctagctagtgttgaagcta aaaatagetttatttatgetgaattgtgatttttttatgccaaantttttttagttetaatcattgatgatagettggaaataaataattatgccatggcattga 174 225835 cagttcattattcctataagaattaaattgagtttagagagaatggtggtgttgagctgattattaacagttactgaaatcaaatatttatttgttacattatt at ccatttgtattttaggtttccttttacattetttttatatgcattetgacattacatatttttaagactatggaaataatttaaagatttaagetetggtggatgat tatetgetaagtaagtetgaaaatgtaatattttgataatactgtaatatacctgtcacacaaatgettttctaatgttttaacettgagtattgeagttgetg ctttgtacagaggtt gtacttctcctagttgcagtcaggcttcatacgctattgtcctgcccgttagagcagccagcgggtacagaatggattttggaagagggagtcacca 175 226227 ctggacctccaaggaagccacgtgcagacatctacaaccttcgatctcctgacgagtttattgttggccaaaaccaggctttgattgaaccaggatg 7 x at aatggggtgttggaagtagaatatatatatacatataaaattggttgggagecaegtgtaccagtgtgtgttgatettggettgatteagtetgecttgt - aacagaaactggcgatggaatatgagaggagccctctggaaagaaaaggacagaccctgtgctttcatgaaagtgaagatctggctgaaccagt tccacaaggttactgtatacatagcctgagtttaaaaggctgtgcccacttcaagaatgtcattgttagactttgaaatttctaactgcctacctgca 176 226237 agageaacetatgaaatactgtgcattetacaatggtgaatete acetaaatgatac catttaatttaaaaaagttaaata at tgtgetataatecetatttagttcaaaattaaccagaattettecatgtgaaatggaccaaaetcatattattgttatgtaaatacagagttttaatgcagta tgacatcccacaggggaaaagaatgtctgtagtgggtgactgttatcaaatattttatagaatacaatgaacggtgaacagactggtaattgtttga gttaccatgacagatttgagacttg aaacgcgtgaga at gaccacttggagaaccggttagtacaagtectgatettgcaagccagcttcttetgeatetgagggget 177 226311 cetggegcccagaggaggeagacagatgtettetagetgagtttetaacegcatgatgagactcagacetteegetgeactagaaaatetgcaaca at gtgtecctgagtcacttctccttagtgggcagactegtgttagatttgtggaacccagetetetgatttacteettttggaaaacecatggaatttcatgt ataaggetttcatttgtattttaaggttatatttgttttgagtatatacatggtgetcaatagcaacatettagcagatgaagcagtttatgattccactcc etcetgtatgacaggtagccactatactgaatcaaggtgctgaaetcaaatcacaaaattctggettaccgatacaacaaccaatac gteccactgctcacatacttatgtgetgetagtetetactcgaagttegtgcaggactaatgettttaaaatgaggtetaaaaaataattactagtcgag actattattetttaaacagaactgeetttttctactetttatgtaaactetttctattgtgttggtctaacnaggcactattttaaaatttttaattttteccatag 178 226360 cacttaaagagattttgtaaagacttgtgtaaagattttgtaataaaatggtetaagggcettttttecaacattcttaatgttaaaaaatgttttaaaa at getagaagacaaettatgtatattetntatatgtatagcagcacatttcatttatggaaatatgtetcagaatatttatttactaatatatttatcttaagcca tgtcttatgttgagagtgtgacattgttggaataatcattgaaaatgactaacacaagaccetgtaaatacatgataattgcacacagattttacatattt geagaccaaaaatgatttaaaacaagttgtagtettetatggttttg tataaggtaactettagtcetccatttagcacattttaaatcctccaaagaataagtateatgtgattattttagetttacaaaaaaaaagttgaatggegt 179 226777 tttattttcatggectataageaggtacettagtagggcagatataggaaaaacaaattagageaaaacaaatectetacaaatecaaggcaggaaa at - agtggtggcagagtgactcattctcetgtceteccatcaggtcaaatcaggaggetgcagtgaatgcetgttetttgaatgtgtagcagttgttncet gtaactetttaaaacttggetataggetgtttagcacagtacagattaaagatacagttacgtaaacagcaaagtaattttatagtgettcatccatttate atgetttggtttgetaattttttcacatacetttttetatcacagtetgttgettttgtacacattttcatattggggtegaca 180 226835 ggagtgtggtacttctcctagttgcagtcaggcttcatacgctattgtcctgcccgttagagcagccagcgggtacagaatggattttggaagaggg s at - agtcaccactggacctccaaggaagccacgtgcagacatctacaaccttcgatctcctgacgagtttattgttggccaaaaccaggctttgattgaa - ccaggatgaatgcgggtgttggaagtag ttaccetetatttaaatgetttgaaaaacagtgcattgacaatgggttgatatttttetttaaaagaaaaatataattatgaaagccaagataatetgaage 81 227140 ctgttttattttaaaattttatgttctgtggttgatgttgtttgtttgtttgtttctattttgttggttttttactttgttttttgttttgttttgttttgttttgcatactaca 11 at -tgeagttetttaaccaatgtetgtttggetaatgtaattaaagttgttaatttatatgagtgcattteaactatgtcaatggtttttaatatttattgtgtagaa gtactggtaatttttttatttacaatatgtttaaagagataacagtttgatatgttttcatgtgtttatagcagaagttatttatttctatggcattecageggat attttggtgtttgcgaggcatgcagtcaatattttgtacagttagtggacagtattcagcaacgcctgatagcttctttggcctt ctggcacaaccetgacattactaagtggaaatgttaggattttteggcategcatgttagaatctctaaaatttaaacattectgttaaatgactaaggtt 182 227174 tgettttatcaatatgaattgaaggccaatatcataccattaactatgaaagettttaattectaaaaatagttagagatatteaagcaatgctetect 12 at -aatatccatacgcaagtgtgtttatgacacaaattcactagtetgtttaaaaatgaattetttatattgactggtgttccacatatttcagtaatttetgttatg agaggacttgaaatagcaaattgccacacagttaactggatagaccangtacgtggtgatcataaccacttggtactacacccagaaactcaaaat tgtctttctcctgatgagatatgggtgtccttttgtacgtctaggcctaggtaaccagtggagtgattatattagcaaatgtgtttgtatccagagtcttcc ccaggcttcgtcttatttctactgtttttgtgcaattccattgatttatgtccttcctccccctaagtacatcagggaactttccacactataaatg 183 227475 atatgactactgtttggggtttctgggcccccatccgtgtacgtatgtggcatttccaggtatgactgagtgtgagagacatgtcagaggctcttagt at gatttcttgetattgaccgatgcttcactgtgccaaaagagaaaaaaaatgttgggttgtaattaaattatttatatatttttgaaacccgaattgaaaat gttgcaggcaacgggetacagetttattagtggttetetaactgtggtetecttgggccaagcaatttetttaaaggaaaagttgattatgtatgtgggg - 157 tgccaggaccactgccttgaaagca tagcccaaccctatcattttcatattatgaaactgagtccaggtaagtgaatctgtccaaggtcacccagcaaggtatcagtagccctgagggtaag 184 228303 gactctgataaggctcgggagggtcctggaaagcctgaggcggcaggaagagtgtgcagagttgagcgtgtctggaaggctgatccactgctg at -ggcccacatcaaagcccccatggggagcagacccgactgcacatggctcttttgctggaagaagagcatngctgcgcagaggactaaaatttca tctgggaaggcttcttttgactgtcagtagcaggatgtcaccagatgagggtgctatgggaccacagctgtctttgttcccattgcaactcaaccctg cnggaggecgcctgcatccctgagagccttctggagcctacagaggagacattggccagccaaaaggaaaggagtggccagggtacgacct aattatcccttatcattccaaaaatgaaatgctgtgttaaatatctccagggcaaagtggtatgttgactgggacaaacgttagaaattgtattgttcatt 185 228653 geacttgttgecetgttecccaagettgtcaatgtttagagatactattegggttgetaaagecattatteatagaaaatetgeccetacagaagtgtg at tgcatgggecttggaaaatetacatgtgtatatetgagtagcgaagcacagattcactetaattgaaagcagcagtttggttttgtaaatgtaattgcaa ttgacactttettttccettteagttattattttttttaaaggacgttatgagaaggcactatgaaaagcctaattggaatagcattatgaaccatgtaatge tgatgc tgcacactgtgatttgcaaacatatgtccgetcttcaat ggtgggtgtcactacagacatgttetggcgtgtteteegagggatggagcatectgttatatattgacttcaaattgagatgttggettcattttttt tttt acccaattaatctcccaatccctagcaactgtgactctgtatttagcacaagagaaagctgagaatgtgggtcttgcctccttccagaaatatgttgg 186 228754 cctagcttttaatcaaatta at ecataggcatttttaaaettcaaaattttagatattttaaaettttataaaaaaaaaatcaaccaacaagagacttttetgaggaggaacatt at tgtatttgaacaagatcttggtgtgtagttcagtettgcagtatacaagcttttgtgtataaatgttatgatatgattccetgtnttttgcaggggttttttt ctcttttgctttttagataaatatgtatatcaatattttaaattcatctttgcttttttagaggagtttgtaatcaccttataac gaaaaaagetatcagetgtatgttaagagagactettactaacatgttgtaaatattacaattcatgaaatgttattgtaagtetgtaaettaatttttccct 187 228915 gtagtatacaggttggtttggaaatgtgttttggcataaacaagtaaaatgtgeccattttatggtttecatgettttgtaatectaaaaatattaatg at tetagttgttetatattataaccacattgegetctatgcaagccettggaacagaacatactcatettcatgtaggacetatgaaaattgtetatttttatet atatataaagttttetaaaaatgataaaaggttattacgaattttgttgtacaaaatetgtacaaaaatetgtttacatcataatgcaagaattggaaat ttttctatggtagcctagttatttgagcctggtttcaatgtgaga gccgcggtggaaacgggcttggagctggccccataangggctngcggcttcctccgacgccgcccctccccacagcttctcgactgcagtggg 188 229215 gcggggggcaccaacacttggagatttttccggaggggagaggattttctaagggcacagagaatccattttctacacattaattgagtgtgg at - agggacactgctggcaaacggagacctatttttgtacaaagaacccttgacctggggcgtaataaagatgacctggacccctgcccccactattg nngnnnnnnntgctggccaagatctggacacgagcagtccctgaggggcggggtccctggcgtgaggccccgtgacagccacctggg gtgggtttgtgggcactgctgctctgctagggagaagcctgtgtggggcacacctcttcaagggagcgtga gacatgattgtetataatctcgetagcettgtactgtgtgtgcatagcaattacagggaagtaatetagetectgactattatgttgaactatgtegetge 189 229802 tttttacaaaettgtettgatocaaageagteacaatgataaccetgeatatetgggaatcataagtcaactatgtatetetgtgtgtgtatatatatgtatg at -tatgtatetattttcaaactgtgattaatatttaaatattcctactgccattttgtgactgaaaaactacacatgaggaaacgtettagaattttccaatag aggaaaaataacacttgggcaattgtcatgtttcacaacagtttcatttttctcatgatttgtgtagegtggaatgtgtttgetcaatgtgaagggtttt cattgctcaatttctctgtgtaa ggttceggetaacacattttaagtcgccagtgetgettacagtttgaatacatgaaaatectgttttnagatgtttgegcacgtgettattaggaaat 190 231766 gagtctgtatggaaatctcaccacagataatggttaacgaaccgggtcgacatcacaaaggagggtggagactctttttactaacttgaatgagaca s at aaagcagtggtgtcagtttataatcctgatgcatttcagtaataatgtagaaaaacattattttaaaaaagttccaacacacagccatgaggagccnn nnnnnnnntcagttttgaaagaggtgcataataaaactactaaccagaggagtctatgccatttt gagtttcaaetttaaatgttcactatgtcatttagtgtecanctttacggataggttgactatetaaataggcatttttagtcattaaaaaaaantetagtca 191 231832 ccaggaggatccctataactcaaaataacttgtttgtaaaagaaaatttgtttacttacccattagtaagttcctgcatattattataagatggaaatca at aaettttetaggatgaagacagettatttttaagttgtatagtettagttggttagggtetcaattttaattaataaaatacttggtttttattgettgtctttt gaattattgttttaataattaaaatgagcacaaagaangttgaagtteagattaatetettetgaatgatgttttttttgtgatgagttgtttetg caccaagttacgtcaaagtctcaggagcagctccggtctccatagaggctgggtcagcagtgggcaaaacaacttcctttgctgggagetctgctt 192 231941 cctcctacagcccctcggaanncncnctcaagaacttcaccccttcagagacaccgaccatggacatcncaaccaaggggnccttccccacca s_at gcanggaccctcttccttctgtccctccgactacaaccaacagcagccganngacgaacagcacntnnnnnaagatcacaacctcagcgaag accacgatgaagcccc gtacagcatgaaaggcttcctctacaagacactagtcaaagagttgagagctgcggtttctaatctttgtccattactcccttactccctatgagactgt ggacetgtcacttggectetetggtettcagttttetcaccagtaaaacaaggaaettgaaccaaatgacetetagtgttcccettgggtttaaatgteta 193 232151 taaatgtteaatgactagaannnantgegtttetattetttgetgagaaaagagaatgtgatttaagagtaataatttgaataccaattatecac at attaaaattgtgtoctctatgtgtaaggcatagcacatttagcacacatacataagcacactaagcaccttacaaatatectcatttatetttacataatc ttttgaaatngattatgtaatacacacngttttnnaacaatnggtgacttecagetgtttaaaacaaactacagtatggtgettgagtactgacttagga ggtcagcatnggtttcactaggagcttctcaaagcacgctgcc gggatcactgggagaagccatggcattatcttcaggcaatttagtctgtcccaaataaaataaatccttgcatgtaaatcattcaagggttatagtaat 194 232176 atttcatatactgaaaagtgtctcataggagtcctcttgcacatctaaaaaggctgaacatttaagtatcccgaattttttgaattgtttccctatagatt at aattacaattggatttcatcatttaaaaaccatacttgtatatgtagttataatatgtaaggaatacattgtttataaccagtatgtacttcaaaaatgtgtatt gtcaaacatacctaactttcttgcaataaatgcaaaagaaactggaacttgacaattataaatagtaatagtgaagaaaaaatagaaaggttgcaatt atataggccatgggtggctcaaaactttgaa 195 232252 ggacgatgaagccatcattgctgcttggagacgccggcaagaagaaaccaggaccaagctgcagaaaaggagggaggactgagctggggaa at aatctgagaacactgaaagaaaccactcacgttagcatagggctcagggcacacgttgccaccactcatcgcaggatgaggatacagagaggat cttccagaggggcagagccaaaatgagagntaccaagcatnngggcannngaggtggagtagggaggaggcaaggagggggagaaccat 158 caatacgaatacgaggtecgaatgeggaccaaetgataccatttetgttgetcagegccetetaagetttggtgtttcacttaatgtatttgacagtgtt catcacaggctagagaggtgagcttggaaaagcactgtagtttgtcagagactccagtttacatccagaaaggccatgaacataggacacgttct gtetgtagaggettcatatgagacccagaaagtctatcctatggcaagtctgacctctcctggcaatgctcagttctgatt gaagtccatcctttggtccaaagcatctggaagaggaagaagagaggaatgagaaagaaggaagtgatgcaaaacatctccaaagaagttttt ggaacaggaaaatcattcaccactcacagggtcaaatatgaaatacaaaaccacgaaccaatcaacagaatttttatccttccaagatgccagctc 196 232481 attgtacagaaacattttagaaaaagaaagggaacttcagcaactgggaatcacagaatacctaaggaaaaacattgctcagctccagcctgatat s_at ggaggcacattatcctggagcccacgaagagctgaagttaatggaaacattaatgtactcacgtccaaggaaggtattagtggaacagacaaaaa atgagtattttgaacttaaagctaatttacatgctgaacctgactatttagaagtcctggagcagcaaacatagatggagagtttgagggtttgcag aaatgctgtgattctgttttaagtccataccttgtaaataagtgccttacgtgagtgtgtcatcaatcagaacctaagc accaagatggtgcaagttecetttgcagatggegtgggcacacttgatttttattatgagtgaatgtaatetttetgtattttaccagagttacagcaatta 197 234331 cctgaaaagtttcctaacattttaataatgttagggatttcgttttggttttagttgtectaagagacaacaggttacagtaaMccatgatgttgggtg s at - tggetaagetggggattggttetgttccccetgetccgtgtagagaaaagetatatttatactgcattetttetcaaetttcaggtaaaacaaactatga - tttaaaaaaagaaaaaagaaaagacaggtacttttacttcaaagagtgctttgctacatttttatttaaaccaaaaatcaaataaaataaggagggggg ctgggtatactttaaacaaaaccagtcctgaaatgctgttatt ttcttgtecagetgttcacagttttatttttatatagatggtgatataaatatttccaaatgcatttgtaaacattetaaatattetcaagtcatgttcaatgttte 198 235210 ctaaaccttcaattttggccaaagtccccaaacacatcattgccacactctgaagtagagaaagaaaatttaggggccagttctcaaggaacacag s-at gtcetttatttttattttaactaagttgaagacecactcaaaaagetettgtggttttatgttettgacctttcaaetggagtectetcattcageaggtggcc cgtgagacacagaatac cagtgetgctgtgaactaaagtatgtcatttatgctcaaagtttaattettcttettgggatattttaaaaatgetactgagattctgetgtaaatatgactag 199 235976 agaatatattgggtttgetttatttcataggettaattetgtaaatetgaatgaccataatagaaatacatettgtggcaagtaattcacagttgtaaag at -taaataggaaaaattatttatttttattgatgtacattgatagatgccataaatcagtagcaaaaggcacttetaaaggtaagtggtttaagttgcctcaa nagagggacaatgtagetttattttacaagaaggcatagttagatttctatgaaatattattetgtacagttttatatanttttggtteacaaaagtaattat tcttgggtgcctttcaa aaagtatattgtgctagcttgtctaagaataaacttnnatactgttgggggagggctgcacctgtcaagataacetgtcaatgtagtaggaaaacag 200 236894_ gaggggacagtaacagaaaagcacgggaaaagatggcaaggttagttaaaatagaaaagtgctcagttcctcatacctgtaatcccagcagttta at gggggccaaggaaggtgggtcacttgagcccaagagttcaaggccatcctgggcaatgtggcgaaagtgtctacaaaaaaatacaaaaagag gaagaaatgatatttcacaagtttgtatcatttgtcat caaaaccacgattgtgtgccccttttttatnaaanctggnatgtttgaangttgtantacangctntncttctntgttgccaattctgnnacnnnnntn 201 238017 na gaaaaaatagtagaagetattctaeaagaaaaaatgtacttgtatatgecaaagttgttatacttcatgatgttttaaaagttgece at ctcaagacaggactgttatagtgactatttgggcatcttcatgcaatggatggtttgttgaccaaaagaagaagctetaaagaccaactctatg gctaaggtcatctgatacacagtgttacataatgcgtacttcaatgaagaaaagtatttttgtctgacagtggaatatatctggagaccacaagtacca ctcctattctgttatctgg 202 238021 cgttggtgaaatttctgtgatgtgtaattagatgaaagaacatggaaaaatcaaagtgetegagtggtaaatatgttttgggtatt s at cctgtttatagactataatacttttccaattaaaatcetcagttgtcacgcagaagaaggttaagetgtatttgattgccagttttactgaaaatgettagta -a ttttacagtatcaccaaatatattttgtttagccaaggtatagga acccaacaagagctgtgcggctccctgattcctcgccagtgttgctaccgcccttggctcttcttgcatggctggctcttgagacccctggaagtg atggaggcaacgtgagaagcacatggacatccgncentgagcttgagaggcagaggcctgagttctagttacagccccagcagtaccagttgt 203 238984_ gtggactgggagggaggenatcacgtacatactccaagcctccaagcctgtttccccttctgacacaggatcttttgtggctggtatanagtggg at actcaataaatgctgtctggtcgtctggctggcatgcctnatgggcctgagaattgaatagaattacagtgatagaagcatgctggtattgtaagtgg tttgtaagtgtgaggactaaattattattaaatagtaatcacatctaatcttggataaattagtaaaagcaagaatgggagcagtaaaaactaagcaa eccgaactaaaattttattgaattaattcaatttcttgtcatgtaacacaaccccaga gtttctgtttcagtcacaaattagggttattgtgatgtgtatttatgatgaccnttgaacaaatgtgaagaatactgtgaattctatgactttatcaaaatca 204 241031 gecacatecaggagettgeagttgttgaccaaatgaatgatgacatagagtagtteagatetateatgtgetettctatetaatcagtcaatattettg at gccetcaagceaacattcattttatgtataacetttcatgattttgaaattttgatagggtaaetgctaatgagtteacaaatgtagcactttaaaagg aaaataaatggagagtgaaaacaacttggctacgtataattgtgggt caacccattttgtgccacatgcaagttttgaataaggatggtatagaaaacaacgctgcatatacaggtaccatttaggnannancngatgcctttnt 205 37892_a gggggcagaatcacatggcaaaagctttgaaaatcataaagatataagttggtgtggctaagatggaaacagggctgattcttgattcccaattctc t aaetcteetttcctatttgaatttetttggtgetgtagaaaacaaaaaaagaaaaatatatattcataaaaaatatggtgetcatetcatecatecaggat gtactaaaacagtgtgtttaataaattgtaattattttgtgtacagttctatactgttatctgtgtccatttccaaaacttgcacgtgtcctgaattcc acagacttggcaagggaccccctggttctgagccagtagctgccatctggaaattcctctttnnnnnnnnnnnnnnnnn nnnnnnnnnn 206 60474_a nnnctcccaggnacccgctgaatttctgaggccttgcttaaagctcagaagtggtttaggcatttggaaaatctggttcacatcataaagaattgat t ttgaaatgttttctatagaaacaagtgtaagtgtnaccgtattatacttgatgttggteatttctcagtcetatetcagttetatattttagaacctagte agtettaagattataaetggtectacattaaaataatgettetegangtcagattttacctgttgetgetgagaacatetetgcetaannnnnnnnn Innnnnnnetteagtteaacatgettecttagettttcatagttgtetgacatttcatgaaa - 159 BIBLIOGRAPHY Affymetrix. GeneChip expression data analysis fundamentals. Affymetrix, Santa Clara, CA USA, 2001. Alon et al., Proc. Natl. A cad. Sci. USA: 96, 6745-6750, June 1999 Ausubel, F. et al., "Current Protocols in Molecular Biology", John Wiley & Sons, (1998) Bonner et al (1973) J. Mol. Biol. 81:123 DeRisi, et al., Nature Genetics 14:457-460 (1996) Germer et al., Genome Res. 10:258-266 (2000) Guo et al., Nucleic Acids Res. 22:5456-5465 (1994) Heid et al., Genome Res. 6:986-994 (1996) Hubbell E.W., W. M. Liu, and R. Mei. Robust estimators for expression analysis. Bioinformatics, 18:1585-1592, 2002. Irizarry R.W., B. M. Bolstad, F. Collin, L. M. Cope, B. Hobbs, and T. P. Speed. Summaries of affymetrix genechip probe level data. Nucleic Acid Research, 31, 2003. Kraus, M. and Aaronson, S., 1991. Methods Enzymol., 200:546-556 Maskos and Southern, Nuc. Acids Res. 20:1679-84, 1992 Moore et al., BBA, 1402:239-249, 1988 Nielsen (1999) Curr. Opin. Biotechnol. 10:71-75 Nielsen et al. (1991) Science 254: 1497-1500 Pease et al., Proc. Natl. Acad. Sci. USA 91(11):5022-5026 (1994) Pevzner et al., J. Biomol. Struc. & Dyn. 9:399-410, 1991 Schena, et al. Science 270:467-470 (1995) Smith et al., Science 258:1122-1126 (1992) -160 Smyth G.K. Bioinformatics and Computational Biology Solutions using R and Bioconductor. Springer, New York, 2005. Smyth G.K. Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Statistical Applications in Genetics and Molecular Biology, 3(l):Article 3, 2004. T. Sano and C. R. Cantor, Bio/Technology 9:1378-81 (1991) Urdea et al., Nucleic Acids Symp. Ser., 24:197-200 (1991) Wedemeyer et al., Clinical Chemistry 48:9 1398-1405, 2002) Weissleder et al., Nature Medicine 6:351-355, 2000

Claims (30)

1. A method of screening for the onset or predisposition to the onset of a large intestine neoplasm or monitoring the progress of a neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 201328_at 221577_x_at 205828_at 201341 at 221922 at 205886 at 201416 at 60474 at 205890 s at 201417_at 222696_at 205910_s_at 201468_s_at 223447_at 205941_s_at 201506_at 223970_at 206224_at 201563 at 225541 at 206976 s at 201656_at 225835_at 207173_x_at 201925_s_at 226360_at 207457_s_at 201926_s_at 227174_at 208079_s_at 202286_s_at 227475_at 208712_at 202718_at 228303_at 209218 at 202831_at 228653_at 209309_at 202833_s_at 228754_at 209752_at 202935_s at 228915_at 209773_s_at 202936_s_at 229215_at 209774_x_at 203124_s_at 231832_at 209792 s at 203256_at 231941 s_at 209875_s_at 203313_s_at 232176_at 209955_s-at 203510_at 232252_at 210052_s_at 203860_at 232481_s_at 210511_s_at 203895 at 234331 s at 210559 s at 203896_s_at 235210_s_at 210766_s_at 203961_at 235976_at 211506_s_at 203962_s_at 236894_at 212281_s_at 204259_at 238017_at 212344_at 204351 at 212353 at 204401_at 238984_ at 212354_at 204404_at 241031_at 213905_x at - 162 204855_at 200660_at 214022_s at 204885_s_at 200832_s_at 214974_x_at 205174_s_at 200903_s_at 215091_s_at 205366_s_at 201014_s_at 217430_x_at 205470_s_at 201112_s_at 217996_at 205513 at 201195 s at 218507 at 205765_at 201261_x_at 218963_s_at 205825_at 201292_at 218984_at 205927_s_at 201338_x_at 219787_s_at 205983_at 201479_at 219911_s_at 206239 s at 201577 at 221729 at 206286_s_at 201601_x_at 221730_at 207158_at 201666_at 221731_x_at 207850_at 202310_s_at 221923_s_at 209369_at 202311_s_at 37892_at 210445 at 202404 s at 222449 at 210519_s_at 202431_s_at 222450_at 211429_s_at 202504_at 222549_at 212063_at 202779_s_at 222608_s_at 212070_at 202859_x_at 223062_s_at 212190 at 202954 at 224428 s at 212531_at 202998_s_at 224646 x_at 212942_s_at 203083_at 224915_x_at 213880_at 203213_at 225295_at 213975_s_at 203878_s_at 225520_at 214235 at 204051 s at 225664 at 214651_s_at 204127_at 225681_at 217523_at 204170_s_at 225767_ at 217867_x_at 204320_at 225799_at 218086_at 204470_at 225806_at 218211 s at 204475 at 226227_x at 218704_at 204580_at 226237_at 218796_at 204620_s_at 226311_at 218872_at 204702_s_at 226777_at 219630_at 205361_s_at 226835_s_at 219682 s at 205476 at 227140 at 219727_at 205479_s_at 229802_at 219955_at 205713_s_at 231766_s_at - 163 219956_at 205815_at 232151_at; and/or 200665_s_at (ii) KIAA 1199 CDCA7 LOXL2 FOXQ1 RNF43 TDGF1 AZGP1 MMP1 CCL20 MTHFD1L MYC TCN1 CTSE ANLN COMP MMP7 MSLN H19 AURKA WDR72 TIMP1 FAP PAICS INHBA PCSK1 DACHI PUS7 COL1IAl CST1 VCAN ZNRF3 GDF15 BGN SQLE CCND1 CTHRC1 AXIN2 REG3A CSE1L COL1Al MET TESC PFDN4 LGR5 SOX9 UBE2C C20orf42 DUSP27 TMEPAI TMEM97 SLC11A2 SERPINB5 CDH11 TRIM29 NFE2L3 ASCL2 MMP11 KLK11 CEL SULFI CXCL1 LY6G6D NLF1 SLC6A6 QPCT SLC7A5 NPDC1 TACSTD2 PDZKlIP1 FABP6 ENCI NEBL CD55 SLITRK6 SERPINE2 PCCA ECT2 MLPH UBE2S FLJ37644 COL12A1 HOXA9 TOP2A KRT23 L1TD1 TBX3 CDC2 CXCL3 WDR51B BACE2 RFC3 MMP3 HOXB6 GPX2 LILRB1 SFRP4 FAM84A TPX2 NPM1 UBD COL5A2 KCNN4 RDHE2 SCD MMP12 LOC541471 PLAU DPEP1 SORD CYP3A5 HSPH1 LCN2 PSATI ANXA3 GTF3A PLCB4 CXCL5 CYP3A5P2 KLK1O SPINK1 IGFBP2 Cl4orf94 GPSM2 DUOX2 TGIF1 RP5-875H10.1 NME1 SOX4 CXCL2 RPL22L1 MUC20 THBS2 SLCO4A1 APOBECI RPESP -164 CLDN1 CKS2 HIG2 RRM2 REGIB PHLDA1 COL1OA1 DKC1 CDH3 SERPINA1 AHCY CD44 SPARC COL1A2 IL8 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to control levels is indicative of a neoplastic large intestine cell or a cell predisposed to the onset of a neoplastic state.
2. The method according to claim 1 wherein said one or more genes or transcripts are selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 225681_at; 227140_at; and/or (ii) CTHRC1.
3. The method according to claim 1 wherein said one or more genes or transcripts are selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 227475_at, 204475_at, 202859_x_at, 202404_sat; and/or (ii) FOXQ1, MMPI, IL8, COL1A2.
4. The method according to claim 1 wherein said one or more genes or transcripts are selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 205513_at, 204259_at, 227174_at, 210511_sat, 37892_at; and/or (ii) TCN1, MMP7, WDR72, INHBA, COL1 1A1.
5. The method according to claim 1, wherein said one or more genes or transcripts are selected from: - 165 (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 221577_x_at 212942_s_at 202310_s_at 213880_at 232252_at 204855_at 229215_at 212354_at 228754_at 202286_s_at 203962_s_at 203860_at 202935_s_at 218963_s_at 207850_at 205828_at 204051_s_at 205890_s_at 200832_s_at 205983_at 21253 1_at; and/or (ii) KIAA1199 GDF15 COL1A1 LGR5 DUSP27 SERPINB5 ASCL2 SULFI SLC6A6 TACSTD2 NEBL PCCA FLJ37644 KRT23 CXCL3 MMP3 SFRP4 UBD SCD DPEP1 LCN2.
6. The method according to any one of claims 1-5 wherein said control level is a non neoplastic level.
7. The method according to any one of claims 1-5 wherein said neoplastic cell is an adenoma or an adenocarcinoma.
8. The method according to any one of claims I to 3 wherein said neoplasm is an adenoma and said one or more genes or transcripts are selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 201328_at 205825_at 219956_at 201468_s_at 205927_s_at 221922 at 201656_at 206239_s_at 223447_at 201925_s_at 207158_at 223970_at 201926_s at 210445_at 225835_at 202718_at. 210519_s_at 226360_at 202831_at 211429_s_at 228303_at 202833_s_at 212063_at 228653_at 203124_s_at 213975_s_at 228915_at 203313_s_at 214235_at 231832_at 203860_at 214651_s at 231941_s_at - 166 203895_at 217523_at 232176_at 203896_s_at 217867_x_at 232481_s_at 204401_at 218086_at 234331_s_at 204885_s_at 218211_s_at 235210_s_at 205174_s_at 218796_at 235976_at 205366_s_at 219630_at 236894_at 205470_s_at 219682_s_at 238017_at 205513_at 219727_at 238984_at 205765_at 219955_at; 24103 1_at; and/or (ii) APOBECI HOXB6 QPCT BACE2 IGFBP2 RDHE2 C20orf42 ITGA6 REG4 CD44 KCNN4 RETNLB CD55 KLK11 RP5-875H10.1 CTSE L1TD1 RPESP CYP3A5 LILRB1 SERPINAl CYP3A5P2 MLPH SLC11A2 DACH1 MSLN SLC12A2 DUOX2 MUC20 SLITRK6 ETS2 NLF1 SPINK1 FABP6 NPDC1 TBX3 FAM84A NQO1 TCN1 GALNT6 PCCA TGIF1 GPSM2 PCSK1 WDR51B GPX2 PDZK1IP1 ZNRF3 HOXA9 PLCB4.
9. The method according to claim 1 wherein said neoplasm is a cancer and said one or more genes or transcripts are selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 200660_at 205476_at 217430_x_at 200665_s_at 205479_s_at 217996_at 200832_s_at 205713_s_at 218507_at 200903_s_at 205815_at 218963_s_at 201014 s at 205828_at 218984_at 201112_s_at 205886_at 219787_s_at - 167 201195_s_at 205890_s_at 219911_s_at 201261_x_at 205910_s_at 221729_at 201292_at 205941_s_at 221730_at 201338_x_at 206224_at 221731_x_at 201479_at 206976_s_at 221923_s_at 201577_at 207173_x_at 37892_at 201601_x_at 207457_s_at 222449_at 201666_at 208079_s_at 222450_at 202310_s_at 208712_at 222549_at 202311_s_at 209218_at 222608_s_at 202403_s_at 209309_at 223062_s_at 202404_s_at 209752_at 224428_s_at 202431_s_at 209773_s_at 224646_x_at 202504_at 209774_x_at 224915_x_at 202779_s_at 209792_s_at 225295_at 202859_x_at 209875_s_at 225520_at 202954_at 209955_s_at 225664_at 202998_s_at 210052_s_at 225681_at 203083_at 210511_s_at 225767_at 203213_at 210559_s_at 225799_at 203878_s at 210766_s at 225806_at 204051_s_at 211506_s_at 226227_x_at 204127_at 212281_s_at 226237_at 204170_s_at 212344_at 226311_at 204320_at 212353_at 226777_at 204470_at 212354_at 226835_s_at 204475_at 213905_x at 227140_at 204580_at 214022_s_at 229802_at 204620_s_at 214974_x_at 231766_s_at 204702_s_at 215091_s_at; 232151_at; and/or 205361_s at (ii) AHCY DKC1 PSATI ANLN ECT2 PUS7 AURKA FAP REGlA AZGP1 GTF3A REGIB BGN H19 REG3A C 14orf94 HIG2 RFC3 - 168 C20orfl99 HSPH1 RRM2 CCL20 IFITM1 SlOAl CCND1 IL8 SCD CDC2 INHBA SFRP4 CDCA7 KLK10 SLC39A1O CDH11 KRT23 SLC7A5 CEL LOC541471 SLCO4A1 CKS2 LOXL2 SPARC CLDN1 LY6G6D SPP1 COL1OAl MMP1 SQLE COL1IA1 MMPI1 SULFI COL12A1 MMP12 THBS2 COLIAI MMP3 TIMPI COL1A2 MTHFD1L TMEM97 COL5A2 MYC TMEPAI COL8A1 NFE2L3 TOP2A COMP NME1 TPX2 CSE1L NPM1 TRIM29 CST1 PAICS UBD CTHRC1 PFDN4 UBE2C CXCL1 PHLDA1 UBE2S CXCL2 PLAU VCAN CXCL5.
10. A method of screening for the onset of a large intestine neoplasm in an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 202286_s_at 235976_at 209309_at 204259_at 236894_at 211506_s_at 204885_s_at 214974_x_at 205174_s_at 238984_at 219787_s_at 205825_at 241031_at 37892_at 207850_at 202311_s_at 222608_s_at 213880_at 204320_at 223062_s at 217523_at 204475_at 225806_at 227174_at 204702_s_at 226237_at -169 228915 at 205910 s at 227140 at 232252_at 206224_at 229802_at; and/or (ii) MMP1 PCSK1 ANLN MMP7 CST1 DACHI LGR5 QPCT COL1 1A1 WDR72 ECT2 C14orf94 COL1IA1 SLITRK6 AZGP1 COL1A1 L1TD1 REG4 DUSP27 KIAA 1199 NFE2L3 NLF1 PSATI CEL IL8 CXCL5 CD44 TACSTD2 CXCL3 COL8A1 MSLN in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to background levels is indicative of a neoplastic cell or a cell predisposed to the onset of a neoplastic state.
11. The method according to claim 10 wherein said neoplastic cell is an adenoma or adenocarcinoma.
12. The method according to any one of claims 10 to 11 wherein said neoplasm is an adenoma and said one or more genes or transcripts are selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 204885_s_at, 217523_at, 236894_at, 205174_s_at, 228915_at, 238984_at, 205825_at, 235976_at, 241031_at; and/or (ii) CD44, MSLN, QPCT, DACHI, NLF1, REG4, L1TD1, PCSK1, SLITRK6.
13. The method according to any one of claims 13 to 15 wherein said neoplasm is a cancer and said one or more genes or transcripts are selected from: (i) the gene or genes detected by Affymetrix probeset IDs: 202311_s_at 209309_at 223062_s_at 204320_at 211506_s_at 225806_at 204475_at 214974_x_at 226237_at 204702_s_at 219787_s at 227140_at - 170 205910_s at 37892_at 229802_at; and/or 206224_at 222608_s_at (ii) ANLN COL1Al IL8 AZGP1 COL8A1 MMP1 C 14orf94 CSTl NFE2L3 CEL CXCL5 PSATI COL1lAl ECT2.
14. The method according to any one of claims 10-13 wherein said background level is the background of the testing methodology.
15. A method of characterising a neoplastic cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 200884_at 214234_s_at 226248_s_at 203240_at 214235_at 226302_at 203963_at 214433_s_at 227676_at 204508_s_at 215125_s_at 227719_at 204607_at 215867 x at 227725_at 204811 s at 217109 at 228232_s at 204895_x_at 217110_s_at 229070_at 204897_at 218211_s_at 231832_at 205259_at 219543_at 232176_at 205765_at 219955_at 232481_s_at 205927_s_at 221841_s_at 235976_at 208063_s_at 221874_at 236894_at 208937_s_at 223969_sat 237521_x_at 210107_at 223970_at 242601_at 213106_at; and/or (ii) CLCA1 CTSE ATP8Bl FCGBP C6orfl05 CACNA2D2 HMGCS2 CKB KLF4 RETNLB ATP8A1 CYP3A5P2 - 171 LlTD1 MUC4 CAPN9 SLITRK6 UGTlA1 NR3C2 VSIG2 SELENBP1 PBLD LOC253012 PTGER4 CA12 ST6GALNAC1 MLPH WDR51B ID1 KIAA1324 FAM3D CYP3A5. in said cell or cellular population wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to a gastrointestinal cancer cell level is indicative of an adenoma cell or a cell predisposed to the onset of an adenoma state.
16. A method of characterising a neoplastic cell or cellular population, which cell or cellular population is derived from the large intestine of an individual, said method comprising measuring the level of expression of one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 200600_at 204006_s_at 213428_s_at 200665 s at 204051 s at 213524 s at 200832_s_at 204122_at 213869_x_at 200974_at 204320_at 213905_x_at 200986_at 204475_at 214247_s_at 201058_s at 204620_s_at 215049_x_at 201069_at 205479_s_at 215076 s at 201105_at 205547_s_at 215646_s_at 201141_at 205828_at 216442_x_at 201147_s_at 207173_x_at 217430_x_at 201150_s_at 207191_s_at 217762_s_at 201162 at 208747 s at 217763 s at 201163_s_at 208782_at 217764_s_at 201185_at 208788 at 218468_s_at 201261_x_at 208850_s_at 218469_at 201289_at 208851 s_at 218559_s_at 201426_s_at 209101 at 218638 s at 201438_at 209156_s_at 219087_at 201616_s_at 209218_at 221011 s_at - 172 201645 at 209395 at 221729 at 201667_at 209396_s_at 221730_at 201744_s_at 209596_at 221731_x_at 201792_at 209875_s_at 37892_at 201842_s at 209955_s_at 223122_s_at 201852_x_at 210095_s_at 223235_s_at 201859_at 210495_x_at 224560_at 201893_x_at 210511_s_at 224694_at 202237_at 210764_s_at 224724_at 202238_s_at 210809_s_at 225664_at 202283 at 211161 s at 225681 at 202291_s_at 211571_s_at 225710_at 202310_s_at 211719_x_at 225799_at 202311_s_at 211813_x_at 226237 at 202403_s_at 211896_s_at 226311_at 202404 s at 211959 at 226694 at 202450_s_at 211964_at 226777_at 202620_s_at 211966_at 226930_at 202766_s_at 211980_at 227099_s_at 202859_x_at 211981_at 227140_at 202878 s at 212077 at 227566 at 202917_s_at 212344_at 229218 at 202998_s_at 212353_at 229802_at 203083_at 212354_at 231579_s_at 203325_s_at 212464_s_at 231766_s_at 203382 s at 212488 at 231879 at 203477_at 212489_at 232458 at 203570 at 212667_at 233555 s_at 203645_s_at 213125_at 234994_at 203878_s_at; and/or (ii) COL1A2 LGALS1 SRGN CTHRCl ELOVL5 LBH FN1 MGP CTGF POSTN MMP2 TNC SPPI LOXL2 GOS2 MMP1 MYL9 SQLE SPARC DCN EFEMP1 - 173 LUM CALDI APOE GREMI FBN1 MSN IL8 MMP3 IGFBP3 IGFBP5 IGFBP7 SERPINFI SFRP2 FSTL1 ISLR SULF1 COL4A2 HNT ASPN VCAN COL5A1 COL6A3 SMOC2 OLFML2B COL8Al HTRA1 KIAA1913 COL12A1 CYR61 PALM2-AKAP2 COL5A2 FAP SERPINGI CDH11 VIM TYROBP THBS2 TIMP2 ACTA2 COL15A1 SCD COL3A1 COL1 lAl TIMP3 PLOD2 SI0OA8 AEBP1 MMP11 FNDC1 GJA1 CD163 SFRP4 NNMT FCGR3B INHBA COL1A1 PLAU COL6A2 SULF2 MAFB ANTXR1 COL6A1 LOC541471 GPNMB SPON2 LOC387763 BGN CTSK CHI3L1 TAGLN MXRA5 THYl COL4A1 CiS LOXL1 RAB31 DKK3 CD93 in a biological sample from said individual wherein a higher level of expression of the genes or transcripts of group (i) and/or group (ii) relative to a gastrointestinal adenoma cell level is indicative of a cancer or a cell predisposed to the onset of a cancerous state.
17. The method according to claim 15 wherein said one or more genes or transcripts are selected from: (i) the gene or genes detected by Affymetrix probeset IDs: 210107_at; and/or - 174 (ii) CLCA1.
18. The method according to claim 15 wherein said one or more genes or transcripts are selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 203240_at, 219955_at, 242601_at, 204607_at, 232481_sat, 227725_at, 223969_sat, 228232_s_at; and/or (ii) FCGBP, LlTD1, LOC253012, HMGCS2, SLITRK6, ST6GALNAC1, RETNLB, VSIG2.
19. The method according to claim 16 wherein said one or more genes or transcripts are selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 202404_s_ at, 210809_sat, 227140_at, 225681_at, 209875_sat, 204475_at, 212464_s_at; and/or (ii) COL1A2, FN1, SPP1, CTHRC1, POSTN, MMPl.
20. The method according to claim 16 wherein said one or more genes or transcripts are selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset .IDs: 200665_sat, 218468_sat, 211959_at, 201744_s_at, 202859_x_at; and/or (ii) SPARC, GREM1, IGFBP5, LUM, IL8.
21. The method according to claim 16 wherein said one or more genes or transcripts are selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 223122_s_at 207173_x_at 210511_s at 212353_at 203083_at 209156_s_at 219087_at 203477_at 224694_at 201438_at 37892_at 201141_at 226237_at 202917_s_at 213905_x_at 225664_at 226930_at 205547_s_at - 175 221730_at 204051_s at; and/or (ii) SFRP2 CDH1 1 INHBA SULFI THBS2 COL6A2 ASPN COL15A1 ANTXR1 COL6A3 COL1IA1 GPNMB COL8A1 S100A8 BGN COL12A1 FNDC1 TAGLN COL5A2 SFRP4.
22. The method according to claim 15 wherein said control level is the background level of neoplastic tissue and said one or more genes or transcripts are selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 235976_at 236894_at; and/or (ii) SLITRK6 L1TD1.
23. The method according to claim 16 wherein said control level is the background level of neoplastic tissue and said one or more genes or transcripts selected from: (i) the gene, genes or transcripts detected by Affymetrix probeset IDs: 202311_sat, 209396_sat, 226237_at, 204320_at, 215646_sat, 227140_at, 204475_at, 37892_at, 229802_at, 209395_at; and/or, (ii) COL1A1, VCAN, CHI3L1, MMP1, COL8A1, COL lAl.
24. The method according to any one of claims 1-23 wherein said cell is a colorectal cell.
25. The method according to any one of claims 1 to 24 wherein said biological sample is a faecal sample, enema wash, surgical resection, tissue biopsy or blood sample.
26. The method according to any one of claims 1 to 25 wherein said level of expression is mRNA expression or protein expression.
27. The method according to any one of claims 1-26 wherein said individual is a human.
28. A molecular array, which array comprises a plurality of: -176 (i) nucleic acid molecules comprising a nucleotide sequence corresponding to any one or more of the neoplastic marker genes listed in any one of claims 1-23 or a sequence exhibiting at least 80% identity thereto or a functional derivative, fragment, variant or homologue of said nucleic acid molecule; or (ii) nucleic acid molecules comprising a nucleotide sequence capable of hybridising to any one or more of the sequences of (i) under medium stringency conditions or a functional derivative, fragment, variant or homologue of said nucleic acid molecule; or (iii) nucleic acid probes or oligonucleotides comprising a nucleotide sequence capable of hybridising to any one or more of the sequences of (i) under medium stringency conditions or a functional derivative, fragment, variant or homologue of said nucleic acid molecule; or (iv) probes capable of binding to any one or more of the proteins encoded by the nucleic acid molecules of (i) or a derivative, fragment or, homologue thereof wherein the level of expression of said marker genes of (i) or proteins of (iv) is indicative of the neoplastic state of a cell or cellular subpopulation derived from the large intestine.
29. A diagnostic kit for assaying biological samples said kit comprising an agent for detecting one or more of the neoplastic markers listed in claims 1-23 and reagents useful for facilitating the detecting by said agent.
30. The method according to any one of claims 1-9 wherein said control level is a non neoplastic level.
AU2015203005A 2007-10-23 2015-06-03 A method of diagnosing neoplasms Active AU2015203005B2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
AU2015203005A AU2015203005B2 (en) 2007-10-23 2015-06-03 A method of diagnosing neoplasms
AU2018200024A AU2018200024A1 (en) 2007-10-23 2018-01-02 A method of diagnosing neoplasms

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US98211407P 2007-10-23 2007-10-23
US60/982,114 2007-10-23
AU2008316319A AU2008316319A1 (en) 2007-10-23 2008-10-23 A method of diagnosing neoplasms
PCT/AU2008/001571 WO2009052573A1 (en) 2007-10-23 2008-10-23 A method of diagnosing neoplasms
AU2015203005A AU2015203005B2 (en) 2007-10-23 2015-06-03 A method of diagnosing neoplasms

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
AU2008316319A Division AU2008316319A1 (en) 2007-10-23 2008-10-23 A method of diagnosing neoplasms

Related Child Applications (1)

Application Number Title Priority Date Filing Date
AU2018200024A Division AU2018200024A1 (en) 2007-10-23 2018-01-02 A method of diagnosing neoplasms

Publications (2)

Publication Number Publication Date
AU2015203005A1 true AU2015203005A1 (en) 2015-06-25
AU2015203005B2 AU2015203005B2 (en) 2017-10-26

Family

ID=54147361

Family Applications (2)

Application Number Title Priority Date Filing Date
AU2015203005A Active AU2015203005B2 (en) 2007-10-23 2015-06-03 A method of diagnosing neoplasms
AU2018200024A Abandoned AU2018200024A1 (en) 2007-10-23 2018-01-02 A method of diagnosing neoplasms

Family Applications After (1)

Application Number Title Priority Date Filing Date
AU2018200024A Abandoned AU2018200024A1 (en) 2007-10-23 2018-01-02 A method of diagnosing neoplasms

Country Status (1)

Country Link
AU (2) AU2015203005B2 (en)

Also Published As

Publication number Publication date
AU2015203005B2 (en) 2017-10-26
AU2018200024A1 (en) 2018-01-25

Similar Documents

Publication Publication Date Title
US20180100200A1 (en) Method of diagnosing neoplasms
US20190024188A1 (en) Method of diagnosing neoplasms - ii
US20110082188A1 (en) Gene expression profiling of inflammatory bowel disease
EP2767595A1 (en) Detection method for characterising the anatomical origin of a cell
AU2006210794A1 (en) Biomarkers for tissue status
EP2319939A2 (en) Gene expression profiling of inflammatory bowel disease
AU2019222816B2 (en) A method of diagnosing neoplasms - II
AU2015203005A1 (en) A method of diagnosing neoplasms

Legal Events

Date Code Title Description
FGA Letters patent sealed or granted (standard patent)