Brief Description Of Drawings
Fig. 1 has shown the Ensembl human genome note of Septin 9 and Q9HC74 gene transcripts.Also shown the relative position of SEQ ID NO:2 and SEQ ID NO:3.
Fig. 2 provides three figure.Two figure on the left side have shown the susceptibility of measuring SEQ ID NO:1 (measuring 2) in embodiment 2 in colorectal carcinoma and blood sample.The ROC that the figure on the right provides colorectal carcinoma to detect.
Fig. 3 has shown the methylation level of measuring in other cancer of embodiment 4.
Fig. 4 shows the methylation level of measuring in other non-Cancerous disease of embodiment 4.
Fig. 5 to Figure 29 provides the matrix of the bisul-phite sequencing data of embodiment 5.Every row of this matrix represent the repetition sequencing data of a sample, and all of each sample repeat to be in a piece.Every row of matrix represents the single CpG site in fragment.The CpG digital display of amplified production is shown in the left side of matrix.The methylated amount of each CpG position measurement by from light grey (0% methylates), to ash (50% methylates), represent to grey black (100% methylates).Successfully do not checked order in some amplified productions, sample or CpG position, they are shown as white.
Fig. 5 to Figure 12 provided before 4 to be had in 10% to 20% methylated sample by quantitative (analyzing by HeavyMethyl), transforms the order-checking overview of amplified production according to the bisul-phite of the genome sequence of table 21.
Figure 13 to Figure 20 provided before 2 and quantitatively (has been passed through HeavyMethyl
tManalyze) have higher than in 20% methylated sample, transform the order-checking overview of amplified production according to the bisul-phite of the genome sequence of table 21.
Figure 21 to Figure 22 provides the order-checking overview that transforms amplified production in 3 healthy individuals blood samples according to the bisul-phite of the genome sequence of table 21.
Figure 23 to Figure 29 provided before 6 by quantitatively (analyzing by HeavyMethyl) to be had and methylates in the sample of (but higher than 0%) lower than 10%, transforms the order-checking overview of amplified production according to the bisul-phite of the genome sequence of table 21.
Each provides 3 figure Figure 30 to Figure 37.Two figure in left side have shown in embodiment 2 in colorectal carcinoma and blood sample according to the susceptibility of the mensuration of table 12.The figure of top has shown the bivariate distribution of two kinds of samples, and the figure of below provides multiclass distribution.The Y-axis of described figure shows the ratio with the sample by analysis that is greater than the methylation level that is presented at the quantitative values in X-axis.
Detailed description of the invention
definition:
Term " observation/expection ratio " (" O/E ratio ") refers to the frequency of CpG dinucleotides in specific dna sequence, corresponding to the belt length (band length) of [CpG number of sites/(C base is counted x G base number)]/each fragment.
Term " CpG island " refers to the successive zone of the genomic dna that meets following standard: (1) is corresponding to the CpG dinucleotide frequency > 0.6 of " observation/expection ratio ", and (2) " GC content " > 0.5.The length on CpG island conventionally but not always approximately 0.2 to about 1KB, or between 2kb.
Term " methylation state " or " methylation status " refer to that in DNA sequence dna, one or more CpG dinucleotides place exists or do not exist 5-methylcytosine (" 5-mCyt ").One or more specific CpG methylation state that site (every place has two CpG dinucleotides sequences) locates that methylates comprises " unmethylated ", " permethylated " and " hemimethylated " in DNA sequence dna.
Term " half-methylate " or " hemimethylation " refer to the methylation state of double-stranded DNA, wherein only have a chain to be methylated.
While being used in herein, term " AUC " is the abbreviation of area under a curve (area under curve).Particularly, it refers to experimenter's operating characteristic (ROC) area under a curve.ROC curve is the curve of the relative false positive rate of True Positive Rate, for the different possibility threshold values of diagnostic test.The compromise (any raising of susceptibility all can be attended by specific decline) between susceptibility and the specificity of selected threshold value is depended in its demonstration.ROC area under a curve (AUC) is that (area is the bigger the better, and optimum value is 1, and the ROC curve of random test is positioned at diagonal lines, and area is 0.5 for measurement to diagnostic test accuracy; Referring to J.P.Egan.Signal Detection Theory andROC Analysis (signal detection theory and ROC analyze), Academic Press, New York, 1975).
Term " supermethylation " refers to the amount of the 5-mCyt finding with respect to corresponding CpG dinucleotides place in normal control DNA sample, the average methylation state that in the DNA sequence dna corresponding to test dna sample, the occurrence rate of one or more CpG dinucleotides 5-mCyt of place increases.
Term " hypomethylation " refers to the amount of the 5-mCyt finding with respect to corresponding CpG dinucleotides place in normal control DNA sample, the average methylation state that in the DNA sequence dna corresponding to test dna sample, the occurrence rate of one or more CpG dinucleotides 5-mCyt of place reduces.
Term " microarray " in a broad sense, is accepted as this area, refers to " DNA microarray " and " DNA chip ", comprises all solid supports of having approved, and comprises for by all methods thereon attached nucleic acid molecule or nucleic acid thereon.
" genetic parameter " is sudden change and the polymorphism of gene and sequence, for their adjusting further required.Be considered to especially insertion, deletion, point mutation, inversion and the polymorphism of sudden change, and especially preferred SNP (single nucleotide polymorphism).
" epigenetic parameter (epigenetic parameter) " refers in particular to cytosine methylation.Other epigenetic parameter for example comprises the acetylize of histone, but it can not adopt described method direct analysis, but it is relevant to DNA methylation.
Term " hydrosulphite reagent " refers to comprise hydrosulphite (bisulfite), disulfite, bisul-phite (hydrogen sulfite) or its combination, as disclosed herein, for distinguishing methylated and unmethylated CpG dinucleotides sequence.
Term " methylation assay " refers to determine any mensuration of the methylation state of one or more CpG dinucleotides sequences in DNA sequence dna.
Term " MS.AP-PCR " (the arbitrarily primed polymerase chain reaction that methylates responsive) refers to adopt the primer that is rich in CG to scan genome to can concentrate on the technology known in the art in the region that most probable contains CpG dinucleotides comprehensively, as people such as Gonzalgo, Cancer Research57:594-599,1997 is described.
Term " MethyLight
tM" refer to known in the art by people such as Eads, Cancer Res.59:2302-2306,1999 describe the real time pcrs based on fluorescence.
In its embodiment used herein, term " HeavyMethyl
tM" assay method refers to such mensuration, wherein covers between amplimer or the blocking-up probe (herein also referred to as blocker) that is amplified the methylation specific of the CpG position that primer covers makes the selective amplification nucleic acid samples of methylation specific become possibility.
In its embodiment used herein, term " HeavyMethyl
tMmethyLight
tM" assay method refers to HeavyMethyl
tMmethyLight
tMmeasure, it is MethyLight
tMthe variant of measuring, wherein MethyLight
tMmeasuring and cover the methylation specific blocking-up probe of CpG position between amplimer combines.
Term " Ms-SNuPE " (methylate responsive mononucleotide primer extend) refers to known to Gonzalgo & Jones, Nucleic Acids Res.25:2529-2531,1997 mensuration of describing.
Term " MSP " (methylation specific PCR) refers to known to people such as Herman, Proc.Natl.Acad.Sci.USA 93:9821-9826,1996 and by United States Patent (USP) 5,786,146 methylation assays of describing.
Term " COBRA " (the hydrosulphite restriction analysis of associating) refers to known to Xiong & Laird, Nucleic Acids Res.25:2532-2534,1997 methylation assays of describing.
Term " MCA " (amplification of methylated CpG island) refers to by people such as Toyota, Cancer Res.59:2307-12,1999 and WO 00/26401A1 in the methylation assay described.
Term " hybridization " should be understood to that oligonucleotide and complementary sequence, along the bonding of Watson-Crick base pairing line in sample DNA, form duplex structure.
" the tight hybridization conditions " of definition is included at 68 ℃ and hybridizes in 5x SSC/5xDenhardt solution/1.0%SDS herein, and at room temperature in 0.2x SSC/0.1%SDS, wash, or comprise its known condition of equivalent (for example such condition: hybridization is carried out at 60 ℃ in 2.5xSSC damping fluid, be the several washing steps under low-buffer concentration at 37 ℃ subsequently, and keep stable).The medium stringent condition of definition is included at 42 ℃ and is washing in 3xSSC herein, or its known condition of equivalent.Can change salt concn and temperature parameter to obtain the identity of optimum level between probe and target nucleic acid.Can obtain in the prior art the guidance to these conditions, the people such as such as Sambrook, 1989, Molecular Cloning, ALaboratory Manual (molecular cloning experiment guide), Cold Spring Harbor Press, and the people such as Ausubel N.Y., Current Protocols in Molecular Biology (up-to-date molecular biology experiment), (John Wiley & Sons, N.Y.) unit 2.10.
Term " methylation specific restriction enzyme " or " responsive restriction enzyme methylates " should be understood to according to the methylation state of its recognition site and the enzyme of selectivity digesting nucleic acid.For not methylated when recognition site or the restriction enzyme of special shearing just when hemimethylation, in the time that recognition site is methylated, can not shear, or shear with significantly reduced efficiency.For the restriction enzyme of the special shearing of ability in the time that recognition site is methylated, in the time that recognition site is not methylated, can not shear, or shear with significantly reduced efficiency.The preferably restriction enzyme of methylation specific, its recognition sequence contains CG dinucleotides (for example cgcg or cccggg).Concerning some embodiments, the restriction enzyme of further preferably not cutting in the time that C5 carbon atom is methylated when the cytosine(Cyt) in this dinucleotides.
" restriction enzyme of non-methylation specific " or " the non-responsive restriction enzyme that methylates " is for haveing nothing to do with the restriction enzyme of essentially identical efficiency cutting nucleotide sequence with methylation state.They are also referred to as " non-specific restriction enzyme methylates ".
Term " gene " should be considered to comprise its all transcript variant (for example, term " Septin 9 " should comprise the transcript Q9HC74 of for example its brachymemma) with and all promotor and regulatory elements.In addition this term, in described gene, has multiple SNP due to known, so should be believed to comprise its all sequence variants.
Term " precancerous " or " knurl become before " or its are equal to term should be considered to make a comment or criticism any cell proliferation illness of the pernicious transformation of experience.With regard to colorectal cell proliferative disorders, the example of this class situation comprises highly dysplastic cell proliferation disorders, comprises the adenoma of following classification:
Grade 1: pernicious body of gland infiltrates through the submucosa in polyp head (polyp head) from muscularis mucosae;
Grade 2: identical submucosa is invaded, but is present in the joint of head to stem;
Grade 3: invade stem; And
Class 4: the base portion (this grade is corresponding to the Dukes A phase) of invading stem in the junction that is connected to colon wall.
general introduction
The invention provides the method for cell proliferation disorders in detection and/or classification individuality, comprise and determine and separate that at least one is selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and the gene of SEQ ID NOS:160 to SEQ ID NO:165 or the expression level of genome sequence in the biological sample of described individuality, wherein owe to express and/or CpG methylates and shows existence or the classification of described illness.Described mark can be for diagnosis knurl sexual cell proliferative disorders (cancer), comprises the early detection during canceration early stage of disease, and also for distinguishing knurl and benign cell proliferative disorders.The open method of the present invention, wherein knurl sexual cell proliferative disease and benign cell proliferative disease are distinguished, described method is characterised in that owes expression and/or exists CpG to methylate to show to exist knurl sexual cell hyperplasia or the front illness of knurl, and it does not exist and shows to exist benign cell proliferative disease.
Mark of the present invention is especially effective aspect detection or differentiation hepatocyte growth venereal disease disease or detection or discriminating colorectal rectum cell proliferative disorders, and the means of the improvement of early detection, classification and the described illness for the treatment of are provided thus.
Except above analysis, at least one is selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, outside the gene of FAT and SEQ ID NOS:160 to SEQ ID NO:165 or the methylated embodiment of genome sequence, the present invention also provides the gene in groups for detection of cancer especially liver cancer and/or colorectal carcinoma with new application, comprise that being selected from least one is selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, gene or the genome sequence of FAT and SEQ ID NOS:160 to SEQ ID NO:165.
In first other embodiment, the present invention is the analysis based at least one being selected to Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and the gene of SEQ ID NOS:160 to SEQ ID NO:165 or the CpG methylation state of genome sequence.Further the sequence of preferred described gene is as shown in table 1.
The bisulphite modified of DNA is the known instrument for assessment of CpG methylation state.In eukaryotic DNA, 5-methylcytosine is modal covalency base modification.It is for example transcribed in adjusting, genetic imprinting and tumour work in occurring.Therefore confirm that 5-methylcytosine has sizable meaning as genetic information component.But 5-methylcytosine can not be identified by checking order, because 5-methylcytosine has identical base pairing behavior with cytosine(Cyt).In addition, for example, in pcr amplification process, the epigenetic information that 5-methylcytosine carries is lost completely.
Being most commonly used to the method that in analyzing DNA, 5-methylcytosine exists is the specific reaction based on hydrosulphite and cytosine(Cyt), and after alkaline hydrolysis subsequently, cytosine(Cyt) is converted into the uridylic of corresponding thymus pyrimidine on Pairing behavior thus.But importantly, 5-methylcystein keeps not modified under these conditions.As a result, original DNA is changed in this way, makes the original methylcystein that can not distinguish with cytosine(Cyt) in its hybridization behavior can be used as now only surplus cytosine(Cyt) and is detected by conventional known molecular biology techniques, for example, by increasing and hybridizing.All these technology are the base pairing characteristic based on different all, can be fully utilized now.
With regard to susceptibility, prior art is determined by method, the method comprises DNA to be analyzed is encapsulated in agarose matrix, prevent thus DNA diffusion and renaturation (hydrosulphite only reacts with single stranded DNA), and substitute all precipitations and the purification step (people such as OlekA with quick dialysis, A modified and improved method for bisulfite based cytosinemethylation analysis (methods of the changes and improvements of analyzing for the cytosine(Cyt) based on hydrosulphite), Nucleic Acids Res.24:5064-6, 1996)).Thereby likely analyze the methylation state of individual cells, practicality and the susceptibility of the method is described.Rein, the people such as T, NucleicAcids Res., 26:2255,1998 provide the summary of the currently known methods to detecting 5-methylcytosine.
For example, except extremely indivedual examples (, the people such as Zeschnigk M, Eur J Hum Genet.5:94-98,1997), this sulphite technology is at present only for research.In all cases, the short specific fragment of known increases after bisulf iotate-treated, and or (the Olek & Walter that checks order completely, Nat Genet.1997 17:275-6,1997), or carry out one or more primer extension reactions (Gonzalgo & Jones, Nucleic Acids Res., 25:2529-31,1997; WO 95/00669; United States Patent (USP) 6,251,594) to analyze each cytosine(Cyt) position, or by collagenase treatment (Xiong & Laird, Nucleic Acids Res., 25:2532-4,1997).Also there is in the prior art description (people such as Olek, WO99/28498) by the detection of hybridization.In addition, also described and used methylate detection (Grigg & Clark, Bioessays, 16:431-6,1994 of hydrosulphite technology for individual gene; The people such as Zeschnigk M, Hum Mol Genet., 6:387-95,1997; The people such as Feil R, Nucleic Acids Res., 22:695-, 1994; The people such as Martin V, Gene, 157:261-4,1995; WO 9746705 and WO 9515373).
The present invention also provides the combine use of this hydrosulphite technology with one or more methylation assays, for determining the methylation state of the CpG dinucleotides sequence at least one sequence that is selected from SEQ ID NOS:1 to SEQ ID NO:3, SEQ IDNO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167.Genome CpG dinucleotides can be methylated or do not methylated (or being called upper and lower methylating (up-and down-methylated)).But method of the present invention is suitable for analyzing heterogeneous biological sample, for example, lower concentration tumour cell in blood or ight soil.Therefore, in the time analyzing the methylation state of CpG position in this sample, those skilled in the art can determine with quantitative determination process the methylation level (for example per-cent, umber, ratio, ratio or degree) of specific CpG position, rather than methylation state.Correspondingly, term methylation status or methylation state also should be considered to refer to the value of reflection CpG position methylation.Unless clearly stated, term " supermethylation " or " above methylating " should be considered to refer to methylate the specific threshold value of horizontal exceeding, wherein said threshold value can be the value that represents the average or intermediate value methylation level of given colony, or is preferably the critical level of optimization." critical " also can refer to " threshold value " in this article.In the context of the present invention, for for example, being selected from the gene of following sequence or genome sequence or (promotor or regulatory region in) all CpGs position relevant with it, term " methylated ", " supermethylation " or " upper methylated " should be considered to comprise that methylation level methylates higher than threshold value zero (0) % (or its value of being equal to), described sequence is Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165.
According to the present invention, determine SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, in SEQ ID NOS:159 to SEQ ID NO:167, the methylation state of CpG dinucleotides sequence is all useful aspect diagnosis and characterize cells proliferative disease.Methylation assay method.Multiple methylation assay method known in the state of the art, and can combine use with the present invention.These mensuration make it possible to determine the methylation state of one or more CpG dinucleotides (for example CpG island) in DNA sequence dna.Wherein, this class is measured and is comprised through DNA sequencing, PCR (for sequence-specific amplification), the Southern engram analysis of the DNA of bisulf iotate-treated, uses responsive restriction enzyme and other technology of methylating.
For example, by using bisulf iotate-treated, gene order-checking is simplified for the distribution of analyzing DNA methylation patterns and 5-methylcytosine people such as (, Proc.Natl.Acad.Sci.USA 89:1827-1831,1992) Frommer.In addition, use the PCR product of restriction enzyme digestion from the DNA cloning through hydrosulphite transformation, for example Sadri & Hornsby (Nucl.Acids Res.24:5058-5059,1996), or COBRA (Combined Bisulfite Restriction Analysis (the hydrosulphite analysis of associating)) (Xiong & Laird, NucleicAcids Res.25:2532-2534,1997) described method.
COBRA.COBRA
tMit is the quantitative methylation assay (Xiong & Laird, Nucleic AcidsRes.25:2532-2534,1997) that can be used for determining the DNA methylation level at specific gene seat place in a small amount of genomic dna.In brief, by restriction enzyme digestion for disclosing through the PCR product of the DNA of the sodium bisulfite processing sequence difference relying on that methylates.The sequence difference that first method (Proc.Natl.Acad.Sci.USA 89:1827-1831,1992) of describing according to people such as Frommer relies on methylating by standard bisulf iotate-treated is introduced genomic dna.Adopt subsequently the special primer in object CpG island is carried out to the pcr amplification of DNA changing through hydrosulphite, be then digestion with restriction enzyme, gel electrophoresis and adopt the special hybridization probe being labeled to detect.Methylation level in original DNA sample represents by the relative quantity of digested and not digested PCR product, and it is linear quantitative in DNA methylation horizontal extent on a large scale.In addition this technology DNA for obtaining from the paraffin-embedded tissue sample of micro-dissection reliably.
For COBRA
tMthe typical agents of analyzing (for example, can be typically based on COBRA
tMtest kit in find) can include, but are not limited to: for the PCR primer of the specific gene DNA sequence dna HuoCpG island of bisulf iotate-treated (or through); Restriction Enzyme and applicable damping fluid; Gene recombination oligonucleotide; Contrast hybridization oligonucleotide; For the kinases labelling kit of oligonucleotide probe; And the Nucleotide of mark.In addition, hydrosulphite transformation reagent can comprise: DNA sex change damping fluid; Sulfonation damping fluid; DNA reclaims reagent or test kit (for example, precipitation, ultrafiltration, affinity column); Desulfonation damping fluid; And DNA reclaims component.
Preferably, such as " MethyLight
tM" (based on the real time pcr of fluorescence) (people such as Eads, Cancer Res.59:2302-2306,1999), Ms-SNuPE
tM(the mononucleotide primer that methylates responsive extends) reaction (Gonzalgo & Jones, Nucleic Acids Res.25:2529-2531,1997), methylation status of PTEN promoter (" MSP "; The people such as Herman, Proc.Natl.Acad.Sci.USA 93:9821-9826,1996; United States Patent (USP) 5,786,146) and methylated CpG island amplification (" MCA "; The people such as Toyota, Cancer Res.59:2307-12,1999) mensuration by separately or combine use with other method in these methods.
" HeavyMethyl
tM" determination techniques is the quantivative approach for assessment of methylation differential, the methylation specific amplification of its DNA based on to through bisulf iotate-treated.The methylation specific blocking-up probe (being also referred to as in this article blocker) that covers between amplimer or be amplified the CpG position that primer covers makes methylation specific selective amplification nucleic acid samples be called possibility.In its embodiment of application herein, term " HeavyMethyl
tMmethyLight
tM" measure and refer to HeavyMethyl
tMmethyLight
tMmeasure wherein MethyLight
tMmeasuring and cover the methylation specific blocking-up probe of CpG position between amplimer combines.HeavyMethyl
tMmeasure and also can combine use with the amplimer of methylation specific.
Be generally used for HeavyMethyl
tMthe typical agents of analyzing (for example, can be typically based on MethyLight
tMtest kit in find) can include, but are not limited to: for the PCR primer of the specific gene DNA sequence dna HuoCpG island of bisulf iotate-treated (or through); Blocking-up oligonucleotide; PCR damping fluid and the deoxynucleotide optimized; And Taq polysaccharase.
MSP.MSP (PCR of methylation specific) makes to assess in CpG island the methylation state of any CpG site group substantially, and with the irrelevant (people such as Herman of the use of the responsive restriction enzyme that methylates, Proc.Natl.Acad.Sci.USA 93:9821-9826,1996; United States Patent (USP) 5,786,146).In brief, use sodium bisulfite modifying DNA, change all unmethylated rather than methylated cytosine(Cyt)s into uridylic, then use with respect to methylate DNA not and be specific to the primer amplification of methylate DNA.MSP only needs DNA in a small amount, 0.1% the allelotrope sensitivity that methylates to position, given CpG island, and can carry out at the DNA extracting from paraffin-embedded sample.The typical agents of analyzing for MSP (for example, may in the typical test kit based on MSP, find) include, but are not limited to: for the methylated and unmethylated PCR primer of the specific gene DNA sequence dna HuoCpG island of bisulf iotate-treated (or through), PCR damping fluid and deoxynucleotide and the specific probe of optimization.
MethyLight
tM.MethyLight
tMbe determined as high-throughput quantification methylation assay, PCR in real time (TaqMan_) technology that it uses based on fluorescence does not need further operation (people such as Eads, Cancer Res.59:2302-2306,1999) after PCR step.In brief, MethyLight
tMmethod starts with the biased sample of genomic dna, and this biased sample is converted into the mixing pit of the sequence difference of the dependence that methylates in sodium bisulfite reaction according to standard operation (unmethylated cytosine(Cyt) residue is transformed into uridylic by hydrosulphite process).In " (biased) of skew " reaction (adopting the PCR primer of overlapping known CpG dinucleotides), carry out the PCR based on fluorescence subsequently.Can produce sequence difference in amplification procedure level and in fluoroscopic examination process level.
MethyLight
tMmensuration can be as the quantitative test of methylation patterns in genome DNA sample, and wherein sequence area is divided and occurred in probe hybridization level.In this quantitative manner, under overlapping fluorescent probe of specifically inferring the site that methylates exists, PCR reaction provides the amplification of methylation specific.For being provided by following reaction without skew contrast of DNA amount is provided: wherein primer and probe do not cover any CpG dinucleotides.Or, by with the control oligonucleotide (HeavyMethyl in " covering " known site that methylates not
tMthe mode based on fluorescence with MSP technology), or with cover the potential site that methylates oligonucleotide survey skew PCR pond realize the quantitative test to genomic methylation.
MethyLight
tMmethod can be used together with any applicable probe, as " TaqMan_ ", " Lightcycler_ " etc.For example, process double stranded genomic dna with sodium bisulfite, and it is adopted to one of two cover PCR reactions of TaqMan_ probe; For example, adopt MSP primer and/or HeavyMethyl blocker oligonucleotide and TaqMan_ probe.This TaqMan_ probe is fluorescence " reporter " and " cancellation " molecule double-tagging, and is designed to be specific to relative high GC content district, to such an extent as to its in PCR circulation to melt than the temperature of high approximately 10 ℃ of primer forward or backwards.This makes TaqMan_ probe in PCR anneals/extend step, keep fully hybridization.When Taq polysaccharase is in PCR when the new chain of enzymic synthesis, it finally can run into the TaqMan_ probe of annealing.Taq polysaccharase 5 ' to 3 ' endonuclease activity will replace it by digestion TaqMan_ probe subsequently, thereby discharge fluorescence reporter molecule for adopting its signal not being quenched now of real-time fluorescence detection system detection by quantitative.
For MethyLight
tMthe typical agents of analyzing (for example, can be based on MethyLight
tMtest kit in find) can include, but are not limited to: for the PCR primer on specific gene (or DNA sequence dna HuoCpG island of bisulf iotate-treated); TaqMan_ or Lightcycler_ probe; PCR damping fluid and the deoxynucleotide optimized; And Taq polysaccharase.
QM
tM(quantitatively methylating) is determined as the another kind of quantitative test of methylation patterns in genome DNA sample, and wherein sequence area is divided and appeared in probe hybridization level.In this quantitative manner, PCR reaction provides the amplification without skew, the wherein overlapping site that methylates of specifically inferring of this fluorescent probe under the existence of fluorescent probe.Provided the contrast without skew of input DNA amount by such reaction: i.e. not overlapping any CpG dinucleotides of primer or probe wherein.Or, by with the control oligonucleotide (HeavyMethyl in " covering " known site that methylates not
tMthe mode based on fluorescence with MSP technology), or with cover the potential site that methylates oligonucleotide survey skew PCR pond realize the quantitative test to genomic methylation.
QM
tMmethod can be used in amplification procedure together with any applicable probe, as " TaqMan_ ", Lightcycler_ etc.For example, process double stranded genomic dna with sodium bisulfite, and it is used to primer and the TaqMan_ probe without skew.This TaqMan_ probe is fluorescence " reporter " and " cancellation " molecule double-tagging, and is designed to be specific to relative high GC content district, to such an extent as to its in PCR circulation to melt than the temperature of high approximately 10 ℃ of primer forward or backwards.This makes TaqMan_ probe in PCR anneals/extend step, keep fully hybridization.When Taq polysaccharase is in PCR when the new chain of enzymic synthesis, it finally can run into the TaqMan_ probe of annealing.Taq polysaccharase 5 ' to 3 ' endonuclease activity will replace it by digestion TaqMan_ probe subsequently, thereby discharge fluorescence reporter molecule for adopting its signal not being quenched now of real-time fluorescence detection system detection by quantitative.For QM
tMthe typical agents of analyzing (for example, can be based on QM
tMtest kit in find) can include, but are not limited to: for the PCR primer on specific gene (or DNA sequence dna HuoCpG island of bisulf iotate-treated); TaqMan_ or Lightcycler_ probe; PCR damping fluid and the deoxynucleotide optimized; And Taq polysaccharase.
Ms-SNuPE.Ms-SNuPE
tMtechnology is the quantivative approach for assessment of the methylation differential in specific CpG site, it is based on bisulf iotate-treated DNA, then be that mononucleotide primer extends (Gonzalgo & Jones, Nucleic Acids Res.25:2529-2531,1997).In brief, make genomic dna react to change unmethylated cytosine(Cyt) into uridylic with sodium bisulfite, and keep 5-methylcytosine constant.Adopt subsequently the required target sequence of PCR primer amplification that is specific to the DNA changing through hydrosulphite, the product of resulting separation is also used as the methylated template of analysis purposes CpG site.Can analyze DNA (pathological section of for example micro-dissection) in a small amount, it has avoided using restriction enzyme to determine the methylation state of CpG site.
For Ms-SNuPE
tMthe typical agents of analyzing (for example, can be typically based on COBRA
tMtest kit in find) can include, but are not limited to: for the PCR primer of the specific gene DNA sequence dna HuoCpG island of bisulf iotate-treated (or through); PCR damping fluid and the deoxynucleotide optimized; Gel extraction kit, positive control primer; For the Ms-SNuPE of specific gene
tMprimer; Reaction buffer (for Ms-SNuPE reaction); And the Nucleotide of mark.In addition, hydrosulphite transformation reagent can comprise: DNA sex change damping fluid; Sulfonation damping fluid; DNA reclaims reagent or test kit (for example, precipitation, ultrafiltration, affinity column); Desulfonation damping fluid; And DNA reclaims component.
sEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, the genome sequence of SEQ ID NOS:159 to SEQ ID NO:167, with and non-natural occur treated variant SEQ ID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, SEQ ID NOS:168 to SEQ ID NO:203 is defined in the especially early detection of colorectum and/or hepatocyte growth venereal disease disease of cell proliferative disorders, classification and/or treatment aspect have new application.
In one embodiment, method of the present invention comprises the following steps: i) make the genomic dna (preferably separating from body fluid) obtaining from individuality contact with at least one reagent or a group reagent, described reagent is distinguished methylating and unmethylated CpG dinucleotides at least one gene that is selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165 (comprising its promotor and regulation domain) or genome sequence; And ii) to be more than or equal to 80% susceptibility and to be more than or equal to 80% specific detection or to detect and discriminating colorectal or hepatocyte growth venereal disease disease.
Preferably, described susceptibility is approximately 75% to approximately 96% or approximately 80% to approximately 90% or approximately 80% to approximately 85%.Preferably, described specificity is approximately 75% to approximately 96% or approximately 80% to approximately 90% or approximately 80% to approximately 85%.
Can be by any standard method isolation of genomic DNA of the prior art, comprise and use commercially available test kit.In brief, in the time that target DNA is wrapped in cytolemma in biological sample, it is cleaved that this biological sample must be broken and pass through enzyme, chemistry or mechanical means.For example remove albumen and other pollutent by the digestion of protein kinase K subsequently.Then from solution, reclaim genomic dna.This can realize by the whole bag of tricks, comprise saltout, organic extraction or DNA is attached to solid support.Can be subject to the impact of many factors on the selection of method, comprise the amount of time, expense and required DNA.All clinical sample kinds, comprise the front material of knurl material or knurl, all be suitable for use in the inventive method, the preferably hemocyte of clone, Histological section, biopsy, paraffin-embedded tissue, body fluid, ight soil, colon effluent, urine, blood plasma, serum, whole blood, separation, the cell that separates from blood, or its combination.Body fluid is preferred DNA source; Especially preferred is the hemocyte of blood plasma, serum, whole blood, separation and the cell from blood separation.
Subsequently, in genomic dna at least one target region, methylate and not at least one or agent treated genome DNA sample in groups of methylated CpG dinucleotides with distinguishing, the sequence that wherein said target region comprises or hybridization to the length of at least one sequence is at least 16 continuous nucleotides under stringent condition, described at least one sequence is selected from and is selected from respectively SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167, wherein said continuous nucleotide comprises at least one CpG dinucleotides sequence.
Especially preferred, described reagent will be not 5 ' methylated cytosine(Cyt) base change into uridylic, thymus pyrimidine or other in hybridization behavior, be different from cytosine(Cyt) another base.But in another embodiment, described reagent can be for methylating responsive restriction enzyme.
When genomic dna is by this mode processing, so that make 5 ' unmethylated cytosine(Cyt) base change into uridylic, thymus pyrimidine or other in hybridization behavior, be different from cytosine(Cyt) other base time, preferred this processing is carried out (bisul-phite, hydrosulphite (disulfite)) and alkaline hydrolysis subsequently with hydrosulphite.This processing causes SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167 (difference) is converted into SEQ ID NOs:10 to SEQ ID NO:15, SEQID NOS:30 to SEQ ID NO:31, SEQ ID NOS:38 to SEQ ID NO:39, SEQID NOS:168 to SEQ ID NO:185, wherein said CpG dinucleotides is methylated, or SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOs:42 to SEQ IDNO:43, SEQ ID NOS:50 to SEQ ID NO:51, SEQ ID NOS:186 to SEQ IDNO:203, wherein said CpG dinucleotides is unmethylated.
The DNA that subsequent analysis is treated, to determine the methylation state of target-gene sequence (before processing, at least one gene or genome sequence are selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQID NO:165).Especially preferredly be, this target region comprises or hybridization is at least 16 continuous nucleotides of at least one gene or genome sequence under stringent condition, and described at least one gene or genome sequence are selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165.The gene order of Optimization Analysis SEQ ID NOS:1 to SEQ ID NO:3, SEQ IDNO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167.Described analytical procedure can be selected from well known in the prior art those, comprise that those are listed in herein.Especially preferred is MethyLight
tM, MSP and use blocking-up oligonucleotide (HeavyMethyl described herein
tM).Further preferably, the any oligonucleotide being used in this analysis (comprises primer, blocking-up oligonucleotide and detection probes) should reverse complemental in, be equal to or hybridize SEQ ID NOS:10 to SEQ IDNO:15 under tight or height stringent condition, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ IDNO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ IDNO:39, SEQ ID NOS:50 to SEQ ID NO:51, the long fragment of at least 16 base pairs of one or more base sequences in SEQ ID NOS:168 to SEQ IDNO:203 and complementary sequence thereof.
Abnormal methylation, be selected from more specifically Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and the gene of SEQ IDNOS:160 to SEQ ID NO:165 (comprising their promotor and/or regulatory region) or the supermethylation of genome sequence relevant with the existence of knurl sexual cell proliferative disorders, especially general in colorectum and hepatoma.Therefore,, in the time that biological sample shows methylating of any degree, described sample should be confirmed as knurl.
Make it possible to first with the susceptibility greater than or equal to 80% and greater than or equal to 80% specific detection or detect and discriminating colorectal or hepatocyte growth venereal disease disease being selected from the analysis of one of the gene of Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165 or genome sequence.Being calculated as of susceptibility: (knurl/all knurls that detect); For example (the colon knurl/all colon knurls that detect); Specific being calculated as (undetected feminine gender/total feminine gender).
Preferably, described susceptibility is approximately 75% to approximately 96% or approximately 80% to approximately 90% or approximately 80% to approximately 85%.Preferably, described specificity is approximately 75% to approximately 96% or approximately 80% to approximately 90% or approximately 80% to approximately 85%.
Knurl defined herein is all 1cm malignant tumor of colon and adenoma or its hypotypes of being greater than.Feminine gender can be defined as healthy individuals.
In one embodiment, described method disclose be selected from least one gene of Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165 (or its promotor and/or regulatory region) or genome sequence as difference, detect and distinguish the mark of cell proliferative disorders (the especially colon of knurl or liver illness).
The expression of the RNA that described method can be transcribed from them by any analysis or the polypeptide of translating from described RNA or the expression of albumen realize, and preferably analyze by mrna expression or expression of polypeptides analysis.Therefore, the present invention also provides diagnostic assay and method, quantitatively and qualitatively detect that in individuality, at least one is selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and the gene of SEQ ID NOS:160 to SEQ ID NO:165 or the expression of genome sequence, and determine thus in described individuality, whether have cancer.
From being selected from the gene of Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165 or the unconventionality expression of the mRNA that genome sequence is transcribed is relevant to the existence of cancer in individuality.According to the present invention, owe to express (and/or exist methylate) relevant to the existence of cancer, otherwise mistake expression (and/or do not exist and methylate) and do not exist cancer relevant.Especially preferably, determine that at least one is as the expression of transcribing variant of disclosed gene Septin 9 in SEQ ID NOS:16 to SEQ ID NO:19.
In order to detect the existence of mRNA of encoding gene or genome sequence, remove to obtain sample from patient.This sample can be the sample of any applicable cellular material that comprises tumour.Applicable sample type comprises the hemocyte of clone, Histological section, biopsy, paraffin-embedded tissue, body fluid, ight soil, colon effluent, urine, blood plasma, serum, whole blood, separation, the cell separating from blood, and all possible combination.Preferably, described sample type is ight soil or body fluid, the cell that is selected from the hemocyte of colon effluent, urine, blood plasma, serum, whole blood, separation, separates from blood.
Described sample can be processed to extract wherein contained RNA.Subsequent analysis is from the nucleic acid of this sample gained.The technology of a lot of absolute and relative level for definite genetic expression known in the state of the art; be suitable for use in the technology that common technology in the present invention comprises that in situ hybridization (for example FISH), Northern analyze, RNA enzyme protection is measured (RPA), microarray and PCR-based, for example quantitative PCR and difference show PCR or any other nucleic acid detection method.
Especially preferred is to use reverse transcription/polymerization chain type reaction technology (RT-PCR).(for example, referring to above Watson and Fleming) that RT-PCR method is known in the art.
RT-PCR method can be carried out as follows.By the total RNA of guanidinium isothiocyanate method isolated cell of for example standard, and this total RNA of reverse transcription.This reverse transcription method comprises employing reversed transcriptive enzyme and 3 ' end oligonucleotide dT primer and/or random hexamers synthetic DNA in RNA template.Consequent cDNA is subsequently by pcr amplification (people such as Belyavsky, Nucl AcidRes 17:2919-2932,1989; Krug and Berger, Methods in Enzymology (method in zymetology), Academic Press, N.Y., Vol.152, pp.316-325,1987, by reference to they are introduced).Further preferably " in real time " variant of RT-PCR, wherein said PCR product for example, by hybridization probe (TaqMan, Lightcycler, MolecularBeacons & Scorpion) or SYBR is green detects.Then, reference standard curve or by by the Ct value of Ct value and calibration criterion relatively and by quantitative from probe or the green signal detecting of SYBR.To the analysis of house-keeping gene through being commonly used to stdn result.
In Northern engram analysis, on sex change sepharose, separate total mRNA or poly (A)+mRNA, and in this dry gel self or hybridize the probe to mark on film.The hit amount of RNA of the signal of gained and RNA group is proportional.
To the relative different that relatively discloses gene expression dose of the signal from two or more cell masses or tissue.Can be by signal and the typical curve that adopts the cell-free transcription folder corresponding to target RNA of known quantity to produce be compared to carry out absolute quantitation.To the analysis of house-keeping gene through being usually used in stdn result, got rid of owing to being transferred to the different of RNA on film or being loaded to the caused any notable difference of difference of the RNA on gel, described house-keeping gene is that expression level is expected the relative constant gene of maintenance with conditional independence.
The first step during Northern analyzes separates pure, complete RNA from object cell or tissue.Because Northern trace is distinguished RNA by size, the integrity of sample affects the concentration degree of signal in wall scroll band.The RNA sample of Partial digestion will cause signal ambiguity or be distributed in several bands, cause the reduction generally of susceptibility and may cause the explanation of error to data.In Northern engram analysis, can use DNA, RNA and oligonucleotide probe, these probes are preferably labeled (for example, radioactively labelled substance, mass spectrum marker (mass label) or fluorescent marker).Target RNA, rather than the large young pathbreaker of probe determines the size of the band detecting, so be applicable to probe analysis such as the method for the random primer labelling that produces different lengths probe.The specific activity of probe will determine the level of susceptibility, so preferably use the probe with high specific activity.
In RNA enzyme protection is measured, RNA target is hybridized in solution with the rna probe with definite length.After hybridization, use RNA enzyme (RNase) the digestion RNA that is specific to single-chain nucleic acid to remove any not strand target RNA and the probe of hybridization.Make RNA enzyme deactivation, and for example carry out isolation of RNA by denaturing polyacrylamide gel electrophoresis.The amount of target RNA in amount and the RNA group of global RNA probe is proportional.RPA can be used for the relative and absolute quantitation of genetic expression, and also for drawing RNA structure, for example intron/exon border and transcription initiation site.RNA enzyme protection is measured and is better than Northern engram analysis, because it has lower detectability.
The DNA profiling that has clear and definite end points by in-vitro transcription for the antisense RNA probes of RPA generates, conventionally in the scope of 50-600 Nucleotide.Use comprises and extra with the rna probe of the sequence of target RNA homology, protected fragment and total length probe region is not separated.Rna probe conventionally substitutes DNA probe and uses, this is because be easy to produce single stranded RNA probe and with the circulation ratio of RNase digestion RNA:RNA duplex and the reliability (people such as Ausubel, 2003), especially preferred is the probe with high specific activity.
Especially preferred is to use microarray.Microarray method can be divided into two major portions.First is that known gene order is fixed on slide glass or other solid support, and being fluorescently-labeled cDNA (comprising sequence to be studied) is subsequently fixed to the hybridization of the known on slide glass (or other solid phase) with this.After hybridization, adopt fluorescence microarray scanner scanning array.Provide the measurement to gene expression difference to the analysis of different genes relative intensity of fluorescence.
Can produce DNA array by the slide glass or other solid surface that pre-synthesis oligonucleotide are fixed to preparation.In this case, adopt the synthetic and purification process of standard oligonucleotide to process and prepare representational gene order.These synthetic gene orders be complementary to goal gene rna transcription this (in this case, gene or genome sequence are selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165), and tend to the short sequence within the scope of 25-70 Nucleotide.In preferred embodiments, described oligonucleotide or polynucleotide comprise be selected from SEQ ID NOS:16 to SEQ ID NO:19 with and at least one sequence of complementary sequence is complementary or at least 9,18 or 25 bases of the sequence of hybridization.Or fixing oligomer can synthesize by in-situ chemical in slide surface.Suitable Nucleotide is added into continuously the point on microarray by synthetic relating to of original position oligonucleotide; The point of not accepting Nucleotide adopts physics or actual covert to protect in each stage of the method.Preferably, described synthetic nucleic acid is the nucleic acid of locking.
Analyzing in the Microarray Experiments of expressing, RNA template used represents the spectrum of transcribing of the cell or tissue studied.First isolation of RNA from cell mass to be compared or tissue.Then each RNA sample is produced to fluorescently-labeled cDNA as template by reverse transcription reaction.The fluorescent mark of this cDNA can be realized by direct mark or indirect labelling method.In direct mark, Nucleotide (for example, the Cy of fluorescent decoration
_3-or Cy
_5-dCTP) in reverse transcription reaction, be directly incorporated in cDNA.Or the Nucleotide that can modify by mix amino allyl group between cDNA synthesis phase is then modified cDNA by N-hydroxy-succinamide (NHS)-fat dye-coupling to this amino allyl group and is completed indirect labelling after reverse transcription reaction finishes.Or this probe can be unlabelled, but can be by being detected with the part specific combination of direct or indirect mark.Marker and method for tagged ligand (and probe) are known in this area, comprise the radioactively labelled substance that for example can for example, mix by currently known methods (nick translation or tyrosine phosphorylation (kinasing)).Other suitable marker includes but not limited to vitamin H, fluorophore, chemoluminescence group (for example dioxane, the dioxane especially causing, enzyme, antibody etc.
In order to carry out differential gene expression analysis, the cDNA producing from different RNA sample is by Cy
_3 marks.The cDNA of the mark obtaining is purified to remove uncorporated Nucleotide, free dye and residual RNA.After purifying, the cDNA sample of mark is hybridized to microarray.The stringency of this hybridization is determined by the many factors in crossover process and in washing process, comprises the concentration of temperature, ionic strength, duration and methane amide.For example in Sambrook et al. (MolecularCloning:A Laboratory Manual (molecular cloning: laboratory manual), 2nd ed., 1989), these factors are summarized.After hybridization, use fluorescence microarray scanner scanning microarray.The fluorescence intensity of each point represents the expression level of institute's analyzing gene; Bright spot is corresponding to the gene of strongly expressed, and dim spot represents weak expression.
Once obtain image, need to analyze raw data.First, must be from the fluorescence of each some subtracting background fluorescence.Then by the stdn of data relative comparison sequence, the control sequence nucleic acid that for example external source is added (preferably RNA or DNA), or house-keeping gene group, to make up the difference of any non-specific hybridization, array defect or determinator, cDNA mark, hybridization or washing.Data normalization makes to compare the result of multiple mensuration.
Another aspect of the present invention relates to for the test kit in the method according to this invention diagnosis individuality cancer, and described test kit comprises: measure and be selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and the gene of SEQ IDNOS:160 to SEQ ID NO:165 or the assembly of genome sequence transcriptional level.In preferred embodiments, can be under tight or medium stringent condition and the oligonucleotide or the polynucleotide that are selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and the gene of SEQ ID NOS:160 to SEQID NO:165 or the hybridization of the transcription product of genome sequence for measuring that the assembly of transcriptional level comprises.Preferably, described oligonucleotide or polynucleotide can and be selected from the gene of Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165 under tight or medium stringent condition or at least one transcription product of genome sequence is hybridized, as provided in SEQ ID NOS:16 to SEQID NO:19.At least 9,18 or 25 bases of the sequence that in one embodiment, described oligonucleotide or polynucleotide comprise or hybridization complementary with at least one sequence that is selected from SEQ ID NOS:16 to SEQ ID NO:19 and complementary sequence thereof.
In the most preferred embodiment, determine transcriptional level by the technology that is selected from Northern engram analysis, reverse transcriptase PCR, PCR in real time, RNA enzyme protection and microarray.In another embodiment of the present invention, this test kit also comprises the device for obtain biological sample from patient.Preferably, test kit also comprises container, and it is most preferably suitable for splendid attire for measuring the assembly of transcriptional level and patient's biological sample, most preferably, also comprises and uses and explain the specification sheets of test kit result.
In preferred embodiments, this test kit comprises that (a) can be under tight or medium stringent condition and the multiple oligonucleotide or the polynucleotide that are selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and at least one gene of SEQ ID NOS:160 to SEQ ID NO:165 or the hybridization of the transcription product of genome sequence; (b) container, the patient's biological sample that is preferably suitable for oligonucleotide or polynucleotide described in splendid attire and comprises transcription product, wherein said oligonucleotide or polynucleotide can be under tight or medium stringent conditions and described transcription product hybridization; (c) for detection of the assembly of the hybridization of (b), and optionally, (d) use and explain the specification sheets of test kit result.Further preferably, each of the oligonucleotide of described (a) or polynucleotide all comprises and is selected from least 9,18 or 25 bases of the sequence of the complementary or hybridization of at least one sequence of SEQ ID NOS:16 to SEQ ID NO:19 and complementary sequence thereof.
Described test kit also can contain other component, such as being packaged in the hybridization buffer (wherein oligonucleotide will be used as probe) separating in container.Or in the time that described oligonucleotide will be used to amplified target region, described test kit can contain the polysaccharase that is packaged in container separately and the reaction buffer for polymerase-mediated primer extension of optimization, as PCR.Preferably, described polysaccharase is reversed transcriptive enzyme.Further preferably described test kit also contains RNA enzyme reagent.
The present invention is also provided for detecting the method that whether has the polypeptide of being encoded by described gene order the sample obtaining from patient.
Relevant to the existence of cancer by the expression of polypeptides horizontal abnormality that is selected from the gene of Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165 or the polypeptide of genome sequence coding.
According to the present invention, described polypeptide owe express relevant to the existence of cancer.Especially preferably, described polypeptide is at least one aminoacid sequence providing from the SEQ ID of Septin 9 genes NOS:20 to SEQ ID NO:23 polypeptide is provided.
Can use any method for detection of polypeptide well known in the prior art.These class methods comprise, but (be for example not limited to mass spectroscopy, immunodiffusion method, immunoelectrophoresis, immuno-chemical method, binding substances-part assay method, immunohistochemistry technique, aggegation and complement assay method, referring to Basic and Clinical Immunology (basis and clinical immunology), Sites and Terr, eds., Appleton & Lange, Norwalk, Conn, pp 217-262,1991, by it by reference to being incorporated to herein).Preferably binding substances-part method of immunity, comprises antibody is reacted with one or more epi-positions, and the polypeptide or derivatives thereof of alternative label competitively.
Certain embodiments of the present invention comprise that use is specific to by the antibody that is selected from the gene of Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165 or the polypeptide of genome sequence coding.At least one especially preferred, described polypeptide provides for SEQ ID NOS:20 to SEQ IDNO:23 aminoacid sequence.
This antibody-like can be used for cancer diagnosis.In certain embodiments, the generation of mono-clonal or polyclonal antibody can be induced as antigen by using by the epi-position of the peptide coding of SEQ ID NOS:20 to SEQ ID NO:23.This antibody-like can be used for again detecting the polypeptide as the expression of cancer diagnosis marker.Can be by the level that exists of quantitative these polypeptide of ordinary method.Can detect and quantitatively antibody-polypeptide combination by multiple means well known in the prior art, such as with fluorescence or radioligand mark.The present invention also comprises the test kit for carrying out aforesaid method, and wherein these test kits contain the antibody that is specific to studied polypeptide.
Multiple competitiveness known in this field and noncompetitive polypeptide binding immunoassay assay method.The antibody using in these are measured can not be labeled, is for example used in aggegation test, or be labeled, for many measure method.Spendable marker comprises radionuclide, enzyme, fluorescent agent, chemoluminescence agent, enzyme substrates or cofactor, enzyme inhibitors, particle, dyestuff etc.Preferred mensuration includes but not limited to radioimmunoassay (RIA), enzyme immunoassay, such as enzyme-linked immunosorbent assay (ELISA), fluorescence immunoassay etc.Can carry out polyclone or monoclonal antibody or its epi-position for the preparation of immunoassay by any method in several different methods known in the art.
In other embodiment of described method, described albumen can detect with western engram analysis.Described analysis is standard in the art.In brief, albumen is separated as SDS-PAGE by electrophoresis.Subsequently that the protein delivery separating is upper to applicable film (or paper), as nitrocellulose, keep the spatial isolation obtaining by electrophoresis simultaneously.Then film is hatched together with the remaining closed reagent that has associativity position on binding film, normally used reagent comprises general albumen (for example milk-protein).Then, add the antibody that is specific to target protein, described antibody can be detected ground mark, for example, for example, by dyestuff or Enzymology method (alkaline phosphatase or horseradish peroxidase).Detect subsequently the position of described antibody on film.
In other embodiment of the method, described albumen can detect by ImmunohistochemistryMethods Methods (surveying the specific antigen in sample with antibody).Described analysis is standard in the prior art, wherein the detection of antigen in tissue is called as to immunohistochemistry, and detection in culturing cell is commonly referred to immunocytochemistry.In brief, primary antibody is detected by being attached to its specific antigen.Subsequently, this Antibody-antigen complex is by the antibodies of secondary enzyme coupling.Under necessary substrate and chromophoric group existence, according to the enzyme that detects combination in coloured deposition at antibody-antigen binding site place.Applicable sample type, Ag-Ab affinity, antibody type and detection Enhancement Method have multiple.Therefore must be, that every each and every one example is determined separately by those skilled in the art for the optimal conditions of immunohistochemistry or Immuncytochemical detection.
Be a kind of preparation for the method for the antibody of polypeptide: all or part of aminoacid sequence of selecting and prepare this polypeptide, this aminoacid sequence of chemosynthesis is also injected into applicable animal, normally rabbit or mouse (Milstein and Kohler Nature 256:495-497,1975; Gulfreand Milstein, Methods in Enzymology:Immunochemical Techniques (method in zymetology: immunochemical technique) 73:1-46, Langone and Banatis eds., Academic Press, 1981, by its entirety by reference to being incorporated to herein).The method of preparing polypeptide or its epi-position includes, but are not limited to chemosynthesis, recombinant DNA technology or separates from biological sample.
In the final step of the method, determine patient's diagnostic result, wherein (be selected from least one gene of Septin9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165 or genome sequence) and owe to show to exist cancer.Term is owed to express and should be considered to refer to that the level detecting is less than predetermined threshold value, and this threshold value can be selected from the threshold value of average, intermediate value or optimization.
Another aspect of the present invention is provided for, according to the test kit of cancer in the inventive method diagnosis individuality, comprising: be selected from the assembly of Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and the gene of SEQ IDNOS:160 to SEQ ID NO:165 or the polypeptide of genome sequence for detection of at least one.Preferably, the sequence of described polypeptide provides as SEQ ID NOS:20 to SEQ ID NO:23.Preferably include antibody, antibody derivatives or antibody fragment for detection of the assembly of described polypeptide.The described polypeptide most preferably Western trace of the antibody by utilizing mark detects.In another embodiment of the present invention, this test kit also comprises the assembly that obtains patient's biological sample.Preferably, test kit also comprises the container that is suitable for polypeptide in splendid attire detection patient biological sample, most preferably also comprises the specification sheets that uses and explain test kit result.In preferred embodiments, described test kit comprises: (a) be selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and the gene of SEQID NOS:160 to SEQ ID NO:165 or the assembly of genome sequence polypeptide for detection of at least one; (b) container of the patient's biological sample that is suitable for assembly described in splendid attire and comprises described polypeptide, wherein said assembly can form mixture with described polypeptide; (c) assembly of the mixture of detection (b); And optionally (d) uses and explains the specification sheets of test kit result.Preferably, at least one assembly that is selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and the gene of SEQ ID NOS:160 to SEQ ID NO:165 or the polypeptide of genome sequence of described detection is specific at least one and is selected from the peptide sequence of SEQ ID NOS:20 to SEQ ID NO:23.Described test kit can also contain other component being packaged in container separately, for example, for blocking, wash or coated damping fluid or solution.
Specific embodiment of the invention scheme provides the new application of the analysis to methylation level and/or pattern in described sequence, and it makes accurate detection, sign and/or treatment liver and/or colorectal cell proliferative disorders become possibility.The early detection of cancer is directly associated with disease prognosis, thereby method disclosed herein makes doctor and patient can make better more reasonably treatment decision.
further improve
The invention provides the new purposes of genome sequence SEQ ID NOS:1 to SEQ ID NO:3, SEQ IDNO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167.Other embodiment provides the modified variant of SEQ ID NOS:1 to SEQ ID NO:3, SEQ IDNO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167, and for analyzing oligonucleotide and/or the PNA-oligomer of cytosine methylation patterns having in SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167.
Object of the present invention comprises the methylation state of analyzing the one or more CpG dinucleotides at least one sequence that is selected from SEQ ID NOS:1 to SEQ IDNO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167 and complementary sequence thereof.
Disclosed invention provides the treated nucleic acid derived from genome SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NO:159 to SEQ ID NO:167, and wherein said processing is suitable for changing at least one unmethylated cytosine(Cyt) base of described genomic dna sequence into other base that uridylic or other can be different from cytosine(Cyt) in hybridization with detecting.The genome of discussing can comprise one or more continuous methylated CpG positions.Described processing preferably includes and uses the reagent that is selected from hydrosulphite, bisul-phite, disulfite and combination thereof.In the preferred embodiment of the invention, the invention provides the modified nucleic acid that non-natural produces, its length that comprises the sequence that is selected from SEQ ID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS.50 to SEQ ID NO:51, SEQ ID NOS:168 to SEQ ID NO:203 is the sequence of at least 16 continuous nucleotide base.In a further preferred embodiment, described nucleic acid is the fragment that is disclosed in the nucleotide sequence in SEQ ID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, SEQ ID NOS:168 to SEQ ID NO:203 of at least 50,100,150,200,250 or 500 base pair length.Especially preferred be not with SEQ ID NOS:10 to SEQ IDNO:15, SEQ ID NO:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ IDNO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ IDNO:39, SEQ ID NOS:50 to SEQ ID NO:51, SEQ ID NOS:168 to SEQ IDNO:203 rather than SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQID NO:28, identical or the complementary nucleic acid molecule of all or part of sequence of the DNA of SEQ ID NOS:159 to SEQ ID NO:167 or other natural generation.
Preferably, described sequence comprises at least one in CpG, TpA or CpA dinucleotides and the sequence complementary with it.SEQ ID NOS:10 to SEQ ID NO:15, SEQ IDNOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ IDNOS.42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ IDNOS:50 to SEQ ID NO:51, the sequence of SEQ ID NOS:168 to SEQ ID NO:203 provides SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, the modified form that the non-natural of SEQ ID NOS:159 to SEQ ID NO:167 produces, wherein the modification of each genome sequence causes synthetic following have unique and being different from the nucleic acid of the sequence of described genome sequence.As SEQ ID NO:1,4 kinds of forms that quilt changes are disclosed for each sense strand genomic dna.The first form is that " C " is transformed into " T ", " but CpG " still keeps " CpG " (, situation corresponding to such: wherein for genome sequence, " C " residue in all " CpG " dinucleotides sequences is methylated, and is not therefore changed); The second form discloses the complementary sequence (being antisense strand) of disclosed genomic dna sequence, wherein " C " is transformed into " T ", " but CpG " still keeps " CpG " (, situation corresponding to such: wherein for genome sequence, " C " residue in all " CpG " dinucleotides sequences is methylated, and is not therefore changed).The sequence of " upper methylated " transformation of SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167 is corresponding to SEQ ID NOS:10 to SEQ ID NO:15, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:168 to SEQ ID NO:185.The third chemical transformation form of each genome sequence is provided, wherein all be converted into " T " for all " C " residues " C ", comprise in " CpG " dinucleotides sequence those (, situation corresponding to such: wherein for genome sequence, all " C " residues in " CpG " dinucleotides sequence are not methylated); Last a kind of chemical transformation form of each sequence discloses the complementary sequence (being antisense strand) of disclosed genomic dna sequence, wherein all be converted into " T " for all " C " residues " C ", comprise in " CpG " dinucleotides sequence those (, situation corresponding to such: wherein, for the complementary sequence (antisense strand) of each genome sequence, all " C " residues in " CpG " dinucleotides sequence are not methylated).The sequence of " lower methylated " transformation of SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167 is corresponding to the sequence of SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:42 to SEQ IDNO:43, SEQ ID NOS:50 to SEQ ID NO:51, SEQ ID NOS:186 to SEQ IDNO:203.
Therefore, importantly, the nucleotide sequence of SEQ ID NOS:10 to SEQ ID NO:15, SEQ IDNOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ IDNOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, SEQ ID NOS:168 to SEQ ID NO:203 and molecule does not relate to or be associated with detection, classification or the treatment of cell proliferative disorders.
In other preferred embodiment, the present invention also provides the oligonucleotide or the oligomer that are suitable for in the methods of the invention, for detection of SEQ ID NOS:1 to SEQ ID NO:3, SEQID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167, SEQID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQID NOS:30 to SEQ ID NO:31, SEQ ID NOD:42 to SEQ ID NO:43, SEQID NOD:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, cytosine methylation state in the genome of SEQID NOS:168 to SEQ ID NO:203 or treated (chemically modified) DNA.Described oligonucleotide or oligomer nucleic acid provide new diagnostic means.Described oligonucleotide or oligomer comprise the nucleotide sequence with at least nine (9) individual Nucleotide, it is same as or treated nucleic acid sequence SEQ ID NOS:10 to the SEQ ID NO:15 of (as hereinbefore defined) hybridization under medium tight or stringent condition, SEQ IDNOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ IDNOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ IDNOS:50 to SEQ ID NO:51, SEQ ID NOS:168 to SEQ ID NO:203 and/or its complementary sequence, or genome sequence SEQ ID NOS:1 to SEQ ID NO:3, SEQ IDNOS:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167 and/or its complementary sequence.
Therefore, present invention resides in hybridization under medium tight and/or tight hybridization conditions and be selected from SEQID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ IDNOS:159 to SEQ ID NO:167, SEQ ID NOS:10 to SEQ ID NO:15, SEQID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQID NOS:50 to SEQ ID NO:51, the nucleic acid molecule (for example oligonucleotide and peptide nucleic acid(PNA) (PNA) molecule (PNA-oligomer)) of all or part of sequence of SEQ ID NOS:168 to SEQ ID NO:203 or its complementary sequence.Especially preferred is that hybridization is selected from SEQ ID NOS:10 to SEQ ID NO:15 under medium tight and/or tight hybridization conditions, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, SEQ ID NOS:168 to SEQ ID NO:203 rather than SEQ IDNOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, the nucleic acid molecule of SEQ ID NOS:159 to SEQ ID NO:167 or other human gene group DNA's all or part of sequence.
Identical or the hybridization portion of described hybrid nucleic acid conventionally length is at least 9,16,20,25,30 or 35 Nucleotide.But longer molecule has application of the present invention, be therefore also contained in scope of the present invention.
Preferably, the hybridization portion of hybrid nucleic acid molecule of the present invention be selected from SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167, SEQ ID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, sequence or its part of SEQ ID NOS:168 to SEQ ID NO:203 or its complementary sequence have at least 95% or at least 98% or 100% consistence.
Hybrid nucleic acid type described herein can for example be used as primer (for example, PCR primer) or diagnosis and/or prognosis probe or primer.Preferably, the hybridization of described oligonucleotide probe and nucleic acid samples is carried out under stringent condition, and this probe is identical with target sequence 100%.Nucleic acid duplex or hybridization stability are expressed as melting temperature (Tm) or Tm, and it is the temperature that probe and target DNA dissociate.This melting temperature (Tm) can be used for determining required stringent condition.
For example, for the target sequence (allelic variant and SNP) relevant or basic identical rather than identical with corresponding sequence SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167, usefully first use the salt (for example SSC or SSPE) of certain concentration to determine the minimum temperature that homology hybridization only occurs.Then, suppose that 1% mispairing causes Tm to reduce by 1 ℃, the also corresponding reduction of temperature of last washing in hybridization (for example,, if detect the sequence that has > 95% identity with probe, final wash temperature reduces by 5 ℃).In fact, the variation of Tm can be between 0.5 ℃ to 1.5 ℃ of every 1% mispairing.
Length is the example of the oligonucleotide of the present invention of X (in Nucleotide), as shown as the polynucleotide position of SEQ ID NO:1 by reference example, comprise the continuous overlapping oligonucleotide collection (having justice collection and antonymous set) corresponding to those length X, wherein the oligonucleotide (corresponding to given X value) in each continuous overlapping collection is defined as from nucleotide position:
N is to (n+ (X-1))
The finite set of Z oligonucleotide;
Wherein n=1,2,3 ... (Y-(X-1));
Wherein Y equals the length (Nucleotide or base pair) (219909) of SEQ ID NO:1;
Wherein X equals the described common length (in Nucleotide) (for example, for continuous overlapping 20 aggressiveness (20-mer), X=20) of concentrating each oligonucleotide; And
The given SEQ ID NO that is wherein Y for length, the quantity (Z) of the continuous overlapping oligomer that length is X equals Y-(X-1).For example, in the time of X=20, Z=219909-19=219890 for the sense or antisense collection of SEQ ID NO:1.
Preferably, described collection is restricted to those oligomer that comprise at least one CpG, TpG or CpA dinucleotides.
The example of the present invention's 20 aggressiveness oligonucleotide comprises the collection (and antonymous set complementary with it) of following 219890 oligomer, by representing with reference to the polynucleotide position of SEQ ID NO:1:
1-20,2-21,3-22,4-23,5-24 ... ... and 219890-219909.
Preferably, described collection is limited in those oligomer that comprise at least one CpG, TpG or CpA dinucleotides.
Similarly, the example of 25 aggressiveness oligonucleotide of the present invention comprises the collection (and antonymous set complementary with it) of following 219885 oligomer, by representing with reference to the polynucleotide position of SEQ ID NO:1:
1-25,2-26,3-27,4-28,5-29............ and 219885-219909.
Preferably, described collection is limited in those oligomer that comprise at least one CpG, TpG or CpA dinucleotides.
For SEQ ID NOS:1 to SEQ ID NO3, SEQ ID NO.24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167, SEQ ID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, each in SEQ ID NOS:168 to SEQ ID NO:203 (sense and antisense), the present invention includes length is the multiple continuous overlapping collection of the oligonucleotide of X or the oligonucleotide of modification, wherein, for example, X=9, 10, 17, 20, 22, 23, 25, 27, 30 or 35 Nucleotide.
Oligonucleotide of the present invention or oligomer formation can be used for determining the heredity of genome sequence and the effective tool of epigenetic parameter that are selected from SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167.The preferred collection of the oligonucleotide that this class length is X or modified oligonucleotide is for those are corresponding to SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQID NOS:159 to SEQ ID NO:167, SEQ ID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, the continuous overlapping collection of the oligomer of SEQ ID NO:168 to SEQ ID NO:203 (and complementary sequence).Preferably, described oligomer comprises at least one CpG, TpG or CpA dinucleotides.
The especially preferred oligonucleotide of the present invention or oligomer are positioned at those of middle part 1/3rd of this oligonucleotide for the cytosine(Cyt) of CpG dinucleotides wherein (or TpG or CpA dinucleotides of corresponding transformation) sequence; Wherein this oligonucleotide is for example that 13 bases are long, and CpG, TpG or CpA dinucleotides are positioned at the 5th to the 9th amino acid from 5 ' end.
Oligonucleotide of the present invention also can be by being connected to one or more parts by this oligonucleotide chemistry or conjugate is modified, to improve activity, stability or the detection of this oligonucleotide.This class part or conjugate comprise chromophore, and fluorophore, such as the lipid of cholesterol, cholic acid, thioether, aliphatic chain, phosphatide, polyamines, polyoxyethylene glycol (PEG), palmityl part and other are for example disclosed in United States Patent (USP) 5,514,758,5,565,552,5,567,810,5,574,142,5,585,481,5,587,371,5,597, in 696 and 5,958,773.Described probe can be also the form of PNA (peptide nucleic acid(PNA)), and it has particularly preferred pairing performance.Therefore, described oligonucleotide can comprise group, for example peptide that other is additional, and can comprise the cutting agent (people such as Krol that hybridization triggers, BioTechniques 6:958-976,1988) or intercalating agent (Zon, Pharm.Res.5:539-549,1988).For this reason, described oligonucleotide can be coupled to another molecule, the cutting agent that linking agent, transport agents, the hybridization that such as chromophore, fluorophore, peptide, hybridization trigger triggers etc.
Described oligonucleotide also can comprise sugar and/or the base portion of at least one known modification, maybe can comprise key between the main chain of modification or non-natural nucleoside.
According to particular of the present invention, described oligonucleotide or oligomer are used in " collection " conventionally, it contains at least one oligomer, be selected from SEQ ID NOS:1 to SEQ ID NO:3 for analyzing, SEQ ID NO:24, SEQ ID NO:28, each CpG dinucleotides of the genome sequence of SEQ ID NOS:159 to SEQ ID NO:167 and complementary sequence, or treated nucleic acid SEQ ID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, the CpG of correspondence in SEQ ID NOS:168 to SEQ ID NO:203 and complementary sequence thereof, TpG or CpA dinucleotides.But expection is for economy or other factors, can Optimization Analysis described in the CpG of limited selection in sequence, and correspondingly change the capacity of described oligonucleotide collection.
Therefore, in specific embodiments, the invention provides the collection that contains at least two (2) individual (oligonucleotide and/or PNA oligomer), can be used for detecting treated genomic dna (SEQ IDNOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ IDNOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ IDNOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, SEQ IDNOS:168 to SEQ ID NO:203) or genomic dna (SEQ ID NOS:1 to SEQ IDNO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOD:159 to SEQ IDNO:167 and complementary sequence thereof) in cytosine methylation state.These probes make heredity and the epigenetic parameter of diagnosis, classification and/or treatment liver and/or colorectal cell proliferative disorders become possibility.This cover oligomer also can be for detection of treated genomic dna (SEQID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQID NOS:30 to SEQ ID NO:31, SEQ ID NO:42 to SEQ ID NO:43, SEQ IDNO:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, SEQ IDNOS:168 to SEQ ID NO:203) in, or genomic dna (SEQ ID NOS:1 to SEQII NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NO:159 to SEQ IDNOS:167 and complementary sequence thereof) in single nucleotide polymorphism (SNPs).
In preferred embodiments, at least one, more preferably all members of oligonucleotide collection are incorporated in to solid phase.
In other embodiments, the invention provides the collection that contains at least two (2) individual Nucleotide, they are used as " primer " oligonucleotide SEQ ID NOS:1 to SEQ ID NO:3 that is used for increasing, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167, SEQ ID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, the DNA sequence dna of SEQ ID NOS:168 to SEQ ID NO:203 and one of complementary sequence or its fragment.
Expect that described oligonucleotide can form all or in part " array " or " DNA chip " (, being attached to the arrangement of different oligonucleotide and/or the PNA-oligomer of solid phase).The feature of the array of this different IPs thuja acid and/or PNA-oligomer sequence can for example be to arrange with rectangle or hexagonal-lattice in solid phase.Described solid phase surface can be made up of silicon, glass, polystyrene, aluminium, steel, iron, copper, nickel, silver or gold.Also can use nitrocellulose and plastics as nylon, it can or exist as resinous substrates using sedimental form, also can be used.The summary that oligomer array is prepared aspect prior art can obtain from the special version (Nature Genetics Supplement, Volume 21, January 1999, and the document wherein quoted) of Nature Genetics.Fluorescently-labeled probe is generally used for scanning immobilized DNA array.Cy3 and Cy5 dyestuff are simply attached to 5 ' of particular probe-OH upper especially applicable for fluorescent marker.The detection of the fluorescence probe to hybridization can for example be undertaken by Laser Scanning Confocal Microscope.Cy3 and Cy5 dyestuff and much other dyestuff are all commercially available.
Also expect that described oligonucleotide or its particular sequence can form all or part of of " virtual array ", wherein said oligonucleotide or its particular sequence are as for example " specifying thing (specifier) ", as the part of various group that is labeled probe of uniqueness, or analyze the complex mixture of assay with its combination.This method is for example described in US 2003/0013091 (United States serial 09/898,743, on January 16th, 2003 is open).In these methods, produce abundant marker, so that every kind of nucleic acid in this complex mixture (being every kind of analyte) can be by the unique combination of unique tag thing, detected thereby (every kind of marker is direct census, obtains the digital readout of every kind of molecule in mixture).
Especially preferred, oligomer of the present invention is by least for one of following purposes: detect, detect and distinguish hypotype, diagnosis, prognosis, treatment, monitoring and treat and monitor liver and/or colorectal cell proliferative disorders.This realizes by detect or detect and distinguish one or more in following organization type with described collection: colorectal carcinoma, colorectal carcinoma, inflammatory colon, 2 grades of heteroplasia adenoma of colon that are less than 1cm, 3 grades of heteroplasia adenoma of colon that are greater than 1cm, normal colon, non-colon health tissue and non-colon cancer tissue.
Especially preferred is those oligomer collection in embodiment.
In the most preferred embodiment of described method, determine whether to exist cell proliferative disorders, most preferably determine knurl sexual cell propagation or itself and optimum illness are distinguished.This realizes by the methylation state of analyzing at least one target sequence that comprises at least one CpG position, at least 16 continuous nucleotides that wherein said sequence comprises or hybridization is selected from the sequence of SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167 and complementary sequence thereof under stringent condition.The present invention also provides by analyzing cytosine methylation and single nucleotide polymorphism and determines genome sequence SEQ IDNOS:1 to SEQ ID NO:3 in individuality, SEQ ID NO:24, SEQ ID NO:28, the heredity of SEQ ID NOS:159 to SEQ ID NO:167 and/or the method for epigenetic parameter.Described method comprises that the nucleic acid that comprises SEQ ID NOS:1 to SEQ IDNO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167 the biological sample that makes to obtain from described individuality contacts with at least one reagent or one-tenth group reagent, and wherein said reagent or in groups reagent are distinguished methylating and non-methylated CpG dinucleotides in described target nucleic acid.
In preferred embodiments, said method comprising the steps of: in the first step, obtain tissue sample to be analyzed.This source can be any applicable source, for example the hemocyte of clone, Histological section, biopsy, paraffin-embedded tissue, body fluid, ight soil, colon effluent, urine, blood plasma, serum, whole blood, separation, from cell and all possible combination thereof of blood separation.Preferably, the described source of DNA is ight soil or body fluid, is selected from the hemocyte of colon effluent, urine, blood plasma, serum, whole blood, separation, the cell of separation autoblood.
Then from described sample separation genomic dna.Can separate by any standard approach of the prior art, comprise and use commercially available test kit.In brief, in the time that target DNA is wrapped in cytolemma, it is cleaved that this biological sample must be broken and pass through enzyme, chemistry or mechanical means.For example remove albumen and other pollutent by the digestion of protein kinase K subsequently.Then reclaim genomic dna from solution.This can realize by the whole bag of tricks, comprise saltout, organic extraction or DNA is attached to solid support.Can be subject to the impact of many factors on the selection of method, comprise the amount of time, expense and required DNA.
For example, in the time that described sample DNA is not wrapped in cytolemma the Circulating DNA of blood sample (from), can uses and in prior art, separate and/or the standard method of purify DNA.These methods comprise use proteolytic degradation reagent, for example chaotropic salt, example hydrochloric acid guanidine or urea; Or stain remover, as sodium laurylsulfonate (SDS), cyanogen bromide.Other method includes but not limited to ethanol precipitation or propyl alcohol precipitation, passes through centrifugal vacuum concentration etc.Those skilled in the art also can use device, for example, such as the filter of ultrafiltration, and silicon face or film, magnetic-particle, granules of polystyrene, polystyrene surface, positively charged surface and the film with positive electric charge, charged membrane, powered surfaces, charged conversion film, charged conversion surface.
Once nucleic acid is extracted, just genome double-stranded DNA is used for analyzing.
In the second step of described method, described genome DNA sample is processed to make to be converted into uridylic, thymus pyrimidine or in hybridization behavior, to be not used in another base of cytosine(Cyt) 5 ' unmethylated cytosine(Cyt) base.This should be understood to " pre-treatment " as herein described or " processing ".
This preferably realizes by hydrosulphite agent treated.Term " hydrosulphite reagent " refers to comprise the reagent of hydrosulphite, hydrosulphite (disulfite), bisul-phite or its combination, can be used for as disclosed herein distinguishing methylating and unmethylated CpG dinucleotides sequence.Described processing is well known in the art (for example PCT/EP2004/011715, by reference to its entirety being incorporated to herein).Preferably, this bisulf iotate-treated is carried out under sex change solvent exists, and described sex change solvent is such as, but not limited to alkyl glycol, and especially diethylene glycol dimethyl ether (DME), or diox Huo dioxane derivatives carries out under existing.In preferred embodiments, described sex change solvent uses with the concentration of 1% to 35% (v/v).Also preferably this bisulfite reaction carries out under scavenging agent exists, such as but not limited to chromanane derivative, as 6-hydroxyl-2,5,7,8 ,-tetramethyl-chromanane 2-carboxylic acid or trihydroxybenzoic acid and derivative thereof, for example gallic acid (referring to PCT/EP2004/011715, by its entirety by reference to being incorporated to herein).This hydrosulphite changes and preferably under the temperature of reaction of 30 ℃ to 70 ℃, carries out, and wherein during reaction temperature is increased in short time and exceedes 85 ℃ (referring to PCT/EP2004/011715, by its entirety by reference to being incorporated to this paper).DNA through bisulf iotate-treated preferably carried out purifying before quantitatively.This can be undertaken by any method well known in the prior art, such as but not limited to ultrafiltration, is preferably undertaken by Microcon^ (TM) post (being produced by Millipore^ (TM)).This purifying carries out (referring to PCT/EP2004/011715, by its entirety by reference to being incorporated to herein) according to the scheme of manufacturers of improvement.
In the 3rd step of described method, adopt the fragment of the treated DNA of primer set oligonucleotide of the present invention and amplification enzymatic amplification.Can in same reaction vessel, carry out the amplification of several DNA fragmentations simultaneously.Conventionally, this amplified reaction adopts polymerase chain reaction (PCR) to carry out.Preferably, the length of described amplified production is 100 to 2,000 base pairs.Described complete primer tasteless nucleotide comprises at least two kinds of oligonucleotide, the sequence of each all reverse complemental in, be same as, or hybridize SEQ ID NOS:10 to SEQID NO:15 under tight or height stringent condition, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOs:38 to SEQID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, the long fragment of at least 16 bases of the base sequence of one of SEQ ID NOS:168 to SEQID NO:203 and complementary sequence thereof.
In other embodiment of described method, at least one methylation state that is selected from the CpG position of preliminary election in the nucleotide sequence of SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167 can detect by the primer tasteless nucleotide with methylation specific.This technology (MSP) is described in the United States Patent (USP) 6,265,171 of authorizing Herman.Increase to make to distinguish through the DNA of bisulf iotate-treated with methylation state special primer and methylate and unmethylated nucleic acid.MSP primer pair contains the primer of at least one hybridization through the CpG of bisulf iotate-treated dinucleotides.Therefore, the sequence of described primer comprises at least one CpG dinucleotides.Be specific to the not MSP primer of methylate DNA and contain " T " in the C position of CpG.Preferably, thereby the base sequence of described primer need to comprise the sequence with at least 9 length of nucleotides, it hybridizes treated nucleic acid sequence SEQ ID NOS:10 to SEQ ID NO:15, SEQ IDNOS:28 to SEQ ID NO:33, SEQ ID NOS.30 to SEQ ID NO:31, SEQ IDNOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ IDNOS:50 to SEQ ID NO:51, one of SEQ ID NOS:168 to SEQ ID NO:203 and complementary sequence thereof, the base sequence of wherein said oligomer comprises at least one CpG dinucleotides.Further preferred embodiment of the present invention comprises using blocks oligonucleotide (HeavyMethyl
tMmeasure).To the use of this class blocking-up oligonucleotide by people such as Yu, BioTechniques 23:714-720,1997 describe.Blocking-up probe oligonucleotides and PCR primer are hybridized to the nucleic acid through bisulf iotate-treated simultaneously.The pcr amplification of this nucleic acid stops in 5 ' position of blocking-up probe, so that the amplification of nucleic acid is suppressed in the time that existence is complementary to the sequence of blocking-up probe.Described probe can be designed as in the special mode of methylation state hybridizes the nucleic acid through bisulf iotate-treated.For example, in order to detect the methylated nucleic acid in methylated nucleic acid group not, can block probe by use to the inhibition of the amplification at the unmethylated nucleic acid in discussed position carries out, this blocking-up probe comprises " CpA " or " TpA " in discussed position, this " CpG " when wishing to suppress the amplification of methylated nucleic acid is contrary.
For the PCR method that adopts blocking-up oligonucleotide, effectively destroying polymerase-mediated amplification needs blocker not to be aggregated enzyme extension.Preferably, this is by using 3 '-deoxy-oligonucleotide blocker or realizing at 3 ' derivative oligonucleotide blocker having except " freedom " oh group.The representative of the preferred classes that for example, 3 '-O-ethanoyl oligonucleotide is blocker molecule.
In addition, should get rid of polymerase-mediated blocking-up oligonucleotide degraded.Preferably, this eliminating comprises that use lacks the polysaccharase of 5 '-3 ' 5 prime excision enzyme activity, or uses the blocking-up oligonucleotide of modifying, and it for example has thioesters bridge at its 5 ' end, and this gives this blocker molecule nuclease resistance.Specific application can not need this 5 ' of blocker to modify.For example, for example, if blocking-up and primer binding site are overlapping thereby prevented the combination (, blocker is excessive) of primer, the degraded of blocking oligonucleotide will prevent substantially.This is because polysaccharase can not extend forward primer the process through (5 '-3 ' direction) blocker-a kind of blocking-up oligonucleotide degraded that conventionally causes hybridization.
For purposes of the present invention and as implemented here, especially preferred blocker/PCR embodiment comprises and uses peptide nucleic acid(PNA) (PNA oligomer is as blocking-up oligonucleotide.This PNA blocking-up oligomer is applicable to admirably, is not also aggregated enzyme extension because they are not degraded.
Preferably, therefore the base sequence of described blocking-up oligonucleotide requires to comprise the sequence with at least 9 length of nucleotides, it hybridizes one of treated nucleic acid sequence SEQ ID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, SEQ ID NOs:168 to SEQ ID NO:203 and complementary sequence thereof, wherein.The base sequence of described oligonucleotide comprises at least one CpG, TpG or CpA dinucleotides.
The fragment portability obtaining by amplification has the marker that can detect directly or indirectly.Preferably, marker is the form of fluorescent marker, the radionuclide molecule fragment that maybe can adhere to, and this molecule fragment that can adhere to has the quality that can detect in mass spectrum conventionally.In the time that described marker is mass spectrum marker, preferably the amplified production of mark has single positive or negative net charge, and making can be detected better in mass spectrograph.Can or use electrospray ionization mass spectrum (ESI) to detect and observe by for example substance assistant laser desorpted/MALDI-MS (MALDI).
Substance assistant laser desorpted/MALDI-MS (MALDI-TOF) is the very effective progress of analysing biomolecules (Karas & Hillenkamp, Anal Chem., 60:2299-301,1988).Analyte is embedded in light absorbing matrix.This matrix is evaporated by short laser pulse, in the mode of non-fragmentation, analyte molecule is delivered into vapor phase thus.This analyte is by being ionized with the collision of substrate molecule.The voltage applying accelerates this ion and enters field-free flight pipe.Due to they different quality, ion is accelerated with different speed.Small ion arrives detector sooner than heavy ion.MALDI-TOF mass spectrum is suitable for analyzing peptide and albumen.To slightly a little difficulties (Gut & Beck, Current Innovations and Future Trends, 1:147-57,1995) of the molecule of nucleic acid.The susceptibility of foranalysis of nucleic acids is approximately little 100 times than peptide, and is inversely proportional to the clip size increasing.In addition, for the main chain with multiple negative charges, lower via the obvious efficiency of ionization process of matrix.In MALDI-TOF mass spectrum, extremely crucial to the selection of matrix.For the desorb of peptide, find several very effective matrix, it produces fabulous crystallization.Have now several matrix of replying for DNA, still, between peptide and nucleic acid, the difference of susceptibility is not eliminated.The difference of susceptibility is more similar to peptide and reduces but can become it by chemically modified DNA.For example, adopt simple alkylation chemistry, thiophosphatephosphorothioate (phosphorothioate) nucleic acid (wherein common phosphide main chain is replaced by thiophosphatephosphorothioate (thiophosphate)) can be changed (Gut & Beck in electroneutral DNA, NucleicAcids Res.23:1367-73,1995).Electric charge label is connected to this modified DNA causes MALDI-TOF susceptibility to be increased to the level of peptide.Other advantage of electric charge label is the analysis stability that overcomes the increase of impurity, and it is obviously more difficult that wherein impurity makes to detect the substrate of unmodified.
In the 4th step of described method, analyze the amplified production obtaining in the 3rd step of described method, process the methylation state of CpG dinucleotides before to determine.
In the embodiment of acquisition amplified production that increases by MSP, whether according to the base sequence of described primer, there is the methylation state that self has just shown the CpG position being covered by this primer in amplified production.
The amplified production obtaining by standard and methylation specific PCR all can further be analyzed by the method based on base, such as but not limited to array technique and the technology based on probe, and by the technology such as order-checking and template guided extension.
In an embodiment of described method, in the 3rd step, synthetic amplified production is hybridized subsequently to oligonucleotide and/or PNA probe array or oligonucleotide and/or PNA probe sets.In this case, hybridization is carried out as follows: the probe sets using in crossover process is preferably made up of at least two oligonucleotide or PNA oligomer; In this process, amplified production, as probe, is attached to the oligonucleotide of solid phase before its hybridization; Remove subsequently the not fragment of hybridization; Described oligonucleotide contains at least one and has the base sequence of at least 9 length of nucleotides, its reverse complementation or be same as the fragment of the base sequence providing in sequence table of the present invention; And described fragment comprises at least one CpG, TpG or CpA dinucleotides.The length of the hybridization portion of hybrid nucleic acid typically is at least 9,15,20,25,30 or 35 Nucleotide.But longer molecule has application of the present invention, therefore also falls within the scope of the present invention.
In preferred embodiments, described Nucleotide is present in the centre 1/3rd of described oligomer.For example, in the time that described oligomer comprises a CpG dinucleotides, described dinucleotides is preferably the 5th to the 9th Nucleotide from 5 ' end of 13 aggressiveness.For being selected from SEQ IDNOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, each CpG dinucleotides in the sequence of SEQ IDNOS:159 to SEQ ID NO:167 and SEQID NOS:10 to SEQ ID NO.15, SEQ ID NOS:28 to SEQ ID NO.33, SEQID NOs:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, equivalent site in SEQID NOS:168 to SEQ ID NO:203, all exist a kind of oligonucleotide for its analysis.
Described oligonucleotide also can exist with the form of peptide nucleic acid(PNA).Then remove the not amplified production of hybridization.Detect subsequently the amplified production of hybridization.In this case, preferably, the marker that is connected to amplified production all can be differentiated in the residing each position of oligonucleotide of solid phase.
In other embodiments, the genomic methylation state of CpG position can by with pcr amplification primer (wherein said primer can be methylation specific or standard) hybridize through the oligonucleotide probe (as above described in detail) of the DNA of bisulf iotate-treated simultaneously and determine.
In the especially preferred embodiment of the method, use the fluorescence oligonucleotide probe (TaqMan that adopts double-tagging
tMpCR, adopts ABI Prism 7700 Sequence DetectionSystem, Perkin Elmer Applied Biosystems, Foster City, California) the real-time quantitative PCR based on fluorescence (people such as Heid, Genome Res.6:986-994,1996; Also referring to United States Patent (USP) 6,331,393).This TaqMan
tMpCR reaction adopts inextensible detection oligonucleotide, is called TaqMan
tMprobe, in preferred embodiments, it is designed to and at forward and the reverse sequence hybridization that is rich in CpG between amplimer.This TaqMan
tMprobe also comprises fluorescence " reporter part " and " cancellation part ", and they are covalently bound to being attached to described TaqMan
tMthe shank (for example phosphoramidite) of the Nucleotide of oligonucleotide.For methylating in bisulf iotate-treated post analysis nucleic acid, needing probe is methylation specific, as United States Patent (USP) 6,331, described in 393 (by reference to its entirety being incorporated to herein), is also referred to as MethyLightTM
tMmeasure.Also be applicable to TaqMan of the present invention
tMthe variation of detection method comprises the two probe technique (Lightcycler of use
tM) or amplified fluorescence primer (Sunrise
tMtechnology).These two kinds of technology all can be changed to be applicable to the DNA through bisulf iotate-treated, and for the methylation analysis in CpG dinucleotides.
In the further preferred embodiment of described method, the 4th step of described method comprises that the template guided oligonucleotide of use extends, as Gonzalgo & Jones, Nucleic Acids Res.25:2529-2531,1997 MS-SNuPE that describe.
In other embodiment of described method, the 4th step of described method comprises the order-checking of amplified production to producing in described method the 3rd step and the sequential analysis subsequently (people such as Sanger F., Proc Natl Acad Sci USA 74:5463-5467,1977).
preferred plan
In the most preferred embodiment of described method, described genomic nucleic acids is separated and processing according to the first three steps of aforesaid method, that is:
A) obtain the biological sample with genes of individuals group DNA from individuality;
B) extract or otherwise separate described genomic dna;
C) by one or more agent treated genomic dna or its fragment b), changing uridylic into or in another base that can be different from cytosine(Cyt) aspect cross performance 5 ' unmethylated cytosine(Cyt) base with detecting; And wherein
D) c) middle processing amplification is afterwards carried out in the mode of methylation specific, by primer or the blocking-up oligonucleotide of methylation specific, and further, wherein
E) be to be undertaken by real-time detection probes to the detection of amplified production, as mentioned above.
Preferably, in the time that increasing subsequently d) undertaken by the mode of methylation specific primer as above, the primer of described methylation specific comprises the sequence with at least 9 Nucleotide length, nucleic acid sequence SEQ ID NOS:10 to the SEQ ID NO:15 that this sequence hybridization is treated, SEQID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQID NS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ IDNOS:50 to SEQ ID NO:51, one of SEQ ID NOs:168 to SEQ ID NO:203 and complementary sequence thereof, the base sequence of wherein said oligomer comprises at least one CpG dinucleotides.
The step of described method e), is undertaken by real-time detection method as above the detection that shows the specific amplified product of one or more CpG position methylation state of at least one sequence in SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167.
Other embodiment of the present invention provides the method for analysis genomic dna of the present invention (SEQ ID NOs:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ IDNO:28, SEQ ID NOS:159 to SEQ ID NO:167 and the complementary sequence thereof) methylation state changing without hydrosulphite.Such method known in the state of the art, include but not limited to DMH, the restriction enzyme reagent that wherein methylates responsive or a series of restriction enzyme reagent that comprise the responsive restriction enzyme reagent that methylates are used to determine and methylate, and this responsive restriction enzyme reagent that methylates can be distinguished and in target region, methylate and methylated CpG dinucleotides not.
In the first step of this other embodiment, from tissue or cell source isolation of genomic DNA.Genomic dna can separate by any standard approach in prior art, comprises and uses commercially available test kit.In brief, in the time that target DNA is wrapped in cytolemma, it is cleaved that this biological sample must be broken and pass through enzyme, chemistry or mechanical means.For example remove albumen and other pollutent by the digestion of protein kinase K subsequently.Then reclaim this genomic dna from solution.This can realize by the whole bag of tricks, comprise saltout, organic extraction or DNA is attached to solid support.Can be subject to the impact of many factors on the selection of method, comprise the amount of time, expense and required DNA.All clinical sample kinds, comprise knurl material or potential knurl material, all be suitable for use in the inventive method, the preferably hemocyte of clone, Histological section, biopsy, paraffin-embedded tissue, body fluid, ight soil, colon effluent, urine, blood plasma, serum, whole blood, separation, the cell that separates from blood, and combination.Body fluid is preferred DNA source; Especially preferred is the hemocyte of blood plasma, serum, whole blood, separation and the cell from blood separation.
Once after nucleic acid is extracted, genome double-stranded DNA is just used in analysis.
In preferred embodiments, described DNA can be cut before use methylates responsive restriction enzyme treatment.These class methods are known in the prior art, can comprise physics and chemistry means.Especially preferred is to use one or more non-responsive restriction enzymes that methylate, and their recognition site is rich in AT and does not comprise CG dinucleotides.The use of this fermentoid makes the region that can retain CpG island and be rich in CpG in the DNA of fragmentation.The restriction enzyme of described non-methylation specific is preferably selected from MseI, BfaI, Csp6I, Tru1I, Tvu1I, Tru9I, Tvu9I, MaeI and XspI.Especially preferred is to use two or three this fermentoid.Especially preferred is the combination that uses MseI, BfaI and Csp6I.
The DNA of fragmentation can be connected to joint oligonucleotide subsequently, to be conducive to enzyme process amplification subsequently.The DNA fragmentation that oligonucleotide is connected to flat end and sticky end is known in the prior art, for example, for example, by making end dephosphorylation (using ox or shrimp alkaline phosphotase) and using ligase enzyme (T4 DNA ligase) to connect under dATPs exists subsequently.It is long that described joint oligonucleotide is generally at least 18 base pairs.
In the 3rd step, digest described DNA (or its fragment) with one or more responsive restriction enzymes that methylate subsequently.Carry out described digestion to make DNA provide at least one to be selected from the methylation state information of Septin 9 (comprising its all transcript variants), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and the gene of SEQ ID NOS:160 to SEQ ID NO:165 or the specific CpG dinucleotides of genome sequence in the hydrolysis of restriction site.
Preferably, the restriction enzyme of methylation specific is selected from the mixture of Bsi EI, Hga I HinPI, Hpy99I, Ava I, Bce AI, Bsa HI, BisI, BstUI, Bshl236I, AccII, BstFNI, McrBC, GIaI, MvnI, HpaII (HapII), HhaI, AciI, SmaI, HinP1I, HpyCH4IV, EagI and above two or more enzymes.Preferably contain the mixture of restriction enzyme BstUI, HpaII, HpyCH4IV and HinP1I.
In the 4th step, it is optional but preferred embodiment, and described restriction fragment is amplified.This can be undertaken by polymerase chain reaction, and described amplified production can be with applicable certification mark thing, i.e. fluorescent marker, radionuclide and mass spectrum marker described above.Especially preferred is that each primer that all comprises the long continuous sequence of at least 16 Nucleotide increases by amplification enzyme and at least two kinds, and described continuous sequence is complementary to or hybridizes and be selected from SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, the sequence of SEQ ID NOS:159 to SEQ ID NO:167 and the sequence of complementary sequence thereof under medium tight or stringent condition.Preferably, described continuous sequence at least 16,20 or 25 Nucleotide long.In other embodiments, described primer can be complementary to any joint that is connected to described fragment.
In the 5th step, detect described amplified production.This detection can make any standard approach of the prior art, such as but not limited to gel electrophoresis analysis, hybridization analysis, by detectable mix in PCR product, DNA array analysis, MALDI or ESI analyze.Preferably, described detection by hybridize at least one each comprise the long continuous sequence of at least 16 Nucleotide nucleic acid or peptide nucleic acid(PNA) carry out, described continuous sequence be complementary to or medium forbid or stringent condition under hybridization be selected from the sequence of SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQID NO:28, SEQ ID NOS:159 to SEQ ID NO:167 and complementary sequence thereof.Preferably, described continuous sequence at least 16,20 or 25 Nucleotide long.
After determining the methylation state or level of described genomic nucleic acids, be selected from SEQ ID NOS:1 to SEQ ID NO:3 based at least one, SEQ ID NO:24, SEQ ID NO:28, methylation state or the level of at least one CpG dinucleotides sequence of the sequence of SEQ ID NOS:159 to SEQ ID NO:167, or reflect that at least one is selected from SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, the average of the average methylation state of multiple CpG dinucleotides sequences of the sequence of SEQ ID NOS:159 to SEQ ID NO:167 or value infer whether cell proliferative disorders exists or its classification, wherein methylate relevant to cell proliferative disorders before knurl or knurl.Methylate while determining by quantitative means when described, be preferably zero (, when sample shows methylating of any degree, being defined as thering is methylated state in the CpG position of analysis) for the threshold value existing that methylates described in determining.But, can predict that those skilled in the art may wish to adjust described threshold value to provide particularly preferred susceptibility or specificity for measuring.Correspondingly, described threshold value can improve (therefore improving specificity), and described threshold value can be in the scope of 0%-5%, 5%-10%, 10%-15%, 15%-20%, 20%-30% or 30%-50%.Especially preferred is threshold value 10%, 15%, 25% and 30%.
In other embodiment of described method, the transcript Q9HC74 that wherein gene comprises Septin 9 or its brachymemma is in groups selected from FOXL2 with at least one, NGFR, TMEFF2, SIX6, SARM1, the gene of VTN and ZDHHC22, after determining the methylation state of described genomic nucleic acids, according to the methylation state of at least one CpG dinucleotides sequence of at least one CpG dinucleotides sequence of SEQ ID NO:1 and SEQ ID NO:24 to SEQ ID NO:29, or the average or the value that reflect the average methylation state of its multiple CpG dinucleotides infer whether have cell proliferative disorders or its hypotype, especially liver and/or colorectal cell proliferative disorders, wherein methylate and cancer, especially liver and/or colorectal carcinoma are relevant.
the diagnosis and prognosis of cell proliferative disorders is measured
The invention enables to diagnose to be unfavorable for patient or individual event, wherein at least one important heredity and/or epigenetic parameter being selected from gene or the genome sequence of Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165 can be used as mark.The described parameter obtaining by the inventive method can compare with another set of heredity and/or epigenetic parameter, and its difference is as being unfavorable for patient or the individual diagnosis of event and/or the basis of prognosis.
More specifically, the invention enables and can screen risk population with early detection cancer, most preferably liver cancer and/or colorectal carcinoma.In addition, the invention enables and can distinguish knurl (for example malignant tumour) and optimum (non-carcinous) cell proliferative disorders.For example, it makes the energy discriminating colorectal rectum cancer and minicell adenoma of colon or polyp.Knurl sexual cell proliferative disorders performance reduces at least one is selected from the gene of Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165 or genome sequence methylate (expression reducing) is contrary with the methylated described optimum illness that does not show reduction.
Particularly, the invention provides cancer diagnosis and classification and determination method, its measurement of differential expression of one or more CpG dinucleotides of the gene that is selected from SEQ ID NOS:1 to SEQ ID NO:3, SEQ IDNO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167 based at least one being comprised to CpG dinucleotides.Conventionally, this mensuration comprises from individuality and obtains sample, measure to weigh at least one and be selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, the gene of FAT and SEQ ID NOS:160 to SEQ ID NO:165 or the expression of genome sequence, preferably be selected from SEQ ID NOS:1 to SEQID NO:3 by determining derived from least one of described sample, SEQ ID NO:24, SEQ ID NO:28, the methylation state with respect to control sample or known standard substance of the sequence of SEQ ID NOS:159 to SEQ IDNO:167, and make thus diagnosis.
In particularly preferred embodiments, oligomer of the present invention is used to assess the methylation state of CpG dinucleotides, for example, based on SEQ ID NOS:1 to SEQ ID NO:3, SEQ IDNO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167, SEQ IDNOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ IDNOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ IDNOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, those or its array of SEQ IDNOS:168 to SEQ ID NO:203, and be arranged in the test kit based on them and can be used for diagnosis and/or the classification of cell proliferative disorders.
test kit
In addition, another aspect of the present invention is test kit, and it comprises: for determining that at least one is selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and the gene of SEQ ID NOS:160 to SEQ ID NO:165 or the methylated assembly of genome sequence.Described for determining that methylated assembly preferably includes the reagent containing hydrosulphite; One or more oligonucleotide, its each sequence is all same as, is complementary to or hybridization is selected from the sequence of SEQ ID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, SEQ ID NOS:168 to SEQ ID NO:203 under tight or height stringent condition 9 or more preferably 18 fragments that base is long; And preferably, for carrying out and assess the specification sheets of described methylation analysis method.In one embodiment, the base sequence of described oligonucleotide comprises at least one CpG, CpA or TpG dinucleotides.
In other embodiments, described test kit can also comprise the standard reagent for carrying out the special methylation analysis in CpG position, and wherein said analysis comprises one or more following technology: MS-SNuPE, MSP, MethyLight
tM, HeavyMethyl, COBRA and nucleic acid sequencing.But, belong to the part that test kit of the present invention also can only contain aforementioned component.
In preferred embodiments, described test kit can comprise other hydrosulphite transformation reagent that is selected from following reagent: DNA sex change damping fluid; Sulfonation damping fluid; DNA reclaims reagent or test kit (for example, precipitation, ultrafiltration, affinity column); Desulfonation damping fluid; And DNA reclaims component.
In other embodiments, described test kit can contain the polysaccharase that is packaged in container separately and through optimizing the reaction buffer for the polymerase-mediated primer extension of for example PCR.In another embodiment of the present invention, described test kit also comprises the assembly for obtaining patient's biological sample.Preferably such test kit, it also comprises and is suitable for splendid attire at least one is selected from the container of Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and the gene of SEQ ID NOS:160 to SEQID NO:165 or the methylated assembly of genome sequence for determining patient's biological sample, most preferably also comprises the specification sheets that uses and explain test kit result.In preferred embodiments, described test kit comprises: (a) hydrosulphite reagent, (b) be suitable for the container of hydrosulphite reagent described in splendid attire and patient's biological sample, (c) at least a set of primer tasteless nucleotide that comprises two kinds of oligonucleotide, the sequence of described each oligonucleotide is all same as, be complementary to or hybridize and be selected from SEQ ID NOS:10 to SEQ ID NO:15 under tight or height stringent condition, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, 9 or more preferably long fragments of 18 bases of the sequence of SEQ ID NOS:168 to SEQ ID NO:203, and preferably, (d) for using and explain the specification sheets of test kit result.In another preferred embodiment, described test kit comprises: (a) hydrosulphite reagent; (b) be suitable for the container of hydrosulphite reagent and patient's biological sample described in splendid attire; (c) have at least one oligonucleotide and/or the PNA-oligomer of at least 9 or 16 length of nucleotides, it is same as or hybridizes one of pretreated nucleic acid sequence SEQ ID NOS:10 to SEQID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, SEQ ID NOS:168 to SEQID NO:203 and complementary sequence thereof; And optionally, (d) about the specification sheets that uses and explain test kit result.
In another embodiment, described test kit comprises: (a) hydrosulphite reagent; (b) be suitable for the container of hydrosulphite reagent and patient's biological sample described in splendid attire; (c) at least a set of primer tasteless nucleotide that contains two kinds of oligonucleotide, its each sequence is same as, is complementary to or hybridization is selected from SEQ ID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ IDNO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ IDNO:39, SEQ ID NOS:50 to SEQ ID NO:51, SEQ ID NOS:168 to SEQ IDNO:203 under tight or height stringent condition 9 or more preferably 18 fragments that base is long; (d) have at least one oligonucleotide and/or the PNA-oligomer of at least 9 or 16 length of nucleotides, it is same as or hybridizes one of pretreated nucleic acid sequence SEQ ID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, SEQ ID NOS:168 to SEQ ID NO:203 and complementary sequence thereof; And optionally (e) about the specification sheets that uses and explain test kit result.
Described test kit also can contain other the component being packaged in container separately, as for blocking, wash or coated damping fluid or solution.
For COBRA
tMthe typical agents of analyzing (for example may be typically based on COBRA
tMtest kit in find) can include, but are not limited to: be selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and the gene of SEQ ID NOS:160 to SEQ ID NO:165 or the PCR primer of genome sequence at least one; Restriction enzyme and suitable damping fluid; Gene recombination oligomer; Contrast hybridization oligomer; For the kinases labelling kit of oligomer probe; And the Nucleotide of mark.For MethyLight
tMthe typical agents of analyzing (for example may be typically based on MethyLight
tMtest kit in find) can include, but are not limited to: be selected from the PCR primer of the sequence transforming through hydrosulphite of the gene of Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165 or genome sequence at least one; Probe (for example TaqMan that hydrosulphite is special
tMor Lightcycler
tM); PCR damping fluid and the deoxynucleotide optimized; And Taq polysaccharase.
For Ms-SNuPE
tMthe typical agents of analyzing (for example may be typically based on Ms-SNuPE
tMtest kit in find) can include, but are not limited to: for the PCR primer of the specific gene DNA sequence dna HuoCpG island of bisulf iotate-treated (or through); PCR damping fluid and the deoxynucleotide optimized; Gel extraction kit; Positive control primer; Be used for the Ms-SNuPE of the sequence transforming through hydrosulphite of at least one gene that is selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165 or genome sequence
tMprimer; Reaction buffer (for Ms-SNuPE reaction); And the Nucleotide of mark.
The typical agents (for example may find at the typical test kit based on MSP) of analyzing for MSP can comprise, but be not limited to: for being selected from the sequence gene transforming through hydrosulphite of Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ IDNOS:160 to SEQ ID NO:165 or methylating and unmethylated PCR primer of genome sequence, PCR damping fluid and the deoxynucleotide optimized, and special probe.
In addition, other side of the present invention is alternative test kit, it comprises that wherein said assembly preferably includes the restriction enzyme of at least one methylation specific for determining that at least one is selected from Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and the gene of SEQ ID NOS:160 to SEQ ID NO:165 or the methylated assembly of genome sequence; One or more are suitable for the primer tasteless nucleotide (preferred one or more primer pairs) of sequence of at least one CpG dinucleotides that amplification comprises the sequence that is selected from SEQ ID NO:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167; And optionally, for carrying out and assess the specification sheets of described methylation analysis method.In one embodiment, the fragment that the length that the base sequence of described oligonucleotide is same as, is complementary to or hybridization is selected from the sequence of SEQ IDNOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ IDNOS:159 to SEQ ID NO:167 under tight or height stringent condition is at least 18 bases.
In other embodiment, described test kit can comprise that one or more are for analyzing the oligonucleotide probe of described digestion fragment, the fragment that the length that preferred described oligonucleotide is same as, is complementary to or hybridization is selected from the sequence of SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167 under tight or height stringent condition is at least 16 bases.
In preferred embodiments, described test kit can comprise other reagent, and this other reagent is selected from: damping fluid (for example restriction enzyme, PCR, storage or lavation buffer solution); DNA reclaims reagent or test kit (for example precipitation, ultrafiltration, affinity column) and DNA and reclaims component.
In other other embodiment, described test kit can contain the polysaccharase and the reaction buffer that are packaged in container separately, and described reaction buffer is optimized for described polymerase-mediated primer extension, for example PCR.In another embodiment of the present invention, described test kit also comprises the assembly for obtaining patient's biological sample.In preferred embodiments, described test kit comprises: the restriction enzyme reagent that (a) methylates responsive; (b) be suitable for the container of reagent and described patient's biological sample described in splendid attire; (c) at least a set of oligonucleotide that contains one or more peptide nucleic acid(PNA)s, the fragment that the length that it is same as, is complementary to or hybridization is selected from the sequence of SEQID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ IDNOS:159 to SEQ ID NO:167 under tight or height stringent condition is at least 16 bases; And optionally (d) uses and explains the specification sheets of test kit result.
In other preferred embodiment, described test kit comprises: the restriction enzyme reagent that (a) methylates responsive; (b) for the container of reagent described in splendid attire and patient's biological sample; (c) at least a set of primer tasteless nucleotide of sequence that is suitable at least one CpG dinucleotides that amplification comprises the sequence that is selected from SEQ ID NOs:1 to SEQ ID NO:3, SEQ IDNO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167; And optionally, (d) use and explain the specification sheets of test kit result.
In another embodiment, described test kit comprises: responsive restriction enzyme (a) methylates; (b) be suitable for the container of reagent and patient's biological sample described in splendid attire; (c) at least a set of primer tasteless nucleotide of sequence that is suitable at least one CpG dinucleotides that amplification comprises the sequence that is selected from SEQ ID NOs:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ IDNO:28, SEQ ID NOS:159 to SEQ ID NO:167; (d) at least a set of oligonucleotide that comprises one or more nucleic acid or peptide nucleic acid(PNA), the fragment that the length of sequence that it is same as, is complementary to or hybridization is selected from SEQ ID NOs:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167 under tight or height stringent condition is at least 9 bases and optionally, (e) is used and explains the specification sheets of test kit result.
Described test kit also can contain other component being packaged in container separately, for example damping fluid or solution, and it is suitable for blocking-up, washing or coated.
The invention still further relates to test kit for the purposes in the diagnosis whether individuality cell proliferative disorders is existed is provided, it is realized by the responsive restriction enzyme analysis that methylates.Described test kit comprises container and DNA microarray component.Described DNA microarray component is a surface, and the position of specifying is thereon fixed with multiple oligonucleotide, and wherein said oligonucleotide comprises at least one CpG site that methylates.Described at least one, oligonucleotide is specific at least one and is selected from gene or the genome sequence of Septin 9 (comprising the transcript variant that they are all), FOXL2, SARM1, VTN, PRDM6, NR2E1, FAT and SEQ ID NOS:160 to SEQ ID NO:165, and at least 15 base pairs that comprise one of SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167 are long but be no more than the sequence of 200 bp.Preferably, described sequence is that at least 15 base pairs of one of SEQ ID NOS:1 to SEQ ID NO:3, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167 are long but be no more than the sequence of 80bp.Further preferably, described sequence is that at least 20 base pairs of one of SEQ ID NOS:1 to SEQ ID NO:3, SEQ IDNO:24, SEQ ID NO:28, SEQ ID NOS:159 to SEQ ID NO:167 are long but be no more than the sequence of 30 bp.
Described test kit preferably also comprises the restriction enzyme component that comprises one or more responsive restriction enzymes that methylate.
In another embodiment, the feature of described test kit is also the restriction enzyme that it comprises at least one methylation specific, and the restriction site of the wherein said oligonucleotide restriction enzyme that comprises described at least one methylation specific.
Described test kit can also comprise one or more following components for DNA enrichment known in the prior art: protein ingredient, the methylated DNA of described albumen selective binding; Optionally the triplex in being applicable to solution forms nucleic acid component, one or more joints; For example, for the material or the solution that connect, ligase enzyme or damping fluid; For carrying out material or the solution of column chromatography; Be used for carrying out for example, material or solution based on immunologic enrichment (immunoprecipitation); Be used for material or the solution of the nucleic acid amplification that carries out for example PCR; If if can use one or more dyestuffs that can use in solution together with coupling agent; For material or the solution of hybridizing; And/or for carrying out material or the solution of cleaning step.
The present invention also provides the composition that can be used for detection, distinguishes and distinguish colon cell proliferative disorders.Described composition comprises the long nucleic acid of at least one 18 base pair, and it is for being disclosed in SEQ ID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS.42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, the fragment of the nucleotide sequence in SEQ ID NOS:168 to SEQ ID NO:203, and one or more take from the magnesium chloride of following material: 1-5mM, 100-500 μ M dNTP, the taq polysaccharase of 0.5-5 unit, bovine serum albumin, oligomer is oligonucleotide or peptide Nucleotide (PNA) oligomer especially, each of described oligomer comprises the base sequence that at least one length is at least 9 Nucleotide, and it is complementary to or hybridizes pretreated genomic dna SEQ ID NOS:10 to SEQ ID NO:15 under medium tight or stringent condition, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, one of SEQ ID NOS:168 to SEQ ID NO:203 and complementary sequence thereof.Preferably the composition of described material comprises such buffered soln: it is suitable in the aqueous solution, stablizing described nucleic acid and the reaction based on polysaccharase can be carried out in described solution.Applicable damping fluid is known and commercially available in the prior art.
In further preferred embodiment of the present invention, described at least one nucleic acid is the long segment of at least 50,100,150,200,250 or 500 base pairs that is disclosed in the nucleotide sequence in SEQ ID NOS:10 to SEQ ID NO:15, SEQ ID NOS:28 to SEQ ID NO:33, SEQ ID NOS:30 to SEQ ID NO:31, SEQ ID NOS:42 to SEQ ID NO:43, SEQ ID NOS:38 to SEQ ID NO:39, SEQ ID NOS:50 to SEQ ID NO:51, SEQ ID NOS:168 to SEQ ID NO:203.
The present invention is described particularly with reference to its some preferred embodiment, and following embodiment is only for explaining the present invention, is not intended to be limited in the scope of principle of the present invention and extensive interpretation and equivalent thereof.
Embodiment
embodiment 1
In following embodiment, the sequence of below listing is measured to analyze by MSP and/or HeavyMethyl.This mensuration is designed in the upper operation of LightCycler platform (Roche Diagnostics), but other in the prior art normally used this quasi-instrument be also applicable to.
MSP amplified production detects by the fluorescently-labeled detection probes of Taqman type, and HeavyMethyl amplified production detects by the two probes of Lightcycler type.
Goal gene group region:
SEQ ID NO:165
Type: HeavyMethyl
Primer:
SEQ ID NO:249
SEQ ID NO:250
Blocker:
SEQ ID NO:251
Probe:
SEQ ID NO:252
SEQ ID NO:253
Temperature cycle program:
activation:95 ℃ 10 minutes
55 circulations:95 ℃ 10 seconds (20 ℃/s)
56 ℃ 30 seconds (20 ℃/s)
72 ℃ 10 seconds (20 ℃/s)
melt:
95 ℃ 10 seconds 20
35 ℃ of 20 detections in 20 seconds
95 ℃ 0 second 0,1
Goal gene group region
SEQ ID NO:24
Type: HeavyMethyl
Primer:
SEQ ID NO:254
SEQ ID NO:255
Blocker:
SEQ ID NO:256
Probe:
SEQ ID NO:257 (fluorescently-labeled)
SEQ ID NO:258 (Red640 mark)
Temperature cycle program:
95 ℃ of sex change
95 ℃ 10 minutes
55 circulations:
95 ℃ of sex change 10 seconds (20 ℃/s)
56 ℃ annealing 30 seconds (20 ℃/s)
72 ℃ extend 10 seconds (20 ℃/s)
melt:
95 ℃ 10 seconds 20
35 ℃ 20 seconds 20
95 ℃ 0 second 0,1
Goal gene group region
SEQ ID NO:24
Type HeavyMethyl
Primer:
SEQ ID NO:264
SEQ ID NO:265
Blocker:
SEQ ID NO:266
Probe:
SEQ ID NO:267 (fluorescently-labeled)
SEQ ID NO:268 (Red640 mark)
Temperature cycle program:
95 ℃ of sex change
95 ℃ 10 minutes
55 circulations:
95 ℃ of sex change 10 seconds (20 ℃/s)
56 ℃ annealing 30 seconds (20 ℃/s)
72 ℃ extend 10 seconds (20 ℃/s)
melt:
95 ℃ 10 seconds 20
35 ℃ 20 seconds 20
95 ℃ 0 second 0,1
Goal gene group region:
SEQ ID NO:28
Type: MSP
Primer:
SEQ ID NO:274
SEQ ID NO:275
Taqman probe:
SEQ ID NO:276
Temperature cycle program:
activation:95 ℃ 10 minutes
55 circulations:95 ℃ 15 seconds (20 ℃/s)
62 ℃ 45 seconds (20 ℃/s)
cooling:40 ℃ 5 seconds
Goal gene group region:
SEQ ID NO:1
Type: MSP
Primer:
SEQ ID NO:277
SEQ ID NO:278
Taqman probe:
SEQ ID NO:279
Temperature cycle program:
activation:95 ℃ 10 minutes
55 circulations:95 ℃ 15 seconds (20 ℃/s)
62 ℃ 45 seconds (20 ℃/s)
cooling:40 ℃ 5 seconds
Goal gene group region:
SEQ ID NO:28
Type: MSP
Primer:
SEQ ID NO:280
SEQ ID NO:281
Taqman probe:
SEQ ID NO:282
Temperature cycle situation:
activation:95 ℃ 10 minutes
55 circulations:95 ℃ 15 seconds (20 ℃/s)
62 ℃ 45 seconds (20 ℃/s)
Goal gene group region:
SEQ ID NO:1
Type: MSP
Primer:
SEQ ID NO:283
SEQ ID NO:284
Taqman probe:
SEQ ID NO:285
Temperature cycle situation:
activation:95 ℃ 10 minutes
55 circulations:95 ℃ 15 seconds (20 ℃/s)
62 ℃ 45 seconds (20 ℃/s)
Goal gene group region:
SEQ ID NO:28
Type: HeavyMethyl
Primer:
SEQ ID NO:286
SEQ ID NO:287
Blocker:
SEQ ID NO:288
Probe:
SEQ ID NO:289
SEQ ID NO:290
Temperature cycle situation:
95 ℃ of activation
95 ℃ 10 minutes
50 circulations:
95 ℃ of sex change 10 seconds (20 ℃/s)
56 ℃ annealing 30 seconds (20 ℃/s)
72 ℃ extend 10 seconds (20 ℃/s)
melt
95 ℃ 10 seconds 20
40 ℃ 10 seconds 20
70 ℃ 0 second 0,1
cooling
40 ℃ 5 seconds
Goal gene group region
SEQ ID NO:1
Type: HeavyMethyl
Primer:
SEQ ID NO:291
SEQ ID NO:292
Blocker:
SEQ ID NO:293
Probe:
SEQ ID NO:294
SEQ ID NO:295
Temperature cycle situation:
95 ℃ of activation
95 ℃ 10 minutes
50 circulations:
95 ℃ of sex change 10 seconds (20 ℃/s)
56 ℃ annealing 30 seconds (20 ℃/s)
72 ℃ extend 10 seconds (20 ℃/s)
melt
95 ℃ 10 seconds 20
40 ℃ 10 seconds 20
70 ℃ 0 second 0,1
cooling
40 ℃ 5 seconds
Goal gene group region
SEQ ID NO:1
Type: HeavyMethyl
Primer:
SEQ ID NO:296
SEQ ID NO:297
Blocker:
SEQ ID NO:289
Probe:
SEQ ID NO:299
SEQ ID NO:300
Temperature cycle situation:
95 ℃ of activation
95 ℃ 10 minutes
50 circulations:
95 ℃ of sex change 10 seconds (20 ℃/s)
56 ℃ annealing 30 seconds (20 ℃/s)
72 ℃ extend 10 seconds (20 ℃/s)
melt
95 ℃ 10 seconds 20
40 ℃ 10 seconds 20
70 ℃ 0 second 0,1
cooling
40 ℃ 5 seconds
Goal gene group region
SEQ ID NO:166
Type: HeavyMethyl
Primer:
SEQ ID NO:259
SEQ ID NO:260
Blocker:
SEQ ID NO:261
Probe:
SEQ ID NO:262
SEQ ID NO:263
Temperature cycle situation:
activation:95 ℃ 10 minutes
55 circulations: 95 ℃ 10 seconds
58 ℃ 30 seconds
72 ℃ 10 seconds
melting curve:95 ℃ 10 seconds
35 ℃ 20 seconds
95 ℃ 0 second
cooling:40 ℃ 5 seconds
Goal gene group region:
SEQ ID NO:167
Type: HeavyMethyl
Primer:
SEQ ID NO:269
SEQ ID NO:270
Blocker:
SEQ ID NO:271
Probe:
SEQ ID NO:272
SEQ ID NO:273
Temperature cycle situation:
95 ℃ of sex change
95 ℃ 10 minutes
55 circulations:
95 ℃ of sex change 10 seconds
Anneal 30 seconds for 56 ℃
72 ℃ are extended 10 seconds
melt
95 ℃ 10 seconds
40 ℃ 10 seconds
embodiment 2
Carry out following analysis, to select to be suitable for the preferred group (panel) of colorectal cancer screening and/or diagnosis according to the analysis to DNA methylation in whole blood.
Adopt mensuration platform (Lightcycler) and the real time measure method (MSP and/or HeavyMethyl) to analyze the performance of every kind of mark, as be suitable for use in reference or clinical laboratory apparatus.In colorectum cancerous tissue and whole blood, test independently the performance of every kind of mark, to the indication of the tolerance range of every kind of mark is provided.
Described group is selected from following mark:
SEQ ID NO:376
SEQ ID NO:378
SEQ ID NO:27
SEQ ID NO:26
SEQ ID NO:24
SEQ ID NO:1
SEQ ID NO:165
SEQ ID NO:25
SEQ ID NO:28
SEQ ID NO:378
SEQ ID NO:163
Every kind of mark is by the assay method of at least one methylation specific, and MSP and/or HeavyMethyl, analyze, as shown in table 2.
Carry out the further mensuration (non-methylation specific) of measuring hereinafter referred to as C3, so that the total DNA in quantitative every kind of sample.Described C3 is determined as hydrosulphite DNA and measures, and it is independent of methylation state and detects total DNA.Use following primer and probe:
Primer: GGAGTGGAGGAAATTGAGAT SEQ ID NO:62
Primer: CCACACAACAAATACTCAAAAC SEQ ID NO:63
Probe: TGGGTGTTTGTAATTTTTGTTTTGTGTTAGGTT SEQ IDNO:64
Every kind is determined on colorectal carcinoma, normal adjacent tissue and/or whole blood sample and reruns twice, as shown in table 3.
Adopt commercially available test kit to carry out DNA extraction, carry out hydrosulphite conversion according to the method for describing in the Olek et al. (1996) slightly revising.
All mensuration (C3 and methylation specific) all adopt Lightcycler platform to carry out.
data interpretation
The calculating of DNA concentration
The Cp (intersection point value) that Lightcycler instrument software calculates and intensity curve are used to determine DNA concentration.Measure for methylation assay and C3, all calculate DNA concentration by the CP value reference standard curve that makes every hole.
Sample repeats
As a rule, every kind of mensuration all will, to every kind of sample operation twice, obtain multiple measuring results to every kind of sample.For every kind of sample, score value is calculated as follows:
1. calculate the right ratio v1/v2 of all samples
If 2. the two is all lower than threshold value 0.1ng, ratio be made as=, if one is=, and another is higher than threshold value, ratio is made as to 100
3. exceeding each working sample of 2.5 for ratio no longer further analyzes
4. for the sample inaccurately with twice repetition, get average, do not get any score value per-cent that methylates
Adopt the sample that is less than 1ng DNA through measuring that C3 measures no longer to further consider.For every kind of sample, the per-cent that methylates detecting is calculated as the DNA relative concentration DNA concentration in the sample of measuring quantitative measurment by C3 that adopts methylation assay quantitative measurment.
On three different threshold levels (referring to table) and (detect methylated any sample and be regarded as the positive) in all methylation level and determine methylated detection.
The sensitivity of every kind of mensuration determines from colorectal carcinoma sample positive rate, and wherein Sensitivity determination is detected the sample % of (being true positives) by the positive for methylating.
The specificity of every kind of mensuration is determined from the negative recall rate of whole blood sample (being true negative recall rate), wherein from analyzed total number of samples, is deducted false positive.
result
The ratio that methylating of measuring is positioned at the institute's analytic sample by the various threshold values of independent mensuration is presented at table 4 (colorectum cancerous tissue), 5 (normal adjacent tissue) and 6 (whole bloods).
Figure 30 to 37 demonstration bivariate distribution figure (upper left side of figure) and the methylation level measured are higher than the relevant polymorphic type distribution plan (the lower-left side of figure) of the ratio (Y-axis) of the colorectum cancerous tissue of certain threshold (X-axis) and whole blood (and in some situation normal adjacent tissue) sample.The right side of every figure is that sensitivity is schemed with respect to specific ROC.ROC curve is the figure for the relative false positive rate of True Positive Rate of the difference possibility threshold value of diagnostic test.Between its display sensitivity and specificity, depend on the compromise (any increase of sensitivity will be attended by specific reduction) of selected threshold value.ROC area under curve (AUC) is that (area is the bigger the better, and the best is 1, and random test meeting has along cornerwise ROC curve, area 0.5 for measurement to diagnostic test accuracy; Reference: J.P.Egan.Signal Detection Theory and ROC Analysis, Academic Press, New York, 1975).The AUC of each ROC figure and Wilcoxon p-value are presented in table 12.
stage
According to cancer staging, the further analysis of colorectal carcinoma result is presented in table 7.In described table, shown to all stages of CRC based on the methylate mark sensitivity of threshold value (> 10% and > 20%) of two differences.For most of marks, sensitivity is all consistent in all CRC stage, so these marks can be suitable for the detection in all stages of CRC in examination or monitoring test.Seem the trend that has sensitivity to raise in II phase cancer.Sensitivity is lower, more Specific marker is tending towards (for example identifying more early stage cancer, SEQ ID NO:25 (measuring 3)) also can increase the sensitivity of examination and/or monitoring test, but also can be used for other application (biopsy, stool test etc.).
group
Table has shown in 8-11 in colorectal carcinoma and whole blood by measuring the methylate ratio of the institute's analytic sample that is positioned at various threshold values of multiple measurement.In every kind of situation, form has shown the sample ratio in given threshold value, and adopts two kinds of marks compared to the only improvement of the first mark sample detection.
embodiment 3
Carry out following analysis, to confirm that gene 9 (comprising its transcript variant Q9HC74) of Septin and group thereof are as the applicable mark for colorectal cancer screening and/or diagnosis, its DNA methylation analysis based in whole blood, by the performance that checking is measured in a large amount of sample sets.
The performance of mark measures platform (Lightcycler) by employing and the real time measure method (MSP and/or HeavyMethyl) is analyzed, as is suitable for use in reference or clinical laboratory apparatus.In colorectum tissue (normal adjacent tissue), colorectum cancerous tissue and whole blood, test independently the performance of every kind of mark, to the indication to mark accuracy is provided.
Adopt following primer and probe:
Adopt the SEQ ID NO:1 (measuring 7) of the Lightcycler probe of table 2 to adopt following scheme to carry out:
Water adds to final volume 10 μ l
MgCl
2 3.5
Forward primer 0.3
Reverse primer 0.3
Blocker 4
Detection probes (fluorescence) 0.15
Detection probes (red) 0.15
1a+1b reagent FastStart mixture 1
DNA
LightCycler program:
Activation: 95 ℃ 10 minutes
55 circulations: 95 ℃ 10 seconds (20 ℃/s)
56 ℃ 30 seconds (20 ℃/s) detect
72 ℃ 10 seconds (20 ℃/s)
Melting curve: 95 ℃ 10 seconds 20
40 ℃ 10 seconds 20
70 ℃ 0 second 0,1
Cooling: 40 ℃ 5 seconds
Adopt the SEQ ID NO:1 (measuring 7) of table 2 Taqman probe to adopt following scheme to carry out:
Scheme:
Water adds to final volume 10 μ l
MgCl
2 3.5
Primer 1 0.3
Primer 2 0.3
Blocker 4
TaqMan probe 0.15
1a+1b reagent (FastStart) 1
DNA 10μl
Cycling condition
Activation: 95 ℃ 10 minutes
50 circulations: 95 ℃ 10 seconds (20 ℃/s)
56 ℃ 30 seconds (20 ℃/s) detect
72 ℃ 10 seconds (20 ℃/s)
95 ℃ of melting curves 10 seconds 20
40 ℃ 10 seconds 20
70 ℃ 0 second 0,1
Cooling: 40 ℃ 5 seconds
Carry out C3 mensuration with the total DNA in quantitative every kind of sample.This C3 measures as above embodiment 2 carries out.
Every kind is determined on colorectal carcinoma, normal adjacent tissue and/or whole blood sample and repeats twice.Analyzed two groups of samples, sample sets 1 is presented in table 13, and sample sets 2 is presented in table 14.
Sample sets 1 adopts following mensuration to analyze, as described in detail in table 2:
SEQ ID NO:1 (measuring 2)
SEQ ID NO:26 (measuring 6)
SEQ ID NO:24 (measuring 5)
SEQ ID NO:25 (measuring 3)
Sample sets 2 adopts following mensuration to analyze, as described in detail in table 2:
SEQ ID NO:1 (measuring 7) LightCycler (LC) and Taqman (Taq) variant and following mensuration
SEQ ID NO:28 (measuring 2)
SEQ ID NO:24 (measuring 5b)
SEQ ID NO:29 (measuring 2b)
As described in detail in table 7.
Only analyze and contain the sample that is greater than 4ng DNA.In sample sets 1,27 blood samples and 91 colorectal carcinoma samples are analyzed.In sample sets 2,26 blood samples are analyzed, 22 non-colorectum samples that close on and 81 colorectal carcinoma samples.
All mensuration (C3 and methylation specific) all adopt Lightcycler platform to carry out.
dNA extraction and bisulf iotate-treated
Pass through Magna Pure method (Roche) DNA isolation from all samples according to the explanation of manufacturers.Then transform according to following bisulfite reaction the effluent obtaining from purifying.Make the dioxane that contains free-radical scavengers ((6-hydroxyl-2 of 98.6mg of bisulfite solution (5.89mol/l) and the 146 μ l of effluent and 354 μ l, 5,7,8-tetramethyl-chromanane 2-carboxylic acid is in 2.5ml dioxane)) mix.At 99 ℃, make reaction mixture sex change 3 minutes, then under following temperature program(me), hatch altogether 7h minute 50 ℃; A thermal peak (99.9 ℃) 3 minutes; 50 ℃ of 1.5h; A thermal peak (99 ℃) 3 minutes; 50 ℃ of 3h.Adopt subsequently Millipore Microcon
tMpost is by ultrafiltration purification reaction mixture.Substantially carry out purifying according to the specification sheets of manufacturers.For this reason, make reaction mixture and the water of 300 μ l mix, be loaded to ultra-filtration membrane, centrifugal 15 minutes, then wash with 1xTE damping fluid.In this processing, DNA is still retained on film.Then carry out desulfonation.For this reason, add 0.2mol/lNaOH and hatch 10 minutes.Then order is carried out the washing step of centrifugal (10 minutes) and 1xTE damping fluid.After this, eluted dna.For this reason, film is mixed 10 minutes with the 1xTE damping fluid (50 ℃) of 75 μ l heating.Specification sheets according to manufacturers overturns film.Carry out subsequently the centrifugal of repetition, with this, DNA is removed from film.The effluent of 10 μ l is used to Lightcycler PCR in real time and measures.
reaction soln and thermal cycle conditions
sEQ ID NQ:26 measures 6 (HeavvMethyl mensuration)
Reaction soln:
Water
MgCl
23.50mM (damping fluid, comprise 1mM! )
Primer mixture 0.30 μ M (every kind)
Blocker 4.00 μ M
Detection probes mixture 0.15 μ M (every kind)
1a+1b reagent FastStart mixture 1.00x
Thermal cycle conditions:
Activation: 95 ℃ 10 minutes
55 circulations: 95 ℃ 10 seconds (20 ℃/s)
56 ℃ 30 seconds (20 ℃/s) detect
72 ℃ 10 seconds (20 ℃/s)
Melting curve: 95 ℃ 10 seconds 20
40 ℃ 10 seconds 20
70 ℃ 0 second 0,1
Cooling: 40 ℃ 5 seconds
sEQ ID NO:25 measures 3 (HeavvMethvl mensuration)
Reaction soln:
Water
MgCl2 3.50mM (damping fluid, comprise 1mM! )
Primer mixture 0.30 μ M (every kind)
Blocker 4.00 μ M
Detection probes mixture 0.15 μ M (every kind)
1a+1b reagent FastStart mixture 1.00x
Thermal cycle conditions:
Activation: 95 ℃ 10 minutes
55 circulations: 95 ℃ 10 seconds (20 ℃/s)
56 ℃ 30 seconds (20 ℃/s) detect
72 ℃ 10 seconds (20 ℃/s)
Melting curve: 95 ℃ 10 seconds 20
40 ℃ 10 seconds 20
70 ℃ 0 second 0,1
Cooling: 40 ℃ 5 seconds
SEQ ID NO:24 Assay 5B(HeavvMethyl Assay)
Reaction soln:
Water
MgCl
23.00mM (damping fluid, comprise mM! )
Forward primer 0.30 μ M
Reverse primer 0.30 μ M
Blocker 4.00 μ M
Detection probes fluorescence 0.15 μ M
The red 0.15 μ M of detection probes
1a+1b reagent mixture 1.00x
Thermal cycle conditions:
95 ℃ of sex change
95 ℃ 10 minutes
55 circulations:
95 ℃ of sex change 10 seconds (20 ℃/s)
58 ℃ annealing 30 seconds (20 ℃/s) detect
72 ℃ extend 10 seconds (20 ℃/s)
Melt
95 ℃ 10 seconds 20
35 ℃ 20 seconds 20
95 ℃ 0 second 0,1
sEQ ID NO:24 measures 5 (HeavvMethyl mensuration)
Reaction soln:
Water
MgCl
23.00mM (damping fluid, comprise mM! )
Forward primer 0.30 μ M
Reverse primer 0.30 μ M
Blocker 4.00 μ M
LightCycler probe 0.15 μ M
LightCycler probe 0.15 μ M
1a+1b reagent mixture 1.00x
Thermal cycle conditions:
95 ℃ of sex change
95 ℃ 10 minutes
55 circulations:
95 ℃ of sex change 10 seconds (20 ℃/s)
58 ℃ annealing 30 seconds (20 ℃/s) detect
72 ℃ extend 10 seconds (20 ℃/s)
Melt
95 ℃ 10 seconds 20
35 ℃ 20 seconds 20
95 ℃ 0 second 0,1
sEQ ID NO:1 measures 2 (MSP mensuration)
Reaction soln:
Water (3315932)
MgCl
2(2239272) 3.50MM(*)
Forward primer 0.60 μ M
Reverse primer 0.60 μ M
Detection probes 0.30 μ M
1a+1b reagent FastStart mixture 1.00x
Thermal cycle conditions:
Activation: 95 ℃ 10 minutes
50 circulations: 95 ℃ 15 seconds
62 ℃ 45 seconds
Cooling: 40 ℃ 5 seconds
sEQ ID NO:1 measures 7 (LiqhtCycler probe HeawMethyl mensuration)
Reaction soln:
Water
MgCl
23.50mM (damping fluid, comprise mM! )
Primer 1 0.30 μ M
Primer 2 0.30 μ M
Blocker 4 μ M
Detection probes (fluorescence) 0.15 μ M
Detection probes (red) 0.15 μ M
1a+1b reagent (FastStart) mixture 1.00x
Thermal cycle conditions:
Activation: 95 ℃ 10 minutes
50 circulations: 95 ℃ 10 seconds (20 ℃/s)
56 ℃ 30 seconds (20 ℃/s) detect
72 ℃ 10 seconds (20 ℃/s)
Melting curve: 95 ℃ 10 seconds 20
40 ℃ 10 seconds 20
70 ℃ 0 second 0,1
Cooling: 40 ℃ 5 seconds
sEQ ID NO:1 measures 7 (Taqman HeavyMethyl mensuration)
Reaction soln:
Water
MgCl
23.50mM (damping fluid, comprise mM! )
Primer 1 0.30 μ M
Primer 2 0.30 μ M
Blocker 4.00 μ M
Detection probes 10.15 μ M
Detection probes 20.15 μ M
1a+1b reagent mixture 1.00x
Thermal cycle conditions:
Activation: 95 ℃ 10 minutes
50 circulations: 95 ℃ 10 seconds (20 ℃/s)
56 ℃ 30 seconds (20 ℃/s) detect
72 ℃ 10 seconds (20 ℃/s)
Melting curve: 95 ℃ 10 seconds 20
40 ℃ 10 seconds 20
70 ℃ 0 second 0,1
Cooling: 40 ℃ 5 seconds
sEQ ID NO:28 measures 2 (HeavyMethy) and measures)
Reaction soln:
Water
MgCl
23.50mM (damping fluid, comprise mM! )
Primer 1 0.30 μ M
Primer 2 0.30 μ M
Blocker 4.00 μ M
Detection probes 1 0.15 μ M
Detection probes 2 0.15 μ M
1a+1b reagent mixture 1.00x
Thermal cycle conditions:
Activation: 95 ℃ 10 minutes
50 circulations: 95 ℃ 10 seconds (20 ℃/s)
56 ℃ 30 seconds (20 ℃/s) detect
72 ℃ 10 seconds (20 ℃/s)
Melting curve: 95 ℃ 10 seconds 20
40 ℃ 10 seconds 20
70 ℃ 0 second 0,1
Cooling: 40 ℃ 5 seconds
sEQ ID NO:29 measures 2B (HeavvMethyl mensuration)
Reaction soln:
Water
MgCl
23.00mM (damping fluid, comprise mM! )
Primer 1 0.30 μ M
Primer 2 0.30 μ M
Blocker 4.00 μ M
Detection probes (fluorescence) 0.15 μ M
Detection probes (red) 0.15 μ M
1a+1b reagent (FastStart) 1.00x
Thermal cycle conditions:
Activation: 95 ℃ 10 minutes
50 circulations: 95 ℃ 10 seconds (20 ℃/s)
58 ℃ 30 seconds (20 ℃/s) detect
72 ℃ 10 seconds (20 ℃/s)
Melting curve: 95 ℃ 10 seconds 20
40 ℃ 10 seconds 20
70 ℃ 0 second 0,1
sEQ ID NO:29 measures 2 (HeavvMethyl mensuration)
Reaction soln:
Water
MgCl
23.50mM (damping fluid, comprise mM! )
Primer 1 0.30 μ M
Primer 2 0.30 μ M
Blocker 4.00 μ M
Detection probes 1 0.15 μ M
Detection probes 2 0.15 μ M
1a+1b reagent mixture 1.00x
Thermal cycle conditions:
Activation: 95 ℃ 10 minutes
55 circulations: 95 ℃ 10 seconds (20 ℃/s)
56 ℃ 30 seconds (20 ℃/s) detect
72 ℃ 10 seconds (20 ℃/s)
Melting curve: 95 ℃ 10 seconds 20
40 ℃ 10 seconds 20
70 ℃ 0 second 0,1
Cooling: 40 ℃ 5 seconds
data interpretation
The calculating of DNA concentration
The Cp (intersection point value) that Lightcycler instrument software calculates is used to determine DNA concentration.Measure for methylation assay and C3, all calculate DNA concentration by the CP value reference standard curve that makes every hole
As a rule, every kind of mensuration all will, to every kind of sample operation twice, obtain multiple measuring results to every kind of sample.
Per-cent methylates
Adopt all samples that is less than 4ng DNA through measuring that C3 measures no longer to further consider.For every kind of sample, the per-cent that methylates detecting is calculated as the DNA relative concentration DNA concentration in the sample of measuring quantitative measurment by C3 that adopts methylation assay quantitative measurment.
On multiple different threshold levels (referring to table) and (detect methylated any sample and be all regarded as the positive) in all methylation level and determine methylated detection.
The sensitivity of every kind of mensuration determines from colorectal carcinoma sample positive rate, and wherein Sensitivity determination is detected the sample % of (being true positives) by the positive for methylating.
The specificity of every kind of mensuration picks rate (being true negative recall rate) from whole blood sample feminine gender to be determined, wherein from analyzed total number of samples, deducts false positive.
result
Ratio or the quantity of the sample by analysis that what each assay method was measured methylate is arranged in given threshold value are presented at table 15 (sample sets 1) and 16 (sample sets 2).Wherein at least twice repetition is once positive at given threshold value build-in test, and this sample is considered to positive.Be measured as and there are the ratio of the methylated analyzed sample in given threshold value or quantity this group data that collect by determining by least one mensuration of this group.In the time that at least one in two repetitions is tested as positive in given threshold value, this sample is considered to positive.
In 14 mammary cancer samples, 12 colorectal carcinoma samples and 10 whole blood samples (sample sets 3), further test SEQ ID NO:1 measures 2.Ratio or the quantity of the sample by analysis that what each assay method was measured methylate is arranged in given threshold value are presented at table 18.
embodiment 4: other cancer
Carry out following analysis, to confirm that gene 9 (comprising its transcript variant Q9HC74) of Septin and group thereof are as for examination and/or diagnose the applicable mark of other cancer, its DNA methylation analysis based in whole blood, by the performance that checking is measured in a large amount of sample sets.
The SEQ ID NO:1 HeavyMethyl of employing table 2 measures the performance of 7 analysis mark things, and reaction conditions is according to embodiment 2.
Table 20 has shown the sample size of testing in every class, and the methylate quantity of positive sample of twice replication.Fig. 3 has shown the methylation level of measuring in other cancer, can see that this gene is methylated in polytype cancer.But, only have liver cancer to methylate with the ratio that is equal to or higher than colorectal carcinoma.Fig. 4 has shown the methylation level of measuring in other non-cancer disease, can find out and only have pyelonephritis to be methylated with the ratio that is equal to or higher than colorectal carcinoma.
embodiment 5: bisulfite sequencing
the order-checking of Septin 9 genes
By inference Septin 9 have 4 (referring to the discussion about Ensembl database before) at least 6 different transcript variants (at 5 ' end, referring to Russell, Oncogene.2001 Sep13; 20 (41): 5930-9).For the mentioned variant of the people such as Russell, amplicon is designed to cover four kinds of variants (α, β, γ and ε) CpG island or is rich in CpG region.There are overlapping 2 variants in Liang Ge CpG island, ε and γ.β variant seems that Beiγ CPG island regulates.
Analyzed the sample from 12 patients, the methylated level of Septin 9 is analyzed by quantitative, as mentioned above by HeavyMethyl.Two samples have methylate (the sample C group) that is greater than 20%, and 4 samples have 10% to 20% and methylate (sample B group) and 6 samples have shown at the most 10% methylate (sample A group) before having.
In addition, from there is no the DNA of 3 individual whole blood samples of obvious disease also for α and β amplicon (sample N group).
DNA extraction and bisulf iotate-treated
Adopt QIAGEN Genomic-Tip 500/G or 100/G, according to the specification sheets DNA isolation of manufacturers.Transform purified genomic dna according to following bisulfite reaction subsequently.
2 μ l DNA in 100 μ l and the bisulfite solution of 354 μ l (10.36g sodium bisulfite and 2.49g S-WAT in 22ml nuclease free water) and contain free-radical scavengers (6-hydroxyl-2,5,7,8-tetramethyl-chromanane 2-carboxylic acid, 323mg in 8.2ml diox) 146 μ l dioxs mix.This bisulfite reaction is as follows:
Reaction mixture adopts Millipore Microcon subsequently
tMpost passes through ultrafiltration purification.This purifying carries out according to the specification sheets of manufacturers.More specifically, with desulfonation and washing:
Then, the hydrosulphite TE damping fluid of 50 μ l (is preheated to 50 ℃; 0.1mM EDTA in 10mM Tris) add to film, and (1000rpm) hatches 10 minutes under stirring.This post is oppositely put into low the holding back of 1.7ml manages and rotates 7 minutes with eluted dna with 1000g.Adopt the PCR in real time of control sequence (HB14) to measure definite DNA concentration.
amplification
Amplicon and PCR primer are referring to table 21.Amplicon with " rc " in its title increases from Bis2 chain, and other increases from Bis1 chain.
Object fragment adopts the amplification in 25 μ l reactions of following condition.
PCR reaction:
Cycling condition:
94 ℃ of 3min; 94 ℃ of 20s; 54 ℃ of 30s; 72 ℃ of 45s (38-42 circulation); 72 ℃ of 10min
The purifying of PCR product
Adopt Montage
tMdNA gel extracts test kit, according to the specification sheets purified pcr product of manufacturers.In brief, PCR reactant runs glue on the TAE of 1% improvement (contain 0.1mM EDTA, rather than in standard TAE 1.0mM EDTA) sepharose.Cut target DNA band and shred.Blob of viscose, as in Montage gel extraction equipment, and is collected to DNA solution for 10 minutes with 5000g rotation.The DNA of purifying is further concentrated to 10 μ l.
TA clone
Adopt Invitrogen TOPO
_tA clones test kit, according to the explanation of the manufacturers described PCR product of cloning and increase.In brief, 2 μ l purifying and concentrated PCR product are used in TOPO cloning reaction to be cloned into carrier pCR
_2.1-TOPO.Transform and adopt chemically competent E.coli strain TOP10 to carry out.
Order-checking
The single clone of picking also cultivates at LB (50 μ g Anabacty/ml LB are for selecting).The overnight culture of 1 μ l is used to the bacterium colony PCR in 20 μ l volumes:
PCR mixture
2.5 μ l 10 × DyNAzyme damping fluids
2.5μl 2mM dNTPs
1.25 μ l M13F primers (10 μ M)
1.25 μ l M13R primers (10 μ M)
0.25 μ l DyNAzyme polysaccharase
12.25μl ddH20
Cycling condition:
94 ℃ of 3min; 94 ℃ of 1min; 55 ℃ of 1min; 72 ℃ of 1min (36 circulation); 10min72 ℃
Adopt standard operation to carry out the sub-purifying of colony PCR amplification and sequence reading.Sequencing primer used is M13 reverse primer or produces one of amplicon special primer of initial PCR product.
Result
The matrix that Fig. 5 to 29 provides the bisulfite sequencing data of the γ amplicon from analyzing by applicant's intellectual property software (further information, referring to WO 2004/000463) to produce.The sequencing data that every row representative of matrix repeats for a sample, the used of every kind of sample repeats to be divided in a piece.Every row of matrix represents the single CpG site in fragment.The CpG number of amplified production is presented at the left side of matrix.
The methylated amount of each CpG position measurement by from light grey (0% methylates), to ash (50% methylates), represent to grey black (100% methylates).Correctly do not checked order in some amplified productions, sample or CpG position, they are shown as white,
Fig. 5 to 29 provides the matrix of the bisulfite sequencing data of embodiment 5.Every row of this matrix represent the repetition sequencing data of a sample, and all of each sample repeat to be in a piece.Every row of matrix represents the single CpG site in fragment.The CpG digital display of amplified production is shown in the left side of matrix.
The methylated amount of each CpG position measurement by from light grey (0% methylates), to ash (50% methylates), represent to grey black (100% methylates).Successfully do not checked order in some amplified productions, sample or CpG position, they are shown as white.
Fig. 5 to 12 provided before 4 to be had in 10% to 20% methylated sample by quantitative (analyzing by HeavyMethyl), transforms the order-checking overview of amplified production according to the hydrosulphite of the genome sequence of table 21.
Figure 13 to 20 provided before 2 to be had higher than in 20% methylated sample by quantitative (analyzing by HeavyMethyl), transforms the order-checking overview of amplified production according to the hydrosulphite of the genome sequence of table 21.
Figure 21 to 22 provides the order-checking overview that transforms amplified production in 3 healthy individuals blood samples according to the hydrosulphite of the genome sequence of table 21.
Figure 23 to 29 provided before 6 by quantitatively (analyzing by HeavyMethyl) to be had and methylates in the sample of (but higher than 0%) lower than 10%, transforms the order-checking overview of amplified production according to the hydrosulphite of the genome sequence of table 21.
embodiment 6
Other mensuration that is suitable for the variant through bisulf iotate-treated of the genome sequence of analyzing SEQ ID NO:159 to SEQ ID NO:163 is presented in table 22.The bisulf iotate-treated of genomic dna can such as, by the scheme well known in the prior art (people such as Olek A, A modifiedand improved method for bisulfite based cytosine methylation analysis (methods of the changes and improvements of the cytosine methylation analysis based on hydrosulphite), Nucleic Acids Res.24:5064-6,1996) carry out.Applicable cycling condition is known to those skilled in the art, and can draw from the melting temperature (Tm) of oligomer, as shown in Table 22.
table 1: according to the genome sequence of sequence table
*ensembl database
table 2
table 3: the sample of analyzing according to embodiment 2
Table 4: there is the ratio that is positioned at the methylated colorectal carcinoma sample of different threshold values
Table 5: there is the ratio that is positioned at the methylated normal adjacent tissue sample of different threshold values
Table 6: there is the ratio that is positioned at the methylated whole blood sample of different threshold values
Table 7: according to the methylate ratio of the colorectal carcinoma in threshold value of the difference of disease stage
Table 8: 1% to 10% ratio of colorectal carcinoma sample that methylates threshold value that is positioned at detecting
15% to 25% ratio of colorectal carcinoma sample that methylates threshold value that what table 9 detected be positioned at
30% to 50% ratio of colorectal carcinoma sample that methylates threshold value that what table 10 detected be positioned at
Table 11 is positioned at 0.01% to the 0.1% whole blood sample ratio that methylates threshold value after testing
table 12: as the difference * between blood and the colorectal carcinoma sample of explanation in Figure 30-37
* fiducial interval is presented in bracket
The sample sets 1 of table 13: embodiment 3
the sample sets 2 of table 14: embodiment 3
Table 15: there is the methylated sample ratio from embodiment 3 sample sets 1 of different threshold values that is positioned at
The * twice repeated test positive
* one of * twice repeated test is positive or be positioned at threshold value through measuring
Table 16 has the methylated sample ratio from embodiment 3 sample sets 2 of different threshold values that is positioned at
* one of twice repeated test is positive or be positioned at threshold value through measuring
Table 17 is according to the mensuration of embodiment 3
Table 18: there is the methylated sample ratio from embodiment 3 sample sets 1 that is positioned at different threshold values
The sample sets 3 of table 19: embodiment 3
the result of table 20: embodiment 4
table 21 is according to the genome Equivalent of the primer of embodiment 5 and amplified production
Attention: in title, there is the amplicon of " rc " from the amplification of Bis2 chain,
And other increases from Bis1.
Table 22: according to the oligomer of embodiment 6