WO2001062967A2 - A method that compares genomic sequences - Google Patents

A method that compares genomic sequences Download PDF

Info

Publication number
WO2001062967A2
WO2001062967A2 PCT/IL2001/000151 IL0100151W WO0162967A2 WO 2001062967 A2 WO2001062967 A2 WO 2001062967A2 IL 0100151 W IL0100151 W IL 0100151W WO 0162967 A2 WO0162967 A2 WO 0162967A2
Authority
WO
WIPO (PCT)
Prior art keywords
primers
pcr
primer
ofthe
gene family
Prior art date
Application number
PCT/IL2001/000151
Other languages
French (fr)
Other versions
WO2001062967A3 (en
Inventor
Ben-Zion Vider
Nurit Katzir
Original Assignee
Genena Ltd.
Agricultural Research Organization Newe Ya'ar Research Center
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Genena Ltd., Agricultural Research Organization Newe Ya'ar Research Center filed Critical Genena Ltd.
Priority to AU2001234048A priority Critical patent/AU2001234048A1/en
Publication of WO2001062967A2 publication Critical patent/WO2001062967A2/en
Publication of WO2001062967A3 publication Critical patent/WO2001062967A3/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6809Methods for determination or identification of nucleic acids involving differential detection
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/6853Nucleic acid amplification reactions using modified primers or templates
    • C12Q1/6855Ligating adaptors

Definitions

  • the invention relates to the field of DNA fingerprinting, and genomic polymorphism
  • Mapping plant genes with molecular markers has many applications for agriculture: these include marking useful genes to assist breeding, QTL (Quantitative Trait Loci) analysis and map based cloning of important genes and QTLs.
  • Genetic markers when closely associated with economically important traits, increase breeding efficiency by allowing earlier selection in smaller 10 populations, as compared with conventional breeding techniques.
  • Genetic markers when combined with genetic maps, can be employed in marker-assisted selection (MAS) strategies for yield and quality components. As genetic maps become saturated, their value to programs of plant improvement increases. The most interesting applications of molecular markers require reasonably saturated maps (markers ⁇ 1 cM apart).
  • RAPDs Random Amplified Polymorphic DNA
  • PCR Polymerase Chain Reaction
  • SCARs Sequence Characterized Amplified Regions
  • Microsatellites are repetitive sequences that consist of simple repeats of di-, tri-, or tetra-nucleotides interspersed in the genome.
  • Single-locus markers also called SSR-Simple Sequence Repeats
  • SSRs 10 are highly polymorphic, highly reproducible, co-dominant markers that are frequently used in mapping projects, although the initial investment in developing each marker is high.
  • Inter-SSR-PCR Another marker system based on microsatellites was developed and termed Inter-SSR-PCR: here, the microsatellite repeat is used as the amplification primer, and a multibanded, polymo ⁇ hic pattern is obtained.
  • AFLP Aminified Fragment 15 Length Polymorphism
  • subsets of restriction fragments are amplified by PCR from artificially ligated linkers, giving rise to multibanded patterns analyzed on sequencing gels.
  • each technique has its advantages and drawbacks, and for any botanical taxon or specific application one should select the methodology of choice.
  • a large percentage of actively expressed human and plant genes are members of families of DNA 20 sequences that show a high degree of sequence similarity. However, the extent of sequence sharing and the organization of family members may vary widely.
  • Classical gene families are distinguished by members that exhibit a high degree of sequence homology over most ofthe gene length or at least the coding DNA component, a feature which automatically identifies such sequences as being closely related evolutionary as well as 25 functionally. In some gene families there is particularly pronounced homology within specific, strongly conserved, regions ofthe genes. Such families may include genes possessing large, highly conserved sequence motifs, such as the paired box of PAX genes ( ⁇ 390 bp) or the homeobox of the homeobox containing genes( ⁇ 180 bp). On the oher hand the sequence similarity between the full length of family members may be low to an extent that the members of some gene families 30 may not be obviously related at the DNA sequence level.
  • Gene families can occur as closely clustered genes at specific subchromosomal localization, or as widely dispersed genes.
  • the clustered genes possess higher degree of similarity in the coding sequence than the interspersed genes, and they also demonstrate related pattern of expression.
  • Gene amplification the mechanism responsible for the evolvement of multigene families, may 5 result from large scale gene duplication events that involve the whole genome, or large chromosomal segments. Selective duplication of specific genes can occur by copy transposition events and also by tandem gene duplication.
  • the present invention describes a new method for obtaining DNA polymo ⁇ hism that is related to gene families, and also describes several accompanied novel technologies related to it. 10
  • the present invention is based on a novel method that uses in PCR a mix of primers matching conserved regions in the sequence of gene family members for DNA finge ⁇ rinting of cells and tissues.
  • the method includes several complementary methods and techniques that complement the basic method.
  • primers that match conserved regions in gene families as a tool for detecting polymo ⁇ hic markers in PCR may be advantageous. This is because the isolated polymo ⁇ hic 30 marker encode the sequence of a gene that contribute or is responsible for traits that vary between the compared strains. Thus, either the genetic markers themselves are responsible to economically important traits, or they might turn out to have a significant contribution to the specific trait. For example.
  • Kinase genes are associated with resistance to many plant diseases, and (2) various flavors and tastes of plant fruits are related to the cytochrome P450 enzyme system.
  • Another advantage of finding a specific marker that is a gene family member is that its homologous genes can be identified in other strains and species. 5
  • kinase super family is an excellent candidate containing more then 1000 genes in each organism and a very conserved catalytic domain.
  • the 10 homeobox gene family and the MADS box gene family are also good candidates fulfilling especially the requirement for a high degree of conservation between the various genes, while cytochrome P450 enzymes are much less conserved but are highly dispersed throughout the genome.
  • the method might assist in finding true orthologous genes between different species in 15 the milieu of genes that belong to gene families as will be explained later.
  • the PCR reaction contains a mix ofthe gene family conserved primers together with the primer that match any kind of frequently repeating sequences, e.g. the ISSR.
  • the ability ofthe gene family primers to generate PCR products in combination with the ISSR primer extends the feasibility of creating a variable 20 spectrum of PCR fragments rather than just by using sense and anti-sense primers ofthe gene family. If the mix of primers contains the gene family sense and anti sense primers together with a repeat primer, these PCR fragments will be basically of three types.
  • PCR products that are produced by the repeat primers (2) PCR products that contain on one end primer ofthe gene family and on the other end the repeat primer, thus containing part ofthe gene sequence and an 25 adjacent genomic sequence. (3) PCR products that contain both primers from the gene family and thus the sequence might be of an exon or of two parts of an exon separated by an intron.
  • PCR primers that contains only the gene family sense or anti sense primers along with the repeat primer these PCR fragments will basically result in two types of products: 1 ) Products that are produced by the repeat primers alone. 2) Products that contain one primer of 30 the gene family and one primer that match the repeat sequence.
  • the PCR products that contain at least one gene family primer it is necessary to perform the following steps:
  • the method can differentiate between the different DNA finge ⁇ rinting products, by performing the finge ⁇ rinting of a sample using primers labeled with different 10 fluorochromes for the repeat group of primers, and for the group of primers that match gene family conserved regions. In this way PCR products that contain different combination of primer pairs will have different wavelength emissions and thus can be easily differentiated.
  • each primer is composed of three parts. 1) At the 3' side of each ofthe primers there is a sequence that is identical to the sequence of one ofthe primary primers. 2) At the 5' sequence of each ofthe primers an additional 18-21 nucleotides 25 are added. The sequence of this additive is the same for all the primers that belong to one set that matches a first conserved region, and is different from the sequence added to the second set matching the second conserved sequence. At the 5' side ofthe other set of primers a unique and common sequence to that set is added and it is different from that ofthe first set. 3) At the most 5 " end of these secondary primers the restriction site of a non- frequent cutter 30 restriction enzyme is added.
  • the site is common to the primers that belong to one conserved region, and is different from the restriction site added to the primers that match the other conserved region ( Figure 1 ). Items 2.2 and 2.3 5' parts ofthe gene family primers can be designated together the 5 " anchor sequences.
  • Three steps of DNA finge ⁇ rinting are then performed in the following manner: a. DNA samples of compared strains are amplified using a gene family set of primers and a repeat primer. b. Secondary PCR is performed using the secondary primers to re-amplify the products of the 10 first round of PCR. Alternatively the primary PCR of item a can be omitted. c. PCR is performed on either a selected PCR product from the secondary PCR, or using the whole secondary PCR reaction, with selected combination of tertiary primers. These primer combinations and PCR steps have several advantages:
  • a PCR product that was differentially expressed is isolated the three step system enables to 15 verify the content of primers that created it.
  • a selected PCR product is purified from the gel, divided into parts that are amplified using different combination of tertiary primers.
  • the resulting PCR product teaches us about the composition of the primers that created the selected primary PCR product. For example if the tertiary PCR product was obtained only when both gene family matching tertiary primers were used, then the initial PCR product was created with both a sense 20 and an anti-sense primers.
  • the tertiary primers can also serve as sequencing primers and as amplification primers for cloning.
  • tertiary PCR with a subset ofthe tertiary primers to amplify the whole PCR mix can intensify any selected group of PCR from the secondary PCR.
  • it can be performed on the secondary PCR with either combination of tertiary primers, one gene family 25 tertiary primer and a repeat primer, a repeat primer alone, or two gene family tertiary primers, resulting in clear and strong PCR bands exhibiting the desired combination.
  • PCR DNA finge ⁇ rinting method can be modified and performed in combination with DNA digestion method.
  • DNA is digested with 30 restriction endonucleases, (generally 2) and double stranded DNA adapters are ligated to the DNA fragments to generate template DNA for PCR amplification.
  • restriction endonucleases generally 2
  • double stranded DNA adapters are ligated to the DNA fragments to generate template DNA for PCR amplification.
  • the cleavage step will be performed with only one restriction enzyme, which is not a frequent cutter, for example EcoR-I.
  • Ligation is then performed with the EcoR I adapter.
  • the PCR reaction ofthe DNA finge ⁇ rinting is then performed with one primer that matches the ligated adapter and the pool of primers that matches the conserved sequences ofthe selected gene family. 5
  • G-box G-box
  • Proteins 10 known as G-box factors bind to G-boxes in context-specific manner, mediating a wide variety of gene expression patterns.
  • G-box factors bind to G-boxes in context-specific manner, mediating a wide variety of gene expression patterns.
  • Other most ubiquitous elements in animals include the GC box the CAAT box and the response elements to cAMP, AP2, HNF-1 , glucocorticoid and retinoic acid (Table 1 ).
  • the method of invention For the first time according to the method of invention we offer a new coupling between the results obtained by using gene family conserved primers at the DNA finge ⁇ rinting and between the expression pattern analysis. Basically the method is aimed for identifying the parallel genes 20 responsible for the DNA expression pattern obtained.
  • the primer can be either a sense or an anti-sense primer according to the situation. In either case it must match a sequence in close proximity to the primer matching the conserved region and it is synthesized in such a way that its 30 3' side cannot prime PCR.
  • This side ofthe primer is either blocked by dideoxy residue or the last two nucleotides ofthe 3 " end are pu ⁇ osely designed not to complement the template in a way that it can not prime the PCR. or it is chemically modified by any other method.
  • a sense oriented primer blocking the extension reaction primed by the upstream sense primer it is designed to match a sequence in close proximity and upstream to the anti-sense primers ofthe gene family expression kit.
  • the blocking primer must be added in high concentrations.
  • the specific primer is used together with either the set of sense or anti-sense conserved primers of a gene family in order to obtain a differential new PCR product typical to one ofthe compared species.
  • micro-arrays technology becomes highly applicable the use of micro-array is another approach 5 for DNA finge ⁇ rinting. According to the method of invention the following highlights are described.
  • the application ofthe technique in micro-arrays is based on PCR products obtained for various genotypes, using a set of gene family conserved primers together with a repeat primer.
  • the probes are 10 sequenced. and are attached to the appropriate surface. Many sequences are obtained from many genotypes reflecting the sequence variability. The probes are thus distinguishing between the various genotypes, as there is variability between the genotypes not only in respect to the products size but also in respect to their base composition.
  • Probes can be obtained from different varieties or genotypes by performing PCR with primer sets 15 from two conserved regions within a gene family. Although these PCR products are more homogenous in their size between the various strains however the base composition ofthe ortholog genes might be quite different and thus will result in different degrees of annealing of the various strains DNA templates to the micro-array probes.
  • the method for characterizing the changes of tumor versus normal cells can also be performed using micro-arrays as was described before. It can be done by either using both sets of conserved primers of a gene family in the DNA finge ⁇ rinting step or one set of conserved primers and a primer that matches a repeat sequence. Either way the PCR reactions are performed over a wide range of samples to a specific cancer, including normal cells and various cancer cells of tumors at various cancer stages and grades. The various PCR products obtained from the various cells are then used as probes by any method of probes production and 5 attachment, and serve as gene family DNA finge ⁇ rinting probes in the micro-array.
  • Figure 1 Schematic presentation of three step primer design.
  • I C The first secondary set of primers composed of A 1 -A3 sequences, of C sequence which 15 represents first common sequence added to all the set of primers, and RSI the added restriction site.
  • Figure 2 The PCR DNA finge ⁇ rinting was performed on two genotypes of com and one genotype of melon, using various combinations ofthe ISSR primer and the kinase conserved primers. The products were separated on 2.5% Nussive agarose gel.
  • the sequence ofthe ISSR primer was (AC) 8 YT, where Y represents the bases C or T. Each triplet lanes contained from left 25 to right the com genotypes A and B and the melon Pi genotype.
  • Lanes 1 -3 Kinase group A sense and anti-sense primers.
  • Lanes 4-6 the ISSR primer.
  • Lanes 7-9 ISSR and kinase group B anti-sense primers.
  • Lanes 10-12 ISSR and kinase group B sense primers.
  • Lane 13 Marker PBR digested with Alul.
  • Figure 4 The PCR DNA finge ⁇ rinting was performed on the two strains of co and the two strains of melon as in figure 3. using a mix of ten base random primer and the kinase group A 15 conserved anti-sense primers. The sequence ofthe random primer was CGTCACAGAG.
  • Lane 1 the PBR Alu I digested marker. Lane 2: melon Pi. Lane 3: melon dul. Lane 4: com strain
  • Lane 5 com strain B. Marked polymo ⁇ hism appears between the species and also between the genotypes. 20
  • Figure 5 Multiple amino acid sequence alignment of the kinase catalytic domains of several genes from subdomain VIb to subdomain XI is presented. Primers core amino acid sequence match catalytic subdomain VIb (region 1 ) and catalytic subdomain IX (region 2). When the blocking primer is a sense primer it should be designed to match sequences in close proximity and 25 upstream to the anti-sense primer sequence (region 3). When the blocking primer is an anti-sense primer it should be designed to match sequence in close proximity and downstream to the sense primer sequence (region 4). The designing ofthe sequence that the blocking primers match will have major influence on the specificity ofthe blocking. For example blocking primers from the core conserved region of sub-domain VII containing the DFG motive will block many genes 30
  • region 5 primers from region 1 and 2 will block the expression of fewer genes.
  • the primers used as a prototype in our experiments are primers that match sequences within the kinase conserved domain in plants and conserved sequences within and in adjacent to the homeodomain in the plant homeobox genes.
  • the sequences ofthe primers that match the kinase 5 domain are derived from the catalytic subdomain VIb and subdomain IX, and contain the core motive ofthe amino acid HRDL/IK for the sense primers and DV/I MWS for the anti-sense primers.
  • the primers were subdivided into two groups:
  • group A contained the sense primers numbered 3001 -3006, 3013-3015. 10 3032-3033 and the anti-sense primers numbered 3051 -3054, 3060- 3062. 3081-3082.
  • Group B contained the sense primers numbered 3006-3012. 3015-3018. 3034-3035. and the anti-sense primers 3055-3057. 3063-3066, 3083-3084. Both groups contained equal number of primers that match the above described conserved sequences of various plant kinase gene subfamilies.
  • MAPK. ERK SNF related kinase. Calcium dependent kinases, and shaggy like kinases. Within each group 15 similar sequences ofthe subfamilies were grouped together.
  • the sequence ofthe sense primers match conserved regions upstream and adjacent to the homeodomain. and the anti sense primers match a conserved motives within the homeodomain ofthe homeobox genes.
  • the primers were designed to match conserved sequences in two large gene families: the Knotted like and the leucine ZIP like.
  • the 20 core amino acid motives for the sense primers included ELK and KL/RRL respectively, and the core amino acid motives for the anti-sense primers included NQRKR and QNRR respectively.
  • group A contained the sense primers numbered 3201 -3206, 3215-3221, and the anti-sense primers numbered 3251 - 3255, 3260-3264.
  • Group B contained the sense primers numbered 3207-3212. 3222-3228, and the anti-sense primers 3255-3259. 25
  • PCR analysis conditions were as follows: PCR mixtures contained 30 ng of plant genomic DNA. Reaction mixtures contained the various reagents in the following final concentration: I. 10 mM Tris-HCl (pH 8.3). 50 mM Kcl. 1.5 mM MgCl II . 0.2 uM - of each PCR primer.
  • the total volume of each PCR was 25-50 uL. 5
  • the amplification program was as follows: 7 min. at 94°C. 45 s at 94°C, 60 s at 45°C, and 2 min. at 72°C for 30- 35 cycles on an Eppendorf thermocycler. PCR products were in most cases separated on 2.5% ethidium bromide agarose gels.
  • Table 1 contains cis-regulatory sequences first those ofthe animal kingdom and then those ofthe plant kingdom.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Analytical Chemistry (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Described herein is a PCR method of DNA fingerprinting using primers that match conserved regions of a gene family. In this manner the resulting PCR products represents genomic DNA fragments that are parts of genes. The DNA fingerprints can be those of any source, plant, animal and human. The method uses a plurality of sense primers that match a first conserved region in the gene family, and a plurality of anti-sense primers that match another conserved region in the gene family. By several modifications enabling increased spectrum of variability one set of conserved primers can be replaced by primers that match the sequences of microsatellite repeat, or cis-acting regulatory region or any other random primer. The DNA fingerprinting system can be linked to these genes expression patterns in various organs and in various biological situations.

Description

A METHOD THAT COMPARE GENOMIC SEQUENCES
FIELD OF THE INVENTION
The invention relates to the field of DNA fingerprinting, and genomic polymorphism
5
BACKGROUND OF THE INVENTION
Mapping plant genes with molecular markers has many applications for agriculture: these include marking useful genes to assist breeding, QTL (Quantitative Trait Loci) analysis and map based cloning of important genes and QTLs. Genetic markers, when closely associated with economically important traits, increase breeding efficiency by allowing earlier selection in smaller 10 populations, as compared with conventional breeding techniques. Genetic markers, when combined with genetic maps, can be employed in marker-assisted selection (MAS) strategies for yield and quality components. As genetic maps become saturated, their value to programs of plant improvement increases. The most interesting applications of molecular markers require reasonably saturated maps (markers < 1 cM apart). Following a decade of initial investment to develop 15 marker technology, even the most demanding and sophisticated applications (QTL mapping, positional cloning) are now materializing in crops such as tomato, maize, rice and lettuce, and in the model plant, Arabidopsis. The dynamic field of plant genome analysis is reaching a point where many ofthe initial expectations begin to materialize. An array of molecular marker techniques (see below) enable "seeing" the genome, and following of chromosomal segments that 20 carry a desired trait through sexual crosses. It has been demonstrated that QTLs (Quantitative Trait Loci) can be mapped, and that many valuable alleles are hidden in exotic germplasm items - even when their phenotype for that trait is inferior. Comparatives mapping of genomes from crops that do not cross with each other but are phylogenetically related will allow the mapping of agronomic traits in one crop and subsequent finding of their homologues in other crops. The 25 various marker techniques involve heritable intra-specific variation in the DNA that can be detected using various means. RFLP (Restriction Fragment Length Polymorphism) has been used since the 80's to provide genetic markers that are presumably random, co-dominant, and unlimited in number. They are detected on Southern blots using clones of DNA (genomic, cDNA, ESTs etc.) as probes. Their main disadvantages are the technical difficulty of application and high cost. 30 RAPDs (Random Amplified Polymorphic DNA) are yet another class of frequent polymorphisms. They involve differences in the banding pattern of PCR (Polymerase Chain Reaction) amplification products, applying random primers to genomic DNA samples. Their main advantage over RFLPs is the simple, non-radioactive protocol requiring less DNA. The disadvantages are that most polymorphisms are dominant (band versus no band) and thus less informative in linkage analysis, and they are less reproducible. Some investigators use RAPDs at an initial screening stage, and then develop SCARs (Sequence Characterized Amplified Regions) ofthe most 5 promising PCR products. These consist of PCR products produced by longer, more specific primers that are derived by sequencing RAPD bands. Microsatellites are repetitive sequences that consist of simple repeats of di-, tri-, or tetra-nucleotides interspersed in the genome. Single-locus markers (also called SSR-Simple Sequence Repeats) were derived from cloned microsatellites by sequencing the flanking DNA and synthesizing specific PCR primers that flank the repeats. SSRs 10 are highly polymorphic, highly reproducible, co-dominant markers that are frequently used in mapping projects, although the initial investment in developing each marker is high. Another marker system based on microsatellites was developed and termed Inter-SSR-PCR: here, the microsatellite repeat is used as the amplification primer, and a multibanded, polymoφhic pattern is obtained. Still another technique applied in mapping projects, is the AFLP (Amplified Fragment 15 Length Polymorphism). Here, subsets of restriction fragments are amplified by PCR from artificially ligated linkers, giving rise to multibanded patterns analyzed on sequencing gels. Obviously, each technique has its advantages and drawbacks, and for any botanical taxon or specific application one should select the methodology of choice.
A large percentage of actively expressed human and plant genes are members of families of DNA 20 sequences that show a high degree of sequence similarity. However, the extent of sequence sharing and the organization of family members may vary widely.
Classical gene families are distinguished by members that exhibit a high degree of sequence homology over most ofthe gene length or at least the coding DNA component, a feature which automatically identifies such sequences as being closely related evolutionary as well as 25 functionally. In some gene families there is particularly pronounced homology within specific, strongly conserved, regions ofthe genes. Such families may include genes possessing large, highly conserved sequence motifs, such as the paired box of PAX genes (Ξ390 bp) or the homeobox of the homeobox containing genes(≡180 bp). On the oher hand the sequence similarity between the full length of family members may be low to an extent that the members of some gene families 30 may not be obviously related at the DNA sequence level. Nevertheless, they encode gene products that are characterized by a common general function and the presence of short conserved sequence motifs( e.g. the DEAD box RNA helicases gene family). Gene families can occur as closely clustered genes at specific subchromosomal localization, or as widely dispersed genes.
In many instances the clustered genes possess higher degree of similarity in the coding sequence than the interspersed genes, and they also demonstrate related pattern of expression.
Gene amplification, the mechanism responsible for the evolvement of multigene families, may 5 result from large scale gene duplication events that involve the whole genome, or large chromosomal segments. Selective duplication of specific genes can occur by copy transposition events and also by tandem gene duplication.
The present invention describes a new method for obtaining DNA polymoφhism that is related to gene families, and also describes several accompanied novel technologies related to it. 10
SUMMARY OF THE INVENTION
The present invention is based on a novel method that uses in PCR a mix of primers matching conserved regions in the sequence of gene family members for DNA fingeφrinting of cells and tissues. 15
The method includes several complementary methods and techniques that complement the basic method.
These methods are in the following subjects: (1) The use ofthe gene family conserved primers alone or in combination with microsatellite sequences, or with RAPD primers both in PCR method or with AFLP. 20
(2) The use ofthe gene family conserved primers in combination with primers that match ubiquitous cis-acting regulatory elements. (3) Coupling between DNA fingeφrinting analysis and the gene family expression pattern analysis of cells and tissues. (4) The use ofthe gene family conserved primers in micro-array DNA fingeφrinting (5) Gene family DNA fingeφrinting and cancer. 25
1. The use of the gene family conserved primers in combination with primers of microsatellite sequences, or RAPD. or in combination with a DNA digestion method
The use of primers that match conserved regions in gene families as a tool for detecting polymoφhic markers in PCR may be advantageous. This is because the isolated polymoφhic 30 marker encode the sequence of a gene that contribute or is responsible for traits that vary between the compared strains. Thus, either the genetic markers themselves are responsible to economically important traits, or they might turn out to have a significant contribution to the specific trait. For example. ( 1 ) Kinase genes are associated with resistance to many plant diseases, and (2) various flavors and tastes of plant fruits are related to the cytochrome P450 enzyme system.
Another advantage of finding a specific marker that is a gene family member is that its homologous genes can be identified in other strains and species. 5
For PCR DNA fingeφrinting of gene family it is preferable to design primers that match the conserved sequences of gene families having many gene members, dispersed around the genome, and in which high degree of conservation exist between the gene members, preferably at least in two conserved domains. In both respects the kinase super family is an excellent candidate containing more then 1000 genes in each organism and a very conserved catalytic domain. The 10 homeobox gene family and the MADS box gene family are also good candidates fulfilling especially the requirement for a high degree of conservation between the various genes, while cytochrome P450 enzymes are much less conserved but are highly dispersed throughout the genome.
In addition the method might assist in finding true orthologous genes between different species in 15 the milieu of genes that belong to gene families as will be explained later. According to the method of invention, for the purpose of DNA fingeφrinting analysis, the PCR reaction contains a mix ofthe gene family conserved primers together with the primer that match any kind of frequently repeating sequences, e.g. the ISSR. The ability ofthe gene family primers to generate PCR products in combination with the ISSR primer extends the feasibility of creating a variable 20 spectrum of PCR fragments rather than just by using sense and anti-sense primers ofthe gene family. If the mix of primers contains the gene family sense and anti sense primers together with a repeat primer, these PCR fragments will be basically of three types. (1) PCR products that are produced by the repeat primers. (2) PCR products that contain on one end primer ofthe gene family and on the other end the repeat primer, thus containing part ofthe gene sequence and an 25 adjacent genomic sequence. (3) PCR products that contain both primers from the gene family and thus the sequence might be of an exon or of two parts of an exon separated by an intron. Using a mix of PCR primers that contains only the gene family sense or anti sense primers along with the repeat primer these PCR fragments will basically result in two types of products: 1 ) Products that are produced by the repeat primers alone. 2) Products that contain one primer of 30 the gene family and one primer that match the repeat sequence. In either alternative, in order to differentiate between these fragments and to utilize for the analyses the PCR products that contain at least one gene family primer, it is necessary to perform the following steps:
(1) To perform the PCR fingeφrinting separately with the repeat primer alone, and with gene family primers alone, and then with their combinations. (2) To subtract from the fingeφrint ofthe 5 combination of primers, the fingeφrint results produced with the repeat along, thus identifying the relevant fragments that contain part ofthe sequence of a gene family member.
Alternatively, it is possible to use fiorescent primers in order to differentiate between relevant and irrelevant products. The method can differentiate between the different DNA fingeφrinting products, by performing the fingeφrinting of a sample using primers labeled with different 10 fluorochromes for the repeat group of primers, and for the group of primers that match gene family conserved regions. In this way PCR products that contain different combination of primer pairs will have different wavelength emissions and thus can be easily differentiated.
Using sets of many primers also makes it impossible to perform direct sequence on product obtained as one do not know which ofthe many primers is the one used in a specific PCR 15 product. Also the cloning procedure is a little bit more complicated as the amplification of a specific band for cloning should be performed with two sets of many primers and thus might become ineffective.
In order to overcome these problems and to find the identity of a specific PCR product we offer the designing ofthe following set of primers for each ofthe gene families: 20
1. Design primary sets of primers one set that matches the sequences of a first conserved region and a second set that matches a second conserved region of a gene family.
2. Design secondary sets of primers of which each primer is composed of three parts. 1) At the 3' side of each ofthe primers there is a sequence that is identical to the sequence of one ofthe primary primers. 2) At the 5' sequence of each ofthe primers an additional 18-21 nucleotides 25 are added. The sequence of this additive is the same for all the primers that belong to one set that matches a first conserved region, and is different from the sequence added to the second set matching the second conserved sequence. At the 5' side ofthe other set of primers a unique and common sequence to that set is added and it is different from that ofthe first set. 3) At the most 5" end of these secondary primers the restriction site of a non- frequent cutter 30 restriction enzyme is added. The site is common to the primers that belong to one conserved region, and is different from the restriction site added to the primers that match the other conserved region (Figure 1 ). Items 2.2 and 2.3 5' parts ofthe gene family primers can be designated together the 5" anchor sequences.
3. Design tertiary primers and usually three primers. One having the sequence of the 5' anchor sequence ofthe first set ofthe secondary primers. A second primer having the sequence of the
5' anchor sequences ofthe second set of secondary primers, and the third primer that is the 5 repeat or RAPD primer with a third restriction site (figure 1 ).
Three steps of DNA fingeφrinting are then performed in the following manner: a. DNA samples of compared strains are amplified using a gene family set of primers and a repeat primer. b. Secondary PCR is performed using the secondary primers to re-amplify the products of the 10 first round of PCR. Alternatively the primary PCR of item a can be omitted. c. PCR is performed on either a selected PCR product from the secondary PCR, or using the whole secondary PCR reaction, with selected combination of tertiary primers. These primer combinations and PCR steps have several advantages:
If a PCR product that was differentially expressed is isolated the three step system enables to 15 verify the content of primers that created it. A selected PCR product is purified from the gel, divided into parts that are amplified using different combination of tertiary primers. The resulting PCR product teaches us about the composition of the primers that created the selected primary PCR product. For example if the tertiary PCR product was obtained only when both gene family matching tertiary primers were used, then the initial PCR product was created with both a sense 20 and an anti-sense primers. The tertiary primers can also serve as sequencing primers and as amplification primers for cloning.
Alternatively performing tertiary PCR with a subset ofthe tertiary primers to amplify the whole PCR mix can intensify any selected group of PCR from the secondary PCR. Thus, it can be performed on the secondary PCR with either combination of tertiary primers, one gene family 25 tertiary primer and a repeat primer, a repeat primer alone, or two gene family tertiary primers, resulting in clear and strong PCR bands exhibiting the desired combination.
In addition the gene family PCR DNA fingeφrinting method can be modified and performed in combination with DNA digestion method. In brief, using existing method DNA is digested with 30 restriction endonucleases, (generally 2) and double stranded DNA adapters are ligated to the DNA fragments to generate template DNA for PCR amplification. According to the method of invention a modification is suggested. The cleavage step will be performed with only one restriction enzyme, which is not a frequent cutter, for example EcoR-I.
Ligation is then performed with the EcoR I adapter. The PCR reaction ofthe DNA fingeφrinting is then performed with one primer that matches the ligated adapter and the pool of primers that matches the conserved sequences ofthe selected gene family. 5
2. The use of the gene family conserved primers in combination with primers that match ubiquitous cis-acting regulatory elements
Other sequences that are quite dispersed in the genome are the ubiquitous cis-acting regulatory elements. For example in plants the G-box (CACGTG) is such a ubiquitous element. Proteins 10 known as G-box factors bind to G-boxes in context-specific manner, mediating a wide variety of gene expression patterns. Other most ubiquitous elements in animals include the GC box the CAAT box and the response elements to cAMP, AP2, HNF-1 , glucocorticoid and retinoic acid (Table 1 ).
As the prevalence of these sequences is not arbitrary and as they are more concentrated in the 15 coding regions of various organisms' genome, thus though they are less common than the classical repetitive sequences they are still present in high numbers in the genomic coding sequences. As a result of that they will be able to amplify DNA segments with the primers that anneal to adjacent gene family conserved sequences.
Performing the PCR using these primers together with the gene family conserved primers, has a 20 major advantage that the differential PCR products between the samples will contain information that might be significant not only in regard to a differential gene but also regarding to a differential regulatory element.
It is thus technically performed in a way that a mix of primers that match regulatory elements can be used and replace one set ofthe gene family conserved primers. In the same manner the 25 secondary and tertiary PCR set of primers should follow the scheme of Figure 1 while a mix of primers ofthe regulatory elements having the same parts described in Figure 1 replaces one set of the gene family conserved region primers.
It is an option to add at the 5' end of these regulatory sequence primers the Sal I 6 nucleotides 30 restriction site, which enables two main advantages:
1. As the DNA fingeφrinting might be performed with a pool of primers there is difficulty to perform the PCR sequencing of a desired PCR product with a pool of primers. The existence of a restriction site enables the ligation of an adapter to generate template for further DNA amplification and sequencing.
2. As many ofthe primers that match the cis-regulatory sequences are too short, the six nucleotides addition elongates the primers. It was found that in many cases the Sal I sequence or part of it. complement the sequences of many cis-regulatory elements of plants, in a higher 5 degree of alignment then the sequence of many other restriction enzymes.
3. Coupling of DNA fingerprinting analysis and gene family expression pattern analysis
In a previous method we used sets of gene family conserved primers for the characterization of various gene family expression patterns in various cells and tissues in order to identify the 10 similarities and dissimilarities between the gene expression patterns under different situations. It is a universal method that for example can identify and allow measuring the expression of genes among the vast and diversified families and species of plants. According to the method, conserved pairs of primers are used in RT-PCR. Among other advantages, overcome this obstacle of species diversification. The various sets of primer pairs create gene expression kits to the various gene 15 families. Each kit is composed of carefully planned sense and anti-sense conserved primers cross-reacting with each other, and in each PCR reaction only one pair of primers is used.
For the first time according to the method of invention we offer a new coupling between the results obtained by using gene family conserved primers at the DNA fingeφrinting and between the expression pattern analysis. Basically the method is aimed for identifying the parallel genes 20 responsible for the DNA expression pattern obtained.
According to the method when a researcher finds that a specific PCR product is interesting in either DNA fingeφrinting analysis or in gene expression pattern analysis, after sequencing the product from either source, he can identify the same PCR product in the other analysis. In this way a rapid switch can be made from the RNA level to the DNA level and vice versa. 25
According to the method of invention one can identify the identical gene in the two analysis by designing a blocking primer specific to the gene identified and sequenced either from the DNA fingeφrinting or from the gene expression analysis. In principle the primer can be either a sense or an anti-sense primer according to the situation. In either case it must match a sequence in close proximity to the primer matching the conserved region and it is synthesized in such a way that its 30 3' side cannot prime PCR. This side ofthe primer is either blocked by dideoxy residue or the last two nucleotides ofthe 3" end are puφosely designed not to complement the template in a way that it can not prime the PCR. or it is chemically modified by any other method. Thus, it anneals to the DNA disturbing the DNA polymerase from elongating the DNA by physically standing in its way. For example if it is decided to use a sense oriented primer blocking the extension reaction primed by the upstream sense primer, it is designed to match a sequence in close proximity and upstream to the anti-sense primers ofthe gene family expression kit. In addition the blocking primer must be added in high concentrations. As a result of using a blocking primer designed from 5 one analysis to a gene will block the creation of a product in the other analysis, thus revealing the desired PCR product.
For example, when an interesting difference is found in gene expression analysis between the same plant in two different situations, according to the method finding the gene's band in the DNA fingeφrinting ofthe plant is possible, and will result in isolating part ofthe genomic DNA of this 10 gene. When an interesting difference is found between two strains of a specific plant in their DNA fingeφrints, it is possible to identify the PCR reaction expressing that gene, and to screen its possible differential expression at various situation in the specific plant, or in different organs of that plant. Further, differential expression pattern of this gene between the DNA fingeφrint of different strains might explain part ofthe different phenotype of these different plant strains. 15
Finally the system might help in identifying orthologous genes that are members of a gene family in different species. The following flow charts explain in more detail the necessary steps for obtaining data on differential genes and on orthologous genes between species. For identifying differential genes that are members in gene families as candidate polymoφhic markers between different species the following steps are required: 20 a. Identifying within the DNA fingeφrinting of several plant or animal species a PCR fragment of different size in one ofthe species. b. Isolating the differential PCR fragment in the species, sequencing and designing a blocking primer that matches a unique region in the gene of interest. c. Repeating the DNA fingeφrinting ofthe several plant or animal species as performed in 25 section a in the presence ofthe blocking primer, and watching for a differential blocking effect obtained by the blocking primer on the PCR product ofthe specific specie. d. Performing the gene expression PCR analysis ofthe various species, in several different plant or animal organs. e. Performing the same PCR analysis as in section d in the presence ofthe blocking primer, and 30 detecting that the expression of a differential PCR reaction between the species is now turned off or diminished in the specific species. The same steps can be performed when differential genes are looked for between compared genotypes.
For example if we examine the DNA fingeφrints of one melon genotype and of one cucumber genotype, both belonging to the cucurbitaceous family, by performing DNA 5 fingeφrinting with primer sets for a specific gene family, equal sized and different sized PCR products are obtained. When we focus on the differential products and isolate the sequence of products that appear in the melon and not in the cucumber we can design blocking primers to these genes and then investigate the expression patterns of genes in both plants in various organs. If the genes" expression pattern in the melon is such that a specific RT-PCR product 10 that is highly expressed in fruit versus leaf, is selectively blocked by the blocking primer, and there is no effect on the same RT- PCR product in the cucumber, a clue for the function of this gene in melon fruit development is suggested.
Instead of using the blocking primer, using a specific primer to a differentially expressed gene between various compared species is an alternative. According to this method at the second 15 step instead of using the blocking primer, the specific primer is used together with either the set of sense or anti-sense conserved primers of a gene family in order to obtain a differential new PCR product typical to one ofthe compared species.
Still an advantage gained by using a blocking primer, results from the fact that all the previous results obtained by PCR can be referred relative to the specific situation. 20
For identifying ortholog genes that are members of gene families between different species the following steps are required: a. Identifying within the PCR DNA fingerprinting of several plant or animal species PCR fragments having the same size b. Isolating a PCR fragment in one specie sequencing it and designing a blocking primer that 25 match the gene of interest sequence in a region of a moderately conserved motive. c. Repeating the DNA PCR fingeφrinting of several plant species as performed in section a in the presence ofthe blocking primer, and watching for a comparable effect obtained by the blocking primer on the PCR fragments in the compared plant or animal species. d. Performing the gene expression PCR analysis ofthe various species, in several different plant 30 organs for each ofthe species. e. Performing the same PCR reactions analysis as those of section d in the presence ofthe blocking primer and watching in the various species that the same PCR reactions, that are also with the same pattern of expression in the various organs or situations, are turned off or diminished.
4. The use of the gene family conserved primers in micro-array technology
As micro-arrays technology becomes highly applicable the use of micro-array is another approach 5 for DNA fingeφrinting. According to the method of invention the following highlights are described.
The application ofthe technique in micro-arrays is based on PCR products obtained for various genotypes, using a set of gene family conserved primers together with a repeat primer. The probes are 10 sequenced. and are attached to the appropriate surface. Many sequences are obtained from many genotypes reflecting the sequence variability. The probes are thus distinguishing between the various genotypes, as there is variability between the genotypes not only in respect to the products size but also in respect to their base composition.
Probes can be obtained from different varieties or genotypes by performing PCR with primer sets 15 from two conserved regions within a gene family. Although these PCR products are more homogenous in their size between the various strains however the base composition ofthe ortholog genes might be quite different and thus will result in different degrees of annealing of the various strains DNA templates to the micro-array probes.
20
5. Gene family DNA fingerprinting and cancer
The same approach of gene family DNA fingeφrinting described in the previous sections can be applied to the subject of analyzing the changes that occur in the process of carcinogenesis. It is well known that cancer formation is accompanied by chromosomal changes like translocations. rearrangements, amplifications, duplications and deletions. These changes can be monitored by 25 various direct assays that stains the chromosomes and by indirect methods including various DNA fingeφrinting assays that inspect for consistent changes in the fingeφrints. Applying the gene family DNA fingerprinting method in order to characterize different DNA fingeφrints of normal versus cancer cells of a tissue has the benefit of detecting typical and consistent changes at the genetic level that might provide an explanation based on a set of related genes. 30
This is possible for any gene family and might be more relevant to the kinases and the G protein families. The method for characterizing the changes of tumor versus normal cells can also be performed using micro-arrays as was described before. It can be done by either using both sets of conserved primers of a gene family in the DNA fingeφrinting step or one set of conserved primers and a primer that matches a repeat sequence. Either way the PCR reactions are performed over a wide range of samples to a specific cancer, including normal cells and various cancer cells of tumors at various cancer stages and grades. The various PCR products obtained from the various cells are then used as probes by any method of probes production and 5 attachment, and serve as gene family DNA fingeφrinting probes in the micro-array.
BRIEF DESCRIPTION OF THE FIGURES
Figure 1: Schematic presentation of three step primer design.
I. A Schematic presentation of a gene with a conserved domain signified by the rectangular 10 region with two highly conserved regions A and B. 1 β Primary set of conserved primers A1-A3 represents various primers sequences that match the first conserved region, and primary set of conserved primers B1 -B3 represents various primers sequences that match the second conserved region. I C The first secondary set of primers composed of A 1 -A3 sequences, of C sequence which 15 represents first common sequence added to all the set of primers, and RSI the added restriction site. The second set of secondary primers composed of B1-B3 sequences, of D sequence which represents the second common sequence added to all the set of primers, and RS2 the added restriction site. I ) Describes the gene family tertiary primers. 20
Figure 2: The PCR DNA fingeφrinting was performed on two genotypes of com and one genotype of melon, using various combinations ofthe ISSR primer and the kinase conserved primers. The products were separated on 2.5% Nussive agarose gel. The sequence ofthe ISSR primer was (AC)8YT, where Y represents the bases C or T. Each triplet lanes contained from left 25 to right the com genotypes A and B and the melon Pi genotype. Lanes 1 -3: Kinase group A sense and anti-sense primers. Lanes 4-6, the ISSR primer. Lanes 7-9: ISSR and kinase group B anti-sense primers. Lanes 10-12: ISSR and kinase group B sense primers. Lane 13: Marker PBR digested with Alul.
Clear differences between the two com genotypes and the melon genotype is demonstrated in all 30 triplets.
Differences are noticed between co genotypes A and B using the ISSR and kinase group B anti-sense primers, lanes 7 and 8. Figure 3: The PCR DNA fingerprinting was performed on two genotypes of com and two genotypes of melon, using various combinations that include ten base random primer and the homeobox gene group B conserved primers. The products were separated on 2.5% Nussive agarose gel. The sequence ofthe ten base random primer was CCACTCACCG. Each ofthe four 5 lanes contained from left to right the com genotypes A and B and the dul and Pi genotypes ofthe melon. Lanes 1 -4 homeobox group B sense and anti-sense primers. Lanes 5-8, the ten base random primer. Lanes 9-12. the ten base random primer and homeobox group B sense primers. Lanes 13-16, the ten base random primer and homeobox group B anti- sense primers. Lane 17 Marker PBR digested with Alul. 10
Differences are noticed between co genotypes A and B using the mix ofthe ten base random primer and the homeobox group B anti-sense primers, lanes 13 and 14.
Figure 4: The PCR DNA fingeφrinting was performed on the two strains of co and the two strains of melon as in figure 3. using a mix of ten base random primer and the kinase group A 15 conserved anti-sense primers. The sequence ofthe random primer was CGTCACAGAG.
Lane 1 the PBR Alu I digested marker. Lane 2: melon Pi. Lane 3: melon dul. Lane 4: com strain
A.
Lane 5: com strain B. Marked polymoφhism appears between the species and also between the genotypes. 20
Figure 5: Multiple amino acid sequence alignment of the kinase catalytic domains of several genes from subdomain VIb to subdomain XI is presented. Primers core amino acid sequence match catalytic subdomain VIb (region 1 ) and catalytic subdomain IX (region 2). When the blocking primer is a sense primer it should be designed to match sequences in close proximity and 25 upstream to the anti-sense primer sequence (region 3). When the blocking primer is an anti-sense primer it should be designed to match sequence in close proximity and downstream to the sense primer sequence (region 4). The designing ofthe sequence that the blocking primers match will have major influence on the specificity ofthe blocking. For example blocking primers from the core conserved region of sub-domain VII containing the DFG motive will block many genes 30
(region 5) while primers from region 1 and 2 will block the expression of fewer genes.
EXAMPLES Several aspects ofthe invention will now be demonstrated by way of non-limiting examples.
Materials and method
The primers used as a prototype in our experiments are primers that match sequences within the kinase conserved domain in plants and conserved sequences within and in adjacent to the homeodomain in the plant homeobox genes. The sequences ofthe primers that match the kinase 5 domain are derived from the catalytic subdomain VIb and subdomain IX, and contain the core motive ofthe amino acid HRDL/IK for the sense primers and DV/I MWS for the anti-sense primers.
The primers were subdivided into two groups:
In the kinase gene family group A contained the sense primers numbered 3001 -3006, 3013-3015. 10 3032-3033 and the anti-sense primers numbered 3051 -3054, 3060- 3062. 3081-3082. Group B contained the sense primers numbered 3006-3012. 3015-3018. 3034-3035. and the anti-sense primers 3055-3057. 3063-3066, 3083-3084. Both groups contained equal number of primers that match the above described conserved sequences of various plant kinase gene subfamilies. MAPK. ERK, SNF related kinase. Calcium dependent kinases, and shaggy like kinases. Within each group 15 similar sequences ofthe subfamilies were grouped together.
In relation to the homeobox genes the sequence ofthe sense primers match conserved regions upstream and adjacent to the homeodomain. and the anti sense primers match a conserved motives within the homeodomain ofthe homeobox genes. The primers were designed to match conserved sequences in two large gene families: the Knotted like and the leucine ZIP like. The 20 core amino acid motives for the sense primers included ELK and KL/RRL respectively, and the core amino acid motives for the anti-sense primers included NQRKR and QNRR respectively. In the homeobox gene family group A contained the sense primers numbered 3201 -3206, 3215-3221, and the anti-sense primers numbered 3251 - 3255, 3260-3264. Group B contained the sense primers numbered 3207-3212. 3222-3228, and the anti-sense primers 3255-3259. 25
3265-3266. 3274-3275. Both groups contained equal number of primers that matched the above-described conserved sequences ofthe Knotted like and the Leucine ZIP like plant homeobox gene subfamilies. Other group of primers included the ISSR dinucleotide repeat primers and ten base random primers, which are described separately at the relevant figures.
30
PCR analysis conditions were as follows: PCR mixtures contained 30 ng of plant genomic DNA. Reaction mixtures contained the various reagents in the following final concentration: I. 10 mM Tris-HCl (pH 8.3). 50 mM Kcl. 1.5 mM MgCl II . 0.2 uM - of each PCR primer.
III . 0.2 mM of each ofthe four nucleotides- A ,C ,G, -IT.
IV . 1.25 unit Taq polymerase.
The total volume of each PCR was 25-50 uL. 5
The amplification program was as follows: 7 min. at 94°C. 45 s at 94°C, 60 s at 45°C, and 2 min. at 72°C for 30- 35 cycles on an Eppendorf thermocycler. PCR products were in most cases separated on 2.5% ethidium bromide agarose gels.
Results 10
We conducted experiments that were aimed to check the variability obtained by PCR with two different sets of sense and anti-sense primers from two gene families to amplify DNA from the same plant tissue, and also to use these two sets to amplify DNA from different plants. In this way we could answer two questions: 1. Do the application of two sets of primers designed to the same core motives of a gene family result in different DNA patterns of a plant? And could one ofthe 15 two sets or both of them differentiate between different plant species and even between different plant genotypes?
In our experiments these questions were referred to in regard to the use ofthe kinase or homeobox genes primer mix A or B alone, and in combination with the microsatellite dinucleotide repeat or random ten base primers. 20
From the results, which part of them are summarized in figures 2-6 several interesting preliminary conclusions can be made regarding DNA fingeφrinting:
1. The use of a mix of primers that match two gene family conserved regions differentiate between various plant species in all instances, and in many cases between the various genotypes. 25
2. In most instances the use of a mix of either sense or anti-sense primers that match a gene family conserved region together with ISSR primer or ten base random primers resulted in larger spectrum of PCR products as compared with the mix described in item 1.
3. The use ofthe mixes described in item 2 could generally differentiate better between various genotypes then the mixes of item 1. 30
4. As was explained before for each gene family the primers were divided into two groups. These two groups of primers were basically derived from the same conserved regions in each family, and they also contained representative primers from the same subgroups of genes, though primers with similar sequences were grouped together. The mixes resulted in different DNA fingeφrints for the same plant in all o the cases examined, emphasizing the different annealing properties ofthe various primers, though all were derived from similar sequences (data no shown).
Table 1
Table 1 contains cis-regulatory sequences first those ofthe animal kingdom and then those ofthe plant kingdom.
Figure imgf000018_0001
Figure imgf000019_0001
15

Claims

CLAIMSWhat is claimed:
1. A PCR based method of DNA fingeφrinting using primers that match conserved regions of a gene family.
2. The method of Claim 1, wherein the DNA that is used is that of a plant or an animal or human. 5
3. A method according to claim 1 or 2 further characterized in that a plurality of sense primers are used that match a first conserved region ofthe gene family, and a plurality of anti-sense primers are used that match another conserved region ofthe gene family
4. A method according to claim 3 wherein the gene family is a kinase gene family.
5. A method according to claim 3 wherein the gene family is a homeobox gene family. 10
6. A method according to claim 3 wherein the gene family is the MADS box gene family.
7. A method according to claim 3 wherein the gene family is the phosphatase gene family.
8. A method according to claim 3 wherein the gene family is the cytochrome P450 gene family.
9. A method according to claim 3 wherein the gene family is the steroid receptor super family.
10. A method according to claim 3 wherein the gene family is the G protein coupled receptor 15 gene family.
11. A method according to any one of claims 1-10 further characterized in that it uses a microsatellite, or random primer.
12. A method according to claims 1 or 2, wherein sense primers that match a conserved region within a gene family are used in a mix together with a microsatellite, or random primers. 20
13. A method according to claims 1 or 2 wherein anti-sense primers that match a conserved region in a gene family are used in a mix with a microsatellite. or random primers.
14. A method according to any one ofthe previous claims wherein a mix ofthe microsatellite alone, a mix ofthe gene family conserved primers alone, and the mix of either claims 1 1 or 12 or 13 is used. 25
15. A method according to any one of claims 1 to 13 wherein the microsatellite group of primers are labeled with a first fluorochrome, and the primers that match the gene family conserved region are labeled with a different fluorochrome.
16. A method that uses a common set of primers that belong to a gene family according to claim
3, both for gene expression analysis of any cell and tissue, and for the performance of DNA 30 fϊngeφrinting analysis ofthe same organism in order that one will reveal the function of a gene that produced differential product between genotypes by the DNA fingeφrinting analysis.
17. A method according to claim 16 wherein the organism is a plant or an animal or human.
18. A method according to any one of claims 1 -17 for identifying PCR reactions that contain a gene of interest in a gene family RT-PCR expression analysis, and in a PCR of a DNA fingeφrinting analysis.
19. A method for designing a blocking primer comprising steps of: a. Sequencing a gene of interest by DNA fingeφrinting or by gene expression analysis. 5 b. Providing a blocking primer to match an internal sequence between a sequence of a sense primer and a sequence of an anti-sense primer of a gene family. c. The 3" ofthe blocking primer cannot prime PCR as this side ofthe primer is either blocked by dideoxy residue, or it is chemically modified by any other method. d. Carrying out PCR reactions using the blocking primer in a concentrated form, preferably at a 10 concentration ten times and higher, than the concentration ofthe other primers.
20. The method of Claim 19 wherein the primer is designed to block transcription initiated by the sense primer and the blocking primer is a sense primer that matches a sequence in close proximity and upstream to the sequence that matches the gene family anti-sense primers.
21. The method of Claim 19 wherein the primer is designed to block transcription initiated by the 15 anti-sense primer, and the blocking primer is an anti-sense primer that matches a sequence in close proximity and downstream to the sequence that matches the gene family sense primers.
22. A method according to claims 16-21 for identifying PCR reactions that contain a gene of interest, comprising steps of: a. Performing the first step PCR of gene expression analysis and of DNA fingeφrinting analysis of a cell or tissue, b. Performing PCR of gene expression 20 analysis and of DNA fingeφrinting in the presence of a blocking primer that diminishes, abolishes or significantly reduces the size ofthe PCR reactions containing the gene of interest in a cell or tissue.
23. A method for identifying polymoφhic markers that are genes in gene families comprising a step of performing the method described in any one of claims 1-15 on two or more different 25 species or genotypes and identifying differential PCR products in the DNA fingeφrints.
24. A method for identifying the function of the gene obtained in claim 23 comprising a step of coupling ofthe gene identified in the DNA fingeφrinting and the gene expression analysis method of any one of claims 16-22.
25. A method for identifying differential genes that are members in gene families between different 30 strains or species, and the method comprising steps of: a. Identifying within the DNA fingeφrints of several plant or animal genotypes, or species, a PCR fragment that is of a different size only in one species. b. Isolating a differential PCR fragment in the one species, sequencing it and designing a blocking primer according to claim 19-21 that matches a unique region in the gene of interest. c. Repeating the DNA fingeφrinting of a plurality of plant or animal strains or species according to step a in the presence of a blocking primer, and detecting a differential blocking 5 effect obtained by the blocking primer on the PCR amplified fragments ofthe specific specie. d. Performing gene expression PCR analysis, in several organs of different plants or animals. e. Performing the PCR reaction analysis according to step d in the presence ofthe blocking primer and determining whether the expression of a differential DNA pattern is turned off or diminished, or whether the expression of a DNA pattern previously expressed at about the 10 same level is selectively blocked in the species.
26. A method for identifying orthologous genes that are members of a gene family between different genotypes or species, the method comprising steps of: a. Identifying within the PCR DNA fingeφrints of several plant genotypes or species. PCR fragments ofthe same size. 15 b. Isolating PCR amplified product in a plant, sequencing it and designing a blocking primer according to claims 19-21 that matches the sequence of a unique region in the gene of interest. c. Repeating the DNA fingeφrinting of several plant genotypes or species according to step a in the presence ofthe blocking primer, and determining whether an effect is obtained by the 20 blocking primer on the PCR fragments ofthe various samples. d. Performing the gene expression PCR analysis ofthe various species, in a plurality of organs for each ofthe species or genotypes, and e. Performing the PCR of step d in the presence ofthe blocking primer and determining whether the products that have the same pattern of expression in the various species are 25 turned off or diminished.
27. A method for identifying differential genes that are members of gene families between different genotypes, that uses the set of conserved primers together with a specific primer of a differentially identified gene, and react the specific primer with either the set of conserved primers in order to obtain a differential new PCR product between the samples. 30
28. A method for DNA fingeφrinting using primers, or a mix of primers that match the sequence of ubiquitous cis-acting regulatory elements, and the cis-acting primers replaces the repeat or RAPD primers of claims 1 1-15.
29. A method according to claim 28 using for the PCR DNA fingeφrinting a mix of cis-acting regulatory element primers, such that each primer matches a sequence listed in table 1.
30. A method for the DNA fingeφrinting using primers each of which matches one ofthe sequences of claims 28-29 in a direct repeat form of 2 or more repetitions.
31. A method according to claims 28-29 using a mix of primers that are at least 10 nucleotides 5 long, wherein each primer is comprised of a random hexamer sequences at the 5" end, and continues with a sequence that matches a ubiquitous cis-acting regulatory element at the 3'.
32. A method for DNA fingeφrinting wherein primer sequences include a site of a non frequent cutter non blunt ended restriction enzyme such that a. The restriction site is positioned at the oligo-primers 5' end. 10 b. Within each gene family the sense and anti-sense primers contain different restriction sites but all ofthe sense primers contain the same restriction site and all ofthe anti-sense primers contain the same restriction site, and the repeat or RAPD or cis-acting pools of primers also contain a different restriction site, thus for example all the sense primers ofthe kinase gene family contain the restriction site of EcoRI, and all the anti- sense primers of 15 the kinase gene family contain the restriction site of Hindlll, and the repeat contain the restriction site of Sal I.
33. A method according to claim 32 using a set of dsDNA adapters being 20-30 nucleotides long, that contains at one end restriction sites equivalent to the gene family oligo-primers restriction site, and for each restriction site sequence there is only one adapter with a 20 sequence that is different from that ofthe other adapters.
34. A method according to claim 33 comprising steps of performing a DNA fingeφrinting with sets of sense and anti-sense primers, identifying the mix of primers that resulted in a specific DNA fingeφrinting PCR product, and sequencing it using a primer with a sequence complementary to the adapters' sequence, the method comprising steps of: 25 a. Isolating the PCR product from a gel and dividing it into three micro-tubes. b. Digesting the PCR product in each micro-tube with a different combination of restriction enzymes i.e. EcoRI alone, Hindlll alone and EcoRI + Hindlll. c. Ligating to restriction products in each tube the matching adapters, i.e. the first micro- tube the adapters that match EcoRI restriction site, the second tube the adapters to 30
Hindlll. and the third tube the adapters to EcoRI + Hindlll. d. Performing PCR using primers with a sequence complementary to the adapters' sequence, and observing in which micro-tube the expected size PCR product is produced. ~n
35. A method for DNA fingeφrinting comprising steps of: a. Performing DNA digestion with a restriction enzyme, which is a 6 nucleotides cutter and not frequent cutter, for example EcoR I. b. Ligating a matching adapter to the cleaved restriction site ofthe genomic DNA fragments. c. Performing PCR with a primer that matches the ligated adapter and a pool of primers that 5 matches the conserved sequence within a selected gene family
36. A method according to any ofthe previous claims for identifying genes that belong to a gene family that might be involved in cancer formation, the method comprising steps of: a. Using a set of conserved primers of a gene family for DNA fingeφrinting and for determining gene family expression patterns in normal and cancerous tissues, 10 b. Detecting correlation between gene expression changes and gene deletion, amplification or rearrangements as determined by the DNA fingeφrinting assay.
37. A method according to claims 3 or 16 that uses as common sets of primers that match gene family conserved regions for both gene expression analysis and DNA fingeφrinting of an organisms, and the primers are listed in the tables ofthe following patent applications: 15 Israeli applications 130932, 133684, 135889, and the PCT WO9934016
38. A method for DNA fingeφrinting wherein primer sets and their sequences are the following: a. Primary sets of primers sense and anti-sense, that match the two gene family conserved regions, and a repeat or RAPD or a regulatory region matching primer. 20 b. Secondary sets of primers to the gene family sense and anti-sense primers of which each primer contains three parts: bl) At the 3' side of each primer there is a sequence that is identical to the sequence of one ofthe primary primers, and the whole set ofthe secondary primers at their 3' side have the same sequences as the whole set ofthe primary primers. 25 b2) For each ofthe primers at its 5" side an additional 18-21 nucleotides are added, and the sequence of this additive is the same for one set that matches one conserved region, but is different from the sequence added to the second set that matches a second conserved region. b3) At the most 5' end ofthe secondary primers the restriction site of a non frequent cutter restriction enzyme is added and the site is common to the primers that belong to one conserved 30 region, and is different from the restriction site added to the primers that match the other conserved region. c. Designing tertiary primers and basically three primers of which one has the same sequence as the common anchor sequence of the first set of gene family secondary primers including their restriction site, the second has the sequence ofthe common anchor sequence ofthe second set of secondary primers including their restriction site, and the third which is the repeat or cis-acting regulatory region primer. 5
39. A method according to claim 38 comprising steps of performing a PCR DNA fingeφrinting with sets of sense and anti-sense primers of a gene family, or one of these two sets together with a repeat, or RAPD, or cis-acting regulatory element matching primer, that result in a specific DNA fingerprinting PCR product that can be sequenced or easily amplified, the method comprising steps of: 10 a. Performing on the compared samples the PCR with any two ofthe three primary primer sets. b. Isolating a differential PCR product from a gel. c. Performing on it the secondary PCR with the secondary sets of primers matching the sequence of the primary sets of primers. d. Dividing the PCR product into three micro-tubes and performing PCR with two ofthe 15 tertiary primers that match the common sequence of secondary primers, in a first micro-tube the first primer, in the second micro-tube the second primer, and in the third micro-tube both first and second primers, and observing in which micro-tube the expected size PCR product is produced.
40. The method of claim39 that omit the primary set of primers, and the primary PCR of item 39a 20 from the DNA fingeφrinting. and also enable the performance of the PCR of item 39d on the whole PCR products obtained in item 39c.
41. A method according to any one of claims 1.2. 4-10 and 37 that uses gene family conserved regions as DNA probes in micro-arrays for DNA fingeφrinting.
42. A method according to claim 41 that uses as imprinted probes in DNA fingeφrinting 25 micro-arrays, the sequence of PCR products obtained from various varieties or genotypes, or from normal and cancerous cells with conserved primers from one conserved region of a gene family together with a repeat primer.
43. A method according to claim 41 that uses as probes in DNA fingeφrinting micro-arrays the sequence of PCR products obtained from different varieties or strains or from normal and 30 cancerous cells with primer sets from two conserved regions within a gene family.
PCT/IL2001/000151 2000-02-22 2001-02-19 A method that compares genomic sequences WO2001062967A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2001234048A AU2001234048A1 (en) 2000-02-22 2001-02-19 A method that compares genomic sequences

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
IL134660 2000-02-22
IL13466000 2000-02-22
IL13712400 2000-07-02
IL137124 2000-07-02
IL13795900 2000-08-20
IL137959 2000-08-20

Publications (2)

Publication Number Publication Date
WO2001062967A2 true WO2001062967A2 (en) 2001-08-30
WO2001062967A3 WO2001062967A3 (en) 2002-08-01

Family

ID=27271923

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IL2001/000151 WO2001062967A2 (en) 2000-02-22 2001-02-19 A method that compares genomic sequences

Country Status (2)

Country Link
AU (1) AU2001234048A1 (en)
WO (1) WO2001062967A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110172525A (en) * 2019-06-26 2019-08-27 广西壮族自治区林业科学研究院 Forest difference expression gene SSR primer sets and polymorphism SSR marker development approach

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5876932A (en) * 1995-05-19 1999-03-02 Max-Planc-Gesellschaft Zur Forderung Der Wissenschaften E V. Berlin Method for gene expression analysis
WO1999034016A2 (en) * 1997-12-29 1999-07-08 Genena Ltd. A method for identifying and characterizing cells and tissues
FR2779154A1 (en) * 1998-05-27 1999-12-03 Bio Merieux PROCESS FOR AMPLIFYING AT LEAST ONE PARTICULAR NUCLEOTIDE SEQUENCE AND PRIMERS FOR IMPLEMENTATION

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5876932A (en) * 1995-05-19 1999-03-02 Max-Planc-Gesellschaft Zur Forderung Der Wissenschaften E V. Berlin Method for gene expression analysis
WO1999034016A2 (en) * 1997-12-29 1999-07-08 Genena Ltd. A method for identifying and characterizing cells and tissues
FR2779154A1 (en) * 1998-05-27 1999-12-03 Bio Merieux PROCESS FOR AMPLIFYING AT LEAST ONE PARTICULAR NUCLEOTIDE SEQUENCE AND PRIMERS FOR IMPLEMENTATION

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
LOPEZ-NIETO C E AND NIGAM S K: "Selective amplificationof protein-coding regions of large sets of genes using statistically designed primer sets" NATURE BIOTECHNOLOGY, NATURE PUBLISHING, US, vol. 14, 1 July 1996 (1996-07-01), pages 857-861, XP002090066 ISSN: 1087-0156 *
STONE B AND WHARTON W: "TARGETED RNA FINGERPRINTING: THE CLONING OF DIFFERENTIALLY-EXPRESSED cDNA FRAGMENTS ENRICHED FOR MEMBERS OF THE ZINC FINGER GENE FAMILY" NUCLEIC ACIDS RESEARCH, OXFORD UNIVERSITY PRESS, SURREY, GB, vol. 13, no. 22, 1994, pages 2612-2618, XP002076945 ISSN: 0305-1048 *
VIDER B ET AL: "Human colorectal carcinogenesis is associated with deregulation of homeobox gene expression" BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, ACADEMIC PRESS INC. ORLANDO, FL, US, vol. 232, no. 3, March 1997 (1997-03), pages 742-748, XP002104685 ISSN: 0006-291X *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110172525A (en) * 2019-06-26 2019-08-27 广西壮族自治区林业科学研究院 Forest difference expression gene SSR primer sets and polymorphism SSR marker development approach

Also Published As

Publication number Publication date
WO2001062967A3 (en) 2002-08-01
AU2001234048A1 (en) 2001-09-03

Similar Documents

Publication Publication Date Title
Singh et al. CAAT box-derived polymorphism (CBDP): a novel promoter-targeted molecular marker for plants
Agarwal et al. Advances in molecular marker techniques and their applications in plant sciences
Idrees et al. Molecular markers in plants for analysis of genetic diversity: a review
Saha et al. Simple sequence repeats as useful resources to study transcribed genes of cotton
JP7047373B2 (en) Next-generation sequencer primer and its manufacturing method, DNA library using next-generation sequencer primer, its manufacturing method, and genomic DNA analysis method using the DNA library.
İpek et al. Transcriptome-based SNP discovery by GBS and the construction of a genetic map for olive
JP7343264B2 (en) Method for preparing a DNA library and method for analyzing genomic DNA using a DNA library
KR100781205B1 (en) SSR primer isolated from Perilla spp. and use thereof
JP4739222B2 (en) CMV Resistance-Associated Molecular Marker and Its Use (Molecular Marker Associated With CMVR ResistanceAndUseThereof)
KR100781206B1 (en) SSR primer isolated from Sesamum sp. and use thereof
KR20160053420A (en) SNP molecular markers associated with the pungency of chili and uses thereof
Kuras et al. Application of five DNA marker techniques to distinguish between five apple (Malus× domestica Borkh.) cultivars and their sports
JP2007521003A (en) Splice site AFLP
Ahn et al. Microsatellite marker information from high-throughput next-generation sequence data of Capsicum annuum varieties Mandarin and Blackcluster
WO2001062967A2 (en) A method that compares genomic sequences
JP2022044587A (en) Method of making dna library and genome dna analyzing method using dna library
Falistocco et al. Cytogenetic characterization by in situ hybridization techniques and molecular analysis of 5S rRNA genes of the European hazelnut (Corylus avellana)
JP7056012B2 (en) Random primer set and method for preparing a DNA library using it
JP2010154802A (en) Method for identifying species of plant of genus chrysanthemum
KR100769367B1 (en) Ssr primer derived from common millet and use thereof
KR102524050B1 (en) Molecular marker for discriminating sex of Actinidia arguta and uses thereof
Ceccarelli et al. Characterization of the chromosome complement of Helianthus annuus by in situ hybridization of a tandemly repeated DNA sequence
JP3836451B2 (en) Genotyping method using a DNA marker located near the pungent locus of Capsicum plants, primers used for detecting the DNA marker, and genotyping method using the same
Undal et al. The methods of molecular markers in genome analysis: a brief biotechnological approach
Prasad et al. Advancement in molecular tools of plant population genetics

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase in:

Ref country code: JP