AU750707B2 - Nucleic acids encoding a plant enzyme involved in very long chain fatty acid synthesis - Google Patents

Nucleic acids encoding a plant enzyme involved in very long chain fatty acid synthesis Download PDF

Info

Publication number
AU750707B2
AU750707B2 AU70191/98A AU7019198A AU750707B2 AU 750707 B2 AU750707 B2 AU 750707B2 AU 70191/98 A AU70191/98 A AU 70191/98A AU 7019198 A AU7019198 A AU 7019198A AU 750707 B2 AU750707 B2 AU 750707B2
Authority
AU
Australia
Prior art keywords
nucleic acid
seq
sequence
set forth
sequence set
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
AU70191/98A
Other versions
AU750707C (en
AU7019198A (en
Inventor
Ljerka Kunst
Anthony A. Millar
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of British Columbia
Original Assignee
University of British Columbia
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of British Columbia filed Critical University of British Columbia
Publication of AU7019198A publication Critical patent/AU7019198A/en
Application granted granted Critical
Publication of AU750707B2 publication Critical patent/AU750707B2/en
Publication of AU750707C publication Critical patent/AU750707C/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8243Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
    • C12N15/8247Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving modified lipid metabolism, e.g. seed oil composition
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8216Methods for controlling, regulating or enhancing expression of transgenes in plant cells
    • C12N15/8222Developmentally regulated expression systems, tissue, organ specific, temporal or spatial regulation
    • C12N15/8223Vegetative tissue-specific promoters
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8261Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
    • C12N15/8287Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for fertility modification, e.g. apomixis
    • C12N15/8289Male sterility
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1025Acyltransferases (2.3)
    • C12N9/1029Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)

Landscapes

  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Wood Science & Technology (AREA)
  • Organic Chemistry (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • Plant Pathology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Cell Biology (AREA)
  • Oil, Petroleum & Natural Gas (AREA)
  • Nutrition Science (AREA)
  • Medicinal Chemistry (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Fats And Perfumes (AREA)
  • Enzymes And Modification Thereof (AREA)

Description

WO 98/46766 PCT/CA98/00343 NUCLEIC ACIDS ENCODING A PLANT ENZYME INVOLVED IN VERY LONG CHAIN FATTY ACID SYNTHESIS Technical Field This invention relates to DNA molecules cloned from plants and methods of using such DNA molecules to produce transgenic plants with altered fatty acid composition.
Background Epicuticular waxes form the outermost layer of the aerial portion of the plant and are thus the first line of interaction between the plant and its environment. The physical properties of this wax layer protect the plant from numerous environmental stresses. For example, the hydrophobic nature of wax prevents dehydration (nonstomatal water loss) and aids in shedding rainwater. The reflective nature of wax protects the plant against UV radiation (Reicosky and Hanover, 1978). Waxes are also known to protect against acid rain (Percy and Baker, 1990) and, because they are a good solvent for organic pollutants, they are able to impede the uptake of aqueous foliar sprays (Schreiber and Schonherr, 1992).
Furthermore, surface waxes protect plants from bacterial and fungal (Jenks et al., 1994) pathogens ad play a role in plant-insect interactions (Eigenbrode and Espelie, 1995). Recently it has been shown that some of the compounds found in epicuticular waxes are also present in the tryphine layer of pollen grains (Preuss et al., 1993). Without these compounds the tryphine layer erodes, resulting in pollen that is unable to function causing male sterility.
Epicuticular waxes are composed of long chain, hydrophobic compounds all derived from saturated very long chain fatty acids (VLCFAs), that are synthesized within and then secreted from the epidermis. VLCFAs are defined as those fatty acids whose chain length is 20 or more carbons long.
The lengths will vary from plant to plant, but typically, the wax VLCFAs are approximately 26-34 carbon long. These VLCFAs are synthesized by a microsomal fatty acid elongation (FAE) system by sequential additions of C2 moieties from malonyl-coenzyme A (CoA) to pre-existing fatty acids derived from the de novo fatty acid synthesis (FAS) pathway of the plastid. Analogous to de novo FAS it is thought that each cycle of FAE involves four enzymatic reactions; condensation of malonyl-CoA with a log chain acyl-CoA, reduction to P-hydroxyacyl-CoA, dehydration to an enoyl-CoA and (4) reduction of the enoyl-CoA, resulting in the elongated acyl-CoA (Fehling and Mukherjee, 1991).
Together these four activities are termed the elongase (von Wettstein-Knowles, 1982). VLCFAs in the epidermis are then converted to the other wax components through a number of pathways consisting of multienzyme complexes. For example VLCFAs are converted to aldehydes by fatty acyl-CoA reductase (Kolattukudy, 1971). These aldehydes can either be reduced by aldehyde reductase to produce primary alcohols (Kolattukudy, 1971), or decarbonylated by an aldehyde decarbonylase to produce odd chained alkanes (Cheesbrough and Kolattukudy, 1984). Alkanes can then undergo oxidation to form firstly WO 98/46766 PCT/CA98/00343 secondary alcohols and then ketones (for review see Post-Beittenmiller, 1996). Very little is known at the molecular level about the components that are involved in the biosynthesis of wax specific compounds and their secretion onto the plant surface. Genetic studies have shown that there are a large number of genes involved in these processes (for example, 22 loci have been reported in Arabidopsis, 84 in barley). However only a few of these genes have been isolated so far and the biochemical role of their gene products remains unknown (Lemieux, 1996).
In addition to being made in the epidermal cells, VLCFAs also accumulate in the seed oil of some plant species. To date, developing seeds have been the primary focus of research into VLCFA biosynthesis. In seeds VLCFAs are incorporated into triacylglyerols (TAGs), as in the Brassicaceae, or into wax esters, as in Jojoba. The seed VLCFAs include the agronomically important erucic acid (C22:1), with oils containing this fatty acid used in the manufacture of lubricants, nylon, cosmetics, pharmaceuticals and plasticisers (Battey et al., 1989); Johnston and Fritz, 1989). Conversely, VLCFAs have detrimental nutritional effects and are therefore undesirable in edible oils. This has led to the breeding of Canola rapeseed varieties that are almost devoid of VLCFAs (Stefansson et al., 1961).
The seeds of Arabidopsis contain approximately 28% [w/wt of total fatty acids of VLCFAs, eicosenoic acid (20:1) being the predominant VLCFA (21% of wt/wt of total FA). To identify the gene products that are involved in the synthesis of seed VLCFAs and establish the VLCFA biosynthetic pathway, several groups performed mutational analysis and screened for seed that had reduced VLCFA content. Each group independently identified the FATTYACID ELONGATIONI gene (FAE1; James and Dooner, 1990; Kunst et al., 1992; Lemieux et al., 1990). A mutation at this locus resulted in reduced VLCFA levels 1% wt/wt of total FA) in the seed. Several other mutations that were-non-allelic to FAE1 were also isolated. However, these mutations had a less pronounced effect in that VLCFAs still constituted 6.7% (wt/wt of total FA) of the seed fatty acid (Katavic et al., 1995; Kunst et al., 1992). Thus, despite the fact that four enzymatic activities are required for each elongation step, the FAE1 gene was the only one found by mutant analysis that resulted in almost complete loss of VLCFA synthesis in the seed.
The Arabidopsis FAE1 gene was subsequently cloned (James et al., 1995; WO 96/13582), and showed homology to three condensing enzymes: chalcone synthase, stilbene synthase and P-ketoacyl- [acyl carrier protein] synthase III (17 amino acids were identical to a 50 amino acid region of a consensus sequence for condensing enzymes). Based on this homology it was proposed that FAE1 encodes a P-ketoacyl-coenzyme A synthase (KCS), the condensing enzyme which catalyzes the first reaction of the microsomal fatty acid elongation system (James et al., 1995). As determined by Northern analysis, the FAE1 gene is expressed in seeds of Arabidopsis, but is absent from leaves (James et al., 1995). This result is consistent with the fact that thefael mutation affects only the fatty acid composition of the developing seed, having no pleiotropic effects on fatty acid composition of the vegetative, or floral parts of the plant. Thus, FAE1 is regarded as a seed-specific condensing enzyme.
Recently a cDNA from Jojoba seeds involved in the syntheses of VLCFAs has been isolated (Lassner et al., 1996; WO 95/15387). The protein encoded by this cDNA showed high homology to WO 98/46766 PCT/CA98/00343 FAE1 (52% amino acid identity), and biochemical analysis demonstrated that it has a KCS activity.
Using Jojoba KCS cDNA, Lassner et al. (1996) were able to complement the mutation in a Canola variety of Brassica napus, restoring a low erucic acid rapeseed line to a line that contained higher levels of VLCFAs. This suggests that in Canola, the mutation is in the structural gene encoding KCS, or a gene affecting KCS activity. Thus, both in Arabidopsis and Brassica napus, the mutations that result in the abolition of VLCFA synthesis seem to affect the condensing enzyme.
If four enzyme activities are necessary for an elongation step, and FAE1 and Jojoba-KCS only encode the KCS activity, one might expect to find other complementation groups that result in very low levels of VLCFAs synthesis. Because these complementation groups were not found in mutation screenings, Millar and Kunst (1997) have hypothesized that these three activities are not seed specific, but ubiquitously present throughout the plant and shared with other FAE systems involved in VLCFA formation including wax biosynthesis. To test this FAE1 was ecotopically expressed in yeast and in tissues of Arabidopsis and tobacco, where significant quantities of VLCFAs are not found. Expression of FAE1 alone in these cells resulted in the biosynthesis and accumulation of VLCFAs. This demonstrated that the condensing enzyme is the pivotal control point of the elongase, controlling not only the amounts of VLCFAs produced, but also their chain lengths. In contrast, it appears that the other three enzyme activities of the elongase are found ubiquitously throughout the plant, are not rate limiting and play no role in the control of VLCFA synthesis. The ability of yeast containing FAE1 to synthesize VLCFAs suggests that the expression, and the acyl chain length specificity of the condensing enzyme, along with the apparent broad specificities of the other three FAE activities, may be universal eukaryotic mechanism for regulating the amounts and acyl chain length of VLCFAs synthesized in any given cell (Millar and Kunst, 1997).
Thus, considering the central role of the condensing enzyme for VLCFA synthesis, the isolation of genes encoding condensing enzymes involved in the production of wax specific VLCFAs would facilitate the modification of wax composition through genetic engineering. Furthermore, since the majority of wax components are derived from VLCFAs, the availability of such genes would offer the potential to modify the wax load itself. This offers the potential to modify the susceptibility of plants to environmental stresses such as ultraviolet light, heat and drought, as well as the ability of plants to withstand insects and pathogens. The present invention is directed towards nucleic acids that encode condensing enzymes for VLCFA synthesis.
Summary of the Invention The present invention provides nucleic acids (cDNAs and genomic clones) that encode a key enzyme in the synthesis of VLCFAs in plant epidermal cells. The activity of this enzyme is referred to as very long chain fatty acid elongase; the activity is required for synthesis of VLCFAs of greater than 24 carbons in length. It is shown that co-suppression of the CUT1 gene in plants can disrupt VLCFA synthesis which results in plants having none of the protective wax usually found on stem surfaces. In addition, it is shown that such plants are conditionally male sterile: when grown under normal humidity, the plants are male sterile, but fertility can be restored by growth in an elevated humidity environment.
WO 98/46766 PCT/CA98/00343 The invention thus provides the CUT cDNA and gene nucleotide sequences ("CUT1 nucleic acids") and the amino acid sequence of the CUT1 protein. In one embodiment, the CUTI nucleic acids disclosed are from Arabidopsis thaliana. The open reading frame of the Arabidopsis CUT1 cDNA molecule encodes an enzyme of 497 amino acids which catalyzes the addition of 2C units to pre-existing C24 or longer fatty acids.
Also encompassed within the scope of this invention are transformation vectors that include at least a portion of the CUT nucleic acid molecules. Such vectors may be transformed into plants to produce transgenic plants with modified VLCFA compositions (relative to non-transgenic plants of the same species). Depending on the particular sequences incorporated into the vector, transformation with the CUT1 cDNA, gene or derivatives thereof can be used to modify agronomically important traits, including the presence, composition and thickness of epicuticular wax layers on leaves and stems, seed coat fatty acids, seed oil composition and male sterility. Typically, such vectors include regulatory sequences, such as promoters, operably linked to the CUT1 open reading frame or a derivative of the CUT1 nucleic acids. For example, VLCFA synthesis may be altered by introducing into a plant a transformation vector that includes a sense or antisense version of the CUTI cDNA. Transgenic plants having modified VLCFA compositions and which are transformed with such recombinant transformation vectors are also provided by this invention.
In one aspect of the invention, transformation with-sense or antisense versions of the CUT1 nucleic acids may be used to produce plants having modified epicuticular wax layers on the aerial parts of the plants, such as the leaves and stems. A modified epicuticular wax layer may be modified in physical respects, such as thickness of the wax layer, or in composition. Because these layers play a role in the ability of plants to resist environmental stresses, such as drought and ultraviolet light, as well as insects and pathogens, transformation with vectors including forms of the CUT1 nucleic acids may be used to produce plants with particular agronomic advantages. Producing plants with modified epicuticular wax composition may be achieved by introducing into the plants a vector in which the CUTI nucleic acid (or a derivative thereof) is operably linked to a promoter that directs expression of the open reading frame in the epidermal cells. The CaMV 35S promoter and the endogenous CUT1 gene promoter are examples of regulatory sequences that may be suitable for this purpose.
Agronomically important traits in addition to wax composition may also be modified using the CUT1 nucleic acids of the present invention. For example, the fatty acid composition of the seed coat and the fatty acid composition of seed oil may be modified by transforming plants with the CUT1 cDNA or derivatives thereof. Preferably, where it is desired to modify aspects of seed VLCFA composition, the introduced CUTI nucleic acid sequence will be operably linked to a promoter known to direct expression in seed tissues. Seed-specific promoters include the napin promoter of Brassica napus (Lee et al., 1991). In addition, transformation with the CUT1 nucleic acids or derivatives thereof may be used to disrupt VLCFA synthesis in pollen, resulting in conditionally male sterile plants. Such plants are useful in plant breeding programs.
While the invention provides CUTI-encoding nucleic acids from Arabidopsis, it additionally WO 98/46766 PCT/CA98/00343 encompasses homologs, orthologs and variants and derivatives of these sequences, as well as homologs, orthologs and variants of the CUT1 polypeptide sequence. Thus, in one aspect of the invention, nucleic acid molecules that comprise specified regions of these sequences are provided. Exemplary of such nucleic acid molecules are oligonucleotides that are useful as probes or primers to detect and amplify CUTIencoding nucleic acids from other plant species. Such oligonucleotides are useful as hybridization probes or PCR primers, and typically comprise at least 15 consecutive bases of the disclosed CUT1 nucleic acid sequences. In other embodiments, such oligonucleotides comprise longer regions of the disclosed CUT1 sequences, such as at least 20, 25 or 30 consecutive nucleotides.
In another aspect, the invention provides compositions and methods for isolating nucleic acid sequences that encode enzymes having CUT1 activity from other plant species. Typically, such methods involve hybridizing probes or primers derived from the disclosed Arabidopsis sequences to nucleic acids obtained or derived from such other plant species.
Homologous and orthologous sequences to Arabidopsis CUT] nucleic acid and CUT1 amino acid sequences share key functional and structural characteristics with the disclosed Arabidopsis sequences.
Functionally, such sequences encode (or comprise) a polypeptide that catalyzes the very long chain fatty acid elongation as described above. Structurally, such sequences share a specified structural relationship with the disclosed sequences. By way of example, in certain embodiments, homologous amino acid sequences have at least 70% sequence identity with the Arabidopsis CUT1 amino acid sequence. In other embodiments, homologous nucleic acid sequences hybridize under stringent conditions to the disclosed Arabidopsis CUTI nucleic acid sequences.
Another aspect of the invention relates to the purified CUT1 enzyme itself. Having provided nucleic acid molecules that encode this enzyme, the invention also facilitates the expression of CUT1 enzyme in heterologous systems, including E. coli, yeast and baculovirus expression systems. Thus, the invention permits the large scale production of the enzyme for agricultural and other applications.
In another aspect of the invention the promoter sequence of the CUT1 gene is disclosed. This promoter sequence confers epidermis-specific expression, and may be used to express a variety of nucleic acids in an epidermis-specific manner.
Detailed Description of the Invention I. Definitions Unless otherwise noted, technical terms are used according to conventional usage. Definitions of common terms in molecular biology may be found in Benjamin Lewin, Genes V published by Oxford University Press, 1994 (ISBN 0-19-854287-9); Kendrew et al The Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (ISBN 0-632-02182-9); and Robert A. Meyers Molecular Biology and Biotechnology: a Comprehensive Desk Reference, published by VCH Publishers, Inc., 1995 (ISBN 1-56081-569-8). The nomenclature for DNA bases as set forth at 37 CFR 1.822 and the standard three letter codes for amino acid residues are used herein.
In order to facilitate review of the various embodiments of the invention, the following WO 98/46766 PCT/CA98/00343 definitions of terms are provided: CUT1 protein: The defining functional characteristic of a CUT1 protein is its enzymatic activity, specifically its very long chain fatty acid elongase activity. This activity is manifested as the catalysis of one or more steps in the addition of 2 carbon moieties (such as malonyl-coenzyme A) to preexisting very long chain fatty acids (VLCFAs). In a preferred embodiment, a CUT1 protein catalyzes one or more steps in the addition of 2 carbon moieties to pre-existing long chain fatty acids of at least 24 carbon units in length. This activity can be measured by the assay described below.
This invention provides a cDNA and a gene encoding a CUTI enzyme from Arabidopsis thaliana. However the invention is not limited to this particular CUTI protein: other nucleotide sequences which encode CUTI proteins are also part of the invention, including variants on the disclosed Arabidopsis cDNA and gene sequences and orthologous sequences from other plant species, including naturally occurring variants, such as sequences from other ecotypes, species and natural polymorphisms, the cloning of which is now enabled. Such sequences share the essential functional characteristic of encoding an enzyme having very long chain fatty acid elongase activity. Nucleic acid sequences that encode CUT1 proteins and the proteins encoded by such nucleic acids share not only this functional characteristic, but also a specified level of sequence similarity (or sequence identity), as addressed below.
The concept of sequence identity can also be expressed in the ability of two sequences to hybridize to each other under stringent conditions.
Sequence identity: the similarity between two nucleic acid sequences, or two amino acid sequences is expressed in terms of the similarity between the sequences, otherwise referred to as sequence identity. Sequence identity is frequently measured in terms of percentage identity (or similarity or homology); the higher the percentage, the more similar the two sequences are.
Methods of alignment of sequences for comparison are well-known in the art. Various programs and alignment algorithms are described in: Smith and Waterman (1981); Needleman and Wunsch (1970); Pearson and Lipman (1988); Higgins and Sharp (1988); Higgins and Sharp (1989); Corpet et al. (1988); and Pearson et al. (1994). Altschul et al. (1994) presents a detailed consideration of sequence alignment methods and homology calculations.
The NCBI Basic Local Alignment Search Tool (BLAST) (Altschul et al., 1990) is available from several sources, including the National Center for Biological Information (NCBI, Bethesda, MD) and on the Internet, for use in connection with the sequence analysis programs blastp, blastn, blastx, tblastn and tblastx. It can be accessed at htp://www.ncbi.nlm.nih.gov/BLAST/. A description of how to determine sequence identity using this program is available at http://www.ncbi.nlm.nih.gov/BLAST/blast help.html.
Homologs of the Arabidopsis CUTI protein are characterized by possession of at least sequence identity counted over the full length alignment with the disclosed Arabidopsis CUT1 amino acid sequence using the NCBI Blast 2.0, gapped blastp set to default parameters. Such homologous peptides will more preferably possess at least 75%, more preferably at least 80% and still more preferably at least or 95% sequence identity with the Arabidopsis CUT1 amino acid sequence determined by this method. When less than the entire sequence is being compared for sequence identity, homologs will WO 98/46766 PCT/CA98/00343 possess at least 75% and more preferably at least 85% and more preferably still at least 90% or sequence identity over short windows of 10-20 amino acids. Methods for determining sequence identity over such short windows are described at http://www.ncbi.nlm.nih.gov/BLAST/blast FAQs.html.
Homologs having the sequence identities described above will, in some embodiments, also possess VLCFA elongase activity. One of skill in the art will appreciate that these sequence identity ranges are provided for guidance only; it is entirely possible that strongly significant homologs could be obtained that fall outside of the ranges provided. The present invention provides not only the peptide homologs are described above, but also nucleic acid molecules that encode such homologs.
Homologs of the Arabidopsis CUT cDNA and gene are similarly characterized by possession of at least 60% sequence identity counted over the full length alignment with the disclosed Arabidopsis cDNA or gene sequence using the NCBI Blast 2.0, gapped blastn set to default parameters. Such homologous nucleic acids will more preferably possess at least 70%, more preferably at least 80% and still more preferably at least 90% or 95% sequence identity determined by this method. When less than the entire sequence is being compared for sequence identity, homologs will possess at least 85% and more preferably at least 90% and more preferably still at least 95% sequence identity over 30 nucleotide windows. Homologs having the sequence identities described above will, in some embodiments, also encode a polypeptide having VLCFA elongase activity. However, homologs as defined above are useful for modifying VLCFA elongase activity in transgenic plants (for example, as used in antisense constructs) even when they do not encode a functional peptide. Again, one of skill in the art will appreciate that these sequence identity ranges are provided for guidance only; it is entirely possible that strongly significant nucleic acid homologs could be obtained that fall outside of the ranges provided.
Another indication that two nucleic acid molecules are substantially homologous is that the two molecules hybridize to each other under stringent conditions when one molecule is used as a hybridization probe, and the other is present in a biological sample, genomic material from a cell. Specific hybridization means that the molecules hybridize substantially only to each other and not to other molecules that may be present in the genomic material. Stringent conditions are sequence dependent and are different under different environmental parameters. Generally, stringent conditions are selected to be about 5°C to 20 0 C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The T. is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Conditions for nucleic acid hybridization and calculation of stringencies can be found in Sambrook et al. (1989) and Tijssen (1993). Hybridization conditions and stringencies are further discussed below.
Nucleic acid sequences that do not show a high degree of identity may nevertheless encode similar amino acid sequences, due to the degeneracy of the genetic code. It is understood that changes in nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequence that all encode substantially the same protein.
Probes and primers: Nucleic acid probes and primers may readily be prepared based on the nucleic acids provided by this invention. A probe comprises an isolated nucleic acid attached to a WO 98/46766 PCT/CA98/00343 detectable label or reporter molecule. Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. Methods for labeling and guidance in the choice of labels appropriate for various purposes are discussed, in Sambrook et al. (1989) and Ausubel et al. (1987).
Primers are short nucleic acids, preferably DNA oligonucleotides 15 nucleotides or more in length. Primers may be annealed to a complementary target DNA strand by nucleic acid hybridization to form a hybrid between the primer and the target DNA strand, and then extended along the target DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification of a nucleic acid sequence, by the polymerase chain reaction (PCR) or other nucleic-acid amplification methods known in the art.
Methods for preparing and using probes and primers are described, for example, in Sambrook et al. (1989), Ausubel et al. (1987), and Innis et al., (1990). PCR primer pairs can be derived from a known sequence, for example, by using computer programs intended for that purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge, MA). One of skill in the art will appreciate that the specificity of a particular probe or primer increases with its length. Thus, for example, a primer comprising 20 consecutive nucleotides of the Arabidopsis CUTI cDNA or gene will anneal to a target sequence a corresponding CUTI gene from Zea mays) with a higher specificity than a corresponding primer of only 15 nucleotides. Thus, in order to obtain greater specificity, probes and primers may be selected that comprise 20,25, 30, 35, 40, 50 or more consecutive nucleotides of the Arabidopsis CUT cDNA or gene sequences. Such probes and primers are useful for obtaining CUTI nucleic acid molecules (cDNA, genomic sequences, and portions of these molecules) both from Arabidopsis and other plant species.
Vector: A nucleic acid molecule as introduced into a host cell, thereby producing a transformed host cell. A vector may include nucleic acid sequences that permit it to replicate in the host cell, such as an origin of replication. A vector may also include one or more selectable marker genes and other genetic elements known in the art.
Transformed: A transformed cell is a cell into which has been introduced a nucleic acid molecule by molecular biology techniques. As used herein, the term transformation encompasses all techniques by which a nucleic acid molecule might be introduced into such a cell, including transformation with Agrobacterium vectors, transfection with viral vectors, transformation with plasmid vectors, and introduction of naked DNA by electroporation, lipofection, and particle gun acceleration.
Isolated: An "isolated" biological component (such as a nucleic acid or protein) has been substantially separated or purified away from other biological components in the cell of the organism in which the component naturally occurs, other chromosomal and extrachromosomal DNA and RNA, and proteins. Nucleic acids and proteins which have been "isolated" thus include nucleic acids and proteins purified by standard purification methods. The term also embraces nucleic acids and proteins prepared by recombinant expression in a host cell as well as chemically synthesized nucleic acids.
Purified: The term purified does not require absolute purity; rather, it is intended as a relative term. Thus, for example, a purified CUTI protein preparation is one in which the CUTI protein is more WO 98/46766 PCT/CA98/00343 enriched than the protein is in its natural environment within a cell. Preferably, a preparation of CUTI protein is purified such that CUT I protein represents at least 50% of the total protein content of the preparation.
Operably linked: A first nucleic acid sequence is operably linked with a second nucleic acid sequence when the first nucleic acid sequence is placed in a functional relationship with the second nucleic acid sequence. For instance, a promoter is operably linked to a coding sequence if the promoter effects the transcription or expression of the coding sequence. Generally, operably linked DNA sequences are contiguous and, where necessary to join two protein coding regions, in the same reading frame.
Recombinant: A recombinant nucleic acid is one that has a sequence that is not naturally occurring or has a sequence that is made by an artificial combination of two otherwise separated segments of sequence. This artificial combination is often accomplished by chemical synthesis or, more commonly, by the artificial manipulation of isolated segments of nucleic acids, by genetic engineering techniques.
Ortholog: two nucleotide or amino acid sequences are orthologs of each other if they share a common ancestral sequence and diverged when a species carrying that ancestral sequence split into two species. Orthologous sequences are also homologous sequences.
Transgenic plant: as used herein, this term refers to a plant that contains recombinant genetic material not normally found in plants of this type and which has been introduced into the plant in question (or into progenitors of the plant) by human manipulation. Thus, a plant that is grown from a plant cell into which recombinant DNA is introduced by transformation is a transgenic plant, as are all offspring of that plant which contain the introduced DNA (whether produced sexually or asexually).
II. Sequence Listing and Figures The nucleic and amino acid sequences listed in the accompanying sequence listing are showed using standard letter abbreviations for nucleotide bases, and three letter code for amino acids. Only one strand of each nucleic acid sequence is shown, but the complementary strand is understood to be included by any reference to the displayed strand.
Seq. I.D. No. 1 shows the nucleotide sequence of the CUT1 gene and the encoded amino acid sequence.
Seq. I.D. No. 2 shows the nucleotide sequence of the CUTI cDNA.
Seq. I.D. No. 3 shows the nucleotide sequence of the CUT1 open reading frame.
Seq. I.D. No. 4 shows the amino acid sequence of the CUT1 protein.
Seq. I.D. Nos. 5 11 show primers useful in PCR amplification of various regions of the CUT1 gene, cDNA or ORF.
Seq. I.D. No. 12 shows the promoter region of the CUT1 genomic clone.
Fig. 1 shows the pathways of wax biosynthesis in Arabidopsis.
WO 98/46766 PCT/CA98/00343 EIl. Isolation and Characterization of the CUTI cDNA The CUTI cDNA was initially identified using a TBLASTN homology search (Altschul et al., 1990) of the database of expressed sequenced tags (ESTs) of anonymous Arabidopsis cDNA clones (Newman et al., 1994) using the deduced amino acid sequence of the FAE1 gene. The search found 14 ESTs in the database which had open reading frames with significant homology to FAE1. These ESTs did not correspond to known condensing enzymes such as chalcone synthase or 3-ketoacyl-acyl carrier protein synthase III.
One of these ESTs was selected for further investigation, and the corresponding full length cDNA was isolated. This cDNA is herein referred to as the CUTI cDNA. Sequencing demonstrated that the CUT1 cDNA was 1829 nucleotides long, approximately the size of the FAE1 transcript (James et al., 1995). The CUTI cDNA contains one open reading frame of 497 amino acids, which is shorter than both the FAE1 sequence (506 amino acids) and the jojoba KCS (521 amino acids). The CUTI cDNA and the protein it encodes are shown in Seq. I.D. Nos. 2 and 4, respectively.
There is an in frame stop codon, TAA, 15 nucleotides upstream of the most 5' ATG, suggesting that this sequence indeed represents the full length amino acid sequence of the protein. Thus, the COT1 cDNA as depicted in Seq. I.D. No. 2 has a 5' untranslated region of 58 nucleotides, an open reading frame of 1491 nucleotides and a 3' untranslated region of 258 nucleotides, excluding the poly(A) tail (22 As). Comparison of the deduced amino acid sequence of the CUT1 protein to FAE1 revealed that they are 50.0% identical and 74.7% similar.
IV. Isolation and Characterization of the CUTI Gene An Arabidopsis CUT1 genomic clone was isolated from a genomic library in .GEMI 1 by probing nitrocellulose plaque lifts with a full-length CUTI cDNA clone. A 2.5 kb long Sall fragment containing 580 bp of the coding sequence and 1951 bp of the 5' upstream region was subcloned into the Sail site of pT7T3 18U plasmid (Pharmacia), followed by complete sequencing on both strands. The sequence of this genomic clone is shown in Seq. I.D. No. 1.
In situ hybridization studies in developing shoots, leaves and siliques of Arabidopsis indicated epidermis-specific expression of the CUTI gene, as expected of a gene encoding an enzyme involved in wax biosynthesis.
V. Analysis of the CUT1 Promoter In order to confirm the tissue and cell specificity of the CUTI promoter, 5' flanking sequences from the CUT1 genomic clone were operably linked to the uidA reporter gene encoding p-glucuronidase (GUS). Two constructs were made, one having a 1.9 kb promoter fragment and the second containing a truncated 1.2 kb promoter. These promoter-GUS fusions were introduced into Arabidopsis and tobacco by Agrobacterium-mediated transformation and the promoter function characterized in transgenic plants.
To obtain the 1.9 and 1.2 kb regions of the CUTI promoter sequence, synthetic oligonucleotides WO 98/46766 PCT/CA98/00343 homologous to portions of the 5' untranslated region of the genomic clone were used as primers to amplify either a 1949 bp or a 1209 bp promoter fragment by PCR. As shown in Figure 1, the upstream primer was 5'-GTGCTTTATATATGTTTG-3' (cutpro3) (Seq. I.D. No. 5) in combination with the downstream primer 5'-CGTCGGAGAGTTTTAATG-3' (cutpro 1) (Seq. I.D. No. 6) for the PCR-synthesis of the 1949 bp fragment, and 5'-CTTCGATATCGGTTGTTG-3' (cutpro2) (Seq. I.D. No. 7) and cutpro I for the amplification of the 1209 bp fragment. In both cases, the amplified products were subcloned in the HincII site of the plasmid pT7T318U (Pharmacia). The inserts were then cleaved out with HindIII and XbaI and directionally subcloned into the corresponding sites of the binary Ti plasmid pBl 01 (Clontech), which contains a promoterless GUS gene (Jefferson et al. 1987). The pCUTI-GUS fusion constructs in pBll01 were introduced into Agrobacterium tumefaciens strain GV3101 (Koncz and Schell, 1986) by electroporation and selected for resistance to kanamycin (50 Jg/ml).
For transformation of tobacco, Agrobacterium harbouring the pCUTI-GUS construct was cocultivated with leaf pieces of Nicotiana tabacum SRI and transformants were selected with kanamycin (100mg/mL) on solid medium (Lee and Douglas, 1996). Arabidopsis thaliana Heynh. ecotype Columbia was transformed with pCUTI-GUS binary vector using a combination of in planta (Chang et al., 1994, Katavic et al., 1994) and vacuum inflitration methods (Bechtold et al., 1993). Plants were grown until the primary inflorescence shoots reached 1-2 cm in height, when this bolts were cut off. The wound site was inoculated with 50 mL of an overnight Agrobacterium culture. After 4-6 days a number of secondary inflorescences that appeared were cut off, and vacuum inflitration was performed on these plants using the conditions described by Bechtold et al. (1993). Screening for transformed seed was done on 50g/mL kanamycin as described previously (Katavic et al., 1994).
Tissue sections of transgenic plants containing the pCUTI-GUS constructs were placed in 100 mM NaPO 4 (pH7) and 1 mM spermidine for 15 min, then incubated at 37 C in 0.5 K 3 0.01 Triton X-100, ImM EDTA, 10 mM ,f-mercaptoethanol, 5-bromo-4-chloro-3-indolyl-f3-D-glucuronide in 100 mM NaPO 4 (pH7), until a blue color appeared (after approximately 1 hr). Following incubation with the substrate, chlorophyll was removed from the sections using a graded ethanol series.
In both recipient plant species, Arabidopsis and tobacco, CUT] expression pattern mirrored that observed in the in situ experiments. Furthermore, both long and short CUTI promoter fragments targeted expression of the uidA gene exclusively to the epidermis. No GUS expression was detected in any of the other cell types in the stems or leaves of transgenic plants. Thus, the Arabidopsis CUT1 promoter is regulated in a tissue specific, and cell specific manner, and epidermis specificity appears to be retained even in unrelated plant species like tobacco. In addition, no differences in the strength of expression were detected between the 1.9 kb and 1.2 kb promoter.
VI. Preferred Methods for Producing CUT1 Nucleic Acids With the provision of the CUT1 cDNA and gene (the "CUTI nucleic acids") the polymerase chain reaction (PCR) may now be utilized in a preferred method for producing the CUT1 nucleic acids.
PCR amplification of the CUTI cDNA sequence may be accomplished either by direct PCR from a plant 11 WO 98/46766 PCT/CA98/00343 cDNA library or by Reverse-Transcription PCR (RT-PCR) using RNA extracted from plant cells as a template. Methods and conditions for both direct PCR and RT-PCR are known in the art and are described in Innis et al. (1990). Suitable plant cDNA libraries for direct PCR include the Arabidopsis cDNA library described by Newman et al. (1994). Similarly, the CUTI genomic sequence may be amplified directly from genomic DNA extracted from plants, or from plant genomic DNA libraries.
Amplification may be used to obtain the full length cDNA or genomic sequence, or may be used to amplify selected portions of these molecules (for example for use in antisense constructs) The selection of PCR primers will be made according to the portions of the CUT1 nucleic acids which are to be amplified. Variations in amplification conditions may be required to accommodate primers of differing lengths; such considerations are well known in the art and are discussed in Innis et al. (1990), Sambrook et al. (1989), and Ausubel et al (1987). By way of example only, the entire CUT1 cDNA molecule as shown in Seq. I.D. No. 2 may be amplified using the following combination of primers: primer 1 5' AAATACCCTAATCACATTTTGTAA 3' (Seq. I.D. No. 8) primer 2 5' TTTAAACAGAGAGAAATATTCTTA 3' (Seq. I.D. No. 9) The open reading frame portion of the cDNA may be amplified using the following primer pair: primer 3 5' ATGCCTCAGGCACCGATGCCAGAG 3' (Seq. I.D. No. primer 4 5' CAGCACGAGAAACTAAAAAATACC 3' (Seq. I.D. No. 11) These primers are illustrative only; it will be appreciated by one skilled in the art that many different primers may be derived from the provided sequences in order to amplify particular regions of the CUTI sequences. Resequencing of PCR products obtained by these amplification procedures is recommended; this will facilitate confirmation of the amplified CUTI sequence and will also provide information on natural variation on this sequence in different ecotypes and plant populations.
Oligonucleotides which are derived from the CUT1 nucleic acid sequences and which are suitable for use as PCR primers to amplify the CUT1 nucleic acid sequences are encompassed within the scope of the present invention. Preferably, such oligonucleotide primers will comprise a sequence of consecutive nucleotides of the CUTI nucleic acid sequences. To enhance amplification specificity, primers comprising at least 20-30 consecutive nucleotides of these sequences may also be used.
VII. Cloning CUTI Variants With the provision herein of the CUTI nucleic acid sequences, the cloning by standard methodologies of corresponding cDNAs and genes from other ecotypes and plant species, as well as polymorphic forms of the disclosed sequences is now enabled. Thus, the present invention includes methods of isolating a nucleotide sequence encoding a plant very long chain fatty acid elongation enzyme from a plant. Both conventional hybridization and PCR amplification procedures may be utilized to clone such sequences. Common to both of these techniques is the hybridization of probes or primers derived from the disclosed CUT1 nucleic acid sequences to a target nucleotide preparation, which may WO 98/46766 PCT/CA98/00343 be, in the case of conventional hybridization approaches, a cDNA or genomic library or, in the in the case of PCR amplification, extracted genomic DNA, mRNA, a cDNA library or a genomic library.
Direct PCR amplification may be performed on cDNA libraries prepared from the plant species in question, or RT-PCR may be performed using mRNA extracted from the plant cells using standard methods. PCR primers will comprise at least 15 consecutive nucleotides of the CUTI nucleic acid sequences. One of skill in the art will appreciate that sequence differences between the disclosed CUT1 nucleic acid sequences and the target gene to be amplified may result in lower amplification efficiencies.
To compensate for this, longer PCR primers or lower annealing temperatures may be used during the amplification cycle. Where lower annealing temperatures are used, sequential rounds of amplification using nested primer pairs may be necessary to enhance specificity.
For conventional hybridization techniques, the hybridization probe is preferably labeled with a detectable label such as a radioactive label, and the probe is of at least 20 nucleotides in length. As is well known in the art, increasing length of hybridization probes tends to give enhanced specificity. The labeled probe derived from, for example, the CUT1 cDNA sequence may be hybridized to a plant cDNA or genomic library and the hybridization signal detected using means known in the art. The hybridizing colony or plaque (depending on the type of library used) is then purified and the cloned sequence contained in that colony or plaque isolated and characterized.
VII. Use of the CUT1 Nucleic Acids to Produce Plants with Modified VLCFA Composition Once a gene or cDNA ("nucleic acid") encoding a protein involved in the determination of a particular plant characteristic has been isolated, standard techniques may be used to express the nucleic acid in transgenic plants in order to modify that particular plant characteristic. The basic approach is to clone the nucleic acid into a transformation vector, such that it is operably linked to control sequences a promoter) which direct expression of the open reading frame in plant cells. The transformation vector is then introduced into plant cells by one of a number of techniques electroporation) and progeny plants containing the introduced nucleic acid are selected. Preferably all or part of the transformation vector will stably integrate into the genome of the plant cell. That part of the transformation vector which integrates into the plant cell and which contains the introduced nucleic acid and associated sequences for controlling expression (the introduced "transgene") may be referred to as the recombinant expression cassette.
Selection of progeny plants containing the introduced transgene may be made based upon the detection of an altered phenotype. Such a phenotype may result directly from the nucleic acid cloned into the transformation vector or may be manifested as enhanced resistance to a chemical agent (such as an antibiotic) as a result of the inclusion of a dominant selectable marker gene incorporated into the transformation vector.
The choice of control sequences and how the nucleic acid (or selected portions of the nucleic acid) are arranged in the transformation vector relative to the control sequences determine, in WO 98/46766 PCT/CA98/00343 part, how the plant characteristic affected by the introduced nucleic acid is modified. For example, the control sequences may be tissue specific, such that the nucleic acid is only expressed in particular tissues of the plant pollen) and so the affected characteristic will be modified only in those tissues. The nucleic acid sequence may be arranged relative to the control sequence such that the nucleic acid transcript is expressed normally, or in an antisense orientation. Expression of an antisense RNA corresponding to the cloned nucleic acid will result in a reduction of the targeted gene product (the targeted gene product being the protein encoded by the plant gene from which the introduced nucleic acid was derived). Over-expression of the introduced nucleic acid, resulting from a plus-sense orientation of the nucleic acid relative to the control sequences in the vector, may lead to an increase in the level of the gene product, or may result in co-suppression (also termed "sense suppression") of that gene product.
Successful examples of the modification of plant characteristics by transformation with cloned nucleic acid sequences are replete in the technical and scientific literature. Selected examples, which serve to illustrate the current knowledge in this field of technology, and which are herein incorporated by reference, include: U.S. Patent No. 5,451,514 to Boudet (modification of lignin synthesis using antisense RNA and co-suppression); U.S. Patent No. 5,443,974 to Hitz (modification of saturated and unsaturated fatty acid levels using antisense RNA and co-suppression); U.S. Patent No. 5,530,192 to Murase (modification of amino acid and fatty acid composition using antisense RNA); U.S. Patent No. 5,455,167 to Voelker (modification of medium chain fatty acids) U.S. Patent No. 5,231,020 to Jorgensen (modification of flavonoids using co-suppression); U.S. Patent No. 5,583,021 to Dougherty (modification of virus resistance by expression of plussense untranslatable RNA); WO 96/13582 (modification of seed VLCFA composition using over expression, co-suppression and antisense RNA in conjunction with the Arabidopsis FAE1 gene); and WO 95/15387 (modification of seed VLCFA composition using over expression of jojoba wax synthesis gene).
These examples include descriptions of transformation vector selection, transformation techniques and the construction of constructs designed to over-express the introduced nucleic acid or to express antisense RNA corresponding to the nucleic acid. In light of the foregoing and the provision herein of the CUTI nucleic acids, it is thus apparent that one of skill in the art will be able to introduce these nucleic acids, or derivative forms of these molecules antisense forms), into plants in order to produce plants having modified VLCFA compositions. Examples one and two below provides illustrations of this in which the CUTI cDNA is operably linked to the CaMV 35S promoter sequence, cloned into the pBIN19 transformation vector and introduced into Arabidopsis using a vacuum infiltration method.
WO 98/46766 PCT/CA98/00343 As reported in Example one, certain of the plants transformed in this way had no detectable epicuticular wax layers, indicating that transformation with the CUT1 cDNA had disrupted normal VLCFA synthesis in the plant epidermal cells. Such disruption is likely attributable to the phenomenon termed co-suppression (or sense-suppression). These plants are thus referred to as "CUT1-suppressed".
This phenomenon may be affected by factors such as positional location of the introduced sequences in the plant genome.
Over-expression of CUT1 protein in transgenic plants, resulting in plants enhanced epicuticular wax layers will be a useful agronomic trait, providing increased drought and insect resistance. For example, drought resistance in rice is associated with high wax lines rich in C 29
C
33 and C 35 alkanes (O'Toole and Cruz, 1983; Haque et al., 1992). Increased wax deposition in transgenic plants can be accomplished by overexpression of CUTI protein, while the identification of the CUT1 promoter allows targeting of lipid modification enzymes such as desaturases, thioesterases and other condensing enzymes with different specificities to the epidermal cells to modify wax composition.
Transformation of plants with the CUT1 nucleic acids or derivatives thereof may be used to modify other plant characteristics, such as seed coat composition and seed oil composition. Because condensing enzymes are pivotal enzymes in the synthesis of VLCFAs, controlling levels of accumulation of VLCFAs and their acyl chain length (Millar and Kunst, 1997) through the manipulation of CUT1 expression will permit the production of plants having novel fatty acid compositions. For instance, the accumulation of VLCFAs in tobacco seed expressing FAE1 from Arabidopsis (Millar and Kunst, 1997) raises the possibility of producing VLCFAs in plant species that currently do not synthesize VLCFAs. In addition, targeting of CUTI to seeds will be useful to produce crop plants capable of synthesising new, agronomically important VLCFAs in seed oil.
Disruption of CUT1 activity in transgenic plants also provides a simple means for obtaining conditional male sterility in plants (see Example two). One of the major factors contributing to increases in crop productivity is the development of hybrid varieties of crops. Several different breeding strategies have been used to produce hybrid seed, but none of these strategies can be used as a general approach in all crop plants (Goldberg et al.,1993). As an alternative, genetically engineered systems and strategies for male fertility control that are applicable to a wide range of crops have recently been developed. For example, nuclear male sterility has been engineered by tapetum-specific expression of a bacterial RNAse gene (Mariani et al., 1990, 1992), overexpression of the rolC gene from Agrobacterium rhizogenes (Fladung, 1990; Schmiilling et al., 1988, 1992), expression of glucanase that desrupts the callose wall of the microsporophyte prematurely (Tsuchiya et al., 1995; Worrall et al., 1992), the inhibition of flavonoid biosynthetic genes like chalcone synthase and dihydroflavolon 4-reductase (van der Krol et al., 1988, 1990; van der Meer et al, 1992; Napoli et al. 1990; Taylor and Jorgensen, 1992), and altered expression ofstilbene synthase (Fischer et al., 1997). However, in most of these cases the restoration of fertility is not simple, or not easily controlled. In contrast, conditional male sterility caused by suppression of CUTI activity is easily reversible under high relative humidity.
The selection of vectors and promoters appropriate for targeting particular characteristics for WO 98/46766 PCT/CA98/00343 modification (such as seed-specific expression) are well known; the following paragraphs set forth general guidance on the various options available in producing transgenic plants having modified VLCFA composition.
a. Plant Types VLCFAs are found in all plant types, and thus DNA molecules according to the present invention the CUT1 cDNA, gene, homologs and antisense forms thereof) may be introduced into any plant type in order to modify the VLCFA composition of the plant. Thus, the sequences of the present invention may be used to modify VLCFA composition in any higher plant, including monocotyledonous and dicotyledenous plants, including, but not limited to maize, wheat, rice, barley, soybean, beans in general, rape/canola, alfalfa, flax, sunflower, safflower, brassica, cotton, flax, peanut, clover; vegetables such as lettuce, tomato, cucurbits, potato, carrot, radish, pea, lentils, cabbage, broccoli, brussel sprouts, peppers; tree fruits such as apples, pears, peaches, apricots; flowers such as carnations and roses.
b. Vector Construction, Choice of Promoters A number of recombinant vectors suitable for stable transfection of plant cells or for the establishment of transgenic plants have been described including those described in Pouwels et al., (1987), Weissbach and Weissbach, (1989), and Gelvin et al., (1990). Typically, plant transformation vectors include one or more cloned plant genes (or cDNAs) under the transcriptional control of 5' and 3' regulatory sequences and a dominant selectable marker. Such plant transformation vectors typically also contain a promoter regulatory region a regulatory region controlling inducible or constitutive, environmentally-or developmentally-regulated, or cell- or tissue-specific expression), a transcription initiation start site, a ribosome binding site, an RNA processing signal, a transcription termination site, and/or a polyadenylation signal.
Examples of constitutive plant promoters which may be useful for expressing CUT1 nucleic acids include: the cauliflower mosaic virus (CaMV) 35S promoter, which confers constitutive, high-level expression in most plant tissues (see, Odel et al., 1985, Dekeyser et al., 1990, Terada and Shimamoto, 1990); the nopaline synthase promoter (An et al., 1988); and the octopine synthase promoter (Fromm et al., 1989).
A variety of plant gene promoters that are regulated in response to environmental, hormonal, chemical, and/or developmental signals, also can be used for expression of CUTI nucleic acids in plant cells, including promoters regulated by: heat (Callis et al., 1988); light the pea rbcS-3A promoter, Kuhlemeier et al., 1989, the maize rbcS promoter, Schaffner and Sheen, 1991, and the chlorophyll a/b binding protein promoter, Simpson et al., 1985); hormones, such as abscisic acid (Marcotte et al., 1989); wounding wuni, Siebertz et al., 1989); and chemicals such as methyl jasmonate or salicylic acid. It may also be advantageous to employ tissue-specific promoters, such as those described by Roshal et al., (1987), Schernthaner et al., (1988), and Bustos et al., (1989).
WO 98/46766 PCT/CA98/00343 Alternatively, tissue specific (root, leaf, flower, and seed for example) promoters (Carpenter.et al.
1992, Denis et al. 1993, Opperman et al. 1993, Stockhause et al. 1997; Roshal et al., 1987; Schernthaner et al., 1988; and Bustos et al., 1989) can be fused to the coding sequence to obtained particular expression in respective organs. In addition, the timing of the expression can be controlled by using promoters such as those acting at senescencing (Gan and Amasino 1995) or late seed development (Odell et al. 1994). The promoter region of the CUTI genomic sequence disclosed herein confers epidermis-specific expression in Arabidopsis and tobacco. Accordingly, the native promoter may be used to obtain epidermis-specific expression of the introduced transgene.
For producing conditionally male sterile plants by blocking CUT1 activity in pollen, it is preferable to use a pollen-specific promoter (so as to avoid pleiotropic effects). Thus, the CUT1 coding region may be expressed under the control of the tapetum-specific promoters such as TA29 (Mariani et al.,1990, 1992), MS2 (Aarts et al., 1997), and tap (Nacken et al., 1991).
Plant transformation vectors may also include RNA processing signals, for example, introns, which may be positioned upstream or downstream of the CUTI nucleic acid sequence in the transgene.
In addition, the expression vectors may also include additional regulatory sequences from the 3'untranslated region of plant genes, a 3' terminator region to increase mRNA stability of the mRNA, such as the PI-II terminator region of potato or the octopine or nopaline synthase 3' terminator regions.
Finally, as noted above, plant transformation vectors may also include dominant selectable marker genes to allow for the ready selection of transformants. Such genes include those encoding antibiotic resistance genes resistance to hygromycin, kanamycin, bleomycin, G418, streptomycin or spectinomycin) and herbicide resistance genes phosphinothricin acetyltransferase).
c. Arrangement of CUTI Nucleic Acids in Vector As noted above, the particular arrangement of the CUT1 nucleic acid in the transformation vector will be selected according to the expression of the nucleic acid desired.
Where enhanced VLCFA synthesis is desired, the CUT1 nucleic acid may be operably linked to a constitutive high-level promoter such as the CaMV 35S promoter. Modification of VLCFA synthesis may also be achieved by introducing into a plant a transformation vector containing a variant form of the CUT1 nucleic acid, for example a form which varies from the exact nucleotide sequence of the CUT1 nucleic acid, but which encodes a protein that retains the functional characteristic of the CUT1 protein, very long chain fatty acid elongation activity.
In contrast, a reduction of VLCFA synthesis may be obtained by introducing antisense constructs based on the CUT1 nucleic acid sequence into plants. For antisense suppression, the CUT1 nucleic acid is arranged in reverse orientation relative to the promoter sequence in the transformation vector. The introduced sequence need not be the full length CUT1 nucleic acid, and need not be exactly homologous to the CUT1 nucleic acid. Generally, however, where the introduced sequence is of shorter length, a higher degree of homology to the native CUTI sequence will be needed for effective antisense WO 98/46766 PCT/CA98/00343 suppression. Preferably, the introduced antisense sequence in the vector will be at least 30 nucleotides in length, and improved antisense suppression will typically be observed as the length of the antisense sequence increases. Preferably, the length of the antisense sequence in the vector will be greater than 100 nucleotides. Transcription of an antisense construct as described results in the production of RNA molecules that are the reverse complement of mRNA molecules transcribed from the endogenous CUT1 gene in the plant cell. Although the exact mechanism by which antisense RNA molecules interfere with gene expression has not been elucidated, it is believed that antisense RNA molecules bind to the endogenous mRNA molecules and thereby inhibit translation of the endogenous mRNA.
Suppression of endogenous CUT1 geneexpression can also be achieved using ribozymes.
Ribozymes are synthetic RNA molecules that possess highly specific endoribonuclease activity. The production and use of ribozymes are disclosed in U.S. Patent No. 4,987,071 to Cech and U.S. Patent No. 5,543,508 to Haselhoff, which are hereby incorporated by reference. The inclusion of ribozyme sequences within antisense RNAs may be used to confer RNA cleaving activity on the antisense RNA, such that endogenous mRNA molecules that bind to the antisense RNA are cleaved, which in turn leads to an enhanced antisense inhibition of endogenous gene expression.
Constructs in which the CUTI nucleic acid (or variants thereon) are over-expressed may also be used to obtain co-suppression of the endogenous CUTI gene in the manner described in U.S. Patent No.
5,231,021 to Jorgensen. Such co-suppression (also termed sense suppression) does not require that the entire CUT1 nucleic acid be introduced into the plant cells, nor does it require that the introduced sequence be exactly identical to the CUTI nucleic acid. However, as with antisense suppression, the suppressive efficiency will be enhanced as the introduced sequence is lengthened and the sequence similarity between the introduced sequence and the endogenous CUTI gene is increased. Example I below provides an illustration of co-suppression of the endogenous CUTI gene by transformation of plants with the CUT1 cDNA.
d. Transformation and Regeneration Techniques Transformation and regeneration of both monocotyledonous and dicotyledonous plant cells is now routine, and the selection of the most appropriate transformation technique will be determined by the practitioner. The choice of method will vary with the type of plant to be transformed; those skilled in the art will recognize the suitability of particular methods for given plant types. Suitable methods may include, but are not limited to: electroporation of plant protoplasts; liposome-mediated transformation; polyethylene mediated transformation; transformation using viruses; micro-injection of plant cells; micro-projectile bombardment of plant cells; vacuum infiltration; and Agrobacterium tureficiens (AT) mediated transformation. Typical procedures for transforming and regenerating plants are described in the patent documents listed at the beginning of this section.
e. Selection of Transformed Plants Following transformation and regeneration of plants with the transformation vector, transformed WO 98/46766 PCT/CA98/00343 plants are preferably selected using a dominant selectable marker incorporated into the transformation vector. Typically, such a marker will confer antibiotic resistance on the seedlings of transformed plants, and selection of transformants can be accomplished by exposing the seedlings to appropriate concentrations of the antibiotic. Example I provides an example of such an approach in which seedlings were selected using kanamycin.
After transformed plants are selected and grown to maturity, they can be assayed to determine whether VLCFA synthesis has been altered as a result of the introduced transgene. This can be done in several ways, including, as described in Example 1, microscopic examination of the epicuticular wax layer and chromatographic analysis. Lipids may also be extracted from plant material and analyzed by gas chromatography as described by Dooner (1990). In addition, antisense or sense suppression of the endogenous CUT1 gene may be detected by analyzing mRNA expression on Northern blots.
IX. Production of Sequence Variants As noted above, modification of VLCFA synthesis in plant cells can be achieved by transforming plants with CUT1 nucleic acids, antisense constructs based on CUT] nucleic acid sequences or other variants on CUT1 nucleic acid sequences. With the provision of the CUTI cDNA and genomic sequences herein, the creation of variants on these CUT1 nucleic acid sequences by standard mutagenesis techniques is now enabled.
Variant DNA molecules include those created by standard DNA mutagenesis techniques, for example, M13 primer mutagenesis. Details of these techniques are provided in Sambrook et al. (1989), Ch. 15. By the use of such techniques, variants may be created which differ in minor ways from the disclosed CUT1 nucleic acids. DNA molecules and nucleotide sequences that are derivatives of those specifically disclosed herein and which differ from those disclosed by the deletion, addition or substitution of nucleotides while still encoding a protein which possesses the functional characteristic of the CUT1 protein very long chain fatty acid elongation activity) are comprehended by this invention. DNA molecules and nucleotide sequences which are derived from the CUT1 nucleic acids include DNA sequences which hybridize under moderately stringent conditions to the DNA sequences disclosed, or fragments thereof.
Hybridization conditions resulting in particular degrees of stringency will vary depending upon the nature of the hybridization method of choice and the composition and length of the hybridizing DNA used. Generally, the temperature of hybridization and the ionic strength (especially the Na' concentration) of the hybridization buffer will determine the stringency of hybridization. Calculations regarding hybridization conditions required for attaining particular degrees of stringency are discussed by Sambrook et al. (1989), chapters 9 and 11, herein incorporated by reference. By way of illustration only, a hybridization experiment may be performed by hybridization of a CUTI-derived probe (for example, the CUT1 cDNA sequence) to a target DNA molecule (for example, the CUTI homolog from Zea Mays) which has been electrophoresed in an agarose gel and transferred to a nitrocellulose membrane by Southern blotting (Southern, 1975), a technique well known in the art and described in WO 98/46766 PCT/CA98/00343 (Sambrook et al., 1989). Hybridization with a target probe labeled with 32 P)-dCTP is generally carried out in a solution of high ionic strength such as 6xSSC at a temperature that is 20-25"C below the melting temperature, Tm, described below. For such Southern hybridization experiments where the target DNA molecule on the Southern blot contains 10 ng of DNA or more, hybridization is typically carried out for 6-8 hours using 1-2 ng/ml radiolabeled probe (of specific activity equal to 10 9 CPM/gg or greater).
Following hybridization, the nitrocellulose filter is washed to remove background hybridization. The washing conditions should be as stringent as possible to remove background hybridization but to retain a specific hybridization signal. The term Tm represents the temperature above which, under the prevailing ionic conditions, the radiolabeled probe molecule will not hybridize to its target DNA molecule. The Tm of such a hybrid molecule may be estimated from the following equation (Bolton and McCarthy, 1962): Tm 81.5C 16.6(loglo[Na+]) 0.41(%G+C) 0.63(% formamide) (600/1) Where I the length of the hybrid in base pairs.
This equation is valid for concentrations of Na in the range of 0.01 M to 0.4 M, and it is less accurate for calculations of Tm in solutions of higher The equation is also primarily valid for DNAs whose G+C content is in the range of 30% to 75%, and it applies to hybrids greater than 100 nucleotides in length (the behavior of oligonucleotide probes is described in detail in Ch. 11 of Sambrook et al., 1989).
Thus, by way of example, for a 150 base pair DNA probe derived from the first 150 base pairs of the open reading frame of the CUT1 cDNA (with a hypothetical %GC a calculation of hybridization conditions required to give particular stringencies may be made as follows: For this example, it is assumed that the filter will be washed in 0.3 xSSC solution following hybridization, thereby 0.045M, %GC 45%, Formamide concentration 0, I 150 base pairs, Tm 81.5 16(logJ 0 (0.41 x 45) (600/150) and so Tm 74.4*C.
The Tm of double-stranded DNA decreases by 1-1.5°C with every 1% decrease in homology (Bonner et al., 1973). Therefore, for this given example, washing the filter in 0.3 xSSC at 59.4-64.4°C will produce a stringency of hybridization equivalent to 90%. Alternatively, washing the hybridized filter in 0.3 xSSC at a temperature of 65.4-68.4*C will yield a hybridization stringency of 94%. The above example is given entirely by way of theoretical illustration. One skilled in the art will appreciate that other hybridization techniques may be utilized and that variations in experimental conditions will necessitate alternative calculations for stringency.
DNA sequences that encode a protein having VLCFA elongase activity and which hybridize to the disclosed CUT1 nucleic acid sequences under hybridization conditions of at least 75%, more preferably at least 80%, 85% or 90% stringency, and most preferably at least 95% stringency are encompassed within the present invention.
The degeneracy of the genetic code further widens the scope of the present invention as it enables WO 98/46766 PCT/CA98/00343 major variations in the nucleotide sequence of a DNA molecule while maintaining the amino acid sequence of the encoded protein. For example, the fourth amino acid residue of the CUTI protein is alanine. This is encoded in the CUTI ORF by the nucleotide codon triplet GCA. Because of the degeneracy of the genetic code, three other nucleotide codon triplets--GCT, GCC and GCG--also code for alanine. Thus, the nucleotide sequence of the CUTI ORF could be changed at this position to any of these three codons without affecting the amino acid composition of the encoded protein or the characteristics of the protein. Based upon the degeneracy of the genetic code, variant DNA molecules may be derived from the CUT1 nucleic acid molecules disclosed herein using standard DNA mutagenesis techniques as described above, or by synthesis of DNA sequences. Thus, this invention also encompasses DNA sequences which encode the CUTI protein but which vary from the CUTI nucleic acid sequences by virtue of the degeneracy of the genetic code.
One skilled in the art will recognize that DNA mutagenesis techniques may be used not only to produce variant DNA molecules, but will also facilitate the production of proteins which differ in certain structural aspects from the CUTI protein, yet which proteins are clearly derivative of this protein and which maintain the essential characteristics of the CUT1 protein. Newly derived proteins may also be selected in order to obtain variations on the characteristic of the CUT1 protein, as will be more fully described below. Such derivatives include those with variations in amino acid sequence including minor deletions, additions and substitutions.
While the site for introducing an amino acid sequence variation is predetermined, the mutation per se need not be predetermined. For example, in order to optimize the performance of a mutation at a given site, random mutagenesis may be conducted at the target codon or region and the expressed protein variants screened for the optimal combination of desired activity. Techniques for making substitution mutations at predetermined sites in DNA having a known sequence as described above are well known.
Amino acid substitutions are typically of single residues; insertions usually will be on the order of about from 1 to 10 amino acid residues; and deletions will range about from 1 to 30 residues.
Deletions or insertions preferably are made in adjacent pairs, a deletion of 2 residues or insertion of 2 residues. Substitutions, deletions, insertions or any combination thereof may be combined to arrive at a final construct. Obviously, the mutations that are made in the DNA encoding the protein must not place the sequence out of reading frame and preferably will not create complementary regions that could produce secondary mRNA structure.
Substitutional variants are those in which at least one residue in the amino acid sequence has been removed and a different residue inserted in its place. Such substitutions generally are made in accordance with the following Table 1 when it is desired to finely modulate the characteristics of the protein. Table 1 shows amino acids which may be substituted for an original amino acid in a protein and which are regarded as conservative substitutions.
WO 98/46766 PCT/CA98/00343 Table 1.
Original Residue Conservative Substitutions Ala ser Arg lys Asn gin; his Asp glu Cys ser Gin asn Glu asp Gly pro His asn; gin lie leu, val Leu ile; val Lys arg; gin; glu Met leu; ile Phe met; leu; tyr Ser thr Thr ser Trp tyr Tyr trp; phe Val ile; leu Substantial changes in enzymatic function or other features are made by selecting substitutidns that are less conservative than those in Table 1, selecting residues that differ more significantly in their effect on maintaining the structure of the polypeptide backbone in the area of the substitution, for example, as a sheet or helical conformation, the charge or hydrophobicity of the molecule at the target site, or the bulk of the side chain. The substitutions which in general are expected to produce the greatest changes in protein properties will be those in which a hydrophilic residue, seryl or threonyl, is substituted for (or by) a hydrophobic residue, leucyl, isoleucyl, phenylalanyl, valyl or alanyl; a cysteine or proline is substituted for (or by) any other residue; a residue having an electropositive side chain, lysyl, arginyl, or histadyl, is substituted for (or by) an electronegative residue, glutamyl or aspartyl; or a residue having a bulky side chain, phenylalanine, is substituted for (or by) one not having a side chain, glycine.
The effects of these amino acid substitutions or deletions or additions may be assessed for derivatives of the CUT1 protein by analyzing the ability of the derivative proteins to catalyze the addition of C2 units to existing VLCFA units. These assays may conveniently be performed using the yeastbased systems for assaying fatty acid elongation described below.
X. Production of recombinant CUT1 protein using heterologous expression systems Many different expression systems are available for expressing cloned nucleic acid molecules.
Examples of prokaryotic and eukaryotic expression systems that are routinely used in laboratories are described in Chapters 16-17 of Sambrook et al. (1989), which are herein incorporated by reference. Such systems may be used to express CUTI protein and derivatives at this protein at high levels to facilitate purification and functional analysis of the enzyme. Apart from permitting the activity of the enzyme to be 22 WO 98/46766 PCT/CA98/00343 determined (which is particularly useful to assess the activity of homologous and derivative proteins), heterologous expression facilitates other uses of the purified enzyme. For example the purified enzyme produced by recombinant means may be used to synthesize VLCFAs and other fatty acid metabolites in vitro, particularly radio- or fluorescent- labeled forms of VLCFAs and metabolites. These molecules may be used as tracers to determine the location in plant tissues and cells of VLCFAs and their metabolites.
The purified recombinant enzyme may also be used as an immunogen to raise enzyme-specific antibodies.
Such antibodies are useful as both research reagents (such as in the study of VLCFA regulation in plants) as well as diagnostically to determine expression levels of the enzyme in agricultural products, including pollen.
By way of example only, high level expression of the CUTI protein may be achieved by cloning and expressing the cDNA in yeast cells using the pYES2 yeast expression vector (Invitrogen, San Diego, CA). Secretion of the recombinant CUTI from the yeast cells may be achieved by placing a yeast signal sequence adjacent to the CUTI coding region. A number of yeast signal sequences have been characterized, including the signal sequence for yeast invertase. This sequence has been successfully used to direct the secretion of heterologous proteins from yeast cells, including such proteins as human interferon (Chang et al., 1986), human lactoferrin (Liang and Richardson, 1993) and prochymosin (Smith et al., 1985). Alternatively, the enzyme may be expressed at high level in standard prokaryotic expression systems, such as E. coli.
XI. Assays forVLCFA elongase activity To aid the biochemical characterization of the CUTI protein, or variants of this protein, the very long chain fatty acid elongase activity of the proteins may be determined by expressing the cDNA molecule which encodes protein in question in yeast. For that purpose, the full-length coding region of the cDNA may be linked to the galactose inducible GAL1 promoter in the Saccharomyces cerevisiae expression vector, pYES2 (Invitrogen). The yeast expressing the subject protein may then be employed to determine the substrate specificity of the CUT1 protein by one of the following approaches.
a. In vitro assay for VLCFA elongase activity using cell-free yeast homogenate To determine the range of substrates recognized by the subject protein, acyl elongation activity is measured using substrates of varying carbon chain lengths and degrees of unsaturation. In each case, gM of an[l-"C]acyl CoA (C18, C20, C22, C24 in 0.005% Triton X-100) is added to a standard assay mixture containing 80 mM Hepes-KOH, pH 7.2, 5% glycerol, ImM DTT, 0.5 mM NADPH, 1 mM ATP, 5 mM MgCl 2 1 mM malonyl-CoA, and an aliquot of cell free extract (50 gg protein) in a final volume of 50 pL. Incubation is carried out at 30 0 C for 1 h. The reaction is stopped with 100 ML of4 N KOH in 80% methanol and the lipids saponified for 1 h at 80 0 C. The mixture is then acidified by adding 100 uL of cold 6N HCL and extracted twice with 500 ML of cold hexane. The pooled hexane fractions are dried under N 2 followed by transmethylation for product analyses.
23 WO 98/46766 PCT/CA98/00343 b. In vivo assay: Feeding of transformed yeast cells with radiolabelled acyl-tween substrates A second approach for determining substrate specificity involves growth of yeast cells in the presence of various [l-C"]acyl-Tween substrates (C18, C20, C22, C24; Terzaghi, 1986). Fatty acyl substrates provided in the growth medium as Tween-fatty acid esters are readily taken up from the medium and used by the cells. For each FAE protein, yeast cells are initially grown in the presence of several concentrations of a single acyl-Tween substrate for different lengths of time to determine the optimal substrate concentration and the duration of the feeding assays. Once these parameters are established, yeast cells expressing the subject protein and control cells containing empty pYES2 plasmid are grown in a defined medium in the presence of a single radiolabelled acyl-Tween substrate. At the end of the experiment, cells are pelleted, and then resuspended in 1 mL of 1 N methanolic-HCl (Supelco). Treatment with methanolic-HC1 converts fatty acids to methyl esters (FAME). Radiolabelled FAMEs are analyzed as described bellow, to characterize the products generated by elongation of each acyl-Tween substrate. A comparison of radiolabelled FAMEs from CUT1 containing yeast with FAMEs isolated from control cells allows the determination of the elongation specificity of the subject FAE protein.
c. Product analyses The products of the elongation assays obtained in or pelleted yeast cells from experiment are transmethylated in a sealed tube using 1 N methanolic-HCI (Supelco) at 80 0 C for 1 h. Samples are then extracted twice with 500 tL of hexane after the addition of 1 mL of 0.9% NaC1, and the pooled extracts containing FAMEs concentrated under N 2 Radiolabelled FAMEs are applied on KC,, reversephase TLC plates (Whatman), and separated in acetonitrile:tetrahydrofuran (85:15, Products of TLC separation are identified by co-chromatography with FAME standards, or by GC-MS. In addition, FAMEs may be scraped from the TLC plates and their radioactivity determined by liquid scintillation counting.
EXAMPLES
The following examples serve to illustrate various applications of the present invention.
Example one: Modification of A. thaliana Wax Production By Transformation with the CUTI cDNA a. Construction of binary transformation vector WO 98/46766 PCT/CA98/00343 The CUT1 cDNA was cleaved out of the vector XZipLox (with Kpnl-BamHl) and the resulting 1.85 kb fragment was directionally subcloned into the Kpnl-BamHl sites of pGEM7z(f) (Promega, Madison, WI). The resulting plasmid was then fully cleaved with Xhol, but only partially cleaved with Sstl, (since the CUT cDNA has an internal Sstl site). The 1.9 kb product was isolated on an agarose gel and directionally subcloned into the Sail and Sstl sites of the vector pJD330 (Shaul and Galili 1992).
This vector contains the 355 promoter of the cauliflower mosaic virus (CaMV) which provides constitutive expression in Arabidopsis. The subcloning results in the CUTI cDNA being inserted in a sense orientation with respect to the CaMV 35S promoter. The JD330-CUTI cDNA construct was ligated with pBIN19 and the resulting binary vector was designated p35S-CUT1. This binary vector was transformed into the Agrobacterium tumefaciens strain GV3101 (Koncz and Schell, 1986), and transformants were selected on LB medium containing 25 ,g/mL gentamycin and 50 pg/mL kanamycin.
b. Transformation of Arabidopsis with the p35S-CUT transgene Arabidopsis thaliana Heynh. ecotype Columbia was transformed using a combination of in planta (Chang et al., 1994, Katavic et al., 1994) and vacuum infiltration methods (Bechtold et al., 1993). Plants were grown until the primary inflorescence shoots reached 1-2 cm in height, and then these bolts were cut off. The wound site was inoculated with 50 mL of an overnight Agrobacterium culture harbouring the p35S-CUTI plasmid. After 4-6 days a number of secondary inflorescences that appeared were cut off, and vacuum infiltration was performed on these plants using the conditions described by Bechtold et al. (1993). Screening for transformed seed was done as described previously (Katavic et al., 1994). Briefly, seed from infiltrated plants were plated out (approximately 1500 seeds/plate) on solid minimal salts nutrient medium supplemented with 50 ug/mL kanamycin. Seedlings that showed resistance were visible after approximately 8 days, because they turned green and elongated.
Plants that were derived from seed harvested from different pots were considered as independent lines.
Designations of transformed plants were as follows: the infiltrated plant--T primary transformants--T2; etc., as outlined in Katavic et al. (1994). Plants were grown at 20 0 C under continuous fluorescent illumination (100 tEm/s).
c. CUTI-suppressed plants have altered wax composition Using the above transformation methods 46 kanamycin-resistant plants were obtained from seven different pots of Arabidopsis. Of the 46 plants obtained, 36 appeared waxless, having a glossy or eceriferum (cer) phenotype. At least one cer line was obtained from each pot implying that at least seven independent events had occurred in obtaining these lines. The surfaces of these cer plants were examined by a scanning electron (SE) microscope. SE micrographs clearly demonstrate that while wildtype plants were covered with the characteristic crystals of the epicuticular wax layer, transgenic cer plants were completely devoid of any wax crystals, implying that a severe cer phenotype has been created.
WO 98/46766 PCT/CA98/00343 Plant tissue from the transgenic lines was analyzed for fatty acid composition. Plant tissue was immersed for 10 seconds in a 2:1 chloroform:methanol solution to remove surface waxes. Extracts were then evaporated to dryness under a stream of nitrogen. Waxes were dissolved in 100 pl ofN, Obis(Trimethylsilyl)trifluoroacetamide with 1% Trimethylchlorosilane (Pierce), and derivatized at 80 °C for 1 hour. Samples were analyzed in a Hewlett-Packard 5890 series II gas chromatograph equipped with a flame ionization detector, using either a DB-I column or a DB-5 column.
GLC analyses were performed at the initial temperature of 150 followed by a ramping of 4 "C/min to 320 where it was held for 10 min. Peaks were identified by the comparison of retention times to reference standards, and mass spectrometry. Quantification was based on flame ionization detector peak areas, which were converted to mass units by comparison to the internal standard, 17:0-methylester, which was added to each sample prior to the extraction.
For wax load determinations only the principal surface lipids were measured, n-nonacosane (C29 alkane), 14- and 15-nonacosanol (C29 secondary alcohol), 15-nonacosanone (C29 ketone), C22-C30 aldehydes, C22-C30 primary alcohols and C16-C30 fatty acids (Hannoufa et al., 1993). The total area of these peaks accounted for more than 90 of the total area of the sample.
The wax constituents that are found on the stems of Arabidopsis plants originate from two biosynthetic pathways (Figure The decarbonylation pathway is the major pathway, which utilizes aldehydes to produce alkanes, secondary alcohols and ketones. In Arabidopsis (ecotype Columbia), the C29 species of the wax components produced by this pathway account for almost 90% of all the stem wax.
The second pathway, the acyl-reduction pathway, produces primary alcohols, which account for approximately 5% of the total stem wax. Fatty acids and aldehydes, which are precursors for all the other wax components, are shared by both biosynthetic pathways and make up the remaining Wax composition and quantity on the stems of wild-type and several transgenic lines were examined. Wild-type Arabidopsis stems contained on average 7106 1184 mg of wax/ g dry wt. In contrast, wax loads on the stems of all shiny CUTI-suppressed lines were severely reduced. For example, the wax load on the stems on the most severe line 5 totals 483 83, only 6-7 of the wild-type wax accumulation.
Analysis of wax composition of CUTI-suppressed plants revealed that the decarbonylation pathway is almost completely shut down. The C30 aldehyde, C29 alkane, C29 secondary alcohol and C29 ketone reach only 3.5 1.4% and 2.2% of the levels found on wild-type plants, respectively.
CUTI-suppression also has a major effect on the acyl-reduction pathway, causing a reduction in the levels of primary alcohols of over 50%. In addition, the relative abundance of different classes of alcohols is changed. C30 and C28 alcohols, the major alcohol species in wild type stems, have decreased by 90%, and C24 alcohol is the most abundant class in CUTI suppressed lines. The C24 species are also the most abundant classes of aldehydes and fatty acids in waxless transgenic plants. The described compositional changes were consistent in all 13 different CUTI-suppressed lines analyzed. These changes support the proposal that the role of the CUT1 enzyme is elongation of the fatty acyl chain beyond 24 carbons.
WO 98/46766 PCT/CA98/00343 Example two: Production of conditionally male sterile CUTI-suppressed plants CUTI-suppressed Arabidopsis plants were produced as described in Example one and analyzed for male sterility. This analysis demonstrated that, in addition to stem and leaf wax synthesis, the CUTI gene product has an essential role in pollen development. Similar to cer6-2 (Preuss et al., 1993) and cerl (Aarts et al., 1995) wax-deficient mutants of Arabidopsis, CUTI-suppressed plants are completely male sterile under normal growth conditions (30 to 40% relative humidity) although they produce normal amounts of pollen. However, when grown under high humidity (90 to 100%), pollen fertility is restored to the wild-type level, indicating that male sterility/fertility is conditional and environmentally controlled, just like in cer6-2 and cerl mutants. For these two mutants, conditional male sterility is explained by alterations in the composition and content of the wax components of the tryphine layer covering the pollen grain. These long chain lipid molecules, produced in the tapetum layer of the anther, (Preuss et al., 1993) are needed in the tryphine for proper pollen-pistil signalling and pollen germination. Thus, in their absence, sterility occurs. Conditional male sterility is a valuable trait for plant breeders; being able to selectively inhibit self-fertilization of plants facilitates the production of hybrid plants. Accordingly, the CUT1 cDNA and derivatives thereof may be useful in producing conditionally male sterile plants useful in breeding programs.
Taken together, the results of Examples one and two confirm that CUT1 encodes a condensing enzyme that is involved in VLCFA biosynthesis of waxes which accumulate in the plant epidermis, as well as waxes required for the development of functional pollen grains. In addition the results show that transformation of plants using the CUT1 cDNA is useful to produce plants having modified VLCFA compositions, as well as plants that exhibit conditional male sterility.
Example three: Use of CUT1 gene promoter sequence The promoter of the CUT1 gene confers epidermis-specific expression. Accordingly, this promoter sequence may be used to produce transgene constructs that are specifically expressed in epidermal cells.
Effective epidermis-specific expression may be achieved with less than the entire 1951 bases of sequence upstream of the CUTI ORF shown in Seq. I.D. No. 12. Thus, by way of example, epidermis-specific expression may be obtained by employing the 1209 base pair promoter fragment. One of skill in the art will recognize that still smaller regions of the sequence upstream of the CUTI ORF may be used to obtain epidermis-specific expression, such as a 50 base pair or 100 base pair region of the disclosed promoter sequence.
The determination of whether a particular sub-region of the disclosed sequence operates to confer effective epidermis-specific expression in a particular system (taking into account the plant species into which the construct is being introduced, the level of expression required, etc.) will be performed using known methods, such as operably linking the promoter sub-region to a marker gene GUS), introducing such constructs into plants and then determining the level of expression of the marker gene in epidermis and other WO 98/46766 PCT/CA98/00343 plant tissues.
The present invention therefore facilitates the production, by standard molecular biology techniques, of nucleic acid molecules comprising this promoter sequence operably linked to a nucleic acid sequence, such as an open reading frame. Suitable open reading frames include open reading frames encoding any protein for which epidermis-specific expression is desired.
Having illustrated and described the principles of isolating CUTI nucleic acids, the CUTI protein and modes of use of these biological molecules, it should be apparent to one skilled in the art that the invention can be modified in arrangement and detail without departing from such principles. We claim all modifications coming within the spirit and scope of the claims presented herein.
References Aarts et al., (1997). The Arabidopsis MALE STERILITY 2 protein shares similarity with reductases in elongation/condensation complexes. Plant J. 12:615-623.
Aarts et al., (1995). Molecular characterization of the CERI gene of Arabidopsis involved in epicuticular.wax biosynthesis and pollen fertility. Plant Cell 7:2115-2127.
Altschul et al., (1994). Nature Genet., 6:119-129.
An et al., (1988). Plant Physiol. 88:547.
Ausubel et al., (1987). In Current Protocols in Molecular Biology, Greene Publishin Associates and Wiley-Intersciences.
Battey et al., (1989). "Genetic Engineering for Plant Oils: Potential and Limitations." TIBTECH 7:122- 125.
Bechtold and Pelletier, (1993). "In planta Agrobacterium Mediated Gene Transfer by Infiltration of Adult Arabidopsis thaliana Plants." C.R. Acad. Sci. Paris 316:1194-1199.
Bolton and McCarthy, (1962). Proc. Natl. Acad. Sci. USA 48:1390.
Bonner et al., (1973). J. Mol. Biol. 81:123.
Bustos et al., (1989). Plant Cell 1:839.
Callis et al., (1988). Plant Physiol. 88:965.
Carpetner et al. (1992). Preferential epxression of an -tubulin gene of Arabidopsis in pollen. The Plant Cell 4:557-571.
Chang et al., (1994). "Stable Genetic Transformation of Arabidopsis thaliana by Agrobacterium innoculation in planta." _Plant J. 5: 551-558.
Chang et al., (1994). Stable genetic transformation ofArabidopsisthaliana by Agrobacterium inoculation in planta. Plant J. 5:551-558.
Chang et al. (1986) Saccharomyces cerevisiae secretes and correctly processes human interferon hybrid 28 WO 98/46766 PCT/CA98/00343 protein containing yeast invertase signal peptides. Mol. And Cell. Biol. 6:1812-1819.
Cheesbrough and Kolattukudy, (1984). Proc. Natl. Acad. Sci. 81:6613-6617.
Corpet et al. (1988). Nucleic Acids Research 16:10881-90.
Dekeyser et al., (1990). Plant Cell 2:591.
Denis et al. (1993). Expression of engineered nuclear male sterility in Brassica napus. Plant Physiol.
101:1295-1304.
Dooner, (1990). Theor. Appl. Genet. 80: 241-245.
Eigenbrode and Espelie, (1995). Effects of plant epicuticular lipids on insect herbivores. Annu. Rev. Entomol.
40:117-142.
Fehling and Mukherjee, (1991). "Acyl-CoA Elongase From a Higher Plant (Lunaria annua): Metabolic Intermediates of Very-long-chain Acyl-CoA Products and Substrate Specificity." Biochem. Biophys.
Acta 1082: 239-246.
Fischer et al., (1997). Stilbene synthase gene expression causes changes in flower colour and male sterility in tobacco. Plant J. 11:489-498.
Fromm et al., (1989). Plant Cell 1:977.
Gan Amansino, (1995). Inhibition of leaf senescence by autoregulated production of cytokinin.
Science 270:1986-1988.
Gelvin et al., (1990). Plant Molecular Biology Manual, Kluwer Academic Publishers.
Goldberg et al., (1993). Anther development: Basic principles and practical applications. Plant Cell 5:1217- 1229.
Hannoufa et al., (1993). Epicuticular waxes of Eceriferum mutants of Arabidopsis thaliana. Phytochemistry 33: 851-855.
Haque et al., (1992). Inheritance of leaf epicuticular wax content in rice. Crop Sci. 32:865-868.
Higgins Sharp, (1988). Gene, 73:237-244.
Innis et al. (1990). PCR Protocols, A Guide to Methods and Applications, Innis et al. Academic Press, Inc., San Diego, California.
James et al. (1995). "Directed Tagging of the Arabidopsis FATTYACID ELONGATION (FAEI) Gene With the Maize Transposon Activator." Plant Cell 7:309-319.
James and Dooner, (1990). "Isolation of EMS-induced Mutants in Arabidopsis Altered in Seed Fatty Acid Composition." Theor. Appl. Genet. 80:241-245.
Jefferson et al., (1987). GUS fusions: P-glucuronidase as a sensitive and versatile gene fusion marker system in higher plants. EMBOJ. 6:3901-3907.
Jenks et al., (1994). Plant Physiol. 105:1239-1245.
Johnson and Fritz, (1989). "Fatty Acids in Industry." New York:Marcel Dekker.
29 WO 98/46766 PCT/CA98/00343 Katavic et al. (1994). "In planta Transformation of Arabidopsis thaliana." Mol. Gen. Genet. 245: 363- 370.
Katavic et al., (1994). In planta transformation of Arabidopsis thaliana. MoLGen.Genet. 245: 363-370.
Kolattukudy, (1971). Arch. Biochem. Biophys. 142:701-709.
Koncz and Schell, (1986). The promoter of TL-DNA gene 5 controls the tissue-specific expression of chimaeric genes carried by a novel type of Agrobacterium binary vector. Mol. Gen. Genet. 204:383-396.
Kuhlemeier et al., (1989). Plant Cell 1:471.
Kunst et al., (1992). "Fatty Acid Elongation in Developing Seeds of Arabidopsis thaliana." Plant Physiol. Biochem. 30:425-434.
Lassner et al., (1996). "A Jojoba P-ketoacyl-CoA Synthase cDNA Complements the Canola Fatty Acid Elongation Mutation in Transgenic Plants." Plant Cell 8: 281-292.
Lee and Douglas (1996). Manipulation of plant gene expression using antisense RNA. In: Plant Biochemistry/Molecular Biology LaboratoryManual, pp. 423-439, Dashek, ed., CRC Press, Inc., Boca Raton.
Lee et al., (1991). Proc. Natl. Acad. Sci. USA 88:6181-6185.
Lemieux et al., (1990). "Mutants of Arabidopsis With Alterations in Seed Lipid Fatty Acid Composition." Theor. Appl. Genet. 80:234-240.
Lemieux, (1996). Trends in Plant Sci. 1:312-318.
Liang Richardson, (1993). Expression and characterization of human lactoferrin in yeast (Saccharomyces cerevisiae). J. Agric. Food Chem. 41:1800-1807.
Marcotte et al., (1989). Plant Cell 1:969.
Mariani et al., (1990). Induction of male sterility in plants by a chimaeric ribonuclease gene. Nature 347:737- 741.
Millar and Kunst, (1997). Very-long-chain fatty acid biosynthesis is controlled through the expression and specificity of the condensing enzyme. Plant J. 12:121-131.
Nacken et al., (1991). Molecular characterization of two stamen-specific genes, tapl and fill, that are expressed wild type, but not the deficiens matant ofAnthirrhinum majus. Mol. Gen. Genet. 229:129-136.
Needleman Wunsch, (1970). J. Mol. Biol. 48:443.
O'Toole and Cruz, (1983). Genotypic variation in epicuticular wax of rice. Crop Sci. 23:392-394.
Odel et al., (1985). Nature 313:810.
Odell et al., (1994). Seed specific gene activation mediated by the Cre/lox site-specific recombination system. Plant Physiol. 106:447-458.
Opperman et al., (1993). Root knot nematode directed expression of a plant root specific gene. Science 263:221-223.
Pearson Lipman, (1988). Proc. Natl. Acad. Sci. USA 85:2444.
WO 98/46766 PCT/CA98/00343 Pearson et al., (1994). Methods in Molecular Biology 24:307-31.
Percy and Baker, (1990). New Phytol. 116:79-87.
Post-Beittenmiller, (1996). "Biochemistry and Molecular Biology of Wax Production in Plants." Annu.
Rev. Plant Physiol. Mol. Biol. 47: 405-430.
Pouwels et al., (1987). Cloning Vectors: A Laboratory Manual, 1985, supp.
Preuss et al., (1993). A conditional sterile mutation eliminates surface components from Arabidopsis pollen and disrupts cell signalling during fertilization. Genes Development 7:974-985.
Reicosky and Hanover, (1978). "Physiological Effects of Surface Waxes. 1. Light Reflectance for Glaucous and Nonglaucous Picea Pungens". Plant Physiol. 62:101-104.
Roshal et al., (1987). EMBO J. 6:1155.
Sambrook et al., (1989). In Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York.
Schaffner and Sheen, (1991). Plant Cell 3:997.
Scherthaner et al., (1988). EMBO J. 7:1249.
Schreiber and Schonherr, (1992). Pesti. Sci. 36:213-221.
Siebertz et al., (1989). Plant Cell 1:961.
Smith Waterman, (1981). Adv. Appl. Math. 2:482.
Smith et al., (1985). Heterologous protein secretion from yeast. Science 229:1219-1224.
Southern, (1975). J. Mol. Biol. 98:503.
Simpson et al., (1985). EMBO J. 4:2723.
Stefansson et al., (1961). "Note on the Isolation of Rape Plants With Seed Oil Free From Erucic Acid." Can. J. Plant Sci. 41:218-219.
Stockhause et al., (1997). The promoter of the gene encoding the C 4 form of phosphoenolpyruvate carboxylase directs mesophyll-specific expression in transgenic C 4 Flaveria spp. The Plant Cell 9:479-489.
Terada and Shimamoto, (1990). Mol. Gen. Genet. 220:389.
Tijssen, (1993). Laboratory Techniques in Biochemistry and Molecular Biology Hybridization with Nucleic Acid Probes Part I, Chapter 2 "Overview of principles of hybridization and the strategy of nucleic acid probe assays", Elsevier, New York.
Tsuchiya et al., (1995). Tapetum-specific expression of the gene for an endo-betabeta-l,3-glucanase causes male-sterility in transgenic tobacco. Plant Cell Physiol. 36:487-494.
van der Krol et al., (1988). An antisense chalcone synthase gene in transgenic plants inhibits flower pigmentation. Nature 333:866-869.
WO 98/46766 PCT/CA98/00343 van der Krol et al., (1990). Inhibition of Plower pigmentation by antisense chs genes: promoter and minimal sequence requirements for the anitsense effect. Plant Mpl. Biol. 14:457-466.
van der Meer et al., (1992). Antisense inhibition of flavonoid biosynthesis in petunia anthers results in male sterility. Plant Cell 253:-262.
Voelker et al., (1992). Fatty acid biosynthesis redirected to medium chains in transgenic oilseed plants.
Science 257:72-74.
von Wettstein-Knowles, (1982). "Elongase and Epicuticular Wax Biosynthesis." Physiol._Veg. 20:797- 809.
Weissbach and Weissbach, (1989). Methods for Plant Molecular Biology, Academic Press.
Worrall et al., (1992). Premature dissolution of the microsporocyte callose wall causes male sterility in transgenic tobacco. Plant Cell 4:759-771.
IMI
WO 98/46766 PCT/CA98/00343 SEQUENCE LISTING GENERAL INFORMATION APPLICANT: The University of British Columbia (ii) TITLE OF INVENTION: Nucleic Acids Encoding Plant Enzyme Involved In Very Long Chain Fatty Acid Synthesis (iii) NUMBER OF SEQUENCES: 12 (iv) CORRESPONDENCE ADDRESS: ADDRESSEE: Sim McBurney STREET: 6th Floor, 330 University Avenue CITY: Toronto PROVINCE: Ontario COUNTRY: Canada POSTAL CODE: M5G 1R7 COMPUTER READABLE FORM: MEDIUM TYPE: Disk, COMPUTER: IBM PC compatible OPERATING SYSTEM: Windows SOFTWARE: ASCII (vi) CURRENT APPLICATION DATA: APPLICATION NUMBER: FILING DATE: 14 April 1998
CLASSIFICATION:
(vii) PRIOR APPLICATION DATA: APPLICATION NUMBER:60/043,831 FILING DATE: April 14, 1997 (vii) PRIOR APPLICATION DATA: APPLICATION NUMBER: FILING DATE: April 10, 1998 (viii) PATENT AGENT INFORMATION NAME: RAE, Patricia A.
REFERENCE/DOCKET NUMBER: 3055-18/PAR (ix) TELECOMMUNICATION
INFORMATION:
TELEPHONE: (416) 595-1155 TELEFAX: (416) 595-1163 INFORMATION FOR SEQ ID NO: 1: SEQUENCE CHARACTERISTICS: LENGTH: 3712 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: TAGTGCTTTA TATATGTTTG ATACTTCTGT TTGGCAATAT CAATCATAGT AGAAAAGATA TGGACTTCAT TTGAGGTTTT TGGTGGATTG TGTCTATATG 100 TGAAATCATG GGATCTCAAG ATTTGTCTGC ATTCAGTTTC CAAGTCAAAC 150 ATCGTAACTA CTGTTTGATT TTCCCTCATG CTTGCAGTTT TCATGGATAT 200 CTCAAGATTT GTCTTCTTGC ACTTTCCAAG TCAAACATAA AGTAACTACT 250 WO 98/46766 WO 9846766PCT/CA98/00343 GATTGATATT CCCTCGTGTA CAAGTAGAGG AATTTCATAG TCGTATTTTG ATAACATTTA CAGTTTTTTT TTAATACATT ATGTTACTTC TTTTTTTGGA GGAATTGTTC ATGCTTTTTT CATAGACCAG TTATTACATG TCAGTATATT TTGGTATAGT AATTTAAAAT CTATATTTGA TAATTACCTA AATTTTAAGT TCGGTTGTTG ACGATTAACC AAATAACAAA ACATGTAACT CCCGAATATA TATGTATACC CACGTACATG GGTGATAGGT TGAATTCGTC TTTTTGGGTA GGATTACCAA TTCTTTCATT GAGAAACATA TAGAGTTTTG TGCAATATTT AGGGATGGAC GTGCTATATG AATCGTTTCG TATATAAACA AATTCCAACA TGCGGGTTAT TTAATTACCT TAACGGGTAA ACCAAACCAA AATTTGACAC AAACTAATGA GACCATTTTT GTTTTTGAGA GTACTAAGTA TTTATATCCA CCGGGCAACG TGAACGTGAT GATCTCGATA AAACTAAAAG
TTACCCTCTT
TGAATTCAAA
GTTATTCCTT
TAGTGTTGGT
AATAAATTAT
GATACAATAG
AATCGCCAAA
CTCCAACATA
CATTTCAAAG
CAAATGTGAA
ATGCAAAAAA
CTTGTAGATA
TATAATTTCT
CCAXACTCAC
TAAACGTACA
TATGGTACCA
ATATGTTTTC
ACAAGGTAAT
CATGGGTACT
AAATCAAGTT
ATCATATTAC
ACCGGATATT
ATATCTAAAT
ACCATAATAT
CCTTTAGTCA
CATCAAATCA
CTGACACGTC
TCAAATGACA
AGATTAACTG
TTCTTTTTTT
TTGGTTCAAT
TCATTCTTTC
TATACCATTT
ACAACACTAA
CAATCATAAA
TTTAACAACA
TTATATTTTA
GAAACATTAA
TACATGTATC
CTGATTTTCA
AAGTAAAAGT
TTTAATTTAC
GACAGAGTTA
TTGGATAAAT
ATATGCCTTT
AAAATTATTT
TTTGCTAAAA
TTGTAATATC
GAACTATTAA
TATGTTACTG
AAATTACAGG
CAGTACCAAT
AAGTAGTTAC
TTGCTGTTTC
CAATTGGGCC
TATTCCACCG
TCTTCTGCAA
GAAATATTAT
TACTATAAAA
CAAAAGATAC
AATCAGAAAA
ACCTCTGTGA
TAGTTCTAAA
CTCTTCGATA
TTGCGAATGT
GACATTTAAA
CGCTACCTGC
TTACGTACAG
ACGTAAGAAA
AGGCAAACAA
ATTAAATTGA
TAAGGTATAT
GTCCTTACTT
CTAGTTTATT
ATTCGTATGT
AAATCTTGTA
CTATGATAAC
TACGTGACAA
ATTGCGCCTA
CAAACGCTTT
TTAATTTATT
300 350 400 450 500 550 600 650 700 750 800 850 900 950 1000 1050 1100 1150 1200 1250 1300 1350 1400 1450 1500 1550 1600 WO 98/46766 WO 9846766PCT/CA98/00343 TCTCTTACAA CGACAATTTT AACAGTCCTT ATCATTTGCT GGAAATCGAA GAGAAGTATT AAATGACAAA AAATTAAATA GGTCTCTTCA TTAACTCCTC ATCCTTCACC TTCCCTCTCT CTAATCACAT TTTGTAACAA G ATG CCT CAG GCA CCG Met Pro Gin Ala Pro GAGAAATATG AAATTTTTAT ATCGAAAGGG CCCATCACTT GCTTTTGTCT AGTTACAACT ACAAAAACAT TTTTCTCGTC ATTTATAAAA GAGAGCAAAG CAAGAGCGTT GGGTGACGTT TCATCTACCC CTTCCTCTGT TCGCCTTTAT CATCTTCATT AACTCATCTT CAAAAATACC TAATACAATT ATACATTAAA ACTCTCCGAC ATG CCA GAG TTC TCT AGC TCG GTG Met Pro Glu Phe Ser Ser Ser Val 1650 1700 1750 1800 1850 1900 1950 1990 AAG CTC Lys Leu CAT TTC His Phe AAG TAC GTG AAA Lys Tyr Val Lys
CTT
Leu
TTG
Leu GGT TAC CAA TAT TTG GTT AAC Gly Tyr Gin Tyr Leu Val Asn
TTG
Leu AGT TTT CTT Ser Phe Leu GCC GTT GAG Ala Val Glu GTT TGG AAT Val Trp Asn
ATC
Ile 35
GGT
Gly CCG ATC ATG GCT Pro Ile Met Ala ATT GTC Ile Val CTT AAT Leu Asn
CTT
Leu
TCA
Ser CTT CGG ATG Leu Arg Met
CCT
Pro 50
CTA
Leu GAA GAG ATC Glu Glu Ile
TGT
Cys
CTC
Leu
GTC
Val CAG TTT GAC Gin Phe Asp
GTT
Val
ACT
Thr CAG GTT CTA Gin Val Leu
TCT
Ser
TCC
Ser TCC TTC TTT Ser Phe Phe
ATC
Ile 75
ATC
Ile TTC ATC TCC Phe Ilie Ser
GTT
Val1
TAT
Tyr
AAG
Lys CCA CGC ACC Pro Arg Thr AAG CCA CCT Lys Pro Pro 100 GAA CAC TCT Giu His Ser GAG TTC CAA Glu Phe Gin
TAC
Tyr 90
CGT
Arg CTC GTT GAC Leu Val Asp TAC TTC ATG Tyr Phe Met TCT TGT TAC Ser Cys Tyr ACT TTC ATG Thr Phe Met 2032 2074 2116 2158 2200 2242 2284 2326 2368 GTC ACG TGT Val Thr Cys
GTC
Val1 105
AAG
Lys CCC TTC GCA Pro Phe Ala 110 AGC GTC Ser Val
CGT
Arg 115
ATG
Met TTG ATC CTC Leu Ilie Leu
GAC
Asp 120
CGT
Arg AAG CCT AAG Lys Pro Lys 125
GAG
Giu
AGA
Arg 130 ATC CTT GAA Ile Leu Glu
TCT
Ser 135 GGC CTC GGT Gly Leu Gly WO 98/46766 PCT/CA98/00343
GAG
Glu 140 ACT TGT CTC CCT CCG GCT ATT CAT TAT Thr Cys Leu Pro Pro Ala Ile His Tyr 145
ATT
Ile 150 CCT CCC ACA Pro Pro Thr ATG GTT ATC Met Val Ile 165 2410 CCA ACC Pro Thr 155 ATG GAC GCG GCT Met Asp Ala Ala
AGA
Arg 160 AGC GAG GCT CAG Ser Glu Ala Gin 2452 TTC GAG GCC Phe Glu Ala 170 CCT AAA GAC Pro Lys Asp ATG GAC GAT CTT Met Asp Asp Leu
TTC
Phe 175 AAG AAA ACC GGT Lys Lys Thr Gly CTT AAA Leu Lys 180 2494 2536 GTC Val 185 GAC ATC CTT ATC Asp Ile Leu Ile
GTC
Val 190 AAC TGC TCT CTT Asn Cys Ser Leu
TTC
Phe 195 TCT CCC ACA CCA Ser Pro Thr Pro
TCG
Ser 200 CTC TCA GCT ATG Leu Ser Ala Met
GTC
Val 205 ATC AAC AAA TAT Ile Asn Lys Tyr 2578 2620
AAG
Lys 210 CTT AGG AGT AAT Leu Arg Ser Asn
ATC
Ile 215 AAG AGC TTC AAT Lys Ser Phe Asn
CTT
Leu 220 TCG GGG ATG Ser Gly Met GCC CGC GAC Ala Arg Asp 235 GGC TGC Gly Cys 225 AGC GCG GGC CTG Ser Ala Gly Leu
ATC
Ile 230 TCA GTT GAT CTA Ser Val Asp Leu 2662 TTG CTC CAA Leu Leu Gin 240 ACG GAG ATC Thr Glu Ile GTT CAT CCC AAT Val His Pro Asn
TCA
Ser 245 AAT GCA ATC ATC Asn Ala Ile Ile GTC AGC Val Ser 250 2704 2746
ATA
Ile 255 ACG CCT AAT TAC Thr Pro Asn Tyr
TAT
Tyr 260 CAA GGC AAC GAG Gin Gly Asn Glu
AGA
Arg 265 GCC ATG TTG TTA Ala Met Leu Leu
CCC
Pro 270 AAT TGT CTC TTC Asn Cys Leu Phe
CGC
Arg 275 ATG GGT GCG GCA Met Gly Ala Ala CGG TGG CGA GCC Arg Trp Arg Ala 290 2788 2830
GCC
Ala 280 ATA CAC ATG TCA Ile His Met Ser
AAC
Asn 285 CGC CGG TCT GAC Arg Arg Ser Asp AAA TAC Lys Tyr 295 AAG CTT TCC CAC Lys Leu Ser His
CTC
Leu 300 GTC CGG ACA CAC Val Arg Thr His CGT GGC GCT Arg Gly Ala 305 2872 GAC GAC AAG Asp Asp Lys 310 TCT TTC TAC TGT Ser Phe Tyr Cys
GTC
Val 315 TAC GAA CAG GAA Tyr Glu Gin Glu GAC AAA Asp Lys 320 2914 GAA GGA CAC GTT GGC ATC AAC TTG TCC AAA GAT CTC ATG GCC Glu Gly His Val Gly Ile Asn Leu Ser Lys Asp Leu Met Ala 2956 WO 98/46766 WO 9846766PCT/CA98/00343 325 330 335 ATC GCC GGT GAA Ile Ala Gly Giu
GCC
Al a 340 CTC AAG GCA AAC ATC ACC ACA ATA GGT Leu Lys Ala Asn Ile Thr Thr Ile Gly 345 2998
CCT
Pro 350 TTG GTC CTA CCG Leu Val Leu Pro
GCG
Al a 355 TCA GAA CAA CTT Ser Giu Gin Leu
CTC
Leu 360 TTC CTC ACG Phe Leu Thr TGG AAA CCA Trp Lys Pro 375 3040 3082 TCC CTA Ser Leu 365 ATC GGA CGT AAA Ile Gly Arg Lys
ATC
Ile 370 TTC AAC CCG AAA Phe Asn Pro Lys TAC ATA CCG Tyr Ilie Pro 380 CAC GCA GGA His Ala Gly GAT TTC AAG CTG Asp Phe Lys Leu
GCC
Al a 385 TTC GAA CAC TTT Phe Glu His Phe TGC ATT Cys Ile 390
GGC
Giy 395 AGA GCG GTG ATC Arg Ala Val Ile
GAC
Asp 400 GAG CTC CAA AAG Giu Leu Gin Lys
AAT
Asn 405 3124 3166 3208 CTA CAA CTA TCA Leu Gin Leu Ser
GGA
Gly 410 GAA CAC GTT GAG Glu His Val Glu
GCC
Al a 415 TCA AGA ATG ACA Ser Arg Met Thr
CTA
Leu 420 CAT CGT TTT GGT AAC ACG TCA TCT TCA His Arg Phe Gly Asn Thr Ser Ser Ser
TCG
Ser 430 TTA TGG TAC Leu Trp Tyr AGG AGA GGC Arg Arg Gly 445 3250 3292 GAG CTT Giu Leu 435 AGC TAC ATC GAG Ser Tyr Ile Glu
TCT
Ser 440 AAA GGG AGA ATG Lys Gly Arg Met GAT CGC Asp Arg
GTT
Val 450 TGG CAA ATC GCG Trp Gin Ile Ala
TTT
Phe 455 GGG AGT GGT TTC Gly Ser Gly Phe AAG TGT Lys Cys 460 3334 AAC TCT GCC Asn Ser Ala
GTG
Val 465 TGG AAA TGT Trp Lys Cys AAC CGT ACG ATT Asn Arg Thr Ile 470 AAG ACA CCT Lys Thr Pro 475 3376 AAG GAC GGA CCA Lys Asp Gly Pro
TGG
Trp 480 TCC GAT TGT ATC GAC CGT TAC CCT GTC Ser Asp Cys Ile Asp Arg Tyr Pro Val 485 3418
TTT
Phe 490 ATT CCC GAA GTT GTC AAA CTC Ile Pro Giu Val Val Lys Leu 495 TAA ACTGA 3450 AAACGTCTTT GAACGGTTTA GTAACGGTTT GATTTTGTGT TACGGTTTAG 3500 GTTTATTTGG TCTCGGGATT TGGTTTAAAG GGGATTGAGA AATGGGAAGT 3550 WO 98/46766 PCT/CA98/00343 TAGAGAGAAG AAAAAGCAAA GCATAAATGT TTGTATTTAA TTGCTCTGCA TATACTTAAA TCTCTGCTTT TCATTTGGGG TATTTTTTAG TTTCTCGTGC TGTAATTAAT AACTTGTGGT GTACTCAAAT AAGAATATTT CTCTCTGTTT AAAAAAAAAA AAAAAAAAAA AA INFORMATION FOR SEQ ID NO: 2 SEQUENCE CHARACTERISTICS: LENGTH: 1807 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2:
AAATACC
CTAATCACAT TTTGTAACAA TAATACAATT ATACATTAAA ACTCTCCGAC G ATG CCT CAG GCA CCG ATG CCA GAG TTC TCT AGC TCG GTG Met Pro Gin Ala Pro Met Pro Glu Phe Ser Ser Ser Val 3600 3650 3700 3712 AAG CTC Lys Leu AAG TAC GTG AAA Lys Tyr Vai Lys
CTT
Leu 20 GGT TAC CAA TAT Gly Tyr Gln Tyr TTG GTT AAC Leu Vai Asn 139 CAT TTC TTG AGT His Phe Leu Ser GCC GTT GAG CTT Ala Val Giu Leu 45 TTT CTT TTG Phe Leu Leu
ATC
Ile 35 CCG ATC ATG GCT ATT GTC Pro Ile Met Ala Ile Val CTT CGG ATG GGT Leu Arg Met Gly
CCT
Pro GAA GAG ATC CTT Glu Giu Ile Leu 223 GTT TGG AAT TCA Val Trp AsnSer
CTC
Leu CAG TTT GAC CTA Gin Phe Asp Leu
GTT
Val CAG GTT CTA TGT Gin Val Leu Cys 265
TCT
Ser TCC TTC TTT GTC Ser Phe Phe Val
ATC
Ile 75 TTC ATC TCC ACT Phe Ile Ser Thr
GTT
Val TAC TTC ATG Tyr Phe Met TCT TGT TAC Ser Cys Tyr 307 TCC AAG Ser Lys CCA CGC ACC ATO Pro Arg Thr Ile
TAC
Tyr 90 CTC GTT GAC TAT Leu Val Asp Tyr 349 AAG CCA CCT GTC Lys Pro Pro Val 100 ACG TGT CGT Thr Cys Arg
GTC
Val 105 CCC TTC GCA ACT Pro Phe Ala Thr TTC ATG Phe Met 110 GAA CAC TCT CGT TTG ATC CTC AAG GAC AAG CCT AAG AGC GTC Glu His Ser Arg Leu Ile Leu Lys Asp Lys Pro Lys Ser Vai 433 WO 98/46766 WO 9846766PCT/CA98/00343 120 GAG TTC CAA ATG Glu Phe Gin Met
AGA
Arg 130 ATC CTT GAA CGT Ile Leu Glu Arg
TCT
Ser 135 GGC CTC GGT GAG Gly Leu Gly Glu 475 517
GAG
Glu 140 ACT TGT CTC CCT Thr Cys Leu Pro
CCG
Pro 145 GCT ATT CAT TAT Ala Ile His Tyr
ATT
Ile 150 CCT CCC ACA Pro Pro Thr ATG GTT ATC Met Val Ile 165 CCA ACC Pro Thr 155 ATG GAC GCG GCT AGA !1et Asp Ala Ala Arg 160 AGC GAG GCT CAG Ser Glu Ala Gin 559 TTC GAG GCC Phe Glu Ala 170 CCT AAA GAC Pro Lys Asp ATG GAC GAT CTT Met Asp Asp Leu
TTC
Phe 175 AAG AAA ACC GGT Lys Lys Thr Gly OTT AAA Leu Lys 180
GTC
Val 185 GAC ATC CTT ATC Asp Ile Leu Ile
GTC
Val1 190 AAC TGC TCT CTT Asn Cys Ser Leu
TTC
Phe 195 TCT CCC ACA CCA Ser Pro Thr Pro
TCG
S er 200 CTC TCA GCT ATG Leu Ser Ala Met ATC AAC AAA TAT Ile Asn Lys Tyr 601 643 685 727 769
AAG
Lys 210 CTT AGG AGT AAT Leu Arg Ser Asn
ATC
Ile 215 AAG AGC TTC AAT Lys Ser Phe Asn
CTT
Leu 220 TCG GGG ATG Ser Gly Met GOC CGC GAC Ala Arg Asp 235 GGC TGC Gly Cys 225 AGO GCG GGC CTG Ser Ala Gly Leu
ATC
Ile 230 TCA GTT GAT CTA Ser Val Asp Leu TTG CTC CAA Leu Leu Gin 240 ACG GAG ATC Thr Giu Ile GTT CAT CCC AAT Val His Pro Asn
TCA
Ser 245 AAT GCA ATC ATC Asn Ala Ile Ile GTC AGC Val Ser 250
ATA
Ile 255 ACG CCT AAT TAC Thr Pro Asn Tyr
TAT
Tyr 260 CAA GGC AAC GAG Gin Gly Asn Glu
AGA
Arg 265 GCC ATG TTG TTA Ala Met Leu Leu
COO
Pro 270 AAT TGT CTC TTC Asn Cys Leu Phe
CGC
Arg 275 ATG GGT GCG GCA Met Gly Ala Ala 811 853 895 937 979
GC
Al a 280 ATA CAC ATG TCA Ile His Met Ser
AAC
Asn 285 CGC CGG TCT GAC Arg Arg Ser Asp
CGG
Arg 290 TGG CGA GCC Trp Arg Ala CGT GGC GCT Arg Gly Ala 305 AAA TAC Lys Tyr 295 AAG CTT TCC CAC Lys Leu Ser His CTC GTC Leu Val 300 CGG ACA CAC Arg Thr His WO 98/46766 WO 9846766PCT/CA98/00343 GAC GAC AAG Asp Asp Lys 310 TCT TTC TAC TGT Ser Phe Tyr Cys
GTC
Val 315 TAC GAA CAG GAA Tyr Glu Gin Glu GAC AAA Asp Lys 320 1021 GAA GGA CAC Glu Gly His
GTT
Val 325 GGC ATC AAC TTG TCC AAA GAT CTC ATG Gly Ile Asn Leu Ser Lys Asp Leu Met 330
GCC
Al a 335 1063 ATC GCC GGT GAA Ile Ala Gly Giu
GCC
Al a 340 CTC AAG GCA AAC Leu Lys Ala Asn
ATC
Ile 345 ACC ACA ATA GGT Thr Thr Ile Gly
CCT
Pro 350 TTG GTC CTA CCG Leu Val Leu Pro
GCG
Al a 355 TCA GAA CAA CTT Ser Glu Gin Leu
CTC
Leu 360 TTC CTC ACG Phe Leu Thr TGG AAA CCA Trp Lys Pro 375 TCC CTA Ser Leu 365 ATC GGA CGT AAA Ile Gly Arg Lys
ATC
Ile 370 TTC AAC CCG AAA Phe Asn Pro Lys TAC ATA ccc; Tyr Ie Pro 380 CAC GCA GGA His Ala Giy GAT TTC AAG CTG Asp Phe Lys Leu
GCC
Al a 385 TTC GAA CAC TTT Phe Giu His Phe TGC ATT Cys Ile 390 1105 1147 1189 1231 1273 1315 1357
GGC
Gly 395 AGA GCG GTG ATC Arg Ala Val Ile
GAC
Asp 400 GAG CTC CAA AAG Giu Leu Gin Lys
AAT
Asn 405 CTA CAA CTA TCA Leu Gin Leu Ser
GGA
Gly 410 GAA CAC GTT GAG Giu His Val Glu
GCC
Al a 415 TCA AGA ATG ACA Ser Arg Met Thr
CTA
Leu 420 CAT CGT TTT GGT His Arg Phe Gly
AAC
Asn 425 ACG TCA TCT TCA Thr Ser Ser Ser
TCG
Ser 430 TTA TGG TAC Leu Trp Tyr AGG AGA GGC Arg Arg Gly 445 GAG CTT Glu Leu 435 AGC TAC ATC GAG TCT AAA GGG AGA ATG Ser Tyr Ile Glu Ser Lys Gly Arg Met 440 1399 GAT CGC GTT Asp Arg Val 450 AAC TCT GCC Asn Ser Ala TGG CAA ATC GCG Trp Gin Ile Ala
TTT
Phe 455 GGG AGT GGT TTC Gly Ser Gly Phe AAG TGT Lys Cys 460
GTG
Val 465 TGG AAA TGT AAC Trp Lys Cys Asn
CGT
Arg 470 ACG ATT AAG ACA Thr Ile Lys Thr
CCT
Pro 475 1441 1483 1525 AAG GAC GGA CCA Lys Asp Gly Pro
TGG
Trp 480 TCC GAT TGT ATC Ser Asp Cys Ile
GAC
Asp 485 COT TAC CCT GTC Arg Tyr Pro Val TTT ATT CCC GAA GTT GTC AAA CTC Phe Ile Pro Glu Val Val Lys Leu TAA ACTGA 1557 WO 98/46766 WO 9846766PCT/CA98/00343 *AAACGTCTTT GAACGGTTTA GTAACGGTTT GATTTTGTGT TACGGTTTAG GTTTATTTGG TCTCGGGATT TGGTTTAAAG GGGATTGAGA AATGGGAAGT TAGAGAGAAG AAAAAGCAAA GCATAAATGT TTGTATTTAA TTGCTCTGCA TATACTTAAA TCTCTGCTTT TCATTTGGGG TATTTTTTAG TTTCTCGTGC TGTAATTAAT AACTTGTGGT GTACTCAAAT AAGAATATTT CTCTCTGTTT 1607 1657 1707 1757 1807 INFORMATION FOR SEQ ID NO: 3: SEQUENCE CHARACTERISTICS: LENGTH: 1491 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:
ATG
Met 1 CCT CAG GCA Pro Gin Ala
CCG
Pro 5 ATG CCA GAG TTC Met Pro Glu Phe
TCT
Ser AGC TCG GTG Ser Ser Val AAG CTC Lys Leu 1s AAG TAC GTG AAA Lys Tyr Val Lys
CTT
Leu 20 GGT TAC CAA TAT Gly Tyr Gin Tyr TTG GTT AAC Leu Val Asn CAT TTC TTG His Phe Leu AGT TTT CTT TTG Ser Phe Leu Leu
ATC
Ile 35 CCG ATC ATG GCT Pro Ile Met Ala ATT GTC Ile Val GCC GTT GAG CTT CTT CGG ATG GGT Ala Val Giu Leu Leu Arg Met Gly
CCT
Pro GAA GAG ATC CTT Giu Glu Ile Leu
AAT
Asn 165 207 GTT TGG AAT TCA Val Trp Asn Ser
CTC
Leu CAG TTT GAC CTA Gln Phe Asp Leu CAG GTT CTA TGT Gln Val Leu Cys
TCT
Ser TCC TTC TTT GTC Ser Phe Phe Val
ATC
Ile 75 TTC ATC TCC ACT Phe Ile Ser Thr
GTT
Val TAC TTC ATG Tyr Phe Met TCT TGT TAC Ser Cys Tyr 249 291 TCC AAG Ser Lys CCA CGC ACC ATC Pro Arg Thr Ile
TAC
Tyr 90 CTC GTT GAC TAT Leu Val Asp Tyr AAG CCA CCT Lys Pro Pro 100 GTC ACG TGT CGT Val Thr Cys Arg CCC TTC GCA ACT Pro Phe Ala Thr TTC ATG Phe Met 110 333 GAA CAC TCT CGT TTG ATC CTC AAG GAC AAG CCT AAG AGC Glu His Ser Arg Leu Ile Leu Lys Asp Lys Pro Lys Ser 115 120
GTC
Val1 125 375 WO 98/46766 PCT/CA98/00343 GAG TTC CAA ATG Glu Phe Gin Met
AGA
Arg 130 ATC CTT GAA CGT Ile Leu Glu Arg
TCT
Ser 135 GGC CTC GGT GAG Gly Leu Gly Glu 417 459
GAG
Glu 140 ACT TGT CTC CCT Thr Cys Leu Pro
CCG
Pro 145 GCT ATT CAT TAT Ala Ile His Tyr
ATT
Ile 150 CCT CCC ACA Pro Pro Thr ATG GTT ATC Met Val Ile 165 CCA ACC Pro Thr 155 ATG GAC GCG GCT Met Asp Ala Ala
AGA
Arg 160 AGC GAG GCT CAG Ser Glu Ala Gin TTC GAG GCC Phe Glu Ala 170 ATG GAC GAT CTT Met Asp Asp Leu
TTC
Phe 175 AAG AAA ACC GGT Lys Lys Thr Gly CTT AAA Leu Lys 180 CCT AAA GAC Pro Lys Asp
GTC
Val 185 GAC ATC CTT ATC Asp Ile Leu Ile
GTC
Val 190 AAC TGC TCT CTT Asn Cys Ser Leu
TTC
Phe 195 543 585 627 669 TCT CCC ACA CCA Ser Pro Thr Pro
TCG
Ser 200 CTC TCA GCT ATG Leu Ser Ala Met
GTC
Val 205 ATC AAC AAA TAT Ile Asn Lys Tyr
AAG
Lys 210 CTT AGG AGT AAT Leu Arg Ser Asn
ATC
Ile 215 AAG AGC TTC AAT Lys Ser Phe Asn
CTT
Leu 220 TCG GGG ATG Ser Gly Met GCC CGC GAC Ala Arg Asp 235 GGC TGC Gly Cys 225 AGC GCG GGC CTG Ser Ala Gly Leu
ATC
Ile 230 TCA GTT GAT CTA Ser Val Asp Leu TTG CTC CAA Leu Leu Gin 240 GTT CAT CCC AAT Val His Pro Asn
TCA
Ser 245 AAT GCA ATC ATC Asn Ala Ile Ile GTC AGC Val Ser 250 ACG GAG ATC Thr Glu Ile
ATA
Ile 255 ACG CCT AAT TAC Thr Pro Asn Tyr
TAT
Tyr 260 CAA GGC AAC GAG Gin Gly Asn Glu
AGA
Arg 265 753 795 837 879 GCC ATG TTG TTA Ala Met Leu Leu
CCC
Pro 270 AAT TGT CTC TTC Asn Cys Leu Phe
CGC
Arg 275 ATG GGT GCG GCA Met Gly Ala Ala
GCC
Ala 280 ATA CAC ATG TCA Ile His Met Ser
AAC
Asn 285 CGC CGG TCT GAC Arg Arg Ser Asp
CGG
Arg 290 TGG CGA GCC Trp Arg Ala CGT GGC GCT Arg Gly Ala 305 AAA TAC Lys Tyr 295 AAG CTT TCC CAC Lys Leu Ser His
CTC
Leu 300 GTC CGG ACA CAC Val Arg Thr His GAC GAC AAG TCT TTC TAC TGT GTC TAC GAA CAG GAA GAC AAA WO 98/46766 WO 9846766PCT/CA98/00343 Asp Asp Lys Ser Phe Tyr Cys 310 Val1 315 Tyr Glu Gin Glu Asp Lys 320 GAA GGA CAC Glu Gly His
GTT
Val 325 GGC ATC AAC TTG Gly Ile Asn Leu
TCC
Ser 330 AAA GAT CTC ATG Lys Asp Leu Met
GCC
Al a 335 ATO GCC GGT GAA Ile Ala Gly Glu
GCC
Ala 340 CTC AAG GCA AAC Leu Lys Ala Asn
ATC
Ile 345 ACC ACA ATA GGT Thr Thr Ile Gly 1005 1047 1089 1131
CCT
Pro 350 TTG GTC CTA CCG Leu Val Leu Pro
GCG
Al a 355 TCA GAA CAA CTT Ser Glu Gln Leu
CTC
Leu 360 TTC CTC ACG Phe Leu Thr TGG AAA CCA Trp Lys Pro 375 TCC CTA Ser Leu 365 ATC GGA CGT AAA Ile Gly Arg Lys
ATC
Ile 370 TTC AAC CCG AAA Phe Asn Pro Lys TAC ATA CCG Tyr Ile Pro 380 CAC GCA GGA His Ala Gly GAT TTC AAG CTG Asp Phe Lys Leu
GCC
Al a 385 TTC GAA CAC TTT Phe Glu His Phe TGC ATT Cys Ile 390
GGC
Gly 395 AGA GCG GTG ATC Arg Ala Val Ile
GAC
Asp 400 GAG CTC CAA AAG, Giu Leu Gin Lys
AAT
Asn 405 CTA CAA CTA TCA Leu Gin Leu Ser
GGA
Gly 410 GAA CAC GTT GAG Giu His Val Giu
GCC
Aia 415 TCA AGA ATG ACA Ser Arg Met Thr 1173 1215 1257 1299 1341
CTA
Leu 420 CAT*CGT TTT GGT His Arg Phe Gly
AAC
Asn 425 ACG TCA TCT TCA Thr Ser Ser Ser
TCG
Ser 430 TTA TGG TAC Leu Trp Tyr AGG AGA GGC Arg Arg Gly 445 GAG CTT Giu Leu 435 AGC TAC ATC GAG Ser Tyr Ile Glu
TCT
Ser 440 AAA GGG AGA ATG Lys Gly Arg Met GAT CGC GTT Asp Arg Val 450 AAC TCT GCC Asn Ser Ala TGG CAA ATC GCG Trp Gln Ile Ala
TTT
Phe 455 GGG ACT GCT TTC Gly Ser Gly Phe AAG TGT Lys Cys 460
GTG
Val 465 TGG AAA TGT AAC CGT ACG Trp Lys Cys Asn Arg Thr ATT AAC ACA CCT Ile Lys Thr Pro 475 CGT TAC CCT GTC Arg Tyr Pro Val 1383 1425 1467 AAG CAC GCA CCA Lys Asp Gly Pro
TGG
Trp 480 TCC CAT TGT ATC Ser Asp Cys Ile
GAC
Asp 485 TTT ATT CCC CPA GTT GTC AAA CTC Phe Ile Pro Ciu Val Val Lys Leu 1491 WO 98/46766 WO 9846766PCT/CA98/00343 INFORMATION FOR SEQ ID NO: 4 SEQUENCE CHARACTERISTICS: LENGTH: 497 TYPE: amino acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: Met Pro Gin Ala Pro Met 1 Tyr Leu Gly Val 65 Tyr Tyr His Met Pro 145 Ser Lys Ser Tyr Cys 225 Val Val Leu Pro Gin Phe Lys Ser Arg 130 Ala Giu Thr Leu Lys 210 Ser His Lys Ile 35 Giu Val Met Pro Arg 115 Ile Ile Al a Gly Phe 195 Leu Al a Pro Leu Pro Giu Leu Ser Pro 100 Leu Leu His Gin Leu 180 Ser Arg Giy Asn 5 Gly Ile Ile Cys Lys Vai Ile Glu Tyr Met 165 Lys Pro Ser Leu Ser Tyr Met Leu Ser 70 Pro Thr Leu Arg Ile 150 Vai Pro Thr Asn Ile 230 Asn Pro Glu Phe Gin Tyr Leu 25 Ala Ile Val 40 Asn Vai Trp 55 Ser Phe Phe Arg Thr Ilie Cys Arg Vai 105 Lys Asp Lys 120 Ser Giy Leu 135 Pro Pro Thr Ile Phe Glu Lys Asp Vai i85 Pro Ser Leu 200 Ilie Lys Ser 215 Ser Val Asp Aia Ile Ile Asn Giu Arg 265 10 VJal Al a Asn Val1 Tyr 90 Pro Pro Gly Pro Ala 170 Asp Ser Phe Leu Val 250 Al a Asn Val1 Ser Ile 75 Leu Phe Lys Glu Thr 155 Met Ile Al a Asn Al a 235 Ser Met Ser Ser Ser Val Lys Leu Lys His Giu Leu Phe Vai Ala Ser Glu 140 Met Asp Leu Met Leu 220 Arg Thr Leu Phe Leu Gin Ile Asp Thr Val 125 Thr Asp Asp Ile Val 205 Ser Asp Glu Leu Leu Leu Phe Ser Tyr Phe 110 Giu Cys Ala Leu Val 190 Ile Gly Leu Ile Pro 270 is Ser Arg Asp Thr Ser Met Phe Leu Aila Phe 175 Asn Asn Met Leu Ile 255 Asn Phe Met Leu Val Cys JGlu Gin Pro Arg 160 Lys Cys Lys Giy Gin 240 Thr Cys 245 Pro Asn Tyr Tyr Gin Gly 260 WO 98/46766 Leu Asp Arg 305 Lys Al a Leu Lys Al a 385 Giu Arg Tyr Arg Val1 465 Ser Leu 497 Phe Arg 290 Gly Glu Gly Pro Ile 370 Phe Leu Met Glu Val 450 Trp Asp Arg 275 Trp Ala Gly Glu Al a 355 Phe Glu Gin Thr Leu 435 Trp Lys Cys Met Arg Asp His Ala 340 Ser Asn His Lys Leu 420 Ser Gln Cys Ile Gly Al a Asp Val 325 Leu Glu Pro Phe Asn 405 His Tyr Ile Asn Asp 485 Ala Ala Lys Tyr 295 Lys Ser 310 Gly Ile Lys Ala Gin Leu Lys Trp 375 Cys Ile 390 Leu Gin Arg Phe Ile Glu Ala Phe 455 Arg Thr 570 Arg Tyr Al a 280 Lys Phe Asn Asn Leu 360 Lys His Leu Giy Ser 440 Gly Ile Pro Ile Leu Tyr Leu Ile 345 Phe Pro Al a Ser Asn 425 Lys Ser Lys Val His Met Ser His Cys Val 315 Ser Lys 330 Thr Thr Leu Thr Tyr Ile Gly Gly 395 Gly Glu 410 Thr Ser Gly Arg Gly Phe Thr Pro 475 Phe Ile 490 Ser Leu 300 Tyr Asp Ile Ser Pro 380 Arg His Ser Met Lys 460 Lys Asn 285 Val Giu Leu Gly Leu 365 Asp Al a Val Ser Arg 445 Cys Asp PCT/CA98/00343 Arg Arg Ser Arg Thr His Gin Glu Asp 320 Met Ala Ile 335 Pro Leu Val 350 Ile Gly Arg Phe Lys Leu Vai Ile Asp 400 Giu Ala Ser 415 Ser Leu Trp 430 Arg Gly Asp Asn Ser Ala Gly Pro Trp Pro Glu Val 480 Val Lys 495 INFORMATION FOR SEQ ID NO: SEQUENCE CHARACTERISTICS: LENGTH: 18 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: iinear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: GTGCTTTATA TATGTTTG 18 WO 98/46766 PCT/CA98/00343 INFORMATION FOR SEQ ID NO: 6: SEQUENCE CHARACTERISTICS: LENGTH: 18 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: CGTCGGAGAG TTTTAATG 18 INFORMATION FOR SEQ ID NO: 7: SEQUENCE CHARACTERISTICS: LENGTH: 18 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: CTTCGATATC GGTTGTTG 18 INFORMATION FOR SEQ ID NO: 8: SEQUENCE CHARACTERISTICS: LENGTH: 24 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: AAATACCCTA ATCACATTTT GTAA 24 INFORMATION FOR SEQ ID NO: 9: SEQUENCE CHARACTERISTICS: LENGTH: 24 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: TTTAAACAGA GAGAAATATT CTTA 24 INFORMATION FOR SEQ ID NO: SEQUENCE CHARACTERISTICS: LENGTH: 24 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: ATGCCTCAGG CACCGATGCC AGAG 24 INFORMATION FOR SEQ ID NO: 11: SEQUENCE CHARACTERISTICS: LENGTH: 24 WO 98/46766 WO 9846766PCT/CA98/00343 TYPE: nucleic acid STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: CAGCACGAGA AACTAAAA)A TACC 24 INFORMATION FOR SEQ ID NO: SEQUENCE CHARACTERISTICS: .0 LENGTH: 1951 TYPE: nucleic acid 11: 12: STRANDEDNESS: single TOPOLOGY: linear (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: TAGTGCTTTA TATATGTTTG ATACTTCTGT TTGGCAATAT AGAAAAGATA TGGACTTCAT TTGAGGTTTT TGGTGGATTG TGAAATCATG GGATCTCAAG ATTTGTCTGC ATTCAGTTTC ATCGTAACTA CTGTTTGATT TTCCCTCATG CTTGCAGTTT CTCAAGATTT GTCTTCTTGC ACTTTCCAAG TCAAACATAA GATTGATATT CCCTCGTGTA TTACCCTCTT TCAAATGACA CAAGTAGAGG AATTTCATAG TGAATTCAAA AGATTAACTG TCGTATTTTG ATAACATTTA GTTATTCCTT TTCTTTTTTT CAGTTTTTTT TTAATACATT TAGTGTTGGT TTGGTTCAAT ATGTTACTTC TTTTTTTGGA AATAAATTAT TCATTCTTTC GGAATTGTTC ATGCTTTTTT GATACAATAG TATACCATTT CATAGACCAG TTATTACATG AATCGCCAAA ACAACACTAA TCAGTATATT TTGGTATAGT CTCCAACATA CAATCATAAA AATTTAAAAT CTATATTTGA CATTTCAAAG TTTAACAACA TAATTACCTA AATTTTAAGT CAAATGTGAA TTATATTTTA TCGGTTGTTG ACGATTAACC ATGCAAAAAA GAAACATTAA AAATAACAAA ACATGTAACT CTTGTAGATA TACATGTATC CCCGAATATA TATGTATACC TATAATTTCT CTGATTTTCA CACGTACATG GGTGATAGGT CCAAACTCAC AAGTAAAAGT TGAATTCGTC TTTTTGGGTA TAAACGTACA TTTAATTTAC
CAATCATAGT
TGTCTATATG
CAAGTCAAAC
TCATGGATAT
AGTAACTACT
CAATTGGGCC
TATTCCACCG
TCTTCTGCAA
GAAATATTAT
TACTATAAAA
CAAAAGATAC
AATCAGAAAA
ACCTCTGTGA
TAGTTCTAAA
CTCTTCGATA
TTGCGAATGT
GACATTTAAA
CGCTACCTGC
TTACGTACAG
ACGTAAGAAA
100 150 200 250 300 350 400 450 500 550 600 650 700 750 800 850 900 950 1000 WO 98/46766 WO 9846766PCT/CA98/00343 GGATTACCAA TTCTTTCATT GAGAAACATA TAGAGTTTTG TGCAATATTT AGGGATGGAC GTGCTATATG AATCGTTTCG TATATAAACA AATTCCAACA TGCGGGTTAT TTAATTACCT TAACGGGTAA ACCAZ4ACCAA AATTTGACAC AAACTAATGA GACCATTTTT GTTTTTGAGA GTACTAAGTA TTTATATCCA CCGGGCAACG TGAACGTGAT GATCTCGATA AAACTAAAAG TCTCTTACAA CGACAATTTT AACAGTCCTT ATCATTTGCT GGAAATCGAA GAGAAGTATT AAATGACAAA AAATTAAATA GGTCTCTTCA TTAACTCCTC ATCCTTCACC TTCCCTCTCT
TATGGTACCA
ATATGTTTTC
ACAAGGTAAT
CATGGGTACT
AAATCAAGTT
ATCATATTAC
ACCGGATATT
ATATCTAAAT
ACCATAATAT
CCTTTAGTCA
CATCAAATCA
CTGACACGTC
GAGAAATATG
CCCATCACTT
ACAAAAACAT
GAGAGCAAAG
TCATCTACCC
CATCTTCATT
GACAGAGTTA
TTGGATAAAT
ATATGCCTTT
AAAATTATTT
TTTGCTAAAA
TTGTAATATC
GAACTATTAA
TATGTTACTG
AAATTACAGG
CAGTACCAAT
AAGTAGTTAC
TTGCTGTTTC
AAATTTTTAT
GCTTTTGTCT
TTTTCTCGTC
CAAGAGCGTT
CTTCCTCTGT
AACTCATCTT
AGGCAAACAA
ATTAAATTGA
TAAGGTATAT
GTCCTTACTT
CTAGTTTATT
ATTCGTATGT
AAATCTTGTA
CTATGATAAC
TACGTGACAA
ATTGCGCCTA
CAAACGCTTT
TTAATTTATT
ATCGAAAGGG
AGTTACAACT
ATTTATAAAA
GGGTGACGTT
TCGCCTTTAT
CAAAAATACC
1050 1100 1150 1200 1250 1300 1350 1400 1450 1500 1550 16 QO 1650 1700 1750 1800 1850 1900 1950 1951 CTAATCACAT TTTGTAACAA TAATACAATT ATACATTAAA ACTCTCCGAC
G

Claims (42)

1. A transgenic plant comprising a recombinant expression cassette comprising a promoter sequence operably linked to a nucleic acid sequence that encodes a protein having very long chain fatty acid elongase activity selected from the group consisting of: nucleic acids comprising at least consecutive nucleotides of the sequence set forth in Seq. I.D. No. 3; nucleic acids possessing at least sequence identity with the sequence set forth in Seq. I.D. No. 3; and nucleic acids that hybridize under conditions of at least 75% stringency with the sequence set forth in Seq. I.D. No. 3. 2_ A transgenic plant according to claim 1, wherein the plant has a modified phenotype compared to a non- transgenic plant of the same species. 20 3. A transgenic plant according to claim 2 wherein the modified phenotype is a modified very long chain fatty Sacid composition.
4. A transgenic plant according to claim 2 wherein the modified phenotype is a modified epicuticular wax layer. A transgenic plant according to claim 2 wherein the modified phenotype is modified seed oil composition.
6. A transgenic plant according to claim 2 wherein the modified phenotype is conditional male sterility.
7. A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence comprises at least consecutive nucleotides of the sequence set forth in SSeq. I.D. No. 1. \\mlbfiles\hm~.e pnAnaAcrpGv.i\o9.so.v 11(6/fl 11/06 2002 12:23 FAX 61 3 92438333 GRIFFITH HACK 0007 50
8. A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence possess at least sequence identity with the sequence set forth in Seq. I.D. No. 3.
9. A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence hybridizes under conditions of at least 80% stringency with the sequence 10 set forth in Seq. I.D. No. 3.
10- A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence comprises at least consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1.
11. A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence possess at least sequence identity with the sequence set forth in Seq. I.D. 20 No. 3. 12 A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence hybridizes under conditions of at least 75% stringency with the sequence set forth in Seq. I.D. No. 3.
13. A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence comprises at least consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1.
14. A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence comprises at least consecutive nucleotides of the sequence set forth in Seq. I.D. NO. 1. A transgenic plant according to any one of claims \\melbfile3\hmeS\Ananas\xccp\spcci\70191.0.dc 11/D6/02 11/06 2002 12:24 FAX 61 3 92438333 GRIFFITH HACK IM008 51 1-6 wherein the nucleic acid sequence comprises at least consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1.
16. A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence comprises at least consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1.
17. A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence comprises at least 50 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1.
18. A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence possess at least sequence identity with the sequence set forth in Seq. I.D. No. 3. 20 19. A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence possess at least sequence identity with the sequence set forth in Seq. I.D. No. 3.
20. A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence possess at least sequence identity with the sequence set forth in Seq. I.D. No. 3. 21 A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence hybridizes under conditions of at least 85% stringency with the sequence set forth in Seq. I.D. No. 3. 22 A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence hybridizes under conditions of at least 90% stringency with the sequence \\mlb_file\hoeS\Ananoc\eep\speci\70191.9B.doc 11/D6/02 11/06 2002 12:24 FAX 61 3 92438333 GRIFFITH HACK 1009 52 set forth in Seq. I.D. No. 3. 23 A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence hybridizes under conditions of at least 95% stringency with the sequence set forth in Seq. I.D. No. 3.
24. A transgenic plant according to any one of claims 1-6 wherein the nucleic acid molecule comprises the 10 sequence set forth in Seq. I.D. No. 3.
25. A transgenic plant comprising a recombinant expression cassette comprising a promoter sequence operably linked to a nucleic acid sequence, wherein expression of the nucleic acid sequence modifies very long chain fatty acid elongase activity in the transgenic plant, wherein the nucleic acid sequence is selected from the group consisting of: nucleic acids comprising at least 20 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1; nucleic acids possessing at least sequence identity with the sequence set forth in Seq. I.D. No. 3; and nucleic acids that hybridize under conditions of at least 75% stringency with the sequence set forth in Seq. I.D. No. 3.
26. A transgenic plant according to claim 25, wherein the plant has a modified phenotype compared to a non- transgenic plant of the same species.
27. A transgenic plant according to claim 26 wherein the modified phenotype is a modified very long chain fatty acid composition.
28. A transgenic plant according to claim 26 wherein \\tmlb_filec\hom6c\Ananoeg\eep\Speci\ 7 0 1 9 1 .9 .doc lX/06/02 11/06 2002 12:24 FAX 61 3 92438333 GRIFFITH HACK 0010 53 the modified phenotype is a modified epicuticular wax layer.
29. A transgenic plant according to claim 26 wherein the modified phenotype is modified seed oil composition. A transgenic plant according to claim 26 wherein the modified phenotype is conditional male sterility.
31. A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence comprises at least 30 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1.
32. A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence possess at least 80% sequence identity with the sequence set forth in Seq. SI.D. No. 3. 20 33. A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence hybridizes under conditions of at least 80% stringency with the sequence set forth in Seq. I.D. No. 3.
34. A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence comprises at least consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1.
35. A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence possess at least sequence identity with the sequence set forth in Seq. I.D. No. 3.
36- A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence hybridizes under conditions of at least 75% stringency with the sequence \\imelb_eiler\hais\A nanoe\Keep\Speci\70191. S.4 oc 11/06/02 11/06 2002 12:24 FAX 61 3 92438333 GRIFFITH HACK l011 54 set forth in Seq. I.D. No. 3.
37. A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence comprises at least 20 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1.
38. A transgenic plant according to any one of claims *e**9 25-30 wherein the nucleic acid sequence comprises at least 25 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1.
39. A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence comprises at least 35 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1.
40. A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence comprises at least 20 40 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1.
41. A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence comprises at least 50 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1.
42. A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence possess at least 60% sequence identity with the sequence set forth in Seq. I.D. No. 3.
43. A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence possess at least 90% sequence identity with the sequence set forth in Seq. I.D. No. 3. \\uelb-fiec\howes\Ananog\grp\sl\eei\70olha1g .dac 11/06/02 11/06 2002 12:25 FAX 61 3 92438333 GRIFFITH HACK 1@012 55
44. A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence possess at least sequence identity with the sequence set forth in Seq. I.D. No. 3. A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence hybridizes under conditions of at least 85% stringency with the sequence set forth in Seq. I.D. No. 3.
46. A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence hybridizes under '9 conditions of at least 90% stringency with the sequence set forth in Seq. I.D. No. 3.
47. A transgenic plant according to any one of claims 25-30 wherein the nucleic acid sequence hybridizes under conditions of at least 95% stringency with the sequence set forth in Seq. I.D. No. 3. S*
48. A transgenic plant according to any one of claims 1-6 wherein the nucleic acid molecule comprises the sequence set forth in Seq. I.D. No. 3.
49. A transgenic plant substantially as hereinbefore described and with reference to the examples and drawings. A method of isolating a nucleic acid molecule encoding a plant very long chain fatty acid elongation enzyme, the method comprising hybridizing a nucleic acid preparation with a DNA molecule comprising at least consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1. 5 1 A method of isolating a nucleic acid molecule encoding a plant very long chain fatty acid elongation enzyme, the method comprising hybridizing a nucleic acid \\Aelb_£ile\ho Uto\Aanc\n.carp\speci\7a0191.A.doc 11/06/02 11/06 2002 12:25 FAX 61 3 92438333 GRIFFITH HACK @o013 56 preparation with a DNA molecule comprising at least consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1.
52. A method of isolating a nucleic acid molecule encoding a plant very long chain fatty acid elongation enzyme, the method comprising hybridizing a nucleic acid preparation with a DNA molecule comprising at least
53. A method of isolating a nucleic acid molecule encoding a plant very long chain fatty acid elongation enzyme, the method comprising hybridizing a nucleic acid preparation with a DNA molecule comprising at least 30 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1. 0
54. A method of isolating a nucleic acid molecule 20 encoding a plant very long chain fatty acid elongation enzyme, the method comprising hybridizing a nucleic acid preparation with a DNA molecule comprising at least 0consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1.
55. A method of isolating a nucleic acid molecule encoding a plant very long chain fatty acid elongation enzyme, the method comprising hybridizing a nucleic acid preparation with a DNA molecule comprising at least 4035 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1.
556. A method of isolating a nucleic acid molecule encoding a plant very long chain fatty acid elongation enzyme, the method comprising hybridizing a nucleic acid preparation with a DNA molecule comprising at least consecutive nucleotides of the sequence set forth in Seq. consecutive nucleotides of the sequence set forth in Seq. \\melU_ile\homs\Wnas\ os\Keep\pcci\c191,39 .doc 1/G6/0IQ 11/06 2002 12:25 FAX 61 3 92438333GRFIhHC J01 GRIFFITH HACK I [a 014 57 I. D. No. 1. 57. A method according to any one of claims 50 to 56 wherein the stringency of hybridization is at least 58. A method according to any one of claims 50 to 56 wherein the stringency of hybridization is at least *59. A method according to any one of claims 50 to 56 wherein the stringency of hybridization is at least :5c60. A method according to any one of claims 50 to 56 *wherein the stringency of hybridization is at least 61. A method according to any one of claims 50 to 56 wherein the stringency of hybridization is at least be62. An isolated nucleic acid molecule isolated according to the method of any one of claims 57 through G1- 0*S@0 *a 63. A recombinant nucleic acid molecule comprising a Gc666. promoter sequence operably linked to a nucleic acid sequence, wherein the promoter sequence comprises a CUTi promoter selected from the gr oup consisting of: nucleic acids comprising at least consecutive nucleotides of the sequence set forth in Seq. I.D. No. 12; nucleic acids possessing at least sequence identity with the sequence set forth in Seq. I.D. No. 12; and nucleic acids that hybridize under conditions of at least 75% stringency with the sequence set forth in Seq. V-D. No. 12. 64, A recombinant nucleic acid molecule according to claim G3 wherein the promoter sequence comprises at least \-lalb_9 Xes\omne$An&no\Xeep\speci\o02g9 1. 9 a.O4c 1,1/06/02 11/06 2002 12:25 FAX 61 3 92438333 GRIFFITH HACK 58 consecutive nucleotides of the sequence shown in Seq. I.D. No. 12. A recombinant nucleic acid molecule according to claim 63 wherein the promoter sequence comprises at least 100 consecutive nucleotides of the sequence shown in Seq. I.D. No. 12. 66. A recombinant nucleic acid molecule according to claim 63 wherein the promoter sequence comprises nucleic acids possessing at least 70% sequence identity with the sequence set forth in Seq. I.D. No. 12. 67. A recombinant nucleic acid molecule according to 1 5 claim 63 wherein the promoter sequence comprises nucleic acids possessing at least 75% sequence identity with the sequence set forth in Seq. I.D. No. 12. 68. A recombinant nucleic acid molecule according to claim 63 wherein the promoter sequence comprises nucleic acids possessing at least 85% sequence identity with the sequence set forth in Seq. I.D. No. 12. S 69. A recombinant nucleic acid molecule according to 25 claim 63 wherein the promoter sequence comprises nucleic acids possessing at least 90% sequence identity with the sequence set forth in Seq. I.D. No. 12. A recombinant nucleic acid molecule according to claim 63 wherein the promoter sequence comprises nucleic acids possessing at least 95% sequence identity with the sequence set forth in Seq. I.D. No. 12. 71. A recombinant nucleic acid molecule according to claim 63 wherein the promoter sequence comprises nucleic acids that hybridize under conditions of at least stringency with the sequence set forth in Seq. I.D. No. b_fil\ho \Annoc\cc\eci\70191.9.doc 11/06/02 11/06 2002 12:26 FAX 61 3 92438333 GRIFFITH HACK @~016 59 12. 72. A recombinant nucleic acid molecule according to claim 63 wherein the promoter sequence comprises nucleic acids that hybridize under conditions of at least stringency with the sequence set forth in Seq- I.D. No. 12. 73. A recombinant nucleic acid molecule according to claim 63 wherein the promoter sequence comprises nucleic acids that hybridize under conditions of at least stringency with the sequence set forth in Seq. I.D. No. 12. 15 74. A recombinant nucleic acid molecule according to claim 63 wherein the promoter sequence comprises nucleic S. acids that hybridize under conditions of at least stringency with the sequence set forth in Seq. I.D. No. 12. A recombinant nucleic acid molecule according to claim 63 wherein the promoter sequence comprises nucleic acids that hybridize under conditions of at least stringency with the sequence set forth in Seq. I.D. No. 25 12. 76. An isolated nucleic acid molecule that encodes a protein having very long chain fatty acid elongase activity, wherein the nucleic acid molecule is selected from the group consisting of: nucleic acids comprising at least consecutive nucleotides of the sequence set forth in Seq. I.D. No. 3; nucleic acids possessing at least sequence identity with the sequence set forth in Seq. I.D. No. 3; and nucleic acids that hybridize under conditions \\meulb_tile \homee \An sano\xep\speei\7 0 1 9 1 a. OC 11/06/02 11/06 2002 12:26 FAX 61 3 92438333 GRIFFITH HACK Q 017 60 of at least 75% stringency with the sequence set forth in Seq. I.D. No. 3. 77. An isolated nucleic acid molecule according to claim 76 wherein the nucleic acid sequence comprises at least 15 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 3. 78. An isolated nucleic acid molecule according to claim 76 wherein the nucleic acid sequence comprises at least 30 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 3. 79. An isolated nucleic acid molecule according to 15 claim 76 wherein the nucleic acid sequence comprises at least 20 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1. An isolated nucleic acid molecule according to claim 76 wherein the nucleic acid sequence comprises at least 25 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1. 81. An isolated nucleic acid molecule according to 25 claim 76 wherein the nucleic acid sequence comprises at least 35 consecutive nucleotides of the sequence set forth 82. An isolated nucleic acid molecule according to claim 76 wherein the nucleic acid sequence comprises at least 40 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1. 83. An isolated nucleic acid molecule according to claim 76 wherein the nucleic acid sequence comprises at least 50 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1. \\melfj ,ea\ho™g \Anas eeq\spiei\7o 9i.sa.dc 11/06/02 11/06 2002 12:26 FAX 61 3 92438333 GRIFFITH HACK 0018 61 84. An isolated nucleic acid molecule according to claim 76 wherein the nucleic acid sequence possess at least 70% sequence identity with the sequence set forth in Seq. I.D. No. 3. An isolated nucleic acid molecule according to claim 76 wherein the nucleic acid sequence possess at least 80% sequence identity with the sequence set forth in Seq. l.D. No. 3. 86. An isolated nucleic acid molecule according to claim 76 wherein the nucleic acid sequence possess at least 90% sequence identity with the sequence set forth in Seq. l.D. No. 3. 87. An isolated nucleic acid molecule according to claim 76 wherein the nucleic acid sequence possess at least 95% sequence identity with the sequence set forth in Seq. I.D. No. 3. 88. An isolated nucleic acid molecule according to claim 76 wherein the nucleic acid sequence hybridizes under conditions of at least 75% stringency with the sequence set forth in Seq. I.D. No. 3. 89. A transgenic plant according to any one of claims 1-6 wherein the nucleic acid sequence hybridizes under conditions of at least 80% stringency with the sequence set forth in Seq. I.D. No. 3. An isolated nucleic acid molecule according to claim 76 wherein the nucleic acid sequence hybridizes under conditions of at least 85% stringency with the sequence set forth in Seq. I.D. No. 3. 91. An isolated nucleic acid molecule according to \\melbtilc aium\KMiee\Speci\701 il.98.doc 11/D6/a 11/06 2002 12:26 FAX 61 3 92438333 GRIFFITH HACK 1019 62 claim 76 wherein the nucleic acid sequence hybridizes under conditions of at least 90% stringency with the sequence set forth in Seq. I.D. No. 3. 92. An isolated nucleic acid molecule according to claim 76 wherein the nucleic acid sequence hybridizes under conditions of at least 95% stringency with the sequence set forth in Seq. I.D. No. 3. 93. An isolated nucleic acid molecule according to claim 76 wherein the nucleic acid molecule comprises the sequence set forth in Seq. I.D. No. 3. 94. A purified protein encoded by a nucleic acid 15 molecule according to any one of claims 76 to 93. A recombinant vector comprising a nucleic acid molecule according to any one of claims 76 to 93. 96. A recombinant vector according to claim 96 wherein the nucleic acid molecule is in reverse orientation relative to an adjacent promoter sequence of the vector. 25 97. A transgenic plant comprising a recombinant vector according to claim 95 or 96. 98. A method of producing a plant with a modified very long chain fatty acid composition relative to a non- transgenic plant of the same species, comprising introducing into the plant a recombinant vector according to claim 95 or 96. 99. A transgenic plant produced by a method according to claim 98. 100. A transgenic plant produced by sexual or asexual \\mb_iles\hs*e4\Ananna\Keep\spiec\ 7 019 1 pS.g.doc 13106/02 11/06 2002 12:27 FAX 61 3 92438333 GRIFFITH RACK I020 63 propagation of a transgenic plant according to claim 99, or of the progeny of said plant. 101. An isolated nucleic acid molecule that encodes a protein having an amino acid sequence as shown in Seq. I.D. No. 4. 102. A method of isolating a nucleic acid molecule encoding a plant very long chain fatty acid elongation enzyme, the method comprising hybridizing a nucleic acid preparation with a DNA molecule comprising at least consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1. 15 103. A method according to claim 102 wherein the DNA p. :molecule comprises at least 20 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1. 104. A method according to claim 102 wherein the DNA molecule comprises at least 30 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 3. 105. A method according to claim 102 wherein the DNA S" molecule comprises at least 25 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1. 1 06. A method according to claim 102 wherein the DNA molecule comprises at least 35 consecutive nucleotides of o•*oe the sequence set forth in Seq. I.D. No. 1'. 107. A method according to claim 102 wherein the DNA molecule comprises at least 40 consecutive nucleotides of the sequence set forth in Seq. I.D. No. 1. 108. A method according to claim 102 wherein the DNA molecule comprises at least 50 consecutive nucleotides of the sequence set forth in Seq. I.D. No- i. \\mlb_rlIe \hoas\Anno\kp\p c \Sa .l .91.9.dOC 11/06/02 11/06 2002 12:27 FAX 61 3 92438333 GRIFFITH HACK 0021 64 109_ An isolated nucleic acid molecule isolated according to a method according to any one of claims 102 to 108. 110. A purified peptide having an amino acid sequence that is at least 70% identical to the sequence set forth in Seq. I.D. No. 4. ill. A purified peptide having an amino acid sequence that is at least 75% identical to the sequence set forth in Seq. I.D. No. 4. 112. A purified peptide having an amino acid sequence that is at least 80% identical to the sequence set forth in Seq. I.D. No. 4. 113. A purified peptide having an amino acid sequence that is at least 90% identical to the sequence set forth in Seq I.D. No. 4. 114. A purified peptide having an amino acid sequence that is at least 95% identical to the sequence set forth in Seq. I.D. No. 4. •Dated this lth day of June 2002 UNIVERSITY OF BRITISH COLUMBIA By their Patent Attorneys SGRIFFITH HACK Fellows Institute of Patent and Trade Mark Attorneys of Australia \\mwltfile\homess\Ananots\Keep\spaci\7 0 1 91, .9e. oc j/qG/03
AU70191/98A 1997-04-14 1998-04-14 Nucleic acids encoding a plant enzyme involved in very long chain fatty acid synthesis Ceased AU750707C (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US4383197P 1997-04-14 1997-04-14
US60/043831 1997-04-14
US95894798A 1998-04-10 1998-04-10
US09/958947 1998-04-10
PCT/CA1998/000343 WO1998046766A1 (en) 1997-04-14 1998-04-14 Nucleic acids encoding a plant enzyme involved in very long chain fatty acid synthesis

Publications (3)

Publication Number Publication Date
AU7019198A AU7019198A (en) 1998-11-11
AU750707B2 true AU750707B2 (en) 2002-07-25
AU750707C AU750707C (en) 2003-05-15

Family

ID=26720861

Family Applications (1)

Application Number Title Priority Date Filing Date
AU70191/98A Ceased AU750707C (en) 1997-04-14 1998-04-14 Nucleic acids encoding a plant enzyme involved in very long chain fatty acid synthesis

Country Status (3)

Country Link
AU (1) AU750707C (en)
CA (1) CA2285970A1 (en)
WO (1) WO1998046766A1 (en)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AR013633A1 (en) 1997-04-11 2001-01-10 Calgene Llc METHOD FOR THE ALTERATION OF THE COMPOSITION OF AVERAGE CHAIN FAT ACIDS IN VEGETABLE SEEDS THAT EXPRESS A THIOESTERASE THAT PREFERS HETEROLOGICAL VEGETABLE AVERAGE CHAIN.
WO2001011061A2 (en) * 1999-08-04 2001-02-15 The University Of British Columbia Regulation of embryonic transcription in plants
US6784342B1 (en) 1999-08-04 2004-08-31 The University Of British Columbia Regulation of embryonic transcription in plants
CA2409885A1 (en) * 2000-05-24 2001-11-29 The University Of British Columbia Nucleic acid encoding a plant very long chain fatty acid biosynthetic enzyme
US7253337B2 (en) * 2000-05-24 2007-08-07 The University Of British Columbia Gene regulatory region that promotes early seed-specific transcription
BR0111081A (en) * 2000-05-24 2003-04-08 Univ British Columbia Gene regulatory region that specifically promotes root transcription and uses
DE60122719T2 (en) 2000-06-08 2008-04-17 Miami University, Oxford FATTY ACID ELONGASE 3-ketoacyl-CoA synthase polypeptides
DE10034804A1 (en) 2000-07-18 2002-01-31 Bayer Ag Use of VLCFAE to identify herbicidally active compounds
US6706950B2 (en) 2000-07-25 2004-03-16 Calgene Llc Nucleic acid sequences encoding β-ketoacyl-ACP synthase and uses thereof
EP1699927A4 (en) * 2003-11-25 2009-05-06 Ca Nat Research Council Fatty acid elongase (fae) genes and their utility in increasing erucic acid and other very long-chain fatty acid proportions in seed oil
AR051846A1 (en) * 2004-12-20 2007-02-14 Basf Plant Science Gmbh NUCLEIC ACID MOLECULES CODING KCS TYPE POLYPEPTIDES AND METHODS OF USE
CN113583990B (en) * 2021-06-04 2023-06-23 西南大学 Rice full-fertility half-dwarf phenotype regulatory gene SD38 and application thereof

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5679881A (en) * 1991-11-20 1997-10-21 Calgene, Inc. Nucleic acid sequences encoding a plant cytoplasmic protein involved in fatty acyl-CoA metabolism
DE4433307A1 (en) * 1994-09-19 1996-03-21 Norddeutsche Pflanzenzucht Han An isolated nucleic acid fragment and products derived from it
AU703957B2 (en) * 1994-10-26 1999-04-01 Cargill Incorporated FAE1 genes and their uses

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
GENBANK ACCESSION NO T22193 *
GENBANK ACCESSION NO T76616 *

Also Published As

Publication number Publication date
CA2285970A1 (en) 1998-10-22
AU750707C (en) 2003-05-15
AU7019198A (en) 1998-11-11
WO1998046766A1 (en) 1998-10-22

Similar Documents

Publication Publication Date Title
US6274790B1 (en) Nucleic acids encoding a plant enzyme involved in very long chain fatty acid synthesis
Millar et al. Very‐long‐chain fatty acid biosynthesis is controlled through the expression and specificity of the condensing enzyme
AU640644B2 (en) Genetic engineering of novel plant phenotypes
US5231020A (en) Genetic engineering of novel plant phenotypes
Rossak et al. Expression of the FAE1 gene and FAE1 promoter activity in developing seeds of Arabidopsis thaliana
EP0832262B1 (en) Modification of plant lipids and seed oils utilizing yeast slc genes
EP0788542B1 (en) Fae1 genes and their uses
EP2018397A2 (en) MOLECULAR CLONING AND SEQUENCING OF ACETYL CoA CARBOXYLASE (ACCase) GENE FROM JATROPHA CURCAS
AU750707B2 (en) Nucleic acids encoding a plant enzyme involved in very long chain fatty acid synthesis
CA2547320C (en) Fatty acid elongase (fae) genes and their utility in increasing erucic acid and other very long-chain fatty acid proportions in seed oil
CA2287914C (en) Raffinose synthase gene, method for producing raffinose, and transgenic plant
US5859342A (en) Antisense nucleotide sequences affecting fatty acid catabolism in plants
US6437219B1 (en) Nucleic acids encoding sucrose-binding proteins
US7932433B2 (en) Plant cyclopropane fatty acid synthase genes, proteins, and uses thereof
HUT71783A (en) Crucifer acc-synthase, gene coding it, the promoter of the gene, and recombinant methods for production of acc-synthase and transgenic plants expressing acc-synthase
US6600091B1 (en) Enzymes responsible for the metabolism of zeatin
US7148405B2 (en) Enzymes responsible for the metabolism of cis-zeatin
WO2001007586A2 (en) A plant long chain fatty acid biosynthetic enzyme
WO2000001828A1 (en) A modified arabidopsis thaliana cac1, cac2 or cac3 promoter and an arabidopsis thaliana cac1, cac2 or cac3 suppressor element and methods of use thereof
EP1389904A2 (en) Dwf12 and mutants thereof
WO2012030628A1 (en) Wax esters from crambe

Legal Events

Date Code Title Description
DA2 Applications for amendment section 104

Free format text: THE NATURE OF THE PROPOSED AMENDMENT IS AS SHOWN IN THE STATEMENT(S) FILED 20020827

FGA Letters patent sealed or granted (standard patent)
DA3 Amendments made section 104

Free format text: THE NATURE OF THE AMENDMENT IS AS WAS NOTIFIED IN THE OFFICIAL JOURNAL DATED 20021010