CN116529378A - Plant regulatory element and use thereof for automatic excision - Google Patents
Plant regulatory element and use thereof for automatic excision Download PDFInfo
- Publication number
- CN116529378A CN116529378A CN202180071059.8A CN202180071059A CN116529378A CN 116529378 A CN116529378 A CN 116529378A CN 202180071059 A CN202180071059 A CN 202180071059A CN 116529378 A CN116529378 A CN 116529378A
- Authority
- CN
- China
- Prior art keywords
- seq
- site
- plant
- recombinant dna
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000001105 regulatory effect Effects 0.000 title claims description 121
- 230000014509 gene expression Effects 0.000 claims abstract description 279
- 108020004511 Recombinant DNA Proteins 0.000 claims abstract description 120
- 108700019146 Transgenes Proteins 0.000 claims abstract description 108
- 102000053602 DNA Human genes 0.000 claims abstract description 86
- 230000009261 transgenic effect Effects 0.000 claims abstract description 85
- 238000000034 method Methods 0.000 claims abstract description 59
- 230000006798 recombination Effects 0.000 claims abstract description 42
- 238000005215 recombination Methods 0.000 claims abstract description 42
- 239000013598 vector Substances 0.000 claims abstract description 37
- 241000196324 Embryophyta Species 0.000 claims description 311
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 184
- 108020004414 DNA Proteins 0.000 claims description 154
- 210000004027 cell Anatomy 0.000 claims description 149
- 108090000623 proteins and genes Proteins 0.000 claims description 127
- 230000009466 transformation Effects 0.000 claims description 92
- 239000003550 marker Substances 0.000 claims description 76
- 108010051219 Cre recombinase Proteins 0.000 claims description 74
- 244000068988 Glycine max Species 0.000 claims description 71
- 108010052160 Site-specific recombinase Proteins 0.000 claims description 68
- 101710163270 Nuclease Proteins 0.000 claims description 45
- 108020005004 Guide RNA Proteins 0.000 claims description 41
- 239000012634 fragment Substances 0.000 claims description 33
- 230000001404 mediated effect Effects 0.000 claims description 33
- 240000008042 Zea mays Species 0.000 claims description 31
- 235000010469 Glycine max Nutrition 0.000 claims description 28
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 23
- 230000009286 beneficial effect Effects 0.000 claims description 23
- 238000010362 genome editing Methods 0.000 claims description 23
- 230000010354 integration Effects 0.000 claims description 18
- 241000589158 Agrobacterium Species 0.000 claims description 17
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 17
- 108010042407 Endonucleases Proteins 0.000 claims description 16
- 102000004533 Endonucleases Human genes 0.000 claims description 16
- -1 cas1B Proteins 0.000 claims description 15
- 230000002363 herbicidal effect Effects 0.000 claims description 15
- 239000004009 herbicide Substances 0.000 claims description 15
- 240000000385 Brassica napus var. napus Species 0.000 claims description 14
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 claims description 14
- 235000009973 maize Nutrition 0.000 claims description 14
- 108091033409 CRISPR Proteins 0.000 claims description 12
- 230000009418 agronomic effect Effects 0.000 claims description 12
- 230000001131 transforming effect Effects 0.000 claims description 12
- 229920000742 Cotton Polymers 0.000 claims description 10
- 230000001580 bacterial effect Effects 0.000 claims description 9
- 239000002245 particle Substances 0.000 claims description 9
- 230000008685 targeting Effects 0.000 claims description 9
- 108010014458 Gin recombinase Proteins 0.000 claims description 7
- 230000003115 biocidal effect Effects 0.000 claims description 7
- 102100025053 Cell division control protein 45 homolog Human genes 0.000 claims description 6
- 101100329224 Coprinopsis cinerea (strain Okayama-7 / 130 / ATCC MYA-4618 / FGSC 9003) cpf1 gene Proteins 0.000 claims description 6
- 101000934421 Homo sapiens Cell division control protein 45 homolog Proteins 0.000 claims description 6
- 101150059443 cas12a gene Proteins 0.000 claims description 6
- 108091070501 miRNA Proteins 0.000 claims description 6
- 108010046276 FLP recombinase Proteins 0.000 claims description 5
- 241000238631 Hexapoda Species 0.000 claims description 5
- 108020004459 Small interfering RNA Proteins 0.000 claims description 5
- 108010087512 R recombinase Proteins 0.000 claims description 4
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 4
- 238000003306 harvesting Methods 0.000 claims description 4
- 230000001172 regenerating effect Effects 0.000 claims description 4
- 101150017047 CSM3 gene Proteins 0.000 claims description 3
- 101150069031 CSN2 gene Proteins 0.000 claims description 3
- 101150078885 CSY3 gene Proteins 0.000 claims description 3
- 208000035240 Disease Resistance Diseases 0.000 claims description 3
- 101100275895 Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) csnB gene Proteins 0.000 claims description 3
- 101100007788 Escherichia coli (strain K12) casA gene Proteins 0.000 claims description 3
- 101100007792 Escherichia coli (strain K12) casB gene Proteins 0.000 claims description 3
- 101100219622 Escherichia coli (strain K12) casC gene Proteins 0.000 claims description 3
- 101100382541 Escherichia coli (strain K12) casD gene Proteins 0.000 claims description 3
- 101100326871 Escherichia coli (strain K12) ygbF gene Proteins 0.000 claims description 3
- 101100005249 Escherichia coli (strain K12) ygcB gene Proteins 0.000 claims description 3
- 101100387128 Myxococcus xanthus (strain DK1622) devR gene Proteins 0.000 claims description 3
- 101100387131 Myxococcus xanthus (strain DK1622) devS gene Proteins 0.000 claims description 3
- 101100059152 Thermococcus onnurineus (strain NA1) csm1 gene Proteins 0.000 claims description 3
- 101100273269 Thermus thermophilus (strain ATCC 27634 / DSM 579 / HB8) cse3 gene Proteins 0.000 claims description 3
- 101150090505 cas10 gene Proteins 0.000 claims description 3
- 101150117416 cas2 gene Proteins 0.000 claims description 3
- 101150055191 cas3 gene Proteins 0.000 claims description 3
- 101150111685 cas4 gene Proteins 0.000 claims description 3
- 101150049463 cas5 gene Proteins 0.000 claims description 3
- 101150106467 cas6 gene Proteins 0.000 claims description 3
- 101150044165 cas7 gene Proteins 0.000 claims description 3
- 101150038500 cas9 gene Proteins 0.000 claims description 3
- 101150100788 cmr3 gene Proteins 0.000 claims description 3
- 101150040342 cmr4 gene Proteins 0.000 claims description 3
- 101150095330 cmr5 gene Proteins 0.000 claims description 3
- 101150034961 cmr6 gene Proteins 0.000 claims description 3
- 101150085344 csa5 gene Proteins 0.000 claims description 3
- 101150089829 csc-1 gene Proteins 0.000 claims description 3
- 101150088639 csm4 gene Proteins 0.000 claims description 3
- 101150022488 csm5 gene Proteins 0.000 claims description 3
- 101150064365 csm6 gene Proteins 0.000 claims description 3
- 101150056210 csx1 gene Proteins 0.000 claims description 3
- 102000040650 (ribonucleotides)n+m Human genes 0.000 claims description 2
- 101100438439 Escherichia coli (strain K12) ygbT gene Proteins 0.000 claims description 2
- 241000589180 Rhizobium Species 0.000 claims description 2
- 101100222745 Schizosaccharomyces pombe (strain 972 / ATCC 24843) met17 gene Proteins 0.000 claims description 2
- 101100329497 Thermoproteus tenax (strain ATCC 35583 / DSM 2078 / JCM 9277 / NBRC 100435 / Kra 1) cas2 gene Proteins 0.000 claims description 2
- 101150000705 cas1 gene Proteins 0.000 claims description 2
- 244000299507 Gossypium hirsutum Species 0.000 claims 3
- 240000006064 Urena lobata Species 0.000 claims 1
- 239000002679 microRNA Substances 0.000 claims 1
- 108010091086 Recombinases Proteins 0.000 abstract description 34
- 102000018120 Recombinases Human genes 0.000 abstract description 30
- 108091026890 Coding region Proteins 0.000 description 45
- 101000650775 Boana raniceps Raniseptin-1 Proteins 0.000 description 38
- 108020005345 3' Untranslated Regions Proteins 0.000 description 37
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 30
- 239000002773 nucleotide Substances 0.000 description 29
- 125000003729 nucleotide group Chemical group 0.000 description 29
- 210000001519 tissue Anatomy 0.000 description 28
- 102000004169 proteins and genes Human genes 0.000 description 27
- 230000004568 DNA-binding Effects 0.000 description 25
- 210000004602 germ cell Anatomy 0.000 description 23
- 108091092195 Intron Proteins 0.000 description 22
- 108091036066 Three prime untranslated region Proteins 0.000 description 22
- 101150067314 aadA gene Proteins 0.000 description 22
- 238000003776 cleavage reaction Methods 0.000 description 22
- 230000000694 effects Effects 0.000 description 22
- 230000007017 scission Effects 0.000 description 22
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 20
- 238000013518 transcription Methods 0.000 description 20
- 230000035897 transcription Effects 0.000 description 20
- 239000011701 zinc Substances 0.000 description 20
- 229910052725 zinc Inorganic materials 0.000 description 20
- 238000003556 assay Methods 0.000 description 18
- 108020004999 messenger RNA Proteins 0.000 description 18
- 108010073062 Transcription Activator-Like Effectors Proteins 0.000 description 15
- 230000001976 improved effect Effects 0.000 description 15
- 240000002024 Gossypium herbaceum Species 0.000 description 13
- 235000004341 Gossypium herbaceum Nutrition 0.000 description 13
- 244000061456 Solanum tuberosum Species 0.000 description 13
- 235000002595 Solanum tuberosum Nutrition 0.000 description 13
- 229960000268 spectinomycin Drugs 0.000 description 13
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 13
- 238000013519 translation Methods 0.000 description 13
- 230000006870 function Effects 0.000 description 12
- 238000003780 insertion Methods 0.000 description 12
- 230000037431 insertion Effects 0.000 description 12
- 239000000047 product Substances 0.000 description 12
- 101000583839 Homo sapiens Muscleblind-like protein 1 Proteins 0.000 description 11
- 102100030965 Muscleblind-like protein 1 Human genes 0.000 description 11
- 238000010459 TALEN Methods 0.000 description 11
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 11
- 230000027455 binding Effects 0.000 description 11
- 102000053187 Glucuronidase Human genes 0.000 description 10
- 108010060309 Glucuronidase Proteins 0.000 description 10
- 238000012217 deletion Methods 0.000 description 10
- 230000037430 deletion Effects 0.000 description 10
- 238000004519 manufacturing process Methods 0.000 description 10
- 238000010354 CRISPR gene editing Methods 0.000 description 9
- 241000209094 Oryza Species 0.000 description 9
- 235000007164 Oryza sativa Nutrition 0.000 description 9
- 150000001413 amino acids Chemical class 0.000 description 9
- 210000000349 chromosome Anatomy 0.000 description 9
- 238000009396 hybridization Methods 0.000 description 9
- 239000000203 mixture Substances 0.000 description 9
- 102000040430 polynucleotide Human genes 0.000 description 9
- 108091033319 polynucleotide Proteins 0.000 description 9
- 239000002157 polynucleotide Substances 0.000 description 9
- 235000009566 rice Nutrition 0.000 description 9
- 241000894007 species Species 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 8
- 239000003623 enhancer Substances 0.000 description 8
- 241000219194 Arabidopsis Species 0.000 description 7
- 241000219146 Gossypium Species 0.000 description 7
- 101150086211 OLR1 gene Proteins 0.000 description 7
- 244000038559 crop plants Species 0.000 description 7
- 101150081158 crtB gene Proteins 0.000 description 7
- 150000007523 nucleic acids Chemical class 0.000 description 7
- 108090000765 processed proteins & peptides Proteins 0.000 description 7
- 241000894006 Bacteria Species 0.000 description 6
- 230000003321 amplification Effects 0.000 description 6
- 230000000692 anti-sense effect Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 210000001161 mammalian embryo Anatomy 0.000 description 6
- 239000000178 monomer Substances 0.000 description 6
- 238000003199 nucleic acid amplification method Methods 0.000 description 6
- 102000039446 nucleic acids Human genes 0.000 description 6
- 108020004707 nucleic acids Proteins 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 230000005030 transcription termination Effects 0.000 description 6
- 108700028317 Arabidopsis CDC45 Proteins 0.000 description 5
- 108091035707 Consensus sequence Proteins 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 239000005562 Glyphosate Substances 0.000 description 5
- 102000044126 RNA-Binding Proteins Human genes 0.000 description 5
- 108700020471 RNA-Binding Proteins Proteins 0.000 description 5
- 239000000470 constituent Substances 0.000 description 5
- 101150036876 cre gene Proteins 0.000 description 5
- 238000011161 development Methods 0.000 description 5
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 5
- 229940097068 glyphosate Drugs 0.000 description 5
- 230000035772 mutation Effects 0.000 description 5
- 238000011160 research Methods 0.000 description 5
- 108091008146 restriction endonucleases Proteins 0.000 description 5
- 101150095667 splA gene Proteins 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 4
- 108010039224 Amidophosphoribosyltransferase Proteins 0.000 description 4
- 241000701489 Cauliflower mosaic virus Species 0.000 description 4
- 241000233866 Fungi Species 0.000 description 4
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 4
- 108700026244 Open Reading Frames Proteins 0.000 description 4
- 102000008579 Transposases Human genes 0.000 description 4
- 108010020764 Transposases Proteins 0.000 description 4
- 238000009395 breeding Methods 0.000 description 4
- 230000001488 breeding effect Effects 0.000 description 4
- 230000010307 cell transformation Effects 0.000 description 4
- 210000003763 chloroplast Anatomy 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 4
- 230000005782 double-strand break Effects 0.000 description 4
- 238000010353 genetic engineering Methods 0.000 description 4
- 102000004196 processed proteins & peptides Human genes 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 210000001938 protoplast Anatomy 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 239000004055 small Interfering RNA Substances 0.000 description 4
- 238000006467 substitution reaction Methods 0.000 description 4
- 230000002103 transcriptional effect Effects 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 3
- 241000219195 Arabidopsis thaliana Species 0.000 description 3
- 102000008682 Argonaute Proteins Human genes 0.000 description 3
- 108010088141 Argonaute Proteins Proteins 0.000 description 3
- 108091079001 CRISPR RNA Proteins 0.000 description 3
- 108010049994 Chloroplast Proteins Proteins 0.000 description 3
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 3
- 102100034343 Integrase Human genes 0.000 description 3
- 108010061833 Integrases Proteins 0.000 description 3
- 108090000854 Oxidoreductases Proteins 0.000 description 3
- 102000009572 RNA Polymerase II Human genes 0.000 description 3
- 108010009460 RNA Polymerase II Proteins 0.000 description 3
- 102100024544 SURP and G-patch domain-containing protein 1 Human genes 0.000 description 3
- 108020004566 Transfer RNA Proteins 0.000 description 3
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000033228 biological regulation Effects 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 235000005822 corn Nutrition 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 239000000539 dimer Substances 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 235000016709 nutrition Nutrition 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 230000008488 polyadenylation Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000002829 reductive effect Effects 0.000 description 3
- 238000012552 review Methods 0.000 description 3
- 238000001890 transfection Methods 0.000 description 3
- 238000012250 transgenic expression Methods 0.000 description 3
- 230000014621 translational initiation Effects 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- 108020003589 5' Untranslated Regions Proteins 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 101150005250 CDC45 gene Proteins 0.000 description 2
- 108090000994 Catalytic RNA Proteins 0.000 description 2
- 102000053642 Catalytic RNA Human genes 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 230000007018 DNA scission Effects 0.000 description 2
- 102100035102 E3 ubiquitin-protein ligase MYCBP2 Human genes 0.000 description 2
- 241000702189 Escherichia virus Mu Species 0.000 description 2
- 102100029880 Glycodelin Human genes 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 238000002944 PCR assay Methods 0.000 description 2
- 108091036407 Polyadenylation Proteins 0.000 description 2
- 102000014450 RNA Polymerase III Human genes 0.000 description 2
- 108010078067 RNA Polymerase III Proteins 0.000 description 2
- 108091030071 RNAI Proteins 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 108020000005 Sucrose phosphorylase Proteins 0.000 description 2
- 108700026226 TATA Box Proteins 0.000 description 2
- 108700009124 Transcription Initiation Site Proteins 0.000 description 2
- 108091023040 Transcription factor Proteins 0.000 description 2
- 102000040945 Transcription factor Human genes 0.000 description 2
- 108091023045 Untranslated Region Proteins 0.000 description 2
- 241000607479 Yersinia pestis Species 0.000 description 2
- 239000002551 biofuel Substances 0.000 description 2
- 229920001222 biopolymer Polymers 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 108010031100 chloroplast transit peptides Proteins 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 239000013065 commercial product Substances 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 235000014113 dietary fatty acids Nutrition 0.000 description 2
- 238000006471 dimerization reaction Methods 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 230000006353 environmental stress Effects 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 229930195729 fatty acid Natural products 0.000 description 2
- 239000000194 fatty acid Substances 0.000 description 2
- 150000004665 fatty acids Chemical class 0.000 description 2
- 239000000835 fiber Substances 0.000 description 2
- 230000008571 general function Effects 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 108091027963 non-coding RNA Proteins 0.000 description 2
- 102000042567 non-coding RNA Human genes 0.000 description 2
- 230000006780 non-homologous end joining Effects 0.000 description 2
- 230000035764 nutrition Effects 0.000 description 2
- 238000003976 plant breeding Methods 0.000 description 2
- 210000002706 plastid Anatomy 0.000 description 2
- 102000028499 poly(A) binding Human genes 0.000 description 2
- 108091023021 poly(A) binding Proteins 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 230000004850 protein–protein interaction Effects 0.000 description 2
- 238000004064 recycling Methods 0.000 description 2
- 230000008929 regeneration Effects 0.000 description 2
- 238000011069 regeneration method Methods 0.000 description 2
- 230000014493 regulation of gene expression Effects 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 108020004418 ribosomal RNA Proteins 0.000 description 2
- 108091092562 ribozyme Proteins 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000010153 self-pollination Effects 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 230000003007 single stranded DNA break Effects 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 230000001629 suppression Effects 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 238000011426 transformation method Methods 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 238000011179 visual inspection Methods 0.000 description 2
- FQVLRGLGWNWPSS-BXBUPLCLSA-N (4r,7s,10s,13s,16r)-16-acetamido-13-(1h-imidazol-5-ylmethyl)-10-methyl-6,9,12,15-tetraoxo-7-propan-2-yl-1,2-dithia-5,8,11,14-tetrazacycloheptadecane-4-carboxamide Chemical compound N1C(=O)[C@@H](NC(C)=O)CSSC[C@@H](C(N)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)NC(=O)[C@@H]1CC1=CN=CN1 FQVLRGLGWNWPSS-BXBUPLCLSA-N 0.000 description 1
- YLYPIBBGWLKELC-RMKNXTFCSA-N 2-[2-[(e)-2-[4-(dimethylamino)phenyl]ethenyl]-6-methylpyran-4-ylidene]propanedinitrile Chemical compound C1=CC(N(C)C)=CC=C1\C=C\C1=CC(=C(C#N)C#N)C=C(C)O1 YLYPIBBGWLKELC-RMKNXTFCSA-N 0.000 description 1
- 108020005176 AU Rich Elements Proteins 0.000 description 1
- 101100001031 Acetobacter aceti adhA gene Proteins 0.000 description 1
- 101710197633 Actin-1 Proteins 0.000 description 1
- 101710197650 Actin-7 Proteins 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 102100033647 Activity-regulated cytoskeleton-associated protein Human genes 0.000 description 1
- OIRDTQYFTABQOQ-KQYNXXCUSA-N Adenosine Natural products C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 1
- 101150021974 Adh1 gene Proteins 0.000 description 1
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 1
- 102100034613 Annexin A2 Human genes 0.000 description 1
- 108090000668 Annexin A2 Proteins 0.000 description 1
- 102100034612 Annexin A4 Human genes 0.000 description 1
- 108090000669 Annexin A4 Proteins 0.000 description 1
- 108700001058 Arabidopsis SWI1 Proteins 0.000 description 1
- BHELIUBJHYAEDK-OAIUPTLZSA-N Aspoxicillin Chemical compound C1([C@H](C(=O)N[C@@H]2C(N3[C@H](C(C)(C)S[C@@H]32)C(O)=O)=O)NC(=O)[C@H](N)CC(=O)NC)=CC=C(O)C=C1 BHELIUBJHYAEDK-OAIUPTLZSA-N 0.000 description 1
- 208000035143 Bacterial infection Diseases 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 description 1
- 238000010453 CRISPR/Cas method Methods 0.000 description 1
- 101100454194 Caenorhabditis elegans mei-1 gene Proteins 0.000 description 1
- 101710163595 Chaperone protein DnaK Proteins 0.000 description 1
- 108010007108 Chloroplast Thioredoxins Proteins 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- YAHZABJORDUQGO-NQXXGFSBSA-N D-ribulose 1,5-bisphosphate Chemical compound OP(=O)(O)OC[C@@H](O)[C@@H](O)C(=O)COP(O)(O)=O YAHZABJORDUQGO-NQXXGFSBSA-N 0.000 description 1
- 230000008265 DNA repair mechanism Effects 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 241001057636 Dracaena deremensis Species 0.000 description 1
- 101500023984 Drosophila melanogaster Synapsin-1 Proteins 0.000 description 1
- 101100162704 Emericella nidulans I-AniI gene Proteins 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 101100437498 Escherichia coli (strain K12) uidA gene Proteins 0.000 description 1
- 101000851432 Escherichia coli Transposase for transposon Tn5 Proteins 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108010074122 Ferredoxins Proteins 0.000 description 1
- 244000228957 Ferula foetida Species 0.000 description 1
- 101150106478 GPS1 gene Proteins 0.000 description 1
- 229930182566 Gentamicin Natural products 0.000 description 1
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 1
- 241000626621 Geobacillus Species 0.000 description 1
- 101710178376 Heat shock 70 kDa protein Proteins 0.000 description 1
- 101710152018 Heat shock cognate 70 kDa protein Proteins 0.000 description 1
- 108010004889 Heat-Shock Proteins Proteins 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- QTDMGAWIBXJNRR-UHFFFAOYSA-N Mangostin Natural products CC(=CCc1c(O)cc2Oc3cc(C)c(O)c(CC=C(C)C)c3C(=O)c2c1O)C QTDMGAWIBXJNRR-UHFFFAOYSA-N 0.000 description 1
- 108091027974 Mature messenger RNA Proteins 0.000 description 1
- 101100219625 Mus musculus Casd1 gene Proteins 0.000 description 1
- 101001094044 Mus musculus Solute carrier family 26 member 6 Proteins 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 206010034133 Pathogen resistance Diseases 0.000 description 1
- 240000007377 Petunia x hybrida Species 0.000 description 1
- 101710173432 Phytoene synthase Proteins 0.000 description 1
- 108020005120 Plant DNA Proteins 0.000 description 1
- 108700001094 Plant Genes Proteins 0.000 description 1
- 102100026090 Polyadenylate-binding protein 1 Human genes 0.000 description 1
- 101710103012 Polyadenylate-binding protein, cytoplasmic and nuclear Proteins 0.000 description 1
- 241000205156 Pyrococcus furiosus Species 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- MUPFEKGTMRGPLJ-RMMQSMQOSA-N Raffinose Natural products O(C[C@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O[C@@]2(CO)[C@H](O)[C@@H](O)[C@@H](CO)O2)O1)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 MUPFEKGTMRGPLJ-RMMQSMQOSA-N 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- 108020004422 Riboswitch Proteins 0.000 description 1
- 101710199588 Ribulose bisphosphate carboxylase small subunit, chloroplastic 3 Proteins 0.000 description 1
- 101100139878 Schizosaccharomyces pombe (strain 972 / ATCC 24843) ran1 gene Proteins 0.000 description 1
- 108091027967 Small hairpin RNA Proteins 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 108091028113 Trans-activating crRNA Proteins 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- 108090000704 Tubulin Proteins 0.000 description 1
- MUPFEKGTMRGPLJ-UHFFFAOYSA-N UNPD196149 Natural products OC1C(O)C(CO)OC1(CO)OC1C(O)C(O)C(O)C(COC2C(C(O)C(O)C(CO)O2)O)O1 MUPFEKGTMRGPLJ-UHFFFAOYSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 150000003838 adenosines Chemical class 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- GNRIZKKCNOBBMO-UHFFFAOYSA-N alpha-mangostin Chemical compound OC1=C(CC=C(C)C)C(O)=C2C(=O)C3=C(CC=C(C)C)C(OC)=C(O)C=C3OC2=C1 GNRIZKKCNOBBMO-UHFFFAOYSA-N 0.000 description 1
- 229940126575 aminoglycoside Drugs 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 208000022362 bacterial infectious disease Diseases 0.000 description 1
- 238000012742 biochemical analysis Methods 0.000 description 1
- 238000003766 bioinformatics method Methods 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 101150055766 cat gene Proteins 0.000 description 1
- 230000022131 cell cycle Effects 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000027288 circadian rhythm Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 230000009918 complex formation Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 239000013256 coordination polymer Substances 0.000 description 1
- 101150088252 csy1 gene Proteins 0.000 description 1
- 101150016576 csy2 gene Proteins 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 230000003436 cytoskeletal effect Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 235000019621 digestibility Nutrition 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 230000034431 double-strand break repair via homologous recombination Effects 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 108010050663 endodeoxyribonuclease CreI Proteins 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 241001233957 eudicotyledons Species 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000007380 fibre production Methods 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- 235000019634 flavors Nutrition 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 229960002518 gentamicin Drugs 0.000 description 1
- 210000002980 germ line cell Anatomy 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 239000003262 industrial enzyme Substances 0.000 description 1
- 108091006086 inhibitor proteins Proteins 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 238000013332 literature search Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 235000012054 meals Nutrition 0.000 description 1
- 230000021121 meiosis Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 108010058731 nopaline synthase Proteins 0.000 description 1
- 239000003921 oil Substances 0.000 description 1
- 238000006384 oligomerization reaction Methods 0.000 description 1
- 238000009401 outcrossing Methods 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 210000002824 peroxisome Anatomy 0.000 description 1
- 239000000575 pesticide Substances 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 239000004069 plant analysis Substances 0.000 description 1
- 210000000745 plant chromosome Anatomy 0.000 description 1
- 230000008121 plant development Effects 0.000 description 1
- 230000008635 plant growth Effects 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000001581 pretranslational effect Effects 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 230000026447 protein localization Effects 0.000 description 1
- MUPFEKGTMRGPLJ-ZQSKZDJDSA-N raffinose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO[C@@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)O1 MUPFEKGTMRGPLJ-ZQSKZDJDSA-N 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 108091008025 regulatory factors Proteins 0.000 description 1
- 102000037983 regulatory factors Human genes 0.000 description 1
- 230000009711 regulatory function Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000007363 ring formation reaction Methods 0.000 description 1
- 238000005204 segregation Methods 0.000 description 1
- HBMJWWWQQXIZIP-UHFFFAOYSA-N silicon carbide Chemical compound [Si+]#[C-] HBMJWWWQQXIZIP-UHFFFAOYSA-N 0.000 description 1
- 229910010271 silicon carbide Inorganic materials 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 230000035882 stress Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 108091006106 transcriptional activators Proteins 0.000 description 1
- 108091006107 transcriptional repressors Proteins 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
- C12N15/8237—Externally regulated expression systems
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
- C12N15/8222—Developmentally regulated expression systems, tissue, organ specific, temporal or spatial regulation
- C12N15/823—Reproductive tissue-specific promoters
- C12N15/8234—Seed-specific, e.g. embryo, endosperm
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1241—Nucleotidyltransferases (2.7.7)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Cell Biology (AREA)
- Medicinal Chemistry (AREA)
- Developmental Biology & Embryology (AREA)
- Pregnancy & Childbirth (AREA)
- Reproductive Health (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
Recombinant DNA molecules and constructs for modulating gene expression in plants are provided. By expressing the recombinase encoded by the recombinant DNA molecule or construct, one or more expression cassettes of the recombinant DNA molecule or construct can be excised from the transformed transgenic plant by means of the presence of flanking site-specific recombination sites in the recombinant DNA molecule or construct. Such expression cassettes can be removed from plants transformed with the recombinant DNA constructs or vectors using such recombinase systems. The recombinase transgene may be operably linked to a tissue-preferred or tissue-specific promoter for automatic excision in the transformed plant without crossing with a different transgenic line expressing the recombinase. Also provided are methods of causing automatic excision of one or more expression cassettes in transgenic plants and cells containing or transformed with the recombinant DNA molecules or constructs of the disclosure.
Description
Citation of related application
The present application claims the benefit of U.S. provisional application No. 63/093,893, filed on even date 10/20 in 2020, which is incorporated herein by reference in its entirety.
Incorporation of the sequence Listing
The sequence listing contained in the file named "MONS512WO_ST25.Txt" is 111,683 bytes (as in Microsoft @Measured in (a), created at 10.18 of 2021, which is submitted together in an electronic submission and incorporated herein by reference.
Technical Field
The present invention relates to the fields of plant molecular biology and plant genetic engineering. More specifically, the present invention relates to DNA molecules for modulating the expression of a site-specific recombinase gene in plants.
Background
Regulatory elements are genetic elements that regulate gene activity by regulating transcription of an operably linked transcribable DNA sequence. Such elements may include promoters, leader sequences, introns and 3' untranslated regions, and may be used in the fields of plant molecular biology and plant genetic engineering.
The use of transgenic technology provides many beneficial traits for agricultural purposes, but also encounters some challenges. One concern relates to the presence of marker genes conferring antibiotic or herbicide resistance in transgenic crop plants. In addition, there may be other transgene cassettes or DNA sequences that are designed for specific purposes and are present in the initial transformation, but are not required in the final transgene product. In the field of plant biotechnology, it is highly desirable to remove such marker genes, as well as other unwanted expression cassettes and DNA sequences.
Many strategies have been devised for generating marker-free transgenic plants. For example, removal of the marker gene expression cassette may be performed using a double T-DNA transformation system or a site-specific recombinase system.
Double T-DNA transformation systems utilize two comprising two independent T-DNAsMeta-plant transformation vectors (double T-DNA transformation system). One T-DNA comprises a marker gene expression cassette. The other T-DNA comprises an expression cassette for the gene of interest that is expected to remain in the transgenic plant. Plant cell transformation may be performed by agrobacterium-mediated transformation. Each T-DNA may be integrated into a separate chromosome of the genome of the transformed plant cell. After transformation and plant regeneration, R is allowed to stand 0 Plant selfing to produce R 1 And (5) offspring. Selecting R having T-DNA comprising an expression cassette intended for the final transgene product but lacking T-DNA comprising a marker gene expression cassette 1 Progeny plants (see, e.g., komari, T.et al, (1996) Vectors carrying two separate T-DNAs for co-transformation of higher plants mediated by Agrobacterium tumefaciens and segregation of transformants free from selection mark ers, the Plant Journal,10 (1): 165-174). The double T-DNA transformation system has some drawbacks in terms of efficiency. In the double T-DNA transformation system, transformant R 0 Plants may have more than one copy of one or two T-DNA, which may have to be excluded, and the percentage of plants selected with only one copy of each T-DNA may be low.
Another system for removing marker gene expression cassettes from transgenic plants relies on excision by use of a site-specific recombinase. A number of Site-specific recombinases may be used, such as Cre-recombinase, flp-recombinase (Lyznik, L. Et al, (2000) Gene Transfer Mediat ed by Site-Specific Recombination Systems, plant Molecular Biology Manual N1, 1-26), R-recombinase (Machida, C. Et al, (2000) Use of the R-RS Site-Specific Recombination System in Plants, plant Molecular Biol ogy Manual N2, 1-23) or Gin-recombinase (Maeser, S. Et al, (1991) Gin recombinase of phage Mu may catalyze Site-specific recombination in plant protoplasts, mol Gen Genet, 230:170-176). Essentially, in constructs such as T-DNA insertions, marker gene expression cassettes are flanked by site-specific recombinase recognition sequences such that the construct sequence between the site-specific recombinase recognition sequences can be excised by expression of the recombinase. The expression cassette that remains in the transgenic plant after excision is expected to be present in the construct outside the site-specific recombinase recognition sequence of the construct.
Removal of the expression cassette flanked by site-specific recombinase recognition sequences can be accomplished using hybridization strategies or by automated excision. In a crossing strategy, a plant (e.g., R1 progeny) that is preferably homozygous for the presence of the construct is crossed with another transgenic plant line transformed with an expression cassette for expression of the site-specific recombinase. The resulting F is then selected for the presence of the construct 1 The progeny, the construct has had the expression cassette flanked by site-specific recombinase recognition sequences excised. In the case of automatic excision, another expression cassette encoding a site-specific recombinase is present in the construct, wherein the other expression cassette will be excised either between or on both sides of the site-specific recombinase recognition sequence, such that all such expression cassettes are excised by the site-specific recombinase. Promoters typically have a preference or specificity for driving expression in a particular type of cell or tissue. Not all promoters and expression elements are suitable for efficient automatic excision, and extensive experimentation is required to identify the correct promoter to drive expression of the recombinase and additional expression elements to regulate expression of the recombinase, such as introns and 3' UTRs, to achieve the desired excision frequency and result.
There is a need to drive expression elements in crop plants that are effectively self-resected. The present disclosure provides several expression elements identified by years of experimentation that can be used to drive expression of a recombinase and produce efficient automatic excision of markers and/or recombinase transgenes and possibly other expression cassettes in many crop species after transformation.
Disclosure of Invention
The present invention provides gene regulatory elements for driving site-specific recombinases in plants that will result in efficient automatic excision of marker gene expression cassettes as well as expression cassettes for genome editing. The invention also provides recombinant DNA molecule constructs comprising the regulatory elements. The invention also provides constructs comprising the regulatory elements. In one embodiment, the regulatory element is operably linked to the site-specific recombinase. In certain embodiments, the regulatory element is a meiosis promoter. In other embodiments, the regulatory element is comprised in a construct comprising at least three transgene cassettes. The invention also provides methods of using the regulatory elements and of making and using recombinant DNA molecules and constructs comprising the regulatory elements.
Thus, in one aspect, the present invention provides a recombinant DNA molecule comprising a DNA regulatory sequence selected from the group consisting of: (a) A sequence having at least about 80% sequence identity to any one of SEQ ID NOs 1-26, 59-62 and 64-66; (b) A sequence comprising any one of SEQ ID NOs 1-26, 59-62 and 64-66; and (c) (i) any of SEQ ID NOS: 1-26, 59-62 and 64-66 or (ii) a fragment of any sequence having at least 80% sequence identity to any of SEQ ID NOS: 1-26, 59-62 and 64-66, wherein the fragment has gene regulatory activity; wherein the sequence is operably linked to a heterologous transcribable DNA sequence encoding a site-specific recombinase. In particular embodiments, the recombinant DNA molecule comprises a DNA regulatory sequence having at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% sequence identity to the DNA sequence of any one of SEQ ID NOs 1-26, 59-62 and 64-66. In particular embodiments, the DNA regulatory sequences include regulatory elements having gene regulatory activity. In some embodiments, the regulatory element comprises a promoter. In still other embodiments, the regulatory element comprises an intron. In still other embodiments, the regulatory element comprises a 3' utr. In still other embodiments, the DNA regulatory sequence is a germline-preferred promoter. In other embodiments, the germline preference promoter is selected from the group consisting of: SEQ ID NO. 2, SEQ ID NO. 5, SEQ ID NO. 7, SEQ ID NO. 10, SEQ ID NO. 13, SEQ ID NO. 14, SEQ ID NO. 17, SEQ ID NO. 21 and SEQ ID NO. 65. In other embodiments, the germline-preferred promoter is the CDC45 promoter. In yet a further embodiment, the CDC45 promoter is selected from the group consisting of: SEQ ID NO. 2, SEQ ID NO. 5 and SEQ ID NO. 10, and sequences having at least 80% sequence identity to any of SEQ ID NO. 2, 5, 7, 10, 13, 14, 17, 21 and 65. In still other embodiments, the DNA regulatory sequence is an embryo-preferred promoter. In other embodiments, the DNA regulatory sequence is SEQ ID NO. 60 or a sequence having at least 80% sequence identity to SEQ ID NO. 60. In still other embodiments, the heterologous transcribable DNA sequence comprises a gene encoding a site-specific recombinase. In other embodiments, the site-specific recombinase is selected from the group consisting of: cre-recombinase, flp-recombinase, R-recombinase and Gin-recombinase. In yet another embodiment, the site-specific recombinase is Cre-recombinase.
In another aspect, the recombinant DNA construct further comprises one or both of the following expression cassettes: a selectable marker transgene; and/or transgenes of agronomic interest. In another embodiment, the recombinant DNA construct further comprises a pair of site-specific recombination site sequences flanking one or both transcribable DNA sequences encoding a site-specific recombinase and/or a selectable marker transgene, wherein the site-specific recombination site is cleavable by the site-specific recombinase. In a further embodiment, the selectable marker transgene of the recombinant DNA construct confers resistance to a herbicide or antibiotic. In other embodiments, the site-specific recombination site sequences of the recombinant DNA constructs are each selected from the group consisting of: loxP, lox.TATA-R9, FRT, RS and GIX. In a specific embodiment, the site-specific recombination site sequences of the recombinant DNA constructs are each LoxP or Lox. TATA-R9 sites. In other embodiments, the site-specific recombination site sequences of the recombinant DNA constructs each comprise SEQ ID NO. 44 or SEQ ID NO. 45.
In another aspect, the agronomically beneficial transgene of the recombinant DNA construct confers herbicide tolerance in the plant. In some embodiments, the agronomically beneficial transgene of the recombinant DNA construct confers insect resistance or disease resistance in plants. In further embodiments, agronomically beneficial transgenes of the recombinant DNA constructs confer increased yield or stress tolerance in plants. In still other embodiments, the agronomically beneficial transgene of the recombinant DNA construct encodes a dsRNA, miRNA, or siRNA.
In another aspect, the recombinant DNA construct further comprises one or both of: an expression cassette encoding a guide RNA; and/or an expression cassette encoding a site-specific nuclease. The recombinant DNA construct further comprises a site-specific recombination site sequence flanking the one or more transcribable DNA sequences encoding the site-specific recombinase, a selectable marker transgene, an expression cassette encoding the guide RNA, and/or an expression cassette encoding the site-specific nuclease, wherein the site-specific recombination site is cleavable by the site-specific recombinase. In further embodiments, the guide RNA comprises a targeting sequence that targets a sequence in the genome of the eukaryotic cell for genome editing or site-specific integration. In another embodiment, the eukaryotic cell is a plant cell. In yet another embodiment, the recombinant DNA construct comprises two or more expression cassettes encoding two or more guide RNAs. In a further embodiment, the recombinant DNA construct comprises two, three, four, five, six, seven, eight, nine or ten different expression cassettes encoding guide RNAs. In further embodiments, the site-specific nuclease is an RNA-guided endonuclease or a CRISPR-associated nuclease. In another embodiment, the RNA guided endonuclease or CRISPR-associated nuclease is selected from the group consisting of: cas1, cas1B, cas2, cas3, cas4, cas5, cas6, cas7, cas8, cas9, cas10, cpf1, cys2, csy3, cse1, cse2, csc1, csc2, csa5, csn2, csm3, csm4, csm5, csm6, cmr1, cmr3, cmr4, cmr5, cmr6, csb1, csb2, csb3, csx17, csx14, csx10, csx16, csaX, csx3, csx1, csx15, csf1, csf2, csf3, csf4, casX and CasY. In specific embodiments, the RNA guided endonuclease or CRISPR-associated nuclease is Cpf1 or Cas9.
In another aspect, provided herein are DNA molecules or vectors comprising recombinant DNA constructs. In another embodiment, the DNA transformation vector comprises a recombinant DNA construct and a T-DNA segment bounded by a left border and a right border. In a further embodiment, the transcribable DNA sequence encoding the site-specific recombinase is located between the left and right boundaries of the T-DNA segment within the DNA transfer vector. In yet another embodiment, the DNA transformation vector comprises a recombinant DNA construct and a T-DNA segment having a left border and a right border, wherein one or more of a transcribable DNA sequence encoding a site-specific recombinase, a selectable marker transgene, and/or an agronomically beneficial transgene is located between the left border and the right border of the T-DNA segment. In further embodiments, the DNA transformation vector comprises a recombinant DNA construct and a T-DNA segment having a left border and a right border, wherein one or more of a transcribable DNA sequence encoding a site-specific recombinase, a selectable marker transgene, an agronomically beneficial transgene, an expression cassette encoding a guide RNA, and/or an expression cassette encoding a site-specific nuclease is located between the left border and the right border of the T-DNA segment.
In another aspect, provided herein are transgenic plants, plant parts, or plant cells comprising the recombinant DNA constructs. The recombinant DNA construct is stably transformed into the genome of a transgenic plant, plant part or plant cell. The transgenic plant, plant part or plant cell is a maize, soybean, cotton or canola plant, plant part or plant cell. Also provided herein are bacterial cells comprising the recombinant DNA constructs or transformation vectors.
In another aspect, provided herein is a method for producing a transgenic plant or plant part, comprising (a) transforming a plant cell of an explant with a DNA molecule or vector comprising a recombinant DNA construct to produce one or more transformed plant cells comprising the recombinant DNA construct stably transformed into the genome of the one or more transformed plant cells; (b) Regenerating or producing a transgenic plant from an explant, wherein the transgenic plant comprises a recombinant DNA construct stably transformed into the genome of one or more cells of the transgenic plant. In one embodiment, the plant cells are transformed via agrobacterium-mediated transformation or rhizobia-mediated transformation. In another embodiment, plant cells are transformed via particle-mediated transformation or particle bombardment-mediated transformation. In yet another embodiment, the transgenic plant and plant cell are maize, soybean, cotton or canola plants and plant cells, respectively. In yet another embodiment, the method further comprises: (c) Isolating or harvesting a plant part from said transgenic plant.
In another aspect, provided herein is a method of excision of an expression cassette from the genome of a transgenic plant, the method comprising: (a) Transforming a plant cell with a DNA molecule or vector comprising the recombinant DNA construct of any one of claims 13-30 to produce one or more transformed plant cells comprising the recombinant DNA construct stably transformed into the genome of the one or more transformed plant cells; (b) Regenerating or producing a transgenic plant at least in part from the one or more stably transformed plant cells; (c) Crossing the transgenic plant with itself or another plant; and (d) selecting one or more progeny plants in which one or both of the transcribable DNA sequences encoding the site-specific recombinase and/or the selectable marker transgene between the pair of site-specific recombination site sequences of the recombinant DNA construct is excised and no longer present in the genome of the progeny plant. In a further embodiment of the method, the recombinant DNA construct further comprises one or both of the following expression cassettes located between the pair of site-specific recombination site sequences of the recombinant DNA construct: and wherein one or more progeny plants are selected in which one or more of the transcribable DNA sequence encoding the site-specific recombinase, the selectable marker transgene, the expression cassette encoding the guide RNA, and/or the expression cassette encoding the site-specific nuclease of the recombinant DNA construct is excised and no longer present in the genome of the progeny plant. In particular embodiments, the transgenic plant and plant cell are maize, soybean, cotton or canola plants and plant cells, respectively. In another embodiment, the method further comprises (e) isolating or harvesting plant parts from one or more of the progeny plants. In yet another embodiment, the method further comprises (f) crossing one or more of the progeny plants with itself or another plant.
Brief description of the sequence
SEQ ID NO. 1 is a DNA sequence of the regulatory expression element group (EXP) EXP-Zm.Cdc45-1+Zm.DnaK:1:1 comprising a promoter (P-Zm.Cdc45-1:8) operably linked to 5 'of a leader sequence (L-Zm.Cdc45-1:1) operably linked to 5' of an intron (I-Zm.DnaK:1).
SEQ ID NO. 2 is the DNA sequence of the promoter (P-Zm.Cdc45-1:8).
SEQ ID NO. 3 is the DNA sequence of the leader sequence (L-Zm.Cdc45-1:1).
SEQ ID NO. 4 is a DNA sequence of EXP (EXP-Os.Cdc45-1:1:1) comprising a promoter (P-Os.Cdc45-1-1:1:1) operably linked to 5' of the leader sequence (L-Os.Cdc45-1-1:1:1).
SEQ ID NO. 5 is the DNA sequence of the promoter (P-Os.Cdc45-1-1:1:1).
SEQ ID NO. 6 is the DNA sequence of the leader sequence (L-Zm.Cdc45-1:1).
SEQ ID NO. 7 is the DNA sequence of EXP (EXP-at. Mei 1) consisting of a promoter and a leader sequence.
SEQ ID NO. 8 is the DNA sequence of the 3' UTR (T-at. Mei 1-1:2:1).
SEQ ID NO. 9 is the DNA sequence of EXP (EXP-At.Cdc45:1:1) comprising a promoter (P-At.Cdc45-1:1:1) operably linked to 5' of the leader sequence (L-At.Cdc45-1:1:1).
SEQ ID NO. 10 is the DNA sequence of the promoter (P-at. Cdc45-1:1:1:1).
SEQ ID NO. 11 is the DNA sequence of the leader sequence (L-At.Cdc45-1:1:1).
SEQ ID NO. 12 is the DNA sequence of the 3' UTR (T-At.Cdc45:1).
SEQ ID NO. 13 is the DNA sequence of EXP (EXP-at. Swi 1) consisting of a promoter and a leader sequence.
SEQ ID NO. 14 is the DNA sequence of EXP (EXP-at. Swi1a) consisting of a promoter and a leader sequence.
SEQ ID NO. 15 is the DNA sequence of the 3' UTR (T-At. Swi 1-1:2:1).
SEQ ID NO. 16 is a DNA sequence of EXP (EXP-At. Asy1:1:1) comprising a promoter (P-At. Asy1-1:1:1) operably linked to 5' of the leader sequence (L-At. Asy1-1:1:1).
SEQ ID NO. 17 is the DNA sequence of the promoter (P-At. Asy1-1:1:1).
SEQ ID NO. 18 is the DNA sequence of the leader sequence (L-At. Asy1-1:1:1).
SEQ ID NO. 19 is the DNA sequence of the 3' UTR (T-At. Asy1-1:1:1).
SEQ ID NO. 20 is a DNA sequence of EXP (EXP-Gm. Rsp-1:1) comprising a promoter (P-Gm. Rsp-1-1:1:1) operably linked to 5' of a leader sequence (L-Gm. Rsp-1-1:1:1).
SEQ ID NO. 21 is the DNA sequence of the promoter (P-Gm. Rsp-1-1:1:1).
SEQ ID NO. 22 is a DNA sequence of the leader sequence (L-Gm. Rsp-1-1:1:1).
SEQ ID NO. 23 is the DNA sequence of the 3' UTR (T-At.Cdc45:3).
SEQ ID NO. 24 is the DNA sequence of the 3' UTR (T-At.Cdc45:4).
SEQ ID NO. 25 is the DNA sequence of EXP (EXP-Gm. Rsp-1+Gm. Rsp-1+AtpE: 1) comprising a promoter (P-Gm. Rsp-1-1:1:1) operably linked to the 5 'of a leader sequence (L-Gm. Rsp-1-1:1:1) operably linked to the 5' of an intron (I-at. AtpE: 1).
SEQ ID NO. 26 is the DNA sequence of the intron (I-At. AtpE: 1).
SEQ ID NO. 27 is a DNA sequence of EXP (EXP-Zm.Cdc45-2+Zm.DnaK: 1: 2) comprising a promoter (P-Zm.Cdc45-2-1: 3) operably linked to 5 'of a leader sequence (L-Zm.Cdc45-2-1: 1) operably linked to 5' of an intron (I-Zm.DnaK: 1).
SEQ ID NO. 28 is the DNA sequence of the promoter (P-Zm.Cdc45-2-1:1:3).
SEQ ID NO. 29 is the DNA sequence of the leader sequence (L-Zm.Cdc45-2-1:1:1).
SEQ ID NO. 30 is the DNA sequence of EXP (EXP-Zm.Zm13:2) comprising a promoter (P-Zm.Zm13:2) operably linked to 5' of the leader sequence (L-Zm.Zm13:2).
SEQ ID NO. 31 is the DNA sequence of the promoter (P-Zm.Zm13:2).
SEQ ID NO. 32 is the DNA sequence of the leader sequence (L-Zm.Zm13:2).
SEQ ID NO. 33 is a DNA sequence of EXP (EXP-Zm.wall+Zm.DnaK: 1:5) comprising a promoter (P-Zm.wall-1:1:9) operably linked to 5 'of a leader sequence (P-Zm.wall-1:1:1) operably linked to 5' of an intron (I-Zm.DnaK: 1).
SEQ ID NO. 34 is the DNA sequence of the promoter (P-Zm.wall-1:1:9).
SEQ ID NO. 35 is the DNA sequence of the leader sequence (L-Zm.wall-1:1:1).
SEQ ID NO. 36 is the DNA sequence of EXP (EXP-Syn 1) consisting of a promoter and a leader sequence.
SEQ ID NO. 37 is the DNA sequence of EXP (EXP-Syn 1 a) consisting of a promoter and a leader sequence.
SEQ ID NO. 38 is the DNA sequence of the 3' UTR (T-At. Syn 1-1:2:1).
SEQ ID NO. 39 is the DNA sequence of the intron (I-Zm.DnaK: 1).
SEQ ID NO. 40 is the DNA sequence of EXP (EXP-At.Dmc1+Zm.DnaK: 1:1) comprising a promoter (P-At.Dmc1:1) operably linked to 5 'of a leader sequence (L-At.Dmc1-1:1:1) operably linked to 5' of an intron (I-Zm.DnaK: 1).
SEQ ID NO. 41 is the DNA sequence of the promoter (P-at. Dmc1:1).
SEQ ID NO. 42 is the DNA sequence of the leader sequence (L-at. Dmc1-1:1:1).
SEQ ID NO. 43 is the coding sequence of Cre-recombinase (Cre) with a processable intron derived from the potato light-induced tissue-specific ST-LS1 gene (Genbank accession number X04753).
SEQ ID NO. 44 is the DNA sequence of the Cre-recombinase site-specific recombination site (RS-P1. Lox1: 1).
SEQ ID NO. 45 is the DNA sequence of the Cre-recombinase site-specific recombination site (RS-P1. Lox. TA TA-R9-1:1:1).
SEQ ID NO. 46 is the DNA sequence of EXP (EXP-Os. Act1:1) consisting of the promoter, leader sequence and intron from the rice actin 1 gene.
SEQ ID NO. 47 is the coding sequence of plastid-targeted EPSPS (CP 4) which confers tolerance to the herbicide glyphosate.
SEQ ID NO. 48 shows the DNA sequence of the 3' UTR (T-AGRtu. Nos. 13).
SEQ ID NO. 49 is the DNA sequence of EXP (EXP-Os. Act1+CaMV.35S.2xA1-B3+Ta. Lhc B1:1) consisting of the enhanced promoter and leader sequence.
SEQ ID NO. 50 is a coding sequence for beta-Glucuronidase (GUS) having a processable intron derived from the potato light-induced tissue specific ST-LS1 gene (Genbank accession number X04753).
SEQ ID NO. 51 is the DNA sequence of the 3' UTR (T-St. Pis 4-1:4:1).
SEQ ID NO. 52 is the DNA sequence of EXP (EXP-Os. TubA-3:1) consisting of the promoter, leader sequence and intron from the rice tubulin gene.
SEQ ID NO. 53 is the DNA sequence of EXP (EXP-at. Act7:2) consisting of a promoter, a leader sequence and an intron derived from the Arabidopsis actin 7 gene.
SEQ ID NO. 54 is a plastid-targeted coding sequence of GOI-at. ShkG-CTP2+ec. AadA-SPC/STR 1:1, which confers resistance to the antibiotic spectinomycin.
SEQ ID NO. 55 is the DNA sequence of EXP (EXP-CaMV. 35S-enh: 1:2) consisting of the enhanced promoter and leader sequence.
SEQ ID NO. 56 is the DNA sequence of the promoter (P-Br. Snap 2-1:1:20).
SEQ ID NO. 57 is a DNA sequence encoding a chloroplast transit peptide (TS-Ps. RbcS-3C-1:3:1).
SEQ ID NO. 58 is a DN A coding sequence encoding the crtB gene (CR-PANag. CrtB. Nno-1:4:1).
SEQ ID NO. 59 is the DNA sequence of the 3' UTR (T-Br. Snap 2-1:3:6).
SEQ ID NO. 60 is the DNA sequence of EXP (EXP-vf. Usp88-enh: 1:1) consisting of an enhancer, a chimeric promoter (P-vf. Usp 88-chimera) and a leader sequence.
SEQ ID NO. 61 is the DNA sequence of the chimeric promoter P-vf. Usp88-chimera, consisting of the following elements: an enhancer derived from the vf. Usp88 promoter operably linked to 5' of the vf. Usp88 promoter.
SEQ ID NO. 62 is the DNA sequence of the leader sequence (L-vf. Usp-1:1:1).
SEQ ID NO. 63 is a DNA coding sequence encoding the splA gene (CR-AGRtu. SplA-C58:1:3).
SEQ ID NO. 64 is the DNA sequence of EXP (EXP-Gm. Nmh7:1) consisting of the following elements: a promoter (P-Gm.Nmh7-1:1:12) operably linked to 5' of the leader sequence (L-Gm.Nmh7:1).
SEQ ID NO. 65 is the DNA sequence of the promoter (P-Gm.Nmh7-1:1:12).
SEQ ID NO. 66 is the DNA sequence of the leader sequence (L-Gm.Nmh7:1).
SEQ ID NO. 67 is the DNA sequence of the 3' UTR (T-Gb.E6-3 b: 1:1).
Detailed Description
The present invention provides gene regulatory elements for driving the expression of site-specific recombinases in plants which will result in efficient automatic excision of marker gene cassettes. The invention also provides constructs and recombinant DNA molecules comprising the regulatory elements. The invention also provides methods of automatically excision of at least two transgene expression cassettes from the genome of a transgenic plant by using constructs comprising the transgene cassettes, wherein the gene regulatory elements described herein are operably linked to a site-specific recombinase gene.
The following definitions are provided for certain terms and phrases used herein. Unless otherwise defined in the present disclosure, terms and phrases used herein should be understood by those of ordinary skill and knowledge in the relevant art in light of their ordinary meaning.
Site-specific recombinase and excision of DNA segments
As used herein, a "site-specific recombinase" is an enzyme that binds to a particular DNA recognition sequence and catalyzes DNA cleavage, DNA strand exchange, and recombination of DNA between two site-specific recombinase site sequences. "site-specific recombination" or "site-specific recombinase system" or "site-specific recombinase technology" or "site-directed recombination" or "site-directed recombinase system" or "site-directed recombinase technology" describe various specialized recombination processes involving the exchange between defined DNA sites. As used herein, the term "flanking" refers to two or more sequences, such as site-specific recombination site sequences, located on either side of one or more specific loci/gene sites, genes, sequences, transgenes, or expression cassettes. Site-specific recombination site sequences can be cloned into a recombinant DNA construct relative to 5 'and 3' of the DNA segment comprising the expression cassette under which recombination will occur (i.e., flanking the segment of DNA). Depending on the initial arrangement of parent site-specific recombination sites, site-specific recombination has one of three possible outcomes: integration (insertion of foreign DNA segment), excision (removal of DNA segment) or inversion (rotation of DNA segment 180 degrees before re-ligating the two end segments). Integration results from recombination between sites on different DNA molecules (provided that at least one parent chromosome is circular) and occurs in a uniquely defined orientation.
For recombination sites located on the same DNA molecule or chromosome, the outcome may depend on their relative orientation. Although inversion of a DNA segment may result from exchange between inverted (head-to-head) sites, excision may result from recombination between sites in the head-to-tail direction (Nigel et al (2006) Mechanisms of Site-Specific recombination. Annu. Rev. Bioche m, 75:567-605). A number of Site-specific recombinases can be used to cleave DNA between two Site-specific recombinase recognition sites, such as Cre-recombinase recognizing Lox sites, flp-recombinase recognizing FR T sites (see, e.g., lyznik, L. Et al, (2000) Gene Transf er Mediated by Site-Specific Recombination Systems, plant Molecular Biology Manual N, 1-26), R-recombinase recognizing RS sites (see, e.g., machida, C. Et al, (2000) Use of the R-RS Site-Specific Recombination Sy stem in Plants, plant Molecular Biology Manual N2, 1-23), or Gin-recombinase recognizing GIX sites (see, e.g., maeser, S.et al, (1991) The Gin reco mbinase of phage Mu can catalyze Site-specific recombination in plant protoplasts, mol Genet, 230:170-176). Each of the above-described site-specific recombinase systems has been shown to function in plants. The Cre/Lox site-specific recombinase system is the most commonly relied upon marker excision system in plant biotechnology.
Site-specific recombinases can be used in plant biotechnology to remove marker gene expression cassettes as well as other expression cassettes and DNA segments from transgenic plants. Typically, plants are transformed with a recombinant DNA construct or vector comprising a plurality of expression cassettes. The expression cassettes can be used to express transgenes that provide beneficial properties to plants as well as transgenes that serve as markers to select transformed plant cells, such as antibiotic resistance genes, herbicide tolerance genes, or other transgenes useful in the selection process. The transgene cassette for the marker gene is flanked by a pair of site-specific recombinase recognition sites. Following transformation and selection, regenerated transformed plants are grown. Excision of the marker gene can then be removed by various hybridization strategies (by hybridization with a plant site-specific recombinase expression line or by automatic excision).
Hybridization using plant site-specific recombinase expression lines is generally performed as follows. Let R be 0 The transformed plants were selfed. R is then selected for the presence of the recombinant DNA construct 1 And (5) offspring plants. Then let the selected R 1 Progeny plants are selfed and R is selected to be homozygous for the recombinant DNA construct insertion 2 And (5) offspring plants. Then homozygously R 2 The progeny plant is crossed to another line expressing the recombinase. As a result of this hybridization, the recombinase cleaves the marker gene cassette flanked by site-specific recombinase recognition sequences, producing F comprising the desired cassette but with the marker gene cassette excised from the genome 1 And (5) offspring plants. Then let the obtained F 1 Offspring were selfed and F was selected that lacks the recombinase but is homozygous for the now modified recombinant DNA construct insertion 2 And (5) offspring plants.
Another strategy to remove marker gene expression cassettes is by means of automatic excision. Similar to the excision methods described above, the expressed recombinase is used to excision of the marker gene cassette, but rather than hybridizing the transformed plant to another line expressing the recombinase, the recombinase gene cassette is located in the same recombinant DNA construct and flanked by the site-specific recombinase site sequence and the marker gene cassette. The expression cassette that remains in the transgenic plant after automatic excision is expected to be present in the recombinant DNA construct outside the site-specific recombinase site sequence. After transformation and plant regeneration, R containing the recombinant DNA construct is generated 0 And (5) a plant. Those R can then be made 0 Plant selfing and the resulting R can be selected for the presence of altered recombinant DNA constructs in which the marker gene expression cassette and recombinase expression cassette have been excised 1 And (5) offspring plants. An advantage of the automatic excision system is that one can eliminate the marker gene expression cassette by means of fewer generations than if a site-specific recombinase excision system were used in which hybridization to another line expressing the site-specific recombinase is required.
In automatic excision to produce label-free R 1 One complicating factor in the development of progeny plants, automatic excision, is the finding of expression elements that provide for the expression of the site-specific recombinase at the correct time. One approach is to use expression elements that are active in the germ line or embryo phase of the plant, but not all germ line or embryo preferred expression elements will provide successful results for efficient automatic excision to occur. Some germline-preferred or embryo-preferred expression elements may have a leak in their expression, and some are unable to express the site-specific recombinase at sufficient levels to effectively excise the marker and/or recombinase gene. In addition, the germline or embryo expression elements may provide effective automatic excision only in specific crop species (such as corn, soybean, or cotton) but not in all three crops.
DNA molecules
As used herein, the term "DNA" or "DNA molecule" refers to a double-stranded DNA molecule of genomic or synthetic origin, i.e., a polymer of deoxyribonucleotide bases or a DNA molecule. As used herein, the term "DNA sequence" refers to the nucleotide sequence of a DNA molecule read from the 5 '(upstream) end to the 3' (downstream) end.
As used herein, a "recombinant DNA molecule" or "recombinant DNA construct" is a DNA molecule or construct, respectively, that comprises a combination of DNA sequences that do not naturally occur together without human intervention. For example, a recombinant DNA molecule may comprise at least two DNA sequences that are heterologous to each other: DNA sequences derived from DNA sequences present in nature, synthetic DNA sequences, and/or DNA sequences that have been incorporated into the genomic DNA of a host cell by genetic transformation, genome editing, or site-specific integration.
In this application, reference to an "isolated DNA molecule" or equivalent term or phrase is intended to mean that the DNA molecule is one that exists alone or in combination with other components but is not present in its natural environment. For example, a nucleic acid element that naturally occurs in the genome of an organism, such as a coding sequence, an intron sequence, an untranslated sequence, a leader sequence, a promoter sequence, a transcription termination sequence, etc., is not considered "isolated" as long as the element is native to the genome of the organism and in a location within the genome where it naturally occurs. However, each of these elements, as well as sub-portions of these elements, will be "isolated" within the scope of this disclosure as long as the elements are not within their native genome and/or are in their naturally occurring locations within the genome. For the purposes of this disclosure, any transgenic nucleotide sequence, i.e., a nucleotide sequence of DNA inserted into the genome of a plant or bacterial cell or present in an extrachromosomal vector, will be considered an isolated nucleotide sequence, whether it is present within a plasmid or similar vector used to transform the cell within the genome of the plant or bacterium, or is present in a detectable amount in a tissue, progeny, biological sample, or commodity product derived from the plant or bacterium.
As used herein, the term "sequence identity" refers to the degree to which two optimally aligned polynucleotide sequences or two optimally aligned polypeptide sequences are identical. By aligning two sequences, e.g., a reference sequence and another sequence, to maximize the number of nucleotide matches in a sequence alignment with the appropriate internal nucleotide insertions, deletions, or gaps, an optimal sequence alignment of the two sequences is produced. As used herein, the term "reference sequence" may refer to a DNA sequence comprising one or more of SEQ ID NOS 1-26, 59-62 and 64-66.
As used herein, the term "percent sequence identity" or "percent identity" is the identity score of two optimally aligned sequences multiplied by 100. The "identity score" of a sequence optimally aligned to a reference sequence is the number of nucleotide matches in the optimal alignment divided by the total number of nucleotides in the reference sequence (i.e., the total number of nucleotides in the entire reference sequence over the entire length). Thus, some embodiments of the present disclosure provide a DNA molecule comprising a regulatory sequence having at least 85% identity, at least 86% identity, at least 87% identity, at least 88% identity, at least 89% identity, at least 90% identity, at least 91% identity, at least 92% identity, at least 93% identity, at least 94% identity, at least 95% identity, at least 96% identity, at least 97% identity, at least 98% identity, at least 99% identity, or 100% identity to a reference sequence, such as one of SEQ ID NOs 1-26, 59-62, and 64-66, when optimally aligned with said reference sequence. According to embodiments of the invention, the regulatory sequences may be operably linked to a transcribable DNA sequence that may encode a site-specific recombinase.
Regulatory element
Regulatory elements such as promoters, leader sequences (also known as 5 'UTRs), enhancers, introns, and transcription termination regions (or 3' UTRs) play an indispensable role in the overall expression of genes in living cells. As used herein, the term "regulatory element" refers to a DNA molecule or sequence or DNA segment having gene regulatory activity. As used herein, the term "gene regulatory activity" refers to the ability to affect expression of an operably linked transcribable DNA molecule, for example by affecting transcription and/or translation of the operably linked transcribable DNA molecule. Regulatory elements that function in plants, such as promoters, leader sequences, enhancers, introns and 3' utrs, may be used to modify plant phenotypes by genetic engineering. According to an embodiment of the disclosure, the regulatory element is a promoter having a sequence comprising SEQ ID No. 2, 5, 10, 17, 21, 61 or 65, or a sequence having at least 80% identity, at least 85% identity, at least 86% identity, at least 87% identity, at least 88% identity, at least 89% identity, at least 90% identity, at least 91% identity, at least 92% identity, at least 93% identity, at least 94% identity, at least 95% identity, at least 96% identity, at least 97% identity, at least 98% identity, at least 99% identity or 100% identity to SEQ ID No. 2, 5, 10, 17, 21, 61 or 65, or a functional fragment or portion of any of the foregoing sequences. According to an embodiment of the present disclosure, the regulatory element is a leader sequence having a sequence comprising SEQ ID NO 3, 6, 11, 18, 22, 61 or 66, or a sequence having at least 80% identity, at least 85% identity, at least 86% identity, at least 87% identity, at least 88% identity, at least 89% identity, at least 90% identity, at least 91% identity, at least 92% identity, at least 93% identity, at least 94% identity, at least 95% identity, at least 96% identity, at least 97% identity, at least 98% identity, at least 99% identity or 100% identity to SEQ ID NO 3, 6, 11, 18, 22, 61 or 66, or a functional fragment or portion of any of the foregoing sequences that can affect expression of an operably linked transcribable DNA sequence.
As used herein, a "fragment" of a promoter (or promoter sequence) or regulatory element comprises a fragment or portion of a promoter (or promoter sequence) or regulatory element, respectively, and a "functional fragment" of a promoter (or promoter sequence) or regulatory element comprises a fragment or portion of a promoter (or promoter sequence) or regulatory element, respectively, that affects, regulates or drives expression of an operably linked transcribable DNA sequence. According to some embodiments, a "functional fragment" of a promoter (or promoter sequence) or regulatory element affects, modulates or drives expression of an operably linked transcribable DNA sequence in a similar manner to a promoter (or promoter sequence) or regulatory element.
As used herein, a "set of regulatory expression elements" or "EXP" sequence refers to a set of two or more operably linked regulatory elements, such as enhancers, promoters, leader sequences, and introns. Such two or more operably linked regulatory elements may typically be present together in the same construct and each operably linked to a transcribable DNA sequence. For example, a set of regulatory expression elements may consist of, for example, a promoter operably linked to the 5' of the leader sequence. EXP used to practice embodiments of the present invention can comprise SEQ ID NO:1, 4, 7, 9, 13, 14, 16, 20, 25, 60, or 64, and sequences having at least 80% identity, at least 85% identity, at least 86% identity, at least 87% identity, at least 88% identity, at least 89% identity, at least 90% identity, at least 91% identity, at least 92% identity, at least 93% identity, at least 94% identity, at least 95% identity, at least 96% identity, at least 97% identity, at least 98% identity, at least 99% identity, or 100% identity to SEQ ID NO:1, 4, 7, 9, 13, 14, 16, 20, 25, 60, or 64.
Regulatory elements may be characterized by their associated gene expression patterns in plants, plant tissues and plant cells, for example, their positive and/or negative effects on expression, such as constitutive expression or specific expression patterns, such as temporal, spatial, developmental, tissue, environmental, physiological, pathological or cell cycle expression, and/or chemical reaction or inducible expression, and any combination thereof, as well as quantitative or qualitative indications or patterns of expression. As used herein, a "gene expression pattern" is any pattern of transcription of an operably linked DNA molecule into a transcribed RNA molecule that results in the relative levels and abundance of the transcribed RNA molecule in various plant tissues and cells during development. Regulatory elements may include enhancers, promoters, leader sequences, 5 'UTRs, introns and/or 3' UTRs. Regulatory elements of the present disclosure may include regulatory elements or promoters of germline preference or embryo preference.
As used herein, the term "promoter" generally refers to a DNA molecule, segment, or sequence that is involved in the recognition and binding of RNA polymerase II and other proteins (such as trans-acting transcription factors) to initiate or regulate transcription. The promoter may be initially isolated upstream or 5 'untranslated region (5' UTR) of the genomic copy of the gene. Alternatively, the promoter may be a synthetically produced or engineered DNA molecule. Promoters may also be chimeric. Chimeric promoters are produced by fusion of two or more heterologous DNA molecules. Promoters useful in practicing embodiments of the present invention may include promoter elements comprising SEQ ID NOs 2, 5, 10, 17, 21, 61 or 65; or within any one of SEQ ID NOs 1, 4, 7, 9, 13, 14, 16, 20, 25, 60 and 64; or a sequence having at least 80% identity, at least 85% identity, at least 86% identity, at least 87% identity, at least 88% identity, at least 89% identity, at least 90% identity, at least 91% identity, at least 92% identity, at least 93% identity, at least 94% identity, at least 95% identity, at least 96% identity, at least 97% identity, at least 98% identity, at least 99% identity, or 100% identity to SEQ ID No. 2, 5, 10, 17, 21, 61, or 65; or a sequence having at least 80% identity, at least 85% identity, at least 86% identity, at least 87% identity, at least 88% identity, at least 89% identity, at least 90% identity, at least 91% identity, at least 92% identity, at least 93% identity, at least 94% identity, at least 95% identity, at least 96% identity, at least 97% identity, at least 98% identity, at least 99% identity, or 100% identity to a sequence within any one of SEQ ID nos. 1, 4, 7, 9, 13, 14, 16, 20, 25, 60, and 64; or a functional fragment or portion of any of the foregoing sequences. In particular embodiments, a DNA molecule as described herein, and any variant, fragment, portion or derivative thereof, is further defined as comprising promoter activity, i.e., being capable of acting as a promoter in a host cell, such as in a transgenic plant. In still further embodiments, a fragment of a promoter sequence may be defined as exhibiting promoter activity possessed by the starting promoter molecule from which it is derived, or the fragment may comprise a "minimal promoter" that provides basal levels of transcription and is composed of a TATA box or equivalent DNA sequence for identifying and binding to RNA polymerase II complexes for initiating transcription.
In one embodiment, fragments of the promoter sequences disclosed herein are provided. Promoter fragments may comprise promoter activity as described above and may be used alone or in combination with other promoters and/or promoter fragments, for example, for the construction of chimeric promoters, or in combination with other expression or regulatory elements and expression or regulatory element fragments. In particular embodiments, there is provided a promoter fragment comprising at least about 50, at least about 75, at least about 95, at least about 100, at least about 125, at least about 150, at least about 175, at least about 200, at least about 225, at least about 250, at least about 275, at least about 300, at least about 500, at least about 600, at least about 700, at least about 750, at least about 800, at least about 900, or at least about 1000 or more contiguous nucleotides of a promoter, promoter sequence, or DNA molecule having promoter activity as disclosed herein.
For example, modifying or altering expression using methods known in the art, such as by removing elements or element portions or nonfunctional spacer sequences that may have a positive or negative effect on expression; replicating elements that have a positive or negative effect on expression; inserting elements that have a positive or negative effect on expression; and/or replication or removal of elements having tissue-specific, developmental or cell-specific effects on expression, recombinant DNA molecules or constructs comprising promoter or regulatory elements derived from any of the promoter elements provided as SEQ ID NOs 2, 5, 10, 17, 21, 61 and 65, or from any sequence within any of SEQ ID NOs 1, 4, 7, 9, 13, 14, 16, 20, 25, 60 and 64, such as internal or truncated sequences or sequences with 5' deletions, may be produced. The enhancer element can be made, for example, using any recombinant DNA construct or molecule comprising a promoter or regulatory element derived from any promoter element provided as SEQ ID No. 2, 5, 10, 17, 21, 61 or 65, or from any sequence within SEQ ID No. 1, 4, 7, 9, 13, 14, 16, 20, 25, 60 or 64, including 3' deletions, wherein the TATA box element or equivalent sequence thereof and downstream sequences are removed. Further deletions may be made to remove any positive or negative effects on expression; tissue specificity; cell specificity; or time-specific (such as, but not limited to, circadian rhythm or developmental time) acting elements. Any promoter element provided as or contained within any one of SEQ ID NOs.2, 5, 10, 17, 21, 61 and 65, as well as fragments or enhancers derived therefrom, may be used to prepare chimeric transcription regulatory element compositions.
According to the invention, promoters or promoter fragments may be analyzed for the presence of known promoter elements (i.e., DNA sequence features such as TATA boxes and other known transcription factor binding site motifs). The skilled artisan can use the identification of such known promoter elements to design promoter variants having similar expression patterns to the original promoter.
As used herein, a "germline preferred promoter" is defined as a promoter that drives expression of an operably linked gene (or transgene) primarily in one or more germline cells of a plant but may also drive expression of an operably linked gene (or transgene) in other cells or tissues of a plant. "seed line" is used as a collective term for these cells, which are gamete cells or gamete cell progenitors, differentiate into gamete cells or gamete cell progenitors, or have at least one progeny that is gamete cells or gamete cell progenitors. Genetic modifications in germ line cells can be transferred to progeny plants by gametes derived or inherited from such germ line cells. The use of a germline preferred promoter to drive expression of a site-specific recombinase allows removal of markers and/or recombinase genes flanking the site-specific recombinase recognition sequence in the germline cell in which the site-specific recombinase is expressed. The resulting gametes will have an altered transgene in which the marker gene expression cassette and/or recombinase expression cassette are no longer present, and such altered transgene can then be transferred to the progeny plant. By using Germ line preferred promoters to drive expression of site-specific recombinases for automatic excision can be found in the gene from R 0 R of parent plant for selfing or outcrossing 1 Removal of the marker and/or recombinase gene is accomplished in the generation, as opposed to the next generation in the case where the site-specific recombinase may be introduced by hybridization with a different system having a recombinase expression construct.
The germ line preferred promoters that effectively drive automatic excision in plants are not identical, depending on the type of crop in which the excision is expected. For example, while the Arabidopsis DCM1 promoter has been shown to drive excision of the GUS marker gene in Arabidopsis germline cells (see, e.g., van Ex et al (2009) "Evaluation of seven promoters to achieve germline directed Cre-lox recombination in Arabidopsis thaliana" Plant Cell Rep.28:1509-1520), such a promoter does not drive automatic excision in stably transformed soybean plants (see, e.g., example 4 below). Other Arabidopsis derived germline preferred promoters may drive automatic excision in transgenic soybean plants, such as those comprising the promoter sequence of SEQ ID NO. 10 or SEQ ID NO. 17, as well as those within SEQ ID NO. 7, SEQ ID NO. 13 or SEQ ID NO. 14 (see, e.g., example 3). Promoters derived from the plant-derived germ-line-preferred CDC45 gene have been shown to be capable of driving automatic excision in crop plants. The Arabidopsis CDC45 promoter SEQ ID NO 10 was shown to drive efficient automatic excision in transgenic soybean (example 3) and cotton plants (example 5). The CDC45-1 promoter of maize (SEQ ID NO: 2) and rice (SEQ ID NO: 5) was shown to drive efficient automatic excision in transgenic maize plants (example 2). Although the soybean Rsp1 promoter (SEQ ID NO: 21) is shown to drive automatic excision in soybean plants, the Rsp1 promoter requires an intron (I-at. AtpE:1, SEQ ID NO: 26) operably linked to the Rsp1 promoter and leader sequence to drive efficient automatic excision in transgenic cotton plants. However, operably linking the I-at.atpe intron to the Rsp1 promoter and leader sequence failed to drive efficient automatic excision in transgenic soybean. It was shown that the soybean Rsp1 promoter and P-Gm.Nmh7-1:1:12 (SEQ ID NO: 65) driven automatic excision of the four transgene cassettes: cre-recombinase; marker gene, cpf1 and guide RNA expression cassette.
As used herein, an "embryo-preferred promoter" is defined as a promoter that drives expression of an operably linked gene (or transgene) primarily in one or more cells of a embryo but may also drive expression of an operably linked gene (or transgene) in other cells or tissues of a seed or plant. A chimeric promoter P-vf. Usp 88-chimera (SEQ ID NO: 61) showing embryo preference driven automatic excision in transgenic canola plants.
As used herein, the term "leader sequence" refers to a DNA molecule isolated from the untranslated 5 'region (5' utr) of a gene, and is generally defined as a segment of nucleotides between a Transcription Start Site (TSS) and a protein coding sequence start site. Alternatively, the leader sequence may be a synthetically produced or engineered DNA element. The leader sequence may be used as a 5' regulatory element for regulating expression of the operably linked transcribable DNA sequence. The leader sequence may be used with a heterologous promoter or its native promoter. Leader sequences for practicing embodiments of the invention may include SEQ ID NOs 3, 6, 11, 18, 22, 62 or 66; or a sequence having at least 80% identity, at least 85% identity, at least 86% identity, at least 87% identity, at least 88% identity, at least 89% identity, at least 90% identity, at least 91% identity, at least 92% identity, at least 93% identity, at least 94% identity, at least 95% identity, at least 96% identity, at least 97% identity, at least 98% identity, at least 99% identity, or 100% identity to SEQ ID No. 3, 6, 11, 18, 22, 62, or 66; or a functional fragment or portion of any of the foregoing sequences; or any leader element contained within any of SEQ ID NOs 1, 4, 7, 9, 13, 14, 16, 20, 25, 60 and 64 or within any sequence having at least 80% identity, at least 85% identity, at least 86% identity, at least 87% identity, at least 88% identity, at least 89% identity, at least 90% identity, at least 91% identity, at least 92% identity, at least 93% identity, at least 94% identity, at least 95% identity, at least 96% identity, at least 97% identity, at least 98% identity, at least 99% identity or 100% identity to SEQ ID NOs 1, 4, 7, 9, 13, 14, 16, 20, 25, 60 and 64, or a functional fragment or portion thereof. In particular embodiments, such DNA sequences may be defined as being capable of acting as leader sequences in host cells, including, for example, transgenic plant cells. In one embodiment, such sequences are defined as comprising leader sequence activity.
Leader sequences (also referred to as 5' UTRs) presented as SEQ ID NOs 3, 6, 11, 18, 22, 62 and 66; or any leader element contained within any of SEQ ID NOs 1, 4, 7, 9, 13, 14, 16, 20, 25, 60 and 64 may be comprised of regulatory elements and/or may employ secondary structures that regulate or affect transcription or translation of operably linked transcribable DNA sequences. Leader sequences presented as SEQ ID NOs 3, 6, 11, 18, 22, 62 and 66 or any fragment thereof may be made according to the present disclosure; or any leader sequence element contained within any one of SEQ ID NOs 1, 4, 7, 9, 13, 14, 16, 20, 25, 60 and 64 or any fragment thereof, to produce chimeric regulatory elements that affect transcription or translation of an operably linked transcribable DNA sequence.
As used herein, the term "intron" refers to a DNA molecule or sequence that can be isolated or identified from a gene and can be generally defined as a region that is spliced out during the processing of pre-translational messenger RNA (mRNA). Alternatively, the introns may be synthetically produced or engineered DNA elements. Introns may contain enhancer elements that affect transcription of an operably linked gene or a transcribable DNA sequence. Introns may be used as regulatory elements for regulating expression of operably linked transcribable DNA sequences. A construct may comprise introns, and the introns may or may not be heterologous with respect to the transcribable DNA sequence. Examples of introns in the art include the rice actin intron and the maize HSP70 intron.
In plants, the inclusion of some introns in the genetic construct results in increased mRNA and protein accumulation relative to constructs lacking introns. This effect is known as "intron-mediated enhancement" (IME) of gene expression. Introns known to stimulate expression in plants have been identified in maize genes (e.g., tubA1, adh1, sh1, and Ubi 1), rice genes (e.g., tpi), and dicotyledonous plant genes such as those from petunia (e.g., rbcS), potato (e.g., st-ls 1), and from arabidopsis thaliana (e.g., ubq3 and pat 1). Deletions or mutations within the splice sites of introns have been shown to reduce gene expression, suggesting that splicing may be required for IME. However, IME in dicots has been shown by point mutations within the splice site of the pat1 gene from Arabidopsis thaliana. The same intron has been shown to exhibit disadvantages for multiple uses in one plant. In those cases, it is necessary to collect the basic control elements to construct the appropriate recombinant DNA elements. Exemplary introns for use in practicing the invention are presented as SEQ ID NOs 26 and 39.
As used herein, the term "3 'transcription termination sequence", "3' untranslated region" or "3'utr" refers to a DNA sequence transcribed into an untranslated region within the 3' portion of an mRNA molecule, as is commonly understood in the art. The 3 'untranslated region of an mRNA molecule can be generated by specific cleavage and 3' polyadenylation (also known as polyA tail formation). The 3' utr may be operably linked to and downstream of the RNA or protein encoding portion of the transcribable DNA sequence and may include polyadenylation signals and other regulatory elements or signals capable of affecting transcription, mRNA processing, and/or gene expression. PolyA tails are thought to play a role in mRNA stability and translation initiation. Examples of 3 'transcription termination molecules in the art are the nopaline synthase 3' region, the wheat hsp17 'region, the pea rubisco small subunit 3' region, the cotton E6 3 'region and the coixoxin 3' UTR.
The 3' UTR generally finds advantageous use in the recombinant expression of specific DNA molecules. A weak 3' utr may create readthrough, which may affect the expression of DNA molecules located in adjacent expression cassettes. Proper control of transcription termination may prevent read-through into downstream located DNA sequences (e.g., other expression cassettes), and may further allow efficient recycling of RNA polymerase to increase gene expression. Efficient termination of transcription (release of RNA polymerase II from DNA) is a prerequisite for re-initiation of transcription, thus directly affecting overall transcription levels. Upon termination of transcription, mature mRNA is released from the synthesis site and the template is transported into the cytoplasm. Eukaryotic mRNA accumulates in vivo in the form of poly (A), making it difficult to detect transcription termination sites by conventional methods. However, it is difficult to predict functional and effective 3'UTRs by bioinformatics methods, as there is no conserved DNA sequence that allows for easy prediction of effective 3' UTRs.
Modulation of gene function by 3'utrs is a relatively new area, since only recent sequencing technologies provide us with a panoramic view of 3' utrs across species and cell types. Detailed functional and mechanistic studies were performed on only a few 3' utr models before sequencing technology was available. While these model 3'utrs make a significant contribution to our understanding of 3' utr biology, conclusions regarding their regulatory function are limited and more focused on mRNA stability. Computer analysis of the genome-wide region (Mayr, christine (2017) Regulation by 3'-Untranslated regions. Annual Review of Genetics, 51:171-194) revealed that motifs in the 3' UTR were predominantly conserved on one strand, consistent with the role of 3'UTR in regulating gene expression at the posttranscriptional level (Xie, X. Et al, (2005) Systematic discovery of regulatory motifs in human promoters and 3'UTRs by comparison of several mammals,Nature 434:338-345). The 3' utr determines protein levels by regulating mRNA stability and translation mediated primarily by AU-rich elements and mirnas. The 3' UTR also achieves local translation by modulating mRNA localization. The length of the 3' UTR can be regulated by selective cleavage and polyadenylation. The 3' utr mediates protein-protein interactions (PPI), with a broad impact on protein complex formation, protein localization and protein function. The 3' UTR regulates gene expression by binding to RNA Binding Proteins (RBPs). RBP binds to the 3'utr cis element and mediates 3' utr function by recruiting effector proteins. The RBP cooperates with other bound RBPs to achieve in vivo functional specificity. The composition of RBPs that bind to the 3' utr at a given moment is dynamic and can vary depending on the local environment, for example, by adding post-translational modifications, local expression of other RBPs, and interactions with membrane and cytoskeletal wires. RBP binding is also affected by the formation of secondary and tertiary RNA structures that regulate accessibility of the 3'UTR (Mayr, christine (2017) Regulation by 3' -Untranslipated regions. Annual Review of Genetics, 51:171-194).
A series of adenosine bases are added to the 3' end of the RNA molecule to produce a poly (A) tail. This provides the mR NA with binding sites for a class of regulatory factors known as poly (a) binding proteins (PABPs), which play a role in the regulation of gene expression, including mRNA export, stability and decay, and translation. The 5' cap structure of the mRNA and the poly-A tail act synergistically to control mRNA translation. Binding of PABP to the poly (a) tail promotes interaction with eIF4F bound to the 5' cap structure, resulting in mRNA cyclization that promotes translation initiation and ensures ribosome recycling and efficient translation. This interaction also allows inhibition of translation by inhibitor proteins that bind to the 3' UTR (Barret, L et al (2012) Regulation of eukaryotic gene ex pression by the untranslated regions and other non-coding elements.cell.mol.Life Sci.69:3613-3634).
From a practical point of view, it is often beneficial that the 3' UTR used in the expression cassette has the following characteristics. First, the 3' UTR should be able to terminate transcription of the transgene efficiently and effectively and prevent the transcription from being read through into any adjacent DNA sequence that may consist of one expression cassette if the cassettes are located in another, or into adjacent chromosomal DNA in which the constructs are inserted. Second, the 3' UTR should not cause a decrease in transcriptional activity conferred by promoters, leader sequences, enhancers and introns used to drive expression of the DNA sequence. Finally, in plant biotechnology, the 3' utr is commonly used to initiate an amplification reaction of reverse transcribed RNA extracted from transformed plants, and to: (1) Once the expression cassette is integrated into the plant chromosome, its transcriptional activity or expression is assessed; (2) assessing the number of copies inserted in the plant DNA; and (3) evaluating zygosity of the resulting seeds after breeding. The 3' UTR is also used in amplification reactions of DNA extracted from transformed plants to characterize the integrity of the insert cassette. The 3' UTRs useful in the practice of the present invention are presented as SEQ ID NOs 12, 15, 19, 23, 24 and 59.
As used herein, the term "chimeric" refers to a single DNA molecule produced by fusing a first DNA molecule to a second DNA molecule, wherein neither the first nor the second DNA molecule is typically found in this configuration (i.e., fused to another). Thus, chimeric DNA molecules are novel DNA molecules that are not normally found in nature. As used herein, the term "chimeric promoter" refers to a promoter produced by such manipulation of a DNA molecule. Chimeric promoters may combine two or more DNA fragments, e.g., a promoter fused to an enhancer element. Thus, the invention includes the design, construction and use of chimeric promoters for regulating expression of operably linked transcribable DNA molecules according to the methods disclosed herein. The chimeric promoter used in the practice of the present invention is presented as SEQ ID NO. 61.
Chimeric regulatory elements can be designed to comprise various constituent elements that can be operably linked by various methods known in the art, such as restriction enzyme digestion and ligation, ligation-independent cloning, modular assembly of PCR products during amplification, or direct chemical synthesis of regulatory elements, among other methods known in the art. The resulting various chimeric regulatory elements may consist of the same constituent elements or variants of the same constituent elements, but differ in terms of DNA sequence or DNA sequences comprising linked DNA sequences or sequences allowing the operative linkage of the constituent parts. In the present invention, the DNA sequences as provided in SEQ ID NOS.1-26, 59-62 and 64-66 may provide regulatory element reference sequences, wherein the constituent elements comprising said reference sequences may be linked by methods known in the art and may comprise substitutions, deletions and/or insertions or mutations of one or more nucleotides naturally occurring in bacterial and plant cell transformation.
As used herein, the term "variant" refers to a second DNA molecule, such as a regulatory element, which is similar but not identical in composition to the first DNA molecule, and wherein the second DNA molecule still retains the general function of the first DNA molecule, i.e. the same or similar expression pattern, e.g. by more or less equivalent transcriptional activity. The variant may be a shorter or truncated form of the first DNA molecule, or a variant of the sequence of the first DNA molecule, such as a variant having different restriction enzyme sites and/or internal deletions, substitutions or insertions. A "variant" may also encompass regulatory elements having a nucleotide sequence comprising a substitution, deletion or insertion of one or more nucleotides of a reference sequence, wherein the derived regulatory element has more or less or equivalent transcriptional or translational activity than the corresponding parent regulatory molecule. Regulatory element "variants" will also include variants resulting from mutations that occur naturally in bacterial and plant cell transformation. In the present invention, the polynucleotide sequences provided as SEQ ID NOS.1-26, 59-62 and 64-66 can be used to produce variants that are similar in composition but not identical to the DNA sequence of the original regulatory element, but which still retain the general function (i.e., the same or similar expression pattern) of the original regulatory element. The creation of such variants of the invention is well within the ordinary skill in the art in light of this disclosure and is included within the scope of the invention.
As used herein, a "transcribable DNA sequence" is any DNA sequence that, when operably linked to a promoter, can be transcribed into RNA. A transcribed RNA molecule encoded by a transcribable DNA sequence operably linked to regulatory elements provided herein may be translated to produce a protein molecule or may provide an antisense or other functional or regulatory RNA molecule, such as a double-stranded hairpin RNA (dsRNA), a transfer RNA (tRNA), a ribosomal RNA (rRNA), a microrna (miRNA), a small interfering RNA (siRNA), etc.
As used herein, the term "protein expression" is any pattern of translation of a transcribed RNA molecule into a protein molecule. Protein expression is characterized by its temporal, spatial, developmental or morphological properties, as well as its quantitative or qualitative indications or expression patterns.
The efficacy of modifications, duplications or deletions described herein in terms of the desired expression of a particular transgene can be empirically tested in stable and transient plant assays (such as those described in the working examples herein), thereby validating the results, which can vary depending on the changes made in the starting DNA molecule and the purpose of the changes.
Constructs
As used herein, the term "construct" refers to any DNA molecule or vector, or segment or portion of a DNA molecule, vector or chromosome, that is derived from any one or more sources and that is capable of transfection or genomic integration, comprising at least two DNA sequences that are functionally operably linked to each other. For example, a construct may comprise two operably linked sequences, such as a regulatory element or promoter operably linked to a coding sequence or a transcribable DNA sequence. The construct may be a recombinant DNA construct. An example of a construct that is a linear recombinant DNA segment is T-DNA. As used herein, "vector" refers to a DNA molecule, such as a plasmid, cosmid, virus, phage, or other linear or circular DNA molecule, that may contain or comprise a construct of the present disclosure, and "DNA transformation vector" refers to any DNA molecule or vector that comprises a recombinant DNA construct that can be used for transformation purposes-i.e., for introducing the recombinant DNA molecule or construct into a host cell, such as a plant cell. According to some embodiments, the DNA transformation vector may comprise T-DNA fragments bounded by left and/or right border sequences, which may be used for bacterial-mediated transformation, such as rhizobia-mediated or agrobacterium-mediated transformation. Constructs typically include one or more expression cassettes, a gene coding sequence operably linked to one or more regulatory sequences such as a promoter, or a transcribable DNA sequence. As used herein, an "expression cassette" refers to a DNA sequence comprising at least one transcribable DNA sequence operably linked to one or more regulatory elements (typically at least one promoter and one 3' utr).
As used herein, the term "operably linked" refers to a functional relationship between two or more physically linked DNA sequences comprising a first and a second DNA sequence arranged such that the first DNA sequence affects the function or expression of the second DNA sequence. The two DNA sequences may or may not be part of a single contiguous DNA molecule, and may or may not be contiguous. For example, a promoter is operably linked to a transcribable DNA sequence if the promoter regulates the transcription of the transcribable DNA sequence of interest in a cell. For example, a leader sequence is operably linked to a transcribable DNA sequence when it is capable of affecting the transcription or translation of the DNA sequence.
In one embodiment, the constructs of the invention may be provided as dual tumor-inducing (Ti) plasmid border constructs having right (RB or agrtu. RB) and left border (LB or agrtu. LB) regions of Ti or Ri plasmids isolated from agrobacterium species (e.g., agrobacterium tumefaciens or agrobacterium rhizogenes) comprising T-DNA that allows integration of the T-DNA into the genome of a plant cell along with the transfer molecules provided by the agrobacterium cells (see, e.g., U.S. Pat. No. 6,603,061). Constructs may also contain a plasmid backbone DNA segment that provides replication functions and antibiotic selection in bacterial cells, e.g., an escherichia coli origin of replication, such as ori322; a broad host range origin of replication, such as oriV or oriRi (see, e.g., ye et al, transgenic Research (4): 773-86, 2011); and coding regions for selectable markers, such as Spec/Strp encoding Tn7 aminoglycoside adenylate transferase (aadA) which confers resistance to spectinomycin or streptomycin; or gentamicin (Gm, gent) selectable marker gene. For plant transformation, the host bacterial strain is typically Agrobacterium tumefaciens ABI, C58 or LBA4404, however, other strains known to those skilled in the art of plant transformation may also function in the present invention.
Methods for assembling constructs and introducing them into cells in such a way that a transcribable DNA molecule is transcribed into a functional mRNA molecule, which is translated and expressed as a protein, are known in the art. Conventional compositions and methods for making and using constructs and host cells are well known to those of skill in the art for practicing the present invention. Typical vectors for expressing nucleic acids in higher plants are well known in the art and include vectors derived from the Ti plasmid of Agrobacterium tumefaciens and pCaMVCN transfer control vectors.
Various regulatory elements may be included in the construct, including any of those provided herein. Any such regulatory element may be provided in combination with other regulatory elements. Such combinations may be designed or modified to produce desired regulatory features. In one embodiment, the construct of the invention comprises at least one regulatory element operably linked to a transcribable DNA molecule operably linked to a 3' utr.
Constructs of the invention may include any promoter or leader sequence provided herein or known in the art. For example, the promoters of the invention may be operably linked to a heterologous, non-translated 5' leader sequence, such as a leader sequence derived from a heat shock protein gene. Alternatively, the leader sequence of the present invention may be operably linked to a heterologous promoter, such as the cauliflower mosaic virus 35S transcript promoter.
The expression cassette may also include a transit peptide coding sequence encoding a peptide useful for subcellular targeting of an operably linked protein, particularly to a chloroplast, leucoplast, or other plastid organelle; mitochondria; a peroxisome; cavitation; or an extracellular location. Many chloroplast-localized proteins are expressed from nuclear genes in a precursor fashion and are targeted to the chloroplast by a Chloroplast Transit Peptide (CTP). Examples of such isolated chloroplast proteins include, but are not limited to, those proteins associated with the small subunit of ribulose-1, 5-bisphosphate carboxylase (SSU), ferredoxin oxidoreductase, light harvesting complex protein I and protein II, thioredoxin F, enolpyruvylshikimate phosphate synthase (EPSPS). Chloroplast transit peptides are described, for example, in U.S. patent No. 7,193,133. It has been demonstrated that non-chloroplast proteins can be targeted to chloroplasts by expression of heterologous CTPs operably linked to transgenes encoding the non-chloroplast proteins.
Transcribable DNA sequences
As used herein, the term "transcribable DNA sequence" refers to any DNA sequence capable of being transcribed into an RNA molecule, including, but not limited to, those having protein coding sequences and those that produce an RNA molecule having sequences for gene suppression. The type of DNA sequence may include, but is not limited to, a DNA sequence from the same plant, a DNA sequence from another plant, a DNA sequence from a different organism, or a synthetic DNA sequence, such as a DNA sequence containing antisense information to a gene, or a DNA sequence encoding a transgene in an artificial, synthetic, or other modified form. Exemplary transcribable DNA sequences for incorporation into constructs of the invention include, for example, DNA sequences or genes from a different species than the species into which the DNA sequences are incorporated, or genes derived from or present in the same species but integrated into recipient cells by genetic engineering methods other than classical breeding techniques.
By "transgene" is meant a transcribable DNA sequence that is heterologous to the host cell, at least in terms of its location in the host cell genome, and/or a transcribable DNA sequence that is artificially incorporated into the cell genome of the current or any previous generation host cell.
Regulatory elements, such as promoters of the invention, may be operably linked to a transcribable DNA sequence that is heterologous with respect to the regulatory element. As used herein, the term "heterologous" refers to a combination of two or more DNA molecules, where such a combination is not typically found in nature. For example, the two DNA molecules may be from different species and/or the two DNA molecules may be from different genes, e.g., different genes from the same species or the same gene from different species. Thus, if such a combination is not normally found in nature, the regulatory element is heterologous with respect to the operably linked transcribable DNA molecule, i.e., the sequence of the transcribable DNA molecule is not naturally operably linked to the regulatory element. By "heterologous transcribable DNA sequence" is meant that the transcribable DNA sequence is heterologous with respect to the polynucleotide sequence to which it is operably linked.
The transcribable DNA sequence may generally be any DNA sequence that requires expression of the transcript. Such expression of transcripts can lead to translation of the resulting mRNA molecules, resulting in protein expression. Alternatively, for example, a transcribable DNA sequence may be designed to ultimately result in reduced expression of a particular gene or protein. In one embodiment, this may be accomplished by using a transcribable DNA sequence oriented in the antisense orientation. The use of such antisense techniques is familiar to those of ordinary skill in the art. Any gene can be negatively regulated in this manner, and in one embodiment, the transcribable DNA sequence can be designed to inhibit a specific gene by expression of dsRNA, siRNA or miRNA molecules.
Thus, one embodiment of the invention is a recombinant DNA molecule comprising a regulatory element of the invention, such as those provided as SEQ ID NOS 1-26, 59-62 and 64-66, or fragments thereof, or a sequence or fragment thereof having at least 80% identity, at least 85% identity, at least 86% identity, at least 87% identity, at least 88% identity, at least 89% identity, at least 90% identity, at least 91% identity, at least 92% identity, at least 93% identity, at least 94% identity, at least 95% identity, at least 96% identity, at least 97% identity, at least 98% identity, at least 99% identity or 100% identity to any one of SEQ ID NOS 1-26, 59-62 and 64-66, so as to modulate transcription of said transcribable DNA sequence at a desired level or in a desired pattern upon integration of said construct in the genome of a transgenic plant cell. In one embodiment, the transcribable DNA sequence comprises a protein coding region of a gene, while in another embodiment, the transcribable DNA sequence comprises an antisense region of a gene or any other transcribable DNA sequence that causes inhibition of a specific target gene.
Genes of agronomic interest
The transcribable DNA sequence may be an agronomically advantageous gene. As used herein, the term "agronomically beneficial gene" or "agronomically beneficial transgene" refers to a transcribable DNA sequence that confers a desired characteristic or trait when expressed in a particular plant tissue, cell or cell type. The agronomically beneficial gene or transgene product may act in plants to affect plant morphology, physiology, growth, development, yield, grain composition, nutritional profile, disease or pest resistance and/or environmental or chemical tolerance, or may act as a pesticide in the diet of the plant-feeding pests. In one embodiment of the invention, the regulatory element of the invention is incorporated into a construct such that the regulatory element is operably linked to a transcribable DNA sequence, which is an agronomically beneficial gene or transgene. In transgenic plants containing such constructs, the expression of genes of agronomic interest may confer beneficial agronomic traits. Beneficial agronomic traits may include, for example, but are not limited to, herbicide tolerance, insect control, improved or increased yield, disease resistance, pathogen resistance, improved plant growth and development, improved starch content, improved oil content, improved fatty acid content, improved protein content, improved fruit maturity, enhanced animal and human nutrition, biopolymer production, environmental stress tolerance or resistance, pharmaceutical peptides, improved processing quality, improved flavor, hybrid seed production utility, improved fiber production, and ideal biofuel production.
Non-limiting examples of agronomically beneficial genes (or transgenes) known in the art include those directed against: herbicide resistance (U.S. patent No.; a first number; 6,248,876; increased yield (U.S. Pat. No. 5,633,435, and 5,463,175), increased yield (U.S. Pat. No. USRE; first; second; first; second; and first), insect control (U.S. Pat. No. 5; first; 6,620,988; first; second; third; second; first; second), second; third; second; fourth; fifth, fourth; fifth, fifth; fifth, fourth; fifth, first, second, fifth, fifth, improved fatty acid content (U.S. patent no; a first number; a first number; number, and number), high protein yield (U.S. patent number), fruit maturity (U.S. patent number), enhanced animal and human nutrition (U.S. patent number, and 6,171,640), biopolymers (U.S. patent USRE number, and number), environmental stress resistance (U.S. patent number), pharmaceutical and secretable peptides (U.S. patent number, 6,140,075, and number), improved processing traits (U.S. patent number), improved digestibility (U.S. patent number), low raffinose (U.S. patent number), industrial enzyme production (U.S. patent number), improved taste (U.S. patent number), nitrogen fixation (U.S. patent number), hybrid seed production (U.S. patent number 5,689,041), fiber yield (U.S. patent number; number, and number), and biofuel production (U.S. patent number).
Alternatively, agronomically advantageous genes or transgenes may affect the above plant characteristics or phenotypes by encoding RNA molecules that cause targeted modulation of gene expression of endogenous genes, for example by antisense (see, e.g., U.S. Pat. No. 5,107,065); inhibitory RNAs ("RNAi", including regulation of gene expression by miRNA-, siRNA-, trans-acting siRNA-, and staged sRNA-mediated mechanisms, e.g., as described in published applications u.s.2006/0200878 and u.s.2008/0066206, and U.S. patent application 11/974,469); or co-suppression mediated mechanisms. The RNA may also be a catalytic RNA molecule (e.g., a ribozyme or riboswitch; see, e.g., US 2006/0200878) engineered to cleave a desired endogenous mRNA product. Methods for constructing constructs and introducing them into cells in such a way that a transcribable DNA sequence is transcribed into an RNA molecule capable of causing gene suppression are known in the art.
Selectable markers
Selectable marker transgenes may also be used with the regulatory elements of the present invention. As used herein, the term "selectable marker transgene" refers to any transcribable DNA sequence whose expression or lack of expression in a transgenic plant, tissue or cell can be screened or scored in some manner. Selectable marker genes and their associated selection and screening techniques for practicing the present invention are known in the art and include, but are not limited to, transcribable DNA sequences encoding β -Glucuronidase (GUS), green Fluorescent Protein (GFP), proteins conferring antibiotic resistance, and proteins conferring herbicide resistance. Examples of selectable marker transgenes are provided as GOI-At.ShkG-CTP2+AGRtu. AroA-CP4.Nat:1 (SEQ ID NOs: 47) for selection of transformed plant cells by glyphosate selection, GOI-At.ShkG-CTP2+ec.aadA-SPC/STR:1:1 (SEQ ID NO: 54) for selection of transformed plant cells by spectinomycin selection, and GOI-ec.ui dA+St.LS1:3 (SEQ ID NO: 50) for the GUS reporter gene in the transgene expression cassette in the examples below, which are intended to be retained in the integration construct after automatic excision to demonstrate retention of the cassette and to determine zygosity.
Site-specific nucleases
As used herein, the term "genome editing" refers to modifying the genetic sequence of a target site in a DNA molecule or genome or chromosome of a living organism or cell (such as the genome of an agricultural crop) by deletion, substitution, and/or insertion of the DNA sequence at or near the target site (which may be generated using a site-specific nuclease). "site-specific integration" or "site-directed integration" refers to the term of inserting a DNA sequence or construct into the genome or chromosome of a living organism or cell at a target site. As used herein, the term "site-specific nuclease" refers to a DNA cleaving nuclease that produces a double strand break or nick at or near a particular target site or position of a DNA molecule, chromosome or genome. As used herein, a "target site" for genome editing refers to a location of a polynucleotide sequence within a plant genome that is bound and cleaved by a site-specific nuclease, thereby introducing a double-strand break (or single-strand nick) into the nucleic acid backbone of the polynucleotide sequence and/or its complementary DNA strand. After making the break or cleavage, the DNA repair mechanism of the cell can recognize and repair the break or cleavage via non-homologous end joining (NHEJ) or homology directed repair, and possibly introduce mutations and/or insertions at the target site, as understood in the art.
The site-specific nucleases provided herein may be selected from the group consisting of: zinc Finger Nucleases (ZFNs), meganucleases, RNA-guided endonucleases, such as CRISPR-associated nucleases, TALE endonucleases (TALENs), recombinases, transposases or any other endonuclease that is likely. See, e.g., khandagaale, K.et al, "Genome editing for targeted improvement in plants," Plant Biotechnol Rep 10:327-343 (2016); and Gaj, t et al, "ZFN, TALEN and CRISPR/Cas-based methodsfor genome engineering," Trends biotechnol.31 (7): 397-405 (2013), the contents and disclosure of which are incorporated herein by reference. The expression cassettes provided herein may encode a site-specific nuclease. Such an expression cassette may comprise a transcribable DNA sequence encoding a site-specific nuclease operably linked to a plant-expressible promoter. In another aspect, a recombinant DNA construct provided herein may comprise at least one, at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten expression cassettes encoding one or more site-specific nucleases.
According to embodiments of the present disclosure, the recombinase may be a serine recombinase linked to a DNA recognition motif, a tyrosine recombinase linked to a DNA recognition motif, or other recombinases known in the art. The recombinase or transposase may be a DNA transposase or a recombinase linked to a DNA binding domain. The tyrosine recombinase linked to the DNA recognition motif may be selected from the group consisting of: cre recombinase, flp recombinase and Tnp 1 recombinase. According to some embodiments, the Cre recombinase or Gin recombinase provided herein is tethered to a zinc finger DNA binding domain. In another embodiment, the serine recombinases provided herein linked to a DNA recognition motif are selected from the group consisting of: phiC31 integrase, R4 integrase and TP-901 integrase. In another embodiment, the DNA transposase provided herein that is linked to a DNA binding domain is selected from the group consisting of TALE-piggyBac and TALE-multiplexer.
According to embodiments of the present disclosure, the RNA guided endonuclease or CRISPR-associated nuclease may be selected from the group consisting of: non-limiting examples of Cas1, cas1B, cas2, cas3, cas4, cas5, cas6, cas7, cas8, cas9 (also known as Csn1 and Csx 12), cas10, csy1, csy2, csy3, cse1, cse2, csc1, csc2, csa5, csn2, csm3, csm4, csm5, csm6, cmr1, cmr3, cmr4, cmr5, cmr6, csb1, csb2, csb3, csx17, csx14, csx10, csx16, csaX, csx3, csx1, csx15, csf1, csf2, csf3, csf4, cpf1, casX, casY and their homologs or modified forms, argonaute (Argonaute proteins) include thermophilic bacteria (84) Argonaute (TtAgo), geobacillus (Pyrococcus furiosus) and other variants of the same, or other saline-alkaline and alkaline variants (according to the mangostin) of the mangrover 35, in another aspect, the site-specific nucleases provided herein are selected from the group consisting of Cas9 or Cpf1 enzymes.
For RNA-guided endonucleases or CRISPR-associated nucleases, a guide RNA (gRNA) molecule may be required to direct the endonuclease to a target site in a DNA molecule, chromosome or genome of a plant by base pairing or hybridization to cause DSBs or nicks at or near the target site. The gRNA may be transformed or introduced into a plant cell or tissue as a recombinant DNA construct (possibly together with a nuclease or a DNA construct encoding a nuclease) comprising a transcribable DNA sequence encoding a guide RNA operably linked to a plant-expressible promoter. As understood in the art, a "guide RNA" may include, for example, CRISPR RNA (crRNA), single stranded guide RNA (sgRNA) or any other RNA molecule that can direct or direct an endonuclease to a particular target site in the genome. A "single stranded guide RNA" (or "sgRNA") is an RNA molecule comprising crRNA covalently linked to tracrRNA via a linker sequence, which can be expressed as a single RNA transcript or molecule. The guide RNA comprises a guide or targeting sequence that is identical or complementary to a target site within a DNA molecule, chromosome, or plant genome. Protospacer Adjacent Motifs (PAMs) may be present in the genome immediately 5 'to and upstream of the genomic target site sequence complementary to the targeting sequence of the guide RNA-i.e., immediately downstream (3') of the sense (+) strand of the genomic target site (relative to the targeting sequence of the guide RNA), as is known in the art. See, e.g., wu, x. Et al, "Target specificity of the CRISPR-Cas9 system," Quant biol.2 (2): 59-70 (2014), the contents and disclosure of which are incorporated herein by reference. Genomic PAM sequences on the sense (+) strand adjacent to the target site (relative to the targeting sequence of the guide RNA) may comprise 5'-NGG-3'. However, the corresponding sequence of the guide RNA (i.e., immediately downstream (3') of the targeting sequence of the guide RNA) may not generally be complementary to the genomic PAM sequence. The guide RNA may typically be a non-coding RNA molecule that does not code for a protein. The guide sequence of the guide RNA can be at least 10 nucleotides in length, such as 12-40 nucleotides, 12-30 nucleotides, 12-20 nucleotides, 12-35 nucleotides, 12-30 nucleotides, 15-30 nucleotides, 17-30 nucleotides, or 17-25 nucleotides in length, or about 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or more nucleotides in length. The guide sequence may have at least 95%, at least 96%, at least 97%, at least 99% or 100% identity or complementarity to at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25 or more consecutive nucleotides of the DNA sequence at the target site. The expression cassettes provided herein can encode guide RNAs. Such an expression cassette may comprise a transcribable DNA sequence encoding a guide RNA operably linked to a plant-expressible promoter. In another aspect, a recombinant DNA construct provided herein may comprise at least one, at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten expression cassettes encoding one or more guide RNAs.
Zinc Finger Nucleases (ZFNs) are synthetic proteins consisting of an engineered zinc finger DNA binding domain fused to a cleavage domain (or cleavage half-domain), which can be derived from a restriction endonuclease (e.g., fok 1). The DNA binding domain may be canonical (C2H 2) or atypical (e.g., C3H or C4). Depending on the target site, the DNA binding domain may comprise one or more zinc fingers (e.g., 2, 3, 4, 5, 6, 7, 8, 9, or more zinc fingers). Multiple zinc fingers in a DNA binding domain may be separated by one or more linker sequences. ZFNs can be designed to cleave almost any segment of double stranded DNA by modification of the zinc finger DNA binding domain. ZFNs form dimers from monomers that include a non-specific DNA cleavage domain (e.g., derived from fokl nuclease) fused to a DNA binding domain that includes a zinc finger array engineered to bind to a target site DNA sequence. The DNA-binding domain of ZFNs can typically consist of 3-4 (or more) zinc fingers. Amino acids at the-1, +2, +3, and +6 positions relative to the start of the zinc finger alpha-helix that promotes site-specific binding to the target site can be varied and tailored to suit the particular target sequence. Other amino acids may form a consensus backbone to generate ZFNs with different sequence specificities. Methods and rules for designing ZFNs that target and bind to specific target sequences are known in the art. See, for example, U.S. patent application nos. 2005/0064474, 2009/011717, and 2012/0142062, the contents and disclosures of which are incorporated herein by reference. The Fok1 nuclease domain may need to dimerize in order to cleave DNA, and thus two ZFNs with their C-terminal regions are required to bind opposite DNA strands (5-7 bp apart) of the cleavage site. If the double ZF binding site is palindromic, ZFN monomers can cleave the target site. As used herein, ZFNs are broad and include monomeric ZFNs that can cleave double-stranded DNA without the aid of another ZFN. The term ZFN may also be used to refer to one or both members of a pair of ZFNs engineered to co-act to cleave DNA at the same site.
Without being limited by any scientific theory, because the DNA binding specificity of the zinc finger domain can be re-engineered using one of a variety of methods, custom ZFNs can in theory be constructed to target virtually any target sequence (e.g., at or near the GA oxidase gene in the plant genome). Publicly available methods for engineering zinc finger domains include Context-dependent assembly (Context-dependent Assembly (CoDA)), oligomerization pool engineering (Oligomerized Pool Engineering (OPEN)), and modular assembly. In one aspect, the methods and/or compositions provided herein comprise one or more, two or more, three or more, four or more, or five or more ZFNs. In another aspect, ZFNs provided herein are capable of generating targeted DSBs or nicks. In one aspect, a vector comprising a polynucleotide encoding one or more, two or more, three or more, four or more, or five or more ZFNs is provided to a cell by transformation methods known in the art (e.g., without limitation, viral transfection, particle bombardment, PEG-mediated protoplast transfection, or agrobacterium-mediated transformation). ZFNs can be introduced as ZFN proteins, as polynucleotides encoding ZFN proteins, and/or as a combination of proteins and polynucleotides encoding proteins.
Meganucleases, such as the LAGLIDADG family of homing endonucleases, typically identified in microorganisms, are unique enzymes with high activity and long recognition sequences (> 14 bp) that lead to site-specific digestion of target DNA. Engineered forms of naturally occurring meganucleases typically have an extended DNA recognition sequence (e.g., 14 to 40 bp). According to some embodiments, the meganuclease may comprise a scaffold or base enzyme selected from the group consisting of: I-CreI, I-CeuI, I-MsoI, I-SeeI, I-AniI and I-Dmo I. The engineering of meganucleases can be more challenging than ZFNs and TALENs because the DNA recognition and cleavage functions of meganucleases are interleaved together in a single domain. Specific mutagenesis and high throughput screening methods have been used to create novel meganuclease variants that recognize unique sequences and have improved nuclease activity. Thus, meganucleases can be selected or engineered to bind genomic target sequences in plants. In one aspect, the methods and/or compositions provided herein comprise one or more, two or more, three or more, four or more, or five or more meganucleases. In another aspect, meganucleases can generate targeted nicks or breaks.
Zinc Finger Nucleases (ZFNs) and TAL effector nucleases (TALENs) are chimeric enzymes that combine nucleases and DNA binding domains. TALENs are a class of sequence-specific nucleases that can be used to create double strand breaks at specific target sequences in the genome of a plant or other organism. TALENs are restriction enzymes produced by fusing a transcription activator-like effector (TALE) DNA binding domain with a nuclease domain (e.g., fokl). When each member of the TALEN pair binds to a DNA site flanking the target site, the fokl monomers dimerize and cause double-stranded DNA breaks at the target site. In addition to the wild-type fokl cleavage domain, variants with mutated fokl cleavage domains have been designed to improve cleavage specificity and cleavage activity. The fokl domain functions as a dimer, requiring two constructs with unique DNA binding domains for sites in the target genome with the proper orientation and spacing. The number of amino acid residues between the TALEN DNA binding domain and the fokl cleavage domain, as well as the number of bases between two separate TALEN binding sites, are parameters to achieve high activity levels. TALENs are artificial restriction enzymes produced by fusing a transcription activator-like effector (TALE) DNA binding domain to a nuclease domain. In some aspects, the nuclease is selected from the group consisting of: pvuII, mutH, tevI, fokI, alwI, mlyI, sbfI, sdaI, stsI, cleDORF, clo051 and Pept071. When each member of the TALEN pair binds to a DNA site flanking the target site, the fokl monomers dimerize and cause double-stranded DNA breaks at the target site. As used herein, the term TALEN is broad and includes monomeric TALENs that can cleave double-stranded DNA without the aid of another TALEN. The term TALEN also refers to one or both members of a pair of TALENs that co-act at the same site to cleave DNA.
Transcription activator-like effectors (TALEs) can be engineered to bind virtually any DNA sequence at or near the genomic locus of a GA oxidase gene, such as in a plant. TALE has a central DNA binding domain consisting of 13-28 repeat monomers of 33-34 amino acids. The amino acids of each monomer are highly conserved except for the hypervariable amino acid residues at positions 12 and 13. The DNA binding domain of TAL effectors may contain 33-35 amino acid sequence repeats, including repeated variable double Residues (RVDs) at residues 12 and 13, which determine their specificity in DNA binding. Each repeat binds to a specific nucleotide that facilitates engineering of a specific DNA binding domain by selecting a combination of repeat segments containing the appropriate RVDs. The number of repeats of the RVD sequence determines the length and sequence of the target sequence to be identified (Podevin et al, (2013) Trends in Biotechnology31 (6): 375-383). Two variable amino acids are referred to as repeated variable double Residues (RVD). Amino acid pairs of RVDs preferentially recognize certain nucleotide bases, and modulation of RVDs can recognize consecutive DNA bases. This simple relationship between amino acid sequence and DNA recognition allows for engineering of a particular DNA binding domain by selecting a combination of repeat segments containing the appropriate RVDs.
In addition to the wild-type fokl cleavage domain, variants with mutated fokl cleavage domains have been designed to improve cleavage specificity and cleavage activity. The fokl domain functions as a dimer, requiring two constructs with unique DNA binding domains for sites in the target genome with the proper orientation and spacing. The number of amino acid residues between the TALEN DNA binding domain and the fokl cleavage domain, as well as the number of bases between two separate TALEN binding sites, are parameters to achieve high activity levels. PvuII MutH and TevI cleavage domains are useful alternatives to FokI and FokI variants for use with TALE. When coupled to TALE, pvuII functions as a highly specific cleavage domain (see Yank et al 2013.PLoS One.8:e82539). MutH is capable of introducing strand-specific nicks in DNA (see Gabsalilow et al, 2013.Nucleic Acids Research.41:e83). TevI introduces a double strand break in DNA at the targeting site (see beudeley et al, 2013.Nature Comm.4:1762). The relationship between the amino acid sequence and DNA recognition of the TALE binding domain allows for a protein to be designed. The TALE construct may be designed using a software program such as DNA Works. Other methods of designing TALE constructs are known to those skilled in the art. See Doyle et al, nucleic Acids Research (2012) 40:wl 17-122; cermak et al, nucleic Acids Research (2011) 39:e82; and tale-nt.cac.comell.edu/about. In one aspect, the recombinant DNA constructs provided herein comprise one or more, two or more, three or more, four or more, or five or more TALENs. In another aspect, a TALEN provided herein is capable of generating a targeted nick or break at a target site.
A Zinc Finger Nuclease (ZFN) comprises a zinc finger DNA binding domain and a double cleavage inducing domain. Recognition site specificity is conferred by zinc finger domains, which may contain two, three, or four zinc fingers, e.g., having a C2H2 structure, although other zinc finger structures are known and have been engineered. The zinc finger domain is suitable for use in designing polypeptides that specifically bind to selected polynucleotide recognition sequences. ZFNs consist of engineered DNA-binding zinc finger domains linked to non-specific endonuclease domains (e.g., nuclease domains from type IIs endonucleases, such as fokl). Other functionalities may be fused to the zinc finger binding domain, including transcriptional activator domains, transcriptional repressor domains, and methylases. In some examples, dimerization of nuclease domains is essential for cleavage activity. Each zinc finger recognizes three consecutive base pairs in the target DNA. For example, the three-finger domain recognizes a sequence of nine consecutive nucleotides with dimerization requirements of nucleases, using two sets of zinc finger triplets to bind to an eighteen nucleotide recognition sequence (Gaj et al, (2013) Trends Biotechnology,31 (7): 397-405; and Urnov et al, (2010) Nature Reviews Genetics,11: 636-646).
Cell transformation
The invention also relates to a method of producing transformed cells and plants comprising one or more regulatory elements operably linked to a transcribable DNA sequence.
The term "transformation" refers to the introduction of a DNA molecule into a recipient host. As used herein, the term "host" refers to a bacterium, fungus, or plant, including any cell, tissue, organ, or progeny of a bacterium, fungus, or plant. Plant tissues and cells of particular interest include protoplasts, calli, roots, tubers, seeds, stems, leaves, seedlings, embryos and pollen.
As used herein, the term "transformed" refers to a cell, tissue, organ or organism into which a foreign DNA molecule (such as a construct) has been introduced. The introduced DNA molecule may be integrated into the genomic DNA of a recipient cell, tissue, organ or organism such that the introduced DNA molecule is inherited by subsequent offspring. "transgenic" or "transformed" cells or organisms may also include progeny of a cell or organism, as well as progeny resulting from a breeding program that uses such transgenic organisms as parents for hybridization and that exhibits an altered phenotype due to the presence of foreign DNA. The introduced DNA molecule may also be transiently introduced into the recipient cell such that the introduced DNA molecule is not inherited by subsequent offspring. The term "transgenic" refers to a bacterium, fungus, or plant that contains one or more heterologous DNA molecules.
There are many methods known to those skilled in the art for introducing DNA molecules into plant cells and transforming plant cells. The process generally includes the steps of: selecting a suitable host cell or explant, transforming said cell or explant with a molecule or vector, and obtaining a transformed cell. In the practice of the present invention, the methods and materials for transforming plant cells by introducing plant constructs into the plant genome may include any well known and proven methods. Suitable methods include, but are not limited to, bacterial infection (e.g., agrobacterium), binary BAC vectors, direct delivery of DNA (e.g., through PEG-mediated transformation, drying/inhibition-mediated DNA uptake, electroporation, agitation by silicon carbide fibers and acceleration by DNA coated particles), gene editing (e.g., CRISPR-Cas system), and the like. According to certain embodiments, the transformation method comprises agrobacterium-or rhizobium-mediated transformation or particle bombardment or particle-mediated transformation.
The host cell may be any cell or organism, such as a plant cell, an algal cell, an algae, a fungal cell, a fungus, a bacterial cell or an insect cell. In particular embodiments, host cells and transformed cells may include cells from crop plants.
Transgenic plants can then be regenerated from the transgenic plant cells of the invention. Seeds may be produced from such transgenic plants using conventional breeding techniques or self-pollination. Such seeds and progeny plants grown from such seeds will contain the recombinant DNA molecules of the invention and will therefore be transgenic.
The transgenic plants of the invention may be self-pollinated to provide seeds for homozygous transgenic plants of the invention (homozygous for the recombinant DNA molecule) or may be crossed with non-transgenic plants or different transgenic plants to provide seeds for heterozygous transgenic plants of the invention (heterozygous for the recombinant DNA molecule). Such homozygous and heterozygous transgenic plants are referred to herein as "progeny plants". The progeny plant is a transgenic plant that is inherited from the original transgenic plant and contains the recombinant DNA molecules of the invention. Seeds produced using the transgenic plants of the invention can be harvested and used to culture the generations of the transgenic plants, i.e., the progeny plants of the invention, which comprise the constructs of the invention and express genes of agronomic interest. Descriptions of breeding methods commonly used for different crops can be found in one of several references, see, for example, al lard, principles of Plant Breeding, john Wiley & Sons, NY, u.of CA, davis, CA,50-98 (1960); simmonds, principles of Crop Improveme nt, longman, inc, NY,369-399 (1979); sneep and Hendriksen, plant breeding Perspectives, wageningen (edit), center for Agricultural Publis hing and Documentation (1979); fehr, soybeans: improvement, product ion and Uses, 2 nd edition, monograph,16:249 (1987); fehr, principles of Variety Development, theory and Technique, (volume 1) and Crop Species Soybean (volume 2), iowa State uni, macmillan pub.co., NY,360-376 (1987).
The transformed plant may be analyzed for the presence of the gene or genes of interest and the level and/or profile of expression conferred by the regulatory elements of the invention. Those skilled in the art will be aware of numerous methods that can be used to analyze transformed plants. For example, methods for plant analysis include, but are not limited to, southern or Northern blotting, PCR-based methods, biochemical analysis, phenotypic screening methods, field evaluation, and immunodiagnostic assays. Can be used as described by the manufacturer(Applied Biosystems, foster City, calif.) reagents and methods to measure expression of transcribable DNA sequences and use +.>The Testing Matrix determines the number of PCR cycles. Alternatively, +.>(Third Wave Technologies, madison, wis.) reagents and methods to assess transgene expression.
The invention also provides parts of the plants of the invention. Plant parts include, but are not limited to, leaves, stems, roots, tubers, seeds, endosperm, ovules, and pollen. Plant parts of the invention may be viable, non-viable, renewable and/or non-renewable. The invention also includes and provides transformed plant cells comprising the DNA molecules of the invention. Transformed or transgenic plant cells of the invention include regenerable and/or non-regenerable plant cells.
The invention also provides commercial products produced from transgenic plants or parts thereof comprising the recombinant DNA molecules of the invention. The commercial product of the present invention contains a detectable amount of DNA comprising a DNA sequence selected from the group consisting of: SEQ ID NOS 1-26, 59-62 and 64-66. As used herein, "commodity product" refers to any composition or product that consists of material derived from a transgenic plant, seed, plant cell, or plant part that contains a recombinant DNA molecule of the invention. Commercial products include, but are not limited to, processed seeds, kernels, plant parts, and meal. The commercial product of the present invention will contain a detectable amount of DNA corresponding to the recombinant DNA molecule of the present invention. Detection of one or more such DNA in a sample may be used to determine the content or source of a commodity product. Any standard method of DNA molecule detection may be used, including the detection methods disclosed herein.
The invention may be understood more readily by reference to the following examples, which are provided by way of illustration and are not intended to limit the invention unless otherwise indicated. It should be appreciated by those of skill in the art that the techniques disclosed in the examples which follow represent techniques discovered by the inventor to function well in the practice of the invention. However, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the spirit and scope of the invention, and therefore all matter set forth is to be interpreted as illustrative and not in a limiting sense.
Examples
Example 1
Identification of regulatory elements capable of driving automatic excision in crop plants
This example provides regulatory elements identified through years of experimentation that can drive efficient automatic excision in transgenic corn, soybean and cotton.
By combining literature searches with public and proprietary database searches, regulatory elements that are likely to drive efficient automatic excision in transgenic crop plants were first identified. Effective automatic excision has been determined using the Cre/Lox recombinase system on seventy soybean, twenty cotton and one hundred maize binary transformation vector constructs containing different regulatory elements and combinations. From these studies, a small number of regulatory elements that provide effective automatic excision were identified and presented in table 1 below.
Table 1. Regulatory elements providing efficient automatic excision in crop plants.
/>
Example 2
Automatic excision in maize and Rice CDC45-1 promoters driven stably transformed maize plants
Maize plants are transformed with recombinant DNA molecules, particularly plant transformation constructs comprising different regulatory elements that drive Cre-recombinase expression, to assess the ability and efficiency of automatic excision of Cre-recombinase driven Cre-recombinase expression cassettes and marker gene expression cassettes expressed under the control of the different regulatory elements.
Transformation of maize plants with binary plant transformation constructs comprising the following three transgene expression cassettes: a selectable marker gene expression cassette and a Cre-recombinase expression cassette, both flanked by two LoxP sites (RS-P1. Lox1:1, SEQ ID NO: 44), and a third expression cassette located outside the LoxP sites, which expresses a beta-Glucuronidase (GUS) transgene. The Cre-recombinase expression cassettes were used to determine the different EXPs to test their ability to drive efficient automatic excision of the Cre-recombinase expression cassette and the marker gene expression cassette located between LoxP sites. The Cre-recombinase expression cassette consists of the following elements: the EXP to be tested is operably linked to 5 'of a synthetic coding sequence encoding Cre-recombinase (e.g., a codon redesigned for expression in plant cells) (GOI-P1. Cre-St. LS1:1, SEQ ID NO: 43) containing a processable intron derived from the potato light-induced tissue-specific ST-LS1 gene (GenBank accession number: X04753) operably linked to 5' of a 3 'termination or 3' UTR (T-AGRtu. Nos:13, SEQ ID NO: 48). Each plant transformation construct contained one of two different marker gene expression cassettes, designated "marker-1" and "marker-2". The marker-1 marker gene expression cassette comprises the following elements: constitutive EXP (EXP-Os. Act1:1:1 (SEQ ID NO: 46)), operably linked to 5' of a synthetic coding sequence encoding a plastid-targeted CP4 coding sequence (GOI-At. ShkG-CTP2+AGRtu. AroA-CP4.Nat:1, SEQ ID NO: 47) that provides tolerance to the herbicide glyphosate, operably linked to 5' of a 3' termination region T-AGRtu. Nos. 13 (SEQ ID NO: 48). The marker-2 gene expression cassette comprises the following elements: constitutive EXP (EXP-Os. TubA-3:1 (SEQ ID NO: 52)), operably linked to 5 'of a synthetic coding sequence encoding a plastid-targeted CP4 coding sequence (GOI-At. ShkG-CTP2+AGRtu. AroA-CP4.Nat:1, SEQ ID NO: 47) that provides tolerance to the herbicide glyphosate, operably linked to a 3' termination region
T-AGRtu. Nos. 13 (SEQ ID NO: 48).
The marker gene cassette and the Cre-recombinase cassette are flanked by two loxP Cre-recombinase recognition sequences (RS-P1. Lox1:1, SEQ ID NO: 44) in the same head-to-tail direction. If automatic excision is effective, expression of Cre-recombinase in the transformed plant cell is expected to result in excision of both cassettes. The GUS expression cassette was cloned outside the LoxP Cre-recombinase recognition sequence and contains the following elements: EXP (EXP-Os.Act1+CaMV.35S.2xA1-B3+Ta.Lhcb1:1:1, SEQ ID NO: 49) comprising chimeric promoter and leader sequence is operably linked to 5' of a synthetic coding sequence encoding β -glucuronidase (GOI-ec.uidA+St.LS1:3, SEQ ID NO: 50) containing a processable intron derived from potato light-induced tissue specific ST-LS1 gene (GenBank accession number: X04753) operably linked to 5' of a 3' termination region (T-St.Pis 4-1:4:1,SEQ ID NO:51).
Three EXPs derived from corn and rice CDC45 gene homologs were operably linked to a transcribable DNA sequence encoding Cre recombinase and their ability to drive automatic excision in stably transformed corn plants was determined. EXP-Zm.Cdc45-1+Zm.DnaK:1:1 (SEQ ID NO: 1) comprises a promoter (P-Zm.Cdc45-1:8, SEQ ID NO: 2) operably linked to 5 'of a synthetic leader sequence (L-Zm.Cdc45-1:1, SEQ ID NO: 3) operably linked to 5' of an intron (I-Zm.DnaK: 1, SEQ ID NO: 39). EX P-Os.Cdc45-1:1:1 (SEQ ID NO: 4) comprises a promoter (P-Os.Cdc45-1-1:1:1,SEQ ID NO:5) operably linked to 5' of the leader sequence (L-Os.Cdc45-1-1:1:1,SEQ ID NO:6). EXP-Zm.Cdc45-2+Zm.DnaK:1:2 (SEQ ID NO: 27) comprises a promoter (P-Zm.Cdc45-2-1:1:3,SEQ ID NO:28) operably linked to 5 'of a leader sequence (L-Zm.Cdc45-2-1:1:1,SEQ ID NO:29) operably linked to 5' of an intron (I-Zm.DnaK: 1, SEQ ID NO: 39).
Two additional EXPs were also operably linked to a transcribable DN a sequence encoding Cre recombinase and their ability to drive automatic excision in stably transformed maize plants was determined. EXP-Zm.Zm13:2 (SEQ ID NO: 30) from the germline preference gene expressed in pollen comprises a promoter (P-Zm.Zm13:2, SEQ ID NO: 31) operably linked to 5' of the leader sequence (L-Zm.Zm13:2, SEQ ID NO: 32). EXP-Zm.wall+Zm.DnaK from endosperm preference gene 1:5 (SEQ ID NO: 33) comprises a promoter (P-Zm.wall-1:1:9,SEQ ID NO:34) operably linked to 5 'of a leader sequence (L-Zm.wall-1:1:1,SEQ ID NO:35) operably linked to 5' of an intron (I-Zm.DnaK: 1, SEQ ID NO: 39).
Transformation of maize plant cells by agrobacterium-mediated transformation using the binary plant transformation construct described above, wherein each of the EXPs described above to be tested is operably linked to a transcribable DNA sequence encoding a Cre recombinase. Methods for agrobacterium-mediated transformation are well known in the art. The resulting transformed plant cells are regenerated into maize plants under glyphosate selection.
Selecting R with single copy event 0 Plants and allowed to self-pollinate. Then use TAQMAnalysis of the obtained R by assay 1 The presence of Cre, CP4 and GUS transgenes in seeds. Also using GUS transgene +.>Determination of R by assay 1 Zygosity of seeds for the integration construct. According to each selected R 0 Event selfing, forty-one R was determined 1 Seed. Those producing at least two R's that are homozygous for GUS and lack Cre and CP4 transgenes 1 The event of the seed is believed to successfully automatically cut off the Cre and CP4 transgene expression cassettes. Table 2 below shows the results for each EXP. Data corresponding to two different CP4 marker transgene cassettes (marker-1 and marker-2) are also presented in table 2. Only one of the CP4 transgene cassettes was used for EXP-Zm.Zm13:2 (marker-1) and EXP-Zm.wall+Zm.DnaK:1:5 (marker-2). In Table 2, the column "percent of homozygous event/analyzed 41 seeds" indicates the production of at least two R 1 Percentage of events for seeds that are homozygous for GUS and lack Cre and CP4 transgenes. In brackets, the first digit indicates that at least two R particles are produced 1 R of seed 0 Number of events, the seed was homozygous for GUS and lacks the Cre and CP4 transgenes, the second number represents R where each event analyzed 41 seeds 0 Number of events.
TABLE 2 demonstration of efficient autoproteological R in stably transformed maize plants with maize and Rice CDC45-1 promoter driven autoproteological excision 0 Percentage of events.
As can be seen from Table 2 above, the promoters contained within EXP-Zm.Cdc45-1+Zm.DnaK:1:1 (P-Zm.Cdc45-1:8, SEQ ID NO: 2) and EXP-Os.Cdc45-1:1:1 (P-Os.Cdc45-1-1:1:1,SEQ ID NO:5) were very effective in driving automatic excision in stably transformed maize plants. Promoters contained within EXP-zm.Cdc45-2+zm.DnaK:1:2 (P-zm.Cdc45-2-1:1:3,SEQ ID NO:28), EXP-zm.zm13:2 (P-zm.zm13:2, SEQ ID NO: 31) and EXP-zm.wall+zm.DnaK:1:5 (P-zm.wall-1:1:9,SEQ ID NO:34) were unable to drive automatic excision in stably transformed maize plants in this experiment.
Example 3
Arabidopsis CDC45 promoter and other germ line preferred promoters drive automatic excision in stably transformed soybean plants
Soybean plants were transformed with recombinant DNA molecules, particularly plant transformation constructs comprising different regulatory elements that drive Cre-recombinase expression, to assess the ability and efficiency of Cre-recombinase expression cassettes and marker gene expression cassettes expressed under the control of the different regulatory elements to automatically excise.
Transforming a soybean plant with a binary plant transformation construct comprising the following four transgene expression cassettes: generating a selectable marker gene expression cassette (crtB) for the color phenotype; cre-recombinase expression cassette and a selectable marker expression cassette (aadA) for selecting transformed soybean cells flanked by two loxP sites (RS-P1. Lox. TATA-R9-1:1:1,SEQ ID NO:45); and a fourth expression cassette located outside the LoxP site, which expresses a β -Glucuronidase (GUS) transgene. The Cre-recombinase expression cassettes were used to determine the different EXPs to test their ability to drive efficient automatic excision of crtB, cre-recombinase and marker gene expression cassettes all located between LoxP sites. The Cre-recombinase expression cassette consists of the following elements: EXP to be tested, operatively linked to 5' of a synthetic coding sequence encoding Cre-recombinase (GOI-P1. Cre-St. LS1:1:1, SEQ ID NO: 43) containing a processable intron derived from the potato light-induced tissue-specific ST-LS1 gene (GenBank accession number: X04753), operatively linked to 5' of a 3' termination region (T-AGRtu. Nos:13, SEQ ID NO: 48). The color marker gene expression cassette comprises a seed-preferred promoter and leader sequence (P-Br. Snap2-1:1:20,SEQ ID NO:56) operably linked to 5' of a synthetic coding sequence encoding a chloroplast-targeted (TS-Ps. RbcS-3C-1:3:1,SEQ ID NO:57) phytoene synthase (crtB) (CR-PANag. CrtB. Nno-1:4:1,SEQ ID NO:58) operably linked to 5' of a 3' termination region (T-Br. Snap2-1:3:6,SEQ ID NO:59). Expression of the crtB gene in seeds produced orange colored seeds and was an indication that the transgene cassette remained between LoxP sites due to failure to excise the cassette after automatic excision. The transformation aadA selection marker cassette consists of the following elements: EXP (EXP-at. Act7:2, SEQ ID NO: 53) operably linked to 5' encoding a chloroplast-targeted Tn7 adenylyl transferase coding sequence (GOI-at. ShkG-CTP2+ec. AadA-SPC/STR:1:1, SEQ ID NO: 54) conferring resistance to spectinomycin and used to select transformed plant cells operably linked to 5' of a 3' termination region (T-AGRtu. Nos:13, SEQ ID NO: 48). The GUS transgene expression cassette comprises an enhanced cauliflower mosaic virus 35S promoter and leader sequence (EXP-CaMV. 35S-enh:1:2, SEQ ID NO: 55) operably linked to 5' of a synthetic coding sequence encoding β -glucuronidase (GOI-ec. UidA+St. LS1:3, SEQ ID NO: 50) containing a processable intron derived from the potato light-induced tissue specific ST-LS1 gene (GenBank accession number: X04753), operably linked to 5' of a 3' termination region (T-AGRtu. Nos:13, SEQ ID NO: 48).
Seven EXPs derived from arabidopsis genes with germ line preferred expression were operably linked to a transcribable DNA sequence encoding Cre recombinase and their ability to drive automatic excision in stably transformed soybean plants was determined. See table 3 below. Each Cre-recombinase expression cassette also comprises a native 3' termination region (also presented in Table 3) corresponding to the gene from which the EXP is derived. Two of EXPs, EXP-at.Swi1 (SEQ ID NO: 13) and EXP-at.Swi1s (SEQ ID NO: 14), are variants of different lengths derived from the same germline bias gene and comprise a promoter operably linked to a leader sequence. The 3' UTR, i.e.T-At.Swi1-1:2:1 (SEQ ID NO: 14) was used in both EXP-At.Swi1 expression cassettes. Likewise, the other two EXPs, EXP-Syn1 (SEQ ID NO: 36) and EXP-Syn1a (SEQ ID NO: 37), are variants of different lengths derived from the same germline bias gene and comprise a promoter operably linked to a leader sequence. The 3' UTR, i.e.T-At.Syn1-1:2:1 (SEQ ID NO: 38) was used in both EXP-At.Syn1 expression cassettes.
Transformation of soybean plant cells by agrobacterium-mediated transformation using the binary plant transformation construct described above, wherein each of the EXPs described above to be tested is operably linked to a transcribable DNA sequence encoding a Cre recombinase. Methods for agrobacterium-mediated transformation are well known in the art. The resulting transformed plant cells are regenerated into soybean plants under selection of spectinomycin.
Selecting R with single copy event 0 Plants and allowed to self-pollinate. And then useAnalysis of the obtained R by assay 1 Presence of GUS transgene in seeds. Successful auto excision of Cre, aadA and crtB expression cassettes between LoxP sites was deduced from the lack of orange color conferred to seeds by the retained crtB expression cassette. Table 3 below shows the results for each EXP/3' UTR combination.
TABLE 3 production of marker-free R in stably transformed soybean plants 1 R of seed 0 Percentage of events.
/>
As can be seen from Table 3 above, in this experimentAll EXPs (except EXP-Syn1 and EXP-Syn1 a) were able to drive automatic excision in stably transformed soybean plants. The combination of EXP and 3' UTR derived from the Arabidopsis CDC45 gene (EXP-at. Cdc45:1:1, SEQ ID NO:9 and T-at. Cdc45:1, SEQ ID NO: 12) provided the highest percentage of production of marker-free R 1 R of seed 0 An event. Two variants EXP derived from the Arabidopsis Swi1 gene (EXP-At.Swi1, SEQ ID NO:13 and EXP-At.Swi1a, SEQ ID NO: 14) and the same corresponding 3' UTR (T-At.Swi1-1:2:1,SEQ ID NO:15) also provided a high percentage of the generation of marker-free R 1 R of seed 0 An event.
Example 4
Soybean Rsp-1 promoter drives automatic excision in stably transformed soybean plants
Soybean plants are transformed with constructs, particularly plant binary transformation constructs comprising test regulatory elements that drive Cre-recombinase expression, and used to assess the ability and efficiency of automatic excision of Cre-recombinase expression cassettes and marker gene expression cassettes.
Transforming a soybean plant with a binary plant transformation construct comprising the following three transgene expression cassettes: cre-recombinase expression cassette and a selectable marker expression cassette (aadA) for selecting transformed soybean cells flanked by two loxP sites (RS-P1. Lox1:1, SEQ ID NO: 44); and a third expression cassette outside the LoxP site for expressing a beta-Glucuronidase (GUS) transgene. Cre-recombinase expression cassettes are used to determine their ability of different EXPs to drive efficient automatic excision of Cre-recombinase expression cassettes and marker gene expression cassettes. The Cre-recombinase expression cassette consists of the following elements: test EXP operably linked to 5' of a synthetic coding sequence encoding Cre-recombinase (GOI-P1. Cre-St. LS1:1:1, SEQ ID NO: 43) containing a processable intron derived from the potato light-induced tissue-specific ST-LS1 gene (GenBank accession number X04753) operably linked to 5' of a 3' termination region (T-At. Cd c45:1, SEQ ID NO: 12). The transformation selection marker cassette aadA consists of the following elements: EXP (EXP-at. Act7:2, SEQ ID NO: 54) operably linked to 5' encoding a chloroplast-targeted Tn7 adenylyl transferase coding sequence (GOI-at. ShkG-CTP2+ec. AadA-SPC/STR:1:1, SEQ ID NO: 54) that confers resistance to spectinomycin and is used to select transformed plant cells, said coding sequence being operably linked to 5' of a 3' termination region (T-AGRtu. Nos. 13, SEQ ID NO: 48). The GUS transgene expression cassette consists of the following elements: an enhanced cauliflower mosaic virus 35S promoter and leader sequence (EXP-CaMV. 35S-enh:1:2, SEQ ID NO: 55) operably linked to 5' of a synthetic coding sequence encoding β -glucuronidase (GOI-ec. UidA+St. LS1:3, SEQ ID NO: 50) containing a processable intron derived from the potato light-induced tissue specific ST-LS1 gene (GenBank accession number: X04753), operably linked to 5' of a 3' termination region (T-AGRtu. Nos:13, SEQ ID NO: 48).
EXP-Gm.Rsp-1:1 driven efficient automatic excision in stably transformed soybean plants.
Soybean plants were transformed with the binary construct as described above, wherein EXP (EXP-gm. Rsp-1:1 (SEQ ID NO: 20)) driven the expression of the Cre-recombinase transgene. The Cre-recombinase expression cassette consists of EXP (EXP-Gm. Rsp-1:1 (SEQ ID NO: 20)) consisting of the following elements: a promoter (P-Gm.Rsp-1-1:1:1,SEQ ID NO:21) operably linked to 5 'of a leader sequence (L-Gm.Rsp-1-1:1:1,SEQ ID NO:22) operably linked to 5' of a synthetic coding sequence encoding Cre-recombinase operably linked to 5 'of a 3' termination region (T-At.Cdc45:1, SEQ ID NO: 12).
The binary plant transformation constructs described above are used to transform soybean plant cells by agrobacterium-mediated transformation, as is well known in the art. The resulting transformed plant cells were induced under selection for spectinomycin to form whole soybean plants.
Fifty R are allowed 0 Single copy events self-pollinate. And then useAnalysis of the obtained R by assay 1 The presence of Cre, aadA and GUS transgenes in plants. Also use +.> The assay determines the zygosity of the GUS transgene cassette. Will be for fifty R 0 GUS positive (GUS+) and unlabeled (aadA-) R for each of the events 1 The percentages of the offspring are presented in table 4 below.
TABLE 4 GUS positive (GUS+) and unlabeled (aadA-) R in stably transformed soybean plants with EXP-Gm.Rsp-1:1 driven automatic excision 1 Percentage of offspring
/>
/>
/>
As can be seen from Table 4 above, except for one R 0 Except for event (event-26), all events produced a marker-free R after self-pollination 1 And (5) offspring. Some R 0 Events provide a large proportion of unlabeled R 1 Progeny plants, many of which are homozygous for the GUS marker expression cassette, such as event-12, event-15, event-18, event-20, event-37, event-41, event-44, event-46 and event-48. The P-Gm.Rsp-1:1:1 promoter (SEQ ID NO: 21) contained in EXP-Gm.Rsp-1:1 (SEQ ID NO: 20) provides efficient automatic excision in stably transformed soybean plants.
The germ line preferred Dmc1 promoter does not drive efficient automatic excision in stably transformed soybean plants.
The Dmc1 promoter, which has been demonstrated to be preferred by Arabidopsis germline, drives efficient automatic excision in transformed Arabidopsis plants. To see if such a promoter is capable of driving automatic excision in stably transformed soybean plants, binary plant transformation vectors as described above were constructed in which EXP (EXP-at. Dmc1+Zm. DnaK:1:1 (SEQ ID NO: 40)) was operably linked to 5' of the synthetic coding sequence encoding Cre-recombinase.
The binary plant transformation constructs described above are used to transform soybean plant cells by agrobacterium-mediated transformation, as is well known in the art. The resulting transformed plant cells were induced under selection for spectinomycin to form whole soybean plants.
Allow for selection of R 0 Single copy events self-pollinate. And then useAnalysis of the obtained R by assay 1 The presence of Cre, aadA and GUS transgenes in plants. Also use +.>The assay determines the zygosity of the GUS transgene cassette. Will be directed to the R of the selection 0 GUS positive (GUS+) and unlabeled (aadA-) R for each of the events 1 The percentages of the offspring are presented in table 5 below.
TABLE 5 GUS positive (GUS+) and unlabeled (aadA-) R in stably transformed soybean plants with automatic excision driven by the Dmc1 promoter 1 Percentage of offspring.
As can be seen from Table 5 above, there are only a few R 0 Single copy events provide unmarked R 1 And (5) offspring. Twenty R 0 Event(s)Only three events in (a) produced homozygous GUS positive, unlabeled plants. Compared to EXP-Gm.Rsp-1:1 (SEQ ID NO: 20), R comprising EXP-at.Dmc1+Zm.DnaK 1:1 (SEQ ID NO: 40) driving Cre-recombinase 0 The event did not provide efficient automatic excision in stably transformed plants.
The introduction of the Kozak consensus sequence into 3' of EXP-gm. Rsp-1:1 did not significantly affect the efficiency of automatic excision in stably transformed soybean plants.
The leader sequence of EXP-Gm.Rsp-1:1 (L-Gm.Rsp-1-1:1:1,SEQ ID NO:22) comprises a ninety three base pair small fragment of the Open Reading Frame (ORF) of protein 9 containing the soybean BURP domain. The small ORF fragment is in frame with the Cre-recombinase coding sequence, so this fragment does not interfere with R 1 Efficient automatic excision in progeny plants. To shift the bias of translation initiation to the initiation codon of the Cre-recombinase coding sequence, a small Kozak consensus sequence (5 '-GCAAAA-3) was operably linked to 3' of EXP-Gm.Rsp-1:1 (EXP-Gm.Rsp-1:1+Kozak) according to Nakagawa et al, 2007 (Nakagawa et al, (2007) Di versity of preferred nucleotide sequences around the translation initiati on codon in eukaryote genome.nucleic Acids Research,36 (3): 861-871).
Four binary plant transformation constructs were similarly constructed as described above, each comprising EXP-Gm.Rsp-1:1 (SEQ ID NO: 20) operably linked to 5' of the synthetic coding sequence encoding Cre-recombinase. One construct also comprises a 3' small Kozak consensus sequence operably linked to EXP-gm.rsp-1:1 (EX P-gm.rsp-1:1+kozak). The other two constructs contained truncated variants of T-At.Cdc45:1 (SEQ ID NO: 12), T-At.Cdc45:3 (SEQ ID NO: 23) and T-At.Cdc45:4 (SEQ ID NO: 24). GUS expression cassettes outside the LoxP site are replaced with expression cassettes expressing herbicide tolerance genes.
The binary plant transformation constructs described above are used to transform soybean plant cells by agrobacterium-mediated transformation, as is well known in the art. The resulting transformed plant cells were induced under selection for spectinomycin to form whole soybean plants.
Allow for selection of R 0 Single copy events self-pollinate. Then makeBy usingAnalysis of the obtained R by assay 1 The presence of Cre, aadA and herbicide tolerance transgenes in plants. Also use +.>The assay determines the zygosity of Herbicide Tolerance (HT) transgene cassettes. Will be directed to the R of the selection 0 Homozygous HT positive and marker-free R for each of the events 1 The percentages of the offspring are presented in table 6 below.
TABLE 6 homozygous HT positive and marker-free R in stably transformed soybean plants 1 Percentage of offspring.
As can be seen in Table 6 above, the introduction of the Kozak consensus sequence into the 3' of EXP-Gm.Rsp-1:1 had minimal effect on the efficiency of automatic excision in stably transformed soybean plants (23%, 29% relative to no Kozak consensus sequence). Truncated variants of T-at.Cdc45:1' UTR reduce the efficiency of automatic excision. However, for both truncated variants of T-At.Cdc45:1, automatic excision still occurs.
Introduction of introns 3' of EXP-gm. Rsp-1:1 significantly reduced automatic excision in stably transformed soybean plants.
Introns are known in the art to improve expression of transgenes. To evaluate the role of introns in automatic excision, the intron I-at. AtpE 1 (SEQ ID NO: 26) was cloned into 3' of EXP-Gm. R sp-1:1 in binary plant transformation constructs similar to those described above. In addition to Cre, aadA and GUS transgene cassettes, a third transgene cassette was cloned between LoxP sites for expression of sucrose phosphorylase genes; visual marking. When splA expression cassettes are present in the seed, the seed will shrink. Such visual markers allow for a rapid assessment of the presence of marker gene cassettes that were not removed by automated excision. The expression cassette consists of the following elements: seed enhanced EXP (EXP)Vf. Usp88-enh:1:1 (SEQ ID NO: 60)), said EXP being operably linked to 5' of a coding sequence encoding a sucrose phosphorylase (CR-AGRtu. SplA-C58:1:3, SEQ ID NO: 61), said coding sequence being operably linked to 5' of a 3' termination region (T-AGRtu. Nos. 13, SEQ ID NO: 48). Allow for selection of R 0 Single copy events self-pollinate. UsingThe assay determines the presence of and zygosity of the GUS transgene cassette. By visual inspection R 1 Offspring seeds and observed for lack of the shrunken seed phenotype, the lack of Cre, aadA and splA expression cassettes was determined. It was inferred that those seeds with the appearance of shrunken seeds still contained Cre, aadA and splA expression cassettes. Will be directed to the R of the selection 0 GUS positive (GUS+) and unlabeled (aadA-) R for each of the events 1 The percentages of the offspring are presented in table 7 below.
TABLE 7 GUS positive (GUS+) and unlabeled (aadA-) R in stably transformed soybean plants with EXP-Gm.Rsp-1:1 and I-AtpE:1 driven automatic excision 1 Percentage of offspring.
As can be seen in Table 7 above, the addition of intron I-at. AtpE 1 (SEQ ID NO: 26) to 3' of EXP-Gm. Rsp-1:1 significantly reduced the efficiency of automatic excision in stably transformed soybean plants compared to the results without introns (see Table 4 above). Minority R 0 The event produced a homozygous GUS positive unlabeled event. In contrast, as seen in example 5 below, the introduction of the I-At.AtpE 1 intron to 3' of EXP-Gm.Rsp-1:1 enhances the ability of EXP-Gm.Rsp-1:1 to effectively drive automatic excision in cotton plants.
Example 5
Automatic excision in stably transformed cotton plants is driven when the soybean Rsp-1 promoter is operably linked to the intron I-At.AtpE:1
Cotton plants were transformed with the same binary plant transformation construct comprising intron I-at. AtpE 1 (SEQ ID NO: 26) cloned into the 3' of EXP-Gm. Rsp-1:1 as described in the previous examples. Allow for selection of R 0 Single copy events self-pollinate. Using The assay determines the presence of and zygosity of the GUS transgene cassette. By visual inspection R 1 Offspring seeds and observed for lack of the shrunken seed phenotype, the lack of Cre, aadA and splA expression cassettes was determined. Will be directed to the R of the selection 0 GUS positive (GUS+) and unlabeled (aadA-) R for each of the events 1 The percentages of the offspring are presented in table 8 below.
TABLE 8 GUS positive (GUS+) and unlabeled (aadA-) R in stably transformed cotton plants with EXP-Gm.Rsp-1:1 and I-AtpE:1 driven automatic excision 1 Percentage of offspring.
As can be seen from Table 8 above, cloning the I-at. AtpE 1 (SEQ ID NO: 26) intron to 3' of EXP-Gm. Rsp-1:1 resulted in efficient automatic excision in stably transformed cotton plants, compared to the results obtained using the same binary plant transformation construct in soybean. All but one event (event-3) provides an unlabeled R 1 Offspring seed and homozygous GUS positive, unlabeled negative R 1 Offspring seed.
Example 6
Arabidopsis CDC45 promoter drives auto excision in stably transformed cotton plants
Cotton plants were transformed with constructs, particularly plant binary transformation constructs comprising EXP (EXP-at. Cdc45:1:1 (SEQ ID NO: 9)) presented in example 3, driving Cre-recombinase expression, and used to assess the ability and efficiency of automatic excision of Cre-recombinase expression cassettes and marker gene expression cassettes.
Binary plant transformation constructs as described in example 4 were cloned using methods well known in the art. EXP (EXP-At.Cdc45:1:1 (SEQ ID NO: 9)) is operably linked to 5' of a synthetic coding sequence encoding Cre-recombinase (GOI-P1. Cre-St.LS1:1:1, SEQ ID NO: 43) operably linked to 5' of a 3' termination region (T-At.Cdc45:1, SEQ ID NO: 12).
Cotton plant cells were transformed with the binary plant transformation constructs described above by agrobacterium-mediated transformation. Under the selection of spectinomycin, the resulting transformed plant cells are induced to form whole cotton plants.
Allow for selection of R 0 Single copy events self-pollinate. And then useAnalysis of the obtained R by assay 1 The presence of Cre, aadA and GUS transgenes in plants. Also use +.>The assay determines the zygosity of the GUS transgene cassette. Will be directed to the R of the selection 0 GUS positive (GUS+) and unlabeled (aadA-) R for each of the events 1 The percentages of the offspring are presented in table 9 below.
TABLE 9 GUS positive (GUS+) and unlabeled (aadA-) R in stably transformed cotton plants with automatic excision driven by the Arabidopsis CDC45 promoter 1 Percentage of offspring.
As can be seen from Table 9 above, except for one R 0 Except for event (event-14), all events produced GUS positive (GUS+) and unlabeled (aadA-) R 1 And (5) offspring. Some R 0 Events produce a significant proportion of unlabeled and homozygous unlabeled events, such as event-2, event-7, and event-11. The CDC45 promoter (P-at. Cdc45-1:1:1,SEQ ID NO:10) contained in EXP-at. Cdc45:1:1 (SEQ ID NO: 9) was able to efficiently drive automatic excision in stably transformed cotton plants.
Example 7
Chimeric P-vf. Usp 88-chimeric promoters drive efficient automatic excision in stably transformed canola plants
The canola plants were transformed with constructs, particularly plant binary transformation constructs comprising EXP (EXP-vf. Usp88-enh:1:1 (SEQ ID NO: 60)) to drive Cre-recombinase expression, and used to assess the ability and efficiency of automatic excision of Cre-recombinase expression cassettes and marker gene expression cassettes. EXP (EXP-vf. Usp88-enh:1:1 (SEQ ID NO: 60)) consists of the following elements: chimeric promoter P-vf. Usp 88-chimeric (SEQ ID NO: 61) operably linked to 5' of leader sequence L-vf. Usp-1:1:1 (SEQ ID NO: 62).
Transforming a canola plant with a binary plant transformation construct comprising the following three transgene expression cassettes: cre-recombinase expression cassette and a selectable marker expression cassette (aadA) for selecting transformed soybean cells flanked by two loxP sites (RS-P1. Lox1:1, SEQ ID NO: 44); and a third expression cassette outside the LoxP site for expressing a beta-Glucuronidase (GUS) transgene. The Cre-recombinase expression cassette consists of the following elements: EXP (EXP-vf. Usp88-enh:1:1 (SEQ ID NO: 60)), operably linked to 5' of a synthetic coding sequence encoding Cre-recombinase (GOI-P1. Cre-St. LS1:1:1, SEQ ID NO: 43) containing a processable intron derived from the potato light-induced tissue-specific ST-LS1 gene (GenBank accession number X04753), operably linked to 5' of a 3' termination region (T-Br. Snap2-1:3:6,SEQ ID NO:59). The transformation selection marker cassette aadA consists of the following elements: EXP (EXP-at. Act7:2, SEQ ID NO: 53) operably linked to 5' encoding a chloroplast-targeted Tn7 adenylyl transferase coding sequence (GOI-at. ShkG-CTP2+ec. AadA-SPC/STR:1:1, SEQ ID NO: 54) that confers resistance to spectinomycin and is used to select transformed plant cells, said coding sequence being operably linked to 5' of a 3' termination region (T-AGRt u. Nos. 13, SEQ ID NO: 48). The GUS transgene expression cassette consists of the following elements: an enhanced cauliflower mosaic virus 35S promoter and leader sequence (EXP-CaMV. 35S-enh:1:2, SEQ ID NO: 55) operably linked to 5' of a synthetic coding sequence encoding β -glucuronidase (GOI-ec. UidA+St. LS1:3, SEQ ID NO: 50) containing a processable intron derived from the potato light-induced tissue specific ST-LS1 gene (GenBank accession number: X04753), operably linked to 5' of a 3' termination region (T-AG Rtu. Nos:13, SEQ ID NO: 48).
The binary plant transformation constructs described above are used to transform canola plant cells by agrobacterium-mediated transformation, as is well known in the art. The resulting transformed plant cells were induced to form whole canola plants under the selection of spectinomycin (aadA).
Allow for selection of R 0 Single copy events self-pollinate. And then useAnalysis of the obtained R by assay 1 Presence of aadA and GUS transgenes in seeds. Also using GUS transgene +.>Determination of R by assay 1 Zygosity of seeds for the integration construct. According to each selected R 0 Event selfing, eighty-eight R was determined 1 Seed. Table 10 below shows the number of GUS+/aadA-events of hemizygous and homozygous observed for each event seed sample.
Table 10. Number and percentage of GUS+/aadA-R1 progeny seeds of hemizygous and homozygous samples in stably transformed canola plants with automatic excision driven by chimeric P-vf. Usp 88-chimeric promoter.
/>
As can be seen from Table 9 above, except for one R 0 Except for event (event-3), all events produced homozygous GUS positive (GUS+) and unlabeled (aadA-) R 1 And (5) offspring. Event-3 may be a chimeric event in which some cells are not transformed, as happens occasionally in plant transformation. Some R 0 Events produce a significant proportion of unmarked events of hemizygous events, such as event-2, event-4, event-5, event-7, and event-8. EXP (EXP-vf. Usp88-enh:1:1 (SEQ ID NO: 61)) comprising the chimeric promoter P-vf. Usp 88-chimera (SEQ ID NO: 61) was able to efficiently drive automatic excision in stably transformed canola plants.
Example 8
Automatic excision of EXP-Gm.Rsp-1:1 driven selection markers and genome editing transgene cassettes in stably transformed soybean plants
Soybean plants were transformed with constructs, particularly plant binary transformation constructs comprising EXP (EXP-Gm. Rsp-1:1 (SEQ ID NO: 20)) driving Cre-recombinase expression, and used to evaluate the ability and efficiency of automatic excision of Cre-recombinase expression cassettes, marker gene expression cassettes, and expression cassettes for soybean genome editing.
Transforming a soybean plant with a binary plant transformation construct comprising the following five transgenic expression cassettes; cre-recombinase expression cassette; a selectable marker expression cassette (aadA) for selecting transformed soybean cells; an expression cassette for expressing a Cpf1 CRISPR-associated nuclease; an expression cassette for expression of guide RN A flanked by two LoxP sites (RS-P1. Lox1:1, SEQ ID NO: 44); and a fifth expression cassette located outside the LoxP site for expressing genes of agronomic interest. The Cre-recombinase expression cassette consists of the following elements: EXP (EXP-Gm. Rsp-1:1 (SEQ ID NO: 20)) operably linked to 5 'of a synthetic coding sequence encoding Cre-recombinase (GOI-P1. Cre-St. LS1:1:1, SEQ ID NO: 43) containing a processable intron derived from the potato light-induced tissue-specific ST-LS1 gene (GenBank accession number X04753) operably linked to a 3' termination region (T-at. Cdc45:3, SE Q ID NO: 23). The transformation selection marker cassette aadA consists of the following elements: EXP (EXP-at. Act7:2, SEQ ID NO: 54) operably linked to 5' encoding a chloroplast-targeted Tn7 adenylyl transferase coding sequence (GOI-at. ShkG-CTP2+ec. AadA-SPC/STR:1:1, SEQ ID NO: 54) that confers resistance to spectinomycin and is used to select transformed plant cells, said coding sequence being operably linked to 5' of a 3' termination region (T-AGRtu. Nos. 13, SEQ ID NO: 48). The Cpf1 expression cassette consists of the following elements: constitutively expressed EXP is provided operably linked to the 5' of a nuclear targeted Cpf1 coding sequence operably linked to the 5' of a 3' plant termination region. The guide RNA expression cassette consists of the following elements: an RNA polymerase III promoter operably linked to 5' of a guide RNA coding sequence operably linked to 5' of a 3' RNA polymerase III termination region. All four expression cassettes are flanked by LoxP sites. The fifth transgene expression cassette consists of the following elements: plants are constitutively EXP operably linked to 5' of a coding sequence of a gene encoding an agronomic benefit operably linked to 5' of a 3' plant termination region.
The guide RNA transgene cassette comprises a guide RNA designed to provide site-specific integration of the construct at a specific location within the soybean genome. The construct is designed such that upon integration of the construct into R 1 The specific region of the soybean genome of the generation soybean seed is then provided with Cre, aadA, cpf1 and automatic excision of the guide RNA transgene expression cassette. After automatic excision, at R 1 Only the fifth transgenic expression cassette for expression of genes of agronomic interest should be maintained in the transgenic soybean plants.
All R's are determined 0 Transgenic copy number of transgenic event. They were also analyzed for site-specific integration using flanking PCR assays, in which amplification was performed across the target site ligation. By bringing selected R 0 Transgenic events selfed to generate R 1 Seed generation.Determination of R 1 Random samples of seed generation were marked and genome editing cassettes were present. Table 11 below shows the number and percentage of seeds without hemizygous and homozygous markers and genome editing cassettes from eight selected events.
Table 11 number and percentage of R1 progeny seeds without hemizygous and homozygous markers and genome editing cassettes sampled from stably transformed soybean plants with EXP-Gm.Rsp-1:1 driven automatic excision.
As can be seen in Table 11 above, EXP (EXP-Gm. Rsp-1:1 (SEQ ID NO: 20)) is capable of driving the automatic excision of markers and genome editing cassettes with an efficiency similar to that shown in example 4 above.
Example 9
EXP-Gm.Nmh7:1 driven automatic excision of selectable markers and genome editing transgene cassettes in stably transformed soybean plants
Soybean plants were transformed with constructs, particularly plant binary transformation constructs comprising EXP (EXP-Gm. Nmh7:1 (SEQ ID NO: 64)) driving Cre-recombinase expression, and used to evaluate the ability and efficiency of automatic excision of Cre-recombinase expression cassettes, marker gene expression cassettes, and expression cassettes for soybean genome editing.
Transforming a soybean plant with a binary plant transformation construct comprising five transgenic expression cassettes; cre-recombinase expression cassette; a selectable marker expression cassette (aadA) for selecting transformed soybean cells; an expression cassette for expressing a Cpf1 CRISPR-associated nuclease; an expression cassette for expressing a guide RNA flanked by two LoxP sites (RS-P1. Lox1:1, SEQ ID NO: 44); and a fifth expression cassette located outside the LoxP site for expressing genes of agronomic interest. All expression cassettes were similar to those described in example 7 above. The Cre-recombinase expression cassette consists of the following elements: EXP (EXP-Gm. Nmh7:1 (SEQ ID NO: 64)) operably linked to 5' of a synthetic coding sequence encoding Cre-recombinase (GOI-P1. Cre-St. LS1:1:1, SEQ ID NO: 43) containing a processable intron derived from the potato light-induced tissue-specific ST-LS1 gene (GenBank accession number X04753) operably linked to 5' of a 3' termination region (T-Gb. E6-3b:1:1,SEQ ID NO:67). Two different constructs were used in the transformation. Each construct comprises a different constitutive promoter driving expression of the Cpf1 CRISPR-associated nuclease.
All R's are determined 0 Transgenic copy number of transgenic event. They were also analyzed for site-specific integration using flanking PCR assays, in which amplification was performed across the target site ligation. By bringing selected R 0 Transgenic events selfed to generate R 1 Seed generation. Determination of R 1 Random samples of seed generation were marked and genome editing cassettes were present. Table 12 below shows the number and percentage of seeds without hemizygous and homozygous markers and genome editing cassettes from eight selected events.
Table 12 number and percentage of R1 progeny seeds without hemizygous and homozygous markers and genome editing cassettes sampled from stably transformed soybean plants with EXP-Gm.Nmh7:1 driven automatic excision.
As can be seen in Table 12 above, two events derived from transformation using each construct resulted in R without hemizygous markers and genome editing cassettes 1 And (5) offspring. Event 2, resulting from transformation using construct-1, even produced several R's without homozygous markers and genome editing cassettes 1 And (5) offspring.
Example 10
Automatic excision of selectable markers and genome editing transgene cassettes in stably transformed maize plants driven by maize and rice CDC45-1 promoters
Maize plants are transformed with a construct, in particular a plant binary transformation construct comprising EXP-Zm.Cdc45-1+Zm.DnaK:1:1 (SEQ ID NO: 1) or EXP-Os.Cdc45-1:1:1 (SEQ ID NO: 4) driving Cre-recombinase expression.
A maize plant is transformed with a construct comprising at least five expression cassettes. The construct comprises a Cre-recombinase expression cassette, at least one selectable marker cassette, an expression cassette for expressing a CRISP R-associated nuclease such as Cpf1, at least one guide RNA cassette; all these cassettes flank the loxp site. The construct further comprises at least one expression cassette outside the LoxP site for expression of genes of agronomic interest. Each construct contained a Cre-recombinase expression cassette similar to that described in example 2, with EXP-Zm.Cdc45-1+Zm.DnaK:1:1 (SEQ ID NO: 1) or EXP-Os.Cdc45-1:1:1 (SEQ ID NO: 4) used to drive Cre-recombinase expression.
Transformation of maize plants with either of the two constructs and selection of R using molecular assays 0 Transgenic events to determine copy number, insert integrity. If the construct is designed to provide site-specific integration of the construct; confirmation of the insertion site within the maize genome is confirmed by amplification of the insertion site ligation. By bringing selected R 0 Transgenic event selfing to generate R 1 Seed generation. Seeds or germinated offspring were analyzed for removal of Cre, marker, cpf1 and guide RNA cassette. They were also analyzed for the presence of one or more expression cassettes for one or more agronomically beneficial genes.
Having illustrated and described the principles of the present invention, it will be apparent to those skilled in the art that the invention may be modified in arrangement and detail without departing from such principles. We claim all such modifications as fall within the spirit and scope of the following claims. All publications and published patent documents cited herein are incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.
Claims (50)
1. A recombinant DNA construct comprising a DNA regulatory sequence comprising:
a. a sequence having at least 80% sequence identity to any one of SEQ ID NOs 1-26, 59-62 and 64-66;
b. a sequence comprising any one of SEQ ID NOs 1-26, 59-62 and 64-66; and
c. (i) A fragment of any one of SEQ ID NOs 1-26, 59-62 and 64-66 or (ii) any sequence having at least 80% sequence identity to any one of SEQ ID NOs 1-26, 59-62 and 64-66, wherein said fragment has gene regulatory activity;
Wherein the DNA regulatory sequence is operably linked to a heterologous transcribable DNA sequence encoding a site-specific recombinase.
2. The recombinant DNA molecule of claim 1, wherein said DNA regulatory sequence has at least 90% sequence identity to the DNA sequence of any one of SEQ ID NOs 1-26, 59-62 and 64-66.
3. The recombinant DNA molecule of claim 1 or 2, wherein said DNA regulatory sequence has at least 95% sequence identity to the DNA sequence of any one of SEQ ID NOs 1-26, 59-62 and 64-66.
4. The recombinant DNA molecule of claim 1, 2 or 3, wherein said DNA regulatory sequence has gene regulatory activity.
5. The recombinant DNA molecule of any one of claims 1-4, wherein the DNA regulatory sequence is a germline-preferred promoter.
6. The recombinant DNA molecule of any one of claims 1-5, wherein the germline-preferred promoter is selected from the group consisting of: SEQ ID NO. 2, SEQ ID NO. 5, SEQ ID NO. 7, SEQ ID NO. 10, SEQ ID NO. 13, SEQ ID NO. 14, SEQ ID NO. 17, SEQ ID NO. 21 and SEQ ID NO. 65, or a sequence having at least 80% sequence identity to any of SEQ ID NO. 2, 5, 7, 10, 13, 14, 17, 21 and 65.
7. The recombinant DNA molecule of any one of claims 1-6, wherein the germline-preferred promoter is a CDC45 promoter.
8. The CDC45 promoter of any one of claims 1-7, wherein the CDC45 promoter is selected from the group consisting of: SEQ ID NO. 2, SEQ ID NO. 5 and SEQ ID NO. 10, or a sequence having at least 80% sequence identity to any of SEQ ID NO. 2, 5 and 10.
9. The recombinant DNA molecule of any one of claims 1-5, wherein the DNA regulatory sequence is SEQ ID No. 60 or a sequence having at least 80% sequence identity to SEQ ID No. 60.
10. The recombinant DNA molecule of any one of claims 1-9, wherein the site-specific recombinase is selected from the group consisting of: cre-recombinase, flp-recombinase, R-recombinase and Gin-recombinase.
11. The recombinant DNA molecule of any one of claims 1-10, wherein the site-specific recombinase is Cre-recombinase.
12. The recombinant DNA construct of any one of claims 1-11, further comprising one or both of the following expression cassettes: a selectable marker transgene; and/or transgenes of agronomic interest.
13. The recombinant DNA construct according to any one of claims 1-12, further comprising a pair of site-specific recombination site sequences flanking one or both transcribable DNA sequences encoding the site-specific recombinase and/or the selectable marker transgene, wherein the site-specific recombination site is cleavable by the site-specific recombinase.
14. The recombinant DNA construct according to claim 13, wherein said pair of site-specific recombination site sequences are oriented in a head-to-tail arrangement.
15. The recombinant DNA construct according to claim 13 or 14, wherein said selectable marker transgene confers resistance to a herbicide or antibiotic.
16. The recombinant DNA construct according to any one of claims 13, 14 or 15, wherein said pair of site-specific recombination site sequences are each selected from the group consisting of: loxP, lox.TATA-R9, FRT, RS and GIX.
17. The recombinant DNA construct according to any one of claims 12-16, wherein the pair of site-specific recombination sites are each LoxP or lox.tata-R9 sites.
18. The recombinant DNA construct according to any one of claims 12-17, wherein said pair of site-specific recombination site sequences each comprises SEQ ID No. 44 or SEQ ID No. 45.
19. The recombinant DNA construct according to any one of claims 12-18, wherein said agronomically beneficial transgene confers herbicide tolerance in a plant.
20. The recombinant DNA construct according to any one of claims 12-18, wherein said agronomically beneficial transgene confers insect resistance or disease resistance in a plant.
21. The recombinant DNA construct according to any one of claims 12-18, wherein said agronomically beneficial transgene confers increased yield or stress tolerance in a plant.
22. The recombinant DNA construct according to any one of claims 12-21, wherein said agronomically beneficial transgene encodes a dsRNA, miRNA or siRNA.
23. The recombinant DNA construct of any one of claims 1-22, further comprising one or both of: an expression cassette encoding a guide RNA; and/or an expression cassette encoding a site-specific nuclease.
24. The recombinant DNA construct according to claim 23, further comprising a pair of site-specific recombination site sequences flanking one or more transcribable DNA sequences encoding the site-specific recombinase, a selectable marker transgene, an expression cassette encoding the guide RNA, and/or an expression cassette encoding the site-specific nuclease, wherein the site-specific recombination site is cleavable by the site-specific recombinase.
25. The recombinant DNA construct according to claim 23 or 24, wherein said guide RNA comprises a targeting sequence that targets a sequence in the genome of a eukaryotic cell for genome editing or site-specific integration.
26. The recombinant DNA construct according to claim 25, wherein said eukaryotic cell is a plant cell.
27. The recombinant DNA construct according to any one of claims 23-26, comprising two or more expression cassettes encoding two or more guide RNAs.
28. The recombinant DNA construct according to any one of claims 23-27, wherein there are two, three, four, five, six, seven, eight, nine or ten different expression cassettes encoding guide RNAs.
29. The recombinant DNA construct according to any one of claims 23-28, wherein said site-specific nuclease is an RNA-guided endonuclease.
30. The recombinant DNA construct according to any one of claims 23-29, wherein said RNA-guided endonuclease is selected from the group consisting of: cas1, cas1B, cas2, cas3, cas4, cas5, cas6, cas7, cas8, cas9, cas10, cpf1, cys2, csy3, cse1, cse2, csc1, csc2, csa5, csn2, csm3, csm4, csm5, csm6, cmr1, cmr3, cmr4, cmr5, cmr6, csb1, csb2, csb3, csx17, csx14, csx10, csx16, csaX, csx3, csx1, csx15, csf1, csf2, csf3, csf4, casX and CasY.
31. The recombinant DNA construct of claim 30, wherein the RNA-guided endonuclease is Cas9 or Cpf1.
32. A DNA molecule or vector comprising the recombinant DNA construct of any one of claims 1-31.
33. A DNA transformation vector comprising the recombinant DNA construct of any one of claims 1-31 and a T-DNA segment bounded by a left border and a right border.
34. The DNA transformation vector of claim 33, wherein the transcribable DNA sequence encoding the site-specific recombinase is located between the left and right boundaries of the T-DNA segment.
35. A DNA transformation vector comprising the recombinant DNA construct of any one of claims 12-31 and a T-DNA segment having a left border and a right border, wherein one or more of the transcribable DNA sequence encoding the site-specific recombinase, the selectable marker transgene, and/or the agronomically beneficial transgene is located between the left border and the right border of the T-DNA segment.
36. A DNA transformation vector comprising the recombinant DNA construct of any one of claims 23-31 and a T-DNA segment having a left border and a right border, wherein one or more of the transcribable DNA sequence encoding the site-specific recombinase, the selectable marker transgene, the agronomically beneficial transgene, the expression cassette encoding the guide RNA, and/or the expression cassette encoding the site-specific nuclease is located between the left border and the right border of the T-DNA segment.
37. A transgenic plant, plant part or plant cell comprising the recombinant DNA construct of any one of claims 1-31.
38. The transgenic plant, plant part, or plant cell of claim 37, wherein the recombinant DNA construct is stably transformed into the genome of the transgenic plant, plant part, or plant cell.
39. The transgenic plant, plant part or plant cell according to claim 37 or 38, wherein the transgenic plant, plant part or plant cell is a maize, soybean, cotton or canola plant, plant part or plant cell.
40. A bacterial cell comprising the recombinant DNA construct of any one of claims 1-31, the DNA molecule or vector of claim 32, or the transformation vector of any one of claims 33-36.
41. A method of producing a transgenic plant or plant part comprising:
a. transforming plant cells of an explant with a DNA molecule or vector comprising the recombinant DNA construct of any one of claims 1-30 to produce one or more transformed plant cells comprising the recombinant DNA construct stably transformed into the genome of the one or more transformed plant cells;
b. Regenerating or developing a transgenic plant from said explant, wherein said transgenic plant comprises said recombinant DNA construct stably transformed into the genome of one or more cells of said transgenic plant.
42. The method of claim 41, wherein the plant cell is transformed via Agrobacterium-mediated transformation or rhizobium-mediated transformation.
43. The method of claim 41, wherein the plant cell is transformed via particle-mediated transformation or particle bombardment-mediated transformation.
44. The method of claim 41, wherein the transgenic plant and plant cell are maize, soybean, cotton or canola plants and plant cells, respectively.
45. The method of claim 41, further comprising:
a. isolating or harvesting a plant part from said transgenic plant.
46. A method for excision of an expression cassette from the genome of a transgenic plant comprising:
a. transforming a plant cell with a DNA molecule or vector comprising the recombinant DNA construct of any one of claims 13-29 to produce one or more transformed plant cells comprising the recombinant DNA construct stably transformed into the genome of the one or more transformed plant cells;
b. Regenerating or developing a transgenic plant at least partially from the one or more stably transformed plant cells;
c. crossing the transgenic plant with itself or another plant; and
d. selecting one or more progeny plants, wherein one or both of the transcribable DNA sequence encoding the site-specific recombinase and/or a selectable marker transgene between a pair of site-specific recombination site sequences of the recombinant DNA construct is excised and no longer present in the genome of the progeny plant.
47. The method according to claim 46,
wherein the recombinant DNA construct further comprises one or both of the following expression cassettes located between the pair of DNA site-specific recombination site sequences of the recombinant DNA construct: an expression cassette encoding a guide RNA and/or an expression cassette encoding a site-specific nuclease, and wherein one or more progeny plants are selected, wherein one or more of the transcribable DNA sequence encoding the site-specific recombinase, the selectable marker transgene, the expression cassette encoding the guide RNA, and/or the expression cassette encoding the site-specific nuclease of the recombinant DNA construct is excised and no longer present in the genome of the progeny plants.
48. The method of claim 46, wherein the transgenic plant and plant cell are maize, soybean, cotton or canola plants and plant cells, respectively.
49. The method of claim 46, further comprising:
a. plant parts are isolated or harvested from one or more of the progeny plants.
50. The method of claim 46, further comprising:
a. crossing one or more of said progeny plants with itself or another plant.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202063093893P | 2020-10-20 | 2020-10-20 | |
US63/093,893 | 2020-10-20 | ||
PCT/US2021/055596 WO2022086951A1 (en) | 2020-10-20 | 2021-10-19 | Plant regulatory elements and uses thereof for autoexcision |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116529378A true CN116529378A (en) | 2023-08-01 |
Family
ID=81291290
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202180071059.8A Pending CN116529378A (en) | 2020-10-20 | 2021-10-19 | Plant regulatory element and use thereof for automatic excision |
Country Status (5)
Country | Link |
---|---|
US (1) | US20240035039A1 (en) |
EP (1) | EP4232587A1 (en) |
CN (1) | CN116529378A (en) |
CA (1) | CA3196238A1 (en) |
WO (1) | WO2022086951A1 (en) |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AUPN903196A0 (en) * | 1996-03-29 | 1996-04-26 | Australian National University, The | Single-step excision means |
EP3830277A4 (en) * | 2018-08-03 | 2022-05-04 | Monsanto Technology LLC | Plant regulatory elements and uses thereof |
-
2021
- 2021-10-19 EP EP21883690.6A patent/EP4232587A1/en active Pending
- 2021-10-19 CA CA3196238A patent/CA3196238A1/en active Pending
- 2021-10-19 WO PCT/US2021/055596 patent/WO2022086951A1/en active Application Filing
- 2021-10-19 CN CN202180071059.8A patent/CN116529378A/en active Pending
- 2021-10-19 US US18/032,313 patent/US20240035039A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
WO2022086951A1 (en) | 2022-04-28 |
CA3196238A1 (en) | 2022-04-28 |
EP4232587A1 (en) | 2023-08-30 |
US20240035039A1 (en) | 2024-02-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20240124884A1 (en) | Methods and compositions for use of directed recombination in plant breeding | |
JP6505599B2 (en) | A specially designed transgene integration platform (ETIP) for gene targeting and trait stacking | |
US20240110197A1 (en) | Expression modulating elements and use thereof | |
US20210348179A1 (en) | Compositions and methods for regulating gene expression for targeted mutagenesis | |
EP3472315A1 (en) | Methodologies and compositions for creating targeted recombination and breaking linkage between traits | |
JP2022534381A (en) | Methods and compositions for generating dominant alleles using genome editing | |
US20210123067A1 (en) | Compositions and methods for transferring cytoplasmic or nuclear traits or components | |
WO2020072418A1 (en) | Compositions and methods for transferring biomolecules to wounded cells | |
CN116529378A (en) | Plant regulatory element and use thereof for automatic excision | |
WO2023201186A1 (en) | Plant regulatory elements and uses thereof for autoexcision | |
US20230340517A1 (en) | Compositions and methods for enhancing corn traits and yield using genome editing | |
US20230313216A1 (en) | Compositions and methods for enhancing corn traits and yield using genome editing | |
US20230354762A1 (en) | Compositions and methods for enhancing corn traits and yield using genome editing | |
WO2023230459A2 (en) | Compositions and methods for targeting donor polynucelotides in soybean genomic loci | |
US20230242928A1 (en) | Modulating nucleotide expression using expression modulating elements and modified tata and use thereof | |
WO2022103878A1 (en) | Methods to improve site-directed integration frequency | |
EP3975701A1 (en) | Methods and compositions for generating dominant short stature alleles using genome editing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |