WO2021175288A1 - 改进的胞嘧啶碱基编辑*** - Google Patents
改进的胞嘧啶碱基编辑*** Download PDFInfo
- Publication number
- WO2021175288A1 WO2021175288A1 PCT/CN2021/079086 CN2021079086W WO2021175288A1 WO 2021175288 A1 WO2021175288 A1 WO 2021175288A1 CN 2021079086 W CN2021079086 W CN 2021079086W WO 2021175288 A1 WO2021175288 A1 WO 2021175288A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- amino acid
- apobec3b deaminase
- ha3bctd
- ha3b
- base editing
- Prior art date
Links
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 title abstract description 18
- 229940104302 cytosine Drugs 0.000 title abstract description 9
- 101000964385 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3B Proteins 0.000 claims abstract description 164
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 83
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 83
- 108091033409 CRISPR Proteins 0.000 claims abstract description 73
- 102100040262 DNA dC->dU-editing enzyme APOBEC-3B Human genes 0.000 claims abstract description 70
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 68
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 53
- 238000010354 CRISPR gene editing Methods 0.000 claims abstract description 37
- 239000012636 effector Substances 0.000 claims abstract description 29
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 103
- 102000048415 human APOBEC3B Human genes 0.000 claims description 94
- 238000006467 substitution reaction Methods 0.000 claims description 54
- 210000004899 c-terminal region Anatomy 0.000 claims description 48
- 210000004027 cell Anatomy 0.000 claims description 46
- 108020005004 Guide RNA Proteins 0.000 claims description 43
- 239000002773 nucleotide Substances 0.000 claims description 34
- 125000003729 nucleotide group Chemical group 0.000 claims description 34
- 230000014509 gene expression Effects 0.000 claims description 33
- 238000000034 method Methods 0.000 claims description 30
- 150000001413 amino acids Chemical group 0.000 claims description 23
- 101710163270 Nuclease Proteins 0.000 claims description 21
- 108010008532 Deoxyribonuclease I Proteins 0.000 claims description 7
- 102000007260 Deoxyribonuclease I Human genes 0.000 claims description 7
- 101710172430 Uracil-DNA glycosylase inhibitor Proteins 0.000 claims description 6
- 102220605874 Cytosolic arginine sensor for mTORC1 subunit 2_D10A_mutation Human genes 0.000 claims description 4
- 235000003869 genetically modified organism Nutrition 0.000 claims description 4
- 238000004519 manufacturing process Methods 0.000 claims description 4
- 230000030648 nucleus localization Effects 0.000 claims description 3
- 102000006943 Uracil-DNA Glycosidase Human genes 0.000 claims description 2
- 108010072685 Uracil-DNA Glycosidase Proteins 0.000 claims description 2
- 239000003112 inhibitor Substances 0.000 claims 1
- 230000009437 off-target effect Effects 0.000 abstract description 6
- 235000001014 amino acid Nutrition 0.000 description 65
- 241000196324 Embryophyta Species 0.000 description 56
- 235000018102 proteins Nutrition 0.000 description 44
- 230000035772 mutation Effects 0.000 description 33
- 230000000694 effects Effects 0.000 description 22
- 150000007523 nucleic acids Chemical class 0.000 description 22
- 108020004414 DNA Proteins 0.000 description 17
- 230000007018 DNA scission Effects 0.000 description 17
- 230000009466 transformation Effects 0.000 description 16
- 239000013598 vector Substances 0.000 description 16
- 108091028043 Nucleic acid sequence Proteins 0.000 description 15
- 102000039446 nucleic acids Human genes 0.000 description 14
- 108020004707 nucleic acids Proteins 0.000 description 14
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 12
- 230000001105 regulatory effect Effects 0.000 description 12
- 210000001519 tissue Anatomy 0.000 description 11
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 10
- 108020004705 Codon Proteins 0.000 description 10
- 240000007594 Oryza sativa Species 0.000 description 10
- 235000007164 Oryza sativa Nutrition 0.000 description 10
- 238000010362 genome editing Methods 0.000 description 10
- 235000009566 rice Nutrition 0.000 description 10
- 102000053602 DNA Human genes 0.000 description 9
- 108090000765 processed proteins & peptides Proteins 0.000 description 9
- 210000001938 protoplast Anatomy 0.000 description 8
- 230000008685 targeting Effects 0.000 description 8
- 108020004682 Single-Stranded DNA Proteins 0.000 description 7
- 244000098338 Triticum aestivum Species 0.000 description 7
- 102000004196 processed proteins & peptides Human genes 0.000 description 7
- 241000894007 species Species 0.000 description 7
- 241000272517 Anseriformes Species 0.000 description 6
- 244000062793 Sorghum vulgare Species 0.000 description 6
- 235000021307 Triticum Nutrition 0.000 description 6
- 240000008042 Zea mays Species 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- 229920001184 polypeptide Polymers 0.000 description 6
- 238000013518 transcription Methods 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 5
- 230000000295 complement effect Effects 0.000 description 5
- 244000038559 crop plants Species 0.000 description 5
- 230000004927 fusion Effects 0.000 description 5
- 230000001404 mediated effect Effects 0.000 description 5
- 229910052757 nitrogen Inorganic materials 0.000 description 5
- 101000860090 Acidaminococcus sp. (strain BV3L6) CRISPR-associated endonuclease Cas12a Proteins 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 4
- 108700010070 Codon Usage Proteins 0.000 description 4
- 101000860092 Francisella tularensis subsp. novicida (strain U112) CRISPR-associated endonuclease Cas12a Proteins 0.000 description 4
- 241000287828 Gallus gallus Species 0.000 description 4
- 244000068988 Glycine max Species 0.000 description 4
- 235000010469 Glycine max Nutrition 0.000 description 4
- 101000964378 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3A Proteins 0.000 description 4
- 240000005979 Hordeum vulgare Species 0.000 description 4
- 235000007340 Hordeum vulgare Nutrition 0.000 description 4
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 4
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 4
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 4
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 4
- 235000013330 chicken meat Nutrition 0.000 description 4
- 235000005822 corn Nutrition 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 239000012634 fragment Substances 0.000 description 4
- 230000010354 integration Effects 0.000 description 4
- 108020004999 messenger RNA Proteins 0.000 description 4
- 229930182817 methionine Natural products 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 102000040430 polynucleotide Human genes 0.000 description 4
- 108091033319 polynucleotide Proteins 0.000 description 4
- 239000002157 polynucleotide Substances 0.000 description 4
- 230000001052 transient effect Effects 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 241000283690 Bos taurus Species 0.000 description 3
- 238000010453 CRISPR/Cas method Methods 0.000 description 3
- 241000282472 Canis lupus familiaris Species 0.000 description 3
- 241000282693 Cercopithecidae Species 0.000 description 3
- 108010080611 Cytosine Deaminase Proteins 0.000 description 3
- 102000000311 Cytosine Deaminase Human genes 0.000 description 3
- 102100040263 DNA dC->dU-editing enzyme APOBEC-3A Human genes 0.000 description 3
- 241000206602 Eukaryota Species 0.000 description 3
- 241000282326 Felis catus Species 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- 241000124008 Mammalia Species 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- 241001494479 Pecora Species 0.000 description 3
- 241000700159 Rattus Species 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- 241000282887 Suidae Species 0.000 description 3
- 108091028113 Trans-activating crRNA Proteins 0.000 description 3
- 108020004566 Transfer RNA Proteins 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 230000009615 deamination Effects 0.000 description 3
- 238000006481 deamination reaction Methods 0.000 description 3
- 235000013399 edible fruits Nutrition 0.000 description 3
- 238000012239 gene modification Methods 0.000 description 3
- 230000005017 genetic modification Effects 0.000 description 3
- 235000013617 genetically modified food Nutrition 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 235000015097 nutrients Nutrition 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 244000144977 poultry Species 0.000 description 3
- 235000013594 poultry meat Nutrition 0.000 description 3
- 230000008929 regeneration Effects 0.000 description 3
- 238000011069 regeneration method Methods 0.000 description 3
- 230000008439 repair process Effects 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- 241000093740 Acidaminococcus sp. Species 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 2
- 108091093088 Amplicon Proteins 0.000 description 2
- 241000219194 Arabidopsis Species 0.000 description 2
- 235000017060 Arachis glabrata Nutrition 0.000 description 2
- 244000105624 Arachis hypogaea Species 0.000 description 2
- 235000010777 Arachis hypogaea Nutrition 0.000 description 2
- 235000018262 Arachis monticola Nutrition 0.000 description 2
- 101000909256 Caldicellulosiruptor bescii (strain ATCC BAA-1888 / DSM 6725 / Z-1320) DNA polymerase I Proteins 0.000 description 2
- 229920000742 Cotton Polymers 0.000 description 2
- 101710177611 DNA polymerase II large subunit Proteins 0.000 description 2
- 101710184669 DNA polymerase II small subunit Proteins 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 241000588088 Francisella tularensis subsp. novicida U112 Species 0.000 description 2
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 2
- AEMRFAOFKBGASW-UHFFFAOYSA-N Glycolic acid Chemical compound OCC(O)=O AEMRFAOFKBGASW-UHFFFAOYSA-N 0.000 description 2
- 241000219146 Gossypium Species 0.000 description 2
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 2
- 102000029812 HNH nuclease Human genes 0.000 description 2
- 108060003760 HNH nuclease Proteins 0.000 description 2
- 244000020551 Helianthus annuus Species 0.000 description 2
- 235000003222 Helianthus annuus Nutrition 0.000 description 2
- 241000689670 Lachnospiraceae bacterium ND2006 Species 0.000 description 2
- 241000209510 Liliopsida Species 0.000 description 2
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 2
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 2
- 240000003183 Manihot esculenta Species 0.000 description 2
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 description 2
- 240000004658 Medicago sativa Species 0.000 description 2
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 2
- 244000061176 Nicotiana tabacum Species 0.000 description 2
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 2
- 108091005461 Nucleic proteins Proteins 0.000 description 2
- 101000902592 Pyrococcus furiosus (strain ATCC 43587 / DSM 3638 / JCM 8422 / Vc1) DNA polymerase Proteins 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 240000000111 Saccharum officinarum Species 0.000 description 2
- 235000007201 Saccharum officinarum Nutrition 0.000 description 2
- 240000003768 Solanum lycopersicum Species 0.000 description 2
- 244000061456 Solanum tuberosum Species 0.000 description 2
- 235000002595 Solanum tuberosum Nutrition 0.000 description 2
- 241000193996 Streptococcus pyogenes Species 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 230000009418 agronomic effect Effects 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 230000033590 base-excision repair Effects 0.000 description 2
- 230000027455 binding Effects 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 230000004186 co-expression Effects 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- 230000005782 double-strand break Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 238000001976 enzyme digestion Methods 0.000 description 2
- 241001233957 eudicotyledons Species 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000002779 inactivation Effects 0.000 description 2
- 235000019713 millet Nutrition 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 235000020232 peanut Nutrition 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- YKBGVTZYEHREMT-KVQBGUIXSA-N 2'-deoxyguanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](CO)O1 YKBGVTZYEHREMT-KVQBGUIXSA-N 0.000 description 1
- CKTSBUTUHBMZGZ-ULQXZJNLSA-N 4-amino-1-[(2r,4s,5r)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-tritiopyrimidin-2-one Chemical compound O=C1N=C(N)C([3H])=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-ULQXZJNLSA-N 0.000 description 1
- 230000005730 ADP ribosylation Effects 0.000 description 1
- 102000012758 APOBEC-1 Deaminase Human genes 0.000 description 1
- 108010079649 APOBEC-1 Deaminase Proteins 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 108010052875 Adenine deaminase Proteins 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 239000002028 Biomass Substances 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 101150069031 CSN2 gene Proteins 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 1
- 101150074775 Csf1 gene Proteins 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- 102000005381 Cytidine Deaminase Human genes 0.000 description 1
- 108010031325 Cytidine deaminase Proteins 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 208000035240 Disease Resistance Diseases 0.000 description 1
- 102100035102 E3 ubiquitin-protein ligase MYCBP2 Human genes 0.000 description 1
- 241000221785 Erysiphales Species 0.000 description 1
- 108010002537 Fruit Proteins Proteins 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 241000714474 Rous sarcoma virus Species 0.000 description 1
- 241000191967 Staphylococcus aureus Species 0.000 description 1
- 238000010459 TALEN Methods 0.000 description 1
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 235000007244 Zea mays Nutrition 0.000 description 1
- 108700041896 Zea mays Ubi-1 Proteins 0.000 description 1
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 210000003763 chloroplast Anatomy 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 101150055601 cops2 gene Proteins 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- QDOXWKRWXJOMAK-UHFFFAOYSA-N dichromium trioxide Chemical compound O=[Cr]O[Cr]=O QDOXWKRWXJOMAK-UHFFFAOYSA-N 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 230000006251 gamma-carboxylation Effects 0.000 description 1
- 238000010363 gene targeting Methods 0.000 description 1
- 238000012214 genetic breeding Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 229940029575 guanosine Drugs 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- 239000004009 herbicide Substances 0.000 description 1
- 238000012165 high-throughput sequencing Methods 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 102000048646 human APOBEC3A Human genes 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 230000033607 mismatch repair Effects 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 230000002438 mitochondrial effect Effects 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 238000003976 plant breeding Methods 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000001172 regenerating effect Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000019635 sulfation Effects 0.000 description 1
- 238000005670 sulfation reaction Methods 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 230000003313 weakening effect Effects 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/78—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y305/00—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5)
- C12Y305/04—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5) in cyclic amidines (3.5.4)
- C12Y305/04001—Cytosine deaminase (3.5.4.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y305/00—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5)
- C12Y305/04—Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5) in cyclic amidines (3.5.4)
- C12Y305/04005—Cytidine deaminase (3.5.4.5)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/80—Fusion polypeptide containing a DNA binding domain, e.g. Lacl or Tet-repressor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
Definitions
- the invention belongs to the field of gene editing. Specifically, the present invention relates to an improved cytosine base editing system, which has a significantly reduced genome-wide off-target effect and a narrow editing window.
- Genome editing technology is a genetic engineering technology based on the targeted modification of the genome by artificial nucleases, and it is playing an increasingly powerful role in agricultural and medical research.
- Clustered regularly spaced short palindromic repeats and its related system are currently the most widely used genome editing tools.
- Cas The protein Under the guidance of the artificially designed guide RNA, Cas The protein can be targeted to any location in the genome.
- the base editing system is a new gene editing technology developed based on the CRISPR system. It is divided into cytosine base editing system and adenine base editing system.
- Cytosine deaminase and adenine deaminase are respectively combined with Cas9 single-stranded nickase. Fusion, under the targeting action of the guide RNA, Cas9 single-stranded nickase produces a single-stranded DNA region, so the deaminase can efficiently deaminate the C and A nucleotides on the single-stranded DNA at the targeted position. , Become U bases and I bases, and then are repaired into T bases and G bases in the process of cell self-repair.
- the cytosine base editing system has been found to produce unpredictable off-target phenomena in the genome. This is probably due to the overexpression of cytosine deaminase in the genome and random deamination in the highly active regions of the genome. Caused. In addition, if there are multiple Cs in the working window of the target site, the existing high-efficiency base editing system often obtains products with multiple C changes at the same time, and cannot obtain products with only a single C mutation. The specificity of the genome and the accuracy of the target site greatly influence the use of cytosine base editing systems.
- the specificity and accuracy of the cytosine base editing system may be related to the ability of cytosine deaminase to bind to single-stranded DNA, changing or weakening the ability of deaminase to bind to single-stranded DNA without reducing the ability of deaminase With the ability of deamination, it is possible to obtain a cytosine single-base editing system that is both high-efficiency, specific and precise.
- the inventors optimized Loop1 and Loop7 in the human-derived hA3Bctd (APOBEC3B C-terminal domain) domain that binds to single-stranded DNA, and tested the obtained variants through rice protoplast transformation. The efficiency and accuracy of the variants were obtained, and the specificity of the obtained variants was tested, thereby obtaining a series of highly efficient, highly specific, and highly accurate base editing systems.
- Figure 1 Shows the selection of A3Bctd mutation sites.
- Figure 2 Shows the targeting efficiency and off-target efficiency of the base editing system to be tested.
- Figure 3 Shows the average targeting efficiency and average off-target efficiency of the base editing system to be tested.
- Figure 4. Shows the combination of double protrusion and triple mutant.
- Figure 5 Shows the protoplast transformation to verify the targeting and off-target efficiency of the double and triple mutants.
- Figure 6 Shows the average targeting efficiency and average off-target efficiency of the double and triple mutants to be tested.
- Figure 7 Shows the working efficiency of different base editing systems for different Cs at the four targeted sites.
- Figure 8 Shows the average mutation types of editing products of different base editing systems at four targeted sites.
- the term “and/or” encompasses all combinations of items connected by the term, and should be treated as if each combination has been individually listed herein.
- “A and/or B” encompasses “A”, “A and B”, and “B”.
- “A, B, and/or C” encompasses "A”, “B”, “C”, “A and B”, “A and C”, “B and C”, and "A and B and C”.
- the protein or nucleic acid may be composed of the sequence, or may have additional amino acids or nuclei at one or both ends of the protein or nucleic acid. Glycolic acid, but still has the activity described in the present invention.
- methionine encoded by the start codon at the N-terminus of the polypeptide will be retained under certain actual conditions (for example, when expressed in a specific expression system), but does not substantially affect the function of the polypeptide.
- CRISPR effector protein generally refers to the nuclease present in the naturally-occurring CRISPR system, as well as its modified form, its variant, its catalytically active fragment, and the like.
- the term covers any effector protein based on the CRISPR system that can achieve gene targeting (such as gene editing, gene targeted regulation, etc.) in cells.
- Cas9 nuclease examples include Cas9 nuclease or variants thereof.
- the Cas9 nuclease may be a Cas9 nuclease from different species, such as spCas9 from S. pyogenes or SaCas9 derived from S. aureus.
- Cas9 nuclease and Cas9 are used interchangeably herein, and refer to RNA comprising Cas9 protein or fragments thereof (for example, a protein containing the active DNA cleavage domain of Cas9 and/or the gRNA binding domain of Cas9) Guided nuclease.
- Cas9 is a component of the CRISPR/Cas (clustered regularly spaced short palindrome repeats and related systems) genome editing system, which can target and cleave the DNA target sequence under the guidance of the guide RNA to form a DNA double-strand break (DSB) ).
- CRISPR/Cas clustered regularly spaced short palindrome repeats and related systems
- CRISPR effector proteins may also include Cpf1 nuclease or variants thereof such as highly specific variants.
- the Cpf1 nuclease may be Cpf1 nuclease from different species, for example, Cpf1 nuclease from Francisella novicida U112, Acidaminococcus sp. BV3L6 and Lachnospiraceae bacterium ND2006.
- CRISPR effector protein can also be derived from Cas3, Cas8a, Cas5, Cas8b, Cas8c, Cas10d, Cse1, Cse2, Csy1, Csy2, Csy3, GSU0054, Cas10, Csm2, Cmr5, Cas10, Csx11, Csx10, Csf1, Csn2, Cas4 , C2c1, C2c3 or C2c2 nucleases, for example, include these nucleases or functional variants thereof.
- Gene as used herein not only covers chromosomal DNA present in the nucleus, but also includes organelle DNA present in subcellular components of the cell (such as mitochondria, plastids).
- organism includes any organism suitable for genome editing, preferably eukaryotes.
- organisms include, but are not limited to, mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cattle, and cats; poultry such as chickens, ducks, and geese; plants include monocots and dicots, For example, rice, corn, wheat, sorghum, barley, soybean, peanut, Arabidopsis and so on.
- Genetically modified organism or “genetically modified cell” means an organism or cell that contains exogenous polynucleotides or modified genes or expression control sequences in its genome.
- exogenous polynucleotides can be stably integrated into the genome of organisms or cells, and inherited for successive generations.
- the exogenous polynucleotide can be integrated into the genome alone or as part of a recombinant DNA construct.
- the modified gene or expression control sequence contains single or multiple deoxynucleotide substitutions, deletions and additions in the organism or cell genome.
- Form in terms of sequence means a sequence from a foreign species, or if from the same species, a sequence that has undergone significant changes in composition and/or locus from its natural form through deliberate human intervention.
- nucleic acid sequence is used interchangeably and are single-stranded or double-stranded RNA or DNA polymers, optionally containing synthetic, non-natural Or changed nucleotide bases.
- Nucleotides are referred to by their single letter names as follows: “A” is adenosine or deoxyadenosine (respectively RNA or DNA), “C” is cytidine or deoxycytidine, and “G” is guanosine or Deoxyguanosine, “U” means uridine, “T” means deoxythymidine, “R” means purine (A or G), “Y” means pyrimidine (C or T), “K” means G or T, “ H” means A or C or T, “I” means inosine, and “N” means any nucleotide.
- Polypeptide “peptide”, and “protein” are used interchangeably in the present invention and refer to a polymer of amino acid residues.
- the term applies to amino acid polymers in which one or more amino acid residues are corresponding artificial chemical analogs of naturally occurring amino acids, as well as to naturally occurring amino acid polymers.
- the terms "polypeptide”, “peptide”, “amino acid sequence” and “protein” may also include modified forms, including but not limited to glycosylation, lipid linkage, sulfation, gamma carboxylation of glutamic acid residues, hydroxyl And ADP-ribosylation.
- Suitable conservative amino acid substitutions are known to those skilled in the art and can generally be made without changing the biological activity of the resulting molecule.
- those skilled in the art recognize that a single amino acid substitution in a non-essential region of a polypeptide does not substantially change the biological activity (see, for example, Watson et al., Molecular Biology of the Gene, 4th Edition, 1987, The Benjamin/Cummings Pub .co.,p.224).
- expression construct refers to a vector suitable for expression of a nucleotide sequence of interest in an organism, such as a recombinant vector.
- “Expression” refers to the production of a functional product.
- the expression of a nucleotide sequence may refer to the transcription of the nucleotide sequence (such as transcription to generate mRNA or functional RNA) and/or the translation of RNA into a precursor or mature protein.
- the "expression construct" of the present invention can be a linear nucleic acid fragment, a circular plasmid, a viral vector, or, in some embodiments, can be an RNA (such as mRNA) that can be translated.
- the "expression construct" of the present invention may contain regulatory sequences and nucleotide sequences of interest from different sources, or regulatory sequences and nucleotide sequences of interest from the same source but arranged in a manner different from those normally occurring in nature.
- regulatory sequence and “regulatory element” are used interchangeably and refer to the upstream (5' non-coding sequence), middle or downstream (3' non-coding sequence) of the coding sequence, and affect the transcription, RNA processing, or processing of the related coding sequence. Stability or translated nucleotide sequence. Regulatory sequences may include, but are not limited to, promoters, translation leader sequences, introns, and polyadenylation recognition sequences.
- Promoter refers to a nucleic acid fragment capable of controlling the transcription of another nucleic acid fragment.
- the promoter is a promoter capable of controlling gene transcription in a cell, regardless of whether it is derived from the cell.
- the promoter can be a constitutive promoter or a tissue-specific promoter or a developmentally regulated promoter or an inducible promoter.
- tissue-specific promoter and “tissue-preferred promoter” are used interchangeably, and refer to mainly but not necessarily exclusively expressed in a tissue or organ, and can also be expressed in a specific cell or cell type The promoter.
- tissue-preferred promoter refers to a promoter whose activity is determined by developmental events.
- inducible promoters selectively express operably linked DNA sequences in response to endogenous or exogenous stimuli (environment, hormones, chemical signals, etc.).
- operably linked refers to the connection of regulatory elements (for example, but not limited to, promoter sequences, transcription termination sequences, etc.) to nucleic acid sequences (for example, coding sequences or open reading frames) such that the nucleotides The transcription of the sequence is controlled and regulated by the transcription control element.
- regulatory elements for example, but not limited to, promoter sequences, transcription termination sequences, etc.
- nucleic acid sequences for example, coding sequences or open reading frames
- "Introducing" nucleic acid molecules such as plasmids, linear nucleic acid fragments, RNA, etc.
- proteins into an organism refers to transforming the cells of the organism with the nucleic acid or protein so that the nucleic acid or protein can function in the cell.
- the "transformation” used in the present invention includes stable transformation and transient transformation.
- “Stable transformation” refers to the introduction of an exogenous nucleotide sequence into the genome, resulting in the stable inheritance of the exogenous nucleotide sequence. Once stably transformed, the exogenous nucleic acid sequence is stably integrated into the genome of the organism and any successive generations thereof.
- Transient transformation refers to the introduction of nucleic acid molecules or proteins into cells to perform functions without stable inheritance of exogenous nucleotide sequences. In transient transformation, the foreign nucleic acid sequence is not integrated into the genome.
- Proteins refer to the physiological, morphological, biochemical or physical characteristics of cells or organisms.
- “Agronomic traits” especially refer to the measurable index parameters of crop plants, including but not limited to: leaf green, grain yield, growth rate, total biomass or accumulation rate, fresh weight at maturity, dry weight at maturity, fruit Yield, seed yield, plant total nitrogen content, fruit nitrogen content, seed nitrogen content, plant nutrient tissue nitrogen content, plant total free amino acid content, fruit free amino acid content, seed free amino acid content, plant nutrient tissue free amino acid content, plant total protein Content, fruit protein content, seed protein content, plant nutrient tissue protein content, herbicide resistance, drought resistance, nitrogen absorption, root lodging, harvest index, stem lodging, plant height, ear height, ear length, disease resistance Resistance, cold resistance, salt resistance and tiller number.
- the present invention provides a base editing fusion protein, which comprises APOBEC3B deaminase or a mutant of APOBEC3B deaminase fused with a CRISPR effector protein.
- base editing fusion protein and “base editor” can be used interchangeably.
- the base editing fusion protein containing APOBEC3B deaminase or its mutants of the present invention can perform efficient base editing on target sequences, and at the same time has significantly reduced genome-wide random off-target effects compared with other base editors.
- the base editing fusion protein comprising APOBEC3B deaminase or a mutant thereof of the present invention has a shortened editing window for the target sequence, and can achieve more precise base editing.
- the APOBEC3B deaminase mutant is or derived from human APOBEC3B deaminase.
- An exemplary wild-type human APOBEC3B deaminase comprises the amino acid sequence shown in SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is or derived from the C-terminal domain (hA3Bctd, APOBEC3B C-terminal domain) of human APOBEC3B deaminase.
- An exemplary wild-type hA3Bctd includes the amino acid sequence of SEQ ID NO: 2.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd Amino acid substitutions at one or more of the following positions: 210th, 211th, 214th, 230th, 240th, 281th, 308th, 311th, 313th, 314th And the 315th position, wherein the amino acid position is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd Amino acid substitutions at one or more of the following positions: position 211, position 214, position 308, position 311, position 313, position 314, and position 315, wherein the amino acid positions refer to SEQ ID NO: 19 Sure.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd
- the amino acid substitutions at positions 211 and 311, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd
- the amino acid substitutions at positions 211 and 313, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd
- the amino acid substitutions at positions 211 and 314, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd
- the amino acid substitutions at positions 311 and 313, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd
- the amino acid substitutions at positions 214 and 314, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd
- the amino acid substitutions at positions 314 and 315, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd
- the amino acid substitutions at positions 211, 311 and 314, wherein the positions of the amino acids are determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd
- the amino acid substitutions at positions 211, 214 and 313, wherein the positions of the amino acids are determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd
- the amino acid substitutions at positions 214, 314 and 315, wherein the positions of the amino acids are determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd
- One or more amino acid substitutions selected from: R210A, R210K3, R211K, T214C, T214G, T214S, T214V, L230K, N240A, W281H, F308K, R311K, Y313F, D314R, D314H, Y315M, wherein the amino acid positions refer to SEQ ID NO: 19 confirmed.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd
- One or more amino acid substitutions selected from the following: R211K, T214V, F308K, R311K, Y313F, D314R, D314H, and Y315M, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd Amino acid substitutions R211K and R311K, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd Amino acid substitutions R211K and Y313F, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd Contains amino acid substitutions R211K and D314R, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd Amino acid substitutions R311K and Y313F, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd Amino acid substitutions T214V and D314R, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd Amino acid substitutions D314R and Y315M, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd Amino acid substitutions R211K, R311K and D314R, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd Amino acid substitutions R211K, T214V and Y313F, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant is derived from human APOBEC3B deaminase (hA3B) or the C-terminal domain (hA3Bctd) of human APOBEC3B deaminase, and contains relative to wild-type hA3B or hA3Bctd Amino acid substitutions T214V, D314H and Y315M, wherein the position of the amino acid is determined with reference to SEQ ID NO: 19.
- the APOBEC3B deaminase mutant comprises an amino acid sequence selected from SEQ ID NO: 3-18, 26-31, and 32-34.
- the CRISPR effector protein is a "nuclease-inactivated CRISPR effector protein".
- nuclease-inactivated CRISPR effector protein refers to the loss of the double-stranded nucleic acid cleavage activity of the CRISPR effector protein, but still retains the gRNA-directed DNA targeting ability.
- CRISPR effector proteins lacking double-stranded nucleic acid cleavage activity also encompass nickases, which form a nick in the double-stranded nucleic acid molecule, but do not completely cut the double-stranded nucleic acid.
- the nuclease-inactivated CRISPR effector protein of the present invention has nickase activity. Without being limited by any theory, it is believed that mismatch repair in eukaryotes guides the removal and repair of mismatched bases in the DNA strand through nicks in the strand.
- the U:G mismatch formed by the action of cytidine deaminase may be repaired to C:G. By introducing a cut on a chain containing an unedited G, it will be possible to preferentially repair the U:G mismatch to the desired U:A or T:A.
- the nuclease-inactivated CRISPR effector protein is nuclease-inactivated Cas9.
- the DNA cleavage domain of Cas9 nuclease is known to contain two subdomains: HNH nuclease subdomain and RuvC subdomain.
- the HNH subdomain cleaves the strand complementary to gRNA, while the RuvC subdomain cleaves the non-complementary strand. Mutations in these subdomains can inactivate the nuclease activity of Cas9, forming "nuclease-inactivated Cas9".
- the nuclease-inactivated Cas9 still retains the gRNA-directed DNA binding ability. Therefore, in principle, when fused with another protein, the nuclease-inactivated Cas9 can target the additional protein to almost any DNA sequence simply by co-expression with a suitable guide RNA.
- the nuclease-inactivated Cas9 of the present invention can be derived from Cas9 of different species, for example, derived from S. pyogenes Cas9 (SpCas9), or derived from Staphylococcus aureus (S. aureus) Cas9 (SaCas9). ). Simultaneously mutating the HNH nuclease subdomain and RuvC subdomain of Cas9 (for example, including mutations D10A and H840A) inactivates the nuclease of Cas9 and becomes nuclease death Cas9 (dCas9). Mutation and inactivation of one of the subdomains can make Cas9 have nickase activity, that is, obtain Cas9 nickase (nCas9), for example, nCas9 with only mutation D10A.
- SpCas9 S. pyogenes Cas9
- SaCas9 Staphylococc
- the nuclease-inactivated Cas9 of the present invention contains the amino acid substitution D10A and/or H840A relative to the wild-type Cas9.
- the nuclease-inactivated Cas9 may also contain additional mutations.
- SpCas9 with nuclease inactivation may also include EQR, VQR, or VRER mutations
- SaCas9 may also include KKH mutations (Kim et al. Nat. Biotechnol. 35, 371-376.).
- the nuclease-inactivated SpCas9 includes the amino acid sequence shown in SEQ ID NO:35.
- the nuclease-inactivated CRISPR effector protein is nuclease-inactivated Cpf1.
- Cpf1 contains a DNA cleavage domain (RuvC), which can be mutated to delete the DNA cleavage activity of Cpf1, forming "Cpf1 with lack of DNA cleavage activity".
- the Cpf1 lacking DNA cleavage activity still retains the DNA binding ability guided by gRNA. Therefore, in principle, when fused with another protein, Cpf1 lacking DNA cleavage activity can target the additional protein to almost any DNA sequence simply by co-expression with a suitable guide RNA.
- the Cpf1 lacking DNA cleavage activity of the present invention can be derived from Cpf1 of different species, for example, Cpf1 proteins derived from Francisella novicida U112, Acidaminococcus sp. BV3L6 and Lachnospiraceae bacterium ND2006 called FnCpf1, AsCpf1 and LbCpf1, respectively.
- the Cpf1 lacking DNA cleavage activity is FnCpf1 lacking DNA cleavage activity. In some embodiments, the FnCpf1 lacking DNA cleavage activity comprises a D917A mutation relative to the wild-type FnCpf1.
- the Cpf1 lacking DNA cleavage activity is AsCpf1 lacking DNA cleavage activity. In some embodiments, the AsCpf1 lacking DNA cleavage activity comprises a D908A mutation relative to the wild-type AsCpf1.
- the Cpf1 lacking DNA cleavage activity is LbCpf1 lacking DNA cleavage activity.
- the LbCpf1 lacking DNA cleavage activity comprises a D832A mutation relative to the wild-type LbCpf1.
- the APOBEC3B deaminase or APOBEC3B deaminase mutant is fused to the N-terminus of the CRISPR effector protein (for example, a nuclease-inactivated CRISPR effector protein, such as Cas9 or Cpf1).
- the CRISPR effector protein for example, a nuclease-inactivated CRISPR effector protein, such as Cas9 or Cpf1.
- the APOBEC3B deaminase or APOBEC3B deaminase mutant and the CRISPR effector protein are fused via a linker.
- the joint can be 1-50 long (e.g. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 20-25, 25-50) or more amino acids, non-functional amino acid sequences without secondary or higher structure.
- the joint may be a flexible joint.
- the linker is 16 or 32 amino acids long.
- the linker is the XTEN linker shown in SEQ ID NO: 36 or 37.
- uracil DNA glycosylase catalyzes the removal of U from DNA and initiates base excision repair (BER), resulting in the repair of U:G to C:G. Therefore, without being limited by any theory, the inclusion of a uracil DNA glycosylase inhibitor in the base editing fusion protein of the present invention will be able to increase the efficiency of base editing.
- the base editing fusion protein further comprises Uracil DNA Glycosylase Inhibitor (UGI).
- URI Uracil DNA Glycosylase Inhibitor
- the uracil DNA glycosylase inhibitor comprises the amino acid sequence shown in SEQ ID NO: 38.
- the base editing fusion protein of the present invention further comprises a nuclear localization sequence (NLS).
- NLS nuclear localization sequence
- one or more NLS in the base editing fusion protein should have sufficient strength to drive the base editing fusion protein to accumulate in the nucleus of the cell in an amount that can realize its base editing function.
- the strength of nuclear localization activity is determined by the number and position of NLS in the base editing fusion protein, one or more specific NLS used, or a combination of these factors.
- the NLS of the base editing fusion protein of the present invention may be located at the N-terminus and/or C-terminus. In some embodiments of the present invention, the NLS of the base editing fusion protein of the present invention may be located between the APOBEC3B deaminase or APOBEC3B deaminase mutant and the CRISPR effector protein. In some embodiments, the base editing fusion protein comprises about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more NLS. In some embodiments, the base editing fusion protein comprises about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more NLS at or near the N-terminus.
- the base editing fusion protein comprises about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more NLS at or near the C-terminus. In some embodiments, the base editing fusion protein includes a combination of these, such as one or more NLS at the N-terminus and one or more NLS at the C-terminus. When there is more than one NLS, each one can be selected as not dependent on the other NLS. In some preferred embodiments of the present invention, the base editing fusion protein comprises at least 2 NLS, for example, the at least 2 NLS are located at the C-terminus. In some embodiments, the NLS is located at the C-terminus of the base editing fusion protein. In some embodiments, the base editing fusion protein comprises at least 3 NLS.
- NLS consists of one or more short sequences of positively charged lysine or arginine exposed on the surface of the protein, but other types of NLS are also known.
- Non-limiting examples of NLS include: PKKKRKV or KRPAATKKAGQAKKKK.
- the N-terminus of the base editing fusion protein includes the NLS of the amino acid sequence shown in PKKKRKV. In some embodiments of the present invention, the C-terminus of the base editing fusion protein includes the NLS of the amino acid sequence shown in KRPAATKKAGQAKKKK. In some embodiments of the present invention, the C-terminus of the base editing fusion protein includes the NLS of the amino acid sequence shown in PKKKRKV.
- the base editing fusion protein of the present invention may also include other positioning sequences, such as cytoplasmic positioning sequence, chloroplast positioning sequence, mitochondrial positioning sequence, and the like.
- the present invention also provides the use of the base editing fusion protein of the present invention to base edit the target sequence in the cell genome.
- the present invention also provides a system for base editing a target sequence in a cell genome, which comprises at least one of the following i) to v):
- the base editing fusion protein of the present invention and an expression construct containing a nucleotide sequence encoding a guide RNA;
- An expression construct comprising a nucleotide sequence encoding the base editing fusion protein of the present invention, and an expression construct comprising a nucleotide sequence encoding a guide RNA;
- the guide RNA can target the base editing fusion protein to the target sequence in the cell genome.
- base editing system refers to a combination of components required for base editing of the genome of a cell or organism.
- the various components of the system such as base editing fusion protein and one or more guide RNAs, may exist independently of each other, or may exist in any combination as a composition.
- guide RNA and “gRNA” are used interchangeably, and refer to RNA that can form a complex with the CRISPR effector protein and can target the complex to the target sequence due to a certain identity with the target sequence molecular.
- the guide RNA targets the target sequence by base pairing with the complementary strand of the target sequence.
- the gRNA used by Cas9 nuclease or its functional variants is usually composed of crRNA and tracrRNA molecules that are partially complementary to form a complex, wherein the crRNA contains sufficient identity with the target sequence to hybridize with the complementary strand of the target sequence and guide
- the CRISPR complex (Cas9+crRNA+tracrRNA) is a guide sequence (also called a seed sequence) that specifically binds to the target sequence.
- sgRNA single guide RNA
- the gRNA used by Cpf1 nuclease or its functional variants is usually composed of mature crRNA molecules only, which can also be called sgRNA. Designing a suitable gRNA based on the CRISPR effector protein used and the target sequence to be edited is within the abilities of those skilled in the art.
- the base editing system of the present invention contains more than one guide RNA, so that more than one target sequence can be base edited at the same time.
- the nucleotide sequence encoding the base editing fusion protein is codon-optimized for the organism from which the cell to be base edited is derived.
- Codon optimization refers to replacing at least one codon of the natural sequence with a codon that is used more frequently or most frequently in the gene of the host cell (e.g., about or more than about 1, 2, 3, 4, 5, 10 , 15, 20, 25, 50 or more codons while maintaining the natural amino acid sequence to modify the nucleic acid sequence to enhance expression in the host cell of interest.
- a codon that is used more frequently or most frequently in the gene of the host cell e.g., about or more than about 1, 2, 3, 4, 5, 10 , 15, 20, 25, 50 or more codons
- Codon preference is often related to the translation efficiency of messenger RNA (mRNA), and the translation efficiency is considered to depend on the nature and the nature of the codon being translated
- mRNA messenger RNA
- tRNA transfer RNA
- genes can be tailored to be the best in a given organism based on codon optimization. Good gene expression. Codon utilization tables can be easily obtained, such as the "Codon Usage Database” available at www.kazusa.orjp/codon/ , and these tables can be adjusted in different ways Applicable. See, Nakamura Y. et al., "Codon usage tabulated from the international DNA sequence databases: status for the year 2000. Nucl. Acids Res., 28:292 (2000).
- the guide RNA is a single guide RNA (sgRNA).
- sgRNA single guide RNA
- the method of constructing a suitable sgRNA based on a given target sequence is known in the art. For example, see the literature: Wang, Y. et al. Simultaneous editing of three homoeoalleles in hexaploid bread wheat conflicts heritable resistance to powdery mildew. Nat. Biotechnol. 32,947-951 (2014); Shan, Q.et gen. Target modified.
- the nucleotide sequence encoding the base editing fusion protein and/or the nucleotide sequence encoding the guide RNA are operably linked to an expression control element such as a promoter.
- promoters examples include, but are not limited to, polymerase (pol) I, pol II, or pol III promoters.
- the pol I promoter include chicken RNA pol I promoter.
- pol II promoters include, but are not limited to, cytomegalovirus immediate early (CMV) promoter, Rous sarcoma virus long terminal repeat (RSV-LTR) promoter, and simian virus 40 (SV40) immediate early promoter.
- pol III promoters include U6 and H1 promoters.
- An inducible promoter such as a metallothionein promoter can be used.
- promoters include T7 phage promoter, T3 phage promoter, ⁇ -galactosidase promoter, and Sp6 phage promoter.
- the promoter can be cauliflower mosaic virus 35S promoter, maize Ubi-1 promoter, wheat U6 promoter, rice U3 promoter, maize U3 promoter, rice actin promoter.
- Organisms whose genome can be modified by the base editing system of the present invention include any organisms suitable for base editing, preferably eukaryotes.
- organisms include, but are not limited to, mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cattle, and cats; poultry such as chickens, ducks, and geese; plants, including monocots and dicots
- the plant is a crop plant, including but not limited to wheat, rice, corn, soybean, sunflower, sorghum, rape, alfalfa, cotton, barley, millet, sugar cane, tomato, tobacco, cassava, and potato.
- the organism is a plant. More preferably, the organism is rice.
- the present invention provides a method for producing a genetically modified organism, comprising the base editing fusion protein of the present invention, or an expression construct comprising the base editing fusion protein of the present invention or the present invention
- the system for base editing of target sequences in the cell genome is introduced into organism cells.
- the guide RNA can target the base editing fusion protein to the target sequence in the cell genome of the organism, resulting in the target sequence
- One or more of C is replaced by T.
- the organism is a plant.
- target sequences that can be recognized and targeted by the CRISPR effector protein and the guide RNA complex are within the skill of those of ordinary skill in the art.
- the method of the present invention also includes screening for organisms such as plants with the desired nucleotide substitutions.
- the nucleotide substitution in organisms such as plants can be detected by T7EI, PCR/RE or sequencing methods, for example, see Shan, Q., Wang, Y., Li, J. & Gao, C. Genome editing in rice and wheat using the CRISPR/Cas system. Nat. Protoc. 9, 2395-2410 (2014).
- the target sequence to be modified can be located anywhere in the genome, for example, in a functional gene such as a protein-coding gene, or, for example, can be located in a gene expression regulatory region such as a promoter region or an enhancer region, so as to achieve Modification of gene function or modification of gene expression.
- the C to T base editing in the cell target sequence can be detected by T7EI, PCR/RE or sequencing methods.
- the base editing system can be introduced into cells by various methods well known to those skilled in the art.
- Methods that can be used to introduce the genome editing system of the present invention into cells include, but are not limited to: calcium phosphate transfection, protoplast fusion, electroporation, liposome transfection, microinjection, viral infection (such as baculovirus, vaccinia virus, adenovirus) Viruses, adeno-associated viruses, lentiviruses and other viruses), gene bombardment, PEG-mediated transformation of protoplasts, and Agrobacterium-mediated transformation.
- Cells that can be genome edited by the method of the present invention can be derived from, for example, mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cows, and cats; poultry such as chickens, ducks, and geese; plants, including monads.
- mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cows, and cats
- poultry such as chickens, ducks, and geese
- plants including monads.
- Leafy plants and dicotyledonous plants such as rice, corn, wheat, sorghum, barley, soybean, peanut, Arabidopsis, etc.
- the method of the present invention is particularly suitable for producing genetically modified plants, such as crop plants.
- the base editing system can be introduced into the plant by various methods well known to those skilled in the art.
- the methods that can be used to introduce the base editing system of the present invention into plants include, but are not limited to: gene bombardment, PEG-mediated transformation of protoplasts, Agrobacterium-mediated transformation, plant virus-mediated transformation, pollen tube passage method, and seed Room injection.
- the base editing system is introduced into the plant by transient transformation.
- the target sequence can be modified by introducing or producing the base editing fusion protein and guide RNA into plant cells, and the modification can be inherited stably without the need to edit the base.
- the system stably transforms plants. This avoids the potential off-target effects of the stable base editing system, and also avoids the integration of exogenous nucleotide sequences in the plant genome, thereby having higher biological safety.
- the introduction is performed in the absence of selective pressure, so as to avoid the integration of foreign nucleotide sequences in the plant genome.
- the introduction includes transforming the base editing system of the present invention into an isolated plant cell or tissue, and then regenerating the transformed plant cell or tissue into a whole plant.
- the regeneration is performed in the absence of selective pressure, that is, no selective agent for the selective gene carried on the expression vector is used during the tissue culture process. Not using selection agents can improve plant regeneration efficiency and obtain modified plants that do not contain exogenous nucleotide sequences.
- the base editing system of the present invention can be transformed into specific parts on the whole plant, such as leaves, stem tips, pollen tubes, young ears or hypocotyls. This is particularly suitable for the transformation of plants that are difficult to undergo tissue culture regeneration.
- the protein expressed in vitro and/or the RNA molecule transcribed in vitro is directly transformed into the plant.
- the protein and/or RNA molecule can realize base editing in plant cells and then be degraded by the cell, avoiding the integration of foreign nucleotide sequences in the plant genome.
- genetic modification and breeding of plants using the method of the present invention can obtain plants without foreign DNA integration, that is, transgene-free modified plants.
- the base editing system of the present invention has high specificity (low off-target rate) when performing base editing in plants, which also improves biological safety.
- Plants that can be base edited by the method of the present invention include monocotyledonous plants and dicotyledonous plants.
- the plant may be a crop plant such as wheat, rice, corn, soybean, sunflower, sorghum, rape, alfalfa, cotton, barley, millet, sugarcane, tomato, tobacco, cassava, or potato.
- the target sequence is related to plant traits such as agronomic traits, whereby the base editing causes the plant to have an altered trait relative to a wild-type plant.
- the target sequence to be modified can be located anywhere in the genome, for example, in a functional gene such as a protein-coding gene, or, for example, can be located in a gene expression regulatory region such as a promoter region or an enhancer region, so as to achieve Modification of gene function or modification of gene expression.
- the C to T substitution results in an amino acid substitution in the target protein.
- the C to T substitution results in a change in the expression of the target gene.
- the method further includes obtaining progeny of the genetically modified plant.
- the present invention also provides a genetically modified plant or its progeny or part thereof, wherein the plant is obtained by the above-mentioned method of the present invention.
- the genetically modified plant or progeny or part thereof is non-transgenic.
- the present invention also provides a plant breeding method, comprising crossing the genetically modified first plant obtained by the above-mentioned method of the present invention with a second plant not containing the genetic modification, thereby combining the The genetic modification is introduced into the second plant.
- Example 1 Selection of A3Bctd mutation site based on protein structure information
- the candidate single-base editing system is optimized on the A3A-BE3 vector backbone (SEQ ID NO: 1, including the base editor of human APOBEC3A), using artificially synthesized A3Bctd DNA fragments (SEQ ID NO: 2) using the Gbison method Replace the APOBEC3A sequence in the A3A-BE3 vector to obtain the A3Bctd-BE3 vector.
- A3Bctd-BE3 vector fusion PCR and Gbison were used to perform point mutations on the encoded amino acids on A3Bctd to obtain A3Bctd-R210A-BE3, A3Bctd-R210K-BE3, A3Bctd-R211K-BE3, A3Bctd-T214C-BE3, A3Bctd- T214G-BE3, A3Bctd-T214S-BE3, A3Bctd-T214V-BE3, A3Bctd-L230K-BE3, A3Bctd-N240A-BE3, A3Bctd-W281H-BE3, A3Bctd-F308K-BE3, A3Bctd-A3Bctd-R311K-BE3 BE3, A3Bctd-D314R-BE3, A3Bctd-D314H-BE3, A3Bctd-Y315M-BE3 point mutation
- the constructed control plasmids include A3A-BE3, YEE-BE3, RK-BE3, eA3A-BE3, A3A-R128A-BE3, A3A-Y130F-BE3, and untruncated APOBEC3B-BE3 (see the deaminase sequence in SEQ ID NO: 19-25), where YEE and RK are two variants of APOBEC1 deaminase on the BE3 vector, constructed by fusion PCR and Gbison.
- the A3A deaminase sequence is artificially synthesized, and the R128A and Y130F of A3A are constructed by fusion PCR and Gbison.
- the guide RNA vectors used in this experiment include pSp-sgRNA and pSa-sgRNA vectors.
- the eight targets shown in Table 1 were constructed respectively.
- the target of -T1 was constructed into pSp-sgRNA vector using restriction enzyme digestion and ligation method, as The guide RNA vector for detecting the targeting efficiency, the target at the end of -SaT1 or -SaT2 is constructed into the pSa-sgRNA vector by restriction enzyme digestion and ligation method, as a vector for detecting random off-target ability by the TA-AS method.
- the principle of the TA-AS method is to co-transfect with the base editing system to be tested (such as the base editing system based on nSpCas9 in this experiment) with its orthogonal (that is, cannot share gRNA), and can produce other single-stranded regions.
- a CRISPR system such as the nSaCas9 system, whereby other orthogonal CRISPR systems select a location in the genome to produce a long-term stable single-stranded region. If the base editing system to be tested has a random off-target effect in the genome, it will Deamination at the C base of this single-stranded region causes undesirable editing. The random off-target effect of the single-base editing system can be detected efficiently and simply by the high-throughput sequencing of the amplicons at selected sites.
- each single base The base editing system and its own guide RNA vector pSp-sgRNA and pnSaCsa9 and the corresponding pSa-sgRNA in the TA-AS system are used to transform rice protoplasts. After two days of culture, the target site amplicons are sequenced, and four targets are selected. The average value of the site and the four off-target sites to evaluate the targeting efficiency and off-target efficiency.
- Each single-base editing system has at least three biological repetitions for each target site.
- the results are shown in Figure 2 and Figure 3. It was found that there are eight point mutations R211K, T214V, F308K, R311K, Y313F, D314R, D314H, and Y315M that can maintain high mutation efficiency while reducing off-target efficiency. Seven of them are on Loop1 and Loop7. These seven variants were combined to further improve specificity.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- Crystallography & Structural Chemistry (AREA)
- Cell Biology (AREA)
- Mycology (AREA)
- Peptides Or Proteins (AREA)
- Enzymes And Modification Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Document Processing Apparatus (AREA)
- Saccharide Compounds (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Description
Claims (39)
- 一种碱基编辑融合蛋白,其包含与CRISPR效应蛋白融合的APOBEC3B脱氨酶或APOBEC3B脱氨酶突变体。
- 权利要求1的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体是或衍生自人APOBEC3B脱氨酶,例如,所述人APOBEC3B脱氨酶包含SEQ ID NO:19所示氨基酸序列。
- 权利要求1的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体是或衍生自人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),例如,所述A3Bctd包含SEQ ID NO:2的氨基酸序列。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含在以下一或多个位置的氨基酸取代:第210位、第211位、第214位、第230位、第240位、第281位、第308位、第311位、第313位、第314位和第315位,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含在以下一或多个位置的氨基酸取代:第211位、第214位、第308位、第311位、第313位、第314位和第315位,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含在第211位和第311位的氨基酸取代,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含在第211位和第313位的氨基酸取代,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含在第211位和第314位的氨基酸取代,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含在第311位和第313位的氨基酸取代,,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含在第214位和第314位的氨基酸取代,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含在第314位和第315位的氨基酸取代,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含在第211位、第311位和第314位的氨基酸取代,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含在第211位、第214位和第313位的氨基酸取代,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含在第214位、第314位和第315位的氨基酸取代,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含选自以下的一或多个氨基酸取代:R210A、R210K3、R211K、T214C、T214G、T214S、T214V、L230K、N240A、W281H、F308K、R311K、Y313F、D314R、D314H、Y315M,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含选自以下的一或多个氨基酸取代:R211K、T214V、F308K、R311K、Y313F、D314R、D314H与Y315M,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含氨基酸取代R211K和R311K,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍 生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含氨基酸取代R211K和Y313F,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含包含氨基酸取代R211K和D314R,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含氨基酸取代R311K和Y313F,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含氨基酸取代T214V和D314R,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含氨基酸取代D314R和Y315M,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含氨基酸取代R211K、R311K和D314R,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含氨基酸取代R211K、T214V和Y313F,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求2或3的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体衍生自人APOBEC3B脱氨酶(hA3B)或人APOBEC3B脱氨酶的C-末端结构域(hA3Bctd),且相对于野生型的hA3B或hA3Bctd包含氨基酸取代T214V、D314H和Y315M,其中所述氨基酸位置参考SEQ ID NO:19确定。
- 权利要求1的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶突变体包含选自SEQ ID NO:3-18、26-31和32-34的氨基酸序列。
- 权利要求1-26中任一项的碱基编辑融合蛋白,其中所述CRISPR效应蛋白是核酸酶失活的CRISPR效应蛋白,例如,其是CRISPR切口酶。
- 权利要求27的碱基编辑融合蛋白,其中所述核酸酶失活的CRISPR效应蛋白是 核酸酶失活的Cas9,其相对于野生型Cas9包含氨基酸取代D10A和/或H840A,所述核酸酶失活的Cas9包含SEQ ID NO:35所示的氨基酸序列。
- 权利要求1-28中任一项的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶或APOBEC3B脱氨酶突变体融合至所述CRISPR效应蛋白的N端。
- 权利要求1-29中任一项的碱基编辑融合蛋白,其中所述APOBEC3B脱氨酶或APOBEC3B脱氨酶突变体和所述CRISPR效应蛋白通过接头融合,例如所述接头是SEQ ID NO:36或37所示的接头。
- 权利要求1-30中任一项的碱基编辑融合蛋白,其中所述碱基编辑融合蛋白还包含尿嘧啶DNA糖基化酶抑制剂(UGI),例如,所述尿嘧啶DNA糖基化酶抑制剂包含SEQ ID NO:38所示的氨基酸序列。
- 权利要求1-31中任一项的碱基编辑融合蛋白,其中所述碱基编辑融合蛋白还包含核定位序列(NLS)。
- 一种用于对细胞基因组中的靶序列进行碱基编辑的***,其包含以下i)至v)中至少一项:i)权利要求1-32中任一项的碱基编辑融合蛋白,和向导RNA;ii)包含编码权利要求1-32中任一项的碱基编辑融合蛋白的核苷酸序列的表达构建体,和向导RNA;iii)权利要求1-32中任一项的碱基编辑融合蛋白,和包含编码向导RNA的核苷酸序列的表达构建体;iv)包含编码权利要求1-32中任一项的碱基编辑融合蛋白的核苷酸序列的表达构建体,和包含编码向导RNA的核苷酸序列的表达构建体;v)包含编码权利要求1-32中任一项的碱基编辑融合蛋白的核苷酸序列和编码向导RNA的核苷酸序列的表达构建体;其中所述向导RNA能够将所述碱基编辑融合蛋白靶向细胞基因组中的靶序列。
- 权利要求33的***,其包含多于一种向导RNA或其表达构建体,从而可以同时对多于一个靶序列进行碱基编辑。
- 权利要求33或34的***,其中所述编码碱基编辑融合蛋白的核苷酸序列针对待进行碱基编辑的细胞所来自的生物体进行密码子优化。
- 权利要求33-35中任一项的***,所述向导RNA是单向导RNA(sgRNA)。
- 权利要求33-36中任一项的***,其中所述编码碱基编辑融合蛋白的核苷酸序列和/或所述编码向导RNA的核苷酸序列与表达调控元件如启动子可操作地连接。
- 一种产生经遗传修饰的生物体的方法,包括将权利要求1-32中任一项的碱基编辑融合蛋白、或者包含编码权利要求1-32中任一项的碱基编辑融合蛋白的核苷酸序列的表达构建体、或者权利要求33-36中任一项的用于对细胞基因组中的靶序列进行碱基编辑的***导入生物体细胞。
- 权利要求38的方法,其中所述生物体是植物。
Priority Applications (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2022553071A JP2023517890A (ja) | 2020-03-04 | 2021-03-04 | 改善されたシトシン塩基編集システム |
KR1020227034519A KR20220150363A (ko) | 2020-03-04 | 2021-03-04 | 개선된 사이토신 염기 편집 시스템 |
BR112022017732A BR112022017732A2 (pt) | 2020-03-04 | 2021-03-04 | Sistema aperfeiçoado de edição de base de citosina |
AU2021229415A AU2021229415A1 (en) | 2020-03-04 | 2021-03-04 | Improved cytosine base editing system |
EP21764693.4A EP4130257A4 (en) | 2020-03-04 | 2021-03-04 | IMPROVED CYTOSINE BASE EDITING SYSTEM |
CN202180019220.7A CN115427564A (zh) | 2020-03-04 | 2021-03-04 | 改进的胞嘧啶碱基编辑*** |
CA3174615A CA3174615A1 (en) | 2020-03-04 | 2021-03-04 | Improved cytosine base editing system |
US17/909,570 US20230313234A1 (en) | 2020-03-04 | 2021-03-04 | Improved cytosine base editing system |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010145047.2 | 2020-03-04 | ||
CN202010145047 | 2020-03-04 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021175288A1 true WO2021175288A1 (zh) | 2021-09-10 |
Family
ID=77613907
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/079086 WO2021175288A1 (zh) | 2020-03-04 | 2021-03-04 | 改进的胞嘧啶碱基编辑*** |
Country Status (9)
Country | Link |
---|---|
US (1) | US20230313234A1 (zh) |
EP (1) | EP4130257A4 (zh) |
JP (1) | JP2023517890A (zh) |
KR (1) | KR20220150363A (zh) |
CN (1) | CN115427564A (zh) |
AU (1) | AU2021229415A1 (zh) |
BR (1) | BR112022017732A2 (zh) |
CA (1) | CA3174615A1 (zh) |
WO (1) | WO2021175288A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114686456A (zh) * | 2022-05-10 | 2022-07-01 | 中山大学 | 基于双分子脱氨酶互补的碱基编辑***及其应用 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018218188A2 (en) * | 2017-05-25 | 2018-11-29 | The General Hospital Corporation | Base editors with improved precision and specificity |
WO2019042284A1 (en) * | 2017-09-01 | 2019-03-07 | Shanghaitech University | FUSION PROTEINS FOR ENHANCED PRECISION IN THE BASIC EDITION |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016046635A1 (en) * | 2014-09-25 | 2016-03-31 | Institut Pasteur | Methods for characterizing human papillomavirus associated cervical lesions |
IL294014B2 (en) * | 2015-10-23 | 2024-07-01 | Harvard College | Nucleobase editors and their uses |
US10961525B2 (en) * | 2017-07-05 | 2021-03-30 | The Trustees Of The University Of Pennsylvania | Hyperactive AID/APOBEC and hmC dominant TET enzymes |
US11332749B2 (en) * | 2017-07-13 | 2022-05-17 | Regents Of The University Of Minnesota | Real-time reporter systems for monitoring base editing |
EP3841203A4 (en) * | 2018-08-23 | 2022-11-02 | The Broad Institute Inc. | CAS9 VARIANTS WITH NON-CANONICAL PAM SPECIFICITIES AND USES OF THEM |
-
2021
- 2021-03-04 WO PCT/CN2021/079086 patent/WO2021175288A1/zh unknown
- 2021-03-04 CA CA3174615A patent/CA3174615A1/en active Pending
- 2021-03-04 CN CN202180019220.7A patent/CN115427564A/zh active Pending
- 2021-03-04 JP JP2022553071A patent/JP2023517890A/ja active Pending
- 2021-03-04 BR BR112022017732A patent/BR112022017732A2/pt unknown
- 2021-03-04 KR KR1020227034519A patent/KR20220150363A/ko unknown
- 2021-03-04 EP EP21764693.4A patent/EP4130257A4/en not_active Withdrawn
- 2021-03-04 US US17/909,570 patent/US20230313234A1/en active Pending
- 2021-03-04 AU AU2021229415A patent/AU2021229415A1/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018218188A2 (en) * | 2017-05-25 | 2018-11-29 | The General Hospital Corporation | Base editors with improved precision and specificity |
WO2019042284A1 (en) * | 2017-09-01 | 2019-03-07 | Shanghaitech University | FUSION PROTEINS FOR ENHANCED PRECISION IN THE BASIC EDITION |
Non-Patent Citations (9)
Title |
---|
COELHO, MA ET AL.: "BE-FLARE: a Fluorescent Reporter of Base Editing Activity Reveals Editing Characteristics of APOBEC3A and APOBEC3B", BMC BIOLOGY, vol. 16, 28 December 2018 (2018-12-28), XP055751951, ISSN: 1741-7007, DOI: 10.1186/s12915-018-0617-1 * |
LIANG, Z. ET AL.: "Targeted mutagenesis in Zea mays using TALENs and the CRISPR/Cas system", J GENET GENOMICS, vol. 41, 2014, pages 63 - 68 |
NAKAMURA, Y. ET AL.: "Codon usage tabulated from the international DNA sequence databases: status for the year 2000", NUCL. ACIDS RES., vol. 28, 2000, pages 292, XP002941557, DOI: 10.1093/nar/28.1.292 |
SAMBROOK, J.FRITSCH, E.F.MANIATIS, T.: "Molecular Cloning: A Laboratory Manual", 1989, COLD SPRING HARBOR LABORATORY PRESS |
See also references of EP4130257A4 |
SHAN, Q. ET AL.: "Targeted genome modification of crop plants using a CRISPR-Cas system", NAT. BIOTECHNOL., vol. 31, 2013, pages 686 - 688, XP055216828, DOI: 10.1038/nbt.2650 |
SHAN, Q.WANG, Y.LI, J.GAO, C.: "Genome editing in rice and wheat using the CRISPR/Cas system", NAT. PROTOC., vol. 9, 2014, pages 2395 - 2410, XP055927452, DOI: 10.1038/nprot.2014.157 |
WANG, Y. ET AL.: "Simultaneous editing of three homoeoalleles in hexaploid bread wheat confers heritable resistance to powdery mildew", NAT. BIOTECHNOL., vol. 32, 2014, pages 947 - 951, XP055390922, DOI: 10.1038/nbt.2969 |
WATSON ET AL.: "Molecular Biology of the Gene", 1987, THE BENJAMIN/CUMMINGS PUB. CO., pages: 224 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114686456A (zh) * | 2022-05-10 | 2022-07-01 | 中山大学 | 基于双分子脱氨酶互补的碱基编辑***及其应用 |
CN114686456B (zh) * | 2022-05-10 | 2023-02-17 | 中山大学 | 基于双分子脱氨酶互补的碱基编辑***及其应用 |
Also Published As
Publication number | Publication date |
---|---|
US20230313234A1 (en) | 2023-10-05 |
CN115427564A (zh) | 2022-12-02 |
JP2023517890A (ja) | 2023-04-27 |
AU2021229415A1 (en) | 2022-10-06 |
EP4130257A4 (en) | 2024-05-01 |
KR20220150363A (ko) | 2022-11-10 |
BR112022017732A2 (pt) | 2023-01-17 |
EP4130257A1 (en) | 2023-02-08 |
EP4130257A9 (en) | 2024-04-24 |
CA3174615A1 (en) | 2021-09-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2019120310A1 (en) | Base editing system and method based on cpf1 protein | |
WO2021032155A1 (zh) | 一种碱基编辑***和其使用方法 | |
BR112020010594A2 (pt) | método para edição de bases em plantas | |
WO2023169454A1 (zh) | 腺嘌呤脱氨酶及其在碱基编辑中的用途 | |
WO2021185358A1 (zh) | 一种提高植物遗传转化和基因编辑效率的方法 | |
WO2021082830A1 (zh) | 靶向性修饰植物基因组序列的方法 | |
US20240117368A1 (en) | Multiplex genome editing method and system | |
WO2023169410A1 (zh) | 胞嘧啶脱氨酶及其在碱基编辑中的用途 | |
WO2020087631A1 (zh) | 基于C2c1核酸酶的基因组编辑***和方法 | |
CN112805385B (zh) | 基于人apobec3a脱氨酶的碱基编辑器及其用途 | |
CN117264998A (zh) | 双功能基因组编辑***及其用途 | |
WO2021175288A1 (zh) | 改进的胞嘧啶碱基编辑*** | |
EP4242237A1 (en) | Foki nuclease domain variant | |
JP2024501892A (ja) | 新規の核酸誘導型ヌクレアーゼ | |
WO2022188816A1 (zh) | 改进的cg碱基编辑*** | |
WO2024051850A1 (zh) | 基于dna聚合酶的基因组编辑***和方法 | |
WO2020117837A1 (en) | Methods and compositions for improving silage | |
WO2022199665A1 (zh) | 一种提高植物遗传转化和基因编辑效率的方法 | |
WO2023227050A1 (zh) | 一种在基因组中定点***外源序列的方法 | |
WO2021098709A1 (zh) | 衍生自黄杆菌的基因编辑*** | |
US20230357756A1 (en) | Compositions, methods, and systems for cell labeling | |
WO2023232109A1 (zh) | 新的crispr基因编辑*** | |
US20230407278A1 (en) | Compositions and methods for cas9 molecules with improved gene editing properties | |
CN116622758A (zh) | 一种提高植物遗传转化和基因编辑效率的方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21764693 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2022553071 Country of ref document: JP Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 3174615 Country of ref document: CA |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112022017732 Country of ref document: BR |
|
ENP | Entry into the national phase |
Ref document number: 20227034519 Country of ref document: KR Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2021229415 Country of ref document: AU Date of ref document: 20210304 Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 2021764693 Country of ref document: EP Effective date: 20221004 |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01E Ref document number: 112022017732 Country of ref document: BR Free format text: 1) COM BASE NA PORTARIA 48 DE 20/06/2022, SOLICITA-SE QUE SEJA APRESENTADO, EM ATE 60 (SESSENTA) DIAS, NOVO CONTEUDO DE LISTAGEM DE SEQUENCIA POIS O CONTEUDO APRESENTADO NA PETICAO NO 870220085893 DE 20/09/2022 NAO POSSUI TODOS OS CAMPOS OBRIGATORIOS INFORMADOS, NAO CONSTANDO O CAMPO 150 / 151 .2) FAVOR EFETUAR, EM ATE 60 (SESSENTA) DIAS, O PAGAMENTO DE GRU CODIGO DE SERVICO 260 PARA A REGULARIZACAO DO PEDIDO, CONFORME ART 2O 1O DA RESOLUCAO 189/2017 E NOTA DE ESCLARECIMENTO PUBLICADA NA RPI 2421 DE 30/05/2017, UMA VEZ QUE A PETICAO NO 870220085893 DE 20/09/2022 APRESENTA DOCUMENTOS REFERENTES A DOIS SERVICOS DIVERSOS (COMPLEMENTACAO E MODIFICACAO DO RESUMO) TENDO SIDO PAGA SOMENTE UMA RET |
|
ENP | Entry into the national phase |
Ref document number: 112022017732 Country of ref document: BR Kind code of ref document: A2 Effective date: 20220902 |