CN109957569A - 基于cpf1蛋白的碱基编辑***和方法 - Google Patents

基于cpf1蛋白的碱基编辑***和方法 Download PDF

Info

Publication number
CN109957569A
CN109957569A CN201811578853.8A CN201811578853A CN109957569A CN 109957569 A CN109957569 A CN 109957569A CN 201811578853 A CN201811578853 A CN 201811578853A CN 109957569 A CN109957569 A CN 109957569A
Authority
CN
China
Prior art keywords
lys
leu
glu
ile
asn
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811578853.8A
Other languages
English (en)
Other versions
CN109957569B (zh
Inventor
高彩霞
王延鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Qihe Biotechnology Co ltd
Original Assignee
Institute of Genetics and Developmental Biology of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Genetics and Developmental Biology of CAS filed Critical Institute of Genetics and Developmental Biology of CAS
Publication of CN109957569A publication Critical patent/CN109957569A/zh
Application granted granted Critical
Publication of CN109957569B publication Critical patent/CN109957569B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/102Mutagenizing nucleic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8201Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
    • C12N15/8213Targeted insertion of genes into the plant genome by homologous recombination
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8216Methods for controlling, regulating or enhancing expression of transgenes in plant cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/87Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
    • C12N15/90Stable introduction of foreign DNA into chromosome
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/87Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
    • C12N15/90Stable introduction of foreign DNA into chromosome
    • C12N15/902Stable introduction of foreign DNA into chromosome using homologous recombination
    • C12N15/907Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/78Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y305/00Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5)
    • C12Y305/04Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5) in cyclic amidines (3.5.4)
    • C12Y305/04001Cytosine deaminase (3.5.4.1)
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2310/00Structure or type of the nucleic acid
    • C12N2310/10Type of nucleic acid
    • C12N2310/20Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Cell Biology (AREA)
  • Medicinal Chemistry (AREA)
  • Mycology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

本发明涉及基因工程领域。具体而言,本发明涉及一种基于CPF1蛋白的碱基编辑方法。更具体而言,本发明涉及一种通过向导RNA指导的Cpf1‑脱氨酶融合蛋白对生物体(例如植物)基因组中的靶序列进行高效碱基编辑的方法,以及通过所述方法产生的经遗传修饰的生物体(例如植物)及其后代。

Description

基于CPF1蛋白的碱基编辑***和方法
技术领域
本发明涉及基因工程领域。具体而言,本发明涉及一种基于CPF1蛋白的碱基编辑***和方法。更具体而言,本发明涉及一种通过向导RNA指导的Cpf1-脱氨酶融合蛋白对生物体(例如植物)基因组中的靶序列进行高效碱基编辑的***和方法,以及通过所述方法产生的经遗传修饰的生物体(例如植物)及其后代。
背景技术
高效的作物改良的前提是能够获得新的遗传突变,这些突变可以容易地引入现代栽培种中。遗传研究,尤其是基于全基因组相关的研究表明,单核苷酸的改变是构成作物性状差异的主要原因。单碱基的变异会导致氨基酸替换,从而导致优良等位基因和优异性状的进化。在基因组编辑出现前,定向诱导基因组局部突变(TILLING)可以作为用于产生作物改良中迫切需要的突变的方法。然而,TILLING筛选耗时耗力,并且所鉴别的点突变经常受数目和种类的限制。基因组编辑技术,特别是基于CRISPR/Cas9***的基因组编辑技术可以通过同源重组(HR)介导的DNA修复途径来实现在基因组位点中引入特定碱基的替换。但目前,该方法的成功使用受到很大限制,主要是由于在植物中HR介导的双链断链修复发生的频率很低。另外,有效的提供足量的DNA修复模板也是目前的一大难关。这些问题使得目前在植物中通过HR的方式高效而简单地实现定点突变成为一大挑战。
近年,利用Cas9与DNA的结合特性与DNA脱氨酶的特性,将Cas9 与脱氨酶融合,可以实现精准的对靶基因进行单碱基胞嘧啶(C)到胸腺嘧啶(T)的转变以及腺嘌呤(A)到鸟嘌呤(G)的转变。目前,对于C到T 的转变的***主要包括SpnCas9-BE3、SpnCas9-AID以及Cas9变体的融合物如VQR-BE3、EQR-BE3和VRER-BE3,此外还有SaCas9-BE3以及变体 SaKKH-BE3,这些组合的使胞嘧啶(C)到胸腺嘧啶(T)转变的PAM限制降低以及编辑窗口的范围更为可变。此外,近期哈佛大学David Liu实验室通过人工进化的方式,研发出了能作用于ssDNA的腺嘌呤脱氨酶,与 Cas9融合可以实现对DNA进行A到G的转变的Cas9-ABE***,使单碱基编辑的作用方位进一步扩大。尽管这些研究使DNA的单碱基编辑有了很大的使用范围,但当前的单碱基编辑技术仍存在许多问题:第一,由于Cas9 及Cas9变体的PAM一般限于富含G/C的区域,单碱编辑***的PAM类型仍需要拓宽;第二,由于Cas9及编辑本身特异性较差,单碱基编辑***在特异性方面仍需要提高;第三,由于nCas9-BE3及其变体的BE3,还有 nCas9-ABE通常在靶位点的非靶向链上产生单链的缺刻,很容易在错配修复的过程中在产生单碱基突变的同时也会产生DNA的确实和***,所以在单碱基的高保真方面也有待提高。因此,本领域仍然迫切需要新的对植物基因组进行碱基编辑的***和方法。
附图描述
图1.CPF1介导的植物基因组切割活性的优化。
图2.CPF1介导的植物基因组C到T的突变。
图3.CPF1介导的植物基因组A到G的突变。
图4.利用CPF1的RNA切割活性进行多位点的同时碱基编辑。
发明内容
一、定义
在本发明中,除非另有说明,否则本文中使用的科学和技术名词具有本领域技术人员所通常理解的含义。并且,本文中所用的蛋白质和核酸化学、分子生物学、细胞和组织培养、微生物学、免疫学相关术语和实验室操作步骤均为相应领域内广泛使用的术语和常规步骤。例如,本发明中使用的标准重组DNA和分子克隆技术为本领域技术人员熟知,并且在如下文献中有更全面的描述:Sambrook,J.,Fritsch,E.F.和Maniatis,T.,MolecularCloning:A Laboratory Manual;Cold Spring Harbor Laboratory Press:Cold SpringHarbor,1989(下文称为“Sambrook”)。同时,为了更好地理解本发明,下面提供相关术语的定义和解释。
“Cpf1核酸酶”、“Cpf1蛋白”和“Cpf1”在本文中可互换使用,指的是包括Cpf1蛋白或其片段的RNA指导的核酸酶。Cpf1是CRISPR-Cpf1基因组编辑***的组分,能在向导RNA(crRNA)的指导下靶向并切割DNA靶序列形成DNA双链断裂(DSB)。Cpf1蛋白上包含一个DNA的切割结构域和一个独立的RNA切割结构域。Cpf1蛋白的RNA切割结构域能够加工 pre-crRNA以形成成熟crRNA。
“向导RNA”和“gRNA”在本文中可互换使用。Cpf1介导的基因组编辑***的向导RNA通常仅由成熟crRNA分子构成,其中crRNA包含的序列与靶序列具有足够相同性以便与靶序列的互补序列杂交并且指导复合物(Cpf1+crRNA)与该靶序列序列特异性结合。
“脱氨酶”是指催化脱氨基反应的酶。在本发明一些实施方式中,所述脱氨酶指的是胞嘧啶脱氨酶,其催化胞苷或脱氧胞苷分别脱氨化为尿嘧啶或脱氧尿嘧啶。在本发明一些实施方式中,所述脱氨酶指的是腺嘌呤脱氨酶,其能够催化腺苷或脱氧腺苷(A)形成肌苷(I)。
“基因组”在用于植物细胞时不仅涵盖存在于细胞核中的染色体DNA,而且还包括存在于细胞的亚细胞组分(如线粒体、质体)中的细胞器DNA。
如本文所用,“生物体”包括适于基因组编辑的任何生物体,优选真核生物。生物体的实例包括但不限于,哺乳动物如人、小鼠、大鼠、猴、犬、猪、羊、牛、猫;家禽如鸡、鸭、鹅;植物包括单子叶植物和双子叶植物,例如水稻、玉米、小麦、高粱、大麦、大豆、花生、拟南芥等。
“经遗传修饰的生物体”或“经遗传修饰的细胞”意指在其基因组内包含外源多核苷酸或修饰的基因或表达调控序列的生物体或细胞。例如外源多核苷酸能够稳定地整合进生物体或细胞的基因组中,并遗传连续的世代。外源多核苷酸可单独地或作为重组DNA构建体的部分整合进基因组中。修饰的基因或表达调控序列为在生物体或细胞基因组中所述序列包含单个或多个脱氧核苷酸取代、缺失和添加。例如,通过本发明获得的经遗传修饰的生物体可以相对于野生型(相应的未经所述遗传修饰的生物体)包含一个或多个C至T或A至G的取代。
针对序列而言的“外源”意指来自外来物种的序列,或者如果来自相同物种,则指通过蓄意的人为干预而从其天然形式发生了组成和/或基因座的显著改变的序列。
“多核苷酸”、“核酸序列”、“核苷酸序列”或“核酸片段”可互换使用并且是单链或双链RNA或DNA聚合物,任选地可含有合成的、非天然的或改变的核苷酸碱基。核苷酸通过如下它们的单个字母名称来指代:“A”为腺苷或脱氧腺苷(分别对应RNA或DNA),“C”表示胞苷或脱氧胞苷,“G”表示鸟苷或脱氧鸟苷,“U”表示尿苷,“T”表示脱氧胸苷,“R”表示嘌呤 (A或G),“Y”表示嘧啶(C或T),“K”表示G或T,“H”表示A或C或 T,“I”表示肌苷,并且“N”表示任何核苷酸。
“多肽”、“肽”、和“蛋白质”在本发明中可互换使用,指氨基酸残基的聚合物。该术语适用于其中一个或多个氨基酸残基是相应的天然存在的氨基酸的人工化学类似物的氨基酸聚合物,以及适用于天然存在的氨基酸聚合物。术语“多肽”、“肽”、“氨基酸序列”和“蛋白质”还可包括修饰形式,包括但不限于糖基化、脂质连接、硫酸盐化、谷氨酸残基的γ羧化、羟化和ADP-核糖基化。
如本发明所用,“表达构建体”是指适于感兴趣的核苷酸序列在生物体中表达的载体如重组载体。“表达”指功能产物的产生。例如,核苷酸序列的表达可指核苷酸序列的转录(如转录生成mRNA或功能RNA)和/或RNA 翻译成前体或成熟蛋白质。
本发明的“表达构建体”可以是线性的核酸片段、环状质粒、病毒载体,或者,在一些实施方式中,可以是能够翻译的RNA(如mRNA)。
本发明的“表达构建体”可包含不同来源的调控序列和感兴趣的核苷酸序列,或相同来源但以不同于通常天然存在的方式排列的调控序列和感兴趣的核苷酸序列。
“调控序列”和“调控元件”可互换使用,指位于编码序列的上游(5′非编码序列)、中间或下游(3′非编码序列),并且影响相关编码序列的转录、 RNA加工或稳定性或者翻译的核苷酸序列。
调控序列可包括但不限于启动子、翻译前导序列、内含子和多腺苷酸化识别序列。
“启动子”指能够控制另一核酸片段转录的核酸片段。在本发明的一些实施方案中,启动子是能够控制生物体细胞中基因转录的启动子,无论其是否来源于所述生物体。启动子可以是组成型启动子或组织特异性启动子或发育调控启动子或诱导型启动子。
“组成型启动子”指一般将引起基因在多数细胞类型中在多数情况下表达的启动子。“组织特异性启动子”和“组织优选启动子”可互换使用,并且指主要但非必须专一地在一种组织或器官中表达,而且也可在一种特定细胞或细胞型中表达的启动子。“发育调控启动子”指其活性由发育事件决定的启动子。“诱导型启动子”响应内源性或外源性刺激(环境、激素、化学信号等)而选择性表达可操纵连接的DNA序列。
如本文中所用,术语“可操作地连接”指调控元件(例如但不限于,启动子序列、转录终止序列等)与核酸序列(例如,编码序列或开放读码框)连接,使得核苷酸序列的转录被所述转录调控元件控制和调节。用于将调控元件区域可操作地连接于核酸分子的技术为本领域已知的。
将核酸分子(例如质粒、线性核酸片段、RNA等)或蛋白质“导入”生物体是指用所述核酸或蛋白质转化生物体细胞,使得所述核酸或蛋白质在细胞中能够发挥功能。本发明所用的“转化”包括稳定转化和瞬时转化。
“稳定转化”指将外源核苷酸序列导入基因组中,导致外源基因稳定遗传。一旦稳定转化,外源核酸序列稳定地整合进所述生物体和其任何连续世代的基因组中。
“瞬时转化”指将核酸分子或蛋白质导入细胞中,执行功能而没有外源基因稳定遗传。瞬时转化中,外源核酸序列不整合进基因组中。
如本文所使用的,术语“植物”包括整个植物和任何后代、植物的细胞、组织、或部分。术语“植物部分”包括植物的任何部分,包括,例如但不限于:种子(包括成熟种子、没有种皮的未成熟胚、和不成熟的种子);植物插条(plant cutting);植物细胞;植物细胞培养物;植物器官(例如,花粉、胚、花、果实、芽、叶、根、茎,和相关外植体)。植物组织或植物器官可以是种子、愈伤组织、或者任何其他被组织成结构或功能单元的植物细胞群体。植物细胞或组织培养物能够再生出具有该细胞或组织所来源的植物的生理学和形态学特征的植物,并能够再生出与该植物具有基本上相同基因型的植物。与此相反,一些植物细胞不能够再生产生植物。植物细胞或组织培养物中的可再生细胞可以是胚、原生质体、分生细胞、愈伤组织、花粉、叶、花药、根、根尖、丝、花、果仁、穗、穗轴、壳、或茎。
植物部分包括可收获的部分和可用于繁殖后代植物的部分。可用于繁殖的植物部分包括,例如但不限于:种子;果实;插条;苗;块茎;和砧木。植物的可收获部分可以是植物的任何有用部分,包括,例如但不限于:花;花粉;苗;块茎;叶;茎;果实;种子;和根。
植物细胞是植物的结构和生理单元。如本文所使用的,植物细胞包括原生质体和具有部分细胞壁的原生质体。植物细胞可以处于分离的单个细胞或细胞聚集体的形式(例如,松散愈伤组织和培养的细胞),并且可以是更高级组织单元(例如,植物组织、植物器官、和植物)的一部分。因此,植物细胞可以是原生质体、产生配子的细胞,或者能够再生成完整植物的细胞或细胞的集合。因此,在本文的实施方案中,包含多个植物细胞并能够再生成为整株植物的种子被认为是一种“植物部分”。
如本文所使用的,术语“原生质体”是指细胞壁被完全或部分地除去、其脂双层膜裸露的植物细胞。典型地,原生质体是没有细胞壁的分离植物细胞,其具有再生成细胞培养物或整株植物的潜力。
植物“后代”包括植物的任何后续世代。
“性状”指植物或特定植物材料或细胞的生理的、形态的、生化的或物理的特征。在一些实施方式中,这些特征可以是肉眼可见的,比如种子、植株的大小等;可用生物化学技术测定的指标,如种子或叶片中蛋白、淀粉或油份的含量等;可观察的代谢或生理过程,如测定对水分胁迫、特定盐、糖或氮浓度的抗性;可检测的基因表达水平;或可观察渗透胁迫的抗性或产量等农艺性状。在一些实施方式中,性状还包括植物的倍性(ploidy),例如对植物育种重要的单倍性(h叩loidy)。在一些实施方式中,性状还包括植物对除草剂的抗性。
“农艺性状”是可测量的指标参数,包括但不限于:叶片绿色、籽粒产量、生长速率、总生物量或积累速率、成熟时的鲜重、成熟时的干重、果实产量、种子产量、植物总氮含量、果实氮含量、种子氮含量、植物营养组织氮含量、植物总游离氨基酸含量、果实游离氨基酸含量、种子游离氨基酸含量、植物营养组织游离氨基酸含量、植物总蛋白含量、果实蛋白含量、种子蛋白含量、植物营养组织蛋白质含量、抗旱性、氮的吸收、根的倒伏、收获指数、茎的倒伏、株高、穗高、穗长、抗病性、抗寒性、抗盐性和分蘖数等。
二、基于Cpf1蛋白的碱基编辑***
本发明提供了一种用于对生物体基因组中的靶序列进行碱基编辑的***,其包含以下i)至v)中至少一项:
i)碱基编辑融合蛋白,和向导RNA;
ii)包含编码碱基编辑融合蛋白的核苷酸序列的表达构建体,和向导 RNA;
iii)碱基编辑融合蛋白,和包含编码向导RNA的核苷酸序列的表达构建体;
iv)包含编码碱基编辑融合蛋白的核苷酸序列的表达构建体,和包含编码向导RNA的核苷酸序列的表达构建体;
v)包含编码碱基编辑融合蛋白的核苷酸序列和编码向导RNA的核苷酸序列的表达构建体;
其中所述碱基编辑融合蛋白包含DNA切割活性缺失的Cpf1和脱氨酶,所述向导RNA能够将所述碱基编辑融合蛋白靶向基因组中的靶序列,导致所述靶序列中一或多个C至T或者A至G的取代。
Cpf1包含一个DNA切割结构域(RuvC),将其突变后可以使Cpf1的 DNA切割活性缺失,形成“DNA切割活性缺失的Cpf1”。所述DNA切割活性缺失的Cpf1仍然保留gRNA指导的DNA结合能力。因此,原则上,当与另外的蛋白融合时,DNA切割活性缺失的Cpf1可以简单地通过与合适的向导RNA共表达而将所述另外的蛋白靶向几乎任何DNA序列。
本发明所述DNA切割活性缺失的Cpf1可以衍生自不同物种的Cpf1,例如,衍生自Francisella novicida U112、Acidaminococcussp.BV3L6和 Lachnospiraceae bacteriumND2006的分别称为FnCpf1(例如野生型的氨基酸序列示于SEQ ID NO:19)、AsCpf1(例如野生型的氨基酸序列示于SEQ ID NO:18)和LbCpf1的Cpf1蛋白(例如野生型的氨基酸序列示于SEQ ID NO:20)。
在一些实施方案中,所述DNA切割活性缺失的Cpf1是DNA切割活性缺失的FnCpf1。在一些具体实施方式中,所述DNA切割活性缺失的FnCpf1 相对于野生型FnCpf1包含D917A突变。
在一些实施方案中,所述DNA切割活性缺失的Cpf1是DNA切割活性缺失的AsCpf1。在一些具体实施方式中,所述DNA切割活性缺失的AsCpf1 相对于野生型AsCpf1包含D908A突变。
在一些优选实施方案中,所述DNA切割活性缺失的Cpf1是DNA切割活性缺失的LbCpf1。在一些具体实施方式中,所述DNA切割活性缺失的LbCpf1相对于野生型LbCpf1包含D832A突变。
在一些实施方案中,所述DNA切割活性缺失的Cpf1保留其RNA切割活性,从而能对pre-crRNA进行加工以形成成熟crRNA。因此,在一些实施方案中,本发明的***中的包含编码向导RNA的核苷酸序列的表达构建体可以包含编码串联的多个不同向导RNA(crRNA)前体的序列,其在转录后可以被所述DNA切割活性缺失的Cpf1加工形成多个不同的向导 RNA(crRNA),从而同时靶向多个不同靶序列。
本发明的一些实施方案中,融合蛋白中的所述脱氨酶是胞苷脱氨酶,例如载脂蛋白B mRNA编辑复合体(APOBEC)家族脱氨酶。
胞苷脱氨酶可以催化DNA上胞苷(C)的脱氨化作用形成尿嘧啶(U)。本发明人令人惊奇地发现,将DNA切割活性缺失的Cpf1与胞苷脱氨酶融合,在向导RNA的指导下,融合蛋白可以靶向基因组中的靶序列,由于Cpf1 的DNA切割活性失活缺失,DNA双链不被切割,而融合蛋白中的胞苷脱氨酶能够将Cpf1-向导RNA-DNA复合物形成中产生的单链DNA的胞苷脱氨转换成U,再通过碱基错配修复实现C至T的取代。
本发明所述胞苷脱氨酶特别是可以接受单链DNA作为底物的胞苷脱氨酶。本发明可用的胞苷脱氨酶的实例包括但不限于:APOBEC1脱氨酶、激活诱导的胞苷脱氨酶(AID)、APOBEC3G或CDA1。在本发明的一些具体实施方式中,所述胞苷脱氨酶包含SEQ ID NO:1所示的氨基酸序列。
在融合蛋白中的脱氨酶是胞苷脱氨酶的情况下,本发明的碱基编辑***可以将基因组靶序列中的一或多个C突变为T,也称作Cpf1-PBE***。
在细胞中,尿嘧啶DNA糖基化酶催化U从DNA上的去除并启动碱基切除修复(BER),导致将U:G修复成C:G。因此,不受任何理论限制,在本发明的碱基编辑融合蛋白或本发明的***中包含尿嘧啶DNA糖基化酶抑制剂将能够增加碱基编辑的效率。
因此,在本发明的涉及Cpf1-PBE***的一些实施方案中,所述碱基编辑融合蛋白还包含尿嘧啶DNA糖基化酶抑制剂(UGI)。在一些具体实施方式中,所述尿嘧啶DNA糖基化酶抑制剂包含SEQ ID NO:2所示的氨基酸序列。
本发明的一些实施方案中,所述脱氨酶是腺嘌呤脱氨酶。
天然存在的腺嘌呤脱氨酶同常以RNA为底物,通过脱氨作用将单链RNA上的腺苷转变成肌苷(I)。最近,通过定向进化的方法,已经基于大肠杆菌的tRNA腺嘌呤脱氨酶TadA获得了能够以单链DNA为底物,将单链 DNA上的脱氧鸟苷转变为肌苷(I)的DNA依赖型腺嘌呤脱氨酶。参见Nicloe M.Gaudelli等人,doi:10.1038/nature24644,2017。
本发明人令人惊奇地发现,将DNA切割活性缺失的Cpf1与DNA依赖型腺嘌呤脱氨酶融合,在向导RNA的指导下,融合蛋白可以靶向植物基因组中的靶序列,由于Cpf1的DNA切割活性缺失,DNA双链不被切割,,而融合蛋白中的DNA依赖型腺嘌呤脱氨酶能够将Cpf1-向导RNA-DNA复合物形成中产生的单链DNA的腺苷脱氨转换成肌苷(I),由于DNA聚合酶会将肌苷(I)当做鸟嘌呤(G)处理,因此通过碱基错配修复可以实现A至G的取代。因此,在融合蛋白中的脱氨酶是DNA依赖型腺嘌呤脱氨酶的情况下,本发明的碱基编辑***可以将基因组靶序列中的一或多个A突变为G,也称作Cpf1-ABE***。
在本发明的一些实施方案中,所述DNA依赖型腺嘌呤脱氨酶是大肠杆菌tRNA腺嘌呤脱氨酶TadA(ecTadA)的变体,特别是可以接受单链DNA 作为底物的变体,所述变体相对于野生型ecTadA包含一或多组选自以下的突变:
1)A106V和D108N;
2)D147Y和E155V;
3)L84F、H123Y和I156F;
4)A142N;
5)H36L、R51L、S146C和K157N;
6)P48S/T/A;
7)A142N;
8)W23L/R;
9)R152H/P。
在本发明一具体实施方式中,所述DNA依赖型腺嘌呤脱氨酶(ABE版本7.9)相对于野生型ecTadA包含以下突变:W23R、H36L、R51L、S146C、 K157N、A106V、D108N、P48A、L84F、H123Y、I156F、A142N、D147Y、 E155V和R152P。
在本发明一具体实施方式中,所述DNA依赖型腺嘌呤脱氨酶(ABE版本7.10)相对于野生型ecTadA包含以下突变:W23R、H36L、R51L、S146C、 K157N、A106V、D108N、P48A、L84F、H123Y、I156F、D147Y、E155V 和R152P。
野生型ecTadA氨基酸序列如下所示: MSEVEFSHEYWMRHALTLAKRAWDEREVPVGAVLVHNNRVIGEGWNR PIGRHDPTAHAEIMALRQGGLVMQNYRLIDATLYVTLEPCVMCAGAMIH SRIGRVVFGARDAKTGAAGSLMDVLHHPGMNHRVEITEGILADECAALL SDFFRMRRQEIKAQKKAQSSTD(SEQ ID NO:3)。在一些实施方案中,其中起始甲硫氨酸可以不存在。
优选的ecTadA衍生的DNA依赖型腺嘌呤脱氨酶(ABE版本7.10)氨基酸序列如下所示: MSEVEFSHEYWMRHALTLAKRARDEREVPVGAVLVLNNRVIGEGWNR AIGLHDPTAHAEIMALRQGGLVMQNYRLIDATLYVTFEPCVMCAGAMIH SRIGRVVFGVRNAKTGAAGSLMDVLHYPGMNHRVEITEGILADECAALLCYFFRMPRQVFNAQKKAQSSTD(SEQ ID NO:4)。在一些实施方案中,其中起始甲硫氨酸可以不存在。
在本发明的一些实施方案中,所述脱氨酶被融合至所述DNA切割活性缺失的Cpf1的N末端。在一些实施方案中,所述脱氨酶被融合至所述DNA 切割活性缺失的Cpf1的C末端。
在一些优选实施方案中,所述DNA依赖型腺嘌呤脱氨酶的N端融合有对应的野生型腺嘌呤脱氨酶。预期DNA依赖型腺嘌呤脱氨酶与野生型腺嘌呤脱氨酶形成异二聚体可以显著提高融合蛋白A至G的编辑活性。
在本发明的一些实施方案中,所述脱氨酶和所述DNA切割活性缺失的 Cpf1通过接头融合。所述接头可以是长1-50个(例如1、2、3、4、5、6、 7、8、9、10、11、12、13、14、15、16、17、18、19、20个或20-25 个、25-50个)或更多个氨基酸、无二级以上结构的非功能性氨基酸序列。例如,所述接头可以是柔性接头,例如GGGGS、GS、GAP、(GGGGS)x 3、GGS和(GGS)x7等。在一些具体的实施方案中,所述接头是XTEN接头。在一些具体实施方案中,所述接头长32个氨基酸。在一些具体的实施方案中,所述接头的氨基酸序列为: SGGSSGGSSGSETPGTSESATPESSGGSSGGS。
在本发明的一些实施方案中,本发明的碱基编辑融合蛋白还包含核定位序列(NLS)。一般而言,所述碱基编辑融合蛋白中的一个或多个NLS应具有足够的强度,以便在植物细胞的核中驱动所述碱基编辑融合蛋白以可实现其碱基编辑功能的量积聚。一般而言,核定位活性的强度由所述碱基编辑融合蛋白中NLS的数目、位置、所使用的一个或多个特定的NLS、或这些因素的组合决定。
在本发明的一些实施方案中,本发明的碱基编辑融合蛋白的NLS可以位于N端和/或C端。在一些实施方案中,所述碱基编辑融合蛋白包含约1、 2、3、4、5、6、7、8、9、10个或更多个NLS。在一些实施方案中,所述碱基编辑融合蛋白包含在或接近于N端的约1、2、3、4、5、6、7、8、9、 10个或更多个NLS。在一些实施方案中,所述碱基编辑融合蛋白包含在或接近于C端约1、2、3、4、5、6、7、8、9、10个或更多个NLS。在一些实施方案中,所述碱基编辑融合蛋白包含这些的组合,如包含在N端的一个或多个NLS以及在C端的一个或多个NLS。当存在多于一个NLS时,每一个可以被选择为不依赖于其他NLS。在本发明的一些优选实施方式中,所述碱基编辑融合蛋白包含2个NLS,例如所述2个NLS分别位于N端和C端。
一般而言,NLS由暴露于蛋白表面上的带正电的赖氨酸或精氨酸的一个或多个短序列组成,但其他类型的NLS也是已知的。NLS的非限制性实例包括:KKRKV(核苷酸序列5’-AAGAAGAGAAAGGTC-3’)、 PKKKRKV(核苷酸序列5’-CCCAAGAAGAAGAGGAAGGTG-3’或CCAAAGAAGAAGAGGAAGGTT),或SGGSPKKKRKV(核苷酸序列5’- TCGGGGGGGAGCCCAAAGAAGAAGCGGAAGGTG-3’)。
在本发明的一些实施方式中,所述碱基编辑融合蛋白的N端包含 PKKKRKV所示的氨基酸序列的NLS。在本发明的一些实施方式中,所述碱基编辑融合蛋白的C端包含SGGSPKKKRKV或 KRPAATKKAGQAKKKK所示的氨基酸序列的NLS。
此外,根据所需要编辑的DNA位置,本发明的碱基编辑融合蛋白还可以包括其他的定位序列,例如细胞质定位序列、叶绿体定位序列、线粒体定位序列等。
在本发明的涉及Cpf1-PBE***的一些实施方案中,所述碱基编辑融合蛋白还包含尿嘧啶DNA糖基化酶抑制剂(UGI),且还在所述UGI的N端或 C端紧接两个NLS。在一些优选实施方案中,本发明的碱基编辑融合蛋白包含选自SEQ ID NO:24-29的氨基酸序列。
为了获得有效表达,在本发明的一些实施方式中,所述编码碱基编辑融合蛋白的核苷酸序列针对待进行碱基编辑的生物物种进行密码子优化。
密码子优化是指通过用在宿主细胞的基因中更频繁地或者最频繁地使用的密码子代替天然序列的至少一个密码子(例如约或多于约1、2、3、4、 5、10、15、20、25、50个或更多个密码子同时维持该天然氨基酸序列而修饰核酸序列以便增强在感兴趣宿主细胞中的表达的方法。不同的物种对于特定氨基酸的某些密码子展示出特定的偏好。密码子偏好性(在生物之间的密码子使用的差异)经常与信使RNA(mRNA)的翻译效率相关,而该翻译效率则被认为依赖于被翻译的密码子的性质和特定的转运RNA(tRNA)分子的可用性。细胞内选定的tRNA的优势一般反映了最频繁用于肽合成的密码子。因此,可以将基因定制为基于密码子优化在给定生物中的最佳基因表达。密码子利用率表可以容易地获得,例如在www.kazusa.orjp/codon/上可获得的密码子使用数据库(“Codon Usage Database”)中,并且这些表可以通过不同的方式调整适用。参见,Nakamura Y.等,“Codon usage tabulatedfrom theinternational DNA sequencedatabases:statusfortheyear2000. Nucl.AcidsRes.,28:292(2000)。
在一些具体实施方案中,本发明的碱基编辑融合蛋白由选自SEQ ID NO:8-9、11-12或14-15的核苷酸序列编码。
在本发明一些实施方式中,所述编码碱基编辑融合蛋白的核苷酸序列和/或所述编码向导RNA的核苷酸序列与表达调控元件如启动子可操作地连接。
本发明可使用的启动子的实例包括但不限于聚合酶(pol)I、pol II或pol III启动子。pol I启动子的实例包括鸡RNApol I启动子。pol II启动子的实例包括但不限于巨细胞病毒立即早期(CMV)启动子、劳斯肉瘤病毒长末端重复(RSV-LTR)启动子和猿猴病毒40(SV40)立即早期启动子。pol III启动子的实例包括U6和H1启动子。可以使用诱导型启动子如金属硫蛋白启动子。启动子的其他实例包括T7噬菌体启动子、T3噬菌体启动子、β-半乳糖苷酶启动子和Sp6噬菌体启动子。当用于植物时,启动子可以是花椰菜花叶病毒35S启动子、玉米Ubi-1启动子、小麦U6启动子、水稻U3启动子、玉米U3启动子、水稻肌动蛋白启动子。
优选地,向导RNA(crRNA)使用Ubi-1启动子表达并用核酶例如HDV 核酶来切割成熟。
在一实施方式中,在Ubi-1启动子后添加一内含子可以增强目的蛋白或 RNA的表达。
在一些具体实施方案中,用于表达本发明的碱基融合蛋白的表达构建体包含SEQID NO:10或13所示的表达盒。或者所述表达构建体包含SEQ ID NO:30所示的表达调控序列。
三、产生经遗传修饰的生物体的方法
在另一方面,本发明提供了一种产生经遗传修饰的生物体(如植物)的方法,包括将本发明的用于对生物体基因组中的靶序列进行碱基编辑的***导入所述生物体细胞,由此所述向导RNA将所述碱基编辑融合蛋白靶向所述植物基因组中的靶序列,导致所述靶序列中的一或多个C被T取代或一或多个A被G取代。
可以被Cpf1蛋白和向导RNA(即crRNA)复合物识别并靶向的靶序列或 crRNA编码序列的设计可以参照例如Zhang et al.,Cell 163,1-13,October 22, 2015。一般而言,本发明的基因组编辑***靶向的靶序列5’末端需包含前间区序列邻近基序(protospaceradjacent motif)(PAM)5’-TTTN或5’-YTN,其中N独立地选自A、G、C和T,Y选自C和T。
例如,在本发明的一些实施方案中,所述靶序列具有以下结构:5’- TTTN-NX-3’或5’-YTN-NX-3’,其中N独立地选自A、G、C和T,Y选自C 和T;X为15≤X≤35的整数;Nx表示X个连续的核苷酸。
在本发明中,待进行修饰的靶序列可以位于基因组的任何位置,例如位于功能基因如蛋白编码基因内,或者例如可以位于基因表达调控区如启动子区或增强子区,从而实现对所述基因功能修饰或对基因表达的修饰。
可以通过T7EI、PCR/RE或测序方法检测所述细胞靶序列中的A至G 或C至T碱基编辑。
在本发明的方法中,所述碱基编辑的***可以通过本领域技术人员熟知的各种方法导入细胞。可用于将本发明的基因组编辑***导入细胞的方法包括但不限于:磷酸钙转染、原生质融合、电穿孔、脂质体转染、微注射、病毒感染(如杆状病毒、痘苗病毒、腺病毒、腺相关病毒、慢病毒和其他病毒)、基因枪法、PEG介导的原生质体转化、土壤农杆菌介导的转化。
可以通过本发明的方法进行基因组编辑的细胞可以来自例如,哺乳动物如人、小鼠、大鼠、猴、犬、猪、羊、牛、猫;家禽如鸡、鸭、鹅;植物,包括单子叶植物和双子叶植物,例如水稻、玉米、小麦、高粱、大麦、大豆、花生、拟南芥等。
本发明的方法尤其适合于产生经遗传修饰的植物,例如作物植物。在本发明的产生经遗传修饰的植物的方法中,所述碱基编辑***可以本领域技术人员熟知的各种方法导入植物。可用于将本发明的碱基编辑***导入植物的方法包括但不限于:基因枪法、PEG介导的原生质体转化、土壤农杆菌介导的转化、植物病毒介导的转化、花粉管通道法和子房注射法。
在本发明的产生经遗传修饰的植物的方法中,只需在植物细胞中导入或产生所述碱基编辑融合蛋白和向导RNA即可实现对靶序列的修饰,并且所述修饰可以稳定遗传,无需将所述碱基编辑***稳定转化植物。这样避免了稳定存在的碱基编辑***的潜在脱靶作用,也避免外源核苷酸序列在植物基因组中的整合,从而具有更高生物安全性。
在一些优选实施方式中,所述导入在不存在选择压力下进行,从而避免外源核苷酸序列在植物基因组中的整合。
在一些实施方式中,所述导入包括将本发明的碱基编辑***转化至分离的植物细胞或组织,然后使所述经转化的植物细胞或组织再生为完整植物。优选地,在不存在选择压力下进行所述再生,也即是,在组织培养过程中不使用任何针对表达载体上携带的选择基因的选择剂。不使用选择剂可以提高植物的再生效率,获得不含外源核苷酸序列的经修饰的植物。
在另一些实施方式中,可以将本发明的碱基编辑***转化至完整植物上的特定部位,例如叶片、茎尖、花粉管、幼穗或下胚轴。这特别适合于难以进行组织培养再生的植物的转化。
在本发明的一些实施方式中,直接将体外表达的蛋白质和/或体外转录的RNA分子转化至所述植物。所述蛋白质和/或RNA分子能够在植物细胞中实现碱基编辑,随后被细胞降解,避免了外源核苷酸序列在植物基因组中的整合。
可以通过本发明的方法进行碱基编辑的植物包括单子叶植物和双子叶植物。例如,所述植物可以是作物植物,例如小麦、水稻、玉米、大豆、向日葵、高粱、油菜、苜蓿、棉花、大麦、粟、甘蔗、番茄、烟草、木薯或马铃薯。
在本发明的一些实施方式中,其中所述靶序列与植物性状如农艺性状相关,由此所述碱基编辑导致所述植物相对于野生型植物具有改变的性状。
在本发明中,待进行修饰的靶序列可以位于基因组的任何位置,例如位于功能基因如蛋白编码基因内,或者例如可以位于基因表达调控区如启动子区或增强子区,从而实现对所述基因功能修饰或对基因表达的修饰。相应地,在本发明的一些实施方式中,所述C至T或A至G的取代导致靶蛋白中的氨基酸取代或靶蛋白的截短(生成了终止密码子)。在本发明的另一些实施方式中,所述C至T或A至G的取代导致靶基因的表达发生变化。
在本发明的一些实施方式中,所述方法还包括获得所述经遗传修饰的植物的后代。
在另一方面,本发明还提供了经遗传修饰的植物或其后代或其部分,其中所述植物通过本发明上述的方法获得。
在另一方面,本发明还提供了一种植物育种方法,包括将通过本发明上述的方法获得的经遗传修饰的第一植物与不含有所述遗传修饰的第二植物杂交,从而将所述遗传修饰导入第二植物。
实施例
构建Ubi-CPF1-PBE/ABE表达载体
ABE、XTEN、dCPF1序列针对植物进行密码子优化并订购自GenScript (Nanjing)。使用引物对HindIII-F(具有HindIII限制位点)和EcoRI(具有 EcoRI限制位点)扩增全长dCPF1-ABE片段。PCR产物用HindIII和EcoRI 消化,然后***至这两种酶消化的pJIT163-GFP载体(该载体序列示于SEQ ID NO:16)以产生融合表达载体dCPF1-ABE。
PBE、XTEN、dCPF1序列针对植物进行密码子优化并订购自GenScript (Nanjing)。使用引物对HindIII-F(具有HindIII限制位点)和EcoRI(具有 EcoRI限制位点)扩增全长dCPF1-PBE片段。PCR产物用HindIII和EcoRI 消化,然后***至这两种酶消化的pJIT163-GFP载体(该载体序列示于SEQ ID NO:16)以产生融合表达载体dCPF1-PBE。
构建sgRNA表达载体
根据之前描述(Wang,Y.et al.Simultaneous editing of three homoeoallelesin hexaploid bread wheat confers heritable resistance to powdery mildew.Nat.Biotechnol.32,947-951,2014;Shan,Q.et al.Targeted genome modification of cropplants using a CRISPR-Cas system.Nat.Biotechnol.31,686-688,2013;和 Liang,Z.etal.Targeted mutagenesis in Zea mays using TALENs and the CRISPR/Cas system.JGenet Genomics.41,63-68,2014)基于 pTaU6-sgRNA(Addgene ID53062)或pOsU3-sgRNA(Addgene ID53063)或 pZmU3-sgRNA(Addgene ID5306)或OsU3/TaU6-tRNA-sgRNA(Zhanget al. 2017.Genome Biology.DOI:10.1186/s13059-017-1325-9)构建sgRNA表达载体。此外,还有通过II型启动子来启动锤头酶和crRNA来产生crRNA(Tang et al.Nature plant,doi:10.1038/nplants.2017.18)
pUbi-mGFPP-crRNA、pUbi-DEP1-sgRNA、pUbi-DEP1-crRNA、pUbi -DME-crRNA.
BFP和GFP表达载体
pUbi-mGFP,该载体序列示于SEQ ID NO:17。
原生质体测定
在本研究中使用小麦Bobwhite品种、水稻日本晴品种。如下所述进行原生质体转化。平均转化效率为55-70%。每种质粒用10μg通过PEG介导方法进行转化,48小时后,收集原生质体,提取DNA用于T7EI和PCR-RE 测定。
小麦原生质体制备及转化
1)取小麦幼嫩的叶片,将其中间部分切成0.5-1mm的丝,放入0.6M 的Mannitol溶液中避光处理10分钟,再用滤网过滤,将其放入50ml酶液中20-25℃避光,10rmp缓慢摇晃消化5小时。
2)加10ml W5稀释酶解产物,用75μm尼龙滤膜过滤酶解液于圆底离心管中(50ml)。
3)23℃,100g,离心3min,弃上清。
4)用W510ml轻轻悬起,冰上放置30min使原生质体逐渐沉降,弃上清。
5)加适量MMG悬浮,至于冰上,待转化。
6)2ml离心管中加10-20μg质粒,200μl原生质体(大约4×105细胞), 220μl新配的PEG溶液,混匀,室温避光放置10-20分钟诱导转化。
7)诱导转化结束后缓慢加880μl W5溶液,轻轻颠倒混匀,100g水平离心3min,吸弃上清。
8)加2ml W5溶液重悬,转移到六孔板中,室温(或25℃)暗处培养。若用于提取原生质体基因组DNA,需培养48h。
水稻原生质体制备及转化:
1)选取幼苗叶鞘部分分离原生质体,用锋利刀片切成大约0.5mm宽。
2)切开后立刻转移到0.6M Mannitol溶液中,避光放置10min。
3)过滤掉Mannitol溶液,转移到酶解液中,避光抽真空30min。
4)避光酶解5-6h,同时缓慢摇动(脱色摇床,速度10)。
5)酶解结束后,加入等体积的W5,水平摇动10sec,释放原生质体。
6)使用40μm尼龙膜过滤原生质体到50ml圆底离心管,再加W5溶液冲洗。
7)250g水平离心3min沉淀原生质体,吸弃上清。
8)加10ml W5重悬原生质体,250g离心3min,弃上清。
9)加适量MMG溶液重悬原生质体浓度为2×106/ml。
注:以上所有步骤在室温进行。
10)2ml离心管中加10-20μg质粒,200μl原生质体(大约4×105细胞), 220μl新配的PEG溶液,混匀,室温避光放置10-20分钟诱导转化。
11)诱导转化结束后缓慢加880μl W5溶液,轻轻颠倒混匀,250g水平离心3min,吸弃上清。
12)加2ml WI溶液重悬,转移到六孔板中,室温(或25℃)暗处培养,若用于提取原生质体基因组DNA,需培养48h。
PCR/RE检测:
1)提取植物基因组DNA。
2)合成基因特异引物,扩增含有靶位点的片段,长度为350-1000bp 之间:
10×EasyTaq Buffer 5μl
dNTP(2.5mM) 4μl
正向引物(10μM) 2μl
正向引物(10μM) 2μl
Easy Taq 0.5μl
DNA 2μl
ddH<sub>2</sub>O 至50μl
3)一般反应条件是:94℃变性5min;94℃变性30s,58℃复性30s,72℃延伸30s,扩增30至35个循环;72℃保温5min;12℃保温。取5μl PCR 产物电泳检测。
4)限制性内切酶酶切PCR产物,一般的酶切体系如下:
10×Fastdigest Buffer 2ul
限制性内切酶 1 μl
PCR产物 3-5μl
ddH<sub>2</sub>O 至20μl
5)37℃,酶切2-3h。1.2%琼脂糖凝胶电泳检测。
6)回收纯化PCR产物中未切开的突变条带,进行TA克隆。反应体系如下:
pEasy-T Vector 1μl
回收的未切开的PCR产物 4μl
7)22℃连接10min,转化E.coli感受态细胞,涂LB固体平板(Amp100、 IPTG和X-gal),培养12-16h,挑选白色菌落鉴定阳性克隆,送测序。
深度测序
不同的sgRNA表达载体分别与Ubi-CPF1-PBE/ABE表达载体转化至小麦、水稻质体48小时后,收集原生质体,提取DNA进行深度测序。在第一轮PCR中,靶区域使用位点特异性引物进行扩增。在二轮PCR中,将正向和反向标签添加至PCR产物末端进行文库构建。合并等量不同PCR产物。样品然后在Beijing Genomics Institute使用Illumina High-Seq 4000测序。
实施例
实施例1.CPF1介导的植物基因组切割活性的优化。
CPF1在植物细胞中的编辑活性在不同文章中的差别较大,并且不同类型的CPF1之间的切割活性差别也很大。
本实施例通过对AsCPF1、FnCPF1以及LbCPF1的入核状态进行优化,同时也对crRNA的启动子进行优化,来提高CPF1在植物细胞里的切割活性。构建了AsCPF1、FnCPF1和LbCPF1的1-4个NLS的载体,并且构建了U3/U6以及UBI启动的通过核酶来产生crRNA的不同载体(如图1)。通过PCR/RE的结果可以看出,2个NLS的三种CPF1都可以工作,而且 LbCPF1效率较高(SEQ ID NO:5-7分别为ASCPF1-2NLS、FNCPF1-2NLS 和LBCPF1-2NLS的编码序列,可容易获得对应的氨基酸序列)。针对OsPDS 这个基因的靶位点,可以看出2NLS-LbCPF1的效率要高于NLS-LbCPF1,高于其他一些报道的构建体。
实施例2.CPF1介导的植物基因组C到T的突变(CPF1-PBE)
参考CPF1在植物细胞切割活性的特点,构建了以下dCPF1-PBE***: dAsCPF 1-2NLS-PBE、dFnCPF1-2NLS-PBE、dLbCPF1-2NLS-PBE。其中C 端的NLS分别放在了UGI的一端和分别放在了UGI的两端。crRNA用UBI1 来启动并用核酶切割。利用PCR/RE检测结果表明,dFnCPf1和dLbCPF1 检测出了编辑活性,并且NLS在UGI的一端的活性较高(SEQ ID NO:8,9分别示出dFNCPF1-PBE-2NLS和dLbCPF1-2NLS-PBE的编码序列,可容易获得对应的氨基酸序列)。此外,还构建了加强版的dCPF1-PBE2-X,即在用ZmUbi-1启动子后面加入了一个内含子来增加dCPF1-PBE的表达(SEQ ID NO:10示出了包含ZmUbi-1启动子和内含子序列的dLBCPF1-PBE-2NLS 表达盒)。
实施例3.CPF1介导的植物基因组A到G的突变(CPF1-ABE)
构建了以下CPF1-ABE***:dAsCPF1-1NLS-ABE、 dFnCPF1-NLS-ABE、dLbCPF1-1NLS-ABE、以及dAsCPF1-2NLS-ABE、 dFnCPF1-2NLS-ABE、dLbCPF1-2NLS-ABE,其中ABE又包括ABE7.9和 ABE7.10两个版本。crRNA用UBI1来启动并用核酶切割。
利用图3E的GFP碱基编辑报告***的结果表明:dFnCPF1-ABE7.10 (SEQ ID NO:11)和dLbCPF1-ABE7.9和dLbCPF1-ABE7.10(SEQ ID NO:12) 都可以工作,而7.10要高于7.9(图3F)。
利用PCR/RE检测结果表明,dLbCPF1-ABE7.10检测出了活性,并且 2NLS要高于1NLS。此外,还构建了两个加强版的dCPF1-ABE2,即在UBI1 启动子后面加入了一个内含子来增加dCPF1-ABE的表达(dCPF1-ABE2-X1)(SEQ ID NO.13),还有将ABE构建在了CPF1的C端(dCPF1-ABE2-X2/X3)(SEQ ID NO.14、15),利用图3E的GFP碱基编辑报告***的结果:dCPF1-ABE2-X2/X3编辑活性要高于 dLbCPF1-ABE7.10(图3G)。
实施例4.CPF1介导的基因编辑优化
为了继续提高CPF1的编辑效率,我们继续对CPF1***进行优化,首先对CPF1介导的编辑的所有表达载体采用BdUbi10启动子进行启动,以便增加其表达量。此外,我们还对crRNA的启动采用II型启动子启动,并将crRNAArray放入表达基因的5’UTR或3’UTR区域,来通过提高mRNA 的表达来提高CPF1的编辑效率。
相关序列描述:
SEQ ID NO.1胞苷脱氨酶氨基酸序列
SEQ ID NO.2尿嘧啶DNA糖基化酶抑制剂(UGI)氨基酸序列
SEQ ID NO.3野生型ecTadA氨基酸序列
SEQ ID NO.4 ecTadA衍生的DNA依赖型腺嘌呤脱氨酶(ABE版本7.10)氨基酸序列
SEQ ID NO.5 ASCPF1-2NLS编码序列
SEQ ID NO.6 FNCPF1-2NLS编码序列
SEQ ID NO.7 LBCPF1-2NLS编码序列
SEQ ID NO.8 dFNCPF1-PBE-2NLS编码序列
SEQ ID NO.9 dLBCPF1-PBE-2NLS编码序列
SEQ ID NO.10启动子+内含子+dLBCPF1-PBE-2NLS编码序列
SEQ ID NO.11 dFNCPF1-ABE7.10-2NLS编码序列
SEQ ID NO.12 dLBCPF1-ABE7.10-2NLS编码序列
SEQ ID NO.13启动子+内含子+dLBCPF1-ABE2-x编码序列
SEQ ID NO.14 LBCPF1-ABE2-x2编码序列
SEQ ID NO.15 LBCPF1-ABE2-x3编码序列
SEQ ID NO.16 PJIT163-GFP
SEQ ID NO:17 pBUI-mGFP
SEQ ID NO:18 ASCPF1氨基酸序列
SEQ ID NO:19 FNCPF1氨基酸序列
SEQ ID NO:20 LBCPF1氨基酸序列
SEQ ID NO:21 ASCPF1-2NLS氨基酸序列
SEQ ID NO:22 FNCPF1-2NLS氨基酸序列
SEQ ID NO:23 LBCPF1-2NLS氨基酸序列
SEQ ID NO:24 dFNCPF1-PBE-2NLS氨基酸序列
SEQ ID NO:25 dLBCPF1-PBE-2NLS氨基酸序列
SEQ ID NO:26 dFNCPF1-ABE7.10-2NLS氨基酸序列
SEQ ID NO:27 dLBCPF1-ABE7.10-2NLS氨基酸序列
SEQ ID NO:28 LBCPF1-ABE2-x2氨基酸序列
SEQ ID NO:29 LBCPF1-ABE2-x3氨基酸序列
SEQ ID NO:30启动子+内含子核苷酸序列。
序列表
<110> 中国科学院遗传与发育生物学研究所
<120> 基于CPF1蛋白的碱基编辑***和方法
<130> 149132
<150> 201711403490.X
<151> 2017-12-22
<160> 30
<170> PatentIn version 3.5
<210> 1
<211> 227
<212> PRT
<213> Rattus norvegicus
<400> 1
Ser Ser Glu Thr Gly Pro Val Ala Val Asp Pro Thr Leu Arg Arg Arg
1 5 10 15
Ile Glu Pro His Glu Phe Glu Val Phe Phe Asp Pro Arg Glu Leu Arg
20 25 30
Lys Glu Thr Cys Leu Leu Tyr Glu Ile Asn Trp Gly Gly Arg His Ser
35 40 45
Ile Trp Arg His Thr Ser Gln Asn Thr Asn Lys His Val Glu Val Asn
50 55 60
Phe Ile Glu Lys Phe Thr Thr Glu Arg Tyr Phe Cys Pro Asn Thr Arg
65 70 75 80
Cys Ser Ile Thr Trp Phe Leu Ser Trp Ser Pro Cys Gly Glu Cys Ser
85 90 95
Arg Ala Ile Thr Glu Phe Leu Ser Arg Tyr Pro His Val Thr Leu Phe
100 105 110
Ile Tyr Ile Ala Arg Leu Tyr His His Ala Asp Pro Arg Asn Arg Gln
115 120 125
Gly Leu Arg Asp Leu Ile Ser Ser Gly Val Thr Ile Gln Ile Met Thr
130 135 140
Glu Gln Glu Ser Gly Tyr Cys Trp Arg Asn Phe Val Asn Tyr Ser Pro
145 150 155 160
Ser Asn Glu Ala His Trp Pro Arg Tyr Pro His Leu Trp Val Arg Leu
165 170 175
Tyr Val Leu Glu Leu Tyr Cys Ile Ile Leu Gly Leu Pro Pro Cys Leu
180 185 190
Asn Ile Leu Arg Arg Lys Gln Pro Gln Leu Thr Phe Phe Thr Ile Ala
195 200 205
Leu Gln Ser Cys His Tyr Gln Arg Leu Pro Pro His Ile Leu Trp Ala
210 215 220
Thr Gly Leu
225
<210> 2
<211> 87
<212> PRT
<213> Bacillus phage
<400> 2
Thr Asn Leu Ser Asp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu Val
1 5 10 15
Ile Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val Glu Glu Val Ile
20 25 30
Gly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala Tyr Asp Glu
35 40 45
Ser Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala Pro Glu Tyr
50 55 60
Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly Glu Asn Lys Ile
65 70 75 80
Lys Met Leu Ser Gly Gly Ser
85
<210> 3
<211> 167
<212> PRT
<213> Escherichia coli
<400> 3
Met Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala Leu
1 5 10 15
Thr Leu Ala Lys Arg Ala Trp Asp Glu Arg Glu Val Pro Val Gly Ala
20 25 30
Val Leu Val His Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro
35 40 45
Ile Gly Arg His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg
50 55 60
Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu
65 70 75 80
Tyr Val Thr Leu Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile His
85 90 95
Ser Arg Ile Gly Arg Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly
100 105 110
Ala Ala Gly Ser Leu Met Asp Val Leu His His Pro Gly Met Asn His
115 120 125
Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu
130 135 140
Leu Ser Asp Phe Phe Arg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys
145 150 155 160
Lys Ala Gln Ser Ser Thr Asp
165
<210> 4
<211> 167
<212> PRT
<213> Escherichia coli
<400> 4
Met Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala Leu
1 5 10 15
Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly Ala
20 25 30
Val Leu Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Ala
35 40 45
Ile Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg
50 55 60
Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu
65 70 75 80
Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile His
85 90 95
Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala Lys Thr Gly
100 105 110
Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro Gly Met Asn His
115 120 125
Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu
130 135 140
Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln Lys
145 150 155 160
Lys Ala Gln Ser Ser Thr Asp
165
<210> 5
<211> 3990
<212> DNA
<213> Artificial Sequence
<220>
<223> ASCPF1-2NLS
<400> 5
atgacgcagt tcgaggggtt caccaacctc taccaggtca gcaagacgct gcggttcgag 60
ctgattccgc agggcaagac cctcaagcac atccaggagc agggctttat cgaggaggac 120
aaagcgcgga acgaccacta caaggagctc aagccgatca tcgaccggat ctacaagacc 180
tacgcggacc agtgcctgca gctcgtgcag ctcgactggg agaacctctc ggccgccatc 240
gactcctacc gcaaggagaa gacggaggag acacgcaacg cgctcatcga agaacaggct 300
acgtatcgca acgctatcca cgactacttc atcgggcgca cagataatct caccgacgcc 360
atcaacaagc gccacgccga aatctataag gggctgttta aggccgagct gttcaatggc 420
aaagtgctga agcaactggg caccgtcacg accacagagc atgagaacgc cctcctccgg 480
tcgttcgaca agtttacgac atactttagc ggcttttacg agaaccgcaa aaacgtgttc 540
agcgccgagg acattagcac cgccatcccg cataggatcg tgcaagacaa cttcccgaag 600
ttcaaggaga actgccacat cttcacccgc ctcatcacgg ccgtgccgtc cctccgcgag 660
cacttcgaga acgtgaagaa ggccatcggg atcttcgtgt ccacctcgat cgaagaggtg 720
ttctccttcc ccttctacaa tcagctgctg acgcagaccc agatcgacct ctacaaccag 780
ctcctcggcg gcatctcccg cgaggccggc accgagaaga tcaagggcct gaacgaggtg 840
ctcaatctcg ccatccagaa gaacgacgaa accgcgcaca tcattgcctc actcccacat 900
aggtttatcc ccctgtttaa gcagatcctc tccgaccgca acacgttgtc cttcatcctc 960
gaggagttca agagcgacga ggaggtcatc cagtccttct gcaagtacaa gaccctcctc 1020
cgcaacgaga atgtgctcga aaccgccgag gcgctgttca atgagctcaa tagcatcgat 1080
ctcacccaca tcttcatctc ccataagaag ctggaaacca tctccagcgc cctgtgcgac 1140
cactgggaca ccctccgcaa cgccctctac gagcggcgca tcagcgagct caccggcaag 1200
atcacgaagt cggcgaaaga gaaagtgcaa aggagcctca agcacgagga cattaacctg 1260
caggagatca tctccgccgc gggcaaggag ctgtccgagg ctttcaagca gaagacctcg 1320
gagatcctct ctcacgccca cgcggccctg gaccagccgc tgccgacgac cctgaaaaag 1380
caagaggaaa aggaaatcct caagtcccag ctggacagcc tcctggggct gtaccacctt 1440
ctcgactggt tcgccgtgga cgagtccaac gaggtcgatc cggagtttag cgcccgcctc 1500
accgggatca agcttgagat ggaacctagc ttgagcttct ataataaggc gcgcaattac 1560
gcgaccaaga agccgtattc cgtggagaag ttcaagctga acttccaaat gcccaccctg 1620
gccagcgggt gggacgttaa caaagagaag aacaacggag ccattctttt cgtgaaaaat 1680
gggttgtatt atttgggaat tatgccgaaa caaaaaggca ggtacaaggc gctcagcttc 1740
gagccaactg agaagacctc cgagggcttc gacaagatgt actacgatta ttttcctgac 1800
gctgcaaaga tgataccgaa gtgcagcact cagcttaagg cggtgacggc gcactttcag 1860
acccatacca cccccatcct cctctccaac aacttcatcg agccgctcga gatcaccaag 1920
gagatatacg atctgaataa tccagaaaag gaacccaaga agttccagac cgcctacgcc 1980
aagaagacgg gcgatcaaaa ggggtataga gaggcgctct gcaagtggat cgacttcacg 2040
cgcgatttcc tcagcaagta caccaagaca acctccatcg atctctcttc cctccgcccc 2100
tcttcccagt acaaggacct cggggagtac tacgccgaac tcaacccact cctgtatcac 2160
atctcgtttc agcgtatcgc ggaaaaggag atcatggacg ccgtcgaaac cggcaagttg 2220
tatcttttcc aaatctataa caaggacttc gcgaagggcc accacgggaa gccaaacctg 2280
cacaccctct actggacagg cctcttcagc ccggaaaatc tcgcgaagac gagcataaag 2340
ctgaacggcc aggcagaact cttctacagg ccgaagtcca ggatgaagcg catggctcat 2400
cgcctcggtg agaagatgct gaacaagaag ctgaaagatc aaaagacgcc aatccctgat 2460
acactgtatc aggagctgta cgattacgtg aaccaccgcc tctcacacga cctcagcgac 2520
gaggcccgcg cgctcctgcc aaacgtcatc acgaaggagg tcagccacga gatcataaag 2580
gatcggcggt ttacctctga caagttcttt ttccatgtcc ccatcacgct gaactaccag 2640
gccgcgaata gcccgtccaa gttcaaccag cgggtcaacg cgtatctcaa ggagcaccca 2700
gagacaccca taatcgggat tgaccggggg gagcggaacc tcatctacat caccgtcatc 2760
gactccaccg gaaagattct cgagcaacgg tcgctcaata ccatccagca gttcgactac 2820
cagaagaagc tcgacaaccg ggagaaggaa cgcgtcgccg cgaggcaggc ctggtccgta 2880
gtgggcacga tcaaagacct gaagcagggc tatctcagcc aggtcatcca tgagatagtg 2940
gatctcatga tccactacca agccgtcgtg gtcctcgaga atctcaattt cggattcaaa 3000
tccaagcgca caggcatcgc cgagaaggcg gtgtaccaac agttcgagaa aatgcttatc 3060
gacaagctca attgcctggt gctcaaggac tatccggcgg agaaggtcgg gggggtcctc 3120
aatccgtatc agctgaccga ccagtttacg tcatttgcga agatgggcac ccagagcggc 3180
ttccttttct atgtcccggc cccatatacc tcaaagattg atcccttgac cggattcgtg 3240
gacccgtttg tctggaagac catcaagaac catgagtcgc gtaagcattt cctggagggt 3300
ttcgacttcc tgcactatga tgtaaaaacc ggagacttca tcctgcattt caagatgaat 3360
cggaacctct ccttccagcg gggactccct ggcttcatgc ccgcttggga tatcgttttt 3420
gagaaaaatg aaacccaatt cgacgccaaa ggcacgcctt tcatcgcggg caagaggatt 3480
gtccctgtaa ttgagaacca tagattcacc gggcgttacc gtgacctgta ccccgcaaac 3540
gaactcatcg ccctcctgga ggagaaaggc atcgttttcc gcgacgggtc aaatatcctc 3600
cccaaactgc tcgagaacga tgatagccac gctattgaca cgatggtagc gctcatcaga 3660
tccgtgctgc aaatgagaaa ttcaaatgct gccactgggg aggattacat caactcccct 3720
gtgcgtgatc tcaatggcgt gtgcttcgat tctagatttc agaatcctga gtggccgatg 3780
gatgccgatg ctaacggcgc ataccacata gcattgaaag gacaactgtt gttgaaccat 3840
ctcaaggaga gcaaggacct taagctgcag aacggcatca gcaaccagga ttggcttgcc 3900
tatatccaag agctccgcaa ttccggcggc agcccaaaga agaagaggaa ggtgagcggc 3960
ggcagcccaa agaagaagcg caaggtctag 3990
<210> 6
<211> 3969
<212> DNA
<213> Artificial Sequence
<220>
<223> FNCPF1-2NLS
<400> 6
atgtccatct accaggagtt cgtcaataag tactcactct ctaagaccct gcggttcgag 60
ctgatcccgc agggcaagac actcgagaac atcaaggcgc gcggcctgat tctcgacgat 120
gagaagcggg ccaaggacta caagaaggcg aagcagatca ttgataagta ccaccagttc 180
ttcatcgagg agattctgtc cagcgtgtgc atctctgagg atctcctgca gaattactcc 240
gacgtctact tcaagctcaa gaagtctgac gatgacaacc tgcagaagga tttcaagtcc 300
gccaaggaca ccatcaagaa gcagatttct gagtacatca aggattccga gaagttcaag 360
aatctcttca accagaatct gattgatgcg aagaagggcc aggagtctga cctgatcctc 420
tggctgaagc agtccaagga caatggcatt gagctgttca aggccaacag cgatatcacc 480
gatattgacg aggcgctgga gatcattaag tcattcaagg gctggaccac atacttcaag 540
ggcttccatg agaaccggaa gaatgtgtac tcatctaacg acattccgac ctccatcatc 600
tacaggatcg tcgatgacaa tctgccaaag ttcctcgaga acaaggccaa gtacgagtcc 660
ctcaaggaca aggccccgga ggcgattaat tacgagcaga tcaagaagga tctggcggag 720
gagctgacct tcgatatcga ctacaagaca agcgaggtga accagagggt gttctccctc 780
gatgaggtgt tcgagatcgc caatttcaac aattacctga accagtccgg cattaccaag 840
ttcaatacaa tcattggcgg caagttcgtc aacggcgaga ataccaagcg caagggcatt 900
aacgagtaca tcaatctcta ctcccagcag atcaacgaca agaccctgaa gaagtacaag 960
atgtctgtgc tcttcaagca gatcctgtcc gatacagagt ccaagagctt cgtcattgat 1020
aagctcgagg acgacagcga cgtggtcacc acaatgcagt cattctacga gcagatcgcc 1080
gcgttcaaga ccgtggagga gaagagcatt aaggagacac tctcactcct gttcgatgac 1140
ctgaaggccc agaagctcga cctgagcaag atctacttca agaacgataa gagcctcaca 1200
gacctgtcac agcaggtgtt cgatgactac tcagtgattg gcaccgccgt cctcgagtac 1260
attacacagc agatcgcgcc aaagaacctc gataatcctt ctaagaagga gcaggagctg 1320
atcgccaaga aaaccgagaa ggcgaagtac ctctccctgg agacaattaa gctcgccctg 1380
gaggagttca ataagcacag ggatattgac aagcagtgcc gcttcgagga gatcctcgcg 1440
aacttcgccg cgatcccaat gattttcgat gagatcgccc agaacaagga caatctggcg 1500
cagatctcta ttaagtacca gaaccagggc aagaaggacc tcctgcaggc ctccgcagag 1560
gacgacgtga aggccatcaa ggatctcctg gaccagacca acaatctcct gcacaagctc 1620
aagatcttcc atatttcaca gtctgaggat aaggccaata tcctcgataa ggacgagcat 1680
ttctacctgg tgttcgagga gtgctacttc gagctggcga acattgtccc tctgtacaac 1740
aagattagga attacatcac acagaagccg tacagcgacg agaagttcaa gctcaacttc 1800
gagaattcaa ccctggccaa cggctgggat aagaataagg agcctgacaa cacagcgatc 1860
ctcttcatca aggacgacaa gtactacctg ggcgtgatga ataagaagaa caataagatc 1920
ttcgatgaca aggccattaa ggagaacaag ggcgagggct acaagaagat cgtgtacaag 1980
ctcctgcctg gcgccaataa gatgctcccg aaggtgttct tctccgcgaa gtccattaag 2040
ttctacaacc caagcgagga tatcctcagg atcaggaacc actctaccca tacaaagaac 2100
ggctcccctc agaagggcta cgagaagttc gagttcaata tcgaggattg ccggaagttc 2160
attgacttct acaagcagtc catcagcaag caccctgagt ggaaggattt cggcttccgc 2220
ttcagcgaca cccagcggta caactcaatc gatgagttct acagggaggt ggagaatcag 2280
ggctacaagc tcacattcga gaacatttca gagtcttaca tcgactccgt ggtcaatcag 2340
ggcaagctct acctgttcca gatctacaac aaggatttca gcgcctactc aaagggcagg 2400
ccgaacctcc ataccctgta ctggaaggcg ctcttcgatg agcgcaatct gcaggacgtg 2460
gtctacaagc tcaacggcga ggccgagctg ttctaccgca agcagtctat tccgaagaag 2520
atcacacacc cagcgaagga ggccatcgcg aacaagaata aggacaatcc gaagaaggag 2580
tccgtgttcg agtacgatct cattaaggac aagcggttca ccgaggataa gttcttcttc 2640
cattgcccaa tcacaattaa cttcaagtcc agcggcgcca acaagttcaa tgacgagatc 2700
aatctcctgc tcaaggagaa ggcgaacgat gtgcatatcc tctccattga ccgcggcgag 2760
cggcatctcg cctactacac cctggtcgat ggcaagggca atatcattaa gcaggatacc 2820
ttcaacatca ttggcaatga ccgcatgaaa accaactacc acgataagct cgccgcgatc 2880
gagaaggata gggacagcgc gcgcaaggac tggaagaaga tcaacaatat taaggagatg 2940
aaggagggct acctctcaca ggtggtccat gagattgcca agctcgtgat cgagtacaat 3000
gcgattgtgg tgttcgagga cctcaacttc ggcttcaagc ggggcaggtt caaggtggag 3060
aagcaggtct accagaagct ggagaagatg ctcatcgaga agctcaatta cctggtgttc 3120
aaggataacg agttcgacaa gaccggagga gtcctcaggg cataccagct gaccgcgcca 3180
ttcgagacat tcaagaagat gggcaagcag acaggcatca tctactacgt gccagccggc 3240
ttcacctcca agatctgccc tgtgacaggc ttcgtcaacc agctctaccc taagtacgag 3300
tccgtcagca agtcacagga gttcttcagc aagttcgata agatttgcta caatctggac 3360
aagggctact tcgagttctc attcgattac aagaacttcg gcgacaaggc cgcgaagggc 3420
aagtggacca ttgcgtcttt cggctcccgg ctcatcaact tcaggaatag cgataagaac 3480
cacaattggg acaccaggga ggtgtacccg acaaaggagc tggagaagct gctcaaggac 3540
tactcaatcg agtacggcca tggcgagtgc attaaggccg cgatctgcgg cgagagcgat 3600
aagaagttct tcgccaagct gacctcagtc ctcaatacaa tcctgcagat gaggaactct 3660
aagaccggca cagagctgga ctacctgatt tccccagtgg ccgatgtcaa cggcaatttc 3720
ttcgacagcc gccaggcgcc gaagaatatg ccacaggacg ccgacgccaa cggcgcatac 3780
cacatcggcc tcaagggcct gatgctgctc ggacgcatta agaacaatca ggagggcaag 3840
aagctcaacc tcgtgatcaa gaacgaggag tacttcgagt tcgtccaaaa ccggaacaat 3900
tccggcggca gcccaaagaa gaagaggaag gtgagcggcg gcagcccaaa gaagaagcgc 3960
aaggtctag 3969
<210> 7
<211> 3753
<212> DNA
<213> Artificial Sequence
<220>
<223> LBCPF1-2NLS
<400> 7
atgtcaaagc tcgagaaatt caccaactgt tattcgttga gcaaaacact gcggtttaaa 60
gcgattccag tcggcaagac tcaagagaat atagacaata agcggctgtt ggtggaagat 120
gaaaagcgcg cggaagacta caaaggggtg aagaagttgt tggacagata ctacctctct 180
tttatcaatg atgtcttgca ctcaatcaaa ttgaagaatc tgaacaacta catctccctc 240
ttcagaaaga aaacaaggac agaaaaggag aataaggaac ttgaaaattt ggagatcaat 300
ctgaggaaag agatcgcgaa agcctttaaa ggcaacgaag gatacaaaag tctgttcaag 360
aaggatataa ttgagacaat tttgccagag ttcctcgatg acaaggacga gattgcgctg 420
gtcaattcgt tcaacggatt cacaacagca ttcacaggct tctttgataa tcgggaaaat 480
atgttctctg aggaggcaaa gtccacttct attgcgttca ggtgtatcaa tgagaatctc 540
actaggtaca tttccaacat ggatatcttt gagaaggttg acgcaatttt tgacaagcac 600
gaagttcagg agattaagga gaagatcctc aattccgatt atgacgttga ggacttcttc 660
gaaggtgagt tttttaattt cgtgctcact caagagggta tcgacgtgta taatgcgatc 720
atcggtgggt tcgtgactga gtccggtgaa aagattaagg gattgaacga gtatatcaac 780
ctttacaacc aaaagacgaa acagaagctg ccaaagttca agcctcttta caaacaggtt 840
ctttcagacc gcgagtcact ctcgttctat ggggagggct acacttcgga tgaggaagtc 900
ctggaggtgt tcaggaatac tctcaataag aattcggaga ttttctcttc tataaaaaaa 960
ctggaaaagt tgtttaagaa ttttgacgaa tactctagcg ccggcatatt tgtgaaaaac 1020
ggcccggcca tatcaacgat aagtaaagat atcttcggcg aatggaacgt gatcagagac 1080
aaatggaacg cggagtatga cgatattcac ctgaagaaga aggctgtcgt aacggagaag 1140
tacgaggatg atcgcaggaa aagcttcaaa aagatcggaa gtttcagcct ggaacagttg 1200
caggagtatg ctgacgccga tcttagcgtc gtcgagaagt tgaaggagat aatcatccaa 1260
aaggtcgacg agatatataa agtctatgga tcaagtgaaa aactgttcga cgccgacttc 1320
gttttggaga agtccctgaa gaagaacgac gctgttgttg ccattatgaa ggatctgctc 1380
gacagcgtga agagtttcga gaactatatt aaggcttttt tcggggaggg gaaggagact 1440
aacagagatg agtccttcta cggagacttc gtcctcgcgt acgatatact ccttaaggta 1500
gaccacatct acgacgcaat cagaaattac gtgacacaaa agccgtacag caaggacaag 1560
ttcaaactct acttccagaa cccccagttc atgggcggct gggacaagga caaggaaacg 1620
gattacaggg ctacgatcct gaggtatggt tcaaaatact acttggcgat tatggacaag 1680
aagtacgcca agtgtctcca gaagattgac aaagacgatg tcaatggcaa ttatgagaag 1740
atcaactaca agctgcttcc gggtccgaac aagatgctcc caaaggtttt cttcagcaag 1800
aaatggatgg cctactataa cccaagcgag gacatccaga agatttataa gaacggtacg 1860
ttcaagaagg gcgacatgtt caatcttaac gactgtcaca agctgatcga cttcttcaaa 1920
gactcaatta gccggtaccc aaagtggtct aacgcctatg acttcaactt ttcggaaacc 1980
gagaagtaca aggatatagc cggattttat agagaggtgg aagagcaggg ctacaaggtg 2040
tcattcgagt ccgccagcaa gaaggaagtg gacaagctcg tggaagaggg taagctctac 2100
atgttccaga tttataataa agactttagc gataagagcc acgggacacc taatctccac 2160
acaatgtatt tcaagctgct cttcgacgag aataaccacg gccaaatcag gttgtcagga 2220
ggggctgaac tcttcatgcg gcgcgctagc cttaagaagg aggagcttgt agtccaccct 2280
gcgaatagtc caattgcgaa taagaacccg gacaatccta aaaagactac aacattgagc 2340
tacgacgtgt acaaggataa gaggttttcc gaggatcagt acgagctcca catcccgatt 2400
gcgatcaaca agtgcccaaa gaatattttc aagataaaca cagaggtgcg tgtactcctg 2460
aagcatgacg acaatcctta cgtcattggg attgatcggg gcgagaggaa cctcctctat 2520
attgtggtgg tggacgggaa ggggaacata gtcgaacagt actcccttaa cgaaataatt 2580
aacaatttca acggcatccg tatcaagacc gactaccatt cgttgctgga caagaaggag 2640
aaggagagat ttgaggcgcg gcaaaattgg acaagtatcg agaacatcaa ggaactcaaa 2700
gcaggttata tctctcaagt tgtgcataag atatgcgagc tggttgagaa gtatgacgca 2760
gtgatcgctc ttgaggacct caactcgggc tttaagaatt ctagagttaa agtggagaag 2820
caggtctatc aaaagttcga gaagatgctt atagataagc tcaactacat ggtcgataag 2880
aaatcgaacc catgtgccac cggcggcgca ctcaaaggtt accaaataac aaacaaattc 2940
gagtccttca aatcgatgag tactcagaat gggttcatat tttatatacc ggcgtggctt 3000
acgtctaaga tcgacccgtc aactggtttt gtcaacctgt tgaagacgaa atacacgtcc 3060
attgccgatt cgaaaaagtt catatctagt tttgatcgta ttatgtacgt cccagaggaa 3120
gatcttttcg agtttgctct cgactacaaa aacttttcgc ggaccgatgc ggattacatt 3180
aaaaaatgga aactctattc gtacggcaac agaatcagga tttttcgcaa ccctaagaag 3240
aataacgtct ttgattggga ggaagtttgc ttgactagcg cgtacaagga gctctttaat 3300
aagtatggca ttaactacca acagggtgat atcagagcac tgctttgcga acaatctgac 3360
aaggctttct actcatcctt catggctttg atgagcctga tgctccagat gagaaattca 3420
attacaggca gaaccgacgt ggatttcttg atctccccgg ttaaaaattc tgatggcatc 3480
ttttacgata gcaggaacta tgaagcgcaa gagaatgcga ttctgccaaa aaatgcagac 3540
gccaacggtg cctataacat cgccaggaaa gtcctgtggg cgatcggcca gttcaaaaag 3600
gccgaagacg aaaaattgga caaggtcaaa atcgctatca gcaacaaaga gtggctggag 3660
tatgctcaga catccgtaaa gcattccggc ggcagcccaa agaagaagag gaaggtgagc 3720
ggcggcagcc caaagaagaa gcgcaaggtc tag 3753
<210> 8
<211> 4995
<212> DNA
<213> Artificial Sequence
<220>
<223> dFNCPF1-PBE-2NLS
<400> 8
atgccaaaga agaagaggaa ggtttcatcg gagaccggcc ctgttgctgt tgaccccacc 60
ctgcggcgga gaatcgagcc acacgagttc gaggtgttct tcgacccaag ggagctccgc 120
aaggagacgt gcctcctgta cgagatcaac tggggcggca ggcactccat ctggaggcac 180
accagccaaa acaccaacaa gcacgtggag gtcaacttca tcgagaagtt caccaccgag 240
aggtacttct gcccaaacac ccgctgctcc atcacctggt tcctgtcctg gagcccatgc 300
ggcgagtgct ccagggccat caccgagttc ctcagccgct acccacacgt caccctgttc 360
atctacatcg ccaggctcta ccaccacgcc gacccaagga acaggcaggg cctccgcgac 420
ctgatctcca gcggcgtgac catccaaatc atgaccgagc aggagtccgg ctactgctgg 480
aggaacttcg tcaactactc cccaagcaac gaggcccact ggccaaggta cccacacctc 540
tgggtgcgcc tctacgtgct cgagctgtac tgcatcatcc tcggcctgcc accatgcctc 600
aacatcctga ggcgcaagca accacagctg accttcttca ccatcgccct ccaaagctgc 660
cactaccaga ggctcccacc acacatcctg tgggctaccg gcctcaagtc cggcagcgag 720
acgccaggca cctccgagag cgctacgcct gaacttaagt ccatctacca ggagttcgtc 780
aataagtact cactctctaa gaccctgcgg ttcgagctga tcccgcaggg caagacactc 840
gagaacatca aggcgcgcgg cctgattctc gacgatgaga agcgggccaa ggactacaag 900
aaggcgaagc agatcattga taagtaccac cagttcttca tcgaggagat tctgtccagc 960
gtgtgcatct ctgaggatct cctgcagaat tactccgacg tctacttcaa gctcaagaag 1020
tctgacgatg acaacctgca gaaggatttc aagtccgcca aggacaccat caagaagcag 1080
atttctgagt acatcaagga ttccgagaag ttcaagaatc tcttcaacca gaatctgatt 1140
gatgcgaaga agggccagga gtctgacctg atcctctggc tgaagcagtc caaggacaat 1200
ggcattgagc tgttcaaggc caacagcgat atcaccgata ttgacgaggc gctggagatc 1260
attaagtcat tcaagggctg gaccacatac ttcaagggct tccatgagaa ccggaagaat 1320
gtgtactcat ctaacgacat tccgacctcc atcatctaca ggatcgtcga tgacaatctg 1380
ccaaagttcc tcgagaacaa ggccaagtac gagtccctca aggacaaggc cccggaggcg 1440
attaattacg agcagatcaa gaaggatctg gcggaggagc tgaccttcga tatcgactac 1500
aagacaagcg aggtgaacca gagggtgttc tccctcgatg aggtgttcga gatcgccaat 1560
ttcaacaatt acctgaacca gtccggcatt accaagttca atacaatcat tggcggcaag 1620
ttcgtcaacg gcgagaatac caagcgcaag ggcattaacg agtacatcaa tctctactcc 1680
cagcagatca acgacaagac cctgaagaag tacaagatgt ctgtgctctt caagcagatc 1740
ctgtccgata cagagtccaa gagcttcgtc attgataagc tcgaggacga cagcgacgtg 1800
gtcaccacaa tgcagtcatt ctacgagcag atcgccgcgt tcaagaccgt ggaggagaag 1860
agcattaagg agacactctc actcctgttc gatgacctga aggcccagaa gctcgacctg 1920
agcaagatct acttcaagaa cgataagagc ctcacagacc tgtcacagca ggtgttcgat 1980
gactactcag tgattggcac cgccgtcctc gagtacatta cacagcagat cgcgccaaag 2040
aacctcgata atccttctaa gaaggagcag gagctgatcg ccaagaaaac cgagaaggcg 2100
aagtacctct ccctggagac aattaagctc gccctggagg agttcaataa gcacagggat 2160
attgacaagc agtgccgctt cgaggagatc ctcgcgaact tcgccgcgat cccaatgatt 2220
ttcgatgaga tcgcccagaa caaggacaat ctggcgcaga tctctattaa gtaccagaac 2280
cagggcaaga aggacctcct gcaggcctcc gcagaggacg acgtgaaggc catcaaggat 2340
ctcctggacc agaccaacaa tctcctgcac aagctcaaga tcttccatat ttcacagtct 2400
gaggataagg ccaatatcct cgataaggac gagcatttct acctggtgtt cgaggagtgc 2460
tacttcgagc tggcgaacat tgtccctctg tacaacaaga ttaggaatta catcacacag 2520
aagccgtaca gcgacgagaa gttcaagctc aacttcgaga attcaaccct ggccaacggc 2580
tgggataaga ataaggagcc tgacaacaca gcgatcctct tcatcaagga cgacaagtac 2640
tacctgggcg tgatgaataa gaagaacaat aagatcttcg atgacaaggc cattaaggag 2700
aacaagggcg agggctacaa gaagatcgtg tacaagctcc tgcctggcgc caataagatg 2760
ctcccgaagg tgttcttctc cgcgaagtcc attaagttct acaacccaag cgaggatatc 2820
ctcaggatca ggaaccactc tacccataca aagaacggct cccctcagaa gggctacgag 2880
aagttcgagt tcaatatcga ggattgccgg aagttcattg acttctacaa gcagtccatc 2940
agcaagcacc ctgagtggaa ggatttcggc ttccgcttca gcgacaccca gcggtacaac 3000
tcaatcgatg agttctacag ggaggtggag aatcagggct acaagctcac attcgagaac 3060
atttcagagt cttacatcga ctccgtggtc aatcagggca agctctacct gttccagatc 3120
tacaacaagg atttcagcgc ctactcaaag ggcaggccga acctccatac cctgtactgg 3180
aaggcgctct tcgatgagcg caatctgcag gacgtggtct acaagctcaa cggcgaggcc 3240
gagctgttct accgcaagca gtctattccg aagaagatca cacacccagc gaaggaggcc 3300
atcgcgaaca agaataagga caatccgaag aaggagtccg tgttcgagta cgatctcatt 3360
aaggacaagc ggttcaccga ggataagttc ttcttccatt gcccaatcac aattaacttc 3420
aagtccagcg gcgccaacaa gttcaatgac gagatcaatc tcctgctcaa ggagaaggcg 3480
aacgatgtgc atatcctctc cattgcccgc ggcgagcggc atctcgccta ctacaccctg 3540
gtcgatggca agggcaatat cattaagcag gataccttca acatcattgg caatgaccgc 3600
atgaaaacca actaccacga taagctcgcc gcgatcgaga aggataggga cagcgcgcgc 3660
aaggactgga agaagatcaa caatattaag gagatgaagg agggctacct ctcacaggtg 3720
gtccatgaga ttgccaagct cgtgatcgag tacaatgcga ttgtggtgtt cgaggacctc 3780
aacttcggct tcaagcgggg caggttcaag gtggagaagc aggtctacca gaagctggag 3840
aagatgctca tcgagaagct caattacctg gtgttcaagg ataacgagtt cgacaagacc 3900
ggaggagtcc tcagggcata ccagctgacc gcgccattcg agacattcaa gaagatgggc 3960
aagcagacag gcatcatcta ctacgtgcca gccggcttca cctccaagat ctgccctgtg 4020
acaggcttcg tcaaccagct ctaccctaag tacgagtccg tcagcaagtc acaggagttc 4080
ttcagcaagt tcgataagat ttgctacaat ctggacaagg gctacttcga gttctcattc 4140
gattacaaga acttcggcga caaggccgcg aagggcaagt ggaccattgc gtctttcggc 4200
tcccggctca tcaacttcag gaatagcgat aagaaccaca attgggacac cagggaggtg 4260
tacccgacaa aggagctgga gaagctgctc aaggactact caatcgagta cggccatggc 4320
gagtgcatta aggccgcgat ctgcggcgag agcgataaga agttcttcgc caagctgacc 4380
tcagtcctca atacaatcct gcagatgagg aactctaaga ccggcacaga gctggactac 4440
ctgatttccc cagtggccga tgtcaacggc aatttcttcg acagccgcca ggcgccgaag 4500
aatatgccac aggacgccga cgccaacggc gcataccaca tcggcctcaa gggcctgatg 4560
ctgctcggac gcattaagaa caatcaggag ggcaagaagc tcaacctcgt gatcaagaac 4620
gaggagtact tcgagttcgt ccaaaaccgg aacaatacgc gtgactccgg cggcagcacc 4680
aacctgtccg acatcatcga gaaggagacg ggcaagcaac tcgtgatcca ggagagcatc 4740
ctcatgctgc cagaggaggt ggaggaggtc atcggcaaca agccagagtc cgacatcctg 4800
gtgcacaccg cctacgacga gtccaccgac gagaacgtca tgctcctgac cagcgacgcc 4860
ccagagtaca agccatgggc cctcgtcatc caggacagca acggggagaa caagatcaag 4920
atgctgtcgg gggggagccc aaagaagaag cggaaggtga gcggcggcag cccaaagaag 4980
aagcgcaagg tctag 4995
<210> 9
<211> 4779
<212> DNA
<213> Artificial Sequence
<220>
<223> dLBCPF1-PBE-2NLS
<400> 9
atgccaaaga agaagaggaa ggtttcatcg gagaccggcc ctgttgctgt tgaccccacc 60
ctgcggcgga gaatcgagcc acacgagttc gaggtgttct tcgacccaag ggagctccgc 120
aaggagacgt gcctcctgta cgagatcaac tggggcggca ggcactccat ctggaggcac 180
accagccaaa acaccaacaa gcacgtggag gtcaacttca tcgagaagtt caccaccgag 240
aggtacttct gcccaaacac ccgctgctcc atcacctggt tcctgtcctg gagcccatgc 300
ggcgagtgct ccagggccat caccgagttc ctcagccgct acccacacgt caccctgttc 360
atctacatcg ccaggctcta ccaccacgcc gacccaagga acaggcaggg cctccgcgac 420
ctgatctcca gcggcgtgac catccaaatc atgaccgagc aggagtccgg ctactgctgg 480
aggaacttcg tcaactactc cccaagcaac gaggcccact ggccaaggta cccacacctc 540
tgggtgcgcc tctacgtgct cgagctgtac tgcatcatcc tcggcctgcc accatgcctc 600
aacatcctga ggcgcaagca accacagctg accttcttca ccatcgccct ccaaagctgc 660
cactaccaga ggctcccacc acacatcctg tgggctaccg gcctcaagtc cggcagcgag 720
acgccaggca cctccgagag cgctacgcct gaacttaagt caaagctcga gaaattcacc 780
aactgttatt cgttgagcaa aacactgcgg tttaaagcga ttccagtcgg caagactcaa 840
gagaatatag acaataagcg gctgttggtg gaagatgaaa agcgcgcgga agactacaaa 900
ggggtgaaga agttgttgga cagatactac ctctctttta tcaatgatgt cttgcactca 960
atcaaattga agaatctgaa caactacatc tccctcttca gaaagaaaac aaggacagaa 1020
aaggagaata aggaacttga aaatttggag atcaatctga ggaaagagat cgcgaaagcc 1080
tttaaaggca acgaaggata caaaagtctg ttcaagaagg atataattga gacaattttg 1140
ccagagttcc tcgatgacaa ggacgagatt gcgctggtca attcgttcaa cggattcaca 1200
acagcattca caggcttctt tgataatcgg gaaaatatgt tctctgagga ggcaaagtcc 1260
acttctattg cgttcaggtg tatcaatgag aatctcacta ggtacatttc caacatggat 1320
atctttgaga aggttgacgc aatttttgac aagcacgaag ttcaggagat taaggagaag 1380
atcctcaatt ccgattatga cgttgaggac ttcttcgaag gtgagttttt taatttcgtg 1440
ctcactcaag agggtatcga cgtgtataat gcgatcatcg gtgggttcgt gactgagtcc 1500
ggtgaaaaga ttaagggatt gaacgagtat atcaaccttt acaaccaaaa gacgaaacag 1560
aagctgccaa agttcaagcc tctttacaaa caggttcttt cagaccgcga gtcactctcg 1620
ttctatgggg agggctacac ttcggatgag gaagtcctgg aggtgttcag gaatactctc 1680
aataagaatt cggagatttt ctcttctata aaaaaactgg aaaagttgtt taagaatttt 1740
gacgaatact ctagcgccgg catatttgtg aaaaacggcc cggccatatc aacgataagt 1800
aaagatatct tcggcgaatg gaacgtgatc agagacaaat ggaacgcgga gtatgacgat 1860
attcacctga agaagaaggc tgtcgtaacg gagaagtacg aggatgatcg caggaaaagc 1920
ttcaaaaaga tcggaagttt cagcctggaa cagttgcagg agtatgctga cgccgatctt 1980
agcgtcgtcg agaagttgaa ggagataatc atccaaaagg tcgacgagat atataaagtc 2040
tatggatcaa gtgaaaaact gttcgacgcc gacttcgttt tggagaagtc cctgaagaag 2100
aacgacgctg ttgttgccat tatgaaggat ctgctcgaca gcgtgaagag tttcgagaac 2160
tatattaagg cttttttcgg ggaggggaag gagactaaca gagatgagtc cttctacgga 2220
gacttcgtcc tcgcgtacga tatactcctt aaggtagacc acatctacga cgcaatcaga 2280
aattacgtga cacaaaagcc gtacagcaag gacaagttca aactctactt ccagaacccc 2340
cagttcatgg gcggctggga caaggacaag gaaacggatt acagggctac gatcctgagg 2400
tatggttcaa aatactactt ggcgattatg gacaagaagt acgccaagtg tctccagaag 2460
attgacaaag acgatgtcaa tggcaattat gagaagatca actacaagct gcttccgggt 2520
ccgaacaaga tgctcccaaa ggttttcttc agcaagaaat ggatggccta ctataaccca 2580
agcgaggaca tccagaagat ttataagaac ggtacgttca agaagggcga catgttcaat 2640
cttaacgact gtcacaagct gatcgacttc ttcaaagact caattagccg gtacccaaag 2700
tggtctaacg cctatgactt caacttttcg gaaaccgaga agtacaagga tatagccgga 2760
ttttatagag aggtggaaga gcagggctac aaggtgtcat tcgagtccgc cagcaagaag 2820
gaagtggaca agctcgtgga agagggtaag ctctacatgt tccagattta taataaagac 2880
tttagcgata agagccacgg gacacctaat ctccacacaa tgtatttcaa gctgctcttc 2940
gacgagaata accacggcca aatcaggttg tcaggagggg ctgaactctt catgcggcgc 3000
gctagcctta agaaggagga gcttgtagtc caccctgcga atagtccaat tgcgaataag 3060
aacccggaca atcctaaaaa gactacaaca ttgagctacg acgtgtacaa ggataagagg 3120
ttttccgagg atcagtacga gctccacatc ccgattgcga tcaacaagtg cccaaagaat 3180
attttcaaga taaacacaga ggtgcgtgta ctcctgaagc atgacgacaa tccttacgtc 3240
attgggattg ctcggggcga gaggaacctc ctctatattg tggtggtgga cgggaagggg 3300
aacatagtcg aacagtactc ccttaacgaa ataattaaca atttcaacgg catccgtatc 3360
aagaccgact accattcgtt gctggacaag aaggagaagg agagatttga ggcgcggcaa 3420
aattggacaa gtatcgagaa catcaaggaa ctcaaagcag gttatatctc tcaagttgtg 3480
cataagatat gcgagctggt tgagaagtat gacgcagtga tcgctcttga ggacctcaac 3540
tcgggcttta agaattctag agttaaagtg gagaagcagg tctatcaaaa gttcgagaag 3600
atgcttatag ataagctcaa ctacatggtc gataagaaat cgaacccatg tgccaccggc 3660
ggcgcactca aaggttacca aataacaaac aaattcgagt ccttcaaatc gatgagtact 3720
cagaatgggt tcatatttta tataccggcg tggcttacgt ctaagatcga cccgtcaact 3780
ggttttgtca acctgttgaa gacgaaatac acgtccattg ccgattcgaa aaagttcata 3840
tctagttttg atcgtattat gtacgtccca gaggaagatc ttttcgagtt tgctctcgac 3900
tacaaaaact tttcgcggac cgatgcggat tacattaaaa aatggaaact ctattcgtac 3960
ggcaacagaa tcaggatttt tcgcaaccct aagaagaata acgtctttga ttgggaggaa 4020
gtttgcttga ctagcgcgta caaggagctc tttaataagt atggcattaa ctaccaacag 4080
ggtgatatca gagcactgct ttgcgaacaa tctgacaagg ctttctactc atccttcatg 4140
gctttgatga gcctgatgct ccagatgaga aattcaatta caggcagaac cgacgtggat 4200
ttcttgatct ccccggttaa aaattctgat ggcatctttt acgatagcag gaactatgaa 4260
gcgcaagaga atgcgattct gccaaaaaat gcagacgcca acggtgccta taacatcgcc 4320
aggaaagtcc tgtgggcgat cggccagttc aaaaaggccg aagacgaaaa attggacaag 4380
gtcaaaatcg ctatcagcaa caaagagtgg ctggagtatg ctcagacatc cgtaaagcat 4440
acgcgtgact ccggcggcag caccaacctg tccgacatca tcgagaagga gacgggcaag 4500
caactcgtga tccaggagag catcctcatg ctgccagagg aggtggagga ggtcatcggc 4560
aacaagccag agtccgacat cctggtgcac accgcctacg acgagtccac cgacgagaac 4620
gtcatgctcc tgaccagcga cgccccagag tacaagccat gggccctcgt catccaggac 4680
agcaacgggg agaacaagat caagatgctg tcggggggga gcccaaagaa gaagcggaag 4740
gtgagcggcg gcagcccaaa gaagaagcgc aaggtctag 4779
<210> 10
<211> 6305
<212> DNA
<213> Artificial Sequence
<220>
<223> BDUBI10-dLBCPF1-PBE-2NLS
<400> 10
ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60
agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120
tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180
tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240
gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300
ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360
gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420
agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480
taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540
aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600
tcgacgagtc taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag 660
cagacggcac ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg 720
ttggacttgc tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg 780
gcacggcagg cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc 840
caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc 900
ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa 960
tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc 1020
taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc 1080
atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg 1140
cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc 1200
ctgggatggc tctagccgtt ccgcagacgg gatcgatcta ggataggtat acatgttgat 1260
gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1320
cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1380
acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1440
ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1500
tctgcaggtc gaagcttgaa gcaaacatgc caaagaagaa gaggaaggtt tcatcggaga 1560
ccggccctgt tgctgttgac cccaccctgc ggcggagaat cgagccacac gagttcgagg 1620
tgttcttcga cccaagggag ctccgcaagg agacgtgcct cctgtacgag atcaactggg 1680
gcggcaggca ctccatctgg aggcacacca gccaaaacac caacaagcac gtggaggtca 1740
acttcatcga gaagttcacc accgagaggt acttctgccc aaacacccgc tgctccatca 1800
cctggttcct gtcctggagc ccatgcggcg agtgctccag ggccatcacc gagttcctca 1860
gccgctaccc acacgtcacc ctgttcatct acatcgccag gctctaccac cacgccgacc 1920
caaggaacag gcagggcctc cgcgacctga tctccagcgg cgtgaccatc caaatcatga 1980
ccgagcagga gtccggctac tgctggagga acttcgtcaa ctactcccca agcaacgagg 2040
cccactggcc aaggtaccca cacctctggg tgcgcctcta cgtgctcgag ctgtactgca 2100
tcatcctcgg cctgccacca tgcctcaaca tcctgaggcg caagcaacca cagctgacct 2160
tcttcaccat cgccctccaa agctgccact accagaggct cccaccacac atcctgtggg 2220
ctaccggcct caagtccggc agcgagacgc caggcacctc cgagagcgct acgcctgaac 2280
ttaagtcaaa gctcgagaaa ttcaccaact gttattcgtt gagcaaaaca ctgcggttta 2340
aagcgattcc agtcggcaag actcaagaga atatagacaa taagcggctg ttggtggaag 2400
atgaaaagcg cgcggaagac tacaaagggg tgaagaagtt gttggacaga tactacctct 2460
cttttatcaa tgatgtcttg cactcaatca aattgaagaa tctgaacaac tacatctccc 2520
tcttcagaaa gaaaacaagg acagaaaagg agaataagga acttgaaaat ttggagatca 2580
atctgaggaa agagatcgcg aaagccttta aaggcaacga aggatacaaa agtctgttca 2640
agaaggatat aattgagaca attttgccag agttcctcga tgacaaggac gagattgcgc 2700
tggtcaattc gttcaacgga ttcacaacag cattcacagg cttctttgat aatcgggaaa 2760
atatgttctc tgaggaggca aagtccactt ctattgcgtt caggtgtatc aatgagaatc 2820
tcactaggta catttccaac atggatatct ttgagaaggt tgacgcaatt tttgacaagc 2880
acgaagttca ggagattaag gagaagatcc tcaattccga ttatgacgtt gaggacttct 2940
tcgaaggtga gttttttaat ttcgtgctca ctcaagaggg tatcgacgtg tataatgcga 3000
tcatcggtgg gttcgtgact gagtccggtg aaaagattaa gggattgaac gagtatatca 3060
acctttacaa ccaaaagacg aaacagaagc tgccaaagtt caagcctctt tacaaacagg 3120
ttctttcaga ccgcgagtca ctctcgttct atggggaggg ctacacttcg gatgaggaag 3180
tcctggaggt gttcaggaat actctcaata agaattcgga gattttctct tctataaaaa 3240
aactggaaaa gttgtttaag aattttgacg aatactctag cgccggcata tttgtgaaaa 3300
acggcccggc catatcaacg ataagtaaag atatcttcgg cgaatggaac gtgatcagag 3360
acaaatggaa cgcggagtat gacgatattc acctgaagaa gaaggctgtc gtaacggaga 3420
agtacgagga tgatcgcagg aaaagcttca aaaagatcgg aagtttcagc ctggaacagt 3480
tgcaggagta tgctgacgcc gatcttagcg tcgtcgagaa gttgaaggag ataatcatcc 3540
aaaaggtcga cgagatatat aaagtctatg gatcaagtga aaaactgttc gacgccgact 3600
tcgttttgga gaagtccctg aagaagaacg acgctgttgt tgccattatg aaggatctgc 3660
tcgacagcgt gaagagtttc gagaactata ttaaggcttt tttcggggag gggaaggaga 3720
ctaacagaga tgagtccttc tacggagact tcgtcctcgc gtacgatata ctccttaagg 3780
tagaccacat ctacgacgca atcagaaatt acgtgacaca aaagccgtac agcaaggaca 3840
agttcaaact ctacttccag aacccccagt tcatgggcgg ctgggacaag gacaaggaaa 3900
cggattacag ggctacgatc ctgaggtatg gttcaaaata ctacttggcg attatggaca 3960
agaagtacgc caagtgtctc cagaagattg acaaagacga tgtcaatggc aattatgaga 4020
agatcaacta caagctgctt ccgggtccga acaagatgct cccaaaggtt ttcttcagca 4080
agaaatggat ggcctactat aacccaagcg aggacatcca gaagatttat aagaacggta 4140
cgttcaagaa gggcgacatg ttcaatctta acgactgtca caagctgatc gacttcttca 4200
aagactcaat tagccggtac ccaaagtggt ctaacgccta tgacttcaac ttttcggaaa 4260
ccgagaagta caaggatata gccggatttt atagagaggt ggaagagcag ggctacaagg 4320
tgtcattcga gtccgccagc aagaaggaag tggacaagct cgtggaagag ggtaagctct 4380
acatgttcca gatttataat aaagacttta gcgataagag ccacgggaca cctaatctcc 4440
acacaatgta tttcaagctg ctcttcgacg agaataacca cggccaaatc aggttgtcag 4500
gaggggctga actcttcatg cggcgcgcta gccttaagaa ggaggagctt gtagtccacc 4560
ctgcgaatag tccaattgcg aataagaacc cggacaatcc taaaaagact acaacattga 4620
gctacgacgt gtacaaggat aagaggtttt ccgaggatca gtacgagctc cacatcccga 4680
ttgcgatcaa caagtgccca aagaatattt tcaagataaa cacagaggtg cgtgtactcc 4740
tgaagcatga cgacaatcct tacgtcattg ggattgctcg gggcgagagg aacctcctct 4800
atattgtggt ggtggacggg aaggggaaca tagtcgaaca gtactccctt aacgaaataa 4860
ttaacaattt caacggcatc cgtatcaaga ccgactacca ttcgttgctg gacaagaagg 4920
agaaggagag atttgaggcg cggcaaaatt ggacaagtat cgagaacatc aaggaactca 4980
aagcaggtta tatctctcaa gttgtgcata agatatgcga gctggttgag aagtatgacg 5040
cagtgatcgc tcttgaggac ctcaactcgg gctttaagaa ttctagagtt aaagtggaga 5100
agcaggtcta tcaaaagttc gagaagatgc ttatagataa gctcaactac atggtcgata 5160
agaaatcgaa cccatgtgcc accggcggcg cactcaaagg ttaccaaata acaaacaaat 5220
tcgagtcctt caaatcgatg agtactcaga atgggttcat attttatata ccggcgtggc 5280
ttacgtctaa gatcgacccg tcaactggtt ttgtcaacct gttgaagacg aaatacacgt 5340
ccattgccga ttcgaaaaag ttcatatcta gttttgatcg tattatgtac gtcccagagg 5400
aagatctttt cgagtttgct ctcgactaca aaaacttttc gcggaccgat gcggattaca 5460
ttaaaaaatg gaaactctat tcgtacggca acagaatcag gatttttcgc aaccctaaga 5520
agaataacgt ctttgattgg gaggaagttt gcttgactag cgcgtacaag gagctcttta 5580
ataagtatgg cattaactac caacagggtg atatcagagc actgctttgc gaacaatctg 5640
acaaggcttt ctactcatcc ttcatggctt tgatgagcct gatgctccag atgagaaatt 5700
caattacagg cagaaccgac gtggatttct tgatctcccc ggttaaaaat tctgatggca 5760
tcttttacga tagcaggaac tatgaagcgc aagagaatgc gattctgcca aaaaatgcag 5820
acgccaacgg tgcctataac atcgccagga aagtcctgtg ggcgatcggc cagttcaaaa 5880
aggccgaaga cgaaaaattg gacaaggtca aaatcgctat cagcaacaaa gagtggctgg 5940
agtatgctca gacatccgta aagcatacgc gtgactccgg cggcagcacc aacctgtccg 6000
acatcatcga gaaggagacg ggcaagcaac tcgtgatcca ggagagcatc ctcatgctgc 6060
cagaggaggt ggaggaggtc atcggcaaca agccagagtc cgacatcctg gtgcacaccg 6120
cctacgacga gtccaccgac gagaacgtca tgctcctgac cagcgacgcc ccagagtaca 6180
agccatgggc cctcgtcatc caggacagca acggggagaa caagatcaag atgctgtcgg 6240
gggggagccc aaagaagaag cggaaggtga gcggcggcag cccaaagaag aagcgcaagg 6300
tctag 6305
<210> 11
<211> 5196
<212> DNA
<213> Artificial Sequence
<220>
<223> dFNCPF1-ABE7.10-2NLS
<400> 11
atgccaaaaa agaagagaaa ggtttcaggc ggctcctccg aggtggagtt ctctcacgag 60
tattggatga ggcacgctct tacacttgct aagagagctt gggacgaaag agaagtgcca 120
gttggcgccg ttcttgtgca taataatagg gtgatcggcg agggttggaa tagaccaatt 180
ggaaggcatg atccaacagc tcacgcagag attatggctc tcagacaagg cggcctcgtt 240
atgcagaact acaggctcat tgacgctaca ctctacgtga cactcgaacc ttgcgttatg 300
tgcgccggag ctatgattca ttctaggatt ggcagggtcg tgtttggagc tagggacgct 360
aaaacaggag ccgccggatc tcttatggac gtgttgcatc atccaggcat gaaccatagg 420
gtggagatta cagagggcat tcttgcagac gagtgcgctg ctcttctttc cgatttcttc 480
aggatgagaa ggcaggagat taaggcccag aagaaggctc agtcttctac agatagcgga 540
ggatcttccg gaggatctag cggctccgag acaccaggaa catccgaaag cgctacacca 600
gaatctagcg gaggctcttc cggaggatct tctgaagtgg agttctccca cgagtattgg 660
atgaggcacg ctcttacact tgctaaaagg gctagggacg aaagggaagt tccagttgga 720
gctgttctcg tgctcaataa cagggtgatt ggcgagggtt ggaatagagc cattggactc 780
catgatccaa cagctcacgc agagattatg gctcttagac aaggcggcct cgttatgcag 840
aattacagac tcatcgacgc cacactctac gttaccttcg aaccttgcgt tatgtgcgcc 900
ggagctatga tccattctag gattggcagg gtcgtgttcg gcgttagaaa cgctaagaca 960
ggagctgcag gctctcttat ggacgttctt cattacccag gcatgaatca tagagtggag 1020
atcacagaag gcattcttgc agacgagtgc gcagctctcc tttgctattt cttcaggatg 1080
ccgaggcaag ttttcaacgc tcagaagaag gcccagtctt ctacagattc cggcggatct 1140
tctggaggat ctagcggctc cgagacacca ggaacatccg aatccgctac accagagtct 1200
tctggaggat ctagcggagg atctcttaag tccatctacc aggagttcgt caataagtac 1260
tcactctcta agaccctgcg gttcgagctg atcccgcagg gcaagacact cgagaacatc 1320
aaggcgcgcg gcctgattct cgacgatgag aagcgggcca aggactacaa gaaggcgaag 1380
cagatcattg ataagtacca ccagttcttc atcgaggaga ttctgtccag cgtgtgcatc 1440
tctgaggatc tcctgcagaa ttactccgac gtctacttca agctcaagaa gtctgacgat 1500
gacaacctgc agaaggattt caagtccgcc aaggacacca tcaagaagca gatttctgag 1560
tacatcaagg attccgagaa gttcaagaat ctcttcaacc agaatctgat tgatgcgaag 1620
aagggccagg agtctgacct gatcctctgg ctgaagcagt ccaaggacaa tggcattgag 1680
ctgttcaagg ccaacagcga tatcaccgat attgacgagg cgctggagat cattaagtca 1740
ttcaagggct ggaccacata cttcaagggc ttccatgaga accggaagaa tgtgtactca 1800
tctaacgaca ttccgacctc catcatctac aggatcgtcg atgacaatct gccaaagttc 1860
ctcgagaaca aggccaagta cgagtccctc aaggacaagg ccccggaggc gattaattac 1920
gagcagatca agaaggatct ggcggaggag ctgaccttcg atatcgacta caagacaagc 1980
gaggtgaacc agagggtgtt ctccctcgat gaggtgttcg agatcgccaa tttcaacaat 2040
tacctgaacc agtccggcat taccaagttc aatacaatca ttggcggcaa gttcgtcaac 2100
ggcgagaata ccaagcgcaa gggcattaac gagtacatca atctctactc ccagcagatc 2160
aacgacaaga ccctgaagaa gtacaagatg tctgtgctct tcaagcagat cctgtccgat 2220
acagagtcca agagcttcgt cattgataag ctcgaggacg acagcgacgt ggtcaccaca 2280
atgcagtcat tctacgagca gatcgccgcg ttcaagaccg tggaggagaa gagcattaag 2340
gagacactct cactcctgtt cgatgacctg aaggcccaga agctcgacct gagcaagatc 2400
tacttcaaga acgataagag cctcacagac ctgtcacagc aggtgttcga tgactactca 2460
gtgattggca ccgccgtcct cgagtacatt acacagcaga tcgcgccaaa gaacctcgat 2520
aatccttcta agaaggagca ggagctgatc gccaagaaaa ccgagaaggc gaagtacctc 2580
tccctggaga caattaagct cgccctggag gagttcaata agcacaggga tattgacaag 2640
cagtgccgct tcgaggagat cctcgcgaac ttcgccgcga tcccaatgat tttcgatgag 2700
atcgcccaga acaaggacaa tctggcgcag atctctatta agtaccagaa ccagggcaag 2760
aaggacctcc tgcaggcctc cgcagaggac gacgtgaagg ccatcaagga tctcctggac 2820
cagaccaaca atctcctgca caagctcaag atcttccata tttcacagtc tgaggataag 2880
gccaatatcc tcgataagga cgagcatttc tacctggtgt tcgaggagtg ctacttcgag 2940
ctggcgaaca ttgtccctct gtacaacaag attaggaatt acatcacaca gaagccgtac 3000
agcgacgaga agttcaagct caacttcgag aattcaaccc tggccaacgg ctgggataag 3060
aataaggagc ctgacaacac agcgatcctc ttcatcaagg acgacaagta ctacctgggc 3120
gtgatgaata agaagaacaa taagatcttc gatgacaagg ccattaagga gaacaagggc 3180
gagggctaca agaagatcgt gtacaagctc ctgcctggcg ccaataagat gctcccgaag 3240
gtgttcttct ccgcgaagtc cattaagttc tacaacccaa gcgaggatat cctcaggatc 3300
aggaaccact ctacccatac aaagaacggc tcccctcaga agggctacga gaagttcgag 3360
ttcaatatcg aggattgccg gaagttcatt gacttctaca agcagtccat cagcaagcac 3420
cctgagtgga aggatttcgg cttccgcttc agcgacaccc agcggtacaa ctcaatcgat 3480
gagttctaca gggaggtgga gaatcagggc tacaagctca cattcgagaa catttcagag 3540
tcttacatcg actccgtggt caatcagggc aagctctacc tgttccagat ctacaacaag 3600
gatttcagcg cctactcaaa gggcaggccg aacctccata ccctgtactg gaaggcgctc 3660
ttcgatgagc gcaatctgca ggacgtggtc tacaagctca acggcgaggc cgagctgttc 3720
taccgcaagc agtctattcc gaagaagatc acacacccag cgaaggaggc catcgcgaac 3780
aagaataagg acaatccgaa gaaggagtcc gtgttcgagt acgatctcat taaggacaag 3840
cggttcaccg aggataagtt cttcttccat tgcccaatca caattaactt caagtccagc 3900
ggcgccaaca agttcaatga cgagatcaat ctcctgctca aggagaaggc gaacgatgtg 3960
catatcctct ccattgcccg cggcgagcgg catctcgcct actacaccct ggtcgatggc 4020
aagggcaata tcattaagca ggataccttc aacatcattg gcaatgaccg catgaaaacc 4080
aactaccacg ataagctcgc cgcgatcgag aaggataggg acagcgcgcg caaggactgg 4140
aagaagatca acaatattaa ggagatgaag gagggctacc tctcacaggt ggtccatgag 4200
attgccaagc tcgtgatcga gtacaatgcg attgtggtgt tcgaggacct caacttcggc 4260
ttcaagcggg gcaggttcaa ggtggagaag caggtctacc agaagctgga gaagatgctc 4320
atcgagaagc tcaattacct ggtgttcaag gataacgagt tcgacaagac cggaggagtc 4380
ctcagggcat accagctgac cgcgccattc gagacattca agaagatggg caagcagaca 4440
ggcatcatct actacgtgcc agccggcttc acctccaaga tctgccctgt gacaggcttc 4500
gtcaaccagc tctaccctaa gtacgagtcc gtcagcaagt cacaggagtt cttcagcaag 4560
ttcgataaga tttgctacaa tctggacaag ggctacttcg agttctcatt cgattacaag 4620
aacttcggcg acaaggccgc gaagggcaag tggaccattg cgtctttcgg ctcccggctc 4680
atcaacttca ggaatagcga taagaaccac aattgggaca ccagggaggt gtacccgaca 4740
aaggagctgg agaagctgct caaggactac tcaatcgagt acggccatgg cgagtgcatt 4800
aaggccgcga tctgcggcga gagcgataag aagttcttcg ccaagctgac ctcagtcctc 4860
aatacaatcc tgcagatgag gaactctaag accggcacag agctggacta cctgatttcc 4920
ccagtggccg atgtcaacgg caatttcttc gacagccgcc aggcgccgaa gaatatgcca 4980
caggacgccg acgccaacgg cgcataccac atcggcctca agggcctgat gctgctcgga 5040
cgcattaaga acaatcagga gggcaagaag ctcaacctcg tgatcaagaa cgaggagtac 5100
ttcgagttcg tccaaaaccg gaacaattcc ggcggcagcc caaagaagaa gaggaaggtg 5160
agcggcggca gcccaaagaa gaagcgcaag gtctag 5196
<210> 12
<211> 4980
<212> DNA
<213> Artificial Sequence
<220>
<223> dLBCPF1-ABE7.10-2NLS
<400> 12
atgccaaaaa agaagagaaa ggtttcaggc ggctcctccg aggtggagtt ctctcacgag 60
tattggatga ggcacgctct tacacttgct aagagagctt gggacgaaag agaagtgcca 120
gttggcgccg ttcttgtgca taataatagg gtgatcggcg agggttggaa tagaccaatt 180
ggaaggcatg atccaacagc tcacgcagag attatggctc tcagacaagg cggcctcgtt 240
atgcagaact acaggctcat tgacgctaca ctctacgtga cactcgaacc ttgcgttatg 300
tgcgccggag ctatgattca ttctaggatt ggcagggtcg tgtttggagc tagggacgct 360
aaaacaggag ccgccggatc tcttatggac gtgttgcatc atccaggcat gaaccatagg 420
gtggagatta cagagggcat tcttgcagac gagtgcgctg ctcttctttc cgatttcttc 480
aggatgagaa ggcaggagat taaggcccag aagaaggctc agtcttctac agatagcgga 540
ggatcttccg gaggatctag cggctccgag acaccaggaa catccgaaag cgctacacca 600
gaatctagcg gaggctcttc cggaggatct tctgaagtgg agttctccca cgagtattgg 660
atgaggcacg ctcttacact tgctaaaagg gctagggacg aaagggaagt tccagttgga 720
gctgttctcg tgctcaataa cagggtgatt ggcgagggtt ggaatagagc cattggactc 780
catgatccaa cagctcacgc agagattatg gctcttagac aaggcggcct cgttatgcag 840
aattacagac tcatcgacgc cacactctac gttaccttcg aaccttgcgt tatgtgcgcc 900
ggagctatga tccattctag gattggcagg gtcgtgttcg gcgttagaaa cgctaagaca 960
ggagctgcag gctctcttat ggacgttctt cattacccag gcatgaatca tagagtggag 1020
atcacagaag gcattcttgc agacgagtgc gcagctctcc tttgctattt cttcaggatg 1080
ccgaggcaag ttttcaacgc tcagaagaag gcccagtctt ctacagattc cggcggatct 1140
tctggaggat ctagcggctc cgagacacca ggaacatccg aatccgctac accagagtct 1200
tctggaggat ctagcggagg atctcttaag tcaaagctcg agaaattcac caactgttat 1260
tcgttgagca aaacactgcg gtttaaagcg attccagtcg gcaagactca agagaatata 1320
gacaataagc ggctgttggt ggaagatgaa aagcgcgcgg aagactacaa aggggtgaag 1380
aagttgttgg acagatacta cctctctttt atcaatgatg tcttgcactc aatcaaattg 1440
aagaatctga acaactacat ctccctcttc agaaagaaaa caaggacaga aaaggagaat 1500
aaggaacttg aaaatttgga gatcaatctg aggaaagaga tcgcgaaagc ctttaaaggc 1560
aacgaaggat acaaaagtct gttcaagaag gatataattg agacaatttt gccagagttc 1620
ctcgatgaca aggacgagat tgcgctggtc aattcgttca acggattcac aacagcattc 1680
acaggcttct ttgataatcg ggaaaatatg ttctctgagg aggcaaagtc cacttctatt 1740
gcgttcaggt gtatcaatga gaatctcact aggtacattt ccaacatgga tatctttgag 1800
aaggttgacg caatttttga caagcacgaa gttcaggaga ttaaggagaa gatcctcaat 1860
tccgattatg acgttgagga cttcttcgaa ggtgagtttt ttaatttcgt gctcactcaa 1920
gagggtatcg acgtgtataa tgcgatcatc ggtgggttcg tgactgagtc cggtgaaaag 1980
attaagggat tgaacgagta tatcaacctt tacaaccaaa agacgaaaca gaagctgcca 2040
aagttcaagc ctctttacaa acaggttctt tcagaccgcg agtcactctc gttctatggg 2100
gagggctaca cttcggatga ggaagtcctg gaggtgttca ggaatactct caataagaat 2160
tcggagattt tctcttctat aaaaaaactg gaaaagttgt ttaagaattt tgacgaatac 2220
tctagcgccg gcatatttgt gaaaaacggc ccggccatat caacgataag taaagatatc 2280
ttcggcgaat ggaacgtgat cagagacaaa tggaacgcgg agtatgacga tattcacctg 2340
aagaagaagg ctgtcgtaac ggagaagtac gaggatgatc gcaggaaaag cttcaaaaag 2400
atcggaagtt tcagcctgga acagttgcag gagtatgctg acgccgatct tagcgtcgtc 2460
gagaagttga aggagataat catccaaaag gtcgacgaga tatataaagt ctatggatca 2520
agtgaaaaac tgttcgacgc cgacttcgtt ttggagaagt ccctgaagaa gaacgacgct 2580
gttgttgcca ttatgaagga tctgctcgac agcgtgaaga gtttcgagaa ctatattaag 2640
gcttttttcg gggaggggaa ggagactaac agagatgagt ccttctacgg agacttcgtc 2700
ctcgcgtacg atatactcct taaggtagac cacatctacg acgcaatcag aaattacgtg 2760
acacaaaagc cgtacagcaa ggacaagttc aaactctact tccagaaccc ccagttcatg 2820
ggcggctggg acaaggacaa ggaaacggat tacagggcta cgatcctgag gtatggttca 2880
aaatactact tggcgattat ggacaagaag tacgccaagt gtctccagaa gattgacaaa 2940
gacgatgtca atggcaatta tgagaagatc aactacaagc tgcttccggg tccgaacaag 3000
atgctcccaa aggttttctt cagcaagaaa tggatggcct actataaccc aagcgaggac 3060
atccagaaga tttataagaa cggtacgttc aagaagggcg acatgttcaa tcttaacgac 3120
tgtcacaagc tgatcgactt cttcaaagac tcaattagcc ggtacccaaa gtggtctaac 3180
gcctatgact tcaacttttc ggaaaccgag aagtacaagg atatagccgg attttataga 3240
gaggtggaag agcagggcta caaggtgtca ttcgagtccg ccagcaagaa ggaagtggac 3300
aagctcgtgg aagagggtaa gctctacatg ttccagattt ataataaaga ctttagcgat 3360
aagagccacg ggacacctaa tctccacaca atgtatttca agctgctctt cgacgagaat 3420
aaccacggcc aaatcaggtt gtcaggaggg gctgaactct tcatgcggcg cgctagcctt 3480
aagaaggagg agcttgtagt ccaccctgcg aatagtccaa ttgcgaataa gaacccggac 3540
aatcctaaaa agactacaac attgagctac gacgtgtaca aggataagag gttttccgag 3600
gatcagtacg agctccacat cccgattgcg atcaacaagt gcccaaagaa tattttcaag 3660
ataaacacag aggtgcgtgt actcctgaag catgacgaca atccttacgt cattgggatt 3720
gctcggggcg agaggaacct cctctatatt gtggtggtgg acgggaaggg gaacatagtc 3780
gaacagtact cccttaacga aataattaac aatttcaacg gcatccgtat caagaccgac 3840
taccattcgt tgctggacaa gaaggagaag gagagatttg aggcgcggca aaattggaca 3900
agtatcgaga acatcaagga actcaaagca ggttatatct ctcaagttgt gcataagata 3960
tgcgagctgg ttgagaagta tgacgcagtg atcgctcttg aggacctcaa ctcgggcttt 4020
aagaattcta gagttaaagt ggagaagcag gtctatcaaa agttcgagaa gatgcttata 4080
gataagctca actacatggt cgataagaaa tcgaacccat gtgccaccgg cggcgcactc 4140
aaaggttacc aaataacaaa caaattcgag tccttcaaat cgatgagtac tcagaatggg 4200
ttcatatttt atataccggc gtggcttacg tctaagatcg acccgtcaac tggttttgtc 4260
aacctgttga agacgaaata cacgtccatt gccgattcga aaaagttcat atctagtttt 4320
gatcgtatta tgtacgtccc agaggaagat cttttcgagt ttgctctcga ctacaaaaac 4380
ttttcgcgga ccgatgcgga ttacattaaa aaatggaaac tctattcgta cggcaacaga 4440
atcaggattt ttcgcaaccc taagaagaat aacgtctttg attgggagga agtttgcttg 4500
actagcgcgt acaaggagct ctttaataag tatggcatta actaccaaca gggtgatatc 4560
agagcactgc tttgcgaaca atctgacaag gctttctact catccttcat ggctttgatg 4620
agcctgatgc tccagatgag aaattcaatt acaggcagaa ccgacgtgga tttcttgatc 4680
tccccggtta aaaattctga tggcatcttt tacgatagca ggaactatga agcgcaagag 4740
aatgcgattc tgccaaaaaa tgcagacgcc aacggtgcct ataacatcgc caggaaagtc 4800
ctgtgggcga tcggccagtt caaaaaggcc gaagacgaaa aattggacaa ggtcaaaatc 4860
gctatcagca acaaagagtg gctggagtat gctcagacat ccgtaaagca ttccggcggc 4920
agcccaaaga agaagaggaa ggtgagcggc ggcagcccaa agaagaagcg caaggtctag 4980
<210> 13
<211> 6506
<212> DNA
<213> Artificial Sequence
<220>
<223> dLBCPF1-ABE2-X
<400> 13
ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60
agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120
tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180
tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240
gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300
ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360
gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420
agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480
taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540
aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600
tcgacgagtc taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag 660
cagacggcac ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg 720
ttggacttgc tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg 780
gcacggcagg cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc 840
caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc 900
ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa 960
tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc 1020
taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc 1080
atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg 1140
cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc 1200
ctgggatggc tctagccgtt ccgcagacgg gatcgatcta ggataggtat acatgttgat 1260
gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1320
cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1380
acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1440
ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1500
tctgcaggtc gaagcttgaa gcaaacatgc caaaaaagaa gagaaaggtt tcaggcggct 1560
cctccgaggt ggagttctct cacgagtatt ggatgaggca cgctcttaca cttgctaaga 1620
gagcttggga cgaaagagaa gtgccagttg gcgccgttct tgtgcataat aatagggtga 1680
tcggcgaggg ttggaataga ccaattggaa ggcatgatcc aacagctcac gcagagatta 1740
tggctctcag acaaggcggc ctcgttatgc agaactacag gctcattgac gctacactct 1800
acgtgacact cgaaccttgc gttatgtgcg ccggagctat gattcattct aggattggca 1860
gggtcgtgtt tggagctagg gacgctaaaa caggagccgc cggatctctt atggacgtgt 1920
tgcatcatcc aggcatgaac catagggtgg agattacaga gggcattctt gcagacgagt 1980
gcgctgctct tctttccgat ttcttcagga tgagaaggca ggagattaag gcccagaaga 2040
aggctcagtc ttctacagat agcggaggat cttccggagg atctagcggc tccgagacac 2100
caggaacatc cgaaagcgct acaccagaat ctagcggagg ctcttccgga ggatcttctg 2160
aagtggagtt ctcccacgag tattggatga ggcacgctct tacacttgct aaaagggcta 2220
gggacgaaag ggaagttcca gttggagctg ttctcgtgct caataacagg gtgattggcg 2280
agggttggaa tagagccatt ggactccatg atccaacagc tcacgcagag attatggctc 2340
ttagacaagg cggcctcgtt atgcagaatt acagactcat cgacgccaca ctctacgtta 2400
ccttcgaacc ttgcgttatg tgcgccggag ctatgatcca ttctaggatt ggcagggtcg 2460
tgttcggcgt tagaaacgct aagacaggag ctgcaggctc tcttatggac gttcttcatt 2520
acccaggcat gaatcataga gtggagatca cagaaggcat tcttgcagac gagtgcgcag 2580
ctctcctttg ctatttcttc aggatgccga ggcaagtttt caacgctcag aagaaggccc 2640
agtcttctac agattccggc ggatcttctg gaggatctag cggctccgag acaccaggaa 2700
catccgaatc cgctacacca gagtcttctg gaggatctag cggaggatct cttaagtcaa 2760
agctcgagaa attcaccaac tgttattcgt tgagcaaaac actgcggttt aaagcgattc 2820
cagtcggcaa gactcaagag aatatagaca ataagcggct gttggtggaa gatgaaaagc 2880
gcgcggaaga ctacaaaggg gtgaagaagt tgttggacag atactacctc tcttttatca 2940
atgatgtctt gcactcaatc aaattgaaga atctgaacaa ctacatctcc ctcttcagaa 3000
agaaaacaag gacagaaaag gagaataagg aacttgaaaa tttggagatc aatctgagga 3060
aagagatcgc gaaagccttt aaaggcaacg aaggatacaa aagtctgttc aagaaggata 3120
taattgagac aattttgcca gagttcctcg atgacaagga cgagattgcg ctggtcaatt 3180
cgttcaacgg attcacaaca gcattcacag gcttctttga taatcgggaa aatatgttct 3240
ctgaggaggc aaagtccact tctattgcgt tcaggtgtat caatgagaat ctcactaggt 3300
acatttccaa catggatatc tttgagaagg ttgacgcaat ttttgacaag cacgaagttc 3360
aggagattaa ggagaagatc ctcaattccg attatgacgt tgaggacttc ttcgaaggtg 3420
agttttttaa tttcgtgctc actcaagagg gtatcgacgt gtataatgcg atcatcggtg 3480
ggttcgtgac tgagtccggt gaaaagatta agggattgaa cgagtatatc aacctttaca 3540
accaaaagac gaaacagaag ctgccaaagt tcaagcctct ttacaaacag gttctttcag 3600
accgcgagtc actctcgttc tatggggagg gctacacttc ggatgaggaa gtcctggagg 3660
tgttcaggaa tactctcaat aagaattcgg agattttctc ttctataaaa aaactggaaa 3720
agttgtttaa gaattttgac gaatactcta gcgccggcat atttgtgaaa aacggcccgg 3780
ccatatcaac gataagtaaa gatatcttcg gcgaatggaa cgtgatcaga gacaaatgga 3840
acgcggagta tgacgatatt cacctgaaga agaaggctgt cgtaacggag aagtacgagg 3900
atgatcgcag gaaaagcttc aaaaagatcg gaagtttcag cctggaacag ttgcaggagt 3960
atgctgacgc cgatcttagc gtcgtcgaga agttgaagga gataatcatc caaaaggtcg 4020
acgagatata taaagtctat ggatcaagtg aaaaactgtt cgacgccgac ttcgttttgg 4080
agaagtccct gaagaagaac gacgctgttg ttgccattat gaaggatctg ctcgacagcg 4140
tgaagagttt cgagaactat attaaggctt ttttcgggga ggggaaggag actaacagag 4200
atgagtcctt ctacggagac ttcgtcctcg cgtacgatat actccttaag gtagaccaca 4260
tctacgacgc aatcagaaat tacgtgacac aaaagccgta cagcaaggac aagttcaaac 4320
tctacttcca gaacccccag ttcatgggcg gctgggacaa ggacaaggaa acggattaca 4380
gggctacgat cctgaggtat ggttcaaaat actacttggc gattatggac aagaagtacg 4440
ccaagtgtct ccagaagatt gacaaagacg atgtcaatgg caattatgag aagatcaact 4500
acaagctgct tccgggtccg aacaagatgc tcccaaaggt tttcttcagc aagaaatgga 4560
tggcctacta taacccaagc gaggacatcc agaagattta taagaacggt acgttcaaga 4620
agggcgacat gttcaatctt aacgactgtc acaagctgat cgacttcttc aaagactcaa 4680
ttagccggta cccaaagtgg tctaacgcct atgacttcaa cttttcggaa accgagaagt 4740
acaaggatat agccggattt tatagagagg tggaagagca gggctacaag gtgtcattcg 4800
agtccgccag caagaaggaa gtggacaagc tcgtggaaga gggtaagctc tacatgttcc 4860
agatttataa taaagacttt agcgataaga gccacgggac acctaatctc cacacaatgt 4920
atttcaagct gctcttcgac gagaataacc acggccaaat caggttgtca ggaggggctg 4980
aactcttcat gcggcgcgct agccttaaga aggaggagct tgtagtccac cctgcgaata 5040
gtccaattgc gaataagaac ccggacaatc ctaaaaagac tacaacattg agctacgacg 5100
tgtacaagga taagaggttt tccgaggatc agtacgagct ccacatcccg attgcgatca 5160
acaagtgccc aaagaatatt ttcaagataa acacagaggt gcgtgtactc ctgaagcatg 5220
acgacaatcc ttacgtcatt gggattgctc ggggcgagag gaacctcctc tatattgtgg 5280
tggtggacgg gaaggggaac atagtcgaac agtactccct taacgaaata attaacaatt 5340
tcaacggcat ccgtatcaag accgactacc attcgttgct ggacaagaag gagaaggaga 5400
gatttgaggc gcggcaaaat tggacaagta tcgagaacat caaggaactc aaagcaggtt 5460
atatctctca agttgtgcat aagatatgcg agctggttga gaagtatgac gcagtgatcg 5520
ctcttgagga cctcaactcg ggctttaaga attctagagt taaagtggag aagcaggtct 5580
atcaaaagtt cgagaagatg cttatagata agctcaacta catggtcgat aagaaatcga 5640
acccatgtgc caccggcggc gcactcaaag gttaccaaat aacaaacaaa ttcgagtcct 5700
tcaaatcgat gagtactcag aatgggttca tattttatat accggcgtgg cttacgtcta 5760
agatcgaccc gtcaactggt tttgtcaacc tgttgaagac gaaatacacg tccattgccg 5820
attcgaaaaa gttcatatct agttttgatc gtattatgta cgtcccagag gaagatcttt 5880
tcgagtttgc tctcgactac aaaaactttt cgcggaccga tgcggattac attaaaaaat 5940
ggaaactcta ttcgtacggc aacagaatca ggatttttcg caaccctaag aagaataacg 6000
tctttgattg ggaggaagtt tgcttgacta gcgcgtacaa ggagctcttt aataagtatg 6060
gcattaacta ccaacagggt gatatcagag cactgctttg cgaacaatct gacaaggctt 6120
tctactcatc cttcatggct ttgatgagcc tgatgctcca gatgagaaat tcaattacag 6180
gcagaaccga cgtggatttc ttgatctccc cggttaaaaa ttctgatggc atcttttacg 6240
atagcaggaa ctatgaagcg caagagaatg cgattctgcc aaaaaatgca gacgccaacg 6300
gtgcctataa catcgccagg aaagtcctgt gggcgatcgg ccagttcaaa aaggccgaag 6360
acgaaaaatt ggacaaggtc aaaatcgcta tcagcaacaa agagtggctg gagtatgctc 6420
agacatccgt aaagcattcc ggcggcagcc caaagaagaa gaggaaggtg agcggcggca 6480
gcccaaagaa gaagcgcaag gtctag 6506
<210> 14
<211> 4971
<212> DNA
<213> Artificial Sequence
<220>
<223> LBCPF1-ABE2-X2
<400> 14
atgtcaaagc tcgagaaatt caccaactgt tattcgttga gcaaaacact gcggtttaaa 60
gcgattccag tcggcaagac tcaagagaat atagacaata agcggctgtt ggtggaagat 120
gaaaagcgcg cggaagacta caaaggggtg aagaagttgt tggacagata ctacctctct 180
tttatcaatg atgtcttgca ctcaatcaaa ttgaagaatc tgaacaacta catctccctc 240
ttcagaaaga aaacaaggac agaaaaggag aataaggaac ttgaaaattt ggagatcaat 300
ctgaggaaag agatcgcgaa agcctttaaa ggcaacgaag gatacaaaag tctgttcaag 360
aaggatataa ttgagacaat tttgccagag ttcctcgatg acaaggacga gattgcgctg 420
gtcaattcgt tcaacggatt cacaacagca ttcacaggct tctttgataa tcgggaaaat 480
atgttctctg aggaggcaaa gtccacttct attgcgttca ggtgtatcaa tgagaatctc 540
actaggtaca tttccaacat ggatatcttt gagaaggttg acgcaatttt tgacaagcac 600
gaagttcagg agattaagga gaagatcctc aattccgatt atgacgttga ggacttcttc 660
gaaggtgagt tttttaattt cgtgctcact caagagggta tcgacgtgta taatgcgatc 720
atcggtgggt tcgtgactga gtccggtgaa aagattaagg gattgaacga gtatatcaac 780
ctttacaacc aaaagacgaa acagaagctg ccaaagttca agcctcttta caaacaggtt 840
ctttcagacc gcgagtcact ctcgttctat ggggagggct acacttcgga tgaggaagtc 900
ctggaggtgt tcaggaatac tctcaataag aattcggaga ttttctcttc tataaaaaaa 960
ctggaaaagt tgtttaagaa ttttgacgaa tactctagcg ccggcatatt tgtgaaaaac 1020
ggcccggcca tatcaacgat aagtaaagat atcttcggcg aatggaacgt gatcagagac 1080
aaatggaacg cggagtatga cgatattcac ctgaagaaga aggctgtcgt aacggagaag 1140
tacgaggatg atcgcaggaa aagcttcaaa aagatcggaa gtttcagcct ggaacagttg 1200
caggagtatg ctgacgccga tcttagcgtc gtcgagaagt tgaaggagat aatcatccaa 1260
aaggtcgacg agatatataa agtctatgga tcaagtgaaa aactgttcga cgccgacttc 1320
gttttggaga agtccctgaa gaagaacgac gctgttgttg ccattatgaa ggatctgctc 1380
gacagcgtga agagtttcga gaactatatt aaggcttttt tcggggaggg gaaggagact 1440
aacagagatg agtccttcta cggagacttc gtcctcgcgt acgatatact ccttaaggta 1500
gaccacatct acgacgcaat cagaaattac gtgacacaaa agccgtacag caaggacaag 1560
ttcaaactct acttccagaa cccccagttc atgggcggct gggacaagga caaggaaacg 1620
gattacaggg ctacgatcct gaggtatggt tcaaaatact acttggcgat tatggacaag 1680
aagtacgcca agtgtctcca gaagattgac aaagacgatg tcaatggcaa ttatgagaag 1740
atcaactaca agctgcttcc gggtccgaac aagatgctcc caaaggtttt cttcagcaag 1800
aaatggatgg cctactataa cccaagcgag gacatccaga agatttataa gaacggtacg 1860
ttcaagaagg gcgacatgtt caatcttaac gactgtcaca agctgatcga cttcttcaaa 1920
gactcaatta gccggtaccc aaagtggtct aacgcctatg acttcaactt ttcggaaacc 1980
gagaagtaca aggatatagc cggattttat agagaggtgg aagagcaggg ctacaaggtg 2040
tcattcgagt ccgccagcaa gaaggaagtg gacaagctcg tggaagaggg taagctctac 2100
atgttccaga tttataataa agactttagc gataagagcc acgggacacc taatctccac 2160
acaatgtatt tcaagctgct cttcgacgag aataaccacg gccaaatcag gttgtcagga 2220
ggggctgaac tcttcatgcg gcgcgctagc cttaagaagg aggagcttgt agtccaccct 2280
gcgaatagtc caattgcgaa taagaacccg gacaatccta aaaagactac aacattgagc 2340
tacgacgtgt acaaggataa gaggttttcc gaggatcagt acgagctcca catcccgatt 2400
gcgatcaaca agtgcccaaa gaatattttc aagataaaca cagaggtgcg tgtactcctg 2460
aagcatgacg acaatcctta cgtcattggg attgctcggg gcgagaggaa cctcctctat 2520
attgtggtgg tggacgggaa ggggaacata gtcgaacagt actcccttaa cgaaataatt 2580
aacaatttca acggcatccg tatcaagacc gactaccatt cgttgctgga caagaaggag 2640
aaggagagat ttgaggcgcg gcaaaattgg acaagtatcg agaacatcaa ggaactcaaa 2700
gcaggttata tctctcaagt tgtgcataag atatgcgagc tggttgagaa gtatgacgca 2760
gtgatcgctc ttgaggacct caactcgggc tttaagaatt ctagagttaa agtggagaag 2820
caggtctatc aaaagttcga gaagatgctt atagataagc tcaactacat ggtcgataag 2880
aaatcgaacc catgtgccac cggcggcgca ctcaaaggtt accaaataac aaacaaattc 2940
gagtccttca aatcgatgag tactcagaat gggttcatat tttatatacc ggcgtggctt 3000
acgtctaaga tcgacccgtc aactggtttt gtcaacctgt tgaagacgaa atacacgtcc 3060
attgccgatt cgaaaaagtt catatctagt tttgatcgta ttatgtacgt cccagaggaa 3120
gatcttttcg agtttgctct cgactacaaa aacttttcgc ggaccgatgc ggattacatt 3180
aaaaaatgga aactctattc gtacggcaac agaatcagga tttttcgcaa ccctaagaag 3240
aataacgtct ttgattggga ggaagtttgc ttgactagcg cgtacaagga gctctttaat 3300
aagtatggca ttaactacca acagggtgat atcagagcac tgctttgcga acaatctgac 3360
aaggctttct actcatcctt catggctttg atgagcctga tgctccagat gagaaattca 3420
attacaggca gaaccgacgt ggatttcttg atctccccgg ttaaaaattc tgatggcatc 3480
ttttacgata gcaggaacta tgaagcgcaa gagaatgcga ttctgccaaa aaatgcagac 3540
gccaacggtg cctataacat cgccaggaaa gtcctgtggg cgatcggcca gttcaaaaag 3600
gccgaagacg aaaaattgga caaggtcaaa atcgctatca gcaacaaaga gtggctggag 3660
tatgctcaga catccgtaaa gcataagctt atgccaaaaa agaagagaaa ggtttcaggc 3720
ggctcctccg aggtggagtt ctctcacgag tattggatga ggcacgctct tacacttgct 3780
aagagagctt gggacgaaag agaagtgcca gttggcgccg ttcttgtgca taataatagg 3840
gtgatcggcg agggttggaa tagaccaatt ggaaggcatg atccaacagc tcacgcagag 3900
attatggctc tcagacaagg cggcctcgtt atgcagaact acaggctcat tgacgctaca 3960
ctctacgtga cactcgaacc ttgcgttatg tgcgccggag ctatgattca ttctaggatt 4020
ggcagggtcg tgtttggagc tagggacgct aaaacaggag ccgccggatc tcttatggac 4080
gtgttgcatc atccaggcat gaaccatagg gtggagatta cagagggcat tcttgcagac 4140
gagtgcgctg ctcttctttc cgatttcttc aggatgagaa ggcaggagat taaggcccag 4200
aagaaggctc agtcttctac agatagcgga ggatcttccg gaggatctag cggctccgag 4260
acaccaggaa catccgaaag cgctacacca gaatctagcg gaggctcttc cggaggatct 4320
tctgaagtgg agttctccca cgagtattgg atgaggcacg ctcttacact tgctaaaagg 4380
gctagggacg aaagggaagt tccagttgga gctgttctcg tgctcaataa cagggtgatt 4440
ggcgagggtt ggaatagagc cattggactc catgatccaa cagctcacgc agagattatg 4500
gctcttagac aaggcggcct cgttatgcag aattacagac tcatcgacgc cacactctac 4560
gttaccttcg aaccttgcgt tatgtgcgcc ggagctatga tccattctag gattggcagg 4620
gtcgtgttcg gcgttagaaa cgctaagaca ggagctgcag gctctcttat ggacgttctt 4680
cattacccag gcatgaatca tagagtggag atcacagaag gcattcttgc agacgagtgc 4740
gcagctctcc tttgctattt cttcaggatg ccgaggcaag ttttcaacgc tcagaagaag 4800
gcccagtctt ctacagattc cggcggatct tctggaggat ctagcggctc cgagacacca 4860
ggaacatccg aatccgctac accagagtct tctggaggat ctagcggagg atctcttaag 4920
aagagaccag cagctacaaa gaaggccgga caagctaaga agaagaagta g 4971
<210> 15
<211> 4992
<212> DNA
<213> Artificial Sequence
<220>
<223> LBCPF1-ABE2-X3
<400> 15
atgccaaaaa agaagagaaa ggtttcaaag ctcgagaaat tcaccaactg ttattcgttg 60
agcaaaacac tgcggtttaa agcgattcca gtcggcaaga ctcaagagaa tatagacaat 120
aagcggctgt tggtggaaga tgaaaagcgc gcggaagact acaaaggggt gaagaagttg 180
ttggacagat actacctctc ttttatcaat gatgtcttgc actcaatcaa attgaagaat 240
ctgaacaact acatctccct cttcagaaag aaaacaagga cagaaaagga gaataaggaa 300
cttgaaaatt tggagatcaa tctgaggaaa gagatcgcga aagcctttaa aggcaacgaa 360
ggatacaaaa gtctgttcaa gaaggatata attgagacaa ttttgccaga gttcctcgat 420
gacaaggacg agattgcgct ggtcaattcg ttcaacggat tcacaacagc attcacaggc 480
ttctttgata atcgggaaaa tatgttctct gaggaggcaa agtccacttc tattgcgttc 540
aggtgtatca atgagaatct cactaggtac atttccaaca tggatatctt tgagaaggtt 600
gacgcaattt ttgacaagca cgaagttcag gagattaagg agaagatcct caattccgat 660
tatgacgttg aggacttctt cgaaggtgag ttttttaatt tcgtgctcac tcaagagggt 720
atcgacgtgt ataatgcgat catcggtggg ttcgtgactg agtccggtga aaagattaag 780
ggattgaacg agtatatcaa cctttacaac caaaagacga aacagaagct gccaaagttc 840
aagcctcttt acaaacaggt tctttcagac cgcgagtcac tctcgttcta tggggagggc 900
tacacttcgg atgaggaagt cctggaggtg ttcaggaata ctctcaataa gaattcggag 960
attttctctt ctataaaaaa actggaaaag ttgtttaaga attttgacga atactctagc 1020
gccggcatat ttgtgaaaaa cggcccggcc atatcaacga taagtaaaga tatcttcggc 1080
gaatggaacg tgatcagaga caaatggaac gcggagtatg acgatattca cctgaagaag 1140
aaggctgtcg taacggagaa gtacgaggat gatcgcagga aaagcttcaa aaagatcgga 1200
agtttcagcc tggaacagtt gcaggagtat gctgacgccg atcttagcgt cgtcgagaag 1260
ttgaaggaga taatcatcca aaaggtcgac gagatatata aagtctatgg atcaagtgaa 1320
aaactgttcg acgccgactt cgttttggag aagtccctga agaagaacga cgctgttgtt 1380
gccattatga aggatctgct cgacagcgtg aagagtttcg agaactatat taaggctttt 1440
ttcggggagg ggaaggagac taacagagat gagtccttct acggagactt cgtcctcgcg 1500
tacgatatac tccttaaggt agaccacatc tacgacgcaa tcagaaatta cgtgacacaa 1560
aagccgtaca gcaaggacaa gttcaaactc tacttccaga acccccagtt catgggcggc 1620
tgggacaagg acaaggaaac ggattacagg gctacgatcc tgaggtatgg ttcaaaatac 1680
tacttggcga ttatggacaa gaagtacgcc aagtgtctcc agaagattga caaagacgat 1740
gtcaatggca attatgagaa gatcaactac aagctgcttc cgggtccgaa caagatgctc 1800
ccaaaggttt tcttcagcaa gaaatggatg gcctactata acccaagcga ggacatccag 1860
aagatttata agaacggtac gttcaagaag ggcgacatgt tcaatcttaa cgactgtcac 1920
aagctgatcg acttcttcaa agactcaatt agccggtacc caaagtggtc taacgcctat 1980
gacttcaact tttcggaaac cgagaagtac aaggatatag ccggatttta tagagaggtg 2040
gaagagcagg gctacaaggt gtcattcgag tccgccagca agaaggaagt ggacaagctc 2100
gtggaagagg gtaagctcta catgttccag atttataata aagactttag cgataagagc 2160
cacgggacac ctaatctcca cacaatgtat ttcaagctgc tcttcgacga gaataaccac 2220
ggccaaatca ggttgtcagg aggggctgaa ctcttcatgc ggcgcgctag ccttaagaag 2280
gaggagcttg tagtccaccc tgcgaatagt ccaattgcga ataagaaccc ggacaatcct 2340
aaaaagacta caacattgag ctacgacgtg tacaaggata agaggttttc cgaggatcag 2400
tacgagctcc acatcccgat tgcgatcaac aagtgcccaa agaatatttt caagataaac 2460
acagaggtgc gtgtactcct gaagcatgac gacaatcctt acgtcattgg gattgctcgg 2520
ggcgagagga acctcctcta tattgtggtg gtggacggga aggggaacat agtcgaacag 2580
tactccctta acgaaataat taacaatttc aacggcatcc gtatcaagac cgactaccat 2640
tcgttgctgg acaagaagga gaaggagaga tttgaggcgc ggcaaaattg gacaagtatc 2700
gagaacatca aggaactcaa agcaggttat atctctcaag ttgtgcataa gatatgcgag 2760
ctggttgaga agtatgacgc agtgatcgct cttgaggacc tcaactcggg ctttaagaat 2820
tctagagtta aagtggagaa gcaggtctat caaaagttcg agaagatgct tatagataag 2880
ctcaactaca tggtcgataa gaaatcgaac ccatgtgcca ccggcggcgc actcaaaggt 2940
taccaaataa caaacaaatt cgagtccttc aaatcgatga gtactcagaa tgggttcata 3000
ttttatatac cggcgtggct tacgtctaag atcgacccgt caactggttt tgtcaacctg 3060
ttgaagacga aatacacgtc cattgccgat tcgaaaaagt tcatatctag ttttgatcgt 3120
attatgtacg tcccagagga agatcttttc gagtttgctc tcgactacaa aaacttttcg 3180
cggaccgatg cggattacat taaaaaatgg aaactctatt cgtacggcaa cagaatcagg 3240
atttttcgca accctaagaa gaataacgtc tttgattggg aggaagtttg cttgactagc 3300
gcgtacaagg agctctttaa taagtatggc attaactacc aacagggtga tatcagagca 3360
ctgctttgcg aacaatctga caaggctttc tactcatcct tcatggcttt gatgagcctg 3420
atgctccaga tgagaaattc aattacaggc agaaccgacg tggatttctt gatctccccg 3480
gttaaaaatt ctgatggcat cttttacgat agcaggaact atgaagcgca agagaatgcg 3540
attctgccaa aaaatgcaga cgccaacggt gcctataaca tcgccaggaa agtcctgtgg 3600
gcgatcggcc agttcaaaaa ggccgaagac gaaaaattgg acaaggtcaa aatcgctatc 3660
agcaacaaag agtggctgga gtatgctcag acatccgtaa agcataagct tatgccaaaa 3720
aagaagagaa aggtttcagg cggctcctcc gaggtggagt tctctcacga gtattggatg 3780
aggcacgctc ttacacttgc taagagagct tgggacgaaa gagaagtgcc agttggcgcc 3840
gttcttgtgc ataataatag ggtgatcggc gagggttgga atagaccaat tggaaggcat 3900
gatccaacag ctcacgcaga gattatggct ctcagacaag gcggcctcgt tatgcagaac 3960
tacaggctca ttgacgctac actctacgtg acactcgaac cttgcgttat gtgcgccgga 4020
gctatgattc attctaggat tggcagggtc gtgtttggag ctagggacgc taaaacagga 4080
gccgccggat ctcttatgga cgtgttgcat catccaggca tgaaccatag ggtggagatt 4140
acagagggca ttcttgcaga cgagtgcgct gctcttcttt ccgatttctt caggatgaga 4200
aggcaggaga ttaaggccca gaagaaggct cagtcttcta cagatagcgg aggatcttcc 4260
ggaggatcta gcggctccga gacaccagga acatccgaaa gcgctacacc agaatctagc 4320
ggaggctctt ccggaggatc ttctgaagtg gagttctccc acgagtattg gatgaggcac 4380
gctcttacac ttgctaaaag ggctagggac gaaagggaag ttccagttgg agctgttctc 4440
gtgctcaata acagggtgat tggcgagggt tggaatagag ccattggact ccatgatcca 4500
acagctcacg cagagattat ggctcttaga caaggcggcc tcgttatgca gaattacaga 4560
ctcatcgacg ccacactcta cgttaccttc gaaccttgcg ttatgtgcgc cggagctatg 4620
atccattcta ggattggcag ggtcgtgttc ggcgttagaa acgctaagac aggagctgca 4680
ggctctctta tggacgttct tcattaccca ggcatgaatc atagagtgga gatcacagaa 4740
ggcattcttg cagacgagtg cgcagctctc ctttgctatt tcttcaggat gccgaggcaa 4800
gttttcaacg ctcagaagaa ggcccagtct tctacagatt ccggcggatc ttctggagga 4860
tctagcggct ccgagacacc aggaacatcc gaatccgcta caccagagtc ttctggagga 4920
tctagcggag gatctcttaa gaagagacca gcagctacaa agaaggccgg acaagctaag 4980
aagaagaagt ag 4992
<210> 16
<211> 5686
<212> DNA
<213> Artificial Sequence
<220>
<223> PJIT163-GFP
<400> 16
gagctcggta cctgacccgg tcgtgcccct ctctagagat aatgagcatt gcatgtctaa 60
gttataaaaa attaccacat attttttttg tcacacttgt ttgaagtgca gtttatctat 120
ctttatacat atatttaaac tttactctac gaataatata atctatagta ctacaataat 180
atcagtgttt tagagaatca tataaatgaa cagttagaca tggtctaaag gacaattgag 240
tattttgaca acaggactct acagttttat ctttttagtg tgcatgtgtt ctcctttttt 300
tttgcaaata gcttcaccta tataatactt catccatttt attagtacat ccatttaggg 360
tttagggtta atggttttta tagactaatt tttttagtac atctatttta ttctatttta 420
gcctctaaat taagaaaact aaaactctat tttagttttt ttatttaata atttagatat 480
aaaatagaat aaaataaagt gactaaaaat taaacaaata ccctttaaga aattaaaaaa 540
actaaggaaa catttttctt gtttcgagta gataatgcca gcctgttaaa cgccgtcgac 600
gagtctaacg gacaccaacc agcgaaccag cagcgtcgcg tcgggccaag cgaagcagac 660
ggcacggcat ctctgtcgct gcctctggac ccctctcgat cgagagttcc gctccaccgt 720
tggacttgct ccgctgtcgg catccagaaa ttgcgtggcg gagcggcaga cgtgagccgg 780
cacggcaggc ggcctcctcc tcctctcacg gcaccggcag ctacggggga ttcctttccc 840
accgctcctt cgctttccct tcctcgcccg ccgtaataaa tagacacccc ctccacaccc 900
tctttcccca acctcgtgtt gttcggagcg cacacacaca caaccagatc tcccccaaat 960
ccacccgtcg gcacctccgc ttcaaggtac gccgctcgtc ctcccccccc ccccctctct 1020
accttctcta gatcggcgtt ccggtccatg gttagggccc ggtagttcta cttctgttca 1080
tgtttgtgtt agatccgtgt ttgtgttaga tccgtgctgc tagcgttcgt acacggatgc 1140
gacctgtacg tcagacacgt tctgattgct aacttgccag tgtttctctt tggggaatcc 1200
tgggatggct ctagccgttc cgcagacggg atcgatttca tgattttttt tgtttcgttg 1260
catagggttt ggtttgccct tttcctttat ttcaatatat gccgtgcact tgtttgtcgg 1320
gtcatctttt catgcttttt tttgtcttgg ttgtgatgat gtggtctggt tgggcggtcg 1380
ttctagatcg gagtagaatt aattctgttt caaactacct ggtggattta ttaattttgg 1440
atctgtatgt gtgtgccata catattcata gttacgaatt gaagatgatg gatggaaata 1500
tcgatctagg ataggtatac atgttgatgc gggttttact gatgcatata cagagatgct 1560
ttttgttcgc ttggttgtga tgatgtggtg tggttgggcg gtcgttcatt cgttctagat 1620
cggagtagaa tactgtttca aactacctgg tgtatttatt aattttggaa ctgtatgtgt 1680
gtgtcataca tcttcatagt tacgagttta agatggatgg aaatatcgat ctaggatagg 1740
tatacatgtt gatgtgggtt ttactgatgc atatacatga tggcatatgc agcatctatt 1800
catatgctct aaccttgagt acctatctat tataataaac aagtatgttt tataattatt 1860
ttgatcttga tatacttgga tgatggcata tgcagcagct atatgtggat ttttttagcc 1920
ctgccttcat acgctattta tttgcttggt actgtttctt ttgtcgatgc tcaccctgtt 1980
gtttggtgtt acttctgcaa agcttgtcga cggatccatg gtgagcaagg gcgaggagct 2040
gttcaccggg gtggtgccca tcctggtcga gctggacggc gacgtaaacg gccacaagtt 2100
cagcgtgtcc ggcgagggcg agggcgatgc cacctacggc aagctgaccc tgaagttcat 2160
ctgcaccacc ggcaagctgc ccgtgccctg gcccaccctc gtgaccacct tcacctacgg 2220
cgtgcagtgc ttcagccgct accccgacca catgaagcag cacgacttct tcaagtccgc 2280
catgcccgaa ggctacgtcc aggagcgcac catcttcttc aaggacgacg gcaactacaa 2340
gacccgcgcc gaggtgaagt tcgagggcga caccctggtg aaccgcatcg agctgaaggg 2400
catcgacttc aaggaggacg gcaacatcct ggggcacaag ctggagtaca actacaacag 2460
ccacaacgtc tatatcatgg ccgacaagca gaagaacggc atcaaggtga acttcaagat 2520
ccgccacaac atcgaggacg gcagcgtgca gctcgccgac cactaccagc agaacacccc 2580
catcggcgac ggccccgtgc tgctgcccga caaccactac ctgagcaccc agtccgccct 2640
gagcaaagac cccaacgaga agcgcgatca catggtcctg ctggagttcg tgaccgccgc 2700
cgggatcact cacggcatgg acgagctgta caagtaaccc gggaattcgg tacgctgaaa 2760
tcaccagtct ctctctacaa atctatctct ctctattttc tccataaata atgtgtgagt 2820
agtttcccga taagggaaat tagggttctt atagggtttc gctcatgtgt tgagcatata 2880
agaaaccctt agtatgtatt tgtatttgta aaatacttct atcaataaaa tttctaattc 2940
ctaaaaccaa aatccagtac taaaatccag atctcctaaa gtccctatag atctttgtcg 3000
tgaatataaa ccagacacga gacgactaaa cctggagccc agacgccgtt cgaagctaga 3060
agtaccgctt aggcaggagg ccgttaggga aaagatgcta aggcagggtt ggttacgttg 3120
actcccccgt aggtttggtt taaatatgat gaagtggacg gaaggaagga ggaagacaag 3180
gaaggataag gttgcaggcc ctgtgcaagg taagaagatg gaaatttgat agaggtacgc 3240
tactatactt atactatacg ctaagggaat gcttgtattt ataccctata ccccctaata 3300
accccttatc aatttaagaa ataatccgca taagcccccg cttaaaaatt ggtatcagag 3360
ccatgaatag gtctatgacc aaaactcaag aggataaaac ctcaccaaaa tacgaaagag 3420
ttcttaactc taaagataaa agatctttca agatcaaaac tagttccctc acaccggagc 3480
atgcgatatc ctcgagagat ctaggcgtaa tcatggtcat agctgtttcc tgtgtgaaat 3540
tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg taaagcctgg 3600
ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag 3660
tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt 3720
ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg 3780
ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg 3840
gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag 3900
gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 3960
cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 4020
ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 4080
tttctccctt cgggaagcgt ggcgctttct caatgctcac gctgtaggta tctcagttcg 4140
gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 4200
tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 4260
ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 4320
ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct 4380
ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 4440
accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 4500
tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 4560
cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 4620
taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 4680
caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 4740
gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt 4800
gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag 4860
ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct 4920
attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt 4980
gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc 5040
tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt 5100
agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg 5160
gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg 5220
actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct 5280
tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc 5340
attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt 5400
tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt 5460
tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg 5520
aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta tcagggttat 5580
tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg 5640
cgcacatttc cccgaaaagt gccacctgcc agtgccaagc taattc 5686
<210> 17
<211> 720
<212> DNA
<213> Artificial Sequence
<220>
<223> pBUI-mGFP
<400> 17
atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac 60
ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 120
ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 180
ctcgtgacca ccttcaccta cggcgtgtag tgcttcagcc gctaccccga ccacatgaag 240
cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 300
ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 360
gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 420
aagctggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac 480
ggcatcaagg tgaacttcaa gatccgccac aacatcgagg acggcagcgt gcagctcgcc 540
gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 600
tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 660
ctgctggagt tcgtgaccgc cgccgggatc actcacggca tggacgagct gtacaagtaa 720
<210> 18
<211> 1307
<212> PRT
<213> Acidaminococcus sp.
<400> 18
Met Thr Gln Phe Glu Gly Phe Thr Asn Leu Tyr Gln Val Ser Lys Thr
1 5 10 15
Leu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Lys His Ile Gln
20 25 30
Glu Gln Gly Phe Ile Glu Glu Asp Lys Ala Arg Asn Asp His Tyr Lys
35 40 45
Glu Leu Lys Pro Ile Ile Asp Arg Ile Tyr Lys Thr Tyr Ala Asp Gln
50 55 60
Cys Leu Gln Leu Val Gln Leu Asp Trp Glu Asn Leu Ser Ala Ala Ile
65 70 75 80
Asp Ser Tyr Arg Lys Glu Lys Thr Glu Glu Thr Arg Asn Ala Leu Ile
85 90 95
Glu Glu Gln Ala Thr Tyr Arg Asn Ala Ile His Asp Tyr Phe Ile Gly
100 105 110
Arg Thr Asp Asn Leu Thr Asp Ala Ile Asn Lys Arg His Ala Glu Ile
115 120 125
Tyr Lys Gly Leu Phe Lys Ala Glu Leu Phe Asn Gly Lys Val Leu Lys
130 135 140
Gln Leu Gly Thr Val Thr Thr Thr Glu His Glu Asn Ala Leu Leu Arg
145 150 155 160
Ser Phe Asp Lys Phe Thr Thr Tyr Phe Ser Gly Phe Tyr Glu Asn Arg
165 170 175
Lys Asn Val Phe Ser Ala Glu Asp Ile Ser Thr Ala Ile Pro His Arg
180 185 190
Ile Val Gln Asp Asn Phe Pro Lys Phe Lys Glu Asn Cys His Ile Phe
195 200 205
Thr Arg Leu Ile Thr Ala Val Pro Ser Leu Arg Glu His Phe Glu Asn
210 215 220
Val Lys Lys Ala Ile Gly Ile Phe Val Ser Thr Ser Ile Glu Glu Val
225 230 235 240
Phe Ser Phe Pro Phe Tyr Asn Gln Leu Leu Thr Gln Thr Gln Ile Asp
245 250 255
Leu Tyr Asn Gln Leu Leu Gly Gly Ile Ser Arg Glu Ala Gly Thr Glu
260 265 270
Lys Ile Lys Gly Leu Asn Glu Val Leu Asn Leu Ala Ile Gln Lys Asn
275 280 285
Asp Glu Thr Ala His Ile Ile Ala Ser Leu Pro His Arg Phe Ile Pro
290 295 300
Leu Phe Lys Gln Ile Leu Ser Asp Arg Asn Thr Leu Ser Phe Ile Leu
305 310 315 320
Glu Glu Phe Lys Ser Asp Glu Glu Val Ile Gln Ser Phe Cys Lys Tyr
325 330 335
Lys Thr Leu Leu Arg Asn Glu Asn Val Leu Glu Thr Ala Glu Ala Leu
340 345 350
Phe Asn Glu Leu Asn Ser Ile Asp Leu Thr His Ile Phe Ile Ser His
355 360 365
Lys Lys Leu Glu Thr Ile Ser Ser Ala Leu Cys Asp His Trp Asp Thr
370 375 380
Leu Arg Asn Ala Leu Tyr Glu Arg Arg Ile Ser Glu Leu Thr Gly Lys
385 390 395 400
Ile Thr Lys Ser Ala Lys Glu Lys Val Gln Arg Ser Leu Lys His Glu
405 410 415
Asp Ile Asn Leu Gln Glu Ile Ile Ser Ala Ala Gly Lys Glu Leu Ser
420 425 430
Glu Ala Phe Lys Gln Lys Thr Ser Glu Ile Leu Ser His Ala His Ala
435 440 445
Ala Leu Asp Gln Pro Leu Pro Thr Thr Leu Lys Lys Gln Glu Glu Lys
450 455 460
Glu Ile Leu Lys Ser Gln Leu Asp Ser Leu Leu Gly Leu Tyr His Leu
465 470 475 480
Leu Asp Trp Phe Ala Val Asp Glu Ser Asn Glu Val Asp Pro Glu Phe
485 490 495
Ser Ala Arg Leu Thr Gly Ile Lys Leu Glu Met Glu Pro Ser Leu Ser
500 505 510
Phe Tyr Asn Lys Ala Arg Asn Tyr Ala Thr Lys Lys Pro Tyr Ser Val
515 520 525
Glu Lys Phe Lys Leu Asn Phe Gln Met Pro Thr Leu Ala Ser Gly Trp
530 535 540
Asp Val Asn Lys Glu Lys Asn Asn Gly Ala Ile Leu Phe Val Lys Asn
545 550 555 560
Gly Leu Tyr Tyr Leu Gly Ile Met Pro Lys Gln Lys Gly Arg Tyr Lys
565 570 575
Ala Leu Ser Phe Glu Pro Thr Glu Lys Thr Ser Glu Gly Phe Asp Lys
580 585 590
Met Tyr Tyr Asp Tyr Phe Pro Asp Ala Ala Lys Met Ile Pro Lys Cys
595 600 605
Ser Thr Gln Leu Lys Ala Val Thr Ala His Phe Gln Thr His Thr Thr
610 615 620
Pro Ile Leu Leu Ser Asn Asn Phe Ile Glu Pro Leu Glu Ile Thr Lys
625 630 635 640
Glu Ile Tyr Asp Leu Asn Asn Pro Glu Lys Glu Pro Lys Lys Phe Gln
645 650 655
Thr Ala Tyr Ala Lys Lys Thr Gly Asp Gln Lys Gly Tyr Arg Glu Ala
660 665 670
Leu Cys Lys Trp Ile Asp Phe Thr Arg Asp Phe Leu Ser Lys Tyr Thr
675 680 685
Lys Thr Thr Ser Ile Asp Leu Ser Ser Leu Arg Pro Ser Ser Gln Tyr
690 695 700
Lys Asp Leu Gly Glu Tyr Tyr Ala Glu Leu Asn Pro Leu Leu Tyr His
705 710 715 720
Ile Ser Phe Gln Arg Ile Ala Glu Lys Glu Ile Met Asp Ala Val Glu
725 730 735
Thr Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ala Lys
740 745 750
Gly His His Gly Lys Pro Asn Leu His Thr Leu Tyr Trp Thr Gly Leu
755 760 765
Phe Ser Pro Glu Asn Leu Ala Lys Thr Ser Ile Lys Leu Asn Gly Gln
770 775 780
Ala Glu Leu Phe Tyr Arg Pro Lys Ser Arg Met Lys Arg Met Ala His
785 790 795 800
Arg Leu Gly Glu Lys Met Leu Asn Lys Lys Leu Lys Asp Gln Lys Thr
805 810 815
Pro Ile Pro Asp Thr Leu Tyr Gln Glu Leu Tyr Asp Tyr Val Asn His
820 825 830
Arg Leu Ser His Asp Leu Ser Asp Glu Ala Arg Ala Leu Leu Pro Asn
835 840 845
Val Ile Thr Lys Glu Val Ser His Glu Ile Ile Lys Asp Arg Arg Phe
850 855 860
Thr Ser Asp Lys Phe Phe Phe His Val Pro Ile Thr Leu Asn Tyr Gln
865 870 875 880
Ala Ala Asn Ser Pro Ser Lys Phe Asn Gln Arg Val Asn Ala Tyr Leu
885 890 895
Lys Glu His Pro Glu Thr Pro Ile Ile Gly Ile Asp Arg Gly Glu Arg
900 905 910
Asn Leu Ile Tyr Ile Thr Val Ile Asp Ser Thr Gly Lys Ile Leu Glu
915 920 925
Gln Arg Ser Leu Asn Thr Ile Gln Gln Phe Asp Tyr Gln Lys Lys Leu
930 935 940
Asp Asn Arg Glu Lys Glu Arg Val Ala Ala Arg Gln Ala Trp Ser Val
945 950 955 960
Val Gly Thr Ile Lys Asp Leu Lys Gln Gly Tyr Leu Ser Gln Val Ile
965 970 975
His Glu Ile Val Asp Leu Met Ile His Tyr Gln Ala Val Val Val Leu
980 985 990
Glu Asn Leu Asn Phe Gly Phe Lys Ser Lys Arg Thr Gly Ile Ala Glu
995 1000 1005
Lys Ala Val Tyr Gln Gln Phe Glu Lys Met Leu Ile Asp Lys Leu
1010 1015 1020
Asn Cys Leu Val Leu Lys Asp Tyr Pro Ala Glu Lys Val Gly Gly
1025 1030 1035
Val Leu Asn Pro Tyr Gln Leu Thr Asp Gln Phe Thr Ser Phe Ala
1040 1045 1050
Lys Met Gly Thr Gln Ser Gly Phe Leu Phe Tyr Val Pro Ala Pro
1055 1060 1065
Tyr Thr Ser Lys Ile Asp Pro Leu Thr Gly Phe Val Asp Pro Phe
1070 1075 1080
Val Trp Lys Thr Ile Lys Asn His Glu Ser Arg Lys His Phe Leu
1085 1090 1095
Glu Gly Phe Asp Phe Leu His Tyr Asp Val Lys Thr Gly Asp Phe
1100 1105 1110
Ile Leu His Phe Lys Met Asn Arg Asn Leu Ser Phe Gln Arg Gly
1115 1120 1125
Leu Pro Gly Phe Met Pro Ala Trp Asp Ile Val Phe Glu Lys Asn
1130 1135 1140
Glu Thr Gln Phe Asp Ala Lys Gly Thr Pro Phe Ile Ala Gly Lys
1145 1150 1155
Arg Ile Val Pro Val Ile Glu Asn His Arg Phe Thr Gly Arg Tyr
1160 1165 1170
Arg Asp Leu Tyr Pro Ala Asn Glu Leu Ile Ala Leu Leu Glu Glu
1175 1180 1185
Lys Gly Ile Val Phe Arg Asp Gly Ser Asn Ile Leu Pro Lys Leu
1190 1195 1200
Leu Glu Asn Asp Asp Ser His Ala Ile Asp Thr Met Val Ala Leu
1205 1210 1215
Ile Arg Ser Val Leu Gln Met Arg Asn Ser Asn Ala Ala Thr Gly
1220 1225 1230
Glu Asp Tyr Ile Asn Ser Pro Val Arg Asp Leu Asn Gly Val Cys
1235 1240 1245
Phe Asp Ser Arg Phe Gln Asn Pro Glu Trp Pro Met Asp Ala Asp
1250 1255 1260
Ala Asn Gly Ala Tyr His Ile Ala Leu Lys Gly Gln Leu Leu Leu
1265 1270 1275
Asn His Leu Lys Glu Ser Lys Asp Leu Lys Leu Gln Asn Gly Ile
1280 1285 1290
Ser Asn Gln Asp Trp Leu Ala Tyr Ile Gln Glu Leu Arg Asn
1295 1300 1305
<210> 19
<211> 1300
<212> PRT
<213> Francisella novicida
<400> 19
Met Ser Ile Tyr Gln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys Thr
1 5 10 15
Leu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile Lys
20 25 30
Ala Arg Gly Leu Ile Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr Lys
35 40 45
Lys Ala Lys Gln Ile Ile Asp Lys Tyr His Gln Phe Phe Ile Glu Glu
50 55 60
Ile Leu Ser Ser Val Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr Ser
65 70 75 80
Asp Val Tyr Phe Lys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln Lys
85 90 95
Asp Phe Lys Ser Ala Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu Tyr
100 105 110
Ile Lys Asp Ser Glu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu Ile
115 120 125
Asp Ala Lys Lys Gly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys Gln
130 135 140
Ser Lys Asp Asn Gly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile Thr
145 150 155 160
Asp Ile Asp Glu Ala Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp Thr
165 170 175
Thr Tyr Phe Lys Gly Phe His Glu Asn Arg Lys Asn Val Tyr Ser Ser
180 185 190
Asn Asp Ile Pro Thr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn Leu
195 200 205
Pro Lys Phe Leu Glu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp Lys
210 215 220
Ala Pro Glu Ala Ile Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala Glu
225 230 235 240
Glu Leu Thr Phe Asp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln Arg
245 250 255
Val Phe Ser Leu Asp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn Tyr
260 265 270
Leu Asn Gln Ser Gly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly Lys
275 280 285
Phe Val Asn Gly Glu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr Ile
290 295 300
Asn Leu Tyr Ser Gln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr Lys
305 310 315 320
Met Ser Val Leu Phe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys Ser
325 330 335
Phe Val Ile Asp Lys Leu Glu Asp Asp Ser Asp Val Val Thr Thr Met
340 345 350
Gln Ser Phe Tyr Glu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu Lys
355 360 365
Ser Ile Lys Glu Thr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala Gln
370 375 380
Lys Leu Asp Leu Ser Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu Thr
385 390 395 400
Asp Leu Ser Gln Gln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr Ala
405 410 415
Val Leu Glu Tyr Ile Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp Asn
420 425 430
Pro Ser Lys Lys Glu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys Ala
435 440 445
Lys Tyr Leu Ser Leu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe Asn
450 455 460
Lys His Arg Asp Ile Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu Ala
465 470 475 480
Asn Phe Ala Ala Ile Pro Met Ile Phe Asp Glu Ile Ala Gln Asn Lys
485 490 495
Asp Asn Leu Ala Gln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys Lys
500 505 510
Asp Leu Leu Gln Ala Ser Ala Glu Asp Asp Val Lys Ala Ile Lys Asp
515 520 525
Leu Leu Asp Gln Thr Asn Asn Leu Leu His Lys Leu Lys Ile Phe His
530 535 540
Ile Ser Gln Ser Glu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu His
545 550 555 560
Phe Tyr Leu Val Phe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile Val
565 570 575
Pro Leu Tyr Asn Lys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr Ser
580 585 590
Asp Glu Lys Phe Lys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn Gly
595 600 605
Trp Asp Lys Asn Lys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys
610 615 620
Asp Asp Lys Tyr Tyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile
625 630 635 640
Phe Asp Asp Lys Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys
645 650 655
Ile Val Tyr Lys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val
660 665 670
Phe Phe Ser Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile
675 680 685
Leu Arg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln
690 695 700
Lys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys Phe
705 710 715 720
Ile Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu Trp Lys Asp
725 730 735
Phe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp Glu
740 745 750
Phe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu Asn
755 760 765
Ile Ser Glu Ser Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu Tyr
770 775 780
Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly Arg
785 790 795 800
Pro Asn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg Asn
805 810 815
Leu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe Tyr
820 825 830
Arg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys Glu Ala
835 840 845
Ile Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser Val Phe Glu
850 855 860
Tyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe Phe
865 870 875 880
His Cys Pro Ile Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys Phe
885 890 895
Asn Asp Glu Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val His
900 905 910
Ile Leu Ser Ile Asp Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr Leu
915 920 925
Val Asp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile Ile
930 935 940
Gly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala Ile
945 950 955 960
Glu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile Asn Asn
965 970 975
Ile Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val Val His Glu Ile
980 985 990
Ala Lys Leu Val Ile Glu Tyr Asn Ala Ile Val Val Phe Glu Asp Leu
995 1000 1005
Asn Phe Gly Phe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln Val
1010 1015 1020
Tyr Gln Lys Leu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr Leu
1025 1030 1035
Val Phe Lys Asp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu Arg
1040 1045 1050
Ala Tyr Gln Leu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met Gly
1055 1060 1065
Lys Gln Thr Gly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr Ser
1070 1075 1080
Lys Ile Cys Pro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro Lys
1085 1090 1095
Tyr Glu Ser Val Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe Asp
1100 1105 1110
Lys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser Phe
1115 1120 1125
Asp Tyr Lys Asn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp Thr
1130 1135 1140
Ile Ala Ser Phe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser Asp
1145 1150 1155
Lys Asn His Asn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys Glu
1160 1165 1170
Leu Glu Lys Leu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His Gly
1175 1180 1185
Glu Cys Ile Lys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys Phe
1190 1195 1200
Phe Ala Lys Leu Thr Ser Val Leu Asn Thr Ile Leu Gln Met Arg
1205 1210 1215
Asn Ser Lys Thr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro Val
1220 1225 1230
Ala Asp Val Asn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro Lys
1235 1240 1245
Asn Met Pro Gln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile Gly
1250 1255 1260
Leu Lys Gly Leu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln Glu
1265 1270 1275
Gly Lys Lys Leu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe Glu
1280 1285 1290
Phe Val Gln Asn Arg Asn Asn
1295 1300
<210> 20
<211> 1228
<212> PRT
<213> Lachnospiraceae bacterium
<400> 20
Met Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr
1 5 10 15
Leu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp
20 25 30
Asn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys
35 40 45
Gly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp
50 55 60
Val Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu
65 70 75 80
Phe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn
85 90 95
Leu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn
100 105 110
Glu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu
115 120 125
Pro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe
130 135 140
Asn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn
145 150 155 160
Met Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile
165 170 175
Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys
180 185 190
Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys
195 200 205
Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe
210 215 220
Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile
225 230 235 240
Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn
245 250 255
Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys
260 265 270
Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser
275 280 285
Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe
290 295 300
Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys
305 310 315 320
Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile
325 330 335
Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe
340 345 350
Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp
355 360 365
Ile His Leu Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp
370 375 380
Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu
385 390 395 400
Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu
405 410 415
Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser
420 425 430
Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys
435 440 445
Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys
450 455 460
Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr
465 470 475 480
Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile
485 490 495
Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr
500 505 510
Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro
515 520 525
Gln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala
530 535 540
Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys
545 550 555 560
Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly
565 570 575
Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met
580 585 590
Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro
595 600 605
Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly
610 615 620
Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys
625 630 635 640
Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn
645 650 655
Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu
660 665 670
Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys
675 680 685
Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile
690 695 700
Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His
705 710 715 720
Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile
725 730 735
Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys
740 745 750
Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys
755 760 765
Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr
770 775 780
Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile
785 790 795 800
Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val
805 810 815
Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile Asp
820 825 830
Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys Gly
835 840 845
Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn
850 855 860
Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu
865 870 875 880
Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile
885 890 895
Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys
900 905 910
Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn
915 920 925
Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln
930 935 940
Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp Lys
945 950 955 960
Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile
965 970 975
Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn Gly Phe
980 985 990
Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser Thr
995 1000 1005
Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala Asp
1010 1015 1020
Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val Pro
1025 1030 1035
Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe Ser
1040 1045 1050
Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser Tyr
1055 1060 1065
Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn Val
1070 1075 1080
Phe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu Leu
1085 1090 1095
Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg Ala
1100 1105 1110
Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe Met
1115 1120 1125
Ala Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser Ile Thr Gly
1130 1135 1140
Arg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys Asn Ser Asp
1145 1150 1155
Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn Ala
1160 1165 1170
Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile Ala
1175 1180 1185
Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu Asp
1190 1195 1200
Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu Trp
1205 1210 1215
Leu Glu Tyr Ala Gln Thr Ser Val Lys His
1220 1225
<210> 21
<211> 1329
<212> PRT
<213> Artificial Sequence
<220>
<223> ASCPF1-2NLS
<400> 21
Met Thr Gln Phe Glu Gly Phe Thr Asn Leu Tyr Gln Val Ser Lys Thr
1 5 10 15
Leu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Lys His Ile Gln
20 25 30
Glu Gln Gly Phe Ile Glu Glu Asp Lys Ala Arg Asn Asp His Tyr Lys
35 40 45
Glu Leu Lys Pro Ile Ile Asp Arg Ile Tyr Lys Thr Tyr Ala Asp Gln
50 55 60
Cys Leu Gln Leu Val Gln Leu Asp Trp Glu Asn Leu Ser Ala Ala Ile
65 70 75 80
Asp Ser Tyr Arg Lys Glu Lys Thr Glu Glu Thr Arg Asn Ala Leu Ile
85 90 95
Glu Glu Gln Ala Thr Tyr Arg Asn Ala Ile His Asp Tyr Phe Ile Gly
100 105 110
Arg Thr Asp Asn Leu Thr Asp Ala Ile Asn Lys Arg His Ala Glu Ile
115 120 125
Tyr Lys Gly Leu Phe Lys Ala Glu Leu Phe Asn Gly Lys Val Leu Lys
130 135 140
Gln Leu Gly Thr Val Thr Thr Thr Glu His Glu Asn Ala Leu Leu Arg
145 150 155 160
Ser Phe Asp Lys Phe Thr Thr Tyr Phe Ser Gly Phe Tyr Glu Asn Arg
165 170 175
Lys Asn Val Phe Ser Ala Glu Asp Ile Ser Thr Ala Ile Pro His Arg
180 185 190
Ile Val Gln Asp Asn Phe Pro Lys Phe Lys Glu Asn Cys His Ile Phe
195 200 205
Thr Arg Leu Ile Thr Ala Val Pro Ser Leu Arg Glu His Phe Glu Asn
210 215 220
Val Lys Lys Ala Ile Gly Ile Phe Val Ser Thr Ser Ile Glu Glu Val
225 230 235 240
Phe Ser Phe Pro Phe Tyr Asn Gln Leu Leu Thr Gln Thr Gln Ile Asp
245 250 255
Leu Tyr Asn Gln Leu Leu Gly Gly Ile Ser Arg Glu Ala Gly Thr Glu
260 265 270
Lys Ile Lys Gly Leu Asn Glu Val Leu Asn Leu Ala Ile Gln Lys Asn
275 280 285
Asp Glu Thr Ala His Ile Ile Ala Ser Leu Pro His Arg Phe Ile Pro
290 295 300
Leu Phe Lys Gln Ile Leu Ser Asp Arg Asn Thr Leu Ser Phe Ile Leu
305 310 315 320
Glu Glu Phe Lys Ser Asp Glu Glu Val Ile Gln Ser Phe Cys Lys Tyr
325 330 335
Lys Thr Leu Leu Arg Asn Glu Asn Val Leu Glu Thr Ala Glu Ala Leu
340 345 350
Phe Asn Glu Leu Asn Ser Ile Asp Leu Thr His Ile Phe Ile Ser His
355 360 365
Lys Lys Leu Glu Thr Ile Ser Ser Ala Leu Cys Asp His Trp Asp Thr
370 375 380
Leu Arg Asn Ala Leu Tyr Glu Arg Arg Ile Ser Glu Leu Thr Gly Lys
385 390 395 400
Ile Thr Lys Ser Ala Lys Glu Lys Val Gln Arg Ser Leu Lys His Glu
405 410 415
Asp Ile Asn Leu Gln Glu Ile Ile Ser Ala Ala Gly Lys Glu Leu Ser
420 425 430
Glu Ala Phe Lys Gln Lys Thr Ser Glu Ile Leu Ser His Ala His Ala
435 440 445
Ala Leu Asp Gln Pro Leu Pro Thr Thr Leu Lys Lys Gln Glu Glu Lys
450 455 460
Glu Ile Leu Lys Ser Gln Leu Asp Ser Leu Leu Gly Leu Tyr His Leu
465 470 475 480
Leu Asp Trp Phe Ala Val Asp Glu Ser Asn Glu Val Asp Pro Glu Phe
485 490 495
Ser Ala Arg Leu Thr Gly Ile Lys Leu Glu Met Glu Pro Ser Leu Ser
500 505 510
Phe Tyr Asn Lys Ala Arg Asn Tyr Ala Thr Lys Lys Pro Tyr Ser Val
515 520 525
Glu Lys Phe Lys Leu Asn Phe Gln Met Pro Thr Leu Ala Ser Gly Trp
530 535 540
Asp Val Asn Lys Glu Lys Asn Asn Gly Ala Ile Leu Phe Val Lys Asn
545 550 555 560
Gly Leu Tyr Tyr Leu Gly Ile Met Pro Lys Gln Lys Gly Arg Tyr Lys
565 570 575
Ala Leu Ser Phe Glu Pro Thr Glu Lys Thr Ser Glu Gly Phe Asp Lys
580 585 590
Met Tyr Tyr Asp Tyr Phe Pro Asp Ala Ala Lys Met Ile Pro Lys Cys
595 600 605
Ser Thr Gln Leu Lys Ala Val Thr Ala His Phe Gln Thr His Thr Thr
610 615 620
Pro Ile Leu Leu Ser Asn Asn Phe Ile Glu Pro Leu Glu Ile Thr Lys
625 630 635 640
Glu Ile Tyr Asp Leu Asn Asn Pro Glu Lys Glu Pro Lys Lys Phe Gln
645 650 655
Thr Ala Tyr Ala Lys Lys Thr Gly Asp Gln Lys Gly Tyr Arg Glu Ala
660 665 670
Leu Cys Lys Trp Ile Asp Phe Thr Arg Asp Phe Leu Ser Lys Tyr Thr
675 680 685
Lys Thr Thr Ser Ile Asp Leu Ser Ser Leu Arg Pro Ser Ser Gln Tyr
690 695 700
Lys Asp Leu Gly Glu Tyr Tyr Ala Glu Leu Asn Pro Leu Leu Tyr His
705 710 715 720
Ile Ser Phe Gln Arg Ile Ala Glu Lys Glu Ile Met Asp Ala Val Glu
725 730 735
Thr Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ala Lys
740 745 750
Gly His His Gly Lys Pro Asn Leu His Thr Leu Tyr Trp Thr Gly Leu
755 760 765
Phe Ser Pro Glu Asn Leu Ala Lys Thr Ser Ile Lys Leu Asn Gly Gln
770 775 780
Ala Glu Leu Phe Tyr Arg Pro Lys Ser Arg Met Lys Arg Met Ala His
785 790 795 800
Arg Leu Gly Glu Lys Met Leu Asn Lys Lys Leu Lys Asp Gln Lys Thr
805 810 815
Pro Ile Pro Asp Thr Leu Tyr Gln Glu Leu Tyr Asp Tyr Val Asn His
820 825 830
Arg Leu Ser His Asp Leu Ser Asp Glu Ala Arg Ala Leu Leu Pro Asn
835 840 845
Val Ile Thr Lys Glu Val Ser His Glu Ile Ile Lys Asp Arg Arg Phe
850 855 860
Thr Ser Asp Lys Phe Phe Phe His Val Pro Ile Thr Leu Asn Tyr Gln
865 870 875 880
Ala Ala Asn Ser Pro Ser Lys Phe Asn Gln Arg Val Asn Ala Tyr Leu
885 890 895
Lys Glu His Pro Glu Thr Pro Ile Ile Gly Ile Asp Arg Gly Glu Arg
900 905 910
Asn Leu Ile Tyr Ile Thr Val Ile Asp Ser Thr Gly Lys Ile Leu Glu
915 920 925
Gln Arg Ser Leu Asn Thr Ile Gln Gln Phe Asp Tyr Gln Lys Lys Leu
930 935 940
Asp Asn Arg Glu Lys Glu Arg Val Ala Ala Arg Gln Ala Trp Ser Val
945 950 955 960
Val Gly Thr Ile Lys Asp Leu Lys Gln Gly Tyr Leu Ser Gln Val Ile
965 970 975
His Glu Ile Val Asp Leu Met Ile His Tyr Gln Ala Val Val Val Leu
980 985 990
Glu Asn Leu Asn Phe Gly Phe Lys Ser Lys Arg Thr Gly Ile Ala Glu
995 1000 1005
Lys Ala Val Tyr Gln Gln Phe Glu Lys Met Leu Ile Asp Lys Leu
1010 1015 1020
Asn Cys Leu Val Leu Lys Asp Tyr Pro Ala Glu Lys Val Gly Gly
1025 1030 1035
Val Leu Asn Pro Tyr Gln Leu Thr Asp Gln Phe Thr Ser Phe Ala
1040 1045 1050
Lys Met Gly Thr Gln Ser Gly Phe Leu Phe Tyr Val Pro Ala Pro
1055 1060 1065
Tyr Thr Ser Lys Ile Asp Pro Leu Thr Gly Phe Val Asp Pro Phe
1070 1075 1080
Val Trp Lys Thr Ile Lys Asn His Glu Ser Arg Lys His Phe Leu
1085 1090 1095
Glu Gly Phe Asp Phe Leu His Tyr Asp Val Lys Thr Gly Asp Phe
1100 1105 1110
Ile Leu His Phe Lys Met Asn Arg Asn Leu Ser Phe Gln Arg Gly
1115 1120 1125
Leu Pro Gly Phe Met Pro Ala Trp Asp Ile Val Phe Glu Lys Asn
1130 1135 1140
Glu Thr Gln Phe Asp Ala Lys Gly Thr Pro Phe Ile Ala Gly Lys
1145 1150 1155
Arg Ile Val Pro Val Ile Glu Asn His Arg Phe Thr Gly Arg Tyr
1160 1165 1170
Arg Asp Leu Tyr Pro Ala Asn Glu Leu Ile Ala Leu Leu Glu Glu
1175 1180 1185
Lys Gly Ile Val Phe Arg Asp Gly Ser Asn Ile Leu Pro Lys Leu
1190 1195 1200
Leu Glu Asn Asp Asp Ser His Ala Ile Asp Thr Met Val Ala Leu
1205 1210 1215
Ile Arg Ser Val Leu Gln Met Arg Asn Ser Asn Ala Ala Thr Gly
1220 1225 1230
Glu Asp Tyr Ile Asn Ser Pro Val Arg Asp Leu Asn Gly Val Cys
1235 1240 1245
Phe Asp Ser Arg Phe Gln Asn Pro Glu Trp Pro Met Asp Ala Asp
1250 1255 1260
Ala Asn Gly Ala Tyr His Ile Ala Leu Lys Gly Gln Leu Leu Leu
1265 1270 1275
Asn His Leu Lys Glu Ser Lys Asp Leu Lys Leu Gln Asn Gly Ile
1280 1285 1290
Ser Asn Gln Asp Trp Leu Ala Tyr Ile Gln Glu Leu Arg Asn Ser
1295 1300 1305
Gly Gly Ser Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro
1310 1315 1320
Lys Lys Lys Arg Lys Val
1325
<210> 22
<211> 1322
<212> PRT
<213> Artificial Sequence
<220>
<223> FNCPF1-2NLS
<400> 22
Met Ser Ile Tyr Gln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys Thr
1 5 10 15
Leu Arg Phe Glu Leu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile Lys
20 25 30
Ala Arg Gly Leu Ile Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr Lys
35 40 45
Lys Ala Lys Gln Ile Ile Asp Lys Tyr His Gln Phe Phe Ile Glu Glu
50 55 60
Ile Leu Ser Ser Val Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr Ser
65 70 75 80
Asp Val Tyr Phe Lys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln Lys
85 90 95
Asp Phe Lys Ser Ala Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu Tyr
100 105 110
Ile Lys Asp Ser Glu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu Ile
115 120 125
Asp Ala Lys Lys Gly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys Gln
130 135 140
Ser Lys Asp Asn Gly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile Thr
145 150 155 160
Asp Ile Asp Glu Ala Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp Thr
165 170 175
Thr Tyr Phe Lys Gly Phe His Glu Asn Arg Lys Asn Val Tyr Ser Ser
180 185 190
Asn Asp Ile Pro Thr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn Leu
195 200 205
Pro Lys Phe Leu Glu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp Lys
210 215 220
Ala Pro Glu Ala Ile Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala Glu
225 230 235 240
Glu Leu Thr Phe Asp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln Arg
245 250 255
Val Phe Ser Leu Asp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn Tyr
260 265 270
Leu Asn Gln Ser Gly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly Lys
275 280 285
Phe Val Asn Gly Glu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr Ile
290 295 300
Asn Leu Tyr Ser Gln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr Lys
305 310 315 320
Met Ser Val Leu Phe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys Ser
325 330 335
Phe Val Ile Asp Lys Leu Glu Asp Asp Ser Asp Val Val Thr Thr Met
340 345 350
Gln Ser Phe Tyr Glu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu Lys
355 360 365
Ser Ile Lys Glu Thr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala Gln
370 375 380
Lys Leu Asp Leu Ser Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu Thr
385 390 395 400
Asp Leu Ser Gln Gln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr Ala
405 410 415
Val Leu Glu Tyr Ile Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp Asn
420 425 430
Pro Ser Lys Lys Glu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys Ala
435 440 445
Lys Tyr Leu Ser Leu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe Asn
450 455 460
Lys His Arg Asp Ile Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu Ala
465 470 475 480
Asn Phe Ala Ala Ile Pro Met Ile Phe Asp Glu Ile Ala Gln Asn Lys
485 490 495
Asp Asn Leu Ala Gln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys Lys
500 505 510
Asp Leu Leu Gln Ala Ser Ala Glu Asp Asp Val Lys Ala Ile Lys Asp
515 520 525
Leu Leu Asp Gln Thr Asn Asn Leu Leu His Lys Leu Lys Ile Phe His
530 535 540
Ile Ser Gln Ser Glu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu His
545 550 555 560
Phe Tyr Leu Val Phe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile Val
565 570 575
Pro Leu Tyr Asn Lys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr Ser
580 585 590
Asp Glu Lys Phe Lys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn Gly
595 600 605
Trp Asp Lys Asn Lys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys
610 615 620
Asp Asp Lys Tyr Tyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile
625 630 635 640
Phe Asp Asp Lys Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys
645 650 655
Ile Val Tyr Lys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val
660 665 670
Phe Phe Ser Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile
675 680 685
Leu Arg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln
690 695 700
Lys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys Phe
705 710 715 720
Ile Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu Trp Lys Asp
725 730 735
Phe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp Glu
740 745 750
Phe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu Asn
755 760 765
Ile Ser Glu Ser Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu Tyr
770 775 780
Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly Arg
785 790 795 800
Pro Asn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg Asn
805 810 815
Leu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe Tyr
820 825 830
Arg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys Glu Ala
835 840 845
Ile Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser Val Phe Glu
850 855 860
Tyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe Phe
865 870 875 880
His Cys Pro Ile Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys Phe
885 890 895
Asn Asp Glu Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val His
900 905 910
Ile Leu Ser Ile Asp Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr Leu
915 920 925
Val Asp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile Ile
930 935 940
Gly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala Ile
945 950 955 960
Glu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile Asn Asn
965 970 975
Ile Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val Val His Glu Ile
980 985 990
Ala Lys Leu Val Ile Glu Tyr Asn Ala Ile Val Val Phe Glu Asp Leu
995 1000 1005
Asn Phe Gly Phe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln Val
1010 1015 1020
Tyr Gln Lys Leu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr Leu
1025 1030 1035
Val Phe Lys Asp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu Arg
1040 1045 1050
Ala Tyr Gln Leu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met Gly
1055 1060 1065
Lys Gln Thr Gly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr Ser
1070 1075 1080
Lys Ile Cys Pro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro Lys
1085 1090 1095
Tyr Glu Ser Val Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe Asp
1100 1105 1110
Lys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser Phe
1115 1120 1125
Asp Tyr Lys Asn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp Thr
1130 1135 1140
Ile Ala Ser Phe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser Asp
1145 1150 1155
Lys Asn His Asn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys Glu
1160 1165 1170
Leu Glu Lys Leu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His Gly
1175 1180 1185
Glu Cys Ile Lys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys Phe
1190 1195 1200
Phe Ala Lys Leu Thr Ser Val Leu Asn Thr Ile Leu Gln Met Arg
1205 1210 1215
Asn Ser Lys Thr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro Val
1220 1225 1230
Ala Asp Val Asn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro Lys
1235 1240 1245
Asn Met Pro Gln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile Gly
1250 1255 1260
Leu Lys Gly Leu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln Glu
1265 1270 1275
Gly Lys Lys Leu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe Glu
1280 1285 1290
Phe Val Gln Asn Arg Asn Asn Ser Gly Gly Ser Pro Lys Lys Lys
1295 1300 1305
Arg Lys Val Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys Val
1310 1315 1320
<210> 23
<211> 1252
<212> PRT
<213> Artificial Sequence
<220>
<223> LBCPF1-2NLS
<400> 23
Met Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr
1 5 10 15
Leu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp
20 25 30
Asn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys
35 40 45
Gly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp
50 55 60
Val Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu
65 70 75 80
Phe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn
85 90 95
Leu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn
100 105 110
Glu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu
115 120 125
Pro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe
130 135 140
Asn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn
145 150 155 160
Met Glu Thr Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg
165 170 175
Cys Ile Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe
180 185 190
Glu Lys Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys
195 200 205
Glu Lys Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly
210 215 220
Glu Phe Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn
225 230 235 240
Ala Ile Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly
245 250 255
Leu Asn Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu
260 265 270
Pro Lys Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser
275 280 285
Leu Ser Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu
290 295 300
Val Phe Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile
305 310 315 320
Lys Lys Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala
325 330 335
Gly Ile Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp
340 345 350
Ile Phe Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr
355 360 365
Asp Asp Ile His Leu Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu
370 375 380
Asp Asp Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu
385 390 395 400
Gln Leu Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu
405 410 415
Lys Glu Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly
420 425 430
Ser Ser Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu
435 440 445
Lys Lys Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser
450 455 460
Val Lys Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys
465 470 475 480
Glu Thr Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr
485 490 495
Asp Ile Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr
500 505 510
Val Thr Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln
515 520 525
Asn Pro Gln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr
530 535 540
Arg Ala Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met
545 550 555 560
Asp Lys Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val
565 570 575
Asn Gly Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn
580 585 590
Lys Met Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr
595 600 605
Asn Pro Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys
610 615 620
Lys Gly Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe
625 630 635 640
Phe Lys Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp
645 650 655
Phe Asn Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr
660 665 670
Arg Glu Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser
675 680 685
Lys Lys Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe
690 695 700
Gln Ile Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn
705 710 715 720
Leu His Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly
725 730 735
Gln Ile Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser
740 745 750
Leu Lys Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala
755 760 765
Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp
770 775 780
Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile
785 790 795 800
Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr
805 810 815
Glu Val Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly
820 825 830
Ile Asp Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly
835 840 845
Lys Gly Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn
850 855 860
Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys
865 870 875 880
Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu
885 890 895
Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys
900 905 910
Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp
915 920 925
Leu Asn Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val
930 935 940
Tyr Gln Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val
945 950 955 960
Asp Lys Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr
965 970 975
Gln Ile Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn
980 985 990
Gly Phe Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro
995 1000 1005
Ser Thr Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile
1010 1015 1020
Ala Asp Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr
1025 1030 1035
Val Pro Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn
1040 1045 1050
Phe Ser Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr
1055 1060 1065
Ser Tyr Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn
1070 1075 1080
Asn Val Phe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys
1085 1090 1095
Glu Leu Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile
1100 1105 1110
Arg Ala Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser
1115 1120 1125
Phe Met Ala Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser Ile
1130 1135 1140
Thr Gly Arg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys Asn
1145 1150 1155
Ser Asp Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu
1160 1165 1170
Asn Ala Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn
1175 1180 1185
Ile Ala Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala
1190 1195 1200
Glu Asp Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn Lys
1205 1210 1215
Glu Trp Leu Glu Tyr Ala Gln Thr Ser Val Lys His Ser Gly Gly
1220 1225 1230
Ser Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro Lys Lys
1235 1240 1245
Lys Arg Lys Val
1250
<210> 24
<211> 1664
<212> PRT
<213> Artificial Sequence
<220>
<223> dFNCPF1-PBE-2NLS
<400> 24
Met Pro Lys Lys Lys Arg Lys Val Ser Ser Glu Thr Gly Pro Val Ala
1 5 10 15
Val Asp Pro Thr Leu Arg Arg Arg Ile Glu Pro His Glu Phe Glu Val
20 25 30
Phe Phe Asp Pro Arg Glu Leu Arg Lys Glu Thr Cys Leu Leu Tyr Glu
35 40 45
Ile Asn Trp Gly Gly Arg His Ser Ile Trp Arg His Thr Ser Gln Asn
50 55 60
Thr Asn Lys His Val Glu Val Asn Phe Ile Glu Lys Phe Thr Thr Glu
65 70 75 80
Arg Tyr Phe Cys Pro Asn Thr Arg Cys Ser Ile Thr Trp Phe Leu Ser
85 90 95
Trp Ser Pro Cys Gly Glu Cys Ser Arg Ala Ile Thr Glu Phe Leu Ser
100 105 110
Arg Tyr Pro His Val Thr Leu Phe Ile Tyr Ile Ala Arg Leu Tyr His
115 120 125
His Ala Asp Pro Arg Asn Arg Gln Gly Leu Arg Asp Leu Ile Ser Ser
130 135 140
Gly Val Thr Ile Gln Ile Met Thr Glu Gln Glu Ser Gly Tyr Cys Trp
145 150 155 160
Arg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro Arg
165 170 175
Tyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys Ile
180 185 190
Ile Leu Gly Leu Pro Pro Cys Leu Asn Ile Leu Arg Arg Lys Gln Pro
195 200 205
Gln Leu Thr Phe Phe Thr Ile Ala Leu Gln Ser Cys His Tyr Gln Arg
210 215 220
Leu Pro Pro His Ile Leu Trp Ala Thr Gly Leu Lys Ser Gly Ser Glu
225 230 235 240
Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Leu Lys Ser Ile Tyr
245 250 255
Gln Glu Phe Val Asn Lys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Glu
260 265 270
Leu Ile Pro Gln Gly Lys Thr Leu Glu Asn Ile Lys Ala Arg Gly Leu
275 280 285
Ile Leu Asp Asp Glu Lys Arg Ala Lys Asp Tyr Lys Lys Ala Lys Gln
290 295 300
Ile Ile Asp Lys Tyr His Gln Phe Phe Ile Glu Glu Ile Leu Ser Ser
305 310 315 320
Val Cys Ile Ser Glu Asp Leu Leu Gln Asn Tyr Ser Asp Val Tyr Phe
325 330 335
Lys Leu Lys Lys Ser Asp Asp Asp Asn Leu Gln Lys Asp Phe Lys Ser
340 345 350
Ala Lys Asp Thr Ile Lys Lys Gln Ile Ser Glu Tyr Ile Lys Asp Ser
355 360 365
Glu Lys Phe Lys Asn Leu Phe Asn Gln Asn Leu Ile Asp Ala Lys Lys
370 375 380
Gly Gln Glu Ser Asp Leu Ile Leu Trp Leu Lys Gln Ser Lys Asp Asn
385 390 395 400
Gly Ile Glu Leu Phe Lys Ala Asn Ser Asp Ile Thr Asp Ile Asp Glu
405 410 415
Ala Leu Glu Ile Ile Lys Ser Phe Lys Gly Trp Thr Thr Tyr Phe Lys
420 425 430
Gly Phe His Glu Asn Arg Lys Asn Val Tyr Ser Ser Asn Asp Ile Pro
435 440 445
Thr Ser Ile Ile Tyr Arg Ile Val Asp Asp Asn Leu Pro Lys Phe Leu
450 455 460
Glu Asn Lys Ala Lys Tyr Glu Ser Leu Lys Asp Lys Ala Pro Glu Ala
465 470 475 480
Ile Asn Tyr Glu Gln Ile Lys Lys Asp Leu Ala Glu Glu Leu Thr Phe
485 490 495
Asp Ile Asp Tyr Lys Thr Ser Glu Val Asn Gln Arg Val Phe Ser Leu
500 505 510
Asp Glu Val Phe Glu Ile Ala Asn Phe Asn Asn Tyr Leu Asn Gln Ser
515 520 525
Gly Ile Thr Lys Phe Asn Thr Ile Ile Gly Gly Lys Phe Val Asn Gly
530 535 540
Glu Asn Thr Lys Arg Lys Gly Ile Asn Glu Tyr Ile Asn Leu Tyr Ser
545 550 555 560
Gln Gln Ile Asn Asp Lys Thr Leu Lys Lys Tyr Lys Met Ser Val Leu
565 570 575
Phe Lys Gln Ile Leu Ser Asp Thr Glu Ser Lys Ser Phe Val Ile Asp
580 585 590
Lys Leu Glu Asp Asp Ser Asp Val Val Thr Thr Met Gln Ser Phe Tyr
595 600 605
Glu Gln Ile Ala Ala Phe Lys Thr Val Glu Glu Lys Ser Ile Lys Glu
610 615 620
Thr Leu Ser Leu Leu Phe Asp Asp Leu Lys Ala Gln Lys Leu Asp Leu
625 630 635 640
Ser Lys Ile Tyr Phe Lys Asn Asp Lys Ser Leu Thr Asp Leu Ser Gln
645 650 655
Gln Val Phe Asp Asp Tyr Ser Val Ile Gly Thr Ala Val Leu Glu Tyr
660 665 670
Ile Thr Gln Gln Ile Ala Pro Lys Asn Leu Asp Asn Pro Ser Lys Lys
675 680 685
Glu Gln Glu Leu Ile Ala Lys Lys Thr Glu Lys Ala Lys Tyr Leu Ser
690 695 700
Leu Glu Thr Ile Lys Leu Ala Leu Glu Glu Phe Asn Lys His Arg Asp
705 710 715 720
Ile Asp Lys Gln Cys Arg Phe Glu Glu Ile Leu Ala Asn Phe Ala Ala
725 730 735
Ile Pro Met Ile Phe Asp Glu Ile Ala Gln Asn Lys Asp Asn Leu Ala
740 745 750
Gln Ile Ser Ile Lys Tyr Gln Asn Gln Gly Lys Lys Asp Leu Leu Gln
755 760 765
Ala Ser Ala Glu Asp Asp Val Lys Ala Ile Lys Asp Leu Leu Asp Gln
770 775 780
Thr Asn Asn Leu Leu His Lys Leu Lys Ile Phe His Ile Ser Gln Ser
785 790 795 800
Glu Asp Lys Ala Asn Ile Leu Asp Lys Asp Glu His Phe Tyr Leu Val
805 810 815
Phe Glu Glu Cys Tyr Phe Glu Leu Ala Asn Ile Val Pro Leu Tyr Asn
820 825 830
Lys Ile Arg Asn Tyr Ile Thr Gln Lys Pro Tyr Ser Asp Glu Lys Phe
835 840 845
Lys Leu Asn Phe Glu Asn Ser Thr Leu Ala Asn Gly Trp Asp Lys Asn
850 855 860
Lys Glu Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys Asp Asp Lys Tyr
865 870 875 880
Tyr Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile Phe Asp Asp Lys
885 890 895
Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys Ile Val Tyr Lys
900 905 910
Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val Phe Phe Ser Ala
915 920 925
Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile Leu Arg Ile Arg
930 935 940
Asn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln Lys Gly Tyr Glu
945 950 955 960
Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys Phe Ile Asp Phe Tyr
965 970 975
Lys Gln Ser Ile Ser Lys His Pro Glu Trp Lys Asp Phe Gly Phe Arg
980 985 990
Phe Ser Asp Thr Gln Arg Tyr Asn Ser Ile Asp Glu Phe Tyr Arg Glu
995 1000 1005
Val Glu Asn Gln Gly Tyr Lys Leu Thr Phe Glu Asn Ile Ser Glu
1010 1015 1020
Ser Tyr Ile Asp Ser Val Val Asn Gln Gly Lys Leu Tyr Leu Phe
1025 1030 1035
Gln Ile Tyr Asn Lys Asp Phe Ser Ala Tyr Ser Lys Gly Arg Pro
1040 1045 1050
Asn Leu His Thr Leu Tyr Trp Lys Ala Leu Phe Asp Glu Arg Asn
1055 1060 1065
Leu Gln Asp Val Val Tyr Lys Leu Asn Gly Glu Ala Glu Leu Phe
1070 1075 1080
Tyr Arg Lys Gln Ser Ile Pro Lys Lys Ile Thr His Pro Ala Lys
1085 1090 1095
Glu Ala Ile Ala Asn Lys Asn Lys Asp Asn Pro Lys Lys Glu Ser
1100 1105 1110
Val Phe Glu Tyr Asp Leu Ile Lys Asp Lys Arg Phe Thr Glu Asp
1115 1120 1125
Lys Phe Phe Phe His Cys Pro Ile Thr Ile Asn Phe Lys Ser Ser
1130 1135 1140
Gly Ala Asn Lys Phe Asn Asp Glu Ile Asn Leu Leu Leu Lys Glu
1145 1150 1155
Lys Ala Asn Asp Val His Ile Leu Ser Ile Ala Arg Gly Glu Arg
1160 1165 1170
His Leu Ala Tyr Tyr Thr Leu Val Asp Gly Lys Gly Asn Ile Ile
1175 1180 1185
Lys Gln Asp Thr Phe Asn Ile Ile Gly Asn Asp Arg Met Lys Thr
1190 1195 1200
Asn Tyr His Asp Lys Leu Ala Ala Ile Glu Lys Asp Arg Asp Ser
1205 1210 1215
Ala Arg Lys Asp Trp Lys Lys Ile Asn Asn Ile Lys Glu Met Lys
1220 1225 1230
Glu Gly Tyr Leu Ser Gln Val Val His Glu Ile Ala Lys Leu Val
1235 1240 1245
Ile Glu Tyr Asn Ala Ile Val Val Phe Glu Asp Leu Asn Phe Gly
1250 1255 1260
Phe Lys Arg Gly Arg Phe Lys Val Glu Lys Gln Val Tyr Gln Lys
1265 1270 1275
Leu Glu Lys Met Leu Ile Glu Lys Leu Asn Tyr Leu Val Phe Lys
1280 1285 1290
Asp Asn Glu Phe Asp Lys Thr Gly Gly Val Leu Arg Ala Tyr Gln
1295 1300 1305
Leu Thr Ala Pro Phe Glu Thr Phe Lys Lys Met Gly Lys Gln Thr
1310 1315 1320
Gly Ile Ile Tyr Tyr Val Pro Ala Gly Phe Thr Ser Lys Ile Cys
1325 1330 1335
Pro Val Thr Gly Phe Val Asn Gln Leu Tyr Pro Lys Tyr Glu Ser
1340 1345 1350
Val Ser Lys Ser Gln Glu Phe Phe Ser Lys Phe Asp Lys Ile Cys
1355 1360 1365
Tyr Asn Leu Asp Lys Gly Tyr Phe Glu Phe Ser Phe Asp Tyr Lys
1370 1375 1380
Asn Phe Gly Asp Lys Ala Ala Lys Gly Lys Trp Thr Ile Ala Ser
1385 1390 1395
Phe Gly Ser Arg Leu Ile Asn Phe Arg Asn Ser Asp Lys Asn His
1400 1405 1410
Asn Trp Asp Thr Arg Glu Val Tyr Pro Thr Lys Glu Leu Glu Lys
1415 1420 1425
Leu Leu Lys Asp Tyr Ser Ile Glu Tyr Gly His Gly Glu Cys Ile
1430 1435 1440
Lys Ala Ala Ile Cys Gly Glu Ser Asp Lys Lys Phe Phe Ala Lys
1445 1450 1455
Leu Thr Ser Val Leu Asn Thr Ile Leu Gln Met Arg Asn Ser Lys
1460 1465 1470
Thr Gly Thr Glu Leu Asp Tyr Leu Ile Ser Pro Val Ala Asp Val
1475 1480 1485
Asn Gly Asn Phe Phe Asp Ser Arg Gln Ala Pro Lys Asn Met Pro
1490 1495 1500
Gln Asp Ala Asp Ala Asn Gly Ala Tyr His Ile Gly Leu Lys Gly
1505 1510 1515
Leu Met Leu Leu Gly Arg Ile Lys Asn Asn Gln Glu Gly Lys Lys
1520 1525 1530
Leu Asn Leu Val Ile Lys Asn Glu Glu Tyr Phe Glu Phe Val Gln
1535 1540 1545
Asn Arg Asn Asn Thr Arg Asp Ser Gly Gly Ser Thr Asn Leu Ser
1550 1555 1560
Asp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu Val Ile Gln Glu
1565 1570 1575
Ser Ile Leu Met Leu Pro Glu Glu Val Glu Glu Val Ile Gly Asn
1580 1585 1590
Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala Tyr Asp Glu Ser
1595 1600 1605
Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala Pro Glu Tyr
1610 1615 1620
Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly Glu Asn Lys
1625 1630 1635
Ile Lys Met Leu Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys Val
1640 1645 1650
Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys Val
1655 1660
<210> 25
<211> 1592
<212> PRT
<213> Artificial Sequence
<220>
<223> dLBCPF1-PBE-2NLS
<400> 25
Met Pro Lys Lys Lys Arg Lys Val Ser Ser Glu Thr Gly Pro Val Ala
1 5 10 15
Val Asp Pro Thr Leu Arg Arg Arg Ile Glu Pro His Glu Phe Glu Val
20 25 30
Phe Phe Asp Pro Arg Glu Leu Arg Lys Glu Thr Cys Leu Leu Tyr Glu
35 40 45
Ile Asn Trp Gly Gly Arg His Ser Ile Trp Arg His Thr Ser Gln Asn
50 55 60
Thr Asn Lys His Val Glu Val Asn Phe Ile Glu Lys Phe Thr Thr Glu
65 70 75 80
Arg Tyr Phe Cys Pro Asn Thr Arg Cys Ser Ile Thr Trp Phe Leu Ser
85 90 95
Trp Ser Pro Cys Gly Glu Cys Ser Arg Ala Ile Thr Glu Phe Leu Ser
100 105 110
Arg Tyr Pro His Val Thr Leu Phe Ile Tyr Ile Ala Arg Leu Tyr His
115 120 125
His Ala Asp Pro Arg Asn Arg Gln Gly Leu Arg Asp Leu Ile Ser Ser
130 135 140
Gly Val Thr Ile Gln Ile Met Thr Glu Gln Glu Ser Gly Tyr Cys Trp
145 150 155 160
Arg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro Arg
165 170 175
Tyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys Ile
180 185 190
Ile Leu Gly Leu Pro Pro Cys Leu Asn Ile Leu Arg Arg Lys Gln Pro
195 200 205
Gln Leu Thr Phe Phe Thr Ile Ala Leu Gln Ser Cys His Tyr Gln Arg
210 215 220
Leu Pro Pro His Ile Leu Trp Ala Thr Gly Leu Lys Ser Gly Ser Glu
225 230 235 240
Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Leu Lys Ser Lys Leu
245 250 255
Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys
260 265 270
Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu
275 280 285
Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys
290 295 300
Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser
305 310 315 320
Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys
325 330 335
Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn
340 345 350
Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys
355 360 365
Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu
370 375 380
Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr
385 390 395 400
Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu
405 410 415
Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu
420 425 430
Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile
435 440 445
Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser
450 455 460
Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val
465 470 475 480
Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe
485 490 495
Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn
500 505 510
Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu
515 520 525
Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu
530 535 540
Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu
545 550 555 560
Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu
565 570 575
Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn
580 585 590
Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn
595 600 605
Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys
610 615 620
Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser
625 630 635 640
Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala
645 650 655
Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln
660 665 670
Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe
675 680 685
Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val
690 695 700
Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn
705 710 715 720
Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu
725 730 735
Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val
740 745 750
Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr
755 760 765
Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly
770 775 780
Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg
785 790 795 800
Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys
805 810 815
Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys
820 825 830
Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val
835 840 845
Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile
850 855 860
Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn
865 870 875 880
Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser
885 890 895
Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu Thr
900 905 910
Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu Val Glu Glu Gln
915 920 925
Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys Glu Val Asp Lys
930 935 940
Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile Tyr Asn Lys Asp
945 950 955 960
Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His Thr Met Tyr Phe
965 970 975
Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile Arg Leu Ser Gly
980 985 990
Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys Lys Glu Glu Leu
995 1000 1005
Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys Asn Pro Asp
1010 1015 1020
Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr Lys Asp
1025 1030 1035
Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile Ala
1040 1045 1050
Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val
1055 1060 1065
Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile
1070 1075 1080
Ala Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly
1085 1090 1095
Lys Gly Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn
1100 1105 1110
Asn Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu
1115 1120 1125
Asp Lys Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr
1130 1135 1140
Ser Ile Glu Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln
1145 1150 1155
Val Val His Lys Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala Val
1160 1165 1170
Ile Ala Leu Glu Asp Leu Asn Ser Gly Phe Lys Asn Ser Arg Val
1175 1180 1185
Lys Val Glu Lys Gln Val Tyr Gln Lys Phe Glu Lys Met Leu Ile
1190 1195 1200
Asp Lys Leu Asn Tyr Met Val Asp Lys Lys Ser Asn Pro Cys Ala
1205 1210 1215
Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile Thr Asn Lys Phe Glu
1220 1225 1230
Ser Phe Lys Ser Met Ser Thr Gln Asn Gly Phe Ile Phe Tyr Ile
1235 1240 1245
Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser Thr Gly Phe Val
1250 1255 1260
Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala Asp Ser Lys Lys
1265 1270 1275
Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val Pro Glu Glu Asp
1280 1285 1290
Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe Ser Arg Thr Asp
1295 1300 1305
Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser Tyr Gly Asn Arg
1310 1315 1320
Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn Val Phe Asp Trp
1325 1330 1335
Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu Leu Phe Asn Lys
1340 1345 1350
Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg Ala Leu Leu Cys
1355 1360 1365
Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe Met Ala Leu Met
1370 1375 1380
Ser Leu Met Leu Gln Met Arg Asn Ser Ile Thr Gly Arg Thr Asp
1385 1390 1395
Val Asp Phe Leu Ile Ser Pro Val Lys Asn Ser Asp Gly Ile Phe
1400 1405 1410
Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn Ala Ile Leu Pro
1415 1420 1425
Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile Ala Arg Lys Val
1430 1435 1440
Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu Asp Glu Lys Leu
1445 1450 1455
Asp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu Trp Leu Glu Tyr
1460 1465 1470
Ala Gln Thr Ser Val Lys His Thr Arg Asp Ser Gly Gly Ser Thr
1475 1480 1485
Asn Leu Ser Asp Ile Ile Glu Lys Glu Thr Gly Lys Gln Leu Val
1490 1495 1500
Ile Gln Glu Ser Ile Leu Met Leu Pro Glu Glu Val Glu Glu Val
1505 1510 1515
Ile Gly Asn Lys Pro Glu Ser Asp Ile Leu Val His Thr Ala Tyr
1520 1525 1530
Asp Glu Ser Thr Asp Glu Asn Val Met Leu Leu Thr Ser Asp Ala
1535 1540 1545
Pro Glu Tyr Lys Pro Trp Ala Leu Val Ile Gln Asp Ser Asn Gly
1550 1555 1560
Glu Asn Lys Ile Lys Met Leu Ser Gly Gly Ser Pro Lys Lys Lys
1565 1570 1575
Arg Lys Val Ser Gly Gly Ser Pro Lys Lys Lys Arg Lys Val
1580 1585 1590
<210> 26
<211> 1731
<212> PRT
<213> Artificial Sequence
<220>
<223> dFNCPF1-ABE7.10-2NLS
<400> 26
Met Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Ser Glu Val Glu
1 5 10 15
Phe Ser His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala Lys Arg
20 25 30
Ala Trp Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu Val His Asn
35 40 45
Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro Ile Gly Arg His Asp
50 55 60
Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg Gln Gly Gly Leu Val
65 70 75 80
Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val Thr Leu Glu
85 90 95
Pro Cys Val Met Cys Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg
100 105 110
Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly Ala Ala Gly Ser Leu
115 120 125
Met Asp Val Leu His His Pro Gly Met Asn His Arg Val Glu Ile Thr
130 135 140
Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Ser Asp Phe Phe
145 150 155 160
Arg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys Lys Ala Gln Ser Ser
165 170 175
Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro
180 185 190
Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly
195 200 205
Gly Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala
210 215 220
Leu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly
225 230 235 240
Ala Val Leu Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg
245 250 255
Ala Ile Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu
260 265 270
Arg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr
275 280 285
Leu Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile
290 295 300
His Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala Lys Thr
305 310 315 320
Gly Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro Gly Met Asn
325 330 335
His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala
340 345 350
Leu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln
355 360 365
Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser
370 375 380
Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser
385 390 395 400
Ser Gly Gly Ser Ser Gly Gly Ser Leu Lys Ser Ile Tyr Gln Glu Phe
405 410 415
Val Asn Lys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Glu Leu Ile Pro
420 425 430
Gln Gly Lys Thr Leu Glu Asn Ile Lys Ala Arg Gly Leu Ile Leu Asp
435 440 445
Asp Glu Lys Arg Ala Lys Asp Tyr Lys Lys Ala Lys Gln Ile Ile Asp
450 455 460
Lys Tyr His Gln Phe Phe Ile Glu Glu Ile Leu Ser Ser Val Cys Ile
465 470 475 480
Ser Glu Asp Leu Leu Gln Asn Tyr Ser Asp Val Tyr Phe Lys Leu Lys
485 490 495
Lys Ser Asp Asp Asp Asn Leu Gln Lys Asp Phe Lys Ser Ala Lys Asp
500 505 510
Thr Ile Lys Lys Gln Ile Ser Glu Tyr Ile Lys Asp Ser Glu Lys Phe
515 520 525
Lys Asn Leu Phe Asn Gln Asn Leu Ile Asp Ala Lys Lys Gly Gln Glu
530 535 540
Ser Asp Leu Ile Leu Trp Leu Lys Gln Ser Lys Asp Asn Gly Ile Glu
545 550 555 560
Leu Phe Lys Ala Asn Ser Asp Ile Thr Asp Ile Asp Glu Ala Leu Glu
565 570 575
Ile Ile Lys Ser Phe Lys Gly Trp Thr Thr Tyr Phe Lys Gly Phe His
580 585 590
Glu Asn Arg Lys Asn Val Tyr Ser Ser Asn Asp Ile Pro Thr Ser Ile
595 600 605
Ile Tyr Arg Ile Val Asp Asp Asn Leu Pro Lys Phe Leu Glu Asn Lys
610 615 620
Ala Lys Tyr Glu Ser Leu Lys Asp Lys Ala Pro Glu Ala Ile Asn Tyr
625 630 635 640
Glu Gln Ile Lys Lys Asp Leu Ala Glu Glu Leu Thr Phe Asp Ile Asp
645 650 655
Tyr Lys Thr Ser Glu Val Asn Gln Arg Val Phe Ser Leu Asp Glu Val
660 665 670
Phe Glu Ile Ala Asn Phe Asn Asn Tyr Leu Asn Gln Ser Gly Ile Thr
675 680 685
Lys Phe Asn Thr Ile Ile Gly Gly Lys Phe Val Asn Gly Glu Asn Thr
690 695 700
Lys Arg Lys Gly Ile Asn Glu Tyr Ile Asn Leu Tyr Ser Gln Gln Ile
705 710 715 720
Asn Asp Lys Thr Leu Lys Lys Tyr Lys Met Ser Val Leu Phe Lys Gln
725 730 735
Ile Leu Ser Asp Thr Glu Ser Lys Ser Phe Val Ile Asp Lys Leu Glu
740 745 750
Asp Asp Ser Asp Val Val Thr Thr Met Gln Ser Phe Tyr Glu Gln Ile
755 760 765
Ala Ala Phe Lys Thr Val Glu Glu Lys Ser Ile Lys Glu Thr Leu Ser
770 775 780
Leu Leu Phe Asp Asp Leu Lys Ala Gln Lys Leu Asp Leu Ser Lys Ile
785 790 795 800
Tyr Phe Lys Asn Asp Lys Ser Leu Thr Asp Leu Ser Gln Gln Val Phe
805 810 815
Asp Asp Tyr Ser Val Ile Gly Thr Ala Val Leu Glu Tyr Ile Thr Gln
820 825 830
Gln Ile Ala Pro Lys Asn Leu Asp Asn Pro Ser Lys Lys Glu Gln Glu
835 840 845
Leu Ile Ala Lys Lys Thr Glu Lys Ala Lys Tyr Leu Ser Leu Glu Thr
850 855 860
Ile Lys Leu Ala Leu Glu Glu Phe Asn Lys His Arg Asp Ile Asp Lys
865 870 875 880
Gln Cys Arg Phe Glu Glu Ile Leu Ala Asn Phe Ala Ala Ile Pro Met
885 890 895
Ile Phe Asp Glu Ile Ala Gln Asn Lys Asp Asn Leu Ala Gln Ile Ser
900 905 910
Ile Lys Tyr Gln Asn Gln Gly Lys Lys Asp Leu Leu Gln Ala Ser Ala
915 920 925
Glu Asp Asp Val Lys Ala Ile Lys Asp Leu Leu Asp Gln Thr Asn Asn
930 935 940
Leu Leu His Lys Leu Lys Ile Phe His Ile Ser Gln Ser Glu Asp Lys
945 950 955 960
Ala Asn Ile Leu Asp Lys Asp Glu His Phe Tyr Leu Val Phe Glu Glu
965 970 975
Cys Tyr Phe Glu Leu Ala Asn Ile Val Pro Leu Tyr Asn Lys Ile Arg
980 985 990
Asn Tyr Ile Thr Gln Lys Pro Tyr Ser Asp Glu Lys Phe Lys Leu Asn
995 1000 1005
Phe Glu Asn Ser Thr Leu Ala Asn Gly Trp Asp Lys Asn Lys Glu
1010 1015 1020
Pro Asp Asn Thr Ala Ile Leu Phe Ile Lys Asp Asp Lys Tyr Tyr
1025 1030 1035
Leu Gly Val Met Asn Lys Lys Asn Asn Lys Ile Phe Asp Asp Lys
1040 1045 1050
Ala Ile Lys Glu Asn Lys Gly Glu Gly Tyr Lys Lys Ile Val Tyr
1055 1060 1065
Lys Leu Leu Pro Gly Ala Asn Lys Met Leu Pro Lys Val Phe Phe
1070 1075 1080
Ser Ala Lys Ser Ile Lys Phe Tyr Asn Pro Ser Glu Asp Ile Leu
1085 1090 1095
Arg Ile Arg Asn His Ser Thr His Thr Lys Asn Gly Ser Pro Gln
1100 1105 1110
Lys Gly Tyr Glu Lys Phe Glu Phe Asn Ile Glu Asp Cys Arg Lys
1115 1120 1125
Phe Ile Asp Phe Tyr Lys Gln Ser Ile Ser Lys His Pro Glu Trp
1130 1135 1140
Lys Asp Phe Gly Phe Arg Phe Ser Asp Thr Gln Arg Tyr Asn Ser
1145 1150 1155
Ile Asp Glu Phe Tyr Arg Glu Val Glu Asn Gln Gly Tyr Lys Leu
1160 1165 1170
Thr Phe Glu Asn Ile Ser Glu Ser Tyr Ile Asp Ser Val Val Asn
1175 1180 1185
Gln Gly Lys Leu Tyr Leu Phe Gln Ile Tyr Asn Lys Asp Phe Ser
1190 1195 1200
Ala Tyr Ser Lys Gly Arg Pro Asn Leu His Thr Leu Tyr Trp Lys
1205 1210 1215
Ala Leu Phe Asp Glu Arg Asn Leu Gln Asp Val Val Tyr Lys Leu
1220 1225 1230
Asn Gly Glu Ala Glu Leu Phe Tyr Arg Lys Gln Ser Ile Pro Lys
1235 1240 1245
Lys Ile Thr His Pro Ala Lys Glu Ala Ile Ala Asn Lys Asn Lys
1250 1255 1260
Asp Asn Pro Lys Lys Glu Ser Val Phe Glu Tyr Asp Leu Ile Lys
1265 1270 1275
Asp Lys Arg Phe Thr Glu Asp Lys Phe Phe Phe His Cys Pro Ile
1280 1285 1290
Thr Ile Asn Phe Lys Ser Ser Gly Ala Asn Lys Phe Asn Asp Glu
1295 1300 1305
Ile Asn Leu Leu Leu Lys Glu Lys Ala Asn Asp Val His Ile Leu
1310 1315 1320
Ser Ile Ala Arg Gly Glu Arg His Leu Ala Tyr Tyr Thr Leu Val
1325 1330 1335
Asp Gly Lys Gly Asn Ile Ile Lys Gln Asp Thr Phe Asn Ile Ile
1340 1345 1350
Gly Asn Asp Arg Met Lys Thr Asn Tyr His Asp Lys Leu Ala Ala
1355 1360 1365
Ile Glu Lys Asp Arg Asp Ser Ala Arg Lys Asp Trp Lys Lys Ile
1370 1375 1380
Asn Asn Ile Lys Glu Met Lys Glu Gly Tyr Leu Ser Gln Val Val
1385 1390 1395
His Glu Ile Ala Lys Leu Val Ile Glu Tyr Asn Ala Ile Val Val
1400 1405 1410
Phe Glu Asp Leu Asn Phe Gly Phe Lys Arg Gly Arg Phe Lys Val
1415 1420 1425
Glu Lys Gln Val Tyr Gln Lys Leu Glu Lys Met Leu Ile Glu Lys
1430 1435 1440
Leu Asn Tyr Leu Val Phe Lys Asp Asn Glu Phe Asp Lys Thr Gly
1445 1450 1455
Gly Val Leu Arg Ala Tyr Gln Leu Thr Ala Pro Phe Glu Thr Phe
1460 1465 1470
Lys Lys Met Gly Lys Gln Thr Gly Ile Ile Tyr Tyr Val Pro Ala
1475 1480 1485
Gly Phe Thr Ser Lys Ile Cys Pro Val Thr Gly Phe Val Asn Gln
1490 1495 1500
Leu Tyr Pro Lys Tyr Glu Ser Val Ser Lys Ser Gln Glu Phe Phe
1505 1510 1515
Ser Lys Phe Asp Lys Ile Cys Tyr Asn Leu Asp Lys Gly Tyr Phe
1520 1525 1530
Glu Phe Ser Phe Asp Tyr Lys Asn Phe Gly Asp Lys Ala Ala Lys
1535 1540 1545
Gly Lys Trp Thr Ile Ala Ser Phe Gly Ser Arg Leu Ile Asn Phe
1550 1555 1560
Arg Asn Ser Asp Lys Asn His Asn Trp Asp Thr Arg Glu Val Tyr
1565 1570 1575
Pro Thr Lys Glu Leu Glu Lys Leu Leu Lys Asp Tyr Ser Ile Glu
1580 1585 1590
Tyr Gly His Gly Glu Cys Ile Lys Ala Ala Ile Cys Gly Glu Ser
1595 1600 1605
Asp Lys Lys Phe Phe Ala Lys Leu Thr Ser Val Leu Asn Thr Ile
1610 1615 1620
Leu Gln Met Arg Asn Ser Lys Thr Gly Thr Glu Leu Asp Tyr Leu
1625 1630 1635
Ile Ser Pro Val Ala Asp Val Asn Gly Asn Phe Phe Asp Ser Arg
1640 1645 1650
Gln Ala Pro Lys Asn Met Pro Gln Asp Ala Asp Ala Asn Gly Ala
1655 1660 1665
Tyr His Ile Gly Leu Lys Gly Leu Met Leu Leu Gly Arg Ile Lys
1670 1675 1680
Asn Asn Gln Glu Gly Lys Lys Leu Asn Leu Val Ile Lys Asn Glu
1685 1690 1695
Glu Tyr Phe Glu Phe Val Gln Asn Arg Asn Asn Ser Gly Gly Ser
1700 1705 1710
Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro Lys Lys Lys
1715 1720 1725
Arg Lys Val
1730
<210> 27
<211> 1659
<212> PRT
<213> Artificial Sequence
<220>
<223> dLBCPF1-ABE7.10-2NLS
<400> 27
Met Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Ser Glu Val Glu
1 5 10 15
Phe Ser His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala Lys Arg
20 25 30
Ala Trp Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu Val His Asn
35 40 45
Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro Ile Gly Arg His Asp
50 55 60
Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg Gln Gly Gly Leu Val
65 70 75 80
Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val Thr Leu Glu
85 90 95
Pro Cys Val Met Cys Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg
100 105 110
Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly Ala Ala Gly Ser Leu
115 120 125
Met Asp Val Leu His His Pro Gly Met Asn His Arg Val Glu Ile Thr
130 135 140
Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Ser Asp Phe Phe
145 150 155 160
Arg Met Arg Arg Gln Glu Ile Lys Ala Gln Lys Lys Ala Gln Ser Ser
165 170 175
Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro
180 185 190
Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly
195 200 205
Gly Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala
210 215 220
Leu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly
225 230 235 240
Ala Val Leu Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg
245 250 255
Ala Ile Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu
260 265 270
Arg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr
275 280 285
Leu Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile
290 295 300
His Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala Lys Thr
305 310 315 320
Gly Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro Gly Met Asn
325 330 335
His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala
340 345 350
Leu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln
355 360 365
Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser
370 375 380
Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser
385 390 395 400
Ser Gly Gly Ser Ser Gly Gly Ser Leu Lys Ser Lys Leu Glu Lys Phe
405 410 415
Thr Asn Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys Ala Ile Pro
420 425 430
Val Gly Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu Leu Val Glu
435 440 445
Asp Glu Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys Leu Leu Asp
450 455 460
Arg Tyr Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser Ile Lys Leu
465 470 475 480
Lys Asn Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys Thr Arg Thr
485 490 495
Glu Lys Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn Leu Arg Lys
500 505 510
Glu Ile Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys Ser Leu Phe
515 520 525
Lys Lys Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu Asp Asp Lys
530 535 540
Asp Glu Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr Thr Ala Phe
545 550 555 560
Thr Gly Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu Glu Ala Lys
565 570 575
Ser Thr Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu Thr Arg Tyr
580 585 590
Ile Ser Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile Phe Asp Lys
595 600 605
His Glu Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser Asp Tyr Asp
610 615 620
Val Glu Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val Leu Thr Gln
625 630 635 640
Glu Gly Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe Val Thr Glu
645 650 655
Ser Gly Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn Leu Tyr Asn
660 665 670
Gln Lys Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu Tyr Lys Gln
675 680 685
Val Leu Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu Gly Tyr Thr
690 695 700
Ser Asp Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu Asn Lys Asn
705 710 715 720
Ser Glu Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu Phe Lys Asn
725 730 735
Phe Asp Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn Gly Pro Ala
740 745 750
Ile Ser Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn Val Ile Arg
755 760 765
Asp Lys Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys Lys Lys Ala
770 775 780
Val Val Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser Phe Lys Lys
785 790 795 800
Ile Gly Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala Asp Ala Asp
805 810 815
Leu Ser Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln Lys Val Asp
820 825 830
Glu Ile Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe Asp Ala Asp
835 840 845
Phe Val Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val Val Ala Ile
850 855 860
Met Lys Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn Tyr Ile Lys
865 870 875 880
Ala Phe Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu Ser Phe Tyr
885 890 895
Gly Asp Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val Asp His Ile
900 905 910
Tyr Asp Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr Ser Lys Asp
915 920 925
Lys Phe Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly Gly Trp Asp
930 935 940
Lys Asp Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg Tyr Gly Ser
945 950 955 960
Lys Tyr Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys Cys Leu Gln
965 970 975
Lys Ile Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys Ile Asn Tyr
980 985 990
Lys Leu Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val Phe Phe Ser
995 1000 1005
Lys Lys Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile Gln Lys
1010 1015 1020
Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn Leu
1025 1030 1035
Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser
1040 1045 1050
Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu
1055 1060 1065
Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu Val Glu
1070 1075 1080
Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys Glu
1085 1090 1095
Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile
1100 1105 1110
Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu
1115 1120 1125
His Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly
1130 1135 1140
Gln Ile Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala
1145 1150 1155
Ser Leu Lys Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro
1160 1165 1170
Ile Ala Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu
1175 1180 1185
Ser Tyr Asp Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr
1190 1195 1200
Glu Leu His Ile Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn Ile
1205 1210 1215
Phe Lys Ile Asn Thr Glu Val Arg Val Leu Leu Lys His Asp Asp
1220 1225 1230
Asn Pro Tyr Val Ile Gly Ile Ala Arg Gly Glu Arg Asn Leu Leu
1235 1240 1245
Tyr Ile Val Val Val Asp Gly Lys Gly Asn Ile Val Glu Gln Tyr
1250 1255 1260
Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn Gly Ile Arg Ile Lys
1265 1270 1275
Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu Lys Glu Arg Phe
1280 1285 1290
Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile Lys Glu Leu
1295 1300 1305
Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys Glu Leu
1310 1315 1320
Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn Ser
1325 1330 1335
Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln
1340 1345 1350
Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp
1355 1360 1365
Lys Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr
1370 1375 1380
Gln Ile Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln
1385 1390 1395
Asn Gly Phe Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile
1400 1405 1410
Asp Pro Ser Thr Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr
1415 1420 1425
Ser Ile Ala Asp Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile
1430 1435 1440
Met Tyr Val Pro Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr
1445 1450 1455
Lys Asn Phe Ser Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys
1460 1465 1470
Leu Tyr Ser Tyr Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys
1475 1480 1485
Lys Asn Asn Val Phe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala
1490 1495 1500
Tyr Lys Glu Leu Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly
1505 1510 1515
Asp Ile Arg Ala Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr
1520 1525 1530
Ser Ser Phe Met Ala Leu Met Ser Leu Met Leu Gln Met Arg Asn
1535 1540 1545
Ser Ile Thr Gly Arg Thr Asp Val Asp Phe Leu Ile Ser Pro Val
1550 1555 1560
Lys Asn Ser Asp Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala
1565 1570 1575
Gln Glu Asn Ala Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala
1580 1585 1590
Tyr Asn Ile Ala Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys
1595 1600 1605
Lys Ala Glu Asp Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser
1610 1615 1620
Asn Lys Glu Trp Leu Glu Tyr Ala Gln Thr Ser Val Lys His Ser
1625 1630 1635
Gly Gly Ser Pro Lys Lys Lys Arg Lys Val Ser Gly Gly Ser Pro
1640 1645 1650
Lys Lys Lys Arg Lys Val
1655
<210> 28
<211> 1656
<212> PRT
<213> Artificial Sequence
<220>
<223> LBCPF1-ABE2-X2
<400> 28
Met Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr
1 5 10 15
Leu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp
20 25 30
Asn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys
35 40 45
Gly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp
50 55 60
Val Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu
65 70 75 80
Phe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn
85 90 95
Leu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn
100 105 110
Glu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu
115 120 125
Pro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe
130 135 140
Asn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn
145 150 155 160
Met Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile
165 170 175
Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys
180 185 190
Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys
195 200 205
Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe
210 215 220
Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile
225 230 235 240
Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn
245 250 255
Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys
260 265 270
Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser
275 280 285
Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe
290 295 300
Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys
305 310 315 320
Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile
325 330 335
Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe
340 345 350
Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp
355 360 365
Ile His Leu Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp
370 375 380
Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu
385 390 395 400
Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu
405 410 415
Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser
420 425 430
Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys
435 440 445
Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys
450 455 460
Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr
465 470 475 480
Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile
485 490 495
Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr
500 505 510
Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro
515 520 525
Gln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala
530 535 540
Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys
545 550 555 560
Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly
565 570 575
Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met
580 585 590
Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro
595 600 605
Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly
610 615 620
Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys
625 630 635 640
Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn
645 650 655
Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu
660 665 670
Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys
675 680 685
Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile
690 695 700
Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His
705 710 715 720
Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile
725 730 735
Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys
740 745 750
Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys
755 760 765
Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr
770 775 780
Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile
785 790 795 800
Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val
805 810 815
Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile Ala
820 825 830
Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys Gly
835 840 845
Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn
850 855 860
Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu
865 870 875 880
Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile
885 890 895
Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys
900 905 910
Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn
915 920 925
Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln
930 935 940
Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp Lys
945 950 955 960
Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile
965 970 975
Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn Gly Phe
980 985 990
Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser Thr
995 1000 1005
Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala Asp
1010 1015 1020
Ser Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val Pro
1025 1030 1035
Glu Glu Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe Ser
1040 1045 1050
Arg Thr Asp Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser Tyr
1055 1060 1065
Gly Asn Arg Ile Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn Val
1070 1075 1080
Phe Asp Trp Glu Glu Val Cys Leu Thr Ser Ala Tyr Lys Glu Leu
1085 1090 1095
Phe Asn Lys Tyr Gly Ile Asn Tyr Gln Gln Gly Asp Ile Arg Ala
1100 1105 1110
Leu Leu Cys Glu Gln Ser Asp Lys Ala Phe Tyr Ser Ser Phe Met
1115 1120 1125
Ala Leu Met Ser Leu Met Leu Gln Met Arg Asn Ser Ile Thr Gly
1130 1135 1140
Arg Thr Asp Val Asp Phe Leu Ile Ser Pro Val Lys Asn Ser Asp
1145 1150 1155
Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu Ala Gln Glu Asn Ala
1160 1165 1170
Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala Tyr Asn Ile Ala
1175 1180 1185
Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys Ala Glu Asp
1190 1195 1200
Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn Lys Glu Trp
1205 1210 1215
Leu Glu Tyr Ala Gln Thr Ser Val Lys His Lys Leu Met Pro Lys
1220 1225 1230
Lys Lys Arg Lys Val Ser Gly Gly Ser Ser Glu Val Glu Phe Ser
1235 1240 1245
His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala Lys Arg Ala
1250 1255 1260
Trp Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu Val His Asn
1265 1270 1275
Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Pro Ile Gly Arg His
1280 1285 1290
Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg Gln Gly Gly
1295 1300 1305
Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val
1310 1315 1320
Thr Leu Glu Pro Cys Val Met Cys Ala Gly Ala Met Ile His Ser
1325 1330 1335
Arg Ile Gly Arg Val Val Phe Gly Ala Arg Asp Ala Lys Thr Gly
1340 1345 1350
Ala Ala Gly Ser Leu Met Asp Val Leu His His Pro Gly Met Asn
1355 1360 1365
His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala
1370 1375 1380
Ala Leu Leu Ser Asp Phe Phe Arg Met Arg Arg Gln Glu Ile Lys
1385 1390 1395
Ala Gln Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser
1400 1405 1410
Gly Gly Ser Ser Gly Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala
1415 1420 1425
Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly Ser Ser Glu Val
1430 1435 1440
Glu Phe Ser His Glu Tyr Trp Met Arg His Ala Leu Thr Leu Ala
1445 1450 1455
Lys Arg Ala Arg Asp Glu Arg Glu Val Pro Val Gly Ala Val Leu
1460 1465 1470
Val Leu Asn Asn Arg Val Ile Gly Glu Gly Trp Asn Arg Ala Ile
1475 1480 1485
Gly Leu His Asp Pro Thr Ala His Ala Glu Ile Met Ala Leu Arg
1490 1495 1500
Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu Ile Asp Ala Thr
1505 1510 1515
Leu Tyr Val Thr Phe Glu Pro Cys Val Met Cys Ala Gly Ala Met
1520 1525 1530
Ile His Ser Arg Ile Gly Arg Val Val Phe Gly Val Arg Asn Ala
1535 1540 1545
Lys Thr Gly Ala Ala Gly Ser Leu Met Asp Val Leu His Tyr Pro
1550 1555 1560
Gly Met Asn His Arg Val Glu Ile Thr Glu Gly Ile Leu Ala Asp
1565 1570 1575
Glu Cys Ala Ala Leu Leu Cys Tyr Phe Phe Arg Met Pro Arg Gln
1580 1585 1590
Val Phe Asn Ala Gln Lys Lys Ala Gln Ser Ser Thr Asp Ser Gly
1595 1600 1605
Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr Pro Gly Thr Ser
1610 1615 1620
Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser Ser Gly Gly Ser
1625 1630 1635
Leu Lys Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys
1640 1645 1650
Lys Lys Lys
1655
<210> 29
<211> 1663
<212> PRT
<213> Artificial Sequence
<220>
<223> LBCPF1-ABE2-X3
<400> 29
Met Pro Lys Lys Lys Arg Lys Val Ser Lys Leu Glu Lys Phe Thr Asn
1 5 10 15
Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys Ala Ile Pro Val Gly
20 25 30
Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu Leu Val Glu Asp Glu
35 40 45
Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys Leu Leu Asp Arg Tyr
50 55 60
Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser Ile Lys Leu Lys Asn
65 70 75 80
Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys Thr Arg Thr Glu Lys
85 90 95
Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn Leu Arg Lys Glu Ile
100 105 110
Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys Ser Leu Phe Lys Lys
115 120 125
Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu Asp Asp Lys Asp Glu
130 135 140
Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr Thr Ala Phe Thr Gly
145 150 155 160
Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu Glu Ala Lys Ser Thr
165 170 175
Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu Thr Arg Tyr Ile Ser
180 185 190
Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile Phe Asp Lys His Glu
195 200 205
Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser Asp Tyr Asp Val Glu
210 215 220
Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val Leu Thr Gln Glu Gly
225 230 235 240
Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe Val Thr Glu Ser Gly
245 250 255
Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn Leu Tyr Asn Gln Lys
260 265 270
Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu Tyr Lys Gln Val Leu
275 280 285
Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu Gly Tyr Thr Ser Asp
290 295 300
Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu Asn Lys Asn Ser Glu
305 310 315 320
Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu Phe Lys Asn Phe Asp
325 330 335
Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn Gly Pro Ala Ile Ser
340 345 350
Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn Val Ile Arg Asp Lys
355 360 365
Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys Lys Lys Ala Val Val
370 375 380
Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser Phe Lys Lys Ile Gly
385 390 395 400
Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala Asp Ala Asp Leu Ser
405 410 415
Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln Lys Val Asp Glu Ile
420 425 430
Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe Asp Ala Asp Phe Val
435 440 445
Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val Val Ala Ile Met Lys
450 455 460
Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn Tyr Ile Lys Ala Phe
465 470 475 480
Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu Ser Phe Tyr Gly Asp
485 490 495
Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val Asp His Ile Tyr Asp
500 505 510
Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr Ser Lys Asp Lys Phe
515 520 525
Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly Gly Trp Asp Lys Asp
530 535 540
Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg Tyr Gly Ser Lys Tyr
545 550 555 560
Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys Cys Leu Gln Lys Ile
565 570 575
Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys Ile Asn Tyr Lys Leu
580 585 590
Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val Phe Phe Ser Lys Lys
595 600 605
Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile Gln Lys Ile Tyr Lys
610 615 620
Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn Leu Asn Asp Cys His
625 630 635 640
Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser Arg Tyr Pro Lys Trp
645 650 655
Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu Thr Glu Lys Tyr Lys Asp
660 665 670
Ile Ala Gly Phe Tyr Arg Glu Val Glu Glu Gln Gly Tyr Lys Val Ser
675 680 685
Phe Glu Ser Ala Ser Lys Lys Glu Val Asp Lys Leu Val Glu Glu Gly
690 695 700
Lys Leu Tyr Met Phe Gln Ile Tyr Asn Lys Asp Phe Ser Asp Lys Ser
705 710 715 720
His Gly Thr Pro Asn Leu His Thr Met Tyr Phe Lys Leu Leu Phe Asp
725 730 735
Glu Asn Asn His Gly Gln Ile Arg Leu Ser Gly Gly Ala Glu Leu Phe
740 745 750
Met Arg Arg Ala Ser Leu Lys Lys Glu Glu Leu Val Val His Pro Ala
755 760 765
Asn Ser Pro Ile Ala Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr Thr
770 775 780
Thr Leu Ser Tyr Asp Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp Gln
785 790 795 800
Tyr Glu Leu His Ile Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn Ile
805 810 815
Phe Lys Ile Asn Thr Glu Val Arg Val Leu Leu Lys His Asp Asp Asn
820 825 830
Pro Tyr Val Ile Gly Ile Ala Arg Gly Glu Arg Asn Leu Leu Tyr Ile
835 840 845
Val Val Val Asp Gly Lys Gly Asn Ile Val Glu Gln Tyr Ser Leu Asn
850 855 860
Glu Ile Ile Asn Asn Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr His
865 870 875 880
Ser Leu Leu Asp Lys Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln Asn
885 890 895
Trp Thr Ser Ile Glu Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile Ser
900 905 910
Gln Val Val His Lys Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala Val
915 920 925
Ile Ala Leu Glu Asp Leu Asn Ser Gly Phe Lys Asn Ser Arg Val Lys
930 935 940
Val Glu Lys Gln Val Tyr Gln Lys Phe Glu Lys Met Leu Ile Asp Lys
945 950 955 960
Leu Asn Tyr Met Val Asp Lys Lys Ser Asn Pro Cys Ala Thr Gly Gly
965 970 975
Ala Leu Lys Gly Tyr Gln Ile Thr Asn Lys Phe Glu Ser Phe Lys Ser
980 985 990
Met Ser Thr Gln Asn Gly Phe Ile Phe Tyr Ile Pro Ala Trp Leu Thr
995 1000 1005
Ser Lys Ile Asp Pro Ser Thr Gly Phe Val Asn Leu Leu Lys Thr
1010 1015 1020
Lys Tyr Thr Ser Ile Ala Asp Ser Lys Lys Phe Ile Ser Ser Phe
1025 1030 1035
Asp Arg Ile Met Tyr Val Pro Glu Glu Asp Leu Phe Glu Phe Ala
1040 1045 1050
Leu Asp Tyr Lys Asn Phe Ser Arg Thr Asp Ala Asp Tyr Ile Lys
1055 1060 1065
Lys Trp Lys Leu Tyr Ser Tyr Gly Asn Arg Ile Arg Ile Phe Arg
1070 1075 1080
Asn Pro Lys Lys Asn Asn Val Phe Asp Trp Glu Glu Val Cys Leu
1085 1090 1095
Thr Ser Ala Tyr Lys Glu Leu Phe Asn Lys Tyr Gly Ile Asn Tyr
1100 1105 1110
Gln Gln Gly Asp Ile Arg Ala Leu Leu Cys Glu Gln Ser Asp Lys
1115 1120 1125
Ala Phe Tyr Ser Ser Phe Met Ala Leu Met Ser Leu Met Leu Gln
1130 1135 1140
Met Arg Asn Ser Ile Thr Gly Arg Thr Asp Val Asp Phe Leu Ile
1145 1150 1155
Ser Pro Val Lys Asn Ser Asp Gly Ile Phe Tyr Asp Ser Arg Asn
1160 1165 1170
Tyr Glu Ala Gln Glu Asn Ala Ile Leu Pro Lys Asn Ala Asp Ala
1175 1180 1185
Asn Gly Ala Tyr Asn Ile Ala Arg Lys Val Leu Trp Ala Ile Gly
1190 1195 1200
Gln Phe Lys Lys Ala Glu Asp Glu Lys Leu Asp Lys Val Lys Ile
1205 1210 1215
Ala Ile Ser Asn Lys Glu Trp Leu Glu Tyr Ala Gln Thr Ser Val
1220 1225 1230
Lys His Lys Leu Met Pro Lys Lys Lys Arg Lys Val Ser Gly Gly
1235 1240 1245
Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met Arg His Ala
1250 1255 1260
Leu Thr Leu Ala Lys Arg Ala Trp Asp Glu Arg Glu Val Pro Val
1265 1270 1275
Gly Ala Val Leu Val His Asn Asn Arg Val Ile Gly Glu Gly Trp
1280 1285 1290
Asn Arg Pro Ile Gly Arg His Asp Pro Thr Ala His Ala Glu Ile
1295 1300 1305
Met Ala Leu Arg Gln Gly Gly Leu Val Met Gln Asn Tyr Arg Leu
1310 1315 1320
Ile Asp Ala Thr Leu Tyr Val Thr Leu Glu Pro Cys Val Met Cys
1325 1330 1335
Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg Val Val Phe Gly
1340 1345 1350
Ala Arg Asp Ala Lys Thr Gly Ala Ala Gly Ser Leu Met Asp Val
1355 1360 1365
Leu His His Pro Gly Met Asn His Arg Val Glu Ile Thr Glu Gly
1370 1375 1380
Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Ser Asp Phe Phe Arg
1385 1390 1395
Met Arg Arg Gln Glu Ile Lys Ala Gln Lys Lys Ala Gln Ser Ser
1400 1405 1410
Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Ser Glu Thr
1415 1420 1425
Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser Gly Gly Ser
1430 1435 1440
Ser Gly Gly Ser Ser Glu Val Glu Phe Ser His Glu Tyr Trp Met
1445 1450 1455
Arg His Ala Leu Thr Leu Ala Lys Arg Ala Arg Asp Glu Arg Glu
1460 1465 1470
Val Pro Val Gly Ala Val Leu Val Leu Asn Asn Arg Val Ile Gly
1475 1480 1485
Glu Gly Trp Asn Arg Ala Ile Gly Leu His Asp Pro Thr Ala His
1490 1495 1500
Ala Glu Ile Met Ala Leu Arg Gln Gly Gly Leu Val Met Gln Asn
1505 1510 1515
Tyr Arg Leu Ile Asp Ala Thr Leu Tyr Val Thr Phe Glu Pro Cys
1520 1525 1530
Val Met Cys Ala Gly Ala Met Ile His Ser Arg Ile Gly Arg Val
1535 1540 1545
Val Phe Gly Val Arg Asn Ala Lys Thr Gly Ala Ala Gly Ser Leu
1550 1555 1560
Met Asp Val Leu His Tyr Pro Gly Met Asn His Arg Val Glu Ile
1565 1570 1575
Thr Glu Gly Ile Leu Ala Asp Glu Cys Ala Ala Leu Leu Cys Tyr
1580 1585 1590
Phe Phe Arg Met Pro Arg Gln Val Phe Asn Ala Gln Lys Lys Ala
1595 1600 1605
Gln Ser Ser Thr Asp Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly
1610 1615 1620
Ser Glu Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Ser Ser
1625 1630 1635
Gly Gly Ser Ser Gly Gly Ser Leu Lys Lys Arg Pro Ala Ala Thr
1640 1645 1650
Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys
1655 1660
<210> 30
<211> 1526
<212> DNA
<213> Artificial Sequence
<220>
<223> Artificial Sequence
<400> 30
ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta 60
agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta 120
tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa 180
tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga 240
gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt 300
ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg 360
gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt 420
agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata 480
taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa 540
aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga 600
tcgacgagtc taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag 660
cagacggcac ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg 720
ttggacttgc tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg 780
gcacggcagg cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc 840
caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc 900
ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa 960
tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc 1020
taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc 1080
atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg 1140
cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc 1200
ctgggatggc tctagccgtt ccgcagacgg gatcgatcta ggataggtat acatgttgat 1260
gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1320
cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1380
acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1440
ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1500
tctgcaggtc gaagcttgaa gcaaac 1526

Claims (20)

1.一种用于对生物体基因组中的靶序列进行碱基编辑的***,其包含以下i)至v)中至少一项:
i)碱基编辑融合蛋白,和向导RNA;
ii)包含编码碱基编辑融合蛋白的核苷酸序列的表达构建体,和向导RNA;
iii)碱基编辑融合蛋白,和包含编码向导RNA的核苷酸序列的表达构建体;
iv)包含编码碱基编辑融合蛋白的核苷酸序列的表达构建体,和包含编码向导RNA的核苷酸序列的表达构建体;
v)包含编码碱基编辑融合蛋白的核苷酸序列和编码向导RNA的核苷酸序列的表达构建体;
其中所述碱基编辑融合蛋白包含DNA切割活性缺失的Cpf1和脱氨酶,所述向导RNA能够将所述碱基编辑融合蛋白靶向基因组中的靶序列,导致所述靶序列中一或多个C至T或者A至G的取代。
2.权利要求1的***,其中所述DNA切割活性缺失的Cpf1是DNA切割活性缺失的FnCpf1,例如所述DNA切割活性缺失的FnCpf1相对于野生型FnCpf1包含D917A突变。
3.权利要求1的***,其中所述DNA切割活性缺失的Cpf1是DNA切割活性缺失的AsCpf1,例如所述DNA切割活性缺失的AsCpf1相对于野生型AsCpf1包含D908A突变。
4.权利要求1的***,其中所述DNA切割活性缺失的Cpf1是DNA切割活性缺失的LbCpf1,例如所述DNA切割活性缺失的LbCpf1相对于野生型LbCpf1包含D832A突变。
5.权利要求1的***,其中所述脱氨酶是胞苷脱氨酶,例如载脂蛋白B mRNA编辑复合体(APOBEC)家族脱氨酶。
6.权利要求5的***,其中所述胞苷脱氨酶是APOBEC1脱氨酶或激活诱导的胞苷脱氨酶(AID)。
7.权利要求5的***,其中所述碱基编辑融合蛋白还包含尿嘧啶DNA糖基化酶抑制剂(UGI)。
8.权利要求1的***,其中所述脱氨酶是DNA依赖型腺嘌呤脱氨酶,优选单链DNA依赖型腺嘌呤脱氨酶。
9.权利要求8的***,其中所述DNA依赖型腺嘌呤脱氨酶是大肠杆菌tRNA腺嘌呤脱氨酶TadA(ecTadA)的变体,特别是可以接受单链DNA作为底物的变体。
10.权利要求9的***,其中所述DNA依赖型腺嘌呤脱氨酶相对于野生型ecTadA包含一或多组选自以下的突变:
1)A106V和D108N;
2)D147Y和E155V;
3)L84F、H123Y和I156F;
4)A142N;
5)H36L、R51L、S146C和K157N;
6)P48S/T/A;
7)A142N;
8)W23L/R;和
9)R152H/P。
11.权利要求10的***,其中所述DNA依赖型腺嘌呤脱氨酶相对于野生型ecTadA包含以下突变:W23R、H36L、R51L、S146C、K157N、A106V、D108N、P48A、L84F、H123Y、I156F、D147Y、E155V和R152P。
12.权利要求9的***,所述DNA依赖型腺嘌呤脱氨酶的N端融合有对应的野生型腺嘌呤脱氨酶,优选地所述DNA依赖型腺嘌呤脱氨酶的N端通过接头与对应的野生型腺嘌呤脱氨酶融合。
13.权利要求1的***,其中所述脱氨酶被融合至所述DNA切割活性缺失的Cpf1的N末端,或其中所述脱氨酶被融合至DNA切割活性缺失的Cpf1的C末端。
14.权利要求1的***,其中所述脱氨酶和所述DNA切割活性缺失的Cpf1通过接头融合。
15.权利要求1的***,其中所述碱基编辑融合蛋白还在其N端和/或C端包含核定位序列(NLS)。
16.权利要求1的***,其中所述编码碱基编辑融合蛋白的核苷酸序列针对待进行碱基编辑的生物体进行密码子优化。
17.权利要求1的***,所述编码碱基编辑融合蛋白的核苷酸序列和/或所述编码向导RNA的核苷酸序列与表达调控元件可操作地连接。
18.权利要求17的***,其中所述调控元件是启动子,例如35S启动子、玉米Ubi-1启动子、小麦U6启动子、水稻U3启动子或玉米U3启动子。
19.一种产生经遗传修饰的生物体的方法,包括将权利要求1-18中任一项的***导入生物体细胞,由此所述向导RNA将所述碱基编辑融合蛋白靶向所述细胞基因组中的靶序列,导致所述靶序列中一或多个C至T或者A至G的取代。
20.权利要求19的方法,其中所述生物体选自哺乳动物如人、小鼠、大鼠、猴、犬、猪、羊、牛、猫;家禽如鸡、鸭、鹅;植物,包括单子叶植物和双子叶植物,例如水稻、玉米、小麦、高粱、大麦、大豆、花生、拟南芥。
CN201811578853.8A 2017-12-22 2018-12-21 基于cpf1蛋白的碱基编辑***和方法 Active CN109957569B (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201711403490X 2017-12-22
CN201711403490 2017-12-22

Publications (2)

Publication Number Publication Date
CN109957569A true CN109957569A (zh) 2019-07-02
CN109957569B CN109957569B (zh) 2022-10-25

Family

ID=66992485

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811578853.8A Active CN109957569B (zh) 2017-12-22 2018-12-21 基于cpf1蛋白的碱基编辑***和方法

Country Status (3)

Country Link
CN (1) CN109957569B (zh)
AR (1) AR114014A1 (zh)
WO (1) WO2019120310A1 (zh)

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110964741A (zh) * 2019-12-20 2020-04-07 北京市农林科学院 一种核定位信号fnb及其在提高碱基编辑效率中的应用
CN111019967A (zh) * 2019-11-27 2020-04-17 南京农业大学 GmU3-19g-1和GmU6-16g-1启动子在大豆多基因编辑***中的应用
CN111518794A (zh) * 2020-04-13 2020-08-11 中山大学 基于激活诱导性胞苷脱氨酶的诱导突变蛋白的制备和用途
WO2021032155A1 (zh) * 2019-08-20 2021-02-25 中国科学院遗传与发育生物学研究所 一种碱基编辑***和其使用方法
CN112851776A (zh) * 2020-04-20 2021-05-28 中国科学院天津工业生物技术研究所 一种基因定点突变方法及其胁迫抗性育种应用
CN113005141A (zh) * 2021-01-05 2021-06-22 温州医科大学 高活性突变体构成的基因编辑工具及制备方法和修复先天性视网膜劈裂症致病基因的方法
CN114045302A (zh) * 2021-11-12 2022-02-15 三亚中国农业科学院国家南繁研究院 一种单碱基编辑载体及其构建和应用
CN114375335A (zh) * 2019-07-19 2022-04-19 成对植物服务股份有限公司 经优化的蛋白质连接体以及使用方法
CN114667149A (zh) * 2019-09-09 2022-06-24 比姆医疗股份有限公司 新型核碱基编辑器及其使用方法
CN114761547A (zh) * 2019-09-26 2022-07-15 先正达农作物保护股份公司 用于dna碱基编辑的方法和组合物
CN114829595A (zh) * 2019-10-17 2022-07-29 成对植物服务股份有限公司 Cas12a核酸酶的变体及其制备方法和用途
CN114835818A (zh) * 2022-03-17 2022-08-02 江南大学 一种基因编辑融合蛋白、其构建的腺嘌呤碱基编辑器及其应用
CN114867852A (zh) * 2019-10-30 2022-08-05 成对植物服务股份有限公司 V型crispr-cas碱基编辑器及其使用方法
WO2022223010A1 (zh) * 2021-04-21 2022-10-27 浙江大学 一种负链rna病毒载体及无需转化的植物基因组编辑方法
CN115704015A (zh) * 2021-08-12 2023-02-17 清华大学 基于腺嘌呤和胞嘧啶双碱基编辑器的靶向诱变***
CN116286734A (zh) * 2022-11-29 2023-06-23 武汉大学 野生型LbCas12a蛋白的突变体及SNP检测用途
CN116376948A (zh) * 2022-07-25 2023-07-04 广州医科大学 一种质粒载体及展示外源蛋白的ms2噬菌体类似颗粒的制备方法
CN116497067A (zh) * 2019-02-13 2023-07-28 比姆医疗股份有限公司 治疗血红素病变的组合物和方法
CN116751799A (zh) * 2023-06-14 2023-09-15 江南大学 一种多位点双重碱基编辑器及其应用
CN116836962A (zh) * 2023-06-28 2023-10-03 微光基因(苏州)有限公司 工程化的腺苷脱氨酶及碱基编辑器
WO2023207607A1 (zh) * 2022-04-29 2023-11-02 北京大学 用于修饰线粒体dna的脱氨酶突变体、组合物和方法

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IL312708A (en) * 2018-12-27 2024-07-01 Lifeedit Therapeutics Inc Polypeptides useful for gene editing and methods of use
CN110551752B (zh) * 2019-08-30 2023-03-14 北京市农林科学院 xCas9n-epBE碱基编辑***及其在基因组碱基替换中的应用
BR112022004586A2 (pt) 2019-09-12 2022-06-14 Inst Genetics & Developmental Biology Cas Plantas resistentes a herbicidas
AU2021313163A1 (en) 2020-07-21 2023-02-16 Pairwise Plants Services, Inc. Optimized protein linkers and methods of use
CN114317596B (zh) * 2020-09-30 2024-01-16 北京市农林科学院 一种将植物基因组靶点序列中的a突变为g的方法
CN114317589B (zh) * 2020-09-30 2024-01-16 北京市农林科学院 SpRYn-ABE碱基编辑***在植物基因组碱基替换中的应用
CN114317518B (zh) * 2020-09-30 2024-01-12 北京市农林科学院 SpRYn-CBE碱基编辑***在植物基因组碱基替换中的应用
CN112430622A (zh) * 2020-10-26 2021-03-02 扬州大学 一种FokI和dCpf1融合蛋白表达载体及其介导的定点基因编辑方法
WO2023187027A1 (en) * 2022-03-30 2023-10-05 BASF Agricultural Solutions Seed US LLC Optimized base editors
CN114686456B (zh) * 2022-05-10 2023-02-17 中山大学 基于双分子脱氨酶互补的碱基编辑***及其应用
CN115820691B (zh) * 2022-07-25 2023-08-22 安徽农业大学 一种基于LbCpf1变体的水稻碱基编辑***和应用

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017184786A1 (en) * 2016-04-19 2017-10-26 The Broad Institute Inc. Cpf1 complexes with reduced indel activity
WO2017184768A1 (en) * 2016-04-19 2017-10-26 The Broad Institute Inc. Novel crispr enzymes and systems
WO2017189308A1 (en) * 2016-04-19 2017-11-02 The Broad Institute Inc. Novel crispr enzymes and systems
CN110214183A (zh) * 2016-08-03 2019-09-06 哈佛大学的校长及成员们 腺苷核碱基编辑器及其用途

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3374494A4 (en) * 2015-11-11 2019-05-01 Coda Biotherapeutics, Inc. CRISPR COMPOSITIONS AND METHODS OF USE FOR GENE THERAPY
US20190264186A1 (en) * 2016-01-22 2019-08-29 The Broad Institute Inc. Crystal structure of crispr cpf1

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017184786A1 (en) * 2016-04-19 2017-10-26 The Broad Institute Inc. Cpf1 complexes with reduced indel activity
WO2017184768A1 (en) * 2016-04-19 2017-10-26 The Broad Institute Inc. Novel crispr enzymes and systems
WO2017189308A1 (en) * 2016-04-19 2017-11-02 The Broad Institute Inc. Novel crispr enzymes and systems
CN110214183A (zh) * 2016-08-03 2019-09-06 哈佛大学的校长及成员们 腺苷核碱基编辑器及其用途

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
NICOLE M. GAUDELLI等: "《Programmable base editing of A•T to G•C in genomic DNA without DNA cleavage》", 《NATURE》 *

Cited By (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12016908B2 (en) 2019-02-13 2024-06-25 Beam Therapeutics Inc. Compositions and methods for treating hemoglobinopathies
CN116497067A (zh) * 2019-02-13 2023-07-28 比姆医疗股份有限公司 治疗血红素病变的组合物和方法
CN114375335A (zh) * 2019-07-19 2022-04-19 成对植物服务股份有限公司 经优化的蛋白质连接体以及使用方法
WO2021032155A1 (zh) * 2019-08-20 2021-02-25 中国科学院遗传与发育生物学研究所 一种碱基编辑***和其使用方法
CN114945670A (zh) * 2019-08-20 2022-08-26 上海蓝十字医学科学研究所 一种碱基编辑***和其使用方法
CN114667149A (zh) * 2019-09-09 2022-06-24 比姆医疗股份有限公司 新型核碱基编辑器及其使用方法
CN114761547A (zh) * 2019-09-26 2022-07-15 先正达农作物保护股份公司 用于dna碱基编辑的方法和组合物
CN114829595A (zh) * 2019-10-17 2022-07-29 成对植物服务股份有限公司 Cas12a核酸酶的变体及其制备方法和用途
CN114867852A (zh) * 2019-10-30 2022-08-05 成对植物服务股份有限公司 V型crispr-cas碱基编辑器及其使用方法
CN111019967A (zh) * 2019-11-27 2020-04-17 南京农业大学 GmU3-19g-1和GmU6-16g-1启动子在大豆多基因编辑***中的应用
CN110964741B (zh) * 2019-12-20 2022-03-01 北京市农林科学院 一种核定位信号fnb及其在提高碱基编辑效率中的应用
CN110964741A (zh) * 2019-12-20 2020-04-07 北京市农林科学院 一种核定位信号fnb及其在提高碱基编辑效率中的应用
CN111518794A (zh) * 2020-04-13 2020-08-11 中山大学 基于激活诱导性胞苷脱氨酶的诱导突变蛋白的制备和用途
WO2021208501A1 (zh) * 2020-04-13 2021-10-21 中山大学 基于激活诱导性胞苷脱氨酶的诱导突变蛋白的制备和用途
CN111518794B (zh) * 2020-04-13 2023-05-16 中山大学 基于激活诱导性胞苷脱氨酶的诱导突变蛋白的制备和用途
CN112851776A (zh) * 2020-04-20 2021-05-28 中国科学院天津工业生物技术研究所 一种基因定点突变方法及其胁迫抗性育种应用
CN113005141A (zh) * 2021-01-05 2021-06-22 温州医科大学 高活性突变体构成的基因编辑工具及制备方法和修复先天性视网膜劈裂症致病基因的方法
WO2022223010A1 (zh) * 2021-04-21 2022-10-27 浙江大学 一种负链rna病毒载体及无需转化的植物基因组编辑方法
CN115704015A (zh) * 2021-08-12 2023-02-17 清华大学 基于腺嘌呤和胞嘧啶双碱基编辑器的靶向诱变***
CN114045302A (zh) * 2021-11-12 2022-02-15 三亚中国农业科学院国家南繁研究院 一种单碱基编辑载体及其构建和应用
CN114835818A (zh) * 2022-03-17 2022-08-02 江南大学 一种基因编辑融合蛋白、其构建的腺嘌呤碱基编辑器及其应用
CN114835818B (zh) * 2022-03-17 2024-03-22 江南大学 一种基因编辑融合蛋白、其构建的腺嘌呤碱基编辑器及其应用
WO2023207607A1 (zh) * 2022-04-29 2023-11-02 北京大学 用于修饰线粒体dna的脱氨酶突变体、组合物和方法
CN116376948B (zh) * 2022-07-25 2023-12-15 广州医科大学 一种质粒载体及展示外源蛋白的ms2噬菌体类似颗粒的制备方法
CN116376948A (zh) * 2022-07-25 2023-07-04 广州医科大学 一种质粒载体及展示外源蛋白的ms2噬菌体类似颗粒的制备方法
CN116286734B (zh) * 2022-11-29 2024-04-02 武汉大学 野生型LbCas12a蛋白的突变体及SNP检测用途
CN116286734A (zh) * 2022-11-29 2023-06-23 武汉大学 野生型LbCas12a蛋白的突变体及SNP检测用途
CN116751799A (zh) * 2023-06-14 2023-09-15 江南大学 一种多位点双重碱基编辑器及其应用
CN116751799B (zh) * 2023-06-14 2024-01-26 江南大学 一种多位点双重碱基编辑器及其应用
CN116836962A (zh) * 2023-06-28 2023-10-03 微光基因(苏州)有限公司 工程化的腺苷脱氨酶及碱基编辑器
CN116836962B (zh) * 2023-06-28 2024-04-05 微光基因(苏州)有限公司 工程化的腺苷脱氨酶及碱基编辑器

Also Published As

Publication number Publication date
WO2019120310A1 (en) 2019-06-27
CN109957569B (zh) 2022-10-25
AR114014A1 (es) 2020-07-08

Similar Documents

Publication Publication Date Title
CN109957569B (zh) 基于cpf1蛋白的碱基编辑***和方法
CN110157727A (zh) 植物碱基编辑方法
AU2020203760B2 (en) Viral vectors encoding recombinant fviii variants with increased expression for gene therapy of hemophilia a
KR102061438B1 (ko) 표적화 dna 서열 중의 핵산 염기가 특이적으로 전환되어 있는 단자엽식물 게놈 서열을 전환시키는 방법, 및 그것에 사용되는 분자 복합체
US20200283807A1 (en) Method and cell line for production of phytocannabinoids and phytocannabinoid analogues in yeast
KR102404550B1 (ko) A형 혈우병의 유전자 요법을 위한 증가된 발현을 갖는 재조합 fviii 변이체를 인코딩하는 바이러스 벡터
CN108913718A (zh) 一种靶向EGFR vⅢ的CAR-T细胞的制备方法及应用
CN112608940B (zh) 一种先天性白内障疾病动物模型构建方法及应用
CN114540424A (zh) Igfbp7肌肉组织特异性敲除小鼠动物模型及其构建方法
CN113943737A (zh) 一种鸡ctgf基因在抑制鸡前脂肪细胞分化的应用
CN109652381A (zh) 基于碱基编辑靶向cd133的car-t细胞制备方法及应用
KR102009270B1 (ko) 구제역 O-Thi60 주의 방어 항원이 발현되는 재조합 바이러스
CN114395020B (zh) GmRALF1蛋白在促进植物对磷元素吸收中的应用
KR20210034013A (ko) 발현이 증가된 재조합 fviii 변이체들을 인코딩하는 바이러스 벡터를 이용한 혈우병의 유전적 치료법
CN111100874B (zh) 打靶载体及整合外源基因至小鼠dc-sign外显子7位点构建bac克隆的方法和应用
CN114134170A (zh) 一种ha标签融合表达载体的制备方法及其应用
CN101300358A (zh) 真核细胞中变胞藻黄素的生物合成
CN114150001A (zh) 一种用于弓形虫基因编辑的CRISPR/Cas9载体的构建方法
CN112626119A (zh) 一种人源cyp2d6*10转基因小鼠模型的构建方法
KR102247462B1 (ko) 리보플라빈 생성능이 향상된 재조합 유산균
CN107345210A (zh) 一株呋喃香豆素基因转化的杀虫工程菌株及其应用
CN111909957B (zh) 一种雨生红球藻的遗传转化方法
KR101246884B1 (ko) 지방산 생합성 경로의 과발현용 형질전환 슈도모나스 에어루지노사 및 그의 제조방법
CN116555061A (zh) 一种酿酒酵母突变体及其在制备酒精中的应用
KR20120094788A (ko) 지방산 함량 개선을 위해 개발된 슈도모나스 에어루지노사 균주 및 제조방법

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20220623

Address after: Room D340, F3, building 2, No. 2250, Pudong South Road, Pudong New Area, Shanghai 200120

Applicant after: Shanghai Blue Cross Medical Science Research Institute

Address before: No. 2, No. 1, Beichen West Road, Beichen, Beijing

Applicant before: INSTITUTE OF GENETICS AND DEVELOPMENTAL BIOLOGY, CHINESE ACADEMY OF SCIENCES

TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20220921

Address after: Unit E598, 5th Floor, Lecheng Plaza, Phase II, Biomedical Industrial Park, No. 218, Sangtian Street, Suzhou Industrial Park, Suzhou Area, China (Jiangsu) Pilot Free Trade Zone, Suzhou City, Jiangsu Province, 215127

Applicant after: Suzhou Qihe Biotechnology Co.,Ltd.

Address before: Room D340, F3, building 2, No. 2250, Pudong South Road, Pudong New Area, Shanghai 200120

Applicant before: Shanghai Blue Cross Medical Science Research Institute

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant