CN110734900A - 一种胞嘧啶碱基编辑工具及其用途 - Google Patents

一种胞嘧啶碱基编辑工具及其用途 Download PDF

Info

Publication number
CN110734900A
CN110734900A CN201911075141.9A CN201911075141A CN110734900A CN 110734900 A CN110734900 A CN 110734900A CN 201911075141 A CN201911075141 A CN 201911075141A CN 110734900 A CN110734900 A CN 110734900A
Authority
CN
China
Prior art keywords
fragment
apobec3g
nucleotide sequence
fusion protein
seq
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201911075141.9A
Other languages
English (en)
Other versions
CN110734900B (zh
Inventor
李佳楠
于文霞
黄行许
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Shanghai for Science and Technology
Original Assignee
University of Shanghai for Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Shanghai for Science and Technology filed Critical University of Shanghai for Science and Technology
Priority to CN201911075141.9A priority Critical patent/CN110734900B/zh
Publication of CN110734900A publication Critical patent/CN110734900A/zh
Application granted granted Critical
Publication of CN110734900B publication Critical patent/CN110734900B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/113Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/78Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y305/00Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5)
    • C12Y305/04Hydrolases acting on carbon-nitrogen bonds, other than peptide bonds (3.5) in cyclic amidines (3.5.4)
    • C12Y305/04001Cytosine deaminase (3.5.4.1)
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/01Fusion polypeptide containing a localisation/targetting motif
    • C07K2319/09Fusion polypeptide containing a localisation/targetting motif containing a nuclear localisation signal
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2310/00Structure or type of the nucleic acid
    • C12N2310/10Type of nucleic acid
    • C12N2310/20Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/22Vectors comprising a coding region that has been codon optimised for expression in a respective host

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Wood Science & Technology (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Medicinal Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

本发明涉及生物技术领域,特别是涉及一种胞嘧啶碱基编辑工具及其用途。本发明提供一种融合蛋白,包括APOBEC3G片段和SpCas9‑D10A nickase片段。本发明所提供的APOBEC3G系列胞嘧啶碱基编辑工具在APOBEC3G片段中相对于野生型存在R24A、W94L、Y124A、W127L、P200K氨基酸突变,其中R24A、W94L、Y124A、W127L四个氨基酸突变可以限制APOBEC3G与RNA的结合,D128K,P199A,P200K,Q322K四个氨基酸突变可以提高APOBEC3G与DNA的结合,在将sgRNA 5’端4‑7位的C突变为T的碱基编辑中,可以提高编辑效率,同时极大地降低甚至消除胞嘧啶碱基编辑工具的RNA脱靶效应。

Description

一种胞嘧啶碱基编辑工具及其用途
技术领域
本发明涉及生物技术领域,特别是涉及一种胞嘧啶碱基编辑工具及其用途。
背景技术
CRISPR/Cas9是目前最有效、最便捷的基因组编辑技术。Cas9核酸酶在引导RNA(guide RNA,sgRNA)引导下,可以到达基因组特定靶点,对其进行切割,从而产生DNA双链断裂(double strandbreaks,DSB),然后通过内源的DNA修复机制来实现编辑。DNA修复机制包括非同源末端连接(Non-Homologous End Join,NHEJ)和同源重组修复(HomologousDirectly Repair,HDR)。其中,NHEJ修复的结果是随机引入***、缺失,导致基因的失活,这在基因组修复中占据主要地位。而HDR可利用模版进行准确修复,从而完成基因突变的校正。
但实际上,HDR介导的准确修复的几率很低,通常小于5%,因此极大限制了CRISPR/Cas9从科研向应用的转化方面的应用。尤其是基因精准治疗方面,一直是基因编辑领域的一大难题。
最近新开发的碱基编辑器(Base Editor,BE)成功解决了上述问题,大大提高了基因突变的校正效率。现有的碱基编辑器有胞嘧啶碱基编辑工具(Cytosine Base Editor,CBE)和腺嘌呤碱基编辑工具(Adenine Base Editor,ABE)两种。
CBE和ABE是将RuvC结构域失活的Cas9D10Anickase(nCas9)和胞嘧啶脱氨酶/腺嘌呤脱氨酶整合在一起,在sgRNA的引导下到达靶向位点并与sgRNA互补的DNA链进行结合,胞嘧啶脱氨酶对周边一定范围的胞嘧啶C进行脱氨成为尿嘧啶U,U可以与胞嘧啶A互补配对,经过DNA的复制,U最终会被A的互补配对碱基T所取代;类似的,腺嘌呤脱氨酶对周边一定范围的腺嘌呤A进行脱氨成为次黄嘌呤I,I可以与胞嘧啶C互补配对,经过DNA的复制,I最终会被C的互补配对碱基G所取代。从而达到C-to-T或A-to-G的目的。目前最先发明的BE3和ABE,都具有非常广泛的应用。其中,BE3对应的编辑窗口是sgRNA 5’端的4-8位,ABE对应的编辑窗口是sgRNA 5’端的4-7位。
BE3中的脱氨酶rAPOBEC1是大鼠胞嘧啶脱氨酶,其中,内源状态的rAPOBEC1除了上述表示可以编辑单链DNA,也可以编辑RNA,将C变为U。近期研究发现BE3碱基编辑器作用过程中会产生严重的RNA脱靶效应(off-Target)5,极大的限制了此碱基编辑器的应用。
发明内容
本发明的目的在于提供一种胞嘧啶碱基编辑工具及其用途。
为了达到上述目的,本发明提供了一种融合蛋白,其特征在于,所述融合蛋白自N端至C端依次包括APOBEC3G(A3G)片段和SpCas9-D10A nickase片段,所述APOBEC3G片段具有胞嘧啶脱氨酶活性;所述APOBEC3G片段存在R24A、W94L、Y124A、W127L、D128K、P199A、P199W、P200A、P200K、Q322K中的至少一个氨基酸突变或者APOBEC3G片段为自APOBEC3G第一位起始密码子至第190位或197位删除的截短APOBEC3G片段。
优选地,所述APOBEC3G片段源自人(Homo sapiens)。
优选地,所述APOBEC3G片段的核苷酸序列包括:
a)如SEQ ID NO.27-36所示的核苷酸序列;或,
b)与SEQ ID NO.27-36具有80%以上序列相似性的核苷酸序列、且具有a)所限定的核苷酸序列的功能。
更优选地,所述b)中的核苷酸序列可与a)中SEQ ID NO.27-36具有80%、85%、90%、93%、95%、97%、或99%以上的相似性。
更优选地,所述b)中的核苷酸序列具体包括:如SEQ ID NO.27-36所示的核苷酸序列经过取代、缺失或者添加一个或多个(具体可以是1-50、1-30个、1-20个、1-10个、1-5个、1-3个、1个、2个、或3个)氨基酸密码子而得到的,或者在N-末端和/或C-末端添加一个或多个(具体可以是1-50个、1-30个、1-20个、1-10个、1-5个、1-3个、1个、2个、或3个)氨基酸密码子而得到的。
优选地,所述SpCas9-D10A nickase片段的核苷酸序列包括:
c)如SEQ ID NO.37-38所示的核苷酸序列;或,
d)与SEQ ID NO.37-38具有80%以上序列相似性的核苷酸序列、且具有d)所限定的核苷酸序列的功能。
更优选地,所述d)中的核苷酸序列可与SEQ ID NO.37-38具有80%、85%、90%、93%、95%、97%、或99%以上的相似性。
更优选地,所述d)中的核苷酸序列具体包括如SEQ ID NO.37-38所示的核苷酸序列经过取代、缺失或者添加一个或多个(具体可以是1-50、1-30个、1-20个、1-10个、1-5个、1-3个、1个、2个、或3个)氨基酸密码子而得到的,或者在N-末端和/或C-末端添加一个或多个(具体可以是1-50个、1-30个、1-20个、1-10个、1-5个、1-3个、1个、2个、或3个)氨基酸密码子而得到的。
优选地,所述融合蛋白还包括核定位信号片段,所述核定位信号片段位于APOBEC3G片段的N端或SpCas9-D10A nickase片段的C端。
更优选地,所述核定位信号片段的核苷酸序列如SEQ ID NO.39所示。
优选地,所述融合蛋白还包括柔性连接肽片段,所述柔性连接肽段位于APOBEC3G片段的N端、APOBEC3G片段和SpCas9-D10A nickase之间、或SpCas9-D10A nickase的C端。
更优选地,所述柔性连接肽片段的核苷酸序列如SEQ ID NO.40-41所示。
本发明还提供了一种编码上述融合蛋白的分离的多核苷酸。
本发明还提供了一种构建体,其特征在于,所述构建体通过上述的分离的多核苷酸***表达载体中构建获得;所述构建体的多核苷酸序列如SEQ ID NO.1~14所示。
优选地,所述的表达载体包括但不限于pCMV表达载体、pSV2表达载体、pGL3表达载体等。
本发明还提供了一种表达***,其特征在于,所述表达***为宿主细胞,所述宿主细胞含有上述构建体或基因组中整合有上述分离的多核苷酸。所述宿主细胞可以表达如上所述的融合蛋白,所述融合蛋白可以与sgRNA相配合,从而可以将所述融合蛋白定位到目标区域,实现目标区域的碱基编辑。
优选地,所述宿主细胞选自真核细胞或原核细胞。
更优选地,所述宿主细胞选自小鼠细胞或人细胞。
进一步地,所述宿主细胞选自小鼠脑神经瘤细胞、人胚胎肾细胞、人***细胞、人结肠癌细胞、人骨肉瘤细胞。
更进一步地,所述表达***的宿主细胞选自N2a细胞、HEK293FT细胞、Hela细胞、HCT116细胞、或U2OS细胞。
一种碱基编辑工具,其特征在于,包括上述融合蛋白和sgRNA。
本发明还通过了上述碱基编辑工具在真核生物的基因编辑中的用途。
优选地,所述基因编辑为靶点区域内sgRNA 5’端4-7位的C-to-T的碱基编辑。
APOBEC3G是人的APOBEC家族中的成员,可以结合单链DNA或RNA,产生脱氨作用,将C突变为U,在抗病毒过程中发挥重要作用。APOBEC3G的脱氨作用倾向于在CC序列发生。APOBEC3G有两个功能域,早前研究认为,氨基端的主要作用是结合RNA,羧基端的主要作用是结合DNA,以及脱氨作用,这也是所有双结构域APOBEC的共同特点。其中,值得注意的是,早前研究认为APOBEC3G不具有RNA编辑功能。近期研究指出,APOBEC3G的DNA和RNA结合域存在竞争作用,并且与之前研究结果不同的是,将APOBEC3G过表达,发现其具有RNA脱氨酶活性。
与现有技术相比,本发明的有益效果在:
(1)本发明提供了新一代胞嘧啶碱基编辑工具,将APOBEC3G与spCas9-D10Anickase片段相连,然后将APOBEC3G中负责RNA结合的功能域突变,RNA脱氨功能破坏,提高DNA脱氨活性。随后,将几种APOBEC家族的脱氨酶的脱氨功能域进行比对,针对APOBEC3G脱氨酶进行突变,提高其活性。新一代的胞嘧啶编辑工具相比于BE3有更窄的编辑窗口,相似的编辑效率,极低甚至完全消除RNA脱靶活性。
(2)本发明所提供的碱基编辑体系地拓宽了基因组可靶向的范围,可以将NGG序列作为PAM,实现sgRNA靶点区域内5’端4-7位的C-to-T的碱基,且突变具有很高的精准性,,可以极大降低甚至消除RNA脱靶效应。
(3)本发明中的APOBEC3G片段相对于野生型存在R24A、W94L、Y124A、W127L等氨基酸突变,可以限制APOBEC3G与RNA的结合,在将sgRNA 5’端4-7位的C突变为T的碱基编辑中,可以极大地降低甚至消除胞嘧啶碱基编辑工具的RNA脱靶效应;另外,本发明所提供的胞嘧啶碱基编辑工具在DNA上的编辑效率比经典的胞嘧啶碱基编辑工具(BE3)高,在保证了编辑效率的前提下,大大提升了突变的精准性,具有良好的产业化前景。
附图说明
图1显示为本发明实施例中所使用的APOBEC3G-BE3、APOBEC3G-BE4系列质粒结构示意图;
图2显示为本发明A3G-BE3,191-BE3,198-BE3,BE3在HEK293T细胞中对内源基因位点的编辑能力统计图;其中,a为A3G-BE3,191-BE3,198-BE3,BE3在HEK293Site3位点的C-to-T编辑效率统计图;b为A3G-BE3,191-BE3,198-BE3,BE3的RNA脱靶效率统计图;C4,C5代表靶位点处C的位置,计数从PAM远端碱基算起;191-BE3,198-BE3分别为自APOBEC3G第一位至第190位、197位删除的的截短APOBEC3G-BE3;
图3显示为本发明4M-BE3,BE3在HEK293T细胞中对内源基因位点的编辑能力统计图;其中,a为4M-BE3,A3G-BE3,BE3在HEK293Site3,HEK293site2,EMX1三个位点的C-to-T编辑效率统计图;b为4M-BE3,A3G-BE3,BE3的RNA脱靶效率统计图;C3,C4,C5,C6,C8代表靶位点处C的位置,计数从PAM远端碱基算起;
图4显示为本发明4M-BE3基础上一系列突变体在HEK293T细胞中对内源基因位点的编辑能力统计图;其中,a为4M-BE3基础上进行的突变位点结构示意图;b为7个突变质粒以及4M-BE3,BE3在HEK293Site2,HEK293Site3,EMX1三个位点的C-to-T编辑效率统计图;c为在b基础上的两个组合突变质粒以及4M-BE3,BE3在HEK293Site2,HEK293Site3,EMX1三个位点的C-to-T编辑效率统计图;d为b,c中编辑效率最高的两个质粒4M-BE3,4M+P199A+P200K-BE3,4M+D128K+P199A+P200K-BE3,BE3的RNA脱靶效率统计图;C3,C4,C5,C6,C8代表靶位点处C的位置,计数从PAM远端碱基算起;
图5显示为本发明中优化质粒4M+P199A+P200K-BE4,4M+D128K+P199A+P200K-BE4在HEK293T细胞中对内源基因位点的编辑能力统计图;其中,
a为4M+P199A+P200K-BE4,4M+D128K+P199A+P200K-BE4的结构示意图;
b为4M+P199A+P200K-BE4,4M+D128K+P199A+P200K-BE4在HEK293Site3位点的C-to-T编辑效率统计图;
c为4M+P199A+P200K-BE4,4M+D128K+P199A+P200K-BE4BE3的RNA脱靶效率统计图;C4,C5代表靶位点处C的位置,计数从PAM远端碱基算起。
具体实施方式
在进一步描述本发明具体实施方式之前,应理解,本发明的保护范围不局限于下述特定的具体实施方案;还应当理解,本发明实施例中使用的术语是为了描述特定的具体实施方案,而不是为了限制本发明的保护范围;在本发明说明书和权利要求书中,除非文中另外明确指出,单数形式“一个”、“一”和“这个”包括复数形式。
当实施例给出数值范围时,应理解,除非本发明另有说明,每个数值范围的两个端点以及两个端点之间任何一个数值均可选用。除非另外定义,本发明中使用的所有技术和科学术语与本技术领域技术人员通常理解的意义相同。除实施例中使用的具体方法、设备、材料外,根据本技术领域的技术人员对现有技术的掌握及本发明的记载,还可以使用与本发明实施例中所述的方法、设备、材料相似或等同的现有技术的任何方法、设备和材料来实现本发明。
除非另外说明,本发明中所公开的实验方法、检测方法、制备方法均采用本技术领域常规的分子生物学、生物化学、染色质结构和分析、分析化学、细胞培养、重组DNA技术及相关领域的常规技术。这些技术在现有文献中已有完善说明,具体可参见Sambrook等MOLECULAR CLONINGG:A LABORATORY MANUAL,Second edition,Cold SpriNGG HarborLaboratory Press,1989and Third edition,2001;Ausubel等,CURRENT PROTOCOLS INMOLECULAR BIOLOGY,John Wiley&Sons,New York,1987andperiodic updates;the seriesMETHODS IN ENZYMOLOGY,Academic Press,San Diego;Wolffe,CHROMATIN STRUCTURE ANDFUNCTION,Third edition,Academic Press,San Diego,1998;METHODS IN ENZYMOLOGY,Vol.304,Chromatin(P.M.Wassarman and A.P.Wolffe,eds.),Academic Press,SanDiego,1999;和METHODS IN MOLECULAR BIOLOGY,Vol.119,Chromatin Protocols(P.B.Becker,ed.)Humana Press,Totowa,1999等。
实施例1
本实施例中,分别将APOBEC3G部分中RNA结合位点DNA结合位点等进行点突变(包括R24A、W94L、Y124A、W127L、D128K、P199A、P199W、P200A、P200K、Q322K),或者将APOBEC3G片段为自APOBEC3G第一位起始密码子至第190位或197位删除,得到截短APOBEC3G片段;将APOBEC3G氨基端截断构建C-APOBEC3G-BE3,或将BE3中D10Anickase Cas9部分替换为表达效率更高的D10Anickase Cas9(BE4)。
相关质粒示意图如图1所示,其中4M为R24A、W94L、Y124A、W127四个点突变,A3G(OP)为优化密码子后的A3G。
实施例中所使用的突变质粒的构建方法如下:通过Mut Express II FastMutagenesis Kit V2(Vazyme,C214-02)将氨基酸突变引入A3G-BE3质粒或A3G-BE4质粒中APOBEC3G部分(图1)。
构建得到的质粒C-191-BE3,C-198-BE3,4M-BE3,4M+D128K-BE3,4M+P199A-BE3,4M+P199W-BE3,4M+P200A-BE3,4M+P200K-BE3,4M+Q322K-BE3,4M+D128K+P199A+P200A-BE3,A3G-BE4max,4M+P199A+P200K-BE4max,4M+D128K+P199A+P200K-BE4max,A3G(OP)+P199A+P200K-BE4max,序列如SEQ ID NO.1-14所示。
实施例2
本实施例中,利用APOBEC3G系列工具在HEK293T细胞上对内源基因位点进行编辑,检测编辑效率以及RNA脱靶效率。
2.1sgRNA质粒的构建
挑选3个人源内源基因位点,设计sgRNA,3条sgRNA在基因组的位置分别为NC_000015.10:107422339-107422361;NC_000013.11:87944780-87944802;NC_000014.9:72917055-72917077。
sgRNA的上下游序列通过程序(95℃,5min;95℃-85℃at-2℃/s;85℃-25℃at-0.1℃/s;hold at 4℃)退火,连接到经过BsaI(NEB:R0539L)线性化的pGL3-U6-sgRNA(Addgene#51133)载体上。所用到的多核苷酸序列如SEQ ID NO.15~20所示。线性化体系如下所示:pGL3-U6-sgRNA2μg;buffer(NEB:R0539L)6μL;BsaI 2μL;ddH2O补齐到60μL。37℃酶切过夜。连接体系如下:T4连接buffer(NEB:M0202L)1μL,线性化载体20NGG,退火的oligo片段(10μM)5μL,T4连接酶(NEB:M0202L)0.5μL,ddH2O补齐到10μL.16℃连接过夜。连接的载体通过转化,挑菌,鉴定。对阳性克隆摇菌提取质粒(Axygene:AP-MN-P-250G)并测定浓度。
2.2细胞的培养转染与收取
HEK293T细胞(购自ATCC)接种培养于添加10%FBS的DMEM高糖培养液中(HyClone,SH30022.01B),其中含1%Penicillin Streptomycin(v/v)(Gibco)。当细胞浓度为80%时,用10%血清的DMEM培养基换液,培养2小时使细胞状态恢复最佳。每孔转染的质粒的量分别是实施例1制备的APOBEC3G系列编辑工具质粒(如图1)4μg,sgRNA质粒2μg。分别将质粒混在250μl的Opti-MEM(Gibco,11058021)培养基。将6μl的Lipofectamine 2000转染试剂(Thermo,11668019)混入250μl的Opti-MEM培养基并混匀,静置5分钟。将混有质粒的Opti-MEM加入混有Lipofectamine 2000的Opti-MEM,慢速吹打混匀,静置20分钟。将混有质粒和Lipofectamine 2000的Opti-MEM分别加入6cm盘(盘中细胞浓度为80%转染)。转染6小时后用10%FBS的DMEM换液。转染48小时后分选出阳性率最高的5%的细胞,其中5000个细胞用于检测DNA编辑效率,其余50万细胞用于抽取RNA检测RNA脱靶效率。
2.3DNA编辑效率检测
DNA检测先是通过裂解得到基因组,裂解液的成分为50mM KCl,1.5mM MgCl2,10mMTris pH 8.0,0.5%Nonidet P-40,0.5%Tween 20,100g/ml protease K。对靶点附近序列进行PCR扩增,将扩增产物纯化后用SaNGGer测序进行鉴定。扩增体系如下:2Xbuffer(Vazyme,P505)25μL;dNTP 1μL;F(10pmol/μL)1μL;R(10pmol/μL)1μL;模板1μL;DNA聚合酶(Vazyme,P505)0.5μL;ddH2O补齐到50μL。扩增出来的PCR产物经过下述步骤纯化:加入三倍体积的PCR-A(Axygen:AP-PCR-250G)过柱,离心,12000转/分钟离心1分钟;加入700μLW2,离心1分钟;弃废液,加入700μLW2,离心1分钟;弃废液,空转1分钟;加入20μL水洗脱。所用到的PCR扩增引物如SEQ ID NO.21-26所示。将得到的PCR产物用PCR扩增单向引物进行Sanger测序,后比对测序结果,比较编辑效率。结果如图2A,3A,4所示。
2.4RNA脱靶效应效率检测
在RNA检测中,先用Trizol(Vazyme,R401-01)抽取总RNA,抽取步骤如下:每孔加入1ml Trizol,混匀后收集,加入200ul三氯甲烷,上下颠倒充分混匀后4℃温度下离心,12000转/分钟离心15分钟;吸取上清400ul,加入等体积异丙醇,上下颠倒混匀后4℃温度下离心,12000转/分钟离心10分钟;弃上清,加入1mL75%乙醇,上下颠倒混匀后,4℃温度下离心,12000转/分钟离心10分钟;弃上清,风干后加水溶解。取2ug进行RNA-seq,分析脱靶效果,得到具体脱靶(即突变数量),结果如图2B,3B,5所示。
从DNA编辑效率和RNA脱靶效率来看,APOBEC3G系列编辑工具具有编辑效率高,脱靶效率极低甚至消除的优点。
综上所述,本发明有效克服了现有技术中的种种缺点而具高度产业利用价值。
上述实施例仅例示性说明本发明的原理及其功效,而非用于限制本发明。任何熟悉此技术的人士皆可在不违背本发明的精神及范畴下,对上述实施例进行修饰或改变。因此,举凡所属技术领域中具有通常知识者在未脱离本发明所揭示的精神与技术思想下所完成的一切等效修饰或改变,仍应由本发明的权利要求所涵盖。
SEQUENCE LISTING
<110> 上海科技大学
<120> 一种胞嘧啶碱基编辑工具及其应用
<160> 41
<170> PatentIn version 3.5
<210> 1
<211> 8430
<212> DNA
<213> Artificial Sequence
<220>
<223> C-191-BE3
<400> 1
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat ggagattctc 420
agacactcga tggatccaaa gacattcact ttcaacttta acaatgaacc ttgggtcaga 480
ggacggcatg agacttacct gtgttatgag gtggagcgca tgcacaatga cacctgggtc 540
ctgctgaacc agcgcagggg ctttctatgc aaccaggctc cacataaaca cggtttcctt 600
gaaggccgcc atgcagagct gtgcttcctg gacgtgattc ccttttggaa gctggacctg 660
gaccaggact acagggttac ctgcttcacc tcctggagcc cctgcttcag ctgtgcccag 720
gaaatggcta aattcatttc aaaaaacaaa cacgtgagcc tgtgcatctt cactgcccgc 780
atctatgatg atcaaggaag atgtcaggag gggctgcgca ccctggccga ggctggggcc 840
aaaatttcaa taatgacata cagtgaattt aagcactgct gggacacctt tgtggaccac 900
cagggatgtc ccttccagcc ctgggatgga ctagatgagc acagccaaga cctgagtggg 960
aggctgcggg ccattctcca gaatcaggaa aacagcggca gcgagactcc cgggacctca 1020
gagtccgcca cacccgaaag tgataaaaag tattctattg gtttagccat cggcactaat 1080
tccgttggat gggctgtcat aaccgatgaa tacaaagtac cttcaaagaa atttaaggtg 1140
ttggggaaca cagaccgtca ttcgattaaa aagaatctta tcggtgccct cctattcgat 1200
agtggcgaaa cggcagaggc gactcgcctg aaacgaaccg ctcggagaag gtatacacgt 1260
cgcaagaacc gaatatgtta cttacaagaa atttttagca atgagatggc caaagttgac 1320
gattctttct ttcaccgttt ggaagagtcc ttccttgtcg aagaggacaa gaaacatgaa 1380
cggcacccca tctttggaaa catagtagat gaggtggcat atcatgaaaa gtacccaacg 1440
atttatcacc tcagaaaaaa gctagttgac tcaactgata aagcggacct gaggttaatc 1500
tacttggctc ttgcccatat gataaagttc cgtgggcact ttctcattga gggtgatcta 1560
aatccggaca actcggatgt cgacaaactg ttcatccagt tagtacaaac ctataatcag 1620
ttgtttgaag agaaccctat aaatgcaagt ggcgtggatg cgaaggctat tcttagcgcc 1680
cgcctctcta aatcccgacg gctagaaaac ctgatcgcac aattacccgg agagaagaaa 1740
aatgggttgt tcggtaacct tatagcgctc tcactaggcc tgacaccaaa ttttaagtcg 1800
aacttcgact tagctgaaga tgccaaattg cagcttagta aggacacgta cgatgacgat 1860
ctcgacaatc tactggcaca aattggagat cagtatgcgg acttattttt ggctgccaaa 1920
aaccttagcg atgcaatcct cctatctgac atactgagag ttaatactga gattaccaag 1980
gcgccgttat ccgcttcaat gatcaaaagg tacgatgaac atcaccaaga cttgacactt 2040
ctcaaggccc tagtccgtca gcaactgcct gagaaatata aggaaatatt ctttgatcag 2100
tcgaaaaacg ggtacgcagg ttatattgac ggcggagcga gtcaagagga attctacaag 2160
tttatcaaac ccatattaga gaagatggat gggacggaag agttgcttgt aaaactcaat 2220
cgcgaagatc tactgcgaaa gcagcggact ttcgacaacg gtagcattcc acatcaaatc 2280
cacttaggcg aattgcatgc tatacttaga aggcaggagg atttttatcc gttcctcaaa 2340
gacaatcgtg aaaagattga gaaaatccta acctttcgca taccttacta tgtgggaccc 2400
ctggcccgag ggaactctcg gttcgcatgg atgacaagaa agtccgaaga aacgattact 2460
ccatggaatt ttgaggaagt tgtcgataaa ggtgcgtcag ctcaatcgtt catcgagagg 2520
atgaccaact ttgacaagaa tttaccgaac gaaaaagtat tgcctaagca cagtttactt 2580
tacgagtatt tcacagtgta caatgaactc acgaaagtta agtatgtcac tgagggcatg 2640
cgtaaacccg cctttctaag cggagaacag aagaaagcaa tagtagatct gttattcaag 2700
accaaccgca aagtgacagt taagcaattg aaagaggact actttaagaa aattgaatgc 2760
ttcgattctg tcgagatctc cggggtagaa gatcgattta atgcgtcact tggtacgtat 2820
catgacctcc taaagataat taaagataag gacttcctgg ataacgaaga gaatgaagat 2880
atcttagaag atatagtgtt gactcttacc ctctttgaag atcgggaaat gattgaggaa 2940
agactaaaaa catacgctca cctgttcgac gataaggtta tgaaacagtt aaagaggcgt 3000
cgctatacgg gctggggacg attgtcgcgg aaacttatca acgggataag agacaagcaa 3060
agtggtaaaa ctattctcga ttttctaaag agcgacggct tcgccaatag gaactttatg 3120
cagctgatcc atgatgactc tttaaccttc aaagaggata tacaaaaggc acaggtttcc 3180
ggacaagggg actcattgca cgaacatatt gcgaatcttg ctggttcgcc agccatcaaa 3240
aagggcatac tccagacagt caaagtagtg gatgagctag ttaaggtcat gggacgtcac 3300
aaaccggaaa acattgtaat cgagatggca cgcgaaaatc aaacgactca gaaggggcaa 3360
aaaaacagtc gagagcggat gaagagaata gaagagggta ttaaagaact gggcagccag 3420
atcttaaagg agcatcctgt ggaaaatacc caattgcaga acgagaaact ttacctctat 3480
tacctacaaa atggaaggga catgtatgtt gatcaggaac tggacataaa ccgtttatct 3540
gattacgacg tcgatcacat tgtaccccaa tcctttttga aggacgattc aatcgacaat 3600
aaagtgctta cacgctcgga taagaaccga gggaaaagtg acaatgttcc aagcgaggaa 3660
gtcgtaaaga aaatgaagaa ctattggcgg cagctcctaa atgcgaaact gataacgcaa 3720
agaaagttcg ataacttaac taaagctgag aggggtggct tgtctgaact tgacaaggcc 3780
ggatttatta aacgtcagct cgtggaaacc cgccaaatca caaagcatgt tgcacagata 3840
ctagattccc gaatgaatac gaaatacgac gagaacgata agctgattcg ggaagtcaaa 3900
gtaatcactt taaagtcaaa attggtgtcg gacttcagaa aggattttca attctataaa 3960
gttagggaga taaataacta ccaccatgcg cacgacgctt atcttaatgc cgtcgtaggg 4020
accgcactca ttaagaaata cccgaagcta gaaagtgagt ttgtgtatgg tgattacaaa 4080
gtttatgacg tccgtaagat gatcgcgaaa agcgaacagg agataggcaa ggctacagcc 4140
aaatacttct tttattctaa cattatgaat ttctttaaga cggaaatcac tctggcaaac 4200
ggagagatac gcaaacgacc tttaattgaa accaatgggg agacaggtga aatcgtatgg 4260
gataagggcc gggacttcgc gacggtgaga aaagttttgt ccatgcccca agtcaacata 4320
gtaaagaaaa ctgaggtgca gaccggaggg ttttcaaagg aatcgattct tccaaaaagg 4380
aatagtgata agctcatcgc tcgtaaaaag gactgggacc cgaaaaagta cggtggcttc 4440
gatagcccta cagttgccta ttctgtccta gtagtggcaa aagttgagaa gggaaaatcc 4500
aagaaactga agtcagtcaa agaattattg gggataacga ttatggagcg ctcgtctttt 4560
gaaaagaacc ccatcgactt ccttgaggcg aaaggttaca aggaagtaaa aaaggatctc 4620
ataattaaac taccaaagta tagtctgttt gagttagaaa atggccgaaa acggatgttg 4680
gctagcgccg gagagcttca aaaggggaac gaactcgcac taccgtctaa atacgtgaat 4740
ttcctgtatt tagcgtccca ttacgagaag ttgaaaggtt cacctgaaga taacgaacag 4800
aagcaacttt ttgttgagca gcacaaacat tatctcgacg aaatcataga gcaaatttcg 4860
gaattcagta agagagtcat cctagctgat gccaatctgg acaaagtatt aagcgcatac 4920
aacaagcaca gggataaacc catacgtgag caggcggaaa atattatcca tttgtttact 4980
cttaccaacc tcggcgctcc agccgcattc aagtattttg acacaacgat agatcgcaaa 5040
cgatacactt ctaccaagga ggtgctagac gcgacactga ttcaccaatc catcacggga 5100
ttatatgaaa ctcggataga tttgtcacag cttgggggtg actctggtgg ttctactaat 5160
ctgtcagata ttattgaaaa ggagaccggt aagcaactgg ttatccagga atccatcctc 5220
atgctcccag aggaggtgga agaagtcatt gggaacaagc cggaaagcga tatactcgtg 5280
cacaccgcct acgacgagag caccgacgag aatgtcatgc ttctgactag cgacgcccct 5340
gaatacaagc cttgggctct ggtcatacag gatagcaacg gtgagaacaa gattaagatg 5400
ctctctggtg gttctcccaa gaagaagagg aaagtctaac cggtcatcat caccatcacc 5460
attgagttta aacccgctga tcagcctcga ctgtgccttc tagttgccag ccatctgttg 5520
tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc cactcccact gtcctttcct 5580
aataaaatga ggaaattgca tcgcattgtc tgagtaggtg tcattctatt ctggggggtg 5640
gggtggggca ggacagcaag ggggaggatt gggaagacaa tagcaggcat gctggggatg 5700
cggtgggctc tatggcttct gaggcggaaa gaaccagctg gggctcgata ccgtcgacct 5760
ctagctagag cttggcgtaa tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc 5820
tcacaattcc acacaacata cgagccggaa gcataaagtg taaagcctag ggtgcctaat 5880
gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag tcgggaaacc 5940
tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg 6000
ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag 6060
cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag 6120
gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc 6180
tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc 6240
agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc 6300
tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt 6360
cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg 6420
ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat 6480
ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag 6540
ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt 6600
ggtggcctaa ctacggctac actagaagaa cagtatttgg tatctgcgct ctgctgaagc 6660
cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta 6720
gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag 6780
atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga 6840
ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa 6900
gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa 6960
tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc 7020
ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga 7080
taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa 7140
gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt 7200
gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg 7260
ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc 7320
aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg 7380
gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag 7440
cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt 7500
actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt 7560
caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac 7620
gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac 7680
ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag 7740
caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa 7800
tactcatact cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga 7860
gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc 7920
cccgaaaagt gccacctgac gtcgacggat cgggagatcg atctcccgat cccctagggt 7980
cgactctcag tacaatctgc tctgatgccg catagttaag ccagtatctg ctccctgctt 8040
gtgtgttgga ggtcgctgag tagtgcgcga gcaaaattta agctacaaca aggcaaggct 8100
tgaccgacaa ttgcatgaag aatctgctta gggttaggcg ttttgcgctg cttcgcgatg 8160
tacgggccag atatacgcgt tgacattgat tattgactag ttattaatag taatcaatta 8220
cggggtcatt agttcatagc ccatatatgg agttccgcgt tacataactt acggtaaatg 8280
gcccgcctgg ctgaccgccc aacgaccccc gcccattgac gtcaataatg acgtatgttc 8340
ccatagtaac gccaataggg actttccatt gacgtcaatg ggtggagtat ttacggtaaa 8400
ctgcccactt ggcagtacat caagtgtatc 8430
<210> 2
<211> 8409
<212> DNA
<213> Artificial Sequence
<220>
<223> C-198-BE3
<400> 2
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat ggatccaaag 420
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 480
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 540
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 600
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 660
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 720
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 780
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 840
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 900
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 960
aatcaggaaa acagcggcag cgagactccc gggacctcag agtccgccac acccgaaagt 1020
gataaaaagt attctattgg tttagccatc ggcactaatt ccgttggatg ggctgtcata 1080
accgatgaat acaaagtacc ttcaaagaaa tttaaggtgt tggggaacac agaccgtcat 1140
tcgattaaaa agaatcttat cggtgccctc ctattcgata gtggcgaaac ggcagaggcg 1200
actcgcctga aacgaaccgc tcggagaagg tatacacgtc gcaagaaccg aatatgttac 1260
ttacaagaaa tttttagcaa tgagatggcc aaagttgacg attctttctt tcaccgtttg 1320
gaagagtcct tccttgtcga agaggacaag aaacatgaac ggcaccccat ctttggaaac 1380
atagtagatg aggtggcata tcatgaaaag tacccaacga tttatcacct cagaaaaaag 1440
ctagttgact caactgataa agcggacctg aggttaatct acttggctct tgcccatatg 1500
ataaagttcc gtgggcactt tctcattgag ggtgatctaa atccggacaa ctcggatgtc 1560
gacaaactgt tcatccagtt agtacaaacc tataatcagt tgtttgaaga gaaccctata 1620
aatgcaagtg gcgtggatgc gaaggctatt cttagcgccc gcctctctaa atcccgacgg 1680
ctagaaaacc tgatcgcaca attacccgga gagaagaaaa atgggttgtt cggtaacctt 1740
atagcgctct cactaggcct gacaccaaat tttaagtcga acttcgactt agctgaagat 1800
gccaaattgc agcttagtaa ggacacgtac gatgacgatc tcgacaatct actggcacaa 1860
attggagatc agtatgcgga cttatttttg gctgccaaaa accttagcga tgcaatcctc 1920
ctatctgaca tactgagagt taatactgag attaccaagg cgccgttatc cgcttcaatg 1980
atcaaaaggt acgatgaaca tcaccaagac ttgacacttc tcaaggccct agtccgtcag 2040
caactgcctg agaaatataa ggaaatattc tttgatcagt cgaaaaacgg gtacgcaggt 2100
tatattgacg gcggagcgag tcaagaggaa ttctacaagt ttatcaaacc catattagag 2160
aagatggatg ggacggaaga gttgcttgta aaactcaatc gcgaagatct actgcgaaag 2220
cagcggactt tcgacaacgg tagcattcca catcaaatcc acttaggcga attgcatgct 2280
atacttagaa ggcaggagga tttttatccg ttcctcaaag acaatcgtga aaagattgag 2340
aaaatcctaa cctttcgcat accttactat gtgggacccc tggcccgagg gaactctcgg 2400
ttcgcatgga tgacaagaaa gtccgaagaa acgattactc catggaattt tgaggaagtt 2460
gtcgataaag gtgcgtcagc tcaatcgttc atcgagagga tgaccaactt tgacaagaat 2520
ttaccgaacg aaaaagtatt gcctaagcac agtttacttt acgagtattt cacagtgtac 2580
aatgaactca cgaaagttaa gtatgtcact gagggcatgc gtaaacccgc ctttctaagc 2640
ggagaacaga agaaagcaat agtagatctg ttattcaaga ccaaccgcaa agtgacagtt 2700
aagcaattga aagaggacta ctttaagaaa attgaatgct tcgattctgt cgagatctcc 2760
ggggtagaag atcgatttaa tgcgtcactt ggtacgtatc atgacctcct aaagataatt 2820
aaagataagg acttcctgga taacgaagag aatgaagata tcttagaaga tatagtgttg 2880
actcttaccc tctttgaaga tcgggaaatg attgaggaaa gactaaaaac atacgctcac 2940
ctgttcgacg ataaggttat gaaacagtta aagaggcgtc gctatacggg ctggggacga 3000
ttgtcgcgga aacttatcaa cgggataaga gacaagcaaa gtggtaaaac tattctcgat 3060
tttctaaaga gcgacggctt cgccaatagg aactttatgc agctgatcca tgatgactct 3120
ttaaccttca aagaggatat acaaaaggca caggtttccg gacaagggga ctcattgcac 3180
gaacatattg cgaatcttgc tggttcgcca gccatcaaaa agggcatact ccagacagtc 3240
aaagtagtgg atgagctagt taaggtcatg ggacgtcaca aaccggaaaa cattgtaatc 3300
gagatggcac gcgaaaatca aacgactcag aaggggcaaa aaaacagtcg agagcggatg 3360
aagagaatag aagagggtat taaagaactg ggcagccaga tcttaaagga gcatcctgtg 3420
gaaaataccc aattgcagaa cgagaaactt tacctctatt acctacaaaa tggaagggac 3480
atgtatgttg atcaggaact ggacataaac cgtttatctg attacgacgt cgatcacatt 3540
gtaccccaat cctttttgaa ggacgattca atcgacaata aagtgcttac acgctcggat 3600
aagaaccgag ggaaaagtga caatgttcca agcgaggaag tcgtaaagaa aatgaagaac 3660
tattggcggc agctcctaaa tgcgaaactg ataacgcaaa gaaagttcga taacttaact 3720
aaagctgaga ggggtggctt gtctgaactt gacaaggccg gatttattaa acgtcagctc 3780
gtggaaaccc gccaaatcac aaagcatgtt gcacagatac tagattcccg aatgaatacg 3840
aaatacgacg agaacgataa gctgattcgg gaagtcaaag taatcacttt aaagtcaaaa 3900
ttggtgtcgg acttcagaaa ggattttcaa ttctataaag ttagggagat aaataactac 3960
caccatgcgc acgacgctta tcttaatgcc gtcgtaggga ccgcactcat taagaaatac 4020
ccgaagctag aaagtgagtt tgtgtatggt gattacaaag tttatgacgt ccgtaagatg 4080
atcgcgaaaa gcgaacagga gataggcaag gctacagcca aatacttctt ttattctaac 4140
attatgaatt tctttaagac ggaaatcact ctggcaaacg gagagatacg caaacgacct 4200
ttaattgaaa ccaatgggga gacaggtgaa atcgtatggg ataagggccg ggacttcgcg 4260
acggtgagaa aagttttgtc catgccccaa gtcaacatag taaagaaaac tgaggtgcag 4320
accggagggt tttcaaagga atcgattctt ccaaaaagga atagtgataa gctcatcgct 4380
cgtaaaaagg actgggaccc gaaaaagtac ggtggcttcg atagccctac agttgcctat 4440
tctgtcctag tagtggcaaa agttgagaag ggaaaatcca agaaactgaa gtcagtcaaa 4500
gaattattgg ggataacgat tatggagcgc tcgtcttttg aaaagaaccc catcgacttc 4560
cttgaggcga aaggttacaa ggaagtaaaa aaggatctca taattaaact accaaagtat 4620
agtctgtttg agttagaaaa tggccgaaaa cggatgttgg ctagcgccgg agagcttcaa 4680
aaggggaacg aactcgcact accgtctaaa tacgtgaatt tcctgtattt agcgtcccat 4740
tacgagaagt tgaaaggttc acctgaagat aacgaacaga agcaactttt tgttgagcag 4800
cacaaacatt atctcgacga aatcatagag caaatttcgg aattcagtaa gagagtcatc 4860
ctagctgatg ccaatctgga caaagtatta agcgcataca acaagcacag ggataaaccc 4920
atacgtgagc aggcggaaaa tattatccat ttgtttactc ttaccaacct cggcgctcca 4980
gccgcattca agtattttga cacaacgata gatcgcaaac gatacacttc taccaaggag 5040
gtgctagacg cgacactgat tcaccaatcc atcacgggat tatatgaaac tcggatagat 5100
ttgtcacagc ttgggggtga ctctggtggt tctactaatc tgtcagatat tattgaaaag 5160
gagaccggta agcaactggt tatccaggaa tccatcctca tgctcccaga ggaggtggaa 5220
gaagtcattg ggaacaagcc ggaaagcgat atactcgtgc acaccgccta cgacgagagc 5280
accgacgaga atgtcatgct tctgactagc gacgcccctg aatacaagcc ttgggctctg 5340
gtcatacagg atagcaacgg tgagaacaag attaagatgc tctctggtgg ttctcccaag 5400
aagaagagga aagtctaacc ggtcatcatc accatcacca ttgagtttaa acccgctgat 5460
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 5520
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 5580
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 5640
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 5700
aggcggaaag aaccagctgg ggctcgatac cgtcgacctc tagctagagc ttggcgtaat 5760
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 5820
gagccggaag cataaagtgt aaagcctagg gtgcctaatg agtgagctaa ctcacattaa 5880
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 5940
gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 6000
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 6060
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 6120
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 6180
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 6240
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 6300
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 6360
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 6420
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 6480
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 6540
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 6600
ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 6660
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 6720
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 6780
ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 6840
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 6900
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 6960
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 7020
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 7080
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 7140
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 7200
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 7260
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 7320
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 7380
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 7440
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 7500
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 7560
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 7620
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 7680
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 7740
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 7800
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 7860
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 7920
tcgacggatc gggagatcga tctcccgatc ccctagggtc gactctcagt acaatctgct 7980
ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt 8040
agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga 8100
atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt 8160
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 8220
catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 8280
acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 8340
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 8400
aagtgtatc 8409
<210> 3
<211> 8997
<212> DNA
<213> Artificial Sequence
<220>
<223> 4M-BE3
<400> 3
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atccacccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210> 4
<211> 8997
<212> DNA
<213> Artificial Sequence
<220>
<223> 4M+D128K-BE3
<400> 4
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttcctta agccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atccacccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210> 5
<211> 8997
<212> DNA
<213> Artificial Sequence
<220>
<223> 4M+P199A-BE3
<400> 5
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atgcccccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210> 6
<211> 8997
<212> DNA
<213> Artificial Sequence
<220>
<223> 4M+P199W-BE3
<400> 6
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg attggcccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210> 7
<211> 8997
<212> DNA
<213> Artificial Sequence
<220>
<223> 4M+P200A-BE3
<400> 7
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atccagccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210> 8
<211> 8997
<212> DNA
<213> Artificial Sequence
<220>
<223> 4M+P200K-BE3
<400> 8
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atccaaagac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210> 9
<211> 8997
<212> DNA
<213> Artificial Sequence
<220>
<223> 4M+Q322K-BE3
<400> 9
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttccttg acccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atccacccac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg taaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210> 10
<211> 8997
<212> DNA
<213> Artificial Sequence
<220>
<223> 4M+D128K+P199A+P200A-BE3
<400> 10
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaagcctcac 420
ttcagaaaca cagtggagcg aatgtatcga gacacattct cctacaactt ttataatgca 480
cccatccttt ctcgtcggaa taccgtctgg ctgtgctacg aagtgaaaac aaagggtccc 540
tcaaggcccc ctttggacgc aaagatcttt cgaggccagg tgtattccga acttaagtac 600
cacccagaga tgagattctt ccactggttc agcaagtgga ggaagctgca tcgtgaccag 660
gagtatgagg tcacctggta catatccttg agcccctgca caaagtgtac aagggatatg 720
gccacgttcc tggccgagga cccgaaggtt accctgacca tctttgttgc ccgcctcgcc 780
tacttcctta agccagatta ccaggaggcg cttcgcagcc tgtgtcagaa aagagacggt 840
ccgcgtgcca ccatgaagat catgaattat gacgaatttc agcactgttg gagcaagttc 900
gtgtacagcc aaagagagct atttgagcct tggaataatc tgcctaaata ttatatatta 960
ctgcacatca tgctggggga gattctcaga cactcgatgg atgccaagac attcactttc 1020
aactttaaca atgaaccttg ggtcagagga cggcatgaga cttacctgtg ttatgaggtg 1080
gagcgcatgc acaatgacac ctgggtcctg ctgaaccagc gcaggggctt tctatgcaac 1140
caggctccac ataaacacgg tttccttgaa ggccgccatg cagagctgtg cttcctggac 1200
gtgattccct tttggaagct ggacctggac caggactaca gggttacctg cttcacctcc 1260
tggagcccct gcttcagctg tgcccaggaa atggctaaat tcatttcaaa aaacaaacac 1320
gtgagcctgt gcatcttcac tgcccgcatc tatgatgatc aaggaagatg tcaggagggg 1380
ctgcgcaccc tggccgaggc tggggccaaa atttcaataa tgacatacag tgaatttaag 1440
cactgctggg acacctttgt ggaccaccag ggatgtccct tccagccctg ggatggacta 1500
gatgagcaca gccaagacct gagtgggagg ctgcgggcca ttctccagaa tcaggaaaac 1560
agcggcagcg agactcccgg gacctcagag tccgccacac ccgaaagtga taaaaagtat 1620
tctattggtt tagccatcgg cactaattcc gttggatggg ctgtcataac cgatgaatac 1680
aaagtacctt caaagaaatt taaggtgttg gggaacacag accgtcattc gattaaaaag 1740
aatcttatcg gtgccctcct attcgatagt ggcgaaacgg cagaggcgac tcgcctgaaa 1800
cgaaccgctc ggagaaggta tacacgtcgc aagaaccgaa tatgttactt acaagaaatt 1860
tttagcaatg agatggccaa agttgacgat tctttctttc accgtttgga agagtccttc 1920
cttgtcgaag aggacaagaa acatgaacgg caccccatct ttggaaacat agtagatgag 1980
gtggcatatc atgaaaagta cccaacgatt tatcacctca gaaaaaagct agttgactca 2040
actgataaag cggacctgag gttaatctac ttggctcttg cccatatgat aaagttccgt 2100
gggcactttc tcattgaggg tgatctaaat ccggacaact cggatgtcga caaactgttc 2160
atccagttag tacaaaccta taatcagttg tttgaagaga accctataaa tgcaagtggc 2220
gtggatgcga aggctattct tagcgcccgc ctctctaaat cccgacggct agaaaacctg 2280
atcgcacaat tacccggaga gaagaaaaat gggttgttcg gtaaccttat agcgctctca 2340
ctaggcctga caccaaattt taagtcgaac ttcgacttag ctgaagatgc caaattgcag 2400
cttagtaagg acacgtacga tgacgatctc gacaatctac tggcacaaat tggagatcag 2460
tatgcggact tatttttggc tgccaaaaac cttagcgatg caatcctcct atctgacata 2520
ctgagagtta atactgagat taccaaggcg ccgttatccg cttcaatgat caaaaggtac 2580
gatgaacatc accaagactt gacacttctc aaggccctag tccgtcagca actgcctgag 2640
aaatataagg aaatattctt tgatcagtcg aaaaacgggt acgcaggtta tattgacggc 2700
ggagcgagtc aagaggaatt ctacaagttt atcaaaccca tattagagaa gatggatggg 2760
acggaagagt tgcttgtaaa actcaatcgc gaagatctac tgcgaaagca gcggactttc 2820
gacaacggta gcattccaca tcaaatccac ttaggcgaat tgcatgctat acttagaagg 2880
caggaggatt tttatccgtt cctcaaagac aatcgtgaaa agattgagaa aatcctaacc 2940
tttcgcatac cttactatgt gggacccctg gcccgaggga actctcggtt cgcatggatg 3000
acaagaaagt ccgaagaaac gattactcca tggaattttg aggaagttgt cgataaaggt 3060
gcgtcagctc aatcgttcat cgagaggatg accaactttg acaagaattt accgaacgaa 3120
aaagtattgc ctaagcacag tttactttac gagtatttca cagtgtacaa tgaactcacg 3180
aaagttaagt atgtcactga gggcatgcgt aaacccgcct ttctaagcgg agaacagaag 3240
aaagcaatag tagatctgtt attcaagacc aaccgcaaag tgacagttaa gcaattgaaa 3300
gaggactact ttaagaaaat tgaatgcttc gattctgtcg agatctccgg ggtagaagat 3360
cgatttaatg cgtcacttgg tacgtatcat gacctcctaa agataattaa agataaggac 3420
ttcctggata acgaagagaa tgaagatatc ttagaagata tagtgttgac tcttaccctc 3480
tttgaagatc gggaaatgat tgaggaaaga ctaaaaacat acgctcacct gttcgacgat 3540
aaggttatga aacagttaaa gaggcgtcgc tatacgggct ggggacgatt gtcgcggaaa 3600
cttatcaacg ggataagaga caagcaaagt ggtaaaacta ttctcgattt tctaaagagc 3660
gacggcttcg ccaataggaa ctttatgcag ctgatccatg atgactcttt aaccttcaaa 3720
gaggatatac aaaaggcaca ggtttccgga caaggggact cattgcacga acatattgcg 3780
aatcttgctg gttcgccagc catcaaaaag ggcatactcc agacagtcaa agtagtggat 3840
gagctagtta aggtcatggg acgtcacaaa ccggaaaaca ttgtaatcga gatggcacgc 3900
gaaaatcaaa cgactcagaa ggggcaaaaa aacagtcgag agcggatgaa gagaatagaa 3960
gagggtatta aagaactggg cagccagatc ttaaaggagc atcctgtgga aaatacccaa 4020
ttgcagaacg agaaacttta cctctattac ctacaaaatg gaagggacat gtatgttgat 4080
caggaactgg acataaaccg tttatctgat tacgacgtcg atcacattgt accccaatcc 4140
tttttgaagg acgattcaat cgacaataaa gtgcttacac gctcggataa gaaccgaggg 4200
aaaagtgaca atgttccaag cgaggaagtc gtaaagaaaa tgaagaacta ttggcggcag 4260
ctcctaaatg cgaaactgat aacgcaaaga aagttcgata acttaactaa agctgagagg 4320
ggtggcttgt ctgaacttga caaggccgga tttattaaac gtcagctcgt ggaaacccgc 4380
caaatcacaa agcatgttgc acagatacta gattcccgaa tgaatacgaa atacgacgag 4440
aacgataagc tgattcggga agtcaaagta atcactttaa agtcaaaatt ggtgtcggac 4500
ttcagaaagg attttcaatt ctataaagtt agggagataa ataactacca ccatgcgcac 4560
gacgcttatc ttaatgccgt cgtagggacc gcactcatta agaaataccc gaagctagaa 4620
agtgagtttg tgtatggtga ttacaaagtt tatgacgtcc gtaagatgat cgcgaaaagc 4680
gaacaggaga taggcaaggc tacagccaaa tacttctttt attctaacat tatgaatttc 4740
tttaagacgg aaatcactct ggcaaacgga gagatacgca aacgaccttt aattgaaacc 4800
aatggggaga caggtgaaat cgtatgggat aagggccggg acttcgcgac ggtgagaaaa 4860
gttttgtcca tgccccaagt caacatagta aagaaaactg aggtgcagac cggagggttt 4920
tcaaaggaat cgattcttcc aaaaaggaat agtgataagc tcatcgctcg taaaaaggac 4980
tgggacccga aaaagtacgg tggcttcgat agccctacag ttgcctattc tgtcctagta 5040
gtggcaaaag ttgagaaggg aaaatccaag aaactgaagt cagtcaaaga attattgggg 5100
ataacgatta tggagcgctc gtcttttgaa aagaacccca tcgacttcct tgaggcgaaa 5160
ggttacaagg aagtaaaaaa ggatctcata attaaactac caaagtatag tctgtttgag 5220
ttagaaaatg gccgaaaacg gatgttggct agcgccggag agcttcaaaa ggggaacgaa 5280
ctcgcactac cgtctaaata cgtgaatttc ctgtatttag cgtcccatta cgagaagttg 5340
aaaggttcac ctgaagataa cgaacagaag caactttttg ttgagcagca caaacattat 5400
ctcgacgaaa tcatagagca aatttcggaa ttcagtaaga gagtcatcct agctgatgcc 5460
aatctggaca aagtattaag cgcatacaac aagcacaggg ataaacccat acgtgagcag 5520
gcggaaaata ttatccattt gtttactctt accaacctcg gcgctccagc cgcattcaag 5580
tattttgaca caacgataga tcgcaaacga tacacttcta ccaaggaggt gctagacgcg 5640
acactgattc accaatccat cacgggatta tatgaaactc ggatagattt gtcacagctt 5700
gggggtgact ctggtggttc tactaatctg tcagatatta ttgaaaagga gaccggtaag 5760
caactggtta tccaggaatc catcctcatg ctcccagagg aggtggaaga agtcattggg 5820
aacaagccgg aaagcgatat actcgtgcac accgcctacg acgagagcac cgacgagaat 5880
gtcatgcttc tgactagcga cgcccctgaa tacaagcctt gggctctggt catacaggat 5940
agcaacggtg agaacaagat taagatgctc tctggtggtt ctcccaagaa gaagaggaaa 6000
gtctaaccgg tcatcatcac catcaccatt gagtttaaac ccgctgatca gcctcgactg 6060
tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg 6120
aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga 6180
gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg 6240
aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa 6300
ccagctgggg ctcgataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 6360
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 6420
taaagtgtaa agcctagggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 6480
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 6540
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 6600
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 6660
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 6720
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 6780
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 6840
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6900
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6960
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 7020
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 7080
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 7140
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag 7200
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 7260
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 7320
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 7380
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 7440
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 7500
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 7560
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 7620
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 7680
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 7740
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 7800
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 7860
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 7920
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 7980
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 8040
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 8100
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 8160
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 8220
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 8280
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 8340
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 8400
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 8460
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gacggatcgg 8520
gagatcgatc tcccgatccc ctagggtcga ctctcagtac aatctgctct gatgccgcat 8580
agttaagcca gtatctgctc cctgcttgtg tgttggaggt cgctgagtag tgcgcgagca 8640
aaatttaagc tacaacaagg caaggcttga ccgacaattg catgaagaat ctgcttaggg 8700
ttaggcgttt tgcgctgctt cgcgatgtac gggccagata tacgcgttga cattgattat 8760
tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 8820
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 8880
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 8940
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatc 8997
<210> 11
<211> 9429
<212> DNA
<213> Artificial Sequence
<220>
<223> A3G-BE4max
<400> 11
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaaacggaca 420
gccgacggaa gcgagttcga gtcaccaaag aagaagcgga aagtcatgaa gcctcacttc 480
agaaacacag tggagcgaat gtatcgagac acattctcct acaactttta taatgcaccc 540
atcctttctc gtcggaatac cgtctggctg tgctacgaag tgaaaacaaa gggtccctca 600
aggccccctt tggacgcaaa gatctttcga ggccaggtgt attccgaact taagtaccac 660
ccagagatga gattcttcca ctggttcagc aagtggagga agctgcatcg tgaccaggag 720
tatgaggtca cctggtacat atcctggagc ccctgcacaa agtgtacaag ggatatggcc 780
acgttcctgg ccgaggaccc gaaggttacc ctgaccatct ttgttgcccg cctctactac 840
ttctgggacc cagattacca ggaggcgctt cgcagcctgt gtcagaaaag agacggtccg 900
cgtgccacca tgaagatcat gaattatgac gaatttcagc actgttggag caagttcgtg 960
tacagccaaa gagagctatt tgagccttgg aataatctgc ctaaatatta tatattactg 1020
cacatcatgc tgggggagat tctcagacac tcgatggatc cacccacatt cactttcaac 1080
tttaacaatg aaccttgggt cagaggacgg catgagactt acctgtgtta tgaggtggag 1140
cgcatgcaca atgacacctg ggtcctgctg aaccagcgca ggggctttct atgcaaccag 1200
gctccacata aacacggttt ccttgaaggc cgccatgcag agctgtgctt cctggacgtg 1260
attccctttt ggaagctgga cctggaccag gactacaggg ttacctgctt cacctcctgg 1320
agcccctgct tcagctgtgc ccaggaaatg gctaaattca tttcaaaaaa caaacacgtg 1380
agcctgtgca tcttcactgc ccgcatctat gatgatcaag gaagatgtca ggaggggctg 1440
cgcaccctgg ccgaggctgg ggccaaaatt tcaataatga catacagtga atttaagcac 1500
tgctgggaca cctttgtgga ccaccaggga tgtcccttcc agccctggga tggactagat 1560
gagcacagcc aagacctgag tgggaggctg cgggccattc tccagaatca ggaaaactct 1620
ggaggatcta gcggaggatc ctctggcagc gagacaccag gaacaagcga gtcagcaaca 1680
ccagagagca gtggcggcag cagcggcggc agcgacaaga agtacagcat cggcctggcc 1740
atcggcacca actctgtggg ctgggccgtg atcaccgacg agtacaaggt gcccagcaag 1800
aaattcaagg tgctgggcaa caccgaccgg cacagcatca agaagaacct gatcggagcc 1860
ctgctgttcg acagcggcga aacagccgag gccacccggc tgaagagaac cgccagaaga 1920
agatacacca gacggaagaa ccggatctgc tatctgcaag agatcttcag caacgagatg 1980
gccaaggtgg acgacagctt cttccacaga ctggaagagt ccttcctggt ggaagaggat 2040
aagaagcacg agcggcaccc catcttcggc aacatcgtgg acgaggtggc ctaccacgag 2100
aagtacccca ccatctacca cctgagaaag aaactggtgg acagcaccga caaggccgac 2160
ctgcggctga tctatctggc cctggcccac atgatcaagt tccggggcca cttcctgatc 2220
gagggcgacc tgaaccccga caacagcgac gtggacaagc tgttcatcca gctggtgcag 2280
acctacaacc agctgttcga ggaaaacccc atcaacgcca gcggcgtgga cgccaaggcc 2340
atcctgtctg ccagactgag caagagcaga cggctggaaa atctgatcgc ccagctgccc 2400
ggcgagaaga agaatggcct gttcggaaac ctgattgccc tgagcctggg cctgaccccc 2460
aacttcaaga gcaacttcga cctggccgag gatgccaaac tgcagctgag caaggacacc 2520
tacgacgacg acctggacaa cctgctggcc cagatcggcg accagtacgc cgacctgttt 2580
ctggccgcca agaacctgtc cgacgccatc ctgctgagcg acatcctgag agtgaacacc 2640
gagatcacca aggcccccct gagcgcctct atgatcaaga gatacgacga gcaccaccag 2700
gacctgaccc tgctgaaagc tctcgtgcgg cagcagctgc ctgagaagta caaagagatt 2760
ttcttcgacc agagcaagaa cggctacgcc ggctacattg acggcggagc cagccaggaa 2820
gagttctaca agttcatcaa gcccatcctg gaaaagatgg acggcaccga ggaactgctc 2880
gtgaagctga acagagagga cctgctgcgg aagcagcgga ccttcgacaa cggcagcatc 2940
ccccaccaga tccacctggg agagctgcac gccattctgc ggcggcagga agatttttac 3000
ccattcctga aggacaaccg ggaaaagatc gagaagatcc tgaccttccg catcccctac 3060
tacgtgggcc ctctggccag gggaaacagc agattcgcct ggatgaccag aaagagcgag 3120
gaaaccatca ccccctggaa cttcgaggaa gtggtggaca agggcgcttc cgcccagagc 3180
ttcatcgagc ggatgaccaa cttcgataag aacctgccca acgagaaggt gctgcccaag 3240
cacagcctgc tgtacgagta cttcaccgtg tataacgagc tgaccaaagt gaaatacgtg 3300
accgagggaa tgagaaagcc cgccttcctg agcggcgagc agaaaaaggc catcgtggac 3360
ctgctgttca agaccaaccg gaaagtgacc gtgaagcagc tgaaagagga ctacttcaag 3420
aaaatcgagt gcttcgactc cgtggaaatc tccggcgtgg aagatcggtt caacgcctcc 3480
ctgggcacat accacgatct gctgaaaatt atcaaggaca aggacttcct ggacaatgag 3540
gaaaacgagg acattctgga agatatcgtg ctgaccctga cactgtttga ggacagagag 3600
atgatcgagg aacggctgaa aacctatgcc cacctgttcg acgacaaagt gatgaagcag 3660
ctgaagcggc ggagatacac cggctggggc aggctgagcc ggaagctgat caacggcatc 3720
cgggacaagc agtccggcaa gacaatcctg gatttcctga agtccgacgg cttcgccaac 3780
agaaacttca tgcagctgat ccacgacgac agcctgacct ttaaagagga catccagaaa 3840
gcccaggtgt ccggccaggg cgatagcctg cacgagcaca ttgccaatct ggccggcagc 3900
cccgccatta agaagggcat cctgcagaca gtgaaggtgg tggacgagct cgtgaaagtg 3960
atgggccggc acaagcccga gaacatcgtg atcgaaatgg ccagagagaa ccagaccacc 4020
cagaagggac agaagaacag ccgcgagaga atgaagcgga tcgaagaggg catcaaagag 4080
ctgggcagcc agatcctgaa agaacacccc gtggaaaaca cccagctgca gaacgagaag 4140
ctgtacctgt actacctgca gaatgggcgg gatatgtacg tggaccagga actggacatc 4200
aaccggctgt ccgactacga tgtggaccat atcgtgcctc agagctttct gaaggacgac 4260
tccatcgaca acaaggtgct gaccagaagc gacaagaacc ggggcaagag cgacaacgtg 4320
ccctccgaag aggtcgtgaa gaagatgaag aactactggc ggcagctgct gaacgccaag 4380
ctgattaccc agagaaagtt cgacaatctg accaaggccg agagaggcgg cctgagcgaa 4440
ctggataagg ccggcttcat caagagacag ctggtggaaa cccggcagat tacaaagcac 4500
gtggcacaga tcctggactc ccggatgaac actaagtacg acgagaatga caagctgatc 4560
cgggaagtga aagtgatcac cctgaagtcc aagctggtgt ccgatttccg gaaggatttc 4620
cagttttaca aagtgcgcga gatcaacaac taccaccacg cccacgacgc ctacctaaac 4680
gccgtcgtgg gaaccgcact gatcaaaaag taccctaagc tggaaagcga gttcgtgtac 4740
ggcgactaca aggtgtacga cgtgcggaag atgatcgcca agagcgagca ggaaatcggc 4800
aaggctaccg ccaagtactt cttctacagc aacatcatga actttttcaa gaccgagatt 4860
accctggcca acggcgagat ccggaagcgg cctctgatcg agacaaacgg cgaaaccggg 4920
gagatcgtgt gggataaggg ccgggatttt gccaccgtgc ggaaagtgct gagcatgccc 4980
caagtgaata tcgtgaaaaa gaccgaggtg cagacaggcg gcttcagcaa agagtctatc 5040
agacccaaga ggaacagcga taagctgatc gccagaaaga aggactggga ccctaagaag 5100
tacggcggct tcgtgagccc caccgtggcc tattctgtgc tggtggtggc caaagtggaa 5160
aagggcaagt ccaagaaact gaagagtgtg aaagagctgc tggggatcac catcatggaa 5220
agaagcagct tcgagaagaa tcccatcgac tttctggaag ccaagggcta caaagaagtg 5280
aaaaaggacc tgatcatcaa gctgcctaag tactccctgt tcgagctgga aaacggccgg 5340
aagagaatgc tggcctctgc cagattcctg cagaagggaa acgaactggc cctgccctcc 5400
aaatatgtga acttcctgta cctggccagc cactatgaga agctgaaggg ctcccccgag 5460
gataatgagc agaaacagct gtttgtggaa cagcacaagc actacctgga cgagatcatc 5520
gagcagatca gcgagttctc caagagagtg atcctggccg acgctaatct ggacaaagtg 5580
ctgtccgcct acaacaagca ccgggataag cccatcagag agcaggccga gaatatcatc 5640
cacctgttta ccctgaccaa tctgggagcc cctagagcct tcaagtactt tgacaccacc 5700
atcgaccgga aggtgtacag aagcaccaaa gaggtgctgg acgccaccct gatccaccag 5760
agcatcaccg gcctgtacga gacacggatc gacctgtctc agctgggagg tgacagcggc 5820
gggagcggcg ggagcggggg gagcactaat ctgagcgaca tcattgagaa ggagactggg 5880
aaacagctgg tcattcagga gtccatcctg atgctgcctg aggaggtgga ggaagtgatc 5940
ggcaacaagc cagagtctga catcctggtg cacaccgcct acgacgagtc cacagatgag 6000
aatgtgatgc tgctgacctc tgacgccccc gagtataagc cttgggccct ggtcatccag 6060
gattctaacg gcgagaataa gatcaagatg ctgagcggag gatccggagg atctggaggc 6120
agcaccaacc tgtctgacat catcgagaag gagacaggca agcagctggt catccaggag 6180
agcatcctga tgctgcccga agaagtcgaa gaagtgatcg gaaacaagcc tgagagcgat 6240
atcctggtcc ataccgccta cgacgagagt accgacgaaa atgtgatgct gctgacatcc 6300
gacgccccag agtataagcc ctgggctctg gtcatccagg attccaacgg agagaacaaa 6360
atcaaaatgc tgtctggcgg ctcaaaaaga accgccgacg gcagcgaatt cgagcccaag 6420
aagaagagga aagtctaacc ggtcatcatc accatcacca ttgagtttaa acccgctgat 6480
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 6540
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 6600
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 6660
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 6720
aggcggaaag aaccagctgg ggctcgatac cgtcgacctc tagctagagc ttggcgtaat 6780
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 6840
gagccggaag cataaagtgt aaagcctagg atgcctaatg agtgagctaa ctcacattaa 6900
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 6960
gaatcggcca acgcgcggga agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 7020
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7080
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7140
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7200
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 7260
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 7320
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 7380
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 7440
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 7500
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 7560
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 7620
ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 7680
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 7740
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 7800
ggtctgacac tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 7860
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 7920
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 7980
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 8040
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 8100
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 8160
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 8220
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 8280
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 8340
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 8400
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 8460
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 8520
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 8580
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 8640
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 8700
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 8760
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 8820
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 8880
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 8940
tcgacggatc gggagatcga tctcccgatc ccctagggtc gactctcagt acaatctgct 9000
ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt 9060
agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga 9120
atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt 9180
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 9240
catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 9300
acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 9360
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 9420
aagtgtatc 9429
<210> 12
<211> 9429
<212> DNA
<213> Artificial Sequence
<220>
<223> 4M+P199A+P200K-BE4max
<400> 12
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaaacggaca 420
gccgacggaa gcgagttcga gtcaccaaag aagaagcgga aagtcatgaa gcctcacttc 480
agaaacacag tggagcgaat gtatcgagac acattctcct acaactttta taatgcaccc 540
atcctttctc gtcggaatac cgtctggctg tgctacgaag tgaaaacaaa gggtccctca 600
aggccccctt tggacgcaaa gatctttcga ggccaggtgt attccgaact taagtaccac 660
ccagagatga gattcttcca ctggttcagc aagtggagga agctgcatcg tgaccaggag 720
tatgaggtca cctggtacat atccttgagc ccctgcacaa agtgtacaag ggatatggcc 780
acgttcctgg ccgaggaccc gaaggttacc ctgaccatct ttgttgcccg cctcgcctac 840
ttccttgacc cagattacca ggaggcgctt cgcagcctgt gtcagaaaag agacggtccg 900
cgtgccacca tgaagatcat gaattatgac gaatttcagc actgttggag caagttcgtg 960
tacagccaaa gagagctatt tgagccttgg aataatctgc ctaaatatta tatattactg 1020
cacatcatgc tgggggagat tctcagacac tcgatggatg ccaagacatt cactttcaac 1080
tttaacaatg aaccttgggt cagaggacgg catgagactt acctgtgtta tgaggtggag 1140
cgcatgcaca atgacacctg ggtcctgctg aaccagcgca ggggctttct atgcaaccag 1200
gctccacata aacacggttt ccttgaaggc cgccatgcag agctgtgctt cctggacgtg 1260
attccctttt ggaagctgga cctggaccag gactacaggg ttacctgctt cacctcctgg 1320
agcccctgct tcagctgtgc ccaggaaatg gctaaattca tttcaaaaaa caaacacgtg 1380
agcctgtgca tcttcactgc ccgcatctat gatgatcaag gaagatgtca ggaggggctg 1440
cgcaccctgg ccgaggctgg ggccaaaatt tcaataatga catacagtga atttaagcac 1500
tgctgggaca cctttgtgga ccaccaggga tgtcccttcc agccctggga tggactagat 1560
gagcacagcc aagacctgag tgggaggctg cgggccattc tccagaatca ggaaaactct 1620
ggaggatcta gcggaggatc ctctggcagc gagacaccag gaacaagcga gtcagcaaca 1680
ccagagagca gtggcggcag cagcggcggc agcgacaaga agtacagcat cggcctggcc 1740
atcggcacca actctgtggg ctgggccgtg atcaccgacg agtacaaggt gcccagcaag 1800
aaattcaagg tgctgggcaa caccgaccgg cacagcatca agaagaacct gatcggagcc 1860
ctgctgttcg acagcggcga aacagccgag gccacccggc tgaagagaac cgccagaaga 1920
agatacacca gacggaagaa ccggatctgc tatctgcaag agatcttcag caacgagatg 1980
gccaaggtgg acgacagctt cttccacaga ctggaagagt ccttcctggt ggaagaggat 2040
aagaagcacg agcggcaccc catcttcggc aacatcgtgg acgaggtggc ctaccacgag 2100
aagtacccca ccatctacca cctgagaaag aaactggtgg acagcaccga caaggccgac 2160
ctgcggctga tctatctggc cctggcccac atgatcaagt tccggggcca cttcctgatc 2220
gagggcgacc tgaaccccga caacagcgac gtggacaagc tgttcatcca gctggtgcag 2280
acctacaacc agctgttcga ggaaaacccc atcaacgcca gcggcgtgga cgccaaggcc 2340
atcctgtctg ccagactgag caagagcaga cggctggaaa atctgatcgc ccagctgccc 2400
ggcgagaaga agaatggcct gttcggaaac ctgattgccc tgagcctggg cctgaccccc 2460
aacttcaaga gcaacttcga cctggccgag gatgccaaac tgcagctgag caaggacacc 2520
tacgacgacg acctggacaa cctgctggcc cagatcggcg accagtacgc cgacctgttt 2580
ctggccgcca agaacctgtc cgacgccatc ctgctgagcg acatcctgag agtgaacacc 2640
gagatcacca aggcccccct gagcgcctct atgatcaaga gatacgacga gcaccaccag 2700
gacctgaccc tgctgaaagc tctcgtgcgg cagcagctgc ctgagaagta caaagagatt 2760
ttcttcgacc agagcaagaa cggctacgcc ggctacattg acggcggagc cagccaggaa 2820
gagttctaca agttcatcaa gcccatcctg gaaaagatgg acggcaccga ggaactgctc 2880
gtgaagctga acagagagga cctgctgcgg aagcagcgga ccttcgacaa cggcagcatc 2940
ccccaccaga tccacctggg agagctgcac gccattctgc ggcggcagga agatttttac 3000
ccattcctga aggacaaccg ggaaaagatc gagaagatcc tgaccttccg catcccctac 3060
tacgtgggcc ctctggccag gggaaacagc agattcgcct ggatgaccag aaagagcgag 3120
gaaaccatca ccccctggaa cttcgaggaa gtggtggaca agggcgcttc cgcccagagc 3180
ttcatcgagc ggatgaccaa cttcgataag aacctgccca acgagaaggt gctgcccaag 3240
cacagcctgc tgtacgagta cttcaccgtg tataacgagc tgaccaaagt gaaatacgtg 3300
accgagggaa tgagaaagcc cgccttcctg agcggcgagc agaaaaaggc catcgtggac 3360
ctgctgttca agaccaaccg gaaagtgacc gtgaagcagc tgaaagagga ctacttcaag 3420
aaaatcgagt gcttcgactc cgtggaaatc tccggcgtgg aagatcggtt caacgcctcc 3480
ctgggcacat accacgatct gctgaaaatt atcaaggaca aggacttcct ggacaatgag 3540
gaaaacgagg acattctgga agatatcgtg ctgaccctga cactgtttga ggacagagag 3600
atgatcgagg aacggctgaa aacctatgcc cacctgttcg acgacaaagt gatgaagcag 3660
ctgaagcggc ggagatacac cggctggggc aggctgagcc ggaagctgat caacggcatc 3720
cgggacaagc agtccggcaa gacaatcctg gatttcctga agtccgacgg cttcgccaac 3780
agaaacttca tgcagctgat ccacgacgac agcctgacct ttaaagagga catccagaaa 3840
gcccaggtgt ccggccaggg cgatagcctg cacgagcaca ttgccaatct ggccggcagc 3900
cccgccatta agaagggcat cctgcagaca gtgaaggtgg tggacgagct cgtgaaagtg 3960
atgggccggc acaagcccga gaacatcgtg atcgaaatgg ccagagagaa ccagaccacc 4020
cagaagggac agaagaacag ccgcgagaga atgaagcgga tcgaagaggg catcaaagag 4080
ctgggcagcc agatcctgaa agaacacccc gtggaaaaca cccagctgca gaacgagaag 4140
ctgtacctgt actacctgca gaatgggcgg gatatgtacg tggaccagga actggacatc 4200
aaccggctgt ccgactacga tgtggaccat atcgtgcctc agagctttct gaaggacgac 4260
tccatcgaca acaaggtgct gaccagaagc gacaagaacc ggggcaagag cgacaacgtg 4320
ccctccgaag aggtcgtgaa gaagatgaag aactactggc ggcagctgct gaacgccaag 4380
ctgattaccc agagaaagtt cgacaatctg accaaggccg agagaggcgg cctgagcgaa 4440
ctggataagg ccggcttcat caagagacag ctggtggaaa cccggcagat tacaaagcac 4500
gtggcacaga tcctggactc ccggatgaac actaagtacg acgagaatga caagctgatc 4560
cgggaagtga aagtgatcac cctgaagtcc aagctggtgt ccgatttccg gaaggatttc 4620
cagttttaca aagtgcgcga gatcaacaac taccaccacg cccacgacgc ctacctaaac 4680
gccgtcgtgg gaaccgcact gatcaaaaag taccctaagc tggaaagcga gttcgtgtac 4740
ggcgactaca aggtgtacga cgtgcggaag atgatcgcca agagcgagca ggaaatcggc 4800
aaggctaccg ccaagtactt cttctacagc aacatcatga actttttcaa gaccgagatt 4860
accctggcca acggcgagat ccggaagcgg cctctgatcg agacaaacgg cgaaaccggg 4920
gagatcgtgt gggataaggg ccgggatttt gccaccgtgc ggaaagtgct gagcatgccc 4980
caagtgaata tcgtgaaaaa gaccgaggtg cagacaggcg gcttcagcaa agagtctatc 5040
agacccaaga ggaacagcga taagctgatc gccagaaaga aggactggga ccctaagaag 5100
tacggcggct tcgtgagccc caccgtggcc tattctgtgc tggtggtggc caaagtggaa 5160
aagggcaagt ccaagaaact gaagagtgtg aaagagctgc tggggatcac catcatggaa 5220
agaagcagct tcgagaagaa tcccatcgac tttctggaag ccaagggcta caaagaagtg 5280
aaaaaggacc tgatcatcaa gctgcctaag tactccctgt tcgagctgga aaacggccgg 5340
aagagaatgc tggcctctgc cagattcctg cagaagggaa acgaactggc cctgccctcc 5400
aaatatgtga acttcctgta cctggccagc cactatgaga agctgaaggg ctcccccgag 5460
gataatgagc agaaacagct gtttgtggaa cagcacaagc actacctgga cgagatcatc 5520
gagcagatca gcgagttctc caagagagtg atcctggccg acgctaatct ggacaaagtg 5580
ctgtccgcct acaacaagca ccgggataag cccatcagag agcaggccga gaatatcatc 5640
cacctgttta ccctgaccaa tctgggagcc cctagagcct tcaagtactt tgacaccacc 5700
atcgaccgga aggtgtacag aagcaccaaa gaggtgctgg acgccaccct gatccaccag 5760
agcatcaccg gcctgtacga gacacggatc gacctgtctc agctgggagg tgacagcggc 5820
gggagcggcg ggagcggggg gagcactaat ctgagcgaca tcattgagaa ggagactggg 5880
aaacagctgg tcattcagga gtccatcctg atgctgcctg aggaggtgga ggaagtgatc 5940
ggcaacaagc cagagtctga catcctggtg cacaccgcct acgacgagtc cacagatgag 6000
aatgtgatgc tgctgacctc tgacgccccc gagtataagc cttgggccct ggtcatccag 6060
gattctaacg gcgagaataa gatcaagatg ctgagcggag gatccggagg atctggaggc 6120
agcaccaacc tgtctgacat catcgagaag gagacaggca agcagctggt catccaggag 6180
agcatcctga tgctgcccga agaagtcgaa gaagtgatcg gaaacaagcc tgagagcgat 6240
atcctggtcc ataccgccta cgacgagagt accgacgaaa atgtgatgct gctgacatcc 6300
gacgccccag agtataagcc ctgggctctg gtcatccagg attccaacgg agagaacaaa 6360
atcaaaatgc tgtctggcgg ctcaaaaaga accgccgacg gcagcgaatt cgagcccaag 6420
aagaagagga aagtctaacc ggtcatcatc accatcacca ttgagtttaa acccgctgat 6480
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 6540
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 6600
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 6660
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 6720
aggcggaaag aaccagctgg ggctcgatac cgtcgacctc tagctagagc ttggcgtaat 6780
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 6840
gagccggaag cataaagtgt aaagcctagg atgcctaatg agtgagctaa ctcacattaa 6900
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 6960
gaatcggcca acgcgcggga agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 7020
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7080
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7140
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7200
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 7260
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 7320
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 7380
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 7440
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 7500
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 7560
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 7620
ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 7680
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 7740
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 7800
ggtctgacac tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 7860
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 7920
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 7980
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 8040
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 8100
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 8160
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 8220
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 8280
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 8340
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 8400
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 8460
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 8520
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 8580
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 8640
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 8700
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 8760
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 8820
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 8880
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 8940
tcgacggatc gggagatcga tctcccgatc ccctagggtc gactctcagt acaatctgct 9000
ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt 9060
agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga 9120
atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt 9180
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 9240
catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 9300
acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 9360
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 9420
aagtgtatc 9429
<210> 13
<211> 9429
<212> DNA
<213> Artificial Sequence
<220>
<223> 4M+D128K+P199A+P200K-BE4max
<400> 13
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaaacggaca 420
gccgacggaa gcgagttcga gtcaccaaag aagaagcgga aagtcatgaa gcctcacttc 480
agaaacacag tggagcgaat gtatcgagac acattctcct acaactttta taatgcaccc 540
atcctttctc gtcggaatac cgtctggctg tgctacgaag tgaaaacaaa gggtccctca 600
aggccccctt tggacgcaaa gatctttcga ggccaggtgt attccgaact taagtaccac 660
ccagagatga gattcttcca ctggttcagc aagtggagga agctgcatcg tgaccaggag 720
tatgaggtca cctggtacat atccttgagc ccctgcacaa agtgtacaag ggatatggcc 780
acgttcctgg ccgaggaccc gaaggttacc ctgaccatct ttgttgcccg cctcgcctac 840
ttccttaagc cagattacca ggaggcgctt cgcagcctgt gtcagaaaag agacggtccg 900
cgtgccacca tgaagatcat gaattatgac gaatttcagc actgttggag caagttcgtg 960
tacagccaaa gagagctatt tgagccttgg aataatctgc ctaaatatta tatattactg 1020
cacatcatgc tgggggagat tctcagacac tcgatggatg ccaagacatt cactttcaac 1080
tttaacaatg aaccttgggt cagaggacgg catgagactt acctgtgtta tgaggtggag 1140
cgcatgcaca atgacacctg ggtcctgctg aaccagcgca ggggctttct atgcaaccag 1200
gctccacata aacacggttt ccttgaaggc cgccatgcag agctgtgctt cctggacgtg 1260
attccctttt ggaagctgga cctggaccag gactacaggg ttacctgctt cacctcctgg 1320
agcccctgct tcagctgtgc ccaggaaatg gctaaattca tttcaaaaaa caaacacgtg 1380
agcctgtgca tcttcactgc ccgcatctat gatgatcaag gaagatgtca ggaggggctg 1440
cgcaccctgg ccgaggctgg ggccaaaatt tcaataatga catacagtga atttaagcac 1500
tgctgggaca cctttgtgga ccaccaggga tgtcccttcc agccctggga tggactagat 1560
gagcacagcc aagacctgag tgggaggctg cgggccattc tccagaatca ggaaaactct 1620
ggaggatcta gcggaggatc ctctggcagc gagacaccag gaacaagcga gtcagcaaca 1680
ccagagagca gtggcggcag cagcggcggc agcgacaaga agtacagcat cggcctggcc 1740
atcggcacca actctgtggg ctgggccgtg atcaccgacg agtacaaggt gcccagcaag 1800
aaattcaagg tgctgggcaa caccgaccgg cacagcatca agaagaacct gatcggagcc 1860
ctgctgttcg acagcggcga aacagccgag gccacccggc tgaagagaac cgccagaaga 1920
agatacacca gacggaagaa ccggatctgc tatctgcaag agatcttcag caacgagatg 1980
gccaaggtgg acgacagctt cttccacaga ctggaagagt ccttcctggt ggaagaggat 2040
aagaagcacg agcggcaccc catcttcggc aacatcgtgg acgaggtggc ctaccacgag 2100
aagtacccca ccatctacca cctgagaaag aaactggtgg acagcaccga caaggccgac 2160
ctgcggctga tctatctggc cctggcccac atgatcaagt tccggggcca cttcctgatc 2220
gagggcgacc tgaaccccga caacagcgac gtggacaagc tgttcatcca gctggtgcag 2280
acctacaacc agctgttcga ggaaaacccc atcaacgcca gcggcgtgga cgccaaggcc 2340
atcctgtctg ccagactgag caagagcaga cggctggaaa atctgatcgc ccagctgccc 2400
ggcgagaaga agaatggcct gttcggaaac ctgattgccc tgagcctggg cctgaccccc 2460
aacttcaaga gcaacttcga cctggccgag gatgccaaac tgcagctgag caaggacacc 2520
tacgacgacg acctggacaa cctgctggcc cagatcggcg accagtacgc cgacctgttt 2580
ctggccgcca agaacctgtc cgacgccatc ctgctgagcg acatcctgag agtgaacacc 2640
gagatcacca aggcccccct gagcgcctct atgatcaaga gatacgacga gcaccaccag 2700
gacctgaccc tgctgaaagc tctcgtgcgg cagcagctgc ctgagaagta caaagagatt 2760
ttcttcgacc agagcaagaa cggctacgcc ggctacattg acggcggagc cagccaggaa 2820
gagttctaca agttcatcaa gcccatcctg gaaaagatgg acggcaccga ggaactgctc 2880
gtgaagctga acagagagga cctgctgcgg aagcagcgga ccttcgacaa cggcagcatc 2940
ccccaccaga tccacctggg agagctgcac gccattctgc ggcggcagga agatttttac 3000
ccattcctga aggacaaccg ggaaaagatc gagaagatcc tgaccttccg catcccctac 3060
tacgtgggcc ctctggccag gggaaacagc agattcgcct ggatgaccag aaagagcgag 3120
gaaaccatca ccccctggaa cttcgaggaa gtggtggaca agggcgcttc cgcccagagc 3180
ttcatcgagc ggatgaccaa cttcgataag aacctgccca acgagaaggt gctgcccaag 3240
cacagcctgc tgtacgagta cttcaccgtg tataacgagc tgaccaaagt gaaatacgtg 3300
accgagggaa tgagaaagcc cgccttcctg agcggcgagc agaaaaaggc catcgtggac 3360
ctgctgttca agaccaaccg gaaagtgacc gtgaagcagc tgaaagagga ctacttcaag 3420
aaaatcgagt gcttcgactc cgtggaaatc tccggcgtgg aagatcggtt caacgcctcc 3480
ctgggcacat accacgatct gctgaaaatt atcaaggaca aggacttcct ggacaatgag 3540
gaaaacgagg acattctgga agatatcgtg ctgaccctga cactgtttga ggacagagag 3600
atgatcgagg aacggctgaa aacctatgcc cacctgttcg acgacaaagt gatgaagcag 3660
ctgaagcggc ggagatacac cggctggggc aggctgagcc ggaagctgat caacggcatc 3720
cgggacaagc agtccggcaa gacaatcctg gatttcctga agtccgacgg cttcgccaac 3780
agaaacttca tgcagctgat ccacgacgac agcctgacct ttaaagagga catccagaaa 3840
gcccaggtgt ccggccaggg cgatagcctg cacgagcaca ttgccaatct ggccggcagc 3900
cccgccatta agaagggcat cctgcagaca gtgaaggtgg tggacgagct cgtgaaagtg 3960
atgggccggc acaagcccga gaacatcgtg atcgaaatgg ccagagagaa ccagaccacc 4020
cagaagggac agaagaacag ccgcgagaga atgaagcgga tcgaagaggg catcaaagag 4080
ctgggcagcc agatcctgaa agaacacccc gtggaaaaca cccagctgca gaacgagaag 4140
ctgtacctgt actacctgca gaatgggcgg gatatgtacg tggaccagga actggacatc 4200
aaccggctgt ccgactacga tgtggaccat atcgtgcctc agagctttct gaaggacgac 4260
tccatcgaca acaaggtgct gaccagaagc gacaagaacc ggggcaagag cgacaacgtg 4320
ccctccgaag aggtcgtgaa gaagatgaag aactactggc ggcagctgct gaacgccaag 4380
ctgattaccc agagaaagtt cgacaatctg accaaggccg agagaggcgg cctgagcgaa 4440
ctggataagg ccggcttcat caagagacag ctggtggaaa cccggcagat tacaaagcac 4500
gtggcacaga tcctggactc ccggatgaac actaagtacg acgagaatga caagctgatc 4560
cgggaagtga aagtgatcac cctgaagtcc aagctggtgt ccgatttccg gaaggatttc 4620
cagttttaca aagtgcgcga gatcaacaac taccaccacg cccacgacgc ctacctaaac 4680
gccgtcgtgg gaaccgcact gatcaaaaag taccctaagc tggaaagcga gttcgtgtac 4740
ggcgactaca aggtgtacga cgtgcggaag atgatcgcca agagcgagca ggaaatcggc 4800
aaggctaccg ccaagtactt cttctacagc aacatcatga actttttcaa gaccgagatt 4860
accctggcca acggcgagat ccggaagcgg cctctgatcg agacaaacgg cgaaaccggg 4920
gagatcgtgt gggataaggg ccgggatttt gccaccgtgc ggaaagtgct gagcatgccc 4980
caagtgaata tcgtgaaaaa gaccgaggtg cagacaggcg gcttcagcaa agagtctatc 5040
agacccaaga ggaacagcga taagctgatc gccagaaaga aggactggga ccctaagaag 5100
tacggcggct tcgtgagccc caccgtggcc tattctgtgc tggtggtggc caaagtggaa 5160
aagggcaagt ccaagaaact gaagagtgtg aaagagctgc tggggatcac catcatggaa 5220
agaagcagct tcgagaagaa tcccatcgac tttctggaag ccaagggcta caaagaagtg 5280
aaaaaggacc tgatcatcaa gctgcctaag tactccctgt tcgagctgga aaacggccgg 5340
aagagaatgc tggcctctgc cagattcctg cagaagggaa acgaactggc cctgccctcc 5400
aaatatgtga acttcctgta cctggccagc cactatgaga agctgaaggg ctcccccgag 5460
gataatgagc agaaacagct gtttgtggaa cagcacaagc actacctgga cgagatcatc 5520
gagcagatca gcgagttctc caagagagtg atcctggccg acgctaatct ggacaaagtg 5580
ctgtccgcct acaacaagca ccgggataag cccatcagag agcaggccga gaatatcatc 5640
cacctgttta ccctgaccaa tctgggagcc cctagagcct tcaagtactt tgacaccacc 5700
atcgaccgga aggtgtacag aagcaccaaa gaggtgctgg acgccaccct gatccaccag 5760
agcatcaccg gcctgtacga gacacggatc gacctgtctc agctgggagg tgacagcggc 5820
gggagcggcg ggagcggggg gagcactaat ctgagcgaca tcattgagaa ggagactggg 5880
aaacagctgg tcattcagga gtccatcctg atgctgcctg aggaggtgga ggaagtgatc 5940
ggcaacaagc cagagtctga catcctggtg cacaccgcct acgacgagtc cacagatgag 6000
aatgtgatgc tgctgacctc tgacgccccc gagtataagc cttgggccct ggtcatccag 6060
gattctaacg gcgagaataa gatcaagatg ctgagcggag gatccggagg atctggaggc 6120
agcaccaacc tgtctgacat catcgagaag gagacaggca agcagctggt catccaggag 6180
agcatcctga tgctgcccga agaagtcgaa gaagtgatcg gaaacaagcc tgagagcgat 6240
atcctggtcc ataccgccta cgacgagagt accgacgaaa atgtgatgct gctgacatcc 6300
gacgccccag agtataagcc ctgggctctg gtcatccagg attccaacgg agagaacaaa 6360
atcaaaatgc tgtctggcgg ctcaaaaaga accgccgacg gcagcgaatt cgagcccaag 6420
aagaagagga aagtctaacc ggtcatcatc accatcacca ttgagtttaa acccgctgat 6480
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 6540
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 6600
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 6660
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 6720
aggcggaaag aaccagctgg ggctcgatac cgtcgacctc tagctagagc ttggcgtaat 6780
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 6840
gagccggaag cataaagtgt aaagcctagg atgcctaatg agtgagctaa ctcacattaa 6900
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 6960
gaatcggcca acgcgcggga agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 7020
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7080
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7140
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7200
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 7260
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 7320
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 7380
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 7440
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 7500
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 7560
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 7620
ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 7680
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 7740
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 7800
ggtctgacac tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 7860
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 7920
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 7980
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 8040
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 8100
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 8160
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 8220
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 8280
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 8340
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 8400
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 8460
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 8520
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 8580
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 8640
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 8700
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 8760
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 8820
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 8880
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 8940
tcgacggatc gggagatcga tctcccgatc ccctagggtc gactctcagt acaatctgct 9000
ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt 9060
agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga 9120
atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt 9180
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 9240
catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 9300
acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 9360
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 9420
aagtgtatc 9429
<210> 14
<211> 9429
<212> DNA
<213> Artificial Sequence
<220>
<223> A3G(OP)+P199A+P200K-BE4max
<400> 14
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 60
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 120
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 180
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 240
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 300
ggcgtgtacg gtgggaggtc tatataagca gagctggttt agtgaaccgt cagatccgct 360
agagatccgc ggccgctaat acgactcact atagggagag ccgccaccat gaaacggaca 420
gccgacggaa gcgagttcga gtcaccaaag aagaagcgga aagtcatgaa gccccacttt 480
cggaacaccg tggagcggat gtacagagat accttcagct acaacttcta taatagacct 540
atcctgtccc ggagaaatac cgtgtggctg tgctatgagg tgaagacaaa gggcccatct 600
cggccccctc tggatgccaa gatctttaga ggccaggtgt acagcgagct gaagtatcac 660
cctgagatga ggttctttca ctggttctcc aagtggagga agctgcaccg cgaccaggag 720
tacgaggtga cctggtatat cagctggtcc ccctgcacca agtgtacacg cgatatggcc 780
acatttctgg ccgaggaccc taaggtgacc ctgacaatct ttgtggccag gctgtactat 840
ttccgggacc cagattacca ggaggccctg cgctctctgt gccagaagcg ggatggcccc 900
agagccacca tgaagatcat gaactacgac gagtttcagc actgttggag caagttcgtg 960
tattcccagc gggagctgtt cgagccttgg aacaatctgc caaagtacta tatcctgctg 1020
cacatcatgc tgggcgagat cctgagacac agcatggatg ccaagacctt caccttcaac 1080
ttcaacaatg agccatgggt gcggggcaga cacgagacct acctgtgcta tgaggtggag 1140
cggatgcaca acgacacatg ggtgctgctg aatcagaggc gcggctttct gtgcaatcag 1200
gcaccacaca agcacggctt cctggagggc aggcacgcag agctgtgctt cctggatgtg 1260
atccctttct ggaagctgga cctggatcag gactaccgcg tgacctgttt tacatcttgg 1320
agcccatgct tctcctgtgc ccaggagatg gccaagttta tctccaagaa taagcacgtg 1380
tctctgtgca tcttcaccgc caggatctac gacgatcagg gcaggtgtca ggagggactg 1440
cgcacactgg cagaggcagg agccaagatc tctatcatga cctatagcga gtttaagcac 1500
tgctgggata cattcgtgga ccaccagggc tgtccattcc agccctggga tggcctggac 1560
gagcactccc aggacctgtc tggcaggctg agggccatcc tgcagaacca ggagaattct 1620
ggaggatcta gcggaggatc ctctggcagc gagacaccag gaacaagcga gtcagcaaca 1680
ccagagagca gtggcggcag cagcggcggc agcgacaaga agtacagcat cggcctggcc 1740
atcggcacca actctgtggg ctgggccgtg atcaccgacg agtacaaggt gcccagcaag 1800
aaattcaagg tgctgggcaa caccgaccgg cacagcatca agaagaacct gatcggagcc 1860
ctgctgttcg acagcggcga aacagccgag gccacccggc tgaagagaac cgccagaaga 1920
agatacacca gacggaagaa ccggatctgc tatctgcaag agatcttcag caacgagatg 1980
gccaaggtgg acgacagctt cttccacaga ctggaagagt ccttcctggt ggaagaggat 2040
aagaagcacg agcggcaccc catcttcggc aacatcgtgg acgaggtggc ctaccacgag 2100
aagtacccca ccatctacca cctgagaaag aaactggtgg acagcaccga caaggccgac 2160
ctgcggctga tctatctggc cctggcccac atgatcaagt tccggggcca cttcctgatc 2220
gagggcgacc tgaaccccga caacagcgac gtggacaagc tgttcatcca gctggtgcag 2280
acctacaacc agctgttcga ggaaaacccc atcaacgcca gcggcgtgga cgccaaggcc 2340
atcctgtctg ccagactgag caagagcaga cggctggaaa atctgatcgc ccagctgccc 2400
ggcgagaaga agaatggcct gttcggaaac ctgattgccc tgagcctggg cctgaccccc 2460
aacttcaaga gcaacttcga cctggccgag gatgccaaac tgcagctgag caaggacacc 2520
tacgacgacg acctggacaa cctgctggcc cagatcggcg accagtacgc cgacctgttt 2580
ctggccgcca agaacctgtc cgacgccatc ctgctgagcg acatcctgag agtgaacacc 2640
gagatcacca aggcccccct gagcgcctct atgatcaaga gatacgacga gcaccaccag 2700
gacctgaccc tgctgaaagc tctcgtgcgg cagcagctgc ctgagaagta caaagagatt 2760
ttcttcgacc agagcaagaa cggctacgcc ggctacattg acggcggagc cagccaggaa 2820
gagttctaca agttcatcaa gcccatcctg gaaaagatgg acggcaccga ggaactgctc 2880
gtgaagctga acagagagga cctgctgcgg aagcagcgga ccttcgacaa cggcagcatc 2940
ccccaccaga tccacctggg agagctgcac gccattctgc ggcggcagga agatttttac 3000
ccattcctga aggacaaccg ggaaaagatc gagaagatcc tgaccttccg catcccctac 3060
tacgtgggcc ctctggccag gggaaacagc agattcgcct ggatgaccag aaagagcgag 3120
gaaaccatca ccccctggaa cttcgaggaa gtggtggaca agggcgcttc cgcccagagc 3180
ttcatcgagc ggatgaccaa cttcgataag aacctgccca acgagaaggt gctgcccaag 3240
cacagcctgc tgtacgagta cttcaccgtg tataacgagc tgaccaaagt gaaatacgtg 3300
accgagggaa tgagaaagcc cgccttcctg agcggcgagc agaaaaaggc catcgtggac 3360
ctgctgttca agaccaaccg gaaagtgacc gtgaagcagc tgaaagagga ctacttcaag 3420
aaaatcgagt gcttcgactc cgtggaaatc tccggcgtgg aagatcggtt caacgcctcc 3480
ctgggcacat accacgatct gctgaaaatt atcaaggaca aggacttcct ggacaatgag 3540
gaaaacgagg acattctgga agatatcgtg ctgaccctga cactgtttga ggacagagag 3600
atgatcgagg aacggctgaa aacctatgcc cacctgttcg acgacaaagt gatgaagcag 3660
ctgaagcggc ggagatacac cggctggggc aggctgagcc ggaagctgat caacggcatc 3720
cgggacaagc agtccggcaa gacaatcctg gatttcctga agtccgacgg cttcgccaac 3780
agaaacttca tgcagctgat ccacgacgac agcctgacct ttaaagagga catccagaaa 3840
gcccaggtgt ccggccaggg cgatagcctg cacgagcaca ttgccaatct ggccggcagc 3900
cccgccatta agaagggcat cctgcagaca gtgaaggtgg tggacgagct cgtgaaagtg 3960
atgggccggc acaagcccga gaacatcgtg atcgaaatgg ccagagagaa ccagaccacc 4020
cagaagggac agaagaacag ccgcgagaga atgaagcgga tcgaagaggg catcaaagag 4080
ctgggcagcc agatcctgaa agaacacccc gtggaaaaca cccagctgca gaacgagaag 4140
ctgtacctgt actacctgca gaatgggcgg gatatgtacg tggaccagga actggacatc 4200
aaccggctgt ccgactacga tgtggaccat atcgtgcctc agagctttct gaaggacgac 4260
tccatcgaca acaaggtgct gaccagaagc gacaagaacc ggggcaagag cgacaacgtg 4320
ccctccgaag aggtcgtgaa gaagatgaag aactactggc ggcagctgct gaacgccaag 4380
ctgattaccc agagaaagtt cgacaatctg accaaggccg agagaggcgg cctgagcgaa 4440
ctggataagg ccggcttcat caagagacag ctggtggaaa cccggcagat tacaaagcac 4500
gtggcacaga tcctggactc ccggatgaac actaagtacg acgagaatga caagctgatc 4560
cgggaagtga aagtgatcac cctgaagtcc aagctggtgt ccgatttccg gaaggatttc 4620
cagttttaca aagtgcgcga gatcaacaac taccaccacg cccacgacgc ctacctaaac 4680
gccgtcgtgg gaaccgcact gatcaaaaag taccctaagc tggaaagcga gttcgtgtac 4740
ggcgactaca aggtgtacga cgtgcggaag atgatcgcca agagcgagca ggaaatcggc 4800
aaggctaccg ccaagtactt cttctacagc aacatcatga actttttcaa gaccgagatt 4860
accctggcca acggcgagat ccggaagcgg cctctgatcg agacaaacgg cgaaaccggg 4920
gagatcgtgt gggataaggg ccgggatttt gccaccgtgc ggaaagtgct gagcatgccc 4980
caagtgaata tcgtgaaaaa gaccgaggtg cagacaggcg gcttcagcaa agagtctatc 5040
agacccaaga ggaacagcga taagctgatc gccagaaaga aggactggga ccctaagaag 5100
tacggcggct tcgtgagccc caccgtggcc tattctgtgc tggtggtggc caaagtggaa 5160
aagggcaagt ccaagaaact gaagagtgtg aaagagctgc tggggatcac catcatggaa 5220
agaagcagct tcgagaagaa tcccatcgac tttctggaag ccaagggcta caaagaagtg 5280
aaaaaggacc tgatcatcaa gctgcctaag tactccctgt tcgagctgga aaacggccgg 5340
aagagaatgc tggcctctgc cagattcctg cagaagggaa acgaactggc cctgccctcc 5400
aaatatgtga acttcctgta cctggccagc cactatgaga agctgaaggg ctcccccgag 5460
gataatgagc agaaacagct gtttgtggaa cagcacaagc actacctgga cgagatcatc 5520
gagcagatca gcgagttctc caagagagtg atcctggccg acgctaatct ggacaaagtg 5580
ctgtccgcct acaacaagca ccgggataag cccatcagag agcaggccga gaatatcatc 5640
cacctgttta ccctgaccaa tctgggagcc cctagagcct tcaagtactt tgacaccacc 5700
atcgaccgga aggtgtacag aagcaccaaa gaggtgctgg acgccaccct gatccaccag 5760
agcatcaccg gcctgtacga gacacggatc gacctgtctc agctgggagg tgacagcggc 5820
gggagcggcg ggagcggggg gagcactaat ctgagcgaca tcattgagaa ggagactggg 5880
aaacagctgg tcattcagga gtccatcctg atgctgcctg aggaggtgga ggaagtgatc 5940
ggcaacaagc cagagtctga catcctggtg cacaccgcct acgacgagtc cacagatgag 6000
aatgtgatgc tgctgacctc tgacgccccc gagtataagc cttgggccct ggtcatccag 6060
gattctaacg gcgagaataa gatcaagatg ctgagcggag gatccggagg atctggaggc 6120
agcaccaacc tgtctgacat catcgagaag gagacaggca agcagctggt catccaggag 6180
agcatcctga tgctgcccga agaagtcgaa gaagtgatcg gaaacaagcc tgagagcgat 6240
atcctggtcc ataccgccta cgacgagagt accgacgaaa atgtgatgct gctgacatcc 6300
gacgccccag agtataagcc ctgggctctg gtcatccagg attccaacgg agagaacaaa 6360
atcaaaatgc tgtctggcgg ctcaaaaaga accgccgacg gcagcgaatt cgagcccaag 6420
aagaagagga aagtctaacc ggtcatcatc accatcacca ttgagtttaa acccgctgat 6480
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 6540
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 6600
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 6660
gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct atggcttctg 6720
aggcggaaag aaccagctgg ggctcgatac cgtcgacctc tagctagagc ttggcgtaat 6780
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca cacaacatac 6840
gagccggaag cataaagtgt aaagcctagg atgcctaatg agtgagctaa ctcacattaa 6900
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag ctgcattaat 6960
gaatcggcca acgcgcggga agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc 7020
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7080
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7140
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7200
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 7260
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 7320
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 7380
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 7440
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 7500
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 7560
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 7620
ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 7680
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 7740
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 7800
ggtctgacac tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa 7860
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta 7920
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag 7980
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga 8040
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac 8100
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc 8160
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta 8220
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac 8280
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat 8340
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa 8400
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg 8460
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag 8520
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc 8580
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct 8640
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat 8700
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg 8760
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc 8820
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta 8880
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg 8940
tcgacggatc gggagatcga tctcccgatc ccctagggtc gactctcagt acaatctgct 9000
ctgatgccgc atagttaagc cagtatctgc tccctgcttg tgtgttggag gtcgctgagt 9060
agtgcgcgag caaaatttaa gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga 9120
atctgcttag ggttaggcgt tttgcgctgc ttcgcgatgt acgggccaga tatacgcgtt 9180
gacattgatt attgactagt tattaatagt aatcaattac ggggtcatta gttcatagcc 9240
catatatgga gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca 9300
acgacccccg cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga 9360
ctttccattg acgtcaatgg gtggagtatt tacggtaaac tgcccacttg gcagtacatc 9420
aagtgtatc 9429
<210> 15
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> 多核苷酸序列
<400> 15
accgggccca gactgagcac gtga 24
<210> 16
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> 多核苷酸序列
<400> 16
aaactcacgt gctcagtctg ggcc 24
<210> 17
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> 多核苷酸序列
<400> 17
accgtgcccc tccctccctg gccc 24
<210> 18
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> 多核苷酸序列
<400> 18
aaacgggcca gggagggagg ggca 24
<210> 19
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> 多核苷酸序列
<400> 19
accggaacac aaagcataga ctgc 24
<210> 20
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> 多核苷酸序列
<400> 20
aaacgcagtc tatgctttgt gttc 24
<210> 21
<211> 27
<212> DNA
<213> Artificial Sequence
<220>
<223> 引物
<400> 21
gcccatgcaa ttagtctatt tctgctg 27
<210> 22
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> 引物
<400> 22
gcaggagctg cacatactag cc 22
<210> 23
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> 引物
<400> 23
ggggccccta accctatgta gc 22
<210> 24
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> 引物
<400> 24
ccattggcct gcttcgtggc 20
<210> 25
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> 引物
<400> 25
gttactgcag cccaagcctc ag 22
<210> 26
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> 引物
<400> 26
gtccagcccc atctgtcaaa ctg 23
<210> 27
<211> 583
<212> DNA
<213> APOBEC3G片段
<400> 27
ggagattctc agacactcga tggatccaaa gacattcact ttcaacttta acaatgaacc 60
ttgggtcaga ggacggcatg agacttacct gtgttatgag gtggagcgca tgcacaatga 120
cacctgggtc ctgctgaacc agcgcagggg ctttctatgc aaccaggctc cacataaaca 180
cggtttcctt gaaggccgcc atgcagagct gtgcttcctg gacgtgattc ccttttggaa 240
gctggacctg gaccaggact acagggttac ctgcttcacc tcctggagcc cctgcttcag 300
ctgtgcccag gaaatggcta aattcatttc aaaaaacaaa cacgtgagcc tgtgcatctt 360
cactgcccgc atctatgatg atcaaggaag atgtcaggag gggctgcgca ccctggccga 420
ggctggggcc aaaatttcaa taatgacata cagtgaattt aagcactgct gggacacctt 480
tgtggaccac cagggatgtc ccttccagcc ctgggatgga ctagatgagc acagccaaga 540
cctgagtggg aggctgcggg ccattctcca gaatcaggaa aac 583
<210> 28
<211> 564
<212> DNA
<213> Artificial Sequence
<220>
<223> APOBEC3G片段
<400> 28
atggatccaa agacattcac tttcaacttt aacaatgaac cttgggtcag aggacggcat 60
gagacttacc tgtgttatga ggtggagcgc atgcacaatg acacctgggt cctgctgaac 120
cagcgcaggg gctttctatg caaccaggct ccacataaac acggtttcct tgaaggccgc 180
catgcagagc tgtgcttcct ggacgtgatt cccttttgga agctggacct ggaccaggac 240
tacagggtta cctgcttcac ctcctggagc ccctgcttca gctgtgccca ggaaatggct 300
aaattcattt caaaaaacaa acacgtgagc ctgtgcatct tcactgcccg catctatgat 360
gatcaaggaa gatgtcagga ggggctgcgc accctggccg aggctggggc caaaatttca 420
ataatgacat acagtgaatt taagcactgc tgggacacct ttgtggacca ccagggatgt 480
cccttccagc cctgggatgg actagatgag cacagccaag acctgagtgg gaggctgcgg 540
gccattctcc agaatcagga aaac 564
<210> 29
<211> 1152
<212> DNA
<213> Artificial Sequence
<220>
<223> APOBEC3G片段
<400> 29
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatccaccc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210> 30
<211> 1152
<212> DNA
<213> Artificial Sequence
<220>
<223> APOBEC3G片段
<400> 30
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct taagccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatccaccc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210> 31
<211> 1152
<212> DNA
<213> Artificial Sequence
<220>
<223> APOBEC3G片段
<400> 31
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatgccccc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210> 32
<211> 1152
<212> DNA
<213> Artificial Sequence
<220>
<223> APOBEC3G片段
<400> 32
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggattggccc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210> 33
<211> 1152
<212> DNA
<213> Artificial Sequence
<220>
<223> APOBEC3G片段
<400> 33
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatccagcc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210> 34
<211> 1152
<212> DNA
<213> Artificial Sequence
<220>
<223> APOBEC3G片段
<400> 34
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatccaaag 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210> 35
<211> 1152
<212> DNA
<213> Artificial Sequence
<220>
<223> APOBEC3G片段
<400> 35
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct tgacccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatccaccc 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtaaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210> 36
<211> 1152
<212> DNA
<213> Artificial Sequence
<220>
<223> APOBEC3G片段
<400> 36
atgaagcctc acttcagaaa cacagtggag cgaatgtatc gagacacatt ctcctacaac 60
ttttataatg cacccatcct ttctcgtcgg aataccgtct ggctgtgcta cgaagtgaaa 120
acaaagggtc cctcaaggcc ccctttggac gcaaagatct ttcgaggcca ggtgtattcc 180
gaacttaagt accacccaga gatgagattc ttccactggt tcagcaagtg gaggaagctg 240
catcgtgacc aggagtatga ggtcacctgg tacatatcct tgagcccctg cacaaagtgt 300
acaagggata tggccacgtt cctggccgag gacccgaagg ttaccctgac catctttgtt 360
gcccgcctcg cctacttcct taagccagat taccaggagg cgcttcgcag cctgtgtcag 420
aaaagagacg gtccgcgtgc caccatgaag atcatgaatt atgacgaatt tcagcactgt 480
tggagcaagt tcgtgtacag ccaaagagag ctatttgagc cttggaataa tctgcctaaa 540
tattatatat tactgcacat catgctgggg gagattctca gacactcgat ggatgccaag 600
acattcactt tcaactttaa caatgaacct tgggtcagag gacggcatga gacttacctg 660
tgttatgagg tggagcgcat gcacaatgac acctgggtcc tgctgaacca gcgcaggggc 720
tttctatgca accaggctcc acataaacac ggtttccttg aaggccgcca tgcagagctg 780
tgcttcctgg acgtgattcc cttttggaag ctggacctgg accaggacta cagggttacc 840
tgcttcacct cctggagccc ctgcttcagc tgtgcccagg aaatggctaa attcatttca 900
aaaaacaaac acgtgagcct gtgcatcttc actgcccgca tctatgatga tcaaggaaga 960
tgtcaggagg ggctgcgcac cctggccgag gctggggcca aaatttcaat aatgacatac 1020
agtgaattta agcactgctg ggacaccttt gtggaccacc agggatgtcc cttccagccc 1080
tgggatggac tagatgagca cagccaagac ctgagtggga ggctgcgggc cattctccag 1140
aatcaggaaa ac 1152
<210> 37
<211> 4101
<212> DNA
<213> Artificial Sequence
<220>
<223> SpCas9-D10A nickase片段
<400> 37
gataaaaagt attctattgg tttagccatc ggcactaatt ccgttggatg ggctgtcata 60
accgatgaat acaaagtacc ttcaaagaaa tttaaggtgt tggggaacac agaccgtcat 120
tcgattaaaa agaatcttat cggtgccctc ctattcgata gtggcgaaac ggcagaggcg 180
actcgcctga aacgaaccgc tcggagaagg tatacacgtc gcaagaaccg aatatgttac 240
ttacaagaaa tttttagcaa tgagatggcc aaagttgacg attctttctt tcaccgtttg 300
gaagagtcct tccttgtcga agaggacaag aaacatgaac ggcaccccat ctttggaaac 360
atagtagatg aggtggcata tcatgaaaag tacccaacga tttatcacct cagaaaaaag 420
ctagttgact caactgataa agcggacctg aggttaatct acttggctct tgcccatatg 480
ataaagttcc gtgggcactt tctcattgag ggtgatctaa atccggacaa ctcggatgtc 540
gacaaactgt tcatccagtt agtacaaacc tataatcagt tgtttgaaga gaaccctata 600
aatgcaagtg gcgtggatgc gaaggctatt cttagcgccc gcctctctaa atcccgacgg 660
ctagaaaacc tgatcgcaca attacccgga gagaagaaaa atgggttgtt cggtaacctt 720
atagcgctct cactaggcct gacaccaaat tttaagtcga acttcgactt agctgaagat 780
gccaaattgc agcttagtaa ggacacgtac gatgacgatc tcgacaatct actggcacaa 840
attggagatc agtatgcgga cttatttttg gctgccaaaa accttagcga tgcaatcctc 900
ctatctgaca tactgagagt taatactgag attaccaagg cgccgttatc cgcttcaatg 960
atcaaaaggt acgatgaaca tcaccaagac ttgacacttc tcaaggccct agtccgtcag 1020
caactgcctg agaaatataa ggaaatattc tttgatcagt cgaaaaacgg gtacgcaggt 1080
tatattgacg gcggagcgag tcaagaggaa ttctacaagt ttatcaaacc catattagag 1140
aagatggatg ggacggaaga gttgcttgta aaactcaatc gcgaagatct actgcgaaag 1200
cagcggactt tcgacaacgg tagcattcca catcaaatcc acttaggcga attgcatgct 1260
atacttagaa ggcaggagga tttttatccg ttcctcaaag acaatcgtga aaagattgag 1320
aaaatcctaa cctttcgcat accttactat gtgggacccc tggcccgagg gaactctcgg 1380
ttcgcatgga tgacaagaaa gtccgaagaa acgattactc catggaattt tgaggaagtt 1440
gtcgataaag gtgcgtcagc tcaatcgttc atcgagagga tgaccaactt tgacaagaat 1500
ttaccgaacg aaaaagtatt gcctaagcac agtttacttt acgagtattt cacagtgtac 1560
aatgaactca cgaaagttaa gtatgtcact gagggcatgc gtaaacccgc ctttctaagc 1620
ggagaacaga agaaagcaat agtagatctg ttattcaaga ccaaccgcaa agtgacagtt 1680
aagcaattga aagaggacta ctttaagaaa attgaatgct tcgattctgt cgagatctcc 1740
ggggtagaag atcgatttaa tgcgtcactt ggtacgtatc atgacctcct aaagataatt 1800
aaagataagg acttcctgga taacgaagag aatgaagata tcttagaaga tatagtgttg 1860
actcttaccc tctttgaaga tcgggaaatg attgaggaaa gactaaaaac atacgctcac 1920
ctgttcgacg ataaggttat gaaacagtta aagaggcgtc gctatacggg ctggggacga 1980
ttgtcgcgga aacttatcaa cgggataaga gacaagcaaa gtggtaaaac tattctcgat 2040
tttctaaaga gcgacggctt cgccaatagg aactttatgc agctgatcca tgatgactct 2100
ttaaccttca aagaggatat acaaaaggca caggtttccg gacaagggga ctcattgcac 2160
gaacatattg cgaatcttgc tggttcgcca gccatcaaaa agggcatact ccagacagtc 2220
aaagtagtgg atgagctagt taaggtcatg ggacgtcaca aaccggaaaa cattgtaatc 2280
gagatggcac gcgaaaatca aacgactcag aaggggcaaa aaaacagtcg agagcggatg 2340
aagagaatag aagagggtat taaagaactg ggcagccaga tcttaaagga gcatcctgtg 2400
gaaaataccc aattgcagaa cgagaaactt tacctctatt acctacaaaa tggaagggac 2460
atgtatgttg atcaggaact ggacataaac cgtttatctg attacgacgt cgatcacatt 2520
gtaccccaat cctttttgaa ggacgattca atcgacaata aagtgcttac acgctcggat 2580
aagaaccgag ggaaaagtga caatgttcca agcgaggaag tcgtaaagaa aatgaagaac 2640
tattggcggc agctcctaaa tgcgaaactg ataacgcaaa gaaagttcga taacttaact 2700
aaagctgaga ggggtggctt gtctgaactt gacaaggccg gatttattaa acgtcagctc 2760
gtggaaaccc gccaaatcac aaagcatgtt gcacagatac tagattcccg aatgaatacg 2820
aaatacgacg agaacgataa gctgattcgg gaagtcaaag taatcacttt aaagtcaaaa 2880
ttggtgtcgg acttcagaaa ggattttcaa ttctataaag ttagggagat aaataactac 2940
caccatgcgc acgacgctta tcttaatgcc gtcgtaggga ccgcactcat taagaaatac 3000
ccgaagctag aaagtgagtt tgtgtatggt gattacaaag tttatgacgt ccgtaagatg 3060
atcgcgaaaa gcgaacagga gataggcaag gctacagcca aatacttctt ttattctaac 3120
attatgaatt tctttaagac ggaaatcact ctggcaaacg gagagatacg caaacgacct 3180
ttaattgaaa ccaatgggga gacaggtgaa atcgtatggg ataagggccg ggacttcgcg 3240
acggtgagaa aagttttgtc catgccccaa gtcaacatag taaagaaaac tgaggtgcag 3300
accggagggt tttcaaagga atcgattctt ccaaaaagga atagtgataa gctcatcgct 3360
cgtaaaaagg actgggaccc gaaaaagtac ggtggcttcg atagccctac agttgcctat 3420
tctgtcctag tagtggcaaa agttgagaag ggaaaatcca agaaactgaa gtcagtcaaa 3480
gaattattgg ggataacgat tatggagcgc tcgtcttttg aaaagaaccc catcgacttc 3540
cttgaggcga aaggttacaa ggaagtaaaa aaggatctca taattaaact accaaagtat 3600
agtctgtttg agttagaaaa tggccgaaaa cggatgttgg ctagcgccgg agagcttcaa 3660
aaggggaacg aactcgcact accgtctaaa tacgtgaatt tcctgtattt agcgtcccat 3720
tacgagaagt tgaaaggttc acctgaagat aacgaacaga agcaactttt tgttgagcag 3780
cacaaacatt atctcgacga aatcatagag caaatttcgg aattcagtaa gagagtcatc 3840
ctagctgatg ccaatctgga caaagtatta agcgcataca acaagcacag ggataaaccc 3900
atacgtgagc aggcggaaaa tattatccat ttgtttactc ttaccaacct cggcgctcca 3960
gccgcattca agtattttga cacaacgata gatcgcaaac gatacacttc taccaaggag 4020
gtgctagacg cgacactgat tcaccaatcc atcacgggat tatatgaaac tcggatagat 4080
ttgtcacagc ttgggggtga c 4101
<210> 38
<211> 4101
<212> DNA
<213> Artificial Sequence
<220>
<223> SpCas9-D10A nickase片段
<400> 38
gacaagaagt acagcatcgg cctggccatc ggcaccaact ctgtgggctg ggccgtgatc 60
accgacgagt acaaggtgcc cagcaagaaa ttcaaggtgc tgggcaacac cgaccggcac 120
agcatcaaga agaacctgat cggagccctg ctgttcgaca gcggcgaaac agccgaggcc 180
acccggctga agagaaccgc cagaagaaga tacaccagac ggaagaaccg gatctgctat 240
ctgcaagaga tcttcagcaa cgagatggcc aaggtggacg acagcttctt ccacagactg 300
gaagagtcct tcctggtgga agaggataag aagcacgagc ggcaccccat cttcggcaac 360
atcgtggacg aggtggccta ccacgagaag taccccacca tctaccacct gagaaagaaa 420
ctggtggaca gcaccgacaa ggccgacctg cggctgatct atctggccct ggcccacatg 480
atcaagttcc ggggccactt cctgatcgag ggcgacctga accccgacaa cagcgacgtg 540
gacaagctgt tcatccagct ggtgcagacc tacaaccagc tgttcgagga aaaccccatc 600
aacgccagcg gcgtggacgc caaggccatc ctgtctgcca gactgagcaa gagcagacgg 660
ctggaaaatc tgatcgccca gctgcccggc gagaagaaga atggcctgtt cggaaacctg 720
attgccctga gcctgggcct gacccccaac ttcaagagca acttcgacct ggccgaggat 780
gccaaactgc agctgagcaa ggacacctac gacgacgacc tggacaacct gctggcccag 840
atcggcgacc agtacgccga cctgtttctg gccgccaaga acctgtccga cgccatcctg 900
ctgagcgaca tcctgagagt gaacaccgag atcaccaagg cccccctgag cgcctctatg 960
atcaagagat acgacgagca ccaccaggac ctgaccctgc tgaaagctct cgtgcggcag 1020
cagctgcctg agaagtacaa agagattttc ttcgaccaga gcaagaacgg ctacgccggc 1080
tacattgacg gcggagccag ccaggaagag ttctacaagt tcatcaagcc catcctggaa 1140
aagatggacg gcaccgagga actgctcgtg aagctgaaca gagaggacct gctgcggaag 1200
cagcggacct tcgacaacgg cagcatcccc caccagatcc acctgggaga gctgcacgcc 1260
attctgcggc ggcaggaaga tttttaccca ttcctgaagg acaaccggga aaagatcgag 1320
aagatcctga ccttccgcat cccctactac gtgggccctc tggccagggg aaacagcaga 1380
ttcgcctgga tgaccagaaa gagcgaggaa accatcaccc cctggaactt cgaggaagtg 1440
gtggacaagg gcgcttccgc ccagagcttc atcgagcgga tgaccaactt cgataagaac 1500
ctgcccaacg agaaggtgct gcccaagcac agcctgctgt acgagtactt caccgtgtat 1560
aacgagctga ccaaagtgaa atacgtgacc gagggaatga gaaagcccgc cttcctgagc 1620
ggcgagcaga aaaaggccat cgtggacctg ctgttcaaga ccaaccggaa agtgaccgtg 1680
aagcagctga aagaggacta cttcaagaaa atcgagtgct tcgactccgt ggaaatctcc 1740
ggcgtggaag atcggttcaa cgcctccctg ggcacatacc acgatctgct gaaaattatc 1800
aaggacaagg acttcctgga caatgaggaa aacgaggaca ttctggaaga tatcgtgctg 1860
accctgacac tgtttgagga cagagagatg atcgaggaac ggctgaaaac ctatgcccac 1920
ctgttcgacg acaaagtgat gaagcagctg aagcggcgga gatacaccgg ctggggcagg 1980
ctgagccgga agctgatcaa cggcatccgg gacaagcagt ccggcaagac aatcctggat 2040
ttcctgaagt ccgacggctt cgccaacaga aacttcatgc agctgatcca cgacgacagc 2100
ctgaccttta aagaggacat ccagaaagcc caggtgtccg gccagggcga tagcctgcac 2160
gagcacattg ccaatctggc cggcagcccc gccattaaga agggcatcct gcagacagtg 2220
aaggtggtgg acgagctcgt gaaagtgatg ggccggcaca agcccgagaa catcgtgatc 2280
gaaatggcca gagagaacca gaccacccag aagggacaga agaacagccg cgagagaatg 2340
aagcggatcg aagagggcat caaagagctg ggcagccaga tcctgaaaga acaccccgtg 2400
gaaaacaccc agctgcagaa cgagaagctg tacctgtact acctgcagaa tgggcgggat 2460
atgtacgtgg accaggaact ggacatcaac cggctgtccg actacgatgt ggaccatatc 2520
gtgcctcaga gctttctgaa ggacgactcc atcgacaaca aggtgctgac cagaagcgac 2580
aagaaccggg gcaagagcga caacgtgccc tccgaagagg tcgtgaagaa gatgaagaac 2640
tactggcggc agctgctgaa cgccaagctg attacccaga gaaagttcga caatctgacc 2700
aaggccgaga gaggcggcct gagcgaactg gataaggccg gcttcatcaa gagacagctg 2760
gtggaaaccc ggcagattac aaagcacgtg gcacagatcc tggactcccg gatgaacact 2820
aagtacgacg agaatgacaa gctgatccgg gaagtgaaag tgatcaccct gaagtccaag 2880
ctggtgtccg atttccggaa ggatttccag ttttacaaag tgcgcgagat caacaactac 2940
caccacgccc acgacgccta cctaaacgcc gtcgtgggaa ccgcactgat caaaaagtac 3000
cctaagctgg aaagcgagtt cgtgtacggc gactacaagg tgtacgacgt gcggaagatg 3060
atcgccaaga gcgagcagga aatcggcaag gctaccgcca agtacttctt ctacagcaac 3120
atcatgaact ttttcaagac cgagattacc ctggccaacg gcgagatccg gaagcggcct 3180
ctgatcgaga caaacggcga aaccggggag atcgtgtggg ataagggccg ggattttgcc 3240
accgtgcgga aagtgctgag catgccccaa gtgaatatcg tgaaaaagac cgaggtgcag 3300
acaggcggct tcagcaaaga gtctatcaga cccaagagga acagcgataa gctgatcgcc 3360
agaaagaagg actgggaccc taagaagtac ggcggcttcg tgagccccac cgtggcctat 3420
tctgtgctgg tggtggccaa agtggaaaag ggcaagtcca agaaactgaa gagtgtgaaa 3480
gagctgctgg ggatcaccat catggaaaga agcagcttcg agaagaatcc catcgacttt 3540
ctggaagcca agggctacaa agaagtgaaa aaggacctga tcatcaagct gcctaagtac 3600
tccctgttcg agctggaaaa cggccggaag agaatgctgg cctctgccag attcctgcag 3660
aagggaaacg aactggccct gccctccaaa tatgtgaact tcctgtacct ggccagccac 3720
tatgagaagc tgaagggctc ccccgaggat aatgagcaga aacagctgtt tgtggaacag 3780
cacaagcact acctggacga gatcatcgag cagatcagcg agttctccaa gagagtgatc 3840
ctggccgacg ctaatctgga caaagtgctg tccgcctaca acaagcaccg ggataagccc 3900
atcagagagc aggccgagaa tatcatccac ctgtttaccc tgaccaatct gggagcccct 3960
agagccttca agtactttga caccaccatc gaccggaagg tgtacagaag caccaaagag 4020
gtgctggacg ccaccctgat ccaccagagc atcaccggcc tgtacgagac acggatcgac 4080
ctgtctcagc tgggaggtga c 4101
<210> 39
<211> 57
<212> DNA
<213> Artificial Sequence
<220>
<223> 核定位信号片段
<400> 39
atgaaacgga cagccgacgg aagcgagttc gagtcaccaa agaagaagcg gaaagtc 57
<210> 40
<211> 96
<212> DNA
<213> Artificial Sequence
<220>
<223> 柔性连接肽片段
<400> 40
tctggaggat ctagcggagg atcctctgga agcgagacac caggcacaag cgagtccgcc 60
acaccagaga gctccggcgg ctcctccgga ggatcc 96
<210> 41
<211> 96
<212> DNA
<213> Artificial Sequence
<220>
<223> 柔性连接肽片段
<400> 41
tctggaggat ctagcggagg atcctctgga agcgagacac caggcacaag cgagtccgcc 60
acaccagaga gctccggcgg ctcctccgga ggatcc 96

Claims (16)

1.一种融合蛋白,其特征在于,所述融合蛋白自N端至C端依次包括APOBEC3G片段和SpCas9-D10A nickase片段,所述APOBEC3G片段具有胞嘧啶脱氨酶活性;所述APOBEC3G片段存在R24A、W94L、Y124A、W127L、D128K、P199A、P199W、P200A、P200K、Q322K中的至少一个氨基酸突变或者APOBEC3G片段为自APOBEC3G第一位起始密码子至第190位或197位删除的截短APOBEC3G片段。
2.如权利要求1所述的融合蛋白,其特征在于,所述APOBEC3G片段的核苷酸序列包括:
a)如SEQ ID NO.27-36所示的核苷酸序列;或,
b)与SEQ ID NO.27-36具有80%以上序列相似性的核苷酸序列、且具有a)所限定的核苷酸序列的功能。
3.如权利要求1所述的融合蛋白,其特征在于,所述SpCas9-D10A nickase片段的核苷酸序列包括:
c)如SEQ ID NO.37-38所示的核苷酸序列;或,
d)与SEQ ID NO.37-38具有80%以上序列相似性的核苷酸序列、且具有d)所限定的核苷酸序列的功能。
4.如权利要求1所述的融合蛋白,其特征在于,所述融合蛋白还包括核定位信号片段,所述核定位信号片段位于APOBEC3G片段的N端或SpCas9-D10A nickase片段的C端。
5.如权利要求4所述的融合蛋白,其特征在于,所述核定位信号片段的核苷酸序列如SEQ ID NO.39所示。
6.如权利要求1所述的融合蛋白,其特征在于,所述融合蛋白还包括柔性连接肽片段,所述柔性连接肽段位于APOBEC3G片段的N端、APOBEC3G片段和SpCas9-D10A nickase之间、或SpCas9-D10A nickase的C端。
7.如权利要求6所述的融合蛋白,其特征在于,所述柔性连接肽片段的核苷酸序列如SEQ ID NO.40-41所示。
8.一种分离的多核苷酸,其特征在于,所述的分离的多核苷酸编码权利要求1所述的融合蛋白。
9.一种构建体,其特征在于,所述构建体通过将权利要求8所述的分离的多核苷酸***表达载体中构建获得;所述构建体的多核苷酸序列如SEQ ID NO.1~14所示。
10.如权利要求9所述的构建体,其特征在于,所述的表达载体为pCMV表达载体、pSV2表达载体或pGL3表达载体中的一种。
11.一种表达***,其特征在于,所述表达***为宿主细胞,所述宿主细胞含有权利要求9所述的构建体,或者所述宿主细胞的基因组中整合有权利要求8所述的分离的多核苷酸。
12.如权利要求11所述的表达***,其特征在于,所述宿主细胞选自小鼠细胞或人细胞。
13.如权利要求11所述的表达***,其特征在于,所述宿主细胞选自小鼠脑神经瘤细胞、人胚胎肾细胞、人***细胞、人结肠癌细胞、人骨肉瘤细胞。
14.一种碱基编辑工具,其特征在于,包括权利要求1所述的融合蛋白和sgRNA。
15.权利要求14所述的碱基编辑工具在真核生物的基因编辑中的用途。
16.如权利要求15所述的碱基编辑工具在真核生物的基因编辑中的用途,其特征在于,所述基因编辑为靶点区域内sgRNA 5’端4-7位的C-to-T的碱基编辑。
CN201911075141.9A 2019-11-06 2019-11-06 一种胞嘧啶碱基编辑工具及其用途 Active CN110734900B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911075141.9A CN110734900B (zh) 2019-11-06 2019-11-06 一种胞嘧啶碱基编辑工具及其用途

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911075141.9A CN110734900B (zh) 2019-11-06 2019-11-06 一种胞嘧啶碱基编辑工具及其用途

Publications (2)

Publication Number Publication Date
CN110734900A true CN110734900A (zh) 2020-01-31
CN110734900B CN110734900B (zh) 2022-09-30

Family

ID=69272245

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911075141.9A Active CN110734900B (zh) 2019-11-06 2019-11-06 一种胞嘧啶碱基编辑工具及其用途

Country Status (1)

Country Link
CN (1) CN110734900B (zh)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113249362A (zh) * 2020-02-07 2021-08-13 辉大(上海)生物科技有限公司 经改造的胞嘧啶碱基编辑器及其应用
CN114058607A (zh) * 2020-07-31 2022-02-18 上海科技大学 一种用于c到u碱基编辑的融合蛋白及其制备方法和应用
CN114561392A (zh) * 2022-03-22 2022-05-31 绍兴市妇幼保健院 一种基于碱基编辑技术关闭靶基因清除HBV e抗原的方法
CN114561429A (zh) * 2022-03-22 2022-05-31 绍兴市妇幼保健院 一种基于碱基编辑atg起始密码子抑制hbv表面抗原的治疗方法
CN114606265A (zh) * 2022-04-07 2022-06-10 吉林大学 一种能够实现单个aav病毒包被的迷你碱基编辑器
CN116555237A (zh) * 2022-03-08 2023-08-08 中国科学院遗传与发育生物学研究所 胞嘧啶脱氨酶及其在碱基编辑中的用途

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090269831A1 (en) * 2008-02-07 2009-10-29 Harris Reuben S Modified cytosine deaminases
CN102482639A (zh) * 2009-04-03 2012-05-30 医学研究会 活化诱导胞苷脱氨酶(aid)突变体及使用方法
US20160022737A1 (en) * 2014-07-25 2016-01-28 Sangamo Biosciences, Inc. Gene editing for hiv gene therapy
CN108513575A (zh) * 2015-10-23 2018-09-07 哈佛大学的校长及成员们 核碱基编辑器及其用途

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090269831A1 (en) * 2008-02-07 2009-10-29 Harris Reuben S Modified cytosine deaminases
CN102482639A (zh) * 2009-04-03 2012-05-30 医学研究会 活化诱导胞苷脱氨酶(aid)突变体及使用方法
US20160022737A1 (en) * 2014-07-25 2016-01-28 Sangamo Biosciences, Inc. Gene editing for hiv gene therapy
WO2016014837A1 (en) * 2014-07-25 2016-01-28 Sangamo Biosciences, Inc. Gene editing for hiv gene therapy
CN108513575A (zh) * 2015-10-23 2018-09-07 哈佛大学的校长及成员们 核碱基编辑器及其用途

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
XIAO WANG ET AL.: "Efficient base editing in methylated regions with a human APOBEC3A-Cas9 fusion", 《NATURE BIOTECHNOLOGY》 *
赵亚伟等: "碱基编辑器的开发及其在细菌基因组编辑中的应用", 《微生物学通报》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113249362A (zh) * 2020-02-07 2021-08-13 辉大(上海)生物科技有限公司 经改造的胞嘧啶碱基编辑器及其应用
CN113249362B (zh) * 2020-02-07 2023-04-14 辉大(上海)生物科技有限公司 经改造的胞嘧啶碱基编辑器及其应用
CN114058607A (zh) * 2020-07-31 2022-02-18 上海科技大学 一种用于c到u碱基编辑的融合蛋白及其制备方法和应用
CN114058607B (zh) * 2020-07-31 2024-02-27 上海科技大学 一种用于c到u碱基编辑的融合蛋白及其制备方法和应用
CN116555237A (zh) * 2022-03-08 2023-08-08 中国科学院遗传与发育生物学研究所 胞嘧啶脱氨酶及其在碱基编辑中的用途
CN114561392A (zh) * 2022-03-22 2022-05-31 绍兴市妇幼保健院 一种基于碱基编辑技术关闭靶基因清除HBV e抗原的方法
CN114561429A (zh) * 2022-03-22 2022-05-31 绍兴市妇幼保健院 一种基于碱基编辑atg起始密码子抑制hbv表面抗原的治疗方法
CN114606265A (zh) * 2022-04-07 2022-06-10 吉林大学 一种能够实现单个aav病毒包被的迷你碱基编辑器
CN114606265B (zh) * 2022-04-07 2024-01-30 吉林大学 一种能够实现单个aav病毒包被的迷你碱基编辑器

Also Published As

Publication number Publication date
CN110734900B (zh) 2022-09-30

Similar Documents

Publication Publication Date Title
CN110734900B (zh) 一种胞嘧啶碱基编辑工具及其用途
RU2763170C2 (ru) Производство олигосахаридов человеческого молока в микроорганизмах-хозяевах с модифицированным импортом/экспортом
KR102147005B1 (ko) Fad2 성능 유전자좌 및 표적화 파단을 유도할 수 있는 상응하는 표적 부위 특이적 결합 단백질
KR20220141332A (ko) 홍역-벡터화된 covid-19 면역원성 조성물 및 백신
AU2020264412B2 (en) Dna-binding protein using ppr motif, and use thereof
US20030167538A1 (en) Use of the maize x112 mutant ahas 2 gene and imidazolinone herbicides for selection of transgenic monocots, maize, rice and wheat plants resistant to the imidazolinone herbicides
US20040013648A1 (en) Vector system
KR102494564B1 (ko) 말라리아 백신
KR20190120287A (ko) 게놈 편집 시스템 및 방법
CN112204147A (zh) 基于Cpf1的植物转录调控***
CN107002095A (zh) 用于治疗溶酶体贮积症的腺伴随病毒载体
KR20210105382A (ko) 단백질을 코딩하는 rna
CN101827938A (zh) 涉及rt1基因、相关的构建体和方法的具有改变的根构造的植物
CN113604469B (zh) 基于CRISPR/CasRx的基因编辑方法及其应用
CN110305901A (zh) 一种基于人tlr4基因启动子区的双荧光素酶报告基因载体及其构建方法与应用
JP2024037797A (ja) がんを処置するための感染性核酸の使用
CN101868545A (zh) 具有改变的根构造的植物、涉及编码富含亮氨酸重复序列激酶(llrk)多肽及其同源物的基因的相关构建体和方法
CN112626035A (zh) 一种新冠肺炎疫苗以及疫苗套件
CN111378626B (zh) 一种cho细胞系、构建方法、重组蛋白表达***、应用
US6730481B2 (en) Primers-attached vector elongation (PAVE): a 5′-directed cDNA cloning strategy
KR20210005167A (ko) 리소좀 축적병을 완화시키기 위한 렌티벡터-형질도입된 t-rapa 세포의 용도
CN113621650B (zh) 一种高效丝素重链启动子分泌表达***的建立与应用
CN113005092A (zh) 一种敲除pd1的靶向lmp1的car-t细胞的制备方法和应用
JPH1175859A (ja) アポト−シス関連遺伝子発現性のウイルスベクター系
CN114836473A (zh) 用于构建筛选药物活性的细胞株模型的慢病毒载体与应用

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant