CN111549062A - 家蚕基于CRISPR/Cas9***的全基因组敲除载体文库及构建方法 - Google Patents

家蚕基于CRISPR/Cas9***的全基因组敲除载体文库及构建方法 Download PDF

Info

Publication number
CN111549062A
CN111549062A CN202010379321.2A CN202010379321A CN111549062A CN 111549062 A CN111549062 A CN 111549062A CN 202010379321 A CN202010379321 A CN 202010379321A CN 111549062 A CN111549062 A CN 111549062A
Authority
CN
China
Prior art keywords
vector
crispr
cas9
zeocin
ser1pa
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010379321.2A
Other languages
English (en)
Inventor
马三垣
常珈菘
夏庆友
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Southwest University
Original Assignee
Southwest University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Southwest University filed Critical Southwest University
Priority to CN202010379321.2A priority Critical patent/CN111549062A/zh
Publication of CN111549062A publication Critical patent/CN111549062A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/87Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
    • C12N15/90Stable introduction of foreign DNA into chromosome
    • C12N15/902Stable introduction of foreign DNA into chromosome using homologous recombination
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B40/00Libraries per se, e.g. arrays, mixtures
    • C40B40/04Libraries containing only organic compounds
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B50/00Methods of creating libraries, e.g. combinatorial synthesis
    • C40B50/06Biochemical methods, e.g. using enzymes or whole viable microorganisms

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Zoology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • Wood Science & Technology (AREA)
  • Microbiology (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Mycology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

本发明涉及家蚕基于CRISPR/Cas9***的全基因组敲除载体文库及构建方法,包含用于递送CRISPR/Cas9***的递送载体***piggyBac转座子载体***和家蚕全部编码蛋白的基因的打靶位点的sgRNA序列共同组成的载体文库pB‑CRISPR‑library。本发明基于piggyBac转座子***的强大的承载容量,可以将CRISPR/Cas9***的两个组成原件Cas9蛋白表达框和sgRNA表达框整合到一个载体上,同时该载体还包含筛选标记Zeocin抗性基因表达框,共同组成了一个用于家蚕的CRISPR/Cas9敲除的all‑in‑one载体pB‑CRISPR。

Description

家蚕基于CRISPR/Cas9***的全基因组敲除载体文库及构建 方法
技术领域
本发明属于真核生物基因敲除技术领域,涉及家蚕基于CRISPR/Cas9***的全基因组敲除载体文库及构建方法。
背景技术
家蚕不仅是重要的经济昆虫,也是鳞翅目模式生物。家蚕是最早完成全基因组测序的生物之一,家蚕基因组计划完成后,家蚕功能基因组研究就成为了家蚕研究领域的重要方向,目前已经建立了转基因、RNAi、基因编辑等遗传操作技术用于研究家蚕功能基因,但是,面对数以万计的家蚕编码蛋白基因的功能,传统的正向遗传学耗时耗力,迫切需要一种高通量研究手段来研究数量巨大的家蚕功能基因。
高通量RNAi是最早用于研究真核生物功能基因的高通量研究方法(PMID:25145850),已经在人、鼠等物种取得了很多研究成果,但是RNAi天然存在的脱靶效应、无法完全敲除基因功能而只能敲低功能基因的mRNA表达水平等缺点,其高通量筛选研究受到限制。CRISPR/Cas9***是近年来开发出来的基因敲除技术,已经在包括人、鼠、果蝇、拟南芥、水稻等模式生物钟取得了显著成果,目前CRISPR/Cas9***在家蚕中的应用也已经取得了成功。
CRISPR/Cas9***是由两部分元件组成,用于执行核酸内切酶功能的Cas9蛋白和用于执行引导Cas9蛋白到打靶位点的功能的sgRNA。基于其设计和构建的简便性,CRISPR/Cas9***不仅能够实现单个基因的敲除,还可以非常方便地设计并构建用于全基因组敲除的sgRNA文库,从而实现全基因组编辑,目前,在真核生物中主要是运用慢病毒载体***来递送CRISPR/Cas9全基因组敲除文库。但是,慢病毒***有着固有的缺点,一、慢病毒载体承载容量有限,一般只能够携带几kbp的外源基因,而且随着***片段长度增加病毒滴度急剧降低,因此,慢病毒介导的CRISPR/Cas9基因敲除***一般是先将Cas9表达框整合到宿主细胞的基因组上,然后再将sgRNA表达框整合到宿主细胞的基因组上,增加了构建CRISPR/Cas9基因敲除文库的周期、成本和难度;二、无法广泛应用于多种类型真核生物,主要应用在哺乳动物,慢病毒***无法在昆虫等物种中递送CRISPR/Cas9敲除文库。
piggyBac转座子是最初发现于鳞翅目昆虫,是真核生物的第二类转座子,以“剪切-黏贴”的模式来实现转座。piggyBac***的宿主范围广泛,从低昆虫哺乳动物均可实现高效转座,piggyBac转座子可以携带的外源基因片段极大,可达207kb。应用piggyBac***可以将Cas9蛋白表达框和sgRNA表达框构建到同一个piggyBac转座载体上,只需要一步转染就可以CRISPR***整合到宿主细胞的基因组上,极大缩短实验周期,降低实验成本和操作难度。由于piggyBac转座子是以“剪切-黏贴”的方式来实现来实现转座,因此可以通过控制转座子浓度等方式来控制piggyBac转座子***携带的外源基因整合到宿主细胞的拷贝数。
在家蚕中还没有任何高通量研究功能基因组的技术手段,构建家蚕基于piggyBac***和CRISPR/Cas9***的全基因组敲除的载体文库非常必要。
发明内容
有鉴于此,本发明的目的在于提供一种家蚕基于CRISPR/Cas9***的全基因组敲除载体文库及构建方法。
为达到上述目的,本发明提供如下技术方案:
1、家蚕基于CRISPR/Cas9***的全基因组敲除载体文库的构建方法,具体步骤如下:
(1)构建piggyBac转座子***介导的真核生物CRISPR/Cas9敲除载体,命名为pB-CRISPR,其核苷酸序列如SEQ ID NO.1所示,将其作为骨架载体;
(2)设计敲除家蚕全部基因的sgRNA,其核苷酸序列是位于基因区符合NNNNNNNNNNNNNNNNNNNNNGG特征的序列;
(3)将步骤(2)的sgRNA整合到步骤(1)的骨架载体上,构建得到所述载体文库。
作为优选的技术方案之一,步骤(1)中,载体pB-CRISPR包括Hr3 CQ Enhancer-Hsp70启动子启动的spCas9蛋白编码序列,IE2启动子启动的Zeocin抗性蛋白编码序列,U6启动子启动的sgRNA表达框和piggyBac转座子的转座臂。
作为优选的技术方案之一,步骤(1)的具体方法是:
(1-1)合成包含Zeocin抗性基因表达框的载体PUC57-IE2-Zeocin-Ser1PA;
(1-2)将载体PUC57-IE2-Zeocin-Ser1PA上的Zeocin抗性基因表达框IE2-Zeocin-Ser1PA表达框连接到piggyBac转座子基础载体piggyBacModify上,构建成中间载体pB-Modified{IE2-Zeocin-Ser1PA};
(1-3)将hr3-hsp70-Cas9-sv40表达框从载体pUC57-hr3-hsp70-Cas9-sv40上扩增出来,然后用无缝克隆的方法连接到pB-Modified{IE2-Zeocin-Ser1PA}上,构建成中间载体pB-Modified{IE2-Zeocin-Ser1PA}{hr3-hsp70-Cas9-SV40};
(1-4)将U6-gRNA从载体pUC57-U6-gRNA扩增出来,用酶切连接的方法连接到载体pB-Modified{IE2-Zeocin-Ser1PA}{hr3-hsp70-Cas9-SV40}的AscI/NheI位点,构建成真核生物基因敲除基础载体pB-Modified{IE2-Zeocin-Ser1PA}{U6-gRNA}{hr3-hsp70-Cas9-SV40},命名为pB-CRISPR;
其中,载体PUC57-IE2-Zeocin-Ser1PA,核苷酸序列如SEQ ID NO.2所示;
piggyBac转座子基础载体piggyBacModify,核苷酸序列如SEQ ID NO.3所示;
中间载体pB-Modified{IE2-Zeocin-Ser1PA},核苷酸序列如SEQ ID NO.4所示;
载体pUC57-hr3-hsp70-Cas9-sv40,核苷酸序列如SEQ ID NO.5所示;
载体PUC57-Hr3-Hsp70-Cas9-SV40,由pUC57-hA4-Cas9(PMID:24671069,核苷酸序列如SEQ ID NO.6所示)将启动子A4更换为Hsp70得到;
中间载体pB-Modified{IE2-Zeocin-Ser1PA}{hr3-hsp70-Cas9-SV40},核苷酸序列如SEQ ID NO.7所示;
载体pUC57-U6-gRNA,核苷酸序列如SEQ ID NO.8所示。
载体图谱如图1所示。
作为优选的技术方案之一,步骤(1)中,pB-CRISPR包含家蚕U6启动子和sgRNA的骨架,它们的核苷酸序列分别如SEQ ID NO.9和SEQ ID NO.10所示。
作为进一步优选的技术方案之一,基于家蚕U6启动子在第一个核苷酸为鸟嘌呤核苷酸“G”时的启动效率最高,全部sgRNA序列的5’端添加一个鸟嘌呤核苷酸“G”。
作为优选的技术方案之一,步骤(2)的具体方法是:基于spCas9作用规律,设计家蚕全部编码蛋白的基因的打靶位点,每个基因设计约6个打靶位点,总计94000个。
作为进一步优选的技术方案之一,所述打靶位点包括23个核苷酸,具有如下规律:
5’-NNNNNNNNNNNNNNNNNNNN-NGG-3’,N表示碱基A、T、G或C,所有打靶位点尽量在CDS序列的前半部分,靠近PAM区的种子序列部分12bp核苷酸不能在基因组上有重复区域。
2、利用上述方法构建得到的家蚕基于CRISPR/Cas9***的全基因组敲除载体文库。
3、上述载体文库在家蚕基因组突变中的应用。
本发明的有益效果在于:
本发明包含用于递送CRISPR/Cas9***的递送载体***piggyBac转座子载体***和家蚕全部编码蛋白的基因的打靶位点的sgRNA序列共同组成的载体文库pB-CRISPR-library。
本发明基于piggyBac转座子***的强大的承载容量,可以将CRISPR/Cas9***的两个组成原件Cas9蛋白表达框和sgRNA表达框整合到一个载体上,同时该载体还包含筛选标记Zeocin抗性基因表达框,共同组成了一个用于家蚕的CRISPR/Cas9敲除的all-in-one载体pB-CRISPR。另外,基于piggyBac转座子***能够在家蚕中实现转座,本发明能够在家蚕中实现全基因组敲除。
附图说明
为了使本发明的目的、技术方案和有益效果更加清楚,本发明提供如下附图进行说明:
图1为pB-CRISPR载体图谱,piggyBacL/piggyBacR,piggyBac转座臂;IE2,IE2启动子;Zeocin,Zeocin抗性基因;Ser1PA,家蚕丝胶1(Ser1)基因polyA;U6,家蚕U6启动子;Hr3-Hsp70,Hr3增强子和Hsp70启动子;spCas9,SpCas9蛋白;SV40PA,SV40 polyA。
图2为家蚕基于CRISPR/Cas9***的全基因组敲除的载体文库的建库流程。
图3为基于spCas9作用规律的家蚕全部编码蛋白的基因的打靶位点。表示设计不同数量sgRNA的基因的数量。
图4为筛选家蚕全部编码蛋白的基因的打靶位点,筛选标准是:1、所有打靶位点尽量在CDS序列的前半部分,2、靠近PAM区的种子序列部分12bp核苷酸不能在基因组上有重复区域。绝大多数蛋白的打靶位点为6个,总共设计94000个打靶位点。
具体实施方式
下面将结合附图,对本发明的优选实施例进行详细的描述。
实施例:
本实施例中所用到的家蚕胚胎细胞系(The Bombyx mori embryonic cell line,BmE)为生物实验中常用细胞系(PMID:17570024)。
1、本发明的目的是提供家蚕基于CRISPR/Cas9***的全基因组敲除的载体文库,建库流程如图2所示。
2、为了实现家蚕CRISPR/Cas9***的递送,本发明提供了一种基于piggyBac转座子***的递送载体pB-CRISPR,其核苷酸序列如SEQ ID NO.1所示,载体图谱见图1。具体如下:
以piggyBac转座子***基础载体piggyBacModify(由金士瑞公司合成)为初始载体,构建一个piggyBac转座子***介导的家蚕CRISPR/Cas9基因敲除载体骨架,主要包含piggyBac转座臂(包含两个piggyBac转座子末端反向重复序列,inverted terminalrepeat,ITR)、筛选标记Zeocin抗性基因表达框、Cas9蛋白表达框、U6-gRNA表达框,命名为pB-CRISPR。
为了构建家蚕基于CRISPR/Cas9***的全基因组敲除的载体文库,本发明提供的递送载体pB-CRISPR包含家蚕U6启动子和sgRNA的骨架,序列如SEQ ID NO.9和SEQ IDNO.10所示。
3、构建两个载体用于后续建库实验。一个是含有U6启动子核苷酸序列的载体,命名为T-U6。其构建方法具体是设计一对引物:
U6-F,5’-AGCTGTCCAAGGAATGCGT-3’,如SEQ ID NO.11所示;
U6-R,5’-ATATACAAAATATCGTGCTCTACAAGT-3’,如SEQ ID NO.12所示;
以pB-CRISPR为模板扩增U6启动子序列,然后连接到T载体上,sanger测序验证正确性。第二个载体是含有sgRNA scaffold核苷酸序列的载体,命名为T-sgRNA scaffold。具体构建方法是设计一对引物:
sgRNA-scaffold-F,5’-GTTTTAGAGCTAGAAATAGCAAGTTAAA-3’,如SEQ ID NO.13所示;
sgRNA-scaffold-R,5’-GCTAAATCGATAAAGATCTTTCATT-3’,如SEQ ID NO.14所示;
以pB-CRISPR为模板扩增sgRNA scaffold序列,然后连接到T载体上,sanger测序验证正确性。PCR扩增酶是高保真热启动酶,反应总体系50μL,包括引物各1μL,模板1μL,2×酶预混液25μL,水22μL,反应条件如下:98℃预变性4min;98℃变性10s,55℃退火5s,72℃延伸5s;35个循环;72℃延伸10min;12℃保存。T载体选择全式金公司的Trans-T1载体,按照公式说明书操作,sanger测序由华大公司完成。
4、为了构建家蚕基于CRISPR/Cas9***的全基因组敲除的载体文库,本发明提供了基于spCas9作用规律的家蚕全部编码蛋白的基因的打靶位点的DNA序列,其特征在于,绝大多数蛋白的打靶位点为6个,包括23个核苷酸,其核苷酸具有如下规律:
5’-NNNNNNNNNNNNNNNNNNNN-NGG-3’,所有打靶位点尽量在CDS序列的前半部分,靠近PAM区的种子序列部分12bp核苷酸不能在基因组上有重复区域,总共设计了94000个打靶位点。然后根据U6启动子的核苷酸序列和sgRNA scaffold的核苷酸序列,设计sgRNA的侧翼序列,其特征如下:5′-TACAA AATAT CGTGC TCTAC AAGTG NNNNN NNNNN NNNNN NNNNNGTTTT AGAGC TAGAA ATAGC AAGTT-3′,其中“N”为sgRNA序列,用基因芯片的方式合成全部sgRNA序列(包含侧翼区),命名为单链寡核苷酸库(The pool of sgRNAoligonucleotides),核苷酸序列分别如SEQ ID NO.15。如图3和图4所示。
5、为了构建家蚕基于CRISPR/Cas9***的全基因组敲除的载体文库,基于家蚕U6启动子在第一个核苷酸为鸟嘌呤核苷酸“G”时的启动效率最高,本发明的载体库的全部sgRNA序列的5’端添加一个鸟嘌呤核苷酸“G”。
6、根据骨架载体pB-CRISPR载体序列、T-U6载体序列、T-sgRNA scaffold载体序列和The pool of sgRNA oligonucleotides序列,设计用于做搭桥PCR的引物,引物序列如下:
>KU-1R,5’-TCGATGATGATGATCAATTGTGGCGCGCCAAGCTGTCCAAGGAATGCGT-3’,如SEQID NO.16所示;
>DG-1R,5’-TTGTAGAGCACGATATTTTGTATAT-3’,如SEQ ID NO.17所示;
>DP-2F,5’-TCAATAGTTTAGTTTTTTTAGGTATATATACAAAATATCGTGCTCTACAA-3’,如SEQID NO.18所示;
>DP-2R,5’-AAGTTGATAACGGACTAGCCTTATTTTAACTTGCTATTTCTAGCTCTAA-3’,如SEQID NO.19所示;
>DG-3F,5’-TTAGAGCTAGAAATAGCAAGTTAAA-3’,如SEQ ID NO.20所示;
>KU-1F,5’-ACCGATCGATCCTAGGCGCTAGCTAATGAAAGATCTTTATCGATTTAGC-3’如SEQID NO.21所示。
7、应用搭桥PCR方法,构建包含全部sgRNA库的U6-sgRNA文库片段。如图2所示。
1)以载体T-U6为模板,用引物KU-1R和DG-1R来扩增U6启动子片段,命名为DGP-1。PCR扩增酶是高保真热启动酶,反应总体系50μL,包括引物各1μL,模板1μL,2×酶预混液25μL,水22μL,反应条件如下:98℃预变性4min;98℃变性10s,55℃退火5s,72℃延伸5s;35个循环;72℃延伸10min;12℃保存,总共扩增30管。
2)以合成的单链寡核苷酸文库(The pool of sgRNA oligonucleotides)为模板用引物DP-2F和DP-2R来扩增sgRNA片段,命名为DGP-2。PCR扩增酶是高保真热启动酶,反应总体系50μL,包括引物各1μL,模板1μL,2×酶预混液25μL,水22μL,反应条件如下:98℃预变性4min;98℃变性10s,55℃退火5s,72℃延伸5s;35个循环;72℃延伸10min;12℃保存,总共扩增30管。
3)以载体T-sgRNA scaffold为模板,用引物DG-3F和KU-1F来扩增sgRNA scaffold片段,命名为DGP-3。PCR扩增酶是高保真热启动酶,反应总体系50μL,包括引物各1μL,模板1μL,2×酶预混液25μL,水22μL,反应条件如下:98℃预变性4min;98℃变性10s,55℃退火5s,72℃延伸5s;35个循环;72℃延伸10min;12℃保存,总共扩增30管。
4)以PCR片段DGP-1/DGP-2/DGP-3混合物为模板,用引物KU-1F和KU-1R来来进行搭桥PCR实验,扩增出包含sgRNA库的U6-sgRNA文库片段,命名为DGP-4。其中,DGP-1/DGP-2/DGP-3按照摩尔比1:1:1混合。PCR扩增酶是高保真热启动酶,反应总体系50μL,包括引物各1μL,模板1μL,2×酶预混液25μL,水22μL,反应条件如下:98℃预变性4min;98℃变性10s,55℃退火5s,72℃延伸5s;35个循环;72℃延伸10min;12℃保存,总共扩增50管。
8、将载体pB-CRISPR用AscI/NheI双酶切,回收12187bp的骨架用于下一步建库实验。其中,酶切条件为50μL体系,包含1μg载体,5μL的CutSmart缓冲液,AscI和NheI各1μL,酶切条件为37℃过夜。将片段DGP-4用AscI/NheI双酶切,其中,酶切条件为50μL体系,包含1μg的DGP-4片段,AscI和NheI各1μL,酶切条件为37℃过夜。
9、将步骤6描述的酶切完成的载体骨架和酶切完成的DGP-4片段按照摩尔比1:10混合后用DNA连接酶连接,然后要过柱方式纯化浓缩,用双蒸水溶解,溶解体积为30μL;然后通过电转方式建库,抽提质粒,完成建库,建库覆盖度大于100×。DNA连接酶为T4 DNA连接酶,连接总体系为50μL,其中骨架和片段按照摩尔比1:10添加,总质量为2μg,T4连接酶2.5μL,连接条件是16℃过夜连接。电转感受态为E.coli HST08 Premium Electro-Cells,电转仪器为Gene Pulser Xcell,电转50管。具体电转步骤按照公式提供说明书操作。
用梯度稀释的方法统计电转后单克隆的数量,保证单克隆数量能够满足建库最低要求,即单克隆数量>100×。
10、质粒抽提采用试剂盒抽提的方法,将抽提好的质粒全部混匀后取部分用设计好的引物扩增下来载体文库的sgRNA片段,建库后执行高通量测序,分析sgRNA丰度,检测文库质量。测序引物核苷酸序列如下:
正向引物>gD-F,5-NNNNNNNNNNNNTAAATCACGCTTTCAATA,N表示碱基A、T、G或C,如SEQ ID NO.22所示;
反向引物>gD-R,5-NNNNNNNNNNNNCGACTCGGTGCCACTTT,N表示碱基A、T、G或C,如SEQID NO.23所示。
载体文库质量分析如图3所示。
最后说明的是,以上优选实施例仅用以说明本发明的技术方案而非限制,尽管通过上述优选实施例已经对本发明进行了详细的描述,但本领域技术人员应当理解,可以在形式上和细节上对其作出各种各样的改变,而不偏离本发明权利要求书所限定的范围。
序列表
<110> 西南大学
<120> 家蚕基于CRISPR/Cas9***的全基因组敲除载体文库及构建方法
<130> 2020
<160> 23
<170> SIPOSequenceListing 1.0
<210> 1
<211> 13408
<212> DNA
<213> Artificial
<400> 1
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240
attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300
tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360
tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt ggagatcggt acttcgcgaa 420
tgcgtcgaga taagagggtt aaaaaatata ttttacgcac catatacgca tcgggttgat 480
atcgttaata tggatcaatt tgaacagttg attaacgtgt ctctgctcaa gtctttgatc 540
aaaacgcaaa tcgacgaaaa tgtgtcggac aatatcaagt cgatgagcga aaaactaaaa 600
aggctagaat acgacaatct cacagacagc gttgagatat acggtattca cgacagcagg 660
ctgaataata aaaaaattag aaactattat ttaaccctag aaagataatc atattgtgac 720
gtacgttaaa gataatcatg cgtaaaattg acgcatgtgt tttatcggtc tgtatatcga 780
ggtttattta ttaatttgaa tagatattaa gttttattat atttacactt acatactaat 840
aataaattca acaaacaatt tatttatgtt tatttattta ttaaaaaaaa acaaaaactc 900
aaaatttctt ctataaagta acaaaacttt taaacattct ctcttttaca aaaataaact 960
tattttgtac tttaaaaaca gtcatgttgt attataaaat aagtaattag cttaacttat 1020
acataataga aacaaattat acttattagt cagtcagaaa caactttggc acatatcaat 1080
attatgctct cgacaaataa cttttttgca ttttttgcac gatgcatttg cctttcgcct 1140
tattttagag gggcagtaag tacagtaagt acgttttttc attactggct cttcagtact 1200
gtcatctgat gtaccaggca cttcatttgg caaaatatta gagatattat cgcgcaaata 1260
tctcttcaaa gtaggagctt ctaaacgctt acgcataaac gatgacgtca ggctcatgta 1320
aaggtttctc ataaattttt tgcgactttg aaccttttct cccttgctac tgacattatg 1380
gctgtatata ataaaagaat ttatgcaggc aatgtttatc attccgtaca ataatgccat 1440
aggccaccta ttcgtcctcc tactgcaggt catcacagaa cacatttggt ctagcgtgtc 1500
cactccgcct ttagtttgat tataatacat aaccatttgc ggtttaccgg tactttcgtt 1560
gatagaagca tcctcatcac aagatgataa taagtatacc atcttagctg gcttcggttt 1620
atatgagacg agagtaaggg gtccgtcaaa acaaaacatc gatgttccca ctggcctgga 1680
gcgactgttt ttcagtactt ccggtatctc gcgtttgttc ctgcaggatc atgatgataa 1740
acaatgtatg gtgctaatgt tgcttcaaca acaattctgt tgaactgtgt tttcatgttt 1800
gccaacaagc acctttatac tcggtggcct ccccaccacc aacttttttg cactgcaaaa 1860
aaacacgctt ttgcacgcgg gcccatacat agtacaaact ctacgtttcg tagactattt 1920
tacataaata gtctacaccg ttgtatacgc tccaaataca ctaccacaca ttgaaccttt 1980
ttgcagtgca aaaaagtacg tgtcggcagt cacgtaggcc ggccttatcg ggtcgcgtcc 2040
tgtcacgtac gaatcacatt atcggaccgg acgagtgttg tcttatcgtg acaggacgcc 2100
agcttcctgt gttgctaacc gcagccggac gcaactcctt atcggaacag gacgcgcctc 2160
catatcagcc gcgcgttatc tcatgcgcgt gaccggacac gaggcgcccg tcccgcttat 2220
cgcgcctata aatacagccc gcaacgatct ggtaaacaca gttgaacagc atctgttcga 2280
aatggccaag ttgaccagtg ccgttccggt gctcaccgcg cgcgacgtcg ccggagcggt 2340
cgagttctgg accgaccggc tcgggttctc ccgggacttc gtggaggacg acttcgccgg 2400
tgtggtccgg gacgacgtga ccctgttcat cagcgcggtc caggaccagg tggtgccgga 2460
caacaccctg gcctgggtgt gggtgcgcgg cctggacgag ctgtacgccg agtggtcgga 2520
ggtcgtgtcc acgaacttcc gggacgcctc cgggccggcc atgaccgaga tcggcgagca 2580
gccgtggggg cgggagttcg ccctgcgcga cccggccggc aactgcgtgc acttcgtggc 2640
cgaggagcag gactaaagct ttacaactaa acacgacttg gagtattcct tgtagtgttt 2700
aagattttaa atcttactta atgacttcga acgattttaa cgataacttt ctctttgttt 2760
aactttaatc agcatacata aaaagccccg gttttgtatc gggaagaaaa aaaatgtaat 2820
tgtgttgcct agataataaa cgtattatca aagtgtgtgg ttttccttta ccaaagaccc 2880
ctttaagatg ggcctaatgg gcttaagtcg agtcctttcc gatgtgttaa atacacattt 2940
attacactga tgcgtcgaat gtacactttt aataggatag ctccactaaa aattatttta 3000
tttatttaat ttgttgcacc aaaactgata cattgacgaa acgcgtatgc tagcaatgaa 3060
agatctttat cgatttagcc aaaagcaaaa gcttgaccaa aaataggata atatttgttt 3120
ttttatttaa aaaaataaac aattttttat acataaactg tttatctagt attaatattt 3180
atgttaacat ttgataacga atcaaatata tttttaaact aattaaaaaa tccgatgtat 3240
gttataaaat tgttctagaa aaaaagcacc gactcggtgc cactttttca agttgataac 3300
ggactagcct tattttaact tgctatttct agctctaaaa cactggcagg tgtcttgacg 3360
agttcttctg aattattaac gcttacaatt tcctgatgcg gtattttctc cttacgcatc 3420
tgtgcggtat ttcacaccgc atcaggtggc acttttcggg gaaatgtgcg cggaacccct 3480
atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagatt atcaaaaagg 3540
atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat 3600
gagtaaactt ggtctgacag ttaccaatgc cacctgccat cacttgtaga gcacgatatt 3660
ttgtatatat acctaaaaaa actaaactat tgaaagcgtg atttacaaca acactcgact 3720
ttacaaagat tattcaaaaa gagcaaaaac tcttaacata ttctattaaa gatatataat 3780
ataattaaaa cgaaattaaa taataacaat aaaaccttta gaatttgtaa taaaatccat 3840
aaaaacaaat gaaaacagtt atggtttgta cagcgccatc tgttattact ttgacaaaat 3900
cactatgact atctgacctt gtcttacacg ttaacaattc ttattctgtc cttatctata 3960
agccaagtac caagcttaaa ttcgtatggc ttatagttga cgatttttaa attctcaagg 4020
tatgtactta tttaatatta ataagtacta attgttaaaa tcatctaaaa caattcagtg 4080
atttacaaca atgtgtacta cataacctaa tacttataaa tttattaaac tgtattgatt 4140
cttttaggtc aatcatcatg actttaggag acttggtgtc tcaggaaaaa ggaacgcaaa 4200
aagattgagg cgtttgaaat gtattgctgg agaaagctgc tacgcattcc ttggacagct 4260
tggcgcgccc agcgtcgtga aaagaggcaa tgacaaatac aaaacgacgt atgagcagac 4320
ccgtcgccaa gacgggtcta cctctaagat gatgtcattt gttttttaaa actaactcgc 4380
tttacgagta gaattctacg tgtaaaacat aatcaagaga tgatgtcatt tgtttttcaa 4440
aaccaaactc gctttacgag tagaattcta cgtgtaaaac acaatcaaaa gatgatgtca 4500
ttcgtttttc aaaaccgaat ttaagaaatg atgtcatttg tttttcaaaa ccaaactcgc 4560
tttacgagca gaattctacg tgtaaaacac aatcaagaga tgatgtcatt tgtttttcaa 4620
aactgaatga tgtcatttgt ttttcaaaac taaacttgct ttgcgagtag aattctacgt 4680
gtaaaacaca gtcaagagat gatgtcattt gtttttcaaa actgaaccgg ctttacgagt 4740
agaattctac ttgtaaaaca taatcaagag atgatgtcat ttgtttttca aaactgaact 4800
ggctttacga gtagaattct acgtgtaaaa cataatcaag agatgatgtc atcattaaac 4860
tgatgtcatt ttatacacga ttgttaacat gtttaataat gactaatttg tttttccaaa 4920
ttaaactcgc tttacgagta gaattctact tgtaacgcac gattaagtat gaatcataag 4980
ctgatgtcat ttgttttcga cataaaatgt ttatacaatg gaatcttctt gtaaattatc 5040
caaataatat aatttatccg attctacgtt acatttaaat tcgttgttat cgtacaattc 5100
ttcaggacac gccatgtatt ggtcattttt agcgtgcaac caacgattgt atttgacgcc 5160
gtcgttggat tgcgtgttca ggttggcgta cacgtgactg ggcacggctt ctttttccat 5220
gggacgtcga ccgagaaatt tctctggccg ttattcgtta ttctctcttt tctttttggg 5280
tctctccctc tctgcactaa tgctctctca ctctgtcaca cagtaaacgg catactgctc 5340
tcgttggttc gagagagcgc gcctcgaatg ttcgcgaaaa gagcgccgga gtataaatag 5400
aggcgcttcg tctacggagc gacaattcaa ttcaaacaag caaagtgaac acgtcgctaa 5460
gcgaaagcta agcaaataaa caagcgcagc tgaacaagct aaacaatctg cagtaaagtg 5520
caagttaaag tgaatcaatt aaaagtaacc agcaaccaag taaatcaact gcaactactg 5580
aaatctgcca agaagtaatt attgaataca agaagagaac tctgggggat ctctagtcca 5640
gtgtggtgga attcgccatg gccccaaaga aaaagagaaa ggttgattac aaagaccacg 5700
acggagacta caaagaccac gacattgatt ataaagatga tgatgataaa ggaacgatgg 5760
acaaaaagta tagcatcggt ctggatattg gaactaactc cgtcggctgg gctgtaatca 5820
ccgacgaata caaggtcccg tcaaaaaagt tcaaggtatt gggtaacaca gatcgtcact 5880
ctatcaaaaa gaatctcatt ggagctctgt tgttcgacag cggcgaaaca gctgaggcca 5940
ctagactgaa gcgcaccgcc agacgccgtt acacgaggag aaagaacaga atctgctact 6000
tgcaagaaat attctcaaac gagatggcca aagtggacga ttcgttcttt cataggttag 6060
aagagagttt ccttgttgaa gaggataaaa agcacgaaag acatccgata tttggaaaca 6120
tcgtggacga agttgcttat cacgagaagt accccacgat ctatcatctg cgtaaaaagt 6180
tggtggactc gacagataag gccgacctca ggttaatata ccttgcactg gcgcacatga 6240
tcaaattcag aggccatttt ctgattgaag gtgacctgaa ccctgacaat agtgatgtgg 6300
acaaactctt cattcaatta gttcagacct acaatcaact gtttgaagag aaccctatca 6360
acgcttcagg agttgacgct aaggccatcc ttagtgcgag actgagcaaa tcccgccgtc 6420
tcgaaaactt aatcgcacag ttgcctggag agaaaaagaa cggtttgttc ggaaatctca 6480
ttgcgttgtc actcggactc acgccaaact tcaagtctaa cttcgatttg gcagaagacg 6540
cgaaactgca actgagcaaa gacacatatg acgatgacct cgataacctc ttagctcaga 6600
tcggcgatca atacgccgac ttgttcctcg ctgccaaaaa tctgtcggac gctatacttc 6660
tgagtgatat cttgcgcgtc aacacagaaa ttactaaggc tcctctgtcg gccagtatga 6720
taaaacgcta tgacgaacac catcaggatt tgacattgct caaagccctc gtgcgtcaac 6780
agctcccaga aaagtacaag gagattttct ttgatcagtc caagaatggc tacgcaggtt 6840
atatagacgg tggagcgtcg caagaagagt tctacaagtt catcaagcca atattagaaa 6900
agatggacgg cacggaagag ttacttgtta agctgaatcg tgaggacctg ttgcgtaaac 6960
agaggacatt cgataacgga tcaattccgc accaaataca tcttggcgaa ctgcacgcta 7020
tcctcaggag acaagaggac ttctacccct ttttaaagga taaccgtgaa aagatcgaga 7080
aaatcctgac tttcaggatt ccttactatg tcggcccact ggctcgtggt aatagcaggt 7140
ttgcctggat gaccaggaag tccgaagaga caattactcc gtggaacttc gaagaggtgg 7200
ttgataaagg agcatcagcg cagtctttca tagaacgcat gacaaatttt gacaagaact 7260
taccgaatga gaaggtcctt cccaaacact cactcctcta cgaatacttc acagtataca 7320
acgagctcac taaagtcaag tacgtaaccg agggtatgcg caaacccgct ttcctgtctg 7380
gagagcagaa aaaggccatc gtggaccttc tgttcaagac aaaccgtaag gtcactgtaa 7440
agcaactcaa ggaagactac ttcaaaaaga tagagtgttt cgattcagtg gaaatctctg 7500
gcgttgagga cagatttaac gcttccttgg gtacttacca cgatttgctc aagatcatta 7560
aagataagga cttcctcgac aacgaagaga acgaagatat cttagaggac atagttctca 7620
cccttacgct gtttgaagat agagagatga ttgaagagcg cctgaagact tatgctcatt 7680
tgttcgatga caaagtcatg aagcaactga aacgccgtag gtacaccggc tggggtagat 7740
tatcgcgcaa acttattaat ggtataaggg acaagcagtc gggaaaaacg atattggact 7800
ttctcaagag tgatggtttc gccaacagaa attttatgca actcatacac gatgacagct 7860
taacattcaa ggaagatatc caaaaagcac aggtgtcggg acagggcgac agtttgcacg 7920
aacatattgc taacctcgcc ggctccccgg cgataaaaaa gggtatcctt cagactgtga 7980
aagtcgtaga tgaactggtg aaggttatgg gtcgtcataa acccgagaac atagttatcg 8040
aaatggctag ggagaatcaa acaactcaga agggacagaa aaactcaaga gaacgcatga 8100
agcgcattga agagggtatc aaagagcttg gcagtcaaat cctgaaggaa caccctgtcg 8160
agaacacgca acttcagaac gaaaaattgt acctctacta tctgcagaat ggtagagata 8220
tgtacgtaga ccaagaattg gatattaacc gcctctcaga ttacgacgtg gatcatatag 8280
ttccgcagtc attcttgaag gatgactcta tcgacaacaa agtcctcaca agatcagaca 8340
agaaccgcgg aaaatcagat aatgtaccct ctgaagaggt ggttaaaaag atgaaaaact 8400
actggagaca gttacttaac gctaagttga tcacgcaaag aaagttcgat aacctcacaa 8460
aggctgaacg cggcggttta agcgagcttg acaaggccgg tttcataaaa cgtcagttag 8520
tcgaaaccag gcaaattacg aaacacgtag cccaaatatt ggattcccgc atgaacacta 8580
aatacgatga aaatgacaag ctcatccgtg aggtcaaagt aattaccctg aaaagcaagt 8640
tggtgtccga cttcagaaag gatttccagt tctacaaagt tcgcgaaatc aacaactacc 8700
accatgcaca tgacgcttac ctgaacgcag tcgtaggcac tgcgttaatt aaaaagtacc 8760
ctaaactgga atctgagttc gtgtacggtg actataaagt gtacgatgtt agaaagatga 8820
tcgctaaaag cgaacaggag attggaaagg ctaccgccaa gtatttcttt tactccaaca 8880
tcatgaattt ctttaagacc gaaatcacgt tagcaaatgg cgagatacgt aaaaggccac 8940
ttatcgaaac aaacggagaa actggcgaga tagtgtggga caagggtaga gattttgcca 9000
ctgtccgcaa agtactgtcg atgccgcaag tgaatatcgt taaaaagacc gaagttcaaa 9060
cgggaggctt cagcaaagag tccatcctgc ccaagcgtaa cagtgataaa ttgatagcta 9120
ggaaaaagga ctgggatcct aaaaagtatg gtggattcga cagcccaact gtcgcatact 9180
ccgtattggt ggttgcgaaa gtcgaaaaag gaaagagcaa aaagctcaag tccgtaaaag 9240
agctgttggg cattaccata atggaaagat catctttcga gaagaatcct atcgattttc 9300
tggaagccaa gggatataaa gaggtcaaaa aggacctcat aatcaagtta ccaaaataca 9360
gtctgttcga attggagaac ggcagaaaac gcatgcttgc atcagcgggt gaactgcaaa 9420
agggaaatga gttagcactt ccttctaaat acgtcaactt cctgtatttg gcgtcacact 9480
acgaaaaact gaagggctct ccagaagata acgagcaaaa gcagttattt gtggaacagc 9540
acaaacatta ccttgacgaa attatagagc aaatctcgga gttcagtaag agagtgattt 9600
tggctgacgc caatcttgat aaagttctgt ctgcttacaa caagcaccgt gataaaccga 9660
ttagggaaca ggccgagaac atcatacatc tcttcacact cactaacctt ggtgcacccg 9720
cagcgttcaa atattttgac accacgatag atcgtaagag gtacaccagc acgaaagaag 9780
ttttggacgc gacactcatc catcaatcaa tcacgggcct gtacgagacc agaatcgacc 9840
tgtcccagct cggtggcgac tagcggccgc gactctagat cataatcagc catgcggccg 9900
cgactctaga ccacatttgt agaggtttta cttgctttaa aaaacctccc acacctcccc 9960
ctgaacctga aacataaaat gaatgcaatt gttgttgtta acttgtttat tgcagcttat 10020
aatggttaca aataaagcaa tagcatcaca aatttcacaa ataaagcatt tttttcactg 10080
cattctagtt gtggtttgtc caaactcatc aatgtatctt aaagcttatc gatacgcgta 10140
cctaggccgg ccgatctcgg atctgacaat gttcagtgca gagactcggc tacgcctcgt 10200
ggactttgaa gttgaccaac aatgtttatt cttacctcta atagtcctct gtggcaaggt 10260
caagattctg ttagaagcca atgaagaacc tggttgttca ataacatttt gttcgtctaa 10320
tatttcacta ccgcttgacg ttggctgcac ttcatgtacc tcatctataa acgcttcttc 10380
tgtatcgctc tggacgtcat cttcacttac gtgatctgat atttcactgt cagaatcctc 10440
accaacaagc tcgtcatcgc tttgcagaag agcagagagg atatgctcat cgtctaaaga 10500
actacccatt ttattatata ttagtcacga tatctataac aagaaaatat atatataata 10560
agttatcacg taagtagaac atgaaataac aatataatta tcgtatgagt taaatcttaa 10620
aagtcacgta aaagataatc atgcgtcatt ttgactcacg cggtcgttat agttcaaaat 10680
cagtgacact taccgcattg acaagcacgc ctcacgggag ctccaagcgg cgactgagat 10740
gtcctaaatg cacagcgacg gattcgcgct atttagaaag agagagcaat atttcaagaa 10800
tgcatgcgtc aattttacgc agactatctt tctagggtta aaaaagattt gcgctttact 10860
cgacctaaac tttaaacacg tcatagaatc ttcgtttgac aaaaaccaca ttgtggccaa 10920
gctgtgtgac gcgacgcgcg ctaaagaatg gcaaaccaag tcgcgcgagc gtcgactcta 10980
gaggatcccc gggtaccgag ctcgaattcg taatcatggt catagctgtt tcctgtgtga 11040
aattgttatc cgctcacaat tccacacaac atacgagccg gaagcataaa gtgtaaagcc 11100
tggggtgcct aatgagtgag ctaactcaca tcggatgccg ggaccgacga gtgcagaggc 11160
gtgcaagcga gcttggcgta atcatggtca tagctgtttc ctgtgtgaaa ttgttatccg 11220
ctcacaattc cacacaacat acgagccgga agcataaagt gtaaagcctg gggtgcctaa 11280
tgagtgagct aactcacatt aattgcgttg cgctcactgc ccgctttcca gtcgggaaac 11340
ctgtcgtgcc agctgcatta atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt 11400
gggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga 11460
gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca 11520
ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg 11580
ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt 11640
cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc 11700
ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct 11760
tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc ggtgtaggtc 11820
gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta 11880
tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca 11940
gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag 12000
tggtggccta actacggcta cactagaaga acagtatttg gtatctgcgc tctgctgaag 12060
ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt 12120
agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa 12180
gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg 12240
attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga 12300
agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta 12360
atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactc 12420
cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag tgctgcaatg 12480
ataccgcgag acccacgctc accggctcca gatttatcag caataaacca gccagccgga 12540
agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc tattaattgt 12600
tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt 12660
gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg cttcattcag ctccggttcc 12720
caacgatcaa ggcgagttac atgatccccc atgttgtgca aaaaagcggt tagctccttc 12780
ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt tatcactcat ggttatggca 12840
gcactgcata attctcttac tgtcatgcca tccgtaagat gcttttctgt gactggtgag 12900
tactcaacca agtcattctg agaatagtgt atgcggcgac cgagttgctc ttgcccggcg 12960
tcaatacggg ataataccgc gccacatagc agaactttaa aagtgctcat cattggaaaa 13020
cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt tgagatccag ttcgatgtaa 13080
cccactcgtg cacccaactg atcttcagca tcttttactt tcaccagcgt ttctgggtga 13140
gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa gggcgacacg gaaatgttga 13200
atactcatac tcttcctttt tcaatattat tgaagcattt atcagggtta ttgtctcatg 13260
agcggataca tatttgaatg tatttagaaa aataaacaaa taggggttcc gcgcacattt 13320
ccccgaaaag tgccacctga cgtctaagaa accattatta tcatgacatt aacctataaa 13380
aataggcgta tcacgaggcc ctttcgtc 13408
<210> 2
<211> 1383
<212> DNA
<213> Artificial
<400> 2
atcgatgttc ccactggcct ggagcgactg tttttcagta cttccggtat ctcgcgtttg 60
ttcctgcagg atcatgatga taaacaatgt atggtgctaa tgttgcttca acaacaattc 120
tgttgaactg tgttttcatg tttgccaaca agcaccttta tactcggtgg cctccccacc 180
accaactttt ttgcactgca aaaaaacacg cttttgcacg cgggcccata catagtacaa 240
actctacgtt tcgtagacta ttttacataa atagtctaca ccgttgtata cgctccaaat 300
acactaccac acattgaacc tttttgcagt gcaaaaaagt acgtgtcggc agtcacgtag 360
gccggcctta tcgggtcgcg tcctgtcacg tacgaatcac attatcggac cggacgagtg 420
ttgtcttatc gtgacaggac gccagcttcc tgtgttgcta accgcagccg gacgcaactc 480
cttatcggaa caggacgcgc ctccatatca gccgcgcgtt atctcatgcg cgtgaccgga 540
cacgaggcgc ccgtcccgct tatcgcgcct ataaatacag cccgcaacga tctggtaaac 600
acagttgaac agcatctgtt cgaaatggcc aagttgacca gtgccgttcc ggtgctcacc 660
gcgcgcgacg tcgccggagc ggtcgagttc tggaccgacc ggctcgggtt ctcccgggac 720
ttcgtggagg acgacttcgc cggtgtggtc cgggacgacg tgaccctgtt catcagcgcg 780
gtccaggacc aggtggtgcc ggacaacacc ctggcctggg tgtgggtgcg cggcctggac 840
gagctgtacg ccgagtggtc ggaggtcgtg tccacgaact tccgggacgc ctccgggccg 900
gccatgaccg agatcggcga gcagccgtgg gggcgggagt tcgccctgcg cgacccggcc 960
ggcaactgcg tgcacttcgt ggccgaggag caggactaaa gctttacaac taaacacgac 1020
ttggagtatt ccttgtagtg tttaagattt taaatcttac ttaatgactt cgaacgattt 1080
taacgataac tttctctttg tttaacttta atcagcatac ataaaaagcc ccggttttgt 1140
atcgggaaga aaaaaaatgt aattgtgttg cctagataat aaacgtatta tcaaagtgtg 1200
tggttttcct ttaccaaaga cccctttaag atgggcctaa tgggcttaag tcgagtcctt 1260
tccgatgtgt taaatacaca tttattacac tgatgcgtcg aatgtacact tttaatagga 1320
tagctccact aaaaattatt ttatttattt aatttgttgc accaaaactg atacattgac 1380
gaa 1383
<210> 3
<211> 6291
<212> DNA
<213> Artificial
<400> 3
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240
attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300
tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360
tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt ggagatcggt acttcgcgaa 420
tgcgtcgaga taagagggtt aaaaaatata ttttacgcac catatacgca tcgggttgat 480
atcgttaata tggatcaatt tgaacagttg attaacgtgt ctctgctcaa gtctttgatc 540
aaaacgcaaa tcgacgaaaa tgtgtcggac aatatcaagt cgatgagcga aaaactaaaa 600
aggctagaat acgacaatct cacagacagc gttgagatat acggtattca cgacagcagg 660
ctgaataata aaaaaattag aaactattat ttaaccctag aaagataatc atattgtgac 720
gtacgttaaa gataatcatg cgtaaaattg acgcatgtgt tttatcggtc tgtatatcga 780
ggtttattta ttaatttgaa tagatattaa gttttattat atttacactt acatactaat 840
aataaattca acaaacaatt tatttatgtt tatttattta ttaaaaaaaa acaaaaactc 900
aaaatttctt ctataaagta acaaaacttt taaacattct ctcttttaca aaaataaact 960
tattttgtac tttaaaaaca gtcatgttgt attataaaat aagtaattag cttaacttat 1020
acataataga aacaaattat acttattagt cagtcagaaa caactttggc acatatcaat 1080
attatgctct cgacaaataa cttttttgca ttttttgcac gatgcatttg cctttcgcct 1140
tattttagag gggcagtaag tacagtaagt acgttttttc attactggct cttcagtact 1200
gtcatctgat gtaccaggca cttcatttgg caaaatatta gagatattat cgcgcaaata 1260
tctcttcaaa gtaggagctt ctaaacgctt acgcataaac gatgacgtca ggctcatgta 1320
aaggtttctc ataaattttt tgcgactttg aaccttttct cccttgctac tgacattatg 1380
gctgtatata ataaaagaat ttatgcaggc aatgtttatc attccgtaca ataatgccat 1440
aggccaccta ttcgtcctcc tactgcaggt catcacagaa cacatttggt ctagcgtgtc 1500
cactccgcct ttagtttgat tataatacat aaccatttgc ggtttaccgg tactttcgtt 1560
gatagaagca tcctcatcac aagatgataa taagtatacc atcttagctg gcttcggttt 1620
atatgagacg agagtaaggg gtccgtcaaa acaaaacatc gatgttccca ctggcctgga 1680
gcgactgttt ttcagtactt ccggtatctc gcgtttgttt gatcgcacgg ttcccacaat 1740
ggttaattcg agctcgcccg gggatctaat tcaattagag actaattcaa ttagagctaa 1800
ttcaattagg atccaagctt atcgatttcg aaccctcgac cgccggagta taaatagagg 1860
cgcttcgtct acggagcgac aattcaattc aaacaagcaa agtgaacacg tcgctaagcg 1920
aaagctaagc aaataaacaa gcgcagctga acaagctaaa caatcggggt accgctagag 1980
tcgacggtac cgcgggcccg ggatccaccg gtcgccacca tggtgagcaa gggcgaggag 2040
ctgttcaccg gggtggtgcc catcctggtc gagctggacg gcgacgtaaa cggccacaag 2100
ttcagcgtgt ccggcgaggg cgagggcgat gccacctacg gcaagctgac cctgaagttc 2160
atctgcacca ccggcaagct gcccgtgccc tggcccaccc tcgtgaccac cctgacctac 2220
ggcgtgcagt gcttcagccg ctaccccgac cacatgaagc agcacgactt cttcaagtcc 2280
gccatgcccg aaggctacgt ccaggagcgc accatcttct tcaaggacga cggcaactac 2340
aagacccgcg ccgaggtgaa gttcgagggc gacaccctgg tgaaccgcat cgagctgaag 2400
ggcatcgact tcaaggagga cggcaacatc ctggggcaca agctggagta caactacaac 2460
agccacaacg tctatatcat ggccgacaag cagaagaacg gcatcaaggt gaacttcaag 2520
atccgccaca acatcgagga cggcagcgtg cagctcgccg accactacca gcagaacacc 2580
cccatcggcg acggccccgt gctgctgccc gacaaccact acctgagcac ccagtccgcc 2640
ctgagcaaag accccaacga gaagcgcgat cacatggtcc tgctggagtt cgtgaccgcc 2700
gccgggatca ctctcggcat ggacgagctg tacaagtaac ggccgcgact ctagatcata 2760
atcagccatg cggccgcgac tctagaccac atttgtagag gttttacttg ctttaaaaaa 2820
cctcccacac ctccccctga acctgaaaca taaaatgaat gcaattgttg ttgttaactt 2880
gtttattgca gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa 2940
agcatttttt tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttaaag 3000
cttatcgata cgcgtacggc gcgcctaggc cggccgatct cggatctgac aatgttcagt 3060
gcagagactc ggctacgcct cgtggacttt gaagttgacc aacaatgttt attcttacct 3120
ctaatagtcc tctgtggcaa ggtcaagatt ctgttagaag ccaatgaaga acctggttgt 3180
tcaataacat tttgttcgtc taatatttca ctaccgcttg acgttggctg cacttcatgt 3240
acctcatcta taaacgcttc ttctgtatcg ctctggacgt catcttcact tacgtgatct 3300
gatatttcac tgtcagaatc ctcaccaaca agctcgtcat cgctttgcag aagagcagag 3360
aggatatgct catcgtctaa agaactaccc attttattat atattagtca cgatatctat 3420
aacaagaaaa tatatatata ataagttatc acgtaagtag aacatgaaat aacaatataa 3480
ttatcgtatg agttaaatct taaaagtcac gtaaaagata atcatgcgtc attttgactc 3540
acgcggtcgt tatagttcaa aatcagtgac acttaccgca ttgacaagca cgcctcacgg 3600
gagctccaag cggcgactga gatgtcctaa atgcacagcg acggattcgc gctatttaga 3660
aagagagagc aatatttcaa gaatgcatgc gtcaatttta cgcagactat ctttctaggg 3720
ttaaaaaaga tttgcgcttt actcgaccta aactttaaac acgtcataga atcttcgttt 3780
gacaaaaacc acattgtggc caagctgtgt gacgcgacgc gcgctaaaga atggcaaacc 3840
aagtcgcgcg agcgtcgact ctagaggatc cccgggtacc gagctcgaat tcgtaatcat 3900
ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag 3960
ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acatcggatg 4020
ccgggaccga cgagtgcaga ggcgtgcaag cgagcttggc gtaatcatgg tcatagctgt 4080
ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa 4140
agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac 4200
tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 4260
cggggagagg cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc 4320
gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat 4380
ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca 4440
ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc 4500
atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc 4560
aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg 4620
gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta 4680
ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg 4740
ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac 4800
acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag 4860
gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga agaacagtat 4920
ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat 4980
ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc 5040
gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt 5100
ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct 5160
agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt 5220
ggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc 5280
gttcatccat agttgcctga ctccccgtcg tgtagataac tacgatacgg gagggcttac 5340
catctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct ccagatttat 5400
cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca actttatccg 5460
cctccatcca gtctattaat tgttgccggg aagctagagt aagtagttcg ccagttaata 5520
gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg tcgtttggta 5580
tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc cccatgttgt 5640
gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag 5700
tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg ccatccgtaa 5760
gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc 5820
gaccgagttg ctcttgcccg gcgtcaatac gggataatac cgcgccacat agcagaactt 5880
taaaagtgct catcattgga aaacgttctt cggggcgaaa actctcaagg atcttaccgc 5940
tgttgagatc cagttcgatg taacccactc gtgcacccaa ctgatcttca gcatctttta 6000
ctttcaccag cgtttctggg tgagcaaaaa caggaaggca aaatgccgca aaaaagggaa 6060
taagggcgac acggaaatgt tgaatactca tactcttcct ttttcaatat tattgaagca 6120
tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag aaaaataaac 6180
aaataggggt tccgcgcaca tttccccgaa aagtgccacc tgacgtctaa gaaaccatta 6240
ttatcatgac attaacctat aaaaataggc gtatcacgag gccctttcgt c 6291
<210> 4
<211> 6334
<212> DNA
<213> Artificial
<400> 4
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240
attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300
tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360
tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt ggagatcggt acttcgcgaa 420
tgcgtcgaga taagagggtt aaaaaatata ttttacgcac catatacgca tcgggttgat 480
atcgttaata tggatcaatt tgaacagttg attaacgtgt ctctgctcaa gtctttgatc 540
aaaacgcaaa tcgacgaaaa tgtgtcggac aatatcaagt cgatgagcga aaaactaaaa 600
aggctagaat acgacaatct cacagacagc gttgagatat acggtattca cgacagcagg 660
ctgaataata aaaaaattag aaactattat ttaaccctag aaagataatc atattgtgac 720
gtacgttaaa gataatcatg cgtaaaattg acgcatgtgt tttatcggtc tgtatatcga 780
ggtttattta ttaatttgaa tagatattaa gttttattat atttacactt acatactaat 840
aataaattca acaaacaatt tatttatgtt tatttattta ttaaaaaaaa acaaaaactc 900
aaaatttctt ctataaagta acaaaacttt taaacattct ctcttttaca aaaataaact 960
tattttgtac tttaaaaaca gtcatgttgt attataaaat aagtaattag cttaacttat 1020
acataataga aacaaattat acttattagt cagtcagaaa caactttggc acatatcaat 1080
attatgctct cgacaaataa cttttttgca ttttttgcac gatgcatttg cctttcgcct 1140
tattttagag gggcagtaag tacagtaagt acgttttttc attactggct cttcagtact 1200
gtcatctgat gtaccaggca cttcatttgg caaaatatta gagatattat cgcgcaaata 1260
tctcttcaaa gtaggagctt ctaaacgctt acgcataaac gatgacgtca ggctcatgta 1320
aaggtttctc ataaattttt tgcgactttg aaccttttct cccttgctac tgacattatg 1380
gctgtatata ataaaagaat ttatgcaggc aatgtttatc attccgtaca ataatgccat 1440
aggccaccta ttcgtcctcc tactgcaggt catcacagaa cacatttggt ctagcgtgtc 1500
cactccgcct ttagtttgat tataatacat aaccatttgc ggtttaccgg tactttcgtt 1560
gatagaagca tcctcatcac aagatgataa taagtatacc atcttagctg gcttcggttt 1620
atatgagacg agagtaaggg gtccgtcaaa acaaaacatc gatgttccca ctggcctgga 1680
gcgactgttt ttcagtactt ccggtatctc gcgtttgttc ctgcaggatc atgatgataa 1740
acaatgtatg gtgctaatgt tgcttcaaca acaattctgt tgaactgtgt tttcatgttt 1800
gccaacaagc acctttatac tcggtggcct ccccaccacc aacttttttg cactgcaaaa 1860
aaacacgctt ttgcacgcgg gcccatacat agtacaaact ctacgtttcg tagactattt 1920
tacataaata gtctacaccg ttgtatacgc tccaaataca ctaccacaca ttgaaccttt 1980
ttgcagtgca aaaaagtacg tgtcggcagt cacgtaggcc ggccttatcg ggtcgcgtcc 2040
tgtcacgtac gaatcacatt atcggaccgg acgagtgttg tcttatcgtg acaggacgcc 2100
agcttcctgt gttgctaacc gcagccggac gcaactcctt atcggaacag gacgcgcctc 2160
catatcagcc gcgcgttatc tcatgcgcgt gaccggacac gaggcgcccg tcccgcttat 2220
cgcgcctata aatacagccc gcaacgatct ggtaaacaca gttgaacagc atctgttcga 2280
aatggccaag ttgaccagtg ccgttccggt gctcaccgcg cgcgacgtcg ccggagcggt 2340
cgagttctgg accgaccggc tcgggttctc ccgggacttc gtggaggacg acttcgccgg 2400
tgtggtccgg gacgacgtga ccctgttcat cagcgcggtc caggaccagg tggtgccgga 2460
caacaccctg gcctgggtgt gggtgcgcgg cctggacgag ctgtacgccg agtggtcgga 2520
ggtcgtgtcc acgaacttcc gggacgcctc cgggccggcc atgaccgaga tcggcgagca 2580
gccgtggggg cgggagttcg ccctgcgcga cccggccggc aactgcgtgc acttcgtggc 2640
cgaggagcag gactaaagct ttacaactaa acacgacttg gagtattcct tgtagtgttt 2700
aagattttaa atcttactta atgacttcga acgattttaa cgataacttt ctctttgttt 2760
aactttaatc agcatacata aaaagccccg gttttgtatc gggaagaaaa aaaatgtaat 2820
tgtgttgcct agataataaa cgtattatca aagtgtgtgg ttttccttta ccaaagaccc 2880
ctttaagatg ggcctaatgg gcttaagtcg agtcctttcc gatgtgttaa atacacattt 2940
attacactga tgcgtcgaat gtacactttt aataggatag ctccactaaa aattatttta 3000
tttatttaat ttgttgcacc aaaactgata cattgacgaa acgcgtatgc tagcaatgaa 3060
ggcgcgccta ggccggccga tctcggatct gacaatgttc agtgcagaga ctcggctacg 3120
cctcgtggac tttgaagttg accaacaatg tttattctta cctctaatag tcctctgtgg 3180
caaggtcaag attctgttag aagccaatga agaacctggt tgttcaataa cattttgttc 3240
gtctaatatt tcactaccgc ttgacgttgg ctgcacttca tgtacctcat ctataaacgc 3300
ttcttctgta tcgctctgga cgtcatcttc acttacgtga tctgatattt cactgtcaga 3360
atcctcacca acaagctcgt catcgctttg cagaagagca gagaggatat gctcatcgtc 3420
taaagaacta cccattttat tatatattag tcacgatatc tataacaaga aaatatatat 3480
ataataagtt atcacgtaag tagaacatga aataacaata taattatcgt atgagttaaa 3540
tcttaaaagt cacgtaaaag ataatcatgc gtcattttga ctcacgcggt cgttatagtt 3600
caaaatcagt gacacttacc gcattgacaa gcacgcctca cgggagctcc aagcggcgac 3660
tgagatgtcc taaatgcaca gcgacggatt cgcgctattt agaaagagag agcaatattt 3720
caagaatgca tgcgtcaatt ttacgcagac tatctttcta gggttaaaaa agatttgcgc 3780
tttactcgac ctaaacttta aacacgtcat agaatcttcg tttgacaaaa accacattgt 3840
ggccaagctg tgtgacgcga cgcgcgctaa agaatggcaa accaagtcgc gcgagcgtcg 3900
actctagagg atccccgggt accgagctcg aattcgtaat catggtcata gctgtttcct 3960
gtgtgaaatt gttatccgct cacaattcca cacaacatac gagccggaag cataaagtgt 4020
aaagcctggg gtgcctaatg agtgagctaa ctcacatcgg atgccgggac cgacgagtgc 4080
agaggcgtgc aagcgagctt ggcgtaatca tggtcatagc tgtttcctgt gtgaaattgt 4140
tatccgctca caattccaca caacatacga gccggaagca taaagtgtaa agcctggggt 4200
gcctaatgag tgagctaact cacattaatt gcgttgcgct cactgcccgc tttccagtcg 4260
ggaaacctgt cgtgccagct gcattaatga atcggccaac gcgcggggag aggcggtttg 4320
cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg 4380
cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat 4440
aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc 4500
gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc 4560
tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga 4620
agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt 4680
ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg 4740
taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc 4800
gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg 4860
gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc 4920
ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat ctgcgctctg 4980
ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc 5040
gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct 5100
caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt 5160
taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa 5220
aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa 5280
tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc 5340
tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct 5400
gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca 5460
gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt 5520
aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt 5580
gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc 5640
ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc 5700
tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt 5760
atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact 5820
ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc 5880
ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt 5940
ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg 6000
atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct 6060
gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa 6120
tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt 6180
ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc 6240
acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat gacattaacc 6300
tataaaaata ggcgtatcac gaggcccttt cgtc 6334
<210> 5
<211> 5898
<212> DNA
<213> Artificial
<400> 5
ggcgcgccta tactcgagca gcgtcgtgaa aagaggcaat gacaaataca aaacgacgta 60
tgagcagacc cgtcgccaag acgggtctac ctctaagatg atgtcatttg ttttttaaaa 120
ctaactcgct ttacgagtag aattctacgt gtaaaacata atcaagagat gatgtcattt 180
gtttttcaaa accaaactcg ctttacgagt agaattctac gtgtaaaaca caatcaaaag 240
atgatgtcat tcgtttttca aaaccgaatt taagaaatga tgtcatttgt ttttcaaaac 300
caaactcgct ttacgagcag aattctacgt gtaaaacaca atcaagagat gatgtcattt 360
gtttttcaaa actgaatgat gtcatttgtt tttcaaaact aaacttgctt tgcgagtaga 420
attctacgtg taaaacacag tcaagagatg atgtcatttg tttttcaaaa ctgaaccggc 480
tttacgagta gaattctact tgtaaaacat aatcaagaga tgatgtcatt tgtttttcaa 540
aactgaactg gctttacgag tagaattcta cgtgtaaaac ataatcaaga gatgatgtca 600
tcattaaact gatgtcattt tatacacgat tgttaacatg tttaataatg actaatttgt 660
ttttccaaat taaactcgct ttacgagtag aattctactt gtaacgcacg attaagtatg 720
aatcataagc tgatgtcatt tgttttcgac ataaaatgtt tatacaatgg aatcttcttg 780
taaattatcc aaataatata atttatccga ttctacgtta catttaaatt cgttgttatc 840
gtacaattct tcaggacacg ccatgtattg gtcattttta gcgtgcaacc aacgattgta 900
tttgacgccg tcgttggatt gcgtgttcag gttggcgtac acgtgactgg gcacggcttc 960
tttttccatg ggacgtcgac cgagaaattt ctctggccgt tattcgttat tctctctttt 1020
ctttttgggt ctctccctct ctgcactaat gctctctcac tctgtcacac agtaaacggc 1080
atactgctct cgttggttcg agagagcgcg cctcgaatgt tcgcgaaaag agcgccggag 1140
tataaataga ggcgcttcgt ctacggagcg acaattcaat tcaaacaagc aaagtgaaca 1200
cgtcgctaag cgaaagctaa gcaaataaac aagcgcagct gaacaagcta aacaatctgc 1260
agtaaagtgc aagttaaagt gaatcaatta aaagtaacca gcaaccaagt aaatcaactg 1320
caactactga aatctgccaa gaagtaatta ttgaatacaa gaagagaact ctgggggatc 1380
tctagtccag tgtggtggaa ttcgccatgg ccccaaagaa aaagagaaag gttgattaca 1440
aagaccacga cggagactac aaagaccacg acattgatta taaagatgat gatgataaag 1500
gaacgatgga caaaaagtat agcatcggtc tggatattgg aactaactcc gtcggctggg 1560
ctgtaatcac cgacgaatac aaggtcccgt caaaaaagtt caaggtattg ggtaacacag 1620
atcgtcactc tatcaaaaag aatctcattg gagctctgtt gttcgacagc ggcgaaacag 1680
ctgaggccac tagactgaag cgcaccgcca gacgccgtta cacgaggaga aagaacagaa 1740
tctgctactt gcaagaaata ttctcaaacg agatggccaa agtggacgat tcgttctttc 1800
ataggttaga agagagtttc cttgttgaag aggataaaaa gcacgaaaga catccgatat 1860
ttggaaacat cgtggacgaa gttgcttatc acgagaagta ccccacgatc tatcatctgc 1920
gtaaaaagtt ggtggactcg acagataagg ccgacctcag gttaatatac cttgcactgg 1980
cgcacatgat caaattcaga ggccattttc tgattgaagg tgacctgaac cctgacaata 2040
gtgatgtgga caaactcttc attcaattag ttcagaccta caatcaactg tttgaagaga 2100
accctatcaa cgcttcagga gttgacgcta aggccatcct tagtgcgaga ctgagcaaat 2160
cccgccgtct cgaaaactta atcgcacagt tgcctggaga gaaaaagaac ggtttgttcg 2220
gaaatctcat tgcgttgtca ctcggactca cgccaaactt caagtctaac ttcgatttgg 2280
cagaagacgc gaaactgcaa ctgagcaaag acacatatga cgatgacctc gataacctct 2340
tagctcagat cggcgatcaa tacgccgact tgttcctcgc tgccaaaaat ctgtcggacg 2400
ctatacttct gagtgatatc ttgcgcgtca acacagaaat tactaaggct cctctgtcgg 2460
ccagtatgat aaaacgctat gacgaacacc atcaggattt gacattgctc aaagccctcg 2520
tgcgtcaaca gctcccagaa aagtacaagg agattttctt tgatcagtcc aagaatggct 2580
acgcaggtta tatagacggt ggagcgtcgc aagaagagtt ctacaagttc atcaagccaa 2640
tattagaaaa gatggacggc acggaagagt tacttgttaa gctgaatcgt gaggacctgt 2700
tgcgtaaaca gaggacattc gataacggat caattccgca ccaaatacat cttggcgaac 2760
tgcacgctat cctcaggaga caagaggact tctacccctt tttaaaggat aaccgtgaaa 2820
agatcgagaa aatcctgact ttcaggattc cttactatgt cggcccactg gctcgtggta 2880
atagcaggtt tgcctggatg accaggaagt ccgaagagac aattactccg tggaacttcg 2940
aagaggtggt tgataaagga gcatcagcgc agtctttcat agaacgcatg acaaattttg 3000
acaagaactt accgaatgag aaggtccttc ccaaacactc actcctctac gaatacttca 3060
cagtatacaa cgagctcact aaagtcaagt acgtaaccga gggtatgcgc aaacccgctt 3120
tcctgtctgg agagcagaaa aaggccatcg tggaccttct gttcaagaca aaccgtaagg 3180
tcactgtaaa gcaactcaag gaagactact tcaaaaagat agagtgtttc gattcagtgg 3240
aaatctctgg cgttgaggac agatttaacg cttccttggg tacttaccac gatttgctca 3300
agatcattaa agataaggac ttcctcgaca acgaagagaa cgaagatatc ttagaggaca 3360
tagttctcac ccttacgctg tttgaagata gagagatgat tgaagagcgc ctgaagactt 3420
atgctcattt gttcgatgac aaagtcatga agcaactgaa acgccgtagg tacaccggct 3480
ggggtagatt atcgcgcaaa cttattaatg gtataaggga caagcagtcg ggaaaaacga 3540
tattggactt tctcaagagt gatggtttcg ccaacagaaa ttttatgcaa ctcatacacg 3600
atgacagctt aacattcaag gaagatatcc aaaaagcaca ggtgtcggga cagggcgaca 3660
gtttgcacga acatattgct aacctcgccg gctccccggc gataaaaaag ggtatccttc 3720
agactgtgaa agtcgtagat gaactggtga aggttatggg tcgtcataaa cccgagaaca 3780
tagttatcga aatggctagg gagaatcaaa caactcagaa gggacagaaa aactcaagag 3840
aacgcatgaa gcgcattgaa gagggtatca aagagcttgg cagtcaaatc ctgaaggaac 3900
accctgtcga gaacacgcaa cttcagaacg aaaaattgta cctctactat ctgcagaatg 3960
gtagagatat gtacgtagac caagaattgg atattaaccg cctctcagat tacgacgtgg 4020
atcatatagt tccgcagtca ttcttgaagg atgactctat cgacaacaaa gtcctcacaa 4080
gatcagacaa gaaccgcgga aaatcagata atgtaccctc tgaagaggtg gttaaaaaga 4140
tgaaaaacta ctggagacag ttacttaacg ctaagttgat cacgcaaaga aagttcgata 4200
acctcacaaa ggctgaacgc ggcggtttaa gcgagcttga caaggccggt ttcataaaac 4260
gtcagttagt cgaaaccagg caaattacga aacacgtagc ccaaatattg gattcccgca 4320
tgaacactaa atacgatgaa aatgacaagc tcatccgtga ggtcaaagta attaccctga 4380
aaagcaagtt ggtgtccgac ttcagaaagg atttccagtt ctacaaagtt cgcgaaatca 4440
acaactacca ccatgcacat gacgcttacc tgaacgcagt cgtaggcact gcgttaatta 4500
aaaagtaccc taaactggaa tctgagttcg tgtacggtga ctataaagtg tacgatgtta 4560
gaaagatgat cgctaaaagc gaacaggaga ttggaaaggc taccgccaag tatttctttt 4620
actccaacat catgaatttc tttaagaccg aaatcacgtt agcaaatggc gagatacgta 4680
aaaggccact tatcgaaaca aacggagaaa ctggcgagat agtgtgggac aagggtagag 4740
attttgccac tgtccgcaaa gtactgtcga tgccgcaagt gaatatcgtt aaaaagaccg 4800
aagttcaaac gggaggcttc agcaaagagt ccatcctgcc caagcgtaac agtgataaat 4860
tgatagctag gaaaaaggac tgggatccta aaaagtatgg tggattcgac agcccaactg 4920
tcgcatactc cgtattggtg gttgcgaaag tcgaaaaagg aaagagcaaa aagctcaagt 4980
ccgtaaaaga gctgttgggc attaccataa tggaaagatc atctttcgag aagaatccta 5040
tcgattttct ggaagccaag ggatataaag aggtcaaaaa ggacctcata atcaagttac 5100
caaaatacag tctgttcgaa ttggagaacg gcagaaaacg catgcttgca tcagcgggtg 5160
aactgcaaaa gggaaatgag ttagcacttc cttctaaata cgtcaacttc ctgtatttgg 5220
cgtcacacta cgaaaaactg aagggctctc cagaagataa cgagcaaaag cagttatttg 5280
tggaacagca caaacattac cttgacgaaa ttatagagca aatctcggag ttcagtaaga 5340
gagtgatttt ggctgacgcc aatcttgata aagttctgtc tgcttacaac aagcaccgtg 5400
ataaaccgat tagggaacag gccgagaaca tcatacatct cttcacactc actaaccttg 5460
gtgcacccgc agcgttcaaa tattttgaca ccacgataga tcgtaagagg tacaccagca 5520
cgaaagaagt tttggacgcg acactcatcc atcaatcaat cacgggcctg tacgagacca 5580
gaatcgacct gtcccagctc ggtggcgact agcggccgcg actctagatc ataatcagcc 5640
atgcggccgc gactctagac cacatttgta gaggttttac ttgctttaaa aaacctccca 5700
cacctccccc tgaacctgaa acataaaatg aatgcaattg ttgttgttaa cttgtttatt 5760
gcagcttata atggttacaa ataaagcaat agcatcacaa atttcacaaa taaagcattt 5820
ttttcactgc attctagttg tggtttgtcc aaactcatca atgtatctta aagcttatcg 5880
atacgcgtac ggcgcgcc 5898
<210> 6
<211> 6247
<212> DNA
<213> Artificial
<400> 6
ggcgcgccta tactcgagca gcgtcgtgaa aagaggcaat gacaaataca aaacgacgta 60
tgagcagacc cgtcgccaag acgggtctac ctctaagatg atgtcatttg ttttttaaaa 120
ctaactcgct ttacgagtag aattctacgt gtaaaacata atcaagagat gatgtcattt 180
gtttttcaaa accaaactcg ctttacgagt agaattctac gtgtaaaaca caatcaaaag 240
atgatgtcat tcgtttttca aaaccgaatt taagaaatga tgtcatttgt ttttcaaaac 300
caaactcgct ttacgagcag aattctacgt gtaaaacaca atcaagagat gatgtcattt 360
gtttttcaaa actgaatgat gtcatttgtt tttcaaaact aaacttgctt tgcgagtaga 420
attctacgtg taaaacacag tcaagagatg atgtcatttg tttttcaaaa ctgaaccggc 480
tttacgagta gaattctact tgtaaaacat aatcaagaga tgatgtcatt tgtttttcaa 540
aactgaactg gctttacgag tagaattcta cgtgtaaaac ataatcaaga gatgatgtca 600
tcattaaact gatgtcattt tatacacgat tgttaacatg tttaataatg actaatttgt 660
ttttccaaat taaactcgct ttacgagtag aattctactt gtaacgcacg attaagtatg 720
aatcataagc tgatgtcatt tgttttcgac ataaaatgtt tatacaatgg aatcttcttg 780
taaattatcc aaataatata atttatccga ttctacgtta catttaaatt cgttgttatc 840
gtacaattct tcaggacacg ccatgtattg gtcattttta gcgtgcaacc aacgattgta 900
tttgacgccg tcgttggatt gcgtgttcag gttggcgtac acgtgactgg gcacggcttc 960
tttttccatg ggacgtcgac tcatcttgtc acacctacat cttactaatt tcgtaagtag 1020
attttttttt acacgtataa tgtatgtatt ctttccttaa ttaacttatt ttgaaacgaa 1080
ataaataggc tattaatatt tggaactagg ttgcggtcaa tgtcaatgtc tgtctcaact 1140
ttaattcaga atgccttgtg ttccgtagat gctataaatc aatcaagatg catcttggat 1200
tgttgccaac tcgcagctac aaaatttgtt tccaagccta agcatagtgc tgtacccgtt 1260
cccgtgtatt caaatcccgt ataatagtat aatatactcc gtaaatgtag tgtcactgct 1320
tgctgaaatg atattgcaag ttccgttggg aatcttgccg ttatcaagca atgcgatatt 1380
agcggtatgg cgggaggggg acgcgcagac tccctctgct gtattaccat atatggacac 1440
aaaacttcgt gtattgtacc ctagcgcgcg attggaggag agtctgcggc ggcggggcag 1500
gggcgccccg ataaccggcc tcatttatat agtccgccaa gcgcactcac caacattcca 1560
cgaagtgagc ttgggtcgtt gcgttgtaca gcaataacga agctgtgcaa tagcaagtta 1620
atttatttat ttataataga actatttaat taaaagtaag ttattttcat tgtgtcttca 1680
aatatattaa gtgattgtga taacggttaa cggttgttag aggattggta ctagtccagt 1740
gtggtggaat tcgccatggc cccaaagaaa aagagaaagg ttgattacaa agaccacgac 1800
ggagactaca aagaccacga cattgattat aaagatgatg atgataaagg aacgatggac 1860
aaaaagtata gcatcggtct ggatattgga actaactccg tcggctgggc tgtaatcacc 1920
gacgaataca aggtcccgtc aaaaaagttc aaggtattgg gtaacacaga tcgtcactct 1980
atcaaaaaga atctcattgg agctctgttg ttcgacagcg gcgaaacagc tgaggccact 2040
agactgaagc gcaccgccag acgccgttac acgaggagaa agaacagaat ctgctacttg 2100
caagaaatat tctcaaacga gatggccaaa gtggacgatt cgttctttca taggttagaa 2160
gagagtttcc ttgttgaaga ggataaaaag cacgaaagac atccgatatt tggaaacatc 2220
gtggacgaag ttgcttatca cgagaagtac cccacgatct atcatctgcg taaaaagttg 2280
gtggactcga cagataaggc cgacctcagg ttaatatacc ttgcactggc gcacatgatc 2340
aaattcagag gccattttct gattgaaggt gacctgaacc ctgacaatag tgatgtggac 2400
aaactcttca ttcaattagt tcagacctac aatcaactgt ttgaagagaa ccctatcaac 2460
gcttcaggag ttgacgctaa ggccatcctt agtgcgagac tgagcaaatc ccgccgtctc 2520
gaaaacttaa tcgcacagtt gcctggagag aaaaagaacg gtttgttcgg aaatctcatt 2580
gcgttgtcac tcggactcac gccaaacttc aagtctaact tcgatttggc agaagacgcg 2640
aaactgcaac tgagcaaaga cacatatgac gatgacctcg ataacctctt agctcagatc 2700
ggcgatcaat acgccgactt gttcctcgct gccaaaaatc tgtcggacgc tatacttctg 2760
agtgatatct tgcgcgtcaa cacagaaatt actaaggctc ctctgtcggc cagtatgata 2820
aaacgctatg acgaacacca tcaggatttg acattgctca aagccctcgt gcgtcaacag 2880
ctcccagaaa agtacaagga gattttcttt gatcagtcca agaatggcta cgcaggttat 2940
atagacggtg gagcgtcgca agaagagttc tacaagttca tcaagccaat attagaaaag 3000
atggacggca cggaagagtt acttgttaag ctgaatcgtg aggacctgtt gcgtaaacag 3060
aggacattcg ataacggatc aattccgcac caaatacatc ttggcgaact gcacgctatc 3120
ctcaggagac aagaggactt ctaccccttt ttaaaggata accgtgaaaa gatcgagaaa 3180
atcctgactt tcaggattcc ttactatgtc ggcccactgg ctcgtggtaa tagcaggttt 3240
gcctggatga ccaggaagtc cgaagagaca attactccgt ggaacttcga agaggtggtt 3300
gataaaggag catcagcgca gtctttcata gaacgcatga caaattttga caagaactta 3360
ccgaatgaga aggtccttcc caaacactca ctcctctacg aatacttcac agtatacaac 3420
gagctcacta aagtcaagta cgtaaccgag ggtatgcgca aacccgcttt cctgtctgga 3480
gagcagaaaa aggccatcgt ggaccttctg ttcaagacaa accgtaaggt cactgtaaag 3540
caactcaagg aagactactt caaaaagata gagtgtttcg attcagtgga aatctctggc 3600
gttgaggaca gatttaacgc ttccttgggt acttaccacg atttgctcaa gatcattaaa 3660
gataaggact tcctcgacaa cgaagagaac gaagatatct tagaggacat agttctcacc 3720
cttacgctgt ttgaagatag agagatgatt gaagagcgcc tgaagactta tgctcatttg 3780
ttcgatgaca aagtcatgaa gcaactgaaa cgccgtaggt acaccggctg gggtagatta 3840
tcgcgcaaac ttattaatgg tataagggac aagcagtcgg gaaaaacgat attggacttt 3900
ctcaagagtg atggtttcgc caacagaaat tttatgcaac tcatacacga tgacagctta 3960
acattcaagg aagatatcca aaaagcacag gtgtcgggac agggcgacag tttgcacgaa 4020
catattgcta acctcgccgg ctccccggcg ataaaaaagg gtatccttca gactgtgaaa 4080
gtcgtagatg aactggtgaa ggttatgggt cgtcataaac ccgagaacat agttatcgaa 4140
atggctaggg agaatcaaac aactcagaag ggacagaaaa actcaagaga acgcatgaag 4200
cgcattgaag agggtatcaa agagcttggc agtcaaatcc tgaaggaaca ccctgtcgag 4260
aacacgcaac ttcagaacga aaaattgtac ctctactatc tgcagaatgg tagagatatg 4320
tacgtagacc aagaattgga tattaaccgc ctctcagatt acgacgtgga tcatatagtt 4380
ccgcagtcat tcttgaagga tgactctatc gacaacaaag tcctcacaag atcagacaag 4440
aaccgcggaa aatcagataa tgtaccctct gaagaggtgg ttaaaaagat gaaaaactac 4500
tggagacagt tacttaacgc taagttgatc acgcaaagaa agttcgataa cctcacaaag 4560
gctgaacgcg gcggtttaag cgagcttgac aaggccggtt tcataaaacg tcagttagtc 4620
gaaaccaggc aaattacgaa acacgtagcc caaatattgg attcccgcat gaacactaaa 4680
tacgatgaaa atgacaagct catccgtgag gtcaaagtaa ttaccctgaa aagcaagttg 4740
gtgtccgact tcagaaagga tttccagttc tacaaagttc gcgaaatcaa caactaccac 4800
catgcacatg acgcttacct gaacgcagtc gtaggcactg cgttaattaa aaagtaccct 4860
aaactggaat ctgagttcgt gtacggtgac tataaagtgt acgatgttag aaagatgatc 4920
gctaaaagcg aacaggagat tggaaaggct accgccaagt atttctttta ctccaacatc 4980
atgaatttct ttaagaccga aatcacgtta gcaaatggcg agatacgtaa aaggccactt 5040
atcgaaacaa acggagaaac tggcgagata gtgtgggaca agggtagaga ttttgccact 5100
gtccgcaaag tactgtcgat gccgcaagtg aatatcgtta aaaagaccga agttcaaacg 5160
ggaggcttca gcaaagagtc catcctgccc aagcgtaaca gtgataaatt gatagctagg 5220
aaaaaggact gggatcctaa aaagtatggt ggattcgaca gcccaactgt cgcatactcc 5280
gtattggtgg ttgcgaaagt cgaaaaagga aagagcaaaa agctcaagtc cgtaaaagag 5340
ctgttgggca ttaccataat ggaaagatca tctttcgaga agaatcctat cgattttctg 5400
gaagccaagg gatataaaga ggtcaaaaag gacctcataa tcaagttacc aaaatacagt 5460
ctgttcgaat tggagaacgg cagaaaacgc atgcttgcat cagcgggtga actgcaaaag 5520
ggaaatgagt tagcacttcc ttctaaatac gtcaacttcc tgtatttggc gtcacactac 5580
gaaaaactga agggctctcc agaagataac gagcaaaagc agttatttgt ggaacagcac 5640
aaacattacc ttgacgaaat tatagagcaa atctcggagt tcagtaagag agtgattttg 5700
gctgacgcca atcttgataa agttctgtct gcttacaaca agcaccgtga taaaccgatt 5760
agggaacagg ccgagaacat catacatctc ttcacactca ctaaccttgg tgcacccgca 5820
gcgttcaaat attttgacac cacgatagat cgtaagaggt acaccagcac gaaagaagtt 5880
ttggacgcga cactcatcca tcaatcaatc acgggcctgt acgagaccag aatcgacctg 5940
tcccagctcg gtggcgacta gcggccgcga ctctagatca taatcagcca tgcggccgcg 6000
actctagacc acatttgtag aggttttact tgctttaaaa aacctcccac acctccccct 6060
gaacctgaaa cataaaatga atgcaattgt tgttgttaac ttgtttattg cagcttataa 6120
tggttacaaa taaagcaata gcatcacaaa tttcacaaat aaagcatttt tttcactgca 6180
ttctagttgt ggtttgtcca aactcatcaa tgtatcttaa agcttatcga tacgcgtacg 6240
gcgcgcc 6247
<210> 7
<211> 12207
<212> DNA
<213> Artificial
<400> 7
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240
attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300
tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360
tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt ggagatcggt acttcgcgaa 420
tgcgtcgaga taagagggtt aaaaaatata ttttacgcac catatacgca tcgggttgat 480
atcgttaata tggatcaatt tgaacagttg attaacgtgt ctctgctcaa gtctttgatc 540
aaaacgcaaa tcgacgaaaa tgtgtcggac aatatcaagt cgatgagcga aaaactaaaa 600
aggctagaat acgacaatct cacagacagc gttgagatat acggtattca cgacagcagg 660
ctgaataata aaaaaattag aaactattat ttaaccctag aaagataatc atattgtgac 720
gtacgttaaa gataatcatg cgtaaaattg acgcatgtgt tttatcggtc tgtatatcga 780
ggtttattta ttaatttgaa tagatattaa gttttattat atttacactt acatactaat 840
aataaattca acaaacaatt tatttatgtt tatttattta ttaaaaaaaa acaaaaactc 900
aaaatttctt ctataaagta acaaaacttt taaacattct ctcttttaca aaaataaact 960
tattttgtac tttaaaaaca gtcatgttgt attataaaat aagtaattag cttaacttat 1020
acataataga aacaaattat acttattagt cagtcagaaa caactttggc acatatcaat 1080
attatgctct cgacaaataa cttttttgca ttttttgcac gatgcatttg cctttcgcct 1140
tattttagag gggcagtaag tacagtaagt acgttttttc attactggct cttcagtact 1200
gtcatctgat gtaccaggca cttcatttgg caaaatatta gagatattat cgcgcaaata 1260
tctcttcaaa gtaggagctt ctaaacgctt acgcataaac gatgacgtca ggctcatgta 1320
aaggtttctc ataaattttt tgcgactttg aaccttttct cccttgctac tgacattatg 1380
gctgtatata ataaaagaat ttatgcaggc aatgtttatc attccgtaca ataatgccat 1440
aggccaccta ttcgtcctcc tactgcaggt catcacagaa cacatttggt ctagcgtgtc 1500
cactccgcct ttagtttgat tataatacat aaccatttgc ggtttaccgg tactttcgtt 1560
gatagaagca tcctcatcac aagatgataa taagtatacc atcttagctg gcttcggttt 1620
atatgagacg agagtaaggg gtccgtcaaa acaaaacatc gatgttccca ctggcctgga 1680
gcgactgttt ttcagtactt ccggtatctc gcgtttgttc ctgcaggatc atgatgataa 1740
acaatgtatg gtgctaatgt tgcttcaaca acaattctgt tgaactgtgt tttcatgttt 1800
gccaacaagc acctttatac tcggtggcct ccccaccacc aacttttttg cactgcaaaa 1860
aaacacgctt ttgcacgcgg gcccatacat agtacaaact ctacgtttcg tagactattt 1920
tacataaata gtctacaccg ttgtatacgc tccaaataca ctaccacaca ttgaaccttt 1980
ttgcagtgca aaaaagtacg tgtcggcagt cacgtaggcc ggccttatcg ggtcgcgtcc 2040
tgtcacgtac gaatcacatt atcggaccgg acgagtgttg tcttatcgtg acaggacgcc 2100
agcttcctgt gttgctaacc gcagccggac gcaactcctt atcggaacag gacgcgcctc 2160
catatcagcc gcgcgttatc tcatgcgcgt gaccggacac gaggcgcccg tcccgcttat 2220
cgcgcctata aatacagccc gcaacgatct ggtaaacaca gttgaacagc atctgttcga 2280
aatggccaag ttgaccagtg ccgttccggt gctcaccgcg cgcgacgtcg ccggagcggt 2340
cgagttctgg accgaccggc tcgggttctc ccgggacttc gtggaggacg acttcgccgg 2400
tgtggtccgg gacgacgtga ccctgttcat cagcgcggtc caggaccagg tggtgccgga 2460
caacaccctg gcctgggtgt gggtgcgcgg cctggacgag ctgtacgccg agtggtcgga 2520
ggtcgtgtcc acgaacttcc gggacgcctc cgggccggcc atgaccgaga tcggcgagca 2580
gccgtggggg cgggagttcg ccctgcgcga cccggccggc aactgcgtgc acttcgtggc 2640
cgaggagcag gactaaagct ttacaactaa acacgacttg gagtattcct tgtagtgttt 2700
aagattttaa atcttactta atgacttcga acgattttaa cgataacttt ctctttgttt 2760
aactttaatc agcatacata aaaagccccg gttttgtatc gggaagaaaa aaaatgtaat 2820
tgtgttgcct agataataaa cgtattatca aagtgtgtgg ttttccttta ccaaagaccc 2880
ctttaagatg ggcctaatgg gcttaagtcg agtcctttcc gatgtgttaa atacacattt 2940
attacactga tgcgtcgaat gtacactttt aataggatag ctccactaaa aattatttta 3000
tttatttaat ttgttgcacc aaaactgata cattgacgaa acgcgtatgc tagcaatgaa 3060
ggcgcgccca gcgtcgtgaa aagaggcaat gacaaataca aaacgacgta tgagcagacc 3120
cgtcgccaag acgggtctac ctctaagatg atgtcatttg ttttttaaaa ctaactcgct 3180
ttacgagtag aattctacgt gtaaaacata atcaagagat gatgtcattt gtttttcaaa 3240
accaaactcg ctttacgagt agaattctac gtgtaaaaca caatcaaaag atgatgtcat 3300
tcgtttttca aaaccgaatt taagaaatga tgtcatttgt ttttcaaaac caaactcgct 3360
ttacgagcag aattctacgt gtaaaacaca atcaagagat gatgtcattt gtttttcaaa 3420
actgaatgat gtcatttgtt tttcaaaact aaacttgctt tgcgagtaga attctacgtg 3480
taaaacacag tcaagagatg atgtcatttg tttttcaaaa ctgaaccggc tttacgagta 3540
gaattctact tgtaaaacat aatcaagaga tgatgtcatt tgtttttcaa aactgaactg 3600
gctttacgag tagaattcta cgtgtaaaac ataatcaaga gatgatgtca tcattaaact 3660
gatgtcattt tatacacgat tgttaacatg tttaataatg actaatttgt ttttccaaat 3720
taaactcgct ttacgagtag aattctactt gtaacgcacg attaagtatg aatcataagc 3780
tgatgtcatt tgttttcgac ataaaatgtt tatacaatgg aatcttcttg taaattatcc 3840
aaataatata atttatccga ttctacgtta catttaaatt cgttgttatc gtacaattct 3900
tcaggacacg ccatgtattg gtcattttta gcgtgcaacc aacgattgta tttgacgccg 3960
tcgttggatt gcgtgttcag gttggcgtac acgtgactgg gcacggcttc tttttccatg 4020
ggacgtcgac cgagaaattt ctctggccgt tattcgttat tctctctttt ctttttgggt 4080
ctctccctct ctgcactaat gctctctcac tctgtcacac agtaaacggc atactgctct 4140
cgttggttcg agagagcgcg cctcgaatgt tcgcgaaaag agcgccggag tataaataga 4200
ggcgcttcgt ctacggagcg acaattcaat tcaaacaagc aaagtgaaca cgtcgctaag 4260
cgaaagctaa gcaaataaac aagcgcagct gaacaagcta aacaatctgc agtaaagtgc 4320
aagttaaagt gaatcaatta aaagtaacca gcaaccaagt aaatcaactg caactactga 4380
aatctgccaa gaagtaatta ttgaatacaa gaagagaact ctgggggatc tctagtccag 4440
tgtggtggaa ttcgccatgg ccccaaagaa aaagagaaag gttgattaca aagaccacga 4500
cggagactac aaagaccacg acattgatta taaagatgat gatgataaag gaacgatgga 4560
caaaaagtat agcatcggtc tggatattgg aactaactcc gtcggctggg ctgtaatcac 4620
cgacgaatac aaggtcccgt caaaaaagtt caaggtattg ggtaacacag atcgtcactc 4680
tatcaaaaag aatctcattg gagctctgtt gttcgacagc ggcgaaacag ctgaggccac 4740
tagactgaag cgcaccgcca gacgccgtta cacgaggaga aagaacagaa tctgctactt 4800
gcaagaaata ttctcaaacg agatggccaa agtggacgat tcgttctttc ataggttaga 4860
agagagtttc cttgttgaag aggataaaaa gcacgaaaga catccgatat ttggaaacat 4920
cgtggacgaa gttgcttatc acgagaagta ccccacgatc tatcatctgc gtaaaaagtt 4980
ggtggactcg acagataagg ccgacctcag gttaatatac cttgcactgg cgcacatgat 5040
caaattcaga ggccattttc tgattgaagg tgacctgaac cctgacaata gtgatgtgga 5100
caaactcttc attcaattag ttcagaccta caatcaactg tttgaagaga accctatcaa 5160
cgcttcagga gttgacgcta aggccatcct tagtgcgaga ctgagcaaat cccgccgtct 5220
cgaaaactta atcgcacagt tgcctggaga gaaaaagaac ggtttgttcg gaaatctcat 5280
tgcgttgtca ctcggactca cgccaaactt caagtctaac ttcgatttgg cagaagacgc 5340
gaaactgcaa ctgagcaaag acacatatga cgatgacctc gataacctct tagctcagat 5400
cggcgatcaa tacgccgact tgttcctcgc tgccaaaaat ctgtcggacg ctatacttct 5460
gagtgatatc ttgcgcgtca acacagaaat tactaaggct cctctgtcgg ccagtatgat 5520
aaaacgctat gacgaacacc atcaggattt gacattgctc aaagccctcg tgcgtcaaca 5580
gctcccagaa aagtacaagg agattttctt tgatcagtcc aagaatggct acgcaggtta 5640
tatagacggt ggagcgtcgc aagaagagtt ctacaagttc atcaagccaa tattagaaaa 5700
gatggacggc acggaagagt tacttgttaa gctgaatcgt gaggacctgt tgcgtaaaca 5760
gaggacattc gataacggat caattccgca ccaaatacat cttggcgaac tgcacgctat 5820
cctcaggaga caagaggact tctacccctt tttaaaggat aaccgtgaaa agatcgagaa 5880
aatcctgact ttcaggattc cttactatgt cggcccactg gctcgtggta atagcaggtt 5940
tgcctggatg accaggaagt ccgaagagac aattactccg tggaacttcg aagaggtggt 6000
tgataaagga gcatcagcgc agtctttcat agaacgcatg acaaattttg acaagaactt 6060
accgaatgag aaggtccttc ccaaacactc actcctctac gaatacttca cagtatacaa 6120
cgagctcact aaagtcaagt acgtaaccga gggtatgcgc aaacccgctt tcctgtctgg 6180
agagcagaaa aaggccatcg tggaccttct gttcaagaca aaccgtaagg tcactgtaaa 6240
gcaactcaag gaagactact tcaaaaagat agagtgtttc gattcagtgg aaatctctgg 6300
cgttgaggac agatttaacg cttccttggg tacttaccac gatttgctca agatcattaa 6360
agataaggac ttcctcgaca acgaagagaa cgaagatatc ttagaggaca tagttctcac 6420
ccttacgctg tttgaagata gagagatgat tgaagagcgc ctgaagactt atgctcattt 6480
gttcgatgac aaagtcatga agcaactgaa acgccgtagg tacaccggct ggggtagatt 6540
atcgcgcaaa cttattaatg gtataaggga caagcagtcg ggaaaaacga tattggactt 6600
tctcaagagt gatggtttcg ccaacagaaa ttttatgcaa ctcatacacg atgacagctt 6660
aacattcaag gaagatatcc aaaaagcaca ggtgtcggga cagggcgaca gtttgcacga 6720
acatattgct aacctcgccg gctccccggc gataaaaaag ggtatccttc agactgtgaa 6780
agtcgtagat gaactggtga aggttatggg tcgtcataaa cccgagaaca tagttatcga 6840
aatggctagg gagaatcaaa caactcagaa gggacagaaa aactcaagag aacgcatgaa 6900
gcgcattgaa gagggtatca aagagcttgg cagtcaaatc ctgaaggaac accctgtcga 6960
gaacacgcaa cttcagaacg aaaaattgta cctctactat ctgcagaatg gtagagatat 7020
gtacgtagac caagaattgg atattaaccg cctctcagat tacgacgtgg atcatatagt 7080
tccgcagtca ttcttgaagg atgactctat cgacaacaaa gtcctcacaa gatcagacaa 7140
gaaccgcgga aaatcagata atgtaccctc tgaagaggtg gttaaaaaga tgaaaaacta 7200
ctggagacag ttacttaacg ctaagttgat cacgcaaaga aagttcgata acctcacaaa 7260
ggctgaacgc ggcggtttaa gcgagcttga caaggccggt ttcataaaac gtcagttagt 7320
cgaaaccagg caaattacga aacacgtagc ccaaatattg gattcccgca tgaacactaa 7380
atacgatgaa aatgacaagc tcatccgtga ggtcaaagta attaccctga aaagcaagtt 7440
ggtgtccgac ttcagaaagg atttccagtt ctacaaagtt cgcgaaatca acaactacca 7500
ccatgcacat gacgcttacc tgaacgcagt cgtaggcact gcgttaatta aaaagtaccc 7560
taaactggaa tctgagttcg tgtacggtga ctataaagtg tacgatgtta gaaagatgat 7620
cgctaaaagc gaacaggaga ttggaaaggc taccgccaag tatttctttt actccaacat 7680
catgaatttc tttaagaccg aaatcacgtt agcaaatggc gagatacgta aaaggccact 7740
tatcgaaaca aacggagaaa ctggcgagat agtgtgggac aagggtagag attttgccac 7800
tgtccgcaaa gtactgtcga tgccgcaagt gaatatcgtt aaaaagaccg aagttcaaac 7860
gggaggcttc agcaaagagt ccatcctgcc caagcgtaac agtgataaat tgatagctag 7920
gaaaaaggac tgggatccta aaaagtatgg tggattcgac agcccaactg tcgcatactc 7980
cgtattggtg gttgcgaaag tcgaaaaagg aaagagcaaa aagctcaagt ccgtaaaaga 8040
gctgttgggc attaccataa tggaaagatc atctttcgag aagaatccta tcgattttct 8100
ggaagccaag ggatataaag aggtcaaaaa ggacctcata atcaagttac caaaatacag 8160
tctgttcgaa ttggagaacg gcagaaaacg catgcttgca tcagcgggtg aactgcaaaa 8220
gggaaatgag ttagcacttc cttctaaata cgtcaacttc ctgtatttgg cgtcacacta 8280
cgaaaaactg aagggctctc cagaagataa cgagcaaaag cagttatttg tggaacagca 8340
caaacattac cttgacgaaa ttatagagca aatctcggag ttcagtaaga gagtgatttt 8400
ggctgacgcc aatcttgata aagttctgtc tgcttacaac aagcaccgtg ataaaccgat 8460
tagggaacag gccgagaaca tcatacatct cttcacactc actaaccttg gtgcacccgc 8520
agcgttcaaa tattttgaca ccacgataga tcgtaagagg tacaccagca cgaaagaagt 8580
tttggacgcg acactcatcc atcaatcaat cacgggcctg tacgagacca gaatcgacct 8640
gtcccagctc ggtggcgact agcggccgcg actctagatc ataatcagcc atgcggccgc 8700
gactctagac cacatttgta gaggttttac ttgctttaaa aaacctccca cacctccccc 8760
tgaacctgaa acataaaatg aatgcaattg ttgttgttaa cttgtttatt gcagcttata 8820
atggttacaa ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc 8880
attctagttg tggtttgtcc aaactcatca atgtatctta aagcttatcg atacgcgtac 8940
ctaggccggc cgatctcgga tctgacaatg ttcagtgcag agactcggct acgcctcgtg 9000
gactttgaag ttgaccaaca atgtttattc ttacctctaa tagtcctctg tggcaaggtc 9060
aagattctgt tagaagccaa tgaagaacct ggttgttcaa taacattttg ttcgtctaat 9120
atttcactac cgcttgacgt tggctgcact tcatgtacct catctataaa cgcttcttct 9180
gtatcgctct ggacgtcatc ttcacttacg tgatctgata tttcactgtc agaatcctca 9240
ccaacaagct cgtcatcgct ttgcagaaga gcagagagga tatgctcatc gtctaaagaa 9300
ctacccattt tattatatat tagtcacgat atctataaca agaaaatata tatataataa 9360
gttatcacgt aagtagaaca tgaaataaca atataattat cgtatgagtt aaatcttaaa 9420
agtcacgtaa aagataatca tgcgtcattt tgactcacgc ggtcgttata gttcaaaatc 9480
agtgacactt accgcattga caagcacgcc tcacgggagc tccaagcggc gactgagatg 9540
tcctaaatgc acagcgacgg attcgcgcta tttagaaaga gagagcaata tttcaagaat 9600
gcatgcgtca attttacgca gactatcttt ctagggttaa aaaagatttg cgctttactc 9660
gacctaaact ttaaacacgt catagaatct tcgtttgaca aaaaccacat tgtggccaag 9720
ctgtgtgacg cgacgcgcgc taaagaatgg caaaccaagt cgcgcgagcg tcgactctag 9780
aggatccccg ggtaccgagc tcgaattcgt aatcatggtc atagctgttt cctgtgtgaa 9840
attgttatcc gctcacaatt ccacacaaca tacgagccgg aagcataaag tgtaaagcct 9900
ggggtgccta atgagtgagc taactcacat cggatgccgg gaccgacgag tgcagaggcg 9960
tgcaagcgag cttggcgtaa tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc 10020
tcacaattcc acacaacata cgagccggaa gcataaagtg taaagcctgg ggtgcctaat 10080
gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag tcgggaaacc 10140
tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg 10200
ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag 10260
cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag 10320
gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc 10380
tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc 10440
agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc 10500
tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt 10560
cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg 10620
ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat 10680
ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag 10740
ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt 10800
ggtggcctaa ctacggctac actagaagaa cagtatttgg tatctgcgct ctgctgaagc 10860
cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta 10920
gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag 10980
atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga 11040
ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa 11100
gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa 11160
tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc 11220
ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga 11280
taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa 11340
gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt 11400
gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg 11460
ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc 11520
aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg 11580
gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag 11640
cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt 11700
actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt 11760
caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac 11820
gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac 11880
ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag 11940
caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa 12000
tactcatact cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga 12060
gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc 12120
cccgaaaagt gccacctgac gtctaagaaa ccattattat catgacatta acctataaaa 12180
ataggcgtat cacgaggccc tttcgtc 12207
<210> 8
<211> 3947
<212> DNA
<213> Artificial
<400> 8
gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 60
cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120
tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180
aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 240
ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300
ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360
tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420
tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 480
actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 540
gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600
acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660
gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 720
acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780
gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840
ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900
gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960
cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020
agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080
catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140
tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200
cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260
gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320
taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc 1380
ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440
tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500
ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560
cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620
agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680
gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 1740
atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800
gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1860
gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1920
ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1980
cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc 2040
cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100
acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc 2160
cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg 2220
accatgatta cgccaagctc gcttgcacgc ctctgcactc gtcggtcccg gcatccgatg 2280
acccaatctc gagttgctag caatgaaaga tctttatcga tttagccaaa agcaaaagct 2340
tgaccaaaaa taggataata tttgtttttt tatttaaaaa aataaacaat tttttataca 2400
taaactgttt atctagtatt aatatttatg ttaacatttg ataacgaatc aaatatattt 2460
ttaaactaat taaaaaatcc gatgtatgtt ataaaattgt tctagaaaaa aagcaccgac 2520
tcggtgccac tttttcaagt tgataacgga ctagccttat tttaacttgc tatttctagc 2580
tctaaaacac tggcaggtgt cttgacgagt tcttctgaat tattaacgct tacaatttcc 2640
tgatgcggta ttttctcctt acgcatctgt gcggtatttc acaccgcatc aggtggcact 2700
tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg 2760
tatccgctca tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga 2820
agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgccac 2880
ctgccatcac ttgtagagca cgatattttg tatatatacc taaaaaaact aaactattga 2940
aagcgtgatt tacaacaaca ctcgacttta caaagattat tcaaaaagag caaaaactct 3000
taacatattc tattaaagat atataatata attaaaacga aattaaataa taacaataaa 3060
acctttagaa tttgtaataa aatccataaa aacaaatgaa aacagttatg gtttgtacag 3120
cgccatctgt tattactttg acaaaatcac tatgactatc tgaccttgtc ttacacgtta 3180
acaattctta ttctgtcctt atctataagc caagtaccaa gcttaaattc gtatggctta 3240
tagttgacga tttttaaatt ctcaaggtat gtacttattt aatattaata agtactaatt 3300
gttaaaatca tctaaaacaa ttcagtgatt tacaacaatg tgtactacat aacctaatac 3360
ttataaattt attaaactgt attgattctt ttaggtcaat catcatgact ttaggagact 3420
tggtgtctca ggaaaaagga acgcaaaaag attgaggcgt ttgaaatgta ttgctggaga 3480
aagctgctac gcattccttg gacagcttgg cgcgccatct cgacgcattc gcgaagtacc 3540
gatctccaat tcactggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac 3600
ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata gcgaagaggc 3660
ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggc gcctgatgcg 3720
gtattttctc cttacgcatc tgtgcggtat ttcacaccgc atatggtgca ctctcagtac 3780
aatctgctct gatgccgcat agttaagcca gccccgacac ccgccaacac ccgctgacgc 3840
gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg 3900
gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa cgcgcga 3947
<210> 9
<211> 619
<212> DNA
<213> Artificial
<400> 9
agctgtccaa ggaatgcgta gcagctttct ccagcaatac atttcaaacg cctcaatctt 60
tttgcgttcc tttttcctga gacaccaagt ctcctaaagt catgatgatt gacctaaaag 120
aatcaataca gtttaataaa tttataagta ttaggttatg tagtacacat tgttgtaaat 180
cactgaattg ttttagatga ttttaacaat tagtacttat taatattaaa taagtacata 240
ccttgagaat ttaaaaatcg tcaactataa gccatacgaa tttaagcttg gtacttggct 300
tatagataag gacagaataa gaattgttaa cgtgtaagac aaggtcagat agtcatagtg 360
attttgtcaa agtaataaca gatggcgctg tacaaaccat aactgttttc atttgttttt 420
atggatttta ttacaaattc taaaggtttt attgttatta tttaatttcg ttttaattat 480
attatatatc tttaatagaa tatgttaaga gtttttgctc tttttgaata atctttgtaa 540
agtcgagtgt tgttgtaaat cacgctttca atagtttagt ttttttaggt atatatacaa 600
aatatcgtgc tctacaagt 619
<210> 10
<211> 82
<212> DNA
<213> Artificial
<400> 10
gttttagagc tagaaatagc aagttaaaat aaggctagtc cgttatcaac ttgaaaaagt 60
ggcaccgagt cggtgctttt tt 82
<210> 11
<211> 19
<212> DNA
<213> Artificial
<400> 11
agctgtccaa ggaatgcgt 19
<210> 12
<211> 27
<212> DNA
<213> Artificial
<400> 12
atatacaaaa tatcgtgctc tacaagt 27
<210> 13
<211> 28
<212> DNA
<213> Artificial
<400> 13
gttttagagc tagaaatagc aagttaaa 28
<210> 14
<211> 25
<212> DNA
<213> Artificial
<400> 14
gctaaatcga taaagatctt tcatt 25
<210> 15
<211> 70
<212> DNA
<213> Artificial
<220>
<221> misc_feature
<222> (26)..(45)
<223> n is a, c, g, or t
<400> 15
tacaaaatat cgtgctctac aagtgnnnnn nnnnnnnnnn nnnnngtttt agagctagaa 60
atagcaagtt 70
<210> 16
<211> 49
<212> DNA
<213> Artificial
<400> 16
tcgatgatga tgatcaattg tggcgcgcca agctgtccaa ggaatgcgt 49
<210> 17
<211> 25
<212> DNA
<213> Artificial
<400> 17
ttgtagagca cgatattttg tatat 25
<210> 18
<211> 50
<212> DNA
<213> Artificial
<400> 18
tcaatagttt agttttttta ggtatatata caaaatatcg tgctctacaa 50
<210> 19
<211> 49
<212> DNA
<213> Artificial
<400> 19
aagttgataa cggactagcc ttattttaac ttgctatttc tagctctaa 49
<210> 20
<211> 25
<212> DNA
<213> Artificial
<400> 20
ttagagctag aaatagcaag ttaaa 25
<210> 21
<211> 49
<212> DNA
<213> Artificial
<400> 21
accgatcgat cctaggcgct agctaatgaa agatctttat cgatttagc 49
<210> 22
<211> 30
<212> DNA
<213> Artificial
<220>
<221> misc_feature
<222> (1)..(12)
<223> n is a, c, g, or t
<400> 22
nnnnnnnnnn nntaaatcac gctttcaata 30
<210> 23
<211> 29
<212> DNA
<213> Artificial
<220>
<221> misc_feature
<222> (1)..(12)
<223> n is a, c, g, or t
<400> 23
nnnnnnnnnn nncgactcgg tgccacttt 29

Claims (9)

1.家蚕基于CRISPR/Cas9***的全基因组敲除载体文库的构建方法,其特征在于,具体步骤如下:
(1)构建piggyBac转座子***介导的真核生物CRISPR/Cas9敲除载体,命名为pB-CRISPR,其核苷酸序列如SEQ ID NO.1所示,将其作为骨架载体;
(2)设计敲除家蚕全部基因的sgRNA,其核苷酸序列是位于基因区符合NNNNNNNNNNNNNNNNNNNNNGG特征的序列;
(3)将步骤(2)的sgRNA整合到步骤(1)的骨架载体上,构建得到所述载体文库。
2.根据权利要求1所述的构建方法,其特征在于,步骤(1)中,载体pB-CRISPR包括Hr3CQ Enhancer-Hsp70启动子启动的spCas9蛋白编码序列,IE2启动子启动的Zeocin抗性蛋白编码序列,U6启动子启动的sgRNA表达框和piggyBac转座子的转座臂。
3.根据权利要求1所述的构建方法,其特征在于,步骤(1)的具体方法是:
(1-1)合成包含Zeocin抗性基因表达框的载体PUC57-IE2-Zeocin-Ser1PA;
(1-2)将载体PUC57-IE2-Zeocin-Ser1PA上的Zeocin抗性基因表达框IE2-Zeocin-Ser1PA表达框连接到piggyBac转座子基础载体piggyBacModify上,构建成中间载体pB-Modified{IE2-Zeocin-Ser1PA};
(1-3)将hr3-hsp70-Cas9-sv40表达框从载体pUC57-hr3-hsp70-Cas9-sv40上扩增出来,然后用无缝克隆的方法连接到pB-Modified{IE2-Zeocin-Ser1PA}上,构建成中间载体pB-Modified{IE2-Zeocin-Ser1PA}{hr3-hsp70-Cas9-SV40};
(1-4)将U6-gRNA从载体pUC57-U6-gRNA扩增出来,用酶切连接的方法连接到载体pB-Modified{IE2-Zeocin-Ser1PA}{hr3-hsp70-Cas9-SV40}的AscI/NheI位点,构建成真核生物基因敲除基础载体pB-Modified{IE2-Zeocin-Ser1PA}{U6-gRNA}{hr3-hsp70-Cas9-SV40},命名为pB-CRISPR;
其中,载体PUC57-IE2-Zeocin-Ser1PA,核苷酸序列如SEQ ID NO.2所示;
piggyBac转座子基础载体piggyBacModify,核苷酸序列如SEQ ID NO.3所示;
中间载体pB-Modified{IE2-Zeocin-Ser1PA},核苷酸序列如SEQ ID NO.4所示;
载体pUC57-hr3-hsp70-Cas9-sv40,核苷酸序列如SEQ ID NO.5所示;
中间载体pB-Modified{IE2-Zeocin-Ser1PA}{hr3-hsp70-Cas9-SV40},核苷酸序列如SEQ ID NO.7所示;
载体pUC57-U6-gRNA,核苷酸序列如SEQ ID NO.8所示。
4.根据权利要求1所述的构建方法,其特征在于,步骤(1)中,pB-CRISPR包含家蚕U6启动子和sgRNA的骨架,它们的核苷酸序列分别如SEQ ID NO.9和SEQ ID NO.10所示。
5.根据权利要求4所述的构建方法,其特征在于,基于家蚕U6启动子在第一个核苷酸为鸟嘌呤核苷酸“G”时的启动效率最高,全部sgRNA序列的5’端添加一个鸟嘌呤核苷酸“G”。
6.根据权利要求4所述的构建方法,其特征在于,步骤(2)的具体方法是:基于spCas9作用规律,设计家蚕全部编码蛋白的基因的打靶位点,每个基因设计约6个打靶位点,总计94000个。
7.根据权利要求4所述的构建方法,其特征在于,所述打靶位点包括23个核苷酸,具有如下规律:5’-NNNNNNNNNNNNNNNNNNNN-NGG-3’,N表示碱基A、T、G或C,所有打靶位点尽量在CDS序列的前半部分,靠近PAM区的种子序列部分12bp核苷酸不能在基因组上有重复区域。
8.利用权利要求1~7中任一项所述方法构建得到的家蚕基于CRISPR/Cas9***的全基因组敲除载体文库。
9.权利要求8所述载体文库在家蚕基因组突变中的应用。
CN202010379321.2A 2020-05-07 2020-05-07 家蚕基于CRISPR/Cas9***的全基因组敲除载体文库及构建方法 Pending CN111549062A (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010379321.2A CN111549062A (zh) 2020-05-07 2020-05-07 家蚕基于CRISPR/Cas9***的全基因组敲除载体文库及构建方法

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010379321.2A CN111549062A (zh) 2020-05-07 2020-05-07 家蚕基于CRISPR/Cas9***的全基因组敲除载体文库及构建方法

Publications (1)

Publication Number Publication Date
CN111549062A true CN111549062A (zh) 2020-08-18

Family

ID=72007935

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010379321.2A Pending CN111549062A (zh) 2020-05-07 2020-05-07 家蚕基于CRISPR/Cas9***的全基因组敲除载体文库及构建方法

Country Status (1)

Country Link
CN (1) CN111549062A (zh)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111534542A (zh) * 2020-05-07 2020-08-14 西南大学 piggyBac转座子***介导的真核生物转基因细胞系及构建方法
CN112981544A (zh) * 2021-02-25 2021-06-18 通用生物***(安徽)有限公司 一种sgRNA文库构建方法
CN114015690A (zh) * 2021-11-08 2022-02-08 南阳师范学院 一种家蚕shRNA表达载体构建方法及应用
CN114540421A (zh) * 2022-03-04 2022-05-27 西南大学 一种针对家蚕msg和psg表达基因的可控编辑方法
CN116356431A (zh) * 2023-03-30 2023-06-30 内蒙古农业大学 基于牛全基因组CRISPR-Cas9敲除文库、敲除细胞库及筛选目标基因的方法

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102492692A (zh) * 2011-12-16 2012-06-13 西南大学 增强子Hr3
WO2014204727A1 (en) * 2013-06-17 2014-12-24 The Broad Institute Inc. Functional genomics using crispr-cas systems, compositions methods, screens and applications thereof
CN107043782A (zh) * 2017-04-10 2017-08-15 西南大学 一种基因敲除方法及其sgRNA片段与应用
CN107119021A (zh) * 2017-07-04 2017-09-01 王小平 Pd‑1敲除cd19car‑t细胞的制备
CN107699547A (zh) * 2017-09-05 2018-02-16 上海科技大学 Pd‑1基因沉默的靶向cd133的car t细胞及其应用
CN108192900A (zh) * 2018-01-16 2018-06-22 西南大学 烟草烟碱去甲基化酶基因cyp82e4的编辑失活载体及其应用
US20180288988A1 (en) * 2017-03-30 2018-10-11 Utah State University Transgenic silkworms expressing spider silk
CN109652458A (zh) * 2018-12-28 2019-04-19 郑敦武 基于piggyBAC-Cas9***构建基因敲除细胞株的方法

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102492692A (zh) * 2011-12-16 2012-06-13 西南大学 增强子Hr3
WO2014204727A1 (en) * 2013-06-17 2014-12-24 The Broad Institute Inc. Functional genomics using crispr-cas systems, compositions methods, screens and applications thereof
US20180288988A1 (en) * 2017-03-30 2018-10-11 Utah State University Transgenic silkworms expressing spider silk
CN107043782A (zh) * 2017-04-10 2017-08-15 西南大学 一种基因敲除方法及其sgRNA片段与应用
CN107119021A (zh) * 2017-07-04 2017-09-01 王小平 Pd‑1敲除cd19car‑t细胞的制备
CN107699547A (zh) * 2017-09-05 2018-02-16 上海科技大学 Pd‑1基因沉默的靶向cd133的car t细胞及其应用
CN108192900A (zh) * 2018-01-16 2018-06-22 西南大学 烟草烟碱去甲基化酶基因cyp82e4的编辑失活载体及其应用
CN109652458A (zh) * 2018-12-28 2019-04-19 郑敦武 基于piggyBAC-Cas9***构建基因敲除细胞株的方法

Non-Patent Citations (11)

* Cited by examiner, † Cited by third party
Title
BAOSHENG等: "Expansion of CRISPR targeting sites in Bombyx mori", 《INSECT BIOCHEMISTRY & MOLECULAR BIOLOGY》 *
CHANG J等: "Genome-wide CRISPR screening reveals genes essential for cell viability and resistance to abiotic and biotic stresses in Bombyx mori", 《GENOME RESEARCH》 *
CHUNLONG XU等: "piggyBac mediates efficient in vivo CRISPR library screening for tumorigenesis in mice.", 《PNAS》 *
GANG W等: "Efficient, footprint-free human iPSC genome editing by consolidation of Cas9/CRISPR and piggyBac technologies", 《 NATURE PROTOCOLS》 *
LIU T H等: "A newly discovered member of the Atlastin family, BmAtlastin-n, has an antiviral effect against BmNPV in Bombyx mori", 《SCIENTIFIC REPORTS》 *
NCBI: "Spodoptera aff. frugiperda 2 RZ-2014 B2AR-Fusion promoter region", 《GENBANK DATEBASE》 *
SANYUANMA等: "An integrated CRISPR Bombyx mori genome editing system with improved efficiency and expanded target sites", 《INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY》 *
常珈菘: "家蚕全基因组编辑细胞库的构建及其应用", 《中国博士学位论文全文数据库(电子期刊)》 *
王彦芹等: "《现代分子生物学实验指导》", 31 March 2017, 西安交通大学出版社 *
王珏 等: "利用CRISPR/Cas9和piggyBac实现果蝇基因组无缝编辑", 《遗传》 *
赵爱春等: "《家蚕转基因技术及应用》", 30 December 2017, 上海科学技术出版社 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111534542A (zh) * 2020-05-07 2020-08-14 西南大学 piggyBac转座子***介导的真核生物转基因细胞系及构建方法
CN112981544A (zh) * 2021-02-25 2021-06-18 通用生物***(安徽)有限公司 一种sgRNA文库构建方法
CN114015690A (zh) * 2021-11-08 2022-02-08 南阳师范学院 一种家蚕shRNA表达载体构建方法及应用
CN114540421A (zh) * 2022-03-04 2022-05-27 西南大学 一种针对家蚕msg和psg表达基因的可控编辑方法
CN114540421B (zh) * 2022-03-04 2024-04-16 西南大学 一种针对家蚕msg和psg表达基因的可控编辑方法
CN116356431A (zh) * 2023-03-30 2023-06-30 内蒙古农业大学 基于牛全基因组CRISPR-Cas9敲除文库、敲除细胞库及筛选目标基因的方法

Similar Documents

Publication Publication Date Title
CN113227368B (zh) 工程化酶
CN111549062A (zh) 家蚕基于CRISPR/Cas9***的全基因组敲除载体文库及构建方法
DK2087106T3 (en) MUTATING DELTA8 DESATURATION GENES CONSTRUCTED BY TARGETED MUTAGENES AND USE THEREOF IN THE MANUFACTURE OF MULTI-Saturated FAT ACIDS
CN112639104B (zh) 源自耐有机酸的酵母的新型启动子及使用其表达靶基因的方法
CN109689856A (zh) 用于海藻宿主细胞的CRISPR-Cas***
WO2009056423A2 (de) Fermentative gewinnung von aceton aus erneuerbaren rohstoffen mittels neuen stoffwechselweges
CA2486392A1 (en) Method for the stable expression of nucleic acids in transgenic plants, controlled by a parsley-ubiquitin promoter
CN101827938A (zh) 涉及rt1基因、相关的构建体和方法的具有改变的根构造的植物
CN113584033B (zh) 一种CRISPR/Cpf1基因编辑***及其构建方法和在赤霉菌中的应用
CN115698297A (zh) 多模块生物合成酶基因组合文库的制备方法
CN111549060A (zh) 一种真核生物CRISPR/Cas9全基因组编辑细胞文库及构建方法
CN111534543A (zh) 一种真核生物CRISPR/Cas9敲除***、基础载体、载体及细胞系
CN113549562B (zh) 一种高效生产广藿香醇的工程菌及其构建方法和应用
CN101868545B (zh) 具有改变的根构造的植物、涉及编码富含亮氨酸重复序列激酶(llrk)多肽及其同源物的基因的相关构建体和方法
CN116200368A (zh) 一种基于c2c9核酸酶的新型基因组编辑***及其应用
CN111534541A (zh) 一种真核生物CRISPR-Cas9双gRNA载体及构建方法
CN106086054A (zh) 一种幽门螺杆菌基因无痕敲除的方法
CN101848931B (zh) 具有改变的根构造的植物、涉及编码exostosin家族多肽及其同源物的基因的相关的构建体和方法
CN113186140B (zh) 用于预防和/或治疗宿醉和肝病的基因工程细菌
CN106399373B (zh) 一种Cas9表达载体
CN111041039B (zh) 一种嗜热厌氧乙醇杆菌基因组编辑载体及其应用
CN115093482A (zh) 一种高精确性腺嘌呤碱基编辑器及其用途
CN112209883B (zh) 一种与rna特异性结合的荧光素染料及其应用
CN114058607B (zh) 一种用于c到u碱基编辑的融合蛋白及其制备方法和应用
KR102194368B1 (ko) 미생물의 유전체를 개량하기 위한 변이 균주, 이의 제조방법 및 이를 이용한 미생물의 유전체 개량 방법

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200818