CN108431221A - 用于转化梭状芽胞杆菌属细菌的遗传工具 - Google Patents

用于转化梭状芽胞杆菌属细菌的遗传工具 Download PDF

Info

Publication number
CN108431221A
CN108431221A CN201680068812.7A CN201680068812A CN108431221A CN 108431221 A CN108431221 A CN 108431221A CN 201680068812 A CN201680068812 A CN 201680068812A CN 108431221 A CN108431221 A CN 108431221A
Authority
CN
China
Prior art keywords
bacterium
cas9
tool
sequence
clostruidium
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201680068812.7A
Other languages
English (en)
Inventor
弗朗索瓦·瓦塞尔斯
尼古拉斯·洛普斯费雷拉
弗洛伦特·科拉斯
安娜·洛佩斯孔特拉斯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wageningen Research Foundation
IFP Energies Nouvelles IFPEN
Stichting Dienst Landbouwkundig Onderzoek DLO
Original Assignee
Wageningen Research Foundation
IFP Energies Nouvelles IFPEN
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wageningen Research Foundation, IFP Energies Nouvelles IFPEN filed Critical Wageningen Research Foundation
Publication of CN108431221A publication Critical patent/CN108431221A/zh
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/102Mutagenizing nucleic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N1/00Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
    • C12N1/20Bacteria; Culture media therefor
    • C12N1/205Bacterial isolates
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/74Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/87Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
    • C12N15/90Stable introduction of foreign DNA into chromosome
    • C12N15/902Stable introduction of foreign DNA into chromosome using homologous recombination
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/16Hydrolases (3) acting on ester bonds (3.1)
    • C12N9/22Ribonucleases RNAses, DNAses
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/02Preparation of oxygen-containing organic compounds containing a hydroxy group
    • C12P7/04Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/02Preparation of oxygen-containing organic compounds containing a hydroxy group
    • C12P7/04Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
    • C12P7/06Ethanol, i.e. non-beverage
    • C12P7/065Ethanol, i.e. non-beverage with microorganisms other than yeasts
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/24Preparation of oxygen-containing organic compounds containing a carbonyl group
    • C12P7/26Ketones
    • C12P7/28Acetone-containing products
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2310/00Structure or type of the nucleic acid
    • C12N2310/10Type of nucleic acid
    • C12N2310/20Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12RINDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
    • C12R2001/00Microorganisms ; Processes using microorganisms
    • C12R2001/01Bacteria or Actinomycetales ; using bacteria or Actinomycetales
    • C12R2001/145Clostridium
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02EREDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
    • Y02E50/00Technologies for the production of fuel of non-fossil origin
    • Y02E50/10Biofuels, e.g. bio-diesel

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Mycology (AREA)
  • Virology (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

本发明涉及包含至少两种不同核酸的遗传工具,其允许通过同源重组来转化梭状芽胞杆菌(Clostridium)属细菌、通常为产溶剂细菌。

Description

用于转化梭状芽胞杆菌属细菌的遗传工具
本发明涉及包含至少两种不同核酸的遗传工具,其允许通过同源重组来转化梭状芽胞杆菌(Clostridium)属细菌、通常为梭状芽胞杆菌属的产溶剂细菌。
背景技术
属于厚壁菌(Firmicutes)门梭状芽胞杆菌属的细菌是能够形成内生孢子的专性厌氧革兰氏阳性杆菌。该属包含众多因其致病性质或其工业和医学利益而被研究的物种。例如,破伤风梭状芽胞杆菌(Clostridium tetani)、肉毒梭状芽胞杆菌(Clostridiumbotulinum)、产气荚膜梭状芽胞杆菌(Clostridium perfringens)和难辨梭状芽胞杆菌(Clostridium difficile)分别是造成破伤风、肉毒中毒、气性坏疽和假膜性结肠炎的因素。同时,其他物种例如丙酮丁醇梭状芽胞杆菌(Clostridium acetobutylicum)、丁酸梭状芽胞杆菌(Clostridium butyricum)和拜氏梭状芽胞杆菌(Clostridium beijerinckii)对人类无致病性,被用于发酵。最近,诺维氏梭状芽胞杆菌(Clostridium novyi)和产芽孢梭状芽胞杆菌(Clostridium sporogenes)近来已被用于旨在开发抗癌疗法的研究中。
工业上感兴趣的梭状芽胞杆菌属物种能够从各种各样的从葡萄糖到纤维素范围内的糖和底物产生溶剂。产生溶剂的梭状芽胞杆菌以二次生长(diauxic growth)为特征。酸(乙酸和丁酸)在指数生长期期间产生。然后,当细胞生长停止并且所述细菌进入静止期时,它们产生溶剂。
大多数产溶剂梭状芽胞杆菌菌株产生丙酮、丁醇和乙醇作为最终产物。这些菌株被称为“ABE菌株”。例如,丙酮丁醇梭状芽胞杆菌(C.acetobutylicum)ATCC824和拜氏梭状芽胞杆菌(C.beijerinckii)NCIMB 8052菌株就是这种情况。其他菌株也能够将丙酮还原成异丙醇,并被称为“IBE菌株”。例如,拜氏梭状芽胞杆菌(C.beijerinckii)DSM 6423(NRRLB593)菌株就是这种情况,其在基因组中具有编码伯/仲醇脱氢酶的adh基因,所述伯/仲醇脱氢酶使丙酮还原为异丙醇。
在引入来自拜氏梭状芽胞杆菌DSM 6423的表达adh基因的质粒后,由丙酮丁醇梭状芽胞杆菌ATCC 824产生异丙醇是可能的。这样的遗传修饰菌株表现的方式与DSM 6423菌株相同。在分别编码参与酸的再同化的CoA转移酶、和乙酰乙酸脱羧酶的ctfA、ctfB和adc基因的操纵子结构内过表达之后,可以改善这种性能(Collas等人,2012)。引入含有这些基因的质粒修饰了ATCC824菌株的发酵概况以产生IBE混合物。然而,需要在生长培养基中存在抗生素来维持这种遗传构建物,使得不可能将该菌株用于工业应用。
尽管梭状芽胞杆菌属细菌有着不可否认的利益,但由于获得遗传修饰菌株的难度,研究和/或修改其代谢的工作做得很少。最稳固的***是基于允许精准和稳定修饰基因组的同源重组事件。然而,在梭状芽胞杆菌中观察到的同源重组频率非常低,选择标记(例如抗生素抗性基因)和反选择标记(例如编码毒素的基因)被证明是必要的。近来已经开发了两种工具(Al-Hinai等人,2012;Cartman等人,2012),每种都基于使用单一质粒。虽然有创新,但这些***有缺点。第一个***的缺点是在修饰位点留下一个FRT盒(在切除可选择性标记期间使用),它可改变突变体的遗传环境并阻止了重复使用该工具进行大量修饰。这些缺点是本领域技术人员公知的。需要两个顺序的同源重组事件的第二个***,不能用于修饰基因组的必需区域。然后开发了涉及顺序使用两种质粒的工具,其中一种编码大范围核酸酶I-SceI,其能够在特定靶位点处诱导双链DNA的断裂并促进同源重组事件(Zhang等人,2015)。这里也一样,因需要进行两个顺序的同源重组事件,使某些必需基因的修饰变得不可能。迄今为止,开发的最新一代工具(Wang等人,2015;Xu等人,2015)是基于CRISPR(成簇规律间隔短回文重复序列(Clustered Regularly Interspaced Short PalindromicRepeats))技术,其以类似于在真核生物中观察到的RNA干扰的方式工作(Barrangou等人,2007)。Wang等人和Xu等人描述的工具,分别适于拜氏梭状芽胞杆菌和解纤维梭状芽胞杆菌(C.cellulolyticum),基于使用单一质粒。Xu等人使用Cas9酶的修饰形式产生单链断裂而不是双链断裂。这两种可利用的基于CRISPR技术的工具的主要缺点是明显限制可***细菌基因组中的目标核酸的大小(并因此限制编码序列或基因的数量)(根据Xu等人,最多约1.8kb)。
发明内容
本发明的目的是提供一种适用于整个梭状芽胞杆菌属的遗传工具,特别是适用于梭状芽胞杆菌属的产溶剂细菌,使得修饰细菌基因组以允许在工业规模上使用梭状芽胞杆菌属细菌第一次成为可能。
本发明因此涉及允许通过同源重组来转化梭状芽胞杆菌属细菌、优选梭状芽胞杆菌属的产溶剂细菌的遗传工具,其特征在于所述遗传工具包含至少两种不同的核酸。这种工具例如能够修饰梭状芽胞杆菌属细菌基因组的区域,包括细菌存活所必需的序列,或允许***大片段的核酸序列,这是使用现有工具不可能的。
因此,本发明的第一个目的涉及允许通过同源重组来转化梭状芽胞杆菌属的产溶剂细菌的遗传工具,其特征在于所述遗传工具包含:
-至少编码Cas9的第一核酸,其中所述Cas9编码序列被置于启动子的控制下,和
-至少一种含有修复模板的第二核酸,其允许通过同源重组机制使Cas9靶向的细菌DNA的一部分被目标序列替换,
并且i)所述核酸中的至少一种还编码一种或多种向导RNA(gRNA),或ii)所述遗传工具还包含一种或多种向导RNA,每种向导RNA包含Cas9酶结合性RNA结构以及与所述细菌DNA的靶向部分互补的序列。
本发明进一步涉及通过同源重组来转化和/或遗传修饰梭状芽胞杆菌属细菌、通常为梭状芽胞杆菌属的产溶剂细菌的方法,其特征在于它包含将本发明的遗传工具引入所述细菌的步骤。本发明还涉及如此转化和/或遗传修饰的梭状芽胞杆菌属细菌。
发明人还公开了一种用于转化和/或遗传修饰梭状芽胞杆菌属细菌或使用梭状芽胞杆菌属细菌产生至少一种溶剂例如溶剂混合物的试剂盒,其包含本发明的遗传工具的组分,并任选地,特别是一种或多种与在所述工具内使用的选定诱导型启动子相适应的诱导物。
还公开了本发明的遗传工具、通过同源重组来转化和/或遗传修饰梭状芽胞杆菌属细菌的方法、如此转化和/或遗传修饰的细菌在以工业规模生产溶剂或溶剂混合物中的应用,所述溶剂或溶剂混合物优选为丙酮、丁醇、乙醇、异丙醇或其混合物,通常为异丙醇/丁醇混合物。
具体实施方式
CRISPR(成簇规律间隔短回文重复序列)是指在细菌和古细菌中具有针对噬菌体和质粒的免疫防御作用的基因的基因座。CRISPR-Cas9***本质上是基于诱导靶基因组中双链断裂的Cas9蛋白和负责切割位点特异性的向导RNA(gRNA)的组合。这种在DNA中产生靶向的双链断裂的能力使得有可能促进将突变引入目标菌株的基因组中所必需的同源重组事件。细胞活性取决于基因组的完整性。细菌必须修复其DNA中的任何断裂,无论是通过非同源末端连接机制还是通过需要修复模板的同源重组机制。通过给细胞提供这样的模板,就有可能修饰断裂位点所对应的区域(图2)。CRISPR在DNA分子中制造双链断裂的能力使它最近在各种生物体中被用作遗传工具,所述生物体特别是化脓性链球菌(Streptococcuspyogenes)、大肠杆菌(E.coli)和真核细胞,其中CRIPSPR在化脓性链球菌中被首次鉴定。(Jiang等人,2013;Cong等人,2013;Hwang等人,2013;Hsu等人,2013)。最近,借助于遗传工具,它被用于拜氏梭状芽胞杆菌和解纤维梭状芽胞杆菌,所述遗传工具仅允许有限修饰所述细菌基因组,其在工业规模上是不切实际的(Wang等人,2015;Xu等人,2015)。
在本文中首次公开了允许通过同源重组来转化梭状芽胞杆菌属细菌并包含至少两种不同核酸的遗传工具。本发明人已经证明,这种工具使得有可能以足够有效的方式转化和/或遗传修饰梭状芽胞杆菌属的产溶剂细菌,使它们从工业角度来看尤为有用,从而满足了长期以来所表达的需求。
本发明的独特的遗传工具,允许通过同源重组来转化梭状芽胞杆菌属细菌,所述遗传工具包含:
-至少编码Cas9的第一核酸,其中所述Cas9编码序列被置于启动子的控制下,和
-至少一种含有修复模板的第二核酸,其允许通过同源重组机制使Cas9靶向的细菌DNA的一部分被目标序列替换,
条件在于i)所述核酸中的至少一种还编码一种或多种向导RNA(gRNA),或ii)所述遗传工具还包含一种或多种向导RNA。在该工具中,每种向导RNA包含Cas9酶结合性RNA结构以及与所述细菌DNA的靶向部分互补的序列。
用语“梭状芽胞杆菌属细菌”旨在特别表示工业感兴趣的梭状芽胞杆菌物种,通常为梭状芽胞杆菌属的产溶剂细菌。词语“梭状芽胞杆菌属细菌”包括野生型细菌以及源自于其的遗传修饰的菌株,所述遗传修饰目的在于在不用暴露于CRISPR***的情况下改善它们的表现(例如过表达ctfA、ctfB和adc基因)。
用语“工业感兴趣的梭状芽胞杆菌物种”旨在表示能够通过发酵从单糖例如葡萄糖、木糖、果糖或甘露糖、从多糖例如纤维素或半纤维素、从酸例如丁酸或乙酸、或从梭状芽胞杆菌属细菌可吸收和可用的任何其它碳源(例如CO、CO2和甲醇)产生溶剂的物种。感兴趣的产溶剂细菌的例子是产生丙酮、丁醇、乙醇和/或异丙醇的梭状芽胞杆菌属细菌,例如文献中被定为“ABE菌株”的菌株[产生丙酮、丁醇和乙醇作为发酵产物的菌株]和“IBE菌株”[产生异丙醇(通过还原丙酮)、丁醇和乙醇作为发酵产物的菌株]。梭状芽胞杆菌属的产溶剂细菌可选自丙酮丁醇梭状芽胞杆菌、解纤维梭状芽胞杆菌、C.phytofermentans、拜氏梭状芽胞杆菌、糖丁酸梭状芽胞杆菌(C.saccharobutylicum)、糖丁基丙酮梭状芽胞杆菌(C.saccharoperbutylacetonicum)、产芽孢梭状芽胞杆菌(C.sporogenes)、丁酸梭状芽胞杆菌(C.butyricum)、金黄丁酸梭状芽胞杆菌(C.aurantibutyricum)和酪丁酸梭状芽胞杆菌(C.tyrobutyricum),优选选自丙酮丁醇梭状芽胞杆菌、拜氏梭状芽胞杆菌、丁酸梭状芽胞杆菌和酪丁酸梭状芽胞杆菌以及解纤维梭状芽胞杆菌,并更优选选自丙酮丁醇梭状芽胞杆菌和拜氏梭状芽胞杆菌。
在一个具体实施方式中,涉及的梭状芽胞杆菌属细菌是“ABE菌株”,优选丙酮丁醇梭状芽胞杆菌ATCC824菌株或拜氏梭状芽胞杆菌NCIMB 8052菌株。
在另一个具体实施方式中,涉及的梭状芽胞杆菌属细菌是“IBE菌株”,优选拜氏梭状芽胞杆菌DSM 6423(也被识别为NRRL B593菌株)。
CRISPR***含有两种不同的组分,即i)内切核酸酶,在目前的情况下是与CRISPR***相关的核酸酶(Cas或“CRISPR相关蛋白”),Cas9,和ii)向导RNA。所述向导RNA是嵌合RNA的形式,其由细菌CRISPR RNA(crRNA)和tracrRNA(反式激活CRISPR RNA)的组合组成(Jinek等人,Science 2012–参见图3)。所述gRNA将充当Cas蛋白的向导的“间隔序列”所对应的crRNA的靶向特异性和tracrRNA的构象性质结合在单个转录本中。当所述gRNA和Cas9蛋白在细胞中同时表达时,靶基因组序列可被永久修饰或中断。所述修饰有利地由修复模板引导。
本发明的遗传工具包含至少编码Cas9的第一核酸。术语“Cas9”旨在表示Cas9蛋白质(也称为Csn1或Csx12)或其功能性蛋白、肽或多肽片段,“功能性”是指能够与所述一种或多种向导RNA相互作用并且能够实施酶(核酸酶)活性使该酶能够在靶基因组的DNA中产生双链断裂。“Cas9”因而可以表示修饰的蛋白质,例如被截短以去除对于蛋白质的预定义功能并非必不可少的蛋白质结构域,特别是对于与所述一种或多种gRNA相互作用而言不必要的结构域。
如本发明的环境中使用的编码Cas9的序列(整个蛋白质或其片段)可以从任何已知的Cas9蛋白获得(Makarova等人,2011)。可用于本发明的Cas9蛋白的例子包括但不限于来自化脓性链球菌、嗜热链球菌(Streptococcus thermophilus)、变异链球菌(Streptococcus mutans)、空肠弯曲杆菌(Campylobacter jejuni)、新凶手弗朗西斯菌(Francisella novicida)和脑膜炎奈瑟氏球菌(Neisseria meningitidis)的Cas9蛋白。其他可用于本发明的Cas9蛋白也在Fonfara等人,2013的文章中描述。
在一个具体实施方式中,由本发明的遗传工具的一种核酸编码的Cas9蛋白或其功能性蛋白、肽或多肽片段包含氨基酸序列SEQ ID NO:1或与其具有至少50%、优选至少60%同一性并至少含有占据氨基酸序列SEQ ID NO:1的第10位(“D10”)和第840位(“D840”)的两个天冬氨酸(“D”)的任何其它氨基酸序列,或由前述的氨基酸序列组成。
在一个优选实施方式中,Cas9包含由来自化脓性链球菌M1 GAS菌株(NCBI登录号:NC_002737.2SPy_1046,SEQ ID NO:2)的cas9基因或其经历优化的形式(“优化形式”)编码的Cas9蛋白(NCBI登录号:WP_010922251.1,SEQ ID NO:1),或由所述Cas9蛋白组成,所述优化形式产生含有优先被梭状芽胞杆菌属细菌使用的密码子的转录本,所述密码子通常是富含腺嘌呤(“A”)和胸腺嘧啶(“T”)碱基的密码子,从而允许在该细菌属内促进Cas9蛋白的表达。这些优化的密码子遵守每种细菌菌株特定的密码子使用偏倚,这对本领域技术人员来说是公知的。
在本文件公开的肽序列中,氨基酸根据以下命名法由它们的单字母代码表示:C:半胱氨酸;D:天冬氨酸;E:谷氨酸;F:苯丙氨酸;G:甘氨酸;H:组氨酸;I:异亮氨酸;K:赖氨酸;L:亮氨酸;M:甲硫氨酸;N:天冬酰胺;P:脯氨酸;Q:谷氨酰胺;R:精氨酸;S:丝氨酸;T:苏氨酸;V:缬氨酸;W:色氨酸和Y:酪氨酸。
根据一个具体实施方式,Cas9结构域由完整的Cas9蛋白、优选来自化脓链球菌的Cas9蛋白或其优化形式组成。
在本发明的遗传工具的核酸之一内存在的Cas9编码序列被置于启动子的控制下。该启动子可以是组成型启动子或诱导型启动子。在一个优选实施方式中,控制Cas9表达的启动子是诱导型启动子。
在本发明的情况下有用的组成型启动子的例子可选自th1基因、ptb基因、adc基因、BCS操纵子的启动子或其衍生物,优选有功能但较短(截短)的衍生物,例如来自丙酮丁醇梭状芽胞杆菌的th1基因启动子的“miniPth1”衍生物(Dong等人,2012),或者本领域技术人员公知的允许在梭状芽胞杆菌内表达蛋白质的任何其他启动子。
在本发明的情况下有用的诱导型启动子的例子可选自例如表达受转录阻遏子TetR控制的启动子,例如tetA基因(最初存在于大肠杆菌转座子Tn10上的四环素抗性基因)的启动子;表达受L-***糖控制的启动子,例如基因ptk的启动子(参见Zhang J.等人,2015),优选与来自丙酮丁醇梭状芽胞杆菌的araR调节子表达盒组合以构建ARAi***(参见Zhang J.等人,2015);表达受昆布二糖(β-1,3葡萄糖二聚体)控制的启动子,例如celC基因启动子,优选紧随其后的是阻遏基因glyR3和目标基因(参见Mearls EB等人(2015)),或celC基因启动子(参见Newcomb M.等人,2011);表达受乳糖控制的启动子,例如bgaL基因启动子,优选紧随其后的是AdhE1(醛/醇脱氢酶)基因(参见Banerjee等人,2014);表达受木糖控制的启动子,例如xylB基因启动子(see Nariya H等人,2011);以及表达受UV暴露控制的启动子,例如bcn启动子(see Dupuy等人,2005)。
由上述启动子之一衍生的启动子,优选有功能但较短(截短的)衍生物也可有利地用于本发明的情况中。
在本发明中有用的其它诱导型启动子也在例如Ransom EM等人(2015)、Currie DH等人(2013)、D’Urzo N等人(2013)和Hartman AH等人(2011)的文章中描述。
优选的诱导型启动子是由tetA衍生的无水四环素(aTc)诱导型启动子(aTc毒性低于四环素,并且能够在较低浓度下解除转录阻遏子TetR的抑制),其选自Pcm–2tetO1和Pcm–2tetO2/1(Dong等人,2012)。
另一种优选的诱导型启动子是由xylB衍生的木糖诱导型启动子,例如来自难辨梭状芽胞杆菌630的xylB启动子(Nariya等人,2011)。
如本发明中公开的诱导型启动子使得有可能有利地控制所述酶的作用并便于选择已经历了期望的遗传修饰的转化体。
术语“向导RNA”或“gRNA”在本发明的含义内是指能够与“Cas9”相互作用以便将其引导到细菌染色体的靶区域的RNA分子。断裂的特异性由gRNA决定。如上所说明的,每个gRNA包含两个区域:
–第一区域(通常称为“SDS”区域),在gRNA的5’末端处,其与靶染色体区域互补并模拟内源性CRISPR***的crRNA,和
–第二区域(通常称为“处理”区域),在gRNA的3’末端,其模拟内源性CRISPR***的tracrRNA(反式激活crRNA)和crRNA之间的碱基配对相互作用并具有在3’方向上以基本单链序列结束的双链茎环结构。该第二区域对于gRNA与Cas9的结合是必不可少的。
所述gRNA的第一区域(SDS区域)根据靶向的染色体序列而变化。
所述gRNA的与靶染色体区域互补的SDS区域包含至少1个核苷酸,优选至少1、2、3、4、5、10、15、20、25、30、35或40个核苷酸,通常在1和40个核苷酸之间。优选地,该区域具有20、21、22、23、24、25、26、27、28、29或30个核苷酸的长度。
所述gRNA的第二区域(处理区域)具有茎环(或发夹)结构。不同gRNA的处理区域不取决于所选择的染色体靶标。
根据一个具体实施方式,所述处理区域包含至少1个核苷酸、优选至少1、50、100、200、500和1000个核苷酸、通常在1和1000个核苷酸之间的序列或由其组成。优选地,该区域具有40至120个核苷酸的长度。
gRNA的总长度一般为50至1000个核苷酸,优选80至200个核苷酸,并且更特别优选90至120个核苷酸。根据一个具体实施方式,如本发明所用的gRNA具有范围在95和110个核苷酸之间的长度,例如约100或约110个核苷酸的长度。
本领域技术人员可以通过使用公知的技术根据待靶向的染色体区域容易地定义gRNA的序列和结构(参见例如DiCarlo等人,2013的文章)。
细菌染色体内的靶向DNA区域/部分/序列可以对应于非编码DNA的一部分或编码DNA的一部分。在一个具体实施方式中,所述细菌DNA的靶向部分包含一个或多个对细菌存活必需的基因或基因部分或者一个或多个其失活允许选择已整合了目标核酸的细菌的基因或DNA序列。
梭状芽胞杆菌属细菌内靶向DNA部分的具体例子是在实验章节中使用的序列。它们是例如编码upp(SEQ ID NO:3)和adhE1(SEQ ID NO:4)基因的序列。
所述靶向的DNA区域/部分/序列后面跟着一个在Cas9结合中起作用的前间区序列邻近基序(protospacer adjacent motif)(PAM)序列。
给定gRNA的SDS区域与细菌染色体内的靶向DNA区域/部分/序列有100%同一性或至少80%同一性,优选至少85%、90%、95%、96%、97%、98%或99%同一性,并且能够与所述区域/部分/序列的互补序列的全部或部分杂交,通常与包含至少1个核苷酸、优选至少1、2、3、4、5、10、15、20、25、30、35或40个核苷酸、通常在1和40个核苷酸之间的序列、优选与包含20、21、22、23、24、25、26、27、28、29或30个核苷酸的序列杂交。
在本发明的方法中,可同时使用一种或多种gRNA。这些不同的gRNA可以靶向相同或不同的、优选不同的染色体区域。
可以将所述gRNA以gRNA分子(成熟或前体)的形式、以前体形式或以一种或多种编码所述gRNA的核酸的形式引入所述细菌细胞。优选将所述gRNA以一种或多种编码所述gRNA的核酸的形式引入到所述细菌细胞中。
当所述一种或多种gRNA以RNA分子的形式被直接引入细胞时,这些gRNA(成熟或前体)可含有修饰的核苷酸或化学修饰,从而使它们例如增加它们对核酸酶的抗性并因此增加它们在细胞中的寿命。它们特别可以包含至少一个经修饰的或非天然核苷酸,例如包含修饰碱基的核苷酸,如肌苷、甲基-5-脱氧胞苷、二甲基氨基–5–脱氧尿苷、脱氧尿苷、二氨基–2,6–嘌呤、溴代–5–脱氧尿苷或允许杂交的任何其他修饰碱基。本发明使用的gRNA也可以在核苷酸间键例如硫代磷酸酯、H-膦酸酯或烷基-膦酸酯水平上或在主链例如α–寡核苷酸、2’–O–烷基核糖或肽核酸水平(PNA)(Egholm等人,1992)上修饰。
所述gRNA可以是天然RNA、合成RNA或通过重组技术产生的RNA。所述gRNA可以通过本领域技术人员已知的任何方法例如化学合成、体内转录或扩增技术来制备。
当所述gRNA以一种或多种核酸的形式被引入所述细菌细胞时,编码所述一种或多种gRNA的所述一种或多种序列被置于表达启动子的控制下。所述启动子可以是组成型的或诱导型的。
当使用几种gRNA时,每种gRNA的表达可以由不同的启动子控制。优选地,所有gRNA使用的启动子是相同的。在一种具体实施方式中,可以使用相同的启动子来允许表达若干种、例如仅数种打算表达的gRNA。
在一个优选实施方式中,控制所述一种或多种gRNA表达的所述一种或多种启动子是组成型启动子。
在本发明的背景下有用的组成型启动子的例子可选自th1基因、ptb基因或bcs操纵子的启动子或其衍生物,优选miniPth1,或者本领域技术人员公知的允许在梭状芽胞杆菌内合成RNA(编码或非编码)的任何其他启动子。
在本发明的环境内有用的诱导型启动子的例子可选自tetA基因、xylA基因、lacI基因或bgaL基因的启动子或其衍生物,优选2tetO1或tetO2/1。优选的诱导型启动子是tetO2/1。
控制Cas9和所述一种或多种gRNA的表达的启动子可以是相同或不同的,并且可以是组成型或诱导型的。在本发明的一个具体和优选的实施方式中,分别控制Cas9或所述一种或多种gRNA的表达的启动子中仅有一个是诱导型启动子。
在本发明的含义内,术语“核酸”旨在表示任选天然的、合成的、半合成的或重组的DNA或RNA分子,其任选被化学修饰(即包括非天然碱基、含有例如修饰键、修饰碱基和/或修饰糖的修饰核苷酸)、或被优化使得从编码序列合成的转录本的密码子是其中使用的梭状芽胞杆菌属细菌中最常见的密码子。如上所述,在梭状芽胞杆菌属的情况下,优化的密码子通常是富含腺嘌呤(“A”)和胸腺嘧啶(“T”)碱基的密码子。
存在于本发明的遗传工具内的每种核酸,通常为“第一”核酸和“第二”核酸,由不同的实体组成,并且对应于例如:i)表达盒(或“构建体”),例如包含至少一个转录启动子的核酸,所述转录启动子与一个或多个目标(编码)序列可操作地连接(以本领域技术人员理解的含义),通常与包含几个目标编码序列的操纵子可操作地连接,所述目标编码序列的表达产物有助于在所述细菌内产生目标功能,或者例如还包含激活序列和/或转录终止子的核酸;或ii)环状或线性的单链或双链载体,例如质粒、噬菌体、粘粒、人造或合成染色体,其包含一个或多个如上定义的表达盒。优选地,所述载体是质粒。
所述表达盒和载体可通过本领域技术人员公知的常规程序构建,并且可包含一个或多个启动子、细菌复制起点(ORI序列)、终止序列、选择基因例如抗生素抗性基因、和允许靶向***所述盒或载体的序列(“侧翼区”)。另外,所述表达盒和载体可以通过本领域技术人员公知的技术被整合到基因组中。
目标ORI序列可选自pIP404、pAMβ1、repH(丙酮丁醇梭状芽胞杆菌中的复制起点)、ColE1或rep(大肠杆菌中的复制起点)或允许所述载体、通常为质粒在梭状芽胞杆菌细胞内维持的任何其他复制起点。
目标终止序列可选自adc和thl基因、bcs操纵子的终止序列,或本领域技术人员公知的用于停止在梭状芽胞杆菌内转录的任何其他终止子的终止序列。
目标选择基因可选自ermB、catP、bla、tetA、tetM、以及对氨苄西林、红霉素、氯霉素、甲砜霉素、四环素或本领域技术人员公知的可用于选择梭状芽胞杆菌属细菌的任何其他抗生素具有抗性的任何其它基因。
一种特定的载体包含一个或多个表达盒,每个表达盒编码gRNA。
在一个具体实施方式中,本发明涉及一种遗传工具,其包含作为如权利要求中所述的“第一”核酸的质粒载体,所述质粒载体的序列选自序列SEQ ID NO:5、SEQ ID NO:6和SEQ ID NO:7之一。
在一个具体实施方式中,本发明涉及一种遗传工具,其包含作为“第二”或“第n”核酸的质粒载体,所述质粒载体的序列选自序列SEQ ID NO:8、SEQ ID NO:9、SEQ ID NO:10和SEQ ID NO:11之一。
所述目标序列通过由所选择的修复模板(根据CRISPR技术)引导的同源重组机制被引入细菌基因组中。所述目标序列代替了细菌基因组内的靶向部分。因此,重组过程允许所述细菌的基因组内的靶向部分的全部或部分修饰或缺失,或允许核酸片段(在特定实施方式中为大片段)***所述细菌的基因组中。所选择的修复模板实际上可根据期望转化的性质而包含所述细菌基因组的全部或部分靶向序列或者其经或多或少地修饰的形式。就像DNA的靶向部分那样,所述模板因此可以自身包含与天然和/或合成的、编码和/或非编码序列相对应的一个或多个核酸序列或核酸序列部分。所述模板可以例如包含一个或多个序列或序列部分,所述序列或序列部分对应于细菌存活、特别是梭状芽胞杆菌属细菌的存活所必需的基因或对应于其失活允许选择已经整合了一种或多种目标核酸的梭状芽胞杆菌属细菌的一个或多个基因或DNA序列。所述模板也可以包含一个或多个“外来”序列,即,属于梭状芽胞杆菌属的细菌的基因组中或所述属的特定物种的基因组中天然不存在的序列。所述模板也可以包含如上所述的序列的组合。
目标序列的具体例子是实验章节中使用的序列。它们是例如序列upp_del(SEQ IDNO:12)和upp_stop(SEQ ID NO:13)。
本发明的遗传工具允许所述修复模板引导目标核酸合并到梭状芽胞杆菌属细菌的细菌基因组内,所述目标核酸通常是包含至少1个碱基对(bp),优选至少1、2、3、4、5、10、15、20、50、100、1,000、10,000、100,000或1,000,000bp,通常在1bp和20kb之间或1bp和10kb之间,优选在10bp和20kb之间或10bp和10kb之间,例如在1bp和2kb之间的DNA序列或序列部分。
在一个具体实施方式中,所述目标DNA序列编码至少一种目标产物,优选促进溶剂产生的产物,通常是至少一种目标蛋白质,例如酶;膜蛋白例如转运蛋白;转录因子;或其组合。
在一个优选实施方式中,所述目标DNA序列促进溶剂产生并且通常选自编码以下各项的序列:i)酶,优选参与醛向醇转化的酶,例如选自编码醇脱氢酶的序列(例如选自adh、adhE、adhE1、adhE2、bdhA和bdhB的序列)、编码转移酶的序列(例如选自ctfA、ctfB、atoA和atoB的序列)、编码脱羧酶的序列(例如adc)、编码氢化酶的序列(例如选自etfA、etfB和hydA的序列)及其组合,ii)膜蛋白,例如编码磷酸转移酶的序列(例如选自glcG、bglC、cbe4532、cbe4533、cbe4982、cbe4983、cbe0751的序列),和iii)转录因子(例如选自sigE、sigF、sigG、sigH、sigK的序列)。
本发明进一步涉及通过同源重组转化和/或遗传修饰梭状芽胞杆菌属细菌、优选梭状芽胞杆菌属的产溶剂细菌的方法。所述方法包含将如本申请所公开的本发明的遗传工具引入细菌的步骤。该方法可进一步包含获得转化的细菌、即具有所述一种或多种期望的重组/优化的细菌的步骤。
通过同源重组转化和/或遗传修饰梭状芽胞杆菌属的产溶剂细菌的具体的本发明方法依次包含以下步骤:
a)将如本申请公开的本发明的遗传工具引入所述细菌中,所述遗传工具包含至少一个诱导型启动子,和
b)诱导所述诱导型启动子的表达以遗传修饰所述细菌。
将本发明的遗传工具的组分(核酸或gRNA)引入所述细菌中是通过本领域技术人员已知的任何直接或间接方法进行的,例如通过转化、缀合、显微注射、转染、电穿孔等,优选通过转化(Lütke–Eversloh,2014)。
诱导步骤,当需要时,可在将本发明的遗传工具引入靶细菌后通过本领域技术人员已知的任何方法实施。例如通过使细菌接触以足量存在的合适物质或通过暴露于紫外光来进行。所述物质解除与所选的诱导型启动子相关联的表达的抑制。当所选的启动子是选自Pcm–2tetO1和Pcm–tetO2/1的脱水四环素(aTc)–诱导型启动子时,aTc优选以约1ng/ml和约5000ng/ml之间,优选约100ng/ml和约500ng/ml之间或约200ng/ml和约300ng/ml之间,例如约250ng/ml的浓度使用。
在一个具体实施方式中,所述方法包含一个或多个附加步骤,当存在步骤b)时所述附加步骤在步骤b)之后,所述附加步骤引入第n、例如第三、第四、第五等的核酸,所述核酸编码i)与已经引入的修复模板不同的修复模板,和ii)一种或多种向导RNA,所述向导RNA允许它们整合到细菌基因组的靶向区域中,各附加步骤优选有利地在去除先前引入的编码修复模板的核酸的步骤(于是,细菌细胞被视为所述核酸“清除的”)之后,并且优选在去除先前引入的一种或多种向导RNA或编码一种或多种向导RNA的序列的步骤之后。
在一个特别有利的方式中并且与现有技术中可用的工具形成对照,本发明的遗传工具允许小尺寸以及大尺寸的目标序列以一个步骤引入,即使用单一核酸(通常是本文中公开的“第二”核酸),或以几个步骤,即使用几种核酸(通常是本文中公开的“第二”和所述一个或多个“第n”核酸),优选以一个步骤。
在本发明的具体实施方式中,这“第n”核酸删除了细菌DNA的靶向部分或由较短的(例如由缺失至少一个碱基对的序列)和/或非功能性的序列代替它。在本发明的一个特别优选的实施方式中,所述“第二”或“第n”核酸有利地将包含至少一个碱基对并多达2、3、4、5、6、7、8、9、10、11、12或13kb的目标核酸引入所述细菌基因组中。
根据所使用的gRNA,所述目标核酸可以***到细菌染色体的相同或不同的区域,并且如果证明有用的话,***到包含对细菌存活必要的基因如基因gyrA、pfkA、hydA、crt、thl、hbd之一的细菌基因组部分中,或本领域技术人员已知的对梭状芽胞杆菌属细菌存活必要的任何其它基因中,和/或***到其失活允许选择已整合了所述一种或多种目标核酸的细菌的基因或DNA序列例如upp基因中。
依靠本发明,通常依靠本发明的遗传工具和方法,现在有可能有效地(高频率的同源重组)、可观地(有可能在所述细菌的基因组内并入大尺寸的目标核酸)和稳定地(不需要维持转化细菌与抗生素接触)修饰梭状芽胞杆菌属细菌以获得目标转化细菌,例如相对于本源细菌具有基因型或表型差异的增强变体,通常是工业上有用的细菌,例如可用于生产溶剂或生物燃料的细菌。
本发明的另一个目的涉及利用本发明的方法和/或遗传工具转化的梭状芽胞杆菌属细菌,通常为梭状芽胞杆菌属的产溶剂细菌。这样的细菌表达利用所述修复模板通过同源重组而引入其基因组中的所述一种或多种目标核酸。这样的细菌可包含本发明的全部或部分遗传工具,通常是Cas9或编码Cas9的核酸。
利用本发明的方法和遗传工具转化的本发明的梭状芽胞杆菌属特定细菌,例如细菌ATCC824,不再含有pSOL大质粒。
在一个具体实施方式中,利用本发明的方法和遗传工具转化的本发明的梭状芽胞杆菌属细菌,能够只依靠自发引入其基因组的所述一种或多种目标核酸的表达而产生一种或多种溶剂。
本发明还涉及用于转化和/或遗传修饰梭状芽胞杆菌属细菌的试剂盒,其包含本文中公开的遗传工具的全部或部分组分,通常是i)编码Cas9的第一核酸,其中所述Cas9编码序列置于启动子的控制下,和ii)至少一种编码修复模板的第二核酸,其允许通过同源重组机制使Cas9靶向的细菌DNA的一部分被目标序列替换,以及任选的一种或多种与在所述工具内任选使用的选定诱导型启动子相适应的诱导物。
本发明的特定试剂盒允许表达包含标签的Cas9蛋白。
本发明的试剂盒还可包含一种或多种消耗品,例如培养基、至少一种梭状芽胞杆菌属的感受态细菌(即为用于转化而包装的)、至少一种gRNA、Cas9蛋白、一种或多种选择分子、或一套说明书。
本发明典型地涉及用于实施本文中公开的转化方法或使用梭状芽胞杆菌属细菌生产溶剂(至少一种溶剂)的试剂盒。
本发明最后涉及本发明的遗传工具或方法或试剂盒在转化和/或遗传修饰梭状芽胞杆菌属细菌、通常为梭状芽胞杆菌属的产溶剂细菌中的潜在用途,例如用于产生梭状芽胞杆菌属细菌的增强变体。
最后,本发明涉及本发明的遗传工具、方法、试剂盒或所转化的梭状芽胞杆菌属细菌的潜在用途,特别是用于生产溶剂或生物燃料或其混合物,通常在工业规模上。可能生产的溶剂通常是丙酮、丁醇、乙醇、异丙醇或其混合物,通常是乙醇/异丙醇、丁醇/异丙醇或乙醇/丁醇混合物,优选异丙醇/丁醇混合物。
在一个具体实施方式中,乙醇/异丙醇混合物的比率至少等于1/4。所述比率优选在1/3和1之间,更优选等于1。
在一个具体实施方式中,乙醇/丁醇混合物的比率至少等于1/4。所述比率优选在1/3和1之间,更优选等于1。
在一个具体实施方式中,异丙醇/丁醇混合物的比率至少等于1/4。所述比率优选在1/3和1之间,更优选等于1。
使用本发明的转化细菌通常允许在工业规模上每年生产至少100吨丙酮、至少100吨乙醇、至少1000吨异丙醇、至少1800吨丁醇、或至少40000吨的其混合物。
以下实施例和附图的目的是为了更充分地说明本发明而不是限制其范围。
附图说明
图1:梭状芽胞杆菌产溶剂菌株的代谢。ABE菌株产生丙酮、乙醇和丁醇,而IBE菌株具有将丙酮转化为异丙醇的adh基因。根据Lee等人,2012修改。
图2:CRISPR作用模式。Mali等人
图3:使用CRISPR–Cas9进行基因组编辑。双链断裂是通过由gRNA指导的Cas9核酸酶产生的。通过同源重组修复该断裂变允许将修复模板中包含的修饰引入基因组。图根据Ann Ran等人,2013修改。
图4:upp靶向质粒。pIP404,丙酮丁醇梭状芽胞杆菌中的复制起点。ColE1,大肠杆菌中的复制起点。catP,氯霉素乙酰转移酶基因(氯霉素/甲砜霉素抗性基因)。CDS,编码序列。
图5:pSOL靶向质粒。pIP404,丙酮丁醇梭状芽胞杆菌中的复制起点。ColE1,大肠杆菌中的复制起点。catP,氯霉素乙酰转移酶基因(氯霉素/甲砜霉素抗性基因)。CDS,编码序列。
图6:pEC500E–miniPthl–Cas9载体图谱。pAMβ1,丙酮丁醇梭状芽胞杆菌中的复制起点。rep,大肠杆菌中的复制起点。bla,β–内酰胺酶基因(氨苄西林抗性)。ermB,甲基化酶(红霉素抗性)。CDS,编码序列。
图7:在野生型菌株中和获得的转化体中被cas9靶向的区域的测序。NC_003030,丙酮丁醇梭状芽胞杆菌ATCC824的序列(GenBank);crRNA,gRNA的识别位点;PAM,前间区序列邻近基序,在Cas9结合中发挥作用。CDS,编码序列。SEQ ID NO:22对应于图7中出现的NC_003030片段,SEQ ID NO:23分别对应于图7中出现的被定为“upp_stop修复模板”、“ATCC824”和“ATCC824upp–”的序列。
图8:扩增结果。
A:catP_fwd x catP_rev(预期大小:709bp)
B:RH_ctfB_R x V–CTFA–CAC2707_R(预期大小:351bp)
1:2–对数标志物(NEB)。2:H20,阴性对照。3:未转化的ATCC824。4:用pEC500E–miniPthl–cas9转化的ATCC824。5&6:用pEC500E–miniPthl–cas9和pEC750C转化的ATCC824(2个独立转化体)。7&8:用pEC500E–miniPthl–cas9和pEC750C–gRNA_adhE转化的ATCC824(2个独立转化体)。
图9:由ATCC824衍生的菌株的α-淀粉酶活性检测。1:ATCC824;2:用pEC500E和pEC750C转化的ATCC824;3:用pEC500E–miniPthl–cas9和pEC750C转化的ATCC824;4:用pEC500E和pEC750C–gRNA_adhE转化的ATCC824;5:用pEC500E–miniPthl–cas9和pEC750C–gRNA_adhE转化的ATCC824。
图10:野生型菌株和转化体的发酵结果(两种技术复证)。
图11:A/B.cas9的诱导型表达质粒。repH,丙酮丁醇梭状芽胞杆菌中的复制起点。ColE1,大肠杆菌中的复制起点。ermB,甲基化酶(红霉素抗性)。tetR,编码转录抑制子TetR的基因。CDS,编码序列。
图12:诱导对从启动子Pcm–2tetO1和Pcm–tetO2/1开始的表达的影响。所述启动子置于gusA基因的下游,并在不存在或存在100ng/mL的aTc的情况下,在丙酮丁醇梭状芽胞杆菌ATCC824细胞中测量GusA活性。根据Dong等人,2012修改。
图13:aTc浓度对含有pEC750C–gRNA_upp的转化体活力的影响。
图14:5–FU–抗性突变体的产生。将液体培养物的连续稀释物沉积在各种培养基上。只有其中同源重组事件允许***所述修复模板的转化体能够在2YTG+5–FU上生长。白色箭头指示被选择用于后面的实验的菌落。ND,未稀释。
图15:upp_del转化体的PCR分析。
A.upp基因周围的基因结构。编码序列用箭头表示。灰色的长方形表明upp_del模板中不存在的区域。所用的引物由三角形表示。CDS,编码序列。PAM,前间区序列邻近基序,在Cas9结合中发挥作用。
B.扩增结果。M:2–对数标志物(NEB)。1:H20,阴性对照。2:未转化的ATCC824。
3:在暴露于aTc之前用pFW0001–Pcm–2tetO1–cas9和pEC750CgRNA_upp–upp_del转化的ATCC824。4&5:在2YTG+5–FU上分离的在暴露于aTc之前用pFW0001–Pcm–2tetO1–cas9和pEC750CgRNA_upp–upp_del转化的ATCC824(2个独立转化体)。
图16:在2YTG+5–FU上分离的菌落中的cas9靶向区域的测序。
NC_003030,丙酮丁醇梭状芽胞杆菌ATCC824的序列(GenBank);crRNA,gRNA识别位点;PAM,前间区序列邻近基序,在Cas9结合中发挥作用。CDS,编码序列。SEQ ID NO:24对应于图16中出现的菌株ATCC824的基因组序列的片段,SEQ ID NO:25分别对应于图16中出现的被定为“upp_stop模板”、“克隆pFW0001–Pcm–2tetO1–cas9–pEC750C–gRNA_upp–upp_stop1”和“克隆pFW0001–Pcm–tetO2/1–cas9–pEC750C–gRNA_upp–upp_stop1”和“克隆pFW0001–Pcm–tetO2/1–cas9–pEC750C–gRNA_upp–upp_stop2”的序列的片段。
图17:pEC750C–gRNA_upp–Δupp::ipa8质粒。pIP404,丙酮丁醇梭状芽胞杆菌中的复制起点。ColE1,大肠杆菌中的复制起点。乙酰转移酶(氯霉素/甲砜霉素抗性基因)。CDS,编码序列。RHA/LHA:upp基因的侧翼序列(ca_c2879)。
图18:使用引物CA_C2877和CA_C2882的扩增结果。M,2–对数大小标志物(NEB)。P,pEC750C–gRNA_upp–Δupp::ipa8;WT,ATCC 824。
图19:由ATCC 824以及由突变体upp_del和Δupp::ipa8产生的溶剂。误差棒表示一式两份进行的实验的标准偏差。对upp_del和Δupp::ipa8获得的数据是在各自情况下对两种生物独立突变体获得的平均值。
图20:表达内切葡聚糖酶CelA(pWUR3)CelD(pWUR4)的各种菌株和表达空质粒的对照菌株在琼脂上的内切葡聚糖酶活性测量。将这些各种菌株在含有0.2%CMC的皮氏培养皿(petri dish)上温育48h。通过刚果红染色显现的水解晕圈表征每种菌株的内切葡聚糖酶活性。晕圈在对照菌株上未检测到,但在表达内切葡聚糖酶CelA或CelD的菌株中清晰可见。
图21:pSEC500E_X_Cas9质粒。pAMB1,拜氏梭状芽胞杆菌中的复制起点;PxylB,xylB启动子;ColE1起点,大肠杆菌中的复制起点;AmpR,氨苄西林抗性基因;PermB,ermB基因启动子;ermB,红霉素抗性基因;2微米ori,酵母中的复制起点;URA3,营养缺陷型标志物。
图22:pS_celAS1质粒。HS1和HS2,同源序列;aad9,奇霉素抗性基因;pCB102,拜氏梭状芽胞杆菌中的复制起点;ColE1起点,大肠杆菌中的复制起点。PeglA,eglA基因启动子。
图23:在含有浓度渐增的木糖的CGM琼脂上选择拜氏梭状芽胞杆菌重组菌株NCIMB8052(pEC500E_Xcas9,pS_celAS1)。
图24:已经整合celA基因的菌株NCIMB8052和菌株NCIMB8052的PCR验证。M,GeneRuler 1kb DNA梯(ThermoFisher)。
实施例
本发明人在两个靶标:upp和adhE基因上测试了在本文中公开并要求保护的遗传工具。
upp基因的失活
所选的第一个靶标使得通过简单的筛选来验证所设计的遗传修饰技术成为可能。upp基因编码尿嘧啶磷酸核糖基转移酶。这种酶从尿嘧啶形成一磷酸尿嘧啶(UMP),但也从5-氟尿嘧啶(5–FU)形成一磷酸5-氟尿嘧啶(5–FUMP)。5–FUMP对于细胞是毒性化合物,其阻断RNA合成。因此,基因组中含有upp基因的细菌与不表达该基因的菌株相反,将不能在含有5-FU的培养基上生长。
靶向该基因使得通过简单的表型观察就有可能简单快速地确定修饰策略是否有效。构建了三种用于靶向upp的质粒(参见图4+SEQ ID NO:9、10和11)。所有三种都含有靶向所述基因的相同gRNA,它们中的两种也含有不同的修复模板用于显示所述工具的产生缺失或点突变的能力:
·upp_del模板(SEQ ID NO:12)含有两个500核苷酸(nt)的片段,所述片段位于距由所述gRNA决定的断裂位点每侧150-nt处。使用这种模板修复所述断裂导致upp基因编码序列内的300nt缺失,使得后者之后将编码无活性蛋白。
·upp_stop模板(SEQ ID NO:13)包含两个位于所述断裂位点每侧的650-nt片段,其在gRNA识别位点处通过存在无义突变(诱导将编码氨基酸的密码子替换为终止密码子)而被修饰,以使得Cas9不再能靶向将编码不完整和失活蛋白质的基因。
pSOL质粒的丢失
所选的第二个靶标在发酵过程中令人感兴趣:参与产溶剂的一组基因,特别是adhE,位于pSOL大质粒上,并且已经显示它的丢失消除了丙酮和丁醇的产生。在利用pSOL靶向质粒(参见图5)去除这些发酵途径后,有可能将目标基因直接再引入基因组中。为了获得不再含有pSOL的菌株,构建了用于靶向adhE的质粒。本发明人表明pSOL不含细胞的必要功能,因此细胞将能够在它不存在的情况下存活。
cas9在丙酮丁醇梭状芽胞杆菌ATCC824中的组成型表达
所选的策略需要同时使用两种质粒:
–用于所述核酸酶的组成型表达的载体,由pEC500E质粒衍生而来:
pEC500E–miniPthl–Cas9(参见图6+SEQ ID NO:5);
–所述靶向载体之一,其确定核酸酶断裂位点并任选允许修复断裂,由pEC750C衍生而来:
·pEC750C–gRNA_upp(SEQ ID NO:9),含有靶向upp的gRNA;
·pEC750C–gRNA_upp–upp_del(SEQ ID NO:10),含有靶向upp的gRNA和upp_del模板;
·pEC750C–gRNA_upp–upp_stop(SEQ ID NO:11),含有靶向upp的gRNA和upp_stop模板;
·pEC750C–gRNA_adhE(SEQ ID NO:8),含有靶向adhE的gRNA。
pEC500E–miniPthl–Cas9表达载体以及对应于空载体(pEC500E)的对照质粒被引入菌株ATCC824。然后将获得的两个菌株用由载体pEC750C衍生的靶向载体转化,载体pEC750C用作对照。这个第二转化步骤的结果显示在下面的表1中。
表1:转化结果。++,获得许多转化体(得到102和103之间的菌落/转化);–,没有获得转化体
获得的转化结果表明Cas9是功能性的。实际上,当表达核酸酶并通过所述gRNA靶向upp基因时,由于在基因组DNA中引起的断裂以及细菌在没有修复模板的情况下不能进行基因组的修复(用pEC500E–miniPthl–cas9和pEC750C–gRNA_upp转化),所以没有获得转化体。
靶向upp
将所述靶向载体引入含有pEC500E的菌株时获得的结果表明,菌株ATCC824的基因组不含cas9同源物,因为转化体是用每种靶向载体获得的。
然后将含有pEC500E–miniPthl–cas9和pEC750C–gRNA_upp–upp_stop的转化体在非选择性培养基上重新铺板几次,以使其丢失所含的质粒。一旦菌落清除了它们的质粒并对抗生素敏感,就对upp基因(SEQ ID NO:3)测序(参见图7)。
期望的修饰确实存在。因此,包含引入的两种质粒并通过强组成型启动子表达cas9基因的CRISPR-Cas9遗传工具确实是功能性的。
靶向adhE
在用pEC500E–miniPthl–cas9和pEC750C_gRNA_adhE质粒转化野生型菌株期间获得转化体。由于cas9是活性的,该结果表明pSOL大质粒中的断裂不影响ATCC824的活力。
为了证实可能的pSOL丢失,进行了各种测试:
–PCR检测存在于pSOL上的基因:ctfB
使用catP_fwd x catP_rev的PCR允许检测存在于pEC750C质粒上的甲砜霉素抗性基因。它的检出证实靶向载体存在。
使用RH_ctfB_R x V–CTFA–CAC2707_R的PCR允许检测存在于pSOL大质粒上的ctfB基因的一部分,并使得有可能知道pSOL大质粒是否存在于细胞中。
扩增似乎显示了pSOL大质粒不再存在于用pEC500E–miniPthl–cas9和pEC750C–gRNA_adhE转化的克隆中(参见图8)。
–检测由存在于pSOL的基因编码的酶活性
在pSOL大质粒上含有的基因中,amyP编码具有α-淀粉酶活性的胞外酶。这种活性可以在含有淀粉和葡萄糖的固体培养基上检测(Sabathé等人,2002)。将液体培养物的稀释液沉积在含有0.2%葡萄糖和2%淀粉的琼脂平板上,并在37℃培养72h。然后通过碘染色显现α-淀粉酶活性。细菌菌落周围的透明晕圈表明存在α-淀粉酶活性。含有pEC500E–miniPthl–cas9和pEC750C–gRNA_adhE的ATCC824周围不存在活性表明amyP基因在该菌株中不表达,证实所述大质粒不再存在(参见图9)。
–发酵结果
将ATCC824野生型菌株和转化体在Gapes培养基中生长24h,以建立这两种菌株的发酵结果。所获得的发酵结果显示,由于在转化体中不存在adhE、adhE1和adc基因(存在于pSOL大质粒上),乙醇产生减少并且丁醇和乙酸产生消失(参见图10)。
因此,Cas9能够作用于ATCC824菌株的染色体或天然质粒,这使得将它的作用拓宽到染色体以及所述菌株中存在的任何染色体外遗传物质(质粒,噬菌体等)成为可能。
在丙酮丁醇梭状芽胞杆菌ATCC824中cas9的诱导型表达
为了能够实现基因组和修复模板之间的同源重组事件,有必要增加其中核酸酶具有活性的细胞数量(当用靶向载体转化含有pEC500E–miniPthl–cas9的ATCC824菌株时高达103)。为此,需要其中核酸酶表达受控的***。构建两种载体,其中cas9基因置于无水四环素诱导型启动子的控制下,其由载体pFW0001衍生而来:
·pFW0001–Pcm–2tetO1–cas9(参见图11A+SEQ ID NO:6);
·pFW0001–Pcm–tetO2/1–cas9(参见图11B+SEQ ID NO:7);
控制cas9表达的启动子含有操纵子序列tetO1和tetO2,转录阻遏子TetR与其结合。这种阻遏通过无水四环素(aTc)的存在而被解除。该***允许受控表达,几乎没有遗漏。在aTc存在下,从启动子Pcm–2tetO1开始的合成更高(参见图12)。
丙酮丁醇梭状芽胞杆菌ATCC824的转化:
将表达载体和空载体(pFW0001)引入ATCC824。随后,将以下质粒引入各类型的转化体中(参见表2):
·pEC750C–gRNA_upp,含有靶向upp的gRNA;
·pEC750C–gRNA_upp–upp_del,含有靶向upp的gRNA和upp_del模板;
·pEC750C–gRNA_upp–upp_stop,含有靶向upp的gRNA和upp_stop模板;
将转化的菌落以不同的稀释度在各种固体培养基上划线:
·2YTG+红霉素来验证细胞活力,稀释106倍;
·2YTG+红霉素+甲砜霉素来选择转化体;
·2YTG+红霉素+甲砜霉素+aTc(200ng/mL)以在诱导物存在下选择转化体。
表2:每种转化在每种类型的培养基上获得的菌落数。
ery,红霉素。thiam,甲砜霉素。aTc,无水四环素。括号内,稀释倍数。ND,未稀释。–,未测试。
观察到aTc的毒性作用,因为当它存在时,即使当使用空靶向载体(pEC750C,对照)时,获得的转化体也很少。正如预期,当含有gRNA_upp盒的pEC750C被引入到表达cas9的细胞中时,在含有aTc的培养基上没有获得转化体。另一方面,在没有aTc的培养基上获得每种质粒组合的大量转化体,表明cas9没有表达。
在aTc存在下的Cas9表达
将在含有红霉素和甲砜霉素的平板上获得的各种转化体在相同类型的培养基上重新铺板,然后用于接种含有这两种抗生素的液体预培养物。然后将这些预培养物用于接种含有不同浓度的aTc的其他液体培养物以确定所述***是否是功能性的。
cas9的诱导
使用三种转化体来分析在aTc存在下诱导cas9表达的能力:
含有pFW0001和pEC750C–gRNA_upp的ATCC824;
含有pFW0001–Pcm–2tetO1–cas9和pEC750C–gRNA_upp的ATCC824;
含有pFW0001–Pcm–tetO2/1–cas9和pEC750C–gRNA_up的ATCC824;
由这些转化体的液体培养物接种含有红霉素、甲砜霉素和浓度渐增的aTc的液体培养基(参见图13)。通过测量培养72h后的光密度来评价细胞生长的能力。
不表达核酸酶的转化体受aTc存在的影响很小或者根本不受影响。另一方面,即使在低aTc浓度下,含有经由启动子Pcm–2tetO1表达cas9的质粒(pFW0001–Pcm–2tetO1–cas9)的转化体和仅含有gRNA的质粒(pEC750C–gRNA_upp)也表现出显著的生长延迟。含有经由启动子Pcm–tetO2/1表达cas9的质粒(pFW0001–Pcm–tetO2/1–cas9)的转化体和仅含有gRNA的质粒(pEC750C–gRNA_upp)在低aTc浓度下不受影响。
然而,从150ng/mL开始观察到强生长延迟,并且在300ng/mL下观察不到生长。因此在不存在诱导物的情况下启动子Pcm–tetO2/1似乎允许比Pcm–2tetO1更好地抑制表达。
突变体的生成
在不存在或存在(100ng/mL)aTc的情况下,还制备了含有用于修复双链断裂的靶向质粒的转化体的液体培养物。所用的转化体含有表3中出现的12种质粒组合之一。
表3:转化体中存在的质粒组合。
培养72h后,将等份试样沉积在各种固体培养基上:
·含有甲砜霉素和红霉素的2YTG;
·含有甲砜霉素、红霉素和100ng/mL aTc的2YTG;
·含有5–氟尿嘧啶的2YTG。
只有其中同源重组事件允许***修复模板的转化体能够在2YTG+5–FU上生长(参见图14)。
upp_del转化体的分析
通过PCR分析在2YTG+5–FU上分离的克隆(参见图15)。
使用catP_fwd x catP_rev的PCR允许检测存在于pEC750C质粒上的甲砜霉素抗性基因。它的检出证实靶向载体存在。
使用LHA_upp_fwd x RHA_upp_rev的PCR允许扩增upp基因以及侧翼区域。出现在下面的引物被用于构建upp_del修复模板(参见图15+SEQ ID NO:14–21):
表4
使用upp_template_fwd x upp_template_rev的PCR允许扩增upp基因中的内部片段,该片段在upp_del修复模板中不存在。
获得的结果证实了所分析的转化体中upp基因内的缺失。
upp_stop突变体的分析
在2YTG+5–FU上暴露于aTc后分离的三个克隆中测序upp基因(参见图16):
·一个包含经由启动子Pcm–2tetO1表达cas9的质粒(pFW0001–Pcm–2tetO1–cas9)和含有gRNA以及upp_stop修复模板的质粒(pEC750CgRNA_upp–upp_stop);
·两个包含经由启动子Pcm–tetO2/1表达cas9的质粒(pFW0001–Pcm–tetO2/1–cas9)和含有gRNA以及upp_stop修复模板的质粒(pEC750CgRNA_upp–upp_stop)。
因此,旨在通过使用在诱导型启动子控制下的cas9基因和存在于第二质粒中的gRNA来开发遗传修饰***的策略是有作用的。与使用组成型启动子miniPth1相比,诱导cas9基因使得有可能控制酶的作用并且便于选择经历了所期望的修饰的转化体。
由操纵子ipa8替换upp基因
所做的修饰由操纵子ipa8在丙酮丁醇梭状芽胞杆菌ATCC 824基因组内的***构成,所述操纵子含有在th1基因的组成型启动子控制下的来自拜氏梭状芽胞杆菌DSM6423的adh基因(允许丙酮转化为异丙醇)和菌株ATCC 824的adc、ctfA、ctfB基因(允许所产生的酸再同化和形成丙酮)。***这个3614-bp操纵子来代替upp基因。
将由侧翼带有位于upp基因每侧的1-kb序列的操纵子组成的修复模板***pEC750–gRNA_upp质粒中而获得pEC750C–gRNA_upp–Δupp::ipa8质粒(参见图17和SEQ IDNO:26)。
将该质粒引入含有pFW0001–Pcm–tetO2/1–cas9质粒的感受态ATCC 824细胞中。在含有甲砜霉素、红霉素和诱导物aTc的2YTG培养基上进行cas9表达的诱导。
获得的菌落通过用引物对CA_C2877和CA_C2882的PCR进行分析,所述引物对允许在菌株ATCC 824中扩增2720bp产物:
CA_C2877:5’–CTTTTTAAAAAAGTTAAATAAGGAAGG–3’(SEQ ID NO:27);
CA_C2882:5’–GTTTAACTTAAGTTACAGAAAAGCTAGG–3’(SEQ ID NO:28)
对各种对照和诱导后获得的4个独立菌落上进行PCR测定的结果证实了upp基因***纵子ipa8替换(图18)。
对获得的突变体以及用作对照的WT菌株和Δupp突变体在Gapes培养基(Gapes等人,1996)中在34℃和150rpm下进行发酵72h。发酵上清液通过使用0.5g/L丙醇溶液作为内标的HPLC进行分析。碳水化合物浓度在配备折射率检测器(Varian 350 RI)的HXP-87P柱(Biorad,300mm X 7.8mm)上定量。柱温为80℃,洗脱液由硫酸组成,流速为0.4mL/min(Spectra System RI 150)。
获得的结果显示,与WT菌株或Δupp突变体形成对比,Δupp::ipa8突变体能够将丙酮还原成异丙醇(图19)。
将celA基因***拜氏梭状芽胞杆菌NCIMB8052的基因组
所做的修饰由在拜氏梭状芽胞杆菌NCIMB8052基因组内***在来自糖丁酸梭状芽胞杆菌NCP262的eglA启动子控制下的来自Neocallimastix patriciarum的celA基因构成。该基因编码能够降解纤维素底物、称为CMC(羧甲基纤维素)的酶。所述基因及其启动子的大小为1667bp,***在hbd基因之后,并允许所述菌株降解CMC(图20,Lopez–Contreras等人,2001)。
为了进行这种***,使用了两种质粒:
–pEC500E_X_cas9质粒:在来自难辨梭状芽胞杆菌630的xylB诱导型启动子的控制下表达cas9基因(Nariya等人,2011)(参见图21和SEQ ID NO:29)。
–pS_XR_celAS1质粒,在xylB启动子控制下表达向导RNA并靶向来自拜氏梭状芽胞杆菌NCIMB8052的hbd基因。所述质粒还含有由在egl2启动子控制下的celA基因组成的修复模板,所述基因侧翼带有位于所述向导RNA靶向的区域每侧的1001和1017个碱基对的两个同源区域(参见图22和SEQ ID NO:30)。
将这两种质粒依次引入NCIMB8052中。在含有奇霉素和红霉素以及浓度渐增的木糖诱导物的的CGM(6.25g/L酵母抽提物;0.5g/L MgSO4·7H2O;0.95g/L KH2HPO4;0.95g/LK2HPO4;0.013g/L MnSO4·7H2O;0.013g/L FeSO4·7H2O;1.25g/L NaCl;2.5g/L(NH4)2SO4;2.5g/L天冬酰胺)上进行cas9表达的诱导(图24)。
在含有6%木糖的CGM上诱导后获得的菌落通过使用引物对Cbei_325_F和Cbei_325_R的PCR进行分析,所述引物对允许在菌株NCIMB8052中扩增2070个碱基对的产物,并允许在整合了celA基因的情况下扩增3718个碱基对的产物:
Cbei_325_F(celA)5’–AGATAATTATGAAGTTAATCCTTAG–3’(SEQ ID NO:31)
Cbei_326_R(celA)5’–CATTTGCTTTCAGGTCTTCTTTTGCTG–3’(SEQ ID NO:32)
在对照菌株上和诱导后获得的独立菌落上进行PCR测定的结果证实celA基因在NCIMB8052中的***(图25)。
参考文献
-Al–Hinai,M.A.等人,用于有效分离梭状芽胞杆菌双交叉等位基因交换突变体的新***,能够实现无标记染色体基因缺失和DNA整合(Novel system for efficientisolation of Clostridium double–crossover allelic exchange mutants enablingmarkerless chromosomal gene deletions and DNA integration).Appl.EnvironMicrobiol.2012.78(22):8112–21.
-Ran,F.A.等人,使用CRISPR–Cas9***的基因组工程(Genome engineeringusing the CRISPR–Cas9system).Nature protocols.2013.8(11):2281–308.
-Barrangou,R.等人,CRISPR在原核生物中提供针对病毒的获得性抗性(CRISPRprovides acquired resistance against viruses in prokaryotes).Science.2007,315(5819):1709–12.
-Banerjee,A.等人,用于杨氏梭状芽胞杆菌代谢工程的乳糖诱导***(Lactose–inducible system for metabolic engineering of Clostridium ljungdahlii),Appl.Environ.Microbiol.2014.80(8):2410–6.
-Cartman,S.T.等人,难辨梭状芽胞杆菌染色体的精确操纵揭示在tcdC基因型和毒素产生之间缺乏关联(Precise manipulation of the Clostridium difficilechromosome reveals a lack of association between the tcdC genotype and toxinproduction).Appl.Environ.Microbiol.2012.78(13):4683–90.
-Currie,D.H.等人,来自嗜热纤维梭状芽胞杆菌的工程化全长CipA在解糖嗜热厌氧菌中的功能性异源表达(Functional heterologous expression of an engineeredfull length CipA from Clostridium thermocellum in Thermoanaerobacteriumsaccharolyticum),Biotechnol.Biofuels.2013.6(1):32.
-DiCarlo,J.E.等人,利用CRISPR-Cas***在酿酒酵母中进行的基因组工程(Genome engineering in Saccharomyces cerevisiae using CRISPR–Cas systems).Nucleic Acids Res.2013,41(7):4336–43
-Dong,H.等人,用于产溶剂型丙酮丁醇梭状芽胞杆菌的脱水四环素诱导型基因表达***的开发:菌株工程的有用工具(Development of an anhydrotetracycline–inducible gene expression system for solvent–producing Clostridiumacetobutylicum:A useful tool for strain engineering).Metab.Eng.2012.14(1):59–67.
-D’Urzo,N.等人,在木糖诱导型启动子控制下的异源蛋白在桥石短芽孢杆菌SP3中的高水平细胞内表达(High–level intracellular expression of heterologousproteins in Brevibacillus choshinensis SP3under the control of a xyloseinducible promoter).Microb.Cell Fact.2013.12:12
-Egholm,M.等人,肽核酸(PNA).具有非手性肽主链的寡核苷酸类似物(Peptidenucleic acids(PNA).Oligonucleotide analogs with an achiral peptide backbone).J.Am.Chem.Soc.1992.114(5):1895–7.
-Fonfara,I.等人,Cas9的***发育决定了双向RNA和Cas9在直系同源II型CRISPR-Cas***中的功能可交换性(Phylogeny of Cas9 determines functionalexchangeability of dual–RNA and Cas9 among orthologous type II CRISPR–Cassystems).Nucleic acids res.2013,42(4):2577–90.
-Gapes,J.R.,Nimcevic,D.,&Friedl,A.(1996).拜氏梭状芽胞杆菌在具有在线脱溶剂的两级恒化器中的长期连续培养(Long–Term Continuous Cultivation ofClostridium beijerinckii in a Two–Stage Chemostat with On–Line SolventRemoval).Applied and environmental microbiology,62(9),3210–3219.
-Hartman,A.H.等人,用于在产气荚膜梭状芽胞杆菌中受控基因表达的乳糖诱导型启动子***的构建和表征(Construction and characterization of a lactose–inducible promoter system for controlled gene expression in Clostridiumperfringens).Appl.Environ.Microbiol.77(2):471–8
-Jinek,M.等人,在适应性细菌免疫中的可编程的双RNA引导的DNA内切核酸酶(Aprogrammable dual–RNA–guided DNA endonuclease in adaptive bacterialimmunity).Science 2012.337(6096):816–21.
-Lee,J.等人,丙酮丁醇梭状芽胞杆菌ATCC 824用于异丙醇-丁醇-乙醇发酵的代谢工程(Metabolic engineering of Clostridium acetobutylicum ATCC 824forisopropanol–butanol–ethanol fermentation).Appl.Environ.Microbiol;2012.78(5):1416–23.
-Lopez–Contreras AM,Smidt H,van der Oost J,Claassen PAM,Mooibroek H,de Vos WM.2001.表达Neocallismatix patriciarum糖苷水解酶的拜氏梭状芽胞杆菌细胞显示增强的地衣多糖利用和溶剂生产(Clostridium beijerinckii Cells ExpressingNeocallismatix patriciarum Glycoside Hydrolases Show Enhanced LichenanUtilization and Solvent Production).Appl Environ Microbiol 67:5127–5133.
-Lütke–Eversloh,T.新型代谢工程工具对于丙酮丁醇梭状芽胞杆菌的应用(Application of new metabolic engineering tools for Clostridiumacetobutylicum).Appl.Microbiol.Biotechnol.2014.98(13):5823–37
-Mali,P.等人,Cas9作为工程生物学的多用途工具(Cas9as a versatile toolfor engineering biology).Nat.methods.2013,10(10):957–63.
-Makarova,K.S.等人,CRISPR-Cas***的演变和分类(Evolution andclassification of the CRISPR–Cas systems).Nat.Rev.Microbiol.2011,9(6):467–77.
-Mearls,E.B.等人,用于嗜热纤维梭状芽胞杆菌的基于可调控质粒的基因表达***的开发(Development of a regulatable plasmid–based gene expression systemfor Clostridium thermocellum),Appl.Microbiol.Biotechnol.2015.99(18):7589–99.
-Nariya,H.等人,用于产气荚膜梭状芽胞杆菌的木糖诱导型基因表达***的开发和表征(Development and characterization of a xylose–inducible gene expressionsystem for Clostridium perfringens),Appl.Environ.Microbiol.2011.77(23):8439–41.
-Nariya H,Miyata S,Kuwahara T,Okabe A.2011.用于产气荚膜梭状芽胞杆菌的木糖诱导型基因表达***的开发和表征(Development and characterization of axylose–inducible gene expression system for Clostridium perfringens).ApplEnviron Microbiol 77:8439–8441.
-Newcomb,M.等人,celC基因簇在嗜热纤维梭状芽胞杆菌中的共转录(Co–transcription of the celC gene cluster in Clostridium thermocellum),Appl.Microbiol.Biotechnol.2011.90(2):625–34.
-Wang,Y.等人,在拜氏梭状芽胞杆菌中利用CRISPR/Cas9***的无标记染色体基因缺失(Markerless chromosomal gene deletion in Clostridium beijerinckii usingCRISPR/Cas9system).J.Biotechnol.2015.200:1–5.
-Xu,T.等人,经由CRISPR-Cas9切口酶在解纤维梭状芽胞杆菌中的有效基因组编辑(Efficient genome editing in Clostridium cellulolyticum via CRISPR–Cas9nickase).Appl.Environ.Microbiol.2015AEM–00873.
-Zhang,N.等人,在梭状芽胞杆菌中经由等位基因交换的I–SceI–介导无痕基因修饰(I–SceI–mediated scarless gene modification via allelic exchange inClostridium).J.Microbiol.Methods.2015,108:49–60.
-Zhang,J.等人,为解纤维梭状芽胞杆菌开发的新***糖诱导型遗传操作***(A novel arabinose–inducible genetic operation system developed forClostridium cellulolyticum),Biotechnol.Biofuels.2015.8:36.
序列表
<110> IFP新能源公司(IFP Energies Nouvelles)
<120> 用于转化梭状芽胞杆菌属细菌的遗传工具
<130> B2115PC00
<160> 32
<170> PatentIn version 3.5
<210> 1
<211> 1368
<212> PRT
<213> 化脓性链球菌(Streptococcus pyogenes)
<400> 1
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val
1 5 10 15
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala
1010 1015 1020
Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe
1025 1030 1035
Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala
1040 1045 1050
Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu
1055 1060 1065
Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val
1070 1075 1080
Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr
1085 1090 1095
Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro Lys
1100 1105 1110
Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro
1115 1120 1125
Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val
1130 1135 1140
Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys
1145 1150 1155
Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser
1160 1165 1170
Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys
1175 1180 1185
Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu
1190 1195 1200
Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Gly
1205 1210 1215
Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val
1220 1225 1230
Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys
1250 1255 1260
His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys
1265 1270 1275
Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala
1280 1285 1290
Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn
1295 1300 1305
Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala
1310 1315 1320
Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr Ser
1325 1330 1335
Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr
1340 1345 1350
Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp
1355 1360 1365
<210> 2
<211> 4107
<212> DNA
<213> 化脓性链球菌(Streptococcus pyogenes)
<400> 2
atggataaga aatactcaat aggcttagat atcggcacaa atagcgtcgg atgggcggtg 60
atcactgatg aatataaggt tccgtctaaa aagttcaagg ttctgggaaa tacagaccgc 120
cacagtatca aaaaaaatct tataggggct cttttatttg acagtggaga gacagcggaa 180
gcgactcgtc tcaaacggac agctcgtaga aggtatacac gtcggaagaa tcgtatttgt 240
tatctacagg agattttttc aaatgagatg gcgaaagtag atgatagttt ctttcatcga 300
cttgaagagt cttttttggt ggaagaagac aagaagcatg aacgtcatcc tatttttgga 360
aatatagtag atgaagttgc ttatcatgag aaatatccaa ctatctatca tctgcgaaaa 420
aaattggtag attctactga taaagcggat ttgcgcttaa tctatttggc cttagcgcat 480
atgattaagt ttcgtggtca ttttttgatt gagggagatt taaatcctga taatagtgat 540
gtggacaaac tatttatcca gttggtacaa acctacaatc aattatttga agaaaaccct 600
attaacgcaa gtggagtaga tgctaaagcg attctttctg cacgattgag taaatcaaga 660
cgattagaaa atctcattgc tcagctcccc ggtgagaaga aaaatggctt atttgggaat 720
ctcattgctt tgtcattggg tttgacccct aattttaaat caaattttga tttggcagaa 780
gatgctaaat tacagctttc aaaagatact tacgatgatg atttagataa tttattggcg 840
caaattggag atcaatatgc tgatttgttt ttggcagcta agaatttatc agatgctatt 900
ttactttcag atatcctaag agtaaatact gaaataacta aggctcccct atcagcttca 960
atgattaaac gctacgatga acatcatcaa gacttgactc ttttaaaagc tttagttcga 1020
caacaacttc cagaaaagta taaagaaatc ttttttgatc aatcaaaaaa cggatatgca 1080
ggttatattg atgggggagc tagccaagaa gaattttata aatttatcaa accaatttta 1140
gaaaaaatgg atggtactga ggaattattg gtgaaactaa atcgtgaaga tttgctgcgc 1200
aagcaacgga cctttgacaa cggctctatt ccccatcaaa ttcacttggg tgagctgcat 1260
gctattttga gaagacaaga agacttttat ccatttttaa aagacaatcg tgagaagatt 1320
gaaaaaatct tgacttttcg aattccttat tatgttggtc cattggcgcg tggcaatagt 1380
cgttttgcat ggatgactcg gaagtctgaa gaaacaatta ccccatggaa ttttgaagaa 1440
gttgtcgata aaggtgcttc agctcaatca tttattgaac gcatgacaaa ctttgataaa 1500
aatcttccaa atgaaaaagt actaccaaaa catagtttgc tttatgagta ttttacggtt 1560
tataacgaat tgacaaaggt caaatatgtt actgaaggaa tgcgaaaacc agcatttctt 1620
tcaggtgaac agaagaaagc cattgttgat ttactcttca aaacaaatcg aaaagtaacc 1680
gttaagcaat taaaagaaga ttatttcaaa aaaatagaat gttttgatag tgttgaaatt 1740
tcaggagttg aagatagatt taatgcttca ttaggtacct accatgattt gctaaaaatt 1800
attaaagata aagatttttt ggataatgaa gaaaatgaag atatcttaga ggatattgtt 1860
ttaacattga ccttatttga agatagggag atgattgagg aaagacttaa aacatatgct 1920
cacctctttg atgataaggt gatgaaacag cttaaacgtc gccgttatac tggttgggga 1980
cgtttgtctc gaaaattgat taatggtatt agggataagc aatctggcaa aacaatatta 2040
gattttttga aatcagatgg ttttgccaat cgcaatttta tgcagctgat ccatgatgat 2100
agtttgacat ttaaagaaga cattcaaaaa gcacaagtgt ctggacaagg cgatagttta 2160
catgaacata ttgcaaattt agctggtagc cctgctatta aaaaaggtat tttacagact 2220
gtaaaagttg ttgatgaatt ggtcaaagta atggggcggc ataagccaga aaatatcgtt 2280
attgaaatgg cacgtgaaaa tcagacaact caaaagggcc agaaaaattc gcgagagcgt 2340
atgaaacgaa tcgaagaagg tatcaaagaa ttaggaagtc agattcttaa agagcatcct 2400
gttgaaaata ctcaattgca aaatgaaaag ctctatctct attatctcca aaatggaaga 2460
gacatgtatg tggaccaaga attagatatt aatcgtttaa gtgattatga tgtcgatcac 2520
attgttccac aaagtttcct taaagacgat tcaatagaca ataaggtctt aacgcgttct 2580
gataaaaatc gtggtaaatc ggataacgtt ccaagtgaag aagtagtcaa aaagatgaaa 2640
aactattgga gacaacttct aaacgccaag ttaatcactc aacgtaagtt tgataattta 2700
acgaaagctg aacgtggagg tttgagtgaa cttgataaag ctggttttat caaacgccaa 2760
ttggttgaaa ctcgccaaat cactaagcat gtggcacaaa ttttggatag tcgcatgaat 2820
actaaatacg atgaaaatga taaacttatt cgagaggtta aagtgattac cttaaaatct 2880
aaattagttt ctgacttccg aaaagatttc caattctata aagtacgtga gattaacaat 2940
taccatcatg cccatgatgc gtatctaaat gccgtcgttg gaactgcttt gattaagaaa 3000
tatccaaaac ttgaatcgga gtttgtctat ggtgattata aagtttatga tgttcgtaaa 3060
atgattgcta agtctgagca agaaataggc aaagcaaccg caaaatattt cttttactct 3120
aatatcatga acttcttcaa aacagaaatt acacttgcaa atggagagat tcgcaaacgc 3180
cctctaatcg aaactaatgg ggaaactgga gaaattgtct gggataaagg gcgagatttt 3240
gccacagtgc gcaaagtatt gtccatgccc caagtcaata ttgtcaagaa aacagaagta 3300
cagacaggcg gattctccaa ggagtcaatt ttaccaaaaa gaaattcgga caagcttatt 3360
gctcgtaaaa aagactggga tccaaaaaaa tatggtggtt ttgatagtcc aacggtagct 3420
tattcagtcc tagtggttgc taaggtggaa aaagggaaat cgaagaagtt aaaatccgtt 3480
aaagagttac tagggatcac aattatggaa agaagttcct ttgaaaaaaa tccgattgac 3540
tttttagaag ctaaaggata taaggaagtt aaaaaagact taatcattaa actacctaaa 3600
tatagtcttt ttgagttaga aaacggtcgt aaacggatgc tggctagtgc cggagaatta 3660
caaaaaggaa atgagctggc tctgccaagc aaatatgtga attttttata tttagctagt 3720
cattatgaaa agttgaaggg tagtccagaa gataacgaac aaaaacaatt gtttgtggag 3780
cagcataagc attatttaga tgagattatt gagcaaatca gtgaattttc taagcgtgtt 3840
attttagcag atgccaattt agataaagtt cttagtgcat ataacaaaca tagagacaaa 3900
ccaatacgtg aacaagcaga aaatattatt catttattta cgttgacgaa tcttggagct 3960
cccgctgctt ttaaatattt tgatacaaca attgatcgta aacgatatac gtctacaaaa 4020
gaagttttag atgccactct tatccatcaa tccatcactg gtctttatga aacacgcatt 4080
gatttgagtc agctaggagg tgactga 4107
<210> 3
<211> 630
<212> DNA
<213> 梭状芽胞杆菌(clostridium)
<400> 3
atgagtaaag ttacacaaat atcacatcca cttatattac acaaattagc atttatgaga 60
gataaaaaaa caggatctaa agattttaga gagatggtag aagaagtagc aatgctaatg 120
gcatatgaag taacaagaga aatgcagctt gaaactgttg aaatagaaac tcctatatgt 180
ataactaaat gtaaaatgtt agcaggaaaa aaggtagcta tagttcctat acttagagca 240
ggacttggaa tggtaaatgg agtattaaaa ttaatacctg ctgctaaggt tggacatata 300
ggattatata gagatgaaaa gacattaaaa cctgtagaat acttctgtaa acttcctcaa 360
gatatagagg aaagagacat aatagtaact gatccaatgc ttgcaactgg tgggtcagca 420
atagatgcaa taacacttct taagaaaaga ggagcaaaat acataagact tatgtgtctt 480
ataggagcac cagaaggtat agcagcagta caagaagcac atccagatgt agatatatac 540
ctcgcatcaa tagatgaaaa gttagatgaa aatggatata tagttcctgg tcttggagat 600
gctggagata gattattcgg tacaaaataa 630
<210> 4
<211> 2589
<212> DNA
<213> 梭状芽胞杆菌(clostridium)
<400> 4
atgaaagtca caacagtaaa ggaattagat gaaaaactca aggtaattaa agaagctcaa 60
aaaaaattct cttgttactc gcaagaaatg gttgatgaaa tctttagaaa tgcagcaatg 120
gcagcaatcg acgcaaggat agagctagca aaagcagctg ttttggaaac cggtatgggc 180
ttagttgaag acaaggttat aaaaaatcat tttgcaggcg aatacatcta taacaaatat 240
aaggatgaaa aaacctgcgg tataattgaa cgaaatgaac cctacggaat tacaaaaata 300
gcagaaccta taggagttgt agctgctata atccctgtaa caaaccccac atcaacaaca 360
atatttaaat ccttaatatc ccttaaaact agaaatggaa ttttcttttc gcctcaccca 420
agggcaaaaa aatccacaat actagcagct aaaacaatac ttgatgcagc cgttaagagt 480
ggtgccccgg aaaatataat aggttggata gatgaacctt caattgaact aactcaatat 540
ttaatgcaaa aagcagatat aacccttgca actggtggtc cctcactagt taaatctgct 600
tattcttccg gaaaaccagc aataggtgtt ggtccgggta acaccccagt aataattgat 660
gaatctgctc atataaaaat ggcagtaagt tcaattatat tatccaaaac ctatgataat 720
ggtgttatat gtgcttctga acaatctgta atagtcttaa aatccatata taacaaggta 780
aaagatgagt tccaagaaag aggagcttat ataataaaga aaaacgaatt ggataaagtc 840
cgtgaagtga tttttaaaga tggatccgta aaccctaaaa tagtcggaca gtcagcttat 900
actatagcag ctatggctgg cataaaagta cctaaaacca caagaatatt aataggagaa 960
gttacctcct taggtgaaga agaacctttt gcccacgaaa aactatctcc tgttttggct 1020
atgtatgagg ctgacaattt tgatgatgct ttaaaaaaag cagtaactct aataaactta 1080
ggaggcctcg gccatacctc aggaatatat gcagatgaaa taaaagcacg agataaaata 1140
gatagattta gtagtgccat gaaaaccgta agaacctttg taaatatccc aacctcacaa 1200
ggtgcaagtg gagatctata taattttaga ataccacctt ctttcacgct tggctgcgga 1260
ttttggggag gaaattctgt ttccgagaat gttggtccaa aacatctttt gaatattaaa 1320
accgtagctg aaaggagaga aaacatgctt tggtttagag ttccacataa agtatatttt 1380
aagttcggtt gtcttcaatt tgctttaaaa gatttaaaag atctaaagaa aaaaagagcc 1440
tttatagtta ctgatagtga cccctataat ttaaactatg ttgattcaat aataaaaata 1500
cttgagcacc tagatattga ttttaaagta tttaataagg ttggaagaga agctgatctt 1560
aaaaccataa aaaaagcaac tgaagaaatg tcctccttta tgccagacac tataatagct 1620
ttaggtggta cccctgaaat gagctctgca aagctaatgt gggtactata tgaacatcca 1680
gaagtaaaat ttgaagatct tgcaataaaa tttatggaca taagaaagag aatatatact 1740
ttcccaaaac tcggtaaaaa ggctatgtta gttgcaatta caacttctgc tggttccggt 1800
tctgaggtta ctccttttgc tttagtaact gacaataaca ctggaaataa gtacatgtta 1860
gcagattatg aaatgacacc aaatatggca attgtagatg cagaacttat gatgaaaatg 1920
ccaaagggat taaccgctta ttcaggtata gatgcactag taaatagtat agaagcatac 1980
acatccgtat atgcttcaga atacacaaac ggactagcac tagaggcaat acgattaata 2040
tttaaatatt tgcctgaggc ttacaaaaac ggaagaacca atgaaaaagc aagagagaaa 2100
atggctcacg cttcaactat ggcaggtatg gcatccgcta atgcatttct aggtctatgt 2160
cattccatgg caataaaatt aagttcagaa cacaatattc ctagtggcat tgccaatgca 2220
ttactaatag aagaagtaat aaaatttaac gcagttgata atcctgtaaa acaagcccct 2280
tgcccacaat ataagtatcc aaacaccata tttagatatg ctcgaattgc agattatata 2340
aagcttggag gaaatactga tgaggaaaag gtagatctct taattaacaa aatacatgaa 2400
ctaaaaaaag ctttaaatat accaacttca ataaaggatg caggtgtttt ggaggaaaac 2460
ttctattcct cccttgatag aatatctgaa cttgcactag atgatcaatg cacaggcgct 2520
aatcctagat ttcctcttac aagtgagata aaagaaatgt atataaattg ttttaaaaaa 2580
caaccttaa 2589
<210> 5
<211> 10469
<212> DNA
<213> 人工序列
<220>
<223> 质粒
<400> 5
catggataaa aagtacagta ttggtctaga cataggaact aactctgttg ggtgggctgt 60
tataacagat gaatataaag ttccatcaaa aaaatttaaa gtattaggaa acactgatag 120
acattcaata aaaaaaaact tgataggtgc tttattattc gattcaggag agactgctga 180
agctacacgt ttaaaaagaa cagctagacg tagatataca agaagaaaaa ataggatatg 240
ttatcttcaa gaaattttta gtaatgaaat ggcaaaagtt gatgattcat tctttcacag 300
actagaagaa agtttcttag ttgaagaaga taagaagcat gaaagacacc ctatttttgg 360
taatatcgta gatgaagtag catatcatga gaagtatcca actatctatc atttaagaaa 420
gaaattagtt gattctacag ataaagctga tctgagatta atatatttag ctttagctca 480
tatgattaaa tttagaggac attttttaat agaaggtgat ttaaacccag acaacagcga 540
tgtagataaa ttatttatcc aattagttca aacttataat caattattcg aagagaatcc 600
aattaatgca agtggtgtag acgctaaggc tatattatca gctagattat caaaatctag 660
aagattagaa aatctaatag ctcaacttcc tggagaaaag aaaaatggac tttttgggaa 720
cctaatagct ctctcactcg gactaacacc aaattttaaa agcaattttg atcttgctga 780
agacgcaaag ttacaactat caaaggatac atacgatgat gatttagata atttgttagc 840
tcaaataggt gatcaatatg ctgatttgtt tcttgcagca aaaaacttaa gtgatgcaat 900
tttactatca gatatactta gagtaaatac agaaataaca aaggctcctt tatcagcaag 960
tatgattaaa cgatatgatg agcatcatca agatttaaca ttattaaagg cacttgtaag 1020
acaacaatta ccagaaaaat ataaagaaat tttctttgat caatctaaaa atggatatgc 1080
tggatatata gacggtggag caagtcaaga agagttttat aaatttataa agcctatttt 1140
agaaaaaatg gatggaactg aagaattact tgttaaactt aacagagaag atttacttag 1200
aaaacaaaga acttttgata atggttcaat tcctcaccaa attcatttag gagaattaca 1260
tgctatacta agaagacaag aagattttta tccatttctt aaagataata gagaaaaaat 1320
tgaaaaaatt ttaactttta gaataccata ttatgtagga ccacttgcaa ggggaaattc 1380
aagatttgca tggatgacta gaaaatcaga agaaactata accccgtgga attttgaaga 1440
agtagtagat aaaggagcta gtgctcaatc atttatagaa agaatgacaa attttgataa 1500
gaatcttcct aacgaaaagg ttttgccaaa gcatagcctt ctttatgagt attttacagt 1560
ttataatgag cttactaaag taaaatacgt tacagaagga atgagaaaac cagcattttt 1620
gtctggtgaa caaaagaaag caatagtaga cctattattt aaaacaaata ggaaggttac 1680
cgtaaagcaa cttaaagaag attacttcaa aaaaattgaa tgctttgata gtgttgaaat 1740
atcaggagtt gaagatagat ttaatgcttc acttggtaca tatcacgatc tcttaaaaat 1800
tataaaagat aaggattttt tagataatga agaaaatgaa gatattcttg aagatatagt 1860
attaacattg acactttttg aagatagaga aatgatagaa gaaagattaa aaacatatgc 1920
acatcttttt gatgataagg ttatgaagca acttaaaaga agaagatata caggttgggg 1980
acgtttgtca agaaagctaa ttaatggtat tagagataaa caatcaggaa agactattct 2040
cgattttctt aaatcagatg gatttgctaa tagaaacttt atgcaattaa ttcatgatga 2100
ttctcttact ttcaaagagg atattcaaaa ggctcaagtt tctggacaag gcgatagctt 2160
acacgaacac attgctaacc ttgcagggag ccccgctatc aaaaaaggaa ttttacaaac 2220
agttaaagtt gtagatgaac ttgttaaagt tatgggaaga cacaaacctg agaatatagt 2280
tatagaaatg gccagagaaa atcaaacaac acaaaaagga caaaaaaatt ctagagagag 2340
aatgaagaga attgaagaag gaataaaaga gctaggatca caaatattaa aagaacatcc 2400
agttgaaaat actcaattgc aaaatgaaaa gttatatttg tattacttac aaaatggaag 2460
agatatgtat gttgatcaag aactcgatat taatagatta agtgactatg atgttgatca 2520
tattgttcct caatcatttt taaaagatga ttcaatcgat aacaaagtat taactagatc 2580
agataaaaat agaggaaagt cagataatgt accatctgaa gaagttgtta aaaaaatgaa 2640
gaactattgg agacaacttt taaatgcaaa gctaattaca caaagaaaat ttgacaattt 2700
aacaaaagca gaaagaggag gattaagcga attagacaaa gctggattta taaaaagaca 2760
acttgttgag acaagacaaa taactaagca tgttgctcaa atacttgatt caagaatgaa 2820
tacaaaatat gatgaaaatg ataaattaat cagagaagta aaagtaataa cattaaagtc 2880
aaaattagta tcagatttca gaaaggattt tcaattttac aaagttcgtg aaataaataa 2940
ctatcatcat gctcatgatg catacttaaa tgctgttgta ggaactgctc ttattaagaa 3000
atatcctaaa ctagaaagcg aatttgttta tggagattat aaagtttatg atgtgcgcaa 3060
aatgatcgcg aaatccgaac aagaaatcgg taaggctaca gcaaaatatt tcttttatag 3120
taatataatg aattttttta agacagaaat aactttggct aatggtgaaa tcagaaaaag 3180
accacttatc gaaacaaatg gagagacagg agaaatagta tgggataaag gaagagattt 3240
tgctactgtt agaaaagtac taagtatgcc acaagtaaat atcgtaaaga aaactgaagt 3300
tcaaactgga ggtttctcta aggaatcaat tttacctaag agaaattcag ataagttaat 3360
tgcaaggaaa aaagattggg acccaaaaaa atacggtggt tttgatagtc caacagttgc 3420
ctatagtgtt cttgtagtag cgaaagttga gaaaggtaag tcaaaaaagt tgaaaagcgt 3480
aaaagaactt cttggtatca caattatgga aagatcttca tttgaaaaaa atccaattga 3540
ctttttagaa gctaagggtt ataaagaagt taaaaaggat ttaatcataa aactaccaaa 3600
gtatagtcta tttgaactcg aaaacggaag aaaacgaatg ctcgctagcg caggagaact 3660
tcaaaaagga aatgaacttg cgctgccatc aaagtatgta aatttcttat atttagcttc 3720
tcattatgag aaattaaaag gatcaccaga ggataatgaa caaaagcaac tatttgtaga 3780
acaacacaaa cattatttag atgaaataat agaacaaata tctgaatttt ctaaaagagt 3840
tatacttgcc gacgcaaatc tagataaggt gctttcagcg tataataaac acagagataa 3900
accaataaga gaacaagcag aaaacattat ccatcttttt acattaacta atcttggtgc 3960
accagctgca tttaagtact ttgatacaac aatagataga aaaagataca catctactaa 4020
agaagtatta gacgcaactt taatacatca atctattaca gggctttatg aaacaagaat 4080
tgatttaagt caactaggcg gagattaagt cgacaaagta ttgttaaaaa taactctgta 4140
gaattataaa ttagttctac agagttattt tttgacccgg gtaccgagct cgaattcgta 4200
atcatggtca tagctgtttc ctgtgtgaaa ttgttatccg ctcacaattc cacacaacat 4260
acgagccgga agcataaagt gtaaagcctg gggtgcctaa tgagtgagct aactcacatt 4320
aattgcgttg cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta 4380
atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc 4440
gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 4500
ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 4560
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 4620
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 4680
aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 4740
gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc 4800
tcatagctca cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg 4860
tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga 4920
gtccaacccg gtaagacacg acttatcgcc actggcagca gccactggta acaggattag 4980
cagagcgagg tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta 5040
cactagaagg acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag 5100
agttggtagc tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg 5160
caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac 5220
ggggtctgac gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc 5280
aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag 5340
tatatatgag taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc 5400
agcgatctgt ctatttcgtt catccatagt tgcctgactc cccgtcgtgt agataactac 5460
gatacgggag ggcttaccat ctggccccag tgctgcaatg ataccgcgag acccacgctc 5520
accggctcca gatttatcag caataaacca gccagccgga agggccgagc gcagaagtgg 5580
tcctgcaact ttatccgcct ccatccagtc tattaattgt tgccgggaag ctagagtaag 5640
tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt gctacaggca tcgtggtgtc 5700
acgctcgtcg tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac 5760
atgatccccc atgttgtgca aaaaagcggt tagctccttc ggtcctccga tcgttgtcag 5820
aagtaagttg gccgcagtgt tatcactcat ggttatggca gcactgcata attctcttac 5880
tgtcatgcca tccgtaagat gcttttctgt gactggtgag tactcaacca agtcattctg 5940
agaatagtgt atgcggcgac cgagttgctc ttgcccggcg tcaatacggg ataataccgc 6000
gccacatagc agaactttaa aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact 6060
ctcaaggatc ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg cacccaactg 6120
atcttcagca tcttttactt tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa 6180
tgccgcaaaa aagggaataa gggcgacacg gaaatgttga atactcatac tcttcctttt 6240
tcaatattat tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg 6300
tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga 6360
ctgccgggcc tcttgcggga tcaaaagaaa aacgaaatga tacaccaatc agtgcaaaaa 6420
aagatataat gggagataag acggttcgtg ttcgtgctga cttgcaccat atcataaaaa 6480
tcgaaacagc aaagaatggc ggaaacgtaa aagaagttat ggaaataaga cttagaagca 6540
aacttaagag tgtgttgata gtgcagtatc ttaaaatttt gtataatagg aattgaagtt 6600
aaattagatg ctaaaaattt gtaattaaga aggagtgatt acatgaacaa aaatataaaa 6660
tattctcaaa actttttaac gagtgaaaaa gtactcaacc aaataataaa acaattgaat 6720
ttaaaagaaa ccgataccgt ttacgaaatt ggaacaggta aagggcattt aacgacgaaa 6780
ctggctaaaa taagtaaaca ggtaacgtct attgaattag acagtcatct attcaactta 6840
tcgtcagaaa aattaaaact gaatactcgt gtcactttaa ttcaccaaga tattctacag 6900
tttcaattcc ctaacaaaca gaggtataaa attgttggga gtattcctta ccatttaagc 6960
acacaaatta ttaaaaaagt ggtttttgaa agccatgcgt ctgacatcta tctgattgtt 7020
gaagaaggat tctacaagcg taccttggat attcaccgaa cactagggtt gctcttgcac 7080
actcaagtct cgattcagca attgcttaag ctgccagcgg aatgctttca tcctaaacca 7140
aaagtaaaca gtgtcttaat aaaacttacc cgccatacca cagatgttcc agataaatat 7200
tggaagctat atacgtactt tgtttcaaaa tgggtcaatc gagaatatcg tcaactgttt 7260
actaaaaatc agtttcatca agcaatgaaa cacgccaaag taaacaattt aagtaccgtt 7320
acttatgagc aagtattgtc tatttttaat agttatctat tatttaacgg gaggaaataa 7380
ttctatgagt ccctaggccc aactaactca acgctagtag tggatttaat cccaaatgag 7440
ccaacagaac cagaaccaga aacagaatca gaacaagtaa cattggattt agaaatggaa 7500
gaagaaaaaa gcaatgactt cgtgtgaata atgcacgaaa tcgttgctta ttttttttta 7560
aaagcggtat actagatata acgaaacaac gaactgaata gaaacgaaaa aagagccatg 7620
acacatttat aaaatgtttg acgacatttt ataaatgcat agcccgataa gattgccaaa 7680
ccaacgctta tcagttagtc agatgaactc ttccctcgta agaagttatt taattaactt 7740
tgtttgaaga cggtatataa ccgtactatc attatatagg gaaatcagag agttttcaag 7800
tatctaagct actgaattta agaattgtta agcaatcaat cggaaatcgt ttgattgctt 7860
tttttgtatt catttataga aggtggagtt tgtatgaatc atgatgaatg taaaacttat 7920
ataaaaaata gtttattgga gataagaaaa ttagcaaata tctatacact agaaacgttt 7980
aagaaagagt tagaaaagag aaatatctac ttagaaacaa aatcagataa gtatttttct 8040
tcggaggggg aagattatat atataagtta atagaaaata acaaaataat ttattcgatt 8100
agtggaaaaa aattgactta taaaggaaaa aaatcttttt caaaacatgc aatattgaaa 8160
cagttgaatg aaaaagcaaa ccaagttaat taaacaacct attttatagg atttatagga 8220
aaggagaaca gctgaatgaa tatccctttt gttgtagaaa ctgtgcttca tgacggcttg 8280
ttaaagtaca aatttaaaaa tagtaaaatt cgctcaatca ctaccaagcc aggtaaaagc 8340
aaaggggcta tttttgcgta tcgctcaaaa tcaagcatga ttggcggtcg tggtgttgtt 8400
ctgacttccg aggaagcgat tcaagaaaat caagatacat ttacacattg gacacccaac 8460
gtttatcgtt atggaacgta tgcagacgaa aaccgttcat acacgaaagg acattctgaa 8520
aacaatttaa gacaaatcaa taccttcttt attgattttg atattcacac ggcaaaagaa 8580
actatttcag caagcgatat tttaacaacc gctattgatt taggttttat gcctactatg 8640
attatcaaat ctgataaagg ttatcaagca tattttgttt tagaaacgcc agtctatgtg 8700
acttcaaaat cagaatttaa atctgtcaaa gcagccaaaa taatttcgca aaatatccga 8760
gaatattttg gaaagtcttt gccagttgat ctaacgtgta atcattttgg tattgctcgc 8820
ataccaagaa cggacaatgt agaatttttt gatcctaatt accgttattc tttcaaagaa 8880
tggcaagatt ggtctttcaa acaaacagat aataagggct ttactcgttc aagtctaacg 8940
gttttaagcg gtacagaagg caaaaaacaa gtagatgaac cctggtttaa tctcttattg 9000
cacgaaacga aattttcagg agaaaagggt ttaatagggc gtaataacgt catgtttacc 9060
ctctctttag cctactttag ttcaggctat tcaatcgaaa cgtgcgaata taatatgttt 9120
gagtttaata atcgattaga tcaaccctta gaagaaaaag aagtaatcaa aattgttaga 9180
agtgcctatt cagaaaacta tcaaggggct aatagggaat acattaccat tctttgcaaa 9240
gcttgggtat caagtgattt aaccagtaaa gatttatttg tccgtcaagg gtggtttaaa 9300
ttcaagaaaa aaagaagcga acgtcaacgt gttcatttgt cagaatggaa agaagattta 9360
atggcttata ttagcgaaaa aagcgatgta tacaagcctt atttagtgac gaccaaaaaa 9420
gagattagag aagtgctagg cattcctgaa cggacattag ataaattgct gaaggtactg 9480
aaggcgaatc aggaaatttt ctttaagatt aaaccaggaa gaaatggtgg cattcaactt 9540
gctagtgtta aatcattgtt gctatcgatc attaaagtaa aaaaagaaga aaaagaaagc 9600
tatataaagg cgctgacaaa ttcttttgac ttagagcata cattcattca agagacttta 9660
aacaagctag cagaacgccc taaaacggac acacaactcg atttgtttag ctatgataca 9720
ggctgaaaat aaaacccgca ctatgccatt acatttatat ctatgatacg tgtttgtttt 9780
ttctttgctg tttagcgaat gattagcaga aatatacaga gtaagatttt aattaattat 9840
tagggggaga aggagagagt agcccgaaaa cttttagttg gcttggactg aacgaagtga 9900
gggaaaggct actaaaacgt cgaggggcag tgagagcgaa gcgaacactt gattttttaa 9960
ttttctatct tttataggtc attagagtat acttatttgt cctataaact atttagcagc 10020
ataatagatt tattgaatag gtcatttaag ttgagcatat tagaggagga aaatcttgga 10080
gaaatatttg aagaacccga ttacatggat tggattagtt cttgtggtta cgtggttttt 10140
aactaaaagt agtgaatttt tgatttttgg tgtgtgtgtc ttgttgttag tatttgctag 10200
tcaaagtgat taaatagaat tctagcgcca ttcgccattc aggctgcgca actgttggga 10260
agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg gatgtgctgc 10320
aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta aaacgacggc 10380
cagtgccaag cttgcatgcc tgcaggcctc gagtatattg ataaaaataa taatagtggg 10440
tataattaag ttgttaggag gttagttac 10469
<210> 6
<211> 8876
<212> DNA
<213> 人工序列
<220>
<223> 质粒
<400> 6
tcgagactct atcattgata gagtttgaaa ctctatcatt gatagagtat aatatctttg 60
ttcattagag cgataaactt gaatttgaga gggaacttcc atggataaaa agtacagtat 120
tggtctagac ataggaacta actctgttgg gtgggctgtt ataacagatg aatataaagt 180
tccatcaaaa aaatttaaag tattaggaaa cactgataga cattcaataa aaaaaaactt 240
gataggtgct ttattattcg attcaggaga gactgctgaa gctacacgtt taaaaagaac 300
agctagacgt agatatacaa gaagaaaaaa taggatatgt tatcttcaag aaatttttag 360
taatgaaatg gcaaaagttg atgattcatt ctttcacaga ctagaagaaa gtttcttagt 420
tgaagaagat aagaagcatg aaagacaccc tatttttggt aatatcgtag atgaagtagc 480
atatcatgag aagtatccaa ctatctatca tttaagaaag aaattagttg attctacaga 540
taaagctgat ctgagattaa tatatttagc tttagctcat atgattaaat ttagaggaca 600
ttttttaata gaaggtgatt taaacccaga caacagcgat gtagataaat tatttatcca 660
attagttcaa acttataatc aattattcga agagaatcca attaatgcaa gtggtgtaga 720
cgctaaggct atattatcag ctagattatc aaaatctaga agattagaaa atctaatagc 780
tcaacttcct ggagaaaaga aaaatggact ttttgggaac ctaatagctc tctcactcgg 840
actaacacca aattttaaaa gcaattttga tcttgctgaa gacgcaaagt tacaactatc 900
aaaggataca tacgatgatg atttagataa tttgttagct caaataggtg atcaatatgc 960
tgatttgttt cttgcagcaa aaaacttaag tgatgcaatt ttactatcag atatacttag 1020
agtaaataca gaaataacaa aggctccttt atcagcaagt atgattaaac gatatgatga 1080
gcatcatcaa gatttaacat tattaaaggc acttgtaaga caacaattac cagaaaaata 1140
taaagaaatt ttctttgatc aatctaaaaa tggatatgct ggatatatag acggtggagc 1200
aagtcaagaa gagttttata aatttataaa gcctatttta gaaaaaatgg atggaactga 1260
agaattactt gttaaactta acagagaaga tttacttaga aaacaaagaa cttttgataa 1320
tggttcaatt cctcaccaaa ttcatttagg agaattacat gctatactaa gaagacaaga 1380
agatttttat ccatttctta aagataatag agaaaaaatt gaaaaaattt taacttttag 1440
aataccatat tatgtaggac cacttgcaag gggaaattca agatttgcat ggatgactag 1500
aaaatcagaa gaaactataa ccccgtggaa ttttgaagaa gtagtagata aaggagctag 1560
tgctcaatca tttatagaaa gaatgacaaa ttttgataag aatcttccta acgaaaaggt 1620
tttgccaaag catagccttc tttatgagta ttttacagtt tataatgagc ttactaaagt 1680
aaaatacgtt acagaaggaa tgagaaaacc agcatttttg tctggtgaac aaaagaaagc 1740
aatagtagac ctattattta aaacaaatag gaaggttacc gtaaagcaac ttaaagaaga 1800
ttacttcaaa aaaattgaat gctttgatag tgttgaaata tcaggagttg aagatagatt 1860
taatgcttca cttggtacat atcacgatct cttaaaaatt ataaaagata aggatttttt 1920
agataatgaa gaaaatgaag atattcttga agatatagta ttaacattga cactttttga 1980
agatagagaa atgatagaag aaagattaaa aacatatgca catctttttg atgataaggt 2040
tatgaagcaa cttaaaagaa gaagatatac aggttgggga cgtttgtcaa gaaagctaat 2100
taatggtatt agagataaac aatcaggaaa gactattctc gattttctta aatcagatgg 2160
atttgctaat agaaacttta tgcaattaat tcatgatgat tctcttactt tcaaagagga 2220
tattcaaaag gctcaagttt ctggacaagg cgatagctta cacgaacaca ttgctaacct 2280
tgcagggagc cccgctatca aaaaaggaat tttacaaaca gttaaagttg tagatgaact 2340
tgttaaagtt atgggaagac acaaacctga gaatatagtt atagaaatgg ccagagaaaa 2400
tcaaacaaca caaaaaggac aaaaaaattc tagagagaga atgaagagaa ttgaagaagg 2460
aataaaagag ctaggatcac aaatattaaa agaacatcca gttgaaaata ctcaattgca 2520
aaatgaaaag ttatatttgt attacttaca aaatggaaga gatatgtatg ttgatcaaga 2580
actcgatatt aatagattaa gtgactatga tgttgatcat attgttcctc aatcattttt 2640
aaaagatgat tcaatcgata acaaagtatt aactagatca gataaaaata gaggaaagtc 2700
agataatgta ccatctgaag aagttgttaa aaaaatgaag aactattgga gacaactttt 2760
aaatgcaaag ctaattacac aaagaaaatt tgacaattta acaaaagcag aaagaggagg 2820
attaagcgaa ttagacaaag ctggatttat aaaaagacaa cttgttgaga caagacaaat 2880
aactaagcat gttgctcaaa tacttgattc aagaatgaat acaaaatatg atgaaaatga 2940
taaattaatc agagaagtaa aagtaataac attaaagtca aaattagtat cagatttcag 3000
aaaggatttt caattttaca aagttcgtga aataaataac tatcatcatg ctcatgatgc 3060
atacttaaat gctgttgtag gaactgctct tattaagaaa tatcctaaac tagaaagcga 3120
atttgtttat ggagattata aagtttatga tgtgcgcaaa atgatcgcga aatccgaaca 3180
agaaatcggt aaggctacag caaaatattt cttttatagt aatataatga atttttttaa 3240
gacagaaata actttggcta atggtgaaat cagaaaaaga ccacttatcg aaacaaatgg 3300
agagacagga gaaatagtat gggataaagg aagagatttt gctactgtta gaaaagtact 3360
aagtatgcca caagtaaata tcgtaaagaa aactgaagtt caaactggag gtttctctaa 3420
ggaatcaatt ttacctaaga gaaattcaga taagttaatt gcaaggaaaa aagattggga 3480
cccaaaaaaa tacggtggtt ttgatagtcc aacagttgcc tatagtgttc ttgtagtagc 3540
gaaagttgag aaaggtaagt caaaaaagtt gaaaagcgta aaagaacttc ttggtatcac 3600
aattatggaa agatcttcat ttgaaaaaaa tccaattgac tttttagaag ctaagggtta 3660
taaagaagtt aaaaaggatt taatcataaa actaccaaag tatagtctat ttgaactcga 3720
aaacggaaga aaacgaatgc tcgctagcgc aggagaactt caaaaaggaa atgaacttgc 3780
gctgccatca aagtatgtaa atttcttata tttagcttct cattatgaga aattaaaagg 3840
atcaccagag gataatgaac aaaagcaact atttgtagaa caacacaaac attatttaga 3900
tgaaataata gaacaaatat ctgaattttc taaaagagtt atacttgccg acgcaaatct 3960
agataaggtg ctttcagcgt ataataaaca cagagataaa ccaataagag aacaagcaga 4020
aaacattatc catcttttta cattaactaa tcttggtgca ccagctgcat ttaagtactt 4080
tgatacaaca atagatagaa aaagatacac atctactaaa gaagtattag acgcaacttt 4140
aatacatcaa tctattacag ggctttatga aacaagaatt gatttaagtc aactaggcgg 4200
agattaagtc gacaaagtat tgttaaaaat aactctgtag aattataaat tagttctaca 4260
gagttatttt ttgacccggg tatattgata aaaataataa tagtgggtat aattaagttg 4320
ttaggaggtt agttagaatg atgtcaagat tagataaaag taaagtgatt aacagcgcat 4380
tagagctgct taatgaggtc ggaatcgaag gtttaacaac ccgtaaactc gcccagaagc 4440
taggtgtaga gcagcctaca ttgtattggc atgtaaaaaa taagcgggct ttgctcgacg 4500
ccttagccat tgagatgtta gataggcacc atactcactt ttgcccttta gaaggggaaa 4560
gctggcaaga ttttttacgt aataacgcta aaagttttag atgtgcttta ctaagtcatc 4620
gcgatggagc aaaagtacat ttaggtacac ggcctacaga aaaacagtat gaaactctcg 4680
aaaatcaatt agccttttta tgccaacaag gtttttcact agagaatgca ttatatgcac 4740
tcagcgctgt ggggcatttt actttaggtt gcgtattgga agatcaagag catcaagtcg 4800
ctaaagaaga aagggaaaca cctactactg atagtatgcc gccattatta cgacaagcta 4860
tcgaattatt tgatcaccaa ggtgcagagc cagccttctt attcggcctt gaattgatca 4920
tatgcggatt agaaaaacaa cttaaatgtg aaagtgggtc ttaaaagcag cataaccttt 4980
ttccgtgatg gtaacttcac ggtaaccaag atgtcgagtt gagctcgaat tcgtaatcat 5040
ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag 5100
ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg 5160
cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa 5220
tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca 5280
ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg 5340
taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc 5400
agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc 5460
cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac 5520
tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc 5580
tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata 5640
gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc 5700
acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca 5760
acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag 5820
cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta 5880
gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg 5940
gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc 6000
agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt 6060
ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa 6120
ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat 6180
atgagtaaac ttggtctgac agttaccagg tccactgccg ggcctcttgc gggatcaaaa 6240
gaaaaacgaa atgatacacc aatcagtgca aaaaaagata taatgggaga taagacggtt 6300
cgtgttcgtg ctgacttgca ccatatcata aaaatcgaaa cagcaaagaa tggcggaaac 6360
gtaaaagaag ttatggaaat aagacttaga agcaaactta agagtgtgtt gatagtgcag 6420
tatcttaaaa ttttgtataa taggaattga agttaaatta gatgctaaaa atttgtaatt 6480
aagaaggagt gattacatga acaaaaatat aaaatattct caaaactttt taacgagtga 6540
aaaagtactc aaccaaataa taaaacaatt gaatttaaaa gaaaccgata ccgtttacga 6600
aattggaaca ggtaaagggc atttaacgac gaaactggct aaaataagta aacaggtaac 6660
gtctattgaa ttagacagtc atctattcaa cttatcgtca gaaaaattaa aactgaatac 6720
tcgtgtcact ttaattcacc aagatattct acagtttcaa ttccctaaca aacagaggta 6780
taaaattgtt gggagtattc cttaccattt aagcacacaa attattaaaa aagtggtttt 6840
tgaaagccat gcgtctgaca tctatctgat tgttgaagaa ggattctaca agcgtacctt 6900
ggatattcac cgaacactag ggttgctctt gcacactcaa gtctcgattc agcaattgct 6960
taagctgcca gcggaatgct ttcatcctaa accaaaagta aacagtgtct taataaaact 7020
tacccgccat accacagatg ttccagataa atattggaag ctatatacgt actttgtttc 7080
aaaatgggtc aatcgagaat atcgtcaact gtttactaaa aatcagtttc atcaagcaat 7140
gaaacacgcc aaagtaaaca atttaagtac cgttacttat gagcaagtat tgtctatttt 7200
taatagttat ctattattta acgggaggaa ataattctat gagtccctag gcaggcctcc 7260
gccattattt ttttgaacaa ttgacaattc atttcttatt ttttattaag tgatagtcaa 7320
aaggcataac agtgctgaat agaaagaaat ttacagaaaa gaaaattata gaatttagta 7380
tgattaatta tactcattta tgaatgttta attgaataca aaaaaaaata cttgttatgt 7440
attcaattac gggttaaaat atagacaagt tgaaaaattt aataaaaaaa taagtcctca 7500
gctcttatat attaagctac caacttagta tataagccaa aacttaaatg tgctaccaac 7560
acatcaagcc gttagagaac tctatctata gcaatatttc aaatgtaccg acatacaaga 7620
gaaacattaa ctatatatat tcaatttatg agattatctt aacagatata aatgtaaatt 7680
gcaataagta agatttagaa gtttatagcc tttgtgtatt ggaagcagta cgcaaaggct 7740
tttttatttg ataaaaatta gaagtatatt tattttttca taattaattt atgaaaatga 7800
aagggggtga gcaaagtgac agaggaaagc agtatcttat caaataacaa ggtattagca 7860
atatcattat tgactttagc agtaaacatt atgactttta tagtgcttgt agctaagtag 7920
tacgaaaggg ggagctttaa aaagctcctt ggaatacata gaattcataa attaatttat 7980
gaaaagaagg gcgtatatga aaacttgtaa aaattgcaaa gagtttatta aagatactga 8040
aatatgcaaa atacattcgt tgatgattca tgataaaaca gtagcaacct attgcagtaa 8100
atacaatgag tcaagatgtt tacataaagg gaaagtccaa tgtattaatt gttcaaagat 8160
gaaccgatat ggatggtgtg ccataaaaat gagatgtttt acagaggaag aacagaaaaa 8220
agaacgtaca tgcattaaat attatgcaag gagctttaaa aaagctcatg taaagaagag 8280
taaaaagaaa aaataattta tttattaatt taatattgag agtgccgaca cagtatgcac 8340
taaaaaatat atctgtggtg tagtgagccg atacaaaagg atagtcactc gcattttcat 8400
aatacatctt atgttatgat tatgtgtcgg tgggacttca cgacgaaaac ccacaataaa 8460
aaaagagttc ggggtagggt taagcatagt tgaggcaact aaacaatcaa gctaggatat 8520
gcagtagcag accgtaaggt cgttgtttag gtgtgttgta atacatacgc tattaagatg 8580
taaaaatacg gataccaatg aagggaaaag tataattttt ggatgtagtt tgtttgttca 8640
tctatgggca aactacgtcc aaagccgttt ccaaatctgc taaaaagtat atcctttcta 8700
aaatcaaagt caagtatgaa atcataaata aagtttaatt ttgaagttat tatgatatta 8760
tgtttttcta ttaaaataaa ttaagtatat agaatagttt aataatagta tatacttaat 8820
gtgataagtg tctgacagtg tcacagaaag gatgattgtt atggattata agcggc 8876
<210> 7
<211> 8874
<212> DNA
<213> 人工序列
<220>
<223> 质粒
<400> 7
catggataaa aagtacagta ttggtctaga cataggaact aactctgttg ggtgggctgt 60
tataacagat gaatataaag ttccatcaaa aaaatttaaa gtattaggaa acactgatag 120
acattcaata aaaaaaaact tgataggtgc tttattattc gattcaggag agactgctga 180
agctacacgt ttaaaaagaa cagctagacg tagatataca agaagaaaaa ataggatatg 240
ttatcttcaa gaaattttta gtaatgaaat ggcaaaagtt gatgattcat tctttcacag 300
actagaagaa agtttcttag ttgaagaaga taagaagcat gaaagacacc ctatttttgg 360
taatatcgta gatgaagtag catatcatga gaagtatcca actatctatc atttaagaaa 420
gaaattagtt gattctacag ataaagctga tctgagatta atatatttag ctttagctca 480
tatgattaaa tttagaggac attttttaat agaaggtgat ttaaacccag acaacagcga 540
tgtagataaa ttatttatcc aattagttca aacttataat caattattcg aagagaatcc 600
aattaatgca agtggtgtag acgctaaggc tatattatca gctagattat caaaatctag 660
aagattagaa aatctaatag ctcaacttcc tggagaaaag aaaaatggac tttttgggaa 720
cctaatagct ctctcactcg gactaacacc aaattttaaa agcaattttg atcttgctga 780
agacgcaaag ttacaactat caaaggatac atacgatgat gatttagata atttgttagc 840
tcaaataggt gatcaatatg ctgatttgtt tcttgcagca aaaaacttaa gtgatgcaat 900
tttactatca gatatactta gagtaaatac agaaataaca aaggctcctt tatcagcaag 960
tatgattaaa cgatatgatg agcatcatca agatttaaca ttattaaagg cacttgtaag 1020
acaacaatta ccagaaaaat ataaagaaat tttctttgat caatctaaaa atggatatgc 1080
tggatatata gacggtggag caagtcaaga agagttttat aaatttataa agcctatttt 1140
agaaaaaatg gatggaactg aagaattact tgttaaactt aacagagaag atttacttag 1200
aaaacaaaga acttttgata atggttcaat tcctcaccaa attcatttag gagaattaca 1260
tgctatacta agaagacaag aagattttta tccatttctt aaagataata gagaaaaaat 1320
tgaaaaaatt ttaactttta gaataccata ttatgtagga ccacttgcaa ggggaaattc 1380
aagatttgca tggatgacta gaaaatcaga agaaactata accccgtgga attttgaaga 1440
agtagtagat aaaggagcta gtgctcaatc atttatagaa agaatgacaa attttgataa 1500
gaatcttcct aacgaaaagg ttttgccaaa gcatagcctt ctttatgagt attttacagt 1560
ttataatgag cttactaaag taaaatacgt tacagaagga atgagaaaac cagcattttt 1620
gtctggtgaa caaaagaaag caatagtaga cctattattt aaaacaaata ggaaggttac 1680
cgtaaagcaa cttaaagaag attacttcaa aaaaattgaa tgctttgata gtgttgaaat 1740
atcaggagtt gaagatagat ttaatgcttc acttggtaca tatcacgatc tcttaaaaat 1800
tataaaagat aaggattttt tagataatga agaaaatgaa gatattcttg aagatatagt 1860
attaacattg acactttttg aagatagaga aatgatagaa gaaagattaa aaacatatgc 1920
acatcttttt gatgataagg ttatgaagca acttaaaaga agaagatata caggttgggg 1980
acgtttgtca agaaagctaa ttaatggtat tagagataaa caatcaggaa agactattct 2040
cgattttctt aaatcagatg gatttgctaa tagaaacttt atgcaattaa ttcatgatga 2100
ttctcttact ttcaaagagg atattcaaaa ggctcaagtt tctggacaag gcgatagctt 2160
acacgaacac attgctaacc ttgcagggag ccccgctatc aaaaaaggaa ttttacaaac 2220
agttaaagtt gtagatgaac ttgttaaagt tatgggaaga cacaaacctg agaatatagt 2280
tatagaaatg gccagagaaa atcaaacaac acaaaaagga caaaaaaatt ctagagagag 2340
aatgaagaga attgaagaag gaataaaaga gctaggatca caaatattaa aagaacatcc 2400
agttgaaaat actcaattgc aaaatgaaaa gttatatttg tattacttac aaaatggaag 2460
agatatgtat gttgatcaag aactcgatat taatagatta agtgactatg atgttgatca 2520
tattgttcct caatcatttt taaaagatga ttcaatcgat aacaaagtat taactagatc 2580
agataaaaat agaggaaagt cagataatgt accatctgaa gaagttgtta aaaaaatgaa 2640
gaactattgg agacaacttt taaatgcaaa gctaattaca caaagaaaat ttgacaattt 2700
aacaaaagca gaaagaggag gattaagcga attagacaaa gctggattta taaaaagaca 2760
acttgttgag acaagacaaa taactaagca tgttgctcaa atacttgatt caagaatgaa 2820
tacaaaatat gatgaaaatg ataaattaat cagagaagta aaagtaataa cattaaagtc 2880
aaaattagta tcagatttca gaaaggattt tcaattttac aaagttcgtg aaataaataa 2940
ctatcatcat gctcatgatg catacttaaa tgctgttgta ggaactgctc ttattaagaa 3000
atatcctaaa ctagaaagcg aatttgttta tggagattat aaagtttatg atgtgcgcaa 3060
aatgatcgcg aaatccgaac aagaaatcgg taaggctaca gcaaaatatt tcttttatag 3120
taatataatg aattttttta agacagaaat aactttggct aatggtgaaa tcagaaaaag 3180
accacttatc gaaacaaatg gagagacagg agaaatagta tgggataaag gaagagattt 3240
tgctactgtt agaaaagtac taagtatgcc acaagtaaat atcgtaaaga aaactgaagt 3300
tcaaactgga ggtttctcta aggaatcaat tttacctaag agaaattcag ataagttaat 3360
tgcaaggaaa aaagattggg acccaaaaaa atacggtggt tttgatagtc caacagttgc 3420
ctatagtgtt cttgtagtag cgaaagttga gaaaggtaag tcaaaaaagt tgaaaagcgt 3480
aaaagaactt cttggtatca caattatgga aagatcttca tttgaaaaaa atccaattga 3540
ctttttagaa gctaagggtt ataaagaagt taaaaaggat ttaatcataa aactaccaaa 3600
gtatagtcta tttgaactcg aaaacggaag aaaacgaatg ctcgctagcg caggagaact 3660
tcaaaaagga aatgaacttg cgctgccatc aaagtatgta aatttcttat atttagcttc 3720
tcattatgag aaattaaaag gatcaccaga ggataatgaa caaaagcaac tatttgtaga 3780
acaacacaaa cattatttag atgaaataat agaacaaata tctgaatttt ctaaaagagt 3840
tatacttgcc gacgcaaatc tagataaggt gctttcagcg tataataaac acagagataa 3900
accaataaga gaacaagcag aaaacattat ccatcttttt acattaacta atcttggtgc 3960
accagctgca tttaagtact ttgatacaac aatagataga aaaagataca catctactaa 4020
agaagtatta gacgcaactt taatacatca atctattaca gggctttatg aaacaagaat 4080
tgatttaagt caactaggcg gagattaagt cgacaaagta ttgttaaaaa taactctgta 4140
gaattataaa ttagttctac agagttattt tttgacccgg gtatattgat aaaaataata 4200
atagtgggta taattaagtt gttaggaggt tagttagaat gatgtcaaga ttagataaaa 4260
gtaaagtgat taacagcgca ttagagctgc ttaatgaggt cggaatcgaa ggtttaacaa 4320
cccgtaaact cgcccagaag ctaggtgtag agcagcctac attgtattgg catgtaaaaa 4380
ataagcgggc tttgctcgac gccttagcca ttgagatgtt agataggcac catactcact 4440
tttgcccttt agaaggggaa agctggcaag attttttacg taataacgct aaaagtttta 4500
gatgtgcttt actaagtcat cgcgatggag caaaagtaca tttaggtaca cggcctacag 4560
aaaaacagta tgaaactctc gaaaatcaat tagccttttt atgccaacaa ggtttttcac 4620
tagagaatgc attatatgca ctcagcgctg tggggcattt tactttaggt tgcgtattgg 4680
aagatcaaga gcatcaagtc gctaaagaag aaagggaaac acctactact gatagtatgc 4740
cgccattatt acgacaagct atcgaattat ttgatcacca aggtgcagag ccagccttct 4800
tattcggcct tgaattgatc atatgcggat tagaaaaaca acttaaatgt gaaagtgggt 4860
cttaaaagca gcataacctt tttccgtgat ggtaacttca cggtaaccaa gatgtcgagt 4920
tgagctcgaa ttcgtaatca tggtcatagc tgtttcctgt gtgaaattgt tatccgctca 4980
caattccaca caacatacga gccggaagca taaagtgtaa agcctggggt gcctaatgag 5040
tgagctaact cacattaatt gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt 5100
cgtgccagct gcattaatga atcggccaac gcgcggggag aggcggtttg cgtattgggc 5160
gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg 5220
tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa 5280
agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg 5340
cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga 5400
ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg 5460
tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg 5520
gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc 5580
gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg 5640
gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca 5700
ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt 5760
ggcctaacta cggctacact agaaggacag tatttggtat ctgcgctctg ctgaagccag 5820
ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg 5880
gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc 5940
ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt 6000
tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt 6060
ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccag gtccactgcc 6120
gggcctcttg cgggatcaaa agaaaaacga aatgatacac caatcagtgc aaaaaaagat 6180
ataatgggag ataagacggt tcgtgttcgt gctgacttgc accatatcat aaaaatcgaa 6240
acagcaaaga atggcggaaa cgtaaaagaa gttatggaaa taagacttag aagcaaactt 6300
aagagtgtgt tgatagtgca gtatcttaaa attttgtata ataggaattg aagttaaatt 6360
agatgctaaa aatttgtaat taagaaggag tgattacatg aacaaaaata taaaatattc 6420
tcaaaacttt ttaacgagtg aaaaagtact caaccaaata ataaaacaat tgaatttaaa 6480
agaaaccgat accgtttacg aaattggaac aggtaaaggg catttaacga cgaaactggc 6540
taaaataagt aaacaggtaa cgtctattga attagacagt catctattca acttatcgtc 6600
agaaaaatta aaactgaata ctcgtgtcac tttaattcac caagatattc tacagtttca 6660
attccctaac aaacagaggt ataaaattgt tgggagtatt ccttaccatt taagcacaca 6720
aattattaaa aaagtggttt ttgaaagcca tgcgtctgac atctatctga ttgttgaaga 6780
aggattctac aagcgtacct tggatattca ccgaacacta gggttgctct tgcacactca 6840
agtctcgatt cagcaattgc ttaagctgcc agcggaatgc tttcatccta aaccaaaagt 6900
aaacagtgtc ttaataaaac ttacccgcca taccacagat gttccagata aatattggaa 6960
gctatatacg tactttgttt caaaatgggt caatcgagaa tatcgtcaac tgtttactaa 7020
aaatcagttt catcaagcaa tgaaacacgc caaagtaaac aatttaagta ccgttactta 7080
tgagcaagta ttgtctattt ttaatagtta tctattattt aacgggagga aataattcta 7140
tgagtcccta ggcaggcctc cgccattatt tttttgaaca attgacaatt catttcttat 7200
tttttattaa gtgatagtca aaaggcataa cagtgctgaa tagaaagaaa tttacagaaa 7260
agaaaattat agaatttagt atgattaatt atactcattt atgaatgttt aattgaatac 7320
aaaaaaaaat acttgttatg tattcaatta cgggttaaaa tatagacaag ttgaaaaatt 7380
taataaaaaa ataagtcctc agctcttata tattaagcta ccaacttagt atataagcca 7440
aaacttaaat gtgctaccaa cacatcaagc cgttagagaa ctctatctat agcaatattt 7500
caaatgtacc gacatacaag agaaacatta actatatata ttcaatttat gagattatct 7560
taacagatat aaatgtaaat tgcaataagt aagatttaga agtttatagc ctttgtgtat 7620
tggaagcagt acgcaaaggc ttttttattt gataaaaatt agaagtatat ttattttttc 7680
ataattaatt tatgaaaatg aaagggggtg agcaaagtga cagaggaaag cagtatctta 7740
tcaaataaca aggtattagc aatatcatta ttgactttag cagtaaacat tatgactttt 7800
atagtgcttg tagctaagta gtacgaaagg gggagcttta aaaagctcct tggaatacat 7860
agaattcata aattaattta tgaaaagaag ggcgtatatg aaaacttgta aaaattgcaa 7920
agagtttatt aaagatactg aaatatgcaa aatacattcg ttgatgattc atgataaaac 7980
agtagcaacc tattgcagta aatacaatga gtcaagatgt ttacataaag ggaaagtcca 8040
atgtattaat tgttcaaaga tgaaccgata tggatggtgt gccataaaaa tgagatgttt 8100
tacagaggaa gaacagaaaa aagaacgtac atgcattaaa tattatgcaa ggagctttaa 8160
aaaagctcat gtaaagaaga gtaaaaagaa aaaataattt atttattaat ttaatattga 8220
gagtgccgac acagtatgca ctaaaaaata tatctgtggt gtagtgagcc gatacaaaag 8280
gatagtcact cgcattttca taatacatct tatgttatga ttatgtgtcg gtgggacttc 8340
acgacgaaaa cccacaataa aaaaagagtt cggggtaggg ttaagcatag ttgaggcaac 8400
taaacaatca agctaggata tgcagtagca gaccgtaagg tcgttgttta ggtgtgttgt 8460
aatacatacg ctattaagat gtaaaaatac ggataccaat gaagggaaaa gtataatttt 8520
tggatgtagt ttgtttgttc atctatgggc aaactacgtc caaagccgtt tccaaatctg 8580
ctaaaaagta tatcctttct aaaatcaaag tcaagtatga aatcataaat aaagtttaat 8640
tttgaagtta ttatgatatt atgtttttct attaaaataa attaagtata tagaatagtt 8700
taataatagt atatacttaa tgtgataagt gtctgacagt gtcacagaaa ggatgattgt 8760
tatggattat aagcggctcg agtccctatc agtgatagat tgaaactcta tcattgatag 8820
agtataatat ctttgttcat tagagcgata aacttgaatt tgagagggaa cttc 8874
<210> 8
<211> 4938
<212> DNA
<213> 人工序列
<220>
<223> 质粒
<400> 8
tcgactctag aggatccccg ggtaccgagc tcgaattcgt aatcatggtc atagctgttt 60
cctgtgtgaa attgttatcc gctcacaatt ccacacaaca tacgagccgg aagcataaag 120
tgtaaagcct ggggtgccta atgagtgagc taactcacat taattgcgtt gcgctcactg 180
cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg 240
gggagaggcg gtttgcgtat tgggcgctct tccgcttcct cgctcactga ctcgctgcgc 300
tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc 360
acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg 420
aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat 480
cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag 540
gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga 600
tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg 660
tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt 720
cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac 780
gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc 840
ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag aacagtattt 900
ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc 960
ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc 1020
agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg 1080
aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag 1140
atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg 1200
tctgacagtt accaaagcta gcttaatact agtatatact taatgtgata agtgtctgac 1260
agctgaccgg tctaaagagg tccctagcgc ctacggggaa tttgtatcga taaggggtac 1320
aaattcccac taagcgctcg gccggggatc gatccccggg tacgtacccg gcagtttttc 1380
tttttcggca agtgttcaag aagttattaa gtcgggagtg cagtcgaagt gggcaagttg 1440
aaaaattcac aaaaatgtgg tataatatct ttgttcatta gagcgataaa cttgaatttg 1500
agagggaact tagatggtat ttgaaaaaat tgataaaaat agttggaaca gaaaagagta 1560
ttttgaccac tactttgcaa gtgtaccttg tacctacagc atgaccgtta aagtggatat 1620
cacacaaata aaggaaaagg gaatgaaact atatcctgca atgctttatt atattgcaat 1680
gattgtaaac cgccattcag agtttaggac ggcaatcaat caagatggtg aattggggat 1740
atatgatgag atgataccaa gctatacaat atttcacaat gatactgaaa cattttccag 1800
cctttggact gagtgtaagt ctgactttaa atcattttta gcagattatg aaagtgatac 1860
gcaacggtat ggaaacaatc atagaatgga aggaaagcca aatgctccgg aaaacatttt 1920
taatgtatct atgataccgt ggtcaacctt cgatggcttt aatctgaatt tgcagaaagg 1980
atatgattat ttgattccta tttttactat ggggaaatat tataaagaag ataacaaaat 2040
tatacttcct ttggcaattc aagttcatca cgcagtatgt gacggatttc acatttgccg 2100
ttttgtaaac gaattgcagg aattgataaa tagttaactt caggtttgtc tgtaactaaa 2160
aactagtatt taacctagga tcaaaaaaat ttccaataat cccactctaa gccacaaaca 2220
cgccctataa aatcccgctt taatcccact ttgagacaca tgtaatatta ctttacgccc 2280
tagtatagtg ataatttttt acattcaatg ccacgcaaaa aaataaaggg gcactataat 2340
aaaagttcct tcggaactaa ctaaagtaaa aaattatctt tacaacctcc ccaaaaaaaa 2400
gaacaggtac aaagtaccct ataatacaag cgtaaaaaaa atgagggtaa aaataaaaaa 2460
ataaaaaaat aaaaaaataa aaaaataaaa aaataaaaaa ataaaaaaat ataaaaataa 2520
aaaaatataa aaataaaaaa atataaaaat aaaaaaataa aaaaatataa aaataaaaaa 2580
ataaaaaaat ataaaaatat tttttattta aagtttgaaa aaaatttttt tatattatat 2640
aatctttgaa gaaaagaata taaaaaatga gcctttataa aagcccattt tttttcatat 2700
acgtaatatg acgttctaat gtttttattg gtacttctaa cattagagta atttctttat 2760
ttttaaagcc tttttcttta agggctttta ttttttttct taatacattt aattcctctt 2820
tttttgttgc ttttccttta gcttttaatt gctcttgata atttttttta cctctaatat 2880
tttctcttct cttatattcc tttttagaaa ttattattgt catatatttt tgttcttctt 2940
ctgtaatttc taataactct ataagagttt cattcttata cttatattgc ttatttttat 3000
ctaaataaca tctttcagca cttctagttg ctcttataac ttctctttca cttaaatgtt 3060
gtctaaacat actattaagt tctaaaacat catttaatgc cttctcaatg tcttctgtaa 3120
agctacaaag ataatatcta tataaaaata atataagctc tctgtgtcct tttaaatcat 3180
attctcttag ttcacaaagt tttattatgt cttgtattct tccataatat aaacttcttt 3240
ctctataaat ataatttatt ttgcttggtc tacccttttt cctttcatat ggttttaatt 3300
caggtaaaaa tccattttgt atttctctta agtcataaat atattcgtac tcatctaata 3360
tattgactac tgtttttgat ttagagttta tacttcctgg aactcttaat attctcgttg 3420
catctaaggc ttgtctatct gctccaaagt attttaattg attatataaa tattcttgaa 3480
ccgctttcca taatggtaat gctttactag gtactgcatt tattatccat attaaataca 3540
ttcctcttcc actatctatt acatagtttg gtataggaat actttgatta aaataattct 3600
tttctaagtc cattaatacc tggtctttag ttttgccagt tttataataa tccaagtcta 3660
taaacagtgt atttaactct tttatatttt ctaatcgcct acacggctta taaaaggtat 3720
ttagagttat atagatattt tcatcactca tatctaaatc ttttaattca gcgtatttat 3780
agtgccattg gctatatcct tttttatcta taacgctcct ggttatccac cctttacttc 3840
tactatgaat attatctata tagttctttt tattcagctt taatgcgttt ctcacttatt 3900
cacctcccct tctgtaaaac taagaaaatt atatcatatt ttcaataatt attaactatt 3960
cttaaactct taataaaaaa tagagtaagt ccccaattga aacttaatct attttttatg 4020
ttttaattta ttatttttat taaaatattt taaactaaat taaatgattc tttttaattt 4080
tttactattt cattccataa tatattacta taattattta caaataatat ttcttcattt 4140
gtaatattta gatgatttac taattttagt ttttatatat taaataatta atgtataatt 4200
tatataaaaa atcaaaggag cttataaatt atgattattt ccaaagatac taaagattta 4260
atttttttca attttaacaa tactttttgt aatattatgt ttaaatttaa ttgtattttt 4320
ttcatataat aaagccgttg aagtaaacca atccattttc cttatgatgt tattattaaa 4380
tttaagtttt ataataatat ctttattata tttattgttt ttaaaaaaac tagtgaaatt 4440
tctagtgaaa tttccggctt tattaaactt atttttagga attttatttt cattttcatc 4500
tttacaggat ttgattatat ctttaaatat gttttatcaa atattatctt tttctaaatt 4560
tatatatatt tttattatat ttattattat atatatttta tttttaagtt tctttctaac 4620
agctattaaa aagaaactta aaaataaaaa cacgtactct aaaccaataa ataaaactat 4680
ttttattatt gctgccttga ttggaatagt ttttagtaaa attaatttca atattccaca 4740
atattatatt ataagctagc acgcctcgag tatattgata aaaataataa tagtgggtat 4800
aattaagttg ttaggaggtt agttagactt aaaatccata tataacagtt ttagagctag 4860
aaatagcaag ttaaaataag gctagtccgt tatcaacttg aaaaagtggc accgagtcgg 4920
tgcttttttt gaagcttg 4938
<210> 9
<211> 4938
<212> DNA
<213> 人工序列
<220>
<223> 质粒
<400> 9
tcgactctag aggatccccg ggtaccgagc tcgaattcgt aatcatggtc atagctgttt 60
cctgtgtgaa attgttatcc gctcacaatt ccacacaaca tacgagccgg aagcataaag 120
tgtaaagcct ggggtgccta atgagtgagc taactcacat taattgcgtt gcgctcactg 180
cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg 240
gggagaggcg gtttgcgtat tgggcgctct tccgcttcct cgctcactga ctcgctgcgc 300
tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc 360
acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg 420
aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat 480
cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag 540
gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga 600
tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg 660
tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt 720
cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac 780
gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc 840
ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag aacagtattt 900
ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc 960
ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc 1020
agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg 1080
aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag 1140
atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg 1200
tctgacagtt accaaagcta gcttaatact agtatatact taatgtgata agtgtctgac 1260
agctgaccgg tctaaagagg tccctagcgc ctacggggaa tttgtatcga taaggggtac 1320
aaattcccac taagcgctcg gccggggatc gatccccggg tacgtacccg gcagtttttc 1380
tttttcggca agtgttcaag aagttattaa gtcgggagtg cagtcgaagt gggcaagttg 1440
aaaaattcac aaaaatgtgg tataatatct ttgttcatta gagcgataaa cttgaatttg 1500
agagggaact tagatggtat ttgaaaaaat tgataaaaat agttggaaca gaaaagagta 1560
ttttgaccac tactttgcaa gtgtaccttg tacctacagc atgaccgtta aagtggatat 1620
cacacaaata aaggaaaagg gaatgaaact atatcctgca atgctttatt atattgcaat 1680
gattgtaaac cgccattcag agtttaggac ggcaatcaat caagatggtg aattggggat 1740
atatgatgag atgataccaa gctatacaat atttcacaat gatactgaaa cattttccag 1800
cctttggact gagtgtaagt ctgactttaa atcattttta gcagattatg aaagtgatac 1860
gcaacggtat ggaaacaatc atagaatgga aggaaagcca aatgctccgg aaaacatttt 1920
taatgtatct atgataccgt ggtcaacctt cgatggcttt aatctgaatt tgcagaaagg 1980
atatgattat ttgattccta tttttactat ggggaaatat tataaagaag ataacaaaat 2040
tatacttcct ttggcaattc aagttcatca cgcagtatgt gacggatttc acatttgccg 2100
ttttgtaaac gaattgcagg aattgataaa tagttaactt caggtttgtc tgtaactaaa 2160
aactagtatt taacctagga tcaaaaaaat ttccaataat cccactctaa gccacaaaca 2220
cgccctataa aatcccgctt taatcccact ttgagacaca tgtaatatta ctttacgccc 2280
tagtatagtg ataatttttt acattcaatg ccacgcaaaa aaataaaggg gcactataat 2340
aaaagttcct tcggaactaa ctaaagtaaa aaattatctt tacaacctcc ccaaaaaaaa 2400
gaacaggtac aaagtaccct ataatacaag cgtaaaaaaa atgagggtaa aaataaaaaa 2460
ataaaaaaat aaaaaaataa aaaaataaaa aaataaaaaa ataaaaaaat ataaaaataa 2520
aaaaatataa aaataaaaaa atataaaaat aaaaaaataa aaaaatataa aaataaaaaa 2580
ataaaaaaat ataaaaatat tttttattta aagtttgaaa aaaatttttt tatattatat 2640
aatctttgaa gaaaagaata taaaaaatga gcctttataa aagcccattt tttttcatat 2700
acgtaatatg acgttctaat gtttttattg gtacttctaa cattagagta atttctttat 2760
ttttaaagcc tttttcttta agggctttta ttttttttct taatacattt aattcctctt 2820
tttttgttgc ttttccttta gcttttaatt gctcttgata atttttttta cctctaatat 2880
tttctcttct cttatattcc tttttagaaa ttattattgt catatatttt tgttcttctt 2940
ctgtaatttc taataactct ataagagttt cattcttata cttatattgc ttatttttat 3000
ctaaataaca tctttcagca cttctagttg ctcttataac ttctctttca cttaaatgtt 3060
gtctaaacat actattaagt tctaaaacat catttaatgc cttctcaatg tcttctgtaa 3120
agctacaaag ataatatcta tataaaaata atataagctc tctgtgtcct tttaaatcat 3180
attctcttag ttcacaaagt tttattatgt cttgtattct tccataatat aaacttcttt 3240
ctctataaat ataatttatt ttgcttggtc tacccttttt cctttcatat ggttttaatt 3300
caggtaaaaa tccattttgt atttctctta agtcataaat atattcgtac tcatctaata 3360
tattgactac tgtttttgat ttagagttta tacttcctgg aactcttaat attctcgttg 3420
catctaaggc ttgtctatct gctccaaagt attttaattg attatataaa tattcttgaa 3480
ccgctttcca taatggtaat gctttactag gtactgcatt tattatccat attaaataca 3540
ttcctcttcc actatctatt acatagtttg gtataggaat actttgatta aaataattct 3600
tttctaagtc cattaatacc tggtctttag ttttgccagt tttataataa tccaagtcta 3660
taaacagtgt atttaactct tttatatttt ctaatcgcct acacggctta taaaaggtat 3720
ttagagttat atagatattt tcatcactca tatctaaatc ttttaattca gcgtatttat 3780
agtgccattg gctatatcct tttttatcta taacgctcct ggttatccac cctttacttc 3840
tactatgaat attatctata tagttctttt tattcagctt taatgcgttt ctcacttatt 3900
cacctcccct tctgtaaaac taagaaaatt atatcatatt ttcaataatt attaactatt 3960
cttaaactct taataaaaaa tagagtaagt ccccaattga aacttaatct attttttatg 4020
ttttaattta ttatttttat taaaatattt taaactaaat taaatgattc tttttaattt 4080
tttactattt cattccataa tatattacta taattattta caaataatat ttcttcattt 4140
gtaatattta gatgatttac taattttagt ttttatatat taaataatta atgtataatt 4200
tatataaaaa atcaaaggag cttataaatt atgattattt ccaaagatac taaagattta 4260
atttttttca attttaacaa tactttttgt aatattatgt ttaaatttaa ttgtattttt 4320
ttcatataat aaagccgttg aagtaaacca atccattttc cttatgatgt tattattaaa 4380
tttaagtttt ataataatat ctttattata tttattgttt ttaaaaaaac tagtgaaatt 4440
tctagtgaaa tttccggctt tattaaactt atttttagga attttatttt cattttcatc 4500
tttacaggat ttgattatat ctttaaatat gttttatcaa atattatctt tttctaaatt 4560
tatatatatt tttattatat ttattattat atatatttta tttttaagtt tctttctaac 4620
agctattaaa aagaaactta aaaataaaaa cacgtactct aaaccaataa ataaaactat 4680
ttttattatt gctgccttga ttggaatagt ttttagtaaa attaatttca atattccaca 4740
atattatatt ataagctagc acgcctcgag tatattgata aaaataataa tagtgggtat 4800
aattaagttg ttaggaggtt agttagaact aaatgtaaaa tgttagcgtt ttagagctag 4860
aaatagcaag ttaaaataag gctagtccgt tatcaacttg aaaaagtggc accgagtcgg 4920
tgcttttttt gaagcttg 4938
<210> 10
<211> 5911
<212> DNA
<213> 人工序列
<220>
<223> 质粒
<400> 10
aattcgtaat catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca 60
cacaacatac gagccggaag cataaagtgt aaagcctggg gtgcctaatg agtgagctaa 120
ctcacattaa ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag 180
ctgcattaat gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg gcgctcttcc 240
gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 300
cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 360
tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 420
cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 480
aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 540
cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 600
gcgctttctc atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag 660
ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat 720
cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac 780
aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac 840
tacggctaca ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc 900
ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt 960
tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc 1020
ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg 1080
agattatcaa aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca 1140
atctaaagta tatatgagta aacttggtct gacagttacc aaagctagct taatactagt 1200
atatacttaa tgtgataagt gtctgacagc tgaccggtct aaagaggtcc ctagcgccta 1260
cggggaattt gtatcgataa ggggtacaaa ttcccactaa gcgctcggcc ggggatcgat 1320
ccccgggtac gtacccggca gtttttcttt ttcggcaagt gttcaagaag ttattaagtc 1380
gggagtgcag tcgaagtggg caagttgaaa aattcacaaa aatgtggtat aatatctttg 1440
ttcattagag cgataaactt gaatttgaga gggaacttag atggtatttg aaaaaattga 1500
taaaaatagt tggaacagaa aagagtattt tgaccactac tttgcaagtg taccttgtac 1560
ctacagcatg accgttaaag tggatatcac acaaataaag gaaaagggaa tgaaactata 1620
tcctgcaatg ctttattata ttgcaatgat tgtaaaccgc cattcagagt ttaggacggc 1680
aatcaatcaa gatggtgaat tggggatata tgatgagatg ataccaagct atacaatatt 1740
tcacaatgat actgaaacat tttccagcct ttggactgag tgtaagtctg actttaaatc 1800
atttttagca gattatgaaa gtgatacgca acggtatgga aacaatcata gaatggaagg 1860
aaagccaaat gctccggaaa acatttttaa tgtatctatg ataccgtggt caaccttcga 1920
tggctttaat ctgaatttgc agaaaggata tgattatttg attcctattt ttactatggg 1980
gaaatattat aaagaagata acaaaattat acttcctttg gcaattcaag ttcatcacgc 2040
agtatgtgac ggatttcaca tttgccgttt tgtaaacgaa ttgcaggaat tgataaatag 2100
ttaacttcag gtttgtctgt aactaaaaac tagtatttaa cctaggatca aaaaaatttc 2160
caataatccc actctaagcc acaaacacgc cctataaaat cccgctttaa tcccactttg 2220
agacacatgt aatattactt tacgccctag tatagtgata attttttaca ttcaatgcca 2280
cgcaaaaaaa taaaggggca ctataataaa agttccttcg gaactaacta aagtaaaaaa 2340
ttatctttac aacctcccca aaaaaaagaa caggtacaaa gtaccctata atacaagcgt 2400
aaaaaaaatg agggtaaaaa taaaaaaata aaaaaataaa aaaataaaaa aataaaaaaa 2460
taaaaaaata aaaaaatata aaaataaaaa aatataaaaa taaaaaaata taaaaataaa 2520
aaaataaaaa aatataaaaa taaaaaaata aaaaaatata aaaatatttt ttatttaaag 2580
tttgaaaaaa atttttttat attatataat ctttgaagaa aagaatataa aaaatgagcc 2640
tttataaaag cccatttttt ttcatatacg taatatgacg ttctaatgtt tttattggta 2700
cttctaacat tagagtaatt tctttatttt taaagccttt ttctttaagg gcttttattt 2760
tttttcttaa tacatttaat tcctcttttt ttgttgcttt tcctttagct tttaattgct 2820
cttgataatt ttttttacct ctaatatttt ctcttctctt atattccttt ttagaaatta 2880
ttattgtcat atatttttgt tcttcttctg taatttctaa taactctata agagtttcat 2940
tcttatactt atattgctta tttttatcta aataacatct ttcagcactt ctagttgctc 3000
ttataacttc tctttcactt aaatgttgtc taaacatact attaagttct aaaacatcat 3060
ttaatgcctt ctcaatgtct tctgtaaagc tacaaagata atatctatat aaaaataata 3120
taagctctct gtgtcctttt aaatcatatt ctcttagttc acaaagtttt attatgtctt 3180
gtattcttcc ataatataaa cttctttctc tataaatata atttattttg cttggtctac 3240
cctttttcct ttcatatggt tttaattcag gtaaaaatcc attttgtatt tctcttaagt 3300
cataaatata ttcgtactca tctaatatat tgactactgt ttttgattta gagtttatac 3360
ttcctggaac tcttaatatt ctcgttgcat ctaaggcttg tctatctgct ccaaagtatt 3420
ttaattgatt atataaatat tcttgaaccg ctttccataa tggtaatgct ttactaggta 3480
ctgcatttat tatccatatt aaatacattc ctcttccact atctattaca tagtttggta 3540
taggaatact ttgattaaaa taattctttt ctaagtccat taatacctgg tctttagttt 3600
tgccagtttt ataataatcc aagtctataa acagtgtatt taactctttt atattttcta 3660
atcgcctaca cggcttataa aaggtattta gagttatata gatattttca tcactcatat 3720
ctaaatcttt taattcagcg tatttatagt gccattggct atatcctttt ttatctataa 3780
cgctcctggt tatccaccct ttacttctac tatgaatatt atctatatag ttctttttat 3840
tcagctttaa tgcgtttctc acttattcac ctccccttct gtaaaactaa gaaaattata 3900
tcatattttc aataattatt aactattctt aaactcttaa taaaaaatag agtaagtccc 3960
caattgaaac ttaatctatt ttttatgttt taatttatta tttttattaa aatattttaa 4020
actaaattaa atgattcttt ttaatttttt actatttcat tccataatat attactataa 4080
ttatttacaa ataatatttc ttcatttgta atatttagat gatttactaa ttttagtttt 4140
tatatattaa ataattaatg tataatttat ataaaaaatc aaaggagctt ataaattatg 4200
attatttcca aagatactaa agatttaatt tttttcaatt ttaacaatac tttttgtaat 4260
attatgttta aatttaattg tatttttttc atataataaa gccgttgaag taaaccaatc 4320
cattttcctt atgatgttat tattaaattt aagttttata ataatatctt tattatattt 4380
attgttttta aaaaaactag tgaaatttct agtgaaattt ccggctttat taaacttatt 4440
tttaggaatt ttattttcat tttcatcttt acaggatttg attatatctt taaatatgtt 4500
ttatcaaata ttatcttttt ctaaatttat atatattttt attatattta ttattatata 4560
tattttattt ttaagtttct ttctaacagc tattaaaaag aaacttaaaa ataaaaacac 4620
gtactctaaa ccaataaata aaactatttt tattattgct gccttgattg gaatagtttt 4680
tagtaaaatt aatttcaata ttccacaata ttatattata agctagcacg cctcgagtat 4740
attgataaaa ataataatag tgggtataat taagttgtta ggaggttagt tagaactaaa 4800
tgtaaaatgt tagcgtttta gagctagaaa tagcaagtta aaataaggct agtccgttat 4860
caacttgaaa aagtggcacc gagtcggtgc tttttttgaa gcttgtcgac atgaagatag 4920
caataggtag tgatcatgca ggattttcat tgaaaaagga agttataaaa catttagaga 4980
gtaaaaatat tgaggttaaa gattttggca ctctaactga tgaatcatgt gattatccag 5040
attatgcatt aaaagtagca gaggaagttg ctcaaaaaaa ctttgagttt ggaatactca 5100
tttgtggaac aggtatagga ataagcatat cagcaaataa ggtgccagga ataagagcag 5160
ctgtatgtac agatacattc tgtgctcatg catcaagaga acataacaat gcaaatatac 5220
ttgcaatggg agaaagagtt gtaggacctg gattagcaat tgatatagta gatacatttt 5280
taaattcaaa atttcaggga gataggcatc aaagaagaat agacaagatt acacaaatag 5340
aaaaaaaata caatggagga atgaaataat gagtaaagtt acacaaatat cacatccact 5400
tatattacac aaacttcctc aagatataga ggaaagagac ataatagtaa ctgatccaat 5460
gcttgcaact ggtgggtcag caatagatgc aataacactt cttaagaaaa gaggagcaaa 5520
atacataaga cttatgtgtc ttataggagc accagaaggt atagcagcag tacaagaagc 5580
acatccagat gtagatatat acctcgcatc aatagatgaa aagttagatg aaaatggata 5640
tatagttcct ggtcttggag atgctggaga tagattattc ggtacaaaat aaattgcata 5700
aataaaaagg gctgaaaaat aaatttcagt ccttttattt atattttaac tttattccat 5760
gccactgcct cttctgataa taatagaaat tattaaagtt aatacagatg taagggcaac 5820
tgttccacct ggagcacaat ttaagtaata agagctaaat aatccaacga gtacatctat 5880
aaagctaaat aatattgata atataagcgt g 5911
<210> 11
<211> 6217
<212> DNA
<213> 人工序列
<220>
<223> 质粒
<400> 11
aattcgtaat catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca 60
cacaacatac gagccggaag cataaagtgt aaagcctggg gtgcctaatg agtgagctaa 120
ctcacattaa ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag 180
ctgcattaat gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg gcgctcttcc 240
gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 300
cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 360
tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 420
cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 480
aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 540
cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 600
gcgctttctc atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag 660
ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat 720
cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac 780
aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac 840
tacggctaca ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc 900
ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt 960
tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc 1020
ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg 1080
agattatcaa aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca 1140
atctaaagta tatatgagta aacttggtct gacagttacc aaagctagct taatactagt 1200
atatacttaa tgtgataagt gtctgacagc tgaccggtct aaagaggtcc ctagcgccta 1260
cggggaattt gtatcgataa ggggtacaaa ttcccactaa gcgctcggcc ggggatcgat 1320
ccccgggtac gtacccggca gtttttcttt ttcggcaagt gttcaagaag ttattaagtc 1380
gggagtgcag tcgaagtggg caagttgaaa aattcacaaa aatgtggtat aatatctttg 1440
ttcattagag cgataaactt gaatttgaga gggaacttag atggtatttg aaaaaattga 1500
taaaaatagt tggaacagaa aagagtattt tgaccactac tttgcaagtg taccttgtac 1560
ctacagcatg accgttaaag tggatatcac acaaataaag gaaaagggaa tgaaactata 1620
tcctgcaatg ctttattata ttgcaatgat tgtaaaccgc cattcagagt ttaggacggc 1680
aatcaatcaa gatggtgaat tggggatata tgatgagatg ataccaagct atacaatatt 1740
tcacaatgat actgaaacat tttccagcct ttggactgag tgtaagtctg actttaaatc 1800
atttttagca gattatgaaa gtgatacgca acggtatgga aacaatcata gaatggaagg 1860
aaagccaaat gctccggaaa acatttttaa tgtatctatg ataccgtggt caaccttcga 1920
tggctttaat ctgaatttgc agaaaggata tgattatttg attcctattt ttactatggg 1980
gaaatattat aaagaagata acaaaattat acttcctttg gcaattcaag ttcatcacgc 2040
agtatgtgac ggatttcaca tttgccgttt tgtaaacgaa ttgcaggaat tgataaatag 2100
ttaacttcag gtttgtctgt aactaaaaac tagtatttaa cctaggatca aaaaaatttc 2160
caataatccc actctaagcc acaaacacgc cctataaaat cccgctttaa tcccactttg 2220
agacacatgt aatattactt tacgccctag tatagtgata attttttaca ttcaatgcca 2280
cgcaaaaaaa taaaggggca ctataataaa agttccttcg gaactaacta aagtaaaaaa 2340
ttatctttac aacctcccca aaaaaaagaa caggtacaaa gtaccctata atacaagcgt 2400
aaaaaaaatg agggtaaaaa taaaaaaata aaaaaataaa aaaataaaaa aataaaaaaa 2460
taaaaaaata aaaaaatata aaaataaaaa aatataaaaa taaaaaaata taaaaataaa 2520
aaaataaaaa aatataaaaa taaaaaaata aaaaaatata aaaatatttt ttatttaaag 2580
tttgaaaaaa atttttttat attatataat ctttgaagaa aagaatataa aaaatgagcc 2640
tttataaaag cccatttttt ttcatatacg taatatgacg ttctaatgtt tttattggta 2700
cttctaacat tagagtaatt tctttatttt taaagccttt ttctttaagg gcttttattt 2760
tttttcttaa tacatttaat tcctcttttt ttgttgcttt tcctttagct tttaattgct 2820
cttgataatt ttttttacct ctaatatttt ctcttctctt atattccttt ttagaaatta 2880
ttattgtcat atatttttgt tcttcttctg taatttctaa taactctata agagtttcat 2940
tcttatactt atattgctta tttttatcta aataacatct ttcagcactt ctagttgctc 3000
ttataacttc tctttcactt aaatgttgtc taaacatact attaagttct aaaacatcat 3060
ttaatgcctt ctcaatgtct tctgtaaagc tacaaagata atatctatat aaaaataata 3120
taagctctct gtgtcctttt aaatcatatt ctcttagttc acaaagtttt attatgtctt 3180
gtattcttcc ataatataaa cttctttctc tataaatata atttattttg cttggtctac 3240
cctttttcct ttcatatggt tttaattcag gtaaaaatcc attttgtatt tctcttaagt 3300
cataaatata ttcgtactca tctaatatat tgactactgt ttttgattta gagtttatac 3360
ttcctggaac tcttaatatt ctcgttgcat ctaaggcttg tctatctgct ccaaagtatt 3420
ttaattgatt atataaatat tcttgaaccg ctttccataa tggtaatgct ttactaggta 3480
ctgcatttat tatccatatt aaatacattc ctcttccact atctattaca tagtttggta 3540
taggaatact ttgattaaaa taattctttt ctaagtccat taatacctgg tctttagttt 3600
tgccagtttt ataataatcc aagtctataa acagtgtatt taactctttt atattttcta 3660
atcgcctaca cggcttataa aaggtattta gagttatata gatattttca tcactcatat 3720
ctaaatcttt taattcagcg tatttatagt gccattggct atatcctttt ttatctataa 3780
cgctcctggt tatccaccct ttacttctac tatgaatatt atctatatag ttctttttat 3840
tcagctttaa tgcgtttctc acttattcac ctccccttct gtaaaactaa gaaaattata 3900
tcatattttc aataattatt aactattctt aaactcttaa taaaaaatag agtaagtccc 3960
caattgaaac ttaatctatt ttttatgttt taatttatta tttttattaa aatattttaa 4020
actaaattaa atgattcttt ttaatttttt actatttcat tccataatat attactataa 4080
ttatttacaa ataatatttc ttcatttgta atatttagat gatttactaa ttttagtttt 4140
tatatattaa ataattaatg tataatttat ataaaaaatc aaaggagctt ataaattatg 4200
attatttcca aagatactaa agatttaatt tttttcaatt ttaacaatac tttttgtaat 4260
attatgttta aatttaattg tatttttttc atataataaa gccgttgaag taaaccaatc 4320
cattttcctt atgatgttat tattaaattt aagttttata ataatatctt tattatattt 4380
attgttttta aaaaaactag tgaaatttct agtgaaattt ccggctttat taaacttatt 4440
tttaggaatt ttattttcat tttcatcttt acaggatttg attatatctt taaatatgtt 4500
ttatcaaata ttatcttttt ctaaatttat atatattttt attatattta ttattatata 4560
tattttattt ttaagtttct ttctaacagc tattaaaaag aaacttaaaa ataaaaacac 4620
gtactctaaa ccaataaata aaactatttt tattattgct gccttgattg gaatagtttt 4680
tagtaaaatt aatttcaata ttccacaata ttatattata agctagcacg cctcgagtat 4740
attgataaaa ataataatag tgggtataat taagttgtta ggaggttagt tagaactaaa 4800
tgtaaaatgt tagcgtttta gagctagaaa tagcaagtta aaataaggct agtccgttat 4860
caacttgaaa aagtggcacc gagtcggtgc tttttttgaa gcttgtcgac atgaagatag 4920
caataggtag tgatcatgca ggattttcat tgaaaaagga agttataaaa catttagaga 4980
gtaaaaatat tgaggttaaa gattttggca ctctaactga tgaatcatgt gattatccag 5040
attatgcatt aaaagtagca gaggaagttg ctcaaaaaaa ctttgagttt ggaatactca 5100
tttgtggaac aggtatagga ataagcatat cagcaaataa ggtgccagga ataagagcag 5160
ctgtatgtac agatacattc tgtgctcatg catcaagaga acataacaat gcaaatatac 5220
ttgcaatggg agaaagagtt gtaggacctg gattagcaat tgatatagta gatacatttt 5280
taaattcaaa atttcaggga gataggcatc aaagaagaat agacaagatt acacaaatag 5340
aaaaaaaata caatggagga atgaaataat gagtaaagtt acacaaatat cacatccact 5400
tatattacac aaattagcat ttatgagaga taaaaaaaca ggatctaaag attttagaga 5460
gatggtagaa gaagtagcaa tgctaatggc atatgaagta acaagagaaa tgcagcttga 5520
aactgttgaa atagaaactc ctatatgtat aactaaatgt aagatgtaag caggaaaaaa 5580
ggtagctata gttcctatac ttagagcagg acttggaatg gtaaatggag tattaaaatt 5640
aatacctgct gctaaggttg gacatatagg attatataga gatgaaaaga cattaaaacc 5700
tgtagaatac ttctgtaaac ttcctcaaga tatagaggaa agagacataa tagtaactga 5760
tccaatgctt gcaactggtg ggtcagcaat agatgcaata acacttctta agaaaagagg 5820
agcaaaatac ataagactta tgtgtcttat aggagcacca gaaggtatag cagcagtaca 5880
agaagcacat ccagatgtag atatatacct cgcatcaata gatgaaaagt tagatgaaaa 5940
tggatatata gttcctggtc ttggagatgc tggagataga ttattcggta caaaataaat 6000
tgcataaata aaaagggctg aaaaataaat ttcagtcctt ttatttatat tttaacttta 6060
ttccatgcca ctgcctcttc tgataataat agaaattatt aaagttaata cagatgtaag 6120
ggcaactgtt ccacctggag cacaatttaa gtaataagag ctaaataatc caacgagtac 6180
atctataaag ctaaataata ttgataatat aagcgtg 6217
<210> 12
<211> 1002
<212> DNA
<213> 人工序列
<220>
<223> upp_del
<400> 12
catgaagata gcaataggta gtgatcatgc aggattttca ttgaaaaagg aagttataaa 60
acatttagag agtaaaaata ttgaggttaa agattttggc actctaactg atgaatcatg 120
tgattatcca gattatgcat taaaagtagc agaggaagtt gctcaaaaaa actttgagtt 180
tggaatactc atttgtggaa caggtatagg aataagcata tcagcaaata aggtgccagg 240
aataagagca gctgtatgta cagatacatt ctgtgctcat gcatcaagag aacataacaa 300
tgcaaatata cttgcaatgg gagaaagagt tgtaggacct ggattagcaa ttgatatagt 360
agatacattt ttaaattcaa aatttcaggg agataggcat caaagaagaa tagacaagat 420
tacacaaata gaaaaaaaat acaatggagg aatgaaataa tgagtaaagt tacacaaata 480
tcacatccac ttatattaca caaacttcct caagatatag aggaaagaga cataatagta 540
actgatccaa tgcttgcaac tggtgggtca gcaatagatg caataacact tcttaagaaa 600
agaggagcaa aatacataag acttatgtgt cttataggag caccagaagg tatagcagca 660
gtacaagaag cacatccaga tgtagatata tacctcgcat caatagatga aaagttagat 720
gaaaatggat atatagttcc tggtcttgga gatgctggag atagattatt cggtacaaaa 780
taaattgcat aaataaaaag ggctgaaaaa taaatttcag tccttttatt tatattttaa 840
ctttattcca tgccactgcc tcttctgata ataatagaaa ttattaaagt taatacagat 900
gtaagggcaa ctgttccacc tggagcacaa tttaagtaat aagagctaaa taatccaacg 960
agtacatcta taaagctaaa taatattgat aatataagcg tg 1002
<210> 13
<211> 1306
<212> DNA
<213> 人工序列
<220>
<223> upp_stop
<400> 13
atgaagatag caataggtag tgatcatgca ggattttcat tgaaaaagga agttataaaa 60
catttagaga gtaaaaatat tgaggttaaa gattttggca ctctaactga tgaatcatgt 120
gattatccag attatgcatt aaaagtagca gaggaagttg ctcaaaaaaa ctttgagttt 180
ggaatactca tttgtggaac aggtatagga ataagcatat cagcaaataa ggtgccagga 240
ataagagcag ctgtatgtac agatacattc tgtgctcatg catcaagaga acataacaat 300
gcaaatatac ttgcaatggg agaaagagtt gtaggacctg gattagcaat tgatatagta 360
gatacatttt taaattcaaa atttcaggga gataggcatc aaagaagaat agacaagatt 420
acacaaatag aaaaaaaata caatggagga atgaaataat gagtaaagtt acacaaatat 480
cacatccact tatattacac aaattagcat ttatgagaga taaaaaaaca ggatctaaag 540
attttagaga gatggtagaa gaagtagcaa tgctaatggc atatgaagta acaagagaaa 600
tgcagcttga aactgttgaa atagaaactc ctatatgtat aactaaatgt aagatgtaag 660
caggaaaaaa ggtagctata gttcctatac ttagagcagg acttggaatg gtaaatggag 720
tattaaaatt aatacctgct gctaaggttg gacatatagg attatataga gatgaaaaga 780
cattaaaacc tgtagaatac ttctgtaaac ttcctcaaga tatagaggaa agagacataa 840
tagtaactga tccaatgctt gcaactggtg ggtcagcaat agatgcaata acacttctta 900
agaaaagagg agcaaaatac ataagactta tgtgtcttat aggagcacca gaaggtatag 960
cagcagtaca agaagcacat ccagatgtag atatatacct cgcatcaata gatgaaaagt 1020
tagatgaaaa tggatatata gttcctggtc ttggagatgc tggagataga ttattcggta 1080
caaaataaat tgcataaata aaaagggctg aaaaataaat ttcagtcctt ttatttatat 1140
tttaacttta ttccatgcca ctgcctcttc tgataataat agaaattatt aaagttaata 1200
cagatgtaag ggcaactgtt ccacctggag cacaatttaa gtaataagag ctaaataatc 1260
caacgagtac atctataaag ctaaataata ttgataatat aagcgt 1306
<210> 14
<211> 22
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 14
gtgggcaagt tgaaaaattc ac 22
<210> 15
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 15
ttaactattt atcaattcct gcaattcg 28
<210> 16
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 16
cttgagactt tgccgtgagg g 21
<210> 17
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 17
tagttggaat gggcgctagt 20
<210> 18
<211> 29
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 18
atgaagatag caataggtag tgatcatgc 29
<210> 19
<211> 34
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 19
acgcttatat tatcaatatt atttagcttt atag 34
<210> 20
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 20
tgtccaacct tagcagcagg 20
<210> 21
<211> 26
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 21
gtagaagaag tagcaatgct aatggc 26
<210> 22
<211> 62
<212> DNA
<213> 丙酮丁醇梭状芽胞杆菌(Clostridium acetobutylicum)
<400> 22
aaactcctat atgtataact aaatgtaaaa tgttagcagg aaaaaaggta gctatagttc 60
ct 62
<210> 23
<211> 62
<212> DNA
<213> 人工序列
<220>
<223> 片段upp_stop图7
<400> 23
aaactcctat atgtataact aaatgtaaga tgtaagcagg aaaaaaggta gctatagttc 60
ct 62
<210> 24
<211> 86
<212> DNA
<213> 丙酮丁醇梭状芽胞杆菌(Clostridium acetobutylicum)
<400> 24
aaactgttga aatagaaact cctatatgta taactaaatg taaaatgtta gcaggaaaaa 60
aggtagctat agttcctata cttaga 86
<210> 25
<211> 86
<212> DNA
<213> 人工序列
<220>
<223> 片段upp_stop图16
<400> 25
aaactgttga aatagaaact cctatatgta taactaaatg taagatgtaa gcaggaaaaa 60
aggtagctat agttcctata cttaga 86
<210> 26
<211> 10537
<212> DNA
<213> 人工序列
<220>
<223> 质粒
<400> 26
tcgactctag aggatcctac ttggttttat agagatttta aggaggaagc tatgaagata 60
ctttttgttt gtactggtaa tacatgtaga agctgtatgg cagaagctat gtttaatagt 120
atgtgtaata tagatggtat agaagctttt tcagcaggtg catctgctat tcatggaagc 180
aaaacttcct taaattcggc tgtggtagtt agggagaact taggcaaaga tatatcaaat 240
agaaaggcta tacagttgac tactttttta gttgatgaat gtgatttagt tttaacaatg 300
acatctttat taagtgacgt gcttagaagt caaatggaga agaactctaa taagatatat 360
tcattaagtg aatatacagg tgtagagggt gacattactg acccatatgg tagaagtgta 420
gagatctaca gacaaactta taaagacatt gaaaagaggc taaaaatact tattaaaaaa 480
ctaaatgaag atagaagtat ttaatcatat actcttatct tctttttttg cgattatgta 540
ttaagtggag gtgtttttat gaagatagca ataggtagtg atcatgcagg attttcattg 600
aaaaaggaag ttataaaaca tttagagagt aaaaatattg aggttaaaga ttttggcact 660
ctaactgatg aatcatgtga ttatccagat tatgcattaa aagtagcaga ggaagttgct 720
caaaaaaact ttgagtttgg aatactcatt tgtggaacag gtataggaat aagcatatca 780
gcaaataagg tgccaggaat aagagcagct gtatgtacag atacattctg tgctcatgca 840
tcaagagaac ataacaatgc aaatatactt gcaatgggag aaagagttgt aggacctgga 900
ttagcaattg atatagtaga tacattttta aattcaaaat ttcagggaga taggcatcaa 960
agaagaatag acaagattac acaaatagaa aaaaaataca atggaggaat gaaataaaaa 1020
aataagagtt accatttaag gtaactctta tttttattac ttaagataat catatataac 1080
ttcagctcta ggcaatatta tatctgcaag aatgtgagag ctagaaacaa tctcttttac 1140
tggcaaatca ttaagtggcg ccatagcgtg atcaaataac tgcagtcgag ttggtcctgt 1200
ccaagcttca tgtacggtaa catctgtgat tttcgcattt ataagctcac atattctagg 1260
gcttccatca taattgggta ttattttcaa catataatta gggcgacaaa tttgatcctt 1320
tgcttcatta gcatctaagg ctttatgttt gtaccccatt gtagctgtcg caactctaag 1380
ttttccatag tctaaagttc ctactaaagt atctgaatcc acaaaaagct ttggataccc 1440
gagcttttta ggatatgcac ttaattccct tcctactgca attgcaggct cattatctaa 1500
atacatcata tgaagataat ctcccttaac tccattaaag cttacgggaa tagcctgtcc 1560
gctttctgta taacaaccaa gtccactcgt atcatgcatt gccataattt caaacctgac 1620
taagggctca tcaatttcta aaggctctgg cacaacttta cgaagtgcat ccatatctgt 1680
acgatataca atgttaaaat actcacgatt atgaaattta tagggtcctc taggaaatgc 1740
aggcgaagtt aatggcgtgc taatttgttt aattacttca tcctttaaca taaaagtcac 1800
cttcctaaat ttaataatgt ttagcttttc taacatactt tatcttcaca ttataaatcg 1860
cctctagtcc tatttattta tatctagtta ttttttgtcg actgtttcat agtatttctt 1920
tctaaacagc catgggtcta agttcattgg atatgagtaa atctgcagca gttaaagacc 1980
ttatttcatc aatggttgtg tttttattaa tttcagtgag aagtaaacca tcattaataa 2040
cctcaattac tccaagttct gttacaatta gatttgcttg agactttgcc gtgaggggaa 2100
gtgtacattt ttttaaaatt ttaggttgac ctttatttgt atgtctcatt gcaattatta 2160
ctttcttagc tccatttact aaatccatag ctccacccat accagagagc atttttccag 2220
gaacaatcca attggctata ttaccctttt catctacctg gagagcccct aaaacagtaa 2280
catctacgtg accaccacgg attagtgaaa acgaaactga gctatcgaaa aatgtgccgt 2340
caggaagtac tgttgtatag tctcctcctg catttactac atctttatct gcctcattta 2400
ttttaggact agcgcccatt ccaactattc cgttttctga ttggaaagta attttgaaat 2460
tttttggtat ataatctgca accatggtag gaagacctac acctaagttt acaagttgac 2520
cattttttaa ttctcttgca actcttttgg ctattatttc tttcgctagg tttttatcat 2580
taatcatttt atgcaggctc ctttactata taatttataa gaactccggg ggtcattgct 2640
ttttcctttt ctagtttttc acagctaact aaattttcag cttcaactat tacggtttta 2700
gctgccattg ccatataggg attaaagttt ttagtagtac ctttatagaa ggtgtttccg 2760
gcctcatcta caatactacc tttaattaat gctacatcgg ctgtaagagg tagctctaac 2820
aaatattccg ttccatttat agatattttt ttctttcctt tttcaatcaa agttcctaaa 2880
cctgttttag ttagtacacc acctaagcca gatccgcctg cacgtattct ttccactaga 2940
gttccttggg gagagagctc tacttcaagt tcattattaa aaagtttttt gccagtatct 3000
gggttgctgc ctatatatga agcaataagc ttttttactt gattatttga tattaactta 3060
ccaatacctg tattaggata acatgtatca ttacttataa tcgttaaatt ctttatattt 3120
aaattaacta aaaaatcaat taatttggtt ggagtgccac agtttaaaaa acctccaatc 3180
ataattgtca tcccatcttt aaagaatgac cttaaatttt caaatctaat tattttagag 3240
ttcattttaa tccctccttt taaattctct agaaaaatta tctcgaggta taatcctcca 3300
tgatctatta tgttataata taactactgc tttaattaag tcttttggct tgtctttcat 3360
taataacagt gcttcttcta tgtgatcaaa tccatgatat acatgtgtaa ctaatttact 3420
tagatcaaca cgattatata ctaccatatc tcttaacatt tctgctctca aacgtccccc 3480
aggacaaaga cctcctttta tagtcttgtg agccattcca catccccatt ctacacgtgg 3540
tattagtaaa gcatctccac ttccatgata atttatatta gaaattattc ctcctggttt 3600
aaccatagat actgcttggg ataatgtttc agaaccaccg cctgccataa ttacgcggtc 3660
aacgcctttt ccattcgtta atttcataac ttgatcaact atatgaccat ttttataatt 3720
tagaatatct gttgctccat aaaattttgc agcctcaaca caaatcggcc tgctccccac 3780
tccaattatt ctacctgctc cacgtaattt agcacctgct attcccatta agccaacagc 3840
tccaatgcca attaccacaa cacttgaacc catttgaata tctgcaagtt ctgctccatg 3900
aaatccagta gtcatcatat ctgttatcat aacagcattt tctaatggca tgtctttagg 3960
tagaatcgca agattcatat ccgcatcatt tacatgaaaa tattcaccaa aaactccatc 4020
cttgaaattt gaaaatttcc atcctgcgag cataccgttt gagtgctgtt gaaaaccagc 4080
ttgaacttcc aaagatctcc aatctggagt tgtacaagga actataactc tgtcaccagg 4140
tttaaaatcc ttcacttcac ttcctacttc aacaacttca cctacagctt catgccctaa 4200
aatcatattc ttcctatctc caagagctcc ctcaaaaaca gtatgtatat ctgatgtaca 4260
cggagatact gctaatgggc gtacaatagc atcatatgaa cccgcaactg gcctttcttt 4320
ttcgatccat cctaacttat taatacctag cattgcaaaa cctttcataa aatatgttcc 4380
tccttaaaaa tattccttta atagtctaag ggcccccata gtttatccct aatttatacg 4440
ttttctctaa caacttaatt atacccacta ttattatttt tatcaatata ttttgttaaa 4500
aaaaacacaa agtttaatat tttttaacaa aaaaattaaa cttatgcata tgctaacccc 4560
cttaaaatca tgttttagca taaatacata agtttatatt ttaactatat taatgtaata 4620
aaaaatactt gattgcataa ataaaaaggg ctgaaaaata aatttcagtc cttttattta 4680
tattttaact ttattccatg ccactgcctc ttctgataat aatagaaatt attaaagtta 4740
atacagatgt aagggcaact gttccacctg gagcacaatt taagtaataa gagctaaata 4800
atccaacgag tacatctata aagctaaata atattgataa tataagcgtt gacttaaatc 4860
cctttttaaa ctgcatagca gcagctacag gtacagctaa tagtgaagag acaactaata 4920
tccctgttat acgtattgaa acggatataa cgaaaccgac aacgatagaa aatatgtaat 4980
ttaaaaattt tgtgtttatg ccagataccc ttgcaccttg ctcatcaaat gttgtatata 5040
aaaagttgtt ataaaatata gctataagaa atatagatac tacacttatt attagtataa 5100
gtacgagatc acttttacca acagttaaaa tacttccaaa aagaaaagat tcaatgtttc 5160
cgcttgcacg tccacttgtt gttaaggtta tagctattcc aacagataga gttaatataa 5220
ttggaagtat taagtccgaa tactgcttaa aggagtttct aagtaattct attaagacag 5280
cacaaatagt tacaaaaata aaggttgtta tcataggatt tacgccaata aagataccta 5340
tagctacacc ggcaaaagaa gcatgtgata atgcatcacc taccattgag tgtcttttta 5400
aaactaagaa aatacctata agtggacaca aaattgcaat cattaccgag gcaagtaaag 5460
cgttttgcat aaaacttaga cgaaatagtt caatcattat ttaacaacct ccatattgct 5520
ttttaaaaaa tcgttaatgt taagcaaaga gccactttct ccatggaaat tgaaaatatg 5580
agttgaatta gaataagcag cctttaaatt atgttctaca catattattg tgaattcgta 5640
atcatggtca tagctgtttc ctgtgtgaaa ttgttatccg ctcacaattc cacacaacat 5700
acgagccgga agcataaagt gtaaagcctg gggtgcctaa tgagtgagct aactcacatt 5760
aattgcgttg cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta 5820
atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc 5880
gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 5940
ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 6000
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 6060
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 6120
aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 6180
gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc 6240
tcatagctca cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg 6300
tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga 6360
gtccaacccg gtaagacacg acttatcgcc actggcagca gccactggta acaggattag 6420
cagagcgagg tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta 6480
cactagaaga acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag 6540
agttggtagc tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg 6600
caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac 6660
ggggtctgac gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc 6720
aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag 6780
tatatatgag taaacttggt ctgacagtta ccaaagctag cttaatacta gtatatactt 6840
aatgtgataa gtgtctgaca gctgaccggt ctaaagaggt ccctagcgcc tacggggaat 6900
ttgtatcgat aaggggtaca aattcccact aagcgctcgg ccggggatcg atccccgggt 6960
acgtacccgg cagtttttct ttttcggcaa gtgttcaaga agttattaag tcgggagtgc 7020
agtcgaagtg ggcaagttga aaaattcaca aaaatgtggt ataatatctt tgttcattag 7080
agcgataaac ttgaatttga gagggaactt agatggtatt tgaaaaaatt gataaaaata 7140
gttggaacag aaaagagtat tttgaccact actttgcaag tgtaccttgt acctacagca 7200
tgaccgttaa agtggatatc acacaaataa aggaaaaggg aatgaaacta tatcctgcaa 7260
tgctttatta tattgcaatg attgtaaacc gccattcaga gtttaggacg gcaatcaatc 7320
aagatggtga attggggata tatgatgaga tgataccaag ctatacaata tttcacaatg 7380
atactgaaac attttccagc ctttggactg agtgtaagtc tgactttaaa tcatttttag 7440
cagattatga aagtgatacg caacggtatg gaaacaatca tagaatggaa ggaaagccaa 7500
atgctccgga aaacattttt aatgtatcta tgataccgtg gtcaaccttc gatggcttta 7560
atctgaattt gcagaaagga tatgattatt tgattcctat ttttactatg gggaaatatt 7620
ataaagaaga taacaaaatt atacttcctt tggcaattca agttcatcac gcagtatgtg 7680
acggatttca catttgccgt tttgtaaacg aattgcagga attgataaat agttaacttc 7740
aggtttgtct gtaactaaaa actagtattt aacctaggat caaaaaaatt tccaataatc 7800
ccactctaag ccacaaacac gccctataaa atcccgcttt aatcccactt tgagacacat 7860
gtaatattac tttacgccct agtatagtga taatttttta cattcaatgc cacgcaaaaa 7920
aataaagggg cactataata aaagttcctt cggaactaac taaagtaaaa aattatcttt 7980
acaacctccc caaaaaaaag aacaggtaca aagtacccta taatacaagc gtaaaaaaaa 8040
tgagggtaaa aataaaaaaa taaaaaaata aaaaaataaa aaaataaaaa aataaaaaaa 8100
taaaaaaata taaaaataaa aaaatataaa aataaaaaaa tataaaaata aaaaaataaa 8160
aaaatataaa aataaaaaaa taaaaaaata taaaaatatt ttttatttaa agtttgaaaa 8220
aaattttttt atattatata atctttgaag aaaagaatat aaaaaatgag cctttataaa 8280
agcccatttt ttttcatata cgtaatatga cgttctaatg tttttattgg tacttctaac 8340
attagagtaa tttctttatt tttaaagcct ttttctttaa gggcttttat tttttttctt 8400
aatacattta attcctcttt ttttgttgct tttcctttag cttttaattg ctcttgataa 8460
ttttttttac ctctaatatt ttctcttctc ttatattcct ttttagaaat tattattgtc 8520
atatattttt gttcttcttc tgtaatttct aataactcta taagagtttc attcttatac 8580
ttatattgct tatttttatc taaataacat ctttcagcac ttctagttgc tcttataact 8640
tctctttcac ttaaatgttg tctaaacata ctattaagtt ctaaaacatc atttaatgcc 8700
ttctcaatgt cttctgtaaa gctacaaaga taatatctat ataaaaataa tataagctct 8760
ctgtgtcctt ttaaatcata ttctcttagt tcacaaagtt ttattatgtc ttgtattctt 8820
ccataatata aacttctttc tctataaata taatttattt tgcttggtct accctttttc 8880
ctttcatatg gttttaattc aggtaaaaat ccattttgta tttctcttaa gtcataaata 8940
tattcgtact catctaatat attgactact gtttttgatt tagagtttat acttcctgga 9000
actcttaata ttctcgttgc atctaaggct tgtctatctg ctccaaagta ttttaattga 9060
ttatataaat attcttgaac cgctttccat aatggtaatg ctttactagg tactgcattt 9120
attatccata ttaaatacat tcctcttcca ctatctatta catagtttgg tataggaata 9180
ctttgattaa aataattctt ttctaagtcc attaatacct ggtctttagt tttgccagtt 9240
ttataataat ccaagtctat aaacagtgta tttaactctt ttatattttc taatcgccta 9300
cacggcttat aaaaggtatt tagagttata tagatatttt catcactcat atctaaatct 9360
tttaattcag cgtatttata gtgccattgg ctatatcctt ttttatctat aacgctcctg 9420
gttatccacc ctttacttct actatgaata ttatctatat agttcttttt attcagcttt 9480
aatgcgtttc tcacttattc acctcccctt ctgtaaaact aagaaaatta tatcatattt 9540
tcaataatta ttaactattc ttaaactctt aataaaaaat agagtaagtc cccaattgaa 9600
acttaatcta ttttttatgt tttaatttat tatttttatt aaaatatttt aaactaaatt 9660
aaatgattct ttttaatttt ttactatttc attccataat atattactat aattatttac 9720
aaataatatt tcttcatttg taatatttag atgatttact aattttagtt tttatatatt 9780
aaataattaa tgtataattt atataaaaaa tcaaaggagc ttataaatta tgattatttc 9840
caaagatact aaagatttaa tttttttcaa ttttaacaat actttttgta atattatgtt 9900
taaatttaat tgtatttttt tcatataata aagccgttga agtaaaccaa tccattttcc 9960
ttatgatgtt attattaaat ttaagtttta taataatatc tttattatat ttattgtttt 10020
taaaaaaact agtgaaattt ctagtgaaat ttccggcttt attaaactta tttttaggaa 10080
ttttattttc attttcatct ttacaggatt tgattatatc tttaaatatg ttttatcaaa 10140
tattatcttt ttctaaattt atatatattt ttattatatt tattattata tatattttat 10200
ttttaagttt ctttctaaca gctattaaaa agaaacttaa aaataaaaac acgtactcta 10260
aaccaataaa taaaactatt tttattattg ctgccttgat tggaatagtt tttagtaaaa 10320
ttaatttcaa tattccacaa tattatatta taagctagca cgcctcgagt atattgataa 10380
aaataataat agtgggtata attaagttgt taggaggtta gttagaacta aatgtaaaat 10440
gttagcgttt tagagctaga aatagcaagt taaaataagg ctagtccgtt atcaacttga 10500
aaaagtggca ccgagtcggt gctttttttg aagcttg 10537
<210> 27
<211> 27
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 27
ctttttaaaa aagttaaata aggaagg 27
<210> 28
<211> 28
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 28
gtttaactta agttacagaa aagctagg 28
<210> 29
<211> 12849
<212> DNA
<213> 人工序列
<220>
<223> 质粒
<400> 29
ctagtagtgg atttaatccc aaatgagcca acagaaccag aaccagaaac agaatcagaa 60
caagtaacat tggatttaga aatggaagaa gaaaaaagca atgacttcgt gtgaataatg 120
cacgaaatcg ttgcttattt ttttttaaaa gcggtatact agatataacg aaacaacgaa 180
ctgaatagaa acgaaaaaag agccatgaca catttataaa atgtttgacg acattttata 240
aatgcatagc ccgataagat tgccaaacca acgcttatca gttagtcaga tgaactcttc 300
cctcgtaaga agttatttaa ttaactttgt ttgaagacgg tatataaccg tactatcatt 360
atatagggaa atcagagagt tttcaagtat ctaagctact gaatttaaga attgttaagc 420
aatcaatcgg aaatcgtttg attgcttttt ttgtattcat ttatagaagg tggagtttgt 480
atgaatcatg atgaatgtaa aacttatata aaaaatagtt tattggagat aagaaaatta 540
gcaaatatct atacactaga aacgtttaag aaagagttag aaaagagaaa tatctactta 600
gaaacaaaat cagataagta tttttcttcg gagggggaag attatatata taagttaata 660
gaaaataaca aaataattta ttcgattagt ggaaaaaaat tgacttataa aggaaaaaaa 720
tctttttcaa aacatgcaat attgaaacag ttgaatgaaa aagcaaacca agttaattaa 780
acaacctatt ttataggatt tataggaaag gagaacagct gaatgaatat cccttttgtt 840
gtagaaactg tgcttcatga cggcttgtta aagtacaaat ttaaaaatag taaaattcgc 900
tcaatcacta ccaagccagg taaaagcaaa ggggctattt ttgcgtatcg ctcaaaatca 960
agcatgattg gcggtcgtgg tgttgttctg acttccgagg aagcgattca agaaaatcaa 1020
gatacattta cacattggac acccaacgtt tatcgttatg gaacgtatgc agacgaaaac 1080
cgttcataca cgaaaggaca ttctgaaaac aatttaagac aaatcaatac cttctttatt 1140
gattttgata ttcacacggc aaaagaaact atttcagcaa gcgatatttt aacaaccgct 1200
attgatttag gttttatgcc tactatgatt atcaaatctg ataaaggtta tcaagcatat 1260
tttgttttag aaacgccagt ctatgtgact tcaaaatcag aatttaaatc tgtcaaagca 1320
gccaaaataa tttcgcaaaa tatccgagaa tattttggaa agtctttgcc agttgatcta 1380
acgtgtaatc attttggtat tgctcgcata ccaagaacgg acaatgtaga attttttgat 1440
cctaattacc gttattcttt caaagaatgg caagattggt ctttcaaaca aacagataat 1500
aagggcttta ctcgttcaag tctaacggtt ttaagcggta cagaaggcaa aaaacaagta 1560
gatgaaccct ggtttaatct cttattgcac gaaacgaaat tttcaggaga aaagggttta 1620
atagggcgta ataacgtcat gtttaccctc tctttagcct actttagttc aggctattca 1680
atcgaaacgt gcgaatataa tatgtttgag tttaataatc gattagatca acccttagaa 1740
gaaaaagaag taatcaaaat tgttagaagt gcctattcag aaaactatca aggggctaat 1800
agggaataca ttaccattct ttgcaaagct tgggtatcaa gtgatttaac cagtaaagat 1860
ttatttgtcc gtcaagggtg gtttaaattc aagaaaaaaa gaagcgaacg tcaacgtgtt 1920
catttgtcag aatggaaaga agatttaatg gcttatatta gcgaaaaaag cgatgtatac 1980
aagccttatt tagtgacgac caaaaaagag attagagaag tgctaggcat tcctgaacgg 2040
acattagata aattgctgaa ggtactgaag gcgaatcagg aaattttctt taagattaaa 2100
ccaggaagaa atggtggcat tcaacttgct agtgttaaat cattgttgct atcgatcatt 2160
aaagtaaaaa aagaagaaaa agaaagctat ataaaggcgc tgacaaattc ttttgactta 2220
gagcatacat tcattcaaga gactttaaac aagctagcag aacgccctaa aacggacaca 2280
caactcgatt tgtttagcta tgatacaggc tgaaaataaa acccgcacta tgccattaca 2340
tttatatcta tgatacgtgt ttgttttttc tttgctgttt agcgaatgat tagcagaaat 2400
atacagagta agattttaat taattattag ggggagaagg agagagtagc ccgaaaactt 2460
ttagttggct tggactgaac gaagtgaggg aaaggctact aaaacgtcga ggggcagtga 2520
gagcgaagcg aacacttgat tttttaattt tctatctttt ataggtcatt agagtatact 2580
tatttgtcct ataaactatt tagcagcata atagatttat tgaataggtc atttaagttg 2640
agcatattag aggaggaaaa tcttggagaa atatttgaag aacccgatta catggattgg 2700
attagttctt gtggttacgt ggtttttaac taaaagtagt gaatttttga tttttggtgt 2760
gtgtgtcttg ttgttagtat ttgctagtca aagtgattaa atagaattct agcgccattc 2820
gccattcagg ctgcgcaact gttgggaagg gcgatcggtg cgggcctctt cgctattacg 2880
ccagctggcg aaagggggat gaaataaaag gttattttgc attgacaaag ataattaaat 2940
attttattat tagttcataa gttagtttaa tatactaaca aaaataaagc aagtaaaata 3000
tacctaaaat ataaaaaaat taggatagga aaacgatagt tatgaagtgg cattcaagga 3060
gggatggata aaaagtacag tattggtcta gacataggaa ctaactctgt tgggtgggct 3120
gttataacag atgaatataa agttccatca aaaaaattta aagtattagg aaacactgat 3180
agacattcaa taaaaaaaaa cttgataggt gctttattat tcgattcagg agagactgct 3240
gaagctacac gtttaaaaag aacagctaga cgtagatata caagaagaaa aaataggata 3300
tgttatcttc aagaaatttt tagtaatgaa atggcaaaag ttgatgattc attctttcac 3360
agactagaag aaagtttctt agttgaagaa gataagaagc atgaaagaca ccctattttt 3420
ggtaatatcg tagatgaagt agcatatcat gagaagtatc caactatcta tcatttaaga 3480
aagaaattag ttgattctac agataaagct gatctgagat taatatattt agctttagct 3540
catatgatta aatttagagg acatttttta atagaaggtg atttaaaccc agacaacagc 3600
gatgtagata aattatttat ccaattagtt caaacttata atcaattatt cgaagagaat 3660
ccaattaatg caagtggtgt agacgctaag gctatattat cagctagatt atcaaaatct 3720
agaagattag aaaatctaat agctcaactt cctggagaaa agaaaaatgg actttttggg 3780
aacctaatag ctctctcact cggactaaca ccaaatttta aaagcaattt tgatcttgct 3840
gaagacgcaa agttacaact atcaaaggat acatacgatg atgatttaga taatttgtta 3900
gctcaaatag gtgatcaata tgctgatttg tttcttgcag caaaaaactt aagtgatgca 3960
attttactat cagatatact tagagtaaat acagaaataa caaaggctcc tttatcagca 4020
agtatgatta aacgatatga tgagcatcat caagatttaa cattattaaa ggcacttgta 4080
agacaacaat taccagaaaa atataaagaa attttctttg atcaatctaa aaatggatat 4140
gctggatata tagacggtgg agcaagtcaa gaagagtttt ataaatttat aaagcctatt 4200
ttagaaaaaa tggatggaac tgaagaatta cttgttaaac ttaacagaga agatttactt 4260
agaaaacaaa gaacttttga taatggttca attcctcacc aaattcattt aggagaatta 4320
catgctatac taagaagaca agaagatttt tatccatttc ttaaagataa tagagaaaaa 4380
attgaaaaaa ttttaacttt tagaatacca tattatgtag gaccacttgc aaggggaaat 4440
tcaagatttg catggatgac tagaaaatca gaagaaacta taaccccgtg gaattttgaa 4500
gaagtagtag ataaaggagc tagtgctcaa tcatttatag aaagaatgac aaattttgat 4560
aagaatcttc ctaacgaaaa ggttttgcca aagcatagcc ttctttatga gtattttaca 4620
gtttataatg agcttactaa agtaaaatac gttacagaag gaatgagaaa accagcattt 4680
ttgtctggtg aacaaaagaa agcaatagta gacctattat ttaaaacaaa taggaaggtt 4740
accgtaaagc aacttaaaga agattacttc aaaaaaattg aatgctttga tagtgttgaa 4800
atatcaggag ttgaagatag atttaatgct tcacttggta catatcacga tctcttaaaa 4860
attataaaag ataaggattt tttagataat gaagaaaatg aagatattct tgaagatata 4920
gtattaacat tgacactttt tgaagataga gaaatgatag aagaaagatt aaaaacatat 4980
gcacatcttt ttgatgataa ggttatgaag caacttaaaa gaagaagata tacaggttgg 5040
ggacgtttgt caagaaagct aattaatggt attagagata aacaatcagg aaagactatt 5100
ctcgattttc ttaaatcaga tggatttgct aatagaaact ttatgcaatt aattcatgat 5160
gattctctta ctttcaaaga ggatattcaa aaggctcaag tttctggaca aggcgatagc 5220
ttacacgaac acattgctaa ccttgcaggg agccccgcta tcaaaaaagg aattttacaa 5280
acagttaaag ttgtagatga acttgttaaa gttatgggaa gacacaaacc tgagaatata 5340
gttatagaaa tggccagaga aaatcaaaca acacaaaaag gacaaaaaaa ttctagagag 5400
agaatgaaga gaattgaaga aggaataaaa gagctaggat cacaaatatt aaaagaacat 5460
ccagttgaaa atactcaatt gcaaaatgaa aagttatatt tgtattactt acaaaatgga 5520
agagatatgt atgttgatca agaactcgat attaatagat taagtgacta tgatgttgat 5580
catattgttc ctcaatcatt tttaaaagat gattcaatcg ataacaaagt attaactaga 5640
tcagataaaa atagaggaaa gtcagataat gtaccatctg aagaagttgt taaaaaaatg 5700
aagaactatt ggagacaact tttaaatgca aagctaatta cacaaagaaa atttgacaat 5760
ttaacaaaag cagaaagagg aggattaagc gaattagaca aagctggatt tataaaaaga 5820
caacttgttg agacaagaca aataactaag catgttgctc aaatacttga ttcaagaatg 5880
aatacaaaat atgatgaaaa tgataaatta atcagagaag taaaagtaat aacattaaag 5940
tcaaaattag tatcagattt cagaaaggat tttcaatttt acaaagttcg tgaaataaat 6000
aactatcatc atgctcatga tgcatactta aatgctgttg taggaactgc tcttattaag 6060
aaatatccta aactagaaag cgaatttgtt tatggagatt ataaagttta tgatgtgcgc 6120
aaaatgatcg cgaaatccga acaagaaatc ggtaaggcta cagcaaaata tttcttttat 6180
agtaatataa tgaatttttt taagacagaa ataactttgg ctaatggtga aatcagaaaa 6240
agaccactta tcgaaacaaa tggagagaca ggagaaatag tatgggataa aggaagagat 6300
tttgctactg ttagaaaagt actaagtatg ccacaagtaa atatcgtaaa gaaaactgaa 6360
gttcaaactg gaggtttctc taaggaatca attttaccta agagaaattc agataagtta 6420
attgcaagga aaaaagattg ggacccaaaa aaatacggtg gttttgatag tccaacagtt 6480
gcctatagtg ttcttgtagt agcgaaagtt gagaaaggta agtcaaaaaa gttgaaaagc 6540
gtaaaagaac ttcttggtat cacaattatg gaaagatctt catttgaaaa aaatccaatt 6600
gactttttag aagctaaggg ttataaagaa gttaaaaagg atttaatcat aaaactacca 6660
aagtatagtc tatttgaact cgaaaacgga agaaaacgaa tgctcgctag cgcaggagaa 6720
cttcaaaaag gaaatgaact tgcgctgcca tcaaagtatg taaatttctt atatttagct 6780
tctcattatg agaaattaaa aggatcacca gaggataatg aacaaaagca actatttgta 6840
gaacaacaca aacattattt agatgaaata atagaacaaa tatctgaatt ttctaaaaga 6900
gttatacttg ccgacgcaaa tctagataag gtgctttcag cgtataataa acacagagat 6960
aaaccaataa gagaacaagc agaaaacatt atccatcttt ttacattaac taatcttggt 7020
gcaccagctg catttaagta ctttgataca acaatagata gaaaaagata cacatctact 7080
aaagaagtat tagacgcaac tttaatacat caatctatta cagggcttta tgaaacaaga 7140
attgatttaa gtcaactagg cggagattaa gaattttttt agaaagaaca tgtgagcaaa 7200
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 7260
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 7320
aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 7380
gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc 7440
tcatagctca cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg 7500
tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga 7560
gtccaacccg gtaagacacg acttatcgcc actggcagca gccactggta acaggattag 7620
cagagcgagg tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta 7680
cactagaagg acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag 7740
agttggtagc tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg 7800
caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac 7860
ggggtctgac gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc 7920
aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag 7980
tatatatgag taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc 8040
agcgatctgt ctatttcgtt catccatagt tgcctgactc cccgtcgtgt agataactac 8100
gatacgggag ggcttaccat ctggccccag tgctgcaatg ataccgcgag acccacgctc 8160
accggctcca gatttatcag caataaacca gccagccgga agggccgagc gcagaagtgg 8220
tcctgcaact ttatccgcct ccatccagtc tattaattgt tgccgggaag ctagagtaag 8280
tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt gctacaggca tcgtggtgtc 8340
acgctcgtcg tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac 8400
atgatccccc atgttgtgca aaaaagcggt tagctccttc ggtcctccga tcgttgtcag 8460
aagtaagttg gccgcagtgt tatcactcat ggttatggca gcactgcata attctcttac 8520
tgtcatgcca tccgtaagat gcttttctgt gactggtgag tactcaacca agtcattctg 8580
agaatagtgt atgcggcgac cgagttgctc ttgcccggcg tcaatacggg ataataccgc 8640
gccacatagc agaactttaa aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact 8700
ctcaaggatc ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg cacccaactg 8760
atcttcagca tcttttactt tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa 8820
tgccgcaaaa aagggaataa gggcgacacg gaaatgttga atactcatac tcttcctttt 8880
tcaatattat tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg 8940
tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga 9000
ctgccgggcc tcttgcggga tcaaaagaaa aacgaaatga tacaccaatc agtgcaaaaa 9060
aagatataat gggagataag acggttcgtg ttcgtgctga cttgcaccat atcataaaaa 9120
tcgaaacagc aaagaatggc ggaaacgtaa aagaagttat ggaaataaga cttagaagca 9180
aacttaagag tgtgttgata gtgcagtatc ttaaaatttt gtataatagg aattgaagtt 9240
aaattagatg ctaaaaattt gtaattaaga aggagtgatt acatgaacaa aaatataaaa 9300
tattctcaaa actttttaac gagtgaaaaa gtactcaacc aaataataaa acaattgaat 9360
ttaaaagaaa ccgataccgt ttacgaaatt ggaacaggta aagggcattt aacgacgaaa 9420
ctggctaaaa taagtaaaca ggtaacgtct attgaattag acagtcatct attcaactta 9480
tcgtcagaaa aattaaaact gaatactcgt gtcactttaa ttcaccaaga tattctacag 9540
tttcaattcc ctaacaaaca gaggtataaa attgttggga gtattcctta ccatttaagc 9600
acacaaatta ttaaaaaagt ggtttttgaa agccatgcgt ctgacatcta tctgattgtt 9660
gaagaaggat tctacaagcg taccttggat attcaccgaa cactagggtt gctcttgcac 9720
actcaagtct cgattcagca attgcttaag ctgccagcgg aatgctttca tcctaaacca 9780
aaagtaaaca gtgtcttaat aaaacttacc cgccatacca cagatgttcc agataaatat 9840
tggaagctat atacgtactt tgtttcaaaa tgggtcaatc gagaatatcg tcaactgttt 9900
actaaaaatc agtttcatca agcaatgaaa cacgccaaag taaacaattt aagtaccgtt 9960
acttatgagc aagtattgtc tatttttaat agttatctat tatttaacgg gaggaaataa 10020
ttctatgagt ccctaggggt tccgcgcaca tttccccgaa aagtgccacc tgaacgaagc 10080
atctgtgctt cattttgtag aacaaaaatg caacgcgaga gcgctaattt ttcaaacaaa 10140
gaatctgagc tgcattttta cagaacagaa atgcaacgcg aaagcgctat tttaccaacg 10200
aagaatctgt gcttcatttt tgtaaaacaa aaatgcaacg cgagagcgct aatttttcaa 10260
acaaagaatc tgagctgcat ttttacagaa cagaaatgca acgcgagagc gctattttac 10320
caacaaagaa tctatacttc ttttttgttc tacaaaaatg catcccgaga gcgctatttt 10380
tctaacaaag catcttagat tacttttttt ctcctttgtg cgctctataa tgcagtctct 10440
tgataacttt ttgcactgta ggtccgttaa ggttagaaga aggctacttt ggtgtctatt 10500
ttctcttcca taaaaaaagc ctgactccac ttcccgcgtt tactgattac tagcgaagct 10560
gcgggtgcat tttttcaaga taaaggcatc cccgattata ttctataccg atgtggattg 10620
cgcatacttt gtgaacagaa agtgatagcg ttgatgattc ttcattggtc agaaaattat 10680
gaacggtttc ttctattttg tctctatata ctacgtatag gaaatgttta cattttcgta 10740
ttgttttcga ttcactctat gaatagttct tactacaatt tttttgtcta aagagtaata 10800
ctagagataa acataaaaaa tgtagaggtc gagtttagat gcaagttcaa ggagcgaaag 10860
gtggatgggt aggttatata gggatatagc acagagatat atagcaaaga gatacttttg 10920
agcaatgttt gtggaagcgg tattcgcaat attttagtag ctcgttacag tccggtgcgt 10980
ttttggtttt ttgaaagtgc gtcttcagag cgcttttggt tttcaaaagc gctctgaagt 11040
tcctatactt tctagagaat aggaacttcg gaataggaac ttcaaagcgt ttccgaaaac 11100
gagcgcttcc gaaaatgcaa cgcgagctgc gcacatacag ctcactgttc acgtcgcacc 11160
tatatctgcg tgttgcctgt atatatatat acatgagaag aacggcatag tgcgtgttta 11220
tgcttaaatg cgtacttata tgcgtctatt tatgtaggat gaaaggtagt ctagtacctc 11280
ctgtgatatt atcccattcc atgcggggta tcgtatgctt ccttcagcac taccctttag 11340
ctgttctata tgctgccact cctcaattgg attagtctca tccttcaatg ctatcatttc 11400
ctttgatatt ggatcatact aagaaaccat tattatcatg acattaacct ataaaaatag 11460
gcgtatcacg aggccctttc gtctcgcgcg tttcggtgat gacggtgaaa acctctgaca 11520
catgcagctc ccggagacgg tcacagcttg tctgtaagcg gatgccggga gcagacaagc 11580
ccgtcagggc gcgtcagcgg gtgttggcgg gtgtcggggc tggcttaact atgcggcatc 11640
agagcagatt gtactgagag tgcaccatac cacagctttt caattcaatt catcattttt 11700
tttttattct tttttttgat ttcggtttct ttgaaatttt tttgattcgg taatctccga 11760
acagaaggaa gaacgaagga aggagcacag acttagattg gtatatatac gcatatgtag 11820
tgttgaagaa acatgaaatt gcccagtatt cttaacccaa ctgcacagaa caaaaacctg 11880
caggaaacga agataaatca tgtcgaaagc tacatataag gaacgtgctg ctactcatcc 11940
tagtcctgtt gctgccaagc tatttaatat catgcacgaa aagcaaacaa acttgtgtgc 12000
ttcattggat gttcgtacca ccaaggaatt actggagtta gttgaagcat taggtcccaa 12060
aatttgttta ctaaaaacac atgtggatat cttgactgat ttttccatgg agggcacagt 12120
taagccgcta aaggcattat ccgccaagta caatttttta ctcttcgaag acagaaaatt 12180
tgctgacatt ggtaatacag tcaaattgca gtactctgcg ggtgtataca gaatagcaga 12240
atgggcagac attacgaatg cacacggtgt ggtgggccca ggtattgtta gcggtttgaa 12300
gcaggcggca gaagaagtaa caaaggaacc tagaggcctt ttgatgttag cagaattgtc 12360
atgcaagggc tccctatcta ctggagaata tactaagggt actgttgaca ttgcgaagag 12420
cgacaaagat tttgttatcg gctttattgc tcaaagagac atgggtggaa gagatgaagg 12480
ttacgattgg ttgattatga cacccggtgt gggtttagat gacaagggag acgcattggg 12540
tcaacagtat agaaccgtgg atgatgtggt ctctacagga tctgacatta ttattgttgg 12600
aagaggacta tttgcaaagg gaagggatgc taaggtagag ggtgaacgtt acagaaaagc 12660
aggctgggaa gcatatttga gaagatgcgg ccagcaaaac taaaaaactg tattataagt 12720
aaatgcatgt atactaaact cacaaattag agcttcaatt taattatatc agttattacc 12780
ctatgcggtg tgaaataccg cacagatgcg taaggagaaa ataccgccct aggcccaact 12840
aactcaacg 12849
<210> 30
<211> 9367
<212> DNA
<213> 人工序列
<220>
<223> 质粒
<400> 30
aaaaataatg gccctaggga tatacaattt ttaaaaaata tagctaatct tgcataaact 60
atataataaa tctaatttta gaatgtataa tttaaagggt tatggctgta tatagtcata 120
acccttatat ttcattaatt aaagttaaat tttgtatatt caaaaagctc ttgatattta 180
cggcagagcc accatataga gatgcctcag accctaatga tgaaatttct atatttattc 240
ctttgttaaa agagctaact aacatatctt ttgtaatctg taatatctca gggatatcac 300
taattatttg gctatttaaa tatattattt caggagcata agttgtaata gcattattta 360
tagctattgt taagtaacta caaaactcat gaataacttt tttggcattc tggttatctt 420
cataatatag ttgctttact atatcagaat ctattttagg aatattttct aaagaagaaa 480
gctgttcaaa tacctttttt tctgaacaat actgttctaa acagccacga tttccacaag 540
gacatagttt accattaggc atgatgatag tgtgaccaat ttctccactc ataccatttc 600
tgccactata taatttatta tttattataa tgcctgaacc aaatccactg tgaatactga 660
gactaagtag tgagttatgt acagttgaaa aagtattctc agctaaagct gttaagtttg 720
cttcattttc tatgtgaata ggaaagtcat actttttgct aaggatgcta tataagtcaa 780
tctcattaag attgtaatat ggagtaaaca atactttatt ttcacaagta atcccatgta 840
tagccaaagt aagaccaatt accttataag gagtgtctat tttggatatg ttataactat 900
ttataatttc atctatcaat tgtataacat tgtctttact tacttgtata tctgttaatt 960
tcttagaatt tataatagtt ccatctaagt aagatagaga agaaaatata tagtcgtatc 1020
caatgtccat acttaaagaa atccctgcac atttattaaa tactaataat atgggttttc 1080
ttccgccact atgagtacta tttccaattc ctatttcatg aactagagat tcatcaataa 1140
gttttttggt aatagcggat atagttgctt tattcaatcc tatagtagaa gctatacttg 1200
ccctagaaat aggaccattt tttataattt gttcaagtac caatctttca ttcatttctc 1260
gaatagtgta tttatcagta accaatttga tattcctcct taaaataata ttgtaatact 1320
ttttacacaa aaataaaagg ttattttgca ttgacaaaga taattaaata ttttattatt 1380
agttcataag ttagtttaat atactaacaa aaataaagca agtaaaatat acctaaaata 1440
taaaaaaatt aggataggaa aacgatagtt atgaagtggc attcaaggag tatggaaaaa 1500
ctcaagttta tgcttgtttt agagctagaa atagcaagtt aaaataaggc tagtccgtta 1560
tcaacttgaa aaagtggcac cgagtcggtg cttttaaagt attgttaaaa ataactctgt 1620
agaattataa attagttcta cagagttatt ttttaaaaaa attcaagctt gcatgcctgc 1680
aggccatttt agctaaggat atctataatg aaaagattta gaggaggaat aatttatgaa 1740
aaagattttt gtacttggag caggaacaat gggtgctggt atcgttcaag cattcgctca 1800
aaaaggttgt gaagtaattg taagagacat aaaggaagaa tttgttgaca gaggaatagc 1860
tggaatcact aaaggattag aaaagcaagt tgctaaagga aaaatgtctg aagaagataa 1920
agaagctata ctttcaagaa tttcaggaac aactgatatg aaattagctg ctgactgtga 1980
tttagtagtt gaagctgcaa tcgaaaacat gaaaattaag aaggaaatct tcgctgaatt 2040
agatggaatt tgtaagccag aagcgatttt agcttcaaac acttcatctt tatcaattac 2100
tgaagttgct tcagctacaa agagacctga taaagttatc ggaatgcatt tctttaatcc 2160
agctccagta atgaagcttg ttgaaattat taaaggaata gctacttctc aagaaacttt 2220
tgatgctgtt aaggaattat cagttgctat tggaaaagaa ccagtagaag ttgcagaagc 2280
tccaggattc gttgtaaaca gaatattaat cccaatgatt aacgaagctt catttatcct 2340
acaagaagga atagcttcag ttgaagatat tgatacagct atgaaatatg gtgctaacca 2400
tccaatggga cctttagctt taggagatct tattggatta gacgtttgct tagctatcat 2460
ggatgtttta ttcactgaaa caggtgataa caagtacaga gctagcagca tattaagaaa 2520
atatgttaga gctggatggc ttggaagaaa atcaggaaaa ggattctatg attactctaa 2580
ataatatcat gaatatatgg aaaaactcaa gtttatgctt caaactgctt cccctaattc 2640
cccattttta tatatatttg tattaaatta actttaatgt tacaatgttc ttagtatatt 2700
ttccttaata ttacattagt tctataaact ttattgttct taatatttaa ataaaatcca 2760
tgaagggagg aaaaaactat cttttaaaag tttatagtaa ataaaaaaaa attattaatg 2820
taaaaatata ctaagtatag aatatttata atagggggta ttaacttgtt ttcaaaaatc 2880
aaaaaaatta atttttttaa aaaaacattt tcttttttaa ttgctgttgt aatgatgttg 2940
tttacagtat taggaacaaa tacttataaa gctgaagctg caagtggtgg tgcctgggct 3000
caatgtggag gtgaaaactt ccatggtgat aaatgttgtg tttccggtca cacttgtgtt 3060
agtattaacc aatggtattc acaatgtcaa ccaggaggtg ctccaagcaa taatgcttca 3120
aacaataata ataacaataa caataataac aacaacaata ataacaacaa caataatcac 3180
aacaacaaca acaataacaa caataacaat aacaatggtg gtagtggtag tactaaaaac 3240
ttcttcgata accaaattta tgctaaccca aagtttattg aagaagtcaa ttcttctatt 3300
ccaagattaa gttatgattt acaacaaaag gctcaaaagg ttaagaatgt tccaactgcc 3360
gtttggttag cttgggatgg agccactgga gaagttgctc aacatcttaa agctgctggt 3420
tctaaaactg ttgtcttcat catgtacatg attccaactc gtgattgtaa cgctaatgcc 3480
tctgctggtg gtgctggtaa cctcaacact tacaagggat acgttgataa tattgctaga 3540
actattcgta gttatccaaa ttcaaaggtt gttatgattc ttgaaccaga tactcttggt 3600
aaccttgtta ctgctaatag tgctaactgt caaaacgttc gtaacttaca taagaatgct 3660
ttatcttatg gtgttaatgt tttcggtagc atgagtaatg ttagtgttta ccttgatgct 3720
gctcatggtg cttggttagg tagctctact gataaggttg cttctgttgt taaggaaatc 3780
ttaaataatg ctccaaatgg aaagattcgt ggtttaagta ctaacatttc taactaccaa 3840
tcaatttctt ctgaatacca ataccaccaa aaacttgcct ctgctcttgc tgctgtcggt 3900
gttccaaaca tgcacttcat tgttgatact ggtcgtaatg gtgttactat taattctgga 3960
acatggtgta acttagtcgg tactggtctt ggtgaacgtc caagaggtaa tccaaatgct 4020
ggtatgccat tattagatgc ttacatgtgg cttaagactc caggagaatc tgacggttca 4080
tcctctggtt ctagagctga tccaaattgt tctagtaatg attctcttag aggtgctcca 4140
gatgctggtc aatggttcca tgattacttc gctcaattag taagaaatgc tagaccatca 4200
ttttaagcaa atttctaaat gattgaattt aacaaaaatt actaatctag aggatccccg 4260
ggtaccgagc tcgaattcgt aatcatggaa atatataaat aactactaga ttacataaat 4320
aagtttaaat taaagaatat tgtaaaattc aacaaaacat tttattactg ttaagtaagt 4380
gatataatat attaaataat aactattttc ttaagattaa ttgtataatt taattgaata 4440
ttggatatta aaaaaataaa tagttaaaaa atatgatata ggtggtacac atcagaaaat 4500
gggcaatata aactgtgaca taatctttag aaaaaaactt tagtgtagaa tatattattt 4560
aaagaagaat atatgaatga aataagttaa aactaaggtt gatatgatat tatgtttatc 4620
aatgatttta ctaaaaagat aaaaatatat tttgaatatt cataatcaaa gggtagggtt 4680
catcaatatg catattagat tatgagatga tgcaaactaa ataattttat actgtaaggg 4740
agacggataa taaaatggat tataaatctg tagaatttta tgataattat ttaataataa 4800
aagaattaag aaattttaaa ttaaaacata tatttgagtg cggacagatt tttaggtttg 4860
aagaggtagc agaaaatgat tttattgtaa ttgcgtttgg aagattaatt gaagttaaag 4920
aagatggaaa tgacgtaata atttataatt ctacaaagga agattttaaa aatatttggc 4980
ttaagtattt tgatttagat agagattact cagttataaa agatgaactt tcaaaagatg 5040
ttttacttaa acaaagtatt gaatttggat atggtgttag agtcttgaat caagatccat 5100
ttgaaatgtt gcttagcttt attatttcag cgcgaaataa tataccatca ataaagaaga 5160
ctgtaaataa aatatctaat aaatggggaa aagaaattat ttataaggat aaaacctact 5220
atgcgtttcc taatataggt gaaataaaag atgctacact agaagaaata caggagacag 5280
gagcatcttc catggacgcg tgacgtcgac tctagaaatt cttctaaata taagaatatt 5340
ttaaagaaat atctttatat attagttatt aaaatttata agattataag aaacattata 5400
acatatttta gaacttttta actattctaa aagattaatt tacatattaa catttaatta 5460
tgggtaaaaa ctattttgaa aaatgattta tatggaatta tgtttcttaa atatacaatc 5520
atgtttcatg aatacataat tattttaaat gtattgggag ggtaaattga ttgtgaaacg 5580
cggcgatgtt tattttgctg atttatctcc tgttgttggc tcagagcaag gcggggtgcg 5640
cccggtttta gtgatccaaa atgacatcgg aaatcgcttc agcccaactg ctattgttgc 5700
agccataaca gcacaaatac agaaagcgaa attaccaacc cacgtcgaaa tcgatgcaaa 5760
acgctacggt tttgaaagag attccgttat tttgctggag caaattcgga cgattgacaa 5820
gcaaaggtta acggataaga ttactcatct ggatgatgaa atgatggata aggttgatga 5880
agccttacaa atcagtttgg cactcattga tttttagaca tatttgcagg ttgctcaaat 5940
agagcgaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc 6000
gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc 6060
aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag 6120
ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct 6180
cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta 6240
ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc 6300
cttatccggt aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc 6360
agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt 6420
gaagtggtgg cctaactacg gctacactag aagaacagta tttggtatct gcgctctgct 6480
gaagccagtt accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc 6540
tggtagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca 6600
agaagatcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta 6660
agggattttg gtcatgagat tatcaaaaag gagtttaaac ttctaaaatc tgattaccaa 6720
ttagaatgaa tatttcccaa atattaaata ataaaacaaa aaaattgaaa aaagtgtttc 6780
caccattttt tcaatttttt tataattttt ttaatctgtt atttaaatag tttatagtta 6840
aatttacatt ttcattagtc cattcaatat tctctccaag ataactacga actgctaaca 6900
aaattctctc cctatgttct aatggagaag attcagccac tgcatttccc gcaatatctt 6960
ttggtatgat tttacccgtg tccatagtta aaatcatacg gcataaagtt aatatagagt 7020
tggtttcatc atcctgataa ttatctatta attcctctga cgaatccata atggctcttc 7080
tcacatcaga aaatggaata tcaggtagta attcctctaa gtcataattt ccgtatattc 7140
ttttattttt tcgttttgct tggtaaagca ttatggttaa atctgaattt aattccttct 7200
gaggaatgta tccttgttca taaagctctt gtaaccattc tccataaata aattcttgtt 7260
tgggaggatg attccacggt accatttctt gctgaataat aattgttaat tcaatatatc 7320
gtaagttgct tttatctcct attttctttg aaataggtct aattttttgt ataagtattt 7380
ctttactttg atctgtcaat ggttcagata cgacgactaa aaagtcaaga tcactatttg 7440
gttttagtcc actctcaact cctgatccaa acatgtaagt accaataagg ttatttttta 7500
aatgtttccg aagtattttt ttcactttat taatttgttc gtatgtattc aaatatatcc 7560
tcctcactat tttgattagt acctatttta tatccatagt tgttaattaa ataaacttaa 7620
tttagtttat ttatagattt cattggcttc taaatttttt atctagataa taattatttt 7680
agttaatttt attctagatt atatatgata tgatctttca tttccataaa actaaagtaa 7740
gtgtaaacct attcattggg ccggccgctt ataatccata acaatcatcc tttctgtgac 7800
actgtcagac acttatcaca ttaagtatat actattatta aactattcta tatacttaat 7860
ttattttaat agaaaaacat aatatcataa taacttcaaa attaaacttt atttatgatt 7920
tcatacttga ctttgatttt agaaaggata tactttttag cagatttgga aacggctttg 7980
gacgtagttt gcccatagat gaacaaacaa actacatcca aaaattatac ttttcccttc 8040
attggtatcc gtatttttac atcttaatag cgtatgtatt acaacacacc taaacaacga 8100
ccttacggtc tgctactgca tatcctagct tgattgttta gttgcctcaa ctatgcttaa 8160
ccctaccccg aactcttttt ttattgtggg ttttcgtcgt gaagtcccac cgacacataa 8220
tcataacata agatgtatta tgaaaatgcg agtgactatc cttttgtatc ggctcactac 8280
accacagata tattttttag tgcatactgt gtcggcactc tcaatattaa attaataaat 8340
aaattatttt ttctttttac tcttctttac atgagctttt ttaaagctcc ttgcataata 8400
tttaatgcat gtacgttctt ttttctgttc ttcctctgta aaacatctca tttttatggc 8460
acaccatcca tatcggttca tctttgaaca attaatacat tggactttcc ctttatgtaa 8520
acatcttgac tcattgtatt tactgcaata ggttgctact gttttatcat gaatcatcaa 8580
cgaatgtatt ttgcatattt cagtatcttt aataaactct ttgcaatttt tacaagtttt 8640
catatacgcc cttcttttca taaattaatt tatgaattct atgtattcca aggagctttt 8700
taaagctccc cctttcgtac tacttagcta caagcactat aaaagtcata atgtttactg 8760
ctaaagtcaa taatgatatt gctaatacct tgttatttga taagatactg ctttcctctg 8820
tcactttgct cacccccttt cattttcata aattaattat gaaaaaataa atatacttct 8880
aatttttatc aaataaaaaa gcctttgcgt actgcttcca atacacaaag gctataaact 8940
tctaaatctt acttattgca atttacattt atatctgtta agataatctc ataaattgaa 9000
tatatatagt taatgtttct cttgtatgtc ggtacatttg aaatattgct atagatagag 9060
ttctctaacg gcttgatgtg ttggtagcac atttaagttt tggcttatat actaagttgg 9120
tagcttaata tataagagct gaggacttat ttttttatta aatttttcaa cttgtctata 9180
ttttaacccg taattgaata cataacaagt attttttttt gtattcaatt aaacattcat 9240
aaatgagtat aattaatcat actaaattct ataattttct tttctgtaaa tttctttcta 9300
ttcagcactg ttatgccttt tgactatcac ttaataaaaa ataagaaatg aattgtcaat 9360
tgttcaa 9367
<210> 31
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 31
agataattat gaagttaatc cttag 25
<210> 32
<211> 27
<212> DNA
<213> 人工序列
<220>
<223> 引物
<400> 32
catttgcttt caggtcttct tttgctg 27

Claims (15)

1.一种允许通过同源重组来转化梭状芽胞杆菌(Clostridium)属的产溶剂细菌的遗传工具,其特征在于所述遗传工具包含:
-至少编码Cas9的第一核酸,其中所述Cas9编码序列被置于启动子的控制下,和
-至少一种含有修复模板的第二核酸,其允许通过同源重组机制使Cas9靶向的细菌DNA的一部分被目标序列替换,
并且i)所述核酸中的至少一种还编码一种或多种向导RNA(gRNA),或ii)所述遗传工具还包含一种或多种向导RNA,每种向导RNA包含Cas9酶结合性RNA结构以及与所述细菌DNA的靶向部分互补的序列。
2.权利要求1的工具,其特征在于编码所述一种或多种向导RNA的序列在启动子之后,并且所述启动子或控制Cas9的启动子是诱导型启动子。
3.权利要求1或2的工具,其特征在于所述细菌DNA的靶向部分包含细菌存活所必需的基因或基因部分。
4.权利要求1至3任一项的工具,其特征在于所述梭状芽胞杆菌属的产溶剂细菌选自丙酮丁醇梭状芽胞杆菌(C.acetobutylicum)、解纤维梭状芽胞杆菌(C.cellulolyticum)、C.phytofermentans、拜氏梭状芽胞杆菌(C.beijerinckii)、糖丁酸梭状芽胞杆菌(C.saccharobutylicum)、糖丁基丙酮梭状芽胞杆菌(C.saccharoperbutylacetonicum)、产芽孢梭状芽胞杆菌(C.sporogenes)、丁酸梭状芽胞杆菌(C.butyricum)、金黄丁酸梭状芽胞杆菌(C.aurantibutyricum)和酪丁酸梭状芽胞杆菌(C.tyrobutyricum)。
5.权利要求4的工具,其特征在于当所述产溶剂细菌是丙酮丁醇梭状芽胞杆菌时,所述丙酮丁醇梭状芽胞杆菌细菌是菌株ATCC824,并且当所述产溶剂细菌是拜氏梭状芽胞杆菌时,所述拜氏梭状芽胞杆菌细菌是菌株DSM 6423。
6.权利要求1至5任一项的工具,其特征在于所述Cas9蛋白包含序列SEQ ID NO:1。
7.权利要求1至6任一项的工具,其特征在于所述Cas9启动子是诱导型启动子。
8.权利要求1至7任一项的工具,其特征在于所述目标DNA序列编码至少一种促进溶剂产生的产物,通常是至少一种参与醛转化为醇的酶,膜蛋白,转录因子,或其组合。
9.权利要求1至8任一项的工具,其特征在于所述工具内存在的核酸各自属于不同的表达盒或不同的载体,例如质粒。
10.一种通过同源重组来转化梭状芽胞杆菌属的产溶剂细菌的方法,其特征在于所述方法包含将权利要求1至9任一项的遗传工具引入所述细菌中的步骤。
11.权利要求10的方法,其特征在于所述方法包含以下步骤:
a)将权利要求1至9任一项的遗传工具引入所述细菌中,所述遗传工具包含至少一个诱导型启动子,和
b)诱导所述诱导型启动子的表达以遗传修饰所述细菌。
12.权利要求10或11的方法,其特征在于所述方法包含一个或多个附加步骤,当存在步骤b)时所述附加步骤在步骤b)之后,所述附加步骤引入第n个核酸,所述第n个核酸编码i)与已经引入的修复模板不同的修复模板,和ii)一种或多种向导RNA,所述向导RNA允许它们整合到细菌基因组的靶向区域中,各附加步骤在去除先前引入的编码修复模板的核酸的步骤之后,并优选在去除先前引入的一种或多种向导RNA或编码一种或多种向导RNA的序列的步骤之后。
13.一种梭状芽胞杆菌属的产溶剂细菌,其是使用权利要求10至12任一项的方法转化过的。
14.一种用于转化梭状芽胞杆菌属细菌或使用梭状芽胞杆菌属细菌产生至少一种溶剂的试剂盒,其包含权利要求1至9任一项所公开的遗传工具的组分和任选的一种或多种与在所述工具内使用的选定诱导型启动子相适应的诱导物。
15.权利要求1至9任一项的遗传工具、权利要求10或11任一项的方法、或根据权利要求12转化的细菌在以工业规模生产溶剂或溶剂混合物中的应用,所述溶剂或溶剂混合物优选为丙酮、丁醇、乙醇、异丙醇或其混合物,通常为异丙醇/丁醇混合物。
CN201680068812.7A 2015-10-16 2016-10-14 用于转化梭状芽胞杆菌属细菌的遗传工具 Pending CN108431221A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
FR1559846A FR3042506B1 (fr) 2015-10-16 2015-10-16 Outil genetique de transformation de bacteries clostridium
FR1559846 2015-10-16
PCT/FR2016/052663 WO2017064439A1 (fr) 2015-10-16 2016-10-14 Outil genetique de transformation de bacteries clostridium

Publications (1)

Publication Number Publication Date
CN108431221A true CN108431221A (zh) 2018-08-21

Family

ID=55072908

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201680068812.7A Pending CN108431221A (zh) 2015-10-16 2016-10-14 用于转化梭状芽胞杆菌属细菌的遗传工具

Country Status (9)

Country Link
US (2) US11746346B2 (zh)
EP (1) EP3362559B1 (zh)
KR (1) KR20180081527A (zh)
CN (1) CN108431221A (zh)
BR (1) BR112018007622A2 (zh)
CA (1) CA3001815A1 (zh)
DK (1) DK3362559T3 (zh)
FR (1) FR3042506B1 (zh)
WO (1) WO2017064439A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113614229A (zh) * 2018-12-20 2021-11-05 Ifp新能源公司 遗传修饰的梭菌属细菌、其制备和用途
CN114286857A (zh) * 2019-05-24 2022-04-05 Ifp新能源公司 用于修饰细菌的优化的遗传工具

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3613852A3 (en) 2011-07-22 2020-04-22 President and Fellows of Harvard College Evaluation and improvement of nuclease cleavage specificity
US20150044192A1 (en) 2013-08-09 2015-02-12 President And Fellows Of Harvard College Methods for identifying a target site of a cas9 nuclease
US9359599B2 (en) 2013-08-22 2016-06-07 President And Fellows Of Harvard College Engineered transcription activator-like effector (TALE) domains and uses thereof
US9322037B2 (en) 2013-09-06 2016-04-26 President And Fellows Of Harvard College Cas9-FokI fusion proteins and uses thereof
US9737604B2 (en) 2013-09-06 2017-08-22 President And Fellows Of Harvard College Use of cationic lipids to deliver CAS9
US9228207B2 (en) 2013-09-06 2016-01-05 President And Fellows Of Harvard College Switchable gRNAs comprising aptamers
US9068179B1 (en) 2013-12-12 2015-06-30 President And Fellows Of Harvard College Methods for correcting presenilin point mutations
AU2015298571B2 (en) 2014-07-30 2020-09-03 President And Fellows Of Harvard College Cas9 proteins including ligand-dependent inteins
FR3042506B1 (fr) 2015-10-16 2018-11-30 IFP Energies Nouvelles Outil genetique de transformation de bacteries clostridium
CN108513575A (zh) 2015-10-23 2018-09-07 哈佛大学的校长及成员们 核碱基编辑器及其用途
WO2018027078A1 (en) 2016-08-03 2018-02-08 President And Fellows Of Harard College Adenosine nucleobase editors and uses thereof
CA3033327A1 (en) 2016-08-09 2018-02-15 President And Fellows Of Harvard College Programmable cas9-recombinase fusion proteins and uses thereof
WO2018039438A1 (en) 2016-08-24 2018-03-01 President And Fellows Of Harvard College Incorporation of unnatural amino acids into proteins using base editing
KR20240007715A (ko) 2016-10-14 2024-01-16 프레지던트 앤드 펠로우즈 오브 하바드 칼리지 핵염기 에디터의 aav 전달
US10745677B2 (en) 2016-12-23 2020-08-18 President And Fellows Of Harvard College Editing of CCR5 receptor gene to protect against HIV infection
EP3592853A1 (en) 2017-03-09 2020-01-15 President and Fellows of Harvard College Suppression of pain by gene editing
JP2020510439A (ja) 2017-03-10 2020-04-09 プレジデント アンド フェローズ オブ ハーバード カレッジ シトシンからグアニンへの塩基編集因子
SG11201908658TA (en) 2017-03-23 2019-10-30 Harvard College Nucleobase editors comprising nucleic acid programmable dna binding proteins
US11560566B2 (en) 2017-05-12 2023-01-24 President And Fellows Of Harvard College Aptazyme-embedded guide RNAs for use with CRISPR-Cas9 in genome editing and transcriptional activation
US11732274B2 (en) 2017-07-28 2023-08-22 President And Fellows Of Harvard College Methods and compositions for evolving base editors using phage-assisted continuous evolution (PACE)
US11319532B2 (en) 2017-08-30 2022-05-03 President And Fellows Of Harvard College High efficiency base editors comprising Gam
CN111757937A (zh) 2017-10-16 2020-10-09 布罗德研究所股份有限公司 腺苷碱基编辑器的用途
US20210047653A1 (en) * 2018-01-30 2021-02-18 The University Of Memphis Research Foundation Compositions and methods for regulating a biological process
FR3081881B1 (fr) 2018-06-04 2024-05-24 Ifp Energies Now Outil genetique optimise pour modifier les bacteries du genre clostridium
DE112020001342T5 (de) 2019-03-19 2022-01-13 President and Fellows of Harvard College Verfahren und Zusammensetzungen zum Editing von Nukleotidsequenzen
WO2021123391A1 (en) 2019-12-18 2021-06-24 Exomnis Biotech B.V. Genetically modified clostridium strains and uses thereof
EP4146804A1 (en) 2020-05-08 2023-03-15 The Broad Institute Inc. Methods and compositions for simultaneous editing of both strands of a target double-stranded nucleotide sequence
FR3111642A1 (fr) * 2020-06-23 2021-12-24 IFP Energies Nouvelles Souches de bacteriesclostridiumresistantes au 5-fluorouracile, outils genetiques et utilisations de ceux-ci
FR3137108A1 (fr) 2022-06-27 2023-12-29 IFP Energies Nouvelles Bacteries clostridium modifiees pour assimiler plusieurs sucres simultanement, preparation et utilisations de celles-ci

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DK3401400T3 (da) * 2012-05-25 2019-06-03 Univ California Fremgangsmåder og sammensætninger til rna-styret mål-dna-modifikation og til rna-styret transskriptionsmodulering
CN111206032A (zh) * 2013-12-12 2020-05-29 布罗德研究所有限公司 用于基因组编辑的crispr-cas***和组合物的递送、用途和治疗应用
FR3042506B1 (fr) 2015-10-16 2018-11-30 IFP Energies Nouvelles Outil genetique de transformation de bacteries clostridium
FR3081881B1 (fr) 2018-06-04 2024-05-24 Ifp Energies Now Outil genetique optimise pour modifier les bacteries du genre clostridium

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
E CORNILLOT等: "The genes for butanol and acetone formation in Clostridium acetobutylicum ATCC 824 reside on a large plasmid whose loss leads to degeneration of the strain", 《J BACTERIOL》 *
FLORENT COLLAS等: "Simultaneous production of isopropanol, butanol, ethanol and 2,3-butanediol by Clostridium acetobutylicum ATCC 824 engineered strains", 《AMB EXPRESS》 *
XU, T等: "Efficient Genome Editing in Clostridium cellulolyticum via CRISPR-Cas9 Nickase", 《APPLIED AND ENVIRONMENTAL MICROBIOLOGY》 *
YI WANG等: "Markerless chromosomal gene deletion in Clostridium beijerinckii using CRISPR/Cas9 system", 《J BIOTECHNOL》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113614229A (zh) * 2018-12-20 2021-11-05 Ifp新能源公司 遗传修饰的梭菌属细菌、其制备和用途
CN114286857A (zh) * 2019-05-24 2022-04-05 Ifp新能源公司 用于修饰细菌的优化的遗传工具

Also Published As

Publication number Publication date
US20240002835A1 (en) 2024-01-04
FR3042506A1 (fr) 2017-04-21
DK3362559T3 (da) 2020-06-22
US11746346B2 (en) 2023-09-05
BR112018007622A2 (pt) 2018-10-30
EP3362559A1 (fr) 2018-08-22
EP3362559B1 (fr) 2020-05-06
WO2017064439A1 (fr) 2017-04-20
US20180305680A1 (en) 2018-10-25
KR20180081527A (ko) 2018-07-16
FR3042506B1 (fr) 2018-11-30
CA3001815A1 (fr) 2017-04-20

Similar Documents

Publication Publication Date Title
CN108431221A (zh) 用于转化梭状芽胞杆菌属细菌的遗传工具
RU2763170C2 (ru) Производство олигосахаридов человеческого молока в микроорганизмах-хозяевах с модифицированным импортом/экспортом
AU2021203937B2 (en) Compositions and methods for rapid and dynamic flux control using synthetic metabolic valves
AU2013344512B2 (en) Recombinant adenoviruses and use thereof
DK2087105T3 (da) Delta 17-desaturase og anvendelse heraf ved fremstilling af flerumættede fedtsyrer
CN110551713A (zh) 用于修饰梭状芽孢杆菌属细菌的优化的遗传工具
KR20140113997A (ko) 부탄올 생성을 위한 유전자 스위치
IL236992A (en) Genetically modified cyanobacteria that produce ethanol
KR20140092759A (ko) 숙주 세포 및 아이소부탄올의 제조 방법
KR20130027063A (ko) Fe-s 클러스터 요구성 단백질의 활성 향상
KR20120099509A (ko) 재조합 숙주 세포에서 육탄당 키나아제의 발현
BRPI0719748A2 (pt) Microrganismo modificados por engenharia para produzir n-butanol e métodos relacionados
CN101815432A (zh) 涉及编码核苷二磷酸激酶(ndk)多肽及其同源物的基因的用于修改植物根构造的方法
CN101827938A (zh) 涉及rt1基因、相关的构建体和方法的具有改变的根构造的植物
CN101802183A (zh) 高保真度限制性内切核酸酶
CN115927299A (zh) 增加双链rna产生的方法和组合物
CN114729387A (zh) 遗传修饰真菌和与其相关方法和用途
CN115698297A (zh) 多模块生物合成酶基因组合文库的制备方法
CN101868545B (zh) 具有改变的根构造的植物、涉及编码富含亮氨酸重复序列激酶(llrk)多肽及其同源物的基因的相关构建体和方法
AU2017252409A1 (en) Compositions and methods for nucleic acid expression and protein secretion in bacteroides
CN101848931B (zh) 具有改变的根构造的植物、涉及编码exostosin家族多肽及其同源物的基因的相关的构建体和方法
CN113186140B (zh) 用于预防和/或治疗宿醉和肝病的基因工程细菌
CN101627109A (zh) 用于生产正丁醇的工程化改造的微生物及相关方法
CN113614229A (zh) 遗传修饰的梭菌属细菌、其制备和用途
KR20180038462A (ko) 재조합 세포, 재조합 세포의 제조 방법, 및 1,4-부탄디올의 생산 방법

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination