CN106755100A - The genomes of controllability HIV 1 target editing system and its targeting carrier system - Google Patents
The genomes of controllability HIV 1 target editing system and its targeting carrier system Download PDFInfo
- Publication number
- CN106755100A CN106755100A CN201710089568.9A CN201710089568A CN106755100A CN 106755100 A CN106755100 A CN 106755100A CN 201710089568 A CN201710089568 A CN 201710089568A CN 106755100 A CN106755100 A CN 106755100A
- Authority
- CN
- China
- Prior art keywords
- plasmid
- sequence
- expression
- seq
- hiv
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/15011—Lentivirus, not HIV, e.g. FIV, SIV
- C12N2740/15041—Use of virus, viral particle or viral elements as a vector
- C12N2740/15043—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/106—Plasmid DNA for vertebrates
- C12N2800/107—Plasmid DNA for vertebrates for mammalian
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/60—Vectors containing traps for, e.g. exons, promoters
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/80—Vectors containing sites for inducing double-stranded breaks, e.g. meganuclease restriction sites
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2810/00—Vectors comprising a targeting moiety
- C12N2810/10—Vectors comprising a non-peptidic targeting moiety
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/40—Vector systems having a special element relevant for transcription being an insulator
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/48—Vector systems having a special element relevant for transcription regulating transport or export of RNA, e.g. RRE, PRE, WPRE, CTE
Landscapes
- Genetics & Genomics (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Wood Science & Technology (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Virology (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
The present invention relates to a kind of genomes of controllability HIV 1 targeting editing system, including the multiple gRNA expression plasmids of spCAS9 enzymes collocation;Expression of the pUC pUC to spCAS9 with the addition of control element;Targeting sequences of the multiple gRNA for the encoder block genome sequence design of the genomes of HIV 1 and/or CCR5.To improve its targeting, the liposome element auxiliary that above-mentioned targeting editor pUC pUC has installed multiple target spot additional is delivered to the positive cell of CD4, CCR5 and/or CXCR4;To reduce the Non-specific integration to other cells, the plasmid of the system only exists in HIV 1 and can be integrated when active;Moreover, it is achieved that the expression of the CAS9 controlled by TAT protein and R element could only start in the presence of the live viruses of HIV 1, expression is then closed after virus lays dormant or inactivation, reduce the potentially danger to target cell.
Description
Technical field
The invention belongs to genome editing technique field, and in particular to the genome editor of HIV-1 mediations and pDNA medicines
Targeted delivery system.
Background technology
Now with the progress of the targeting operating technology of gene, remove external source infection viral genome or be adapted from body
The dream of interior some deleterious genes is increasingly becoming reality, such as TALEN technologies and CRISPR technologies.The two respectively has advantage and disadvantage, than
As TALEN technology miss rates are very low, the DNA element for being limited to targeting is larger, beyond the capacity of majority carrier, thus is difficult to
Multiple target practice;And CRISPR technologies carry multiple gRNA only with a nucleic acid cleaving enzymatic CAS9, it is possible to achieve many target position are cut
Cut, but at present there is a shortcoming in CRISPR, that is, and effect of missing the target, the cutting to non-specific sites sequence can cause larger
Potential danger.SpCAS9 is bacterium (streptococcus pyogenes) albumen, and its long-term expression in human body cell will be made to body genome
Into the risk that the immunizing antigen inside certain threat and active cell is offered.Or needed when gene target medicine is made
To express in short-term, or realizing controlled expression, and the strategy expressed in short-term be realized, it is possible to use plasmid DNA vectors technology, in machine
Make circles expression in vivo, expression time is short in the cell for the plasmid of the method, the cutting effect of spCAS9 can be influenceed, to this
Need to improve plasmid RT in the cell.Although occurring in that some episome (episome) technologies have this energy at present
Power, but the episome for being used at present both from virus, it is necessary to express some virus proteins to maintain episomal duplication, but
These virus proteins have larger potential hazard to cell, such as, with certain carcinogenic potential, greatly limit it clinical
Use scope.There is the expression of stabilization for this integrated plasmid, if realizing that controllability is expressed, CAS9 cutting effects will be solved
Not enough shortcoming.But integrated plasmid generally requires specific enzyme to mediate, be typically transferred to using virus-mediated at present and
Integrate, using viral carrying DNA to intracellular, there is efficient stable, but there is also cell nonspecific infection, base
Lack because the antigenicity of a group random integration, virus is high, viral infection cell-specific is low, viral production is cumbersome, be difficult accumulating etc.
Point.Expression of the virus of random integration to neighbouring gene is interfered, and causing the expression of gene increases, is mutated or inactivates, and is facing
Certain risk is faced on bed, limitation threshold is of a relatively high.Such as adeno-associated virus AAV is imitated to the site-directed integration in AAVS1 sites
Rate is very low, and other are viral, such as slow virus, the characteristics of do not have site-directed integration substantially.
In view of the carrying shortcoming of above-mentioned pDNA medicines, targeting modification and cutting for some special genes, existing skill
Art cannot realize site-directed integration, then we can contemplate the medicine loading and integration for reducing non-specific cell, only will be special
Property integrate be confined to specific cells group, be so greatly lowered occur Non-specific integration cell context, reduce correlation
The danger of medicine.Such as, for the targeting CRISPR elements of AIDS virus HIV-1, if realized only in the thin of virus infection
Integrated in born of the same parents, then can just substantially reduce the harm to non-infected cell.Virus-mediated is selectively targeted, at present can be right
The capsid protein of virus makees the modification of targeting engineering, it is had affinity to some cell surface receptors, right so as to realize
The cell for expressing special receptor is integrated.But the capsid protein gp120 that HIV-1 is combined to CD4, CCR5 and CXCR4 is in patient
In be possible to that neutralizing antibody occurs, and the coated virion of capsid protein of HIV-1 itself is more fragile, is grasped intolerant to purifying
Make and preserve, it is limited to a certain extent is used for the virus-mediated DNA inputs of targeting and integrates.Therefore people employ
Capsid protein VSVG from bubble type Stomatovirus, not only increases stability and also enhances appeal and security, but by
This is lost to the specific of CD4 positive cells infection and greatly reduces the lymphoid stem cell positive to some CXCR4
Dip-dye ability, and can have certain dip-dye energy to above-mentioned T cell using the coated slow virus of measles virus MV capsid proteins
Power, but there is larger Immunoresistance to the viral capsid proteins in crowd, limit its extensive use.And suitable one
Part HIV-1 viruses can enter lymphatic system, infection CXCR4 positive T lymphoid progenitor cells, especially to the later stage bounce-back rank of infection
Section, it is hiding and constantly to blood supplement and releasing virus particle shelter it that these CXCR4 positive lymphocyte turns into virus
Ground, and traditional chemicals is also unable to do what one wishes to this.In this regard, the CRISPR targetings of the genome targeting cutting for HIV-1
Element, develops a kind of cell that can only integrate those HIV-1 positives and security carrier technique again high, will be with important
Application value.
In addition to the pDNA targeting inputs to specific cells monoid, also there is the expression of controllability further to increase its peace
Quan Xing.In this regard, people devise the expression control element of various controllabilitys or inductivity at present, such as induced using tetracycline
Expression system, estrogen inducible expression, interferon-induced expression system, moulting hormone inducible expression, Cumate are lured
Expression system, FK506/ rapamycins inducible expression and RU486 inducible expressions etc. are led, above-mentioned inducible system respectively has excellent
Shortcoming.Only biocompatibility is preferable, the less inducible system of toxic and side effect just has a Clinical practice value, such as, tetracycline is lured
Expression system is led, is induced using tetracycline and the like, the expression to gene can be just realized in relatively low concentration range
(Tet-ON) or (Tet-OFF) is closed, and the tetracycline of low concentration and the like is to the biocompatibility and peace of human body
Full property is higher, and the system has larger clinical practice potentiality.In addition, new, the safe inducible expression of exploitation also has
There is important application value, the inducible expression of TAT protein and its R control element such as HIV-1, to HIV-1 sun
Property cell gene target operation have certain application value.R element is to be present in viral genome LTR
(long terminal repeat, LTR) the preceding paragraph DNA sequence dna, the RNA of the sequential coding is exactly in its rna plymerase ii
Initiation complex faces position, can form a kind of special higher structure TAR, what the special RNA structures can be encoded with HIV-1
Albumen TAT is combined, and controls the translation of viral gene to extend, using the DNA control elements and TAT protein, it is possible to achieve to gene
Controlling expression, be applied to the gene expression of HIV-1 positive cells.
Because DNA or RNA etc. belongs to macromolecular substances, and the means of traditional small-molecule drug delivering then cannot be accurate
Efficiently be delivered to it is intracellular, if without virus-mediated channel genes, needing the new non-viral carrying technology of exploitation to use
In Intracellular delivery, this is accomplished by special carrying carrier and targeted molecular technology, and this is a key point of such medicine.
And the delivering of pDNA/RNA medicines is carried out using liposome nanometer technology or non-liposomal cation nanometer technology, presently, there are
The shortcoming that targeting efficiency is low, cell absorptivity is low, circles DNA/RNA half-life shorts, expression efficiency are low, especially cation
Nano particle technology, also has the shortcomings that liver renal toxicity higher, it is necessary to make technical further raising and optimization.But its
Have the advantages that safe, cost of manufacture relatively it is viral it is relatively low, beneficial to industrially scalable manufacture, accumulating can be freezed, because of this
A little medicines have the advantages that clinical threshold is low, room for improvement is big, easy large-scale commercial production with respect to viral vector.
It is main using virus, liposome and nanometer polymerization currently for the conveying of the mcroorganism molecule such as DNA, RNA and albumen
The means such as thing.The good characteristic that wherein good biocompatibility of liposome, useful load are big, toxicity is low, technology is more ripe, at present
Develop various liposomes targeting carrying methods, including temperature sensitive type, immunologic pattern, pH responsive types, PEG modification long circulating liposome,
Flexible lipidosome etc..The above-mentioned type liposome respectively has its advantage, and reality is increased using often several technical methods are used in combination
The good characteristic of stuffing plastid.The cell-targeting that the liposome delivery technique is applied to the macromoleculars such as the DNA of non-viral mediation is defeated
Send, with larger technology development space and application potential, be expected to be used for the targeting conveying of related drugs molecule.
The content of the invention
Regarding to the issue above, the present invention is intended to provide one kind can realize specific cutting, controlled expression, safely and effectively
Multiple gene targeting system, is that the gene therapy of the virus types such as AIDS infectious disease lays the foundation.
To achieve the above object, present invention firstly provides a kind of controllability HIV-1 genomes targeting editing system.
A kind of controllability HIV-1 genomes targeting editing system, including utilization high specific spCAS9 (spCAS9-HF1,
SpCAS9 containing N497A/R661A/Q695A/Q926A mutation) (Nature.2016Jan 28;529(7587):490-5.doi:
10.1038/nature16526.) or other spCAS9 editing enzymeses modified version expression plasmid, the control system can answer
For other editing enzymeses, such as cpf1 classes, others C2C2 classes, C2C1 classes, the editing enzymes of CASx and CASy classes, collocation gRNA expression
The combination of plasmid, to reduce potential Non-specific integration;It is potential non-specific to reduce using the gRNA technologies of 17-19nt
Property targeting cutting power (Nat Biotechnol.2014Mar;32(3):279-84.doi:10.1038/nbt.2808.).
The targeting editing system can integrator gene group, its integration to genome depends on the HIV-1 of cell infection
Live virus;
Expression of the targeting editing system to spCAS9/spCAS9-HF1 with the addition of control element, make the table of Cas9 enzymes
Up to or by HIV-1 promoter and TAT protein regulating and expressing, or by the strong promoter and R element of built-in mammal
The chimeric promoters of composition control it to express, or by built-in strong mammalian promoters and TetO/TetR systems constitute it is embedding
Promoter control table is closed to reach;
Expression of the targeting editing system to multiple gRNA uses the promoter of mammal;The multiple gRNA refers to
The encoder block genome sequence for being directed to HIV-1 genomes and/or CCR5 design targeting sequence.
SpCAS9 the or spCAS9-HF1 expression plasmids will start containing HIV-1 genomes controlling element, HIV-1 expression
The fragment of element, genome conformity element and spCAS9/spCAS9-HF1 expression cassettes is inserted into carrier;The Insert Fragment according to
It is secondary to contain 5 '-LTR, packaging signal Ψ, spCAS9/spCAS9-HF1 expression cassette, RRE, cPPT and 3 '-LTR elements;Described 5 '-
LTR elements have been sequentially connected FIV U3 elements, R element and U5 elements, the 3 '-LTR elements be sequentially connected FIV U3 elements,
R element and U5 elements;
Be inserted into fragment containing multiple gRNA expression cassettes in carrier by the multiple gRNA expression plasmids;The insertion piece
Section is the one kind in following two:One, Insert Fragment contains 5 '-LTR, packaging signal Ψ, RRE, cPPT, multiple gRNA tables successively
Up to frame and Sin3 ' LTR elements;Two, Insert Fragment contains ITR, multiple gRNA expression cassettes, ITR, REP expression cassette successively;It is described
5 '-LTR elements have been sequentially connected FIV U3 elements, R element and U5 elements;The multiple gRNA expression cassettes contain and are directed to respectively
The gRNA expression cassettes in the U3 areas, protease P ro code areas and REV protein-coding regions of HIV-1, each gRNA expression cassette is by difference
Promoter control respectively;Sin3 ' the LTR elements have been sequentially connected Δ U3 elements, R element and U5 elements, the Δ U3 units
Part is the U3 areas for losing transcripting starting ability.
Further, spCAS9/spCAS9-HF1 and gRNA is expressed respectively on two kinds of expression plasmids, or in same table
It is co-expressed on up to plasmid, the co-expression plasmid is to insert the fragment of spCAS9/spCAS9-HF1 and multiple gRNA expression cassettes
Between unique 5 '-LTR and 3 '-LTR on to same carrier;The U3 that the 5 '-LTR originate containing FIV, the 3 '-LTR contain
The U3 or Δ U3 in FIV sources.
Further, in spCAS9/spCAS9-HF1 expression plasmids and/or gRNA expression plasmids or co-expression plasmid also
One or several in elements below can be added:RRE, cPPT, wPRE element;Eukaryotic promoter can with R element composition
Regulation and control chimeric promoters, the controllable chimeric promoters of eukaryotic promoter and TetO sequences composition, TetR expression cassettes, S/MAR units
Part and insulator Ins elements.
Further, the eukaryotic promoter includes that EF1a promoters, actin promoters, Ubc promoters, PGK start
Son.
Further, in the sequence containing Seq ID No.1-5 in ordered list in the multiple gRNA expression vectors
At least 2 kinds.
Further, the spCAS9 expression plasmids include following plasmid:p1、p7;The gRNA expression plasmids are included such as
Lower plasmid:p5;The spCAS9, multiple gRNA co-expression plasmids include following plasmid:P6-1, p6-2 and p6-3, described p6-
Seq ID No.21 in 35 heavy gRNA coded sequences such as sequence table;Seq ID No.7 institutes in the sequence of the p1 such as sequence table
Show, the sequence of p5 as shown in Seq ID No.20 in sequence table, the sequence of p6-1 as shown in Seq ID No.8 in sequence table, p7
Sequence as shown in Seq ID No.9 in sequence table.
Above-mentioned targeting editing system can be infected with liposome, cationic nano-grain or lentivirus mediated to HIV-1
Cell in delivered, the delivering of wherein lentivirus mediated can be the slow virus of HIV-1 integrase inactivation types, such as Yin Jiyin
The HIV-1 integrases that the site such as missing or D64/D116/E152 occurs point mutation and inactivates, it is to avoid non-specific gene group is whole
Close, only to the genome conformity containing HIV-1 live virus cells.
Thus, present invention also offers a kind of targeting carrier system for conveying above-mentioned targeting editing system, the load
Fortune system is liposome or slow virus.
The slow virus is that the targeting editing system is arranged in pairs or groups with the second generation or third generation slow virus packaging system,
Carry out slow virus packaging;The integrase inactivation of the slow virus system is lacked, and its integration to genome depends on cell
The HIV-1 live viruses for being infected;Described packaging virus capsid protein is VSVG, or is measles virus (Measles
Virus) the capsid protein in source, or be sindbis alphavirus (Sindbis Virus) and the capsid of other viral sources
Albumen.The targeting editing system of the lentivirus mediated can be used to prevent the genome conformity of the cell to being uninfected by HIV-1, and energy
Enough integration that genome is carried out under the auxiliary of HIV-1 infection viruses.
A kind of liposome for conveying above-mentioned targeting editing system, is mainly prepared from the following components:
DSPC (DSPC):Cholesterol:Polypeptide-PEG- phosphatide mass ratio=20-70:10-30:
0.0001-2;
Or, DPPC (DPPC):DSPC (DSPC):1,2- dioleoyl phosphorus
Phosphatidylcholine (DOPC):Cholesterol:Polypeptide-PEG- phospholipid molar ratio=5:0.5-2:0.5-3:0.5-2:0.0001-0.5.
Further, the polypeptide includes anti-CD4 polypeptides (being named as aCD4), anti-CCR5 polypeptides (being named as aCCR5), resists
One or more combination in CXCR4 polypeptides (being named as aCXCR4).Due to HIV-1 viruses be generally divided into CCR5 preferendums and
CXCR4 preferendums, wherein infection late viral often CXCR4 preferendums occurs after morphing, cause virus to appear in the CXCR4 positives
Cell, such as T lymphocyte stem cells, myeloid cell and macrophage, therefore it is directed to the targeting of the positive cells of HIV-1 in design
, it is necessary to consider targeting of the design for CXCR4 when target.Therefore the targeting point for CD4, CCR5 and CXCR4 is devised
Son.
Further, anti-CD4 polypeptides are one section of amino acid sequence of the variable region from antiCD4 mAb ST40, by excellent
Change transformation to obtain, sequence is Seq ID No.10 in sequence table:ARRPKFYRAPYVKNHPNVWGPWVAYGP.
Further, anti-CCR5 polypeptides come from gp120V2 areas, can combine CCR5, and sequence is:ASTTTNYT.
Further, anti-CXCR4 small peptides change structure from T140, and containing alpha-non-natural amino acid modification, sequence is Seq in sequence table
ID No.11:H-Arg-Arg-Nal-Cys-Tyr-Cit-Lys-D-Lys-Pro-Tyr-Arg-Cit-Cys-Arg-NH2, wherein
2 cysteines carry out cyclisation treatment in building-up process.
Further, polypeptides in combination mode is that the series connection of two or more polypeptides is synthesized into a double target spot polypeptides or many targets
Point polypeptide;Combination including aCD4 and aCCR5:aCD4_CCR5、aCCR5_CD4;The combination of aCD4 and aCXCR4:aCD4_
CXCR4、aCXCR4_CD4;The combination of aCCR5 and aCXCR4:aCCR5_CXCR4、aCXCR4_CCR5;
The sequence of the aCD4_CCR5 is Seq ID No.12 in sequence table:H-
ARRPKFYRAPYVKNHPNVWGPWVAYGPSASTTTNYT-NH2, wherein its amino can be put into left-hand end or right-hand end,
The sequence of the aCCR5_CD4 is Seq ID No.13 in sequence table:H-
ASTTTNYTSARRPKFYRAPYVKNHPNVWGPWVAYGP-NH2;
The sequence of the aCD4_CXCR4 is Seq ID No.16 in sequence table:Ala-Arg-Arg-Pro-Lys-Phe-
Tyr-Arg-Ala-Pro-Tyr-Val-Lys-Asn-His-Pro-Asn-Val-Trp-Gly-Pro-Trp-Val-Ala-Tyr-
Gly-Pro-Arg-Arg-Nal-Cys-Tyr-Cit-Lys-D-Lys-Pro-Tyr-Arg-Cit-Cys-Arg-NH2,
The sequence of the aCXCR4_CD4 is Seq ID No.17 in sequence table:H-Arg-Arg-Nal-Cys-Tyr-Cit-
Lys-D-Lys-Pro-Tyr-Arg-Cit-Cys-Arg-Ala-Arg-Arg-Pro-Lys-Phe-Tyr-Arg-Ala-Pro-
Tyr-Val-Lys-Asn-His-Pro-Asn-Val-Trp-Gly-Pro-Trp-Val-Ala-Tyr-Gly-Pro-NH2,
The aCCR5_CXCR4 polypeptide sequences are Seq ID No.14 in sequence table:H-Arg-Arg-Nal-Cys-Tyr-
Cit-Lys-D-Lys-Pro-Tyr-Arg-Cit-Cys-Arg-Ser-Ala-Ser-Thr-Thr-Thr-Asn-Tyr-Thr-
NH2,
The sequence of the aCXCR4_CCR5 is Seq ID No.15 in sequence table:Ala-Ser-Thr-Thr-Thr-Asn-
Tyr-Thr-Ser-Arg-Arg-Nal-Cys-Tyr-Cit-Lys-D-Lys-Pro-Tyr-Arg-Cit-Cys-Arg-NH2;
The polypeptide aCD4, aCCR5, aCXCR4, aCD4_CCR5, aCCR5_CD4, aCD4_CXCR4, aCXCR4_CD4,
ACCR5_CXCR4, aCXCR4_CCR5 and PEG-PE conjugate may be used alone or in combination when liposome is made and uses;It is preferred that
It is applied in combination, to target more HIV-1 infected cells.
The polypeptide aCD4, aCCR5, aCXCR4, aCD4_CCR5, aCCR5_CD4, aCD4_CXCR4, aCXCR4_CD4,
ACCR5_CXCR4, aCXCR4_CCR5 are using artificial synthesized.
Further, polypeptide is recombinant polypeptide, and expressed sequence inserted into expression vector, and expressed in Host Strains, extracted,
Purifying.
Further, the recombinant polypeptide includes:Anti- CD4 recombinates scFV recombinant polypeptides (abbreviation aCD4scFV), its nucleic acid
Coding expressed sequence be sequence table in Seq ID No.18, its encoding amino acid sequence be sequence table in Seq ID No.21 (contain
E.coli Pel B guide peptide).
Further, aCCR5 expressed sequences are added in the upstream of aCD4scFV, obtains aCCR5_CD4scFV fusions many
Peptide, the nucleic acid of the coding expressed sequence is Seq ID No.19 in sequence table, and amino acid sequence is Seq ID in sequence table
No.22.The sequence can also add (His) in C-terminal6Label.
Further, the coupling that polypeptide can be between PEG molecules is using the one kind in following methods:(1) carboxyl-ammonia is utilized
Condensation reaction between base, the PE-PEG-COOH molecules that will contain are in catalyst n-HOSu NHS and carbodiimide
(EDC) under catalytic reaction and containing-NH2The polypeptide condensation reaction of group, forms amido link;(2) processed using maleimide
Polypeptide and PE-PEG-SH react, or with the maleimide-PEG-PE with maleimide base group and carry in turn
The polypeptide of sulfydryl modification carries out the conjugate that Michael addition reaction forms sulfide based structural;(3) using band p-nitrophenyl carbonic ester
The PE-PEG-pNP of base (p-nitrophenylcarbonyl, pNP) reacts in the basic conditions with the amino of polypeptide, reacts shape
Into the mephenesin Carbamate key conjugate of stabilization;Above-mentioned active group is on PEG molecules, therefore polypeptide is directly coupled at PEG points
On son.
Further, selected from one or more in phosphatidyl-ethanolamine (PE) or its derivative, described spreads out phosphatide
Biology includes DSPE (DSPE), DOPE (DOPE) and two palmitic acid phosphatidyl second
Hydramine (DPPE).
Further, the molecular weight of the PEG (polyethylene glycol) is 200-10000, preferably 200-3000.Use PEG pairs
Surface of liposome carries out modification and can improve the half-life period of liposome, increases the stability of epicyte protein liposome, reduces and adjusts
Reason is acted on, and reduces the phagocytosis of reticuloendothelial system.
Further, penetrating peptide can be also coupled on PEG, be formed " penetrating peptide-PEG- phosphatide ", with target spot polypeptide-PEG-PE
Collocation is used;Penetrating peptide-PEG- phosphatide is 0.0001-2 with the mol ratio of other polypeptides-PEG- phosphatide:1.The penetrating peptide contains
There are the 6-12 amino acid residue of positively charged, such as sequence:H-TyrGlyArgLysLysArgArgGlnArgArgArg-NH2, with
Above-mentioned target spot polypeptide-PEG- phosphatide collocation is used.Penetrating peptide increased the fusion faculty of liposome and cell.
Further, also add the phosphatide of hyaluronic acid in liposome components, the phosphatide of the hyaluronic acid and its
The mol ratio of his polypeptide-PEG- phosphatide is 0.00001-1:1, the phosphatide of the hyaluronic acid is by maleic acid base group modification
Phosphatide (Maleimide- phosphatide) or maleimide-PEG- phosphatide react and to be formed " thoroughly with the hyaluronic acid with sulfydryl modification
Bright matter acid-phosphatide " or " hyaluronic acid-PEG- phosphatide ", for above-mentioned target spot polypeptide-PEG- phosphatide and penetrating peptide-PEG- phosphorus
Fat collocation is used.The addition of hyaluronic acid-phosphatide or hyaluronic acid-PEG- phosphatide can reduce the immunogenicity of liposome and improve
Biocompatibility.
Further, above-mentioned functions polypeptide, can also directly with the coupling of the liposome molecule covalent such as PE, for directly whole
It is incorporated into liposome membrane molecular layer.
Further, can be anti-using film dispersion method, reverse evaporation, gradient using above-mentioned liposome components embedding plasmid
To stowage and double emulsion etc..
Further preferred film dispersion method, comprises the following steps that:
(1) by DSPC:Cholesterol:Polypeptide-PEG2000-DSPE 20-70 in mass ratio:10-30:1 is made uniform mixing
Liquid, or by DPPC, DSPC, DOPC, cholesterol, polypeptide-PEG2000-DSPE in molar ratio 5:1:2:1:0.0001-0.5 is made
Uniform mixed liquor;
(2) chloroform is prepared:Methanol solution, chloroform is 1-3 with the volume ratio of methyl alcohol:1;
(3) by one of above two mixed liquor and the 5-500 times of chloroform of volume:Methyl alcohol (1-2:1) solution is mixed, in rotation
Evaporation solvent in evaporator, forms thin layer;
(4) by above-mentioned dry thin layer rehydration and plasmid is added, 5-50mM HEPES or Tris-Cl is contained wherein in plasmid
(pH7.5-8.5), the concentration of plasmid is in 200ng/ μ l-8000ng/ μ l;
(5) 40-65 DEG C of heating is vortexed and processes 2-6 minutes, repeats 2-5 times, obtains embedding the liposome film bubble of plasmid.
Further, after the completion of film dispersion method prepares liposome membrane bubble, the liposome for obtaining is again by polymeric membrane mistake
The methods such as filter, extruding, remove the liposome of greater particle size, obtain the smaller liposome of particle diameter.The specific method of extruding is:By fat
Plastid squeezes through the cellulose acetate sheets or polycarbonate membrane in 200nm apertures at 40-65 DEG C, is repeated 5-20 times, with
The aperture for reducing liposome and the uniform particle sizes' degree for improving liposome, the unilamellar liposome film bubble for being obtained are dialysed through semi-permeable membrane
Overnight purify, or purified by SephadexG-50 posts or similar gels post, to remove free non-integral protein and miscellaneous
Matter.Compare the making of liposome, in addition to without cell membrane extract, other steps, technique and content all same.
Further, DSPC:Cholesterol:The mass ratio of polypeptide-PEG2000-DSPE is 50:15:1;DPPC、DSPC、
DOPC, cholesterol, the mol ratio of polypeptide-PEG2000-DSPE are 5:1:2:1:0.02.
Further, also fled from and carefully with improving the endocytosis body of plasmid containing auxiliary ingredients in the plasmid of the step (4)
Karyon conveying capacity, the auxiliary ingredients are polyethyleneimine, protamine sulfate, positively charged amino acid polypeptide or cell
Promote one or more in fusogenic peptide, wherein promote fusogenic peptide be divided into viral source, bacterial origin and it is artificial change structure source, be divided into pH
The polypeptide of sensitive and pH non-sensitive types, the preferably pH sensitive rush fused polypeptide of the present invention.
Beneficial effects of the present invention are:The present invention will coding spCAS9 (or high specific spCAS9-HF1), multiple gRNA
Deliver into positive intracellular of CD4, CCR5 and/or CXCR4, rely on the integrase of wild HIV-1, protease, reverse transcriptase,
REV and TAT etc. carries out genome conformity and the expression of target cell, it is to avoid the non-selectable gene group of lentivirus mediated is integrated and not
Controllability is expressed, and the especially expression of spCAS9 enzymes has obtained controllable.Its control system is divided into R element-TAT protein control system
Or Tet-ON control systems, the spCAS9 of controllability avoids the potentially danger to target cell, wherein R element-TAT protein control
The spCAS9 of system just lowers or closes the advantage of expression automatically after HIV genomes inactivation.These plasmids can lose with integrase
Plasmid collocation living, packs integrase inactivation type slow virus, for the gene delivery of lentivirus mediated.Invention also uses many
Weight gRNA technologies, Mutiple Targets cut the key gene group sequence of HIV-1, greatly reduce HIV-1 genomes because single target spot is mutated
And produce the possibility of resistance.Designed multiple gRNA have the genome height phase with various domestic common HIV-1 strains
Like property, the potential ability with broad spectrum activity.For the Mutiple Targets target polypeptide and Pegylation of CD4, CCR5 and CXCR4
(PEG) coupling of-PE, strengthens to X4 and the targeting of R5 biting property HIV-1 virus infected cells, beneficial to follow-up frozen dried,
Reduce immunogenicity.The integration collocation of the PE of the polypeptide-Pegylation of above-mentioned target spot targeting and ratio, do not influence and cell membrane
Fusion, increase selectively targeted effect.The PE for adding the hyaluronic acid of proper proportion is conducive to improving the biology of liposome
The immunogenicity of compatibility and reduction liposome.
Brief description of the drawings
Fig. 1:G1 sequences and 4 kinds of sequence alignments of HIV-1 strains;
Fig. 2:G2 sequences and 4 kinds of sequence alignments of HIV-1 strains;
Fig. 3:G3 reverse complementary sequences and 4 kinds of sequence alignments of HIV-1 strains;
Fig. 4:G4 reverse complementary sequences and 4 kinds of sequence alignments of HIV-1 strains;
Fig. 5:The composition schematic diagram of plasmid p1-p5 Insert Fragments;
Fig. 6:The plasmid map of plasmid p1;
Fig. 7:The plasmid map of plasmid p5;
Fig. 8:The composition schematic diagram of plasmid p6-1, p6-2 and p6-3 Insert Fragment;
Fig. 9:The plasmid map of plasmid p6-1;
Figure 10:The composition schematic diagram of plasmid p7-p12 Insert Fragments;
Figure 11:The plasmid map of plasmid p7;
Figure 12:The plasmid map of plasmid p9;
Figure 13:The composition schematic diagram of plasmid p13, p14 Insert Fragment;
Figure 14:The plasmid map of plasmid p14;
Figure 15:The composition schematic diagram of plasmid p15 Insert Fragments;
Figure 16:The composition schematic diagram of plasmid p16, p17 Insert Fragment;
Figure 17:Uciferase activity after the cationic liposomal transfection of embodiment 6;
Figure 18:Uciferase activity after the slow-virus transfection of embodiment 6;
Figure 19:Uciferase activity after the cationic liposomal transfection of embodiment 7;
Figure 20:Uciferase activity after the slow-virus transfection of embodiment 7;
Figure 21:The uciferase activity of embodiment 9;
Figure 22:The uciferase activity of embodiment 10;
Figure 23:The uciferase activity of embodiment 12;
Figure 24:The uciferase activity of embodiment 13;
Figure 25:The genome conformity efficiency of embodiment 14;
Figure 26:The virus-mediated genome conformity efficiency of the integrase Inactivating mutations packaging system of embodiment 15 packaging.
Specific embodiment
With reference to specific embodiment, the present invention will be further described in detail, but the present invention is not limited to following implementation
Example.In the present invention unless otherwise noted, the commercially available acquisition of raw material, method is the conventional method of this area.
Following instrument, material and method have been applied in embodiments of the invention:
1st, instrument plasmid pLVX_CD4, pLVX_CCR5 and pLVX_CXCR4:For effective analysis of liposomes targeting element
Targeting validity, constructs plasmid pLVX_CD4, pLVX_CCR5 and pLVX_CXCR4.Clone CD4 (GenBank:NM_
000616)、CCR5(GenBank:) and CXCR4 (GenBank NM_000579:NM_001008540 cDNA sequence), inserts respectively
Enter pLVX-acGFPN1 carriers (TAKARA, precious biological), and containing terminator codon, to eliminate the expression of acGFP, only maintain insertion
The expression of gene.It is steady to turn or transiently transfect Hela cells, Hela cells is obtained the expression of associated receptor, it is easy to detection related
Acceptor promotes ability to the targeted fusion of its target liposomes.
2nd, instrument plasmid pLVX_Luc, pTAT, pCCR5_P2A_Luc:Due to pLVX-acGFPN1 carriers contain it is complete
The 5 ' of HIV-1 and 3 ' LTR elements, with the potential ability sheared by target practice plasmid, the luciferase of built-in expression can be used for
Inspection shear effect.In order to detect the control action of TAT gene pairs R elements, luciferase is inserted pLVX- by us
In acGFPN1 carriers, containing terminator codon, terminate the expression of acGFP, structure obtains pLVX_Luc plasmids.By the outer of TAT protein
After aobvious subsequence splicing, in insertion pcDNA3.1 plasmids, in the control of CMV promoter and bovine growth hormone gene termination signal sequence
The lower expression of system, for corotation, structure obtains plasmid pTAT.In order to detect that CCR5gRNA targets the shear effect of element, our structures
PCCR5_P2A_Luc is built, the plasmid is built-up on the basis of pcDNA3.1+ carriers, by the CCR5cDNA encoder blocks of people
Downstream fusion P2A autotomys peptide sequence and luciferase, and three forms a reading frame, occurs after CCR5 above is sheared
There is frameshift mutation in frameshift mutation, P2A and Luc below, so as to lose the activity of luciferase.
3rd, the cell platform construction of target practice efficiency is detected:In order to simulated infection HIV-1 lymphocyte and be easy to quantify point
Analysis, building plasmid p16 and p17 is used to simulate HIV-1 full-length genomes.HIV-1 genomes are inserted into Seq ID in sequence table
On the plasmid backbone of No.6, and by encode gp120 most of DNA sequence dna be replaced into luciferase sequence (plasmid p16) or
EGFP sequences (plasmid p17), such as Figure 16.Retain whole gp41 sequences in p16 and p17, it is therefore an objective to retain the RRE for overlapping
Sequence, the design can eliminate the capacity packing of HIV-1 viruses, eliminate its packaging and appeal, it is ensured that the safety of experimental implementation
Property.And marker genes encoding luciferase luciferase (being abbreviated as Luc) and green fluorescent protein eGFP of insertion can be with
The quantitative of gene targeting efficiency, qualitative detection are provided.P16 or p17 plasmids are made into slow virus, Hela cells are infected, base is obtained
Because of group stable expression cell an integrated clone, for the detection cell platform of target practice efficiency.
The multiple steady structures for turning cell of p16-CD4-CCR5/p17-CD4-CCR5:Slow disease of the packaging containing CD4 sequence plasmids
The slow virus of poison, packaging containing CCR5 sequence plasmids is contaminated p16/p17 and surely turns Hela cells and obtain.Further sense on this basis
The slow virus of dye packaging CXCR4, can obtain p16-CD4-CCR5-CXCR4/p17-CD4-CCR5-CXCR4 stable cell strains.
The puromycin of 0.5-1 μ g/ml is maintained in cell culture fluid, cell total rna, reverse transcription, RT-PCR identifications CD4, CCR5 is extracted
With the expression of CXCR4;The Hela cells of non-infection virus are used to identify positive expression cell line as control.It is steady to turn cell choosing
Select the standard of cell clone:Vitro growth rates, the vigor of cell and cellular morphology are close with normal Hela, amplify culture and are formed
One Cell subline of the stabilization above-mentioned factor of expression.
4th, the packaging of slow virus makes:Plasmid (p1-p17) and the second generation (three pUC pUCs) constructed by the present invention or
The third generation (four pUC pUCs) slow virus packaging system cotransfection 293T cells, slow virus packaging step routinely is carried out, and is turned
Transfection reagent is lipofectamine 2000, and the culture supernatant containing virus, warp are collected in transfection respectively after 48 hours and 72 hours
Cross Purification by filtration, determine the potency of virus, packing freeze -80 DEG C it is standby.
5th, cell culture:Hela cells use the culture of DMEM high glucose mediums (containing dual anti-), the 5- of addition activated carbon treatment
10% high-quality business hyclone, 37 DEG C, 5%CO2Lower culture.The cracking of cell, the measure of luciferase are according to Promega
The explanation of the Luciferase Assay Reagent box of company is operated.Cell transfecting is divided into cationic liposomal transfection method and self-control fat
Plasmids method, cationic liposomal transfection uses lipofectamine 2000, and operating procedure is carried out by reagent specification;From
The transfection ratio of liposome processed, general 48 orifice plate uses 0.5-2 μ l liposomes/hole.
6th, cationic-liposome:Lipofectamine2000 (Life Technology, article No. 11668-019).
7th, Luciferase Assay Reagent box (Promega, article No. E1910).
Design of the embodiment 1 for AIDS virus HIV-1 genomes and the gRNA sequences of CCR5
The design standard of sequence is as follows:Selection 19-28 nt of length, with the various AIDS strains in China's Mainland prevalence
With homology higher, the similarity with human genome is low, at least 4 base differences of more than nt, prevents non-specific
Property cutting generation.Selection in addition is directed to the crucial DNA element of HIV-1, including LTR areas, Pol code areas, Rev protein-coding regions
Deng position.Wherein there is the sequences such as integration and the rna plymerase ii promoter of important genome in LTR areas, containing a U3 area,
RQu He U5 areas, because integrated plasmid needs the presence of RQu He U5 areas, therefore when gRNA is designed, focal selection U3 areas.In
It is to devise 2 difference gRNA for HIV-1U3 areas, sequence is respectively:In sequence table in Seq ID No.2 and sequence table
Seq ID No.3.A gRNA is also devised otherwise for Rev albumen, sequence is Seq ID No.1 in sequence table.In addition also
The gRNA of the protease P ro for Pol code areas is devised, sequence is Seq ID No.4 in sequence table, and the sequence is located at should
Pol albumen upstream, once it is mutated and frameshit occurs, it will cause Pro, Pol and the expression cassette of integrase that frameshift mutation occurs.
We devise 4 kinds of gRNA (Seq ID No.1-4 in sequence table) altogether, carry out Mutiple Targets for the genome to HIV-1 and beat
Target is sheared.We are named as a gRNA targeting sequence is devised near the 186th amino acid codes of CCR5 in addition
G5, for targetting the CCR5 genes in editor inactivation target cell, blocking HIV-1 viruses continue the infection and invasion to cell.
Fig. 1, Fig. 2, Fig. 3 and Fig. 4 are respectively the different sub- of 4 kinds of common AIDS virus strains of 4 gRNA sequences and China
The genome sequence of strain is compared, wherein bThai, b ' Thai, crf01, crf07 and crf08 be the popular AIDS in 5 kinds of China's Mainlands
Virus stain, the character after strain name is clone's number of strain, it was demonstrated that related gRNA sequences are that various country are common
Conserved sequence common to AIDS virus, therefore with the potential ability of the targeting various HIV-1 virus stains genomes of cutting.
Seq ID No.1 in sequence table:G1,5 '-GGAATCGAAGAAGGAGG-3 '
Seq ID No.2 in sequence table:G2,5 '-AAACTACACACCAGGGCC-3 '
Seq ID No.3 in sequence table:G3,5 '-GATAGACCCATAAATCA-3 '
Seq ID No.4 in sequence table:G4,5 '-GCCAAAGAGTGATTTGA-3 '
Seq ID No.5 in sequence table:G5,5 '-GAATTGATACTGACTGTA-3 '
(intracellular HIV-1 virus-mediated plasmid is whole for the double-plasmid expression system of the use of embodiment 2 p1, p2, p3, p4, p5
Close and the mediation of TAT protein/R element the isogenic expression plasmid systems of spCAS9) structure
Plasmid backbone is built first, and its sequence is shown in Seq ID No.6 in sequence table.The plasmid backbone is built upon changing structure
On pUC19 plasmid basics, the MCS of structure pUC19 is changed again, remain original Ori replication orgins and penicillin
Resistant gene.All spCAS9 expression plasmids and multiple gRNA expression plasmids are set on the basis of the plasmid backbone in the present invention
Meter.
The MCS of Seq ID No.6 is in plasmid backbone sequence table:
The design of spCAS9 or spCAS9-HF1 expression plasmids p1, p2 and p3:The pUC pUC is used for genes within cells group
Integration, it is necessary to host cell in exist activity HIV-1 virus, virus mediation under, genome conformity is carried out, in virus
TAT protein and the R element collective effect of its own under, start spCAS9 or spCAS9-HF1 expression.In order to prevent for U3
The gRNA in area is cut to the U3 areas of itself, and U3 areas are replaced with the U3 areas of feline immunodeficiency virus FIV.HIV-1 is also carried in addition
The original U5 sequences of virus and packaging signal Ψ, by REV response elements RRE (234nt), cPPT (central polypurine
Tract) (124nt) is inserted into the downstream of spCAS9 encoder blocks, obtains plasmid p1.Can also be to increase WPRE elements in plasmid
(Woodchuck hepatitis virus post-transcriptional regulatory element, WPRE), is used for
Strengthen the expression of gene, enter core and core output, such as plasmid p3.In order to improve the time that the plasmid before integration exists in the cell,
Can also be to S/MAR sequences (coming from the genes of people IFN-beta 1) be added in plasmid, to improve its half-life period in the cell, such as
Plasmid p2, p3.Plasmid p1, p2 and p3 be coding spCAS9 albumen can integration vector, vector insert composition schematic diagram
As shown in figure 5, wherein the sequence of p1 is Seq ID No.7 in sequence table, plasmid map is shown in Fig. 6.
The building process of plasmid p1, p2 and p3 is as follows:Synthesis FIV5 ' Duan U3 areas, then carry out weight with HIV-15 ' ends U3
Folded PCR amplifications, amplification Chong Die with the U3 downstreams of HIV-1 obtains chimeric 5 '-LTR U3 sequences.Will chimeric 5 '-LTR U3 and R-
U5- Ψ, RRE fragment mix, and whole FIV is fitted together to 5 '-LTR-R-U5- by the primer with two ends respectively with EcoRI, BamHI site
Ψ amplifications turn into a fragment, are inserted into EcoRI, BamHI site of plasmid backbone.Then PCR amplifications two ends carry NLS signals
SpCAS9-HF1, insert plasmid backbone AgeI and KpnI sites.Over-lap PCR expand RRE and cPPT sequences, insertion KpnI and
XbaI sites;3 '-LTR sequences are identical with 5 '-LTR sequences, then by sequence insertion XbaI and PacI sites, obtain plasmid
p1.Plasmid p2 is obtained in the 3 '-LTR downstreams insertion S/MAR sequences of p1 plasmids.Inserted between the cPPT and 3 '-LTR of p1 plasmids
WPRE regulating and controlling sequences obtain plasmid p3.The sequence of above-mentioned plasmid and following each plasmids can also all using artificial synthesized
Method builds.
The design of multiple gRNA expression plasmids p4 and p5:The pUC pUC is also that the HIV-1 by infecting helps be incorporated into place
In key-gene group.The plasmid devises internal promoter, therefore we remove the U3 areas in 3 '-LTR areas, (are lost labeled as Δ U3
Go the autonomous ability for starting transcription), related 3 '-LTR sequence marks for Sin3 ' LTR, 5 '-LTR still using FIV-U3 and
The R-U5 sequences of HIV itself.RRE, cPPT with the addition of wPRE and increase using original sequence of virus HIV-1 in multiple gRNA areas
Strong element, but in order to prevent the reinforcing element to inserting the interference of point gene, increased insulator sequence Ins (from chicken β-
Globin locus HS4 genes), upstream and downstream has been made the insulation processing of correlation.HU6 (the U6 genes from people) and hH1
(the H1 genes from people) promoter is respectively started the expression of g1 and g2, and mU6 (the U6 genes from mouse) and h7SK (come from people
SnRNA 7SK genes promoter) promoter is respectively started the expression of g3 and g4.Other hH1-g2 and h7SK-g4 makees respectively
Reverse insertion, prevent that expression cassette from causing on same positive-sense strand or antisense strand is interfering with each other.Related plasmids composition is illustrated
P4 and p5 in figure such as Fig. 5, wherein p5 are the p4 versions for increasing S/MAR sequences.Fig. 7 is the plasmid map of plasmid p5.
The building process of plasmid p4 and p5 is as follows:By what is be close to after FIV U3-R-U5 sequences and the-LTR of HIV genomes 5 '
Between 1517bp sequences insertion EcoRI and XhoI sites, by between 225bp insulator sequences insertion XhoI and BamHI sites, institute
The plasmid for obtaining between hU6-g1-hH1-g2 insertion BamHI and AgeI sites, obtains plasmid process by BamHI and AgeI digestions
AgeI and KpnI digestions, are connected into mU6-g3 and h7SK-g4, by between wPRE sequences and ins sequences insertion KpnI and XbaI sites,
Sin3 '-LTR are then inserted between XbaI and SpeI sites, identify direction of insertion, choose positive insertion plasmid, are built and are obtained p4 matter
Grain.The genome of PCR methods amplification human archeocyte obtains S/MAR sequences, inserts between PacI the and SpeI sites of p4 plasmids, builds
Obtain p5 plasmids.
When genome editor is applied to, the plasmid that will encode spCAS9 (or spCAS9-HF1) and coding gRNA is taken two-by-two
With using, such as one or more in p1, p2 or p3, with plasmid p4 or/and p5 matched combined.
The structure of the plasmid p6 expression systems of embodiment 3 (spCAS9 or spCAS9-HF1, gRNA are co-expressed in same plasmid)
Build
The design of p6 pUC pUCs:The pUC pUC is in same plasmid by spCAS9 and multiple gRNA co expressions
On, by intracellular active HIV-1 mediated integrations genome.Wherein spCas9 changes and is started by eukaryotic promoters such as EF1a, wherein
Eukaryotic promoter can also be from the isogenic strong promoter of actin, Ubc, PGK, the growth hormone gene of user plus
Tail signal.The R element of HIV is added as expression control element, after HIV is inactivated, the iris action of R element will make spCAS9
Expression minimize level be even switched off expression.In order to prevent the expression from viral LTR from disturbing, employ Sin3 ' LTR and set
Meter.For Enhanced expressing, wPRE elements are with the addition of, to prevent wPRE elements after integrating from, to the conversion risk of body cell, adding
Come from chicken HS4 gene insulator sequence Ins, such as p6-1 plasmids.Simple substance grain sets in the design and implementation example 2 in multiple gRNA areas
Meter is identical.In addition, in order to reduce because the long influence caused to viral packaging efficiency of Insert Fragment, Ins and wPRE elements can
Removal, such as plasmid p6-2;Multiple gRNA can also extend and increase, such as increase the gRNA for CCR5 genes:G5, is placed in silk floss
Under sheep 7SK gene promoters sh7SK, such as plasmid p6-3, the expressed sequence such as Seq ID No.21 of 5 weight gRNA.P6 inserts piece
The schematic diagram of section is shown in Fig. 8.
The structure of plasmid p6-1:The structure of 5 ' FIV-U3-LTR, Ψ, RRE and cPPT is identical with p1, SA (splice
Acceptor) sequence is removed, and inserts EcoRI the and XhoI sites of plasmid backbone.CHS4 insulator sequences PCR introduce XhoI and
BamHI sites, insert XhoI the and BamHI sites of plasmid backbone after digestion.Over-lap PCR expands EF1a-R sequences, inserts BamHI
With AgeI restriction enzyme sites.High-fidelity PCR amplification spCAS9-HF1, inserts AgeI the and KpnI restriction enzyme sites of plasmid backbone.hU6-
G1, hH1-g2 obtain fragment by the method for synthesizing, and insert KpnI the and XbaI sites of plasmid backbone.Synthesis mU6-g3 and
H7SK-g4 fragments, insert XbaI the and SpeI sites of plasmid backbone.Fusion DNA vaccine expands wPRE-Ins-Sin3 ' LTR, inserts matter
Between SpeI the and PacI sites of grain skeleton.The sequence of p6-1 is Seq ID No.8 in sequence table, and plasmid map is shown in Fig. 9.P6-2
It is then to eliminate 2 Ins and wPRE sequences, p6-3 plasmids are then that a sh7SK-g5 is increased on the basis of p6-2, this 5 kinds
The expressed sequence of gRNA such as Seq ID No.21.
When genome editor is applied to, due to p6 pUC pUCs can express simultaneously spCAS9 (or spCAS9-HF1) and
GRNA, therefore can be used alone.
Embodiment 4 uses double-plasmid expression system (HIV-1 mediated integrations, the Tet-ON of p7, p8, p9, p10, p11, p12
Control table reach double-plasmid expression system) structure
The design of expression plasmid p7, p8, p9, p10, p11, p12:The pUC pUC also uses double-mass model pattern and HIV-
The genome conformity of 1 mediation, but spCAS9 expression plasmids employ Tet-on inducible systems and carry out spCAS9 or spCAS9-HF1
Control table reach.It is non-viral in eukaryotics such as EF1a or a TetO sequence of 40nt is added in downstreams of viral promotors, and
Add the termination signal sequence BGHpA of bovine growth hormone gene.In addition under the startup of the eukaryotic promoters such as PGK, a knot is expressed
The albumen TetR of identification TetO is closed, the TetO sequences to the gene of spCAS9 (or spCAS9-HF1) carry out inhibition combination, and
Add the termination signal Glo-pA of people's beta-globin genes.PGK, TetR and Glo-pA have made reverse insertion, prevent expression cassette
That is caused on same positive-sense strand or antisense strand is interfering with each other.The plasmid equally with the addition of RRE, cPPT, WPRE and Ins insulation
Subcomponent, used Sin3 ' the LTR sequences that security is higher, it is to avoid the interference to internal promoters such as EF1a.Obtain plasmid
p7.Because the length of TetR protein expression frames is larger, cause the length of plasmid p7 excessive, it is contemplated that to remove TetR expression cassettes, from
And obtain plasmid p8.And TetR expression cassettes are put into multiple gRNA expression plasmids and get on to express, such as the expression cassette is inserted into
In p4 plasmids.S/MAR elements can be also respectively added on p7, p8 and p9 plasmid in addition, the plasmid before integrating is improved in the cell
Holdup time, respectively obtain plasmid p10, p11 and p12.The composition of plasmid p7, p8, p9, p10, p11, p12 Insert Fragment shows
Intention is shown in Figure 10.
The building process of plasmid p7 is as follows:The structure of 5 ' FIV-U3-LTR, RRE and cPPT is identical with p1, SA (splice
Acceptor) sequence is removed, and inserts EcoRI the and XhoI sites of plasmid backbone.Ins and EF1a-TetO sequences are by overlapping
PCR methods are cloned, and two ends are connected into XhoI and BamHI sites.High-fidelity PCR amplification spCAS9, two ends carry NLS coded sequences, insert
Enter AgeI the and KpnI sites of plasmid backbone.PGK-TetR-GlobinpA is expanded by fusion DNA vaccine method, upstream insertion NotI
Site, downstream introduces KpnI restriction enzyme sites, reversely the KpnI and NotI plasmid enzyme restrictions site of insertion plasmid backbone.BGH-pA is upper and lower
Trip introduces KpnI sites, inserts the KpnI sites of plasmid backbone, identifies direction, the positive insertion person of selection.wPRE-Ins-
Sin3 ' LTR are expanded by over-lap PCR and connected, and two ends introduce SpeI and PacI sites respectively, insert plasmid backbone SpeI and
PacI restriction enzyme sites, sequencing identification.Its sequence is Seq ID No.9 in sequence table, and plasmid map is as shown in figure 11.
The building process of plasmid p9 is as follows:Synthesis FIV5 ' Duan U3 areas, then carry out over-lap PCR expansion with HIV-15 ' ends U3
Increase, amplification Chong Die with the U3 downstreams of HIV-1 obtains chimeric 5 '-LTR U3 sequences.Will chimeric 5 '-LTR U3 and R-U5- Ψ,
Whole FIV is fitted together to 5 '-LTR-R- by the PCR fragment mixing of RRE and cPPT, the primer with two ends respectively with EcoRI, XhoI site
U5- Ψ, RRE and cPPT amplification turn into a fragment, insert EcoRI, XhoI site of plasmid backbone.Then with being respectively provided with
The primer of XhoI and BamHI expands the cHS4 insulators (Ins) of 225bp, and be inserted respectively into plasmid backbone XhoI and
BamHI sites.HU6-g1 and hU1-g2 fusion fragments insert BamHI the and AgeI sites of plasmid backbone by PCR method.g3-
The sequence of mU6 and h7SK-g4 introduces restriction enzyme site by artificial synthesized using PCR, inserts the AgeI and KpnI of plasmid backbone
Site.The expression cassette of TetR, containing PGK, TetR and beta-globin PolyA signal sequences, the method by over-lap PCR synthesizes
It is a complete encoder block sequence, inserts KpnI the and NotI restriction enzyme sites of plasmid backbone.WPRE, Ins and Sin3 '-LTR then leads to
Cross two ends and be respectively provided with the primer in SpeI and PacI sites and directly expand p4 the or p5 plasmids that have connected and obtain, then insert
Enter SpeI the and PacI sites of plasmid backbone.Plasmid p9 confirms that without mutation plasmid map is as shown in figure 12 by sequencing.
Plasmid p10,11,12 are obtained by inserting S/MAR sequences on the basis of plasmid p7, p8 and p9 respectively.
When genome editor is applied to, the plasmid that will encode spCAS9 (or spCAS9-HF1) and coding gRNA is taken two-by-two
With using, such as one or more in p7, p8, p10 or p11, with plasmid p9 or/and p12 matched combined;P7 or/and p10,
With p4 or/and p5 matched combineds.Combinations thereof is capable of achieving tetracycline inducible expression, i.e., without tetracycline and its novel analogs
When, TetR suppresses spCAS9 expression, and when there is certain density tetracycline or its new analog, can release TetR
To the expression inhibiting of spCAS9, start expression.Taken when by p7 and p9 collocation, p7 and p12 collocation, p10 and p9 collocation, p10 and p12
Timing, can improve the expression quantity of TetR albumen.
Other above-mentioned p7, p8, p9, p10, p11, p12 can also collocation be used mutually with p1, p2, p3, p4, p5, p6-1,
Need to only be followed in different collocation and contain at least one spCAS9 expression plasmids and at least one multiple gRNA expression plasmids.
Whether different collocation involves the R element-TAT system controllability and measurabilities or tetracycline induction controllability that can realize spCAS9.
Embodiment 5 uses double-plasmid expression system (HIV mediated integrations, the TAT/R induction spCAS9 tables of p13, p14, p15
Up to mediating multiple gRNA to integrate with AAV, Tet-ON induced expression double-mass models system) structure
The design of expression plasmid p13, p14:The pUC pUC also uses double-mass model and expresses spCAS9 and gRNA respectively.
The genome conformity that the expression plasmid of spCAS9 is mediated using HIV-1, the expression of spCAS9 is melted using R element and EF1a promoters
Control is closed, the induction releasing R element for TAT checks effect to EF1a promoters.And BGHpA is added, use Sin3 ' LTR
Element, RRE, cPPT, wPRE and Ins, such as p13 plasmids.The plasmid for expressing multiple gRNA can be using adeno-associated virus (AAV)
Integrated mechanism carries out the integration of genome, but needs to express the REP albumen of AAV, due to the protein on cells toxic side effect, because
This needs to carry out the expression of REP albumen the control of Tet-ON elements, such as p14, and needs for expressing gene to be inserted into 2
Between set terminal repetitive sequence (Inverted Terminal Rpeats, ITR).AAV viruses etc. can increase to AAVS1 sites
Site-directed integration, reduces Non-specific integration.Plasmid p13, p14 Insert Fragment composition schematic diagram is shown in Figure 13.
The building process of plasmid p14 is as follows:
Left side ITR-hU6-g1-g2-hH1 is inserted XhoI the and BamHI sites of plasmid backbone by over-lap PCR, then
MU6-g3-g4-h7SK-ITR is inserted BamHI the and AgeI restriction enzyme sites of plasmid backbone by over-lap PCR.By over-lap PCR
By the fusion of EF1a and TetO short-movie sections, AgeI the and XbaI sites of plasmid backbone are inserted.PCR expands the REP gene codes of AAV-2
Frame, inserts XbaI the and NotI sites of plasmid backbone.PCR methods expand BGH poly A, insert the NotI and AgeI of plasmid backbone
Site, sequencing identification.
On an other empty carrier, the sense primer with PacI and the anti-sense primer pair with KpnI, PCR amplifications
PGK promoters, PacI the and KpnI sites of insertion vector.There is the primer PCR of KpnI and downstream with HindIII to expand with upstream belt
Increase TetR, KpnI the and HindIII sites of insertion vector.Upstream is expanded with the primer PCR of HindIII and downstream with EcoRI
Increase S/MAR sequences, insertion HindIII and EcoRI sites.Then with the sense primer with EcoRI and with EcoRI and SpeI
The anti-sense primer of (closing on inner side in EcoRI) expands beta-globin PolyA signal sequences to PCR, inserts EcoRI
Point, identifies forward and reverse, takes positive insertion person.SpeI and PacI digestion carriers are finally used, fragment two ends carry SpeI and PacI
Point, on the plasmid built before insertion, sequencing identification.Its plasmid map is as shown in figure 14.
After two sequences of plasmid merge, using SpeI and PacI digestions, small fragment gel extraction is constructed before insertion
SpeI the and PacI restriction enzyme sites of plasmid, sequencing identification excludes mutation.
In addition, devising gRNA target practice sequence g5 for CCR5 in embodiment 1, started by sh7SK promoters and expressed, used
In shearing CCR5 sequences, 4 kinds of gRNA with anti-HIV-1 are shared, and constitute 5 heavy gRNA, and 5 effects of stressing defense are played to AIDS, are contained
There is the plasmid of the sequence in addition to p6-3, also one plasmid from double-mass model system is named as p15, Insert Fragment composition
Schematic diagram is shown in Figure 15, and the multiple gRNA expressed sequences of insertion are Seq ID No.21.Plasmid p15 can be carried with any of above
Encoding plasmids (plasmid p1, p2, p3, p7, p8, p10, the p11 and p13) collocation of spCAS9 is used.
The HIV-1 mediated integrations of embodiment 6, shearing of the TAT induced expression double-mass model systems to HIV-1 genomes
Hela cells (surely turning p16) is transfected using commercial cationic liposome lipofectamine 2000, after 36 hours,
Detect the activity (Promega kits) of luciferase.Result such as Figure 17, display p1-p4 combination, p1-p5 combinations, p2-p5 groups
Close and p3-p5 combines the activity that can reduce the luciferase that p16 is encoded, and singly turning p1, p2, p3, p4 and p5 can not change
P16 surely turns the activity of the intracellular luciferases of Hela, and wherein blank is idle running liposome.
P1, p3, p4, p5 and the p1-p4 packed using slow virus are combined, p3-p4 is combined, p3-p5 is combined and infected respectively
P16- surely turns Hela cells, wherein the virus for combining infection is by infecting the slow virus that shown plasmid is packed, 2 subinfections respectively
Interval 12-24 hours.6th day cell lysis after virus infection, detect the uciferase activity of cell.Figure 18 shows, single matter
The infection of grain can not change the activity of luciferase, can be very after two kinds of virus combinations (p1-p4, p3-p4 or p3-p5) of infection
Significant reduction even eliminates the expression of luciferase to close to background level.
Shearing of the Tet-ON inductivities double-plasmid expression system of embodiment 7 to HIV-1 genomes
P7 and p8 etc. expresses the plasmid of spCAS9, and promoter is built-in eukaryotic promoter, and the promoter is subject to TetO-TetR
Control, when cell tetracycline and the like induction after, combinations of the TetR to TetO elements can be released, eliminate its expression
Inhibitory action.P7, p8 plasmid can be used with the collocation of the plasmid such as p4, p5, p9 and p15.In p16 surely turns Hela cells, single turn
Or the corresponding plasmid of corotation, cell is divided into 2 groups, and one group is processed without Doxycycline (abbreviation Dox), another set 40ng/
Ml Dox are processed 36 hours, and 2 transfections are spaced 12 hours, and the molecular number ratio of plasmid is 1 in many plasmid combinations:1, last time
Cell lysis after transfecting 36 hours, determine the activity of luciferase.Experiment display, is not added with Dox treatment, no matter turn in wink simple substance or
Double-mass model, can not substantially eliminate the expression (Figure 19) of the luciferase of Hela cells (surely turning p16).And with Dox process it is thin
In born of the same parents, double Pignus pignoris grain (p7-p4, p7-p9 or p8-p9) significantly reduce the table of the luciferase of Hela cells (surely turning p16)
Up to (p<0.05) (T inspections).
P7 plasmids and p9 plasmids pack slow virus respectively, and single virus or Geminivirus infection p16 surely turn Hela cells, point
Into 2 groups, one group is not added with Dox, and one group adds Dox (treatment 5 days), and virus determines the activity of luciferase for metainfective 6th day.Figure 20
It has been shown that, adding the double-mass model p7-p9 of Dox groups can effectively eliminate the expression of luciferase.
The expression control of the spCAS9 that 8 R elements of embodiment/TAT is started to built-in promoter
P6-1 plasmids are that spCAS9 and gRNA are incorporated on a plasmid, can so use table by a plasmid
Reach.It is limited to the capacity packing of slow virus, this Large plasmid cannot be controlled table using some elements of more HIV-1 viruses
Reach.Therefore the built-in promoter EF1a that R element is cloned into spCAS9 is up, with built-in eukaryotic promoter co- controlling spCAS9
Expression.Eliminated when by HIV genomes and other genetic constitutions, and automatically after elimination TAT, you can prevent the expression of spCAS9.
P13 plasmids are also to follow the thinking and build.To verify behind EF1a promoters plus whether R element is to the table of spCAS9
Experimental verification is turned using wink up to control action is played.Plasmid is combined using pLVX_Luc, pTAT and p6-1 or p13-p4 for building to be total to
Turn, it is found that p6-1 simple substance grain can suppress the expression of luciferase of pLVX_Luc, the plasmid group of p13 and p4 in the presence of pTAT
Conjunction also has certain inhibitory action in the presence of pTAT, and each experimental group for lacking pTAT plasmids can not effectively suppress fluorescence
The expression (Figure 19) of plain enzyme, p6 is the abbreviation of plasmid p6-1 in figure.
Control actions of the TAT of embodiment 9 to built-in promoter-R element chimeric promoters
P6-1 and p13 plasmids are using the spCAS9 inside the chimeric promoters control of built-in promoter EF1a and R element
Expression, by turning the above-mentioned plasmid of expression and pTAT encoding plasmids and pLVX-Luc plasmids with complete U3 structures wink, due to
U3 areas are present on 5 '-LTR and 3 '-LTR, and targeting shearing easily causes the overall removal of the genome structure on the inside of 2 U3 areas existing
As (Sci Rep.2016Jul 7;6:28213.doi:10.1038/srep28213.), such that it is able to detect built-in gene by fluorescence
Plain enzyme is sheared inactivation.Single turn of cationic-liposome or corotation pLVX_Luc, pTAT and p6-1 or p13-p4 combination plasmids, its
Middle pLVX_Luc plasmids are 1 with the molecular number ratio of other plasmids:3, cell is common Hela, and transfection determines fluorescein after 36 hours
The activity of enzyme, p6 is the abbreviation of plasmid p6-1 in figure.Figure 21 display expression pTAT groups effectively reduce the activity of luciferase
(p<0.05, T inspection).
The effect that the CCR5 of embodiment 10 targetings gRNA is sheared to CCR5
P15 plasmids increased a gRNA, i.e., for the gRNA of CCR5, if the target practice sequence can cut to CCR5
Cut, it will form the effect of similar CCR5 △ 32 mutation, the mutation causes single open reading frame to misplace, lacked logical with G-protein signal
The related extracellular tricyclic structure in road, so as to lose the function that mediation HIV-1 enters cell, HIV-1 senses is prevented so as to play
Contaminate the effect of T lymphocytes.Therefore while gRNA of the design for HIV-1 genomes, one is increased for CCR5's
GRNA sequences, so as to form 5 heavy gRNA, the plasmid is p15, and the plasmid can be with p1, p2, p3, p7, p8, p10, p11 and p13
Plasmid is arranged in pairs or groups, and constitutes double-mass model targeting system.To verify its effect, p7 and p15 combinations are verified.pCCR5_P2A_Luc
The common Hela cells (pCCR5_P2A_ of lipofectamine2000 cotransfections is used in plasmid and p7, p15 or p7-p15 combination respectively
Luc plasmids are 1 with the molecular proportion of other plasmids:3).Transfection determines the activity of luciferase after 36 hours, observation CCR5 is sheared
Afterwards to the influence of luciferase, as a result such as Figure 22, turn relative to simple substance grain wink, p7 and p15 combinations can effectively cut CCR5,
Substantially reduce the activity (p of the luciferase of pCCR5_P2A_Luc<0.01, T inspection).
The film dispersion method of embodiment 11 makes liposome
(1) synthesis of polypeptide-PEG-PE
1. it is respectively synthesized targeted polypeptide with solid-phase synthesis:aCD4、aCCR5、aCXCR4、aCD4_CCR5、aCCR5_
CD4, aCD4_CXCR4, aCXCR4_CD4, aCCR5_CXCR4, aCXCR4_CCR5, sequence are respectively Seq ID in sequence table
No.10、ASTTTNYT、Seq ID No.11、Seq ID No.12、Seq ID No.13、Seq ID No.16、Seq ID
No.17、Seq ID No.14、Seq ID No.15.Wherein aCXCR4, aCD4_CXCR4, aCXCR4_CD4, aCCR5_CXCR4,
The step of aCXCR4_CCR5 also additionally will carry out disulfide bond and is cyclized in building-up process.
2. the anti-CD4 of the anti-CCR5- of targeted polypeptide recombinate scFV recombinant polypeptides (B of Pel containing E.coli guide peptide) (referred to as
ACCR5_CD4scFV synthesis):Wherein anti-CD4scFV fragments are obtained, the table by screening full people's phage antibody displaying storehouse
Optimize by bacterium rare codon up to sequence, express nucleic acid sequence be in sequence table Seq ID No.18 (amino acid sequence is
Seq ID No.21), expression can obtain aCD4scFV polypeptides.Anti- CCR5 polypeptides sequence is added behind E.coli Pel B guiding peptides
Row, sequence is Seq ID No.19 (amino acid sequence is Seq ID No.22) in sequence table, and expression can obtain aCCR5_
CD4scFV polypeptides.The expressed sequence is inserted into sequence table and forms expression vector in Seq ID No.6 plasmid backbones, by this
Expression vector obtains mesh after E.coli BL21 expression by conventional concentration, extraction and purifying (crossing ion column and gel column)
Polypeptide.
3. penetrating peptide is synthesized with solid-phase synthesis, sequence is:H-TyrGlyArgLysLysArgArgGlnArgArgArg-
NH2。
4. the synthesis of pNP-PEG-PE:Take 240mg PE to be put into flask, the imitative preparation of chlorination turns into 50mg/ml solution, plus
Enter 800 μ l triethylamines (TEA), then weigh 10g pNP-PEG-pNP and be dissolved in 50ml chloroforms, be added into above-mentioned preparing
In PE-TEA- chloroform mixtures.Nitrogen will be filled with reaction flask, then room temperature magnetic stirrer over night uses rotary evaporator
(volume 3L, rotating speed 80-180rpm) evaporates chloroform, adds 120ml 0.01M HCl, carries out water bath sonicator, room temperature reaction overnight,
Nitrogen is protected, ultrasonically treated formation emulsion.Using CL-4B gel filtration chromatography reactant mixtures, chromatography solution is 0.01M
HCl, 0.15M NaCl.PNP-PEG-PE components add chloroform purifying pNP-PEG-PE, end-product to be dissolved in chloroform after lyophilized
In, -80 DEG C store for future use.
5. the synthesis of targeted polypeptide-PEG-PE/ penetrating peptides-PEG-PE:Take 200mg pNP-PEG-PE and be dissolved in 20ml chlorine
It is imitative, to put in 100ml flasks, rotary evaporation removes chloroform.Targeted polypeptide/penetrating peptide is dissolved in 0.01M HCl/water solution, adds 10
Targeted polypeptide/the penetrating peptide of times pNP-PEG-PE molal quantitys, shakes 30min.40ml 50mM Tris (pH 8.7) are added, is filled
Enter nitrogen protection, 4 DEG C are reacted 24 hours.Product crosses Sepharose CL-4B column chromatographies, and -20 DEG C of product freezes standby.
The product obtained by the step is had:aCD4-PEG-PE、aCCR5-PEG-PE、aCXCR4-PEG-PE、aCD4_
CCR5-PEG-PE、aCCR5_CD4-PEG-PE、aCD4_CXCR4-PEG-PE、aCXCR4_CD4-PEG-PE、aCCR5_CXCR4-
PEG-PE, aCXCR4_CCR5-PEG-PE, aCCR5_CD4scFV-PEG-PE, penetrating peptide aCCR5_CD4scFV-PEG-PE.
(2) synthesis of hyaluronic acid-PEG-PE:Hyaluronic acid is in EDC (1- (3- dimethylamino-propyls) -3- ethyls first
Carbodiimide hydrochloride)/NHS (N-hydroxy-succinamide) effect under carry out sulfhydrylation, take 100mg hyaluronic acids, be dissolved in
In 10ml EDC/NHS containing 200mM solution, pH5.0 is suspended half an hour, adds 200mg Cys, and room temperature is suspended anti-
Answer 4 hours.The hyaluronic acid of sulfhydrylation is dialysed by the pellicle of MWCO 20kDa, and dialyzate is the hydrochloric acid of pH5.0
The aqueous solution, dialysis 3 times after, be stored in the aqueous solution of pH5.0, -80 DEG C freeze it is standby.Maleimide-PEG-PE and sulfhydrylation
Hyaluronic acid is with mol ratio 1:1 mixing (Hepes pH of buffer 7.4), room temperature reaction 12-16 hours under inflated with nitrogen, product is used
Sepharose CL-4B are purified.
(3) preparation of endotoxin-free plasmid:Plasmid transfection STBL3 Host Strains, positive bacterium colony is seeded in the LB containing penicillin
In culture medium, amplify culture (containing 200 μ g/ml penicillin) overnight, second day centrifugation thalline, using the big upgrading grain examination of endotoxin-free
Agent box cracks thalline, crosses post purifying and wash-out plasmid, by after concentration mensuration and agarose electrophoresis identification, -20 DEG C save backup,
Plasmid concentration is 200ng/ μ l-8000ng/ μ l.The plasmid for treating loading form is pure plasmid, can also add polyethyleneimine, fish
, used as assistant carrier, polyethyleneimine molecular weight is in 600-75000, preferably band for protamine or 5-12 positive charges amino acid polypeptide
The polyethyleneimine (molecular weight 25000) of side chain.
(4) preparation of the liposome of embedding plasmid:By DSPC:Cholesterol:Polypeptide-PEG2000-PE, in mass ratio 50:
15:1 is made uniform mixed liquor, is dissolved in 10 times of chloroforms of volume:Methyl alcohol (2:1, V/V) solution, then in a rotary evaporator
Evaporation solvent, forms thin layer.Plasmid and its auxiliary ingredients are added in reconstitution process, the wherein buffer solution used by plasmid is 5-50mM
HEPES (pH7.5-8.2), the concentration of plasmid is in 200ng/ μ l.50 DEG C of heating are vortexed and process 5 minutes, are repeated 3 times, and are embedded
The liposome film bubble of plasmid.Liposome film bubble is extruded, it is equal with the aperture for reducing liposome and the particle diameter for improving liposome
Evenness, liposome squeezes through the cellulose acetate sheets or polycarbonate membrane in 200nm apertures at 40-65 DEG C, and 5- is repeated
20 times, the unilamellar liposome film bubble for being obtained is purified through semi-permeable membrane dialysed overnight, or by SephadexG-50 posts or class
Like gel column purifying, to remove free non-integral protein and impurity, or purified through semi-permeable membrane dialysed overnight.
The liposome molecule of obtained loading plasmid is determined by laser particle analyzer.Table 1 several represents property for selected
The liposome average size of grain.The liposomal particle size uniformity made using the method is higher, and particle diameter concentrates on 120- substantially
In the range of 180nm, design requirement is basically reached.
Table 1 loads the liposome average size of different plasmids
The plasmid of loading | p1 | p4 | p6-1 | p7 | p13 | p15 |
Particle diameter (nm) | 163.9 | 168.4 | 158.2 | 166.7 | 160.8 | 163.3 |
Conveying effect analysis of liposome of the embodiment 12 with different targeting elements to plasmid
Homemade liposome with different targeting elements loads different plasmids, and transfection carries different expressed receptors
Hela cells, not the Hela cells of isoacceptor be used to simulate lymphocyte of the human body with corresponding acceptor and the endocytosis of liposome made
With.Wherein all cells have surely turned p16 plasmids, to the cellular environment of simulated infection HIV-1, phase are integrated to provide HIV-1
The enzyme of pass TAT protein related to expression control etc..
(A) 4 kinds of liposomes are transfected, empty plasmid (empty), the control liposome of p3-p5 plasmids is respectively loaded, is loaded
The CD4 target liposomes (containing anti-CD4-PEG-PE) of p3-p5 plasmids and the CCR5 target liposomes of loading p3-p5 plasmids are (containing anti-
CCR5-PEG-PE).Transfect 2 kinds of cells respectively, one is that p16 surely turns Hela cell lines, one kind be p16, pLVX-CD4 and
The Hela cell lines of triple steady turn of pLVX-CCR5, transfect 2 times, are spaced 12 hours, and last time transfection determines fluorescence after 36 hours
The activity of plain enzyme.The plasmid p3 collocation plasmid P5 of Figure 23 A display TAT regulation types can shear genome, make the table of luciferase
Up to decline, after the PEG-PE of the small peptide coupling of anti-CD4 and anti-CCR5 loads liposome, it is possible to increase the cell of target practice plasmid is defeated
Enter ability.
(B) 5 kinds of liposomes are transfected, empty plasmid (empty), the control liposome of p3-p5 plasmids is respectively loaded, is loaded
CD4 the and CCR5 target liposomes (aCCR5_CD4scFV-PEG-PE) of p3-p5 plasmids, the CD4 and CCR5 that load p3-p5 plasmids
CD4 the and CCR5 target liposomes (aCCR5_CD4- of target liposomes (aCCR5_CD4-PEG-PE) and loading p3-p5 plasmids
PEG-PE and penetrating peptide-PEG-PE).Transfect 2 kinds of cells respectively, one is that p16 surely turns Hela cell lines, one kind be p16,
The Hela cells of triple steady turn of pLVX-CD4 and pLVX-CCR5, transfect 2 times, are spaced 12 hours, after last time is transfected 36 hours
Determine the activity of luciferase.Figure 23 B display aCCR5_CD4scFV-PEG-PE modified liposomes can improve plasmid p3-p5 groups
The positive Hela cells of input CD4, CCR5 are closed, aCCR5_CD4-PEG-PE modified liposomes can also improve plasmid pair CD4-CCR5
The transfection abilities of positive Hela cells;The liposome effect that aCCR5_CD4-PEG-PE and penetrating peptide-PEG-PE is modified jointly
The transfection of the relatively simple liposome modified using aCCR5_CD4-PEG-PE will get well (p<0.05).
(C) 3 kinds of liposomes are transfected, plasmid p6-1 is respectively loaded, is loaded plasmid p6-1 (coating CXCR4-PEG-PE), dress
Carry the liposome (coating CXCR4-PEG-PE) of p7-p4 plasmid combinations.P6 is the abbreviation of plasmid p6-1 in figure.2 kinds are transfected respectively
Cell, one is that p16 surely turns Hela cell lines, and a kind of is the Hela cells of double steady turn of p16, pLVX-CXCR4.With loading p7-
The cell that the liposome (coating CXCR4-PEG-PE) of p4 plasmid combinations is processed makees 2 kinds for the treatment of, one kind plus Dox treatment, and one kind is not
Plus Dox treatment, transfect 2 times, it is spaced 12 hours, last time transfection determines the activity of luciferase after 36 hours.Figure 23 C show
The liposome of aCXCR4-PEG-PE modifications can also improve the positive cell inputs of target practice plasmid pair CXCR4, also show p6-1 mono-
PUC pUC has genomic modification ability as double-mass model system.
(D) 4 kinds of liposomes are transfected, p6-1 plasmids is loaded respectively, p6-1 plasmids (aCCR5_CD4-PEG-PE) is loaded, is loaded
P7-p4 plasmid combinations, loading p7-p4 plasmid combinations (aCCR5_CD4-PEG-PE, aCXCR4 and penetrating peptide).2 kinds are transfected respectively
Cell, one is that p16 surely turns Hela cell lines, and a kind of is the Hela cells of triple steady turn of p16, pLVX-CD4 and pLVX-CCR5.
Wherein with the cell of the liposome-treated for loading p7-p4 plasmid combinations (containing anti-CD4-CCR5-PEG-PE), 36 are processed with DOX small
When.Transfection 2 times, is spaced 12 hours, and last time transfection determines the activity of luciferase after 36 hours.Figure 23 D show aCCR5_
The liposome of CD4-PEG-PE modifications has the liposome intake ability for improving the positive Hela cells of CD4, CCR5.P6 is in figure
The abbreviation of plasmid p6-1.
In above-mentioned four groups of experiments, the molecular number ratio of plasmid is 1 in all many plasmid combinations:1, * in figure:p<0.05 (T inspections
Test).
The Targeting Effect that embodiment 13 loads the liposome of the target spots of CCR5-CD4-CXCR4 tri- and penetrating peptide is determined:
5 kinds of liposomes of transfection, load empty plasmid, p1-p4 combinations plasmid, p1-p4 combination plasmids (aCCR5_CD4 respectively
Combined peptide-PEG-PE), p1-p4 combine plasmid (aCCR5_CD4 combined peptide-PEG-PE and aCXCR4-PEG-PE) and p1-p4 groups
Conjugative plasmid (aCCR5_CD4 combined peptides-PEG-PE, aCXCR4-PEG-PE and penetrating peptide-PEG-PE).Transfection p16 surely turns respectively
The Hela cells that Hela cells or p16, pLVX-CD4, pLVX-CCR5 and pLVX-CXCR4 quadruple surely turn, transfect 2 times, between transfection
Every 12 hours, the molecular number ratio of plasmid was 1 in many plasmid combinations:1, last time transfection determines the work of luciferase after 36 hours
Property.
As shown in figure 24, naked liposome has certain transfection abilities to cell.ACCR5_CD4- is added in liposome
After PEG-PE, the ability of Hela cell of the transfection without expression CD4, CCR5 and CXCR4 declines on the contrary.But when transfection expression CD4,
After the Hela cells of CCR5 and CXCR4, the transfection abilities of the liposome of aCCR5_CD4-PEG-PE and aCXCR4-PEG-PE are added
Increase.When in liposome simultaneously add penetrating peptide-PEG-PE after, to express CD4, CCR5 and CXCR4 Hela cells turn
Dye ability is most strong, while the transfection abilities of the Hela cells to not expressing CD4-CCR5-CXCR4 are also improved, therefore penetrating peptide-
The addition of PEG-PE is improved the ability of liposome transfection cell.
The genome conformity efficiency of embodiment 14
Representative plasmid p1, p4 and p7 are chosen, above-mentioned plasmid, transfection two is transfected with lipofactamine 2000 respectively
It is secondary, it is spaced 12 hours.Last time transfect after 24 hours after, vitellophag does ten thousand times of 5000-2 dilution, allows cell monoclonal
Grow into macroscopic big clone cell colony (maintaining 0.5-1 μ g/ml puromycins in growth course all the time).After 2-4 weeks,
The cell of clonal growth is drawn with sample-adding pipette tips, 6 orifice plates of inoculation, 12 orifice plates or 24 orifice plates, each clone are inoculated with a hole, often
Individual transfection treatment sample chooses 15 clones, amplifies growth.The cell of last each clone enters performing PCR mirror by extracting STb gene
It is fixed, or a little cell is taken, directly cell is added using the kit of trace P CR templates enters performing PCR amplification in reaction tube.p1
With p7 identifications (sense primer is expanded with one section of primer for cas9:5’-GCCAGCCAGGAAGAGTTCTACA-3’;Draw in downstream
Thing:5 '-TCAGCTCGTTATACACGGTGAAGTA-3 '), p4 with a pair of identification wPRE and ins sequences (sense primer 5 '-
AATAGGGAGGGGGAAAGCGA-3’;- GGAGGGACGTAATTACATC the CCT-3 ' of anti-sense primer 5 ') special primer amplification.
Stable transfection rate is the percentage that positive cell clone number clones number divided by total cell.
The cell clone PCR results of Figure 25 show, after liposome transfection p1, p4 and p7 plasmid, are found by after PCR identifications
In p16 surely turns cell, this 3 representative plasmids there occurs the integration of genome, and it is normal it is non-it is steady turn Hela cells,
Almost there is no integrator cell phenomenon, the intracellular of p16 simulation HIV-1 infection is illustrated, with these plasmid integration genomes of help
Ability.
The virus-mediated genome conformity efficiency of the integrase Inactivating mutations packaging system of embodiment 15 packaging
Plasmid p1, p4 and p7 are packed by second generation slow virus packaging plasmid, and wherein integrase is by transformation, three
Point mutation D64K, D116Q and E152Q.Collect within 48 hours and 72 hours after transfection 293T cells virus, purifying, after determining potency
Hela cells and common Hela cells that infection p16 surely turns, 3 days rear clone cultured cells, specific steps with embodiment 14, carefully
Born of the same parents extract genomic DNA or directly carry out cell PCR measure, with the genome conformity assay method of embodiment 4.
The cell clone PCR results of Figure 26 show that the integrase saltant type slow virus of p1, p4 and p7 plasmid packaging is surely turning
In the Hela cells of p16, there occurs and effectively surely turn phenomenon, imply that there is obvious genome conformity phenomenon, and do not turn
Then there is no visible plasmid and surely turn phenomenon in the Hela cells of dye p16 plasmids, imply do not exist obvious genome conformity phenomenon.
Seq ID No.1:G1,5 '-GGAATCGAAGAAGGAGG-3 '
Seq ID No.2:G2,5 '-AAACTACACACCAGGGCC-3 '
Seq ID No.3:G3,5 '-GATAGACCCATAAATCA-3 '
Seq ID No.4:G4,5 '-GCCAAAGAGTGATTTGA-3 '
Seq ID No.5:G5,5 '-GAATTGATACTGACTGTA-3 '
Seq ID No.6
1 ATGTGAGCAA AAGGCCAGCA AAAGGCCAGG AACCGTAAAA AGGCCGCGTT GCTGGCGTTT
61 TTCCATAGGC TCCGCCCCCC TGACGAGCAT CACAAAAATC GACGCTCAAG TCAGAGGTGG
121 CGAAACCCGA CAGGACTATA AAGATACCAG GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC
181 TCTCCTGTTC CGACCCTGCC GCTTACCGGA TACCTGTCCG CCTTTCTCCC TTCGGGAAGC
241 GTGGCGCTTT CTCATAGCTC ACGCTGTAGG TATCTCAGTT CGGTGTAGGT CGTTCGCTCC
301 AAGCTGGGCT GTGTGCACGA ACCCCCCGTT CAGCCCGACC GCTGCGCCTT ATCCGGTAAC
361 TATCGTCTTG AGTCCAACCC GGTAAGACAC GACTTATCGC CACTGGCAGC AGCCACTGGT
421 AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTACAG AGTTCTTGAA GTGGTGGCCT
481 AACTACGGCT ACACTAGAAG AACAGTATTT GGTATCTGCG CTCTGCTGAA GCCAGTTACC
541 TTCGGAAAAA GAGTTGGTAG CTCTTGATCC GGCAAACAAA CCACCGCTGG TAGCGGTGGT
601 TTTTTTGTTT GCAAGCAGCA GATTACGCGC AGAAAAAAAG GATCTCAAGA AGATCCTTTG
661 ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG AACGAAAACT CACGTTAAGG GATTTTGGTC
721 ATGAGATTAT CAAAAAGGAT CTTCACCTAG ATCCTTTTAA ATTAAAAATG AAGTTTTAAA
781 TCAATCTAAA GTATATATGA GTAAACTTGG TCTGACAGTT ACCAATGCTT AATCAGTGAG
841 GCACCTATCT CAGCGATCTG TCTATTTCGT TCATCCATAG TTGCCTGACT CCCCGTCGTG
901 TAGATAACTA CGATACGGGA GGGCTTACCA TCTGGCCCCA GTGCTGCAAT GATACCGCGA
961 GACCCACGCT CACCGGCTCC AGATTTATCA GCAATAAACC AGCCAGCCGG AAGGGCCGAG
1021 CGCAGAAGTG GTCCTGCAAC TTTATCCGCC TCCATCCAGT CTATTAATTG TTGCCGGGAA
1081 GCTAGAGTAA GTAGTTCGCC AGTTAATAGT TTGCGCAACG TTGTTGCCAT TGCTACAGGC
1141 ATCGTGGTGT CACGCTCGTC GTTTGGTATG GCTTCATTCA GCTCCGGTTC CCAACGATCA
1201 AGGCGAGTTA CATGATCCCC CATGTTGTGC AAAAAAGCGG TTAGCTCCTT CGGTCCTCCG
1261 ATCGTTGTCA GAAGTAAGTT GGCCGCAGTG TTATCACTCA TGGTTATGGC AGCACTGCAT
1321 AATTCTCTTA CTGTCATGCC ATCCGTAAGA TGCTTTTCTG TGACTGGTGA GTACTCAACC
1381 AAGTCATTCT GAGAATAGTG TATGCGGCGA CCGAGTTGCT CTTGCCCGGC GTCAATACGG
1441 GATAATACCG CGCCACATAG CAGAACTTTA AAAGTGCTCA TCATTGGAAA ACGTTCTTCG
1501 GGGCGAAAAC TCTCAAGGAT CTTACCGCTG TTGAGATCCA GTTCGATGTA ACCCACTCGT
1561 GCACCCAACT GATCTTCAGC ATCTTTTACT TTCACCAGCG TTTCTGGGTG AGCAAAAACA
1621 GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC GGAAATGTTG AATACTCATA
1681 CTCTTCCTTT TTCAATATTA TTGAAGCATT TATCAGGGTT ATTGTCTCAT GAGCGGATAC
1741 ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTT AATACGACTC ACTATAGGGC
1801 GAGGAATTCT CACTCGAGAA GCTTGGATCC TGAACCGGTT GAGGTACCGA TATCTGATCA
1861 TGATCTAGAT CAGCGGCCGC ATACACTAGT TGATTAATTA A
Seq ID No.7:The sequence of plasmid p1
1 ATGTGAGCAA AAGGCCAGCA AAAGGCCAGG AACCGTAAAA AGGCCGCGTT GCTGGCGTTT
61 TTCCATAGGC TCCGCCCCCC TGACGAGCAT CACAAAAATC GACGCTCAAG TCAGAGGTGG
121 CGAAACCCGA CAGGACTATA AAGATACCAG GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC
181 TCTCCTGTTC CGACCCTGCC GCTTACCGGA TACCTGTCCG CCTTTCTCCC TTCGGGAAGC
241 GTGGCGCTTT CTCATAGCTC ACGCTGTAGG TATCTCAGTT CGGTGTAGGT CGTTCGCTCC
301 AAGCTGGGCT GTGTGCACGA ACCCCCCGTT CAGCCCGACC GCTGCGCCTT ATCCGGTAAC
361 TATCGTCTTG AGTCCAACCC GGTAAGACAC GACTTATCGC CACTGGCAGC AGCCACTGGT
421 AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTACAG AGTTCTTGAA GTGGTGGCCT
481 AACTACGGCT ACACTAGAAG AACAGTATTT GGTATCTGCG CTCTGCTGAA GCCAGTTACC
541 TTCGGAAAAA GAGTTGGTAG CTCTTGATCC GGCAAACAAA CCACCGCTGG TAGCGGTGGT
601 TTTTTTGTTT GCAAGCAGCA GATTACGCGC AGAAAAAAAG GATCTCAAGA AGATCCTTTG
661 ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG AACGAAAACT CACGTTAAGG GATTTTGGTC
721 ATGAGATTAT CAAAAAGGAT CTTCACCTAG ATCCTTTTAA ATTAAAAATG AAGTTTTAAA
781 TCAATCTAAA GTATATATGA GTAAACTTGG TCTGACAGTT ACCAATGCTT AATCAGTGAG
841 GCACCTATCT CAGCGATCTG TCTATTTCGT TCATCCATAG TTGCCTGACT CCCCGTCGTG
901 TAGATAACTA CGATACGGGA GGGCTTACCA TCTGGCCCCA GTGCTGCAAT GATACCGCGA
961 GACCCACGCT CACCGGCTCC AGATTTATCA GCAATAAACC AGCCAGCCGG AAGGGCCGAG
1021 CGCAGAAGTG GTCCTGCAAC TTTATCCGCC TCCATCCAGT CTATTAATTG TTGCCGGGAA
1081 GCTAGAGTAA GTAGTTCGCC AGTTAATAGT TTGCGCAACG TTGTTGCCAT TGCTACAGGC
1141 ATCGTGGTGT CACGCTCGTC GTTTGGTATG GCTTCATTCA GCTCCGGTTC CCAACGATCA
1201 AGGCGAGTTA CATGATCCCC CATGTTGTGC AAAAAAGCGG TTAGCTCCTT CGGTCCTCCG
1261 ATCGTTGTCA GAAGTAAGTT GGCCGCAGTG TTATCACTCA TGGTTATGGC AGCACTGCAT
1321 AATTCTCTTA CTGTCATGCC ATCCGTAAGA TGCTTTTCTG TGACTGGTGA GTACTCAACC
1381 AAGTCATTCT GAGAATAGTG TATGCGGCGA CCGAGTTGCT CTTGCCCGGC GTCAATACGG
1441 GATAATACCG CGCCACATAG CAGAACTTTA AAAGTGCTCA TCATTGGAAA ACGTTCTTCG
1501 GGGCGAAAAC TCTCAAGGAT CTTACCGCTG TTGAGATCCA GTTCGATGTA ACCCACTCGT
1561 GCACCCAACT GATCTTCAGC ATCTTTTACT TTCACCAGCG TTTCTGGGTG AGCAAAAACA
1621 GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC GGAAATGTTG AATACTCATA
1681 CTCTTCCTTT TTCAATATTA TTGAAGCATT TATCAGGGTT ATTGTCTCAT GAGCGGATAC
1741 ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTT AATACGACTC ACTATAGGGC
1801 GAGGAATTCT GGAAGGGCTA ATTTGGTCCC AAAAAAGACA AGAGAATGCT TATGGACTAG
1861 GGACTGTTAC AAACAAATGA TGAATGGAAA CAGCTGAGCA TGACTCATAG CTGAAGCGCT
1921 AGCAGCTGCT TAACCGCAAA ACCACATCCT ATGTAAAGCT TGCTAATGAC GTATAAGTTG
1981 CTCCACTGTA AAAGTATATA AGCAGCTGCT TTTTGCCTGT ACTGGGTCTC TCTGGTTAGA
2041 CCAGATCTGA GCCTGGGAGC TCTCTGGCTA ACTAGGGAAC CCACTGCTTA AGCCTCAATA
2101 AAGCTTGCCT TGAGTGCTTC AAGTAGTGTG TGCCCGTCTG TTGTGTGACT CTGGTAACTA
2161 GAGATCCCTC AGACCCTTTT AGTCAGTGTG GAAAATCTCT AGCAGTGGCG CCCGAACAGG
2221 GACTTGAAAG CGAAAGTAAA GCCAGAGGAG ATCTCTCGAC GCAGGACTCG GCTTGCTGAA
2281 GCGCGCACGG CAAGAGGCGA GGGGCGGCGA CTGGTGAGTA CGCCAAAAAT TTTGACTAGC
2341 GGAGGCTAGA AGGAGAGAGA TGGGTGCGAG AGCGTCAGTA TTAAGCGGGG GAGGATCCGG
2401 ATCCTGAACC GGTATGCCCA AGAAGAAACG CAAAGTCGCT TCAGACAAGA AGTACAGCAT
2461 CGGCCTGGAC ATCGGCACCA ACTCTGTGGG CTGGGCCGTG ATCACCGACG AGTACAAGGT
2521 GCCCAGCAAG AAATTCAAGG TGCTGGGCAA CACCGACCGG CACAGCATCA AGAAGAACCT
2581 GATCGGAGCC CTGCTGTTCG ACAGCGGCGA AACAGCCGAG GCCACCCGGC TGAAGAGAAC
2641 CGCCAGAAGA AGATACACCA GACGGAAGAA CCGGATCTGC TATCTGCAAG AGATCTTCAG
2701 CAACGAGATG GCCAAGGTGG ACGACAGCTT CTTCCACAGA CTGGAAGAGT CCTTCCTGGT
2761 GGAAGAGGAT AAGAAGCACG AGCGGCACCC CATCTTCGGC AACATCGTGG ACGAGGTGGC
2821 CTACCACGAG AAGTACCCCA CCATCTACCA CCTGAGAAAG AAACTGGTGG ACAGCACCGA
2881 CAAGGCCGAC CTGCGGCTGA TCTATCTGGC CCTGGCCCAC ATGATCAAGT TCCGGGGCCA
2941 CTTCCTGATC GAGGGCGACC TGAACCCCGA CAACAGCGAC GTGGACAAGC TGTTCATCCA
3001 GCTGGTGCAG ACCTACAACC AGCTGTTCGA GGAAAACCCC ATCAACGCCA GCGGCGTGGA
3061 CGCCAAGGCC ATCCTGTCTG CCAGACTGAG CAAGAGCAGA CGGCTGGAAA ATCTGATCGC
3121 CCAGCTGCCC GGCGAGAAGA AGAATGGCCT GTTCGGAAAC CTGATTGCCC TGAGCCTGGG
3181 CCTGACCCCC AACTTCAAGA GCAACTTCGA CCTGGCCGAG GATGCCAAAC TGCAGCTGAG
3241 CAAGGACACC TACGACGACG ACCTGGACAA CCTGCTGGCC CAGATCGGCG ACCAGTACGC
3301 CGACCTGTTT CTGGCCGCCA AGAACCTGTC CGACGCCATC CTGCTGAGCG ACATCCTGAG
3361 AGTGAACACC GAGATCACCA AGGCCCCCCT GAGCGCCTCT ATGATCAAGA GATACGACGA
3421 GCACCACCAG GACCTGACCC TGCTGAAAGC TCTCGTGCGG CAGCAGCTGC CTGAGAAGTA
3481 CAAAGAGATT TTCTTCGACC AGAGCAAGAA CGGCTACGCC GGCTACATTG ACGGCGGAGC
3541 CAGCCAGGAA GAGTTCTACA AGTTCATCAA GCCCATCCTG GAAAAGATGG ACGGCACCGA
3601 GGAACTGCTC GTGAAGCTGA ACAGAGAGGA CCTGCTGCGG AAGCAGCGGA CCTTCGACAA
3661 CGGCAGCATC CCCCACCAGA TCCACCTGGG AGAGCTGCAC GCCATTCTGC GGCGGCAGGA
3721 AGATTTTTAC CCATTCCTGA AGGACAACCG GGAAAAGATC GAGAAGATCC TGACCTTCCG
3781 CATCCCCTAC TACGTGGGCC CTCTGGCCAG GGGAAACAGC AGATTCGCCT GGATGACCAG
3841 AAAGAGCGAG GAAACCATCA CCCCCTGGAA CTTCGAGGAA GTGGTGGACA AGGGCGCTTC
3901 CGCCCAGAGC TTCATCGAGC GGATGACCGC TTTCGATAAG AACCTGCCCA ACGAGAAGGT
3961 GCTGCCCAAG CACAGCCTGC TGTACGAGTA CTTCACCGTG TATAACGAGC TGACCAAAGT
4021 GAAATACGTG ACCGAGGGAA TGAGAAAGCC CGCCTTCCTG AGCGGCGAGC AGAAAAAGGC
4081 CATCGTGGAC CTGCTGTTCA AGACCAACCG GAAAGTGACC GTGAAGCAGC TGAAAGAGGA
4141 CTACTTCAAG AAAATCGAGT GCTTCGACTC CGTGGAAATC TCCGGCGTGG AAGATCGGTT
4201 CAACGCCTCC CTGGGCACAT ACCACGATCT GCTGAAAATT ATCAAGGACA AGGACTTCCT
4261 GGACAATGAG GAAAACGAGG ACATTCTGGA AGATATCGTG CTGACCCTGA CACTGTTTGA
4321 GGACAGAGAG ATGATCGAGG AACGGCTGAA AACCTATGCC CACCTGTTCG ACGACAAAGT
4381 GATGAAGCAG CTGAAGCGGC GGAGATACAC CGGCTGGGGA GCACTGAGCC GGAAGCTGAT
4441 CAACGGCATC CGGGACAAGC AGTCCGGCAA GACAATCCTG GATTTCCTGA AGTCCGACGG
4501 CTTCGCCAAC AGAAACTTCA TGGCTCTGAT CCACGACGAC AGCCTGACCT TTAAAGAGGA
4561 CATCCAGAAA GCCCAGGTGT CCGGCCAGGG CGATAGCCTG CACGAGCACA TTGCCAATCT
4621 GGCCGGCAGC CCCGCCATTA AGAAGGGCAT CCTGCAGACA GTGAAGGTGG TGGACGAGCT
4681 CGTGAAAGTG ATGGGCCGGC ACAAGCCCGA GAACATCGTG ATCGAAATGG CCAGAGAGAA
4741 CCAGACCACC CAGAAGGGAC AGAAGAACAG CCGCGAGAGA ATGAAGCGGA TCGAAGAGGG
4801 CATCAAAGAG CTGGGCAGCC AGATCCTGAA AGAACACCCC GTGGAAAACA CCCAGCTGCA
4861 GAACGAGAAG CTGTACCTGT ACTACCTGCA GAATGGGCGG GATATGTACG TGGACCAGGA
4921 ACTGGACATC AACCGGCTGT CCGACTACGA TGTGGACCAT ATCGTGCCTC AGAGCTTTCT
4981 GAAGGACGAC TCCATCGACA ACAAGGTGCT GACCAGAAGC GACAAGAACC GGGGCAAGAG
5041 CGACAACGTG CCCTCCGAAG AGGTCGTGAA GAAGATGAAG AACTACTGGC GGCAGCTGCT
5101 GAACGCCAAG CTGATTACCC AGAGAAAGTT CGACAATCTG ACCAAGGCCG AGAGAGGCGG
5161 CCTGAGCGAA CTGGATAAGG CCGGCTTCAT CAAGAGACAG CTGGTGGAAA CCAGGGCAAT
5221 CACAAAGCAC GTGGCACAGA TCCTGGACTC CCGGATGAAC ACTAAGTACG ACGAGAATGA
5281 CAAGCTGATC CGGGAAGTGA AAGTGATCAC CCTGAAGTCC AAGCTGGTGT CCGATTTCCG
5341 GAAGGATTTC CAGTTTTACA AAGTGCGCGA GATCAACAAC TACCACCACG CCCACGACGC
5401 CTACCTGAAC GCCGTCGTGG GAACCGCCCT GATCAAAAAG TACCCTAAGC TGGAAAGCGA
5461 GTTCGTGTAC GGCGACTACA AGGTGTACGA CGTGCGGAAG ATGATCGCCA AGAGCGAGCA
5521 GGAAATCGGC AAGGCTACCG CCAAGTACTT CTTCTACAGC AACATCATGA ACTTTTTCAA
5581 GACCGAGATT ACCCTGGCCA ACGGCGAGAT CCGGAAGCGG CCTCTGATCG AGACAAACGG
5641 CGAAACCGGG GAGATCGTGT GGGATAAGGG CCGGGATTTT GCCACCGTGC GGAAAGTGCT
5701 GAGCATGCCC CAAGTGAATA TCGTGAAAAA GACCGAGGTG CAGACAGGCG GCTTCAGCAA
5761 AGAGTCTATC CTGCCCAAGA GGAACAGCGA TAAGCTGATC GCCAGAAAGA AGGACTGGGA
5821 CCCTAAGAAG TACGGCGGCT TCGACAGCCC CACCGTGGCC TATTCTGTGC TGGTGGTGGC
5881 CAAAGTGGAA AAGGGCAAGT CCAAGAAACT GAAGAGTGTG AAAGAGCTGC TGGGGATCAC
5941 CATCATGGAA AGAAGCAGCT TCGAGAAGAA TCCCATCGAC TTTCTGGAAG CCAAGGGCTA
6001 CAAAGAAGTG AAAAAGGACC TGATCATCAA GCTGCCTAAG TACTCCCTGT TCGAGCTGGA
6061 AAACGGCCGG AAGAGAATGC TGGCCTCTGC CGGCGAACTG CAGAAGGGAA ACGAACTGGC
6121 CCTGCCCTCC AAATATGTGA ACTTCCTGTA CCTGGCCAGC CACTATGAGA AGCTGAAGGG
6181 CTCCCCCGAG GATAATGAGC AGAAACAGCT GTTTGTGGAA CAGCACAAGC ACTACCTGGA
6241 CGAGATCATC GAGCAGATCA GCGAGTTCTC CAAGAGAGTG ATCCTGGCCG ACGCTAATCT
6301 GGACAAAGTG CTGTCCGCCT ACAACAAGCA CCGGGATAAG CCCATCAGAG AGCAGGCCGA
6361 GAATATCATC CACCTGTTTA CCCTGACCAA TCTGGGAGCC CCTGCCGCCT TCAAGTACTT
6421 TGACACCACC ATCGACCGGA AGAGGTACAC CAGCACCAAA GAGGTGCTGG ACGCCACCCT
6481 GATCCACCAG AGCATCACCG GCCTGTACGA GACACGGATC GACCTGTCTC AGCTGGGAGG
6541 CGACTCTAAA AGACCAGCAG CCACTAAAAA GGCCGGACAG GCTAAAAAGA AAAAGTAAGG
6601 TACCAGGAGC TTTGTTCCTT GGGTTCTTGG GAGCAGCAGG AAGCACTATG GGCGCAGCGT
6661 CAATGACGCT GACGGTACAG GCCAGACAAT TATTGTCTGG TATAGTGCAG CAGCAGAACA
6721 ATTTGCTGAG GGCTATTGAG GCGCAACAGC ATCTGTTGCA ACTCACAGTC TGGGGCATCA
6781 AGCAGCTCCA GGCAAGAATC CTGGCTGTGG AAAGATACCT AAAGGATCAA CAGCTCCTTT
6841 TAAAAGAAAA GGGGGGATTG GGGGGTACAG TGCAGGGGAA AGAATAGTAG ACATAATAGC
6901 AACAGACATA CAAACTAAAG AATTACAAAA ACAAATTACA AAAATTCAAA ATTTTCGGGT
6961 TTTCTAGATG GAAGGGCTAA TTTGGTCCCA AAAAAGACAA GAGAATGCTT ATGGACTAGG
7021 GACTGTTACA AACAAATGAT GAATGGAAAC AGCTGAGCAT GACTCATAGC TGAAGCGCTA
7081 GCAGCTGCTT AACCGCAAAA CCACATCCTA TGTAAAGCTT GCTAATGACG TATAAGTTGC
7141 TCCACTGTAA AAGTATATAA GCAGCTGCTT TTTGCCTGTA CTGGGTCTCT CTGGTTAGAC
7201 CAGATCTGAG CCTGGGAGCT CTCTGGCTAA CTAGGGAACC CACTGCTTAA GCCTCAATAA
7261 AGCTTGCCTT GAGTGCTTCA AGTAGTGTGT GCCCGTCTGT TGTGTGACTC TGGTAACTAG
7321 AGATCCCTCA GACCCTTTTA GTCAGTGTGG AAAATCTCTA GCATTAATTA A
Seq ID No.8:The sequence of plasmid p6-1
1 ATGTGAGCAA AAGGCCAGCA AAAGGCCAGG AACCGTAAAA AGGCCGCGTT GCTGGCGTTT
61 TTCCATAGGC TCCGCCCCCC TGACGAGCAT CACAAAAATC GACGCTCAAG TCAGAGGTGG
121 CGAAACCCGA CAGGACTATA AAGATACCAG GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC
181 TCTCCTGTTC CGACCCTGCC GCTTACCGGA TACCTGTCCG CCTTTCTCCC TTCGGGAAGC
241 GTGGCGCTTT CTCATAGCTC ACGCTGTAGG TATCTCAGTT CGGTGTAGGT CGTTCGCTCC
301 AAGCTGGGCT GTGTGCACGA ACCCCCCGTT CAGCCCGACC GCTGCGCCTT ATCCGGTAAC
361 TATCGTCTTG AGTCCAACCC GGTAAGACAC GACTTATCGC CACTGGCAGC AGCCACTGGT
421 AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTACAG AGTTCTTGAA GTGGTGGCCT
481 AACTACGGCT ACACTAGAAG AACAGTATTT GGTATCTGCG CTCTGCTGAA GCCAGTTACC
541 TTCGGAAAAA GAGTTGGTAG CTCTTGATCC GGCAAACAAA CCACCGCTGG TAGCGGTGGT
601 TTTTTTGTTT GCAAGCAGCA GATTACGCGC AGAAAAAAAG GATCTCAAGA AGATCCTTTG
661 ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG AACGAAAACT CACGTTAAGG GATTTTGGTC
721 ATGAGATTAT CAAAAAGGAT CTTCACCTAG ATCCTTTTAA ATTAAAAATG AAGTTTTAAA
781 TCAATCTAAA GTATATATGA GTAAACTTGG TCTGACAGTT ACCAATGCTT AATCAGTGAG
841 GCACCTATCT CAGCGATCTG TCTATTTCGT TCATCCATAG TTGCCTGACT CCCCGTCGTG
901 TAGATAACTA CGATACGGGA GGGCTTACCA TCTGGCCCCA GTGCTGCAAT GATACCGCGA
961 GACCCACGCT CACCGGCTCC AGATTTATCA GCAATAAACC AGCCAGCCGG AAGGGCCGAG
1021 CGCAGAAGTG GTCCTGCAAC TTTATCCGCC TCCATCCAGT CTATTAATTG TTGCCGGGAA
1081 GCTAGAGTAA GTAGTTCGCC AGTTAATAGT TTGCGCAACG TTGTTGCCAT TGCTACAGGC
1141 ATCGTGGTGT CACGCTCGTC GTTTGGTATG GCTTCATTCA GCTCCGGTTC CCAACGATCA
1201 AGGCGAGTTA CATGATCCCC CATGTTGTGC AAAAAAGCGG TTAGCTCCTT CGGTCCTCCG
1261 ATCGTTGTCA GAAGTAAGTT GGCCGCAGTG TTATCACTCA TGGTTATGGC AGCACTGCAT
1321 AATTCTCTTA CTGTCATGCC ATCCGTAAGA TGCTTTTCTG TGACTGGTGA GTACTCAACC
1381 AAGTCATTCT GAGAATAGTG TATGCGGCGA CCGAGTTGCT CTTGCCCGGC GTCAATACGG
1441 GATAATACCG CGCCACATAG CAGAACTTTA AAAGTGCTCA TCATTGGAAA ACGTTCTTCG
1501 GGGCGAAAAC TCTCAAGGAT CTTACCGCTG TTGAGATCCA GTTCGATGTA ACCCACTCGT
1561 GCACCCAACT GATCTTCAGC ATCTTTTACT TTCACCAGCG TTTCTGGGTG AGCAAAAACA
1621 GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC GGAAATGTTG AATACTCATA
1681 CTCTTCCTTT TTCAATATTA TTGAAGCATT TATCAGGGTT ATTGTCTCAT GAGCGGATAC
1741 ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTT AATACGACTC ACTATAGGGC
1801 GAGGAATTCT GGAAGGGCTA ATTTGGTCCC AAAAAAGACA AGAGAATGCT TATGGACTAG
1861 GGACTGTTAC AAACAAATGA TGAATGGAAA CAGCTGAGCA TGACTCATAG CTGAAGCGCT
1921 AGCAGCTGCT TAACCGCAAA ACCACATCCT ATGTAAAGCT TGCTAATGAC GTATAAGTTG
1981 CTCCACTGTA AAAGTATATA AGCAGCTGCT TTTTGCCTGT ACTGGGTCTC TCTGGTTAGA
2041 CCAGATCTGA GCCTGGGAGC TCTCTGGCTA ACTAGGGAAC CCACTGCTTA AGCCTCAATA
2101 AAGCTTGCCT TGAGTGCTTC AAGTAGTGTG TGCCCGTCTG TTGTGTGACT CTGGTAACTA
2161 GAGATCCCTC AGACCCTTTT AGTCAGTGTG GAAAATCTCT AGCAGTGGCG CCCGAACAGG
2221 GACTTGAAAG CGAAAGGGAA ACCAGAGGAG CTCTCTCGAC GCAGGACTCG GCTTGCTGAA
2281 GCGCGCACGG CAAGAGGCGA GGGGCGGCGA CTGGTGAGTA CGCCAAAAAT TTTGACTAGC
2341 GGAGGCTAGA AGGAGAGAGA TGGGTGCGAG AGCGTCAGTA TTAAGCGGGG GAGAATTAGA
2401 TCGCGATGGG AAAAAATTCG GTTAAGGCCA GGGGGAAAGA AAAAATATAA ATTAAAACAT
2461 ATAGTATGGG CAAGCAGGGA GCTAGAACGA TTCGCAGTTA ATCCTGGCCT GTTAGAAACA
2521 TCAGAAGGCT GTAGACAAAT ACTGGGACAG CTACAACCAT CCCTTCAGAC AGGATCAGAA
2581 GAACTTAGAT CATTATATAA TACAGTAGCA ACCCTCTATT GTGTGCATCA AAGGATAGAG
2641 ATAAAAGACA CCAAGGAAGC TTTAGACAAG ATAGAGGAAG AGCAAAACAA AAGTAAGACC
2701 ACCGCACAGC AAGCGGCCGG CCGCTGATCT TCAGACCTGG AGGAGGAGAT ATGAGGGACA
2761 ATTGGAGAAG TGAATTATAT AAATATAAAG TAGTAAAAAT TGAACCATTA GGAGTAGCAC
2821 CCACCAAGGC AAAGAGAAGA GTGGTGCAGA GAGAAAAAAG AGCAGTGGGA ATAGGAGCTT
2881 TGTTCCTTGG GTTCTTGGGA GCAGCAGGAA GCACTATGGG CGCAGCGTCA ATGACGCTGA
2941 CGGTACAGGC CAGACAATTA TTGTCTGGTA TAGTGCAGCA GCAGAACAAT TTGCTGAGGG
3001 CTATTGAGGC GCAACAGCAT CTGTTGCAAC TCACAGTCTG GGGCATCAAG CAGCTCCAGG
3061 CAAGAATCCT GGCTGTGGAA AGATACCTAA AGGATCAACA GCTCCTTTTA AAAGAAAAGG
3121 GGGGATTGGG GGGTACAGTG CAGGGGAAAG AATAGTAGAC ATAATAGCAA CAGACATACA
3181 AACTAAAGAA TTACAAAAAC AAATTACAAA AATTCAAAAT TTTCGGGTTT CTCGAGACGG
3241 GGACAGCCCC CCCCCAAAGC CCCCAGGGAT GTAATTACGT CCCTCCCCCG CTAGGGGGCA
3301 GCAGCGAGCC GCCCGGGGCT CCGCTCCGGT CCGGCGCTCC CCCCGCATCC CCGAGCCGGC
3361 AGCGTGCGGG GACAGCCCGG GCACGGGGAA GGTGGCACGG GATCGCTTTC CTCTGAACGC
3421 TTCTCGCTGC TCTTTGAGCC TGCAGACACC TGGGGGGATA CGGGGAAAAA GGATCCGTGG
3481 AGAAGAGCAT GCTTGAGGGC TGAGTGCCCC TCAGTGGGCA GAGAGCACAT GGCCCACAGT
3541 CCCTGAGAAG TTGGGGGGAG GGGTGGGCAA TTGAACTGGT GCCTAGAGAA GGTGGGGCTT
3601 GGGTAAACTG GGAAAGTGAT GTGGTGTACT GGCTCCACCT TTTTCCCCAG GGTGGGGGAG
3661 AACCATATAT AAGTGCAGTA GTCTCTGTGA ACATTCGGTC TCTCTGGTTA GACCAGATCT
3721 GAGCCTGGGA GCTCTCTGGC TAACTAGGGA ACCCACTGCT TAAGCCTCAA TAAAGCTTGC
3781 CTTGAGTGCT TCACCGGTAT GCCCAAGAAG AAACGCAAAG TCGCTTCAGA CAAGAAGTAC
3841 AGCATCGGCC TGGACATCGG CACCAACTCT GTGGGCTGGG CCGTGATCAC CGACGAGTAC
3901 AAGGTGCCCA GCAAGAAATT CAAGGTGCTG GGCAACACCG ACCGGCACAG CATCAAGAAG
3961 AACCTGATCG GAGCCCTGCT GTTCGACAGC GGCGAAACAG CCGAGGCCAC CCGGCTGAAG
4021 AGAACCGCCA GAAGAAGATA CACCAGACGG AAGAACCGGA TCTGCTATCT GCAAGAGATC
4081 TTCAGCAACG AGATGGCCAA GGTGGACGAC AGCTTCTTCC ACAGACTGGA AGAGTCCTTC
4141 CTGGTGGAAG AGGATAAGAA GCACGAGCGG CACCCCATCT TCGGCAACAT CGTGGACGAG
4201 GTGGCCTACC ACGAGAAGTA CCCCACCATC TACCACCTGA GAAAGAAACT GGTGGACAGC
4261 ACCGACAAGG CCGACCTGCG GCTGATCTAT CTGGCCCTGG CCCACATGAT CAAGTTCCGG
4321 GGCCACTTCC TGATCGAGGG CGACCTGAAC CCCGACAACA GCGACGTGGA CAAGCTGTTC
4381 ATCCAGCTGG TGCAGACCTA CAACCAGCTG TTCGAGGAAA ACCCCATCAA CGCCAGCGGC
4441 GTGGACGCCA AGGCCATCCT GTCTGCCAGA CTGAGCAAGA GCAGACGGCT GGAAAATCTG
4501 ATCGCCCAGC TGCCCGGCGA GAAGAAGAAT GGCCTGTTCG GAAACCTGAT TGCCCTGAGC
4561 CTGGGCCTGA CCCCCAACTT CAAGAGCAAC TTCGACCTGG CCGAGGATGC CAAACTGCAG
4621 CTGAGCAAGG ACACCTACGA CGACGACCTG GACAACCTGC TGGCCCAGAT CGGCGACCAG
4681 TACGCCGACC TGTTTCTGGC CGCCAAGAAC CTGTCCGACG CCATCCTGCT GAGCGACATC
4741 CTGAGAGTGA ACACCGAGAT CACCAAGGCC CCCCTGAGCG CCTCTATGAT CAAGAGATAC
4801 GACGAGCACC ACCAGGACCT GACCCTGCTG AAAGCTCTCG TGCGGCAGCA GCTGCCTGAG
4861 AAGTACAAAG AGATTTTCTT CGACCAGAGC AAGAACGGCT ACGCCGGCTA CATTGACGGC
4921 GGAGCCAGCC AGGAAGAGTT CTACAAGTTC ATCAAGCCCA TCCTGGAAAA GATGGACGGC
4981 ACCGAGGAAC TGCTCGTGAA GCTGAACAGA GAGGACCTGC TGCGGAAGCA GCGGACCTTC
5041 GACAACGGCA GCATCCCCCA CCAGATCCAC CTGGGAGAGC TGCACGCCAT TCTGCGGCGG
5101 CAGGAAGATT TTTACCCATT CCTGAAGGAC AACCGGGAAA AGATCGAGAA GATCCTGACC
5161 TTCCGCATCC CCTACTACGT GGGCCCTCTG GCCAGGGGAA ACAGCAGATT CGCCTGGATG
5221 ACCAGAAAGA GCGAGGAAAC CATCACCCCC TGGAACTTCG AGGAAGTGGT GGACAAGGGC
5281 GCTTCCGCCC AGAGCTTCAT CGAGCGGATG ACCGCTTTCG ATAAGAACCT GCCCAACGAG
5341 AAGGTGCTGC CCAAGCACAG CCTGCTGTAC GAGTACTTCA CCGTGTATAA CGAGCTGACC
5401 AAAGTGAAAT ACGTGACCGA GGGAATGAGA AAGCCCGCCT TCCTGAGCGG CGAGCAGAAA
5461 AAGGCCATCG TGGACCTGCT GTTCAAGACC AACCGGAAAG TGACCGTGAA GCAGCTGAAA
5521 GAGGACTACT TCAAGAAAAT CGAGTGCTTC GACTCCGTGG AAATCTCCGG CGTGGAAGAT
5581 CGGTTCAACG CCTCCCTGGG CACATACCAC GATCTGCTGA AAATTATCAA GGACAAGGAC
5641 TTCCTGGACA ATGAGGAAAA CGAGGACATT CTGGAAGATA TCGTGCTGAC CCTGACACTG
5701 TTTGAGGACA GAGAGATGAT CGAGGAACGG CTGAAAACCT ATGCCCACCT GTTCGACGAC
5761 AAAGTGATGA AGCAGCTGAA GCGGCGGAGA TACACCGGCT GGGGAGCACT GAGCCGGAAG
5821 CTGATCAACG GCATCCGGGA CAAGCAGTCC GGCAAGACAA TCCTGGATTT CCTGAAGTCC
5881 GACGGCTTCG CCAACAGAAA CTTCATGGCT CTGATCCACG ACGACAGCCT GACCTTTAAA
5941 GAGGACATCC AGAAAGCCCA GGTGTCCGGC CAGGGCGATA GCCTGCACGA GCACATTGCC
6001 AATCTGGCCG GCAGCCCCGC CATTAAGAAG GGCATCCTGC AGACAGTGAA GGTGGTGGAC
6061 GAGCTCGTGA AAGTGATGGG CCGGCACAAG CCCGAGAACA TCGTGATCGA AATGGCCAGA
6121 GAGAACCAGA CCACCCAGAA GGGACAGAAG AACAGCCGCG AGAGAATGAA GCGGATCGAA
6181 GAGGGCATCA AAGAGCTGGG CAGCCAGATC CTGAAAGAAC ACCCCGTGGA AAACACCCAG
6241 CTGCAGAACG AGAAGCTGTA CCTGTACTAC CTGCAGAATG GGCGGGATAT GTACGTGGAC
6301 CAGGAACTGG ACATCAACCG GCTGTCCGAC TACGATGTGG ACCATATCGT GCCTCAGAGC
6361 TTTCTGAAGG ACGACTCCAT CGACAACAAG GTGCTGACCA GAAGCGACAA GAACCGGGGC
6421 AAGAGCGACA ACGTGCCCTC CGAAGAGGTC GTGAAGAAGA TGAAGAACTA CTGGCGGCAG
6481 CTGCTGAACG CCAAGCTGAT TACCCAGAGA AAGTTCGACA ATCTGACCAA GGCCGAGAGA
6541 GGCGGCCTGA GCGAACTGGA TAAGGCCGGC TTCATCAAGA GACAGCTGGT GGAAACCAGG
6601 GCAATCACAA AGCACGTGGC ACAGATCCTG GACTCCCGGA TGAACACTAA GTACGACGAG
6661 AATGACAAGC TGATCCGGGA AGTGAAAGTG ATCACCCTGA AGTCCAAGCT GGTGTCCGAT
6721 TTCCGGAAGG ATTTCCAGTT TTACAAAGTG CGCGAGATCA ACAACTACCA CCACGCCCAC
6781 GACGCCTACC TGAACGCCGT CGTGGGAACC GCCCTGATCA AAAAGTACCC TAAGCTGGAA
6841 AGCGAGTTCG TGTACGGCGA CTACAAGGTG TACGACGTGC GGAAGATGAT CGCCAAGAGC
6901 GAGCAGGAAA TCGGCAAGGC TACCGCCAAG TACTTCTTCT ACAGCAACAT CATGAACTTT
6961 TTCAAGACCG AGATTACCCT GGCCAACGGC GAGATCCGGA AGCGGCCTCT GATCGAGACA
7021 AACGGCGAAA CCGGGGAGAT CGTGTGGGAT AAGGGCCGGG ATTTTGCCAC CGTGCGGAAA
7081 GTGCTGAGCA TGCCCCAAGT GAATATCGTG AAAAAGACCG AGGTGCAGAC AGGCGGCTTC
7141 AGCAAAGAGT CTATCCTGCC CAAGAGGAAC AGCGATAAGC TGATCGCCAG AAAGAAGGAC
7201 TGGGACCCTA AGAAGTACGG CGGCTTCGAC AGCCCCACCG TGGCCTATTC TGTGCTGGTG
7261 GTGGCCAAAG TGGAAAAGGG CAAGTCCAAG AAACTGAAGA GTGTGAAAGA GCTGCTGGGG
7321 ATCACCATCA TGGAAAGAAG CAGCTTCGAG AAGAATCCCA TCGACTTTCT GGAAGCCAAG
7381 GGCTACAAAG AAGTGAAAAA GGACCTGATC ATCAAGCTGC CTAAGTACTC CCTGTTCGAG
7441 CTGGAAAACG GCCGGAAGAG AATGCTGGCC TCTGCCGGCG AACTGCAGAA GGGAAACGAA
7501 CTGGCCCTGC CCTCCAAATA TGTGAACTTC CTGTACCTGG CCAGCCACTA TGAGAAGCTG
7561 AAGGGCTCCC CCGAGGATAA TGAGCAGAAA CAGCTGTTTG TGGAACAGCA CAAGCACTAC
7621 CTGGACGAGA TCATCGAGCA GATCAGCGAG TTCTCCAAGA GAGTGATCCT GGCCGACGCT
7681 AATCTGGACA AAGTGCTGTC CGCCTACAAC AAGCACCGGG ATAAGCCCAT CAGAGAGCAG
7741 GCCGAGAATA TCATCCACCT GTTTACCCTG ACCAATCTGG GAGCCCCTGC CGCCTTCAAG
7801 TACTTTGACA CCACCATCGA CCGGAAGAGG TACACCAGCA CCAAAGAGGT GCTGGACGCC
7861 ACCCTGATCC ACCAGAGCAT CACCGGCCTG TACGAGACAC GGATCGACCT GTCTCAGCTG
7921 GGAGGCGACT CTAAAAGACC AGCAGCCACT AAAAAGGCCG GACAGGCTAA AAAGAAAAAG
7981 TAAGGTACCC GACTGTGCCT TCTAGTTGCC AGCCATCTGT TGTTTGCCCC TCCCCCGTGC
8041 CTTCCTTGAC CCTGGAAGGT GCCACTCCCA CTGTCCTTTC CTAATAAAAT GAGGAAATTG
8101 CATCGCATTG TCTGAGTAGG TGTCATTCTA TTCTGGGGGG TGGGGTGGGG CAGGACAGCA
8161 AGGGGGAGGA TTGGGAAGAC AATAGCAGGC ATGCTGGGGA TGCGGTGGGC TCTATGGGGT
8221 ACCAAGGTCG GGCAGGAAGA GGGCCTATTT CCCATGATTC CTTCATATTT GCATATACGA
8281 TACAAGGCTG TTAGAGAGAT AATTGGAATT AATTTGACTG TAAACACAAA GATATTAGTA
8341 CAAAATACGT GACGTAGAAA GTAATAATTT CTTGGGTAGT TTGCAGTTTT AAAATTATGT
8401 TTTAAAATGG ACTATCATAT GCTTACCGTA ACTTGAAAGT ATTTCGATTT CTTGGCTTTA
8461 TATATCTTGT GGAAAGGACG AAACACCGGA ATCGAAGAAG GAGGGTTTTA GAGCTAGAAA
8521 TAGCAAGTTA AAATAAGGCT AGTCCGTTAT CAACTTGAAA AAGTGGCACC GAGTCGGTGC
8581 TTTTTTAAAA AAGCACCGAC TCGGTGCCAC TTTTTCAAGT TGATAACGGA CTAGCCTTAT
8641 TTTAACTTGC TATTTCTAGC TCTAAAACGG CCCTGGTGTG TAGTTTGGGA AAGAGTGGTC
8701 TCATACAGAA CTTATAAGAT TCCCAAATCC AAAGACATTT CACGTTTATG GTGATTTCCC
8761 AGAACACATA GCGACATGCA AATATTGCAG GGCGCCACTC CCCTGTCCCT CACAGCCATC
8821 TTCCTGCCAG GGCGCACGCG CGCTGGGTGT TCCCGCCTAG TGACACTGGG CCCGCGATTC
8881 CTTGGAGCGG GTTGATGACG TCAGCGTTCG AATTCCATGG CGGCGCGGCG GCGACGGAGC
8941 ACCGGCGGCG GCAGGGCGAG AGGTTCGGAG CTCAATATCG CGGGACGGCA TGCGGGGGGC
9001 GGGCAGTCAG AAAGGAACGA TGCCACCTAC TGTGACCCCC TTCCCTTCAG CTCCCTATAA
9061 TCTAGAGATC CGACGCCGCC ATCTCTAGGC CCGCGCCGGC CCCCTCGCAC AGACTTGTGG
9121 GAGAAGCTCG GCTACTCCCC TGCCCCGGTT AATTTGCATA TAATATTTCC TAGTAACTAT
9181 AGAGGCTTAA TGTGCGATAA AAGACAGATA ATCTGTTCTT TTTAATACTA GCTACATTTT
9241 ACATGATAGG CTTGGATTTC TATAAGAGAT ACAAATACTA AATTATTATT TTAAAAAACA
9301 GCACAAAAGG AAACTCACCC TAACTGTAAA GTAATTGTGT GTTTTGAGAC TATAAATATC
9361 CCTTGGAGAA AAGCCTTGTT TGATAGACCC ATAAATCAGT TTTAGAGCTA GAAATAGCAA
9421 GTTAAAATAA GGCTAGTCCG TTATCAACTT GAAAAAGTGG CACCGAGTCG GTGCTTTTTT
9481 AAAAAAGCAC CGACTCGGTG CCACTTTTTC AAGTTGATAA CGGACTAGCC TTATTTTAAC
9541 TTGCTATTTC TAGCTCTAAA ACTCAAATCA CTCTTTGGCG AGGTACCCAG GCGGCGCACA
9601 AGCTATATAA ACCTGAAGGA AATCTCAACT TTACACTTAG GTCAAGTTAC TTATCGTACT
9661 AGAGCTTCAG CAGGAAATTT AACTAAAATC TAATTTAACC AGCATAGCAA ATATCATTTA
9721 TTCCCAAAAT GCTAAAGTTT GAGATAAACG GACTTGATTT CCGGCTGTTT TGACACTATC
9781 CAGAATGCCT TGCAGATGGG TGGGGCATGC TAAATACTGC AGACTAGTTG CGGGGAGGCG
9841 GCCCAAAGGG AGATCCGACT CGTCTGAGGG CGAAGGCGAA GACGCGGAAG AGGCCGCAGA
9901 GCCGGCAGCA GGCCGCGGGA AGGAAGGTCC GCTGGATTGA GGGCCGAAGG GACGTAGCAG
9961 AAGGACGTCC CGCGCAGAAT CCAGGTGGCA ACACAGGCGA GCAGCCAAGG AAAGGACGAT
10021 GATTTCCCCG ACAACACCAC GGAATTGTCA GTGCCCAACA GCCGAGCCCC TGTCCAGCAG
10081 CGGGCAAGGC AGGCGGCGAT GAGTTCCGCC GTGGCAATAG GGAGGGGGAA AGCGAAAGTC
10141 CCGGAAAGGA GCTGACAGGT GGTGGCAATG CCCCAACCAG TGGGGGTTGC GTCAGCAAAC
10201 ACAGTGCACA CCACGCCACG TTGCCTGACA ACGGGCCACA ACTCCTCATA AAGAGACAGC
10261 AACCAGGATT TATACAAGGA GGAGAAAATG AAAGCCATAC GGGAAGCAAT AGCATGATAC
10321 AAAGGCATTA AAGCAGCGTA TCCACATAGC GTAAAAGGAG CAACATAGTT AAGAATACCA
10381 GTCAATCTTT CACAAATTTT GTAATCCAGA GGTTGAACGG GGACAGCCCC CCCCCAAAGC
10441 CCCCAGGGAT GTAATTACGT CCCTCCCCCG CTAGGGGGCA GCAGCGAGCC GCCCGGGGCT
10501 CCGCTCCGGT CCGGCGCTCC CCCCGCATCC CCGAGCCGGC AGCGTGCGGG GACAGCCCGG
10561 GCACGGGGAA GGTGGCACGG GATCGCTTTC CTCTGAACGC TTCTCGCTGC TCTTTGAGCC
10621 TGCAGACACC TGGGGGGATA CGGGGAAAAA TGGAAGGGCT AATTTGGTCC CAAAAAAGAC
10681 AAGAGGTATA TAAGCAGCTG CTTTTTGCCT GTACTGGGTC TCTCTGGTTA GACCAGATCT
10741 GAGCCTGGGA GCTCTCTGGC TAACTAGGGA ACCCACTGCT TAAGCCTCAA TAAAGCTTGC
10801 CTTGAGTGCT TCAAGTAGTG TGTGCCCGTC TGTTGTGTGA CTCTGGTAAC TAGAGATCCC
10861 TCAGACCCTT TTAGTCAGTG TGGAAAATCT CTAGCATTAA TTAA
Seq ID No.9:The sequence of plasmid p7
1 ATGTGAGCAA AAGGCCAGCA AAAGGCCAGG AACCGTAAAA AGGCCGCGTT GCTGGCGTTT
61 TTCCATAGGC TCCGCCCCCC TGACGAGCAT CACAAAAATC GACGCTCAAG TCAGAGGTGG
121 CGAAACCCGA CAGGACTATA AAGATACCAG GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC
181 TCTCCTGTTC CGACCCTGCC GCTTACCGGA TACCTGTCCG CCTTTCTCCC TTCGGGAAGC
241 GTGGCGCTTT CTCATAGCTC ACGCTGTAGG TATCTCAGTT CGGTGTAGGT CGTTCGCTCC
301 AAGCTGGGCT GTGTGCACGA ACCCCCCGTT CAGCCCGACC GCTGCGCCTT ATCCGGTAAC
361 TATCGTCTTG AGTCCAACCC GGTAAGACAC GACTTATCGC CACTGGCAGC AGCCACTGGT
421 AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTACAG AGTTCTTGAA GTGGTGGCCT
481 AACTACGGCT ACACTAGAAG AACAGTATTT GGTATCTGCG CTCTGCTGAA GCCAGTTACC
541 TTCGGAAAAA GAGTTGGTAG CTCTTGATCC GGCAAACAAA CCACCGCTGG TAGCGGTGGT
601 TTTTTTGTTT GCAAGCAGCA GATTACGCGC AGAAAAAAAG GATCTCAAGA AGATCCTTTG
661 ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG AACGAAAACT CACGTTAAGG GATTTTGGTC
721 ATGAGATTAT CAAAAAGGAT CTTCACCTAG ATCCTTTTAA ATTAAAAATG AAGTTTTAAA
781 TCAATCTAAA GTATATATGA GTAAACTTGG TCTGACAGTT ACCAATGCTT AATCAGTGAG
841 GCACCTATCT CAGCGATCTG TCTATTTCGT TCATCCATAG TTGCCTGACT CCCCGTCGTG
901 TAGATAACTA CGATACGGGA GGGCTTACCA TCTGGCCCCA GTGCTGCAAT GATACCGCGA
961 GACCCACGCT CACCGGCTCC AGATTTATCA GCAATAAACC AGCCAGCCGG AAGGGCCGAG
1021 CGCAGAAGTG GTCCTGCAAC TTTATCCGCC TCCATCCAGT CTATTAATTG TTGCCGGGAA
1081 GCTAGAGTAA GTAGTTCGCC AGTTAATAGT TTGCGCAACG TTGTTGCCAT TGCTACAGGC
1141 ATCGTGGTGT CACGCTCGTC GTTTGGTATG GCTTCATTCA GCTCCGGTTC CCAACGATCA
1201 AGGCGAGTTA CATGATCCCC CATGTTGTGC AAAAAAGCGG TTAGCTCCTT CGGTCCTCCG
1261 ATCGTTGTCA GAAGTAAGTT GGCCGCAGTG TTATCACTCA TGGTTATGGC AGCACTGCAT
1321 AATTCTCTTA CTGTCATGCC ATCCGTAAGA TGCTTTTCTG TGACTGGTGA GTACTCAACC
1381 AAGTCATTCT GAGAATAGTG TATGCGGCGA CCGAGTTGCT CTTGCCCGGC GTCAATACGG
1441 GATAATACCG CGCCACATAG CAGAACTTTA AAAGTGCTCA TCATTGGAAA ACGTTCTTCG
1501 GGGCGAAAAC TCTCAAGGAT CTTACCGCTG TTGAGATCCA GTTCGATGTA ACCCACTCGT
1561 GCACCCAACT GATCTTCAGC ATCTTTTACT TTCACCAGCG TTTCTGGGTG AGCAAAAACA
1621 GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC GGAAATGTTG AATACTCATA
1681 CTCTTCCTTT TTCAATATTA TTGAAGCATT TATCAGGGTT ATTGTCTCAT GAGCGGATAC
1741 ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTT AATACGACTC ACTATAGGGC
1801 GAGGAATTCT GGAAGGGCTA ATTTGGTCCC AAAAAAGACA AGAGAATGCT TATGGACTAG
1861 GGACTGTTAC AAACAAATGA TGAATGGAAA CAGCTGAGCA TGACTCATAG CTGAAGCGCT
1921 AGCAGCTGCT TAACCGCAAA ACCACATCCT ATGTAAAGCT TGCTAATGAC GTATAAGTTG
1981 CTCCACTGTA AAAGTATATA AGCAGCTGCT TTTTGCCTGT ACTGGGTCTC TCTGGTTAGA
2041 CCAGATCTGA GCCTGGGAGC TCTCTGGCTA ACTAGGGAAC CCACTGCTTA AGCCTCAATA
2101 AAGCTTGCCT TGAGTGCTTC AAGTAGTGTG TGCCCGTCTG TTGTGTGACT CTGGTAACTA
2161 GAGATCCCTC AGACCCTTTT AGTCAGTGTG GAAAATCTCT AGCAGTGGCG CCCGAACAGG
2221 GACTTGAAAG CGAAAGGGAA ACCAGAGGAG CTCTCTCGAC GCAGGACTCG GCTTGCTGAA
2281 GCGCGCACGG CAAGAGGCGA GGGGCGGCGA CTGGTGAGTA CGCCAAAAAT TTTGACTAGC
2341 GGAGGCTAGA AGGAGAGAGA TGGGTGCGAG AGCGTCAGTA TTAAGCGGGG GAGAATTAGA
2401 TCGCGATGGG AAAAAATTCG GTTAAGGCCA GGGGGAAAGA AAAAATATAA ATTAAAACAT
2461 ATAGTATGGG CAAGCAGGGA GCTAGAACGA TTCGCAGTTA ATCCTGGCCT GTTAGAAACA
2521 TCAGAAGGCT GTAGACAAAT ACTGGGACAG CTACAACCAT CCCTTCAGAC AGGATCAGAA
2581 GAACTTAGAT CATTATATAA TACAGTAGCA ACCCTCTATT GTGTGCATCA AAGGATAGAG
2641 ATAAAAGACA CCAAGGAAGC TTTAGACAAG ATAGAGGAAG AGCAAAACAA AAGTAAGACC
2701 ACCGCACAGC AAGCGGCCGG CCGCTGATCT TCAGACCTGG AGGAGGAGAT ATGAGGGACA
2761 ATTGGAGAAG TGAATTATAT AAATATAAAG TAGTAAAAAT TGAACCATTA GGAGTAGCAC
2821 CCACCAAGGC AAAGAGAAGA GTGGTGCAGA GAGAAAAAAG AGCAGTGGGA ATAGGAGCTT
2881 TGTTCCTTGG GTTCTTGGGA GCAGCAGGAA GCACTATGGG CGCAGCGTCA ATGACGCTGA
2941 CGGTACAGGC CAGACAATTA TTGTCTGGTA TAGTGCAGCA GCAGAACAAT TTGCTGAGGG
3001 CTATTGAGGC GCAACAGCAT CTGTTGCAAC TCACAGTCTG GGGCATCAAG CAGCTCCAGG
3061 CAAGAATCCT GGCTGTGGAA AGATACCTAA AGGATCAACA GCTCCTTTTA AAAGAAAAGG
3121 GGGGATTGGG GGGTACAGTG CAGGGGAAAG AATAGTAGAC ATAATAGCAA CAGACATACA
3181 AACTAAAGAA TTACAAAAAC AAATTACAAA AATTCAAAAT TTTCGGGTTT CTCGAGGTGG
3241 AGAAGAGCAT GCTTGAGGGC TGAGTGCCCC TCAGTGGGCA GAGAGCACAT GGCCCACAGT
3301 CCCTGAGAAG TTGGGGGGAG GGGTGGGCAA TTGAACTGGT GCCTAGAGAA GGTGGGGCTT
3361 GGGTAAACTG GGAAAGTGAT GTGGTGTACT GGCTCCACCT TTTTCCCCAG GGTGGGGGAG
3421 AACCATATAT AAGTGCAGTA GTCTCTGTGA ACATTCTCCC TATCAGTGAT AGAGAACTCC
3481 CTATCAGTGA TAGAGAGGAT CCTGAACCGG TATGCCCAAG AAGAAACGCA AAGTCGCTTC
3541 AGACAAGAAG TACAGCATCG GCCTGGACAT CGGCACCAAC TCTGTGGGCT GGGCCGTGAT
3601 CACCGACGAG TACAAGGTGC CCAGCAAGAA ATTCAAGGTG CTGGGCAACA CCGACCGGCA
3661 CAGCATCAAG AAGAACCTGA TCGGAGCCCT GCTGTTCGAC AGCGGCGAAA CAGCCGAGGC
3721 CACCCGGCTG AAGAGAACCG CCAGAAGAAG ATACACCAGA CGGAAGAACC GGATCTGCTA
3781 TCTGCAAGAG ATCTTCAGCA ACGAGATGGC CAAGGTGGAC GACAGCTTCT TCCACAGACT
3841 GGAAGAGTCC TTCCTGGTGG AAGAGGATAA GAAGCACGAG CGGCACCCCA TCTTCGGCAA
3901 CATCGTGGAC GAGGTGGCCT ACCACGAGAA GTACCCCACC ATCTACCACC TGAGAAAGAA
3961 ACTGGTGGAC AGCACCGACA AGGCCGACCT GCGGCTGATC TATCTGGCCC TGGCCCACAT
4021 GATCAAGTTC CGGGGCCACT TCCTGATCGA GGGCGACCTG AACCCCGACA ACAGCGACGT
4081 GGACAAGCTG TTCATCCAGC TGGTGCAGAC CTACAACCAG CTGTTCGAGG AAAACCCCAT
4141 CAACGCCAGC GGCGTGGACG CCAAGGCCAT CCTGTCTGCC AGACTGAGCA AGAGCAGACG
4201 GCTGGAAAAT CTGATCGCCC AGCTGCCCGG CGAGAAGAAG AATGGCCTGT TCGGAAACCT
4261 GATTGCCCTG AGCCTGGGCC TGACCCCCAA CTTCAAGAGC AACTTCGACC TGGCCGAGGA
4321 TGCCAAACTG CAGCTGAGCA AGGACACCTA CGACGACGAC CTGGACAACC TGCTGGCCCA
4381 GATCGGCGAC CAGTACGCCG ACCTGTTTCT GGCCGCCAAG AACCTGTCCG ACGCCATCCT
4441 GCTGAGCGAC ATCCTGAGAG TGAACACCGA GATCACCAAG GCCCCCCTGA GCGCCTCTAT
4501 GATCAAGAGA TACGACGAGC ACCACCAGGA CCTGACCCTG CTGAAAGCTC TCGTGCGGCA
4561 GCAGCTGCCT GAGAAGTACA AAGAGATTTT CTTCGACCAG AGCAAGAACG GCTACGCCGG
4621 CTACATTGAC GGCGGAGCCA GCCAGGAAGA GTTCTACAAG TTCATCAAGC CCATCCTGGA
4681 AAAGATGGAC GGCACCGAGG AACTGCTCGT GAAGCTGAAC AGAGAGGACC TGCTGCGGAA
4741 GCAGCGGACC TTCGACAACG GCAGCATCCC CCACCAGATC CACCTGGGAG AGCTGCACGC
4801 CATTCTGCGG CGGCAGGAAG ATTTTTACCC ATTCCTGAAG GACAACCGGG AAAAGATCGA
4861 GAAGATCCTG ACCTTCCGCA TCCCCTACTA CGTGGGCCCT CTGGCCAGGG GAAACAGCAG
4921 ATTCGCCTGG ATGACCAGAA AGAGCGAGGA AACCATCACC CCCTGGAACT TCGAGGAAGT
4981 GGTGGACAAG GGCGCTTCCG CCCAGAGCTT CATCGAGCGG ATGACCGCTT TCGATAAGAA
5041 CCTGCCCAAC GAGAAGGTGC TGCCCAAGCA CAGCCTGCTG TACGAGTACT TCACCGTGTA
5101 TAACGAGCTG ACCAAAGTGA AATACGTGAC CGAGGGAATG AGAAAGCCCG CCTTCCTGAG
5161 CGGCGAGCAG AAAAAGGCCA TCGTGGACCT GCTGTTCAAG ACCAACCGGA AAGTGACCGT
5221 GAAGCAGCTG AAAGAGGACT ACTTCAAGAA AATCGAGTGC TTCGACTCCG TGGAAATCTC
5281 CGGCGTGGAA GATCGGTTCA ACGCCTCCCT GGGCACATAC CACGATCTGC TGAAAATTAT
5341 CAAGGACAAG GACTTCCTGG ACAATGAGGA AAACGAGGAC ATTCTGGAAG ATATCGTGCT
5401 GACCCTGACA CTGTTTGAGG ACAGAGAGAT GATCGAGGAA CGGCTGAAAA CCTATGCCCA
5461 CCTGTTCGAC GACAAAGTGA TGAAGCAGCT GAAGCGGCGG AGATACACCG GCTGGGGAGC
5521 ACTGAGCCGG AAGCTGATCA ACGGCATCCG GGACAAGCAG TCCGGCAAGA CAATCCTGGA
5581 TTTCCTGAAG TCCGACGGCT TCGCCAACAG AAACTTCATG GCTCTGATCC ACGACGACAG
5641 CCTGACCTTT AAAGAGGACA TCCAGAAAGC CCAGGTGTCC GGCCAGGGCG ATAGCCTGCA
5701 CGAGCACATT GCCAATCTGG CCGGCAGCCC CGCCATTAAG AAGGGCATCC TGCAGACAGT
5761 GAAGGTGGTG GACGAGCTCG TGAAAGTGAT GGGCCGGCAC AAGCCCGAGA ACATCGTGAT
5821 CGAAATGGCC AGAGAGAACC AGACCACCCA GAAGGGACAG AAGAACAGCC GCGAGAGAAT
5881 GAAGCGGATC GAAGAGGGCA TCAAAGAGCT GGGCAGCCAG ATCCTGAAAG AACACCCCGT
5941 GGAAAACACC CAGCTGCAGA ACGAGAAGCT GTACCTGTAC TACCTGCAGA ATGGGCGGGA
6001 TATGTACGTG GACCAGGAAC TGGACATCAA CCGGCTGTCC GACTACGATG TGGACCATAT
6061 CGTGCCTCAG AGCTTTCTGA AGGACGACTC CATCGACAAC AAGGTGCTGA CCAGAAGCGA
6121 CAAGAACCGG GGCAAGAGCG ACAACGTGCC CTCCGAAGAG GTCGTGAAGA AGATGAAGAA
6181 CTACTGGCGG CAGCTGCTGA ACGCCAAGCT GATTACCCAG AGAAAGTTCG ACAATCTGAC
6241 CAAGGCCGAG AGAGGCGGCC TGAGCGAACT GGATAAGGCC GGCTTCATCA AGAGACAGCT
6301 GGTGGAAACC AGGGCAATCA CAAAGCACGT GGCACAGATC CTGGACTCCC GGATGAACAC
6361 TAAGTACGAC GAGAATGACA AGCTGATCCG GGAAGTGAAA GTGATCACCC TGAAGTCCAA
6421 GCTGGTGTCC GATTTCCGGA AGGATTTCCA GTTTTACAAA GTGCGCGAGA TCAACAACTA
6481 CCACCACGCC CACGACGCCT ACCTGAACGC CGTCGTGGGA ACCGCCCTGA TCAAAAAGTA
6541 CCCTAAGCTG GAAAGCGAGT TCGTGTACGG CGACTACAAG GTGTACGACG TGCGGAAGAT
6601 GATCGCCAAG AGCGAGCAGG AAATCGGCAA GGCTACCGCC AAGTACTTCT TCTACAGCAA
6661 CATCATGAAC TTTTTCAAGA CCGAGATTAC CCTGGCCAAC GGCGAGATCC GGAAGCGGCC
6721 TCTGATCGAG ACAAACGGCG AAACCGGGGA GATCGTGTGG GATAAGGGCC GGGATTTTGC
6781 CACCGTGCGG AAAGTGCTGA GCATGCCCCA AGTGAATATC GTGAAAAAGA CCGAGGTGCA
6841 GACAGGCGGC TTCAGCAAAG AGTCTATCCT GCCCAAGAGG AACAGCGATA AGCTGATCGC
6901 CAGAAAGAAG GACTGGGACC CTAAGAAGTA CGGCGGCTTC GACAGCCCCA CCGTGGCCTA
6961 TTCTGTGCTG GTGGTGGCCA AAGTGGAAAA GGGCAAGTCC AAGAAACTGA AGAGTGTGAA
7021 AGAGCTGCTG GGGATCACCA TCATGGAAAG AAGCAGCTTC GAGAAGAATC CCATCGACTT
7081 TCTGGAAGCC AAGGGCTACA AAGAAGTGAA AAAGGACCTG ATCATCAAGC TGCCTAAGTA
7141 CTCCCTGTTC GAGCTGGAAA ACGGCCGGAA GAGAATGCTG GCCTCTGCCG GCGAACTGCA
7201 GAAGGGAAAC GAACTGGCCC TGCCCTCCAA ATATGTGAAC TTCCTGTACC TGGCCAGCCA
7261 CTATGAGAAG CTGAAGGGCT CCCCCGAGGA TAATGAGCAG AAACAGCTGT TTGTGGAACA
7321 GCACAAGCAC TACCTGGACG AGATCATCGA GCAGATCAGC GAGTTCTCCA AGAGAGTGAT
7381 CCTGGCCGAC GCTAATCTGG ACAAAGTGCT GTCCGCCTAC AACAAGCACC GGGATAAGCC
7441 CATCAGAGAG CAGGCCGAGA ATATCATCCA CCTGTTTACC CTGACCAATC TGGGAGCCCC
7501 TGCCGCCTTC AAGTACTTTG ACACCACCAT CGACCGGAAG AGGTACACCA GCACCAAAGA
7561 GGTGCTGGAC GCCACCCTGA TCCACCAGAG CATCACCGGC CTGTACGAGA CACGGATCGA
7621 CCTGTCTCAG CTGGGAGGCG ACTCTAAAAG ACCAGCAGCC ACTAAAAAGG CCGGACAGGC
7681 TAAAAAGAAA AAGTAAGGTA CCCGACTGTG CCTTCTAGTT GCCAGCCATC TGTTGTTTGC
7741 CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC CCACTGTCCT TTCCTAATAA
7801 AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT CTATTCTGGG GGGTGGGGTG
7861 GGGCAGGACA GCAAGGGGGA GGATTGGGAA GACAATAGCA GGCATGCTGG GGATGCGGTG
7921 GGCTCTATGG GGTACCGATT CCGGGTCACT GTGAGTGGGG GAGGCAGGGA AGAAGGGCTC
7981 ACAGGACAGT CAAACCATGC CCCCTGTTTT TCCTTCTTCA AGTAGACCTC TATAAGACAA
8041 CAGAGACAAC TAAGGCTGAG TGGCCAGGCG AGGAGAAACC ATCTCGCCGT AAAACATGGA
8101 AGGAACACTT CAGGGGAAAG GTGGTATCTC TAAGCAAGAG AACTGAGTGG AGTCAAGGCT
8161 GAGAGATGCA GGATAAGCAA ATGGGTAGTG AAAAGACATT CATGAGGACA GCTAAAACAA
8221 TAAGTAATGT AAAATACAGC ATAGCAAAAC TTTAACCTCC AAATCAAGCC TCTACTTGAA
8281 TCCTTTTCTG AGGGATGAAT AAGGCATAGG CATCAGGGGC TGTTGCCAAT GTGCATTAGC
8341 TGTTTGCAGC CTCACCTTCT TTCATGGAGT TTAAGATATA GTGTATTTTC CCAAGGTTTG
8401 AACTAGCTCT TCATTTCTTT ATGTTTTAAA TGCACTGACC TCCCACATTC CCTTTTTAGT
8461 AAAATATTCA GAAATAATTT AAATACATCA TTGCAATGAA AATAAATGTT TTTTATTAGG
8521 CAGAATCCAG ATGCTCAAGG CCCTTCATAA TATCCCCCAG TTTAGTAGTT GGACTTAGGG
8581 AACAAAGGAA CCTTTAATAG AAATTGGACA GCAAGAAAGC GAGCTTAGTG ATACTTGTGG
8641 GCCAGGGCAT TAGCCACACC AGCCACCACT TTCTGATAGG CAGCCTGCAC TGGTGGGGTG
8701 AATTAATAAG ATCTGAATTC CCGGGATCCG CTGTACGCGG ACCCACTTTC ACATTTAAGT
8761 TGTTTTTCTA ATCCGCATAT GATCAATTCA AGGCCGAATA AGAAGGCTGG CTCTGCACCT
8821 TGGTGATCAA ATAATTCGAT AGCTTGTCGT AATAATGGCG GCATACTATC AGTAGTAGGT
8881 GTTTCCCTTT CTTCTTTAGC GACTTGATGC TCTTGATCTT CCAATACGCA ACCTAAAGTA
8941 AAATGCCCCA CAGCGCTGAG TGCATATAAT GCATTCTCTA GTGAAAAACC TTGTTGGCAT
9001 AAAAAGGCTA ATTGATTTTC GAGAGTTTCA TACTGTTTTT CTGTAGGCCG TGTACCTAAA
9061 TGTACTTTTG CTCCATCGCG ATGACTTAGT AAAGCACATC TAAAACTTTT AGCGTTATTA
9121 CGTAAAAAAT CTTGCCAGCT TTCCCCTTCT AAAGGGCAAA AGTGAGTATG GTGCCTATCT
9181 AACATCTCAA TGGCTAAGGC GTCGAGCAAA GCCCGCTTAT TTTTTACATG CCAATACAAT
9241 GTAGGCTGCT CTACACCTAG CTTCTGGGCG AGTTTACGGG TTGTTAAACC TTCGATTCCG
9301 ACCTCATTAA GCAGCTCTAA TGCGCTGTTA ATCACTTTAC TTTTATCTAA TCTAGACATG
9361 GTTTAGTTCC TCACCTTGTC GTATTATACT ATGCCGATAT ACTATGCCGA TGATTAATTG
9421 TCAACACGTG CTGCTGCAGG TCGAAAGGCC CGGAGATGAG GAAGAGGAGA ACAGCGCGGC
9481 AGACGTGCGC TTTTGAAGCG TGCAGAATGC CGGGCCTCCG GAGGACCTTC GGGCGCCCGC
9541 CCCGCCCCTG AGCCCGCCCC TGAGCCCGCC CCCGGACCCA CCCCTTCCCA GCCTCTGAGC
9601 CCAGAAAGCG AAGGAGTAAA GCTGCTATTG GCCGCTGCCC CAAAGGCCTA CCCGCTTCCA
9661 TTGCTCAGCG GTGCTGTCCA TCTGCACGAG ACTAGTGAGA CGTGCTACTT CCATTTGTCA
9721 CGTCCTGCAC GACGCGAGCT GCGGGGCGGG GGGGAACTTC CTGACTAGGG GAGGAGTAGA
9781 AGGTGGCGCG AAGGGGCCAC CAAAGAACGG AGCCGGTTGG CGCCTACCGG TGGATGTGGA
9841 ATGTGTGCGA GGCCAGAGGC CACTTGTGTA GCGCCAAGTG CCCAGCGGGG CTGCTAAAGC
9901 GCATGCTCCA GACTGCCTTG GGAAAAGCGC CTCCCCTACC CGGTAGGCGG CCGCATACAC
9961 TAGTTGGAAG GGCTAATTTG GTCCCAAAAA AGACAAGAGG TATATAAGCA GCTGCTTTTT
10021 GCCTGTACTG GGTCTCTCTG GTTAGACCAG ATCTGAGCCT GGGAGCTCTC TGGCTAACTA
10081 GGGAACCCAC TGCTTAAGCC TCAATAAAGC TTGCCTTGAG TGCTTCAAGT AGTGTGTGCC
10141 CGTCTGTTGT GTGACTCTGG TAACTAGAGA TCCCTCAGAC CCTTTTAGTC AGTGTGGAAA
10201 ATCTCTAGCA TTAATTAA
Seq ID No.10:ARRPKFYRAPYVKNHPNVWGPWVAYGP
Seq ID No.11:H-Arg-Arg-Nal-Cys-Tyr-Cit-Lys-D-Lys-Pro-Tyr-Arg-Cit-Cys-Arg-
NH2
Seq ID No.12:H-ARRPKFYRAPYVKNHPNVWGPWVAYGPSASTTTNYT-NH2
Seq ID No.13:H-ASTTTNYTSARRPKFYRAPYVKNHPNVWGPWVAYGP-NH2
Seq ID No.14:H-Arg-Arg-Nal-Cys-Tyr-Cit-Lys-D-Lys-Pro-Tyr-Arg-Cit-Cys-Arg-
Ser-Ala-Ser-Thr-Thr-Thr-Asn-Tyr-Thr-NH2
Seq ID No.15:Ala-Ser-Thr-Thr-Thr-Asn-Tyr-Thr-Ser-Arg-Arg-Nal-Cys-Tyr-Cit-
Lys-D-Lys-Pro-Tyr-Arg-Cit-Cys-Arg-NH2
Seq ID No.16:Ala-Arg-Arg-Pro-Lys-Phe-Tyr-Arg-Ala-Pro-Tyr-Val-Lys-Asn-His-
Pro-Asn-Val-Trp-Gly-Pro-Trp-Val-Ala-Tyr-Gly-Pro-Arg-Arg-Nal-Cys-Tyr-Cit-Lys-
D-Lys-Pro-Tyr-Arg-Cit-Cys-Arg-NH2
Seq ID No.17:H-Arg-Arg-Nal-Cys-Tyr-Cit-Lys-D-Lys-Pro-Tyr-Arg-Cit-Cys-Arg-
Ala-Arg-Arg-Pro-Lys-Phe-Tyr-Arg-Ala-Pro-Tyr-Val-Lys-Asn-His-Pro-Asn-Val-Trp-
Gly-Pro-Trp-Val-Ala-Tyr-Gly-Pro-NH2
Seq ID No.18:
1 ATGAAATACCTGCTGCCGACGGCTGCTGCTGGTCTGCTGCTCCTCGCTGCCCAGCCGGCG
1 M K Y L L P T A A A G L L L L A A Q P A
61 ATGGCCAGTCAGGTGCAGTTACAGCAATGGGGCGCAGGTCTGTTGAAACCTTCGGAAACC
21 M A S Q V Q L Q Q W G A G L L K P S E T
121 CTGTCCCTTACCTGCGCTGTCTATGGTGGCTCCTTTAGTGGTTACTACTGGAGCTGGATT
41 L S L T C A V Y G G S F S G Y Y W S W I
181 CGTCAGCCGCCAGGTAAAGGGCTGGAATGGATTGGGGAAATCAATCATAGTGGAAGCACC
61 R Q P P G K G L E W I G E I N H S G S T
241 AACTACAACCCGTCCCTGAAGAGTCGTGTCACCATTTCAGTCGATACGTCCAAGAACCAG
81 N Y N P S L K S R V T I S V D T S K N Q
301 TTTTCCCTGAAGCTGAGCTCTGTGACCGCCGCGGATACGGCTGTGTATTACTGTGCGAGA
101 F S L K L S S V T A A D T A V Y Y C A R
361 GTAATTAATTGGTTCGATCCCTGGGGCCAAGGTACCCTGGTCACCGTCTCCTCAGGTGGC
121 V I N W F D P W G Q G T L V T V S S G G
421 GGTGGCTCTGGCGGTGGAGGGAGTGGTGGCGGTGGCTCTGATATTCAGATGACCCAGTCT
141 G G S G G G G S G G G G S D I Q M T Q S
481 CCATCTTCCGTGTCTGCGTCTGTAGGAGACCGCGTCACCATCACTTGTCGGGCGAGTCAG
161 P S S V S A S V G D R V T I T C R A S Q
541 GATATTAGCAGCTGGTTAGCCTGGTATCAGCATAAACCAGGCAAAGCCCCTAAGTTACTG
181 D I S S W L A W Y Q H K P G K A P K L L
601 ATCTATGCTGCGTCCAGTTTGCAAAGTGGGGTCCCATCACGTTTCAGCGGCAGTGGATCT
201 I Y A A S S L Q S G V P S R F S G S G S
661 GGGACAGATTTCACTCTCACCATTAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTAT
221 G T D F T L T I S S L Q P E D F A T Y Y
721 TGTCAACAGGCTAATAGTTTCCCGTACACTTTTGGCCAGGGTACCAAACTGGAGATCAAA
241 C Q Q A N S F P Y T F G Q G T K L E I K
781 TAA
261 *
Seq ID No. 19:
1 ATGAAATACCTGCTGCCGACGGCTGCTGCTGGTCTGCTGCTCCTCGCTGCCCAGCCGGCG
1 M K Y L L P T A A A G L L L L A A Q P A
61 ATGGCCGCAAGCACCACGACTAATTATACCAGTCAGGTGCAGTTACAGCAATGGGGCGCA
21 M A A S T T T N Y T S Q V Q L Q Q W G A
121 GGTCTGTTGAAACCTTCGGAAACCCTGTCCCTTACCTGCGCTGTCTATGGTGGCTCCTTT
41 G L L K P S E T L S L T C A V Y G G S F
181 AGTGGTTACTACTGGAGCTGGATTCGTCAGCCGCCAGGTAAAGGGCTGGAATGGATTGGG
61 S G Y Y W S W I R Q P P G K G L E W I G
241 GAAATCAATCATAGTGGAAGCACCAACTACAACCCGTCCCTGAAGAGTCGTGTCACCATT
81 E I N H S G S T N Y N P S L K S R V T I
301 TCAGTCGATACGTCCAAGAACCAGTTTTCCCTGAAGCTGAGCTCTGTGACCGCCGCGGAT
101 S V D T S K N Q F S L K L S S V T A A D
361 ACGGCTGTGTATTACTGTGCGAGAGTAATTAATTGGTTCGATCCCTGGGGCCAAGGTACC
121 T A V Y Y C A R V I N W F D P W G Q G T
421 CTGGTCACCGTCTCCTCAGGTGGCGGTGGCTCTGGCGGTGGAGGGAGTGGTGGCGGTGGC
141 L V T V S S G G G G S G G G G S G G G G
481 TCTGATATTCAGATGACCCAGTCTCCATCTTCCGTGTCTGCGTCTGTAGGAGACCGCGTC
161 S D I Q M T Q S P S S V S A S V G D R V
541 ACCATCACTTGTCGGGCGAGTCAGGATATTAGCAGCTGGTTAGCCTGGTATCAGCATAAA
181 T I T C R A S Q D I S S W L A W Y Q H K
601 CCAGGCAAAGCCCCTAAGTTACTGATCTATGCTGCGTCCAGTTTGCAAAGTGGGGTCCCA
201 P G K A P K L L I Y A A S S L Q S G V P
661 TCACGTTTCAGCGGCAGTGGATCTGGGACAGATTTCACTCTCACCATTAGCAGCCTGCAG
221 S R F S G S G S G T D F T L T I S S L Q
721 CCTGAAGATTTTGCAACTTACTATTGTCAACAGGCTAATAGTTTCCCGTACACTTTTGGC
241 P E D F A T Y Y C Q Q A N S F P Y T F G
781 CAGGGTACCAAACTGGAGATCAAATAA
261 Q G T K L E I K *
Seq ID No.20, the sequence of plasmid p5
1 ATGTGAGCAA AAGGCCAGCA AAAGGCCAGG AACCGTAAAA AGGCCGCGTT GCTGGCGTTT
61 TTCCATAGGC TCCGCCCCCC TGACGAGCAT CACAAAAATC GACGCTCAAG TCAGAGGTGG
121 CGAAACCCGA CAGGACTATA AAGATACCAG GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC
181 TCTCCTGTTC CGACCCTGCC GCTTACCGGA TACCTGTCCG CCTTTCTCCC TTCGGGAAGC
241 GTGGCGCTTT CTCATAGCTC ACGCTGTAGG TATCTCAGTT CGGTGTAGGT CGTTCGCTCC
301 AAGCTGGGCT GTGTGCACGA ACCCCCCGTT CAGCCCGACC GCTGCGCCTT ATCCGGTAAC
361 TATCGTCTTG AGTCCAACCC GGTAAGACAC GACTTATCGC CACTGGCAGC AGCCACTGGT
421 AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTACAG AGTTCTTGAA GTGGTGGCCT
481 AACTACGGCT ACACTAGAAG AACAGTATTT GGTATCTGCG CTCTGCTGAA GCCAGTTACC
541 TTCGGAAAAA GAGTTGGTAG CTCTTGATCC GGCAAACAAA CCACCGCTGG TAGCGGTGGT
601 TTTTTTGTTT GCAAGCAGCA GATTACGCGC AGAAAAAAAG GATCTCAAGA AGATCCTTTG
661 ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG AACGAAAACT CACGTTAAGG GATTTTGGTC
721 ATGAGATTAT CAAAAAGGAT CTTCACCTAG ATCCTTTTAA ATTAAAAATG AAGTTTTAAA
781 TCAATCTAAA GTATATATGA GTAAACTTGG TCTGACAGTT ACCAATGCTT AATCAGTGAG
841 GCACCTATCT CAGCGATCTG TCTATTTCGT TCATCCATAG TTGCCTGACT CCCCGTCGTG
901 TAGATAACTA CGATACGGGA GGGCTTACCA TCTGGCCCCA GTGCTGCAAT GATACCGCGA
961 GACCCACGCT CACCGGCTCC AGATTTATCA GCAATAAACC AGCCAGCCGG AAGGGCCGAG
1021 CGCAGAAGTG GTCCTGCAAC TTTATCCGCC TCCATCCAGT CTATTAATTG TTGCCGGGAA
1081 GCTAGAGTAA GTAGTTCGCC AGTTAATAGT TTGCGCAACG TTGTTGCCAT TGCTACAGGC
1141 ATCGTGGTGT CACGCTCGTC GTTTGGTATG GCTTCATTCA GCTCCGGTTC CCAACGATCA
1201 AGGCGAGTTA CATGATCCCC CATGTTGTGC AAAAAAGCGG TTAGCTCCTT CGGTCCTCCG
1261 ATCGTTGTCA GAAGTAAGTT GGCCGCAGTG TTATCACTCA TGGTTATGGC AGCACTGCAT
1321 AATTCTCTTA CTGTCATGCC ATCCGTAAGA TGCTTTTCTG TGACTGGTGA GTACTCAACC
1381 AAGTCATTCT GAGAATAGTG TATGCGGCGA CCGAGTTGCT CTTGCCCGGC GTCAATACGG
1441 GATAATACCG CGCCACATAG CAGAACTTTA AAAGTGCTCA TCATTGGAAA ACGTTCTTCG
1501 GGGCGAAAAC TCTCAAGGAT CTTACCGCTG TTGAGATCCA GTTCGATGTA ACCCACTCGT
1561 GCACCCAACT GATCTTCAGC ATCTTTTACT TTCACCAGCG TTTCTGGGTG AGCAAAAACA
1621 GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC GGAAATGTTG AATACTCATA
1681 CTCTTCCTTT TTCAATATTA TTGAAGCATT TATCAGGGTT ATTGTCTCAT GAGCGGATAC
1741 ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTT AATACGACTC ACTATAGGGC
1801 GAGGAATTCT GGAAGGGCTA ATTTGGTCCC AAAAAAGACA AGAGAATGCT TATGGACTAG
1861 GGACTGTTAC AAACAAATGA TGAATGGAAA CAGCTGAGCA TGACTCATAG CTGAAGCGCT
1921 AGCAGCTGCT TAACCGCAAA ACCACATCCT ATGTAAAGCT TGCTAATGAC GTATAAGTTG
1981 CTCCACTGTA AAAGTATATA AGCAGCTGCT TTTTGCCTGT ACTGGGTCTC TCTGGTTAGA
2041 CCAGATCTGA GCCTGGGAGC TCTCTGGCTA ACTAGGGAAC CCACTGCTTA AGCCTCAATA
2101 AAGCTTGCCT TGAGTGCTTC AAGTAGTGTG TGCCCGTCTG TTGTGTGACT CTGGTAACTA
2161 GAGATCCCTC AGACCCTTTT AGTCAGTGTG GAAAATCTCT AGCAGTGGCG CCCGAACAGG
2221 GACTTGAAAG CGAAAGGGAA ACCAGAGGAG CTCTCTCGAC GCAGGACTCG GCTTGCTGAA
2281 GCGCGCACGG CAAGAGGCGA GGGGCGGCGA CTGGTGAGTA CGCCAAAAAT TTTGACTAGC
2341 GGAGGCTAGA AGGAGAGAGA TGGGTGCGAG AGCGTCAGTA TTAAGCGGGG GAGAATTAGA
2401 TCGCGATGGG AAAAAATTCG GTTAAGGCCA GGGGGAAAGA AAAAATATAA ATTAAAACAT
2461 ATAGTATGGG CAAGCAGGGA GCTAGAACGA TTCGCAGTTA ATCCTGGCCT GTTAGAAACA
2521 TCAGAAGGCT GTAGACAAAT ACTGGGACAG CTACAACCAT CCCTTCAGAC AGGATCAGAA
2581 GAACTTAGAT CATTATATAA TACAGTAGCA ACCCTCTATT GTGTGCATCA AAGGATAGAG
2641 ATAAAAGACA CCAAGGAAGC TTTAGACAAG ATAGAGGAAG AGCAAAACAA AAGTAAGACC
2701 ACCGCACAGC AAGCGGCCGG CCGCTGATCT TCAGACCTGG AGGAGGAGAT ATGAGGGACA
2761 ATTGGAGAAG TGAATTATAT AAATATAAAG TAGTAAAAAT TGAACCATTA GGAGTAGCAC
2821 CCACCAAGGC AAAGAGAAGA GTGGTGCAGA GAGAAAAAAG AGCAGTGGGA ATAGGAGCTT
2881 TGTTCCTTGG GTTCTTGGGA GCAGCAGGAA GCACTATGGG CGCAGCGTCA ATGACGCTGA
2941 CGGTACAGGC CAGACAATTA TTGTCTGGTA TAGTGCAGCA GCAGAACAAT TTGCTGAGGG
3001 CTATTGAGGC GCAACAGCAT CTGTTGCAAC TCACAGTCTG GGGCATCAAG CAGCTCCAGG
3061 CAAGAATCCT GGCTGTGGAA AGATACCTAA AGGATCAACA GCTCCTGGGG ATTTGGGGTT
3121 GCTCTGGAAA ACTCATTTGC ACCACTGCTG TGCCTTGGAA TGCTAGTTGG AGTAATAAAT
3181 CTCTGGAACA GATTTGGAAT CACACGACCT GGATGGAGTG GGACAGAGAA ATTAACAATT
3241 ACACAAGCTT AATACACTCC TTAATTGAAG AATCGCAAAA CCAGCAAGAA AAGAATGAAC
3301 AAGAATTATT GGAATTAGAT AAATGGGCAA GTTTGTGGAA TTGGTTTAAC ATAACAAATT
3361 GGCTGTGGTA TATAAAATTA TTCATAATGA TAGTAGGAGG CTTGGTAGGT TTAAGAATAG
3421 TTTTTGCTGT ACTTTCTATA GTGAATAGAG TTAGGCAGGG ATATTCACCA TTATCGTTTC
3481 AGACCCACCT CCCAACCCCG AGGGGACCCG ACAGGCCCGA AGGAATAGAA GAAGAAGGTG
3541 GAGAGAGAGA CAGAGACAGA TCCATTCGAT TAGTGAACGG ATCTCGACGG TATCGCCTTT
3601 AAAAGAAAAG GGGGGATTGG GGGGTACAGT GCAGGGGAAA GAATAGTAGA CATAATAGCA
3661 ACAGACATAC AAACTAAAGA ATTACAAAAA CAAATTACAA AAATTCAAAA TTTTCGGGTT
3721 TCTCGAGACG GGGACAGCCC CCCCCCAAAG CCCCCAGGGA TGTAATTACG TCCCTCCCCC
3781 GCTAGGGGGC AGCAGCGAGC CGCCCGGGGC TCCGCTCCGG TCCGGCGCTC CCCCCGCATC
3841 CCCGAGCCGG CAGCGTGCGG GGACAGCCCG GGCACGGGGA AGGTGGCACG GGATCGCTTT
3901 CCTCTGAACG CTTCTCGCTG CTCTTTGAGC CTGCAGACAC CTGGGGGGAT ACGGGGAAAA
3961 AGGATCCAAG GTCGGGCAGG AAGAGGGCCT ATTTCCCATG ATTCCTTCAT ATTTGCATAT
4021 ACGATACAAG GCTGTTAGAG AGATAATTGG AATTAATTTG ACTGTAAACA CAAAGATATT
4081 AGTACAAAAT ACGTGACGTA GAAAGTAATA ATTTCTTGGG TAGTTTGCAG TTTTAAAATT
4141 ATGTTTTAAA ATGGACTATC ATATGCTTAC CGTAACTTGA AAGTATTTCG ATTTCTTGGC
4201 TTTATATATC TTGTGGAAAG GACGAAACAC CGGAATCGAA GAAGGAGGGT TTTAGAGCTA
4261 GAAATAGCAA GTTAAAATAA GGCTAGTCCG TTATCAACTT GAAAAAGTGG CACCGAGTCG
4321 GTGCTTTTTT AAAAAAGCAC CGACTCGGTG CCACTTTTTC AAGTTGATAA CGGACTAGCC
4381 TTATTTTAAC TTGCTATTTC TAGCTCTAAA ACGGCCCTGG TGTGTAGTTT GGGAAAGAGT
4441 GGTCTCATAC AGAACTTATA AGATTCCCAA ATCCAAAGAC ATTTCACGTT TATGGTGATT
4501 TCCCAGAACA CATAGCGACA TGCAAATATT GCAGGGCGCC ACTCCCCTGT CCCTCACAGC
4561 CATCTTCCTG CCAGGGCGCA CGCGCGCTGG GTGTTCCCGC CTAGTGACAC TGGGCCCGCG
4621 ATTCCTTGGA GCGGGTTGAT GACGTCAGCG TTCGAATTCC ATGGCGGCGC GGCGGCGACG
4681 GAGCACCGGC GGCGGCAGGG CGAGAGGTTC GGAGCTCAAT ATCGCGGGAC GGCATGCGGG
4741 GGGCGGGCAG TCAGAAAGGA ACGATGCCAC CTACTGTGAC CCCCTTCCCT TCAGCTCCCT
4801 ATAAACCGGT GATCCGACGC CGCCATCTCT AGGCCCGCGC CGGCCCCCTC GCACAGACTT
4861 GTGGGAGAAG CTCGGCTACT CCCCTGCCCC GGTTAATTTG CATATAATAT TTCCTAGTAA
4921 CTATAGAGGC TTAATGTGCG ATAAAAGACA GATAATCTGT TCTTTTTAAT ACTAGCTACA
4981 TTTTACATGA TAGGCTTGGA TTTCTATAAG AGATACAAAT ACTAAATTAT TATTTTAAAA
5041 AACAGCACAA AAGGAAACTC ACCCTAACTG TAAAGTAATT GTGTGTTTTG AGACTATAAA
5101 TATCCCTTGG AGAAAAGCCT TGTTTGATAG ACCCATAAAT CAGTTTTAGA GCTAGAAATA
5161 GCAAGTTAAA ATAAGGCTAG TCCGTTATCA ACTTGAAAAA GTGGCACCGA GTCGGTGCTT
5221 TTTTAAAAAA GCACCGACTC GGTGCCACTT TTTCAAGTTG ATAACGGACT AGCCTTATTT
5281 TAACTTGCTA TTTCTAGCTC TAAAACTCAA ATCACTCTTT GGCGAGGTAC CCAGGCGGCG
5341 CACAAGCTAT ATAAACCTGA AGGAAATCTC AACTTTACAC TTAGGTCAAG TTACTTATCG
5401 TACTAGAGCT TCAGCAGGAA ATTTAACTAA AATCTAATTT AACCAGCATA GCAAATATCA
5461 TTTATTCCCA AAATGCTAAA GTTTGAGATA AACGGACTTG ATTTCCGGCT GTTTTGACAC
5521 TATCCAGAAT GCCTTGCAGA TGGGTGGGGC ATGCTAAATA CTGCAGGGTA CCTGCGGGGA
5581 GGCGGCCCAA AGGGAGATCC GACTCGTCTG AGGGCGAAGG CGAAGACGCG GAAGAGGCCG
5641 CAGAGCCGGC AGCAGGCCGC GGGAAGGAAG GTCCGCTGGA TTGAGGGCCG AAGGGACGTA
5701 GCAGAAGGAC GTCCCGCGCA GAATCCAGGT GGCAACACAG GCGAGCAGCC AAGGAAAGGA
5761 CGATGATTTC CCCGACAACA CCACGGAATT GTCAGTGCCC AACAGCCGAG CCCCTGTCCA
5821 GCAGCGGGCA AGGCAGGCGG CGATGAGTTC CGCCGTGGCA ATAGGGAGGG GGAAAGCGAA
5881 AGTCCCGGAA AGGAGCTGAC AGGTGGTGGC AATGCCCCAA CCAGTGGGGG TTGCGTCAGC
5941 AAACACAGTG CACACCACGC CACGTTGCCT GACAACGGGC CACAACTCCT CATAAAGAGA
6001 CAGCAACCAG GATTTATACA AGGAGGAGAA AATGAAAGCC ATACGGGAAG CAATAGCATG
6061 ATACAAAGGC ATTAAAGCAG CGTATCCACA TAGCGTAAAA GGAGCAACAT AGTTAAGAAT
6121 ACCAGTCAAT CTTTCACAAA TTTTGTAATC CAGAGGTTGA ACGGGGACAG CCCCCCCCCA
6181 AAGCCCCCAG GGATGTAATT ACGTCCCTCC CCCGCTAGGG GGCAGCAGCG AGCCGCCCGG
6241 GGCTCCGCTC CGGTCCGGCG CTCCCCCCGC ATCCCCGAGC CGGCAGCGTG CGGGGACAGC
6301 CCGGGCACGG GGAAGGTGGC ACGGGATCGC TTTCCTCTGA ACGCTTCTCG CTGCTCTTTG
6361 AGCCTGCAGA CACCTGGGGG GATACGGGGA AAAATCTAGA TGGAAGGGCT AATTTGGTCC
6421 CAAAAAAGAC AAGAGGTATA TAAGCAGCTG CTTTTTGCCT GTACTGGGTC TCTCTGGTTA
6481 GACCAGATCT GAGCCTGGGA GCTCTCTGGC TAACTAGGGA ACCCACTGCT TAAGCCTCAA
6541 TAAAGCTTGC CTTGAGTGCT TCAAGTAGTG TGTGCCCGTC TGTTGTGTGA CTCTGGTAAC
6601 TAGAGATCCC TCAGACCCTT TTAGTCAGTG TGGAAAATCT CTAGCAACTA GTAGCAAGGT
6661 CGCCACGCAC AAGATCAATA TTAACAATCA GTCATCTCTC TTTAGCAATA AAAAGGTGAA
6721 AAATTACATT TTAAAAATGA CACCATAGAC GATGTATGAA AATAATCTAC TTGGAAATAA
6781 ATCTAGGCAA AGAAGTGCAA GACTGTTACC CAGAAAACTT ACAAATTGTA AATGAGAGGT
6841 TAGTGAAGAT TTAAATGAAT GAAGATCTAA ATAAACTTAT AAATTGTGAG AGAAATTAAT
6901 GAATGTCTAA GTTAATGCAG AAACGGAGAG ACATACTATA TTCATGAACT AAAAGACTTA
6961 ATATTGTGAA GGTATACTTT CTTTTCACAT AAATTTGTAG TCAATATGTT CACCCCAAAA
7021 AAGCTGTTTG TTAACTTGTC AACCTCATTT CAAAATGTAT ATAGAAAGCC CAAAGACAAT
7081 AACAAAAATA TTCTTGTAGA ACAAAATGGG AAAGAATGTT CCACTAAATA TCAAGATTTA
7141 GAGCAAAGCA TGAGATGTGT GGGGATAGAC AGTGAGGCTG ATAAAATAGA GTAGAGCTCA
7201 GAAACAGACC CATTGATATA TGTAAGTGAC CTATGAAAAA AATATGGCAT TTTACAATGG
7261 GAAAATGATG ATCTTTTTCT TTTTTAGAAA AACAGGGAAA TATATTTATA TGTAAAAAAT
7321 AAAAGGGAAC CCATATGTCA TACCATACAC ACAAAAAAAT TCCAGTGAAT TATAAGTCTA
7381 AATGGAGAAG GCAAAACTTT AAATCTTTTA GAAAATAATA TAGAAGCATG CCATCATGAC
7441 TTCAGTGTAG AGAAAAATTT CTTATGACTC AAAGTCCTAA CCACAAAGAA AAGATTGTTA
7501 ATTAGATTGC ATGAATATTA AGACTTATTT TTAAAATTAA AAAACCATTA AGAAAAGTCA
7561 GGCCATAGAA TGACAGAAAA TATTTGCAAC ACCCCAGTAA AGAGAATTGT AATATGCAGA
7621 TTATAAAAAG AAGTCTTACA AATCAGTAAA AAATAAAACT AGACAAAAAT TTGAACAGAT
7681 GAAAGAGAAA CTCTAAATAA TCATTACACA TGAGAAACTC AATCTCAGAA ATCAGAGAAC
7741 TATCATTGCA TATACACTAA ATTAGAGAAA TATTAAAAGG CTAAGTAACA TCTGTGGCAA
7801 TATTGATGGT ATATAACCTT GATATGATGT GATGAGAACA GTACTTTACC CCATGGGCTT
7861 CCTCCCCAAA CCCTTACCCC AGTATAAATC ATGACAAATA TACTTTAAAA ACCATTACCC
7921 TATATCTAAC CAGTACTCCT CAAAACTGTC AAGGTCATCA AAAATAAGAA AAGTCTGAGG
7981 AACTGTCAAA ACTAAGAGGA ACCCAAGGAG ACATGAGAAT TATATGTAAT GTGGCATTCT
8041 GAATGAGATC CCAGAACAGA AAAAGAACAG TAGCTAAAAA ACTAATGAAA TATAAATAAA
8101 GTTTGAACTT TAGTTTTTTT TAAAAAAGAG TAGCATTAAC ACGGCAAAGT CATTTTCATA
8161 TTTTTCTTGA ACATTAAGTA CAAGTCTATA ATTAAAAATT TTTTAAATGT AGTCTGGAAC
8221 ATTGCCAGAA ACAGAAGTAC AGCAGCTATC TGTGCTGTCG CCTAACTATC CATAGCTGAT
8281 TGGTCTAAAA TGAGATACAT CAACGCTCCT CCATGTTTTT TGTTTTCTTT TTAAATGAAA
8341 AACTTTATTT TTTAAGAGGA GTTTCAGGTT CATAGCAAAA TTGAGAGGAA GGTACATTCA
8401 AGCTGAGGAA GTTTTCCTCT ATTCCTAGTT TACTGAGAGA TTGCATCATG AATGGGTGTT
8461 AAATTTTGTC AAATGCTTTT TCTGTGTCTA TCAATATGAC CATGTGATTT TCTTCTTTAA
8521 CCTGTTGATG GGACAAATTA CGTTAATTGA TTTTCAAACG TTGAACCACC CTTACATATC
8581 TGGAATAAAT TCTACTTGGT TGTGGTGTAT ATTTTTTGAT ACATTCTTGG ATTCTTTTTG
8641 CTAATATTTT GTTGAAAATG TTTGTATCTT TGTTCATGAG AGATATTGGT CTGTTGTTTT
8701 CTTTTCTTGT AATGTCATTT TCTAGTTCCG GTATTAAGGT AATGCTGGCC TAGTTGAATG
8761 ATTTAGGAAG TATTCCCTCT GCTTCTGTCT TCTGAAAGAG ATTGTAGAAA GTTGATACAA
8821 TTTTTTTTTC TTTAAATATC TTGATATTAA TTAA
Seq ID No.21,5 heavy gRNA expressed sequences
1 AAGGTCGGGC AGGAAGAGGG CCTATTTCCC ATGATTCCTT CATATTTGCA TATACGATAC
61 AAGGCTGTTA GAGAGATAAT TGGAATTAAT TTGACTGTAA ACACAAAGAT ATTAGTACAA
121 AATACGTGAC GTAGAAAGTA ATAATTTCTT GGGTAGTTTG CAGTTTTAAA ATTATGTTTT
181 AAAATGGACT ATCATATGCT TACCGTAACT TGAAAGTATT TCGATTTCTT GGCTTTATAT
241 ATCTTGTGGA AAGGACGAAA CACCGGAATC GAAGAAGGAG GGTTTTAGAG CTAGAAATAG
301 CAAGTTAAAA TAAGGCTAGT CCGTTATCAA CTTGAAAAAG TGGCACCGAG TCGGTGCTTT
361 TTTAAAAAAG CACCGACTCG GTGCCACTTT TTCAAGTTGA TAACGGACTA GCCTTATTTT
421 AACTTGCTAT TTCTAGCTCT AAAACGGCCC TGGTGTGTAG TTTGGGAAAG AGTGGTCTCA
481 TACAGAACTT ATAAGATTCC CAAATCCAAA GACATTTCAC GTTTATGGTG ATTTCCCAGA
541 ACACATAGCG ACATGCAAAT ATTGCAGGGC GCCACTCCCC TGTCCCTCAC AGCCATCTTC
601 CTGCCAGGGC GCACGCGCGC TGGGTGTTCC CGCCTAGTGA CACTGGGCCC GCGATTCCTT
661 GGAGCGGGTT GATGACGTCA GCGTTCGAAT TCCATGGCGG CGCGGCGGCG ACGGAGCACC
721 GGCGGCGGCA GGGCGAGAGG TTCGGAGCTC AATATCGCGG GACGGCATGC GGGGGGCGGG
781 CAGTCAGAAA GGAACGATGC CACCTACTGT GACCCCCTTC CCTTCAGCTC CCTATAAACC
841 GGTGATCCGA CGCCGCCATC TCTAGGCCCG CGCCGGCCCC CTCGCACAGA CTTGTGGGAG
901 AAGCTCGGCT ACTCCCCTGC CCCGGTTAAT TTGCATATAA TATTTCCTAG TAACTATAGA
961 GGCTTAATGT GCGATAAAAG ACAGATAATC TGTTCTTTTT AATACTAGCT ACATTTTACA
1021 TGATAGGCTT GGATTTCTAT AAGAGATACA AATACTAAAT TATTATTTTA AAAAACAGCA
1081 CAAAAGGAAA CTCACCCTAA CTGTAAAGTA ATTGTGTGTT TTGAGACTAT AAATATCCCT
1141 TGGAGAAAAG CCTTGTTTGA TAGACCCATA AATCAGTTTT AGAGCTAGAA ATAGCAAGTT
1201 AAAATAAGGC TAGTCCGTTA TCAACTTGAA AAAGTGGCAC CGAGTCGGTG CTTTTTTAAA
1261 AAAGCACCGA CTCGGTGCCA CTTTTTCAAG TTGATAACGG ACTAGCCTTA TTTTAACTTG
1321 CTATTTCTAG CTCTAAAACT CAAATCACTC TTTGGCGAGG TACCCAGGCG GCGCACAAGC
1381 TATATAAACC TGAAGGAAAT CTCAACTTTA CACTTAGGTC AAGTTACTTA TCGTACTAGA
1441 GCTTCAGCAG GAAATTTAAC TAAAATCTAA TTTAACCAGC ATAGCAAATA TCATTTATTC
1501 CCAAAATGCT AAAGTTTGAG ATAAACGGAC TTGATTTCCG GCTGTTTTGA CACTATCCAG
1561 AATGCCTTGC AGATGGGTGG GGCATGCTAA ATACTGCAGG TCGACATGTC ACCAATAGAG
1621 ACGGCAATAT TTAGCATGCT CCACCCATCT GCAAGGCATG CTGGATAATA CGCAAACAAC
1681 TCCAAATCAA GCCCTTTTTA ATAGCAAGTT TTGCAATTTA AGGGAATAAT TTTTAAGTGT
1741 TAGTTAGGTT TGAACCTTAG TTTAAGATAT AACTTAGAGA TCCAGCATTT AGAAAGCAAG
1801 GAGAAAGTCG ACATATGCGT AAAGATCGAA ATTTCTTGTA GCTTATATAC TGCGTGCACT
1861 AGCTCAGTCT TTCGAATTGA TACTGACTGT AGTTTTAGAG CTAGAAATAG CAAGTTAAAA
1921 TAAGGCTAGT CCGTTATCAA CTTGAAAAAG TGGCACCGAG TCGGTGCTTT TTT
Claims (8)
1. a kind of controllability HIV-1 genomes target editing system, it is characterised in that including expressing controllable editing enzymes, collocation
Multiple guiding RNA (gRNA);
Specifically, expression of the pUC pUC to editing enzymes with the addition of control element, makes the expression of editing enzymes or is opened by LTR
Mover and TAT protein regulating and expressing, or strong promoter and the R element of HIV-1 by built-in mammal constitute it is chimeric
Promoter controls it to express, or is reached by the chimeric promoters control table that built-in strong mammalian promoters and TetO are constituted;Its
It is characterised by, the editing enzymes expression plasmid includes following plasmid:P1, p2, p3, p7, p8, p10, p11 and p13;The gRNA
Expression plasmid includes following plasmid:P4, p5, p9, p12, p14 and p15.
The multiple gRNA refers to the encoder block genome sequence design for HIV-1 genomes multiple site and/or CCR5
Targeting sequence.Characterized in that, in sequence containing Seq ID No.1-5 in ordered list in the multiple gRNA expression vectors
At least 2 kinds.The gRNA expression plasmids include following plasmid:P4, p5, p9, p12, p14 and p15.
The editing enzymes, gRNA co-expression plasmids include following plasmid:P6-1, p6-2 and p6-3;
The sequence of the p1 as shown in Seq ID No.7 in sequence table, the sequence of p5 as shown in Seq ID No.20 in sequence table,
The sequence of p6-1 as shown in Seq ID No.8 in sequence table, the sequence of p7 as shown in Seq ID No.9 in sequence table, p6-
The five heavy gRNA expressed sequences of 3 and p15 are as shown in Seq ID No.21.
2. controllability HIV-1 genome editing systems according to claim 1, the plasmid for carrying 5 '-and 3 '-LTR
System can integrator gene group, its integration to genome depends on intracellular infected HIV-1 live viruses.
3. a kind of targeting carrier system for conveying any editing systems of claim 1-2, it is characterised in that the fat
Plastid is mainly prepared from the following components:
DSPC DSPC:Cholesterol:Polypeptide-PEG- phosphatide mass ratio=20-70:10-30:0.0001-2;
Or, DPPC DPPC:DSPC DSPC:1,2- dioleyl phosphatidyl cholines
DOPC:Cholesterol:Mol ratio=5 of polypeptide-PEG- phosphatide:1:0.5-3:0.5-2:0.0001-0.5.
4. the carrier system according to claim 1-3, it is characterised in that the carrier system contains the targeting fat of PEGylation
Plastid, PEG is bi-functional, in addition to being covalently attached phospholipid composition, goes back the special target polypeptide of covalent coupling;The target
Include any one or several combinations in single target spot polypeptide and Mutiple Targets polypeptide to polypeptide;Single target spot polypeptide includes anti-CD4
Polypeptide aCD4 or aCD4scFV, anti-CCR5 polypeptides aCCR5, anti-CXCR4 polypeptides aCXCR4;The Mutiple Targets polypeptide includes aCD4_
CCR5、aCCR5_CD4、aCD4_CXCR4、aCXCR4_CD4、aCCR5_CXCR4、aCXCR4_CCR5;The PEG of the coupling points
Son amount scope is 200-10000, and the phosphatide of the coupling is one or more in phosphatidyl-ethanolamine or its derivative;
The coded sequence of aCD4scFV is Seq ID No.18 in sequence table;
The sequence of aCD4_CCR5 is Seq ID No.12 in sequence table, and wherein its amino can be put into left-hand end or right-hand end;
The sequence of aCCR5_CD4 is Seq ID No.13 in sequence table;
The coded sequence of aCCR5_CD4scFV is Seq ID No.19 in sequence table;
The sequence of aCD4_CXCR4 is Seq ID No.16 in sequence table;
The sequence of aCXCR4_CD4 is Seq ID No.17 in sequence table;
The sequence of aCCR5_CXCR4 is Seq ID No.14 in sequence table;
The sequence of aCXCR4_CCR5 is Seq ID No.15 in sequence table.
Above-mentioned functions polypeptide, can also directly with the lipid molecular covalent coupling such as PE, can directly add and be integrated into liposome membrane
In molecular layer.
5. the carrier system according to claim 3 and 4, it is characterised in that penetrating peptide can be also coupled on PEG- phospholipid molecules,
Formed " penetrating peptide-PEG- phosphatide ", the molecule is used with the collocation of target polypeptide-PEG- phosphatide;Penetrating peptide-PEG- phosphatide and other
The mol ratio of target polypeptide-PEG- phosphatide is 0.0001-2:1.
6. the carrier system according to claim 3-5, it is characterised in that hyaluronic acid can be also added in liposome components
Phosphatide, with target spot polypeptide-PEG- phosphatide and penetrating peptide-PEG- phosphatide collocation use, the phosphatide of the hyaluronic acid and its
The mol ratio of his polypeptide-PEG- phosphatide is 0.00001-1:1.
7. the plasmid and liposome according to claim 1-6, load can also add in plasmid cationic polymer or
Other polypeptide auxiliary ingredients, the auxiliary ingredients are polyethyleneimine, protamine sulfate, positively charged amino acid polypeptide or thin
Born of the same parents promote one or more compositions in fusogenic peptide.
8. a kind of targeting carrier system for conveying any editing systems of claim 1-2, it is characterised in that the load
Fortune system can also be slow virus;
The slow virus is that the targeting editing system is arranged in pairs or groups with the second generation or third generation slow virus packaging system, is carried out
Virus packaging;Described packaging virus capsid protein is VSVG, or is the capsid in measles virus (Measles Virus) source
Albumen, or be sindbis alphavirus (Sindbis Virus) source, and other viral sources capsid protein;It is described slow
The integrase inactivation of virus system is lacked, and its integration to genome depends on the HIV-1 live viruses that cell is infected.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710089568.9A CN106755100A (en) | 2017-02-20 | 2017-02-20 | The genomes of controllability HIV 1 target editing system and its targeting carrier system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710089568.9A CN106755100A (en) | 2017-02-20 | 2017-02-20 | The genomes of controllability HIV 1 target editing system and its targeting carrier system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106755100A true CN106755100A (en) | 2017-05-31 |
Family
ID=58957689
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710089568.9A Pending CN106755100A (en) | 2017-02-20 | 2017-02-20 | The genomes of controllability HIV 1 target editing system and its targeting carrier system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106755100A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108728477A (en) * | 2017-04-24 | 2018-11-02 | 华东理工大学 | A kind of efficient Transpositional mutation system and construction method |
CN110004181A (en) * | 2019-03-20 | 2019-07-12 | 东北农业大学 | A kind of additive type CRISPR/Cas9 expression vector and its construction method and application |
WO2021051484A1 (en) * | 2019-09-20 | 2021-03-25 | 南京启真基因工程有限公司 | CRISPR/CAS9 SYSTEM AND APPLICATION THEREOF IN CONSTRUCTION OF α, β, AND α&β THALASSEMIA MODEL PORCINE CELL LINES |
WO2023050158A1 (en) * | 2021-09-29 | 2023-04-06 | 深圳先进技术研究院 | Method for achieving multi-base editing |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105331607A (en) * | 2015-10-19 | 2016-02-17 | 芜湖医诺生物技术有限公司 | Human CCR5 gene target sequence recognized by streptococcus thermophilus CRISPR (clustered regularly interspaced short palindromic repeat)-Cas9 (CRISPR-associated protein 9) system, sgRNA (single guide ribonucleic acid) and application |
WO2016171980A1 (en) * | 2015-04-24 | 2016-10-27 | Bristol-Myers Squibb Company | Polypeptides targeting hiv fusion |
-
2017
- 2017-02-20 CN CN201710089568.9A patent/CN106755100A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016171980A1 (en) * | 2015-04-24 | 2016-10-27 | Bristol-Myers Squibb Company | Polypeptides targeting hiv fusion |
CN105331607A (en) * | 2015-10-19 | 2016-02-17 | 芜湖医诺生物技术有限公司 | Human CCR5 gene target sequence recognized by streptococcus thermophilus CRISPR (clustered regularly interspaced short palindromic repeat)-Cas9 (CRISPR-associated protein 9) system, sgRNA (single guide ribonucleic acid) and application |
Non-Patent Citations (3)
Title |
---|
LAURAMARTÍNEZ-MUÑOZ ET AL.,: ""CCR5/CD4/CXCR4 oligomerization prevents HIV-1 gp120IIIB binding to the cell surface"", 《PNAS》 * |
PANPAN HOU ET AL.,: ""Genome editing of CXCR4 by CRISPR/cas9 confers cells resistant to HIV-1 infection"", 《SCIENTIFIC REPORTS》 * |
赵萌等: ""一种拮抗 HIV-1 并靶向 DC 的融合基因的设计及表达鉴定"", 《生物工程学报》 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108728477A (en) * | 2017-04-24 | 2018-11-02 | 华东理工大学 | A kind of efficient Transpositional mutation system and construction method |
CN110004181A (en) * | 2019-03-20 | 2019-07-12 | 东北农业大学 | A kind of additive type CRISPR/Cas9 expression vector and its construction method and application |
CN110004181B (en) * | 2019-03-20 | 2022-10-11 | 东北农业大学 | Additional CRISPR/Cas9 expression vector and construction method and application thereof |
WO2021051484A1 (en) * | 2019-09-20 | 2021-03-25 | 南京启真基因工程有限公司 | CRISPR/CAS9 SYSTEM AND APPLICATION THEREOF IN CONSTRUCTION OF α, β, AND α&β THALASSEMIA MODEL PORCINE CELL LINES |
WO2023050158A1 (en) * | 2021-09-29 | 2023-04-06 | 深圳先进技术研究院 | Method for achieving multi-base editing |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111593073B (en) | Double-reporter gene framework vector, four-plasmid pseudovirus packaging system and new packaging corolla pneumonia pseudovirus | |
US20020068354A1 (en) | Feline immunodeficiency virus gene therapy vectors | |
KR101234981B1 (en) | Nucleic Acid Constructs | |
KR102609858B1 (en) | Adeno-Associated Viral Vectors for the Treatment of Mucopolysaccharidosis | |
KR102266691B1 (en) | Antigen delivery platforms | |
CN106755100A (en) | The genomes of controllability HIV 1 target editing system and its targeting carrier system | |
KR20200074988A (en) | Adenovirus vectors and uses thereof | |
CN110079551B (en) | Circular RNA expression vector and construction method and application thereof | |
CN109069668B (en) | Gene therapy for eye diseases | |
KR20220125332A (en) | Compositions and methods for targeting PCSK9 | |
KR102156290B1 (en) | Microvesicle and method for producing the same | |
CN113831394B (en) | Recombinant virus combination of African swine fever virus ASFV gene and vaccine prepared from recombinant virus combination | |
US20020045185A1 (en) | Secreted neural adhesion proteins | |
CN107090466A (en) | Double sgRNA expression plasmids and its construction method in library | |
WO2008151633A2 (en) | Vectors for vaccines against lentivirus infections | |
CN115044614B (en) | Modified vector of AAV-8 serotype for gene targeting and expression, construction method and application thereof | |
KR20230129162A (en) | RNA targeting composition and method for treating type 1 myotonic dystrophy | |
WO2011110864A1 (en) | Promoter sequence for dna and viral vectors | |
CN111110707A (en) | Application of recombinant oncolytic virus in preparation of medicine for treating digestive tract tumor | |
CN113880957A (en) | Streptolysin O fusion protein | |
DK3063278T3 (en) | PREPARATION OF VIRAL VECTORS | |
US20020048805A1 (en) | Feline immunodeficiency virus gene therapy vectors | |
EP0775203A1 (en) | Method of gene therapy using dna vectors without selection marker gene | |
CN111991550B (en) | Application of HIP-55 in preparation of products for diagnosing and preventing myocardial ischemia reperfusion injury | |
CN109913499A (en) | It is a kind of suitable for long-chain non-coding RNA construct and express integrated slow virus carrier system and its application |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20170531 |
|
WD01 | Invention patent application deemed withdrawn after publication |