CN107690482A - The microorganism through modification for the optimization production of 2,4 dihydroxy butyric acid - Google Patents

The microorganism through modification for the optimization production of 2,4 dihydroxy butyric acid Download PDF

Info

Publication number
CN107690482A
CN107690482A CN201580080660.8A CN201580080660A CN107690482A CN 107690482 A CN107690482 A CN 107690482A CN 201580080660 A CN201580080660 A CN 201580080660A CN 107690482 A CN107690482 A CN 107690482A
Authority
CN
China
Prior art keywords
ala
gly
leu
val
glu
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201580080660.8A
Other languages
Chinese (zh)
Other versions
CN107690482B (en
Inventor
P·索卡耶
G·贝斯泰尔-卡尔
L·迪蒙-赛格诺维特
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Metabolic Explorer SA
Original Assignee
Metabolic Explorer SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Metabolic Explorer SA filed Critical Metabolic Explorer SA
Publication of CN107690482A publication Critical patent/CN107690482A/en
Application granted granted Critical
Publication of CN107690482B publication Critical patent/CN107690482B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/40Preparation of oxygen-containing organic compounds containing a carboxyl group including Peroxycarboxylic acids
    • C12P7/42Hydroxy-carboxylic acids
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07CACYCLIC OR CARBOCYCLIC COMPOUNDS
    • C07C31/00Saturated compounds having hydroxy or O-metal groups bound to acyclic carbon atoms
    • C07C31/18Polyhydroxylic acyclic alcohols
    • C07C31/22Trihydroxylic alcohols, e.g. glycerol
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07CACYCLIC OR CARBOCYCLIC COMPOUNDS
    • C07C47/00Compounds having —CHO groups
    • C07C47/02Saturated compounds having —CHO groups bound to acyclic carbon atoms or to hydrogen
    • C07C47/19Saturated compounds having —CHO groups bound to acyclic carbon atoms or to hydrogen containing hydroxy groups
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07CACYCLIC OR CARBOCYCLIC COMPOUNDS
    • C07C59/00Compounds having carboxyl groups bound to acyclic carbon atoms and containing any of the groups OH, O—metal, —CHO, keto, ether, groups, groups, or groups
    • C07C59/01Saturated compounds having only one carboxyl group and containing hydroxy or O-metal groups
    • C07C59/10Polyhydroxy carboxylic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N1/00Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
    • C12N1/20Bacteria; Culture media therefor
    • C12N1/205Bacterial isolates
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0006Oxidoreductases (1.) acting on CH-OH groups as donors (1.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/02Preparation of oxygen-containing organic compounds containing a hydroxy group
    • C12P7/04Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic
    • C12P7/18Preparation of oxygen-containing organic compounds containing a hydroxy group acyclic polyhydric
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y101/00Oxidoreductases acting on the CH-OH group of donors (1.1)
    • C12Y101/01Oxidoreductases acting on the CH-OH group of donors (1.1) with NAD+ or NADP+ as acceptor (1.1.1)
    • C12Y101/01001Alcohol dehydrogenase (1.1.1.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y102/00Oxidoreductases acting on the aldehyde or oxo group of donors (1.2)
    • C12Y102/01Oxidoreductases acting on the aldehyde or oxo group of donors (1.2) with NAD+ or NADP+ as acceptor (1.2.1)
    • C12Y102/01003Aldehyde dehydrogenase (NAD+) (1.2.1.3)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y102/00Oxidoreductases acting on the aldehyde or oxo group of donors (1.2)
    • C12Y102/01Oxidoreductases acting on the aldehyde or oxo group of donors (1.2) with NAD+ or NADP+ as acceptor (1.2.1)
    • C12Y102/01004Aldehyde dehydrogenase (NADP+) (1.2.1.4)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y102/00Oxidoreductases acting on the aldehyde or oxo group of donors (1.2)
    • C12Y102/03Oxidoreductases acting on the aldehyde or oxo group of donors (1.2) with oxygen as acceptor (1.2.3)
    • C12Y102/03001Aldehyde oxidase (1.2.3.1), i.e. retinal oxidase
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12RINDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
    • C12R2001/00Microorganisms ; Processes using microorganisms
    • C12R2001/01Bacteria or Actinomycetales ; using bacteria or Actinomycetales
    • C12R2001/07Bacillus
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12RINDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
    • C12R2001/00Microorganisms ; Processes using microorganisms
    • C12R2001/01Bacteria or Actinomycetales ; using bacteria or Actinomycetales
    • C12R2001/145Clostridium
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12RINDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
    • C12R2001/00Microorganisms ; Processes using microorganisms
    • C12R2001/01Bacteria or Actinomycetales ; using bacteria or Actinomycetales
    • C12R2001/185Escherichia
    • C12R2001/19Escherichia coli

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Health & Medical Sciences (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Genetics & Genomics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Biomedical Technology (AREA)
  • Medicinal Chemistry (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Molecular Biology (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Virology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Abstract

The present invention relates to the genetically modified microorganism for producing 2,4 dihydroxy butyric acid by metabolic conversion xylose via 1,2,4 butantriol intermediates.The invention further relates to the method for producing 2,4 dihydroxy butyric acid, and it is carried out by cultivating the genetically modified microorganism in the fermentation medium and 2,4 DHB being reclaimed from the culture medium.

Description

The microorganism through modification for the optimization production of 2,4- dihydroxy butyric acid
Introduction
The present invention relates to genetically modified microorganism, for by via BT intermediate metabolic conversion wood Sugar produces 2,4- dihydroxy butyric acid.The invention further relates to the method for production 2,4- dihydroxy butyric acid, it passes through in fermented and cultured The genetically modified microorganism is cultivated in base and 2,4-DHB is reclaimed from the culture medium and is carried out.
2,4- dihydroxy butyric acid (i.e. 2,4-DHB or DHB), also referred to as 2,4- dihydroxy fourth (alkane) acid or 2,4- dihydroxy fourth Acid esters/salt, it is a kind of compound with high economic interests, because it may act as being used for chemical synthesis 2- hydroxyls -4- (methyl sulphur Generation) and-butyrate/salt (HMTB) (a kind of methionine analogs, in Animal nutrition, widely used food especially in poultry Product additive) precursor.In fact, HMTB can be included in animal feed as the low price amino acid surrogates of methionine, To dramatically increase meat yield (US2009/318715).
Routinely, by using Alpha-hydroxy-gamma-butyrolacton (2HBL) in the presence of methyl mercaptan and alkalescence or acidic catalyst Chemical synthesis process production HMTB.Industrially in three steps from malic acid (DE19735575A1, AU2004200948A), or in two steps prepared in known manner from gamma-butyrolacton (WO2008/022953A1) 2HBL.The problem of this two lines causes is the generation of the salt for the difficulty and highly significant for separating 2HBL, and the salt is then It must be removed.In addition, HMTB petrochemistry synthesis is economically impractical because it depend on using harmful material and The material and reagent of condition and costliness.
Recently, Alpha-hydroxy-γ-fourth can be changed into by 2,4-DHB in an aqueous medium by adjusting appropriate pH by having shown Lactone (2HBL) (US2013/0204016).
Therefore, even if not being used for the native metabolic pathway of the biochemical production of 2,4- dihydroxy butyric acid, it is used in microorganism Metabolic pathway of synthesizing caused by DHB represents a kind of attractive alternative solution, because it alleviates many mentioned above ask Topic.In fact, it is developed recently the metabolic engineering method dependent on the recombination expression of certain enzyme in microorganism:WO2012/ 056318th, WO2013/160762, WO2014/009435 and EP14306564.7 are particularly described by via different metabolic way Footpath glucose fermentation in the genetically modified microorganism produces 2,4-DHB.By spatially similar same based on acting on The reasonable engineering of the structurally and mechanically knowledge of candidate's enzyme of race's substrate, or by screening native enzyme and by rationally designing into one Step improves to obtain the most of enzymes identified in these patent applications.More specifically, WO2012/056318 identified three kinds it is non- Naturally occurring enzyme (apple acid kinase, maloyl semialdehyde dehydrogenase and DHB dehydrogenases, be mutated), they can be in microorganism It is middle to be overexpressed so that Metabolic Intermediate (L)-malic acid is changed into 2,4-DHB;WO2013/160762 requires the heterologous table of a variety of enzymes Reach, some of them are mutated and improve enzymatic activity and/or substrate affinity (apple acyl CoA synzyme or succinyl CoA:(L)- Malic acid-CoA transferases and/or apple acyl-CoA lyases;Apple acyl CoA reductases;With DHB dehydrogenases) with by metabolism Mesosome malic acid or succinyl-CoA or glycollyl-CoA change into 2,4-DHB;And WO2014/009435 method is depended on and incited somebody to action Metabolic Intermediate (L)-homoserine changes into 2- oxos -4 hydroxybutyric acid (OHB) and by recombinantly expressing two kinds of mutant enzymes (homoserine transaminase and DHB reductases) and OHB is reduced in 2,4-DHB.
The present invention, which proposes, to be used in two one steps from 1,2,4- butantriol Metabolic Intermediates with micro-organisms The alternative route of synthesis of 2,4- dihydroxy butyric acid, the step need that BT continuously is oxidized into 2,4- dihydroxies Base butyraldehyde, 2,4- dihydroxy butyraldehydes are then oxidized to 2,4-DHB.
The present inventor really it was surprisingly found that can by that carbon source (more particularly xylose) can changed into 1,2, Total output that is genetically engineered and substantially improving 2,4-DHB is carried out to above-mentioned two reaction in the microorganism of 4- butantriols.
Therefore, the present invention provides alternative microorganism herein, and its is genetically modified for optimizing production from xylose 2,4- dihydroxy butyric acid.This microorganism of xylose into 1,2,4- butantriols is included further through genetically engineered Genetic modification with:
- BT is oxidized to 2,4- dihydroxy butyraldehydes, and
- 2,4- dihydroxy butyraldehydes are oxidized to 2,4-DHB.
The invention further relates to by fermenting and producing 2, the method for 4- dihydroxy butyric acid, it is included in the culture medium containing xylose The microorganism of the present invention is cultivated, and from culture medium 2,4-DHB caused by recovery.
Detailed description of the invention
It should be understood that following detailed description is not restricted, and can be without departing substantially from the scope of the present invention In the case of carry out it is various modification, replace, omit and change.It should also be understood that terms used herein is to describe the present invention Specific embodiment purpose, and be not intended to be restricted.
Herein cited all publications, patents and patent applications (either above still hereafter) are stated with it by carrying Entirety is hereby incorporated into.
In addition, unless otherwise indicated, all technologies used herein and scientific terminology have and ordinary skill people The identical implication that member is generally understood that.Conventional microbiology and Protocols in Molecular Biology is also known in the art and conventional Those.Such technology is well known to the skilled artisan, and fully explain in the literature (see, for example, Sambrook etc., 2012).
Nevertheless, for the use through the different terms of this specification, following definitions are particularly applicable.
Unless the context, singulative " one kind ", "one" and " should/described " include multiple herein Number is censured.Thus, for example, referring to that " microorganism " includes this variety of quasi-microorganism, and refer to that " endogenous gene " is to refer to one Or multiple endogenous genes, etc..
Term "comprising", " containing ", " being related to " or " comprising " or modification herein with comprising meaning use, i.e., with The presence of the full and clear feature, but it is not excluded for the presence or addition of further feature in each embodiment of the present invention.
As used herein, term " microorganism " refers to microscopic organism living, and it can be unicellular or multicellular organism, and lead to It can often be found in nature.In the context of the present invention, microorganism is preferably bacterium, yeast or fungi.It is highly preferred that this The microorganism of invention is selected from enterobacteriaceae (Enterobacteriaceae), Bacillaceae (Bacillaceae), Clostridiaceae (Clostridiaceae), Streptomycetaceae (Streptomycetaceae), bar Cordycepps (Corynebacteriaceae) and ferment It is female.Even further preferably, the microorganism of the present invention is Escherichia (Escherichia), Klebsiella (Klebsiella), hot anaerobic bacillus(cillus anaerobicus) category (Thermoanaerobacterium), fusobacterium (Clostridium), Corynebacterium (Corynebacterium) or saccharomyces strain.Even more preferably still, microorganism of the invention is selected from Escherichia coli (Escherichia coli), Friedlander's bacillus (Klebsiella pneumoniae), hot Thermoanaerobactersaccharolyticum (Thermoanaerobacterium thermosaccharolyticum), clostridium acetobutylicum (Clostridium Acetobutylicum), Corynebacterium glutamicum (Corynebacterium glutamicum) and saccharomyces cerevisiae (Saccharomyces cerevisiae).Most preferably, microorganism of the invention is Escherichia coli.
As used herein, term " recombinant microorganism ", " genetically modified microorganism " or " genetically engineered microorganism " Refer to microorganism as defined above, the microorganism can not find in nature, therefore genetically with its natural homologue It is different.In other words, it refers to by importing and/or by missing and/or by modifying its genetic elements and micro- life through modification Thing.Such modification can force the shape of novel metabolic pathways by genetic engineering, by cultivating microorganism under specific selection pressure Carried out into evolution or by combining both approaches (see, for example, WO2005/073364 or WO2008/116852).
Therefore, mean that the microorganism is energy according to the genetically modified microorganism for being used to produce 2,4-DHB of the present invention Enough produce 2,4-DHB recombinant microorganism as defined above.In other words, the microorganism is genetically modified and allow to produce Raw 2,4-DHB.
In the context of the present invention, can be micro- come genetic modification by adjusting the expression of one or more endogenous genes Biology.The expression that " regulation " refers to the gene herein raised compared with its natural expression, is lowered or even It is completely eliminated.Therefore, such regulation can cause the enhancing of gene product activity, or alternatively cause endogenous gene products Compared with low activity or inactive.
Term " endogenous gene " in this article refers to be naturally occurring in the gene in microorganism.
The heterologous sequence of up-regulation can be advantageous in addition to endogenous regulation element by introducing or be replaced by using such heterologous sequence Change those endogenous regulation elements, or the chromosome by the way that one or more supplement copies of endogenous gene are introduced in microorganism Or plasmid is overexpressed endogenous gene.Also can be endogenous to modify with modifier product by the way that mutation is introduced into its coded sequence Gene activity and/or expression.The missing of endogenous gene can also be carried out to completely inhibit its expression in microorganism.Regulation Endogenous gene expression another way be with stronger or weaker promoter exchange its promoter (i.e. wild-type promoters) with Raise or lower the expression of this gene.Promoter suitable for such purpose can be homologous or heterologous and be this area In it is known.Appropriate promoter is selected to adjust the expression of endogenous gene in the limit of power of those skilled in the art.
Additionally or alternatively, microorganism can be genetically modified and expresses one or more foreign genes, and condition is by the gene Imported necessary to being expressed for it in host microorganism together with all regulating elements in microorganism.With exogenous DNA modification or " conversion " microorganism is the normal work to do of those skilled in the art.In the context of the present invention, term " overexpression " is herein In also used for expression of the foreign gene in microorganism.
It is not naturally occurring in microorganism that " foreign gene " or " heterologous gene " refers to the gene herein.In order to The expression alien gene in microorganism, this genoid can be integrated directly into microbial chromosomal, or by microorganism Plasmid or carrier dyeing vivoexpression.A variety of plasmids (different in terms of its replication orgin and its copy number in cell) It is well known in the present art, and can be readily selected for this purpose by those of skill in the art.According to the external source base of the present invention Because being advantageously homologous gene.
In the context of the present invention, term " homologous gene " or " homologue " are referred not only to by heredity common in theory Ancestors by the heredity of two species (i.e. microbial species) gene, and including can for it is genetically unrelated but evolved with encode into Row identity function and/or with similar structure protein gene (i.e. function homologue).Therefore, term " functional homologue " Refer to the gene of encoding function homologous protein herein.
Using database, as Uniprot (being used for protein), Genbank (being used for gene) or NCBI (be used for protein or Gene) in available information, those skilled in the art can be readily determined the specified protein of microorganism and/or the sequence of gene Row, and based on one of the equivalent gene in this Sequence Identification another kind microorganism or homologue.This routine work can pass through one kind (it can be looked for the gene order or genome of the specific gene sequence of microorganism and other microorganisms in database mentioned above To) sequence alignment carry out.Advantageously, the BLAST algorithm of the exploitation such as Altschul (1990) can be used to carry out such sequence Compare.Once establish sequence homology between those sequences, can draw consensus sequence and for design degeneracy probe with gram The corresponding homolog genes of grand related microorganisms.These conventional methods of molecular biology be well known to a person skilled in the art.
It is to be further understood that in the context of the present invention, if expression encodes sense in specific microorganism The foreign gene of the protein of interest, then preferably by the way that not preferred codon or less preferred codon are replaced with into coding The preferred codon of the microorganism of same amino acid builds the synthesized form of this gene.The really known codon in this area Selection changes between microbial species, and it is horizontal that this can influence recombinant expression of proteins interested.In order to overcome this problem, Codon optimization method has been developed, and has been retouched extensively by (2001) such as Graf etc. (2000), Deml and Davis&Olsen (2011) State.It is interesting to note that in order to which codon optimization determines and develops several softwares, such asSoftware (Lifetechnologies) or (GenScript) OptimumGeneTMSoftware.In other words, the outer of proteins of interest matter is encoded Source gene preferably carries out codon optimization and is used to express in specified microorganisms.
Also can activity that is genetically modified and increasing or decreasing one or more protein according to the microorganism of the present invention.
Can by improve protein catalytic efficiency, by reduce Protein Turnover rate, by reduce mRNA (mRNA) week Rate of rotation, the transcription by increasing gene or the translation by increasing mRNA, which obtain, increases such activity.
Improve protein catalytic efficiency to mean, for given substrate and/or given co-factor, to increase kcat and/or reduction Km, and/or for giving inhibitor, increase Ki.Kcat, Km and Ki are the Michaelis- that those skilled in the art can determine Menten constants (Segel, 1993).Reduce Protein Turnover rate and mean stable protein.Improve protein catalytic efficiency and/or Reduce Protein Turnover rate method be well known to a person skilled in the art.Those include rational engineered and sequence And/or structural analysis and directed mutagenesis, and random mutagenesis and screening.Conventional method such as PCR can be passed through (PCR) direct mutagenesis is carried out, by random mutagenesis techniques for example via mutagens (ultraviolet or chemical reagent such as nitrosoguanidine (NTG) or ethyl methane sulfonate (EMS)) DNA reorganization or fallibility PCR introduce mutation.Stable protein also can be by protein N- ends or C- ends addition " label " peptide sequence realize.Such label is well known in the present art, and including paddy Sweet peptide-S- transferases (GST) of Guang etc..
MRNA can be reduced by modifying 5 '-non-translational region (5 '-UTR) and/or code area and/or 3 '-UTR gene order Turnover rate (Carrier and Keasling, 1999).
The transcription (either endogenous or external source) for increasing gene can be by increasing the number that it is copied in microorganism And/or realized by using the promoter of higher gene expression dose is caused compared with wild-type promoters.In the present invention Context in, term " overexpression " be also used for refer to microorganism in genetic transcription increase.
It is as indicated above, can be on chromosome or in chromosome outer coding in order to increase the copy number of gene in microorganism The gene.When to encode gene interested on chromosome, can by genetic recombinant methods well known in the art (such as Gene substitution) on chromosome introduce gene several copies.When will in microorganism in chromosome outer coding gene when, it can Carried by different types of plasmid, the plasmid depended on for its replication orgin their microorganisms that can replicate wherein with And their copy numbers in cell and it is different.According to the property of plasmid, the microorganism converted by the plasmid can contain plasmid 1 to 5 copy, or its about 20 copy, or even its up to 500 copy.The low-copy that can be replicated in Escherichia coli The example of number plasmid include but is not limited to pSC101 plasmids (close replicate), RK2 plasmids (close to replicate) and pACYC and PRSF1010 plasmids, and the example for the high copy number plasmid that can be replicated in Escherichia coli is pSK bluescript II.
The promoter that gene expression dose can be increased is also that well known to a person skilled in the art and can be homologous (rise Come from identical species) or heterologous (originating from different species).The example of such promoter includes but is not limited to promoter Ptrc, Ptac, Plac and λ promoter cI.These promoters also can be by specific compound or by specific external condition such as temperature Degree or photoinduction (" inducible promoter ").
MRNA translation can be increased by modifying ribosome bind site (RBS).RBS is when starting protein translation and core The sequence on mRNA that sugared body combines.It can be initiation codon AUG upstreams in mRNA 5 ' caps, prokaryotes in eucaryote Internal ribosome entry site (IRES) in 6-7 nucleotides area (being referred to as Shine-Dalgarno sequences) or virus.Pass through This sequence is modified, protein translation initial rate can be changed, changes its throughput rate in proportion, and controls its work in the cell Property.Also the intensity of RBS sequences can be optimized to reach targeting by using software RBS CALCULATOR (Salis, 2011) Translation initiation speed.Property based on mRNA selects RBS sequences in the limit of power of those skilled in the art.
The activity for reducing protein can refer to reduce its special catalysis by being mutated the gene of code for said proteins Activity, so as to change corresponding amino acid sequence by mutant nucleotide sequence or by lacking the code area of the gene And/or reduce the concentration of protein in cell.
As used herein, term " oxidoreducing enzyme " refers to according to operating condition reversibly or irreversibly reduction or oxygen Change the enzyme of its substrate.More specifically, this fermentoid is transferred to referred to as oxidant by being catalyzed electronics from its substrate (electron donor) Another molecule of (electron acceptor) and work.In the context of the present invention, they are each for the redox oxydasis of selection From substrate (BT is oxidized to 2,4- dihydroxy butyraldehydes, and 2,4- dihydroxy butyraldehydes are oxidized to 2,4-DHB), more preferably Carry out in an irreversible fashion.Operating condition is adapted to so that their substrate of the redox oxydasis of the present invention is in this area skill In the limit of power of art personnel.
Oxidoreducing enzyme is generally categorized as the EC1 in the enzyme committee (EC) number class of enzyme, it is grouped more than 20 kinds Subclass (such as EC1.1, including act on the oxidoreducing enzyme of the CH-OH groups of donor, also referred to as alcohol oxidoreducing enzyme of enzyme;EC 1.2, including act on the aldehyde radical of donor or the oxidoreducing enzyme of oxo group;EC1.3, including act on the CH-CH bases of donor The oxidoreducing enzyme of group, also referred to as CH-CH oxidoreducing enzyme;EC1.4, including act on donor CH-NH2 groups oxidation also Protoenzyme donor, also referred to as amino acid oxidoreductases or monoamine oxidase;EC1.5, including act on the CH-NH groups of donor Oxidoreducing enzyme;EC1.6, including act on NADH or NADPH oxidoreducing enzyme;EC1.7, including act on as donor The oxidoreducing enzyme of other nitrogen-containing compounds;EC1.8, including act on the oxidoreducing enzyme of the methylthio group of donor;EC1.9, bag Include the oxidoreducing enzyme for the heme group for acting on donor;EC1.10, including act on diphenol and correlative as donor The oxidoreducing enzyme of matter;EC1.11, including act on the oxidoreducing enzyme of the peroxide as acceptor, also referred to as peroxide Enzyme;EC 1.12, including act on the oxidoreducing enzyme of the hydrogen as donor;EC 1.13, including act on single donor and simultaneously Enter the oxidoreducing enzyme of molecular oxygen, also referred to as oxygenase;EC 1.14, including act on the donor of pairing and be incorporated to molecular oxygen Oxidoreducing enzyme;EC 1.15, including act on the oxidoreducing enzyme of the superoxide radical as acceptor;EC 1.16, bag Include the oxidoreducing enzyme of oxidizing metal ions;EC1.17, including act on the oxidoreducing enzyme of CH or CH2 groups;EC 1.18, Oxidoreducing enzyme including acting on the iron-sulfur protein as donor;EC 1.19, including act on yellow as the reproducibility of donor The oxidoreducing enzyme of plain oxygen also albumen;EC 1.20, including the phosphorus or the oxidoreducing enzyme of arsenic acted in donor;EC 1.21, Including acting on X-H and Y-H to form the oxidoreducing enzyme of X-Y keys;With the EC 1.97 for including other oxidoreducing enzyme).These Some in oxidoreducing enzyme can use NADP or NAD+ as co-factor, and therefore be referred to as NAD+/NADP+ redox Enzyme.
Referred to according to the term " source of carbon ", " carbon source " or " carbon substrate " of the present invention by microbial metabolism and containing extremely Any carbon source of a few carbon atom.Carbon source may originate from renewable raw materials.Renewable raw materials are defined as needed for some industrial process The raw material wanted, it can regenerate in of short duration delay and allow it to change into desired product in enough amounts.Plant Biomass is a kind of interesting renewable carbon source (through processing or untreated).In the context of the present invention, xylose is Carbon source, and the advantageously carbohydrate of the carbon source in addition to xylose.
Term " carbohydrate " refer herein to by microbial metabolism and containing at least one carbon atom, two Any carbon source of hydrogen atom and an oxygen atom.The carbohydrate of the present invention is preferably selected from the following group:Monose such as glucose, fruit Sugar, mannose, galactolipin etc., disaccharides such as sucrose, cellobiose, maltose, lactose etc., oligosaccharides such as gossypose, stachyose, malt Magma essence etc., and polysaccharide cellulose, hemicellulose, starch etc., methanol, formaldehyde and glycerine.According to the preferred reality of the present invention Scheme is applied, the carbon source in addition to xylose advantageously includes the carbohydrate of 3,6 or 12 carbon atoms, or its any combinations. In the further preferred embodiment of the present invention, the carbon source in addition to xylose is selected from glycerine, glucose, galactolipin, fructose, breast Sugar, maltose, sucrose and its any combinations.It is highly preferred that the carbon source in addition to xylose is selected from glycerine, glucose, sucrose and its appointed What is combined, and most preferably glucose.
In the first aspect of the present invention, the present invention relates to microorganism, the microorganism is genetically modified and by by xylose BT is changed into produce 2,4- dihydroxy butyric acid, is used for wherein the microorganism is further genetically modified:
I) 1,2,4- butantriols are oxidized to 2,4- dihydroxy butyraldehydes;With
Ii 2,4- dihydroxy butyraldehydes) are oxidized to 2,4- dihydroxy butyric acid.
It is highly preferred that oxidation step i) and ii) relate separately to that BT exclusively is changed into 2,4- dihydroxies Base butyraldehyde and 2,4- dihydroxy butyraldehydes are changed into 2,4- dihydroxy butyric acid.In other words, it is therefore particularly preferred that by 1,2,4- fourth During triol is oxidized to 2,4- dihydroxy butyraldehydes and 2,4- dihydroxy butyraldehydes are oxidized into 2,4-DHB, microorganism does not produce Metabolic Intermediate is produced with optimizing 2,4-DHB.Such exclusiveness conversion can be realized by using the enzyme being described further below.
According to preferred embodiment, genetic modification i) is Coding Effects in the oxidoreducing enzyme of the CH-OH groups of donor The overexpression of at least one gene of (EC1.1 enzymes), and/or genetic modification ii) it is Coding Effects in the aldehyde radical or oxo of donor The overexpression of at least one gene of the oxidoreducing enzyme (EC1.2 enzymes) of group.
The example for acting on the oxidoreducing enzyme of the CH-OH groups of donor includes but is not limited to EC 1.1.1 enzymes and (uses NAD+ Or oxidoreducing enzyme of the NADP+ as acceptor, also referred to as NAD+/NADP+ oxidoreducing enzyme), EC 1.1.2 enzymes (use cytochromes Oxidoreducing enzyme as acceptor), EC 1.1.3 enzymes (oxidoreducing enzyme of acceptor is used as by the use of oxygen), EC 1.1.4 enzymes are (with two sulphur Oxidoreducing enzyme of the compound as acceptor), EC1.1.5 enzymes (being used as the oxidoreducing enzyme of acceptor by the use of quinone or similar compound), EC 1.1.98 enzyme (with the oxidoreducing enzyme of other known acceptor) and EC 1.1.99 enzymes (with the oxidoreducing enzyme of other acceptors).
The example for acting on the aldehyde radical of donor or the oxidoreducing enzyme of oxo group includes but is not limited to EC1.2.1 enzymes (use Oxidoreducing enzyme of the NAD+ or NADP+ as acceptor, also referred to as NAD+/NADP+ oxidoreducing enzyme), EC1.2.2 enzymes (use cell Oxidoreducing enzyme of the pigment as acceptor), EC 1.2.3 enzymes (oxidoreducing enzyme of acceptor is used as by the use of oxygen), EC 1.2.4 enzymes (use Oxidoreducing enzyme of the disulphide as acceptor), EC 1.2.5 enzymes (be used as the redox of acceptor by the use of quinone or similar compound Enzyme), EC 1.2.7 (oxidoreducing enzyme of acceptor is used as by the use of iron-sulfur protein) and EC 1.2.99 enzymes (with the oxidations of other acceptors also Protoenzyme).
Show that above-mentioned active enzyme is well known in the present art, and can easily be identified by those skilled in the art, Such as identified from publicly available database such as BRENDA.
In order to which BT is oxidized into 2,4- dihydroxy butyraldehydes, the oxidation of CH-OH groups of donor is acted on also Protoenzyme (EC1.1 enzymes) is preferably selected from the following group:Alcohol dehydrogenase (i.e. aldehyde reductase), aldehyde dehydrogenase, lactaldehyde reductase, glyoxalic acid reduction Enzyme, two dehydrogenation gluconic acid reductases and its any combinations.It is highly preferred that the EC1.1 oxidoreducing enzyme is alcohol dehydrogenase (i.e. aldehyde Reductase) or lactaldehyde reductase.It is more preferred still that the EC1.1 oxidoreducing enzyme is the CH-OH groups for acting on donor NAD+/NADP+ oxidoreducing enzyme (i.e. EC1.1.1 enzymes), or with other receptor actings in the redox of the CH-OH groups of donor Enzyme (i.e. EC1.1.99 enzymes).Most preferably, the enzymes of EC 1.1 are EC 1.1.1 enzymes.EC1.1 of the selection suitable for the object of the invention Enzyme and identify its corresponding gene order in the limit of power of those skilled in the art.
The particularly preferred oxidoreducing enzyme (EC1.1 enzymes) of the CH-OH groups of donor is acted on according to it in database Sequence identifier and accession number described in following table 1:Alcohol dehydrogenase includes but is not limited to sequence SEQ ID NO:1 to SEQ ID NO:40;Lactaldehyde reductase includes but is not limited to sequence SEQ ID NO:41 enzyme;Glyoxylate reductase includes but is not limited to Sequence SEQ ID NO:42 and SEQ ID NO:43 enzyme, and two dehydrogenation gluconic acid reductases include but is not limited to sequence SEQ ID NO:44 and SEQ ID NO:45 enzyme.
In particularly preferred embodiment of the invention, 2 are changed into according to BT as shown in Figure 1, The conversion metabolic pathway of 4- dihydroxy butyraldehydes, the oxidoreducing enzyme (EC1.1 enzymes) for acting on the CH-OH groups of donor is sequence SEQ ID NO:36 or SEQ ID NO:41 enzyme.
The gene for encoding oxidoreducing enzyme mentioned above is also known in the art, and is retouched in following table 1 State:
The gene of-coding alcohol dehydrogenase includes but is not limited to bacillus coli gene frmA (also referred to as adhC), adhP (also referred to as For yddN), yiaY, eutG (also referred to as yffV), yqhD, yeaE, ydhF, yhdN, ybdR, ybdH, ybjJ, ydjL, ahr, YahK, tas (also referred to as ygdS) and gldA;Acetone-butanol clostridium gene bdhB, bdhA and CA_C3392, genes of brewing yeast ADH1, ADH2, ADH3, ADH4 and bdh1;Bacillus subtilis genes yhdN and bdhA;Gluconobacter oxydans (Gluconobacter oxydans) gene GOX1615;Arabidopsis gene AKR4C9;Leishmania donovani (Leishmania Donovani) Gene A 4UTP6;The hot anaerobic bacillus(cillus anaerobicus) of producing and ethanol (Thermoanaerobacter ethanolicus) gene adh A; Corynebacterium glutamicum (Corynebacterium glutamicum) gene butA and budC;Friedlander's bacillus (Klebsiella pneumoniae) gene budC;Serratia marcescens (Serratia marcescens) gene slaC;It is red Brown meat seat bacterium (Hypocrea jecorina) gene gld2;Clostridium butyricum (Clostridium butyricum) gene dhaT; Citrobacter freundii (Citrobacter Freundii) gene dhaT;Friedlander's bacillus (Klebsiella Pneumoniae) gene dhaT;With the Rhodococcus erythropolis (Rhodoccus of coding NMDA dependence alcohol dehydrogenase Erythropolis) gene;With the methanol amycolatosis (Amycolatopsis of coding NMDA dependence alcohol dehydrogenase Methanolica) gene;
The gene of-coding lactaldehyde reductase includes but is not limited to bacillus coli gene fucO;
The gene of-encoding glyoxylate acid reductase includes but is not limited to bacillus coli gene ghrA (also referred to as ycdW) and ghrB (also referred to as yiaE);With
The gene of two dehydrogenation gluconic acid reductases of-coding includes but is not limited to bacillus coli gene dkgA (also referred to as yqhE) With dkgB (also referred to as yafB).
In a particularly preferred embodiment of the present invention, the oxidoreducing enzyme of the CH-OH groups of donor is acted on (EC1.1 enzymes) is by the fucO genes from Escherichia coli and/or from clostridium butyricum, citrobacter freundii or kerekou pneumonia primary The dhaT gene codes of Salmonella, i.e., the conversion for being changed into 2,4- dihydroxy butyraldehydes according to BT as shown in Figure 1 are metabolized way Footpath.
As indicated above, microorganism of the invention is also genetically modified to be used for ii) 2,4- dihydroxy butyraldehydes are oxidized to 2, 4- dihydroxy butyric acid.Therefore, redox of the Coding Effects in the aldehyde radical or oxo group of donor can be overexpressed in microorganism At least one gene of enzyme (EC1.2 enzymes).
Therefore, the oxidoreducing enzyme (enzymes of EC 1.2) of the aldehyde or oxo group that act on donor is preferably selected from the following group:Aldehyde takes off Hydrogen enzyme, aldehyde oxidase and its any combinations.It is highly preferred that the EC1.2 oxidoreducing enzyme is aldehyde dehydrogenase.It is more preferred still that The EC1.2 oxidoreducing enzyme be the NAD+/NADP+ oxidoreducing enzyme of the aldehyde radical or oxo group that act on donor (i.e. EC1.2.1 enzymes), or by the use of oxygen as receptor acting in the aldehyde radical of donor or oxidoreducing enzyme (the i.e. EC 1.2.3 of oxo group Enzyme).Most preferably, the enzymes of EC 1.2 are EC 1.2.1 enzymes.Selection is suitable to the EC1.2 enzymes of the object of the invention and identifies that its is corresponding Gene order in the limit of power of those skilled in the art.
Act on donor aldehyde radical or oxo group particularly preferred oxidoreducing enzyme (enzymes of EC 1.2) according to it in number According to the sequence identifier in storehouse and accession number described in following table 1:Aldehyde dehydrogenase includes but is not limited to sequence SEQ ID NO: 46 to SEQ ID NO:60 enzyme;Aldehyde oxidase includes but is not limited to sequence SEQ ID NO:61 enzyme.
In particularly preferred embodiment of the invention, 2,4- dihydroxy butyraldehydes are changed into 2 according to as shown in Figure 1, The conversion metabolic pathway of 4- dihydroxy butyric acid, the oxidoreducing enzyme (EC1.2 enzymes) for acting on donor aldehyde radical or oxo group is sequence Arrange SEQ ID NO:46、SEQ ID NO:54 or SEQ ID NO:55 enzyme.
The gene for encoding oxidoreducing enzyme mentioned above is also known in the art, and is retouched in following table 1 State:
The gene of-encoding aldehyde dehydrogenase includes but is not limited to bacillus coli gene puuC, sad, gabD, prr (also referred to as YdcW), aldA, aldB (also referred to as yiaX), feaB and astD;Friedlander's bacillus KPN_01018;Genes of brewing yeast ALD4 and ALD5;Pseudomonas putida (Pseudomonas putida) gene davD;Pseudomonas strain gene amnC;With Methanococcus jannaschii (Methanocaldococcus jannaschii) gene M J1411;
The gene of-coding dioxovalerate dehydrogenase includes but is not limited to bacillus licheniformis gene ycbD;With
The gene of-encoding aldehyde oxidase includes but is not limited to maize Gene A O1.
In a particularly preferred embodiment of the present invention, the oxidation of aldehyde radical or oxo group of donor is acted on also Protoenzyme (EC1.2 enzymes) is by aldA, aldB or puuC gene code from Escherichia coli, i.e., according to as shown in Figure 1 by 2,4- Dihydroxy butyraldehyde is changed into the conversion metabolic pathway of 2,4- dihydroxy butyric acid.
It must further be noted that it is already present on when encoding above-mentioned active endogenous gene in microorganism interested When, they are advantageously overexpressed in the microorganism.On the contrary, when microorganism is not natively comprising the such active gene of coding When, advantageously with one or more gene transformation microorganisms for encoding the enzyme:Think also to be overexpressed the foreign gene. As explained above, foreign gene is preferably and has carried out codon optimization to be used for its synthesis base expressed in microorganism interested Cause.
In the context of the present invention, microorganism can be by xylose into BT.It is genetically modified to be used for Microorganism by xylose into BT, particularly for it is exclusive by xylose into BT Microorganism is it is known in the art that and being retouched especially through patent application WO2008/091288 and US2013/0203141 State, it is incorporated herein by carrying stating.
Therefore, in a preferred embodiment, for being by genetic modification of the xylose into BT The overexpression of at least one following gene:
The gene of-coding xylose dehydrogenase,
The gene of-coding xylonolactonase,
The gene of-coding xylonate dehydratase,
The gene of-coding 3- deoxidations-D- glycerine-pentanone saccharic acid (DGP) decarboxylase,
The gene of-coding BT dehydrogenase, and
- its any combinations.
The gene for encoding enzyme described above is well known in the present art:
The gene of-coding xylose dehydrogenase is disclosed in US2013/0203141 and WO2008/091288, is stated simultaneously by carrying Enter herein, and include but is not limited to from crescent shank bacterium (Caulobacter crescentus), dead sea salts box bacterium (Haloarcula marismortui), fungi bulkholderia cepasea (Burkholderia fugorum) LB400 and walsh Rich salt bacterium (Haloferax volcanii) DS2 xdh encoding genes,
The gene of-coding xylonolactonase includes but is not limited to the xylC genes from crescent shank bacterium,
Disclosed in US2013/0203141 and WO2008/091288, it passes through the gene of-encoding D-xylose acid dehydratase Carry stating and be incorporated herein, and include but is not limited to yjhG the and yagF genes from Escherichia coli,
The gene of-coding 3- deoxidations-D- glycerine-pentulose acid decarboxylase is disclosed in US2013/0203141 and WO2008/ In 091288, it is incorporated herein by carrying stating, and includes but is not limited to the mdlC genes from pseudomonas putida, and
The gene of-coding NADPH dependence 1,2,4- butantriol dehydrogenases is disclosed in US2013/0203141 and WO2008/ In 091288, it is incorporated herein by carrying stating, and includes but is not limited to adhP the and yqhD genes from Escherichia coli.
In a preferred embodiment, particularly when genetically modified microorganism is Escherichia coli:
α) gene for encoding xylose dehydrogenase, xylonolactonase and 3- deoxidations-D- glycerine-pentulose acid decarboxylase is to lead Enter the heterologous gene in microorganism, and
It is endogenous gene β) to encode xylonate dehydratase and the gene of BT dehydrogenase, therefore advantageously It is overexpressed.
In a particularly preferred embodiment of the present invention, xylose dehydrogenase is compiled by the xdh genes of crescent shank bacterium Code, xylonolactonase by crescent shank bacterium xylC gene codes, xylonate dehydratase by the yjhG from Escherichia coli and/ Or yagF gene codes, 3- deoxidations-D- glycerine-pentulose acid decarboxylase by the mdlC gene codes from pseudomonas putida, And BT dehydrogenase is by adhP the and/or yqhD gene codes from Escherichia coli, i.e., according to as shown in Figure 1 Xylose is changed into the conversion metabolic pathway of 1,2,4- butantriols.
The nucleotides of gene mentioned above or by the gene code amino acid sequence according to them in database Accession number and version and/or according to their sequence identifier described in following table 1.
When the above-mentioned active endogenous gene of coding is already present in microorganism interested, they are advantageously described It is overexpressed in microorganism.On the contrary, when microorganism natively comprising such active gene is encoded when, advantageously described in coding One or more gene transformation microorganisms of enzyme:Think also to be overexpressed the foreign gene.As explained above, foreign gene The synthetic gene that codon optimization is used to express in microorganism interested is preferably carried out.
According to a preferred embodiment, microorganism of the invention is further genetically modified to be used for:
Iii) reducing power for the production of 2,4- dihydroxy butyric acid and microorganism growth is provided from the carbon source in addition to xylose And/or energy, and/or
Iv) at least in part, carbon catabolite repression is preferably entirely suppressed.
Genetic modification iii as described above) it is particularly advantageous, because it optimizes 2,4-DHB productions, by using Alternative carbon source rather than xylose provide reducing power and/or energy, so as to which xylose is exclusively changed into 2,4- dihydroxy Butyric acid.
In order to provide the reducing power and/or energy of the production of 1,2,4- butantriols and microorganism growth from the carbon source in addition to xylose Amount, preferably encoded by lacking and/or weakening using the gene of xylose or the enzyme of other metabolins come genetic modification according to this hair Bright microorganism, the metabolin produces is changed into the flow (flux) that the conversion of 2,4- dihydroxy butyric acid mutually competes with xylose.
Therefore, according to a preferred embodiment of the invention, genetic modification iii) it is at least one in following gene Reduction and/or missing:
The gene (such as xylA genes from Escherichia coli) of-encoding xylose isomerase,
The gene (such as xylB genes from Escherichia coli) of-encoding xylulokinase,
- coding 3- deoxidations-D- glycerine-pentulosonate aldolase gene (such as yjhH from Escherichia coli and/or YagE genes),
The gene (such as yiaE and/or ycdW genes from Escherichia coli) of-coding ketoacid dehydrogenase, and
- its any combinations.
These genes and fully disclosed in US2013/0203141 for lacking or weakening their method, by carrying State and be incorporated herein.
The nucleotides of gene mentioned above or by the gene code amino acid sequence also according to them in database In accession number and version and/or according to their sequence identifier described in following table 1.
If the microorganism of the present invention carries out genetic modification as described above, accordingly preferably microorganism is used except wood Carbon source outside sugar produces and reducing power and/or energy necessary to microorganism growth to provide 2,4-DHB.In this respect, at this It is well known that, some microbials are for other carbon sources preferably some specific carbon sources in field.It is it is worth noting that, most of Naturally occurring microorganism, particularly Escherichia coli preferably use glucose relative to other sugar, that is, allow them to metabolism one Serial monose (Kim etc., 2010).However, some microorganisms can not utilize glucose and xylose altogether in an efficient way.It is actual On, when two kinds of sugar are present in culture medium, the second stage of (dizuxic) fermentation pattern can occur in microorganism:This regulation mechanism exists Known in the art is catabolite repression, and glucose and lactose have been reported by Monod in the 1940s. Through widely studied such mechanism in enterobacteriaceae.In Escherichia coli, substantially recognize that catabolite repression is as follows Occur:Glucose enters phosphoenolpyruvate (PEP) by glucose specificity permease EIICBglc (being encoded by ptsG): Carbohydrate phosphotransferase system (PTS);During glucose transport, (compiled by dephosphorylized EIIAglc by crr Code) ring AMP (cAMP) level is reduced, this in turn limits catabolite repression albumen (CRP) and cAMP compounds (cAMP-CRP) availability;Then, it is related to the catabolism of carbohydrate in addition to glucose and the institute generally adjusted by cAMP-CRP There is gene all to be prevented.Also do not reported in the literature by the catabolite repression of the carbohydrate of PTS system transhipment, but Not yet it is fully understood (Hogema etc., 1997).
Therefore, in a preferred embodiment, it is allowed to suppress the genetic modification iv of carbon catabolite repression) it is selected from Following at least one:
The gene of the glucose permease of-missing coding phosphotransferase system,
The gene of-missing coding phosphate carrier Hpr albumen,
- preferably never involved in by the composition or the sugared input albumen system of inducible promoter expression of cAMP-CRP regulations And gene and/or operator, wherein the sugar is the carbon source in addition to xylose,
- expression encodes xylose transport albumen, such as the gene of transport protein in the same direction or abc transport albumen, preferably never by The composition of cAMP-CRP regulations or inducible promoter expression,
- gene for encoding glucose symport albumen is overexpressed,
- gene for encoding facilitated glucose albumen is overexpressed,
- gene for encoding glucokinase is overexpressed,
- regulation involves the horizontal genes of cAMP, the expression of the gene of optimized encoding adenyl cyclase,
The expression of the gene of-regulation coding CRP and/or CRP sample protein,
- expression encodes the gene of CRP protein independent of cAMP, preferably never by cAMP-CRP regulations composition or Inducible promoter is expressed, and
- its any combinations.
It is highly preferred that utilize xylose and glucose, missing coding phosphotransferase and/or phosphate carrier Hpr eggs in order to common White gene advantageously encodes Portugal with being overexpressed the gene of coding glucose permease or facilitated glucose albumen together with overexpression The assortment of genes of sugared kinases.
The gene for encoding above-described protein is well known in the art:
The gene of the glucose permease of-coding phosphotransferase system includes but is not limited to the pstG from Escherichia coli Gene,
(it is phosphoenolpyruvate to-coding phosphate carrier Hpr protein:The two of sugar phosphotransferase system (PTS sugar) One of kind sugared nonspecific protein component) gene include but is not limited to the ptsH genes from Escherichia coli,
- be related to sugared input albumen system (PTS or other systems) gene/operator include but is not limited to come from large intestine bar Bacterium be used for assimilate lactose lacY genes, from Escherichia coli be used for assimilate maltose malFG genes and respectively come From scrKYABR the or cscBKAR operators for being used to assimilate sucrose of salmonella typhimurium and Escherichia coli,
The gene of-coding xylose transport albumen includes but is not limited to the xylFGH operators from Escherichia coli, comes arrogant XylE the and araE genes of enterobacteria,
The gene of-coding glucose symport albumen includes but is not limited to the galP genes from Escherichia coli,
The gene of-coding facilitated glucose albumen includes but is not limited to zymomonas mobilis (Zymomonas Mobilis glf genes),
The gene of-coding glucokinase includes but is not limited to the glk genes from Escherichia coli,
The gene of-coding adenyl cyclase includes but is not limited to the cyaA genes from Escherichia coli,
The gene of-coding CRP and/or CRP sample protein includes but is not limited to the CRP genes from Escherichia coli and come from Bacillus subtilis and the ccpA genes of other Firmacuteses (firmicutes),
Gene of the coding independent of cAMP CRP protein has been described in-Hogema etc. (1997), includes but is not limited to Encode the gene for the CRP albumen being mutated in Liang Ge areas:Residue 53 to 62 corresponding to the cAMP binding domain of CRP albumen and correspondingly In the residue 141 to 148 of the proximal boundary of DNA binding domain (helix turn helix).The example bag of such protein mutant CRP mutant Asp is included to His residues 53, Ser to Phe residues 62, Gly to Asp residues 141, Arg to Asp residues 142, Leu To Arg residues 148 and its any combinations.
It is highly preferred that in order to utilize xylose and glucose altogether in Escherichia coli, ptsG or ptsH gene delections can be with The overexpression of galP or glf genes combines together with the overexpression of glk genes.
The nucleotide sequence of gene mentioned above or by the gene code amino acid sequence according to them in database In accession number and version and/or according to their sequence identifier described in following table 1.
As explained above, when the above-mentioned active endogenous gene of coding is already present in microorganism interested, they Advantageously it is overexpressed in the microorganism.However, if microorganism has not natively comprising such active gene is encoded Sharply with one or more gene transformation microorganisms for encoding the enzyme:Think also to be overexpressed the foreign gene.As above Text explains that foreign gene is preferably to have carried out the synthetic gene that codon optimization is used to express in microorganism interested.Phase Instead, endogenous gene can be lacked according to above-described any method or weakens its expression.
It is as defined above according to the carbon source in addition to xylose of the present invention, and include 3,6 or 12 carbon The carbohydrate of atom, or its any combinations.Preferably, the carbon source in addition to xylose is selected from the group:Glycerine, glucose, gala Sugar, fructose, lactose, maltose, sucrose and its any combinations.
Most preferably, the carbon source in addition to xylose used in the present invention is glucose.
Or the carbon source in addition to xylose can be sucrose.Therefore, in a preferred embodiment of the invention, microorganism includes To use sucrose, as the carbon source in addition to xylose, as described in WO2012/004247, it is stated further genetic modification by carrying It is incorporated herein.Therefore, microorganism preferably comprises the function base that coding PTS sucrose utilizes system using system and/or non-PTS sucrose Cause.
PTS sucrose is based upon phosphoenolpyruvate (PEP)-dependence sucrose phosphate using system and shifts enzyme system The sucrose of (sucrose-PTS) transhipment sucrose utilizes system.Phosphotransferase system by the transhipment of sugared (such as sucrose or glucose) with Sugared phosphorylation coupling using PEP as phosphodonor.After cell is transported, sucrose phosphate is converted digestion and is cut into grape Sugar -6- phosphoric acid and fructose.Fructose then by fructokinase phosphorylation into fructose-6-phosphate.This PTS sucrose is encoded using system Gene can be controlled by regulatory protein.
Non- PTS sucrose is based upon transporting the sucrose profit of sucrose independently of the system of phosphoenolpyruvate using system Use system.After cell is transported, sucrose is converted digestion and is cut into glucose and fructose.Fructose is then by fructokinase phosphorylation Into fructose-6-phosphate, and glucose is melted into G-6-P by glucokinase phosphoric acid.Encode this non-PTS sucrose and utilize system Gene can be controlled by regulatory protein.
In a particularly preferred embodiment of the present invention, microorganism is naturally expressed or genetically modified with table Up to the gene of the operator scrKYABR from Salmonella, i.e. the scrK genes of encoding fructose kinases, coding PFP ScrY genes, encode the scrA genes of IIBC albumen, the scrB genes of encoding sucrose -6-P invertases and coding repressor ScrR genes, and any combination of them.The conjugative plasmid pUR400 microbials for carrying scrKYABR can be used.These Gene individually, with least two any combinations comprising these genes or whole combine can express in microorganism. Preferably omit gene scrR.
In particularly preferred embodiment of the invention, microorganism naturally express or it is genetically modified with express come From coli strain EC3132 gene, i.e. encoding sucrose:Proton transports movement system (cscB genes), fructokinase in the same direction (cscK), the operator cscBKAR genes of invertase (cscA genes) and sucrose specificity repressor (cscR genes).These bases Cause individually, with least two any combinations comprising these genes or whole combine can be expressed in microorganism.Can It is preferred that omit gene cscR.It can also be used from other biological homologous genes.
However, in a preferred embodiment, microorganism of the invention, which includes to involve, produces NADPH as also proper energy The further genetic modification of at least one gene in power source.In fact, reductase such as dehydrogenase need it is available in microorganism Reducing power, particularly in the form of NADPH.The strategy of NADPH availabilities is well known in the present art in increase cell, and And especially summarized by Lee etc. (2013), and also retouched by US8088620, WO2012/055798 and EP14305691.9 State, above-mentioned file is incorporated herein by carrying stating.
According to the present invention, for improving NADPH generations and therefore improving the genetic modification of its availability in microorganism It is preferably selected from:
- gene or operator for encoding membrane-bound transhydrogenase are overexpressed,
The gene of-missing or reduction encoding soluble transhydrogenase,
- gene that coding produces NADPH glyceraldehyde-3-phosphate dehydrogenase is overexpressed,
The gene of-missing or reduction coding phosphogvlucoisomerase,
The gene of-missing or reduction coding phosphofructokinase,
- gene for encoding glucose-6-phosphate dehydrogenase (G6PD) is overexpressed,
- mutant gene that coding can produce NADPH diaphorase is overexpressed,
- gene for encoding difunctional NAD (P) H hydrate repair enzymes is overexpressed, and
- its any combinations.
The gene of missing or reduction coding phosphofructokinase more preferably encodes glucose-6-phosphate dehydrogenase (G6PD) with being overexpressed The assortment of genes, to increase by the NADPH flows of pentose phosphate pathway.
It is highly preferred that it is selected from for improving genetic modification caused by NADPH:
- gene for encoding membrane-bound transhydrogenase is overexpressed,
- missing or the gene of reduction coding phosphogvlucoisomerase and/or soluble transhydrogenase, and
- it is overexpressed the gene that coding produces NADPH glyceraldehyde-3-phosphate dehydrogenase.
The gene for encoding above-described protein is well known in the present art:
The gene or operator of-coding membrane-bound transhydrogenase include but is not limited to the pntAB operators from Escherichia coli, Especially as described in WO2012/055798A1,
The gene of-encoding soluble transhydrogenase includes but is not limited to the udhA genes from Escherichia coli,
The gene that-coding produces NADPH glyceraldehyde-3-phosphate dehydrogenase includes but is not limited to from Streptococcus mutans GapN (such as Centeno-Leija, described in 2013), it can be used for for example substituting the endogenous gapA gene from Escherichia coli,
The gene of-coding phosphogvlucoisomerase includes but is not limited to the pgi genes from Escherichia coli,
- coding phosphofructokinase gene include but is not limited to the pfkA genes from Escherichia coli, such as especially by WO2005/047498 is described,
The gene of-coding glucose-6-phosphate dehydrogenase (G6PD) includes but is not limited to the zwf genes from Escherichia coli, such as special Do not described by Lim etc. (2002),
The mutant gene that-coding can produce NADPH diaphorase includes but is not limited to come from large intestine bar The mutation lpd genes (lpd*) of bacterium, such as especially described by Bocanegra (1993), and
The gene of difunctional NAD (P) the H hydrate repair enzymes of-coding includes but is not limited to the yjeF bases from Escherichia coli Cause, such as especially described by Marbaix (2011).
In Escherichia coli, if missing or reduction pfkA genes, the genetic modification is preferably with being overexpressed zwf genes Combination.
It is highly preferred that it is selected from for improving genetic modification caused by NADPH:
- the pntAB operators from Escherichia coli are overexpressed,
PfkA gene and/or udhA gene of-the missing from Escherichia coli,
- with from Streptococcus mutans gapN genes substitute the gapA from Escherichia coli.
The nucleotide sequence of gene mentioned above or by the gene code amino acid sequence according to them in data Accession number and version in storehouse and/or according to their sequence identifier described in following table 1.
As explained above, when the above-mentioned active endogenous gene of coding is already present in microorganism interested, they Advantageously it is overexpressed in the microorganism.However, if microorganism has not natively comprising such active gene is encoded Sharply with one or more gene transformation microorganisms for encoding the enzyme:Think also to be overexpressed the foreign gene.As above Text explains that foreign gene is preferably to have carried out the synthetic gene that codon optimization is used to express in microorganism interested.Phase Instead, endogenous gene can be lacked according to above-described any method or weakens its expression.
Table 1:The gene and protein of the present invention (ND=is undisclosed)
On the other hand, the present invention relates to the method for production 2,4- dihydroxy butyric acid, it includes:
A) in the culture medium comprising xylose, allowing to train under fermentation condition of the xylose into 2,4- dihydroxy butyric acid The genetically modified microorganism of invention as described above is supported, and
B) 2,4- dihydroxy butyric acid is reclaimed from the culture medium.
Fermentation medium and carbon source are as known in the art.According to the present invention, term " fermentation process ", " fermentation " or " culture " is used interchangeably, and refers to the experiment condition for allowing given microorganism growth.The growth of microorganism is generally with suitable for institute Carried out with the fermentation tank of the appropriate growth medium of microorganism.
" appropriate culture medium " refers to culture medium (such as aseptic liquid nutrient medium) herein, and it is included for microorganism Maintain and/or grow required or beneficial nutrients such as carbon source or carbon substrate;Nitrogen source, such as peptone, yeast extract, meat Extract, malt extract, urea, ammonium sulfate, ammonium chloride, ammonium nitrate and ammonium phosphate;Phosphorus source, such as potassium dihydrogen phosphate or phosphoric acid Hydrogen dipotassium;Micro- (such as metal salt), such as magnesium salts, cobalt salt and/or manganese salt;And growth factor such as amino acid and Wei Sheng Element.
According to a preferred embodiment, for step a) culture medium further comprising the carbon source in addition to xylose, its is preferred For carbohydrate.
The carbohydrate may be selected from the following group:Monose such as glucose, fructose, mannose, arabinose, galactolipin etc., Disaccharides such as sucrose, cellobiose, maltose, lactose etc., oligosaccharides such as gossypose, stachyose (stacchyose), maltodextrin Deng, polysaccharide such as cellulose, hemicellulose, starch etc., methanol, formaldehyde and glycerine.According to the particularly preferred carbon aquation of the present invention Compound includes 3,6 or 12 carbon atoms, more preferably glycerine, glucose, fructose, galactolipin, lactose, maltose, sucrose and its appoints What is combined.Most preferably, carbohydrate is glucose.Or the carbohydrate can be sucrose.
In a preferred embodiment of the invention, carbon source (preferably carbohydrate) is derived from renewable raw materials, such as plants Thing biomass.
Those skilled in the art can be readily determined for cultivating according to condition of culture necessary to microorganism of the invention.It is special Not, it is known that can be at 20 DEG C -55 DEG C, the preferably temperature at 25 DEG C -40 DEG C makes bacterial fermentation.More particularly, can be about 30 DEG C to about 37 DEG C of temperature culture Escherichia coli.
The method of the present invention can be with batch process, fed batch process or continuous process and in aerobic, micro- aerobic or anaerobism Under the conditions of carry out.
" under aerobic conditions " fermentation means by being supplied to oxygen in the liquid phase that dissolves gas into culture Culture.This can be achieved by the following procedure:(1) oxygen-containing gas (such as air) is sprayed into liquid phase, or (2) are shaken containing culture The container of base is so that oxygen contained in headroom is transferred in liquid phase.The major advantage fermented under aerobic conditions is that oxygen is made Improve ability of the energy of the more ATP forms of bacterial strain generation for cell processes for the presence of electron acceptor, thus improve bacterial strain General metabolism.
Micro- aerobic condition can be used herein and is defined as the oxygen of low percentage (for example, using containing 0.1- 10% oxygen, filled with nitrogen to 100% admixture of gas) it is dissolved into the condition of culture in liquid phase.
On the contrary, " anaerobic condition " is defined as not providing the condition of culture of oxygen to culture medium.Strictly anaerobic condition can pass through Inert gas (such as nitrogen) is sprayed into culture medium to remove other gases of trace to obtain.Nitrate can be used as electron acceptor Produce to improve the ATP of bacterial strain and improve its metabolism.
According to preferred embodiment, the method for the invention also step c) of 2, the 4-DHB including purification step b).For The method of purification of carboxylic acids and specifically carboxylic acid is well known in the present art, and especially WO2002/090312, Described in WO2002/022544 and WO2012/153042, it is incorporated herein by carrying stating.Preferably, people in the art is passed through Means known to member, and particularly ion-exchange chromatography such as ion exchange resin or fixation or Simulation moving bed ion exchange Resin, 2,4-DHB is purified after it is reclaimed in culture medium.
Brief description
Fig. 1 represents the metabolic pathway (* by D- xyloses into 2,4-DHB:The gene for optionally lacking or weakening).
Embodiment
The present invention is further defined in the examples below.It should be understood that although these embodiments indicate this The preferred embodiment of invention, but only provide by way of illustration.From disclosure above and these examples, the technology of this area Personnel can carry out various change to be adapted to various uses and condition to the present invention, the basic hand without changing the present invention Section.
Describe the Exemplary gene for building Institute of Micro-biology's needs with these abilities and enzyme and for cloning and turning Change, the method that monitoring product is formed and produced using engineered microorganisms.
Specifically, embodiment shows the coli strain through modification, but these modifications can be easily mutually equal Other microorganisms in carry out.
Escherichia coli belong to enterobacteriaceae, and it includes Gram-negative, shaft-like, non-sporogenesis, and length is usually 1-5 μm of member.Most of members have the flagellum for activity, but some category are not movable.Many members of this section The normal segments of the gut flora found in the intestines of the mankind and other animals, and it is other found in water or in soil, or Person is the parasitic animal and plant on a variety of different animals and plant.Escherichia coli are one of most important model organisms, but enterobacteriaceae Other important members include Klebsiella (particularly Klebsiella terrigena, plant Klebsiella or the sour Cray of production Primary Salmonella), general Pseudomonas and Salmonella.
Scheme
Several schemes for building 2,4- dihydroxy butyric acid production bacterial strain are described in following examples.
Use in the present inventionScheme 1(pass through the chromosome modification of homologous recombination, the selection of recombinant and antibiotic box Excision) andScheme 2(bacteriophage P1 transduction) has fully described in patent application WO2013/001055.
Scheme 3:Construction recombination plasmid
It is known to those skilled in the art to fully describe and know recombinant DNA technology.
In short, use oligonucleotides (those skilled in the art can design) and MG1655 e. coli k-12s or other Microbe genome DNA enters performing PCR amplification to DNA fragmentation as matrix (according to target gene to be amplified).With compatible limit Enzymic digestion DNA fragmentation processed and the plasmid of selection, connection, are then transformed into competent cell.Analysis transformant is simultaneously surveyed by DNA Sequence verifies recombinant plasmid interested.
Embodiment 1:By in MG1655 Escherichia coli enzyme of the excess generation from new way from xylose production 2,4- DHB- builds bacterial strain 1 to 11.
Coli strain MG1655 is modified to use the approach shown in Fig. 1 from D- xylose production 2,4- dihydroxy butyric acid (2,4-DHB).The work is sought to maximize the carbon flow towards 2,4-DHB productions, so as to remove related to other xyloses Consumption or be related to 2,4-DHB- midbody compounds conversion (which represent the loss of product) enzyme.
Except gene (the sequence SEQ ID NO naturally expressed by Escherichia coli:64 yjhG genes and sequence SEQ ID NO: 65 yagF genes, it encodes xylonate dehydratase;And sequence SEQ ID NO:66 adhP genes and sequence SEQ ID NO:67 yqhD genes, it encodes alcohol dehydrogenase-NAD (P) H dependence BTs dehydrogenase) outside, use pCL1920 Plasmid (Lerner&Inouye, 1990), with Ptrc Artificial promoters (sequence provided in patent WO 2007/0770441) and The ribosome bind site of its own expresses the gene for encoding following enzyme respectively:The xylose dehydrogenase and xylonic of crescent shank bacterium Lactonase (is respectively the xdh [CC0821 on CauloCyc, SEQ ID NO:62] and xylC [CC0820 on CauloCyc, SEQ ID NO:63]), 3- deoxidations-D- glycerine-pentulose acid decarboxylase (sequence SEQ ID NO of pseudomonas putida:68 MdlC genes), alcohol dehydrogenase-NAD (P)+dependence 2 of Escherichia coli or clostridium butyricum, 4- dihydroxy butyraldehyde dehydrogenase gene sequences (it is respectively coded sequence SEQ ID NO:The fucO genes or coded sequence SEQ ID NO from Escherichia coli of 41 enzyme:36 Enzyme the dhaT genes from clostridium butyricum) and Escherichia coli aldehyde dehydrogenase (coded sequence SEQ ID NO:46 enzyme PuuC genes or coded sequence SEQ ID NO from Escherichia coli:The aldB genes from Escherichia coli of 55 enzyme).It is real On border, clone gene xdh, xylC, mdlC first on pCL1920 plasmids, plasmid pDHB0001 is provided, then in plasmid The upper sequential clone fucO or dhaT and puuC or aldB of pDHB0001,4 kinds of combinations are produced, obtain plasmid pDHB0002 extremely PDHB0005, described in following article table.
It is same described in 2000 using by Datsenko&Wanner in addition, in order to block natural wood sugar decomposition metabolic pathway Source restructuring is tactful and according to scheme 1, and encoding D-xylose isomerase (sequence SEQ is lacked from Escherichia coli MG1655 chromosomes ID NO:69 xylA genes) and D- Xylulokinases (sequence SEQ ID NO:70 xylB genes) gene.More accurately, In order to lack xylAB operators, with SEQ ID NO:104 and SEQ ID NO:105 primer, which produces, carries antibiotic resistance base Because of the PCR primer surrounded by the sequence homologous with the upstream and downstream area of xylAB operators together with FRT sites, and led Enter in the previously inverted Escherichia coli MG1655 for there are a pKD46 carriers.Turned with appropriate oligonucleotides checking antibiotic resistance Change body, and be bacterial strain 1 by the Strain Designation of reservation.
In order to avoid the degraded of 3- deoxidation-D- glucose pentanone saccharic acids (DGP), also lacked using identical homologous recombination strategy Coding ketoacid dehydrogenase (sequence SEQ ID NO:73 yiaE genes;With sequence SEQ ID NO:74 ycdW genes) and DGP aldolases (sequence SEQ ID NO:71 yjhH genes;Sequence SEQ ID NO:72 yagE genes) gene.It is more accurate Ground, in order to lack yjhH genes (SEQ ID NO:71), with SEQ ID NO:106 and SEQ ID NO:107 primer is produced and taken The PCR surrounded by the sequence homologous with the upstream and downstream area of yjhH genes with antibiotics resistance gene together with FRT sites is produced Thing, and be conducted into previously conversion have in the bacterial strain 1 of pKD46 carriers.Converted with appropriate oligonucleotides checking antibiotic resistance Body, and be bacterial strain 2 by the Strain Designation of reservation.Then, in order to lack yagE genes (SEQ ID NO:72), produce and carry antibiosis Plain resistant gene together with FRT sites by with yagE genes (SEQ ID NO:108 and SEQ ID NO:109) upstream and downstream The PCR primer that the homologous sequence in area surrounds, and be conducted into previously conversion have in the bacterial strain 2 of pKD46 carriers.With appropriate few core Thuja acid verifies antibiotic resistance transformant, and is bacterial strain 3 by the Strain Designation of reservation.Then, in order to lack yiaE genes (SEQ ID NO:73), with SEQ ID NO:110 and SEQ ID NO:111 primer, which produces, carries antibiotics resistance gene together with FRT positions The PCR primer surrounded by the sequence homologous with the upstream and downstream area of yiaE genes of point, and be conducted into pre-conversion and have In the bacterial strain 3 of pKD46 carriers.Antibiotic resistance transformant is verified with appropriate oligonucleotides, and is by the Strain Designation of reservation Bacterial strain 4.Finally, in order to lack ycdW genes (SEQ ID NO:74), with SEQ ID NO:112 and SEQ ID NO:113 draw Thing, which produces, carries antibiotics resistance gene being surrounded by the sequence homologous with the upstream and downstream area of ycdW genes together with FRT sites PCR primer, and be conducted into pre-conversion and have in the bacterial strain 4 of pKD46 carriers.Resisted with appropriate oligonucleotides checking antibiotic Property transformant, and be bacterial strain 5 by the Strain Designation of reservation.
Due to yjhH genes and yjhG (SEQ ID NO:64) and yjhI belong to yjhIHG operators together, it is necessary to selection with The homologous sequence in yjhH upstream and downstream area, so as to not change the expression of surrounding genes.For belonging to yagEF operators YagE genes (SED ID NO:72) it is identical.
Finally, in order to suppress catabolite repression, by using by Datsenko&Wanner, 2000 descriptions it is homologous Restructuring is tactful (according to scheme 1), in patent application EP 14305691.9, the institute particularly in the embodiment 2 of the document State, use SEQ ID NO:114 and SEQ ID NO:115 primer is lacked by ptsG genes (SEQ ID NO:75) Portugal of coding Grape sugar phosphotransferase IIBC (Glc).Appropriate PCR primer is imported into previously conversion has in the bacterial strain 5 of pKD46 carriers.With suitable When oligonucleotides verify antibiotic resistance transformant, and be bacterial strain 6 by the Strain Designation of reservation.
Different in kanamycins, chloramphenicol, gentamicin, tetracycline, blasticidin or spectinomycin resist is used every time Raw plain resistant gene.Before using bacterial strain 6, such as by Datsenko&Wanner, (according to scheme 1) described in 2000, Flp is used Recombinase removes antibiotic box, producing bacterial strain 7 from Δ xylAB, Δ yjhH, Δ yagE, Δ yiaE, Δ ycdW modification.
Then, each plasmid pDHB0002, pDHB0003, pDHB0004 or pDHB0005 described above are imported into bacterial strain In 7, producing bacterial strain 8 to 11, as in the table below.
Embodiment 2:Improve 2,4- dihydroxy-butyric acid yield-structure bacterium by increasing the NADPH availabilities of production bacterial strain Strain 12 to 20.
Available reducing power in organism is needed by the adhP and yqhD BT dehydrogenases encoded, particularly NADPH forms, therefore be overexpressed and participate in gene caused by NADPH.
By the way that the endogenesis promoter of Escherichia coli MG1655 pntA genes and ribosome bind site are replaced with into induction Type Ptrc promoters (coming from plasmid pTRC99A, Amersham Pharmacia) and the ribosome bind site RBS120 determined (coming from RBS Calculator softwares), as described in patent application EP14305691.9, the embodiment 4 of particularly described document Described in (herein referred as SEQ ID NO:116) excess generation is by pntAB operators (SEQ ID NO:94) film combination of coding Proton translocation pyridine nucleotide transhydrogenase.Appropriate PCR primer described in patent application EP 14305691.9 is imported previous Conversion has in the bacterial strain 7 of pKD46 carriers.Antibiotic resistance transformant is verified with appropriate oligonucleotides, and by the bacterial strain of reservation It is named as bacterial strain 12.
Use SEQ ID NO:117 and SEQ ID NO:118 primer, by using by Datsenko&Wanner, 2000 Particularly in the reality of the application in the homologous recombination strategy (according to scheme 1) of description and such as patent application WO2012/055798 Apply described in example 2, lack and udhA, sequence SEQ ID NO (are formerly known as by sthA genes:95) the soluble pyridine nucleosides of coding Sour transhydrogenase.Appropriate PCR primer is imported into previously conversion has in the bacterial strain 12 of pKD46 carriers.Tested with appropriate oligonucleotides Antibiotic resistance transformant is demonstrate,proved, and is bacterial strain 13 by the Strain Designation of reservation.
Use SEQ ID NO:119 and SEQ ID NO:120 primer, by using by Datsenko&Wanner, 2000 Particularly in the reality of the application in the homologous recombination strategy (according to scheme 1) of description and such as patent application EP 14305691.9 Apply described in example 5, lack by pfkA genes (SEQ ID NO:99) phosphofructokinase of coding.Appropriate PCR primer is led In the bacterial strain 13 for entering pre-conversion pKD46 carriers.Antibiotic resistance transformant is verified with appropriate oligonucleotides, and will be retained Strain Designation be bacterial strain 14.
By using Datsenko&Wanner, the homologous recombination strategy (according to scheme 1) of 2000 descriptions will be by from large intestine GapA genes (the SEQ ID NO of bacillus:97) the NAD dependence GAPDs of coding replace with coding NADP and relied on GapN genes (the SEQ ID NO from Streptococcus mutans of property glyceraldehyde-3-phosphate dehydrogenase:96)(Centeno-Leija Deng 2013).By the substitution for completing gapN gene pairs gapA genes as follows:GapA promoters and ribosome bind site is same When be substituted by Heat-inducible PR01 promoters (SEQ ID NO:121) and its association thermolability repressor CI857 (SEQ ID NO122) (as described in patent application EP 2532751 embodiment 1) and determine ribosome bind site RBS150 (come from RBS Calculator softwares, SEQ ID NO:123).More accurately, using SEQ ID NO:124 and SEQ ID NO:125 Primer, which produces, carries CI857 genes, PR01 promoters, RBS150 ribosome bind sites, the gapN bases from Streptococcus mutans The PCR surrounded by the sequence homologous with the upstream and downstream area of gapA genes of cause and antibiotics resistance gene together with FRT sites Product, and be conducted into and be previously converted into the bacterial strain 14 of pKD46 carriers.Turned with appropriate oligonucleotides checking antibiotic resistance Change body, and be bacterial strain 15 by the Strain Designation of reservation.
Before using bacterial strain 15, Flp recombinases are used from pntAB, udhA, pfkA and gapA locus according to scheme 1 Remove antibiotic box, producing bacterial strain 16.Finally, by each plasmid pDHB0002, pDHB0003, pDHB0004 described above or PDHB0005 is imported in bacterial strain 16, producing bacterial strain 17 to 20, as described in following table.
Embodiment 3:Excessively production is related to Sucrose Metabolism in the E. coli recombinant stain for producing 2,4- dihydroxy-butyric acid Enzyme-structure bacterial strain 21 to 26.
In order to avoid when on the mixture in sucrose and xylose cultivate bacterial strain when catabolite repression, by using by Datsenko&Wanner, the homologous recombination strategy (according to scheme 1) of 2000 descriptions will encode the operator of the input albumen of xylose XylFGH (XylF periplasm proteins, XylG ATP binding subunits, XylH films subunit) natural promoter replaces with Artificial promoters. More accurately, using SEQ ID NO:127 and SEQ ID NO:128 primer, which produces, carries Ptrc Artificial promoters (SEQ ID NO:126) and kalamycin resistance gene together with FRT sites by with xylF genes and with the upstream of xylF genes on chromosome The PCR primer that the homologous sequence in area surrounds, and be conducted into previously conversion have in the bacterial strain 16 of pKD46 carriers.With appropriate widow Nucleotides verifies kanamycin resistant transformants, and is bacterial strain 21 by the Strain Designation of reservation.
Before using bacterial strain 21, antibiotic box, production are removed from xylFGH operators using Flp recombinases according to scheme 1 Raw bacterial strain 22.
In order to allow Escherichia coli to be grown on sucrose, by (the Schmidt of Plasmid pUR4 00 from salmonella typhimurium Deng 1982, SEQ ID NO:81) gene scrK, gene scrY, the protein of PFP (porine) of encoding fructose kinases IIBC gene scrA, the gene scrB of sucrose -6- phposphate enzymes and the gene scrR of repressor is cloned in pBBR1MCS matter Under natural promoter on grain (Kovach etc., 1995), plasmid pBDO0003 is obtained.
Finally, as in the table below, by plasmid pBDO0003 and plasmid pDHB0002, pDHB0003, pDHB0004 or One of pDHB0005 combinations introduce bacterial strain 22, producing bacterial strain 23 to bacterial strain 26.
Embodiment 4:The micro-organisms of 2,4- dihydroxy butyric acid.
Production bacterial strain is evaluated in conical flasks of the 500ml with baffle plate using the M9 culture mediums (Anderson, 1946) of improvement, The M9 culture mediums of the improvement are supplemented with 30g/l MOPS, 20g/L D- xyloses and 10g/l glucose, and are adjusted to pH 6, 8.Concentration 50mg.L-1 spectinomycin and 5mg.L-1 tetracycline are added when being necessary in preculture and culture. 37 DEG C of culture pre-culture in LB culture mediums (Sigma).After 24 hours of incubation, it is inoculated with using it in 30 DEG C and with 200rpm The 50mL cultures of the M9 culture mediums of improvement are to OD600 about 0.2.When the sugar in culture medium exhausts, culture is centrifuged, and 2,4- dihydroxy-butyric acid is analyzed to nutrient solution by LC-MS/MS.
2,4- dihydroxy butyric acid titre/potency (titer) represents as follows:
Table 2:2,4- dihydroxy butyric acid (DHB) titre for the every kind of bacterial strain evaluated in conical flask.
Symbol (-) refers to that 2, the 4-DHB not as good as 5mg/L is produced, and symbol (+) produces corresponding to 5-15mg/L 2,4-DHB It is raw, and 2, the 4-DHB that symbol (++) corresponds to higher than 15mg/L is produced.
Bacterial strain DHB titres
MG1655 -
Bacterial strain 8 to 11 +
Bacterial strain 17 to 20 ++
All recombinant bacterial strains observation 2,4- dihydroxy butyric acid is produced.By collecting thing for NADPH in increase cell to enter Capable genetic modification strengthens yield.
In embodiment 5 under conditions of same as above, on the mixture of xylose and sucrose test strain 23 to 26 (not presented in table 2), and its 2,4-DHB generations are similar to bacterial strain 8 to 11.
Bibliography
-Altschul S,Gish W,Miller W,Myers E,Lipman DJ,J.Mol.Biol,1990,215(3): 403-410
-Anderson,Proc.Natl.Acad.Sci.USA.,1946,32:120-128.
-Bocanegra J,Scrutton N,Perham R,Biochemistry,1993,32(11):2737-2740.
-Carrier T&Keasling J,Biotechnol Prog.,1999,15(1):58-64.
-Centeno-Leija S,Utrilla J,Flores N,Rodriguez A,Gosset G,Martinez A, Antonie Van Leeuwenhoek.,2013,104(6),913-924.
-Datsenko KA&Wanner BL,Proc Natl Acad Sci U S A.,2000,97:6640-6645.
-Davis JJ&Olsen GJ.,Mol.Biol.Evol.,2011,28(1):211-221.
-Deml L,Bojak A,Steck S,Graf M,Wild J,Schirmbeck R,Wolf H,Wagner R., 2011,J.Virol.,75(22):10991-11001.
-Graf M,Bojak A,Deml L,Bieler K,Wolf H,Wagner R.,2000,J.Virol.,74 (22):10/22-10826
- Hogema etc., Molecular microbiology, 1997,24-857-867.
- Kim, J.H. etc., Appl.Microbiol.Biotechnol., 2010,88,1077-1085.
-Kovach ME,Elzer PH,Hill DS,Robertson GT,Farris MA,Roop RM,Peterson KM.,Gene,1995,166(1):175-6.
-Lee S,McCormick M,Lippard S,Cho U,Nature,2013,494:380-384.
- Lerner C.G. and Inouye M., Nucleic Acids Research, 1990,18 (15):4631.
-Lim S,Jung Y,Shin H,Lee Y,J Biosci Bioeng.,2002,93(6):543-549.
-Marbaix A,Noel G,Detroux A,Vertommen D,Schaftingen E,Linster C,J Biol Chem.,2011,286(48):,41246-41252.
-Salis H,Methods Enzymol.,2011,498:19-42.
- Sambrook etc., Molecular cloning:A laboratory manual 4th edition,Cold Spring Harbor Laboratory Press-Cold Spring Harbor,NY,USA,2012.
-Schmidt K,Schupfner M,Schmitt R,J.Bacteriol.,1982,151:68-76.
- Segel IH, Enzyme kinetics, (1993), John Wiley&Sons, pp.44-54 and 100-112.
Sequence table
<110>Metabolic Explorer (METABOLIC EXPLORER SA)
<120>The microorganism through modification for the optimization production of 2,4- dihydroxy butyric acid
<130> D34775
<140> PCT IB2015/000618
<141> 2015-04-07
<160> 128
<170>PatentIn 3.5 editions
<210> 1
<211> 369
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 1
Met Lys Ser Arg Ala Ala Val Ala Phe Ala Pro Gly Lys Pro Leu Glu
1 5 10 15
Ile Val Glu Ile Asp Val Ala Pro Pro Lys Lys Gly Glu Val Leu Ile
20 25 30
Lys Val Thr His Thr Gly Val Cys His Thr Asp Ala Phe Thr Leu Ser
35 40 45
Gly Asp Asp Pro Glu Gly Val Phe Pro Val Val Leu Gly His Glu Gly
50 55 60
Ala Gly Val Val Val Glu Val Gly Glu Gly Val Thr Ser Val Lys Pro
65 70 75 80
Gly Asp His Val Ile Pro Leu Tyr Thr Ala Glu Cys Gly Glu Cys Glu
85 90 95
Phe Cys Arg Ser Gly Lys Thr Asn Leu Cys Val Ala Val Arg Glu Thr
100 105 110
Gln Gly Lys Gly Leu Met Pro Asp Gly Thr Thr Arg Phe Ser Tyr Asn
115 120 125
Gly Gln Pro Leu Tyr His Tyr Met Gly Cys Ser Thr Phe Ser Glu Tyr
130 135 140
Thr Val Val Ala Glu Val Ser Leu Ala Lys Ile Asn Pro Glu Ala Asn
145 150 155 160
His Glu His Val Cys Leu Leu Gly Cys Gly Val Thr Thr Gly Ile Gly
165 170 175
Ala Val His Asn Thr Ala Lys Val Gln Pro Gly Asp Ser Val Ala Val
180 185 190
Phe Gly Leu Gly Ala Ile Gly Leu Ala Val Val Gln Gly Ala Arg Gln
195 200 205
Ala Lys Ala Gly Arg Ile Ile Ala Ile Asp Thr Asn Pro Lys Lys Phe
210 215 220
Asp Leu Ala Arg Arg Phe Gly Ala Thr Asp Cys Ile Asn Pro Asn Asp
225 230 235 240
Tyr Asp Lys Pro Ile Lys Asp Val Leu Leu Asp Ile Asn Lys Trp Gly
245 250 255
Ile Asp His Thr Phe Glu Cys Ile Gly Asn Val Asn Val Met Arg Ala
260 265 270
Ala Leu Glu Ser Ala His Arg Gly Trp Gly Gln Ser Val Ile Ile Gly
275 280 285
Val Ala Val Ala Gly Gln Glu Ile Ser Thr Arg Pro Phe Gln Leu Val
290 295 300
Thr Gly Arg Val Trp Lys Gly Ser Ala Phe Gly Gly Val Lys Gly Arg
305 310 315 320
Ser Gln Leu Pro Gly Met Val Glu Asp Ala Met Lys Gly Asp Ile Asp
325 330 335
Leu Glu Pro Phe Val Thr His Thr Met Ser Leu Asp Glu Ile Asn Asp
340 345 350
Ala Phe Asp Leu Met His Glu Gly Lys Ser Ile Arg Thr Val Ile Arg
355 360 365
Tyr
<210> 2
<211> 336
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 2
Met Lys Ala Ala Val Val Thr Lys Asp His His Val Asp Val Thr Tyr
1 5 10 15
Lys Thr Leu Arg Ser Leu Lys His Gly Glu Ala Leu Leu Lys Met Glu
20 25 30
Cys Cys Gly Val Cys His Thr Asp Leu His Val Lys Asn Gly Asp Phe
35 40 45
Gly Asp Lys Thr Gly Val Ile Leu Gly His Glu Gly Ile Gly Val Val
50 55 60
Ala Glu Val Gly Pro Gly Val Thr Ser Leu Lys Pro Gly Asp Arg Ala
65 70 75 80
Ser Val Ala Trp Phe Tyr Glu Gly Cys Gly His Cys Glu Tyr Cys Asn
85 90 95
Ser Gly Asn Glu Thr Leu Cys Arg Ser Val Lys Asn Ala Gly Tyr Ser
100 105 110
Val Asp Gly Gly Met Ala Glu Glu Cys Ile Val Val Ala Asp Tyr Ala
115 120 125
Val Lys Val Pro Asp Gly Leu Asp Ser Ala Ala Ala Ser Ser Ile Thr
130 135 140
Cys Ala Gly Val Thr Thr Tyr Lys Ala Val Lys Leu Ser Lys Ile Arg
145 150 155 160
Pro Gly Gln Trp Ile Ala Ile Tyr Gly Leu Gly Gly Leu Gly Asn Leu
165 170 175
Ala Leu Gln Tyr Ala Lys Asn Val Phe Asn Ala Lys Val Ile Ala Ile
180 185 190
Asp Val Asn Asp Glu Gln Leu Lys Leu Ala Thr Glu Met Gly Ala Asp
195 200 205
Leu Ala Ile Asn Ser His Thr Glu Asp Ala Ala Lys Ile Val Gln Glu
210 215 220
Lys Thr Gly Gly Ala His Ala Ala Val Val Thr Ala Val Ala Lys Ala
225 230 235 240
Ala Phe Asn Ser Ala Val Asp Ala Val Arg Ala Gly Gly Arg Val Val
245 250 255
Ala Val Gly Leu Pro Pro Glu Ser Met Ser Leu Asp Ile Pro Arg Leu
260 265 270
Val Leu Asp Gly Ile Glu Val Val Gly Ser Leu Val Gly Thr Arg Gln
275 280 285
Asp Leu Thr Glu Ala Phe Gln Phe Ala Ala Glu Gly Lys Val Val Pro
290 295 300
Lys Val Ala Leu Arg Pro Leu Ala Asp Ile Asn Thr Ile Phe Thr Glu
305 310 315 320
Met Glu Glu Gly Lys Ile Arg Gly Arg Met Val Ile Asp Phe Arg His
325 330 335
<210> 3
<211> 383
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 3
Met Ala Ala Ser Thr Phe Phe Ile Pro Ser Val Asn Val Ile Gly Ala
1 5 10 15
Asp Ser Leu Thr Asp Ala Met Asn Met Met Ala Asp Tyr Gly Phe Thr
20 25 30
Arg Thr Leu Ile Val Thr Asp Asn Met Leu Thr Lys Leu Gly Met Ala
35 40 45
Gly Asp Val Gln Lys Ala Leu Glu Glu Arg Asn Ile Phe Ser Val Ile
50 55 60
Tyr Asp Gly Thr Gln Pro Asn Pro Thr Thr Glu Asn Val Ala Ala Gly
65 70 75 80
Leu Lys Leu Leu Lys Glu Asn Asn Cys Asp Ser Val Ile Ser Leu Gly
85 90 95
Gly Gly Ser Pro His Asp Cys Ala Lys Gly Ile Ala Leu Val Ala Ala
100 105 110
Asn Gly Gly Asp Ile Arg Asp Tyr Glu Gly Val Asp Arg Ser Ala Lys
115 120 125
Pro Gln Leu Pro Met Ile Ala Ile Asn Thr Thr Ala Gly Thr Ala Ser
130 135 140
Glu Met Thr Arg Phe Cys Ile Ile Thr Asp Glu Ala Arg His Ile Lys
145 150 155 160
Met Ala Ile Val Asp Lys His Val Thr Pro Leu Leu Ser Val Asn Asp
165 170 175
Ser Ser Leu Met Ile Gly Met Pro Lys Ser Leu Thr Ala Ala Thr Gly
180 185 190
Met Asp Ala Leu Thr His Ala Ile Glu Ala Tyr Val Ser Ile Ala Ala
195 200 205
Thr Pro Ile Thr Asp Ala Cys Ala Leu Lys Ala Val Thr Met Ile Ala
210 215 220
Glu Asn Leu Pro Leu Ala Val Glu Asp Gly Ser Asn Ala Lys Ala Arg
225 230 235 240
Glu Ala Met Ala Tyr Ala Gln Phe Leu Ala Gly Met Ala Phe Asn Asn
245 250 255
Ala Ser Leu Gly Tyr Val His Ala Met Ala His Gln Leu Gly Gly Phe
260 265 270
Tyr Asn Leu Pro His Gly Val Cys Asn Ala Val Leu Leu Pro His Val
275 280 285
Gln Val Phe Asn Ser Lys Val Ala Ala Ala Arg Leu Arg Asp Cys Ala
290 295 300
Ala Ala Met Gly Val Asn Val Thr Gly Lys Asn Asp Ala Glu Gly Ala
305 310 315 320
Glu Ala Cys Ile Asn Ala Ile Arg Glu Leu Ala Lys Lys Val Asp Ile
325 330 335
Pro Ala Gly Leu Arg Asp Leu Asn Val Lys Glu Glu Asp Phe Ala Val
340 345 350
Leu Ala Thr Asn Ala Leu Lys Asp Ala Cys Gly Phe Thr Asn Pro Ile
355 360 365
Gln Ala Thr His Glu Glu Ile Val Ala Ile Tyr Arg Ala Ala Met
370 375 380
<210> 4
<211> 395
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 4
Met Gln Asn Glu Leu Gln Thr Ala Leu Phe Gln Ala Phe Asp Thr Leu
1 5 10 15
Asn Leu Gln Arg Val Lys Thr Phe Ser Val Pro Pro Val Thr Leu Cys
20 25 30
Gly Pro Gly Ser Val Ser Ser Cys Gly Gln Gln Ala Gln Thr Arg Gly
35 40 45
Leu Lys His Leu Phe Val Met Ala Asp Ser Phe Leu His Gln Ala Gly
50 55 60
Met Thr Ala Gly Leu Thr Arg Ser Leu Thr Val Lys Gly Ile Ala Met
65 70 75 80
Thr Leu Trp Pro Cys Pro Val Gly Glu Pro Cys Ile Thr Asp Val Cys
85 90 95
Ala Ala Val Ala Gln Leu Arg Glu Ser Gly Cys Asp Gly Val Ile Ala
100 105 110
Phe Gly Gly Gly Ser Val Leu Asp Ala Ala Lys Ala Val Thr Leu Leu
115 120 125
Val Thr Asn Pro Asp Ser Thr Leu Ala Glu Met Ser Glu Thr Ser Val
130 135 140
Leu Gln Pro Arg Leu Pro Leu Ile Ala Ile Pro Thr Thr Ala Gly Thr
145 150 155 160
Gly Ser Glu Thr Thr Asn Val Thr Val Ile Ile Asp Ala Val Ser Gly
165 170 175
Arg Lys Gln Val Leu Ala His Ala Ser Leu Met Pro Asp Val Ala Ile
180 185 190
Leu Asp Ala Ala Leu Thr Glu Gly Val Pro Ser His Val Thr Ala Met
195 200 205
Thr Gly Ile Asp Ala Leu Thr His Ala Ile Glu Ala Tyr Ser Ala Leu
210 215 220
Asn Ala Thr Pro Phe Thr Asp Ser Leu Ala Ile Gly Ala Ile Ala Met
225 230 235 240
Ile Gly Lys Ser Leu Pro Lys Ala Val Gly Tyr Gly His Asp Leu Ala
245 250 255
Ala Arg Glu Ser Met Leu Leu Ala Ser Cys Met Ala Gly Met Ala Phe
260 265 270
Ser Ser Ala Gly Leu Gly Leu Cys His Ala Met Ala His Gln Pro Gly
275 280 285
Ala Ala Leu His Ile Pro His Gly Leu Ala Asn Ala Met Leu Leu Pro
290 295 300
Thr Val Met Glu Phe Asn Arg Met Val Cys Arg Glu Arg Phe Ser Gln
305 310 315 320
Ile Gly Arg Ala Leu Arg Thr Lys Lys Ser Asp Asp Arg Asp Ala Ile
325 330 335
Asn Ala Val Ser Glu Leu Ile Ala Glu Val Gly Ile Gly Lys Arg Leu
340 345 350
Gly Asp Val Gly Ala Thr Ser Ala His Tyr Gly Ala Trp Ala Gln Ala
355 360 365
Ala Leu Glu Asp Ile Cys Leu Arg Ser Asn Pro Arg Thr Ala Ser Leu
370 375 380
Glu Gln Ile Val Gly Leu Tyr Ala Ala Ala Gln
385 390 395
<210> 5
<211> 387
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 5
Met Asn Asn Phe Asn Leu His Thr Pro Thr Arg Ile Leu Phe Gly Lys
1 5 10 15
Gly Ala Ile Ala Gly Leu Arg Glu Gln Ile Pro His Asp Ala Arg Val
20 25 30
Leu Ile Thr Tyr Gly Gly Gly Ser Val Lys Lys Thr Gly Val Leu Asp
35 40 45
Gln Val Leu Asp Ala Leu Lys Gly Met Asp Val Leu Glu Phe Gly Gly
50 55 60
Ile Glu Pro Asn Pro Ala Tyr Glu Thr Leu Met Asn Ala Val Lys Leu
65 70 75 80
Val Arg Glu Gln Lys Val Thr Phe Leu Leu Ala Val Gly Gly Gly Ser
85 90 95
Val Leu Asp Gly Thr Lys Phe Ile Ala Ala Ala Ala Asn Tyr Pro Glu
100 105 110
Asn Ile Asp Pro Trp His Ile Leu Gln Thr Gly Gly Lys Glu Ile Lys
115 120 125
Ser Ala Ile Pro Met Gly Cys Val Leu Thr Leu Pro Ala Thr Gly Ser
130 135 140
Glu Ser Asn Ala Gly Ala Val Ile Ser Arg Lys Thr Thr Gly Asp Lys
145 150 155 160
Gln Ala Phe His Ser Ala His Val Gln Pro Val Phe Ala Val Leu Asp
165 170 175
Pro Val Tyr Thr Tyr Thr Leu Pro Pro Arg Gln Val Ala Asn Gly Val
180 185 190
Val Asp Ala Phe Val His Thr Val Glu Gln Tyr Val Thr Lys Pro Val
195 200 205
Asp Ala Lys Ile Gln Asp Arg Phe Ala Glu Gly Ile Leu Leu Thr Leu
210 215 220
Ile Glu Asp Gly Pro Lys Ala Leu Lys Glu Pro Glu Asn Tyr Asp Val
225 230 235 240
Arg Ala Asn Val Met Trp Ala Ala Thr Gln Ala Leu Asn Gly Leu Ile
245 250 255
Gly Ala Gly Val Pro Gln Asp Trp Ala Thr His Met Leu Gly His Glu
260 265 270
Leu Thr Ala Met His Gly Leu Asp His Ala Gln Thr Leu Ala Ile Val
275 280 285
Leu Pro Ala Leu Trp Asn Glu Lys Arg Asp Thr Lys Arg Ala Lys Leu
290 295 300
Leu Gln Tyr Ala Glu Arg Val Trp Asn Ile Thr Glu Gly Ser Asp Asp
305 310 315 320
Glu Arg Ile Asp Ala Ala Ile Ala Ala Thr Arg Asn Phe Phe Glu Gln
325 330 335
Leu Gly Val Pro Thr His Leu Ser Asp Tyr Gly Leu Asp Gly Ser Ser
340 345 350
Ile Pro Ala Leu Leu Lys Lys Leu Glu Glu His Gly Met Thr Gln Leu
355 360 365
Gly Glu Asn His Asp Ile Thr Leu Asp Val Ser Arg Arg Ile Tyr Glu
370 375 380
Ala Ala Arg
385
<210> 6
<211> 284
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 6
Met Gln Gln Lys Met Ile Gln Phe Ser Gly Asp Val Ser Leu Pro Ala
1 5 10 15
Val Gly Gln Gly Thr Trp Tyr Met Gly Glu Asp Ala Ser Gln Arg Lys
20 25 30
Thr Glu Val Ala Ala Leu Arg Ala Gly Ile Glu Leu Gly Leu Thr Leu
35 40 45
Ile Asp Thr Ala Glu Met Tyr Ala Asp Gly Gly Ala Glu Lys Val Val
50 55 60
Gly Glu Ala Leu Thr Gly Leu Arg Glu Lys Val Phe Leu Val Ser Lys
65 70 75 80
Val Tyr Pro Trp Asn Ala Gly Gly Gln Lys Ala Ile Asn Ala Cys Glu
85 90 95
Ala Ser Leu Arg Arg Leu Asn Thr Asp Tyr Leu Asp Leu Tyr Leu Leu
100 105 110
His Trp Ser Gly Ser Phe Ala Phe Glu Glu Thr Val Ala Ala Met Glu
115 120 125
Lys Leu Ile Ala Gln Gly Lys Ile Arg Arg Trp Gly Val Ser Asn Leu
130 135 140
Asp Tyr Ala Asp Met Gln Glu Leu Trp Gln Leu Pro Gly Gly Asn Gln
145 150 155 160
Cys Ala Thr Asn Gln Val Leu Tyr His Leu Gly Ser Arg Gly Ile Glu
165 170 175
Tyr Asp Leu Leu Pro Trp Cys Gln Gln Gln Gln Met Pro Val Met Ala
180 185 190
Tyr Ser Pro Leu Ala Gln Ala Gly Arg Leu Arg Asn Gly Leu Leu Lys
195 200 205
Asn Ala Val Val Asn Glu Ile Ala His Ala His Asn Ile Ser Ala Ala
210 215 220
Gln Val Leu Leu Ala Trp Val Ile Ser His Gln Gly Val Met Ala Ile
225 230 235 240
Pro Lys Ala Ala Thr Ile Ala His Val Gln Gln Asn Ala Ala Val Leu
245 250 255
Glu Val Glu Leu Ser Ser Ala Glu Leu Ala Met Leu Asp Lys Ala Tyr
260 265 270
Pro Ala Pro Lys Gly Lys Thr Ala Leu Asp Met Val
275 280
<210> 7
<211> 298
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 7
Met Val Gln Arg Ile Thr Ile Ala Pro Gln Gly Pro Glu Phe Ser Arg
1 5 10 15
Phe Val Met Gly Tyr Trp Arg Leu Met Asp Trp Asn Met Ser Ala Arg
20 25 30
Gln Leu Val Ser Phe Ile Glu Glu His Leu Asp Leu Gly Val Thr Thr
35 40 45
Val Asp His Ala Asp Ile Tyr Gly Gly Tyr Gln Cys Glu Ala Ala Phe
50 55 60
Gly Glu Ala Leu Lys Leu Ala Pro His Leu Arg Glu Arg Met Glu Ile
65 70 75 80
Val Ser Lys Cys Gly Ile Ala Thr Thr Ala Arg Glu Glu Asn Val Ile
85 90 95
Gly His Tyr Ile Thr Asp Arg Asp His Ile Ile Lys Ser Ala Glu Gln
100 105 110
Ser Leu Ile Asn Leu Ala Thr Asp His Leu Asp Leu Leu Leu Ile His
115 120 125
Arg Pro Asp Pro Leu Met Asp Ala Asp Glu Val Ala Asp Ala Phe Lys
130 135 140
His Leu His Gln Ser Gly Lys Val Arg His Phe Gly Val Ser Asn Phe
145 150 155 160
Thr Pro Ala Gln Phe Ala Leu Leu Gln Ser Arg Leu Pro Phe Thr Leu
165 170 175
Ala Thr Asn Gln Val Glu Ile Ser Pro Val His Gln Pro Leu Leu Leu
180 185 190
Asp Gly Thr Leu Asp Gln Leu Gln Gln Leu Arg Val Arg Pro Met Ala
195 200 205
Trp Ser Cys Leu Gly Gly Gly Arg Leu Phe Asn Asp Asp Tyr Phe Gln
210 215 220
Pro Leu Arg Asp Glu Leu Ala Val Val Ala Glu Glu Leu Asn Ala Gly
225 230 235 240
Ser Ile Glu Gln Val Val Tyr Ala Trp Val Leu Arg Leu Pro Ser Gln
245 250 255
Pro Leu Pro Ile Ile Gly Ser Gly Lys Ile Glu Arg Val Arg Ala Ala
260 265 270
Val Glu Ala Glu Thr Leu Lys Met Thr Arg Gln Gln Trp Phe Arg Ile
275 280 285
Arg Lys Ala Ala Leu Gly Tyr Asp Val Pro
290 295
<210> 8
<211> 122
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 8
Met Trp Leu Leu Asp Gln Trp Ala Glu Arg His Ile Ala Glu Ala Gln
1 5 10 15
Ala Lys Gly Glu Phe Asp Asn Leu Ala Gly Ser Gly Glu Pro Leu Ile
20 25 30
Leu Asp Asp Asp Ser His Val Pro Pro Glu Leu Arg Ala Gly Tyr Arg
35 40 45
Leu Leu Lys Asn Ala Gly Cys Leu Pro Pro Glu Leu Glu Gln Arg Arg
50 55 60
Glu Ala Ile Gln Leu Leu Asp Ile Leu Lys Gly Ile Arg His Asp Asp
65 70 75 80
Pro Gln Tyr Gln Glu Val Ser Arg Arg Leu Ser Leu Leu Glu Leu Lys
85 90 95
Leu Arg Gln Ala Gly Leu Ser Thr Asp Phe Leu Arg Gly Asp Tyr Ala
100 105 110
Asp Lys Leu Leu Asp Lys Ile Asn Asp Asn
115 120
<210> 9
<211> 412
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 9
Met Lys Ala Leu Thr Tyr His Gly Pro His His Val Gln Val Glu Asn
1 5 10 15
Val Pro Asp Pro Gly Val Glu Gln Ala Asp Asp Ile Ile Leu Arg Ile
20 25 30
Thr Ala Thr Ala Ile Cys Gly Ser Asp Leu His Leu Tyr Arg Gly Lys
35 40 45
Ile Pro Gln Val Lys His Gly Asp Ile Phe Gly His Glu Phe Met Gly
50 55 60
Glu Val Val Glu Thr Gly Lys Asp Val Lys Asn Leu Gln Lys Gly Asp
65 70 75 80
Arg Val Val Ile Pro Phe Val Ile Ala Cys Gly Asp Cys Phe Phe Cys
85 90 95
Arg Leu Gln Gln Tyr Ala Ala Cys Glu Asn Thr Asn Ala Gly Lys Gly
100 105 110
Ala Ala Leu Asn Lys Lys Gln Ile Pro Ala Pro Ala Ala Leu Phe Gly
115 120 125
Tyr Ser His Leu Tyr Gly Gly Val Pro Gly Gly Gln Ala Glu Tyr Val
130 135 140
Arg Val Pro Lys Gly Asn Val Gly Pro Phe Lys Val Pro Pro Leu Leu
145 150 155 160
Ser Asp Asp Lys Ala Leu Phe Leu Ser Asp Ile Leu Pro Thr Ala Trp
165 170 175
Gln Ala Ala Lys Asn Ala Gln Ile Gln Gln Gly Ser Ser Val Ala Val
180 185 190
Tyr Gly Ala Gly Pro Val Gly Leu Leu Thr Ile Ala Cys Ala Arg Leu
195 200 205
Leu Gly Ala Glu Gln Ile Phe Val Val Asp His His Pro Tyr Arg Leu
210 215 220
His Phe Ala Ala Asp Arg Tyr Gly Ala Ile Pro Ile Asn Phe Asp Glu
225 230 235 240
Asp Ser Asp Pro Ala Gln Ser Ile Ile Glu Gln Thr Ala Gly His Arg
245 250 255
Gly Val Asp Ala Val Ile Asp Ala Val Gly Phe Glu Ala Lys Gly Ser
260 265 270
Thr Thr Glu Thr Val Leu Thr Asn Leu Lys Leu Glu Gly Ser Ser Gly
275 280 285
Lys Ala Leu Arg Gln Cys Ile Ala Ala Val Arg Arg Gly Gly Ile Val
290 295 300
Ser Val Pro Gly Val Tyr Ala Gly Phe Ile His Gly Phe Leu Phe Gly
305 310 315 320
Asp Ala Phe Asp Lys Gly Leu Ser Phe Lys Met Gly Gln Thr His Val
325 330 335
His Ala Trp Leu Gly Glu Leu Leu Pro Leu Ile Glu Lys Gly Leu Leu
340 345 350
Lys Pro Glu Glu Ile Val Thr His Tyr Met Pro Phe Glu Glu Ala Ala
355 360 365
Arg Gly Tyr Glu Ile Phe Glu Lys Arg Glu Glu Glu Cys Arg Lys Val
370 375 380
Ile Leu Val Pro Gly Ala Gln Ser Ala Glu Ala Ala Gln Lys Ala Val
385 390 395 400
Ser Gly Leu Val Asn Ala Met Pro Gly Gly Thr Ile
405 410
<210> 10
<211> 362
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 10
Met Pro His Asn Pro Ile Arg Val Val Val Gly Pro Ala Asn Tyr Phe
1 5 10 15
Ser His Pro Gly Ser Phe Asn His Leu His Asp Phe Phe Thr Asp Glu
20 25 30
Gln Leu Ser Arg Ala Val Trp Ile Tyr Gly Lys Arg Ala Ile Ala Ala
35 40 45
Ala Gln Thr Lys Leu Pro Pro Ala Phe Gly Leu Pro Gly Ala Lys His
50 55 60
Ile Leu Phe Arg Gly His Cys Ser Glu Ser Asp Val Gln Gln Leu Ala
65 70 75 80
Ala Glu Ser Gly Asp Asp Arg Ser Val Val Ile Gly Val Gly Gly Gly
85 90 95
Ala Leu Leu Asp Thr Ala Lys Ala Leu Ala Arg Arg Leu Gly Leu Pro
100 105 110
Phe Val Ala Val Pro Thr Ile Ala Ala Thr Cys Ala Ala Trp Thr Pro
115 120 125
Leu Ser Val Trp Tyr Asn Asp Ala Gly Gln Ala Leu His Tyr Glu Ile
130 135 140
Phe Asp Asp Ala Asn Phe Met Val Leu Val Glu Pro Glu Ile Ile Leu
145 150 155 160
Asn Ala Pro Gln Gln Tyr Leu Leu Ala Gly Ile Gly Asp Thr Leu Ala
165 170 175
Lys Trp Tyr Glu Ala Val Val Leu Ala Pro Gln Pro Glu Thr Leu Pro
180 185 190
Leu Thr Val Arg Leu Gly Ile Asn Asn Ala Gln Ala Ile Arg Asp Val
195 200 205
Leu Leu Asn Ser Ser Glu Gln Ala Leu Ser Asp Gln Gln Asn Gln Gln
210 215 220
Leu Thr Gln Ser Phe Cys Asp Val Val Asp Ala Ile Ile Ala Gly Gly
225 230 235 240
Gly Met Val Gly Gly Leu Gly Asp Arg Phe Thr Arg Val Ala Ala Ala
245 250 255
His Ala Val His Asn Gly Leu Thr Val Leu Pro Gln Thr Glu Lys Phe
260 265 270
Leu His Gly Thr Lys Val Ala Tyr Gly Ile Leu Val Gln Ser Ala Leu
275 280 285
Leu Gly Gln Asp Asp Val Leu Ala Gln Leu Thr Gly Ala Tyr Gln Arg
290 295 300
Phe His Leu Pro Thr Thr Leu Ala Glu Leu Glu Val Asp Ile Asn Asn
305 310 315 320
Gln Ala Glu Ile Asp Lys Val Ile Ala His Thr Leu Arg Pro Val Glu
325 330 335
Ser Ile His Tyr Leu Pro Val Thr Leu Thr Pro Asp Thr Leu Arg Ala
340 345 350
Ala Phe Lys Lys Val Glu Ser Phe Lys Ala
355 360
<210> 11
<211> 347
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 11
Met Lys Asn Ser Lys Ala Ile Leu Gln Val Pro Gly Thr Met Lys Ile
1 5 10 15
Ile Ser Ala Glu Ile Pro Val Pro Lys Glu Asp Glu Val Leu Ile Lys
20 25 30
Val Glu Tyr Val Gly Ile Cys Gly Ser Asp Val His Gly Phe Glu Ser
35 40 45
Gly Pro Phe Ile Pro Pro Lys Asp Pro Asn Gln Glu Ile Gly Leu Gly
50 55 60
His Glu Cys Ala Gly Thr Val Val Ala Val Gly Ser Arg Val Arg Lys
65 70 75 80
Phe Lys Pro Gly Asp Arg Val Asn Ile Glu Pro Gly Val Pro Cys Gly
85 90 95
His Cys Arg Tyr Cys Leu Glu Gly Lys Tyr Asn Ile Cys Pro Asp Val
100 105 110
Asp Phe Met Ala Thr Gln Pro Asn Tyr Arg Gly Ala Leu Thr His Tyr
115 120 125
Leu Cys His Pro Glu Ser Phe Thr Tyr Lys Leu Pro Asp Asn Met Asp
130 135 140
Thr Met Glu Gly Ala Leu Val Glu Pro Ala Ala Val Gly Met His Ala
145 150 155 160
Ala Met Leu Ala Asp Val Lys Pro Gly Lys Lys Ile Ile Ile Leu Gly
165 170 175
Ala Gly Cys Ile Gly Leu Met Thr Leu Gln Ala Cys Lys Cys Leu Gly
180 185 190
Ala Thr Glu Ile Ala Val Val Asp Val Leu Glu Lys Arg Leu Ala Met
195 200 205
Ala Glu Gln Leu Gly Ala Thr Val Val Ile Asn Gly Ala Lys Glu Asp
210 215 220
Thr Ile Ala Arg Cys Gln Gln Phe Thr Glu Asp Met Gly Ala Asp Ile
225 230 235 240
Val Phe Glu Thr Ala Gly Ser Ala Val Thr Val Lys Gln Ala Pro Tyr
245 250 255
Leu Val Met Arg Gly Gly Lys Ile Met Ile Val Gly Thr Val Pro Gly
260 265 270
Asp Ser Ala Ile Asn Phe Leu Lys Ile Asn Arg Glu Val Thr Ile Gln
275 280 285
Thr Val Phe Arg Tyr Ala Asn Arg Tyr Pro Val Thr Ile Glu Ala Ile
290 295 300
Ser Ser Gly Arg Phe Asp Val Lys Ser Met Val Thr His Ile Tyr Asp
305 310 315 320
Tyr Arg Asp Val Gln Gln Ala Phe Glu Glu Ser Val Asn Asn Lys Arg
325 330 335
Asp Ile Ile Lys Gly Val Ile Lys Ile Ser Asp
340 345
<210> 12
<211> 358
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 12
Met Lys Ala Leu Ala Arg Phe Gly Lys Ala Phe Gly Gly Tyr Lys Met
1 5 10 15
Ile Asp Val Pro Gln Pro Met Cys Gly Pro Glu Asp Val Val Ile Glu
20 25 30
Ile Lys Ala Ala Ala Ile Cys Gly Ala Asp Met Lys His Tyr Asn Val
35 40 45
Asp Ser Gly Ser Asp Glu Phe Asn Ser Ile Arg Gly His Glu Phe Ala
50 55 60
Gly Cys Ile Ala Gln Val Gly Glu Lys Val Lys Asp Trp Lys Val Gly
65 70 75 80
Gln Arg Val Val Ser Asp Asn Ser Gly His Val Cys Gly Val Cys Pro
85 90 95
Ala Cys Glu Gln Gly Asp Phe Leu Cys Cys Thr Glu Lys Val Asn Leu
100 105 110
Gly Leu Asp Asn Asn Thr Trp Gly Gly Gly Phe Ser Lys Tyr Cys Leu
115 120 125
Val Pro Gly Glu Ile Leu Lys Ile His Arg His Ala Leu Trp Glu Ile
130 135 140
Pro Asp Gly Val Asp Tyr Glu Asp Ala Ala Val Leu Asp Pro Ile Cys
145 150 155 160
Asn Ala Tyr Lys Ser Ile Ala Gln Gln Ser Lys Phe Leu Pro Gly Gln
165 170 175
Asp Val Val Val Ile Gly Thr Gly Pro Leu Gly Leu Phe Ser Val Gln
180 185 190
Met Ala Arg Ile Met Gly Ala Val Asn Ile Val Val Val Gly Leu Gln
195 200 205
Glu Asp Val Ala Val Arg Phe Pro Val Ala Lys Glu Leu Gly Ala Thr
210 215 220
Ala Val Val Asn Gly Ser Thr Glu Asp Val Val Ala Arg Cys Gln Gln
225 230 235 240
Ile Cys Gly Lys Asp Asn Leu Gly Leu Val Ile Glu Cys Ser Gly Ala
245 250 255
Asn Ile Ala Leu Lys Gln Ala Ile Glu Met Leu Arg Pro Asn Gly Glu
260 265 270
Val Val Arg Val Gly Met Gly Phe Lys Pro Leu Asp Phe Ser Ile Asn
275 280 285
Asp Ile Thr Ala Trp Asn Lys Ser Ile Ile Gly His Met Ala Tyr Asp
290 295 300
Ser Thr Ser Trp Arg Asn Ala Ile Arg Leu Leu Ala Ser Gly Ala Ile
305 310 315 320
Lys Val Lys Pro Met Ile Thr His Arg Ile Gly Leu Ser Gln Trp Arg
325 330 335
Glu Gly Phe Asp Ala Met Val Asp Lys Thr Ala Ile Lys Val Ile Met
340 345 350
Thr Tyr Asp Phe Asp Glu
355
<210> 13
<211> 390
<212> PRT
<213>Clostridium acetobutylicum (Clostridium acetobutylicum)
<400> 13
Met Val Asp Phe Glu Tyr Ser Ile Pro Thr Arg Ile Phe Phe Gly Lys
1 5 10 15
Asp Lys Ile Asn Val Leu Gly Arg Glu Leu Lys Lys Tyr Gly Ser Lys
20 25 30
Val Leu Ile Val Tyr Gly Gly Gly Ser Ile Lys Arg Asn Gly Ile Tyr
35 40 45
Asp Lys Ala Val Ser Ile Leu Glu Lys Asn Ser Ile Lys Phe Tyr Glu
50 55 60
Leu Ala Gly Val Glu Pro Asn Pro Arg Val Thr Thr Val Glu Lys Gly
65 70 75 80
Val Lys Ile Cys Arg Glu Asn Gly Val Glu Val Val Leu Ala Ile Gly
85 90 95
Gly Gly Ser Ala Ile Asp Cys Ala Lys Val Ile Ala Ala Ala Cys Glu
100 105 110
Tyr Asp Gly Asn Pro Trp Asp Ile Val Leu Asp Gly Ser Lys Ile Lys
115 120 125
Arg Val Leu Pro Ile Ala Ser Ile Leu Thr Ile Ala Ala Thr Gly Ser
130 135 140
Glu Met Asp Thr Trp Ala Val Ile Asn Asn Met Asp Thr Asn Glu Lys
145 150 155 160
Leu Ile Ala Ala His Pro Asp Met Ala Pro Lys Phe Ser Ile Leu Asp
165 170 175
Pro Thr Tyr Thr Tyr Thr Val Pro Thr Asn Gln Thr Ala Ala Gly Thr
180 185 190
Ala Asp Ile Met Ser His Ile Phe Glu Val Tyr Phe Ser Asn Thr Lys
195 200 205
Thr Ala Tyr Leu Gln Asp Arg Met Ala Glu Ala Leu Leu Arg Thr Cys
210 215 220
Ile Lys Tyr Gly Gly Ile Ala Leu Glu Lys Pro Asp Asp Tyr Glu Ala
225 230 235 240
Arg Ala Asn Leu Met Trp Ala Ser Ser Leu Ala Ile Asn Gly Leu Leu
245 250 255
Thr Tyr Gly Lys Asp Thr Asn Trp Ser Val His Leu Met Glu His Glu
260 265 270
Leu Ser Ala Tyr Tyr Asp Ile Thr His Gly Val Gly Leu Ala Ile Leu
275 280 285
Thr Pro Asn Trp Met Glu Tyr Ile Leu Asn Asn Asp Thr Val Tyr Lys
290 295 300
Phe Val Glu Tyr Gly Val Asn Val Trp Gly Ile Asp Lys Glu Lys Asn
305 310 315 320
His Tyr Asp Ile Ala His Gln Ala Ile Gln Lys Thr Arg Asp Tyr Phe
325 330 335
Val Asn Val Leu Gly Leu Pro Ser Arg Leu Arg Asp Val Gly Ile Glu
340 345 350
Glu Glu Lys Leu Asp Ile Met Ala Lys Glu Ser Val Lys Leu Thr Gly
355 360 365
Gly Thr Ile Gly Asn Leu Arg Pro Val Asn Ala Ser Glu Val Leu Gln
370 375 380
Ile Phe Lys Lys Ser Val
385 390
<210> 14
<211> 389
<212> PRT
<213>Clostridium acetobutylicum (Clostridium acetobutylicum)
<400> 14
Met Leu Ser Phe Asp Tyr Ser Ile Pro Thr Lys Val Phe Phe Gly Lys
1 5 10 15
Gly Lys Ile Asp Val Ile Gly Glu Glu Ile Lys Lys Tyr Gly Ser Arg
20 25 30
Val Leu Ile Val Tyr Gly Gly Gly Ser Ile Lys Arg Asn Gly Ile Tyr
35 40 45
Asp Arg Ala Thr Ala Ile Leu Lys Glu Asn Asn Ile Ala Phe Tyr Glu
50 55 60
Leu Ser Gly Val Glu Pro Asn Pro Arg Ile Thr Thr Val Lys Lys Gly
65 70 75 80
Ile Glu Ile Cys Arg Glu Asn Asn Val Asp Leu Val Leu Ala Ile Gly
85 90 95
Gly Gly Ser Ala Ile Asp Cys Ser Lys Val Ile Ala Ala Gly Val Tyr
100 105 110
Tyr Asp Gly Asp Thr Trp Asp Met Val Lys Asp Pro Ser Lys Ile Thr
115 120 125
Lys Val Leu Pro Ile Ala Ser Ile Leu Thr Leu Ser Ala Thr Gly Ser
130 135 140
Glu Met Asp Gln Ile Ala Val Ile Ser Asn Met Glu Thr Asn Glu Lys
145 150 155 160
Leu Gly Val Gly His Asp Asp Met Arg Pro Lys Phe Ser Val Leu Asp
165 170 175
Pro Thr Tyr Thr Phe Thr Val Pro Lys Asn Gln Thr Ala Ala Gly Thr
180 185 190
Ala Asp Ile Met Ser His Thr Phe Glu Ser Tyr Phe Ser Gly Val Glu
195 200 205
Gly Ala Tyr Val Gln Asp Gly Ile Ala Glu Ala Ile Leu Arg Thr Cys
210 215 220
Ile Lys Tyr Gly Lys Ile Ala Met Glu Lys Thr Asp Asp Tyr Glu Ala
225 230 235 240
Arg Ala Asn Leu Met Trp Ala Ser Ser Leu Ala Ile Asn Gly Leu Leu
245 250 255
Ser Leu Gly Lys Asp Arg Lys Trp Ser Cys His Pro Met Glu His Glu
260 265 270
Leu Ser Ala Tyr Tyr Asp Ile Thr His Gly Val Gly Leu Ala Ile Leu
275 280 285
Thr Pro Asn Trp Met Glu Tyr Ile Leu Asn Asp Asp Thr Leu His Lys
290 295 300
Phe Val Ser Tyr Gly Ile Asn Val Trp Gly Ile Asp Lys Asn Lys Asp
305 310 315 320
Asn Tyr Glu Ile Ala Arg Glu Ala Ile Lys Asn Thr Arg Glu Tyr Phe
325 330 335
Asn Ser Leu Gly Ile Pro Ser Lys Leu Arg Glu Val Gly Ile Gly Lys
340 345 350
Asp Lys Leu Glu Leu Met Ala Lys Gln Ala Val Arg Asn Ser Gly Gly
355 360 365
Thr Ile Gly Ser Leu Arg Pro Ile Asn Ala Glu Asp Val Leu Glu Ile
370 375 380
Phe Lys Lys Ser Tyr
385
<210> 15
<211> 386
<212> PRT
<213>Clostridium acetobutylicum (Clostridium acetobutylicum)
<400> 15
Met Tyr Asn Phe Asp Phe Phe Asn Pro Thr His Ile Val Phe Gly Lys
1 5 10 15
Asp Arg Leu Asn Glu Leu Asp Asn Leu Val Pro Arg Asp Ala Lys Val
20 25 30
Leu Val Leu Tyr Gly Gly Gly Ser Val Lys Lys Phe Gly Thr Leu Glu
35 40 45
Lys Val Ile Asn Gly Leu Gly Asn Arg Gln Val Ile Glu Phe Gly Gly
50 55 60
Ile Glu Pro Asn Pro Gln Phe Thr Thr Leu Met Lys Ala Val Asp Ile
65 70 75 80
Val Lys Lys Glu Asn Ile Asp Phe Leu Leu Ala Val Gly Gly Gly Ser
85 90 95
Val Met Asp Gly Thr Lys Phe Val Ala Leu Ala Ala Tyr Tyr Glu Gly
100 105 110
Asp Asn Ala Ala Glu Ile Leu Tyr Ser Arg Glu Lys Ala Ala Ala Ile
115 120 125
Asn Lys Ala Val Pro Leu Gly Thr Val Val Thr Leu Pro Ala Thr Gly
130 135 140
Ser Glu Met Asn Asn Gly Gly Val Ile Ser Tyr Glu His Gly Lys Tyr
145 150 155 160
Gly Phe Gly Ser Lys Leu Val Phe Pro Lys Phe Ser Val Leu Asp Pro
165 170 175
Thr Leu Thr Tyr Thr Leu Pro Glu Ser Gln Val Ala Asn Gly Val Ala
180 185 190
Asp Thr Phe Val His Val Leu Glu Gln Tyr Ala Thr Phe Lys Ala Glu
195 200 205
Gly Arg Phe Gln Asp Arg Thr Ala Glu Gly Ile Leu Gln Thr Leu Ile
210 215 220
Glu Ile Gly Arg Lys Thr Ile Asp Asn Pro Thr Asp Tyr Asp Thr Arg
225 230 235 240
Ala Asn Leu Val Trp Cys Ala Thr Met Ala Leu Asn Gly Leu Ile Gly
245 250 255
Ala Gly Val Pro Gln Asp Trp Ser Thr His Met Ile Gly His Glu Leu
260 265 270
Thr Ala Met Phe Gly Ile Asp His Gly Lys Thr Leu Ala Ile Ile Leu
275 280 285
Pro Ser Ile Trp Asn Val Met Arg Glu Gln Lys Lys Gly Lys Ile Leu
290 295 300
Gln Tyr Ala Glu Arg Val Leu Gly Ile Thr Glu Gly Asp Asp Asp Ser
305 310 315 320
Arg Ile Asp Leu Ala Ile Leu Arg Thr Arg Glu Phe Phe Glu Ser Leu
325 330 335
Gly Ile Lys Thr His Leu Ser Glu Tyr Gly Val Thr Ala Asp Lys Ile
340 345 350
Asp Asp Ile Val Asn Ala Leu Asp Lys His Gly Met Lys Ala Leu Ser
355 360 365
Glu Thr Gly Ala Ile Thr Leu Glu Val Ser Arg Lys Ile Leu Glu Gly
370 375 380
Ala Met
385
<210> 16
<211> 348
<212> PRT
<213>Saccharomyces cerevisiae (Saccharomyces cerevisiae)
<400> 16
Met Ser Ile Pro Glu Thr Gln Lys Gly Val Ile Phe Tyr Glu Ser His
1 5 10 15
Gly Lys Leu Glu Tyr Lys Asp Ile Pro Val Pro Lys Pro Lys Ala Asn
20 25 30
Glu Leu Leu Ile Asn Val Lys Tyr Ser Gly Val Cys His Thr Asp Leu
35 40 45
His Ala Trp His Gly Asp Trp Pro Leu Pro Val Lys Leu Pro Leu Val
50 55 60
Gly Gly His Glu Gly Ala Gly Val Val Val Gly Met Gly Glu Asn Val
65 70 75 80
Lys Gly Trp Lys Ile Gly Asp Tyr Ala Gly Ile Lys Trp Leu Asn Gly
85 90 95
Ser Cys Met Ala Cys Glu Tyr Cys Glu Leu Gly Asn Glu Ser Asn Cys
100 105 110
Pro His Ala Asp Leu Ser Gly Tyr Thr His Asp Gly Ser Phe Gln Gln
115 120 125
Tyr Ala Thr Ala Asp Ala Val Gln Ala Ala His Ile Pro Gln Gly Thr
130 135 140
Asp Leu Ala Gln Val Ala Pro Ile Leu Cys Ala Gly Ile Thr Val Tyr
145 150 155 160
Lys Ala Leu Lys Ser Ala Asn Leu Met Ala Gly His Trp Val Ala Ile
165 170 175
Ser Gly Ala Ala Gly Gly Leu Gly Ser Leu Ala Val Gln Tyr Ala Lys
180 185 190
Ala Met Gly Tyr Arg Val Leu Gly Ile Asp Gly Gly Glu Gly Lys Glu
195 200 205
Glu Leu Phe Arg Ser Ile Gly Gly Glu Val Phe Ile Asp Phe Thr Lys
210 215 220
Glu Lys Asp Ile Val Gly Ala Val Leu Lys Ala Thr Asp Gly Gly Ala
225 230 235 240
His Gly Val Ile Asn Val Ser Val Ser Glu Ala Ala Ile Glu Ala Ser
245 250 255
Thr Arg Tyr Val Arg Ala Asn Gly Thr Thr Val Leu Val Gly Met Pro
260 265 270
Ala Gly Ala Lys Cys Cys Ser Asp Val Phe Asn Gln Val Val Lys Ser
275 280 285
Ile Ser Ile Val Gly Ser Tyr Val Gly Asn Arg Ala Asp Thr Arg Glu
290 295 300
Ala Leu Asp Phe Phe Ala Arg Gly Leu Val Lys Ser Pro Ile Lys Val
305 310 315 320
Val Gly Leu Ser Thr Leu Pro Glu Ile Tyr Glu Lys Met Glu Lys Gly
325 330 335
Gln Ile Val Gly Arg Tyr Val Val Asp Thr Ser Lys
340 345
<210> 17
<211> 348
<212> PRT
<213>Saccharomyces cerevisiae (Saccharomyces cerevisiae)
<400> 17
Met Ser Ile Pro Glu Thr Gln Lys Ala Ile Ile Phe Tyr Glu Ser Asn
1 5 10 15
Gly Lys Leu Glu His Lys Asp Ile Pro Val Pro Lys Pro Lys Pro Asn
20 25 30
Glu Leu Leu Ile Asn Val Lys Tyr Ser Gly Val Cys His Thr Asp Leu
35 40 45
His Ala Trp His Gly Asp Trp Pro Leu Pro Thr Lys Leu Pro Leu Val
50 55 60
Gly Gly His Glu Gly Ala Gly Val Val Val Gly Met Gly Glu Asn Val
65 70 75 80
Lys Gly Trp Lys Ile Gly Asp Tyr Ala Gly Ile Lys Trp Leu Asn Gly
85 90 95
Ser Cys Met Ala Cys Glu Tyr Cys Glu Leu Gly Asn Glu Ser Asn Cys
100 105 110
Pro His Ala Asp Leu Ser Gly Tyr Thr His Asp Gly Ser Phe Gln Glu
115 120 125
Tyr Ala Thr Ala Asp Ala Val Gln Ala Ala His Ile Pro Gln Gly Thr
130 135 140
Asp Leu Ala Glu Val Ala Pro Ile Leu Cys Ala Gly Ile Thr Val Tyr
145 150 155 160
Lys Ala Leu Lys Ser Ala Asn Leu Arg Ala Gly His Trp Ala Ala Ile
165 170 175
Ser Gly Ala Ala Gly Gly Leu Gly Ser Leu Ala Val Gln Tyr Ala Lys
180 185 190
Ala Met Gly Tyr Arg Val Leu Gly Ile Asp Gly Gly Pro Gly Lys Glu
195 200 205
Glu Leu Phe Thr Ser Leu Gly Gly Glu Val Phe Ile Asp Phe Thr Lys
210 215 220
Glu Lys Asp Ile Val Ser Ala Val Val Lys Ala Thr Asn Gly Gly Ala
225 230 235 240
His Gly Ile Ile Asn Val Ser Val Ser Glu Ala Ala Ile Glu Ala Ser
245 250 255
Thr Arg Tyr Cys Arg Ala Asn Gly Thr Val Val Leu Val Gly Leu Pro
260 265 270
Ala Gly Ala Lys Cys Ser Ser Asp Val Phe Asn His Val Val Lys Ser
275 280 285
Ile Ser Ile Val Gly Ser Tyr Val Gly Asn Arg Ala Asp Thr Arg Glu
290 295 300
Ala Leu Asp Phe Phe Ala Arg Gly Leu Val Lys Ser Pro Ile Lys Val
305 310 315 320
Val Gly Leu Ser Ser Leu Pro Glu Ile Tyr Glu Lys Met Glu Lys Gly
325 330 335
Gln Ile Ala Gly Arg Tyr Val Val Asp Thr Ser Lys
340 345
<210> 18
<211> 375
<212> PRT
<213>Saccharomyces cerevisiae (Saccharomyces cerevisiae)
<400> 18
Met Leu Arg Thr Ser Thr Leu Phe Thr Arg Arg Val Gln Pro Ser Leu
1 5 10 15
Phe Ser Arg Asn Ile Leu Arg Leu Gln Ser Thr Ala Ala Ile Pro Lys
20 25 30
Thr Gln Lys Gly Val Ile Phe Tyr Glu Asn Lys Gly Lys Leu His Tyr
35 40 45
Lys Asp Ile Pro Val Pro Glu Pro Lys Pro Asn Glu Ile Leu Ile Asn
50 55 60
Val Lys Tyr Ser Gly Val Cys His Thr Asp Leu His Ala Trp His Gly
65 70 75 80
Asp Trp Pro Leu Pro Val Lys Leu Pro Leu Val Gly Gly His Glu Gly
85 90 95
Ala Gly Val Val Val Lys Leu Gly Ser Asn Val Lys Gly Trp Lys Val
100 105 110
Gly Asp Leu Ala Gly Ile Lys Trp Leu Asn Gly Ser Cys Met Thr Cys
115 120 125
Glu Phe Cys Glu Ser Gly His Glu Ser Asn Cys Pro Asp Ala Asp Leu
130 135 140
Ser Gly Tyr Thr His Asp Gly Ser Phe Gln Gln Phe Ala Thr Ala Asp
145 150 155 160
Ala Ile Gln Ala Ala Lys Ile Gln Gln Gly Thr Asp Leu Ala Glu Val
165 170 175
Ala Pro Ile Leu Cys Ala Gly Val Thr Val Tyr Lys Ala Leu Lys Glu
180 185 190
Ala Asp Leu Lys Ala Gly Asp Trp Val Ala Ile Ser Gly Ala Ala Gly
195 200 205
Gly Leu Gly Ser Leu Ala Val Gln Tyr Ala Thr Ala Met Gly Tyr Arg
210 215 220
Val Leu Gly Ile Asp Ala Gly Glu Glu Lys Glu Lys Leu Phe Lys Lys
225 230 235 240
Leu Gly Gly Glu Val Phe Ile Asp Phe Thr Lys Thr Lys Asn Met Val
245 250 255
Ser Asp Ile Gln Glu Ala Thr Lys Gly Gly Pro His Gly Val Ile Asn
260 265 270
Val Ser Val Ser Glu Ala Ala Ile Ser Leu Ser Thr Glu Tyr Val Arg
275 280 285
Pro Cys Gly Thr Val Val Leu Val Gly Leu Pro Ala Asn Ala Tyr Val
290 295 300
Lys Ser Glu Val Phe Ser His Val Val Lys Ser Ile Asn Ile Lys Gly
305 310 315 320
Ser Tyr Val Gly Asn Arg Ala Asp Thr Arg Glu Ala Leu Asp Phe Phe
325 330 335
Ser Arg Gly Leu Ile Lys Ser Pro Ile Lys Ile Val Gly Leu Ser Glu
340 345 350
Leu Pro Lys Val Tyr Asp Leu Met Glu Lys Gly Lys Ile Leu Gly Arg
355 360 365
Tyr Val Val Asp Thr Ser Lys
370 375
<210> 19
<211> 382
<212> PRT
<213>Saccharomyces cerevisiae (Saccharomyces cerevisiae)
<400> 19
Met Ser Ser Val Thr Gly Phe Tyr Ile Pro Pro Ile Ser Phe Phe Gly
1 5 10 15
Glu Gly Ala Leu Glu Glu Thr Ala Asp Tyr Ile Lys Asn Lys Asp Tyr
20 25 30
Lys Lys Ala Leu Ile Val Thr Asp Pro Gly Ile Ala Ala Ile Gly Leu
35 40 45
Ser Gly Arg Val Gln Lys Met Leu Glu Glu Arg Asp Leu Asn Val Ala
50 55 60
Ile Tyr Asp Lys Thr Gln Pro Asn Pro Asn Ile Ala Asn Val Thr Ala
65 70 75 80
Gly Leu Lys Val Leu Lys Glu Gln Asn Ser Glu Ile Val Val Ser Ile
85 90 95
Gly Gly Gly Ser Ala His Asp Asn Ala Lys Ala Ile Ala Leu Leu Ala
100 105 110
Thr Asn Gly Gly Glu Ile Gly Asp Tyr Glu Gly Val Asn Gln Ser Lys
115 120 125
Lys Ala Ala Leu Pro Leu Phe Ala Ile Asn Thr Thr Ala Gly Thr Ala
130 135 140
Ser Glu Met Thr Arg Phe Thr Ile Ile Ser Asn Glu Glu Lys Lys Ile
145 150 155 160
Lys Met Ala Ile Ile Asp Asn Asn Val Thr Pro Ala Val Ala Val Asn
165 170 175
Asp Pro Ser Thr Met Phe Gly Leu Pro Pro Ala Leu Thr Ala Ala Thr
180 185 190
Gly Leu Asp Ala Leu Thr His Cys Ile Glu Ala Tyr Val Ser Thr Ala
195 200 205
Ser Asn Pro Ile Thr Asp Ala Cys Ala Leu Lys Gly Ile Asp Leu Ile
210 215 220
Asn Glu Ser Leu Val Ala Ala Tyr Lys Asp Gly Lys Asp Lys Lys Ala
225 230 235 240
Arg Thr Asp Met Cys Tyr Ala Glu Tyr Leu Ala Gly Met Ala Phe Asn
245 250 255
Asn Ala Ser Leu Gly Tyr Val His Ala Leu Ala His Gln Leu Gly Gly
260 265 270
Phe Tyr His Leu Pro His Gly Val Cys Asn Ala Val Leu Leu Pro His
275 280 285
Val Gln Glu Ala Asn Met Gln Cys Pro Lys Ala Lys Lys Arg Leu Gly
290 295 300
Glu Ile Ala Leu His Phe Gly Ala Ser Gln Glu Asp Pro Glu Glu Thr
305 310 315 320
Ile Lys Ala Leu His Val Leu Asn Arg Thr Met Asn Ile Pro Arg Asn
325 330 335
Leu Lys Glu Leu Gly Val Lys Thr Glu Asp Phe Glu Ile Leu Ala Glu
340 345 350
His Ala Met His Asp Ala Cys His Leu Thr Asn Pro Val Gln Phe Thr
355 360 365
Lys Glu Gln Val Val Ala Ile Ile Lys Lys Ala Tyr Glu Tyr
370 375 380
<210> 20
<211> 331
<212> PRT
<213>Bacillus subtilis (Bacillus subtilis)
<400> 20
Met Glu Tyr Thr Ser Ile Ala Asp Thr Gly Ile Glu Ala Ser Arg Ile
1 5 10 15
Gly Leu Gly Thr Trp Ala Ile Gly Gly Thr Met Trp Gly Gly Thr Asp
20 25 30
Glu Lys Thr Ser Ile Glu Thr Ile Arg Ala Ala Leu Asp Gln Gly Ile
35 40 45
Thr Leu Ile Asp Thr Ala Pro Ala Tyr Gly Phe Gly Gln Ser Glu Glu
50 55 60
Ile Val Gly Lys Ala Ile Lys Glu Tyr Gly Lys Arg Asp Gln Val Ile
65 70 75 80
Leu Ala Thr Lys Thr Ala Leu Asp Trp Lys Asn Asn Gln Leu Phe Arg
85 90 95
His Ala Asn Arg Ala Arg Ile Val Glu Glu Val Glu Asn Ser Leu Lys
100 105 110
Arg Leu Gln Thr Asp Tyr Ile Asp Leu Tyr Gln Val His Trp Pro Asp
115 120 125
Pro Leu Val Pro Ile Glu Glu Thr Ala Glu Val Met Lys Glu Leu Tyr
130 135 140
Asp Ala Gly Lys Ile Arg Ala Ile Gly Val Ser Asn Phe Ser Ile Glu
145 150 155 160
Gln Met Asp Thr Phe Arg Ala Val Ala Pro Leu His Thr Ile Gln Pro
165 170 175
Pro Tyr Asn Leu Phe Glu Arg Glu Met Glu Glu Ser Val Leu Pro Tyr
180 185 190
Ala Lys Asp Asn Lys Ile Thr Thr Leu Leu Tyr Gly Ser Leu Cys Arg
195 200 205
Gly Leu Leu Thr Gly Lys Met Thr Glu Glu Tyr Thr Phe Glu Gly Asp
210 215 220
Asp Leu Arg Asn His Asp Pro Lys Phe Gln Lys Pro Arg Phe Lys Glu
225 230 235 240
Tyr Leu Ser Ala Val Asn Gln Leu Asp Lys Leu Ala Lys Thr Arg Tyr
245 250 255
Gly Lys Ser Val Ile His Leu Ala Val Arg Trp Ile Leu Asp Gln Pro
260 265 270
Gly Ala Asp Ile Ala Leu Trp Gly Ala Arg Lys Pro Gly Gln Leu Glu
275 280 285
Ala Leu Ser Glu Ile Thr Gly Trp Thr Leu Asn Ser Glu Asp Gln Lys
290 295 300
Asp Ile Asn Thr Ile Leu Glu Asn Thr Ile Ser Asp Pro Val Gly Pro
305 310 315 320
Glu Phe Met Ala Pro Pro Thr Arg Glu Glu Ile
325 330
<210> 21
<211> 332
<212> PRT
<213>Gluconobacter oxydans (Gluconobacter oxydans)
<400> 21
Met Ala Ser Asp Thr Ile Arg Ile Pro Gly Ile Asp Thr Pro Leu Ser
1 5 10 15
Arg Val Ala Leu Gly Thr Trp Ala Ile Gly Gly Trp Met Trp Gly Gly
20 25 30
Pro Asp Asp Asp Asn Gly Val Arg Thr Ile His Ala Ala Leu Asp Glu
35 40 45
Gly Ile Asn Leu Ile Asp Thr Ala Pro Val Tyr Gly Phe Gly His Ser
50 55 60
Glu Glu Ile Val Gly Arg Ala Leu Ala Glu Lys Pro Asn Lys Ala His
65 70 75 80
Val Ala Thr Lys Leu Gly Leu His Trp Val Gly Glu Asp Glu Lys Asn
85 90 95
Met Lys Val Phe Arg Asp Ser Arg Pro Ala Arg Ile Arg Lys Glu Val
100 105 110
Glu Asp Ser Leu Arg Arg Leu Arg Val Glu Thr Ile Asp Leu Glu Gln
115 120 125
Ile His Trp Pro Asp Asp Lys Thr Pro Ile Asp Glu Ser Ala Arg Glu
130 135 140
Leu Gln Lys Leu His Gln Asp Gly Lys Ile Arg Ala Leu Gly Val Ser
145 150 155 160
Asn Phe Ser Pro Glu Gln Met Asp Ile Phe Arg Glu Val Ala Pro Leu
165 170 175
Ala Thr Ile Gln Pro Pro Leu Asn Leu Phe Glu Arg Thr Ile Glu Lys
180 185 190
Asp Ile Leu Pro Tyr Ala Glu Lys His Asn Ala Val Val Leu Ala Tyr
195 200 205
Gly Ala Leu Cys Arg Gly Leu Leu Thr Gly Lys Met Asn Arg Asp Thr
210 215 220
Thr Phe Pro Lys Asp Asp Leu Arg Ser Asn Asp Pro Lys Phe Gln Lys
225 230 235 240
Pro Asn Phe Glu Lys Tyr Leu Ala Ala Met Asp Glu Phe Glu Lys Leu
245 250 255
Ala Glu Lys Arg Gly Lys Ser Val Met Ala Phe Ala Val Arg Trp Val
260 265 270
Leu Asp Gln Gly Pro Val Ile Ala Leu Trp Gly Ala Arg Lys Pro Gly
275 280 285
Gln Val Ser Gly Val Lys Asp Val Phe Gly Trp Ser Leu Thr Asp Glu
290 295 300
Glu Lys Lys Ala Val Asp Asp Ile Leu Ala Arg His Val Pro Asn Pro
305 310 315 320
Ile Asp Pro Thr Phe Met Ala Pro Pro Ala Arg Asp
325 330
<210> 22
<211> 339
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 22
Met Ser Met Ile Lys Ser Tyr Ala Ala Lys Glu Ala Gly Gly Glu Leu
1 5 10 15
Glu Val Tyr Glu Tyr Asp Pro Gly Glu Leu Arg Pro Gln Asp Val Glu
20 25 30
Val Gln Val Asp Tyr Cys Gly Ile Cys His Ser Asp Leu Ser Met Ile
35 40 45
Asp Asn Glu Trp Gly Phe Ser Gln Tyr Pro Leu Val Ala Gly His Glu
50 55 60
Val Ile Gly Arg Val Val Ala Leu Gly Ser Ala Ala Gln Asp Lys Gly
65 70 75 80
Leu Gln Val Gly Gln Arg Val Gly Ile Gly Trp Thr Ala Arg Ser Cys
85 90 95
Gly His Cys Asp Ala Cys Ile Ser Gly Asn Gln Ile Asn Cys Glu Gln
100 105 110
Gly Ala Val Pro Thr Ile Met Asn Arg Gly Gly Phe Ala Glu Lys Leu
115 120 125
Arg Ala Asp Trp Gln Trp Val Ile Pro Leu Pro Glu Asn Ile Asp Ile
130 135 140
Glu Ser Ala Gly Pro Leu Leu Cys Gly Gly Ile Thr Val Phe Lys Pro
145 150 155 160
Leu Leu Met His His Ile Thr Ala Thr Ser Arg Val Gly Val Ile Gly
165 170 175
Ile Gly Gly Leu Gly His Ile Ala Ile Lys Leu Leu His Ala Met Gly
180 185 190
Cys Glu Val Thr Ala Phe Ser Ser Asn Pro Ala Lys Glu Gln Glu Val
195 200 205
Leu Ala Met Gly Ala Asp Lys Val Val Asn Ser Arg Asp Pro Gln Ala
210 215 220
Leu Lys Ala Leu Ala Gly Gln Phe Asp Leu Ile Ile Asn Thr Val Asn
225 230 235 240
Val Ser Leu Asp Trp Gln Pro Tyr Phe Glu Ala Leu Thr Tyr Gly Gly
245 250 255
Asn Phe His Thr Val Gly Ala Val Leu Thr Pro Leu Ser Val Pro Ala
260 265 270
Phe Thr Leu Ile Ala Gly Asp Arg Ser Val Ser Gly Ser Ala Thr Gly
275 280 285
Thr Pro Tyr Glu Leu Arg Lys Leu Met Arg Phe Ala Ala Arg Ser Lys
290 295 300
Val Ala Pro Thr Thr Glu Leu Phe Pro Met Ser Lys Ile Asn Asp Ala
305 310 315 320
Ile Gln His Val Arg Asp Gly Lys Ala Arg Tyr Arg Val Val Leu Lys
325 330 335
Ala Asp Phe
<210> 23
<211> 315
<212> PRT
<213>Arabidopsis (Arabidopsis thaliana)
<400> 23
Met Ala Asn Ala Ile Thr Phe Phe Lys Leu Asn Thr Gly Ala Lys Phe
1 5 10 15
Pro Ser Val Gly Leu Gly Thr Trp Gln Ala Ser Pro Gly Leu Val Gly
20 25 30
Asp Ala Val Ala Ala Ala Val Lys Ile Gly Tyr Arg His Ile Asp Cys
35 40 45
Ala Gln Ile Tyr Gly Asn Glu Lys Glu Ile Gly Ala Val Leu Lys Lys
50 55 60
Leu Phe Glu Asp Arg Val Val Lys Arg Glu Asp Leu Phe Ile Thr Ser
65 70 75 80
Lys Leu Trp Cys Thr Asp His Asp Pro Gln Asp Val Pro Glu Ala Leu
85 90 95
Asn Arg Thr Leu Lys Asp Leu Gln Leu Glu Tyr Val Asp Leu Tyr Leu
100 105 110
Ile His Trp Pro Ala Arg Ile Lys Lys Gly Ser Val Gly Ile Lys Pro
115 120 125
Glu Asn Leu Leu Pro Val Asp Ile Pro Ser Thr Trp Lys Ala Met Glu
130 135 140
Ala Leu Tyr Asp Ser Gly Lys Ala Arg Ala Ile Gly Val Ser Asn Phe
145 150 155 160
Ser Thr Lys Lys Leu Ala Asp Leu Leu Glu Leu Ala Arg Val Pro Pro
165 170 175
Ala Val Asn Gln Val Glu Cys His Pro Ser Trp Arg Gln Thr Lys Leu
180 185 190
Gln Glu Phe Cys Lys Ser Lys Gly Val His Leu Ser Ala Tyr Ser Pro
195 200 205
Leu Gly Ser Pro Gly Thr Thr Trp Leu Lys Ser Asp Val Leu Lys Asn
210 215 220
Pro Ile Leu Asn Met Val Ala Glu Lys Leu Gly Lys Ser Pro Ala Gln
225 230 235 240
Val Ala Leu Arg Trp Gly Leu Gln Met Gly His Ser Val Leu Pro Lys
245 250 255
Ser Thr Asn Glu Gly Arg Ile Lys Glu Asn Phe Asn Val Phe Asp Trp
260 265 270
Ser Ile Pro Asp Tyr Met Phe Ala Lys Phe Ala Glu Ile Glu Gln Ala
275 280 285
Arg Leu Val Thr Gly Ser Phe Leu Val His Glu Thr Leu Ser Pro Tyr
290 295 300
Lys Ser Ile Glu Glu Leu Trp Asp Gly Glu Ile
305 310 315
<210> 24
<211> 284
<212> PRT
<213>Leishmania donovani (Leishmania donovani)
<400> 24
Met Ala Asp Val Gly Lys Ala Met Val Thr Leu Ser Asn Gly Val Gln
1 5 10 15
Met Pro Gln Leu Gly Leu Gly Val Trp Gln Ser Pro Ala Gly Glu Val
20 25 30
Thr Ala Asn Ala Val Lys Trp Ala Leu Cys Ala Gly Tyr Arg His Ile
35 40 45
Asp Thr Ala Ala Ile Tyr Lys Asn Glu Glu Ser Val Gly Ala Gly Leu
50 55 60
Arg Ala Ser Gly Val Pro Arg Glu Asp Val Phe Ile Thr Thr Lys Leu
65 70 75 80
Trp Asn Thr Glu Gln Gly Tyr Glu Ser Thr Leu Ala Ala Phe Glu Glu
85 90 95
Ser Arg Gln Lys Leu Gly Val Asp Tyr Ile Asp Leu Tyr Leu Ile His
100 105 110
Trp Pro Arg Gly Lys Asp Ile Val Ser Lys Glu Gly Lys Lys Tyr Leu
115 120 125
Asp Ser Trp Arg Ala Phe Glu Gln Leu Tyr Lys Asp Lys Lys Val Arg
130 135 140
Ala Ile Gly Val Ser Asn Phe His Ile His His Leu Glu Asp Val Leu
145 150 155 160
Ala Met Cys Thr Val Thr Pro Met Val Asn Gln Val Glu Leu His Pro
165 170 175
Leu Asn Asn Gln Ala Glu Leu Arg Ala Phe Cys Asp Ala Lys Gln Ile
180 185 190
Lys Val Glu Ala Trp Ser Pro Leu Gly Gln Gly Lys Leu Leu Ser Asn
195 200 205
Pro Ile Leu Ala Ala Ile Gly Ala Lys Tyr Asn Lys Thr Ala Ala Gln
210 215 220
Val Ile Leu Arg Trp Asn Ile Gln Lys Asn Leu Ile Thr Ile Pro Lys
225 230 235 240
Ser Val His Lys Glu Arg Ile Glu Glu Asn Ala Asp Val Phe Asn Phe
245 250 255
Glu Leu Asp Ala Glu Asp Val Met Ser Ile Asp Ala Leu Asn Thr Asn
260 265 270
Ser Arg Tyr Gly Pro Asp Pro Asp Glu Ala Gln Phe
275 280
<210> 25
<211> 349
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 25
Met Lys Ile Lys Ala Val Gly Ala Tyr Ser Ala Lys Gln Pro Leu Glu
1 5 10 15
Pro Met Asp Ile Thr Arg Arg Glu Pro Gly Pro Asn Asp Val Lys Ile
20 25 30
Glu Ile Ala Tyr Cys Gly Val Cys His Ser Asp Leu His Gln Val Arg
35 40 45
Ser Glu Trp Ala Gly Thr Val Tyr Pro Cys Val Pro Gly His Glu Ile
50 55 60
Val Gly Arg Val Val Ala Val Gly Asp Gln Val Glu Lys Tyr Ala Pro
65 70 75 80
Gly Asp Leu Val Gly Val Gly Cys Ile Val Asp Ser Cys Lys His Cys
85 90 95
Glu Glu Cys Glu Asp Gly Leu Glu Asn Tyr Cys Asp His Met Thr Gly
100 105 110
Thr Tyr Asn Ser Pro Thr Pro Asp Glu Pro Gly His Thr Leu Gly Gly
115 120 125
Tyr Ser Gln Gln Ile Val Val His Glu Arg Tyr Val Leu Arg Ile Arg
130 135 140
His Pro Gln Glu Gln Leu Ala Ala Val Ala Pro Leu Leu Cys Ala Gly
145 150 155 160
Ile Thr Thr Tyr Ser Pro Leu Arg His Trp Gln Ala Gly Pro Gly Lys
165 170 175
Lys Val Gly Val Val Gly Ile Gly Gly Leu Gly His Met Gly Ile Lys
180 185 190
Leu Ala His Ala Met Gly Ala His Val Val Ala Phe Thr Thr Ser Glu
195 200 205
Ala Lys Arg Glu Ala Ala Lys Ala Leu Gly Ala Asp Glu Val Val Asn
210 215 220
Ser Arg Asn Ala Asp Glu Met Ala Ala His Leu Lys Ser Phe Asp Phe
225 230 235 240
Ile Leu Asn Thr Val Ala Ala Pro His Asn Leu Asp Asp Phe Thr Thr
245 250 255
Leu Leu Lys Arg Asp Gly Thr Met Thr Leu Val Gly Ala Pro Ala Thr
260 265 270
Pro His Lys Ser Pro Glu Val Phe Asn Leu Ile Met Lys Arg Arg Ala
275 280 285
Ile Ala Gly Ser Met Ile Gly Gly Ile Pro Glu Thr Gln Glu Met Leu
290 295 300
Asp Phe Cys Ala Glu His Gly Ile Val Ala Asp Ile Glu Met Ile Arg
305 310 315 320
Ala Asp Gln Ile Asn Glu Ala Tyr Glu Arg Met Leu Arg Gly Asp Val
325 330 335
Lys Tyr Arg Phe Val Ile Asp Asn Arg Thr Leu Thr Asp
340 345
<210> 26
<211> 346
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 26
Met Gln Tyr His Arg Ile Pro His Ser Ser Leu Glu Val Ser Thr Leu
1 5 10 15
Gly Leu Gly Thr Met Thr Phe Gly Glu Gln Asn Ser Glu Ala Asp Ala
20 25 30
His Ala Gln Leu Asp Tyr Ala Val Ala Gln Gly Ile Asn Leu Ile Asp
35 40 45
Val Ala Glu Met Tyr Pro Val Pro Pro Arg Pro Glu Thr Gln Gly Leu
50 55 60
Thr Glu Thr Tyr Val Gly Asn Trp Leu Ala Lys His Gly Ser Arg Glu
65 70 75 80
Lys Leu Ile Ile Ala Ser Lys Val Ser Gly Pro Ser Arg Asn Asn Asp
85 90 95
Lys Gly Ile Arg Pro Asp Gln Ala Leu Asp Arg Lys Asn Ile Arg Glu
100 105 110
Ala Leu His Asp Ser Leu Lys Arg Leu Gln Thr Asp Tyr Leu Asp Leu
115 120 125
Tyr Gln Val His Trp Pro Gln Arg Pro Thr Asn Cys Phe Gly Lys Leu
130 135 140
Gly Tyr Ser Trp Thr Asp Ser Ala Pro Ala Val Ser Leu Leu Asp Thr
145 150 155 160
Leu Asp Ala Leu Ala Glu Tyr Gln Arg Ala Gly Lys Ile Arg Tyr Ile
165 170 175
Gly Val Ser Asn Glu Thr Ala Phe Gly Val Met Arg Tyr Leu His Leu
180 185 190
Ala Asp Lys His Asp Leu Pro Arg Ile Val Thr Ile Gln Asn Pro Tyr
195 200 205
Ser Leu Leu Asn Arg Ser Phe Glu Val Gly Leu Ala Glu Val Ser Gln
210 215 220
Tyr Glu Gly Val Glu Leu Leu Ala Tyr Ser Cys Leu Gly Phe Gly Thr
225 230 235 240
Leu Thr Gly Lys Tyr Leu Asn Gly Ala Lys Pro Ala Gly Ala Arg Asn
245 250 255
Thr Leu Phe Ser Arg Phe Thr Arg Tyr Ser Gly Glu Gln Thr Gln Lys
260 265 270
Ala Val Ala Ala Tyr Val Asp Ile Ala Arg Arg His Gly Leu Asp Pro
275 280 285
Ala Gln Met Ala Leu Ala Phe Val Arg Arg Gln Pro Phe Val Ala Ser
290 295 300
Thr Leu Leu Gly Ala Thr Thr Met Asp Gln Leu Lys Thr Asn Ile Glu
305 310 315 320
Ser Leu His Leu Glu Leu Ser Glu Asp Val Leu Ala Glu Ile Glu Ala
325 330 335
Val His Gln Val Tyr Thr Tyr Pro Ala Pro
340 345
<210> 27
<211> 399
<212> PRT
<213>Ethanolicus (Thermoanaerobacter ethanolicus)
<400> 27
Met Trp Glu Thr Lys Ile Asn Pro Asn Lys Val Phe Glu Leu Arg Cys
1 5 10 15
Lys Asn Thr Thr Tyr Phe Gly Ile Gly Ser Ile Lys Lys Ile Lys Asp
20 25 30
Ile Leu Glu Val Leu Lys Asn Lys Gly Ile Asn Asn Val Ile Leu Val
35 40 45
Thr Gly Lys Gly Ser Tyr Lys Ala Ser Gly Ala Trp Asp Val Val Lys
50 55 60
Pro Ala Leu Glu Thr Leu Gly Phe Lys Tyr Ser Leu Tyr Asp Lys Val
65 70 75 80
Gly Pro Asn Pro Thr Val Asp Met Ile Asp Glu Ala Ala Lys Ile Gly
85 90 95
Arg Glu Thr Gly Ala Lys Ala Val Ile Gly Ile Gly Gly Gly Ser Pro
100 105 110
Ile Asp Thr Ala Lys Ser Val Ala Val Leu Leu Glu Tyr Thr Asp Lys
115 120 125
Asn Ala Arg Glu Leu Tyr Glu Gln Lys Phe Ile Pro Glu Lys Ala Ala
130 135 140
Pro Ile Ile Ala Ile Asn Leu Thr His Gly Thr Gly Thr Glu Val Asp
145 150 155 160
Arg Phe Ala Val Ala Thr Ile Pro Glu Lys Asn Tyr Lys Pro Ala Ile
165 170 175
Ala Tyr Asp Cys Leu Tyr Pro Met Tyr Ala Ile Asp Asp Pro Ser Leu
180 185 190
Met Thr Lys Leu Asp Lys Lys Gln Thr Ile Ala Val Thr Ile Asp Ala
195 200 205
Leu Asn His Val Thr Glu Ala Ala Thr Thr Leu Val Ala Ser Pro Tyr
210 215 220
Ser Val Leu Met Ala Lys Glu Thr Val Arg Leu Ile Val Arg Tyr Leu
225 230 235 240
Pro Ala Ala Val Asn Asp Pro Glu Asn Leu Val Ala Arg Tyr Tyr Leu
245 250 255
Leu Tyr Ala Ser Ala Leu Ala Gly Ile Ser Phe Asp Asn Gly Leu Leu
260 265 270
His Leu Thr His Ala Leu Glu His Pro Leu Ser Ala Val Lys Pro Glu
275 280 285
Ile Ala His Gly Leu Gly Leu Gly Ala Ile Leu Pro Ala Val Val Lys
290 295 300
Ala Ile Tyr Pro Ser Val Ala Glu Val Leu Ala Glu Val Tyr Ser Pro
305 310 315 320
Ile Val Pro Gly Leu Lys Gly Leu Pro Ala Glu Ala Glu Tyr Val Ala
325 330 335
Lys Lys Val Glu Glu Trp Leu Phe Lys Val Gly Cys Thr Gln Lys Leu
340 345 350
Ser Asp Phe Gly Phe Thr Lys Glu Asp Ile Pro Thr Leu Val Arg Leu
355 360 365
Ala Lys Thr Thr Pro Ser Leu Asp Gly Leu Leu Ser Asn Ala Pro Val
370 375 380
Glu Ala Thr Glu Ala Val Ile Ala Lys Ile Tyr Glu Glu Ser Phe
385 390 395
<210> 28
<211> 346
<212> PRT
<213>Bacillus subtilis (Bacillus subtilis)
<400> 28
Met Lys Ala Ala Arg Trp His Asn Gln Lys Asp Ile Arg Ile Glu His
1 5 10 15
Ile Glu Glu Pro Lys Thr Glu Pro Gly Lys Val Lys Ile Lys Val Lys
20 25 30
Trp Cys Gly Ile Cys Gly Ser Asp Leu His Glu Tyr Leu Gly Gly Pro
35 40 45
Ile Phe Ile Pro Val Asp Lys Pro His Pro Leu Thr Asn Glu Thr Ala
50 55 60
Pro Val Thr Met Gly His Glu Phe Ser Gly Glu Val Val Glu Val Gly
65 70 75 80
Glu Gly Val Glu Asn Tyr Lys Val Gly Asp Arg Val Val Val Glu Pro
85 90 95
Ile Phe Ala Thr His Gly His Gln Gly Ala Tyr Asn Leu Asp Glu Gln
100 105 110
Met Gly Phe Leu Gly Leu Ala Gly Gly Gly Gly Gly Phe Ser Glu Tyr
115 120 125
Val Ser Val Asp Glu Glu Leu Leu Phe Lys Leu Pro Asp Glu Leu Ser
130 135 140
Tyr Glu Gln Gly Ala Leu Val Glu Pro Ser Ala Val Ala Leu Tyr Ala
145 150 155 160
Val Arg Ser Ser Lys Leu Lys Ala Gly Asp Lys Ala Ala Val Phe Gly
165 170 175
Cys Gly Pro Ile Gly Leu Leu Val Ile Glu Ala Leu Lys Ala Ala Gly
180 185 190
Ala Thr Asp Ile Tyr Ala Val Glu Leu Ser Pro Glu Arg Gln Gln Lys
195 200 205
Ala Glu Glu Leu Gly Ala Ile Ile Val Asp Pro Ser Lys Thr Asp Asp
210 215 220
Val Val Ala Glu Ile Ala Glu Arg Thr Gly Gly Gly Val Asp Val Ala
225 230 235 240
Phe Glu Val Thr Gly Val Pro Val Val Leu Arg Gln Ala Ile Gln Ser
245 250 255
Thr Thr Ile Ala Gly Glu Thr Val Ile Val Ser Ile Trp Glu Lys Gly
260 265 270
Ala Glu Ile His Pro Asn Asp Ile Val Ile Lys Glu Arg Thr Val Lys
275 280 285
Gly Ile Ile Gly Tyr Arg Asp Ile Phe Pro Ala Val Leu Ser Leu Met
290 295 300
Lys Glu Gly Tyr Phe Ser Ala Asp Lys Leu Val Thr Lys Lys Ile Val
305 310 315 320
Leu Asp Asp Leu Ile Glu Glu Gly Phe Gly Ala Leu Ile Lys Glu Lys
325 330 335
Ser Gln Val Lys Ile Leu Val Arg Pro Asn
340 345
<210> 29
<211> 382
<212> PRT
<213>Saccharomyces cerevisiae (Saccharomyces cerevisiae)
<400> 29
Met Arg Ala Leu Ala Tyr Phe Lys Lys Gly Asp Ile His Phe Thr Asn
1 5 10 15
Asp Ile Pro Arg Pro Glu Ile Gln Thr Asp Asp Glu Val Ile Ile Asp
20 25 30
Val Ser Trp Cys Gly Ile Cys Gly Ser Asp Leu His Glu Tyr Leu Asp
35 40 45
Gly Pro Ile Phe Met Pro Lys Asp Gly Glu Cys His Lys Leu Ser Asn
50 55 60
Ala Ala Leu Pro Leu Ala Met Gly His Glu Met Ser Gly Ile Val Ser
65 70 75 80
Lys Val Gly Pro Lys Val Thr Lys Val Lys Val Gly Asp His Val Val
85 90 95
Val Asp Ala Ala Ser Ser Cys Ala Asp Leu His Cys Trp Pro His Ser
100 105 110
Lys Phe Tyr Asn Ser Lys Pro Cys Asp Ala Cys Gln Arg Gly Ser Glu
115 120 125
Asn Leu Cys Thr His Ala Gly Phe Val Gly Leu Gly Val Ile Ser Gly
130 135 140
Gly Phe Ala Glu Gln Val Val Val Ser Gln His His Ile Ile Pro Val
145 150 155 160
Pro Lys Glu Ile Pro Leu Asp Val Ala Ala Leu Val Glu Pro Leu Ser
165 170 175
Val Thr Trp His Ala Val Lys Ile Ser Gly Phe Lys Lys Gly Ser Ser
180 185 190
Ala Leu Val Leu Gly Ala Gly Pro Ile Gly Leu Cys Thr Ile Leu Val
195 200 205
Leu Lys Gly Met Gly Ala Ser Lys Ile Val Val Ser Glu Ile Ala Glu
210 215 220
Arg Arg Ile Glu Met Ala Lys Lys Leu Gly Val Glu Val Phe Asn Pro
225 230 235 240
Ser Lys His Gly His Lys Ser Ile Glu Ile Leu Arg Gly Leu Thr Lys
245 250 255
Ser His Asp Gly Phe Asp Tyr Ser Tyr Asp Cys Ser Gly Ile Gln Val
260 265 270
Thr Phe Glu Thr Ser Leu Lys Ala Leu Thr Phe Lys Gly Thr Ala Thr
275 280 285
Asn Ile Ala Val Trp Gly Pro Lys Pro Val Pro Phe Gln Pro Met Asp
290 295 300
Val Thr Leu Gln Glu Lys Val Met Thr Gly Ser Ile Gly Tyr Val Val
305 310 315 320
Glu Asp Phe Glu Glu Val Val Arg Ala Ile His Asn Gly Asp Ile Ala
325 330 335
Met Glu Asp Cys Lys Gln Leu Ile Thr Gly Lys Gln Arg Ile Glu Asp
340 345 350
Gly Trp Glu Lys Gly Phe Gln Glu Leu Met Asp His Lys Glu Ser Asn
355 360 365
Val Lys Ile Leu Leu Thr Pro Asn Asn His Gly Glu Met Lys
370 375 380
<210> 30
<211> 258
<212> PRT
<213>Corynebacterium glutamicum (Corynebacterium glutamicum)
<400> 30
Met Ser Lys Val Ala Met Val Thr Gly Gly Ala Gln Gly Ile Gly Arg
1 5 10 15
Gly Ile Ser Glu Lys Leu Ala Ala Asp Gly Phe Asp Ile Ala Val Ala
20 25 30
Asp Leu Pro Gln Gln Glu Glu Gln Ala Ala Glu Thr Ile Lys Leu Ile
35 40 45
Glu Ala Ala Gly Gln Lys Ala Val Phe Val Gly Leu Asp Val Thr Asp
50 55 60
Lys Ala Asn Phe Asp Ser Ala Ile Asp Glu Ala Ala Glu Lys Leu Gly
65 70 75 80
Gly Phe Asp Val Leu Val Asn Asn Ala Gly Ile Ala Gln Ile Lys Pro
85 90 95
Leu Leu Glu Val Thr Glu Glu Asp Leu Lys Gln Ile Tyr Ser Val Asn
100 105 110
Val Phe Ser Val Phe Phe Gly Ile Gln Ala Ala Ser Arg Lys Phe Asp
115 120 125
Glu Leu Gly Val Lys Gly Lys Ile Ile Asn Ala Ala Ser Ile Ala Ala
130 135 140
Ile Gln Gly Phe Pro Ile Leu Ser Ala Tyr Ser Thr Thr Lys Phe Ala
145 150 155 160
Val Arg Gly Leu Thr Gln Ala Ala Ala Gln Glu Leu Ala Pro Lys Gly
165 170 175
His Thr Val Asn Ala Tyr Ala Pro Gly Ile Val Gly Thr Gly Met Trp
180 185 190
Glu Gln Ile Asp Ala Glu Leu Ser Lys Ile Asn Gly Lys Pro Ile Gly
195 200 205
Glu Asn Phe Lys Glu Tyr Ser Ser Ser Ile Ala Leu Gly Arg Pro Ser
210 215 220
Val Pro Glu Asp Val Ala Gly Leu Val Ser Phe Leu Ala Ser Glu Asn
225 230 235 240
Ser Asn Tyr Ile Thr Gly Gln Val Met Leu Val Asp Gly Gly Met Leu
245 250 255
Tyr Asn
<210> 31
<211> 256
<212> PRT
<213>Friedlander's bacillus (Klebsiella pneumoniae)
<400> 31
Met Lys Lys Val Ala Leu Val Thr Gly Ala Gly Gln Gly Ile Gly Lys
1 5 10 15
Ala Ile Ala Leu Arg Leu Val Lys Asp Gly Phe Ala Val Ala Ile Ala
20 25 30
Asp Tyr Asn Asp Ala Thr Ala Lys Ala Val Ala Ser Glu Ile Asn Gln
35 40 45
Ala Gly Gly Arg Ala Met Ala Val Lys Val Asp Val Ser Asp Arg Asp
50 55 60
Gln Val Phe Ala Ala Val Glu Gln Ala Arg Lys Thr Leu Gly Gly Phe
65 70 75 80
Asp Val Ile Val Asn Asn Ala Gly Val Ala Pro Ser Thr Pro Ile Glu
85 90 95
Ser Ile Thr Pro Glu Ile Val Asp Lys Val Tyr Asn Ile Asn Val Lys
100 105 110
Gly Val Ile Trp Gly Ile Gln Ala Ala Val Glu Ala Phe Lys Lys Glu
115 120 125
Gly His Gly Gly Lys Ile Ile Asn Ala Cys Ser Gln Ala Gly His Val
130 135 140
Gly Asn Pro Glu Leu Ala Val Tyr Ser Ser Ser Lys Phe Ala Val Arg
145 150 155 160
Gly Leu Thr Gln Thr Ala Ala Arg Asp Leu Ala Pro Leu Gly Ile Thr
165 170 175
Val Asn Gly Tyr Cys Pro Gly Ile Val Lys Thr Pro Met Trp Ala Glu
180 185 190
Ile Asp Arg Gln Val Ser Glu Ala Ala Gly Lys Pro Leu Gly Tyr Gly
195 200 205
Thr Ala Glu Phe Ala Lys Arg Ile Thr Leu Gly Arg Leu Ser Glu Pro
210 215 220
Glu Asp Val Ala Ala Cys Val Ser Tyr Leu Ala Ser Pro Asp Ser Asp
225 230 235 240
Tyr Met Thr Gly Gln Ser Leu Leu Ile Asp Gly Gly Met Val Phe Asn
245 250 255
<210> 32
<211> 251
<212> PRT
<213>Serratia marcescens (Serratia marcescens)
<400> 32
Met Arg Phe Asp Asn Lys Val Val Val Ile Thr Gly Ala Gly Thr Gly
1 5 10 15
Met Gly Glu Ala Ala Ala Arg Arg Phe Ser Ala Glu Gly Ala Ile Val
20 25 30
Val Leu Ala Asp Trp Ala Lys Glu Ala Val Asp Lys Val Ala Ala Ser
35 40 45
Leu Pro Lys Gly Arg Ala Met Ala Val His Ile Asp Val Ser Asp His
50 55 60
Val Ala Val Glu Lys Met Met Asn Glu Val Ala Glu Lys Leu Gly Arg
65 70 75 80
Ile Asp Val Leu Leu Asn Asn Ala Gly Val His Val Ala Gly Ser Val
85 90 95
Leu Glu Thr Ser Val Asp Asp Trp Arg Arg Ile Ala Gly Val Asp Ile
100 105 110
Asp Gly Val Val Phe Cys Ser Lys Phe Ala Leu Pro His Leu Leu Lys
115 120 125
Thr Lys Gly Cys Ile Val Asn Thr Ala Ser Val Ser Gly Leu Gly Gly
130 135 140
Asp Trp Gly Ala Ala Tyr Tyr Cys Ala Ala Lys Gly Ala Val Val Asn
145 150 155 160
Leu Thr Arg Ala Met Ala Leu Asp His Gly Gly Asp Gly Val Arg Ile
165 170 175
Asn Ser Val Cys Pro Ser Leu Val Lys Thr Asn Met Thr Asn Gly Trp
180 185 190
Pro Gln Glu Ile Arg Asp Lys Phe Asn Glu Arg Ile Ala Leu Gly Arg
195 200 205
Ala Ala Glu Pro Glu Glu Val Ala Ala Val Met Ala Phe Leu Ala Ser
210 215 220
Asp Asp Ala Ser Phe Ile Asn Gly Ala Asn Ile Pro Val Asp Gly Gly
225 230 235 240
Ala Thr Ala Ser Asp Gly Gln Gln Asn Ile Val
245 250
<210> 33
<211> 367
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 33
Met Asp Arg Ile Ile Gln Ser Pro Gly Lys Tyr Ile Gln Gly Ala Asp
1 5 10 15
Val Ile Asn Arg Leu Gly Glu Tyr Leu Lys Pro Leu Ala Glu Arg Trp
20 25 30
Leu Val Val Gly Asp Lys Phe Val Leu Gly Phe Ala Gln Ser Thr Val
35 40 45
Glu Lys Ser Phe Lys Asp Ala Gly Leu Val Val Glu Ile Ala Pro Phe
50 55 60
Gly Gly Glu Cys Ser Gln Asn Glu Ile Asp Arg Leu Arg Gly Ile Ala
65 70 75 80
Glu Thr Ala Gln Cys Gly Ala Ile Leu Gly Ile Gly Gly Gly Lys Thr
85 90 95
Leu Asp Thr Ala Lys Ala Leu Ala His Phe Met Gly Val Pro Val Ala
100 105 110
Ile Ala Pro Thr Ile Ala Ser Thr Asp Ala Pro Cys Ser Ala Leu Ser
115 120 125
Val Ile Tyr Thr Asp Glu Gly Glu Phe Asp Arg Tyr Leu Leu Leu Pro
130 135 140
Asn Asn Pro Asn Met Val Ile Val Asp Thr Lys Ile Val Ala Gly Ala
145 150 155 160
Pro Ala Arg Leu Leu Ala Ala Gly Ile Gly Asp Ala Leu Ala Thr Trp
165 170 175
Phe Glu Ala Arg Ala Cys Ser Arg Ser Gly Ala Thr Thr Met Ala Gly
180 185 190
Gly Lys Cys Thr Gln Ala Ala Leu Ala Leu Ala Glu Leu Cys Tyr Asn
195 200 205
Thr Leu Leu Glu Glu Gly Glu Lys Ala Met Leu Ala Ala Glu Gln His
210 215 220
Val Val Thr Pro Ala Leu Glu Arg Val Ile Glu Ala Asn Thr Tyr Leu
225 230 235 240
Ser Gly Val Gly Phe Glu Ser Gly Gly Leu Ala Ala Ala His Ala Val
245 250 255
His Asn Gly Leu Thr Ala Ile Pro Asp Ala His His Tyr Tyr His Gly
260 265 270
Glu Lys Val Ala Phe Gly Thr Leu Thr Gln Leu Val Leu Glu Asn Ala
275 280 285
Pro Val Glu Glu Ile Glu Thr Val Ala Ala Leu Ser His Ala Val Gly
290 295 300
Leu Pro Ile Thr Leu Ala Gln Leu Asp Ile Lys Glu Asp Val Pro Ala
305 310 315 320
Lys Met Arg Ile Val Ala Glu Ala Ala Cys Ala Glu Gly Glu Thr Ile
325 330 335
His Asn Met Pro Gly Gly Ala Thr Pro Asp Gln Val Tyr Ala Ala Leu
340 345 350
Leu Val Ala Asp Gln Tyr Gly Gln Arg Phe Leu Gln Glu Trp Glu
355 360 365
<210> 34
<211> 258
<212> PRT
<213>Corynebacterium glutamicum (Corynebacterium glutamicum)
<400> 34
Met Ser Lys Val Ala Met Val Thr Gly Gly Ala Gln Gly Ile Gly Arg
1 5 10 15
Gly Ile Ser Glu Lys Leu Ala Ala Asp Gly Phe Asp Ile Ala Val Ala
20 25 30
Asp Leu Pro Gln Gln Glu Glu Gln Ala Ala Glu Thr Ile Lys Leu Ile
35 40 45
Glu Ala Ala Asp Gln Lys Ala Val Phe Val Gly Leu Asp Val Thr Asp
50 55 60
Lys Ala Asn Phe Asp Ser Ala Ile Asp Glu Ala Ala Glu Lys Leu Gly
65 70 75 80
Gly Phe Asp Val Leu Val Asn Asn Ala Gly Ile Ala Gln Ile Lys Pro
85 90 95
Leu Leu Glu Val Thr Glu Glu Asp Leu Lys Gln Ile Tyr Ser Val Asn
100 105 110
Val Phe Ser Val Phe Phe Gly Ile Gln Ala Ala Ser Arg Lys Phe Asp
115 120 125
Glu Leu Gly Val Lys Gly Lys Ile Ile Asn Ala Ala Ser Ile Ala Ala
130 135 140
Ile Gln Gly Phe Pro Ile Leu Ser Ala Tyr Ser Thr Thr Lys Phe Ala
145 150 155 160
Val Arg Gly Leu Thr Gln Ala Ala Ala Gln Glu Leu Ala Pro Lys Gly
165 170 175
His Thr Val Asn Ala Tyr Ala Pro Gly Ile Val Gly Thr Gly Met Trp
180 185 190
Glu Gln Ile Asp Ala Glu Leu Ser Lys Ile Asn Gly Lys Pro Ile Gly
195 200 205
Glu Asn Phe Lys Glu Tyr Ser Ser Ser Ile Ala Leu Gly Arg Pro Ser
210 215 220
Val Pro Glu Asp Val Ala Gly Leu Val Ser Phe Leu Ala Ser Glu Asn
225 230 235 240
Ser Asn Tyr Val Thr Gly Gln Val Met Leu Val Asp Gly Gly Met Leu
245 250 255
Tyr Asn
<210> 35
<211> 318
<212> PRT
<213>Hypocrea jecorina (Hypocrea jecorina)
<400> 35
Met Ala Ser Lys Thr Tyr Thr Leu Asn Thr Gly Ala Lys Ile Pro Ala
1 5 10 15
Val Gly Phe Gly Thr Phe Ala Asn Glu Gly Ala Lys Gly Glu Thr Tyr
20 25 30
Ala Ala Val Thr Lys Ala Leu Asp Val Gly Tyr Arg His Leu Asp Cys
35 40 45
Ala Trp Phe Tyr His Asn Glu Asp Glu Val Gly Asp Ala Val Arg Asp
50 55 60
Phe Leu Ala Arg Arg Pro Asp Val Lys Arg Glu Asp Leu Phe Ile Cys
65 70 75 80
Thr Lys Val Trp Asn His Leu His Glu Pro Glu Asp Val Lys Trp Ser
85 90 95
Ala Lys Asn Ser Cys Glu Asn Leu Lys Val Asp Tyr Ile Asp Leu Phe
100 105 110
Leu Val His Trp Pro Ile Ala Ala Glu Lys Asn Ser Asp Arg Ser Val
115 120 125
Lys Leu Gly Pro Asp Gly Lys Tyr Val Ile Asn Gln Ala Leu Thr Glu
130 135 140
Asn Pro Glu Pro Thr Trp Arg Ala Met Glu Glu Leu Val Glu Ser Gly
145 150 155 160
Leu Val Lys Ala Ile Gly Val Ser Asn Trp Thr Ile Pro Gly Leu Lys
165 170 175
Lys Leu Leu Gln Ile Ala Lys Ile Lys Pro Ala Val Asn Gln Ile Glu
180 185 190
Ile His Pro Phe Leu Pro Asn Glu Glu Leu Val Ala Phe Cys Phe Glu
195 200 205
Asn Gly Ile Leu Pro Glu Ala Tyr Ser Pro Leu Gly Ser Gln Asn Gln
210 215 220
Val Pro Ser Thr Gly Glu Arg Val Arg Asp Asn Pro Thr Leu Lys Ala
225 230 235 240
Val Ala Glu Arg Ser Gly Tyr Ser Leu Ala Gln Ile Leu Leu Ala Trp
245 250 255
Gly Leu Lys Arg Gly Tyr Val Val Leu Pro Lys Ser Ser Thr Pro Ser
260 265 270
Arg Ile Glu Ser Asn Phe Asn Ile Pro Glu Leu Ser Asp Glu Asp Phe
275 280 285
Glu Ala Ile Gln Gln Val Ala Lys Gly Arg His Thr Arg Phe Val Asn
290 295 300
Met Lys Asp Thr Phe Gly Tyr Asn Val Trp Pro Glu Glu Glu
305 310 315
<210> 36
<211> 385
<212> PRT
<213>Clostridium butyricum (Clostridium butyricum)
<400> 36
Met Arg Met Tyr Asp Tyr Leu Val Pro Ser Val Asn Phe Met Gly Ala
1 5 10 15
Asn Ser Val Ser Val Val Gly Glu Arg Cys Lys Ile Leu Gly Gly Lys
20 25 30
Lys Ala Leu Ile Val Thr Asp Lys Phe Leu Lys Asp Met Glu Gly Gly
35 40 45
Ala Val Glu Leu Thr Val Lys Tyr Leu Lys Glu Ala Gly Leu Asp Val
50 55 60
Val Tyr Tyr Asp Gly Val Glu Pro Asn Pro Lys Asp Val Asn Val Ile
65 70 75 80
Glu Gly Leu Lys Ile Phe Lys Glu Glu Asn Cys Asp Met Ile Val Thr
85 90 95
Val Gly Gly Gly Ser Ser His Asp Cys Gly Lys Gly Ile Gly Ile Ala
100 105 110
Ala Thr His Glu Gly Asp Leu Tyr Asp Tyr Ala Gly Ile Glu Thr Leu
115 120 125
Val Asn Pro Leu Pro Pro Ile Val Ala Val Asn Thr Thr Ala Gly Thr
130 135 140
Ala Ser Glu Leu Thr Arg His Cys Val Leu Thr Asn Thr Lys Lys Lys
145 150 155 160
Ile Lys Phe Val Ile Val Ser Trp Arg Asn Leu Pro Leu Val Ser Ile
165 170 175
Asn Asp Pro Met Leu Met Val Lys Lys Pro Ala Gly Leu Thr Ala Ala
180 185 190
Thr Gly Met Asp Ala Leu Thr His Ala Ile Glu Ala Tyr Val Ser Lys
195 200 205
Asp Ala Asn Pro Val Thr Asp Ala Ser Ala Ile Gln Ala Ile Lys Leu
210 215 220
Ile Ser Gln Asn Leu Arg Gln Ala Val Ala Leu Gly Glu Asn Leu Glu
225 230 235 240
Ala Arg Glu Asn Met Ala Tyr Ala Ser Leu Leu Ala Gly Met Ala Phe
245 250 255
Asn Asn Ala Asn Leu Gly Tyr Val His Ala Met Ala His Gln Leu Gly
260 265 270
Gly Leu Tyr Asp Met Pro His Gly Val Ala Asn Ala Met Leu Leu Pro
275 280 285
His Val Glu Arg Tyr Asn Met Leu Ser Asn Pro Lys Lys Phe Ala Asp
290 295 300
Ile Ala Glu Phe Met Gly Glu Asn Ile Ser Gly Leu Ser Val Met Glu
305 310 315 320
Ala Ala Glu Lys Ala Ile Asn Ala Met Phe Arg Leu Ser Glu Asp Val
325 330 335
Gly Ile Pro Lys Ser Leu Lys Glu Met Gly Val Lys Gln Glu Asp Phe
340 345 350
Glu His Met Ala Glu Leu Ala Leu Leu Asp Gly Asn Ala Phe Ser Asn
355 360 365
Pro Arg Lys Gly Asn Ala Lys Asp Ile Ile Asn Ile Phe Lys Ala Ala
370 375 380
Tyr
385
<210> 37
<211> 387
<212> PRT
<213>Citrobacter freundii (Citrobacter Freundii)
<400> 37
Met Ser Tyr Arg Met Phe Asp Tyr Leu Val Pro Asn Val Asn Phe Phe
1 5 10 15
Gly Pro Asn Ala Ile Ser Val Val Gly Glu Arg Cys Lys Leu Leu Gly
20 25 30
Gly Lys Lys Ala Leu Leu Val Thr Asp Lys Gly Leu Arg Ala Ile Lys
35 40 45
Asp Gly Ala Val Asp Lys Thr Leu Thr His Leu Arg Glu Ala Gly Ile
50 55 60
Asp Val Val Val Phe Asp Gly Val Glu Pro Asn Pro Lys Asp Thr Asn
65 70 75 80
Val Arg Asp Gly Leu Glu Val Phe Arg Lys Glu His Cys Asp Ile Ile
85 90 95
Val Thr Val Gly Gly Gly Ser Pro His Asp Cys Gly Lys Gly Ile Gly
100 105 110
Ile Ala Ala Thr His Glu Gly Asp Leu Tyr Ser Tyr Ala Gly Ile Glu
115 120 125
Thr Leu Thr Asn Pro Leu Pro Pro Ile Val Ala Val Asn Thr Thr Ala
130 135 140
Gly Thr Ala Ser Glu Val Thr Arg His Cys Val Leu Thr Asn Thr Lys
145 150 155 160
Thr Lys Val Lys Phe Val Ile Val Ser Trp Arg Asn Leu Pro Ser Val
165 170 175
Ser Ile Asn Asp Pro Leu Leu Met Leu Gly Lys Pro Ala Pro Leu Thr
180 185 190
Ala Ala Thr Gly Met Asp Ala Leu Thr His Ala Val Glu Ala Tyr Ile
195 200 205
Ser Lys Asp Ala Asn Pro Val Thr Asp Ala Ala Ala Ile Gln Ala Ile
210 215 220
Arg Leu Ile Ala Arg Asn Leu Arg Gln Ala Val Ala Leu Gly Ser Asn
225 230 235 240
Leu Lys Ala Arg Glu Asn Met Ala Tyr Ala Ser Leu Leu Ala Gly Met
245 250 255
Ala Phe Asn Asn Ala Asn Leu Gly Tyr Val His Ala Met Ala His Gln
260 265 270
Leu Gly Gly Leu Tyr Asp Met Pro His Gly Val Ala Asn Ala Val Leu
275 280 285
Leu Pro His Val Ala Arg Tyr Asn Leu Ile Ala Asn Pro Glu Lys Phe
290 295 300
Ala Asp Ile Ala Glu Phe Met Gly Glu Asn Thr Asp Gly Leu Ser Thr
305 310 315 320
Met Asp Ala Ala Glu Leu Ala Ile His Ala Ile Ala Arg Leu Ser Ala
325 330 335
Asp Ile Gly Ile Pro Gln His Leu Arg Asp Leu Gly Val Lys Glu Ala
340 345 350
Asp Phe Pro Tyr Met Ala Glu Met Ala Leu Lys Asp Gly Asn Ala Phe
355 360 365
Ser Asn Pro Arg Lys Gly Asn Glu Lys Glu Ile Ala Glu Ile Phe Arg
370 375 380
Gln Ala Phe
385
<210> 38
<211> 387
<212> PRT
<213>Friedlander's bacillus (Klebsiella pneumoniae)
<400> 38
Met Ser Tyr Arg Met Phe Asp Tyr Leu Val Pro Asn Val Asn Phe Phe
1 5 10 15
Gly Pro Asn Ala Ile Ser Val Val Gly Glu Arg Cys Gln Leu Leu Gly
20 25 30
Gly Lys Lys Ala Leu Leu Val Thr Asp Lys Gly Leu Arg Ala Ile Lys
35 40 45
Asp Gly Ala Val Asp Lys Thr Leu His Tyr Leu Arg Glu Ala Gly Ile
50 55 60
Glu Val Ala Ile Phe Asp Gly Val Glu Pro Asn Pro Lys Asp Thr Asn
65 70 75 80
Val Arg Asp Gly Leu Ala Val Phe Arg Arg Glu Gln Cys Asp Ile Ile
85 90 95
Val Thr Val Gly Gly Gly Ser Pro His Asp Cys Gly Lys Gly Ile Gly
100 105 110
Ile Ala Ala Thr His Glu Gly Asp Leu Tyr Gln Tyr Ala Gly Ile Glu
115 120 125
Thr Leu Thr Asn Pro Leu Pro Pro Ile Val Ala Val Asn Thr Thr Ala
130 135 140
Gly Thr Ala Ser Glu Val Thr Arg His Cys Val Leu Thr Asn Thr Glu
145 150 155 160
Thr Lys Val Lys Phe Val Ile Val Ser Trp Arg Asn Leu Pro Ser Val
165 170 175
Ser Ile Asn Asp Pro Leu Leu Met Ile Gly Lys Pro Ala Ala Leu Thr
180 185 190
Ala Ala Thr Gly Met Asp Ala Leu Thr His Ala Val Glu Ala Tyr Ile
195 200 205
Ser Lys Asp Ala Asn Pro Val Thr Asp Ala Ala Ala Met Gln Ala Ile
210 215 220
Arg Leu Ile Ala Arg Asn Leu Arg Gln Ala Val Ala Leu Gly Ser Asn
225 230 235 240
Leu Gln Ala Arg Glu Asn Met Ala Tyr Ala Ser Leu Leu Ala Gly Met
245 250 255
Ala Phe Asn Asn Ala Asn Leu Gly Tyr Val His Ala Met Ala His Gln
260 265 270
Leu Gly Gly Leu Tyr Asp Met Pro His Gly Val Ala Asn Ala Val Leu
275 280 285
Leu Pro His Val Ala Arg Tyr Asn Leu Ile Ala Asn Pro Glu Lys Phe
290 295 300
Ala Asp Ile Ala Glu Leu Met Gly Glu Asn Ile Thr Gly Leu Ser Thr
305 310 315 320
Leu Asp Ala Ala Glu Lys Ala Ile Ala Ala Ile Thr Arg Leu Ser Met
325 330 335
Asp Ile Gly Ile Pro Gln His Leu Arg Asp Leu Gly Val Lys Glu Ala
340 345 350
Asp Phe Pro Tyr Met Ala Glu Met Ala Leu Lys Asp Gly Asn Ala Phe
355 360 365
Ser Asn Pro Arg Lys Gly Asn Glu Gln Glu Ile Ala Ala Ile Phe Arg
370 375 380
Gln Ala Phe
385
<210> 39
<211> 22
<212> PRT
<213>Rhodococcus erythropolis (Rhodoccus erythropolis)
<400> 39
Met Lys Thr Lys Ala Ala Val Leu Phe Glu Thr His Lys Pro Phe Glu
1 5 10 15
Ile Val Glu Leu Glu Leu
20
<210> 40
<211> 371
<212> PRT
<213>Methanol amycolatosis (Amycolatopsis methanolica)
<400> 40
Met Lys Thr Lys Ala Ala Val Leu His Ser Ala Gly Lys Pro Phe Glu
1 5 10 15
Ile Glu Glu Leu Glu Leu Asp Gly Pro Arg Glu Gly Glu Val Leu Ile
20 25 30
Lys Tyr Thr Ala Ala Gly Leu Cys His Ser Asp Leu His Leu Ile Asp
35 40 45
Asn Asp Leu Val Pro Arg Phe Pro Ile Val Gly Gly His Glu Gly Ala
50 55 60
Gly Val Ile Glu Asp Val Gly Pro Gly Val Thr Lys Val Lys Pro Gly
65 70 75 80
Asp His Val Val Cys Ser Phe Ile Pro Asn Cys Gly Thr Cys Arg Tyr
85 90 95
Cys Ala Thr Gly Arg Ser Asn Leu Cys Asp Met Gly Ala Thr Ile Leu
100 105 110
Asp Gly Gly Met Pro Asp Gly Ser Phe Arg Phe His Arg Gly Gly Thr
115 120 125
Asp Tyr Gly Ala Met Cys Met Leu Gly Thr Phe Ser Glu Arg Ala Thr
130 135 140
Ile Ser Gln His Ser Val Val Lys Val Asp Asp Trp Leu Pro Leu Glu
145 150 155 160
Thr Ala Val Leu Val Gly Cys Gly Val Pro Thr Gly Trp Ala Ser Ala
165 170 175
Asn Tyr Ala Gly Gly Val Arg Ala Gly Asp Thr Cys Val Val Tyr Gly
180 185 190
Ile Gly Gly Ile Gly Ile Asn Ala Val Gln Gly Ala Ala His Ala Gly
195 200 205
Ala Ala Asn Val Ile Ala Val Asp Pro Val Ala Phe Lys Arg Glu Lys
210 215 220
Ala Leu Glu Leu Gly Ala Thr His Ala Phe Ala Ser Ala Asp Glu Ala
225 230 235 240
Ala Ala Lys Val Ala Glu Leu Thr Trp Gly Gln Met Ala Asp Gln Ala
245 250 255
Leu Ile Thr Val Gly Thr Val Val Glu Gln Val Val Thr Asp Ala Phe
260 265 270
Asn Val Ile Gly Lys Gly Gly Thr Val Val Ile Thr Gly Leu Ala Asn
275 280 285
Pro Glu Lys Leu Thr Val His Leu Ser Gly Gly Val Met Thr Leu Phe
290 295 300
Glu Lys Thr Val Lys Gly Thr Leu Phe Gly Ser Ala Asn Pro Gln Tyr
305 310 315 320
Asp Ile Val Arg Leu Leu Arg Leu Tyr Gln Ala Gly His Val Lys Leu
325 330 335
Asp Glu Leu Val Thr Lys Arg Tyr Ser Leu Glu Glu Val Asn Glu Gly
340 345 350
Tyr Gln Asp Leu Arg Asp Gly Lys Asn Ile Arg Gly Val Ile Met His
355 360 365
Ser Ala Asp
370
<210> 41
<211> 382
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 41
Met Ala Asn Arg Met Ile Leu Asn Glu Thr Ala Trp Phe Gly Arg Gly
1 5 10 15
Ala Val Gly Ala Leu Thr Asp Glu Val Lys Arg Arg Gly Tyr Gln Lys
20 25 30
Ala Leu Ile Val Thr Asp Lys Thr Leu Val Gln Cys Gly Val Val Ala
35 40 45
Lys Val Thr Asp Lys Met Asp Ala Ala Gly Leu Ala Trp Ala Ile Tyr
50 55 60
Asp Gly Val Val Pro Asn Pro Thr Ile Thr Val Val Lys Glu Gly Leu
65 70 75 80
Gly Val Phe Gln Asn Ser Gly Ala Asp Tyr Leu Ile Ala Ile Gly Gly
85 90 95
Gly Ser Pro Gln Asp Thr Cys Lys Ala Ile Gly Ile Ile Ser Asn Asn
100 105 110
Pro Glu Phe Ala Asp Val Arg Ser Leu Glu Gly Leu Ser Pro Thr Asn
115 120 125
Lys Pro Ser Val Pro Ile Leu Ala Ile Pro Thr Thr Ala Gly Thr Ala
130 135 140
Ala Glu Val Thr Ile Asn Tyr Val Ile Thr Asp Glu Glu Lys Arg Arg
145 150 155 160
Lys Phe Val Cys Val Asp Pro His Asp Ile Pro Gln Val Ala Phe Ile
165 170 175
Asp Ala Asp Met Met Asp Gly Met Pro Pro Ala Leu Lys Ala Ala Thr
180 185 190
Gly Val Asp Ala Leu Thr His Ala Ile Glu Gly Tyr Ile Thr Arg Gly
195 200 205
Ala Trp Ala Leu Thr Asp Ala Leu His Ile Lys Ala Ile Glu Ile Ile
210 215 220
Ala Gly Ala Leu Arg Gly Ser Val Ala Gly Asp Lys Asp Ala Gly Glu
225 230 235 240
Glu Met Ala Leu Gly Gln Tyr Val Ala Gly Met Gly Phe Ser Asn Val
245 250 255
Gly Leu Gly Leu Val His Gly Met Ala His Pro Leu Gly Ala Phe Tyr
260 265 270
Asn Thr Pro His Gly Val Ala Asn Ala Ile Leu Leu Pro His Val Met
275 280 285
Arg Tyr Asn Ala Asp Phe Thr Gly Glu Lys Tyr Arg Asp Ile Ala Arg
290 295 300
Val Met Gly Val Lys Val Glu Gly Met Ser Leu Glu Glu Ala Arg Asn
305 310 315 320
Ala Ala Val Glu Ala Val Phe Ala Leu Asn Arg Asp Val Gly Ile Pro
325 330 335
Pro His Leu Arg Asp Val Gly Val Arg Lys Glu Asp Ile Pro Ala Leu
340 345 350
Ala Gln Ala Ala Leu Asp Asp Val Cys Thr Gly Gly Asn Pro Arg Glu
355 360 365
Ala Thr Leu Glu Asp Ile Val Glu Leu Tyr His Thr Ala Trp
370 375 380
<210> 42
<211> 312
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 42
Met Asp Ile Ile Phe Tyr His Pro Thr Phe Asp Thr Gln Trp Trp Ile
1 5 10 15
Glu Ala Leu Arg Lys Ala Ile Pro Gln Ala Arg Val Arg Ala Trp Lys
20 25 30
Ser Gly Asp Asn Asp Ser Ala Asp Tyr Ala Leu Val Trp His Pro Pro
35 40 45
Val Glu Met Leu Ala Gly Arg Asp Leu Lys Ala Val Phe Ala Leu Gly
50 55 60
Ala Gly Val Asp Ser Ile Leu Ser Lys Leu Gln Ala His Pro Glu Met
65 70 75 80
Leu Asn Pro Ser Val Pro Leu Phe Arg Leu Glu Asp Thr Gly Met Gly
85 90 95
Glu Gln Met Gln Glu Tyr Ala Val Ser Gln Val Leu His Trp Phe Arg
100 105 110
Arg Phe Asp Asp Tyr Arg Ile Gln Gln Asn Ser Ser His Trp Gln Pro
115 120 125
Leu Pro Glu Tyr His Arg Glu Asp Phe Thr Ile Gly Ile Leu Gly Ala
130 135 140
Gly Val Leu Gly Ser Lys Val Ala Gln Ser Leu Gln Thr Trp Arg Phe
145 150 155 160
Pro Leu Arg Cys Trp Ser Arg Thr Arg Lys Ser Trp Pro Gly Val Gln
165 170 175
Ser Phe Ala Gly Arg Glu Glu Leu Ser Ala Phe Leu Ser Gln Cys Arg
180 185 190
Val Leu Ile Asn Leu Leu Pro Asn Thr Pro Glu Thr Val Gly Ile Ile
195 200 205
Asn Gln Gln Leu Leu Glu Lys Leu Pro Asp Gly Ala Tyr Leu Leu Asn
210 215 220
Leu Ala Arg Gly Val His Val Val Glu Asp Asp Leu Leu Ala Ala Leu
225 230 235 240
Asp Ser Gly Lys Val Lys Gly Ala Met Leu Asp Val Phe Asn Arg Glu
245 250 255
Pro Leu Pro Pro Glu Ser Pro Leu Trp Gln His Pro Arg Val Thr Ile
260 265 270
Thr Pro His Val Ala Ala Ile Thr Arg Pro Ala Glu Ala Val Glu Tyr
275 280 285
Ile Ser Arg Thr Ile Ala Gln Leu Glu Lys Gly Glu Arg Val Cys Gly
290 295 300
Gln Val Asp Arg Ala Arg Gly Tyr
305 310
<210> 43
<211> 324
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 43
Met Lys Pro Ser Val Ile Leu Tyr Lys Ala Leu Pro Asp Asp Leu Leu
1 5 10 15
Gln Arg Leu Gln Glu His Phe Thr Val His Gln Val Ala Asn Leu Ser
20 25 30
Pro Gln Thr Val Glu Gln Asn Ala Ala Ile Phe Ala Glu Ala Glu Gly
35 40 45
Leu Leu Gly Ser Asn Glu Asn Val Asn Ala Ala Leu Leu Glu Lys Met
50 55 60
Pro Lys Leu Arg Ala Thr Ser Thr Ile Ser Val Gly Tyr Asp Asn Phe
65 70 75 80
Asp Val Asp Ala Leu Thr Ala Arg Lys Ile Leu Leu Met His Thr Pro
85 90 95
Thr Val Leu Thr Glu Thr Val Ala Asp Thr Leu Met Ala Leu Val Leu
100 105 110
Ser Thr Ala Arg Arg Val Val Glu Val Ala Glu Arg Val Lys Ala Gly
115 120 125
Glu Trp Thr Ala Ser Ile Gly Pro Asp Trp Tyr Gly Thr Asp Val His
130 135 140
His Lys Thr Leu Gly Ile Val Gly Met Gly Arg Ile Gly Met Ala Leu
145 150 155 160
Ala Gln Arg Ala His Phe Gly Phe Asn Met Pro Ile Leu Tyr Asn Ala
165 170 175
Arg Arg His His Lys Glu Ala Glu Glu Arg Phe Asn Ala Arg Tyr Cys
180 185 190
Asp Leu Asp Thr Leu Leu Gln Glu Ser Asp Phe Val Cys Leu Ile Leu
195 200 205
Pro Leu Thr Asp Glu Thr His His Leu Phe Gly Ala Glu Gln Phe Ala
210 215 220
Lys Met Lys Ser Ser Ala Ile Phe Ile Asn Ala Gly Arg Gly Pro Val
225 230 235 240
Val Asp Glu Asn Ala Leu Ile Ala Ala Leu Gln Lys Gly Glu Ile His
245 250 255
Ala Ala Gly Leu Asp Val Phe Glu Gln Glu Pro Leu Ser Val Asp Ser
260 265 270
Pro Leu Leu Ser Met Ala Asn Val Val Ala Val Pro His Ile Gly Ser
275 280 285
Ala Thr His Glu Thr Arg Tyr Gly Met Ala Ala Cys Ala Val Asp Asn
290 295 300
Leu Ile Asp Ala Leu Gln Gly Lys Val Glu Lys Asn Cys Val Asn Pro
305 310 315 320
His Val Ala Asp
<210> 44
<211> 275
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 44
Met Ala Asn Pro Thr Val Ile Lys Leu Gln Asp Gly Asn Val Met Pro
1 5 10 15
Gln Leu Gly Leu Gly Val Trp Gln Ala Ser Asn Glu Glu Val Ile Thr
20 25 30
Ala Ile Gln Lys Ala Leu Glu Val Gly Tyr Arg Ser Ile Asp Thr Ala
35 40 45
Ala Ala Tyr Lys Asn Glu Glu Gly Val Gly Lys Ala Leu Lys Asn Ala
50 55 60
Ser Val Asn Arg Glu Glu Leu Phe Ile Thr Thr Lys Leu Trp Asn Asp
65 70 75 80
Asp His Lys Arg Pro Arg Glu Ala Leu Leu Asp Ser Leu Lys Lys Leu
85 90 95
Gln Leu Asp Tyr Ile Asp Leu Tyr Leu Met His Trp Pro Val Pro Ala
100 105 110
Ile Asp His Tyr Val Glu Ala Trp Lys Gly Met Ile Glu Leu Gln Lys
115 120 125
Glu Gly Leu Ile Lys Ser Ile Gly Val Cys Asn Phe Gln Ile His His
130 135 140
Leu Gln Arg Leu Ile Asp Glu Thr Gly Val Thr Pro Val Ile Asn Gln
145 150 155 160
Ile Glu Leu His Pro Leu Met Gln Gln Arg Gln Leu His Ala Trp Asn
165 170 175
Ala Thr His Lys Ile Gln Thr Glu Ser Trp Ser Pro Leu Ala Gln Gly
180 185 190
Gly Lys Gly Val Phe Asp Gln Lys Val Ile Arg Asp Leu Ala Asp Lys
195 200 205
Tyr Gly Lys Thr Pro Ala Gln Ile Val Ile Arg Trp His Leu Asp Ser
210 215 220
Gly Leu Val Val Ile Pro Lys Ser Val Thr Pro Ser Arg Ile Ala Glu
225 230 235 240
Asn Phe Asp Val Trp Asp Phe Arg Leu Asp Lys Asp Glu Leu Gly Glu
245 250 255
Ile Ala Lys Leu Asp Gln Gly Lys Arg Leu Gly Pro Asp Pro Asp Gln
260 265 270
Phe Gly Gly
275
<210> 45
<211> 267
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 45
Met Ala Ile Pro Ala Phe Gly Leu Gly Thr Phe Arg Leu Lys Asp Asp
1 5 10 15
Val Val Ile Ser Ser Val Ile Thr Ala Leu Glu Leu Gly Tyr Arg Ala
20 25 30
Ile Asp Thr Ala Gln Ile Tyr Asp Asn Glu Ala Ala Val Gly Gln Ala
35 40 45
Ile Ala Glu Ser Gly Val Pro Arg His Glu Leu Tyr Ile Thr Thr Lys
50 55 60
Ile Trp Ile Glu Asn Leu Ser Lys Asp Lys Leu Ile Pro Ser Leu Lys
65 70 75 80
Glu Ser Leu Gln Lys Leu Arg Thr Asp Tyr Val Asp Leu Thr Leu Ile
85 90 95
His Trp Pro Ser Pro Asn Asp Glu Val Ser Val Glu Glu Phe Met Gln
100 105 110
Ala Leu Leu Glu Ala Lys Lys Gln Gly Leu Thr Arg Glu Ile Gly Ile
115 120 125
Ser Asn Phe Thr Ile Pro Leu Met Glu Lys Ala Ile Ala Ala Val Gly
130 135 140
Ala Glu Asn Ile Ala Thr Asn Gln Ile Glu Leu Ser Pro Tyr Leu Gln
145 150 155 160
Asn Arg Lys Val Val Ala Trp Ala Lys Gln His Gly Ile His Ile Thr
165 170 175
Ser Tyr Met Thr Leu Ala Tyr Gly Lys Ala Leu Lys Asp Glu Val Ile
180 185 190
Ala Arg Ile Ala Ala Lys His Asn Ala Thr Pro Ala Gln Val Ile Leu
195 200 205
Ala Trp Ala Met Gly Glu Gly Tyr Ser Val Ile Pro Ser Ser Thr Lys
210 215 220
Arg Lys Asn Leu Glu Ser Asn Leu Lys Ala Gln Asn Leu Gln Leu Asp
225 230 235 240
Ala Glu Asp Lys Lys Ala Ile Ala Ala Leu Asp Cys Asn Asp Arg Leu
245 250 255
Val Ser Pro Glu Gly Leu Ala Pro Glu Trp Asp
260 265
<210> 46
<211> 495
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 46
Met Asn Phe His His Leu Ala Tyr Trp Gln Asp Lys Ala Leu Ser Leu
1 5 10 15
Ala Ile Glu Asn Arg Leu Phe Ile Asn Gly Glu Tyr Thr Ala Ala Ala
20 25 30
Glu Asn Glu Thr Phe Glu Thr Val Asp Pro Val Thr Gln Ala Pro Leu
35 40 45
Ala Lys Ile Ala Arg Gly Lys Ser Val Asp Ile Asp Arg Ala Met Ser
50 55 60
Ala Ala Arg Gly Val Phe Glu Arg Gly Asp Trp Ser Leu Ser Ser Pro
65 70 75 80
Ala Lys Arg Lys Ala Val Leu Asn Lys Leu Ala Asp Leu Met Glu Ala
85 90 95
His Ala Glu Glu Leu Ala Leu Leu Glu Thr Leu Asp Thr Gly Lys Pro
100 105 110
Ile Arg His Ser Leu Arg Asp Asp Ile Pro Gly Ala Ala Arg Ala Ile
115 120 125
Arg Trp Tyr Ala Glu Ala Ile Asp Lys Val Tyr Gly Glu Val Ala Thr
130 135 140
Thr Ser Ser His Glu Leu Ala Met Ile Val Arg Glu Pro Val Gly Val
145 150 155 160
Ile Ala Ala Ile Val Pro Trp Asn Phe Pro Leu Leu Leu Thr Cys Trp
165 170 175
Lys Leu Gly Pro Ala Leu Ala Ala Gly Asn Ser Val Ile Leu Lys Pro
180 185 190
Ser Glu Lys Ser Pro Leu Ser Ala Ile Arg Leu Ala Gly Leu Ala Lys
195 200 205
Glu Ala Gly Leu Pro Asp Gly Val Leu Asn Val Val Thr Gly Phe Gly
210 215 220
His Glu Ala Gly Gln Ala Leu Ser Arg His Asn Asp Ile Asp Ala Ile
225 230 235 240
Ala Phe Thr Gly Ser Thr Arg Thr Gly Lys Gln Leu Leu Lys Asp Ala
245 250 255
Gly Asp Ser Asn Met Lys Arg Val Trp Leu Glu Ala Gly Gly Lys Ser
260 265 270
Ala Asn Ile Val Phe Ala Asp Cys Pro Asp Leu Gln Gln Ala Ala Ser
275 280 285
Ala Thr Ala Ala Gly Ile Phe Tyr Asn Gln Gly Gln Val Cys Ile Ala
290 295 300
Gly Thr Arg Leu Leu Leu Glu Glu Ser Ile Ala Asp Glu Phe Leu Ala
305 310 315 320
Leu Leu Lys Gln Gln Ala Gln Asn Trp Gln Pro Gly His Pro Leu Asp
325 330 335
Pro Ala Thr Thr Met Gly Thr Leu Ile Asp Cys Ala His Ala Asp Ser
340 345 350
Val His Ser Phe Ile Arg Glu Gly Glu Ser Lys Gly Gln Leu Leu Leu
355 360 365
Asp Gly Arg Asn Ala Gly Leu Ala Ala Ala Ile Gly Pro Thr Ile Phe
370 375 380
Val Asp Val Asp Pro Asn Ala Ser Leu Ser Arg Glu Glu Ile Phe Gly
385 390 395 400
Pro Val Leu Val Val Thr Arg Phe Thr Ser Glu Glu Gln Ala Leu Gln
405 410 415
Leu Ala Asn Asp Ser Gln Tyr Gly Leu Gly Ala Ala Val Trp Thr Arg
420 425 430
Asp Leu Ser Arg Ala His Arg Met Ser Arg Arg Leu Lys Ala Gly Ser
435 440 445
Val Phe Val Asn Asn Tyr Asn Asp Gly Asp Met Thr Val Pro Phe Gly
450 455 460
Gly Tyr Lys Gln Ser Gly Asn Gly Arg Asp Lys Ser Leu His Ala Leu
465 470 475 480
Glu Lys Phe Thr Glu Leu Lys Thr Ile Trp Ile Ser Leu Glu Ala
485 490 495
<210> 47
<211> 496
<212> PRT
<213>Friedlander's bacillus (Klebsiella pneumoniae)
<400> 47
Met Met Asn Phe Gln His Leu Ala Tyr Trp Gln Glu Lys Ala Lys Asn
1 5 10 15
Leu Ala Ile Glu Thr Arg Leu Phe Ile Asn Gly Glu Tyr Cys Ala Ala
20 25 30
Ala Asp Asn Thr Thr Phe Glu Thr Ile Asp Pro Ala Ala Gln Gln Thr
35 40 45
Leu Ala Gln Val Ala Arg Gly Lys Lys Ala Asp Val Glu Arg Ala Val
50 55 60
Lys Ala Ala Arg Gln Ala Phe Asp Asn Gly Asp Trp Ser Gln Ala Ser
65 70 75 80
Pro Ala Gln Arg Lys Ala Ile Leu Thr Arg Phe Ala Asn Leu Met Glu
85 90 95
Ala His Arg Glu Glu Leu Ala Leu Leu Glu Thr Leu Asp Thr Gly Lys
100 105 110
Pro Ile Arg His Ser Leu Arg Asp Asp Ile Pro Gly Ala Ala Arg Ala
115 120 125
Ile Arg Trp Tyr Ala Glu Ala Leu Asp Lys Val Tyr Gly Glu Val Ala
130 135 140
Pro Thr Gly Ser Asn Glu Leu Ala Met Ile Val Arg Glu Pro Ile Gly
145 150 155 160
Val Ile Ala Ala Val Val Pro Trp Asn Phe Pro Leu Leu Leu Ala Cys
165 170 175
Trp Lys Leu Gly Pro Ala Leu Ala Ala Gly Asn Ser Val Ile Leu Lys
180 185 190
Pro Ser Glu Lys Ser Pro Leu Thr Ala Leu Arg Leu Ala Gly Leu Ala
195 200 205
Lys Glu Ala Gly Leu Pro Asp Gly Val Leu Asn Val Val Ser Gly Phe
210 215 220
Gly His Glu Ala Gly Gln Ala Leu Ala Leu His Pro Asp Val Glu Val
225 230 235 240
Ile Thr Phe Thr Gly Ser Thr Arg Thr Gly Lys Gln Leu Leu Lys Asp
245 250 255
Ala Gly Asp Ser Asn Met Lys Arg Val Trp Leu Glu Ala Gly Gly Lys
260 265 270
Ser Ala Asn Ile Val Phe Ala Asp Cys Pro Asp Leu Gln Gln Ala Val
275 280 285
Arg Ala Thr Ala Gly Gly Ile Phe Tyr Asn Gln Gly Gln Val Cys Ile
290 295 300
Ala Gly Thr Arg Leu Leu Leu Glu Glu Ser Ile Ala Asp Glu Phe Leu
305 310 315 320
Ala Arg Leu Lys Ala Glu Ala Gln His Trp Gln Pro Gly Asn Pro Leu
325 330 335
Asp Pro Asp Thr Thr Met Gly Met Leu Ile Asp Asn Thr His Ala Asp
340 345 350
Asn Val His Ser Phe Ile Arg Gly Gly Glu Ser Gln Ser Thr Leu Phe
355 360 365
Leu Asp Gly Arg Lys Asn Pro Trp Pro Ala Ala Val Gly Pro Thr Ile
370 375 380
Phe Val Asp Val Asp Pro Ala Ser Thr Leu Ser Arg Glu Glu Ile Phe
385 390 395 400
Gly Pro Val Leu Val Val Thr Arg Phe Lys Ser Glu Glu Glu Ala Leu
405 410 415
Lys Leu Ala Asn Asp Ser Asp Tyr Gly Leu Gly Ala Ala Val Trp Thr
420 425 430
Arg Asp Leu Ser Arg Ala His Arg Met Ser Arg Arg Leu Lys Ala Gly
435 440 445
Ser Val Phe Val Asn Asn Tyr Asn Asp Gly Asp Met Thr Val Pro Phe
450 455 460
Gly Gly Tyr Lys Gln Ser Gly Asn Gly Arg Asp Lys Ser Leu His Ala
465 470 475 480
Leu Glu Lys Phe Thr Glu Leu Lys Thr Ile Trp Ile Ala Leu Glu Ser
485 490 495
<210> 48
<211> 519
<212> PRT
<213>Saccharomyces cerevisiae (Saccharomyces cerevisiae)
<400> 48
Met Phe Ser Arg Ser Thr Leu Cys Leu Lys Thr Ser Ala Ser Ser Ile
1 5 10 15
Gly Arg Leu Gln Leu Arg Tyr Phe Ser His Leu Pro Met Thr Val Pro
20 25 30
Ile Lys Leu Pro Asn Gly Leu Glu Tyr Glu Gln Pro Thr Gly Leu Phe
35 40 45
Ile Asn Asn Lys Phe Val Pro Ser Lys Gln Asn Lys Thr Phe Glu Val
50 55 60
Ile Asn Pro Ser Thr Glu Glu Glu Ile Cys His Ile Tyr Glu Gly Arg
65 70 75 80
Glu Asp Asp Val Glu Glu Ala Val Gln Ala Ala Asp Arg Ala Phe Ser
85 90 95
Asn Gly Ser Trp Asn Gly Ile Asp Pro Ile Asp Arg Gly Lys Ala Leu
100 105 110
Tyr Arg Leu Ala Glu Leu Ile Glu Gln Asp Lys Asp Val Ile Ala Ser
115 120 125
Ile Glu Thr Leu Asp Asn Gly Lys Ala Ile Ser Ser Ser Arg Gly Asp
130 135 140
Val Asp Leu Val Ile Asn Tyr Leu Lys Ser Ser Ala Gly Phe Ala Asp
145 150 155 160
Lys Ile Asp Gly Arg Met Ile Asp Thr Gly Arg Thr His Phe Ser Tyr
165 170 175
Thr Lys Arg Gln Pro Leu Gly Val Cys Gly Gln Ile Ile Pro Trp Asn
180 185 190
Phe Pro Leu Leu Met Trp Ala Trp Lys Ile Ala Pro Ala Leu Val Thr
195 200 205
Gly Asn Thr Val Val Leu Lys Thr Ala Glu Ser Thr Pro Leu Ser Ala
210 215 220
Leu Tyr Val Ser Lys Tyr Ile Pro Gln Ala Gly Ile Pro Pro Gly Val
225 230 235 240
Ile Asn Ile Val Ser Gly Phe Gly Lys Ile Val Gly Glu Ala Ile Thr
245 250 255
Asn His Pro Lys Ile Lys Lys Val Ala Phe Thr Gly Ser Thr Ala Thr
260 265 270
Gly Arg His Ile Tyr Gln Ser Ala Ala Ala Gly Leu Lys Lys Val Thr
275 280 285
Leu Glu Leu Gly Gly Lys Ser Pro Asn Ile Val Phe Ala Asp Ala Glu
290 295 300
Leu Lys Lys Ala Val Gln Asn Ile Ile Leu Gly Ile Tyr Tyr Asn Ser
305 310 315 320
Gly Glu Val Cys Cys Ala Gly Ser Arg Val Tyr Val Glu Glu Ser Ile
325 330 335
Tyr Asp Lys Phe Ile Glu Glu Phe Lys Ala Ala Ser Glu Ser Ile Lys
340 345 350
Val Gly Asp Pro Phe Asp Glu Ser Thr Phe Gln Gly Ala Gln Thr Ser
355 360 365
Gln Met Gln Leu Asn Lys Ile Leu Lys Tyr Val Asp Ile Gly Lys Asn
370 375 380
Glu Gly Ala Thr Leu Ile Thr Gly Gly Glu Arg Leu Gly Ser Lys Gly
385 390 395 400
Tyr Phe Ile Lys Pro Thr Val Phe Gly Asp Val Lys Glu Asp Met Arg
405 410 415
Ile Val Lys Glu Glu Ile Phe Gly Pro Val Val Thr Val Thr Lys Phe
420 425 430
Lys Ser Ala Asp Glu Val Ile Asn Met Ala Asn Asp Ser Glu Tyr Gly
435 440 445
Leu Ala Ala Gly Ile His Thr Ser Asn Ile Asn Thr Ala Leu Lys Val
450 455 460
Ala Asp Arg Val Asn Ala Gly Thr Val Trp Ile Asn Thr Tyr Asn Asp
465 470 475 480
Phe His His Ala Val Pro Phe Gly Gly Phe Asn Ala Ser Gly Leu Gly
485 490 495
Arg Glu Met Ser Val Asp Ala Leu Gln Asn Tyr Leu Gln Val Lys Ala
500 505 510
Val Arg Ala Lys Leu Asp Glu
515
<210> 49
<211> 520
<212> PRT
<213>Saccharomyces cerevisiae (Saccharomyces cerevisiae)
<400> 49
Met Leu Ser Arg Thr Arg Ala Ala Ala Pro Asn Ser Arg Ile Phe Thr
1 5 10 15
Arg Ser Leu Leu Arg Leu Tyr Ser Gln Ala Pro Leu Arg Val Pro Ile
20 25 30
Thr Leu Pro Asn Gly Phe Thr Tyr Glu Gln Pro Thr Gly Leu Phe Ile
35 40 45
Asn Gly Glu Phe Val Ala Ser Lys Gln Lys Lys Thr Phe Asp Val Ile
50 55 60
Asn Pro Ser Asn Glu Glu Lys Ile Thr Thr Val Tyr Lys Ala Met Glu
65 70 75 80
Asp Asp Val Asp Glu Ala Val Ala Ala Ala Lys Lys Ala Phe Glu Thr
85 90 95
Lys Trp Ser Ile Val Glu Pro Glu Val Arg Ala Lys Ala Leu Phe Asn
100 105 110
Leu Ala Asp Leu Val Glu Lys His Gln Glu Thr Leu Ala Ala Ile Glu
115 120 125
Ser Met Asp Asn Gly Lys Ser Leu Phe Cys Ala Arg Gly Asp Val Ala
130 135 140
Leu Val Ser Lys Tyr Leu Arg Ser Cys Gly Gly Trp Ala Asp Lys Ile
145 150 155 160
Tyr Gly Asn Val Ile Asp Thr Gly Lys Asn His Phe Thr Tyr Ser Ile
165 170 175
Lys Glu Pro Leu Gly Val Cys Gly Gln Ile Ile Pro Trp Asn Phe Pro
180 185 190
Leu Leu Met Trp Ser Trp Lys Ile Gly Pro Ala Leu Ala Thr Gly Asn
195 200 205
Thr Val Val Leu Lys Pro Ala Glu Thr Thr Pro Leu Ser Ala Leu Phe
210 215 220
Ala Ser Gln Leu Cys Gln Glu Ala Gly Ile Pro Ala Gly Val Val Asn
225 230 235 240
Ile Leu Pro Gly Ser Gly Arg Val Val Gly Glu Arg Leu Ser Ala His
245 250 255
Pro Asp Val Lys Lys Ile Ala Phe Thr Gly Ser Thr Ala Thr Gly Arg
260 265 270
His Ile Met Lys Val Ala Ala Asp Thr Val Lys Lys Val Thr Leu Glu
275 280 285
Leu Gly Gly Lys Ser Pro Asn Ile Val Phe Ala Asp Ala Asp Leu Asp
290 295 300
Lys Ala Val Lys Asn Ile Ala Phe Gly Ile Phe Tyr Asn Ser Gly Glu
305 310 315 320
Val Cys Cys Ala Gly Ser Arg Ile Tyr Ile Gln Asp Thr Val Tyr Glu
325 330 335
Glu Val Leu Gln Lys Leu Lys Asp Tyr Thr Glu Ser Leu Lys Val Gly
340 345 350
Asp Pro Phe Asp Glu Glu Val Phe Gln Gly Ala Gln Thr Ser Asp Lys
355 360 365
Gln Leu His Lys Ile Leu Asp Tyr Val Asp Val Ala Lys Ser Glu Gly
370 375 380
Ala Arg Leu Val Thr Gly Gly Ala Arg His Gly Ser Lys Gly Tyr Phe
385 390 395 400
Val Lys Pro Thr Val Phe Ala Asp Val Lys Glu Asp Met Arg Ile Val
405 410 415
Lys Glu Glu Val Phe Gly Pro Ile Val Thr Val Ser Lys Phe Ser Thr
420 425 430
Val Asp Glu Val Ile Ala Met Ala Asn Asp Ser Gln Tyr Gly Leu Ala
435 440 445
Ala Gly Ile His Thr Asn Asp Ile Asn Lys Ala Val Asp Val Ser Lys
450 455 460
Arg Val Lys Ala Gly Thr Val Trp Ile Asn Thr Tyr Asn Asn Phe His
465 470 475 480
Gln Asn Val Pro Phe Gly Gly Phe Gly Gln Ser Gly Ile Gly Arg Glu
485 490 495
Met Gly Glu Ala Ala Leu Ser Asn Tyr Thr Gln Thr Lys Ser Val Arg
500 505 510
Ile Ala Ile Asp Lys Pro Ile Arg
515 520
<210> 50
<211> 462
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 50
Met Thr Ile Thr Pro Ala Thr His Ala Ile Ser Ile Asn Pro Ala Thr
1 5 10 15
Gly Glu Gln Leu Ser Val Leu Pro Trp Ala Gly Ala Asp Asp Ile Glu
20 25 30
Asn Ala Leu Gln Leu Ala Ala Ala Gly Phe Arg Asp Trp Arg Glu Thr
35 40 45
Asn Ile Asp Tyr Arg Ala Glu Lys Leu Arg Asp Ile Gly Lys Ala Leu
50 55 60
Arg Ala Arg Ser Glu Glu Met Ala Gln Met Ile Thr Arg Glu Met Gly
65 70 75 80
Lys Pro Ile Asn Gln Ala Arg Ala Glu Val Ala Lys Ser Ala Asn Leu
85 90 95
Cys Asp Trp Tyr Ala Glu His Gly Pro Ala Met Leu Lys Ala Glu Pro
100 105 110
Thr Leu Val Glu Asn Gln Gln Ala Val Ile Glu Tyr Arg Pro Leu Gly
115 120 125
Thr Ile Leu Ala Ile Met Pro Trp Asn Phe Pro Leu Trp Gln Val Met
130 135 140
Arg Gly Ala Val Pro Ile Ile Leu Ala Gly Asn Gly Tyr Leu Leu Lys
145 150 155 160
His Ala Pro Asn Val Met Gly Cys Ala Gln Leu Ile Ala Gln Val Phe
165 170 175
Lys Asp Ala Gly Ile Pro Gln Gly Val Tyr Gly Trp Leu Asn Ala Asp
180 185 190
Asn Asp Gly Val Ser Gln Met Ile Lys Asp Ser Arg Ile Ala Ala Val
195 200 205
Thr Val Thr Gly Ser Val Arg Ala Gly Ala Ala Ile Gly Ala Gln Ala
210 215 220
Gly Ala Ala Leu Lys Lys Cys Val Leu Glu Leu Gly Gly Ser Asp Pro
225 230 235 240
Phe Ile Val Leu Asn Asp Ala Asp Leu Glu Leu Ala Val Lys Ala Ala
245 250 255
Val Ala Gly Arg Tyr Gln Asn Thr Gly Gln Val Cys Ala Ala Ala Lys
260 265 270
Arg Phe Ile Ile Glu Glu Gly Ile Ala Ser Ala Phe Thr Glu Arg Phe
275 280 285
Val Ala Ala Ala Ala Ala Leu Lys Met Gly Asp Pro Arg Asp Glu Glu
290 295 300
Asn Ala Leu Gly Pro Met Ala Arg Phe Asp Leu Arg Asp Glu Leu His
305 310 315 320
His Gln Val Glu Lys Thr Leu Ala Gln Gly Ala Arg Leu Leu Leu Gly
325 330 335
Gly Glu Lys Met Ala Gly Ala Gly Asn Tyr Tyr Pro Pro Thr Val Leu
340 345 350
Ala Asn Val Thr Pro Glu Met Thr Ala Phe Arg Glu Glu Met Phe Gly
355 360 365
Pro Val Ala Ala Ile Thr Ile Ala Lys Asp Ala Glu His Ala Leu Glu
370 375 380
Leu Ala Asn Asp Ser Glu Phe Gly Leu Ser Ala Thr Ile Phe Thr Thr
385 390 395 400
Asp Glu Thr Gln Ala Arg Gln Met Ala Ala Arg Leu Glu Cys Gly Gly
405 410 415
Val Phe Ile Asn Gly Tyr Cys Ala Ser Asp Ala Arg Val Ala Phe Gly
420 425 430
Gly Val Lys Lys Ser Gly Phe Gly Arg Glu Leu Ser His Phe Gly Leu
435 440 445
His Glu Phe Cys Asn Ile Gln Thr Val Trp Lys Asp Arg Ile
450 455 460
<210> 51
<211> 482
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 51
Met Lys Leu Asn Asp Ser Asn Leu Phe Arg Gln Gln Ala Leu Ile Asn
1 5 10 15
Gly Glu Trp Leu Asp Ala Asn Asn Gly Glu Ala Ile Asp Val Thr Asn
20 25 30
Pro Ala Asn Gly Asp Lys Leu Gly Ser Val Pro Lys Met Gly Ala Asp
35 40 45
Glu Thr Arg Ala Ala Ile Asp Ala Ala Asn Arg Ala Leu Pro Ala Trp
50 55 60
Arg Ala Leu Thr Ala Lys Glu Arg Ala Thr Ile Leu Arg Asn Trp Phe
65 70 75 80
Asn Leu Met Met Glu His Gln Asp Asp Leu Ala Arg Leu Met Thr Leu
85 90 95
Glu Gln Gly Lys Pro Leu Ala Glu Ala Lys Gly Glu Ile Ser Tyr Ala
100 105 110
Ala Ser Phe Ile Glu Trp Phe Ala Glu Glu Gly Lys Arg Ile Tyr Gly
115 120 125
Asp Thr Ile Pro Gly His Gln Ala Asp Lys Arg Leu Ile Val Ile Lys
130 135 140
Gln Pro Ile Gly Val Thr Ala Ala Ile Thr Pro Trp Asn Phe Pro Ala
145 150 155 160
Ala Met Ile Thr Arg Lys Ala Gly Pro Ala Leu Ala Ala Gly Cys Thr
165 170 175
Met Val Leu Lys Pro Ala Ser Gln Thr Pro Phe Ser Ala Leu Ala Leu
180 185 190
Ala Glu Leu Ala Ile Arg Ala Gly Val Pro Ala Gly Val Phe Asn Val
195 200 205
Val Thr Gly Ser Ala Gly Ala Val Gly Asn Glu Leu Thr Ser Asn Pro
210 215 220
Leu Val Arg Lys Leu Ser Phe Thr Gly Ser Thr Glu Ile Gly Arg Gln
225 230 235 240
Leu Met Glu Gln Cys Ala Lys Asp Ile Lys Lys Val Ser Leu Glu Leu
245 250 255
Gly Gly Asn Ala Pro Phe Ile Val Phe Asp Asp Ala Asp Leu Asp Lys
260 265 270
Ala Val Glu Gly Ala Leu Ala Ser Lys Phe Arg Asn Ala Gly Gln Thr
275 280 285
Cys Val Cys Ala Asn Arg Leu Tyr Val Gln Asp Gly Val Tyr Asp Arg
290 295 300
Phe Ala Glu Lys Leu Gln Gln Ala Val Ser Lys Leu His Ile Gly Asp
305 310 315 320
Gly Leu Asp Asn Gly Val Thr Ile Gly Pro Leu Ile Asp Glu Lys Ala
325 330 335
Val Ala Lys Val Glu Glu His Ile Ala Asp Ala Leu Glu Lys Gly Ala
340 345 350
Arg Val Val Cys Gly Gly Lys Ala His Glu Arg Gly Gly Asn Phe Phe
355 360 365
Gln Pro Thr Ile Leu Val Asp Val Pro Ala Asn Ala Lys Val Ser Lys
370 375 380
Glu Glu Thr Phe Gly Pro Leu Ala Pro Leu Phe Arg Phe Lys Asp Glu
385 390 395 400
Ala Asp Val Ile Ala Gln Ala Asn Asp Thr Glu Phe Gly Leu Ala Ala
405 410 415
Tyr Phe Tyr Ala Arg Asp Leu Ser Arg Val Phe Arg Val Gly Glu Ala
420 425 430
Leu Glu Tyr Gly Ile Val Gly Ile Asn Thr Gly Ile Ile Ser Asn Glu
435 440 445
Val Ala Pro Phe Gly Gly Ile Lys Ala Ser Gly Leu Gly Arg Glu Gly
450 455 460
Ser Lys Tyr Gly Ile Glu Asp Tyr Leu Glu Ile Lys Tyr Met Cys Ile
465 470 475 480
Gly Leu
<210> 52
<211> 474
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 52
Met Gln His Lys Leu Leu Ile Asn Gly Glu Leu Val Ser Gly Glu Gly
1 5 10 15
Glu Lys Gln Pro Val Tyr Asn Pro Ala Thr Gly Asp Val Leu Leu Glu
20 25 30
Ile Ala Glu Ala Ser Ala Glu Gln Val Asp Ala Ala Val Arg Ala Ala
35 40 45
Asp Ala Ala Phe Ala Glu Trp Gly Gln Thr Thr Pro Lys Val Arg Ala
50 55 60
Glu Cys Leu Leu Lys Leu Ala Asp Val Ile Glu Glu Asn Gly Gln Val
65 70 75 80
Phe Ala Glu Leu Glu Ser Arg Asn Cys Gly Lys Pro Leu His Ser Ala
85 90 95
Phe Asn Asp Glu Ile Pro Ala Ile Val Asp Val Phe Arg Phe Phe Ala
100 105 110
Gly Ala Ala Arg Cys Leu Asn Gly Leu Ala Ala Gly Glu Tyr Leu Glu
115 120 125
Gly His Thr Ser Met Ile Arg Arg Asp Pro Leu Gly Val Val Ala Ser
130 135 140
Ile Ala Pro Trp Asn Tyr Pro Leu Met Met Ala Ala Trp Lys Leu Ala
145 150 155 160
Pro Ala Leu Ala Ala Gly Asn Cys Val Val Leu Lys Pro Ser Glu Ile
165 170 175
Thr Pro Leu Thr Ala Leu Lys Leu Ala Glu Leu Ala Lys Asp Ile Phe
180 185 190
Pro Ala Gly Val Ile Asn Ile Leu Phe Gly Arg Gly Lys Thr Val Gly
195 200 205
Asp Pro Leu Thr Gly His Pro Lys Val Arg Met Val Ser Leu Thr Gly
210 215 220
Ser Ile Ala Thr Gly Glu His Ile Ile Ser His Thr Ala Ser Ser Ile
225 230 235 240
Lys Arg Thr His Met Glu Leu Gly Gly Lys Ala Pro Val Ile Val Phe
245 250 255
Asp Asp Ala Asp Ile Glu Ala Val Val Glu Gly Val Arg Thr Phe Gly
260 265 270
Tyr Tyr Asn Ala Gly Gln Asp Cys Thr Ala Ala Cys Arg Ile Tyr Ala
275 280 285
Gln Lys Gly Ile Tyr Asp Thr Leu Val Glu Lys Leu Gly Ala Ala Val
290 295 300
Ala Thr Leu Lys Ser Gly Ala Pro Asp Asp Glu Ser Thr Glu Leu Gly
305 310 315 320
Pro Leu Ser Ser Leu Ala His Leu Glu Arg Val Gly Lys Ala Val Glu
325 330 335
Glu Ala Lys Ala Thr Gly His Ile Lys Val Ile Thr Gly Gly Glu Lys
340 345 350
Arg Lys Gly Asn Gly Tyr Tyr Tyr Ala Pro Thr Leu Leu Ala Gly Ala
355 360 365
Leu Gln Asp Asp Ala Ile Val Gln Lys Glu Val Phe Gly Pro Val Val
370 375 380
Ser Val Thr Pro Phe Asp Asn Glu Glu Gln Val Val Asn Trp Ala Asn
385 390 395 400
Asp Ser Gln Tyr Gly Leu Ala Ser Ser Val Trp Thr Lys Asp Val Gly
405 410 415
Arg Ala His Arg Val Ser Ala Arg Leu Gln Tyr Gly Cys Thr Trp Val
420 425 430
Asn Thr His Phe Met Leu Val Ser Glu Met Pro His Gly Gly Gln Lys
435 440 445
Leu Ser Gly Tyr Gly Lys Asp Met Ser Leu Tyr Gly Leu Glu Asp Tyr
450 455 460
Thr Val Val Arg His Val Met Val Lys His
465 470
<210> 53
<211> 480
<212> PRT
<213>Pseudomonas putida (Pseudomonas putida)
<400> 53
Met Gln Leu Lys Asp Ala Gln Leu Phe Arg Gln Gln Ala Tyr Ile Asn
1 5 10 15
Gly Glu Trp Leu Asp Ala Asp Asn Gly Gln Thr Ile Lys Val Thr Asn
20 25 30
Pro Ala Thr Gly Glu Val Ile Gly Thr Val Pro Lys Met Gly Thr Ala
35 40 45
Glu Thr Arg Arg Ala Ile Glu Ala Ala Asp Lys Ala Leu Pro Ala Trp
50 55 60
Arg Ala Leu Thr Ala Lys Glu Arg Ser Ala Lys Leu Arg Arg Trp Phe
65 70 75 80
Glu Leu Met Ile Glu Asn Gln Asp Asp Leu Ala Arg Leu Met Thr Thr
85 90 95
Glu Gln Gly Lys Pro Leu Ala Glu Ala Lys Gly Glu Ile Ala Tyr Ala
100 105 110
Ala Ser Phe Ile Glu Trp Phe Ala Glu Glu Ala Lys Arg Ile Tyr Gly
115 120 125
Asp Thr Ile Pro Gly His Gln Pro Asp Lys Arg Leu Ile Val Ile Lys
130 135 140
Gln Pro Ile Gly Val Thr Ala Ala Ile Thr Pro Trp Asn Phe Pro Ala
145 150 155 160
Ala Met Ile Thr Arg Lys Ala Gly Pro Ala Leu Ala Ala Gly Cys Thr
165 170 175
Met Val Leu Lys Pro Ala Ser Gln Thr Pro Tyr Ser Ala Leu Ala Leu
180 185 190
Val Glu Leu Ala His Arg Ala Gly Ile Pro Ala Gly Val Leu Ser Val
195 200 205
Val Thr Gly Ser Ala Gly Glu Val Gly Gly Glu Leu Thr Gly Asn Ser
210 215 220
Leu Val Arg Lys Leu Ser Phe Thr Gly Ser Thr Glu Ile Gly Arg Gln
225 230 235 240
Leu Met Glu Glu Cys Ala Lys Asp Ile Lys Lys Val Ser Leu Glu Leu
245 250 255
Gly Gly Asn Ala Pro Phe Ile Val Phe Asp Asp Ala Asp Leu Asp Lys
260 265 270
Ala Val Glu Gly Ala Ile Ile Ser Lys Tyr Arg Asn Asn Gly Gln Thr
275 280 285
Cys Val Cys Ala Asn Arg Ile Tyr Val Gln Asp Gly Val Tyr Asp Ala
290 295 300
Phe Ala Glu Lys Leu Ala Ala Ala Val Ala Lys Leu Lys Ile Gly Asn
305 310 315 320
Gly Leu Glu Glu Gly Thr Thr Thr Gly Pro Leu Ile Asp Gly Lys Ala
325 330 335
Val Ala Lys Val Gln Glu His Ile Glu Asp Ala Val Ser Lys Gly Ala
340 345 350
Lys Val Leu Ser Gly Gly Lys Leu Ile Glu Gly Asn Phe Phe Glu Pro
355 360 365
Thr Ile Leu Val Asp Val Pro Lys Thr Ala Ala Val Ala Lys Glu Glu
370 375 380
Thr Phe Gly Pro Leu Ala Pro Leu Phe Arg Phe Lys Asp Glu Ala Glu
385 390 395 400
Val Ile Ala Met Ser Asn Asp Thr Glu Phe Gly Leu Ala Ser Tyr Phe
405 410 415
Tyr Ala Arg Asp Met Ser Arg Val Phe Arg Val Ala Glu Ala Leu Glu
420 425 430
Tyr Gly Met Val Gly Ile Asn Thr Gly Leu Ile Ser Asn Glu Val Ala
435 440 445
Pro Phe Gly Gly Ile Lys Ala Ser Gly Leu Gly Arg Glu Gly Ser Lys
450 455 460
Tyr Gly Ile Glu Asp Tyr Leu Glu Ile Lys Tyr Leu Cys Ile Ser Val
465 470 475 480
<210> 54
<211> 479
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 54
Met Ser Val Pro Val Gln His Pro Met Tyr Ile Asp Gly Gln Phe Val
1 5 10 15
Thr Trp Arg Gly Asp Ala Trp Ile Asp Val Val Asn Pro Ala Thr Glu
20 25 30
Ala Val Ile Ser Arg Ile Pro Asp Gly Gln Ala Glu Asp Ala Arg Lys
35 40 45
Ala Ile Asp Ala Ala Glu Arg Ala Gln Pro Glu Trp Glu Ala Leu Pro
50 55 60
Ala Ile Glu Arg Ala Ser Trp Leu Arg Lys Ile Ser Ala Gly Ile Arg
65 70 75 80
Glu Arg Ala Ser Glu Ile Ser Ala Leu Ile Val Glu Glu Gly Gly Lys
85 90 95
Ile Gln Gln Leu Ala Glu Val Glu Val Ala Phe Thr Ala Asp Tyr Ile
100 105 110
Asp Tyr Met Ala Glu Trp Ala Arg Arg Tyr Glu Gly Glu Ile Ile Gln
115 120 125
Ser Asp Arg Pro Gly Glu Asn Ile Leu Leu Phe Lys Arg Ala Leu Gly
130 135 140
Val Thr Thr Gly Ile Leu Pro Trp Asn Phe Pro Phe Phe Leu Ile Ala
145 150 155 160
Arg Lys Met Ala Pro Ala Leu Leu Thr Gly Asn Thr Ile Val Ile Lys
165 170 175
Pro Ser Glu Phe Thr Pro Asn Asn Ala Ile Ala Phe Ala Lys Ile Val
180 185 190
Asp Glu Ile Gly Leu Pro Arg Gly Val Phe Asn Leu Val Leu Gly Arg
195 200 205
Gly Glu Thr Val Gly Gln Glu Leu Ala Gly Asn Pro Lys Val Ala Met
210 215 220
Val Ser Met Thr Gly Ser Val Ser Ala Gly Glu Lys Ile Met Ala Thr
225 230 235 240
Ala Ala Lys Asn Ile Thr Lys Val Cys Leu Glu Leu Gly Gly Lys Ala
245 250 255
Pro Ala Ile Val Met Asp Asp Ala Asp Leu Glu Leu Ala Val Lys Ala
260 265 270
Ile Val Asp Ser Arg Val Ile Asn Ser Gly Gln Val Cys Asn Cys Ala
275 280 285
Glu Arg Val Tyr Val Gln Lys Gly Ile Tyr Asp Gln Phe Val Asn Arg
290 295 300
Leu Gly Glu Ala Met Gln Ala Val Gln Phe Gly Asn Pro Ala Glu Arg
305 310 315 320
Asn Asp Ile Ala Met Gly Pro Leu Ile Asn Ala Ala Ala Leu Glu Arg
325 330 335
Val Glu Gln Lys Val Ala Arg Ala Val Glu Glu Gly Ala Arg Val Ala
340 345 350
Phe Gly Gly Lys Ala Val Glu Gly Lys Gly Tyr Tyr Tyr Pro Pro Thr
355 360 365
Leu Leu Leu Asp Val Arg Gln Glu Met Ser Ile Met His Glu Glu Thr
370 375 380
Phe Gly Pro Val Leu Pro Val Val Ala Phe Asp Thr Leu Glu Asp Ala
385 390 395 400
Ile Ser Met Ala Asn Asp Ser Asp Tyr Gly Leu Thr Ser Ser Ile Tyr
405 410 415
Thr Gln Asn Leu Asn Val Ala Met Lys Ala Ile Lys Gly Leu Lys Phe
420 425 430
Gly Glu Thr Tyr Ile Asn Arg Glu Asn Phe Glu Ala Met Gln Gly Phe
435 440 445
His Ala Gly Trp Arg Lys Ser Gly Ile Gly Gly Ala Asp Gly Lys His
450 455 460
Gly Leu His Glu Tyr Leu Gln Thr Gln Val Val Tyr Leu Gln Ser
465 470 475
<210> 55
<211> 512
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 55
Met Thr Asn Asn Pro Pro Ser Ala Gln Ile Lys Pro Gly Glu Tyr Gly
1 5 10 15
Phe Pro Leu Lys Leu Lys Ala Arg Tyr Asp Asn Phe Ile Gly Gly Glu
20 25 30
Trp Val Ala Pro Ala Asp Gly Glu Tyr Tyr Gln Asn Leu Thr Pro Val
35 40 45
Thr Gly Gln Leu Leu Cys Glu Val Ala Ser Ser Gly Lys Arg Asp Ile
50 55 60
Asp Leu Ala Leu Asp Ala Ala His Lys Val Lys Asp Lys Trp Ala His
65 70 75 80
Thr Ser Val Gln Asp Arg Ala Ala Ile Leu Phe Lys Ile Ala Asp Arg
85 90 95
Met Glu Gln Asn Leu Glu Leu Leu Ala Thr Ala Glu Thr Trp Asp Asn
100 105 110
Gly Lys Pro Ile Arg Glu Thr Ser Ala Ala Asp Val Pro Leu Ala Ile
115 120 125
Asp His Phe Arg Tyr Phe Ala Ser Cys Ile Arg Ala Gln Glu Gly Gly
130 135 140
Ile Ser Glu Val Asp Ser Glu Thr Val Ala Tyr His Phe His Glu Pro
145 150 155 160
Leu Gly Val Val Gly Gln Ile Ile Pro Trp Asn Phe Pro Leu Leu Met
165 170 175
Ala Ser Trp Lys Met Ala Pro Ala Leu Ala Ala Gly Asn Cys Val Val
180 185 190
Leu Lys Pro Ala Arg Leu Thr Pro Leu Ser Val Leu Leu Leu Met Glu
195 200 205
Ile Val Gly Asp Leu Leu Pro Pro Gly Val Val Asn Val Val Asn Gly
210 215 220
Ala Gly Gly Val Ile Gly Glu Tyr Leu Ala Thr Ser Lys Arg Ile Ala
225 230 235 240
Lys Val Ala Phe Thr Gly Ser Thr Glu Val Gly Gln Gln Ile Met Gln
245 250 255
Tyr Ala Thr Gln Asn Ile Ile Pro Val Thr Leu Glu Leu Gly Gly Lys
260 265 270
Ser Pro Asn Ile Phe Phe Ala Asp Val Met Asp Glu Glu Asp Ala Phe
275 280 285
Phe Asp Lys Ala Leu Glu Gly Phe Ala Leu Phe Ala Phe Asn Gln Gly
290 295 300
Glu Val Cys Thr Cys Pro Ser Arg Ala Leu Val Gln Glu Ser Ile Tyr
305 310 315 320
Glu Arg Phe Met Glu Arg Ala Ile Arg Arg Val Glu Ser Ile Arg Ser
325 330 335
Gly Asn Pro Leu Asp Ser Val Thr Gln Met Gly Ala Gln Val Ser His
340 345 350
Gly Gln Leu Glu Thr Ile Leu Asn Tyr Ile Asp Ile Gly Lys Lys Glu
355 360 365
Gly Ala Asp Val Leu Thr Gly Gly Arg Arg Lys Leu Leu Glu Gly Glu
370 375 380
Leu Lys Asp Gly Tyr Tyr Leu Glu Pro Thr Ile Leu Phe Gly Gln Asn
385 390 395 400
Asn Met Arg Val Phe Gln Glu Glu Ile Phe Gly Pro Val Leu Ala Val
405 410 415
Thr Thr Phe Lys Thr Met Glu Glu Ala Leu Glu Leu Ala Asn Asp Thr
420 425 430
Gln Tyr Gly Leu Gly Ala Gly Val Trp Ser Arg Asn Gly Asn Leu Ala
435 440 445
Tyr Lys Met Gly Arg Gly Ile Gln Ala Gly Arg Val Trp Thr Asn Cys
450 455 460
Tyr His Ala Tyr Pro Ala His Ala Ala Phe Gly Gly Tyr Lys Gln Ser
465 470 475 480
Gly Ile Gly Arg Glu Thr His Lys Met Met Leu Glu His Tyr Gln Gln
485 490 495
Thr Lys Cys Leu Leu Val Ser Tyr Ser Asp Lys Pro Leu Gly Leu Phe
500 505 510
<210> 56
<211> 463
<212> PRT
<213>Methanococcus jannaschii (Methanocaldococcus jannaschii)
<400> 56
Met Phe Ile Asp Gly Lys Trp Ile Asn Arg Glu Asp Met Asp Val Ile
1 5 10 15
Asn Pro Tyr Ser Leu Glu Val Ile Lys Lys Ile Pro Ala Leu Ser Arg
20 25 30
Glu Glu Ala Lys Glu Ala Ile Asp Thr Ala Glu Lys Tyr Lys Glu Val
35 40 45
Met Lys Asn Leu Pro Ile Thr Lys Arg Tyr Asn Ile Leu Met Asn Ile
50 55 60
Ala Lys Gln Ile Lys Glu Lys Lys Glu Glu Leu Ala Lys Ile Leu Ala
65 70 75 80
Ile Asp Ala Gly Lys Pro Ile Lys Gln Ala Arg Val Glu Val Glu Arg
85 90 95
Ser Ile Gly Thr Phe Lys Leu Ala Ala Phe Tyr Val Lys Glu His Arg
100 105 110
Asp Glu Val Ile Pro Ser Asp Asp Arg Leu Ile Phe Thr Arg Arg Glu
115 120 125
Pro Val Gly Ile Val Gly Ala Ile Thr Pro Phe Asn Phe Pro Leu Asn
130 135 140
Leu Ser Ala His Lys Ile Ala Pro Ala Ile Ala Thr Gly Asn Val Ile
145 150 155 160
Val His His Pro Ser Ser Lys Ala Pro Leu Val Cys Ile Glu Leu Ala
165 170 175
Lys Ile Ile Glu Asn Ala Leu Lys Lys Tyr Asn Val Pro Leu Gly Val
180 185 190
Tyr Asn Leu Leu Thr Gly Ala Gly Glu Val Val Gly Asp Glu Ile Val
195 200 205
Val Asn Glu Lys Val Asn Met Ile Ser Phe Thr Gly Ser Ser Lys Val
210 215 220
Gly Glu Leu Ile Thr Lys Lys Ala Gly Phe Lys Lys Ile Ala Leu Glu
225 230 235 240
Leu Gly Gly Val Asn Pro Asn Ile Val Leu Lys Asp Ala Asp Leu Asn
245 250 255
Lys Ala Val Asn Ala Leu Ile Lys Gly Ser Phe Ile Tyr Ala Gly Gln
260 265 270
Val Cys Ile Ser Val Gly Met Ile Leu Val Asp Glu Ser Ile Ala Asp
275 280 285
Lys Phe Ile Glu Met Phe Val Asn Lys Ala Lys Val Leu Asn Val Gly
290 295 300
Asn Pro Leu Asp Glu Lys Thr Asp Val Gly Pro Leu Ile Ser Val Glu
305 310 315 320
His Ala Glu Trp Val Glu Lys Val Val Glu Lys Ala Ile Asp Glu Gly
325 330 335
Gly Lys Leu Leu Leu Gly Gly Lys Arg Asp Lys Ala Leu Phe Tyr Pro
340 345 350
Thr Ile Leu Glu Val Asp Arg Asp Asn Ile Leu Cys Lys Thr Glu Thr
355 360 365
Phe Ala Pro Val Ile Pro Ile Ile Arg Thr Asn Glu Glu Glu Met Ile
370 375 380
Asp Ile Ala Asn Ser Thr Glu Tyr Gly Leu His Ser Ala Ile Phe Thr
385 390 395 400
Asn Asp Ile Asn Lys Ser Leu Lys Phe Ala Glu Asn Leu Glu Phe Gly
405 410 415
Gly Val Val Ile Asn Asp Ser Ser Leu Phe Arg Gln Asp Asn Met Pro
420 425 430
Phe Gly Gly Val Lys Lys Ser Gly Leu Gly Arg Glu Gly Val Lys Tyr
435 440 445
Ala Met Glu Glu Met Ser Asn Ile Lys Thr Ile Ile Ile Ser Lys
450 455 460
<210> 57
<211> 488
<212> PRT
<213>Bacillus licheniformis (Bacillus licheniformis)
<400> 57
Met Ser Val Ala Ala Glu Ser Lys Thr Tyr Phe Asn Phe Ile Asn Gly
1 5 10 15
Arg Trp Val Lys Ala Glu Ser Gly Gly Met Glu Gln Ser Leu Asn Pro
20 25 30
Ala Asp Thr Arg Asp Ile Val Gly Leu Val Gln Lys Ser Ser Ile Glu
35 40 45
Asp Val Asp Arg Ala Val Glu Ala Ala Lys Gln Ala Lys Lys Ala Trp
50 55 60
Arg Lys Leu Ser Gly Ala Glu Arg Gly Gln Phe Leu Tyr Lys Ala Ala
65 70 75 80
Asp Ile Met Glu Gln Arg Leu Asp Glu Ile Ala Glu Cys Ala Thr Arg
85 90 95
Glu Met Gly Lys Thr Leu Pro Glu Ala Lys Gly Glu Thr Ala Arg Gly
100 105 110
Ile Ala Ile Leu Arg Tyr Tyr Ala Gly Glu Gly Leu Arg Lys Thr Gly
115 120 125
Asp Val Ile Pro Ser Thr Asp Ser Ser Ala Phe Met Tyr Thr Asp Arg
130 135 140
Val Pro Leu Gly Val Val Gly Val Ile Ser Pro Trp Asn Phe Pro Val
145 150 155 160
Ala Ile Pro Ile Trp Lys Met Ala Pro Ala Leu Ile Tyr Gly Asn Thr
165 170 175
Val Val Ile Lys Pro Ala Thr Glu Thr Ala Val Thr Cys Leu Lys Val
180 185 190
Ile Ser Cys Phe Glu Glu Ala Gly Ile Pro Ser Gly Val Val Asn Ala
195 200 205
Val Thr Gly Pro Gly Ser Ser Ala Gly Gln Arg Leu Ala Glu His Pro
210 215 220
Asp Val Asn Gly Ile Thr Phe Thr Gly Ser Asn Gln Thr Gly Lys Ile
225 230 235 240
Ile Gly Arg Thr Ala Phe Glu Arg Gly Ala Lys Tyr Gln Leu Glu Met
245 250 255
Gly Gly Lys Asn Pro Val Ile Val Ala Asp Asp Ala Asp Leu Asp Ile
260 265 270
Ala Val Glu Ala Val Ile Ser Gly Ala Phe Arg Ser Thr Gly Gln Lys
275 280 285
Cys Thr Ala Thr Ser Arg Val Ile Val Leu Asn Gly Val Tyr Asp Arg
290 295 300
Phe Lys Glu Lys Leu Leu Gln Gln Thr Lys Glu Ile Thr Ile Gly Asp
305 310 315 320
Ser Leu Lys Glu Asp Val Trp Met Gly Pro Ile Ala Asn Lys Gln Gln
325 330 335
Leu Asp Asn Cys Leu Ser Tyr Ile Ala Lys Gly Lys Gln Glu Gly Ala
340 345 350
Asp Leu Ile Phe Gly Gly Glu Arg Leu Ala Asp Gly Lys Tyr Glu Asn
355 360 365
Gly Tyr Tyr Ile Arg Pro Ala Ile Phe Asp Asn Val Thr Ser Gly Met
370 375 380
Thr Ile Ala Gln Glu Glu Ile Phe Gly Pro Val Ile Ala Leu Ile Lys
385 390 395 400
Ala Asp Thr Leu Glu Glu Ala Leu Glu Thr Ala Asn Asp Val Lys Phe
405 410 415
Gly Leu Ser Ala Ser Ile Phe Thr Gln Asn Ile Arg Arg Met Leu Ser
420 425 430
Phe Thr Asp Glu Ile Glu Ala Gly Leu Ile Arg Val Asn Ala Glu Ser
435 440 445
Ala Gly Val Glu Leu Gln Ala Pro Phe Gly Gly Val Lys Gln Ser Ser
450 455 460
Ser His Ser Arg Glu Gln Gly Glu Ala Ala Lys Glu Phe Phe Thr Ala
465 470 475 480
Val Lys Thr Val Phe Val Lys Pro
485
<210> 58
<211> 491
<212> PRT
<213>Pseudomonas strain (Pseudomonas sp.)
<400> 58
Met Lys Gln Tyr Arg Asn Phe Val Asp Gly Lys Trp Val Glu Ser Ser
1 5 10 15
Lys Thr Phe Gln Asp Val Thr Pro Ile Asp Gly Ser Val Val Ala Val
20 25 30
Val His Glu Ala Asp Arg Asp Leu Val Asp Ala Ala Val Lys Ala Gly
35 40 45
His Arg Ala Leu Glu Gly Glu Trp Gly Arg Thr Thr Ala Ala Gln Arg
50 55 60
Val Asp Trp Leu Arg Arg Ile Ala Asn Glu Met Glu Arg Arg Gln Gln
65 70 75 80
Asp Phe Leu Asp Ala Glu Met Ala Asp Thr Gly Lys Pro Leu Ser Met
85 90 95
Ala Ala Thr Ile Asp Ile Pro Arg Gly Ile Ala Asn Phe Arg Asn Phe
100 105 110
Ala Asp Ile Leu Ala Thr Ala Pro Val Asp Ser His Arg Leu Asp Leu
115 120 125
Pro Asp Gly Ala Tyr Ala Leu Asn Tyr Ala Ala Arg Lys Pro Leu Gly
130 135 140
Val Val Gly Val Ile Ser Pro Trp Asn Leu Pro Leu Leu Leu Leu Thr
145 150 155 160
Trp Lys Val Ala Pro Ala Leu Ala Cys Gly Asn Ala Val Val Val Lys
165 170 175
Pro Ser Glu Asp Thr Pro Gly Thr Ala Thr Leu Leu Ala Glu Val Met
180 185 190
Glu Ala Val Gly Ile Pro Pro Gly Val Phe Asn Leu Val His Gly Phe
195 200 205
Gly Pro Asn Ser Ala Gly Glu Phe Ile Ser Gln His Pro Asp Ile Ser
210 215 220
Ala Ile Thr Phe Thr Gly Glu Ser Lys Thr Gly Ser Thr Ile Met Arg
225 230 235 240
Ala Ala Ala Glu Gly Val Lys Pro Val Ser Phe Glu Leu Gly Gly Lys
245 250 255
Asn Ala Ala Val Ile Phe Ala Asp Cys Asp Phe Glu Lys Met Leu Asp
260 265 270
Gly Met Met Arg Ala Leu Phe Leu Asn Ser Gly Gln Val Cys Leu Cys
275 280 285
Ser Glu Arg Val Tyr Val Glu Arg Pro Ile Phe Asp Arg Phe Cys Val
290 295 300
Ala Leu Ala Glu Arg Ile Lys Ala Leu Lys Val Asp Trp Pro His Glu
305 310 315 320
Thr Asp Thr Gln Met Gly Pro Leu Ile Ser Ser Lys His Arg Asp Lys
325 330 335
Val Leu Ser Tyr Phe Glu Leu Ala Arg Gln Glu Gly Ala Thr Phe Leu
340 345 350
Ala Gly Gly Gly Val Pro Arg Phe Gly Asp Glu Arg Asp Asn Gly Ala
355 360 365
Trp Val Glu Pro Thr Val Ile Ala Gly Leu Ser Asp Asp Ala Arg Val
370 375 380
Val Arg Glu Glu Ile Phe Gly Pro Ile Cys His Val Thr Pro Phe Asp
385 390 395 400
Ser Glu Ser Glu Val Ile Arg Arg Ala Asn Asp Thr Arg Tyr Gly Leu
405 410 415
Ala Ala Thr Ile Trp Thr Thr Asn Leu Ser Arg Ala His Arg Val Ser
420 425 430
Glu Leu Met Arg Val Gly Ile Ser Trp Val Asn Thr Trp Phe Leu Arg
435 440 445
Asp Leu Arg Thr Pro Phe Gly Gly Ala Gly Leu Ser Gly Ile Gly Arg
450 455 460
Glu Gly Gly Met His Ser Leu Asn Phe Tyr Ser Glu Leu Thr Asn Val
465 470 475 480
Cys Val Arg Ile Asp Lys Glu Ser Pro Asp Val
485 490
<210> 59
<211> 499
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 59
Met Thr Glu Pro His Val Ala Val Leu Ser Gln Val Gln Gln Phe Leu
1 5 10 15
Asp Arg Gln His Gly Leu Tyr Ile Asp Gly Arg Pro Gly Pro Ala Gln
20 25 30
Ser Glu Lys Arg Leu Ala Ile Phe Asp Pro Ala Thr Gly Gln Glu Ile
35 40 45
Ala Ser Thr Ala Asp Ala Asn Glu Ala Asp Val Asp Asn Ala Val Met
50 55 60
Ser Ala Trp Arg Ala Phe Val Ser Arg Arg Trp Ala Gly Arg Leu Pro
65 70 75 80
Ala Glu Arg Glu Arg Ile Leu Leu Arg Phe Ala Asp Leu Val Glu Gln
85 90 95
His Ser Glu Glu Leu Ala Gln Leu Glu Thr Leu Glu Gln Gly Lys Ser
100 105 110
Ile Ala Ile Ser Arg Ala Phe Glu Val Gly Cys Thr Leu Asn Trp Met
115 120 125
Arg Tyr Thr Ala Gly Leu Thr Thr Lys Ile Ala Gly Lys Thr Leu Asp
130 135 140
Leu Ser Ile Pro Leu Pro Gln Gly Ala Arg Tyr Gln Ala Trp Thr Arg
145 150 155 160
Lys Glu Pro Val Gly Val Val Ala Gly Ile Val Pro Trp Asn Phe Pro
165 170 175
Leu Met Ile Gly Met Trp Lys Val Met Pro Ala Leu Ala Ala Gly Cys
180 185 190
Ser Ile Val Ile Lys Pro Ser Glu Thr Thr Pro Leu Thr Met Leu Arg
195 200 205
Val Ala Glu Leu Ala Ser Glu Ala Gly Ile Pro Asp Gly Val Phe Asn
210 215 220
Val Val Thr Gly Ser Gly Ala Val Cys Gly Ala Ala Leu Thr Ser His
225 230 235 240
Pro His Val Ala Lys Ile Ser Phe Thr Gly Ser Thr Ala Thr Gly Lys
245 250 255
Gly Ile Ala Arg Thr Ala Ala Asp His Leu Thr Arg Val Thr Leu Glu
260 265 270
Leu Gly Gly Lys Asn Pro Ala Ile Val Leu Lys Asp Ala Asp Pro Gln
275 280 285
Trp Val Ile Glu Gly Leu Met Thr Gly Ser Phe Leu Asn Gln Gly Gln
290 295 300
Val Cys Ala Ala Ser Ser Arg Ile Tyr Ile Glu Ala Pro Leu Phe Asp
305 310 315 320
Thr Leu Val Ser Gly Phe Glu Gln Ala Val Lys Ser Leu Gln Val Gly
325 330 335
Pro Gly Met Ser Pro Val Ala Gln Ile Asn Pro Leu Val Ser Arg Ala
340 345 350
His Cys Asp Lys Val Cys Ser Phe Leu Asp Asp Ala Gln Ala Gln Gln
355 360 365
Ala Glu Leu Ile Arg Gly Ser Asn Gly Pro Ala Gly Glu Gly Tyr Tyr
370 375 380
Val Ala Pro Thr Leu Val Val Asn Pro Asp Ala Lys Leu Arg Leu Thr
385 390 395 400
Arg Glu Glu Val Phe Gly Pro Val Val Asn Leu Val Arg Val Ala Asp
405 410 415
Gly Glu Glu Ala Leu Gln Leu Ala Asn Asp Thr Glu Tyr Gly Leu Thr
420 425 430
Ala Ser Val Trp Thr Gln Asn Leu Ser Gln Ala Leu Glu Tyr Ser Asp
435 440 445
Arg Leu Gln Ala Gly Thr Val Trp Val Asn Ser His Thr Leu Ile Asp
450 455 460
Ala Asn Leu Pro Phe Gly Gly Met Lys Gln Ser Gly Thr Gly Arg Asp
465 470 475 480
Phe Gly Pro Asp Trp Leu Asp Gly Trp Cys Glu Thr Lys Ser Val Cys
485 490 495
Val Arg Tyr
<210> 60
<211> 492
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 60
Met Thr Leu Trp Ile Asn Gly Asp Trp Ile Thr Gly Gln Gly Ala Ser
1 5 10 15
Arg Val Lys Arg Asn Pro Val Ser Gly Glu Val Leu Trp Gln Gly Asn
20 25 30
Asp Ala Asp Ala Ala Gln Val Glu Gln Ala Cys Arg Ala Ala Arg Ala
35 40 45
Ala Phe Pro Arg Trp Ala Arg Leu Ser Phe Ala Glu Arg His Ala Val
50 55 60
Val Glu Arg Phe Ala Ala Leu Leu Glu Ser Asn Lys Ala Glu Leu Thr
65 70 75 80
Ala Ile Ile Ala Arg Glu Thr Gly Lys Pro Arg Trp Glu Ala Ala Thr
85 90 95
Glu Val Thr Ala Met Ile Asn Lys Ile Ala Ile Ser Ile Lys Ala Tyr
100 105 110
His Val Arg Thr Gly Glu Gln Arg Ser Glu Met Pro Asp Gly Ala Ala
115 120 125
Ser Leu Arg His Arg Pro His Gly Val Leu Ala Val Phe Gly Pro Tyr
130 135 140
Asn Phe Pro Gly His Leu Pro Asn Gly His Ile Val Pro Ala Leu Leu
145 150 155 160
Ala Gly Asn Thr Ile Ile Phe Lys Pro Ser Glu Leu Thr Pro Trp Ser
165 170 175
Gly Glu Ala Val Met Arg Leu Trp Gln Gln Ala Gly Leu Pro Pro Gly
180 185 190
Val Leu Asn Leu Val Gln Gly Gly Arg Glu Thr Gly Gln Ala Leu Ser
195 200 205
Ala Leu Glu Asp Leu Asp Gly Leu Leu Phe Thr Gly Ser Ala Asn Thr
210 215 220
Gly Tyr Gln Leu His Arg Gln Leu Ser Gly Gln Pro Glu Lys Ile Leu
225 230 235 240
Ala Leu Glu Met Gly Gly Asn Asn Pro Leu Ile Ile Asp Glu Val Ala
245 250 255
Asp Ile Asp Ala Ala Val His Leu Thr Ile Gln Ser Ala Phe Val Thr
260 265 270
Ala Gly Gln Arg Cys Thr Cys Ala Arg Arg Leu Leu Leu Lys Ser Gly
275 280 285
Ala Gln Gly Asp Ala Phe Leu Ala Arg Leu Val Ala Val Ser Gln Arg
290 295 300
Leu Thr Pro Gly Asn Trp Asp Asp Glu Pro Gln Pro Phe Ile Gly Gly
305 310 315 320
Leu Ile Ser Glu Gln Ala Ala Gln Gln Val Val Thr Ala Trp Gln Gln
325 330 335
Leu Glu Ala Met Gly Gly Arg Pro Leu Leu Ala Pro Arg Leu Leu Gln
340 345 350
Ala Gly Thr Ser Leu Leu Thr Pro Gly Ile Ile Glu Met Thr Gly Val
355 360 365
Ala Gly Val Pro Asp Glu Glu Val Phe Gly Pro Leu Leu Arg Val Trp
370 375 380
Arg Tyr Asp Thr Phe Asp Glu Ala Ile Arg Met Ala Asn Asn Thr Arg
385 390 395 400
Phe Gly Leu Ser Cys Gly Leu Val Ser Pro Glu Arg Glu Lys Phe Asp
405 410 415
Gln Leu Leu Leu Glu Ala Arg Ala Gly Ile Val Asn Trp Asn Lys Pro
420 425 430
Leu Thr Gly Ala Ala Ser Thr Ala Pro Phe Gly Gly Ile Gly Ala Ser
435 440 445
Gly Asn His Arg Pro Ser Ala Trp Tyr Ala Ala Asp Tyr Cys Ala Trp
450 455 460
Pro Met Ala Ser Leu Glu Ser Asp Ser Leu Thr Leu Pro Ala Thr Leu
465 470 475 480
Asn Pro Gly Leu Asp Phe Ser Asp Glu Val Val Arg
485 490
<210> 61
<211> 1358
<212> PRT
<213>Corn (Zea mays) (maize (Maize))
<400> 61
Met Gly Lys Glu Ala Gly Ala Ala Glu Ser Ser Thr Val Val Leu Ala
1 5 10 15
Val Asn Gly Lys Arg Tyr Glu Ala Ala Gly Val Ala Pro Ser Thr Ser
20 25 30
Leu Leu Glu Phe Leu Arg Thr Gln Thr Pro Val Arg Gly Pro Lys Leu
35 40 45
Gly Cys Gly Glu Gly Gly Cys Gly Ala Cys Val Val Leu Val Ser Lys
50 55 60
Tyr Asp Pro Ala Thr Asp Glu Val Thr Glu Phe Ser Ala Ser Ser Cys
65 70 75 80
Leu Thr Leu Leu His Ser Val Asp Arg Cys Ser Val Thr Thr Ser Glu
85 90 95
Gly Ile Gly Asn Thr Arg Asp Gly Tyr His Pro Val Gln Gln Arg Leu
100 105 110
Ser Gly Phe His Ala Ser Gln Cys Gly Phe Cys Thr Pro Gly Met Cys
115 120 125
Met Ser Ile Phe Ser Ala Leu Val Lys Ala Asp Asn Lys Ser Asp Arg
130 135 140
Pro Asp Pro Pro Ala Gly Phe Ser Lys Ile Thr Thr Ser Glu Ala Glu
145 150 155 160
Lys Ala Val Ser Gly Asn Leu Cys Arg Cys Thr Gly Tyr Arg Pro Ile
165 170 175
Val Asp Thr Cys Lys Ser Phe Ala Ser Asp Val Asp Leu Glu Asp Leu
180 185 190
Gly Leu Asn Cys Phe Trp Lys Lys Gly Glu Glu Pro Ala Glu Val Ser
195 200 205
Arg Leu Pro Gly Tyr Asn Ser Gly Ala Val Cys Thr Phe Pro Glu Phe
210 215 220
Leu Lys Ser Glu Ile Lys Ser Thr Met Lys Gln Val Asn Asp Val Pro
225 230 235 240
Ile Ala Ala Ser Gly Asp Gly Trp Tyr His Pro Lys Ser Ile Glu Glu
245 250 255
Leu His Arg Leu Phe Asp Ser Ser Trp Phe Asp Asp Ser Ser Val Lys
260 265 270
Ile Val Ala Ser Asn Thr Gly Ser Gly Val Tyr Lys Asp Gln Asp Leu
275 280 285
Tyr Asp Lys Tyr Ile Asp Ile Lys Gly Ile Pro Glu Leu Ser Val Ile
290 295 300
Asn Lys Asn Asp Lys Ala Ile Glu Leu Gly Ser Val Val Ser Ile Ser
305 310 315 320
Lys Ala Ile Glu Val Leu Ser Asp Gly Asn Leu Val Phe Arg Lys Ile
325 330 335
Ala Asp His Leu Asn Lys Val Ala Ser Pro Phe Val Arg Asn Thr Ala
340 345 350
Thr Ile Gly Gly Asn Ile Met Met Ala Gln Arg Leu Pro Phe Glu Ser
355 360 365
Asp Val Ala Thr Val Leu Leu Ala Ala Gly Ser Thr Val Thr Val Gln
370 375 380
Val Ala Ser Lys Arg Leu Cys Phe Thr Leu Glu Glu Phe Leu Glu Gln
385 390 395 400
Pro Pro Cys Asp Ser Arg Thr Leu Leu Leu Ser Ile Phe Ile Pro Glu
405 410 415
Trp Gly Ser Asp Tyr Val Thr Phe Glu Thr Phe Arg Ala Ala Pro Arg
420 425 430
Pro Phe Gly Asn Ala Val Ser Tyr Val Asn Ser Ala Phe Leu Ala Arg
435 440 445
Thr Ser Gly Ser Leu Leu Ile Glu Asp Ile Cys Leu Ala Phe Gly Ala
450 455 460
Tyr Gly Val Asp His Ala Ile Arg Ala Lys Lys Val Glu Asp Phe Leu
465 470 475 480
Lys Gly Lys Ser Leu Ser Ser Phe Val Ile Leu Glu Ala Ile Lys Leu
485 490 495
Leu Lys Asp Thr Val Ser Pro Ser Glu Gly Thr Thr His His Glu Tyr
500 505 510
Arg Val Ser Leu Ala Val Ser Phe Leu Phe Ser Phe Leu Ser Ser Leu
515 520 525
Ala Asn Ser Ser Ser Ala Pro Ser Asn Ile Asp Thr Pro Asn Gly Ser
530 535 540
Tyr Thr His Glu Thr Gly Ser Asn Val Asp Ser Pro Glu Arg His Ile
545 550 555 560
Lys Val Asp Ser Asn Asp Leu Pro Ile Arg Ser Arg Gln Glu Met Val
565 570 575
Phe Ser Asp Glu Tyr Lys Pro Val Gly Lys Pro Ile Lys Lys Val Gly
580 585 590
Ala Glu Ile Gln Ala Ser Gly Glu Ala Val Tyr Val Asp Asp Ile Pro
595 600 605
Ala Pro Lys Asp Cys Leu Tyr Gly Ala Phe Ile Tyr Ser Thr His Pro
610 615 620
His Ala His Val Arg Ser Ile Asn Phe Lys Ser Ser Leu Ala Ser Gln
625 630 635 640
Lys Val Ile Thr Val Ile Thr Ala Lys Asp Ile Pro Ser Gly Gly Glu
645 650 655
Asn Ile Gly Ser Ser Phe Leu Met Gln Gly Glu Ala Leu Phe Ala Asp
660 665 670
Pro Ile Ala Glu Phe Ala Gly Gln Asn Ile Gly Val Val Ile Ala Glu
675 680 685
Thr Gln Arg Tyr Ala Asn Met Ala Ala Lys Gln Ala Val Val Glu Tyr
690 695 700
Ser Thr Glu Asn Leu Gln Pro Pro Ile Leu Thr Ile Glu Asp Ala Ile
705 710 715 720
Gln Arg Asn Ser Tyr Ile Gln Ile Pro Pro Phe Leu Ala Pro Lys Pro
725 730 735
Val Gly Asp Tyr Asn Lys Gly Met Ala Glu Ala Asp His Lys Ile Leu
740 745 750
Ser Ala Glu Val Lys Leu Glu Ser Gln Tyr Tyr Phe Tyr Met Glu Thr
755 760 765
Gln Ala Ala Leu Ala Ile Pro Asp Glu Asp Asn Cys Ile Thr Ile Tyr
770 775 780
Ser Ser Thr Gln Met Pro Glu Leu Thr Gln Asn Leu Ile Ala Arg Cys
785 790 795 800
Leu Gly Ile Pro Phe His Asn Val Arg Val Ile Ser Arg Arg Val Gly
805 810 815
Gly Gly Phe Gly Gly Lys Ala Met Lys Ala Thr His Thr Ala Cys Ala
820 825 830
Cys Ala Leu Ala Ala Phe Lys Leu Arg Arg Pro Val Arg Met Tyr Leu
835 840 845
Asp Arg Lys Thr Asp Met Ile Met Ala Gly Gly Arg His Pro Met Lys
850 855 860
Ala Lys Tyr Ser Val Gly Phe Lys Ser Asp Gly Lys Ile Thr Ala Leu
865 870 875 880
His Leu Asp Leu Gly Ile Asn Ala Gly Ile Ser Pro Asp Val Ser Pro
885 890 895
Leu Met Pro Arg Ala Ile Ile Gly Ala Leu Lys Lys Tyr Asn Trp Gly
900 905 910
Thr Leu Glu Phe Asp Thr Lys Val Cys Lys Thr Asn Val Ser Ser Lys
915 920 925
Ser Ala Met Arg Ala Pro Gly Asp Val Gln Gly Ser Phe Ile Ala Glu
930 935 940
Ala Ile Ile Glu His Val Ala Ser Ala Leu Ala Leu Asp Thr Asn Thr
945 950 955 960
Val Arg Arg Lys Asn Leu His Asp Phe Glu Ser Leu Glu Val Phe Tyr
965 970 975
Gly Glu Ser Ala Gly Glu Ala Ser Thr Tyr Ser Leu Val Ser Met Phe
980 985 990
Asp Lys Leu Ala Leu Ser Pro Glu Tyr Gln His Arg Ala Ala Met Ile
995 1000 1005
Glu Gln Phe Asn Ser Ser Asn Lys Trp Lys Lys Arg Gly Ile Ser
1010 1015 1020
Cys Val Pro Ala Thr Tyr Glu Val Asn Leu Arg Pro Thr Pro Gly
1025 1030 1035
Lys Val Ser Ile Met Asn Asp Gly Ser Ile Ala Val Glu Val Gly
1040 1045 1050
Gly Ile Glu Ile Gly Gln Gly Leu Trp Thr Lys Val Lys Gln Met
1055 1060 1065
Thr Ala Phe Gly Leu Gly Gln Leu Cys Pro Asp Gly Gly Glu Cys
1070 1075 1080
Leu Leu Asp Lys Val Arg Val Ile Gln Ala Asp Thr Leu Ser Leu
1085 1090 1095
Ile Gln Gly Gly Met Thr Ala Gly Ser Thr Thr Ser Glu Thr Ser
1100 1105 1110
Cys Glu Thr Val Arg Gln Ser Cys Val Ala Leu Val Glu Lys Leu
1115 1120 1125
Asn Pro Ile Lys Glu Ser Leu Glu Ala Lys Ser Asn Thr Val Glu
1130 1135 1140
Trp Ser Ala Leu Ile Ala Gln Ala Ser Met Ala Ser Val Asn Leu
1145 1150 1155
Ser Ala Gln Pro Tyr Trp Thr Pro Asp Pro Ser Phe Lys Ser Tyr
1160 1165 1170
Leu Asn Tyr Gly Ala Gly Thr Ser Glu Val Glu Val Asp Ile Leu
1175 1180 1185
Thr Gly Ala Thr Thr Ile Leu Arg Ser Asp Leu Val Tyr Asp Cys
1190 1195 1200
Gly Gln Ser Leu Asn Pro Ala Val Asp Leu Gly Gln Ile Glu Gly
1205 1210 1215
Cys Phe Val Gln Gly Ile Gly Phe Phe Thr Asn Glu Asp Tyr Lys
1220 1225 1230
Thr Asn Ser Asp Gly Leu Val Ile His Asp Gly Thr Trp Thr Tyr
1235 1240 1245
Lys Ile Pro Thr Val Asp Asn Ile Pro Lys Glu Phe Asn Val Glu
1250 1255 1260
Met Phe Asn Ser Ala Pro Asp Lys Lys Arg Val Leu Ser Ser Lys
1265 1270 1275
Ala Ser Gly Glu Pro Pro Leu Val Leu Ala Thr Ser Val His Cys
1280 1285 1290
Ala Met Arg Glu Ala Ile Arg Ala Ala Arg Lys Glu Phe Ser Val
1295 1300 1305
Ser Thr Ser Pro Ala Lys Ser Ala Val Thr Phe Gln Met Asp Val
1310 1315 1320
Pro Ala Thr Met Pro Val Val Lys Glu Leu Cys Gly Leu Asp Val
1325 1330 1335
Val Glu Arg Tyr Leu Glu Asn Val Ser Ala Ala Ser Ala Gly Pro
1340 1345 1350
Asn Thr Ala Lys Ala
1355
<210> 62
<211> 747
<212> DNA
<213>Crescent shank bacterium (Caulobacter crescentus)
<400> 62
atgtcctcag ccatctatcc cagcctgaag ggcaagcgcg tcgtcatcac cggcggcggc 60
tcgggcatcg gggccggcct caccgccggc ttcgcccgtc agggcgcgga ggtgatcttc 120
ctcgacatcg ccgacgagga ctccagggct cttgaggccg agctggccgg ctcgccgatc 180
ccgccggtct acaagcgctg cgacctgatg aacctcgagg cgatcaaggc ggtcttcgcc 240
gagatcggcg acgtcgacgt gctggtcaac aacgccggca atgacgaccg ccacaagctg 300
gccgacgtga ccggcgccta ttgggacgag cggatcaacg tcaacctgcg ccacatgctg 360
ttctgcaccc aggccgtcgc gccgggcatg aagaagcgtg gcggcggggc ggtgatcaac 420
ttcggttcga tcagctggca cctggggctt gaggacctcg tcctctacga aaccgccaag 480
gccggcatcg aaggcatgac ccgcgcgctg gcccgggagc tgggtcccga cgacatccgc 540
gtcacctgcg tggtgccggg caacgtcaag accaagcgcc aggagaagtg gtacacgccc 600
gaaggcgagg cccagatcgt ggcggcccaa tgcctgaagg gccgcatcgt cccggagaac 660
gtcgccgcgc tggtgctgtt cctggcctcg gatgacgcgt cgctctgcac cggccacgaa 720
tactggatcg acgccggctg gcgttga 747
<210> 63
<211> 870
<212> DNA
<213>Crescent shank bacterium (Caulobacter crescentus)
<400> 63
atgaccgctc aagtcacttg cgtatgggat ctgaaggcca cgttgggcga aggcccgatc 60
tggcatggcg acaccctgtg gttcgtcgac atcaagcagc gtaaaatcca caactaccac 120
cccgccaccg gcgagcgctt cagcttcgac gcgccggatc aggtgacctt cctcgcgccg 180
atcgtcggcg cgaccggctt tgtcgtcggt ctgaagaccg ggattcaccg cttccacccg 240
gccacgggct tcagcctgct gctcgaggtc gaggacgcgg cgctgaacaa ccgccccaac 300
gacgccacgg tcgacgcgca aggccgtctg tggttcggca ccatgcacga cggggaagag 360
aacaatagcg gctcgctcta tcggatggac ctcaccggcg tcgcccggat ggaccgcgac 420
atctgcatca ccaacggccc gtgcgtctcg cccgacggca agaccttcta ccacaccgac 480
accctggaaa agacgatcta cgccttcgac ctggccgagg acggcctgct gtcgaacaag 540
cgcgtcttcg tgcagttcgc cctgggcgac gatgtctatc cggacggttc ggtcgtcgat 600
tccgaaggct atctgtggac cgccctgtgg ggcggtttcg gcgcggtccg cttctcgccg 660
caaggcgacg ccgtgacgcg catcgaactg cccgccccca acgtcaccaa gccctgcttc 720
ggcgggcctg acctgaagac cctctatttc accaccgccc gcaagggcct gagcgacgag 780
accctggccc agtacccgct ggccggcggt gtgttcgccg ttccggtcga tgtggccggc 840
caaccccagc atgaggtccg ccttgtctaa 870
<210> 64
<211> 1968
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 64
atgtctgttc gcaatatttt tgctgacgag agccacgata tttacaccgt cagaacgcac 60
gccgatggcc cggacggcga actcccatta accgcagaga tgcttatcaa ccgcccgagc 120
ggggatctgt tcggtatgac catgaatgcc ggaatgggtt ggtctccgga cgagctggat 180
cgggacggta ttttactgct cagtacactc ggtggcttac gcggcgcaga cggtaaaccc 240
gtggcgctgg cgttgcacca ggggcattac gaactggaca tccagatgaa agcggcggcc 300
gaggttatta aagccaacca tgccctgccc tatgccgtgt acgtctccga tccttgtgac 360
gggcgtactc agggtacaac ggggatgttt gattcgctac cataccgaaa tgacgcatcg 420
atggtaatgc gccgccttat tcgctctctg cccgacgcga aagcagttat tggtgtggcg 480
agttgcgata aggggcttcc ggccaccatg atggcactcg ccgcgcagca caacatcgca 540
accgtgctgg tccccggcgg cgcgacgctg cccgcaaagg atggagaaga caacggcaag 600
gtgcaaacca ttggcgcacg cttcgccaat ggcgaattat ctctacagga cgcacgccgt 660
gcgggctgta aagcctgtgc ctcttccggc ggcggctgtc aatttttggg cactgccggg 720
acatctcagg tggtggccga aggattggga ctggcaatcc cacattcagc cctggcccct 780
tccggtgagc ctgtgtggcg ggagatcgcc agagcttccg cgcgagctgc gctgaacctg 840
agtcaaaaag gcatcaccac ccgggaaatt ctcaccgata aagcgataga gaatgcgatg 900
acggtccatg ccgcgttcgg tggttcaaca aacctgctgt tacacatccc ggcaattgct 960
caccaggcag gttgccatat cccgaccgtt gatgactgga tccgcatcaa caagcgcgtg 1020
ccccgactgg tgagcgtact gcctaatggc ccggtttatc atccaacggt caatgccttt 1080
atggcaggtg gtgtgccgga agtcatgttg catctgcgca gcctcggatt gttgcatgaa 1140
gacgttatga cggttaccgg cagcacgctg aaagaaaacc tcgactggtg ggagcactcc 1200
gaacggcgtc agcggttcaa gcaactcctg ctcgatcagg aacaaatcaa cgctgacgaa 1260
gtgatcatgt ctccgcagca agcaaaagcg cgcggattaa cctcaactat caccttcccg 1320
gtgggcaata ttgcgccaga aggttcggtg atcaaatcca ccgccattga cccctcgatg 1380
attgatgagc aaggtatcta ttaccataaa ggtgtggcga aggtttatct gtccgagaaa 1440
agtgcgattt acgatatcaa acatgacaag atcaaggcgg gcgatattct ggtcattatt 1500
ggcgttggac cttcaggtac agggatggaa gaaacctacc aggttaccag tgccctgaag 1560
catctgtcat acggtaagca tgtttcgtta atcaccgatg cacgtttctc gggcgtttct 1620
actggcgcgt gcatcggcca tgtggggcca gaagcgctgg ccggaggccc catcggtaaa 1680
ttacgcaccg gggatttaat tgaaattaaa attgattgtc gcgagcttca cggcgaagtc 1740
aatttcctcg gaacccgtag cgatgaacaa ttaccttcac aggaggaggc aactgcaata 1800
ttaaatgcca gacccagcca tcaggattta cttcccgatc ctgaattgcc agatgatacc 1860
cggctatggg caatgcttca ggccgtgagt ggtgggacat ggaccggttg tatttatgat 1920
gtaaacaaaa ttggcgcggc tttgcgcgat tttatgaata aaaactga 1968
<210> 65
<211> 1383
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 65
atgacgcaat taaccatgaa agacaaaatt ggctacgggc tgggagacac cgcctgcggc 60
ttcgtctggc aggccacgat gttcctgctg gcctatttct acaccgacgt cttcggcctg 120
tcggcgggga ttatgggcac gctgtttttg gtctcccgcg tgctcgacgc cgtcaccgac 180
ccgctgatgg ggctgctggt agaccgcacc cgcacgcggc acggccagtt ccgcccgttc 240
ctgctgtggg gggccatccc gttcggcatc gtctgcgtgc tgaccttcta cacgccggac 300
ttctccgcac agggcaagat catctacgcc tgcgtgacct acattctcct gaccctggtc 360
tacaccttcg ttaacgtgcc gtactgcgcc atgccgggcg tcatcaccgc cgacccgaaa 420
gagcgtcacg ccctgcagtc ctggcgcttc ttcctggcgg cggcgggctc gctcgctatc 480
agcggcatcg cgctgccgct ggtgagcatc atcggcaaag gggacgagca ggtgggctac 540
ttcggcgcca tgtgcgtgct ggggctgagc ggcgtggtgc tgctctacgt ctgcttcttc 600
acgaccaaag agcgctacac ctttgaggtg cagccgggct cgtcggtggc gaaagacctt 660
aagctgctgc tgggcaacag ccagtggcgc atcatgtgcg cgttcaagat gatggcgacc 720
tgctccaacg tggtgcgcgg cggggcgacg ctctacttcg tgaaatacgt gatggatcac 780
ccggagttgg cgacccagtt tttactttac ggcagcctcg ccaccatgtt cggctcgctt 840
tgctcctcac gcctgctggg ccgcttcgac cgcgtcaccg ccttcaagtg gatcatcgtc 900
gcctactcgc tgatcagcct gctgattttc gtcaccccgg cggagcacat cgcgctcatt 960
tttgccctca acatcctgtt cctgttcgtc tttaatacca ccacgccgct gcagtggctg 1020
atggcttctg acgtggtgga ctacgaggag agccgcagcg gtcgccgcct cgacgggctg 1080
gtgttctcca cctacctgtt cagcctgaag attggcctgg cgattggcgg ggcggtggtg 1140
ggctggatcc tggcgtacgt caactattcc gccagcagca gcgtgcagcc ggttgaggtg 1200
ctcaccacca tcaaaattct gttctgcgtg gtgccggtgg tgctctacgc gggcatgttc 1260
atcatgctgt cgctctacaa gctcaccgat gcccgcgtgg aggccatcag ccggcagctg 1320
attaagcacc gcgcggcgca gggcgaggcc gttcccgacg ccgcgacagc cgcatcccat 1380
taa 1383
<210> 66
<211> 1041
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 66
atgcagaaca tcatccgaaa aggaggaact atgaaggctg cagttgttac gaaggatcat 60
catgttgacg ttacgtataa aacactgcgc tcactgaaac atggcgaagc cctgctgaaa 120
atggagtgtt gtggtgtatg tcataccgat cttcatgtta agaatggcga ttttggtgac 180
aaaaccggcg taattctggg ccatgaaggc atcggtgtgg tggcagaagt gggtccaggt 240
gtcacctcat taaaaccagg cgatcgtgcc agcgtggcgt ggttctacga aggatgcggt 300
cattgcgaat actgtaacag tggtaacgaa acgctctgcc gttcagttaa aaatgccgga 360
tacagcgttg atggcgggat ggcggaagag tgcatcgtgg tcgccgatta cgcggtaaaa 420
gtgccagatg gtctggactc ggcggcggcc agcagcatta cctgtgcggg agtcaccacc 480
tacaaagccg ttaagctgtc aaaaattcgt ccagggcagt ggattgctat ctacggtctt 540
ggcggtctgg gtaacctcgc cctgcaatac gcgaagaatg tctttaacgc caaagtgatc 600
gccattgatg tcaatgatga gcagttaaaa ctggcaaccg aaatgggcgc agatttagcg 660
attaactcac acaccgaaga cgccgccaaa attgtgcagg agaaaactgg tggcgctcac 720
gctgcggtgg taacagcggt agctaaagct gcgtttaact cggcagttga tgctgtccgt 780
gcaggcggtc gtgttgtggc tgtcggtcta ccgccggagt ctatgagcct ggatatccca 840
cgtcttgtgc tggatggtat tgaagtggtc ggttcgctgg tcggcacgcg ccaggattta 900
actgaagcct tccagtttgc cgccgaaggt aaagtggtgc cgaaagtcgc cctgcgtccg 960
ttagcggaca tcaacaccat ctttactgag atggaagaag gcaaaatccg tggccgcatg 1020
gtgattgatt tccgtcacta a 1041
<210> 67
<211> 1164
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 67
atgaacaact ttaatctgca caccccaacc cgcattctgt ttggtaaagg cgcaatcgct 60
ggtttacgcg aacaaattcc tcacgatgct cgcgtattga ttacctacgg cggcggcagc 120
gtgaaaaaaa ccggcgttct cgatcaagtt ctggatgccc tgaaaggcat ggacgtgctg 180
gaatttggcg gtattgagcc aaacccggct tatgaaacgc tgatgaacgc cgtgaaactg 240
gttcgcgaac agaaagtgac tttcctgctg gcggttggcg gcggttctgt actggacggc 300
accaaattta tcgccgcagc ggctaactat ccggaaaata tcgatccgtg gcacattctg 360
caaacgggcg gtaaagagat taaaagcgcc atcccgatgg gctgtgtgct gacgctgcca 420
gcaaccggtt cagaatccaa cgcaggcgcg gtgatctccc gtaaaaccac aggcgacaag 480
caggcgttcc attctgccca tgttcagccg gtatttgccg tgctcgatcc ggtttatacc 540
tacaccctgc cgccgcgtca ggtggctaac ggcgtagtgg acgcctttgt acacaccgtg 600
gaacagtatg ttaccaaacc ggttgatgcc aaaattcagg accgtttcgc agaaggcatt 660
ttgctgacgc taatcgaaga tggtccgaaa gccctgaaag agccagaaaa ctacgatgtg 720
cgcgccaacg tcatgtgggc ggcgactcag gcgctgaacg gtttgattgg cgctggcgta 780
ccgcaggact gggcaacgca tatgctgggc cacgaactga ctgcgatgca cggtctggat 840
cacgcgcaaa cactggctat cgtcctgcct gcactgtgga atgaaaaacg cgataccaag 900
cgcgctaagc tgctgcaata tgctgaacgc gtctggaaca tcactgaagg ttccgatgat 960
gagcgtattg acgccgcgat tgccgcaacc cgcaatttct ttgagcaatt aggcgtgccg 1020
acccacctct ccgactacgg tctggacggc agctccatcc cggctttgct gaaaaaactg 1080
gaagagcacg gcatgaccca actgggcgaa aatcatgaca ttacgttgga tgtcagccgc 1140
cgtatatacg aagccgcccg ctaa 1164
<210> 68
<211> 1587
<212> DNA
<213>Pseudomonas putida (Pseudomonas putida)
<400> 68
atggcttcgg tacacggcac cacatacgaa ctcttgcgac gtcaaggcat cgatacggtc 60
ttcggcaatc ctggctcgaa cgagctcccg tttttgaagg actttccaga ggactttcga 120
tacatcctgg ctttgcagga agcgtgtgtg gtgggcattg cagacggcta tgcgcaagcc 180
agtcggaagc cggctttcat taacctgcat tctgctgctg gtaccggcaa tgctatgggt 240
gcactcagta acgcctggaa ctcacattcc ccgctgatcg tcactgccgg ccagcagacc 300
agggcgatga ttggcgttga agctctgctg accaacgtcg atgccgccaa cctgccacga 360
ccacttgtca aatggagcta cgagcccgca agcgcagcag aagtccctca tgcgatgagc 420
agggctatcc atatggcaag catggcgcca caaggccctg tctatctttc ggtgccatat 480
gacgattggg ataaggatgc tgatcctcag tcccaccacc tttttgatcg ccatgtcagt 540
tcatcagtac gcctgaacga ccaggatctc gatattctgg tgaaagctct caacagcgca 600
tccaacccgg cgatcgtcct gggcccggac gtcgacgcag caaatgcgaa cgcagactgc 660
gtcatgttgg ccgaacgcct caaagctccg gtttgggttg cgccatccgc tccacgctgc 720
ccattcccta cccgtcatcc ttgcttccgt ggattgatgc cagctggcat cgcagcgatt 780
tctcagctgc tcgaaggtca cgatgtggtt ttggtaatcg gcgctccagt gttccgttac 840
caccaatacg acccaggtca atatctcaaa cctggcacgc gattgatttc ggtgacctgc 900
gacccgctcg aagctgcacg cgcgccaatg ggcgatgcga tcgtggcaga cattggtgcg 960
atggctagcg ctcttgccaa cttggttgaa gagagcagcc gccagctccc aactgcagct 1020
ccggaacccg cgaaggttga ccaagacgct ggccgacttc acccagagac agtgttcgac 1080
acactgaacg acatggcccc ggagaatgcg atttacctga acgagtcgac ttcaacgacc 1140
gcccaaatgt ggcagcgcct gaacatgcgc aaccctggta gctactactt ctgtgcagct 1200
ggcggactgg gcttcgccct gcctgcagca attggcgttc aactcgcaga acccgagcga 1260
caagtcatcg ccgtcattgg cgacggatcg gcgaactaca gcattagtgc gttgtggact 1320
gcagctcagt acaacatccc cactatcttc gtgatcatga acaacggcac ctacggtgcg 1380
ttgcgatggt ttgccggcgt tctcgaagca gaaaacgttc ctgggctgga tgtgccaggg 1440
atcgacttcc gcgcactcgc caagggctat ggtgtccaag cgctgaaagc cgacaacctt 1500
gagcagctca agggttcgct acaagaagcg ctttctgcca aaggcccggt acttatcgaa 1560
gtaagcaccg taagcccggt gaagtga 1587
<210> 69
<211> 1323
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 69
atgcaagcct attttgacca gctcgatcgc gttcgttatg aaggctcaaa atcctcaaac 60
ccgttagcat tccgtcacta caatcccgac gaactggtgt tgggtaagcg tatggaagag 120
cacttgcgtt ttgccgcctg ctactggcac accttctgct ggaacggggc ggatatgttt 180
ggtgtggggg cgtttaatcg tccgtggcag cagcctggtg aggcactggc gttggcgaag 240
cgtaaagcag atgtcgcatt tgagtttttc cacaagttac atgtgccatt ttattgcttc 300
cacgatgtgg atgtttcccc tgagggcgcg tcgttaaaag agtacatcaa taattttgcg 360
caaatggttg atgtcctggc aggcaagcaa gaagagagcg gcgtgaagct gctgtgggga 420
acggccaact gctttacaaa ccctcgctac ggcgcgggtg cggcgacgaa cccagatcct 480
gaagtcttca gctgggcggc aacgcaagtt gttacagcga tggaagcaac ccataaattg 540
ggcggtgaaa actatgtcct gtggggcggt cgtgaaggtt acgaaacgct gttaaatacc 600
gacttgcgtc aggagcgtga acaactgggc cgctttatgc agatggtggt tgagcataaa 660
cataaaatcg gtttccaggg cacgttgctt atcgaaccga aaccgcaaga accgaccaaa 720
catcaatatg attacgatgc cgcgacggtc tatggcttcc tgaaacagtt tggtctggaa 780
aaagagatta aactgaacat tgaagctaac cacgcgacgc tggcaggtca ctctttccat 840
catgaaatag ccaccgccat tgcgcttggc ctgttcggtt ctgtcgacgc caaccgtggc 900
gatgcgcaac tgggctggga caccgaccag ttcccgaaca gtgtggaaga gaatgcgctg 960
gtgatgtatg aaattctcaa agcaggcggt ttcaccaccg gtggtctgaa cttcgatgcc 1020
aaagtacgtc gtcaaagtac tgataaatat gatctgtttt acggtcatat cggcgcgatg 1080
gatacgatgg cactggcgct gaaaattgca gcgcgcatga ttgaagatgg cgagctggat 1140
aaacgcatcg cgcagcgtta ttccggctgg aatagcgaat tgggccagca aatcctgaaa 1200
ggccaaatgt cactggcaga tttagccaaa tatgctcagg aacatcattt gtctccggtg 1260
catcagagtg gtcgccagga acaactggaa aatctggtaa accattatct gttcgacaaa 1320
taa 1323
<210> 70
<211> 1455
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 70
atgtatatcg ggatagatct tggcacctcg ggcgtaaaag ttattttgct caacgagcag 60
ggtgaggtgg ttgctgcgca aacggaaaag ctgaccgttt cgcgcccgca tccactctgg 120
tcggaacaag acccggaaca gtggtggcag gcaactgatc gcgcaatgaa agctctgggc 180
gatcagcatt ctctgcagga cgttaaagca ttgggtattg ccggccagat gcacggagca 240
accttgctgg atgctcagca acgggtgtta cgccctgcca ttttgtggaa cgacgggcgc 300
tgtgcgcaag agtgcacttt gctggaagcg cgagttccgc aatcgcgggt gattaccggc 360
aacctgatga tgcccggatt tactgcgcct aaattgctat gggttcagcg gcatgagccg 420
gagatattcc gtcaaatcga caaagtatta ttaccgaaag attacttgcg tctgcgtatg 480
acgggggagt ttgccagcga tatgtctgac gcagctggca ccatgtggct ggatgtcgca 540
aagcgtgact ggagtgacgt catgctgcag gcttgcgact tatctcgtga ccagatgccc 600
gcattatacg aaggcagcga aattactggt gctttgttac ctgaagttgc gaaagcgtgg 660
ggtatggcga cggtgccagt tgtcgcaggc ggtggcgaca atgcagctgg tgcagttggt 720
gtgggaatgg ttgatgctaa tcaggcaatg ttatcgctgg ggacgtcggg ggtctatttt 780
gctgtcagcg aagggttctt aagcaagcca gaaagcgccg tacatagctt ttgccatgcg 840
ctaccgcaac gttggcattt aatgtctgtg atgctgagtg cagcgtcgtg tctggattgg 900
gccgcgaaat taaccggcct gagcaatgtc ccagctttaa tcgctgcagc tcaacaggct 960
gatgaaagtg ccgagccagt ttggtttctg ccttatcttt ccggcgagcg tacgccacac 1020
aataatcccc aggcgaaggg ggttttcttt ggtttgactc atcaacatgg ccccaatgaa 1080
ctggcgcgag cagtgctgga aggcgtgggt tatgcgctgg cagatggcat ggatgtcgtg 1140
catgcctgcg gtattaaacc gcaaagtgtt acgttgattg ggggcggggc gcgtagtgag 1200
tactggcgtc agatgctggc ggatatcagc ggtcagcagc tcgattaccg tacggggggg 1260
gatgtggggc cagcactggg cgcagcaagg ctggcgcaga tcgcggcgaa tccagagaaa 1320
tcgctcattg aattgttgcc gcaactaccg ttagaacagt cgcatctacc agatgcgcag 1380
cgttatgccg cttatcagcc acgacgagaa acgttccgtc gcctctatca gcaacttctg 1440
ccattaatgg cgtaa 1455
<210> 71
<211> 906
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 71
atgaaaaaat tcagcggcat tattccaccg gtatccagca cgtttcatcg tgacggaacc 60
cttgataaaa aggcaatgcg cgaagttgcc gacttcctga ttaataaagg ggtcgacggg 120
ctgttttatc tgggtaccgg tggtgaattt agccaaatga atacagccca gcgcatggca 180
ctcgccgaag aagctgtaac cattgtcgac gggcgagtgc cggtattgat tggcgtcggt 240
tccccttcca ctgacgaagc ggtcaaactg gcgcagcatg cgcaagccta cggcgctgat 300
ggtatcgtcg ccatcaaccc ctactactgg aaagtcgcac cacgaaatct tgacgactat 360
taccagcaga tcgcccgtag cgtcacccta ccggtgatcc tgtacaactt tccggatctg 420
acgggtcagg acttaacccc ggaaaccgtg acgcgtctgg ctctgcaaaa cgagaatatc 480
gttggcatca aagacaccat cgacagcgtt ggtcacttgc gtacgatgat caacacagtt 540
aagtcggtac gcccgtcgtt ttcggtattc tgcggttacg atgatcattt gctgaatacg 600
atgctgctgg gcggcgacgg tgcgataacc gccagcgcta actttgctcc ggaactctcc 660
gtcggcatct accgcgcctg gcgtgaaggc gatctggcga ccgctgcgac gctgaataaa 720
aaactactac aactgcccgc tatttacgcc ctcgaaacac cgtttgtctc actgatcaaa 780
tacagcatgc agtgtgtcgg gctgcctgta gagacatatt gcttaccacc gattcttgaa 840
gcatctgaag aagcaaaaga taaagtccac gtgctgctta ccgcgcaggg cattttacca 900
gtctga 906
<210> 72
<211> 909
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 72
atgccgcagt ccgcgttgtt cacgggaatc attccccctg tctccaccat ttttaccgcc 60
gacggccagc tcgataagcc gggcaccgcc gcgctgatcg acgatctgat caaagcaggc 120
gttgacggcc tgttcttcct gggcagcggt ggcgagttct cccagctcgg cgccgaagag 180
cgtaaagcca ttgcccgctt tgctatcgat catgtcgatc gtcgcgtgcc ggtgctgatc 240
ggcaccggcg gcaccaacgc ccgggaaacc atcgaactca gccagcacgc gcagcaggcg 300
ggcgcggacg gcatcgtggt gatcaacccc tactactgga aagtgtcgga agcgaacctg 360
atccgctatt tcgagcaggt ggccgacagc gtcacgctgc cggtgatgct ctataacttc 420
ccggcgctga ccgggcagga tctgactccg gcgctggtga aaaccctcgc cgactcgcgc 480
agcaatatta tcggcatcaa agacaccatc gactccgtcg cccacctgcg cagcatgatc 540
cataccgtca aaggtgccca tccgcacttc accgtgctct gcggctacga cgatcatctg 600
ttcaataccc tgctgctcgg cggcgacggg gcgatatcgg cgagcggcaa ctttgccccg 660
caggtgtcgg tgaatcttct gaaagcctgg cgcgacgggg acgtggcgaa agcggccggg 720
tatcatcaga ccttgctgca aattccgcag atgtatcagc tggatacgcc gtttgtgaac 780
gtgattaaag aggcgatcgt gctctgcggt cgtcctgtct ccacgcacgt gctgccgccc 840
gcctcgccgc tggacgagcc gcgcaaggcg cagctgaaaa ccctgctgca acagctcaag 900
ctttgctga 909
<210> 73
<211> 975
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 73
atgaagccgt ccgttatcct ctacaaagcc ttacctgatg atttactgca acgcctgcaa 60
gagcatttca ccgttcacca ggtggcaaac ctcagcccac aaaccgtcga acaaaatgca 120
gcaatttttg ccgaagctga aggtttactg ggttcaaacg agaatgtaaa tgccgcattg 180
ctggaaaaaa tgccgaaact gcgtgccaca tcaacgatct ccgtcggcta tgacaatttt 240
gatgtcgatg cgcttaccgc ccgaaaaatt ctgctgatgc acacgccaac cgtattaaca 300
gaaaccgtcg ccgatacgct gatggcgctg gtgttgtcta ccgctcgtcg ggttgtggag 360
gtagcagaac gggtaaaagc aggcgaatgg accgcgagca taggcccgga ctggtacggc 420
actgacgttc accataaaac actgggcatt gtcgggatgg gacggatcgg catggcgctg 480
gcacaacgtg cgcactttgg cttcaacatg cccatcctct ataacgcgcg ccgccaccat 540
aaagaagcag aagaacgctt caacgcccgc tactgcgatt tggatactct gttacaagag 600
tcagatttcg tttgcctgat cctgccgtta actgatgaga cgcatcatct gtttggcgca 660
gaacaattcg ccaaaatgaa atcctccgcc attttcatta atgccggacg tggcccggtg 720
gttgacgaaa atgcactgat cgcagcattg cagaaaggcg aaattcacgc tgccgggctg 780
gatgtcttcg aacaagagcc actgtccgta gattcgccgt tgctctcaat ggccaacgtc 840
gtcgcagtac cgcatattgg atctgccacc catgagacgc gttatggcat ggccgcctgt 900
gccgtggata atttgattga tgcgttacaa ggaaaggttg agaagaactg tgtgaatccg 960
cacgtcgcgg actaa 975
<210> 74
<211> 939
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 74
atggatatca tcttttatca cccaacgttc gatacccaat ggtggattga ggcactgcgc 60
aaagctattc ctcaggcaag agtcagagca tggaaaagcg gagataatga ctctgctgat 120
tatgctttag tctggcatcc tcctgttgaa atgctggcag ggcgcgatct taaagcggtg 180
ttcgcactcg gggccggtgt tgattctatt ttgagcaagc tacaggcaca ccctgaaatg 240
ctgaaccctt ctgttccact ttttcgcctg gaagataccg gtatgggcga gcaaatgcag 300
gaatatgctg tcagtcaggt gctgcattgg tttcgacgtt ttgacgatta tcgcatccag 360
caaaatagtt cgcattggca accgctgcct gaatatcatc gggaagattt taccatcggc 420
attttgggcg caggcgtact gggcagtaaa gttgctcaga gtctgcaaac ctggcgcttt 480
ccgctgcgtt gctggagtcg aacccgtaaa tcgtggcctg gcgtgcaaag ctttgccgga 540
cgggaagaac tgtctgcatt tctgagccaa tgtcgggtat tgattaattt gttaccgaat 600
acccctgaaa ccgtcggcat tattaatcaa caattactcg aaaaattacc ggatggcgcg 660
tatctcctca acctggcgcg tggtgttcat gttgtggaag atgacctgct cgcggcgctg 720
gatagcggca aagttaaagg cgcaatgttg gatgttttta atcgtgaacc cttaccgcct 780
gaaagtccgc tctggcaaca tccacgcgtg acgataacac cacatgtcgc cgcgattacc 840
cgtcccgctg aagctgtgga gtacatttct cgcaccattg cccagctcga aaaaggggag 900
agggtctgcg ggcaagtcga ccgcgcacgc ggctactaa 939
<210> 75
<211> 1434
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 75
atgtttaaga atgcatttgc taacctgcaa aaggtcggta aatcgctgat gctgccggta 60
tccgtactgc ctatcgcagg tattctgctg ggcgtcggtt ccgcgaattt cagctggctg 120
cccgccgttg tatcgcatgt tatggcagaa gcaggcggtt ccgtctttgc aaacatgcca 180
ctgatttttg cgatcggtgt cgccctcggc tttaccaata acgatggcgt atccgcgctg 240
gccgcagttg ttgcctatgg catcatggtt aaaaccatgg ccgtggttgc gccactggta 300
ctgcatttac ctgctgaaga aatcgcctct aaacacctgg cggatactgg cgtactcgga 360
gggattatct ccggtgcgat cgcagcgtac atgtttaacc gtttctaccg tattaagctg 420
cctgagtatc ttggcttctt tgccggtaaa cgctttgtgc cgatcatttc tggcctggct 480
gccatcttta ctggcgttgt gctgtccttc atttggccgc cgattggttc tgcaatccag 540
accttctctc agtgggctgc ttaccagaac ccggtagttg cgtttggcat ttacggtttc 600
atcgaacgtt gcctggtacc gtttggtctg caccacatct ggaacgtacc tttccagatg 660
cagattggtg aatacaccaa cgcagcaggt caggttttcc acggcgacat tccgcgttat 720
atggcgggtg acccgactgc gggtaaactg tctggtggct tcctgttcaa aatgtacggt 780
ctgccagctg ccgcaattgc tatctggcac tctgctaaac cagaaaaccg cgcgaaagtg 840
ggcggtatta tgatctccgc ggcgctgacc tcgttcctga ccggtatcac cgagccgatc 900
gagttctcct tcatgttcgt tgcgccgatc ctgtacatca tccacgcgat tctggcaggc 960
ctggcattcc caatctgtat tcttctgggg atgcgtgacg gtacgtcgtt ctcgcacggt 1020
ctgatcgact tcatcgttct gtctggtaac agcagcaaac tgtggctgtt cccgatcgtc 1080
ggtatcggtt atgcgattgt ttactacacc atcttccgcg tgctgattaa agcactggat 1140
ctgaaaacgc cgggtcgtga agacgcgact gaagatgcaa aagcgacagg taccagcgaa 1200
atggcaccgg ctctggttgc tgcatttggt ggtaaagaaa acattactaa cctcgacgca 1260
tgtattaccc gtctgcgcgt cagcgttgct gatgtgtcta aagtggatca ggccggcctg 1320
aagaaactgg gcgcagcggg cgtagtggtt gctggttctg gtgttcaggc gattttcggt 1380
actaaatccg ataacctgaa aaccgagatg gatgagtaca tccgtaacca ctaa 1434
<210> 76
<211> 510
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 76
atgggtttgt tcgataaact gaaatctctg gtttccgacg acaagaagga taccggaact 60
attgagatca ttgctccgct ctctggcgag atcgtcaata tcgaagacgt gccggatgtc 120
gtttttgcgg aaaaaatcgt tggtgatggt attgctatca aaccaacggg taacaaaatg 180
gtcgcgccag tagacggcac cattggtaaa atctttgaaa ccaaccacgc attctctatc 240
gaatctgata gcggcgttga actgttcgtc cacttcggta tcgacaccgt tgaactgaaa 300
ggcgaaggct tcaagcgtat tgctgaagaa ggtcagcgcg tgaaagttgg cgatactgtc 360
attgaatttg atctgccgct gctggaagag aaagccaagt ctaccctgac tccggttgtt 420
atctccaaca tggacgaaat caaagaactg atcaaactgt ccggtagcgt aaccgtgggt 480
gaaaccccgg ttatccgcat caagaagtaa 510
<210> 77
<211> 258
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 77
atgttccagc aagaagttac cattaccgct ccgaacggtc tgcacacccg ccctgctgcc 60
cagtttgtaa aagaagctaa gggcttcact tctgaaatta ctgtgacttc caacggcaaa 120
agcgccagcg cgaaaagcct gtttaaactg cagactctgg gcctgactca aggtaccgtt 180
gtgactatct ccgcagaagg cgaagacgag cagaaagcgg ttgaacatct ggttaaactg 240
atggcggaac tcgagtaa 258
<210> 78
<211> 1254
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 78
atgtactatt taaaaaacac aaacttttgg atgttcggtt tattcttttt cttttacttt 60
tttatcatgg gagcctactt cccgtttttc ccgatttggc tacatgacat caaccatatc 120
agcaaaagtg atacgggtat tatttttgcc gctatttctc tgttctcgct attattccaa 180
ccgctgtttg gtctgctttc tgacaaactc gggctgcgca aatacctgct gtggattatt 240
accggcatgt tagtgatgtt tgcgccgttc tttattttta tcttcgggcc actgttacaa 300
tacaacattt tagtaggatc gattgttggt ggtatttatc taggcttttg ttttaacgcc 360
ggtgcgccag cagtagaggc atttattgag aaagtcagcc gtcgcagtaa tttcgaattt 420
ggtcgcgcgc ggatgtttgg ctgtgttggc tgggcgctgt gtgcctcgat tgtcggcatc 480
atgttcacca tcaataatca gtttgttttc tggctgggct ctggctgtgc actcatcctc 540
gccgttttac tctttttcgc caaaacggat gcgccctctt ctgccacggt tgccaatgcg 600
gtaggtgcca accattcggc atttagcctt aagctggcac tggaactgtt cagacagcca 660
aaactgtggt ttttgtcact gtatgttatt ggcgtttcct gcacctacga tgtttttgac 720
caacagtttg ctaatttctt tacttcgttc tttgctaccg gtgaacaggg tacgcgggta 780
tttggctacg taacgacaat gggcgaatta cttaacgcct cgattatgtt ctttgcgcca 840
ctgatcatta atcgcatcgg tgggaaaaac gccctgctgc tggctggcac tattatgtct 900
gtacgtatta ttggctcatc gttcgccacc tcagcgctgg aagtggttat tctgaaaacg 960
ctgcatatgt ttgaagtacc gttcctgctg gtgggctgct ttaaatatat taccagccag 1020
tttgaagtgc gtttttcagc gacgatttat ctggtctgtt tctgcttctt taagcaactg 1080
gcgatgattt ttatgtctgt actggcgggc aatatgtatg aaagcatcgg tttccagggc 1140
gcttatctgg tgctgggtct ggtggcgctg ggcttcacct taatttccgt gttcacgctt 1200
agcggccccg gcccgctttc cctgctgcgt cgtcaggtga atgaagtcgc ttaa 1254
<210> 79
<211> 1545
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 79
atggatgtca ttaaaaagaa acattggtgg caaagcgacg cgctgaaatg gtcagtgcta 60
ggtctgctcg gcctgctggt gggttacctt gttgttttaa tgtacgcaca aggggaatac 120
ctgttcgcca ttaccacgct gatattgagt tcagcggggc tgtatatttt cgccaatcgt 180
aaagcctacg cctggcgcta tgtttacccg ggaatggctg gaatgggatt attcgtcctc 240
ttccctctgg tctgcaccat cgccattgcc ttcaccaact acagcagcac taaccagctg 300
acttttgaac gtgcgcagga agtgttgtta gatcgctcct ggcaagcagg caaaacctat 360
aactttggtc tttacccggc gggcgatgag tggcaactgg cgctcagcga cggcgaaacc 420
ggcaaaaatt acctctccga cgcttttaaa tttggcggcg agcaaaaact gcaactgaaa 480
gaaacgaccg cccagcccga aggcgaacgc gcgaatctgc gcgtgattac ccagaatcgt 540
caggcgctga gtgacattac cgccattctg ccggatggca acaaagtgat gatgagctcc 600
ctgcgccagt tttctggcac gcagccgctc tacacactcg acggtgacgg cacgttgacg 660
aataatcaga gcggcgtgaa atatcgtccg aataaccaaa ttggctttta ccagtccatt 720
accgccgacg gcaactgggg tgatgaaaag ctaagccccg gttacaccgt gaccaccggc 780
tggaaaaact ttacccgcgt ctttaccgac gaaggcattc agaaaccgtt cctcgccatt 840
ttcgtctgga ccgtggtgtt ctcgctgatc actgtctttt taacggtggc ggtcggcatg 900
gttctggcgt gtctggtgca gtgggaagcg ttgcgcggca aagcggtcta tcgcgtcctg 960
ctgattctgc cctacgcggt gccatcgttc atttcaatct tgattttcaa agggttgttt 1020
aaccagagct tcggtgaaat caacatgatg ttgagcgcgc tgtttggcgt gaagcccgcc 1080
tggttcagcg atccgaccac cgcccgcacg atgctaatta tcgtcaatac ctggctgggt 1140
tatccgtaca tgatgatcct ctgcatgggc ttgctgaaag cgattccgga cgatttgtat 1200
gaagcctcag caatggatgg cgcaggtccg ttccagaact tctttaagat tacgctgccg 1260
ctgctgatta aaccgctgac gccgctgatg atcgccagct tcgcctttaa ctttaacaac 1320
ttcgtgctga ttcaactgtt aaccaacggc ggcccggatc gtcttggcac gaccacgcca 1380
gccggttata ccgacctgct tgttaactac acctaccgca tcgcttttga aggcggcggg 1440
ggtcaggact tcggtctggc ggcagcaatt gccacgctga tcttcctgct ggtgggtgcg 1500
ctggcgatag tgaacctgaa agccacgcga atgaagtttg attaa 1545
<210> 80
<211> 891
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 80
atggcaatgg tccaaccgaa atcgcaaaaa gctcgtttat ttattactca cctgctactg 60
ctacttttta tcgcagcgat tatgttcccg ctgctgatgg tcgtcgctat ctcgctgcgt 120
cagggaaact ttgcgaccgg cagcctgatc ccggagcaaa tctcctggga tcactggaaa 180
ctggcgttag gttttagcgt tgaacaggct gatggtcgca ttacgccacc gccattcccg 240
gtactgctgt ggctgtggaa ctcggtaaag gtcgccggga tttccgcgat tggcattgtg 300
gcgctctcca ccacctgcgc ctacgctttc gcccgtatgc gctttccagg caaagcgacg 360
ctgctgaaag gaatgctgat tttccagatg ttcccggcag tactttcact ggtcgcgttg 420
tatgcgttgt ttgatcgtct gggtgagtac attccattca ttggcctgaa tactcacggc 480
ggcgtaattt tcgcgtatct gggtgggatt gcgctgcatg tctggaccat caaaggctat 540
ttcgaaacca tcgacagttc gctggaagaa gctgctgcgc tggatggtgc gacaccgtgg 600
caggccttcc gccttgtcct gttgccgctg tcagtaccga ttctggcggt ggtattcatc 660
ctgtcgttta tcgctgccat tactgaagtt ccggtcgcgt cgctgttact gcgtgacgta 720
aacagctaca ccctggccgt ggggatgcag caatacctca acccgcaaaa ctacctgtgg 780
ggtgactttg ccgccgctgc cgtgatgtct gcattaccga tcaccatcgt cttcttgctg 840
gctcaacgct ggctggtcaa cggcctgacg gcaggtggtg tgaaaggtta a 891
<210> 81
<211> 6546
<212> DNA
<213>Salmonella typhimurium (Salmonella typhimurium)
<400> 81
atgaatgcaa aagtttgggt tctgggcgac gcggtggtgg acctgctgcc ggagagcgaa 60
gggcgcctgc tgcagtgccc tggaggcgcg ccggctaacg tggcggtagg ggttgcccgc 120
cttggcggca acagcggatt tatcggcgcc gtcggcggtg acccgtttgg ccgctacatg 180
cgtcataccc tgcaacagga gcaggtcgac gtcagccata tgtatctcga cgatcagcac 240
cgcacgtcca ctgtggtcgt cgaccttgac gaccaggggg aacgcacctt tacctttatg 300
gtacgcccca gcgcggacct gttcctggtt gaagaagacc tgccacagtt tgccgccgga 360
cagtggttgc acgtctgctc catcgcgctc agcgccgagc ccagccgtag cactaccttc 420
gcggcgatgg agagcatcag gtctgccggc ggtcgggtca gctttgaccc taatattcgt 480
cccgatctct ggcaggatca ggctttgctg ctagcctgcc tcgatcgcgc tttgcacatg 540
gccaacgtgg taaagctatc ggaagaggag ctggtcttca tcagcagcag taatgattta 600
gcatacggaa tcgccagcgt aacggagcgc tatcagccag aattgctact ggtgacccgg 660
ggcaaagcgg gggtgcttgc cgcgtttcag cagaagttta cccatttcaa cgcccggcct 720
gtggccagcg tggacaccac cggcgcggga gacgcatttg tcgccggact gctcgccagc 780
cttgcggcta acgggatgcc aacggacatg accgcactgg aaccgacact cacgcttgca 840
cagacctgcg gcgccctggc caccacagcc aaaggtgcga tgaccgcctt gccttatcag 900
cgcgatctca accgtcagtt ttaatcctta aagccgcttt gcgcggctca ctttgttgca 960
tgcatcacat ttattaaacc ggtttagcat atttgtttta agaaaaacaa aggtcgggct 1020
taacatagcg cctaaaccgg tttagcaaaa attataattt tccattttta cttttgggat 1080
gccaacagca tgtacagaaa aagcacactt gcgatgctta tcgctttgct aaccagcgct 1140
gcctcagccc atgcgcaaac ggatataagc accattgaag cccgactcaa cgcgctggaa 1200
aaacgcctgc aggaggcaga aaacagggcg caaacggcgg aaaaccgcgc cggggcggcg 1260
gagaaaaaag ttcagcaact caccgcgcag cagcaaaaaa accagaactc gactcaggaa 1320
gtggctcagc gtaccgccag acttgagaaa aaagccgatg acaaaagcgg atttgagttt 1380
cacggttacg cccgctccgg cgtgataatg aatgattccg gcgccagcac caaatccgga 1440
gcctacataa cgccggcagg tgaaaccggc ggagctatcg gccgtctggg aaaccaggcc 1500
gatacctatg ttgaaatgaa tcttgaacat aagcagaccc tggataatgg ggccacgacc 1560
cgctttaagg tgatggtcgc cgacgggcaa acctcttata acgactggac tgcaagcacc 1620
agcgatctga acgttcgtca ggcctttgtc gaattgggta acctgccgac gttcgctggg 1680
ccatttaagg gctccaccct gtgggccggg aaacgtttcg accgcgacaa tttcgatatt 1740
cactggattg actctgatgt cgtgttcctc gccggtaccg gtggtggtat ctatgacgtg 1800
aagtggaacg acggcctgcg gagtaatttc tccctgtacg ggcgtaactt cggcgacatt 1860
gatgattcca gcaacagcgt gcagaactat atcctcacca tgaatcactt cgcaggtccg 1920
ctgcagatga tggtcagcgg tctgcgggcg aaggataacg acgagcgtaa agatagcaac 1980
ggcaatctgg caaaaggcga tgcggcaaac accggcgtgc atgcgctgct cggcctgcat 2040
aacgacagtt tctacggcct gcgcgacggt agcagtaaaa ccgctctgct ttatggtcat 2100
ggtctgggcg cagaggttaa aggtatcgga tctgatggcg cacttcgtcc gggagccgac 2160
acatggcgca ttgccagtta cggcaccacg ccgctcagcg aaaactggtc tgttgccccg 2220
gcaatgctgg cgcaacgcag taaagaccgc tatgccgatg gcgacagcta tcagtgggca 2280
acattcaacc tgcgtctgat tcaggcaatc aatcagaatt tcgctctcgc ctacgaaggc 2340
agctaccagt acatggatct taaacccgaa ggttataacg atcgtcaggc ggtgaacggt 2400
agcttctaca agctcacctt cgccccgaca tttaaggtcg gcagtatcgg tgatttcttc 2460
agtcgcccgg agattcgttt ctatacctcc tggatggact ggagcaaaaa actgaataat 2520
tacgccagcg acgacgccct gggcagtgac ggttttaact cgggcggcga atggtctttc 2580
ggtgtgcaga tggaaacctg gttctgacgc ttacgcctga tgacaggaat agccgggggt 2640
cagagcatct ttgtcacccc ggactcaact aagacgcaga aaaagcgctc ccgtgaacgc 2700
gggacgacaa cataaaaatg tttaagcctt aagagggtac tatggatttt gaacagattt 2760
cctgctcgct gcttccgctt cttggaggca aagaaaatat cgccagcgcc gcgcactgcg 2820
ccacgcgcct gcgcctggtg ctggtcgatg attcgctggc cgaccagcag gccatcggca 2880
aagttgaagg ggtgaagggc tgttttcgta atgccggaca gatgcagatt attttcggca 2940
ccggggtggt aaataaggtc tacgctgcct ttactcaggc ggcgggtatt agcgaatcca 3000
gcaaatcgga agccgccgac atcgcggcaa aaaagctcaa tccgttccag cgcatcgccc 3060
gcctgctatc aaacatcttc gtgccgataa tccctgccat cgtcgcctct ggtctgctga 3120
tgggcctgct gggaatggtc aaaacatacg gctgggttga cccgggcaac gccatctaca 3180
tcatgctgga tatgtgcagc tcggcggcat ttatcattct gccgattctg attggcttta 3240
ccgccgcccg cgaattcggc ggtaatcctt atctcggcgc gacgcttggc ggcattctga 3300
ctcatccagc gctgactaac gcctggggcg tggccgcggg tttccacacc atgaactttt 3360
tcggcttcga aattgccatg atcggctatc agggtacggt gttcccggta ctgctggcag 3420
tatggtttat gagcatcgtt gagaagcagt tgcgtcgcgc aatccccgat gccctggatt 3480
tgatcctgac gccgttcctg acggtgatta tatccggttt tatcgccctg ttgattatcg 3540
gcccggccgg tcgcgcactg ggcgacggta tctcgtttgt cctcagcacc ctgattagcc 3600
acgccggctg gctcgccggg ttactgtttg gcggtctcta ttcagttatc gtcattaccg 3660
gtattcatca cagcttccat gcggttgaag ccgggttgct gggcaatccc tccatcggcg 3720
tcaacttcct gctgccgatt tgggcgatgg ccaacgtcgc tcagggcgga gcctgtctgg 3780
cggtgtggtt caaaaccaaa gatgcaaaaa ttaaagccat tactctgccc tcggcgtttt 3840
ccgccatgct gggcatcacc gaggcggcga tttttggtat taacctgcgc tttgtgaagc 3900
catttattgc ggcgctgatt ggtggtgcgg cgggcggcgc atgggtggta tctgtacacg 3960
tctacatgac cgcggtcggc ttgacagcga tccccggcat ggccatcgtg caggccagtt 4020
cgctgttgaa ctacattatc gggatggtta tcgcctttgg cgtcgccttt acggtctccc 4080
tggttttgaa atacaaaacg gacgctgaat aatgtctctt ccatcacgac tgcctgcgat 4140
tttgcaggcc gtaatgcagg gccagccgcg cgcgctggcc gatagccact atccgcgctg 4200
gcaccatgcg ccggtcaccg ggctgatgaa cgaccccaac ggctttatcg aatttgccgg 4260
acgctatcat ctgttttatc agtggaaccc gctcgcctgc gatcatacgt ttaagtgctg 4320
ggcgcactgg agttccatcg atctgctgca ctggcagcat gagcccattg cgctgatgcc 4380
ggacgaagag tatgaccgta acggctgcta ctccggcagc gcggtggata acaacggtac 4440
gcttaccctg tgctataccg gcaacgtgaa gtttgccgag ggagggcgaa ccgcctggca 4500
atgcctggca acggaaaacg ctgacggcac cttccgcaaa atcggtccgg tcctgccgct 4560
gccggagggc tacaccggcc acgtgcgcga cccaaaagtc tggcgacacg aagacctgtg 4620
gtacatggtg ctgggcgcgc aggatcggca aaagcgcggc aaggtgctgc tgttcagctc 4680
tgcggatctc catcagtgga cgagtatggg tgaaatcgcc ggccacggca tcaatggcct 4740
cgacgacgtc ggctatatgt gggagtgccc ggatcttttt ccactcggcg accagcatat 4800
tctaatctgc tgtccgcagg ggattgcccg tgaggaagag tgctacctga acacctaccc 4860
ggcagtatgg atggcgggcg agtttgatta cgctgctggc gctttcagac acggcgaact 4920
gcacgaactg gacgccgggt ttgagttcta cgccccgcaa accatgctta ccagtgatgg 4980
ccgtcgtctg ctggtcggct ggatgggcgt gccggagggc gaagagatgc ttcagccgac 5040
cctgaacaac ggctggatcc atcagatgac ctgcctgcgt gagctggagt ttatcaacgg 5100
tcagctctat cagcgtccgc tacgggaact gagcgccctg cgcggtgaag cgaacggctg 5160
gtcggggaac gccctgccgc tggcaccgat ggaaatcgat ttgcaaaccc gcgggggcga 5220
tatgttgagc ctcgattttg gcggcgtatt aacccttgag tgcgatgcca gcggactccg 5280
cctggcccga cgcagtctcg ccagtgacga gatgcattat cgttactggc gcggaaacgt 5340
ccgctcgctg cgtgttttca tcgaccagtc gagcgtggag attttcataa acggcggtga 5400
aggggtgatg agcagccgct acttcccggc ctgctccggt cagctaacat tctccggcat 5460
cacgccggac gcattctgct actggccgct gcgaacttgc atggtagaat aagcgttttg 5520
cttcaggctc atggcgtcgt aatgaaaacc aaacgcgtaa ccattaaaga tatagccgaa 5580
caggctggcg tctccaaagc gaccgccagc ttggtactga atggtcgtgg caaggagctg 5640
cgcgtggcgc aggaaacgcg tgagcgcgta ctgtcgattg cccgtaagca tcactatcag 5700
ccaagcattc atgcccgctc gctgcgcaac aaccgcagcc acaccatcgg gctggtggtg 5760
ccggagatca ccaaccacgg ctttgcggtc tttgcccatg agctggagat gctgtgccgc 5820
gaggcgggcg tccagctgtt gatctcttgt actgatgaaa accccggtca ggagagcgtg 5880
gtggtcaata atatgattgc ccgccaggtc gacgggatga tcgtcgcttc ctgtatgcac 5940
aacgatgccg actatctcaa actcagccaa cagctgccag tggtgctgtt tgaccggtgc 6000
cccaatgaaa gcgcgctgcc gctggtaatg accgattcga ttaccccaac ggcggaactg 6060
atttcccgca tcgcgcctca gcatagcgat gagttctggt ttttaggcgg tcaggcgcgt 6120
ctgtcgccct cccgcgatcg tctgaccggg ttcacgcagg gtttggctca ggcgggtatt 6180
gccctgcgcc cggaatgggt gatcaacggc aattaccacc ccagctccgg ctatgagatg 6240
tttgccgcac tctgcgcgcg ccttgggcgg ccgcctaagg cgctattcac cgccgcctgc 6300
gggctgctcg aaggggttct gcgctatatg agccagcacc atttactcga ttccgatatt 6360
catctgacga gctttgacga tcactatctt tatgattcgc tgtcgctgcg tatcgacact 6420
gtccagcagg ataatcgcca gctggcctgg cactgctacg atctgataag ccagctgatc 6480
gagggcgata cgcccgaaac gctacaacgc tacctgcccg caaccctgca gtttcggcat 6540
cagtaa 6546
<210> 82
<211> 4885
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 82
ctatattgct gaaggtacag gcgtttccat aactatttgc tcgcgttttt tactcaggaa 60
gaaaatgcca aatagcaaca tcaggcagac aatacccgaa attgcgaaga aaactgtctg 120
gtagcctgcg tggtcaaaga gtatcccagt cggcgttgaa agcagcacaa tcccaagcga 180
actggcaatt tgaaaaccaa tcagaaagat cgtcgacgac aggcgcttat caaagtttgc 240
cacgctgtat ttgaagacgg atatgacaca aagtggaacc tcaatggcat gtaacagctt 300
cactaatgaa ataatccagg ggttaacgaa caacgcgcag gaaaggatac gcaacgccat 360
aatcacaaca ccgataagta atgcattttt tggccctacc cgattcacaa agaaaggaat 420
aatcgccatg cacagcgctt cgagtaccac ctggaatgag ttgagataac catacaggcg 480
cgttcctaca tcgtgtgatt cgaataaacc tgcataaaag acaggaaaga gttgttgatc 540
aaaaatgtta tagaaagacc acgtccccac aataaatatg acgaaaaccc agaagtttcg 600
atccttgaaa actgcgataa aatcctcttt ttttacccct cccgcatccg ccgctatgca 660
ctggtgatcc ttatctttaa aacgcatgtt gatcatcata aatacagcgc caaatagcga 720
gaccaaccag aagttgatat ggggactgat actaaaaaat ataccggcaa agaacgcgcc 780
aatagcatag ccaaaagatc cccaggcgcg cgctgttcca tattcgaaat gaaaatttcg 840
cgccattttt tcggtgaagc tgtcaagcaa accgcatccc gccagatacc ccaggccaaa 900
aaagagcgcc cccagaatta gacctacaga aaaattgctt tgcagtaacg gttcataaac 960
gtaaatcata aacggtccgg tcaagaccag aatgaaactc atacaccaga tgagcggttt 1020
cttcagaccg agtttatcct gaacgatgcc gtagaacatc ataaatagaa tgctggtaaa 1080
ctggttgacc gaataaagtg tacctaattc cgtccctgtt aatcctagat gtcctttcag 1140
ccaaatagcg tataacgacc accacagcga ccaggaaata aaaaagagaa atgagtaact 1200
ggatgcaaaa cgatagtacg catttctgaa tggaatattc agtgccataa ttacctgcct 1260
gtcgttaaaa aattcatgtc ctatttagag ataagagcgg cctcgccgtt tacttctcac 1320
tttccagttc ttgtcgacat ggcagcgctg tcattgcccc tttcgctgtt actgcaagcg 1380
ctccgcaacg ttgagcgaga tcgataattc gtcgcatttc tctctcatct gtagataatc 1440
ccgtagagga cagacctgtg agtaacccgg caacgaacgc atctcccgcc ccagtgctat 1500
cgacacaatt cacagacatt ccagcaaaat ggtggacttg tcctcgataa cagaccacca 1560
ccccttctgc acctttagtc accaacagca tggcgatctc atactctttt gccagggcgc 1620
atatatcccg atcgttctgt gtttttccac tgataagtcg ccattcttct tccgagagct 1680
tgacgacatc cgccagttgt agcgcctgcc gcaaacacaa gcggagcaaa tgctcgtctt 1740
gccatagatc ttcacgaata ttgggatcga agctgacaaa acctccggca tgccggatcg 1800
ccgtcatcgc agtaaatgcg ctggtacgcg aaggctcggc agacaacgca attgaacaga 1860
gatgtaacca ttcgccatgt cgccagcagg gcaagtctgt cgtctctaaa aaaagatcgg 1920
cactggggcg gaccataaac gtaaatgaac gttctccttg atcgttcaga tcgacaagca 1980
ccgtggatgt ccggtgccat tcatcttgct tcagatacgt gatatcgaca ccctcagtta 2040
gcagcgttct ttgcattaac gcaccaaaag gatcatcacc gacccgacct ataaacccac 2100
ttgttccgcc taatctggcg attcccaccg caacgttagc tggcgcgccg ccaggacaag 2160
gcagtagccg cccgtctgat tctggcaaga gatctacgac cgcatcccct aaaacccata 2220
ctttggctga catttttttc ccttaaattc atctgactta cgcatagtga taaacctctt 2280
tttcgcaaaa tcgtcatgga tttactaaaa catgcatatt cgatcacaaa acgtcatagt 2340
taacgttaac atttgtgata ttcatcgcat ttatgaaagt aagggacttt atttttataa 2400
aagttaacgt taacaattca ccaaatttgc ttaaccagga tgattaaaat gacgcaatct 2460
cgattgcatg cggcgcaaaa cgcactagca aaacttcacg agcgccgagg taacactttc 2520
tatccccatt ttcacctcgc gcctcctgcc gggtggatga acgatccaaa cggcctgatc 2580
tggtttaacg atcgttatca cgcgttttat caacatcacc cgatgagcga acactggggg 2640
ccaatgcact ggggacatgc caccagcgac gatatgatcc actggcagca tgagcctatt 2700
gcgctagcgc caggagacga gaatgacaaa gacgggtgtt tttcaggtag tgctgtcgat 2760
gacaatggtg tcctctcact tatctacacc ggacacgtct ggctcgatgg tgcaggtaat 2820
gacgatgcaa ttcgcgaagt acaatgtctg gctaccagtc gggatggtat tcatttcgag 2880
aaacagggtg tgatcctcac tccaccagaa ggcatcatgc acttccgcga tcctaaagtg 2940
tggcgtgaag ccgacacatg gtggatggta gtcggggcga aagacccagg caacacgggg 3000
cagatcctgc tttatcgcgg cagttcattg cgtgaatgga ctttcgatcg cgtactggcc 3060
cacgctgatg cgggtgaaag ctatatgtgg gaatgtccgg actttttcag ccttggcgat 3120
cagcattatc tgatgttttc cccgcaggga atgaatgccg agggatacag ttatcgaaat 3180
cgctttcaaa gtggcgtaat acccggaatg tggtcgccag gacgactttt tgcacaatcc 3240
gggcatttta ctgaacttga taacgggcat gacttttatg caccacaaag ctttgtagcg 3300
aaggatggtc ggcgtattgt tatcggctgg atggatatgt gggaatcgcc aatgccctca 3360
aaacgtgaag gctgggcagg ctgcatgacg ctggcgcgcg agctatcaga gagcaatggc 3420
aaactcctac aacgcccggt acacgaagct gagtcgttac gccagcagca tcaatctatc 3480
tctccccgca caatcagcaa taaatatgtt ttgcaggaaa acgcgcaagc agttgagatt 3540
cagttgcagt gggcgctgaa gaacagtgat gccgaacatt acggattaca gctcggcgct 3600
ggaatgcggc tgtatattga taaccaatct gagcgacttg ttttgtggcg gtattaccca 3660
cacgagaatt tagatggcta ccgtagtatt cccctcccgc agggtgacat gctcgcccta 3720
aggatattta tcgatacatc atccgtggaa gtatttatta acgacgggga ggcggtgatg 3780
agtagccgaa tatatccgca gccagaagaa cgggaactgt cgctctatgc ctcccacgga 3840
gtggctgtgc tgcaacatgg agcactctgg caactgggtt aacataatat caggtggaac 3900
aacggatcaa cagcgggcaa gggatccgcg tcactcttcc cccttcacga ccttcaataa 3960
tatgcaatgc agcttcccgc ccgataatgt catgtggaag ctgaattgtg gtcagcggcg 4020
gtaaaaacag atgcccgacg ccaaccagat tatcaaagcc cattacggcg acatcctgcg 4080
ggatacgtac ccccttcgcc aaaagaacct gataagccac aaaggctgcg cgatcgttac 4140
cacatatcag aacatcaaaa tctggtttgc ccgatttgaa gtgggcattg agtaaacttg 4200
cgagatcggt gtagtgatca tcacctgttg ccatgtgaaa ttgtttcacc tcagccagat 4260
ctcgtccagc atcacgccag gcctgctcaa atccctgccg acgataccct gttgccaacg 4320
cactttccgg tagccagaag cataacggtt gacgatagcc cgccgcgagc aaatgctgtg 4380
ttgattcata ttgtgcagtg taatcatcag ggatataact gggtaacgct gggtcatccg 4440
ccacacagtt cgccaataca atattttcac catacagaga ctcaggcagc gtgatatgtc 4500
gcagccccat tgtagtatag ataatgccat ccggacggtg ggcaagcagc tgacgtgccg 4560
cgcgggcagc gtcatcttca gaaaaaatat tgattaaaaa actattccag ccgaactcgc 4620
tggcggtttg ctcaatggca agcagaatat caacagagaa aggagtggta gcagtgtcct 4680
gcgccagcac ggcgagagtc gacggcttac gtccttgagc gcgcatctta cgggcggaaa 4740
gatcaggaac ataattcagg gtctggattg cctgcaatac gcggtcacgc gttgcaggac 4800
gcacagattc tgcattatgc atcacccggg agactgtcat catcgacact cccgccaggc 4860
gtgcgacatc ctttaatgaa gccat 4885
<210> 83
<211> 3771
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 83
atgaaaataa agaacattct actcaccctt tgcacctcac tcctgcttac caacgttgct 60
gcacacgcca aagaagtcaa aataggtatg gcgattgatg atctccgtct tgaacgctgg 120
caaaaagatc gagatatctt tgtgaaaaag gcagaatctc tcggcgcgaa agtatttgta 180
cagtctgcaa atggcaatga agaaacacaa atgtcgcaga ttgaaaacat gataaaccgg 240
ggtgtcgatg ttcttgtcat tattccgtat aacggtcagg tattaagtaa cgttgtaaaa 300
gaagccaaac aagaaggcat taaagtatta gcttacgacc gtatgattaa cgatgcggat 360
atcgattttt atatttcttt cgataacgaa aaagtcggtg aactgcaggc aaaagccctg 420
gtcgatattg ttccgcaagg taattacttc ctgatgggcg gctcgccggt agataacaac 480
gccaagctgt tccgcgccgg acaaatgaaa gtgttaaaac cttacgttga ttccggaaaa 540
attaaagtcg ttggtgacca atgggttgat ggctggttac cggaaaacgc attgaaaatt 600
atggaaaacg cgctaaccgc caataataac aaaattgatg ctgtagttgc ctcaaacgat 660
gccaccgcag gtggggcaat tcaggcatta agcgcgcaag gtttatcagg gaaagtagca 720
atctccggcc aggatgcgga tctcgcaggt attaaacgta ttgctgccgg tacgcaaact 780
atgacggtgt ataaacctat tacgttgttg gcaaatactg ccgcagaaat tgccgttgag 840
ttgggcaatg gtcaggaacc aaaagcagat accacactga ataatggcct gaaagatgtc 900
ccctcccgcc tcctgacacc gatcgatgtg aataaaaaca acatcaaaga tacggtaatt 960
aaagacggat tccacaaaga gagcgagctg taagcgttac gccccagcgc ggagcggggg 1020
cgtgatttct ctccatgccg cgtgaatgaa ttggcttagg tggagtcgtt atgccttatc 1080
tacttgaaat gaagaacatt accaaaacct tcggcagtgt gaaggcgatt gataacgtct 1140
gcttgcggtt gaatgctggc gaaatcgtct cactttgtgg ggaaaatggg tctggtaaat 1200
caacgctgat gaaagtgctg tgtggtattt atccccatgg ctcctacgaa ggcgaaatta 1260
tttttgcggg agaagagatt caggcgagtc acatccgcga taccgaacgc aaaggtatcg 1320
ccatcattca tcaggaattg gccctggtga aagaattgac cgtgctggaa aatatcttcc 1380
tgggtaacga aataacccac aatggcatta tggattatga cctgatgacg ctacgctgtc 1440
agaagctgct cgcacaggtc agtttatcca tttcacctga tacccgcgtt ggcgatttag 1500
ggcttgggca acaacaactg gttgaaattg ccaaggcact taataaacag gtgcgcttgt 1560
taattctcga tgaaccgaca gcctcattaa ctgagcagga aacgtcgatt ttactggata 1620
ttattcgcga tctacaacag cacggtatcg cctgtattta tatttcgcac aaactcaacg 1680
aagtcaaagc gatttccgat acgatttgcg ttattcgcga cggacagcac attggtacgc 1740
gtgatgctgc cggaatgagt gaagacgata ttatcaccat gatggtcggg cgagagttaa 1800
ccgcgcttta ccctaatgaa ccacatacca ccggagatga aatattacgt attgaacatc 1860
tgacggcatg gcatccggtt aatcgtcata ttaaacgagt taatgatgtc tcgttttccc 1920
tgaaacgtgg cgaaatattg ggtattgccg gactcgttgg tgccggacgt accgagacca 1980
ttcagtgcct gtttggtgtg tggcccggac aatgggaagg aaaaatttat attgatggca 2040
aacaggtaga tattcgtaac tgtcagcaag ccatcgccca ggggattgcg atggtccccg 2100
aagacagaaa gcgcgacggc atcgttccgg taatggcggt tggtaaaaat attaccctcg 2160
ccgcactcaa taaatttacc ggtggcatta gccagcttga tgacgcggca gagcaaaaat 2220
gtattctgga atcaatccag caactcaaag ttaaaacgtc gtcccccgac cttgctattg 2280
gacgtttgag cggcggcaat cagcaaaaag cgatcctcgc tcgctgtctg ttacttaacc 2340
cgcgcattct cattcttgat gaacccacca ggggtatcga tattggcgcg aaatacgaga 2400
tctacaaatt aattaaccaa ctcgtccagc agggtattgc cgttattgtc atctcttccg 2460
aattacctga agtgctcggc cttagcgatc gtgtactggt gatgcatgaa gggaaactaa 2520
aagccaacct gataaatcat aacctgactc aggagcaggt gatggaagcc gcattgagga 2580
gcgaacatca tgtcgaaaag caatccgtct gaagtgaaat tggccgtacc gacatccggt 2640
ggcttctccg ggctgaaatc actgaatttg caggtcttcg tgatgattgc agctatcatc 2700
gcaatcatgc tgttctttac ctggaccacc gatggtgcct acttaagcgc ccgtaacgtc 2760
tccaacctgt tacgccagac cgcgattacc ggcatcctcg cggtaggaat ggtgttcgtc 2820
ataatttctg ctgaaatcga cctttccgtc ggctcaatga tggggctgtt aggtggcgtc 2880
gcggcgattt gtgacgtctg gttaggctgg cctttgccac ttaccatcat tgtgacgctg 2940
gttctgggac tgcttctcgg tgcctggaac ggatggtggg tcgcgtaccg taaagtccct 3000
tcatttattg tcaccctcgc gggcatgttg gcatttcgcg gcatactcat tggcatcacc 3060
aacggcacga ctgtatcccc caccagcgcc gcgatgtcac aaattgggca aagctatctc 3120
cccgccagta ccggcttcat cattggcgcg cttggcttaa tggcttttgt tggttggcaa 3180
tggcgcggaa gaatgcgccg tcaggctttg ggtttacagt ctccggcctc taccgcagta 3240
gtcggtcgcc aggctttaac cgctatcatc gtattaggcg caatctggct gttgaatgat 3300
taccgtggcg ttcccactcc tgttctgctg ctgacgttgc tgttactcgg cggaatgttt 3360
atggcaacgc ggacggcatt tggacgacgc atttatgcca tcggcggcaa tctggaagca 3420
gcacgtctct ccgggattaa cgttgaacgc accaaacttg ccgtgttcgc gattaacgga 3480
ttaatggtag ccatcgccgg attaatcctt agttctcgac ttggcgctgg ttcaccttct 3540
gcgggaaata tcgccgaact ggacgcaatt gcagcatgcg tgattggcgg caccagcctg 3600
gctggcggtg tgggaagcgt tgccggagca gtaatggggg catttatcat ggcttcactg 3660
gataacggca tgagtatgat ggatgtaccg accttctggc agtatatcgt taaaggtgcg 3720
attctgttgc tggcagtatg gatggactcc gcaaccaaac gccgttcttg a 3771
<210> 84
<211> 1395
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 84
atgcctgacg ctaaaaaaca ggggcggtca aacaaggcaa tgacgttttt cgtctgcttc 60
cttgccgctc tggcgggatt actctttggc ctggatatcg gtgtaattgc tggcgcactg 120
ccgtttattg cagatgaatt ccagattact tcgcacacgc aagaatgggt cgtaagctcc 180
atgatgttcg gtgcggcagt cggtgcggtg ggcagcggct ggctctcctt taaactcggg 240
cgcaaaaaga gcctgatgat cggcgcaatt ttgtttgttg ccggttcgct gttctctgcg 300
gctgcgccaa acgttgaagt actgattctt tcccgcgttc tactggggct ggcggtgggt 360
gtggcctctt ataccgcacc gctgtacctc tctgaaattg cgccggaaaa aattcgtggc 420
agtatgatct cgatgtatca gttgatgatc actatcggga tcctcggtgc ttatctttct 480
gataccgcct tcagctacac cggtgcatgg cgctggatgc tgggtgtgat tatcatcccg 540
gcaattttgc tgctgattgg tgtcttcttc ctgccagaca gcccacgttg gtttgccgcc 600
aaacgccgtt ttgttgatgc cgaacgcgtg ctgctacgcc tgcgtgacac cagcgcggaa 660
gcgaaacgcg aactggatga aatccgtgaa agtttgcagg ttaaacagag tggctgggcg 720
ctgtttaaag agaacagcaa cttccgccgc gcggtgttcc ttggcgtact gttgcaggta 780
atgcagcaat tcaccgggat gaacgtcatc atgtattacg cgccgaaaat cttcgaactg 840
gcgggttata ccaacactac cgagcaaatg tgggggaccg tgattgtcgg cctgaccaac 900
gtacttgcca cctttatcgc aatcggcctt gttgaccgct ggggacgtaa accaacgcta 960
acgctgggct tcctggtgat ggctgctggc atgggcgtac tcggtacaat gatgcatatc 1020
ggtattcact ctccgtcggc gcagtatttc gccatcgcca tgctgctgat gtttattgtc 1080
ggttttgcca tgagtgccgg tccgctgatt tgggtactgt gctccgaaat tcagccgctg 1140
aaaggccgcg attttggcat cacctgctcc actgccacca actggattgc caacatgatc 1200
gttggcgcaa cgttcctgac catgctcaac acgctgggta acgccaacac cttctgggtg 1260
tatgcggctc tgaacgtact gtttatcctg ctgacattgt ggctggtacc ggaaaccaaa 1320
cacgtttcgc tggaacatat tgaacgtaat ctgatgaaag gtcgtaaact gcgcgaaata 1380
ggcgctcacg attaa 1395
<210> 85
<211> 1422
<212> DNA
<213>Zymomonas mobilis (Zymomonas mobilis)
<400> 85
atgagttctg aaagtagtca gggtctagtc acgcgactag ccctaatcgc tgctataggc 60
ggcttgcttt tcggttacga ttcagcggtt atcgctgcaa tcggtacacc ggttgatatc 120
cattttattg cccctcgtca cctgtctgct acggctgcgg cttccctttc tgggatggtc 180
gttgttgctg ttttggtcgg ttgtgttacc ggttctttgc tgtctggctg gattggtatt 240
cgcttcggtc gtcgcggcgg attgttgatg agttccattt gtttcgtcgc cgccggtttt 300
ggtgctgcgt taaccgaaaa attatttgga accggtggtt cggctttaca aattttttgc 360
tttttccggt ttcttgccgg tttaggtatc ggtgtcgttt caaccttgac cccaacctat 420
attgctgaaa ttcgtccgcc agacaaacgt ggtcagatgg tttctggtca gcagatggcc 480
attgtgacgg gtgctttaac cggttatatc tttacctggt tactggctca tttcggttct 540
atcgattggg ttaatgccag tggttggtgc tggtctccgg cttcagaagg cctgatcggt 600
attgccttct tattgctgct gttaaccgca ccggatacgc cgcattggtt ggtgatgaag 660
ggacgtcatt ccgaggctag caaaatcctt gctcgtctgg aaccgcaagc cgatcctaat 720
ctgacgattc aaaagattaa agctggcttt gataaagcca tggacaaaag cagcgcaggt 780
ttgtttgctt ttggtatcac cgttgttttt gccggtgtat ccgttgctgc cttccagcag 840
ttagtcggta ttaacgccgt gctgtattat gcaccgcaga tgttccagaa tttaggtttt 900
ggagctgata cggcattatt gcagaccatc tctatcggtg ttgtgaactt catcttcacc 960
atgattgctt cccgtgttgt tgaccgcttc ggccgtaaac ctctgcttat ttggggtgct 1020
ctcggtatgg ctgcaatgat ggctgtttta ggctgctgtt tctggttcaa agtcggtggt 1080
gttttgcctt tggcttctgt gcttctttat attgcagtct ttggtatgtc atggggccct 1140
gtctgctggg ttgttctgtc agaaatgttc ccgagttcca tcaagggcgc agctatgcct 1200
atcgctgtta ccggacaatg gttagctaat atcttggtta acttcctgtt taaggttgcc 1260
gatggttctc cagcattgaa tcagactttc aaccacggtt tctcctatct cgttttcgca 1320
gcattaagta tcttaggtgg cttgattgtt gctcgcttcg tgccggaaac caaaggtcgg 1380
agcctggatg aaatcgagga gatgtggcgc tcccagaagt ag 1422
<210> 86
<211> 984
<212> DNA
<213>Zymomonas mobilis (Zymomonas mobilis)
<400> 86
atggaaattg ttgcgattga catcggtgga acgcatgcgc gtttctctat tgcggaagta 60
agcaatggtc gggttctttc tcttggagaa gaaacaactt ttaaaacggc agaacatgct 120
agcttgcagt tagcttggga acgtttcggt gaaaaactgg gtcgtcctct gccacgtgcc 180
gcagctattg catgggctgg cccggttcat ggtgaagttt taaaacttac caataaccct 240
tgggtattaa gaccagctac tctgaatgaa aagctggaca tcgatacgca tgttctgatc 300
aatgacttcg gcgcggttgc ccacgcggtt gcgcatatgg attcttctta tctggatcat 360
atttgtggtc ctgatgaagc gcttcctagc gatggtgtta tcactattct tggtccggga 420
acgggcttgg gtgttgccca tctgttgcgg actgaaggcc gttatttcgt catcgaaact 480
gaaggcggtc atatcgactt tgctccgctt gacagacttg aagacaaaat tctggcacgt 540
ttacgtgaac gtttccgccg cgtttctatc gaacgcatta tttctggccc gggtcttggt 600
aatatctacg aagcactggc tgccattgaa ggcgttccgt tcagcttgct ggatgatatt 660
aaattatggc agatggcttt ggaaggtaaa gacaaccttg ctgaagccgc tttggatcgc 720
ttctgcttga gccttggcgc tatcgctggt gatcttgctt tggcacaggg tcgaaccagt 780
gttgttattg gcggtggtgt cggtcttcgt atcgcttccc atttgccaga atctggtttc 840
cgtcagcgct ttgtttcaaa aggacgcttt gaacgcgtca tgtccaagat tccggttaag 900
ttgattactt atccgcagcc tggactgttg ggtgcgcagc tgcctatgcc aacaaatatt 960
ctgaagttga ataatatttt ttaa 984
<210> 87
<211> 966
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 87
atgacaaagt atgcattagt cggtgatgtg ggcggcacca acgcacgtct tgctctgtgt 60
gatattgcca gtggtgaaat ctcgcaggct aagacctatt cagggcttga ttaccccagc 120
ctcgaagcgg tcattcgcgt ttatcttgaa gaacataagg tcgaggtgaa agacggctgt 180
attgccatcg cttgcccaat taccggtgac tgggtggcga tgaccaacca tacctgggcg 240
ttctcaattg ccgaaatgaa aaagaatctc ggttttagcc atctggaaat tattaacgat 300
tttaccgctg tatcgatggc gatcccgatg ctgaaaaaag agcatctgat tcagtttggt 360
ggcgcagaac cggtcgaagg taagcctatt gcggtttacg gtgccggaac ggggcttggg 420
gttgcgcatc tggtccatgt cgataagcgt tgggtaagct tgccaggcga aggcggtcac 480
gttgattttg cgccgaatag tgaagaagag gccattatcc tcgaaatatt gcgtgcggaa 540
attggtcatg tttcggcgga gcgcgtgctt tctggccctg ggctggtgaa tttgtatcgc 600
gcaattgtga aagctgacaa ccgcctgcca gaaaatctca agccaaaaga tattaccgaa 660
cgcgcgctgg ctgacagctg caccgattgc cgccgcgcat tgtcgctgtt ttgcgtcatt 720
atgggccgtt ttggcggcaa tctggcgctc aatctcggga catttggcgg cgtgtttatt 780
gcgggcggta tcgtgccgcg cttccttgag ttcttcaaag cctccggttt ccgtgccgca 840
tttgaagata aagggcgctt taaagaatat gtccatgata ttccggtgta tctcatcgtc 900
catgacaatc cgggccttct cggttccggt gcacatttac gccagacctt aggtcacatt 960
ctgtaa 966
<210> 88
<211> 2547
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 88
ttgtacctct atattgagac tctgaaacag agactggatg ccataaatca attgcgtgtg 60
gatcgcgcgc ttgctgctat ggggcctgca ttccaacagg tctacagtct actgccgaca 120
ttgttgcact atcaccatcc gctaatgccg ggttaccttg atggtaacgt tcccaaaggc 180
atttgccttt acacgcctga tgaaactcaa cgccactacc tgaacgagct tgaactgtat 240
cgtggaatgt cagtacagga tccgccgaaa ggtgagcttc caattactgg tgtatacacc 300
atgggcagca cctcgtccgt agggcaaagt tgttcctctg acctggatat ctgggtctgt 360
catcaatcct ggctcgatag cgaagagcgc caattgctac aacgtaaatg tagcctgctg 420
gaaaactggg ccgcctcgct gggtgtggaa gtcagcttct tcctgattga tgaaaaccgc 480
ttccgtcata atgaaagcgg cagcctgggg ggcgaagatt gtggctccac ccagcatata 540
ctgctgcttg acgaatttta tcgtaccgcc gtgcgtctcg ccggtaagcg tattctgtgg 600
aatatggtgc cgtgcgacga agaagagcat tacgacgact atgtgatgac gctttacgcg 660
cagggcgtgc tgacgccaaa tgaatggctg gatctcggtg gcttaagctc gctttctgct 720
gaagagtact ttggtgccag cctttggcag ctctacaaga gtatcgattc cccatacaaa 780
gcggtactga aaacactgct gctggaagcc tattcctggg aatacccgaa cccacgtctg 840
ctggcgaaag atatcaaaca gcgtttgcac gacggcgaga ttgtatcgtt tggtctcgat 900
ccatactgca tgatgctgga gcgtgttact gaatacctga cggcgattga agattttacc 960
cgtctggatt tagtacgtcg ctgcttctat ttaaaagtgt gcgaaaagct cagccgtgaa 1020
cgcgcctgcg taggctggcg tcgcgcagtg ttgagccagt tagtgagcga gtggggttgg 1080
gacgaagctc gtctggcaat gctcgataac cgcgctaact ggaagattga tcaggtgcgt 1140
gaggcgcaca acgagttgct cgacgcgatg atgcagagct accgtaatct gatccgcttt 1200
gcgcgtcgca ataaccttag cgtctccgcc agtccgcagg atatcggcgt gctgacgcgt 1260
aagctgtatg ccgcgtttga agcattacca ggtaaagtga cgctggtaaa cccgcagatt 1320
tcacccgatc tctcggaacc gaatctgacc tttatttatg tgccgccggg ccgggctaac 1380
cgttcaggtt ggtatctgta taaccgcgcg ccaaatattg agtcgatcat cagccatcag 1440
ccgctggaat ataaccgtta cctgaataaa ctggtggcgt gggcatggtt taacggcctg 1500
ctgacctcgc gcacccgttt gtatattaaa ggtaacggca ttgtcgattt gcctaagttg 1560
caggagatgg tcgccgacgt gtcgcaccat ttcccgctgc gcttacctgc accgacaccg 1620
aaggcgctct acagcccgtg tgagatccgc catctggcga ttatcgttaa cctggaatat 1680
gacccgacag cggcgttccg caatcaggtg gtgcatttcg atttccgtaa gctggatgtc 1740
ttcagctttg gcgagaatca aaattgcctg gtaggtagcg ttgacctgct gtaccgcaac 1800
tcgtggaacg aagtgcgtac gctgcacttc aacggcgagc aatcgatgat cgaagccctg 1860
aaaactattc tcggcaaaat gcatcaggac gccgcaccgc cagatagcgt ggaagtcttc 1920
tgttatagcc agcatctgcg cggcttaatt cgtactcgcg tgcagcaact ggtttctgag 1980
tgtattgaat tgcgtctttc cagcacccgc caggaaaccg ggcgtttcaa ggcgctgcgc 2040
gtttctggtc aaacctgggg gttgttcttc gaacgcctga atgtatcggt acagaaactg 2100
gaaaacgcca tcgagtttta tggcgcgatt tcgcataaca aactgcacgg cctgtcagtg 2160
caggttgaaa ccaatcacgt caaattaccg gcggtggtgg acggctttgc cagcgaaggg 2220
atcatccagt tctttttcga agaaacgcaa gacgagaatg gctttaatat ctacattctc 2280
gacgaaagca accgggttga ggtatatcac cactgcgaag gcagcaaaga ggagctggta 2340
cgtgacgtca gtcgcttcta ctcgtcatcg catgaccgtt ttacctacgg ctcaagcttc 2400
atcaacttca acctgccgca gttctatcag attgtgaagg ttgatggtcg tgaacaggtg 2460
attccgttcc gcacaaaatc tatcggtaac atgccgcctg ccaatcagga tcacgatacg 2520
ccgctattac agcaatattt ttcgtga 2547
<210> 89
<211> 210
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 89
Met Val Leu Gly Lys Pro Gln Thr Asp Pro Thr Leu Glu Trp Phe Leu
1 5 10 15
Ser His Cys His Ile His Lys Tyr Pro Ser Lys Ser Thr Leu Ile His
20 25 30
Gln Gly Glu Lys Ala Glu Thr Leu Tyr Tyr Ile Val Lys Gly Ser Val
35 40 45
Ala Val Leu Ile Lys Asp Glu Glu Gly Lys Glu Met Ile Leu Ser Tyr
50 55 60
Leu Asn Gln Gly Asp Phe Ile Gly Glu Leu Gly Leu Phe Glu Glu Gly
65 70 75 80
Gln Glu Arg Ser Ala Trp Val Arg Ala Lys Thr Ala Cys Glu Val Ala
85 90 95
Glu Ile Ser Tyr Lys Lys Phe Arg Gln Leu Ile Gln Val Asn Pro Asp
100 105 110
Ile Leu Met Arg Leu Ser Ala Gln Met Ala Arg Arg Leu Gln Val Thr
115 120 125
Ser Glu Lys Val Gly Asn Leu Ala Phe Leu Asp Val Thr Gly Arg Ile
130 135 140
Ala Gln Thr Leu Leu Asn Leu Ala Lys Gln Pro Asp Ala Met Thr His
145 150 155 160
Pro Asp Gly Met Gln Ile Lys Ile Thr Arg Gln Glu Ile Gly Gln Ile
165 170 175
Val Gly Cys Ser Arg Glu Thr Val Gly Arg Ile Leu Lys Met Leu Glu
180 185 190
Asp Gln Asn Leu Ile Ser Ala His Gly Lys Thr Ile Val Val Tyr Gly
195 200 205
Thr Arg
210
<210> 90
<211> 210
<212> PRT
<213>Artificial
<220>
<223>Crp* mutant
<400> 90
Met Val Leu Gly Lys Pro Gln Thr Asp Pro Thr Leu Glu Trp Phe Leu
1 5 10 15
Ser His Cys His Ile His Lys Tyr Pro Ser Lys Ser Thr Leu Ile His
20 25 30
Gln Gly Glu Lys Ala Glu Thr Leu Tyr Tyr Ile Val Lys Gly Ser Val
35 40 45
Ala Val Leu Ile Lys His Glu Glu Gly Lys Glu Met Ile Leu Ser Tyr
50 55 60
Leu Asn Gln Gly Asp Phe Ile Gly Glu Leu Gly Leu Phe Glu Glu Gly
65 70 75 80
Gln Glu Arg Ser Ala Trp Val Arg Ala Lys Thr Ala Cys Glu Val Ala
85 90 95
Glu Ile Ser Tyr Lys Lys Phe Arg Gln Leu Ile Gln Val Asn Pro Asp
100 105 110
Ile Leu Met Arg Leu Ser Ala Gln Met Ala Arg Arg Leu Gln Val Thr
115 120 125
Ser Glu Lys Val Gly Asn Leu Ala Phe Leu Asp Val Thr Gly Arg Ile
130 135 140
Ala Gln Thr Leu Leu Asn Leu Ala Lys Gln Pro Asp Ala Met Thr His
145 150 155 160
Pro Asp Gly Met Gln Ile Lys Ile Thr Arg Gln Glu Ile Gly Gln Ile
165 170 175
Val Gly Cys Ser Arg Glu Thr Val Gly Arg Ile Leu Lys Met Leu Glu
180 185 190
Asp Gln Asn Leu Ile Ser Ala His Gly Lys Thr Ile Val Val Tyr Gly
195 200 205
Thr Arg
210
<210> 91
<211> 210
<212> PRT
<213>Artificial
<220>
<223>Crp* mutant
<400> 91
Met Val Leu Gly Lys Pro Gln Thr Asp Pro Thr Leu Glu Trp Phe Leu
1 5 10 15
Ser His Cys His Ile His Lys Tyr Pro Ser Lys Ser Thr Leu Ile His
20 25 30
Gln Gly Glu Lys Ala Glu Thr Leu Tyr Tyr Ile Val Lys Gly Ser Val
35 40 45
Ala Val Leu Ile Lys Asp Glu Glu Gly Lys Glu Met Ile Leu Phe Tyr
50 55 60
Leu Asn Gln Gly Asp Phe Ile Gly Glu Leu Gly Leu Phe Glu Glu Gly
65 70 75 80
Gln Glu Arg Ser Ala Trp Val Arg Ala Lys Thr Ala Cys Glu Val Ala
85 90 95
Glu Ile Ser Tyr Lys Lys Phe Arg Gln Leu Ile Gln Val Asn Pro Asp
100 105 110
Ile Leu Met Arg Leu Ser Ala Gln Met Ala Arg Arg Leu Gln Val Thr
115 120 125
Ser Glu Lys Val Gly Asn Leu Ala Phe Leu Asp Val Thr Gly Arg Ile
130 135 140
Ala Gln Thr Leu Leu Asn Leu Ala Lys Gln Pro Asp Ala Met Thr His
145 150 155 160
Pro Asp Gly Met Gln Ile Lys Ile Thr Arg Gln Glu Ile Gly Gln Ile
165 170 175
Val Gly Cys Ser Arg Glu Thr Val Gly Arg Ile Leu Lys Met Leu Glu
180 185 190
Asp Gln Asn Leu Ile Ser Ala His Gly Lys Thr Ile Val Val Tyr Gly
195 200 205
Thr Arg
210
<210> 92
<211> 210
<212> PRT
<213>Artificial
<220>
<223>Crp* mutant
<400> 92
Met Val Leu Gly Lys Pro Gln Thr Asp Pro Thr Leu Glu Trp Phe Leu
1 5 10 15
Ser His Cys His Ile His Lys Tyr Pro Ser Lys Ser Thr Leu Ile His
20 25 30
Gln Gly Glu Lys Ala Glu Thr Leu Tyr Tyr Ile Val Lys Gly Ser Val
35 40 45
Ala Val Leu Ile Lys Asp Glu Glu Gly Lys Glu Met Ile Leu Ser Tyr
50 55 60
Leu Asn Gln Gly Asp Phe Ile Gly Glu Leu Gly Leu Phe Glu Glu Gly
65 70 75 80
Gln Glu Arg Ser Ala Trp Val Arg Ala Lys Thr Ala Cys Glu Val Ala
85 90 95
Glu Ile Ser Tyr Lys Lys Phe Arg Gln Leu Ile Gln Val Asn Pro Asp
100 105 110
Ile Leu Met Arg Leu Ser Ala Gln Met Ala Arg Arg Leu Gln Val Thr
115 120 125
Ser Glu Lys Val Gly Asn Leu Ala Phe Leu Asp Val Thr Asp Arg Ile
130 135 140
Ala Gln Thr Leu Leu Asn Leu Ala Lys Gln Pro Asp Ala Met Thr His
145 150 155 160
Pro Asp Gly Met Gln Ile Lys Ile Thr Arg Gln Glu Ile Gly Gln Ile
165 170 175
Val Gly Cys Ser Arg Glu Thr Val Gly Arg Ile Leu Lys Met Leu Glu
180 185 190
Asp Gln Asn Leu Ile Ser Ala His Gly Lys Thr Ile Val Val Tyr Gly
195 200 205
Thr Arg
210
<210> 93
<211> 210
<212> PRT
<213>Artificial
<220>
<223>Crp* mutant
<400> 93
Met Val Leu Gly Lys Pro Gln Thr Asp Pro Thr Leu Glu Trp Phe Leu
1 5 10 15
Ser His Cys His Ile His Lys Tyr Pro Ser Lys Ser Thr Leu Ile His
20 25 30
Gln Gly Glu Lys Ala Glu Thr Leu Tyr Tyr Ile Val Lys Gly Ser Val
35 40 45
Ala Val Leu Ile Lys Asp Glu Glu Gly Lys Glu Met Ile Leu Ser Tyr
50 55 60
Leu Asn Gln Gly Asp Phe Ile Gly Glu Leu Gly Leu Phe Glu Glu Gly
65 70 75 80
Gln Glu Arg Ser Ala Trp Val Arg Ala Lys Thr Ala Cys Glu Val Ala
85 90 95
Glu Ile Ser Tyr Lys Lys Phe Arg Gln Leu Ile Gln Val Asn Pro Asp
100 105 110
Ile Leu Met Arg Leu Ser Ala Gln Met Ala Arg Arg Leu Gln Val Thr
115 120 125
Ser Glu Lys Val Gly Asn Leu Ala Phe Leu Asp Val Thr Gly Asp Ile
130 135 140
Ala Gln Thr Leu Leu Asn Leu Ala Lys Gln Pro Asp Ala Met Thr His
145 150 155 160
Pro Asp Gly Met Gln Ile Lys Ile Thr Arg Gln Glu Ile Gly Gln Ile
165 170 175
Val Gly Cys Ser Arg Glu Thr Val Gly Arg Ile Leu Lys Met Leu Glu
180 185 190
Asp Gln Asn Leu Ile Ser Ala His Gly Lys Thr Ile Val Val Tyr Gly
195 200 205
Thr Arg
210
<210> 94
<211> 2932
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 94
atgcgaattg gcataccaag agaacggtta accaatgaaa cccgtgttgc agcaacgcca 60
aaaacagtgg aacagctgct gaaactgggt tttaccgtcg cggtagagag cggcgcgggt 120
caactggcaa gttttgacga taaagcgttt gtgcaagcgg gcgctgaaat tgtagaaggg 180
aatagcgtct ggcagtcaga gatcattctg aaggtcaatg cgccgttaga tgatgaaatt 240
gcgttactga atcctgggac aacgctggtg agttttatct ggcctgcgca gaatccggaa 300
ttaatgcaaa aacttgcgga acgtaacgtg accgtgatgg cgatggactc tgtgccgcgt 360
atctcacgcg cacaatcgct ggacgcacta agctcgatgg cgaacatcgc cggttatcgc 420
gccattgttg aagcggcaca tgaatttggg cgcttcttta ccgggcaaat tactgcggcc 480
gggaaagtgc caccggcaaa agtgatggtg attggtgcgg gtgttgcagg tctggccgcc 540
attggcgcag caaacagtct cggcgcgatt gtgcgtgcat tcgacacccg cccggaagtg 600
aaagaacaag ttcaaagtat gggcgcggaa ttcctcgagc tggattttaa agaggaagct 660
ggcagcggcg atggctatgc caaagtgatg tcggacgcgt tcatcaaagc ggaaatggaa 720
ctctttgccg cccaggcaaa agaggtcgat atcattgtca ccaccgcgct tattccaggc 780
aaaccagcgc cgaagctaat tacccgtgaa atggttgact ccatgaaggc gggcagtgtg 840
attgtcgacc tggcagccca aaacggcggc aactgtgaat acaccgtgcc gggtgaaatc 900
ttcactacgg aaaatggtgt caaagtgatt ggttataccg atcttccggg ccgtctgccg 960
acgcaatcct cacagcttta cggcacaaac ctcgttaatc tgctgaaact gttgtgcaaa 1020
gagaaagacg gcaatatcac tgttgatttt gatgatgtgg tgattcgcgg cgtgaccgtg 1080
atccgtgcgg gcgaaattac ctggccggca ccgccgattc aggtatcagc tcagccgcag 1140
gcggcacaaa aagcggcacc ggaagtgaaa actgaggaaa aatgtacctg ctcaccgtgg 1200
cgtaaatacg cgttgatggc gctggcaatc attctttttg gctggatggc aagcgttgcg 1260
ccgaaagaat tccttgggca cttcaccgtt ttcgcgctgg cctgcgttgt cggttattac 1320
gtggtgtgga atgtatcgca cgcgctgcat acaccgttga tgtcggtcac caacgcgatt 1380
tcagggatta ttgttgtcgg agcactgttg cagattggcc agggcggctg ggttagcttc 1440
cttagtttta tcgcggtgct tatagccagc attaatattt tcggtggctt caccgtgact 1500
cagcgcatgc tgaaaatgtt ccgcaaaaat taaggggtaa catatgtctg gaggattagt 1560
tacagctgca tacattgttg ccgcgatcct gtttatcttc agtctggccg gtctttcgaa 1620
acatgaaacg tctcgccagg gtaacaactt cggtatcgcc gggatggcga ttgcgttaat 1680
cgcaaccatt tttggaccgg atacgggtaa tgttggctgg atcttgctgg cgatggtcat 1740
tggtggggca attggtatcc gtctggcgaa gaaagttgaa atgaccgaaa tgccagaact 1800
ggtggcgatc ctgcatagct tcgtgggtct ggcggcagtg ctggttggct ttaacagcta 1860
tctgcatcat gacgcgggaa tggcaccgat tctggtcaat attcacctga cggaagtgtt 1920
cctcggtatc ttcatcgggg cggtaacgtt cacgggttcg gtggtggcgt tcggcaaact 1980
gtgtggcaag atttcgtcta aaccattgat gctgccaaac cgtcacaaaa tgaacctggc 2040
ggctctggtc gtttccttcc tgctgctgat tgtatttgtt cgcacggaca gcgtcggcct 2100
gcaagtgctg gcattgctga taatgaccgc aattgcgctg gtattcggct ggcatttagt 2160
cgcctccatc ggtggtgcag atatgccagt ggtggtgtcg atgctgaact cgtactccgg 2220
ctgggcggct gcggctgcgg gctttatgct cagcaacgac ctgctgattg tgaccggtgc 2280
gctggtcggt tcttcggggg ctatcctttc ttacattatg tgtaaggcga tgaaccgttc 2340
ctttatcagc gttattgcgg gtggtttcgg caccgacggc tcttctactg gcgatgatca 2400
ggaagtgggt gagcaccgcg aaatcaccgc agaagagaca gcggaactgc tgaaaaactc 2460
ccattcagtg atcattactc cggggtacgg catggcagtc gcgcaggcgc aatatcctgt 2520
cgctgaaatt actgagaaat tgcgcgctcg tggtattaat gtgcgtttcg gtatccaccc 2580
ggtcgcgggg cgtttgcctg gacatatgaa cgtattgctg gctgaagcaa aagtaccgta 2640
tgacatcgtg ctggaaatgg acgagatcaa tgatgacttt gctgataccg ataccgtact 2700
ggtgattggt gctaacgata cggttaaccc ggcggcgcag gatgatccga agagtccgat 2760
tgctggtatg cctgtgctgg aagtgtggaa agcgcagaac gtgattgtct ttaaacgttc 2820
gatgaacact ggctatgctg gtgtgcaaaa cccgctgttc ttcaaggaaa acacccacat 2880
gctgtttggt gacgccaaag ccagcgtgga tgcaatcctg aaagctctgt aa 2932
<210> 95
<211> 1401
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 95
atgccacatt cctacgatta cgatgccata gtaataggtt ccggccccgg cggcgaaggc 60
gctgcaatgg gcctggttaa gcaaggtgcg cgcgtcgcag ttatcgagcg ttatcaaaat 120
gttggcggcg gttgcaccca ctggggcacc atcccgtcga aagctctccg tcacgccgtc 180
agccgcatta tagaattcaa tcaaaaccca ctttacagcg accattcccg actgctccgc 240
tcttcttttg ccgatatcct taaccatgcc gataacgtga ttaatcaaca aacgcgcatg 300
cgtcagggat tttacgaacg taatcactgt gaaatattgc agggaaacgc tcgctttgtt 360
gacgagcata cgttggcgct ggattgcccg gacggcagcg ttgaaacact aaccgctgaa 420
aaatttgtta ttgcctgcgg ctctcgtcca tatcatccaa cagatgttga tttcacccat 480
ccacgcattt acgacagcga ctcaattctc agcatgcacc acgaaccgcg ccatgtactt 540
atctatggtg ctggagtgat cggctgtgaa tatgcgtcga tcttccgcgg tatggatgta 600
aaagtggatc tgatcaacac ccgcgatcgc ctgctggcat ttctcgatca agagatgtca 660
gattctctct cctatcactt ctggaacagt ggcgtagtga ttcgtcacaa cgaagagtac 720
gagaagatcg aaggctgtga cgatggtgtg atcatgcatc tgaagtcggg taaaaaactg 780
aaagctgact gcctgctcta tgccaacggt cgcaccggta ataccgattc gctggcgtta 840
cagaacattg ggctagaaac tgacagccgc ggacagctga aggtcaacag catgtatcag 900
accgcacagc cacacgttta cgcggtgggc gacgtgattg gttatccgag cctggcgtcg 960
gcggcctatg accaggggcg cattgccgcg caggcgctgg taaaaggcga agccaccgca 1020
catctgattg aagatatccc taccggtatt tacaccatcc cggaaatcag ctctgtgggc 1080
aaaaccgaac agcagctgac cgcaatgaaa gtgccatatg aagtgggccg cgcccagttt 1140
aaacatctgg cacgcgcaca aatcgtcggc atgaacgtgg gcacgctgaa aattttgttc 1200
catcgggaaa caaaagagat tctgggtatt cactgctttg gcgagcgcgc tgccgaaatt 1260
attcatatcg gtcaggcgat tatggaacag aaaggtggcg gcaacactat tgagtacttc 1320
gtcaacacca cctttaacta cccgacgatg gcggaagcct atcgggtagc tgcgttaaac 1380
ggtttaaacc gcctgtttta a 1401
<210> 96
<211> 1434
<212> DNA
<213>Streptococcus mutans (Streptococcus mutans)
<400> 96
atggcaatga caaaacaata taaaaattat gtcaatggcg agtggaagct ttcagaaaat 60
gaaattaaaa tctacgaacc agccagtgga gctgaattgg gttcagttcc agcaatgagt 120
actgaagaag tagattatgt ttatgcttca gccaagaaag ctcaaccagc ttggcgagca 180
ctttcataca tagaacgtgc tgcctacctt cataaggtag cagatatttt gatgcgtgat 240
aaagaaaaaa taggtgctat tctttccaaa gaggttgcta aaggttataa atcagcagtc 300
agcgaagttg ttcgtactgc agaaatcatt aattatgcag ctgaagaagg tcttcgtatg 360
gaaggtgaag tccttgaagg cggcagtttt gaagcagcca gcaagaaaaa aattgccgtt 420
gttcgtcgtg aaccagtagg tcttgtatta gctatttcac catttaacta ccctgttaac 480
ttggcaggtt cgaaaattgc accggctctt attgcgggaa atgttattgc ttttaaacca 540
ccgacgcaag gatcaatctc agggctctta cttgctgaag catttgctga agctggactt 600
cctgcaggtg tctttaatac cattacaggt cgtggttctg aaattggaga ctatattgta 660
gaacatcaag ccgttaactt tatcaatttc actggttcaa caggaattgg cgaacgtatt 720
ggcaaaatgg ctggtatgcg tccgattatg cttgaactcg gtggaaaaga ttcagccatc 780
gttcttgaag atgcggacct tgaattgact gctaaaaata ttattgcagg tgcttttggt 840
tattcaggtc aacgctgtac agcagttaaa cgtgttcttg tgatggaaag tgttgctgat 900
gaactggtcg aaaaaatccg tgaaaaagtt cttgcattaa caattggtaa tccagaagac 960
gatgcagata ttacaccgtt gattgataca aaatcagctg attatgtaga aggtcttatt 1020
aatgatgcca atgataaagg agccactgcc cttactgaaa tcaaacgtga aggtaatctt 1080
atctgtccaa tcctctttga taaggtaacg acagatatgc gtcttgcttg ggaagaacca 1140
tttggtcctg ttcttccgat cattcgtgtg acatctgtag aagaagccat tgaaatttct 1200
aacaaatcgg aatatggact tcaggcttct atctttacaa atgatttccc acgcgctttt 1260
ggtattgctg agcagcttga agttggtaca gttcatatca ataataagac acagcgcggc 1320
acggacaact tcccattctt aggggctaaa aaatcaggtg caggtattca aggggtaaaa 1380
tattctattg aagctatgac aactgttaaa tccgtcgtat ttgatatcaa ataa 1434
<210> 97
<211> 996
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 97
atgactatca aagtaggtat caacggtttt ggccgtatcg gtcgcattgt tttccgtgct 60
gctcagaaac gttctgacat cgagatcgtt gcaatcaacg acctgttaga cgctgattac 120
atggcataca tgctgaaata tgactccact cacggccgtt tcgacggtac cgttgaagtg 180
aaagacggtc atctgatcgt taacggtaaa aaaatccgtg ttaccgctga acgtgatccg 240
gctaacctga aatgggacga agttggtgtt gacgttgtcg ctgaagcaac tggtctgttc 300
ctgactgacg aaactgctcg taaacacatc accgctggtg cgaagaaagt ggttatgact 360
ggtccgtcta aagacaacac tccgatgttc gttaaaggcg ctaacttcga caaatatgct 420
ggccaggaca tcgtttccaa cgcttcctgc accaccaact gcctggctcc gctggctaaa 480
gttatcaacg ataacttcgg catcatcgaa ggtctgatga ccaccgttca cgctactacc 540
gctactcaga aaaccgttga tggcccgtct cacaaagact ggcgcggcgg ccgcggcgct 600
tcccagaaca tcatcccgtc ctctaccggt gctgctaaag ctgtaggtaa agtactgcca 660
gaactgaatg gcaaactgac tggtatggcg ttccgcgttc cgaccccgaa cgtatctgta 720
gttgacctga ccgttcgtct ggaaaaagct gcaacttacg agcagatcaa agctgccgtt 780
aaagctgctg ctgaaggcga aatgaaaggc gttctgggct acaccgaaga tgacgtagta 840
tctaccgatt tcaacggcga agtttgcact tccgtgttcg atgctaaagc tggtatcgct 900
ctgaacgaca acttcgtgaa actggtatcc tggtacgaca acgaaaccgg ttactccaac 960
aaagttctgg acctgatcgc tcacatctcc aaataa 996
<210> 98
<211> 1650
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 98
atgaaaaaca tcaatccaac gcagaccgct gcctggcagg cactacagaa acacttcgat 60
gaaatgaaag acgttacgat cgccgatctt tttgctaaag acggcgatcg tttttctaag 120
ttctccgcaa ccttcgacga tcagatgctg gtggattact ccaaaaaccg catcactgaa 180
gagacgctgg cgaaattaca ggatctggcg aaagagtgcg atctggcggg cgcgattaag 240
tcgatgttct ctggcgagaa gatcaaccgc actgaaaacc gcgccgtgct gcacgtagcg 300
ctgcgtaacc gtagcaatac cccgattttg gttgatggca aagacgtaat gccggaagtc 360
aacgcggtgc tggagaagat gaaaaccttc tcagaagcga ttatttccgg tgagtggaaa 420
ggttataccg gcaaagcaat cactgacgta gtgaacatcg ggatcggcgg ttctgacctc 480
ggcccataca tggtgaccga agctctgcgt ccgtacaaaa accacctgaa catgcacttt 540
gtttctaacg tcgatgggac tcacatcgcg gaagtgctga aaaaagtaaa cccggaaacc 600
acgctgttct tggtagcatc taaaaccttc accactcagg aaactatgac caacgcccat 660
agcgcgcgtg actggttcct gaaagcggca ggtgatgaaa aacacgttgc aaaacacttt 720
gcggcgcttt ccaccaatgc caaagccgtt ggcgagtttg gtattgatac tgccaacatg 780
ttcgagttct gggactgggt tggcggccgt tactctttgt ggtcagcgat tggcctgtcg 840
attgttctct ccatcggctt tgataacttc gttgaactgc tttccggcgc acacgcgatg 900
gacaagcatt tctccaccac gcctgccgag aaaaacctgc ctgtactgct ggcgctgatt 960
ggcatctggt acaacaattt ctttggtgcg gaaactgaag cgattctgcc gtatgaccag 1020
tatatgcacc gtttcgcggc gtacttccag cagggcaata tggagtccaa cggtaagtat 1080
gttgaccgta acggtaacgt tgtggattac cagactggcc cgattatctg gggtgaacca 1140
ggcactaacg gtcagcacgc gttctaccag ctgatccacc agggaaccaa aatggtaccg 1200
tgcgatttca tcgctccggc tatcacccat aacccgctct ctgatcatca ccagaaactg 1260
ctgtctaact tcttcgccca gaccgaagcg ctggcgtttg gtaaatcccg cgaagtggtt 1320
gagcaggaat atcgtgatca gggtaaagat ccggcaacgc ttgactacgt ggtgccgttc 1380
aaagtattcg aaggtaaccg cccgaccaac tccatcctgc tgcgtgaaat cactccgttc 1440
agcctgggtg cgttgattgc gctgtatgag cacaaaatct ttactcaggg cgtgatcctg 1500
aacatcttca ccttcgacca gtggggcgtg gaactgggta aacagctggc gaaccgtatt 1560
ctgccagagc tgaaagatga taaagaaatc agcagccacg atagctcgac caatggtctg 1620
attaaccgct ataaagcgtg gcgcggttaa 1650
<210> 99
<211> 963
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 99
atgattaaga aaatcggtgt gttgacaagc ggcggtgatg cgccaggcat gaacgccgca 60
attcgcgggg ttgttcgttc tgcgctgaca gaaggtctgg aagtaatggg tatttatgac 120
ggctatctgg gtctgtatga agaccgtatg gtacagctag accgttacag cgtgtctgac 180
atgatcaacc gtggcggtac gttcctcggt tctgcgcgtt tcccggaatt ccgcgacgag 240
aacatccgcg ccgtggctat cgaaaacctg aaaaaacgtg gtatcgacgc gctggtggtt 300
atcggcggtg acggttccta catgggtgca atgcgtctga ccgaaatggg cttcccgtgc 360
atcggtctgc cgggcactat cgacaacgac atcaaaggca ctgactacac tatcggtttc 420
ttcactgcgc tgagcaccgt tgtagaagcg atcgaccgtc tgcgtgacac ctcttcttct 480
caccagcgta tttccgtggt ggaagtgatg ggccgttatt gtggagatct gacgttggct 540
gcggccattg ccggtggctg tgaattcgtt gtggttccgg aagttgaatt cagccgtgaa 600
gacctggtaa acgaaatcaa agcgggtatc gcgaaaggta aaaaacacgc gatcgtggcg 660
attaccgaac atatgtgtga tgttgacgaa ctggcgcatt tcatcgagaa agaaaccggt 720
cgtgaaaccc gcgcaactgt gctgggccac atccagcgcg gtggttctcc ggtgccttac 780
gaccgtattc tggcttcccg tatgggcgct tacgctatcg atctgctgct ggcaggttac 840
ggcggtcgtt gtgtaggtat ccagaacgaa cagctggttc accacgacat catcgacgct 900
atcgaaaaca tgaagcgtcc gttcaaaggt gactggctgg actgcgcgaa aaaactgtat 960
taa 963
<210> 100
<211> 1476
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 100
atggcggtaa cgcaaacagc ccaggcctgt gacctggtca ttttcggcgc gaaaggcgac 60
cttgcgcgtc gtaaattgct gccttccctg tatcaactgg aaaaagccgg tcagctcaac 120
ccggacaccc ggattatcgg cgtagggcgt gctgactggg ataaagcggc atataccaaa 180
gttgtccgcg aggcgctcga aactttcatg aaagaaacca ttgatgaagg tttatgggac 240
accctgagtg cacgtctgga tttttgtaat ctcgatgtca atgacactgc tgcattcagc 300
cgtctcggcg cgatgctgga tcaaaaaaat cgtatcacca ttaactactt tgccatgccg 360
cccagcactt ttggcgcaat ttgcaaaggg cttggcgagg caaaactgaa tgctaaaccg 420
gcacgcgtag tcatggagaa accgctgggg acgtcgctgg cgacctcgca ggaaatcaat 480
gatcaggttg gcgaatactt cgaggagtgc caggtttacc gtatcgacca ctatcttggt 540
aaagaaacgg tgctgaacct gttggcgctg cgttttgcta actccctgtt tgtgaataac 600
tgggacaatc gcaccattga tcatgttgag attaccgtgg cagaagaagt ggggatcgaa 660
gggcgctggg gctattttga taaagccggt cagatgcgcg acatgatcca gaaccacctg 720
ctgcaaattc tttgcatgat tgcgatgtct ccgccgtctg acctgagcgc agacagcatc 780
cgcgatgaaa aagtgaaagt actgaagtct ctgcgccgca tcgaccgctc caacgtacgc 840
gaaaaaaccg tacgcgggca atatactgcg ggcttcgccc agggcaaaaa agtgccggga 900
tatctggaag aagagggcgc gaacaagagc agcaatacag aaactttcgt ggcgatccgc 960
gtcgacattg ataactggcg ctgggccggt gtgccattct acctgcgtac tggtaaacgt 1020
ctgccgacca aatgttctga agtcgtggtc tatttcaaaa cacctgaact gaatctgttt 1080
aaagaatcgt ggcaggatct gccgcagaat aaactgacta tccgtctgca acctgatgaa 1140
ggcgtggata tccaggtact gaataaagtt cctggccttg accacaaaca taacctgcaa 1200
atcaccaagc tggatctgag ctattcagaa acctttaatc agacgcatct ggcggatgcc 1260
tatgaacgtt tgctgctgga aaccatgcgt ggtattcagg cactgtttgt acgtcgcgac 1320
gaagtggaag aagcctggaa atgggtagac tccattactg aggcgtgggc gatggacaat 1380
gatgcgccga aaccgtatca ggccggaacc tggggacccg ttgcctcggt ggcgatgatt 1440
acccgtgatg gtcgttcctg gaatgagttt gagtaa 1476
<210> 101
<211> 474
<212> PRT
<213>Escherichia coli (Escherichia coli)
<400> 101
Met Ser Thr Glu Ile Lys Thr Gln Val Val Val Leu Gly Ala Gly Pro
1 5 10 15
Ala Gly Tyr Ser Ala Ala Phe Arg Cys Ala Asp Leu Gly Leu Glu Thr
20 25 30
Val Ile Val Glu Arg Tyr Asn Thr Leu Gly Gly Val Cys Leu Asn Val
35 40 45
Gly Cys Ile Pro Ser Lys Ala Leu Leu His Val Ala Lys Val Ile Glu
50 55 60
Glu Ala Lys Ala Leu Ala Glu His Gly Ile Val Phe Gly Glu Pro Lys
65 70 75 80
Thr Asp Ile Asp Lys Ile Arg Thr Trp Lys Glu Lys Val Ile Asn Gln
85 90 95
Leu Thr Gly Gly Leu Ala Gly Met Ala Lys Gly Arg Lys Val Lys Val
100 105 110
Val Asn Gly Leu Gly Lys Phe Thr Gly Ala Asn Thr Leu Glu Val Glu
115 120 125
Gly Glu Asn Gly Lys Thr Val Ile Asn Phe Asp Asn Ala Ile Ile Ala
130 135 140
Ala Gly Ser Arg Pro Ile Gln Leu Pro Phe Ile Pro His Glu Asp Pro
145 150 155 160
Arg Ile Trp Asp Ser Thr Asp Ala Leu Glu Leu Lys Glu Val Pro Glu
165 170 175
Arg Leu Leu Val Met Gly Gly Gly Ile Ile Gly Leu Glu Met Gly Thr
180 185 190
Val Tyr His Ala Leu Gly Ser Gln Ile Asp Val Val Glu Met Phe Asp
195 200 205
Gln Val Ile Pro Ala Ala Asp Lys Asp Ile Val Lys Val Phe Thr Lys
210 215 220
Arg Ile Ser Lys Lys Phe Asn Leu Met Leu Glu Thr Lys Val Thr Ala
225 230 235 240
Val Glu Ala Lys Glu Asp Gly Ile Tyr Val Thr Met Glu Gly Lys Lys
245 250 255
Ala Pro Ala Glu Pro Gln Arg Tyr Asp Ala Val Leu Val Ala Ile Gly
260 265 270
Arg Val Pro Asn Gly Lys Asn Leu Asp Ala Gly Lys Ala Gly Val Glu
275 280 285
Val Asp Asp Arg Gly Phe Ile Arg Val Asp Lys Gln Leu Arg Thr Asn
290 295 300
Val Pro His Ile Phe Ala Ile Gly Asp Ile Val Gly Gln Pro Met Leu
305 310 315 320
Ala His Lys Gly Val His Glu Gly His Val Ala Ala Glu Val Ile Ala
325 330 335
Gly Lys Lys His Tyr Phe Asp Pro Lys Val Ile Pro Ser Ile Ala Tyr
340 345 350
Thr Glu Pro Glu Val Ala Trp Val Gly Leu Thr Glu Lys Glu Ala Lys
355 360 365
Glu Lys Gly Ile Ser Tyr Glu Thr Ala Thr Phe Pro Trp Ala Ala Ser
370 375 380
Gly Arg Ala Ile Ala Ser Asp Cys Ala Asp Gly Met Thr Lys Leu Ile
385 390 395 400
Phe Asp Lys Glu Ser His Arg Val Ile Gly Gly Ala Ile Val Gly Thr
405 410 415
Asn Gly Gly Glu Leu Leu Gly Glu Ile Gly Leu Ala Ile Glu Met Gly
420 425 430
Cys Asp Ala Glu Asp Ile Ala Leu Thr Ile His Ala His Pro Thr Leu
435 440 445
His Glu Ser Val Gly Leu Ala Ala Glu Val Phe Glu Gly Ser Ile Thr
450 455 460
Asp Leu Pro Asn Pro Lys Ala Lys Lys Lys
465 470
<210> 102
<211> 474
<212> PRT
<213>Artificial
<220>
<223>Lpd mutant
<400> 102
Met Ser Thr Glu Ile Lys Thr Gln Val Val Val Leu Gly Ala Gly Pro
1 5 10 15
Ala Gly Tyr Ser Ala Ala Phe Arg Cys Ala Asp Leu Gly Leu Glu Thr
20 25 30
Val Ile Val Glu Arg Tyr Asn Thr Leu Gly Gly Val Cys Leu Asn Val
35 40 45
Gly Cys Ile Pro Ser Lys Ala Leu Leu His Val Ala Lys Val Ile Glu
50 55 60
Glu Ala Lys Ala Leu Ala Glu His Gly Ile Val Phe Gly Glu Pro Lys
65 70 75 80
Thr Asp Ile Asp Lys Ile Arg Thr Trp Lys Glu Lys Val Ile Asn Gln
85 90 95
Leu Thr Gly Gly Leu Ala Gly Met Ala Lys Gly Arg Lys Val Lys Val
100 105 110
Val Asn Gly Leu Gly Lys Phe Thr Gly Ala Asn Thr Leu Glu Val Glu
115 120 125
Gly Glu Asn Gly Lys Thr Val Ile Asn Phe Asp Asn Ala Ile Ile Ala
130 135 140
Ala Gly Ser Arg Pro Ile Gln Leu Pro Phe Ile Pro His Glu Asp Pro
145 150 155 160
Arg Ile Trp Asp Ser Thr Asp Ala Leu Glu Leu Lys Glu Val Pro Glu
165 170 175
Arg Leu Leu Val Met Gly Gly Gly Ile Ile Ala Leu Glu Met Ala Thr
180 185 190
Val Tyr His Ala Leu Gly Ser Gln Ile Asp Val Val Val Arg Lys His
195 200 205
Gln Val Ile Arg Ala Ala Asp Lys Asp Ile Val Lys Val Phe Thr Lys
210 215 220
Arg Ile Ser Lys Lys Phe Asn Leu Met Leu Glu Thr Lys Val Thr Ala
225 230 235 240
Val Glu Ala Lys Glu Asp Gly Ile Tyr Val Thr Met Glu Gly Lys Lys
245 250 255
Ala Pro Ala Glu Pro Gln Arg Tyr Asp Ala Val Leu Val Ala Ile Gly
260 265 270
Arg Val Pro Asn Gly Lys Asn Leu Asp Ala Gly Lys Ala Gly Val Glu
275 280 285
Val Asp Asp Arg Gly Phe Ile Arg Val Asp Lys Gln Leu Arg Thr Asn
290 295 300
Val Pro His Ile Phe Ala Ile Gly Asp Ile Val Gly Gln Pro Met Leu
305 310 315 320
Ala His Lys Gly Val His Glu Gly His Val Ala Ala Glu Val Ile Ala
325 330 335
Gly Lys Lys His Tyr Phe Asp Pro Lys Val Ile Pro Ser Ile Ala Tyr
340 345 350
Thr Glu Pro Glu Val Ala Trp Val Gly Leu Thr Glu Lys Glu Ala Lys
355 360 365
Glu Lys Gly Ile Ser Tyr Glu Thr Ala Thr Phe Pro Trp Ala Ala Ser
370 375 380
Gly Arg Ala Ile Ala Ser Asp Cys Ala Asp Gly Met Thr Lys Leu Ile
385 390 395 400
Phe Asp Lys Glu Ser His Arg Val Ile Gly Gly Ala Ile Val Gly Thr
405 410 415
Asn Gly Gly Glu Leu Leu Gly Glu Ile Gly Leu Ala Ile Glu Met Gly
420 425 430
Cys Asp Ala Glu Asp Ile Ala Leu Thr Ile His Ala His Pro Thr Leu
435 440 445
His Glu Ser Val Gly Leu Ala Ala Glu Val Phe Glu Gly Ser Ile Thr
450 455 460
Asp Leu Pro Asn Pro Lys Ala Lys Lys Lys
465 470
<210> 103
<211> 1548
<212> DNA
<213>Escherichia coli (Escherichia coli)
<400> 103
atgacggacc atacaatgaa gaaaaacccc gtaagtatac cacacaccgt ctggtacgcc 60
gacgatatcc gccgcggaga acgcgaggcg gcagatgtgc tggggctcac actctatgag 120
ctgatgcttc gcgctggcga ggccgcattc caggtgtgtc gttcggcgta tcctgacgcc 180
cgccactggc tggtgctgtg cggtcatggt aataacggcg gcgatggcta cgtggtcgcg 240
cgactggcca aagcggtcgg cattgaggtc acgttgttgg cccaggagag cgacaaaccg 300
ttgccggaag aggccgcgct ggcacgcgaa gcatggttaa acgcgggtgg cgagatccat 360
gcttcgaata ttgtctggcc cgaatcggta gatctgattg ttgatgcgct gctcggtacc 420
ggtttgcggc aagcgccccg cgaatccatt agccagttaa tcgaccacgc taattcccat 480
cctgcgccga ttgtggcggt tgatatccct tccggcctgc tggctgaaac tggcgctacg 540
ccaggcgcgg tgatcaacgc cgatcacacc atcactttta ttgcgctgaa accaggcttg 600
ctcactggaa aagcgcggga tgttaccgga caactgcatt ttgactcact ggggctggat 660
agttggctgg caggtcagga gacgaaaatt cagcggtttt cagcagaaca actttctcac 720
tggctaaaac cgcgtcgccc gacttcgcat aaaggcgatc acgggcggct ggtaattatc 780
ggtggcgatc acggcacggc gggggctatt cgtatgacgg gggaagcggc gctgcgtgct 840
ggtgctggtt tagtccgagt actgacccgc agtgaaaaca ttgcgccgct gctgactgca 900
cgaccggaat tgatggtgca tgaactgacg atggactctc ttaccgaaag cctggaatgg 960
gccgatgtgg tggtgattgg tcccggtctg ggccagcaag agtgggggaa aaaagcactg 1020
caaaaagttg agaattttcg caaaccgatg ttgtgggatg ccgatgcatt gaacctgctg 1080
gcaatcaatc ccgataagcg tcacaatcgc gtgatcacgc cgcatcctgg cgaggccgca 1140
cggttgttag gctgttccgt cgctgaaatt gaaagtgacc gcttacattg cgccaaacgt 1200
ctggtacaac gttatggcgg cgtagcggtg ctgaaaggtg ccggaaccgt ggtcgccgcc 1260
catcctgacg ctttaggcat tattgatgcc ggaaatgcag gcatggcgag cggcggcatg 1320
ggcgatgtgc tctctggtat tattggcgca ttgcttgggc aaaaactgtc gccgtatgat 1380
gcagcctgtg caggctgtgt cgcgcacggt gcggcagctg acgtactggc ggcgcgtttt 1440
ggaacgcgcg ggatgctggc aaccgatctc ttttccacgc tacagcgtat tgttaacccg 1500
gaagtgactg ataaaaacca tgatgaatcg agtaattccg ctccctga 1548
<210> 104
<211> 50
<212> DNA
<213>Artificial
<220>
<223>Primer lacks xylAB 1
<400> 104
acgacatcat ccatcacccg cggcattacc tgattatgga gttcaatatg 50
<210> 105
<211> 50
<212> DNA
<213>Artificial
<220>
<223>Primer lacks xylAB 2
<400> 105
cccccacccg gtcaggcagg ggataacgtt tacgccatta atggcagaag 50
<210> 106
<211> 50
<212> DNA
<213>Artificial
<220>
<223>Primer lacks yjhH 1
<400> 106
aatgcgcgaa gttgccgact tcctgattaa taaaggggtc gacgggctgt 50
<210> 107
<211> 50
<212> DNA
<213>Artificial
<220>
<223>Primer lacks yfhH 2
<400> 107
gtaccgactt aactgtgttg atcatcgtac gcaagtgacc aacgctgtcg 50
<210> 108
<211> 50
<212> DNA
<213>Artificial
<220>
<223>Primer lacks yagE 1
<400> 108
ggcggcacca acgcccggga aaccatcgaa ctcagccagc acgcgcagca 50
<210> 109
<211> 50
<212> DNA
<213>Artificial
<220>
<223>Primer lacks yagE 2
<400> 109
agcacggtga agtgcggatg ggcacctttg acggtatgga tcatgctgcg 50
<210> 110
<211> 50
<212> DNA
<213>Artificial
<220>
<223>Primer lacks yiaE 1
<400> 110
catatttcag gctaaggtga tcgccttatc agtgaatgga gagaagcatg 50
<210> 111
<211> 50
<212> DNA
<213>Artificial
<220>
<223>Primer lacks yiaE 2
<400> 111
tatcgggctt tactctacgc agtcgcggct tagtccgcga cgtgcggatt 50
<210> 112
<211> 50
<212> DNA
<213>Artificial
<220>
<223>Primer lacks ycdW 1
<400> 112
aacgataagt gcgaataaat ttcgcacaac gcttttcggg agtcagtatg 50
<210> 113
<211> 50
<212> DNA
<213>Artificial
<220>
<223>Primer lacks ycdW 2
<400> 113
ccaaggatag caggaatcct gatgctttat tagtagccgc gtgcgcggtc 50
<210> 114
<211> 100
<212> DNA
<213>Artificial
<220>
<223>Primer lacks ptsG 1
<400> 114
atgtttaaga atgcatttgc taacctgcaa aaggtcggta aatcgctgat gctgccggta 60
tccgtactgc ctatcgcagg tgtaggctgg agctgcttcg 100
<210> 115
<211> 100
<212> DNA
<213>Artificial
<220>
<223>Primer lacks ptsG 2
<400> 115
ttagtggtta cggatgtact catccatctc ggttttcagg ttatcggatt tagtaccgaa 60
aatcgcctga acaccagaac catatgaata tcctccttag 100
<210> 116
<211> 21
<212> DNA
<213>Artificial
<220>
<223> RBS120
<400> 116
atccggtata ggaggtatag a 21
<210> 117
<211> 100
<212> DNA
<213>Artificial
<220>
<223>Primer lacks udhA 1
<400> 117
ggtgcgcgcg tcgcagttat cgagcgttat caaaatgttg gcggcggttg cacccactgg 60
ggcaccatcc cgtcgaaagc catatgaata tcctccttag 100
<210> 118
<211> 100
<212> DNA
<213>Artificial
<220>
<223>Primer lacks udhA 2
<400> 118
cccagaatct cttttgtttc ccgatggaac aaaattttca gcgtgcccac gttcatgccg 60
acgatttgtg cgcgtgccag tgtaggctgg agctgcttcg 100
<210> 119
<211> 101
<212> DNA
<213>Artificial
<220>
<223>Primer lacks pfkA 1
<400> 119
gttcctcggt tctgcgcgtt tcccggaatt ccgcgacgag aacatccgcg ccgtggctat 60
cgaaaacctg aaaaaacgtg gtgtaggctg gagctgcttc g 101
<210> 120
<211> 102
<212> DNA
<213>Artificial
<220>
<223>Primer lacks pfkA 2
<400> 120
ggcctgataa gcgaagcgca tcaggcattt ttgcttctgt catcggtttc agggtaaagg 60
aatctgcctt tttccgaaat cacatatgaa tatcctcctt ag 102
<210> 121
<211> 83
<212> DNA
<213>Artificial
<220>
<223> PR01
<400> 121
acgttaaatc tatcaccgca agggataaat atctaacacc gtgcgtgttg acaattttac 60
ctctggcggt gataatggtt gca 83
<210> 122
<211> 714
<212> DNA
<213>Artificial
<220>
<223> CI857
<400> 122
atgagcacaa aaaagaaacc attaacacaa gagcagcttg aggacgcacg tcgccttaaa 60
gcaatttatg aaaaaaagaa aaatgaactt ggcttatccc aggaatctgt cgcagacaag 120
atggggatgg ggcagtcagg cgttggtgct ttatttaatg gcatcaatgc attaaatgct 180
tataacgccg cattgcttac aaaaattctc aaagttagcg ttgaagaatt tagcccttca 240
atcgccagag aaatctacga gatgtatgaa gcggttagta tgcagccgtc acttagaagt 300
gagtatgagt accctgtttt ttctcatgtt caggcaggga tgttctcacc taagcttaga 360
acctttacca aaggtgatgc ggagagatgg gtaagcacaa ccaaaaaagc cagtgattct 420
gcattctggc ttgaggttga aggtaattcc atgaccgcac caacaggctc caagccaagc 480
tttcctgacg gaatgttaat tctcgttgac cctgagcagg ctgttgagcc aggtgatttc 540
tgcatagcca gacttggggg tgatgagttt accttcaaga aactgatcag ggatagcggt 600
caggtgtttt tacaaccact aaacccacag tacccaatga tcccatgcaa tgagagttgt 660
tccgttgtgg ggaaagttat cgctagtcag tggcctgaag agacgtttgg ctga 714
<210> 123
<211> 34
<212> DNA
<213>Artificial
<220>
<223> RBS150
<400> 123
taccaactaa cgcacgttta agtaggaacc gtat 34
<210> 124
<211> 50
<212> DNA
<213>Artificial
<220>
<223>GapA is replaced by gapN
<400> 124
acgtgactga ttctaacaaa acattaacac caactggcaa aattttgtcc 50
<210> 125
<211> 50
<212> DNA
<213>Artificial
<220>
<223>GapA is replaced by gapN
<400> 125
aaaaaagagc gaccgaagtc gctcttttta gatcacagtg tcatctcaac 50
<210> 126
<211> 43
<212> DNA
<213>Artificial
<220>
<223> ptrc
<400> 126
gagctgttga caattaatca tccggctcgt ataatgtgtg gaa 43
<210> 127
<211> 50
<212> DNA
<213>Artificial
<220>
<223>Primer replaces xylFGH promoters 1
<400> 127
ctaaaaattg gttacgttta tcgcggtgat tgttacttat taaaactgtc 50
<210> 128
<211> 50
<212> DNA
<213>Artificial
<220>
<223>Primer replaces xylFGH promoters 2
<400> 128
tgagtagaat gttctttatt ttcatggtgt agggccttct gtagttagag 50

Claims (15)

1. a kind of microorganism, the microorganism is genetically modified to be used for by the way that xylose is produced into 2 into BT, 4- dihydroxy butyric acid, wherein the microorganism is used for through further genetic modification:
I) 1,2,4- butantriols are oxidized to 2,4- dihydroxy butyraldehydes;And
Ii 2,4- dihydroxy butyraldehydes) are oxidized to 2,4- dihydroxy butyric acid.
2. microorganism according to claim 1, wherein the genetic modification i) is at least one coding oxidoreducing enzyme The overexpression of gene, the oxidoreducing enzyme act on the CH-OH groups of donor.
3. microorganism according to claim 2, wherein the oxidoreducing enzyme is selected from the group:Alcohol dehydrogenase, lactic aldehyde reduction Enzyme, glyoxylate reductase, two dehydrogenation gluconic acid reductases and its any combinations.
4. microorganism according to any one of claim 1 to 3, wherein the genetic modification ii) it is at least one coding The overexpression of the gene of oxidoreducing enzyme, the oxidoreducing enzyme act on the aldehyde radical or oxo group of donor.
5. microorganism according to claim 4, wherein the oxidoreducing enzyme is selected from the group:Aldehyde dehydrogenase, aldehyde oxidase, And its any combinations.
6. microorganism according to any one of claim 1 to 5, wherein for by xylose into BT Genetic modification is the overexpression of at least one following gene:
The gene of-coding xylose dehydrogenase,
The gene of-coding xylonolactonase,
The gene of-coding xylonate dehydratase,
The gene of-coding 3- deoxidations-D- glycerine-pentulose acid decarboxylase,
The gene of-coding BT dehydrogenase, and
- its any combinations.
7. microorganism according to any one of claim 1 to 6, wherein the microorganism is used through further genetic modification In:
Iii reducing power and/or energy) are provided from the carbon source in addition to xylose, for the production of 2,4- dihydroxy butyric acid and micro- life Thing grows, and/or
Iv) at least partly suppress, preferably completely suppress carbon catabolite repression.
8. microorganism according to claim 7, wherein the genetic modification iii) be at least one following gene missing And/or reduction:
The gene of-encoding xylose isomerase,
The gene of-encoding xylulokinase,
The gene of-coding 3- deoxidations-D- glycerine-pentulosonate aldolase,
The gene of-coding ketoacid dehydrogenase, and
- its any combinations.
9. the microorganism according to claim 7 or 8, wherein the genetic modification iv) selected from following at least one:
The gene of the glucose permease of-missing coding phosphotransferase system,
The gene of-missing coding phosphate carrier Hpr albumen,
The gene and/or operator involved in the sugared input albumen system of-expression, wherein the sugar is the carbon source in addition to xylose,
- expression encodes the gene of xylose transport albumen,
- gene for encoding glucose symport albumen is overexpressed,
- gene for encoding facilitated glucose albumen is overexpressed,
- gene for encoding glucokinase is overexpressed,
The gene involved in-regulation cAMP levels, the expression of the gene of optimized encoding adenyl cyclase,
The expression of the gene of-regulation coding CRP and/or CRP sample albumen,
- expression encodes the gene of the CRP albumen independent of cAMP, and
- its any combinations.
10. microorganism according to any one of claim 1 to 9, it includes the production of the NADPH as reducing power source The further genetic modification of at least one gene involved in life, the modification are preferably selected from:
- gene or operator for encoding membrane-bound transhydrogenase are overexpressed,
The gene of-missing or reduction encoding soluble transhydrogenase,
- gene that coding produces NADPH glyceraldehyde-3-phosphate dehydrogenase is overexpressed,
The gene of-missing or reduction coding phosphogvlucoisomerase,
The gene of-missing or reduction coding phosphofructokinase,
- gene for encoding glucose-6-phosphate dehydrogenase (G6PD) is overexpressed,
- mutant gene that coding can produce NADPH diaphorase is overexpressed,
- gene for encoding difunctional NAD (P) H hydrate repair enzymes is overexpressed, and
- its any combinations.
11. microorganism according to any one of claim 1 to 10, wherein the bacterium belongs to enterobacteriaceae (Enterobacteriaceae), Clostridiaceae (Clostridiaceae), Bacillaceae (Bacillaceae), Streptomycetaceae Or bar Cordycepps (Corynebacteriaceae) (Streptomycetaceae).
12. microorganism according to claim 11, wherein the enterobacteriaceae lactobacteriaceae is Escherichia coli (E.coli).
13. the method for one kind production 2,4- dihydroxy butyric acid, it includes:
A) allowing, by under fermentation condition of the xylose into 2,4- dihydroxy butyric acid, to cultivate such as in the culture medium comprising xylose Genetically modified microorganism any one of claim 1 to 12, and
B) 2,4- dihydroxy butyric acid is reclaimed from the culture medium.
14. the method for claim 13, wherein the culture medium includes the carbon source in addition to xylose.
15. the method for claim 14, wherein the carbon source in addition to xylose is to include the carbon hydrate of 3,6 or 12 carbon atoms Thing, it is preferably selected from the following group:Glycerine, glucose, galactolipin, fructose, lactose, maltose, sucrose and its any combinations.
CN201580080660.8A 2015-04-07 2015-04-07 Modified microorganism for optimized production of 2,4-dihydroxybutyric acid Active CN107690482B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/IB2015/000618 WO2016162712A1 (en) 2015-04-07 2015-04-07 Modified microorganism for the optimized production of 2,4-dihydroxyburyrate

Publications (2)

Publication Number Publication Date
CN107690482A true CN107690482A (en) 2018-02-13
CN107690482B CN107690482B (en) 2022-11-29

Family

ID=53762215

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201580080660.8A Active CN107690482B (en) 2015-04-07 2015-04-07 Modified microorganism for optimized production of 2,4-dihydroxybutyric acid

Country Status (5)

Country Link
US (1) US10415062B2 (en)
EP (1) EP3280694B1 (en)
CN (1) CN107690482B (en)
BR (1) BR112017021255A2 (en)
WO (1) WO2016162712A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110157653A (en) * 2019-05-09 2019-08-23 南京工业大学 Recombinant escherichia coli for high-yield cyclic adenosine monophosphate and application of recombinant escherichia coli in cyclic adenosine monophosphate synthesis
CN110343653A (en) * 2019-07-30 2019-10-18 江南大学 A method of it knocking out Escherichia coli aldehyde dehydrogenase gene and improves 1,2,4- butantriol yield
CN110373371A (en) * 2019-07-30 2019-10-25 江南大学 It is overexpressed method and application that xylose transport protein gene improves 1,2,4- butantriol yield
CN110591997A (en) * 2019-10-29 2019-12-20 南京工业大学 Genetic engineering bacterium for improving activity of xylonic acid dehydratase and construction method and application thereof

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11951140B2 (en) 2011-02-04 2024-04-09 Seed Health, Inc. Modulation of an individual's gut microbiome to address osteoporosis and bone disease
US11844720B2 (en) 2011-02-04 2023-12-19 Seed Health, Inc. Method and system to reduce the likelihood of dental caries and halitosis
US11951139B2 (en) 2015-11-30 2024-04-09 Seed Health, Inc. Method and system for reducing the likelihood of osteoporosis
US11998479B2 (en) 2011-02-04 2024-06-04 Seed Health, Inc. Method and system for addressing adverse effects on the oral microbiome and restoring gingival health caused by sodium lauryl sulphate exposure
US11826388B2 (en) 2013-12-20 2023-11-28 Seed Health, Inc. Topical application of Lactobacillus crispatus to ameliorate barrier damage and inflammation
US11980643B2 (en) 2013-12-20 2024-05-14 Seed Health, Inc. Method and system to modify an individual's gut-brain axis to provide neurocognitive protection
US12005085B2 (en) 2013-12-20 2024-06-11 Seed Health, Inc. Probiotic method and composition for maintaining a healthy vaginal microbiome
US11833177B2 (en) 2013-12-20 2023-12-05 Seed Health, Inc. Probiotic to enhance an individual's skin microbiome
US11969445B2 (en) 2013-12-20 2024-04-30 Seed Health, Inc. Probiotic composition and method for controlling excess weight, obesity, NAFLD and NASH
US11998574B2 (en) 2013-12-20 2024-06-04 Seed Health, Inc. Method and system for modulating an individual's skin microbiome
US11839632B2 (en) 2013-12-20 2023-12-12 Seed Health, Inc. Topical application of CRISPR-modified bacteria to treat acne vulgaris
KR101996767B1 (en) 2018-11-29 2019-07-04 씨제이제일제당 (주) cAMP receptor protein variant and method for producing L-amino acid using the same
KR101991207B1 (en) 2018-11-29 2019-06-19 씨제이제일제당 (주) cAMP receptor protein variant and method for producing L-amino acid using the same
KR101991206B1 (en) * 2018-11-29 2019-06-19 씨제이제일제당 (주) cAMP receptor protein variant and method for producing L-amino acid using the same
CN114008197A (en) * 2019-04-04 2022-02-01 布拉斯科公司 Metabolic engineering for simultaneous consumption of xylose and glucose to produce chemicals from second generation sugars
CN111593014B (en) * 2020-06-24 2022-07-05 江南大学 Method for co-production of 1, 3-propylene glycol and D-1,2, 4-butanetriol
CN112680484B (en) * 2021-01-15 2022-06-14 山东大学 Method for producing 3, 4-dihydroxybutyric acid by using double-bacterium co-culture system
DE102021101004B3 (en) 2021-01-19 2022-03-10 Technische Universität Dresden, Körperschaft des öffentlichen Rechts Process for the production of 2,4-dihydroxybutyrate or L-threonine using a microbial pathway
WO2023285585A2 (en) 2021-07-16 2023-01-19 Biosyntia Aps Microbial cell factories producing vitamin b compounds

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1167534A2 (en) * 2000-06-28 2002-01-02 Daiso Co., Ltd. Process for the preparation of optically active 1,2-diols by cultivating microorganisms
KR20120041827A (en) * 2010-08-27 2012-05-03 부산대학교 산학협력단 Recombinant microorganism transformed with puuc gene encoding 3-hydroxypropionaldehyde dehydrogenase and method of preparing 3-hydroxypropionic acid or co-preparing 1,3-propanediol and 3-hydroxypropionic acid therewith
CN104254612A (en) * 2012-04-26 2014-12-31 安迪苏法国联合股份有限公司 A method of production of 2,4-dihydroxybutyric acid
CN104471069A (en) * 2012-07-11 2015-03-25 安迪苏法国联合股份有限公司 Method for the preparation of 2,4-dihydroxybutyrate

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19735575A1 (en) 1997-08-09 1999-02-11 Schering Ag New di:hydroxy-butanal or -pentene derivatives
AU2004200948A1 (en) 1999-02-11 2004-04-01 Schering Ag Epothilon derivatives, method for the production and the use thereof as pharmaceuticals
ES2260297T3 (en) 2000-09-15 2006-11-01 Purac Biochem B.V. METHOD FOR PURIFICATION OF ALFA-HYDROXIACIDES AT INDUSTRIAL SCALE.
ATE393137T1 (en) 2001-05-07 2008-05-15 Cargill Inc METHOD FOR PRODUCING CARBOXYLIC ACIDS AND THEIR DERIVATIVES
FR2862068B1 (en) 2003-11-06 2007-10-12 Metabolic Explorer Sa OPTIMIZED MICROORGANISM STRAINS FOR NADPH CONSUMPTION BIOSYNTHESIS PATHWAYS
FR2864967B1 (en) 2004-01-12 2006-05-19 Metabolic Explorer Sa ADVANCED MICROORGANISM FOR THE PRODUCTION OF 1,2-PROPANEDIOL
US7709420B2 (en) 2006-01-04 2010-05-04 Halliburton Energy Services, Inc. Organophilic clays and methods for the preparation and use thereof
CN101512004A (en) 2006-07-19 2009-08-19 密歇根州州立大学托管委员会 Microbial synthesis of D-1,2,4-butanetriol
JP2010501516A (en) 2006-08-24 2010-01-21 エボニック デグサ ゲーエムベーハー Method for producing D, L-2-hydroxy-4-alkylthiobutyric acid
WO2008116852A1 (en) 2007-03-23 2008-10-02 Metabolic Explorer New micro-organisms for the production of 1,2-propanediol obtained by a combination of evolution and rational design.
US9200288B2 (en) 2010-04-27 2015-12-01 The Regents Of The University Of California Production of 1,4-butanediol by recombinant microorganisms
TWI500768B (en) 2010-07-05 2015-09-21 Metabolic Explorer Sa Method for the preparation of 1,3-propanediol from sucrose
FR2966150B1 (en) 2010-10-15 2012-10-12 Adisseo France Sas PROCESS FOR THE PREPARATION OF 2-HYDROXYBUTYROLACTONE
AR083468A1 (en) 2010-10-25 2013-02-27 Metabolic Explorer Sa INCREASING NADPH'S AVAILABILITY FOR METIONIN PRODUCTION
CN103270155B (en) 2010-10-28 2016-02-10 安迪苏法国联合股份有限公司 The production method of 2,4-dihydroxyl butyric acid
FR2974803B1 (en) 2011-05-06 2013-05-03 Roquette Freres PROCESS FOR THE PREPARATION OF A PARTIALLY PURIFIED GLYCOLIC ACID
EP2532751A1 (en) 2011-06-10 2012-12-12 Metabolic Explorer Use of inducible promoters in the fermentative production of 1,2-propanediol
AR086790A1 (en) 2011-06-29 2014-01-22 Metabolic Explorer Sa A MICROORGANISM FOR THE PRODUCTION OF METIONIN WITH IMPORTATION OF IMPROVED GLUCOSE
CN104903441A (en) 2012-11-09 2015-09-09 拉勒曼德匈牙利流动管理有限责任公司 Method for acetate consumption during ethanolic fermentation of cellulosic feedstocks
CN105682570B (en) 2013-11-04 2019-02-01 柯惠Lp公司 Surgical fasteners bringing device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1167534A2 (en) * 2000-06-28 2002-01-02 Daiso Co., Ltd. Process for the preparation of optically active 1,2-diols by cultivating microorganisms
KR20120041827A (en) * 2010-08-27 2012-05-03 부산대학교 산학협력단 Recombinant microorganism transformed with puuc gene encoding 3-hydroxypropionaldehyde dehydrogenase and method of preparing 3-hydroxypropionic acid or co-preparing 1,3-propanediol and 3-hydroxypropionic acid therewith
CN104254612A (en) * 2012-04-26 2014-12-31 安迪苏法国联合股份有限公司 A method of production of 2,4-dihydroxybutyric acid
CN104471069A (en) * 2012-07-11 2015-03-25 安迪苏法国联合股份有限公司 Method for the preparation of 2,4-dihydroxybutyrate

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
KRIS NIÑO G ET AL: "Direct bioconversion of d-xylose to 1,2,4-butanetriol in an engineered Escherichia coli", 《PROCESS BIOCHEMISTRY》 *
XINGHUA LI ET AL: "Design and construction of a non-natural malate to 1,2,4-butanetriol pathway creates possibility to produce 1,2,4-butanetriol from glucose", 《SCI REP》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110157653A (en) * 2019-05-09 2019-08-23 南京工业大学 Recombinant escherichia coli for high-yield cyclic adenosine monophosphate and application of recombinant escherichia coli in cyclic adenosine monophosphate synthesis
CN110343653A (en) * 2019-07-30 2019-10-18 江南大学 A method of it knocking out Escherichia coli aldehyde dehydrogenase gene and improves 1,2,4- butantriol yield
CN110373371A (en) * 2019-07-30 2019-10-25 江南大学 It is overexpressed method and application that xylose transport protein gene improves 1,2,4- butantriol yield
CN110591997A (en) * 2019-10-29 2019-12-20 南京工业大学 Genetic engineering bacterium for improving activity of xylonic acid dehydratase and construction method and application thereof
CN110591997B (en) * 2019-10-29 2023-07-07 南京工业大学 Genetically engineered bacterium for improving activity of xylitol dehydratase, and construction method and application thereof

Also Published As

Publication number Publication date
US10415062B2 (en) 2019-09-17
BR112017021255A2 (en) 2018-06-26
WO2016162712A1 (en) 2016-10-13
US20180100169A1 (en) 2018-04-12
EP3280694B1 (en) 2021-11-24
CN107690482B (en) 2022-11-29
EP3280694A1 (en) 2018-02-14

Similar Documents

Publication Publication Date Title
CN107690482B (en) Modified microorganism for optimized production of 2,4-dihydroxybutyric acid
CA2737428C (en) Bacterium capable of producing lactic acid, and method for producing lactic acid
CN107771214B (en) Modified microorganisms for optimized 2,4-dihydroxybutyric acid production with increased 2,4-dihydroxybutyric acid excrements
JP5140074B2 (en) Amino acid production method using glycerol
Olajuyin et al. Efficient production of succinic acid from Palmaria palmata hydrolysate by metabolically engineered Escherichia coli
KR101083136B1 (en) Microorganisms for producing l-amino acids and process for producing l-amino acids using them
CN112877272B (en) Escherichia coli engineering bacteria of N-acetylglucosamine and fermentation production method
JP6195668B2 (en) Microorganism having improved L-lysine production ability and method for producing L-lysine using the same
JP2013529924A (en) Method for preparing 1,3-propanediol from sucrose
MXPA02003634A (en) High yield protein expression system and methods.
CA2808140A1 (en) Improved glycolic acid fermentative production with a modified microorganism
US20150211035A1 (en) Biological method for producing cis-5-hydroxy-l-pipecolic acid
CN111748535B (en) Alanine dehydrogenase mutant and application thereof in fermentation production of L-alanine
KR102149044B1 (en) Method of producing 2-hydroxy gamma butyrolactone or 2,4-dihydroxybutanoic acid
JP3593125B2 (en) Recombinant cells that highly express heterologous genes integrated into chromosomes
DK2430152T3 (en) A microorganism with increased L-lysine productivity and method for producing L-lysine using the same
Gao et al. Fine‐tuning ethanol oxidation pathway enzymes and cofactor PQQ coordinates the conflict between fitness and acetic acid production by Acetobacter pasteurianus
KR20190097250A (en) Conversion of methylglyoxal to hydroxyacetone using a novel enzyme and its application
CN113122563A (en) Method for constructing R-3-aminobutyric acid production strain
Chiang et al. Pathway engineering of Escherichia coli for the glycerol-based production of recombinant endoglucanase
KR20200023450A (en) Microorganisms and Related Methods Having Stabilized Copy Numbers of Functional DNA Sequences
松原充 Fermentative production of 1-propanol from biomass using recombinant Escherichia coli
KR101254401B1 (en) Recombinant microorganism having enhanced xanthan productivity and method of producing xanthan using the same
CN116622667A (en) Mutant of NAD kinase and application thereof
CN112673016A (en) XYLR mutants for improved xylose utilization or improved glucose and xylose co-utilization

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant