实施例2棉花蛋白激酶类编码基因GhGPK1的克隆
克隆子Gh-D193去掉冗余DNA后,序列为SEQ ID No:3,序列分析表明该序列的编码的氨基酸序列属于促***原活化蛋白激酶,本文将克隆子Gh-D193编码的全长基因命名为GhGPK1,对应的蛋白命名为GPK1。
SEQ ID No:3
1 ACACTAAACA ACTTTTGTTG GGACTGGAAT ATCTTCACAA AAATAGAATC GTGCATAGAG
61 ACATCAAGGG AGCAAACATT CTTGTGGATA ACAAGGGTTG TATCAAACTT GCAGACTTTG
121 GTGCATCCAA AAAAGTTGTC GAGTTGGCTA CAATAAATGG AGCCAAGTCA ATGAAGGGAA
181 CTCCTTATTG GATGGCTCCT GAAGTTATTC TCCAAACTGG GCATAGCTTC TCTGCCGATA
241 TTTGGAGTGT TGGCTGT
GhGPK1全长基因的克隆
根据已经获得的Gh-D193基因片段,设计两条特异性引物,作为3’RACE的5’端特异性引物。
GhGPK1 GSP1:SEQ ID NO:4:
ACTTTTGTTGGGACTGGAAT AT
GhGPK1 GSP2:SEQ ID NO:5:
CAAAAATAGAATCGTGCATAGAG
实验步骤按试剂盒说明书操作(3’RACE System for Rapid AmplificationofcDNA Ends试剂盒购自invitrogen公司)
用SEQ ID NO:4与3’端引物AUAP(试剂盒自带),以mRNA逆转录的cDNA为模板进行第一轮PCR扩增。具体步骤如下:Ex Taq购自TAKARA,50μl PCR反应体系:5μl10×ExBuffer,3μl2.5mM的dNTP,2.0μl mRNA反转录的cDNA,1.0μl Ex Taq、10μM的引物SEQ IDNO:4和AUAP各2.0μl,以及35μl的双蒸水。PCR反应条件:94℃预变性5min,94℃变性30s,58℃退火30s,72℃延伸2min,33个循环后,72℃延伸10min。
所得的PCR产物用双馏水稀释50倍后取2.0μl作为模板,用SEQ ID NO:5与3’端引物AUAP进行第二轮PCR扩增,具体步骤如下:50μl PCR反应体系:5μl10×Ex Buffer,3μl2.5mM的dNTP,2.0μl稀释的第一轮PCR产物,1.0μl Ex Taq、10μM的引物SEQ ID NO:5和AUAP各2.0μl,以及35μl的双蒸水。PCR反应条件:94℃预变性5min,94℃变性30s,58℃退火30s,72℃延伸2min,33个循环后,72℃延伸10min。第二次PCR产物回收片段约为1600bp条带(Gel Extraction Kit购自OMEGA)连接于pGEM-T EasyVector,转化到大肠杆菌JM109(具体方法同上),随机挑取10个白色菌落于含有50μg/mL氨苄青霉素的LB液体培养基中培养,37℃培养过夜后加甘油至终浓度20%,-80℃保存备用。SEQ ID NO:5与3’端引物AUAP进行菌液PCR扩增,得到3个阳性克隆,送英潍捷基(上海)贸易有限公司测序测序,获得该基因的cDNA的3’端。
根据已经获得的GhGPK1基因片段,设计三条特异性引物,作为5’RACE的3’端特异性引物。
GhGPK1 GSP3:SEQ ID NO:6:
TATCGGCAGAGAAGCTATGC
GhGPK1 GSP4:SEQ ID NO:7:
GAATAACTTCAGGAGCCATCC
GhGPK1 GSP5:SEQ ID NO:8:
TTCCCTTCATTGACTTGGCTCC
实验步骤按试剂盒说明书操作(5’RACE System for Rapid AmplificationofcDNA Ends试剂盒购自invitrogen公司)。
用SEQ ID NO:7与5’通用引物AAP(试剂盒自带),以mRNA逆转录的cDNA(反转录引物SEQ ID NO:6)为模板进行第一轮PCR扩增,具体步骤如下:Ex Taq购自TAKARA,50μl PCR反应体系:5μl10×Ex Buffer,3μl2.5mM的dNTP,2.0μl mRNA反转录的cDNA,1.0μl Ex Taq、10μM的引物SEQ ID NO:7和AAP各2.0μl,以及35μl的双蒸水。PCR反应条件:94℃预变性5min,94℃变性30s,55℃退火30s,72℃延伸2min,33个循环后,72℃延伸10min。所得的PCR产物用双蒸水稀释50倍后取2.0μl作为模板,用SEQ ID NO:8与3’端引物AUAP进行第二轮PCR扩增,具体步骤如下:50μl PCR反应体系:5μl10×Ex Buffer,3μl2.5mM的dNTP,2.0μl稀释的第一轮PCR产物,1.0μl Ex Taq、10μM的引物SEQ ID NO:8和AUAP各2.0μl,以及35μl的双蒸水。PCR反应条件:94℃预变性5min,94℃变性30s,58℃退火30s,72℃延伸2min,33个循环后,72℃延伸10min。第二次PCR产物回收片段约为800bp条带(Gel Extraction Kit购自OMEGA)连接于pGEM-T EasyVector,转化到JM109(具体方法同上),随机挑取10个白色菌落于含有50μg/mL氨苄青霉素的LB液体培养基中培养,37℃培养过夜后加甘油至终浓度20%,-80℃保存备用。SEQ ID NO:8与3’端引物AUAP进行菌液PCR扩增(反应体系及反应条件同上),得到4个阳性克隆,送英潍捷基(上海)贸易有限公司测序测序,获得该基因的cDNA的5’端。
所得的5’RACE产物克隆测序后,与3’RACE产物测序结果拼接。获得GhGPK1全长cDNA序列SEQ ID NO:19。
SEQ ID NO:19:
1 ACCTCCAAAT CTTTAAGCAA ACGGATCTGA AGAAAGAAGA TTTTTGTACT CTAAATGCAG
61 GAGCTTGTTG GATCGGTTCG CCGGTCCTTT GTCTTTAGGT CTTCAACCTC CAGCGACGAT
121 GCCGGTGGAG GACTTGGAGG CTTTGTTGAG AAGATCGGCG CCAGCATTCG CAGATCGCGA
181 ATCGGCTTGT TCGCTAAGCC ACCTGCTCCA CCCGCTCTTC CTTCTGTTAA GAAAAGGGAC
241 GCTACGATCC GGTGGCGGAA GGGCGAGTTG ATTGGTTGTG GCGCCTTTGG TCGGGTTTAC
301 ATGGGGATGA ATCTTGACTC CGGCGAGTTA CTAGCTGTTA AACAGGTTTT GATAGCGGCA
361 AATGCTTCAA AGGAGAAAAC ACAGGCTCAT ATTAGAGAGC TCGAAGAAGA AGTGAAGCTT
421 CTACAGAATC TGTCACATCC AAACATTGTT AGATATTTGG GCACCGCAAG AGAGGACGAT
481 TCGTTGAATA TTCTATTGGA GTTTGTACCA GGTGGATCCA TTTCCTCACT TTTGGGGAAG
541 TTTGGATCTT TCCCCGAATC TGTTATAAGA ATGTACACTA AACAACTTTT GTTGGGACTG
601 GAATATCTTC ACAAAAATAG AATCGTGCAT AGAGACATCA AGGGAGCAAA CATTCTTGTG
661 GATAACAAGG GTTGTATCAA ACTTGCAGAC TTTGGTGCAT CCAAAAAAGT TGTCGAGTTG
721 GCTACAATAA ATGGAGCCAA GTCAATGAAG GGAACTCCTT ATTGGATGGC TCCTGAAGTT
781 ATTCTCCAAA CTGGGCATAG CTTCTCTGCC GATATTTGGA GTGTTGGCTG TACTGTGATT
841 GAGATGGCTA CTGGAAAGCC CCCTTGGAGC CAACAGTATC AGGAGGTTGC TGCTCTCTTT
901 CATATTGGGA CAACTAAATC TCATCCACCC ATCCCTGAGC ATCTCTCCCC TGAGGCTAAA
961 GACCTTTTGT TAAAATGCCT ACAGAAGGAA CCAGGACTGA GACCTAGTGC ATCAGACTTG
1021 CTTCAGCACC CTTTTGTCAC TGGGGACTAT CAGGAACCTC ATGCGGTGCT TCGTAGATCA
1081 ATTAGGGAAC CTGAAAATCT GGAGATGGCA TCTGGGGTGA ACCTGAGAAG CTCAATAAAT
1141 TCGGAGATCA GGTCAACCTG CACGGGTTTA AAGGACGTTT GTGAAATGGG TAGTGTGAGT
1201 TGCTCTACAG CATTTCTTGG GAAATTCTCA GAACCAGGAG CCTATTGGAG GGGAAGCAAT
1261 TGTGACAATA GCATGTGTGA GATAGATGAC AAAGATGATC TAGAATTTAA TTATTCTGTA
1321 AAGTTCTCCT CTGTTTTATC ATCTGCTGAC TTGAATAAAA GTTTCAATCC CATGTGTGAA
1381 CCCACTGAAG ACTGGCCACC CAAATTAGAT CAAAGTTCTG AGCTAAGCCG AAGTGGAGTA
1441 AACTTGTCTT TGGATGAAAC AATGGAAGCT GCTAGCACTC CTGGAATGTC TGGTAAGGAG
1501 GAGAATGGCT TCACTTTTCT CTGCGGACCT CCAACAGGTG ATGATGATGA AGAAGTTACA
1561 GAGTCAAAAA TTAGAGCCTT CCTGGATGAA AAGGCTCTGG AACTGAAGAA GCTGCAATCT
1621 CCTCTGTATG AACAGTTCTA CAACACATTG AATGGTAGTC TTCCACCTTC TGTTGGAACT
1681 GCAAATGGTG AAAATATTTT GAGCTTACCT CCTAAAAGTA GGTCACCTAA GTGGCTGCCT
1741 AGCAGAAGAC TCTCAGCAGT TGCTGATGCT GCCAACATGG TTAGCTCAAA GAGTCGTATG
1801 AATCATTTGT CAAATACTGC AGTTGTTCAT GACCGGACTT TACAAGAAAT TCAGCCACCG
1861 TCTGTTGAGG AATGGAAAGG GCAGGATATA ATTAGTCCGA GCATGAGCTT TTCTGAGAGA
1921 CAAAGGAGAT GGAAGGAAGA GCTTGACCAA GAGCTTGAGA GAAAGCGAGA GATGCTGCGG
1981 AAGACATCAT CTCCAAAGGA TAAGTTTCTA TTTGGACAAA GAGAACAAAT ACGGTCTCCA
2041 TTTCCTGGCA AGTAAATGTA GTATCCAACT TTACCTCTTT TGATGTTCCA TTGGGACTGT
2101 TCATTTCTGT ATTCTATTTT TGGTGAGAAT GGTGCCCAAT AATGTTACTC GGGCTTTATT
2161 TATGAGTGTC AGATACGAAT ATGTATTTGA TATGAGTATA TTTAATTTTT TTTCTTTTTT
2221 TTTTTTGCTT GTATAAAGTT CTTTCATATT TGTGGAAGAT ATTTGGGAGA GCCAAAAAAA
2281 AAAAAAAAAA AAA
根据GhGPK1全长cDNA序列设计一对引物如下:
GhGPKF:SEQ ID NO:9:
ATGCAGGAGCTTGTTGGATCGG
GhGPKR:SEQ ID NO:10:
TTACTTGCCA GGAAATGGAG ACCG
通过SEQ ID NO:9和SEQ ID NO:10来克隆GhGPK1全长。
采用TaKaRa的PrimeSTAR HS DNA聚合酶,以棉花的cDNA为模板进行PCR反应。50μlPCR反应体系:10μl5×PS Buffer,3μl2.5mM的dNTP,2.0μl cDNA,1.0μl PrimeSTAR、10μM的引物SEQ ID NO:9和SEQ ID NO:10各2.0μl,以及30μl的双蒸水。PCR反应条件:94℃预变性5min,94℃变性30s,58℃退火30s,72℃延伸2min,33个循环后,72℃延伸10min。
PCR扩增产物加A尾:PCR产物加2.5倍的无水乙醇,-20℃放置10分钟,离心,去上清,晾干,用21μl双蒸水溶解。加入2.5μl10×Ex Buffer,0.5μl5mM的dATP,2.5μl10×ExTaq。反应条件:70℃反应30分钟。将得到约1900bp的DNA片段回收(Omega回收试剂盒),连接至pGEM T-easy载体上,转化JM109(方法同上),随机挑取10个白色菌落于含有50μg/mL氨苄青霉素的LB液体培养基中培养,37℃培养过夜后加甘油至终浓度20%,-80℃保存备用。SEQID NO:9与SEQ ID NO:10进行菌液PCR扩增(反应体系及反应条件同上),得到4个阳性克隆,送至英潍捷基(上海)贸易有限公司测序,序列为SEQ ID NO:2。
GPK1蛋白的氨基酸序列:SEQ ID NO:1
1 MQELVGSVRR SFVFRSSTSS
21 DDAGGGLGGF VEKIGASIRR
41 SRIGLFAKPP APPALPSVKK
61 RDATIRWRKG ELIGCGAFGR
81 VYMGMNLDSG ELLAVKQVLI
101 AANASKEKTQ AHIRELEEEV
121 KLLQNLSHPN IVRYLGTARE
141 DDSLNILLEF VPGGSISSLL
161 GKFGSFPESV IRMYTKQLLL
181 GLEYLHKNRI VHRDIKGANI
201 LVDNKGCIKL ADFGASKKVV
221 ELATINGAKS MKGTPYWMAP
241 EVILQTGHSF SADIWSVGCT
261 VIEMATGKPP WSQQYQEVAA
281 LFHIGTTKSH PPIPEHLSPE
301 AKDLLLKCLQ KEPGLRPSAS
321 DLLQHPFVTG DYQEPHAVLR
341 RSIREPENLE MASGVNLRSS
361 INSEIRSTCT GLKDVCEMGS
381 VSCSTAFLGK FSEPGAYWRG
401 SNCDNSMCEI DDKDDLEFNY
421 SVKFSSVLSS ADLNKSFNPM
441 CEPTEDWPPK LDQSSELSRS
461 GVNLSLDETM EAASTPGMSG
481 KEENGFTFLC GPPTGDDDEE
501 VTESKIRAFL DEKALELKKL
521 QSPLYEQFYN TLNGSLPPSV
541 GTANGENILS LPPKSRSPKW
561 LPSRRLSAVA DAANMVSSKS
581 RMNHLSNTAV VHDRTLQEIQ
601 PPSVEEWKGQ DIISPSMSFS
621 ERQRRWKEEL DQELERKREM
641 LRKTSSPKDK FLFGQREQIR
661 SPFPGK*
GhGPK1编码基因的核苷酸序列:SEQ ID NO:2
1 ATGCAGGAGC TTGTTGGATC GGTTCGCCGG TCCTTTGTCT TTAGGTCTTC AACCTCCAGC
61 GACGATGCCG GTGGAGGACT TGGAGGCTTT GTTGAGAAGA TCGGCGCCAG CATTCGCAGA
121 TCGCGAATCG GCTTGTTCGC TAAGCCACCT GCTCCACCCG CTCTTCCTTC TGTTAAGAAA
181 AGGGACGCTA CGATCCGGTG GCGGAAGGGC GAGTTGATTG GTTGTGGCGC CTTTGGTCGG
241 GTTTACATGG GGATGAATCT TGACTCCGGC GAGTTACTAG CTGTTAAACA GGTTTTGATA
301 GCGGCAAATG CTTCAAAGGA GAAAACACAG GCTCATATTA GAGAGCTCGA AGAAGAAGTG
361 AAGCTTCTAC AGAATCTGTC ACATCCAAAC ATTGTTAGAT ATTTGGGCAC CGCAAGAGAG
421 GACGATTCGT TGAATATTCT ATTGGAGTTT GTACCAGGTG GATCCATTTC CTCACTTTTG
481 GGGAAGTTTG GATCTTTCCC CGAATCTGTT ATAAGAATGT ACACTAAACA ACTTTTGTTG
541 GGACTGGAAT ATCTTCACAA AAATAGAATC GTGCATAGAG ACATCAAGGG AGCAAACATT
601 CTTGTGGATA ACAAGGGTTG TATCAAACTT GCAGACTTTG GTGCATCCAA AAAAGTTGTC
661 GAGTTGGCTA CAATAAATGG AGCCAAGTCA ATGAAGGGAA CTCCTTATTG GATGGCTCCT
721 GAAGTTATTC TCCAAACTGG GCATAGCTTC TCTGCCGATA TTTGGAGTGT TGGCTGTACT
781 GTGATTGAGA TGGCTACTGG AAAGCCCCCT TGGAGCCAAC AGTATCAGGA GGTTGCTGCT
841 CTCTTTCATA TTGGGACAAC TAAATCTCAT CCACCCATCC CTGAGCATCT CTCCCCTGAG
901 GCTAAAGACC TTTTGTTAAA ATGCCTACAG AAGGAACCAG GACTGAGACC TAGTGCATCA
961 GACTTGCTTC AGCACCCTTT TGTCACTGGG GACTATCAGG AACCTCATGC GGTGCTTCGT
1021 AGATCAATTA GGGAACCTGA AAATCTGGAG ATGGCATCTG GGGTGAACCT GAGAAGCTCA
1081 ATAAATTCGG AGATCAGGTC AACCTGCACG GGTTTAAAGG ACGTTTGTGA AATGGGTAGT
1141 GTGAGTTGCT CTACAGCATT TCTTGGGAAA TTCTCAGAAC CAGGAGCCTA TTGGAGGGGA
1201 AGCAATTGTG ACAATAGCAT GTGTGAGATA GATGACAAAG ATGATCTAGA ATTTAATTAT
1261 TCTGTAAAGT TCTCCTCTGT TTTATCATCT GCTGACTTGA ATAAAAGTTT CAATCCCATG
1321 TGTGAACCCA CTGAAGACTG GCCACCCAAA TTAGATCAAA GTTCTGAGCT AAGCCGAAGT
1381 GGAGTAAACT TGTCTTTGGA TGAAACAATG GAAGCTGCTA GCACTCCTGG AATGTCTGGT
1441 AAGGAGGAGA ATGGCTTCAC TTTTCTCTGC GGACCTCCAA CAGGTGATGA TGATGAAGAA
1501 GTTACAGAGT CAAAAATTAG AGCCTTCCTG GATGAAAAGG CTCTGGAACT GAAGAAGCTG
1561 CAATCTCCTC TGTATGAACA GTTCTACAAC ACATTGAATG GTAGTCTTCC ACCTTCTGTT
1621 GGAACTGCAA ATGGTGAAAA TATTTTGAGC TTACCTCCTA AAAGTAGGTC ACCTAAGTGG
1681 CTGCCTAGCA GAAGACTCTC AGCAGTTGCT GATGCTGCCA ACATGGTTAG CTCAAAGAGT
1741 CGTATGAATC ATTTGTCAAA TACTGCAGTT GTTCATGACC GGACTTTACA AGAAATTCAG
1801 CCACCGTCTG TTGAGGAATG GAAAGGGCAG GATATAATTA GTCCGAGCAT GAGCTTTTCT
1861 GAGAGACAAA GGAGATGGAA GGAAGAGCTT GACCAAGAGC TTGAGAGAAA GCGAGAGATG
1921 CTGCGGAAGA CATCATCTCC AAAGGATAAG TTTCTATTTG GACAAAGAGA ACAAATACGG
1981 TCTCCATTTC CTGGCAAGTA A