BLASTX nr result
ID: Zingiber25_contig00029013
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Zingiber25_contig00029013 (1104 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004154396.1| PREDICTED: uncharacterized protein LOC101203... 168 4e-39 ref|XP_004153733.1| PREDICTED: uncharacterized protein LOC101205... 163 1e-37 ref|XP_004152998.1| PREDICTED: uncharacterized protein LOC101217... 162 2e-37 gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo] 162 3e-37 gb|AAO45751.1| gag-protease polyprotein [Cucumis melo subsp. melo] 160 1e-36 ref|XP_006833015.1| hypothetical protein AMTR_s00876p00007370, p... 156 2e-35 ref|XP_004153883.1| PREDICTED: uncharacterized protein LOC101208... 156 2e-35 ref|XP_004173928.1| PREDICTED: uncharacterized protein LOC101229... 154 6e-35 ref|XP_006849815.1| hypothetical protein AMTR_s01849p00006620 [A... 150 7e-34 ref|XP_004148918.1| PREDICTED: uncharacterized protein LOC101210... 145 4e-32 ref|XP_004489079.1| PREDICTED: uncharacterized protein LOC101515... 139 3e-30 gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gy... 137 6e-30 gb|EOY08512.1| Gag protease polyprotein [Theobroma cacao] 136 2e-29 ref|XP_003605752.1| Pol polyprotein [Medicago truncatula] gi|355... 135 2e-29 gb|EOY20371.1| Gag protease polyprotein-like protein [Theobroma ... 135 3e-29 emb|CAN66987.1| hypothetical protein VITISV_044466 [Vitis vinifera] 134 6e-29 gb|EOY19679.1| Gag protease polyprotein [Theobroma cacao] 134 8e-29 gb|EOY26216.1| Gag protease polyprotein [Theobroma cacao] 133 1e-28 gb|EMJ16022.1| hypothetical protein PRUPE_ppa023432mg, partial [... 133 1e-28 ref|XP_004300999.1| PREDICTED: uncharacterized protein LOC101305... 132 2e-28 >ref|XP_004154396.1| PREDICTED: uncharacterized protein LOC101203289 [Cucumis sativus] Length = 655 Score = 168 bits (425), Expect = 4e-39 Identities = 99/307 (32%), Positives = 155/307 (50%), Gaps = 15/307 (4%) Frame = +3 Query: 213 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 389 + FR+ P F G+ DP AE W+ S+ETIF++M+ + R++CA F+LRD +WW Sbjct: 121 RDFRKYDPQTFDGSLEDPTKAEMWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRT 180 Query: 390 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 563 L D+ +TW FK+ FY K+F+ + R ++EFLEL+QG ++V EY + F+ Sbjct: 181 TMRMLGGDVRQITWDQFKDCFYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSR 240 Query: 564 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 743 F P + S F +GLR IR VR + TT EA+ A+ +++ + NK Sbjct: 241 FAPELVSNEQARADRFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDERQPRSFNK 300 Query: 744 RLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQK-TQAVEGTGFRVENKVRCSKC 920 S G+K+ + + G P + +P + Q+ G G + + C+ C Sbjct: 301 GSS--------SGQKRKVEQRTVG-VPQRNMRPGDSFRSFQQSSGGAGDTTQERPVCNTC 351 Query: 921 EKIHAGQCLTGTDACFMCKKSGHFARECPLL-----------REPTKGRVFAMTQEQVDL 1067 K H G+CL GT C+ CK+ GH A CPL R P +G +FA + + + Sbjct: 352 GKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGSSSQGERPPQRGTIFATNRSEAEK 411 Query: 1068 DTAIITG 1088 ++TG Sbjct: 412 AGTVVTG 418 >ref|XP_004153733.1| PREDICTED: uncharacterized protein LOC101205308, partial [Cucumis sativus] Length = 768 Score = 163 bits (412), Expect = 1e-37 Identities = 96/306 (31%), Positives = 152/306 (49%), Gaps = 15/306 (4%) Frame = +3 Query: 213 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 389 + FR+ P F G+ DP AE W+ S+E IF++M+ + R++CA F+LRD +WW Sbjct: 120 RDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPEEHRVQCAAFLLRDRGIIWWRT 179 Query: 390 ARLTV--DLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 563 + D+ +TW FK+ FY K+F+ + R ++EFLEL+QG ++V EY + F+ Sbjct: 180 TMCMLGGDVRQITWDQFKDCFYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSR 239 Query: 564 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 743 F P + F +GLR IR VR + TT EA+ A+ ++++ + +K Sbjct: 240 FAPEFVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARSSDK 299 Query: 744 RLSYQGQDQQEPGKKKTIPGQN-SGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 920 S Q ++ + +P +N PF+ Q Q+ G G K C+ C Sbjct: 300 GTS-SSQKRKAEQRIVGVPQRNLRSGDPFRSFQ--------QSSGGAGDTTREKPLCNTC 350 Query: 921 EKIHAGQCLTGTDACFMCKKSGHFARECPL-----------LREPTKGRVFAMTQEQVDL 1067 K H G CL GT C+ CK+ GH A CPL R P +G +FA ++ + + Sbjct: 351 GKRHLGHCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSSEGARPPQRGTIFATSRSEAEK 410 Query: 1068 DTAIIT 1085 ++T Sbjct: 411 VGIVVT 416 >ref|XP_004152998.1| PREDICTED: uncharacterized protein LOC101217872 [Cucumis sativus] Length = 461 Score = 162 bits (411), Expect = 2e-37 Identities = 99/307 (32%), Positives = 150/307 (48%), Gaps = 15/307 (4%) Frame = +3 Query: 213 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 389 + FR+ P F G+ DP E W+ S+ETIF++M+ + R++CA F+LRD +WW Sbjct: 71 RDFRKYDPQTFDGSLEDPTKVELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRT 130 Query: 390 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 563 L D+ +TW FK+ FY K+F+ + R ++EFLEL+QG ++V EY + F+ Sbjct: 131 TMRMLGGDVRQITWDQFKDCFYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQDFDMLSR 190 Query: 564 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 743 F P + F +GLR IR VR + TT EA+ A M D I+ Sbjct: 191 FAPELVGNEQARADRFVKGLRDEIRDFVRALKPTTQAEALRLA-MDISIGKDEIRPMSFD 249 Query: 744 RLSYQGQDQQEPGKKKTIPGQNSGK-QPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 920 + S GQ ++ + +P +N F+ Q Q+ G G + + C C Sbjct: 250 KGSSSGQKRKVEQRTVGVPQRNMRPGDSFRSFQ--------QSSGGAGDTTQERPVCDTC 301 Query: 921 EKIHAGQCLTGTDACFMCKKSGHFARECPLL-----------REPTKGRVFAMTQEQVDL 1067 K H G+CL GT C+ CK+ GH A CPL R P +G +FA + + + Sbjct: 302 GKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGSSSQGERPPQRGTIFATNRSEAEK 361 Query: 1068 DTAIITG 1088 ++TG Sbjct: 362 AGTVVTG 368 >gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo] Length = 871 Score = 162 bits (409), Expect = 3e-37 Identities = 102/306 (33%), Positives = 152/306 (49%), Gaps = 14/306 (4%) Frame = +3 Query: 213 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 389 + FR+ PT F G+ DP A+ W+ SLETIF +M+ + +++CA+FML D WWE Sbjct: 333 RDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWET 392 Query: 390 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 563 L D++ +TW FKE FY K+F+ R +EFL L QGD++V +Y F+ Sbjct: 393 TERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSR 452 Query: 564 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQAL-MSERDRNDMIKEAQN 740 F P + + F GLR I+ VR R T +A+ A+ +S ++R + K A Sbjct: 453 FAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTA-- 510 Query: 741 KRLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 920 R S GQ ++ + +P +N F+ + Q+ G R K C+ C Sbjct: 511 GRGSTSGQKRKAEQQPVPVPQRN-----FRPGGEFRSFQQKPFEAGEAAR--GKPLCTTC 563 Query: 921 EKIHAGQCLTGTDACFMCKKSGHFARECPL----------LREPTKGRVFAMTQEQVDLD 1070 K H G+CL GT CF C++ GH A CPL P +GRVFA + + + Sbjct: 564 GKHHLGRCLFGTRTCFKCRQEGHTADRCPLRPTGIAQNQGAGAPLQGRVFATNRTEAEKA 623 Query: 1071 TAIITG 1088 ++TG Sbjct: 624 GTVVTG 629 >gb|AAO45751.1| gag-protease polyprotein [Cucumis melo subsp. melo] Length = 429 Score = 160 bits (404), Expect = 1e-36 Identities = 101/306 (33%), Positives = 151/306 (49%), Gaps = 14/306 (4%) Frame = +3 Query: 213 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 389 + FR+ PT F G+ DP A+ W+ SLETIF +M+ + +++CA+FML D WWE Sbjct: 61 RDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWET 120 Query: 390 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 563 L D++ +TW FKE FY K+F+ R +EFL L QGD++V +Y F+ Sbjct: 121 TERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSR 180 Query: 564 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQAL-MSERDRNDMIKEAQN 740 F P + + F GLR I+ VR R T +A+ A+ +S ++R + K A Sbjct: 181 FAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTA-- 238 Query: 741 KRLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 920 R S GQ ++ + +P +N F+ + Q+ G R K C+ C Sbjct: 239 GRGSTSGQKRKAEQQPVPVPQRN-----FRPGGEFRSFQQKPFEAGEAAR--GKPLCTTC 291 Query: 921 EKIHAGQCLTGTDACFMCKKSGHFARECPL----------LREPTKGRVFAMTQEQVDLD 1070 K H G+CL GT CF C++ GH A CPL P +GR FA + + + Sbjct: 292 GKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRAFATNRTEAEKA 351 Query: 1071 TAIITG 1088 ++TG Sbjct: 352 GTVVTG 357 >ref|XP_006833015.1| hypothetical protein AMTR_s00876p00007370, partial [Amborella trichopoda] gi|548837601|gb|ERM98293.1| hypothetical protein AMTR_s00876p00007370, partial [Amborella trichopoda] Length = 366 Score = 156 bits (394), Expect = 2e-35 Identities = 86/271 (31%), Positives = 136/271 (50%), Gaps = 2/271 (0%) Frame = +3 Query: 201 QPVYKQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVW 380 +P+Y++FR+ P F+G +DP+ AE W+R++E I ++M+L + D + CA +L+ DAR+W Sbjct: 102 EPIYERFRKQHPPNFEGGSDPMEAEEWLRTVEGIVEYMRLGNGDSVACAASLLKKDARIW 161 Query: 381 WEGARLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGR 560 W+ + T D+A +TW DF +VF KY++ R+ EF LRQG +V EY R+F+R Sbjct: 162 WDVIKQTRDVAAMTWADFVQVFNKKYYSEAIRSARVNEFTNLRQGKSTVTEYARQFDRLA 221 Query: 561 YFVPMITSQPVEELKHFTEGLRPAIRHDVRLS--RVTTFREAVDQALMSERDRNDMIKEA 734 F + + FTEGL I D+ +S R TT+ E + A R ++ Sbjct: 222 KFATDLVPTEFLRIHRFTEGLDSRISRDIAMSGVRATTYAEKDNTARWEARKASN--GGG 279 Query: 735 QNKRLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCS 914 NKR Q + + K GK+P+ VE + C Sbjct: 280 DNKRKLPSNQHNEADKRNKIGSNNYKGKKPY---------------------VEYPL-CP 317 Query: 915 KCEKIHAGQCLTGTDACFMCKKSGHFARECP 1007 C + H G+C C+ C + GH+ ++CP Sbjct: 318 TCGRKHPGECRLKGKTCYKCGQPGHYKKDCP 348 >ref|XP_004153883.1| PREDICTED: uncharacterized protein LOC101208523, partial [Cucumis sativus] Length = 804 Score = 156 bits (394), Expect = 2e-35 Identities = 95/307 (30%), Positives = 151/307 (49%), Gaps = 15/307 (4%) Frame = +3 Query: 213 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 389 + FR+ P F G+ DP AE W+ +ETIF +M+ + R++CA F+LRD +WW Sbjct: 120 RDFRKYDPQTFDGSLEDPTKAELWLFYVETIFIYMRCPEEHRVQCAAFLLRDRGIIWWRT 179 Query: 390 A--RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 563 L D+ +TW FK+ FY K+F+ + R ++EFLEL+QG ++V EY + F+ Sbjct: 180 TIRMLGGDVRQITWNQFKDCFYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSC 239 Query: 564 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 743 F P + + F + LR IR R + TT EA+ A+ ++++ + +K Sbjct: 240 FAPKLVGNEQARAERFVKRLRDEIRGFARALKPTTQAEALRLAVDMSIGKDEIQARSSDK 299 Query: 744 RLSYQGQDQQEPGKKKTIPGQN-SGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 920 S GQ ++ + +P +N PF+ Q Q+ G G K C+ C Sbjct: 300 GTS-SGQKRKAEQRIVGVPQRNLRPGDPFRNFQ--------QSSGGAGDTTREKPLCNTC 350 Query: 921 EKIHAGQCLTGTDACFMCKKSGHFARECPL-----------LREPTKGRVFAMTQEQVDL 1067 K H G+CL GT C+ CK+ GH A C L P +G +FA ++ + + Sbjct: 351 GKRHLGRCLMGTRVCYKCKQEGHMADRCRLRSTGAGQSSQGAGPPQRGTIFATSRSEAEK 410 Query: 1068 DTAIITG 1088 ++TG Sbjct: 411 AGTVVTG 417 >ref|XP_004173928.1| PREDICTED: uncharacterized protein LOC101229796, partial [Cucumis sativus] Length = 338 Score = 154 bits (389), Expect = 6e-35 Identities = 93/270 (34%), Positives = 137/270 (50%), Gaps = 4/270 (1%) Frame = +3 Query: 213 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 389 + FR+ P F G+ DP AE W+ S+ETIF++M+ + R++CA F+LRD +W Sbjct: 61 RDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWCRT 120 Query: 390 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 563 L D+ +TW FK FY K+F+ + R ++EFLEL+QG ++V EY + F+ Sbjct: 121 TMRMLGGDVMQITWDQFKNCFYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSR 180 Query: 564 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 743 F P + F +GLR IR VR + TT EA+ A+ + D I+ Sbjct: 181 FAPELVGNEQARADRFVKGLRDEIRDFVRALKPTTQAEALRLAVDMGIGK-DEIRPRSFD 239 Query: 744 RLSYQGQDQQEPGKKKTIPGQN-SGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 920 + S GQ ++ + +P +N PF+ Q Q+ G G K C+ C Sbjct: 240 KGSSSGQKRKAEQRTVGVPQRNLRPGDPFRSFQ--------QSSGGAGDTTREKPLCNTC 291 Query: 921 EKIHAGQCLTGTDACFMCKKSGHFARECPL 1010 K H G+CL GT C+ CK+ GH A CPL Sbjct: 292 GKRHLGRCLMGTRVCYKCKQEGHMADRCPL 321 >ref|XP_006849815.1| hypothetical protein AMTR_s01849p00006620 [Amborella trichopoda] gi|548853397|gb|ERN11396.1| hypothetical protein AMTR_s01849p00006620 [Amborella trichopoda] Length = 383 Score = 150 bits (380), Expect = 7e-34 Identities = 95/268 (35%), Positives = 132/268 (49%), Gaps = 14/268 (5%) Frame = +3 Query: 330 DRIRCAIFMLRDDARVWWEGARLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELR 509 DR++CA +MLR DAR+WWE T D+ T+ W DFK VF KY+ EF L Sbjct: 8 DRVKCASYMLRKDARIWWEVVEQTKDVDTMNWDDFKRVFNEKYYNSAVLAAKVDEFTGLV 67 Query: 510 QGDLSVAEYVRRFERGRYFVPMITSQPVEELKHFTEGLRPAIRHDVRL-SR-VTTFREAV 683 QG L+V EY ++F+R F P + F EGL+P + DV + SR ++ + V Sbjct: 68 QGSLTVTEYAQKFDRLAKFAPDLVPTDRVRAHRFVEGLKPMVARDVEIVSRGQFSYAQVV 127 Query: 684 DQALMSERDRNDMIKEAQNKRLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKT 863 + AL +ER N + KE +R S +G KK+ GQ+ QP R + Sbjct: 128 EMALTAERSENKIWKENAARRESKKGGANSNDHKKR---GQDQSGQP--SQDKRYKSDND 182 Query: 864 QAVEGTGFRVENKVRCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPL------LREPT 1025 Q G+ R N C KC K H G+C AC+ C K GH R CPL EP Sbjct: 183 QRFNGSSGR--NIPECPKCTKRHLGEC--RAKACYKCGKEGHIKRNCPLWGQTGNRAEPK 238 Query: 1026 K------GRVFAMTQEQVDLDTAIITGE 1091 K RVFA+TQ + + ++++G+ Sbjct: 239 KDDKYVPARVFAITQAEAEASPSVVSGQ 266 >ref|XP_004148918.1| PREDICTED: uncharacterized protein LOC101210300 [Cucumis sativus] Length = 623 Score = 145 bits (365), Expect = 4e-32 Identities = 87/264 (32%), Positives = 135/264 (51%), Gaps = 3/264 (1%) Frame = +3 Query: 213 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 389 + FR+ F G+ DP AE W+ S+ETIF++M+ + R++CA F+LRD +WW Sbjct: 107 RDFRKYDLQTFDGSLEDPTKAEMWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRT 166 Query: 390 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 563 L D+ +TW FK+ FY K+F+ + R ++EFLEL+QG +++ EY + F+ Sbjct: 167 TMRMLGGDVRQITWDQFKDCFYTKFFSANLRDAKSQEFLELKQGHMTIEEYDQEFDMLSR 226 Query: 564 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 743 F P + F +GLR IR VR + TT EA+ A+ + D I+ + Sbjct: 227 FAPELVGNEQARADRFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGK-DEIRASSFD 285 Query: 744 RLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKCE 923 + S GQ ++ + IP +N + P Q++ G R K C+ C Sbjct: 286 KGSSSGQKRKVEQRTVGIPQRN-----LRLGDPFCSFQQSSGEAGDTTR--EKPVCNTCG 338 Query: 924 KIHAGQCLTGTDACFMCKKSGHFA 995 K H G+CL GT C+ C++ GH A Sbjct: 339 KHHLGRCLMGTRVCYKCRQEGHMA 362 >ref|XP_004489079.1| PREDICTED: uncharacterized protein LOC101515713 [Cicer arietinum] Length = 943 Score = 139 bits (349), Expect = 3e-30 Identities = 92/347 (26%), Positives = 160/347 (46%), Gaps = 17/347 (4%) Frame = +3 Query: 102 LLHEQNRIHGEQIQQILQAREQGSTPRRSAP-STQPVYKQFRELGPTEFKGTTDPIAAEG 278 L+ +Q + + + Q+ P P ++ ++ F +L P F G+ P+ A+ Sbjct: 10 LMMQQQAVTTSIMNHLAQSVGPAHPPPPPPPEASNRLFYDFHKLKPPAFLGSLVPLEAQS 69 Query: 279 WIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGARLTVDLA--TLTWTDFKEVFYG 452 W+ + IF ++ T+ D++ A ML+ +A WW+GA+ + A + W +F VF Sbjct: 70 WLDEMTKIFLVVRCTEEDKVAFATHMLQGEAENWWKGAKAYMISAGTPMNWENFCTVFLD 129 Query: 453 KYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVPMITSQPVE--ELKHFTEGLR 626 KY + R + EF L+QGD+SVA+YV +FE F P + ++ F GL Sbjct: 130 KYIPMSIRKQKEFEFTHLQQGDMSVADYVAKFEELARFCAQAEYAPNDRWKINQFEWGLN 189 Query: 627 PAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNKRLSYQ----GQDQQEPGKKKT 794 P I+ ++ +T++ V ++ + E + +N++L +Q + K KT Sbjct: 190 PEIKSNLAQLEITSYATLVHKSYIVEESLRSL---KENRQLKWQQRRDAPKSNQQLKVKT 246 Query: 795 IPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKCEKIHAGQCLTGTDACFMC 974 P N GKQP P+ + + C KC + H G+CL G + CF C Sbjct: 247 SP--NKGKQPQNSVVPQARGPR---------------ECPKCGRSHPGECLYGKNICFWC 289 Query: 975 KKSGHFARECPLLR--------EPTKGRVFAMTQEQVDLDTAIITGE 1091 K GH +++CP + P GRV+ + ++ + +I GE Sbjct: 290 KTPGHLSQDCPQRKMKGLANSNGPLTGRVYTLNAKKTKGNNDLIAGE 336 >gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gypsy type [Oryza sativa] gi|21327374|gb|AAM48279.1|AC122148_32 Putative 22 kDa kafirin cluster; Ty3-Gypsy type [Oryza sativa Japonica Group] gi|31431495|gb|AAP53268.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1230 Score = 137 bits (346), Expect = 6e-30 Identities = 94/290 (32%), Positives = 141/290 (48%), Gaps = 2/290 (0%) Frame = +3 Query: 216 QFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGAR 395 +F++L P F GT +P+ AE WI ++E F+ M TD ++I A +ML+ A WW+ + Sbjct: 73 EFQKLKPPTFSGTANPLEAEEWIVAMEKSFEAMGCTDKEKIIYATYMLQSSAFEWWDAHK 132 Query: 396 LTV-DLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 572 + + +TW FKE FY KYF + +EFLEL+QG+ SVAEY F R F P Sbjct: 133 KSYSERIFITWELFKEAFYKKYFPESVKRMKEKEFLELKQGNKSVAEYEIEFSRLARFAP 192 Query: 573 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNKRLS 752 + + F GLR ++ V +T FRE V +A + E+ ++ E Sbjct: 193 EFVQTDGSKARRFESGLRQPLKRRVEAFELTIFREVVSKAQLLEKGYHEQRIE------- 245 Query: 753 YQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKCEKIH 932 GQ Q+ K KT QN G+ F+ Q +K+ +G +C C+ H Sbjct: 246 -HGQPQK---KFKTNNPQNQGR--FRGNYSGQMQRKSSENQGR--------KCPICQGSH 291 Query: 933 AGQ-CLTGTDACFMCKKSGHFARECPLLREPTKGRVFAMTQEQVDLDTAI 1079 C CF C ++GH +CPLL++ K RV + TQ + T + Sbjct: 292 VPSICPNCWGRCFECGEAGHTRYQCPLLQK-GKNRVSSTTQPNTKVLTPV 340 >gb|EOY08512.1| Gag protease polyprotein [Theobroma cacao] Length = 404 Score = 136 bits (342), Expect = 2e-29 Identities = 86/281 (30%), Positives = 124/281 (44%), Gaps = 10/281 (3%) Frame = +3 Query: 213 KQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGA 392 K+ R+LG F G D A+ WI + M L D ++ A +L AR WW Sbjct: 114 KEARQLGCVSFTGELDATVAKDWINQVSETLSDMGLDDDMKLMVATRLLEKRARTWWNSV 173 Query: 393 RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 572 + + TW+DF F G+YFT ++ REFL L+QG+L+V EY RF +VP Sbjct: 174 K-SRSATPQTWSDFLREFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELMLYVP 232 Query: 573 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSER----DRNDMIKEAQN 740 + ++ +F EGLR IR + + +E V AL +E+ +R K A+ Sbjct: 233 DLVKSEQDQASYFEEGLRNEIRERMTVIGREPHKEVVQMALRAEKLATENRRIRTKFAKR 292 Query: 741 KRLSYQGQDQQEPGKKKTIPGQ------NSGKQPFKQAQPRQQIQKTQAVEGTGFRVENK 902 + L + GK G S + PF +Q R A+ G+G + Sbjct: 293 RNLGMSSSQPVKRGKDSATSGSTTSISVTSPRPPFPPSQQRPSRFSRSAMTGSGKSLGGF 352 Query: 903 VRCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPLLREPT 1025 RC C H+G C G CF C ++GH CP L T Sbjct: 353 DRCRNCGNYHSGLC-RGPTRCFQCGQTGHIRSNCPQLGRAT 392 >ref|XP_003605752.1| Pol polyprotein [Medicago truncatula] gi|355506807|gb|AES87949.1| Pol polyprotein [Medicago truncatula] Length = 745 Score = 135 bits (341), Expect = 2e-29 Identities = 93/332 (28%), Positives = 149/332 (44%), Gaps = 15/332 (4%) Frame = +3 Query: 138 IQQILQAREQGSTPRRSAPSTQPVYKQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQ 317 +Q + QA Q A + + + F + P FKG DP A+ W++ +E IF MQ Sbjct: 13 LQAVAQAVGQQPNVNAGANAEARMLETFMKKNPPTFKGRCDPDGAQTWLKEIERIFRVMQ 72 Query: 318 LTDADRIRCAIFMLRDDARVWWEGARLTV--DLATLTWTDFKEVFYGKYFTVDNRTRLAR 491 T+ ++R L ++A WW T+ + A +TW F+ F +YF D R + Sbjct: 73 CTEDQKVRFGTHQLAEEADDWWVALLPTLGQEGAVVTWAVFRREFLRRYFPEDVRGKKEI 132 Query: 492 EFLELRQGDLSVAEYVRRFERGRYFVPMITSQPVEELK--HFTEGLRPAIRHDVRLSRVT 665 EFLEL+QG++SV EY +F F P T++ E + F GLRP I+ + ++ Sbjct: 133 EFLELKQGNMSVTEYAAKFVELSKFYPHYTAENAEFSRCIKFENGLRPDIKRAIGYQQLR 192 Query: 666 TFREAVDQALMSERDRNDMIKEAQNKRLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPR 845 F++ V+ + E D K ++ G+ QQ K + P ++ +P+ Sbjct: 193 VFQDLVNSCRIYEEDTKAHYKVVNERK----GKGQQSRPKPYSAPADKGKQKMVDVRRPK 248 Query: 846 QQI----------QKTQAVEGTGFRVENKVRCSKCEKIHAGQCLTGTDACFMCKKSGHFA 995 ++ +K ++ VRC K + A C CF C GH + Sbjct: 249 KKDAAEIVYFNCGEKGHKSNACPEEIKKCVRCGKKGHVVA-DCNRTDIVCFNCNGEGHIS 307 Query: 996 RECPL-LREPTKGRVFAMTQEQVDLDTAIITG 1088 +C R PT GRVFA+T Q + + +I G Sbjct: 308 SQCTQPKRAPTTGRVFALTGTQTESEDRLIRG 339 >gb|EOY20371.1| Gag protease polyprotein-like protein [Theobroma cacao] Length = 665 Score = 135 bits (340), Expect = 3e-29 Identities = 85/281 (30%), Positives = 125/281 (44%), Gaps = 10/281 (3%) Frame = +3 Query: 213 KQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGA 392 K+ R+LG F G D A+ WI + M+L D ++ A +L AR WW Sbjct: 50 KEARQLGCVSFTGELDATVAKDWINQVSKTLSDMRLDDDMKLMVATRLLEKRARTWWNSV 109 Query: 393 RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 572 + + TW+DF F G+YFT ++ REFL L+QG+L+V EY RF +VP Sbjct: 110 K-SRSATPQTWSDFLREFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELMLYVP 168 Query: 573 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSER----DRNDMIKEAQN 740 + ++ +F EGLR IR + ++ +E V AL +E+ +R + A+ Sbjct: 169 DLVKSEQDQASYFEEGLRNEIRERMTVTGREPHKEVVQMALRAEKLAIENRRIRTEFAKR 228 Query: 741 KRLSYQGQDQQEPGKKKTIPGQ------NSGKQPFKQAQPRQQIQKTQAVEGTGFRVENK 902 + + GK I G S + PF +Q R A+ G+G Sbjct: 229 RNPGMSSSQPVKRGKDSAISGSTTSVSVTSPRPPFPPSQQRPSRFSRSAMTGSGRSFGGS 288 Query: 903 VRCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPLLREPT 1025 RC C H+G C T CF C ++GH CP L T Sbjct: 289 DRCRNCGNYHSGLCREPT-RCFQCGQTGHIRSNCPRLGRAT 328 >emb|CAN66987.1| hypothetical protein VITISV_044466 [Vitis vinifera] Length = 360 Score = 134 bits (337), Expect = 6e-29 Identities = 95/320 (29%), Positives = 141/320 (44%), Gaps = 2/320 (0%) Frame = +3 Query: 81 FLEGLTALLHEQNRIHGEQIQQILQAREQGSTPRRSAPSTQPVYKQFRELGPTEFKGTTD 260 +L L L+ Q R G +Q Q S+ R S+ + F++LGP F G TD Sbjct: 69 YLGTLAGLVERQARAVGTNVQG------QSSSSRGSS------FDDFKKLGPPYFSGATD 116 Query: 261 PIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGAR-LTVDLATLTWTDFK 437 P AE WI +E F + ++ + A FML + WW R L D +TW F+ Sbjct: 117 PTEAEAWILKMEKFFGVIDCSEEQKASYAAFMLDKETDHWWRMTRRLLEDQGPITWRQFR 176 Query: 438 EVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVPMITSQPVEELKHFTE 617 E FY KYF R + EF+ L QGD++VA+Y +F F P + + E+ F + Sbjct: 177 EAFYKKYFPDSVRRQKVGEFIRLEQGDMTVAQYEAKFTELSRFSPQLIATEEEKALKFQD 236 Query: 618 GLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNKRLSYQGQDQQEPGKKKTI 797 L+P +++ + + + E +Q +R+R+D Q +R S G++Q Sbjct: 237 XLKPYLKNKXSILXLGXYSEYREQ--QRKRNRSDGAHGNQXQRRSTSGRNQ--------- 285 Query: 798 PGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKCEKIHAGQ-CLTGTDACFMC 974 N GK Q ++G C C K H G+ C T ACF C Sbjct: 286 ---NKGKA-------------AQNLDGA---------CPTCGKKHGGRPCYRETGACFGC 320 Query: 975 KKSGHFARECPLLREPTKGR 1034 K GH R+CP R+ G+ Sbjct: 321 GKQGHLIRDCPENRKFITGK 340 >gb|EOY19679.1| Gag protease polyprotein [Theobroma cacao] Length = 474 Score = 134 bits (336), Expect = 8e-29 Identities = 87/282 (30%), Positives = 131/282 (46%), Gaps = 11/282 (3%) Frame = +3 Query: 213 KQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGA 392 K+ R+LG T F G D AA+ WI + F M+L D ++ A +L AR WW Sbjct: 127 KEARQLGCTSFIGDLDATAAKDWITQVTETFVDMKLDDDMKLMVATRLLEKRARTWWSSV 186 Query: 393 RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 572 + + + +LTW DF + F G+Y+T ++ REFL L+QG+L++ EY RF +VP Sbjct: 187 K-SRSITSLTWIDFLQEFDGQYYTYFHQKEKKREFLSLQQGNLTIEEYEARFNELMSYVP 245 Query: 573 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRND---MIKEAQNK 743 + ++ +F EGLR IR + ++ +E V AL +E+ N+ M E + Sbjct: 246 DLVKSEQDQASYFEEGLRNEIRERMTVTGREPHKEVVQMALRAEKLTNENRRMRAEFAKR 305 Query: 744 RLSYQGQDQQEPGKKKTIPGQN-------SGKQPFKQAQPRQQIQKTQAVEGTGFRVENK 902 R Q K T +N S + P Q Q R + T + Sbjct: 306 RNPNVSSSQLPKRGKDTFASENTVSVPVISPRPPLSQLQQRPPRFNRSGMSSTSEKSFGG 365 Query: 903 V-RCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPLLREPT 1025 + +C KC + H G+C CF C +SGH +CP L T Sbjct: 366 LNKCEKCGRYHVGEC--WGIRCFHCDQSGHIRSDCPQLGRAT 405 >gb|EOY26216.1| Gag protease polyprotein [Theobroma cacao] Length = 426 Score = 133 bits (335), Expect = 1e-28 Identities = 88/283 (31%), Positives = 128/283 (45%), Gaps = 12/283 (4%) Frame = +3 Query: 213 KQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGA 392 K+ R+LG F G D A+ WI + M+L D ++ A +L AR WW Sbjct: 114 KEARQLGCVSFTGELDATVAKDWINQVSETLSDMKLNDDMKLMVATRLLEKRARTWWNSV 173 Query: 393 RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 572 + + TW+DF F G+YFT ++ REFL L+QG+L+V EY RF +VP Sbjct: 174 K-SRSATPQTWSDFLREFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELMLYVP 232 Query: 573 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSE-------RDRNDMIKE 731 + ++ +F EGLR IR + ++ +E V AL +E R R + K Sbjct: 233 DLVKSEQDQASYFEEGLRNEIRERMTVTGREPHKEVVQMALRAEKLATENRRIRTEFAKR 292 Query: 732 AQNKRLSY-----QGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVE 896 +N +SY +G+D + T S + PF +Q R A+ G+G Sbjct: 293 -RNPGMSYSQSVKRGKD-SAISRSTTSISVTSPRPPFPPSQQRPSRFSRSAMTGSGKSFG 350 Query: 897 NKVRCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPLLREPT 1025 RC C H+G C T CF C ++GH CP L T Sbjct: 351 GSDRCRNCGNYHSGLCREPT-RCFQCGQTGHIRSNCPRLGRAT 392 >gb|EMJ16022.1| hypothetical protein PRUPE_ppa023432mg, partial [Prunus persica] Length = 590 Score = 133 bits (334), Expect = 1e-28 Identities = 96/324 (29%), Positives = 156/324 (48%), Gaps = 25/324 (7%) Frame = +3 Query: 156 AREQGSTPRRSAPSTQPVYKQFRELGPTE-FKGTTDPIAAEGWIRSLETIFDFMQLTDAD 332 A++ P+R ++ ++ +G T F GT DP AEGWI +E I + M + Sbjct: 13 AQDSARIPKRKLGRVLSIW--YQSIGFTSYFDGTGDPAVAEGWIERMERIMEVMAVPQDR 70 Query: 333 RIRCAIFMLRDDARVWWEGA-RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELR 509 R+ A F L +AR WWE R D + ++W F+ VF +Y+ + +EFL+L Sbjct: 71 RVLLASFFLIGNARHWWESIKRRYPDPSVISWPVFRAVFNSQYYPQAYQNLKMQEFLQLD 130 Query: 510 QGDLSVAEYVRRF-ERGRYFVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVD 686 QG ++V EY ++F E +Y +P++ + ++ + FT+GL+ +IR V R+T F + V Sbjct: 131 QGLMTVLEYEKKFNELSKYCIPLVEDES-KKCQLFTKGLKASIRDIVISQRLTNFGDLVM 189 Query: 687 QALMSERDRNDMIKEAQ---NKRLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQ 857 A + E + M+ AQ +RL G Q K+ + +S + F+ +P Sbjct: 190 SASLVE--SSQMMVRAQGEPRRRLFDLGGPSQGSSKRGSYSAGSSRGRSFRGFRPGISSS 247 Query: 858 -------------KTQAVEGTGFRVENKV------RCSKCEKIHAGQCLTGTDACFMCKK 980 AV G+G + + V +C+ C + H G C GT CF C + Sbjct: 248 GGSNRSGSFGSRLVGNAVRGSGRQSPSAVGGRRNPQCTVCGRYHTGTCRQGTTGCFHCGQ 307 Query: 981 SGHFARECPLLREPTKGRVFAMTQ 1052 GHF RECP+L + + V T+ Sbjct: 308 PGHFLRECPVLLQGGEATVTMPTE 331 >ref|XP_004300999.1| PREDICTED: uncharacterized protein LOC101305853 [Fragaria vesca subsp. vesca] Length = 327 Score = 132 bits (332), Expect = 2e-28 Identities = 90/265 (33%), Positives = 132/265 (49%), Gaps = 6/265 (2%) Frame = +3 Query: 228 LGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGARLTVD 407 LG F G TD + A+ WI +ET F + T+ +++R A F+L+D+ARVWW G D Sbjct: 69 LGAPSFLGGTDFLVADHWIEGMETYFTLITCTEIEKMRIATFLLKDEARVWWNGVERARD 128 Query: 408 LATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVPMITSQ 587 + L+W F ++F KYF R +L EF+ L QG +SV +Y RF + F + + Sbjct: 129 VTALSWEGFVQLFREKYFPDTVREQLELEFIALVQGLMSVRDYKARFSQLYRFAREMDA- 187 Query: 588 PVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNKRLSYQGQD 767 V + F GLR +R+ V R T EAV+ AL E++ + + EA+ R Sbjct: 188 -VALPRKFIRGLRHKLRNVVSSHRFATLAEAVESALAVEQE--EAMHEAEGLR------- 237 Query: 768 QQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQA---VEGTGFRVENKVRCSKCEKIH-- 932 GK K + G SG + A ++Q QA V R +RC +C+ + Sbjct: 238 -DVHGKGKAVAG-GSGSEGLHGASGKRQRTDQQALATVPAAPIRQVEPLRCYRCDGLGHI 295 Query: 933 AGQC-LTGTDACFMCKKSGHFAREC 1004 A +C T AC+ C + GH AREC Sbjct: 296 ARECHKRKTQACYSCGQVGHLAREC 320