BLASTX nr result
ID: Zingiber23_contig00028615
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Zingiber23_contig00028615 (1285 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004154396.1| PREDICTED: uncharacterized protein LOC101203... 168 5e-39 ref|XP_004153733.1| PREDICTED: uncharacterized protein LOC101205... 163 2e-37 ref|XP_004152998.1| PREDICTED: uncharacterized protein LOC101217... 162 2e-37 gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo] 162 4e-37 gb|AAO45751.1| gag-protease polyprotein [Cucumis melo subsp. melo] 160 1e-36 ref|XP_006833015.1| hypothetical protein AMTR_s00876p00007370, p... 156 2e-35 ref|XP_004153883.1| PREDICTED: uncharacterized protein LOC101208... 156 2e-35 ref|XP_004173928.1| PREDICTED: uncharacterized protein LOC101229... 154 8e-35 ref|XP_006849815.1| hypothetical protein AMTR_s01849p00006620 [A... 152 3e-34 ref|XP_004148918.1| PREDICTED: uncharacterized protein LOC101210... 145 5e-32 ref|XP_004489079.1| PREDICTED: uncharacterized protein LOC101515... 139 3e-30 gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gy... 137 7e-30 gb|EOY08512.1| Gag protease polyprotein [Theobroma cacao] 136 2e-29 ref|XP_003605752.1| Pol polyprotein [Medicago truncatula] gi|355... 135 3e-29 gb|EOY20371.1| Gag protease polyprotein-like protein [Theobroma ... 135 4e-29 emb|CAN66987.1| hypothetical protein VITISV_044466 [Vitis vinifera] 134 8e-29 gb|EOY19679.1| Gag protease polyprotein [Theobroma cacao] 134 1e-28 gb|EOY26216.1| Gag protease polyprotein [Theobroma cacao] 133 1e-28 gb|EMJ16022.1| hypothetical protein PRUPE_ppa023432mg, partial [... 133 2e-28 ref|XP_004300999.1| PREDICTED: uncharacterized protein LOC101305... 132 3e-28 >ref|XP_004154396.1| PREDICTED: uncharacterized protein LOC101203289 [Cucumis sativus] Length = 655 Score = 168 bits (425), Expect = 5e-39 Identities = 99/307 (32%), Positives = 155/307 (50%), Gaps = 15/307 (4%) Frame = -3 Query: 1076 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 900 + FR+ P F G+ DP AE W+ S+ETIF++M+ + R++CA F+LRD +WW Sbjct: 121 RDFRKYDPQTFDGSLEDPTKAEMWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRT 180 Query: 899 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 726 L D+ +TW FK+ FY K+F+ + R ++EFLEL+QG ++V EY + F+ Sbjct: 181 TMRMLGGDVRQITWDQFKDCFYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSR 240 Query: 725 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 546 F P + S F +GLR IR VR + TT EA+ A+ +++ + NK Sbjct: 241 FAPELVSNEQARADRFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDERQPRSFNK 300 Query: 545 RLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQK-TQAVEGTGFRVENKVRCSKC 369 S G+K+ + + G P + +P + Q+ G G + + C+ C Sbjct: 301 GSS--------SGQKRKVEQRTVG-VPQRNMRPGDSFRSFQQSSGGAGDTTQERPVCNTC 351 Query: 368 EKIHAGQCLTGTDACFMCKKSGHFARECPLL-----------REPTKGRVFAMTQEQVDL 222 K H G+CL GT C+ CK+ GH A CPL R P +G +FA + + + Sbjct: 352 GKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGSSSQGERPPQRGTIFATNRSEAEK 411 Query: 221 DTAIITG 201 ++TG Sbjct: 412 AGTVVTG 418 >ref|XP_004153733.1| PREDICTED: uncharacterized protein LOC101205308, partial [Cucumis sativus] Length = 768 Score = 163 bits (412), Expect = 2e-37 Identities = 96/306 (31%), Positives = 152/306 (49%), Gaps = 15/306 (4%) Frame = -3 Query: 1076 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 900 + FR+ P F G+ DP AE W+ S+E IF++M+ + R++CA F+LRD +WW Sbjct: 120 RDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPEEHRVQCAAFLLRDRGIIWWRT 179 Query: 899 ARLTV--DLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 726 + D+ +TW FK+ FY K+F+ + R ++EFLEL+QG ++V EY + F+ Sbjct: 180 TMCMLGGDVRQITWDQFKDCFYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSR 239 Query: 725 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 546 F P + F +GLR IR VR + TT EA+ A+ ++++ + +K Sbjct: 240 FAPEFVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARSSDK 299 Query: 545 RLSYQGQDQQEPGKKKTIPGQN-SGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 369 S Q ++ + +P +N PF+ Q Q+ G G K C+ C Sbjct: 300 GTS-SSQKRKAEQRIVGVPQRNLRSGDPFRSFQ--------QSSGGAGDTTREKPLCNTC 350 Query: 368 EKIHAGQCLTGTDACFMCKKSGHFARECPL-----------LREPTKGRVFAMTQEQVDL 222 K H G CL GT C+ CK+ GH A CPL R P +G +FA ++ + + Sbjct: 351 GKRHLGHCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSSEGARPPQRGTIFATSRSEAEK 410 Query: 221 DTAIIT 204 ++T Sbjct: 411 VGIVVT 416 >ref|XP_004152998.1| PREDICTED: uncharacterized protein LOC101217872 [Cucumis sativus] Length = 461 Score = 162 bits (411), Expect = 2e-37 Identities = 99/307 (32%), Positives = 150/307 (48%), Gaps = 15/307 (4%) Frame = -3 Query: 1076 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 900 + FR+ P F G+ DP E W+ S+ETIF++M+ + R++CA F+LRD +WW Sbjct: 71 RDFRKYDPQTFDGSLEDPTKVELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRT 130 Query: 899 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 726 L D+ +TW FK+ FY K+F+ + R ++EFLEL+QG ++V EY + F+ Sbjct: 131 TMRMLGGDVRQITWDQFKDCFYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQDFDMLSR 190 Query: 725 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 546 F P + F +GLR IR VR + TT EA+ A M D I+ Sbjct: 191 FAPELVGNEQARADRFVKGLRDEIRDFVRALKPTTQAEALRLA-MDISIGKDEIRPMSFD 249 Query: 545 RLSYQGQDQQEPGKKKTIPGQNSGK-QPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 369 + S GQ ++ + +P +N F+ Q Q+ G G + + C C Sbjct: 250 KGSSSGQKRKVEQRTVGVPQRNMRPGDSFRSFQ--------QSSGGAGDTTQERPVCDTC 301 Query: 368 EKIHAGQCLTGTDACFMCKKSGHFARECPLL-----------REPTKGRVFAMTQEQVDL 222 K H G+CL GT C+ CK+ GH A CPL R P +G +FA + + + Sbjct: 302 GKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGSSSQGERPPQRGTIFATNRSEAEK 361 Query: 221 DTAIITG 201 ++TG Sbjct: 362 AGTVVTG 368 >gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo] Length = 871 Score = 162 bits (409), Expect = 4e-37 Identities = 102/306 (33%), Positives = 152/306 (49%), Gaps = 14/306 (4%) Frame = -3 Query: 1076 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 900 + FR+ PT F G+ DP A+ W+ SLETIF +M+ + +++CA+FML D WWE Sbjct: 333 RDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWET 392 Query: 899 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 726 L D++ +TW FKE FY K+F+ R +EFL L QGD++V +Y F+ Sbjct: 393 TERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSR 452 Query: 725 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQAL-MSERDRNDMIKEAQN 549 F P + + F GLR I+ VR R T +A+ A+ +S ++R + K A Sbjct: 453 FAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTA-- 510 Query: 548 KRLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 369 R S GQ ++ + +P +N F+ + Q+ G R K C+ C Sbjct: 511 GRGSTSGQKRKAEQQPVPVPQRN-----FRPGGEFRSFQQKPFEAGEAAR--GKPLCTTC 563 Query: 368 EKIHAGQCLTGTDACFMCKKSGHFARECPL----------LREPTKGRVFAMTQEQVDLD 219 K H G+CL GT CF C++ GH A CPL P +GRVFA + + + Sbjct: 564 GKHHLGRCLFGTRTCFKCRQEGHTADRCPLRPTGIAQNQGAGAPLQGRVFATNRTEAEKA 623 Query: 218 TAIITG 201 ++TG Sbjct: 624 GTVVTG 629 >gb|AAO45751.1| gag-protease polyprotein [Cucumis melo subsp. melo] Length = 429 Score = 160 bits (404), Expect = 1e-36 Identities = 101/306 (33%), Positives = 151/306 (49%), Gaps = 14/306 (4%) Frame = -3 Query: 1076 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 900 + FR+ PT F G+ DP A+ W+ SLETIF +M+ + +++CA+FML D WWE Sbjct: 61 RDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWET 120 Query: 899 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 726 L D++ +TW FKE FY K+F+ R +EFL L QGD++V +Y F+ Sbjct: 121 TERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSR 180 Query: 725 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQAL-MSERDRNDMIKEAQN 549 F P + + F GLR I+ VR R T +A+ A+ +S ++R + K A Sbjct: 181 FAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTA-- 238 Query: 548 KRLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 369 R S GQ ++ + +P +N F+ + Q+ G R K C+ C Sbjct: 239 GRGSTSGQKRKAEQQPVPVPQRN-----FRPGGEFRSFQQKPFEAGEAAR--GKPLCTTC 291 Query: 368 EKIHAGQCLTGTDACFMCKKSGHFARECPL----------LREPTKGRVFAMTQEQVDLD 219 K H G+CL GT CF C++ GH A CPL P +GR FA + + + Sbjct: 292 GKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRAFATNRTEAEKA 351 Query: 218 TAIITG 201 ++TG Sbjct: 352 GTVVTG 357 >ref|XP_006833015.1| hypothetical protein AMTR_s00876p00007370, partial [Amborella trichopoda] gi|548837601|gb|ERM98293.1| hypothetical protein AMTR_s00876p00007370, partial [Amborella trichopoda] Length = 366 Score = 156 bits (394), Expect = 2e-35 Identities = 86/271 (31%), Positives = 136/271 (50%), Gaps = 2/271 (0%) Frame = -3 Query: 1088 QPVYKQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVW 909 +P+Y++FR+ P F+G +DP+ AE W+R++E I ++M+L + D + CA +L+ DAR+W Sbjct: 102 EPIYERFRKQHPPNFEGGSDPMEAEEWLRTVEGIVEYMRLGNGDSVACAASLLKKDARIW 161 Query: 908 WEGARLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGR 729 W+ + T D+A +TW DF +VF KY++ R+ EF LRQG +V EY R+F+R Sbjct: 162 WDVIKQTRDVAAMTWADFVQVFNKKYYSEAIRSARVNEFTNLRQGKSTVTEYARQFDRLA 221 Query: 728 YFVPMITSQPVEELKHFTEGLRPAIRHDVRLS--RVTTFREAVDQALMSERDRNDMIKEA 555 F + + FTEGL I D+ +S R TT+ E + A R ++ Sbjct: 222 KFATDLVPTEFLRIHRFTEGLDSRISRDIAMSGVRATTYAEKDNTARWEARKASN--GGG 279 Query: 554 QNKRLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCS 375 NKR Q + + K GK+P+ VE + C Sbjct: 280 DNKRKLPSNQHNEADKRNKIGSNNYKGKKPY---------------------VEYPL-CP 317 Query: 374 KCEKIHAGQCLTGTDACFMCKKSGHFARECP 282 C + H G+C C+ C + GH+ ++CP Sbjct: 318 TCGRKHPGECRLKGKTCYKCGQPGHYKKDCP 348 >ref|XP_004153883.1| PREDICTED: uncharacterized protein LOC101208523, partial [Cucumis sativus] Length = 804 Score = 156 bits (394), Expect = 2e-35 Identities = 95/307 (30%), Positives = 151/307 (49%), Gaps = 15/307 (4%) Frame = -3 Query: 1076 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 900 + FR+ P F G+ DP AE W+ +ETIF +M+ + R++CA F+LRD +WW Sbjct: 120 RDFRKYDPQTFDGSLEDPTKAELWLFYVETIFIYMRCPEEHRVQCAAFLLRDRGIIWWRT 179 Query: 899 A--RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 726 L D+ +TW FK+ FY K+F+ + R ++EFLEL+QG ++V EY + F+ Sbjct: 180 TIRMLGGDVRQITWNQFKDCFYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSC 239 Query: 725 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 546 F P + + F + LR IR R + TT EA+ A+ ++++ + +K Sbjct: 240 FAPKLVGNEQARAERFVKRLRDEIRGFARALKPTTQAEALRLAVDMSIGKDEIQARSSDK 299 Query: 545 RLSYQGQDQQEPGKKKTIPGQN-SGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 369 S GQ ++ + +P +N PF+ Q Q+ G G K C+ C Sbjct: 300 GTS-SGQKRKAEQRIVGVPQRNLRPGDPFRNFQ--------QSSGGAGDTTREKPLCNTC 350 Query: 368 EKIHAGQCLTGTDACFMCKKSGHFARECPL-----------LREPTKGRVFAMTQEQVDL 222 K H G+CL GT C+ CK+ GH A C L P +G +FA ++ + + Sbjct: 351 GKRHLGRCLMGTRVCYKCKQEGHMADRCRLRSTGAGQSSQGAGPPQRGTIFATSRSEAEK 410 Query: 221 DTAIITG 201 ++TG Sbjct: 411 AGTVVTG 417 >ref|XP_004173928.1| PREDICTED: uncharacterized protein LOC101229796, partial [Cucumis sativus] Length = 338 Score = 154 bits (389), Expect = 8e-35 Identities = 93/270 (34%), Positives = 137/270 (50%), Gaps = 4/270 (1%) Frame = -3 Query: 1076 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 900 + FR+ P F G+ DP AE W+ S+ETIF++M+ + R++CA F+LRD +W Sbjct: 61 RDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWCRT 120 Query: 899 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 726 L D+ +TW FK FY K+F+ + R ++EFLEL+QG ++V EY + F+ Sbjct: 121 TMRMLGGDVMQITWDQFKNCFYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSR 180 Query: 725 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 546 F P + F +GLR IR VR + TT EA+ A+ + D I+ Sbjct: 181 FAPELVGNEQARADRFVKGLRDEIRDFVRALKPTTQAEALRLAVDMGIGK-DEIRPRSFD 239 Query: 545 RLSYQGQDQQEPGKKKTIPGQN-SGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 369 + S GQ ++ + +P +N PF+ Q Q+ G G K C+ C Sbjct: 240 KGSSSGQKRKAEQRTVGVPQRNLRPGDPFRSFQ--------QSSGGAGDTTREKPLCNTC 291 Query: 368 EKIHAGQCLTGTDACFMCKKSGHFARECPL 279 K H G+CL GT C+ CK+ GH A CPL Sbjct: 292 GKRHLGRCLMGTRVCYKCKQEGHMADRCPL 321 >ref|XP_006849815.1| hypothetical protein AMTR_s01849p00006620 [Amborella trichopoda] gi|548853397|gb|ERN11396.1| hypothetical protein AMTR_s01849p00006620 [Amborella trichopoda] Length = 383 Score = 152 bits (384), Expect = 3e-34 Identities = 97/275 (35%), Positives = 135/275 (49%), Gaps = 14/275 (5%) Frame = -3 Query: 959 DRIRCAIFMLRDDARVWWEGARLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELR 780 DR++CA +MLR DAR+WWE T D+ T+ W DFK VF KY+ EF L Sbjct: 8 DRVKCASYMLRKDARIWWEVVEQTKDVDTMNWDDFKRVFNEKYYNSAVLAAKVDEFTGLV 67 Query: 779 QGDLSVAEYVRRFERGRYFVPMITSQPVEELKHFTEGLRPAIRHDVRL-SR-VTTFREAV 606 QG L+V EY ++F+R F P + F EGL+P + DV + SR ++ + V Sbjct: 68 QGSLTVTEYAQKFDRLAKFAPDLVPTDRVRAHRFVEGLKPMVARDVEIVSRGQFSYAQVV 127 Query: 605 DQALMSERDRNDMIKEAQNKRLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKT 426 + AL +ER N + KE +R S +G KK+ GQ+ QP R + Sbjct: 128 EMALTAERSENKIWKENAARRESKKGGANSNDHKKR---GQDQSGQP--SQDKRYKSDND 182 Query: 425 QAVEGTGFRVENKVRCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPL------LREPT 264 Q G+ R N C KC K H G+C AC+ C K GH R CPL EP Sbjct: 183 QRFNGSSGR--NIPECPKCTKRHLGEC--RAKACYKCGKEGHIKRNCPLWGQTGNRAEPK 238 Query: 263 K------GRVFAMTQEQVDLDTAIITGEDVSLDTT 177 K RVFA+TQ + + ++++G+ +TT Sbjct: 239 KDDKYVPARVFAITQAEAEASPSVVSGQIPMANTT 273 >ref|XP_004148918.1| PREDICTED: uncharacterized protein LOC101210300 [Cucumis sativus] Length = 623 Score = 145 bits (365), Expect = 5e-32 Identities = 87/264 (32%), Positives = 135/264 (51%), Gaps = 3/264 (1%) Frame = -3 Query: 1076 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 900 + FR+ F G+ DP AE W+ S+ETIF++M+ + R++CA F+LRD +WW Sbjct: 107 RDFRKYDLQTFDGSLEDPTKAEMWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRT 166 Query: 899 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 726 L D+ +TW FK+ FY K+F+ + R ++EFLEL+QG +++ EY + F+ Sbjct: 167 TMRMLGGDVRQITWDQFKDCFYTKFFSANLRDAKSQEFLELKQGHMTIEEYDQEFDMLSR 226 Query: 725 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 546 F P + F +GLR IR VR + TT EA+ A+ + D I+ + Sbjct: 227 FAPELVGNEQARADRFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGK-DEIRASSFD 285 Query: 545 RLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKCE 366 + S GQ ++ + IP +N + P Q++ G R K C+ C Sbjct: 286 KGSSSGQKRKVEQRTVGIPQRN-----LRLGDPFCSFQQSSGEAGDTTR--EKPVCNTCG 338 Query: 365 KIHAGQCLTGTDACFMCKKSGHFA 294 K H G+CL GT C+ C++ GH A Sbjct: 339 KHHLGRCLMGTRVCYKCRQEGHMA 362 >ref|XP_004489079.1| PREDICTED: uncharacterized protein LOC101515713 [Cicer arietinum] Length = 943 Score = 139 bits (349), Expect = 3e-30 Identities = 92/347 (26%), Positives = 160/347 (46%), Gaps = 17/347 (4%) Frame = -3 Query: 1187 LLHEQNRIHGEQIQQILQAREQGSTPRRSAP-STQPVYKQFRELGPTEFKGTTDPIAAEG 1011 L+ +Q + + + Q+ P P ++ ++ F +L P F G+ P+ A+ Sbjct: 10 LMMQQQAVTTSIMNHLAQSVGPAHPPPPPPPEASNRLFYDFHKLKPPAFLGSLVPLEAQS 69 Query: 1010 WIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGARLTVDLA--TLTWTDFKEVFYG 837 W+ + IF ++ T+ D++ A ML+ +A WW+GA+ + A + W +F VF Sbjct: 70 WLDEMTKIFLVVRCTEEDKVAFATHMLQGEAENWWKGAKAYMISAGTPMNWENFCTVFLD 129 Query: 836 KYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVPMITSQPVE--ELKHFTEGLR 663 KY + R + EF L+QGD+SVA+YV +FE F P + ++ F GL Sbjct: 130 KYIPMSIRKQKEFEFTHLQQGDMSVADYVAKFEELARFCAQAEYAPNDRWKINQFEWGLN 189 Query: 662 PAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNKRLSYQ----GQDQQEPGKKKT 495 P I+ ++ +T++ V ++ + E + +N++L +Q + K KT Sbjct: 190 PEIKSNLAQLEITSYATLVHKSYIVEESLRSL---KENRQLKWQQRRDAPKSNQQLKVKT 246 Query: 494 IPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKCEKIHAGQCLTGTDACFMC 315 P N GKQP P+ + + C KC + H G+CL G + CF C Sbjct: 247 SP--NKGKQPQNSVVPQARGPR---------------ECPKCGRSHPGECLYGKNICFWC 289 Query: 314 KKSGHFARECPLLR--------EPTKGRVFAMTQEQVDLDTAIITGE 198 K GH +++CP + P GRV+ + ++ + +I GE Sbjct: 290 KTPGHLSQDCPQRKMKGLANSNGPLTGRVYTLNAKKTKGNNDLIAGE 336 >gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gypsy type [Oryza sativa] gi|21327374|gb|AAM48279.1|AC122148_32 Putative 22 kDa kafirin cluster; Ty3-Gypsy type [Oryza sativa Japonica Group] gi|31431495|gb|AAP53268.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1230 Score = 137 bits (346), Expect = 7e-30 Identities = 94/290 (32%), Positives = 141/290 (48%), Gaps = 2/290 (0%) Frame = -3 Query: 1073 QFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGAR 894 +F++L P F GT +P+ AE WI ++E F+ M TD ++I A +ML+ A WW+ + Sbjct: 73 EFQKLKPPTFSGTANPLEAEEWIVAMEKSFEAMGCTDKEKIIYATYMLQSSAFEWWDAHK 132 Query: 893 LTV-DLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 717 + + +TW FKE FY KYF + +EFLEL+QG+ SVAEY F R F P Sbjct: 133 KSYSERIFITWELFKEAFYKKYFPESVKRMKEKEFLELKQGNKSVAEYEIEFSRLARFAP 192 Query: 716 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNKRLS 537 + + F GLR ++ V +T FRE V +A + E+ ++ E Sbjct: 193 EFVQTDGSKARRFESGLRQPLKRRVEAFELTIFREVVSKAQLLEKGYHEQRIE------- 245 Query: 536 YQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKCEKIH 357 GQ Q+ K KT QN G+ F+ Q +K+ +G +C C+ H Sbjct: 246 -HGQPQK---KFKTNNPQNQGR--FRGNYSGQMQRKSSENQGR--------KCPICQGSH 291 Query: 356 AGQ-CLTGTDACFMCKKSGHFARECPLLREPTKGRVFAMTQEQVDLDTAI 210 C CF C ++GH +CPLL++ K RV + TQ + T + Sbjct: 292 VPSICPNCWGRCFECGEAGHTRYQCPLLQK-GKNRVSSTTQPNTKVLTPV 340 >gb|EOY08512.1| Gag protease polyprotein [Theobroma cacao] Length = 404 Score = 136 bits (342), Expect = 2e-29 Identities = 86/281 (30%), Positives = 124/281 (44%), Gaps = 10/281 (3%) Frame = -3 Query: 1076 KQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGA 897 K+ R+LG F G D A+ WI + M L D ++ A +L AR WW Sbjct: 114 KEARQLGCVSFTGELDATVAKDWINQVSETLSDMGLDDDMKLMVATRLLEKRARTWWNSV 173 Query: 896 RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 717 + + TW+DF F G+YFT ++ REFL L+QG+L+V EY RF +VP Sbjct: 174 K-SRSATPQTWSDFLREFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELMLYVP 232 Query: 716 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSER----DRNDMIKEAQN 549 + ++ +F EGLR IR + + +E V AL +E+ +R K A+ Sbjct: 233 DLVKSEQDQASYFEEGLRNEIRERMTVIGREPHKEVVQMALRAEKLATENRRIRTKFAKR 292 Query: 548 KRLSYQGQDQQEPGKKKTIPGQ------NSGKQPFKQAQPRQQIQKTQAVEGTGFRVENK 387 + L + GK G S + PF +Q R A+ G+G + Sbjct: 293 RNLGMSSSQPVKRGKDSATSGSTTSISVTSPRPPFPPSQQRPSRFSRSAMTGSGKSLGGF 352 Query: 386 VRCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPLLREPT 264 RC C H+G C G CF C ++GH CP L T Sbjct: 353 DRCRNCGNYHSGLC-RGPTRCFQCGQTGHIRSNCPQLGRAT 392 >ref|XP_003605752.1| Pol polyprotein [Medicago truncatula] gi|355506807|gb|AES87949.1| Pol polyprotein [Medicago truncatula] Length = 745 Score = 135 bits (341), Expect = 3e-29 Identities = 93/332 (28%), Positives = 149/332 (44%), Gaps = 15/332 (4%) Frame = -3 Query: 1151 IQQILQAREQGSTPRRSAPSTQPVYKQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQ 972 +Q + QA Q A + + + F + P FKG DP A+ W++ +E IF MQ Sbjct: 13 LQAVAQAVGQQPNVNAGANAEARMLETFMKKNPPTFKGRCDPDGAQTWLKEIERIFRVMQ 72 Query: 971 LTDADRIRCAIFMLRDDARVWWEGARLTV--DLATLTWTDFKEVFYGKYFTVDNRTRLAR 798 T+ ++R L ++A WW T+ + A +TW F+ F +YF D R + Sbjct: 73 CTEDQKVRFGTHQLAEEADDWWVALLPTLGQEGAVVTWAVFRREFLRRYFPEDVRGKKEI 132 Query: 797 EFLELRQGDLSVAEYVRRFERGRYFVPMITSQPVEELK--HFTEGLRPAIRHDVRLSRVT 624 EFLEL+QG++SV EY +F F P T++ E + F GLRP I+ + ++ Sbjct: 133 EFLELKQGNMSVTEYAAKFVELSKFYPHYTAENAEFSRCIKFENGLRPDIKRAIGYQQLR 192 Query: 623 TFREAVDQALMSERDRNDMIKEAQNKRLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPR 444 F++ V+ + E D K ++ G+ QQ K + P ++ +P+ Sbjct: 193 VFQDLVNSCRIYEEDTKAHYKVVNERK----GKGQQSRPKPYSAPADKGKQKMVDVRRPK 248 Query: 443 QQI----------QKTQAVEGTGFRVENKVRCSKCEKIHAGQCLTGTDACFMCKKSGHFA 294 ++ +K ++ VRC K + A C CF C GH + Sbjct: 249 KKDAAEIVYFNCGEKGHKSNACPEEIKKCVRCGKKGHVVA-DCNRTDIVCFNCNGEGHIS 307 Query: 293 RECPL-LREPTKGRVFAMTQEQVDLDTAIITG 201 +C R PT GRVFA+T Q + + +I G Sbjct: 308 SQCTQPKRAPTTGRVFALTGTQTESEDRLIRG 339 >gb|EOY20371.1| Gag protease polyprotein-like protein [Theobroma cacao] Length = 665 Score = 135 bits (340), Expect = 4e-29 Identities = 85/281 (30%), Positives = 125/281 (44%), Gaps = 10/281 (3%) Frame = -3 Query: 1076 KQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGA 897 K+ R+LG F G D A+ WI + M+L D ++ A +L AR WW Sbjct: 50 KEARQLGCVSFTGELDATVAKDWINQVSKTLSDMRLDDDMKLMVATRLLEKRARTWWNSV 109 Query: 896 RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 717 + + TW+DF F G+YFT ++ REFL L+QG+L+V EY RF +VP Sbjct: 110 K-SRSATPQTWSDFLREFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELMLYVP 168 Query: 716 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSER----DRNDMIKEAQN 549 + ++ +F EGLR IR + ++ +E V AL +E+ +R + A+ Sbjct: 169 DLVKSEQDQASYFEEGLRNEIRERMTVTGREPHKEVVQMALRAEKLAIENRRIRTEFAKR 228 Query: 548 KRLSYQGQDQQEPGKKKTIPGQ------NSGKQPFKQAQPRQQIQKTQAVEGTGFRVENK 387 + + GK I G S + PF +Q R A+ G+G Sbjct: 229 RNPGMSSSQPVKRGKDSAISGSTTSVSVTSPRPPFPPSQQRPSRFSRSAMTGSGRSFGGS 288 Query: 386 VRCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPLLREPT 264 RC C H+G C T CF C ++GH CP L T Sbjct: 289 DRCRNCGNYHSGLCREPT-RCFQCGQTGHIRSNCPRLGRAT 328 >emb|CAN66987.1| hypothetical protein VITISV_044466 [Vitis vinifera] Length = 360 Score = 134 bits (337), Expect = 8e-29 Identities = 95/320 (29%), Positives = 141/320 (44%), Gaps = 2/320 (0%) Frame = -3 Query: 1208 FLEGLTALLHEQNRIHGEQIQQILQAREQGSTPRRSAPSTQPVYKQFRELGPTEFKGTTD 1029 +L L L+ Q R G +Q Q S+ R S+ + F++LGP F G TD Sbjct: 69 YLGTLAGLVERQARAVGTNVQG------QSSSSRGSS------FDDFKKLGPPYFSGATD 116 Query: 1028 PIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGAR-LTVDLATLTWTDFK 852 P AE WI +E F + ++ + A FML + WW R L D +TW F+ Sbjct: 117 PTEAEAWILKMEKFFGVIDCSEEQKASYAAFMLDKETDHWWRMTRRLLEDQGPITWRQFR 176 Query: 851 EVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVPMITSQPVEELKHFTE 672 E FY KYF R + EF+ L QGD++VA+Y +F F P + + E+ F + Sbjct: 177 EAFYKKYFPDSVRRQKVGEFIRLEQGDMTVAQYEAKFTELSRFSPQLIATEEEKALKFQD 236 Query: 671 GLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNKRLSYQGQDQQEPGKKKTI 492 L+P +++ + + + E +Q +R+R+D Q +R S G++Q Sbjct: 237 XLKPYLKNKXSILXLGXYSEYREQ--QRKRNRSDGAHGNQXQRRSTSGRNQ--------- 285 Query: 491 PGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKCEKIHAGQ-CLTGTDACFMC 315 N GK Q ++G C C K H G+ C T ACF C Sbjct: 286 ---NKGKA-------------AQNLDGA---------CPTCGKKHGGRPCYRETGACFGC 320 Query: 314 KKSGHFARECPLLREPTKGR 255 K GH R+CP R+ G+ Sbjct: 321 GKQGHLIRDCPENRKFITGK 340 >gb|EOY19679.1| Gag protease polyprotein [Theobroma cacao] Length = 474 Score = 134 bits (336), Expect = 1e-28 Identities = 87/282 (30%), Positives = 131/282 (46%), Gaps = 11/282 (3%) Frame = -3 Query: 1076 KQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGA 897 K+ R+LG T F G D AA+ WI + F M+L D ++ A +L AR WW Sbjct: 127 KEARQLGCTSFIGDLDATAAKDWITQVTETFVDMKLDDDMKLMVATRLLEKRARTWWSSV 186 Query: 896 RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 717 + + + +LTW DF + F G+Y+T ++ REFL L+QG+L++ EY RF +VP Sbjct: 187 K-SRSITSLTWIDFLQEFDGQYYTYFHQKEKKREFLSLQQGNLTIEEYEARFNELMSYVP 245 Query: 716 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRND---MIKEAQNK 546 + ++ +F EGLR IR + ++ +E V AL +E+ N+ M E + Sbjct: 246 DLVKSEQDQASYFEEGLRNEIRERMTVTGREPHKEVVQMALRAEKLTNENRRMRAEFAKR 305 Query: 545 RLSYQGQDQQEPGKKKTIPGQN-------SGKQPFKQAQPRQQIQKTQAVEGTGFRVENK 387 R Q K T +N S + P Q Q R + T + Sbjct: 306 RNPNVSSSQLPKRGKDTFASENTVSVPVISPRPPLSQLQQRPPRFNRSGMSSTSEKSFGG 365 Query: 386 V-RCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPLLREPT 264 + +C KC + H G+C CF C +SGH +CP L T Sbjct: 366 LNKCEKCGRYHVGEC--WGIRCFHCDQSGHIRSDCPQLGRAT 405 >gb|EOY26216.1| Gag protease polyprotein [Theobroma cacao] Length = 426 Score = 133 bits (335), Expect = 1e-28 Identities = 88/283 (31%), Positives = 128/283 (45%), Gaps = 12/283 (4%) Frame = -3 Query: 1076 KQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGA 897 K+ R+LG F G D A+ WI + M+L D ++ A +L AR WW Sbjct: 114 KEARQLGCVSFTGELDATVAKDWINQVSETLSDMKLNDDMKLMVATRLLEKRARTWWNSV 173 Query: 896 RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 717 + + TW+DF F G+YFT ++ REFL L+QG+L+V EY RF +VP Sbjct: 174 K-SRSATPQTWSDFLREFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELMLYVP 232 Query: 716 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSE-------RDRNDMIKE 558 + ++ +F EGLR IR + ++ +E V AL +E R R + K Sbjct: 233 DLVKSEQDQASYFEEGLRNEIRERMTVTGREPHKEVVQMALRAEKLATENRRIRTEFAKR 292 Query: 557 AQNKRLSY-----QGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVE 393 +N +SY +G+D + T S + PF +Q R A+ G+G Sbjct: 293 -RNPGMSYSQSVKRGKD-SAISRSTTSISVTSPRPPFPPSQQRPSRFSRSAMTGSGKSFG 350 Query: 392 NKVRCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPLLREPT 264 RC C H+G C T CF C ++GH CP L T Sbjct: 351 GSDRCRNCGNYHSGLCREPT-RCFQCGQTGHIRSNCPRLGRAT 392 >gb|EMJ16022.1| hypothetical protein PRUPE_ppa023432mg, partial [Prunus persica] Length = 590 Score = 133 bits (334), Expect = 2e-28 Identities = 96/324 (29%), Positives = 156/324 (48%), Gaps = 25/324 (7%) Frame = -3 Query: 1133 AREQGSTPRRSAPSTQPVYKQFRELGPTE-FKGTTDPIAAEGWIRSLETIFDFMQLTDAD 957 A++ P+R ++ ++ +G T F GT DP AEGWI +E I + M + Sbjct: 13 AQDSARIPKRKLGRVLSIW--YQSIGFTSYFDGTGDPAVAEGWIERMERIMEVMAVPQDR 70 Query: 956 RIRCAIFMLRDDARVWWEGA-RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELR 780 R+ A F L +AR WWE R D + ++W F+ VF +Y+ + +EFL+L Sbjct: 71 RVLLASFFLIGNARHWWESIKRRYPDPSVISWPVFRAVFNSQYYPQAYQNLKMQEFLQLD 130 Query: 779 QGDLSVAEYVRRF-ERGRYFVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVD 603 QG ++V EY ++F E +Y +P++ + ++ + FT+GL+ +IR V R+T F + V Sbjct: 131 QGLMTVLEYEKKFNELSKYCIPLVEDES-KKCQLFTKGLKASIRDIVISQRLTNFGDLVM 189 Query: 602 QALMSERDRNDMIKEAQ---NKRLSYQGQDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQ 432 A + E + M+ AQ +RL G Q K+ + +S + F+ +P Sbjct: 190 SASLVE--SSQMMVRAQGEPRRRLFDLGGPSQGSSKRGSYSAGSSRGRSFRGFRPGISSS 247 Query: 431 -------------KTQAVEGTGFRVENKV------RCSKCEKIHAGQCLTGTDACFMCKK 309 AV G+G + + V +C+ C + H G C GT CF C + Sbjct: 248 GGSNRSGSFGSRLVGNAVRGSGRQSPSAVGGRRNPQCTVCGRYHTGTCRQGTTGCFHCGQ 307 Query: 308 SGHFARECPLLREPTKGRVFAMTQ 237 GHF RECP+L + + V T+ Sbjct: 308 PGHFLRECPVLLQGGEATVTMPTE 331 >ref|XP_004300999.1| PREDICTED: uncharacterized protein LOC101305853 [Fragaria vesca subsp. vesca] Length = 327 Score = 132 bits (332), Expect = 3e-28 Identities = 90/265 (33%), Positives = 132/265 (49%), Gaps = 6/265 (2%) Frame = -3 Query: 1061 LGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGARLTVD 882 LG F G TD + A+ WI +ET F + T+ +++R A F+L+D+ARVWW G D Sbjct: 69 LGAPSFLGGTDFLVADHWIEGMETYFTLITCTEIEKMRIATFLLKDEARVWWNGVERARD 128 Query: 881 LATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVPMITSQ 702 + L+W F ++F KYF R +L EF+ L QG +SV +Y RF + F + + Sbjct: 129 VTALSWEGFVQLFREKYFPDTVREQLELEFIALVQGLMSVRDYKARFSQLYRFAREMDA- 187 Query: 701 PVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNKRLSYQGQD 522 V + F GLR +R+ V R T EAV+ AL E++ + + EA+ R Sbjct: 188 -VALPRKFIRGLRHKLRNVVSSHRFATLAEAVESALAVEQE--EAMHEAEGLR------- 237 Query: 521 QQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQA---VEGTGFRVENKVRCSKCEKIH-- 357 GK K + G SG + A ++Q QA V R +RC +C+ + Sbjct: 238 -DVHGKGKAVAG-GSGSEGLHGASGKRQRTDQQALATVPAAPIRQVEPLRCYRCDGLGHI 295 Query: 356 AGQC-LTGTDACFMCKKSGHFAREC 285 A +C T AC+ C + GH AREC Sbjct: 296 ARECHKRKTQACYSCGQVGHLAREC 320