BLASTX nr result
ID: Zingiber24_contig00030072
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Zingiber24_contig00030072 (1070 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004154396.1| PREDICTED: uncharacterized protein LOC101203... 164 7e-38 ref|XP_004153733.1| PREDICTED: uncharacterized protein LOC101205... 161 5e-37 ref|XP_004152998.1| PREDICTED: uncharacterized protein LOC101217... 157 5e-36 gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo] 156 2e-35 ref|XP_006833015.1| hypothetical protein AMTR_s00876p00007370, p... 155 3e-35 gb|AAO45751.1| gag-protease polyprotein [Cucumis melo subsp. melo] 154 6e-35 ref|XP_004173928.1| PREDICTED: uncharacterized protein LOC101229... 152 2e-34 ref|XP_004153883.1| PREDICTED: uncharacterized protein LOC101208... 150 8e-34 ref|XP_006849815.1| hypothetical protein AMTR_s01849p00006620 [A... 145 3e-32 ref|XP_004148918.1| PREDICTED: uncharacterized protein LOC101210... 143 1e-31 ref|XP_004489079.1| PREDICTED: uncharacterized protein LOC101515... 136 2e-29 gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gy... 136 2e-29 gb|EOY08512.1| Gag protease polyprotein [Theobroma cacao] 135 2e-29 emb|CAN66987.1| hypothetical protein VITISV_044466 [Vitis vinifera] 135 2e-29 gb|EOY20371.1| Gag protease polyprotein-like protein [Theobroma ... 135 4e-29 gb|EOY26216.1| Gag protease polyprotein [Theobroma cacao] 134 8e-29 gb|EOY19679.1| Gag protease polyprotein [Theobroma cacao] 133 1e-28 gb|EMJ16022.1| hypothetical protein PRUPE_ppa023432mg, partial [... 133 1e-28 ref|XP_003605752.1| Pol polyprotein [Medicago truncatula] gi|355... 132 2e-28 gb|EOX98886.1| Gag protease polyprotein [Theobroma cacao] 132 2e-28 >ref|XP_004154396.1| PREDICTED: uncharacterized protein LOC101203289 [Cucumis sativus] Length = 655 Score = 164 bits (414), Expect = 7e-38 Identities = 97/299 (32%), Positives = 151/299 (50%), Gaps = 15/299 (5%) Frame = -3 Query: 861 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 685 + FR+ P F G+ DP AE W+ S+ETIF++M+ + R++CA F+LRD +WW Sbjct: 121 RDFRKYDPQTFDGSLEDPTKAEMWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRT 180 Query: 684 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 511 L D+ +TW FK+ FY K+F+ + R ++EFLEL+QG ++V EY + F+ Sbjct: 181 TMRMLGGDVRQITWDQFKDCFYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSR 240 Query: 510 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 331 F P + S F +GLR IR VR + TT EA+ A+ +++ + NK Sbjct: 241 FAPELVSNEQARADRFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDERQPRSFNK 300 Query: 330 RLSYQGRDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQK-TQAVEGTGFRVENKVRCSKC 154 S G+K+ + + G P + +P + Q+ G G + + C+ C Sbjct: 301 GSS--------SGQKRKVEQRTVG-VPQRNMRPGDSFRSFQQSSGGAGDTTQERPVCNTC 351 Query: 153 EKIHAGQCLTGTDACFMCKKSGHFARECPLL-----------REPTKGRVFAMTQEQVD 10 K H G+CL GT C+ CK+ GH A CPL R P +G +FA + + + Sbjct: 352 GKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGSSSQGERPPQRGTIFATNRSEAE 410 >ref|XP_004153733.1| PREDICTED: uncharacterized protein LOC101205308, partial [Cucumis sativus] Length = 768 Score = 161 bits (407), Expect = 5e-37 Identities = 95/299 (31%), Positives = 149/299 (49%), Gaps = 15/299 (5%) Frame = -3 Query: 861 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 685 + FR+ P F G+ DP AE W+ S+E IF++M+ + R++CA F+LRD +WW Sbjct: 120 RDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPEEHRVQCAAFLLRDRGIIWWRT 179 Query: 684 ARLTV--DLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 511 + D+ +TW FK+ FY K+F+ + R ++EFLEL+QG ++V EY + F+ Sbjct: 180 TMCMLGGDVRQITWDQFKDCFYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSR 239 Query: 510 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 331 F P + F +GLR IR VR + TT EA+ A+ ++++ + +K Sbjct: 240 FAPEFVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARSSDK 299 Query: 330 RLSYQGRDQQEPGKKKTIPGQN-SGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 154 S + + E + +P +N PF+ Q Q+ G G K C+ C Sbjct: 300 GTSSSQKRKAEQ-RIVGVPQRNLRSGDPFRSFQ--------QSSGGAGDTTREKPLCNTC 350 Query: 153 EKIHAGQCLTGTDACFMCKKSGHFARECPL-----------LREPTKGRVFAMTQEQVD 10 K H G CL GT C+ CK+ GH A CPL R P +G +FA ++ + + Sbjct: 351 GKRHLGHCLMGTRVCYKCKQEGHMADRCPLRSTGAGQSSEGARPPQRGTIFATSRSEAE 409 >ref|XP_004152998.1| PREDICTED: uncharacterized protein LOC101217872 [Cucumis sativus] Length = 461 Score = 157 bits (398), Expect = 5e-36 Identities = 94/299 (31%), Positives = 149/299 (49%), Gaps = 15/299 (5%) Frame = -3 Query: 861 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 685 + FR+ P F G+ DP E W+ S+ETIF++M+ + R++CA F+LRD +WW Sbjct: 71 RDFRKYDPQTFDGSLEDPTKVELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRT 130 Query: 684 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 511 L D+ +TW FK+ FY K+F+ + R ++EFLEL+QG ++V EY + F+ Sbjct: 131 TMRMLGGDVRQITWDQFKDCFYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQDFDMLSR 190 Query: 510 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 331 F P + F +GLR IR VR + TT EA+ A+ ++++ + +K Sbjct: 191 FAPELVGNEQARADRFVKGLRDEIRDFVRALKPTTQAEALRLAMDISIGKDEIRPMSFDK 250 Query: 330 RLSYQGRDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQK-TQAVEGTGFRVENKVRCSKC 154 S G+K+ + + G P + +P + Q+ G G + + C C Sbjct: 251 GSS--------SGQKRKVEQRTVG-VPQRNMRPGDSFRSFQQSSGGAGDTTQERPVCDTC 301 Query: 153 EKIHAGQCLTGTDACFMCKKSGHFARECPLL-----------REPTKGRVFAMTQEQVD 10 K H G+CL GT C+ CK+ GH A CPL R P +G +FA + + + Sbjct: 302 GKRHLGRCLMGTRVCYKCKQEGHMADRCPLRSTGAGSSSQGERPPQRGTIFATNRSEAE 360 >gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo] Length = 871 Score = 156 bits (394), Expect = 2e-35 Identities = 99/298 (33%), Positives = 148/298 (49%), Gaps = 14/298 (4%) Frame = -3 Query: 861 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 685 + FR+ PT F G+ DP A+ W+ SLETIF +M+ + +++CA+FML D WWE Sbjct: 333 RDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWET 392 Query: 684 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 511 L D++ +TW FKE FY K+F+ R +EFL L QGD++V +Y F+ Sbjct: 393 TERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSR 452 Query: 510 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQAL-MSERDRNDMIKEAQN 334 F P + + F GLR I+ VR R T +A+ A+ +S ++R + K A Sbjct: 453 FAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTA-- 510 Query: 333 KRLSYQGRDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 154 R S G+ ++ + +P +N F+ + Q+ G R K C+ C Sbjct: 511 GRGSTSGQKRKAEQQPVPVPQRN-----FRPGGEFRSFQQKPFEAGEAAR--GKPLCTTC 563 Query: 153 EKIHAGQCLTGTDACFMCKKSGHFARECPL----------LREPTKGRVFAMTQEQVD 10 K H G+CL GT CF C++ GH A CPL P +GRVFA + + + Sbjct: 564 GKHHLGRCLFGTRTCFKCRQEGHTADRCPLRPTGIAQNQGAGAPLQGRVFATNRTEAE 621 >ref|XP_006833015.1| hypothetical protein AMTR_s00876p00007370, partial [Amborella trichopoda] gi|548837601|gb|ERM98293.1| hypothetical protein AMTR_s00876p00007370, partial [Amborella trichopoda] Length = 366 Score = 155 bits (391), Expect = 3e-35 Identities = 88/272 (32%), Positives = 137/272 (50%), Gaps = 3/272 (1%) Frame = -3 Query: 873 QPVYKQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVW 694 +P+Y++FR+ P F+G +DP+ AE W+R++E I ++M+L + D + CA +L+ DAR+W Sbjct: 102 EPIYERFRKQHPPNFEGGSDPMEAEEWLRTVEGIVEYMRLGNGDSVACAASLLKKDARIW 161 Query: 693 WEGARLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGR 514 W+ + T D+A +TW DF +VF KY++ R+ EF LRQG +V EY R+F+R Sbjct: 162 WDVIKQTRDVAAMTWADFVQVFNKKYYSEAIRSARVNEFTNLRQGKSTVTEYARQFDRLA 221 Query: 513 YFVPMITSQPVEELKHFTEGLRPAIRHDVRLS--RVTTFREAVDQALMSERDRNDMIKEA 340 F + + FTEGL I D+ +S R TT+ E + A R ++ Sbjct: 222 KFATDLVPTEFLRIHRFTEGLDSRISRDIAMSGVRATTYAEKDNTARWEARKASN--GGG 279 Query: 339 QNKRLSYQGRDQQEPGKKKTIPGQN-SGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRC 163 NKR E K+ I N GK+P+ VE + C Sbjct: 280 DNKR-KLPSNQHNEADKRNKIGSNNYKGKKPY---------------------VEYPL-C 316 Query: 162 SKCEKIHAGQCLTGTDACFMCKKSGHFARECP 67 C + H G+C C+ C + GH+ ++CP Sbjct: 317 PTCGRKHPGECRLKGKTCYKCGQPGHYKKDCP 348 >gb|AAO45751.1| gag-protease polyprotein [Cucumis melo subsp. melo] Length = 429 Score = 154 bits (389), Expect = 6e-35 Identities = 98/298 (32%), Positives = 147/298 (49%), Gaps = 14/298 (4%) Frame = -3 Query: 861 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 685 + FR+ PT F G+ DP A+ W+ SLETIF +M+ + +++CA+FML D WWE Sbjct: 61 RDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWET 120 Query: 684 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 511 L D++ +TW FKE FY K+F+ R +EFL L QGD++V +Y F+ Sbjct: 121 TERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSR 180 Query: 510 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQAL-MSERDRNDMIKEAQN 334 F P + + F GLR I+ VR R T +A+ A+ +S ++R + K A Sbjct: 181 FAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTA-- 238 Query: 333 KRLSYQGRDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 154 R S G+ ++ + +P +N F+ + Q+ G R K C+ C Sbjct: 239 GRGSTSGQKRKAEQQPVPVPQRN-----FRPGGEFRSFQQKPFEAGEAAR--GKPLCTTC 291 Query: 153 EKIHAGQCLTGTDACFMCKKSGHFARECPL----------LREPTKGRVFAMTQEQVD 10 K H G+CL GT CF C++ GH A CPL P +GR FA + + + Sbjct: 292 GKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRAFATNRTEAE 349 >ref|XP_004173928.1| PREDICTED: uncharacterized protein LOC101229796, partial [Cucumis sativus] Length = 338 Score = 152 bits (385), Expect = 2e-34 Identities = 92/270 (34%), Positives = 137/270 (50%), Gaps = 4/270 (1%) Frame = -3 Query: 861 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 685 + FR+ P F G+ DP AE W+ S+ETIF++M+ + R++CA F+LRD +W Sbjct: 61 RDFRKYDPQTFDGSLEDPTKAELWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWCRT 120 Query: 684 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 511 L D+ +TW FK FY K+F+ + R ++EFLEL+QG ++V EY + F+ Sbjct: 121 TMRMLGGDVMQITWDQFKNCFYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSR 180 Query: 510 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 331 F P + F +GLR IR VR + TT EA+ A+ + D I+ Sbjct: 181 FAPELVGNEQARADRFVKGLRDEIRDFVRALKPTTQAEALRLAVDMGIGK-DEIRPRSFD 239 Query: 330 RLSYQGRDQQEPGKKKTIPGQN-SGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 154 + S G+ ++ + +P +N PF+ Q Q+ G G K C+ C Sbjct: 240 KGSSSGQKRKAEQRTVGVPQRNLRPGDPFRSFQ--------QSSGGAGDTTREKPLCNTC 291 Query: 153 EKIHAGQCLTGTDACFMCKKSGHFARECPL 64 K H G+CL GT C+ CK+ GH A CPL Sbjct: 292 GKRHLGRCLMGTRVCYKCKQEGHMADRCPL 321 >ref|XP_004153883.1| PREDICTED: uncharacterized protein LOC101208523, partial [Cucumis sativus] Length = 804 Score = 150 bits (379), Expect = 8e-34 Identities = 92/299 (30%), Positives = 147/299 (49%), Gaps = 15/299 (5%) Frame = -3 Query: 861 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 685 + FR+ P F G+ DP AE W+ +ETIF +M+ + R++CA F+LRD +WW Sbjct: 120 RDFRKYDPQTFDGSLEDPTKAELWLFYVETIFIYMRCPEEHRVQCAAFLLRDRGIIWWRT 179 Query: 684 A--RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 511 L D+ +TW FK+ FY K+F+ + R ++EFLEL+QG ++V EY + F+ Sbjct: 180 TIRMLGGDVRQITWNQFKDCFYTKFFSANLRDAKSQEFLELKQGHMTVEEYDQEFDMLSC 239 Query: 510 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 331 F P + + F + LR IR R + TT EA+ A+ ++++ + +K Sbjct: 240 FAPKLVGNEQARAERFVKRLRDEIRGFARALKPTTQAEALRLAVDMSIGKDEIQARSSDK 299 Query: 330 RLSYQGRDQQEPGKKKTIPGQN-SGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKC 154 S G+ ++ + +P +N PF+ Q Q+ G G K C+ C Sbjct: 300 GTS-SGQKRKAEQRIVGVPQRNLRPGDPFRNFQ--------QSSGGAGDTTREKPLCNTC 350 Query: 153 EKIHAGQCLTGTDACFMCKKSGHFARECPL-----------LREPTKGRVFAMTQEQVD 10 K H G+CL GT C+ CK+ GH A C L P +G +FA ++ + + Sbjct: 351 GKRHLGRCLMGTRVCYKCKQEGHMADRCRLRSTGAGQSSQGAGPPQRGTIFATSRSEAE 409 >ref|XP_006849815.1| hypothetical protein AMTR_s01849p00006620 [Amborella trichopoda] gi|548853397|gb|ERN11396.1| hypothetical protein AMTR_s01849p00006620 [Amborella trichopoda] Length = 383 Score = 145 bits (366), Expect = 3e-32 Identities = 94/259 (36%), Positives = 126/259 (48%), Gaps = 14/259 (5%) Frame = -3 Query: 744 DRIRCAIFMLRDDARVWWEGARLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELR 565 DR++CA +MLR DAR+WWE T D+ T+ W DFK VF KY+ EF L Sbjct: 8 DRVKCASYMLRKDARIWWEVVEQTKDVDTMNWDDFKRVFNEKYYNSAVLAAKVDEFTGLV 67 Query: 564 QGDLSVAEYVRRFERGRYFVPMITSQPVEELKHFTEGLRPAIRHDVRL-SR-VTTFREAV 391 QG L+V EY ++F+R F P + F EGL+P + DV + SR ++ + V Sbjct: 68 QGSLTVTEYAQKFDRLAKFAPDLVPTDRVRAHRFVEGLKPMVARDVEIVSRGQFSYAQVV 127 Query: 390 DQALMSERDRNDMIKEAQNKRLSYQGRDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKT 211 + AL +ER N + KE +R S +G KK+ GQ+ QP R + Sbjct: 128 EMALTAERSENKIWKENAARRESKKGGANSNDHKKR---GQDQSGQP--SQDKRYKSDND 182 Query: 210 QAVEGTGFRVENKVRCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPL------LREPT 49 Q G+ R N C KC K H G+C AC+ C K GH R CPL EP Sbjct: 183 QRFNGSSGR--NIPECPKCTKRHLGEC--RAKACYKCGKEGHIKRNCPLWGQTGNRAEPK 238 Query: 48 K------GRVFAMTQEQVD 10 K RVFA+TQ + + Sbjct: 239 KDDKYVPARVFAITQAEAE 257 >ref|XP_004148918.1| PREDICTED: uncharacterized protein LOC101210300 [Cucumis sativus] Length = 623 Score = 143 bits (361), Expect = 1e-31 Identities = 86/264 (32%), Positives = 135/264 (51%), Gaps = 3/264 (1%) Frame = -3 Query: 861 KQFRELGPTEFKGTT-DPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEG 685 + FR+ F G+ DP AE W+ S+ETIF++M+ + R++CA F+LRD +WW Sbjct: 107 RDFRKYDLQTFDGSLEDPTKAEMWLSSVETIFNYMRCPEEHRVQCAAFLLRDRGIIWWRT 166 Query: 684 AR--LTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRY 511 L D+ +TW FK+ FY K+F+ + R ++EFLEL+QG +++ EY + F+ Sbjct: 167 TMRMLGGDVRQITWDQFKDCFYTKFFSANLRDAKSQEFLELKQGHMTIEEYDQEFDMLSR 226 Query: 510 FVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNK 331 F P + F +GLR IR VR + TT EA+ A+ + D I+ + Sbjct: 227 FAPELVGNEQARADRFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGK-DEIRASSFD 285 Query: 330 RLSYQGRDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKCE 151 + S G+ ++ + IP +N + P Q++ G R K C+ C Sbjct: 286 KGSSSGQKRKVEQRTVGIPQRN-----LRLGDPFCSFQQSSGEAGDTTR--EKPVCNTCG 338 Query: 150 KIHAGQCLTGTDACFMCKKSGHFA 79 K H G+CL GT C+ C++ GH A Sbjct: 339 KHHLGRCLMGTRVCYKCRQEGHMA 362 >ref|XP_004489079.1| PREDICTED: uncharacterized protein LOC101515713 [Cicer arietinum] Length = 943 Score = 136 bits (342), Expect = 2e-29 Identities = 90/336 (26%), Positives = 156/336 (46%), Gaps = 17/336 (5%) Frame = -3 Query: 972 LLHEQNRIHGEQIQQILQAREQGSTPRRSAP-STQPVYKQFRELGPTEFKGTTDPIAAEG 796 L+ +Q + + + Q+ P P ++ ++ F +L P F G+ P+ A+ Sbjct: 10 LMMQQQAVTTSIMNHLAQSVGPAHPPPPPPPEASNRLFYDFHKLKPPAFLGSLVPLEAQS 69 Query: 795 WIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGARLTVDLA--TLTWTDFKEVFYG 622 W+ + IF ++ T+ D++ A ML+ +A WW+GA+ + A + W +F VF Sbjct: 70 WLDEMTKIFLVVRCTEEDKVAFATHMLQGEAENWWKGAKAYMISAGTPMNWENFCTVFLD 129 Query: 621 KYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVPMITSQPVE--ELKHFTEGLR 448 KY + R + EF L+QGD+SVA+YV +FE F P + ++ F GL Sbjct: 130 KYIPMSIRKQKEFEFTHLQQGDMSVADYVAKFEELARFCAQAEYAPNDRWKINQFEWGLN 189 Query: 447 PAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNKRLSYQGR----DQQEPGKKKT 280 P I+ ++ +T++ V ++ + E + +N++L +Q R + K KT Sbjct: 190 PEIKSNLAQLEITSYATLVHKSYIVEESLRSL---KENRQLKWQQRRDAPKSNQQLKVKT 246 Query: 279 IPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKCEKIHAGQCLTGTDACFMC 100 P N GKQP P+ + + C KC + H G+CL G + CF C Sbjct: 247 SP--NKGKQPQNSVVPQARGPR---------------ECPKCGRSHPGECLYGKNICFWC 289 Query: 99 KKSGHFARECPLLR--------EPTKGRVFAMTQEQ 16 K GH +++CP + P GRV+ + ++ Sbjct: 290 KTPGHLSQDCPQRKMKGLANSNGPLTGRVYTLNAKK 325 >gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gypsy type [Oryza sativa] gi|21327374|gb|AAM48279.1|AC122148_32 Putative 22 kDa kafirin cluster; Ty3-Gypsy type [Oryza sativa Japonica Group] gi|31431495|gb|AAP53268.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1230 Score = 136 bits (342), Expect = 2e-29 Identities = 92/282 (32%), Positives = 138/282 (48%), Gaps = 3/282 (1%) Frame = -3 Query: 858 QFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGAR 679 +F++L P F GT +P+ AE WI ++E F+ M TD ++I A +ML+ A WW+ + Sbjct: 73 EFQKLKPPTFSGTANPLEAEEWIVAMEKSFEAMGCTDKEKIIYATYMLQSSAFEWWDAHK 132 Query: 678 LTV-DLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 502 + + +TW FKE FY KYF + +EFLEL+QG+ SVAEY F R F P Sbjct: 133 KSYSERIFITWELFKEAFYKKYFPESVKRMKEKEFLELKQGNKSVAEYEIEFSRLARFAP 192 Query: 501 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNKRLS 322 + + F GLR ++ V +T FRE V +A + E+ ++ Sbjct: 193 EFVQTDGSKARRFESGLRQPLKRRVEAFELTIFREVVSKAQLLEKGYHE----------- 241 Query: 321 YQGRDQQEPGKK-KTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKCEKI 145 Q + +P KK KT QN G+ F+ Q +K+ +G +C C+ Sbjct: 242 -QRIEHGQPQKKFKTNNPQNQGR--FRGNYSGQMQRKSSENQGR--------KCPICQGS 290 Query: 144 HAGQ-CLTGTDACFMCKKSGHFARECPLLREPTKGRVFAMTQ 22 H C CF C ++GH +CPLL++ K RV + TQ Sbjct: 291 HVPSICPNCWGRCFECGEAGHTRYQCPLLQK-GKNRVSSTTQ 331 >gb|EOY08512.1| Gag protease polyprotein [Theobroma cacao] Length = 404 Score = 135 bits (341), Expect = 2e-29 Identities = 86/281 (30%), Positives = 124/281 (44%), Gaps = 10/281 (3%) Frame = -3 Query: 861 KQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGA 682 K+ R+LG F G D A+ WI + M L D ++ A +L AR WW Sbjct: 114 KEARQLGCVSFTGELDATVAKDWINQVSETLSDMGLDDDMKLMVATRLLEKRARTWWNSV 173 Query: 681 RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 502 + + TW+DF F G+YFT ++ REFL L+QG+L+V EY RF +VP Sbjct: 174 K-SRSATPQTWSDFLREFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELMLYVP 232 Query: 501 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSER----DRNDMIKEAQN 334 + ++ +F EGLR IR + + +E V AL +E+ +R K A+ Sbjct: 233 DLVKSEQDQASYFEEGLRNEIRERMTVIGREPHKEVVQMALRAEKLATENRRIRTKFAKR 292 Query: 333 KRLSYQGRDQQEPGKKKTIPGQ------NSGKQPFKQAQPRQQIQKTQAVEGTGFRVENK 172 + L + GK G S + PF +Q R A+ G+G + Sbjct: 293 RNLGMSSSQPVKRGKDSATSGSTTSISVTSPRPPFPPSQQRPSRFSRSAMTGSGKSLGGF 352 Query: 171 VRCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPLLREPT 49 RC C H+G C G CF C ++GH CP L T Sbjct: 353 DRCRNCGNYHSGLC-RGPTRCFQCGQTGHIRSNCPQLGRAT 392 >emb|CAN66987.1| hypothetical protein VITISV_044466 [Vitis vinifera] Length = 360 Score = 135 bits (341), Expect = 2e-29 Identities = 96/320 (30%), Positives = 141/320 (44%), Gaps = 2/320 (0%) Frame = -3 Query: 993 FLEGLTALLHEQNRIHGEQIQQILQAREQGSTPRRSAPSTQPVYKQFRELGPTEFKGTTD 814 +L L L+ Q R G +Q Q S+ R S+ + F++LGP F G TD Sbjct: 69 YLGTLAGLVERQARAVGTNVQG------QSSSSRGSS------FDDFKKLGPPYFSGATD 116 Query: 813 PIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGAR-LTVDLATLTWTDFK 637 P AE WI +E F + ++ + A FML + WW R L D +TW F+ Sbjct: 117 PTEAEAWILKMEKFFGVIDCSEEQKASYAAFMLDKETDHWWRMTRRLLEDQGPITWRQFR 176 Query: 636 EVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVPMITSQPVEELKHFTE 457 E FY KYF R + EF+ L QGD++VA+Y +F F P + + E+ F + Sbjct: 177 EAFYKKYFPDSVRRQKVGEFIRLEQGDMTVAQYEAKFTELSRFSPQLIATEEEKALKFQD 236 Query: 456 GLRPAIRHDVRLSRVTTFREAVDQALMSERDRNDMIKEAQNKRLSYQGRDQQEPGKKKTI 277 L+P +++ + + + E +Q +R+R+D Q +R S GR+Q Sbjct: 237 XLKPYLKNKXSILXLGXYSEYREQ--QRKRNRSDGAHGNQXQRRSTSGRNQ--------- 285 Query: 276 PGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVENKVRCSKCEKIHAGQ-CLTGTDACFMC 100 N GK Q ++G C C K H G+ C T ACF C Sbjct: 286 ---NKGKA-------------AQNLDGA---------CPTCGKKHGGRPCYRETGACFGC 320 Query: 99 KKSGHFARECPLLREPTKGR 40 K GH R+CP R+ G+ Sbjct: 321 GKQGHLIRDCPENRKFITGK 340 >gb|EOY20371.1| Gag protease polyprotein-like protein [Theobroma cacao] Length = 665 Score = 135 bits (339), Expect = 4e-29 Identities = 85/281 (30%), Positives = 125/281 (44%), Gaps = 10/281 (3%) Frame = -3 Query: 861 KQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGA 682 K+ R+LG F G D A+ WI + M+L D ++ A +L AR WW Sbjct: 50 KEARQLGCVSFTGELDATVAKDWINQVSKTLSDMRLDDDMKLMVATRLLEKRARTWWNSV 109 Query: 681 RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 502 + + TW+DF F G+YFT ++ REFL L+QG+L+V EY RF +VP Sbjct: 110 K-SRSATPQTWSDFLREFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELMLYVP 168 Query: 501 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSER----DRNDMIKEAQN 334 + ++ +F EGLR IR + ++ +E V AL +E+ +R + A+ Sbjct: 169 DLVKSEQDQASYFEEGLRNEIRERMTVTGREPHKEVVQMALRAEKLAIENRRIRTEFAKR 228 Query: 333 KRLSYQGRDQQEPGKKKTIPGQ------NSGKQPFKQAQPRQQIQKTQAVEGTGFRVENK 172 + + GK I G S + PF +Q R A+ G+G Sbjct: 229 RNPGMSSSQPVKRGKDSAISGSTTSVSVTSPRPPFPPSQQRPSRFSRSAMTGSGRSFGGS 288 Query: 171 VRCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPLLREPT 49 RC C H+G C T CF C ++GH CP L T Sbjct: 289 DRCRNCGNYHSGLCREPT-RCFQCGQTGHIRSNCPRLGRAT 328 >gb|EOY26216.1| Gag protease polyprotein [Theobroma cacao] Length = 426 Score = 134 bits (336), Expect = 8e-29 Identities = 88/283 (31%), Positives = 128/283 (45%), Gaps = 12/283 (4%) Frame = -3 Query: 861 KQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGA 682 K+ R+LG F G D A+ WI + M+L D ++ A +L AR WW Sbjct: 114 KEARQLGCVSFTGELDATVAKDWINQVSETLSDMKLNDDMKLMVATRLLEKRARTWWNSV 173 Query: 681 RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 502 + + TW+DF F G+YFT ++ REFL L+QG+L+V EY RF +VP Sbjct: 174 K-SRSATPQTWSDFLREFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELMLYVP 232 Query: 501 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSE-------RDRNDMIKE 343 + ++ +F EGLR IR + ++ +E V AL +E R R + K Sbjct: 233 DLVKSEQDQASYFEEGLRNEIRERMTVTGREPHKEVVQMALRAEKLATENRRIRTEFAKR 292 Query: 342 AQNKRLSY-----QGRDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVE 178 +N +SY +G+D + T S + PF +Q R A+ G+G Sbjct: 293 -RNPGMSYSQSVKRGKD-SAISRSTTSISVTSPRPPFPPSQQRPSRFSRSAMTGSGKSFG 350 Query: 177 NKVRCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPLLREPT 49 RC C H+G C T CF C ++GH CP L T Sbjct: 351 GSDRCRNCGNYHSGLCREPT-RCFQCGQTGHIRSNCPRLGRAT 392 >gb|EOY19679.1| Gag protease polyprotein [Theobroma cacao] Length = 474 Score = 133 bits (335), Expect = 1e-28 Identities = 87/282 (30%), Positives = 131/282 (46%), Gaps = 11/282 (3%) Frame = -3 Query: 861 KQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGA 682 K+ R+LG T F G D AA+ WI + F M+L D ++ A +L AR WW Sbjct: 127 KEARQLGCTSFIGDLDATAAKDWITQVTETFVDMKLDDDMKLMVATRLLEKRARTWWSSV 186 Query: 681 RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 502 + + + +LTW DF + F G+Y+T ++ REFL L+QG+L++ EY RF +VP Sbjct: 187 K-SRSITSLTWIDFLQEFDGQYYTYFHQKEKKREFLSLQQGNLTIEEYEARFNELMSYVP 245 Query: 501 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRND---MIKEAQNK 331 + ++ +F EGLR IR + ++ +E V AL +E+ N+ M E + Sbjct: 246 DLVKSEQDQASYFEEGLRNEIRERMTVTGREPHKEVVQMALRAEKLTNENRRMRAEFAKR 305 Query: 330 RLSYQGRDQQEPGKKKTIPGQN-------SGKQPFKQAQPRQQIQKTQAVEGTGFRVENK 172 R Q K T +N S + P Q Q R + T + Sbjct: 306 RNPNVSSSQLPKRGKDTFASENTVSVPVISPRPPLSQLQQRPPRFNRSGMSSTSEKSFGG 365 Query: 171 V-RCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPLLREPT 49 + +C KC + H G+C CF C +SGH +CP L T Sbjct: 366 LNKCEKCGRYHVGEC--WGIRCFHCDQSGHIRSDCPQLGRAT 405 >gb|EMJ16022.1| hypothetical protein PRUPE_ppa023432mg, partial [Prunus persica] Length = 590 Score = 133 bits (334), Expect = 1e-28 Identities = 96/324 (29%), Positives = 156/324 (48%), Gaps = 25/324 (7%) Frame = -3 Query: 918 AREQGSTPRRSAPSTQPVYKQFRELGPTE-FKGTTDPIAAEGWIRSLETIFDFMQLTDAD 742 A++ P+R ++ ++ +G T F GT DP AEGWI +E I + M + Sbjct: 13 AQDSARIPKRKLGRVLSIW--YQSIGFTSYFDGTGDPAVAEGWIERMERIMEVMAVPQDR 70 Query: 741 RIRCAIFMLRDDARVWWEGA-RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELR 565 R+ A F L +AR WWE R D + ++W F+ VF +Y+ + +EFL+L Sbjct: 71 RVLLASFFLIGNARHWWESIKRRYPDPSVISWPVFRAVFNSQYYPQAYQNLKMQEFLQLD 130 Query: 564 QGDLSVAEYVRRF-ERGRYFVPMITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVD 388 QG ++V EY ++F E +Y +P++ + ++ + FT+GL+ +IR V R+T F + V Sbjct: 131 QGLMTVLEYEKKFNELSKYCIPLVEDES-KKCQLFTKGLKASIRDIVISQRLTNFGDLVM 189 Query: 387 QALMSERDRNDMIKEAQ---NKRLSYQGRDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQ 217 A + E + M+ AQ +RL G Q K+ + +S + F+ +P Sbjct: 190 SASLVE--SSQMMVRAQGEPRRRLFDLGGPSQGSSKRGSYSAGSSRGRSFRGFRPGISSS 247 Query: 216 -------------KTQAVEGTGFRVENKV------RCSKCEKIHAGQCLTGTDACFMCKK 94 AV G+G + + V +C+ C + H G C GT CF C + Sbjct: 248 GGSNRSGSFGSRLVGNAVRGSGRQSPSAVGGRRNPQCTVCGRYHTGTCRQGTTGCFHCGQ 307 Query: 93 SGHFARECPLLREPTKGRVFAMTQ 22 GHF RECP+L + + V T+ Sbjct: 308 PGHFLRECPVLLQGGEATVTMPTE 331 >ref|XP_003605752.1| Pol polyprotein [Medicago truncatula] gi|355506807|gb|AES87949.1| Pol polyprotein [Medicago truncatula] Length = 745 Score = 132 bits (333), Expect = 2e-28 Identities = 91/324 (28%), Positives = 145/324 (44%), Gaps = 15/324 (4%) Frame = -3 Query: 936 IQQILQAREQGSTPRRSAPSTQPVYKQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQ 757 +Q + QA Q A + + + F + P FKG DP A+ W++ +E IF MQ Sbjct: 13 LQAVAQAVGQQPNVNAGANAEARMLETFMKKNPPTFKGRCDPDGAQTWLKEIERIFRVMQ 72 Query: 756 LTDADRIRCAIFMLRDDARVWWEGARLTV--DLATLTWTDFKEVFYGKYFTVDNRTRLAR 583 T+ ++R L ++A WW T+ + A +TW F+ F +YF D R + Sbjct: 73 CTEDQKVRFGTHQLAEEADDWWVALLPTLGQEGAVVTWAVFRREFLRRYFPEDVRGKKEI 132 Query: 582 EFLELRQGDLSVAEYVRRFERGRYFVPMITSQPVEELK--HFTEGLRPAIRHDVRLSRVT 409 EFLEL+QG++SV EY +F F P T++ E + F GLRP I+ + ++ Sbjct: 133 EFLELKQGNMSVTEYAAKFVELSKFYPHYTAENAEFSRCIKFENGLRPDIKRAIGYQQLR 192 Query: 408 TFREAVDQALMSERDRNDMIKEAQNKRLSYQGRDQQEPGKKKTIPGQNSGKQPFKQAQPR 229 F++ V+ + E D K ++ G+ QQ K + P ++ +P+ Sbjct: 193 VFQDLVNSCRIYEEDTKAHYKVVNERK----GKGQQSRPKPYSAPADKGKQKMVDVRRPK 248 Query: 228 QQI----------QKTQAVEGTGFRVENKVRCSKCEKIHAGQCLTGTDACFMCKKSGHFA 79 ++ +K ++ VRC K + A C CF C GH + Sbjct: 249 KKDAAEIVYFNCGEKGHKSNACPEEIKKCVRCGKKGHVVA-DCNRTDIVCFNCNGEGHIS 307 Query: 78 RECPL-LREPTKGRVFAMTQEQVD 10 +C R PT GRVFA+T Q + Sbjct: 308 SQCTQPKRAPTTGRVFALTGTQTE 331 >gb|EOX98886.1| Gag protease polyprotein [Theobroma cacao] Length = 467 Score = 132 bits (332), Expect = 2e-28 Identities = 87/283 (30%), Positives = 133/283 (46%), Gaps = 12/283 (4%) Frame = -3 Query: 861 KQFRELGPTEFKGTTDPIAAEGWIRSLETIFDFMQLTDADRIRCAIFMLRDDARVWWEGA 682 K+ R+LG T F G D AA+ WI + F M+L D ++ A +L AR WW Sbjct: 121 KEARQLGCTSFVGDLDATAAKDWITQVTETFVDMKLDDDMKLMVATRLLEKRARTWWSSV 180 Query: 681 RLTVDLATLTWTDFKEVFYGKYFTVDNRTRLAREFLELRQGDLSVAEYVRRFERGRYFVP 502 + + + LTW DF + F G+Y+T ++ REFL L+QG+L++ EY RF +VP Sbjct: 181 K-SRSITPLTWIDFLQEFDGQYYTYFHQKEKKREFLSLQQGNLTIEEYEARFNELMSYVP 239 Query: 501 MITSQPVEELKHFTEGLRPAIRHDVRLSRVTTFREAVDQALMSERDRND---MIKEAQNK 331 + ++ +F EGLR IR + ++ +E V AL +E+ N+ M E + Sbjct: 240 DLVKSEQDQASYFEEGLRNEIRERMTVTGREPHKEVVQMALRAEKLTNENRRMRAEFAKR 299 Query: 330 R--------LSYQGRDQQEPGKKKTIPGQNSGKQPFKQAQPRQQIQKTQAVEGTGFRVEN 175 R L +G+D ++P S + PF Q Q R + T + Sbjct: 300 RNPNVSSIQLPKRGKDTSASESTVSVP-VISPRPPFSQLQQRPPRFSRSGMSSTSEKSFG 358 Query: 174 KV-RCSKCEKIHAGQCLTGTDACFMCKKSGHFARECPLLREPT 49 + +C KC + H G+C CF C + GH +CP L T Sbjct: 359 GLNKCEKCGRYHVGEC--WGIRCFHCDQPGHIRSDCPQLGRAT 399