BLASTX nr result
ID: Akebia25_contig00005535
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00005535 (2444 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002263775.2| PREDICTED: uncharacterized protein LOC100247... 343 2e-91 emb|CBI38444.3| unnamed protein product [Vitis vinifera] 343 2e-91 emb|CAN68527.1| hypothetical protein VITISV_044224 [Vitis vinifera] 342 6e-91 ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222... 338 9e-90 ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 337 2e-89 ref|XP_002509591.1| DNA binding protein, putative [Ricinus commu... 334 1e-88 ref|XP_006857527.1| hypothetical protein AMTR_s00061p00028660 [A... 332 5e-88 ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627... 331 1e-87 ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao]... 329 3e-87 ref|XP_003538648.1| PREDICTED: uncharacterized protein LOC100805... 328 9e-87 ref|XP_004300713.1| PREDICTED: uncharacterized protein LOC101309... 324 1e-85 gb|AAR96007.1| transposase-like protein [Musa acuminata] 323 2e-85 ref|XP_006490683.1| PREDICTED: uncharacterized protein LOC102618... 311 9e-82 gb|EXB72473.1| hypothetical protein L484_011475 [Morus notabilis] 306 4e-80 ref|XP_007038933.1| HAT transposon superfamily isoform 4 [Theobr... 304 1e-79 ref|XP_007038931.1| HAT transposon superfamily isoform 2 [Theobr... 304 1e-79 ref|XP_002513602.1| protein dimerization, putative [Ricinus comm... 304 1e-79 ref|XP_004145979.1| PREDICTED: uncharacterized protein LOC101215... 302 4e-79 ref|XP_006573373.1| PREDICTED: uncharacterized protein LOC102669... 302 5e-79 ref|XP_003553157.1| PREDICTED: uncharacterized protein LOC100793... 301 1e-78 >ref|XP_002263775.2| PREDICTED: uncharacterized protein LOC100247282 [Vitis vinifera] Length = 672 Score = 343 bits (881), Expect = 2e-91 Identities = 194/620 (31%), Positives = 323/620 (52%), Gaps = 20/620 (3%) Frame = +2 Query: 311 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV--QALAFH 475 WE+ + + + C +C +++ GG+ R+K HL++ + +DI C VP DV Q + Sbjct: 9 WEHCVLVDATRQKVRCNYCHREFSGGVYRMKFHLAQIKNKDIVSCTEVPNDVRDQIQSIL 68 Query: 476 AVCGKDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIY------- 634 + K TP K + + + A P S+ H +T ++ Sbjct: 69 STPKKQKTPKKTKVDLAANGQQNSSSASGDFHP----NHGSSGQHGSTCPLLFPRPSPSE 124 Query: 635 -------NKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTL 793 K+ +D D+ VA F N++ A +S + MV AIAE G Y P++ L Sbjct: 125 QPAVDDEQKQKQDDADKKVAVFFFHNSVPFSAAKSMYYQEMVDAIAECGVGYKAPSYEKL 184 Query: 794 CTKLVQDSRKDVEEYVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNF 973 + L++ + DV + ++D W TGC+++ D W+D + + PKG +FLK+ Sbjct: 185 RSTLMEKVKCDVNDCCKKLRDGWRATGCTILCDCWSDGRTKSLVVFSVTCPKGTLFLKSV 244 Query: 974 ERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVS 1153 + S +L ++L SV+ E+G ENVVQ++ ++A+ Y L+M +Y ++ C S Sbjct: 245 DISGHADDAHYLYELLESVVLEVGLENVVQVITDSAASYVYAGRLLMAKYTTLFWSPCAS 304 Query: 1154 HGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRFVS 1333 + +LEDI K+ EW+ +V ++A+ I Y+Y + L +MR +T +E+ RP +RFV+ Sbjct: 305 FCIDKMLEDISKQ-EWVSTVLEEAKTITHYIYSHAWILNMMRKFTGGRELIRPRITRFVT 363 Query: 1334 HFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLEP 1513 +F+ L+SI+ E+NL+LM +W +R P ++ + ++ FW E +SV EP Sbjct: 364 NFLSLRSIVVQEDNLKLMFSHMDWMSSVYSRRPDSQNVKSLLYLERFWKSAHEAVSVSEP 423 Query: 1514 LITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIH 1693 L+ VLR+VDG+ GY+YE +ER + ++ + NS KY+ +W++ + + N + +H Sbjct: 424 LVKVLRIVDGDMPAMGYIYEGIERAKIAIKGYYNSIEEKYMPIWDIIDRRWNVQLHSPLH 483 Query: 1694 AAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPN-EMDDFAAQLLLYNGKSPKLFN 1870 AAAAFLNPS+ Y K + +R+G + M + + + + +Y L Sbjct: 484 AAAAFLNPSIFYGPNFKVDL-RMRNGFQEAMRKMATEDRDKIEITKEHPIYINAQGALGT 542 Query: 1871 TLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINK 2050 +I+ WW G E+P L++ AIRILSQPCSS CG NWS+FE TKK N+ Sbjct: 543 EFAIMGRTLNAAGDWWAGYGYEIPTLQRAAIRILSQPCSSHWCGWNWSSFEALHTKKRNR 602 Query: 2051 LSPDILEDLVYTRMNSKMMA 2110 + + L DLV N + A Sbjct: 603 MELEKLNDLVLVHCNLHLQA 622 >emb|CBI38444.3| unnamed protein product [Vitis vinifera] Length = 712 Score = 343 bits (881), Expect = 2e-91 Identities = 194/620 (31%), Positives = 323/620 (52%), Gaps = 20/620 (3%) Frame = +2 Query: 311 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV--QALAFH 475 WE+ + + + C +C +++ GG+ R+K HL++ + +DI C VP DV Q + Sbjct: 49 WEHCVLVDATRQKVRCNYCHREFSGGVYRMKFHLAQIKNKDIVSCTEVPNDVRDQIQSIL 108 Query: 476 AVCGKDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIY------- 634 + K TP K + + + A P S+ H +T ++ Sbjct: 109 STPKKQKTPKKTKVDLAANGQQNSSSASGDFHP----NHGSSGQHGSTCPLLFPRPSPSE 164 Query: 635 -------NKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTL 793 K+ +D D+ VA F N++ A +S + MV AIAE G Y P++ L Sbjct: 165 QPAVDDEQKQKQDDADKKVAVFFFHNSVPFSAAKSMYYQEMVDAIAECGVGYKAPSYEKL 224 Query: 794 CTKLVQDSRKDVEEYVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNF 973 + L++ + DV + ++D W TGC+++ D W+D + + PKG +FLK+ Sbjct: 225 RSTLMEKVKCDVNDCCKKLRDGWRATGCTILCDCWSDGRTKSLVVFSVTCPKGTLFLKSV 284 Query: 974 ERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVS 1153 + S +L ++L SV+ E+G ENVVQ++ ++A+ Y L+M +Y ++ C S Sbjct: 285 DISGHADDAHYLYELLESVVLEVGLENVVQVITDSAASYVYAGRLLMAKYTTLFWSPCAS 344 Query: 1154 HGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRFVS 1333 + +LEDI K+ EW+ +V ++A+ I Y+Y + L +MR +T +E+ RP +RFV+ Sbjct: 345 FCIDKMLEDISKQ-EWVSTVLEEAKTITHYIYSHAWILNMMRKFTGGRELIRPRITRFVT 403 Query: 1334 HFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLEP 1513 +F+ L+SI+ E+NL+LM +W +R P ++ + ++ FW E +SV EP Sbjct: 404 NFLSLRSIVVQEDNLKLMFSHMDWMSSVYSRRPDSQNVKSLLYLERFWKSAHEAVSVSEP 463 Query: 1514 LITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIH 1693 L+ VLR+VDG+ GY+YE +ER + ++ + NS KY+ +W++ + + N + +H Sbjct: 464 LVKVLRIVDGDMPAMGYIYEGIERAKIAIKGYYNSIEEKYMPIWDIIDRRWNVQLHSPLH 523 Query: 1694 AAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPN-EMDDFAAQLLLYNGKSPKLFN 1870 AAAAFLNPS+ Y K + +R+G + M + + + + +Y L Sbjct: 524 AAAAFLNPSIFYGPNFKVDL-RMRNGFQEAMRKMATEDRDKIEITKEHPIYINAQGALGT 582 Query: 1871 TLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINK 2050 +I+ WW G E+P L++ AIRILSQPCSS CG NWS+FE TKK N+ Sbjct: 583 EFAIMGRTLNAAGDWWAGYGYEIPTLQRAAIRILSQPCSSHWCGWNWSSFEALHTKKRNR 642 Query: 2051 LSPDILEDLVYTRMNSKMMA 2110 + + L DLV N + A Sbjct: 643 MELEKLNDLVLVHCNLHLQA 662 >emb|CAN68527.1| hypothetical protein VITISV_044224 [Vitis vinifera] Length = 926 Score = 342 bits (876), Expect = 6e-91 Identities = 194/620 (31%), Positives = 320/620 (51%), Gaps = 20/620 (3%) Frame = +2 Query: 311 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV--QALAFH 475 WE+ + + + C +C +++ GG+ R+K HL++ + +DI C VP DV Q + Sbjct: 263 WEHCVLVDATRQKVRCNYCHREFSGGVYRMKFHLAQIKNKDIVSCTEVPNDVRDQIQSIL 322 Query: 476 AVCGKDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIY------- 634 + K TP K + + + A P S+ H +T ++ Sbjct: 323 STPKKQKTPKKTKVDXAANGQQNSSSASGDFHP----NHGSSGQHGSTCPLLFPRPSPSE 378 Query: 635 -------NKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTL 793 K+ +D D+ VA F N++ A +S + MV AIAE G Y P++ L Sbjct: 379 QPAVDDEQKQKQDDADKKVAVFFFHNSVPFSAAKSMYYQEMVDAIAECGVGYKAPSYEKL 438 Query: 794 CTKLVQDSRKDVEEYVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNF 973 + L++ + DV + ++D W TGC+++ D W+D + PKG +FLK+ Sbjct: 439 RSTLMEKVKCDVNDCCKKLRDGWRXTGCTILCDCWSDGRTKSLXVFSVTCPKGTLFLKSV 498 Query: 974 ERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVS 1153 + S +L ++L SV+ E+G ENVVQ++ ++A+ Y L+M +Y ++ C S Sbjct: 499 DISGHADDAHYLFELLESVVLEVGLENVVQVITDSAASYVYAGRLLMAKYTTLFWSPCAS 558 Query: 1154 HGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRFVS 1333 + +LEDI K+ EW+ +V ++A I Y+Y + L +MR +T +E+ RP +RFV+ Sbjct: 559 FCIDKMLEDISKQ-EWVSTVLEEANTITHYIYSHAWILNMMRKFTGGRELIRPRITRFVT 617 Query: 1334 HFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLEP 1513 +F+ L+SI+ E+NL+LM +W +R P + + ++ FW E +SV EP Sbjct: 618 NFLSLRSIVVQEDNLKLMFSHMDWMSSVYSRRPDAQNVKSLLYLERFWKSAHEAVSVSEP 677 Query: 1514 LITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIH 1693 L+ VLR+VDG+ GY+YE +ER + ++ + NS KY+ +W++ + + N + +H Sbjct: 678 LVKVLRIVDGDMPAMGYIYEGIERAKIAIKXYYNSIEEKYMPIWDIIDRRWNVQLHSPLH 737 Query: 1694 AAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPN-EMDDFAAQLLLYNGKSPKLFN 1870 AAAAFLNPS+ Y K + +R+G + M + + + + +Y L Sbjct: 738 AAAAFLNPSIFYGPNFKVDL-RMRNGFQEAMRKMATEDRDKIEITKEHPIYINAQGALGT 796 Query: 1871 TLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINK 2050 +I+ WW G E+P L++ AIRILSQPCSS CG NWS+FE TKK N+ Sbjct: 797 EFAIMGRTLNAAGDWWAGYGYEIPTLQRAAIRILSQPCSSHWCGWNWSSFEALHTKKRNR 856 Query: 2051 LSPDILEDLVYTRMNSKMMA 2110 + + L DLV N + A Sbjct: 857 MELEKLNDLVLVHCNLHLQA 876 >ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222344 [Cucumis sativus] Length = 673 Score = 338 bits (866), Expect = 9e-90 Identities = 197/613 (32%), Positives = 326/613 (53%), Gaps = 18/613 (2%) Frame = +2 Query: 311 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQ--ALAFH 475 WE+ + + + C +C++++ GG+ R+K HL++ + +DI C VP DV+ Sbjct: 9 WEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGIL 68 Query: 476 AVCGKDLTPYKKRKTATCSTDNERNGAD--------SIVSPILSCPDSSTVLHQTTLATI 631 + K P K + +T+ +++ + S +CP + L + I Sbjct: 69 STPKKQKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTYPCLSPSAQPPI 128 Query: 632 YN--KKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCTKL 805 + K+ KD D+ VA F N+I A +S + MV AIAE+G Y P++ L + L Sbjct: 129 DDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTL 188 Query: 806 VQDSRKDVEEYVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFERSD 985 + + D+ +D W TGC+++ D+W+D + F+ + KG +FLK+ + S Sbjct: 189 LDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISG 248 Query: 986 KGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHGVQ 1165 +L D+L ++I E+G ENVVQI+ + + Y L+M +Y ++ CVS+ V Sbjct: 249 HEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVN 308 Query: 1166 LLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRFVSHFMM 1345 +LEDI K +EW+ +V ++A++I Y+Y + + L MR +T KE+ RP +RFV++F+ Sbjct: 309 QMLEDISK-IEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLS 367 Query: 1346 LQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLEPLITV 1525 L+SI+ +E+NL+ M EW +R P + I ++ FW E I++ EPLI + Sbjct: 368 LRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRI 427 Query: 1526 LRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAAAA 1705 LR+VDG+ GY++E +ER + E++ + N KY+ +WE + + N + +H AAA Sbjct: 428 LRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAA 487 Query: 1706 FLNPSLMYDGKIKYEQPDIRDGMNYVVESM--VGPNEMDDFAAQLLLYNGKSPKLFNTLS 1879 FLNPS+ Y+ K + IR+G + M ++M+ NG+ L + Sbjct: 488 FLNPSVFYNPNFKIDL-RIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQG-ALGTDFA 545 Query: 1880 ILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTC-GRNWSAFEVAKTKKINKLS 2056 IL P WW G E+P L++ A+RILSQPCSS C G NWS FE +KK ++ Sbjct: 546 ILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAE 605 Query: 2057 PDILEDLVYTRMN 2095 + L DLV+ + N Sbjct: 606 QEKLTDLVFVQCN 618 >ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101222344 [Cucumis sativus] Length = 673 Score = 337 bits (864), Expect = 2e-89 Identities = 197/613 (32%), Positives = 325/613 (53%), Gaps = 18/613 (2%) Frame = +2 Query: 311 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQ--ALAFH 475 WE+ + + + C +C++++ GG+ R+K HL++ + +DI C VP DV+ Sbjct: 9 WEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGIL 68 Query: 476 AVCGKDLTPYKKRKTATCSTDNERNGAD--------SIVSPILSCPDSSTVLHQTTLATI 631 + K P K + +T+ +++ + S +CP + L + I Sbjct: 69 STPKKQKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCLSPSAQPPI 128 Query: 632 YN--KKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCTKL 805 + K+ KD D+ VA F N+I A +S + MV AIAE+G Y P++ L + L Sbjct: 129 DDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTL 188 Query: 806 VQDSRKDVEEYVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFERSD 985 + + D+ +D W TGC+++ D+W+D + F+ + KG +FLK+ + S Sbjct: 189 LDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISG 248 Query: 986 KGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHGVQ 1165 +L D+L ++I E+G ENVVQI+ + + Y L+M +Y ++ CVS+ V Sbjct: 249 HEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVN 308 Query: 1166 LLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRFVSHFMM 1345 +LEDI K +EW+ +V ++A++I Y+Y + + L MR +T KE+ RP +RFV++F+ Sbjct: 309 QMLEDISK-IEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLS 367 Query: 1346 LQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLEPLITV 1525 L+SI+ +E+NL+ M EW +R P + I ++ FW E I++ EPLI + Sbjct: 368 LRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRI 427 Query: 1526 LRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAAAA 1705 LR+VDG+ GY++E +ER + E++ + N KY+ +WE + + N + +H AAA Sbjct: 428 LRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAA 487 Query: 1706 FLNPSLMYDGKIKYEQPDIRDGMNYVVESM--VGPNEMDDFAAQLLLYNGKSPKLFNTLS 1879 FLNPS Y+ K + IR+G + M ++M+ NG+ L + Sbjct: 488 FLNPSXFYNPNFKIDL-RIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQG-ALGTDFA 545 Query: 1880 ILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTC-GRNWSAFEVAKTKKINKLS 2056 IL P WW G E+P L++ A+RILSQPCSS C G NWS FE +KK ++ Sbjct: 546 ILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAE 605 Query: 2057 PDILEDLVYTRMN 2095 + L DLV+ + N Sbjct: 606 QEKLTDLVFVQCN 618 >ref|XP_002509591.1| DNA binding protein, putative [Ricinus communis] gi|223549490|gb|EEF50978.1| DNA binding protein, putative [Ricinus communis] Length = 670 Score = 334 bits (856), Expect = 1e-88 Identities = 191/624 (30%), Positives = 320/624 (51%), Gaps = 22/624 (3%) Frame = +2 Query: 311 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQALAFHAV 481 WE+ + + + C +C +++ GG+ R+K HL++ + +DI C VP+DV+ + + Sbjct: 9 WEHCVLVDATRQKVRCNYCNREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVR----NHI 64 Query: 482 CGKDLTPYKKRKTATCSTDNERNGADSIVSPI--------------LSCPD----SSTVL 607 TP K++ TD NG D+ S +CP Sbjct: 65 QSILSTPKKQKTPKKQKTDQAENGQDNSSSASGGVHPNRGSSGQHGSTCPSLLFSRPLPT 124 Query: 608 HQTTLATIYNKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHS 787 Q + N+K + D+ +A F N+IA A +S + M A+AE G Y P+ Sbjct: 125 SQPVVDDAQNEKQNN-ADKRIAVFFFHNSIAFSAAKSIYYQEMFDAVAECGQGYKAPSFE 183 Query: 788 TLCTKLVQDSRKDVEEYVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLK 967 L + L++ + D+ ++ +D W TGC+++ D W+D + I PKG +FLK Sbjct: 184 KLRSSLLEKVKGDIHDWYRKYRDDWKETGCTILCDGWSDGRTKSVIVFSVTCPKGTLFLK 243 Query: 968 NFERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQC 1147 + + S +L ++L S++ E+G ENV+Q++ ++ + Y L+M +Y ++ C Sbjct: 244 SVDISGHENDANYLFELLESILLEVGVENVIQVITDSTASYVYAGRLLMAKYSSLFWSPC 303 Query: 1148 VSHGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRF 1327 S+ V +LEDI K+ EW+ +V ++A I Y+Y + L +MR +T +E+ RP +R+ Sbjct: 304 ASYCVNKMLEDISKQ-EWVGTVMEEANTITKYIYSHAWTLNMMRRFTGGRELIRPRITRY 362 Query: 1328 VSHFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVL 1507 VS+++ L++I+ E+NL+ M EW +R P + + + FW E +S+ Sbjct: 363 VSNYLSLRAIVIQEDNLKHMFSHSEWLSSMHSRRPDAQIVKSFLSQDRFWKFAHEAVSIS 422 Query: 1508 EPLITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHK 1687 EPLI +LR+VDG+ GY+YE +ER + ++ + KY+ +WE+ + + N + Sbjct: 423 EPLIKILRIVDGDMPAMGYIYEVLERAKVSIKAYYKGIEDKYMPIWEIIDRRWNIQLHSP 482 Query: 1688 IHAAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPN-EMDDFAAQLLLYNGKSPKL 1864 +HAAAAFLNPS+ Y+ K + +R+G + M + + + + +Y L Sbjct: 483 LHAAAAFLNPSIFYNQNFKIDL-RMRNGFQEAMIKMATSDIDKIEITKEHPIYINGQGAL 541 Query: 1865 FNTLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKI 2044 +I+ P WW G E+P L++VAIR+LSQPCSS C NWS FE TKK Sbjct: 542 GTDFAIMGRTLNSPGDWWAGYGYEIPTLQRVAIRLLSQPCSSHWCRWNWSTFESIHTKKR 601 Query: 2045 NKLSPDILEDLVYTRMNSKMMAYY 2116 NK + L DLV+ N + A Y Sbjct: 602 NKAELEKLNDLVFVHCNLWLQAIY 625 >ref|XP_006857527.1| hypothetical protein AMTR_s00061p00028660 [Amborella trichopoda] gi|548861623|gb|ERN18994.1| hypothetical protein AMTR_s00061p00028660 [Amborella trichopoda] Length = 863 Score = 332 bits (851), Expect = 5e-88 Identities = 195/620 (31%), Positives = 326/620 (52%), Gaps = 20/620 (3%) Frame = +2 Query: 311 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQALAFHAV 481 WE+ + + + C +C++++ GG+ R+K HL++ + +DI C+ VP DV+ L + Sbjct: 195 WEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSDVPNDVRDLIQSVL 254 Query: 482 CG--KDLTPYKKRKTATCSTDNERNGADS-------------IVSPILSCPDSSTVLHQT 616 K TP K + T ++ + + A P L P S Q Sbjct: 255 NTPRKQKTPKKPKIEQTPNSPHNSSSASGGFHLNVGSSGQRGSTCPSLLFPHPSPS-GQP 313 Query: 617 TLATIYNKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLC 796 L +K ++ D+ +A F N+I + +S + MV AIA+ G Y P++ L Sbjct: 314 ILDDSQRQKQEE-ADKKIALFFFHNSIPFSSSKSIYYHGMVDAIADCGVGYRAPSYDRLR 372 Query: 797 TKLVQDSRKDVEEYVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFE 976 T L++ + ++ + +D W +GC++M D WTD + I P+G +FLK+ + Sbjct: 373 TTLLEKVKVEITDSYKTYRDEWRESGCTIMSDGWTDGRSKFLIVFSVACPRGTLFLKSVD 432 Query: 977 RSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSH 1156 S +L ++L SV+ E+G E +VQ++ ++A+ Y L+ +YP ++ C S+ Sbjct: 433 ASAHVDDAHYLFELLESVVLEVGLEYIVQVITDSAANYVYAGRLLTAKYPSLFWSPCASY 492 Query: 1157 GVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRFVSH 1336 + +LEDI K+ EW+ +V ++A+ I Y+Y ++ L LM+ +T KE+ R +RFV+H Sbjct: 493 CIDRMLEDISKQ-EWVSTVIEEARSITKYIYGHSWVLNLMKRFTGGKELLRSRITRFVTH 551 Query: 1337 FMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLEPL 1516 F+ L+SI+ E+NL+ M EW ++ + + +I FW +EV+++ EPL Sbjct: 552 FLSLRSIVIHEDNLKHMFSHTEWLSSLYSKKSDAQAVRSLIYLDRFWKSAQEVVNLSEPL 611 Query: 1517 ITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHA 1696 I VLR+VDG+ GY+YE +ER + ++ + KY+ +WE+ + + N + +HA Sbjct: 612 IKVLRIVDGDMPAMGYIYEGIERAKVAIKAYYKGSEDKYMPIWEIIDRRWNLQLHSPLHA 671 Query: 1697 AAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPNEMD--DFAAQLLLYNGKSPKLFN 1870 AAAFLNP++ Y+ K + IR+G + + MV N+ D + + +Y L N Sbjct: 672 AAAFLNPAIFYNPSFKIDSK-IRNGFHEAMMKMV-LNDKDKMELTKETPMYINAHGALGN 729 Query: 1871 TLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINK 2050 +++ P WW G EVP+L++ AIRILSQPCSS C NW FE TKK N+ Sbjct: 730 DFAMMARTLNTPGDWWAGYGYEVPVLQRAAIRILSQPCSSYWCRWNWGTFENVHTKKRNR 789 Query: 2051 LSPDILEDLVYTRMNSKMMA 2110 L + DLVY N + A Sbjct: 790 LEQEKFNDLVYVHCNLRFQA 809 >ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627361 [Citrus sinensis] Length = 674 Score = 331 bits (848), Expect = 1e-87 Identities = 186/627 (29%), Positives = 321/627 (51%), Gaps = 17/627 (2%) Frame = +2 Query: 311 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQALAFHAV 481 WE+ + + + C +C++++ GG+ R+K HL++ + +DI C+ VP+DV+ + Sbjct: 9 WEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPDDVRDHIQRIL 68 Query: 482 CGKDLTPYKKRKTATCSTDNERNGADSIVSPI-----------LSCPDSSTVLHQTTLAT 628 KR +T N + + S I SCP ++ Sbjct: 69 SIPKKQKNPKRPKVEKATANGQQNSSSASGGIHQNNRSSGQHGSSCPSLLFRHPSPSIQP 128 Query: 629 IYN---KKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCT 799 I + K+ +D D+ +A F N+I A +S + MV AIAE G Y P++ L + Sbjct: 129 IVDDTQKQRQDDTDKKIAVFFFHNSIPFSAAKSMYYQEMVNAIAECGVGYIAPSYEKLRS 188 Query: 800 KLVQDSRKDVEEYVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFER 979 L++ + D+++ ++ W TGC+++ D W+D + + PKG +FLK+ + Sbjct: 189 TLLEKVKVDIDDCCKKYREEWKETGCTILCDNWSDERTKSLVVFSVACPKGTLFLKSVDV 248 Query: 980 SDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHG 1159 S + FL ++L SV+ ++G ENV+Q++ ++A+ Y L+M +Y ++ C ++ Sbjct: 249 SGHEEDATFLFELLESVVLDVGVENVIQVITDSAACYVYAGRLLMTKYSSLFWSPCAAYC 308 Query: 1160 VQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRFVSHF 1339 + +LEDI K+ EW+ V ++A+ I Y Y + L +MR T +E+ RP +RFV+++ Sbjct: 309 IDKMLEDISKQ-EWVAMVLEEAKTITKYFYSHAWTLNMMRKLTGGRELIRPRITRFVANY 367 Query: 1340 MMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLEPLI 1519 + L+SI+ EENL+ M EW +R P + I ++ FW EV+SV EPL+ Sbjct: 368 LSLRSIVIHEENLKHMFSHSEWLSSIYSRRPDAQAIKSLLYLDRFWRSAHEVVSVSEPLV 427 Query: 1520 TVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAA 1699 +LR+VDG+ GY+YE +ER + ++ + KY+ +W++ + + N + +HAA Sbjct: 428 KILRIVDGDMPAMGYMYEGIERAKLAIQAYYKGVEEKYVPIWDIIDRRWNMQLHSPLHAA 487 Query: 1700 AAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPNEMDDFAAQLLLYNGKSPKLFNTLS 1879 AAFLNPS+ Y+ K + +++ + + + +Y L + Sbjct: 488 AAFLNPSIFYNPNFKIDLRMRNGFQEAMIKLATADKDKIEITKEHPVYINAQGALGTDFA 547 Query: 1880 ILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINKLSP 2059 +L K P WW G E+P L++ AIRILSQPCSS NWS FE KK NK+ Sbjct: 548 VLGRKLNAPGDWWASYGYEIPTLQRAAIRILSQPCSSYWYRWNWSTFESIHNKKRNKVEM 607 Query: 2060 DILEDLVYTRMNSKMMAYYNELEMRDK 2140 + DL++ N ++ A Y + + K Sbjct: 608 EKFNDLLFVHCNLRLQAIYRSRDGKSK 634 >ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao] gi|508777206|gb|EOY24462.1| HAT transposon superfamily [Theobroma cacao] Length = 674 Score = 329 bits (844), Expect = 3e-87 Identities = 186/620 (30%), Positives = 319/620 (51%), Gaps = 20/620 (3%) Frame = +2 Query: 311 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQALAFHAV 481 WE+ + + + C +C +++ GG+ R+K HL++ + +DI C VP+DV+ + Sbjct: 9 WEHCVLVDATRQKVRCNYCHREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVRDHIQTIL 68 Query: 482 CG--KDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIY------- 634 K TP K + + D + + S S L S+ H +T ++ Sbjct: 69 NSPKKQKTPKKPKVDKAVANDQQNS---SSASGGLHLNHGSSGQHGSTCPSLLFPRPSPS 125 Query: 635 --------NKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHST 790 K+ ++ D+ +A F N+I A +S + MV AIA+ G Y P++ Sbjct: 126 EQPAVDDGQKQKQEDADKKIAVFFFHNSIPFSAAKSMYYQEMVDAIAKCGVGYKAPSYEN 185 Query: 791 LCTKLVQDSRKDVEEYVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKN 970 L + L++ + D+ + +D W TGC+++ D+W+D + F+ PKG +FLK+ Sbjct: 186 LRSTLLEKVKGDIHDCYKKYRDEWKETGCTILCDSWSDGRTKSFVIFSVTCPKGTLFLKS 245 Query: 971 FERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCV 1150 + S +L ++L SV+ E+G ENV+Q++ + A+ Y L+M +Y ++ C Sbjct: 246 VDVSGHEDDASYLFELLESVVLEVGLENVIQVITDTAASYVYAGRLLMAKYSSLFWSPCA 305 Query: 1151 SHGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRFV 1330 S+ + +LEDI K+ EW+ V ++A+ IV Y+Y + + +MR +T +E+ RP +RFV Sbjct: 306 SYCINKMLEDISKQ-EWVGIVLEEAKSIVQYIYSHAWIVNMMRKFTGGRELMRPRITRFV 364 Query: 1331 SHFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLE 1510 ++++ L+SI+ E+NL+ M EW +R + I ++ FW E +SV E Sbjct: 365 ANYLTLRSIIIQEDNLKHMFSHSEWLSSIYSRRSDAQAIKSLLYLERFWKSAHEAVSVSE 424 Query: 1511 PLITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKI 1690 PL+ +LR+VDG+ GY+YE +ER + ++ + KY+ +W++ + + N + + Sbjct: 425 PLVKILRIVDGDMPAMGYIYEGIERAKVAIKAYYKGLEEKYMPIWDIIDRRWNMQLHSPL 484 Query: 1691 HAAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPNEMDDFAAQLLLYNGKSPKLFN 1870 HAAAAFLNPS+ Y+ K + +++ + + + +Y L Sbjct: 485 HAAAAFLNPSIFYNPNFKIDLRMRNGFQEAMLKLATTDKDKIEITKEHPMYINAQGALGT 544 Query: 1871 TLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINK 2050 +I+ P WW G E+P L++VAIRILSQPCSS C NWS FE TKK NK Sbjct: 545 DFAIMGRTLNAPGDWWASYGYEIPTLQRVAIRILSQPCSSHWCRWNWSTFESIHTKKRNK 604 Query: 2051 LSPDILEDLVYTRMNSKMMA 2110 + + DLV+ N + A Sbjct: 605 VELEKFNDLVFVHCNLCLQA 624 >ref|XP_003538648.1| PREDICTED: uncharacterized protein LOC100805582 isoform X1 [Glycine max] gi|571487050|ref|XP_006590550.1| PREDICTED: uncharacterized protein LOC100805582 isoform X2 [Glycine max] Length = 675 Score = 328 bits (840), Expect = 9e-87 Identities = 185/630 (29%), Positives = 318/630 (50%), Gaps = 20/630 (3%) Frame = +2 Query: 311 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQ--ALAFH 475 WE+ + K + C +C++++ GG+ R+K HL++ + +DI C VP DV+ + Sbjct: 9 WEHCVLVDATKQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQSIL 68 Query: 476 AVCGKDLTPYKKRKTATCSTDNERN---------------GADSIVSPILSCPDSSTVLH 610 + K TP K++ + ++N G + P L P+ S Sbjct: 69 SAPKKPKTPKKQKTDQATVANGQQNSSSASGGFHHNHGYSGQNGSACPSLLFPNPSPSAQ 128 Query: 611 QTTLATIYNKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHST 790 L K+ +D DR +A F N+I A +S + MV A+A+ G Y P++ Sbjct: 129 P--LEHDAQKQKQDDADRKLAIFFFHNSIPFSAAKSIYYQEMVDAVAQCGVGYKAPSYEK 186 Query: 791 LCTKLVQDSRKDVEEYVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKN 970 L + L++ + D+ +D W TGC+++ D W+D + PKG +FLK+ Sbjct: 187 LRSTLLEKVKADIHSDYKKYRDEWKETGCTVLCDNWSDGRTGSLAVFSVACPKGTLFLKS 246 Query: 971 FERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCV 1150 + S +L ++L SV+ E+GAENVVQ++ + ++ Y C L++ RY ++ CV Sbjct: 247 VDVSGHENDSTYLFELLESVVLEVGAENVVQVITDASASYVCAGRLLIARYSFLFWSPCV 306 Query: 1151 SHGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRFV 1330 ++ + +LEDI ++ +W+ +V ++A+ I Y+Y + L +MR +T KE+ RP +RFV Sbjct: 307 AYCIDKMLEDIGRQ-DWVGTVLEEAKTITQYIYSHAWILNIMRKFTGGKELIRPKITRFV 365 Query: 1331 SHFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLE 1510 ++F+ L+SI+ E+N++ M EW R P + I ++ S FW E +SV E Sbjct: 366 TNFLSLKSIVMQEDNIKHMFSHSEWLSSIYRRRPDAQAINSLLYSDRFWKYAHEAVSVSE 425 Query: 1511 PLITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKI 1690 PL+ LR+VDG+ GY+YE +ER + ++ + KY+ +W++ + + N + + Sbjct: 426 PLVKCLRMVDGDMPAMGYVYEGIERAKVAIKAYYKGIEEKYIPIWDIIDRRWNMQIHSSL 485 Query: 1691 HAAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPNEMDDFAAQLLLYNGKSPKLFN 1870 HAAAAFLNPS+ Y+ K + ++ + + + +L Y L Sbjct: 486 HAAAAFLNPSISYNPNFKKDLRMRNGFQEAMLRLAITDKDKMEITKELPTYINAQGALGT 545 Query: 1871 TLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINK 2050 ++L P WW G E+P L+K A+RILSQPCSS NWS FE +K N+ Sbjct: 546 DFAVLGRTLNAPGDWWASYGYEIPTLQKAAVRILSQPCSSLWYRWNWSTFESIHNRKRNR 605 Query: 2051 LSPDILEDLVYTRMNSKMMAYYNELEMRDK 2140 + + +LV+ N + + E + + Sbjct: 606 VELEKFSELVFVHSNLWLQTIFKRREAKSE 635 >ref|XP_004300713.1| PREDICTED: uncharacterized protein LOC101309161 [Fragaria vesca subsp. vesca] Length = 677 Score = 324 bits (830), Expect = 1e-85 Identities = 183/618 (29%), Positives = 325/618 (52%), Gaps = 23/618 (3%) Frame = +2 Query: 311 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQ--ALAFH 475 WE+ + K + C +C++++ GG+ R+K HL++ + +DI C VP DV+ L+ Sbjct: 9 WEHCVLVDATKQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHILSIL 68 Query: 476 AVCGKDLTPYKKRKTATCSTDNER---------------NGADSIVSPILS--CPDSSTV 604 K TP K + + ++ NG + P L CP ++ Sbjct: 69 ETPKKQKTPKKPKVDKAALANGQQISSSASGDFHPTHVSNGQNGSTCPSLLFLCPSPTS- 127 Query: 605 LHQTTLATIYNKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNH 784 Q + + +K +DL D+ VA F N+I A +S + MV A+AE G +Y P++ Sbjct: 128 --QEPVDDVQKQK-QDLADKTVAVFFFHNSIPFSAARSIYYREMVDAVAECGGNYKAPSY 184 Query: 785 STLCTKLVQDSRKDVEEYVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFL 964 L + L++ D+ + +D W TGC+++ ++W+D ++ + PKG +FL Sbjct: 185 EVLRSTLLEKVNSDIHDRYKKYRDEWKETGCTILCESWSDGRNKSLVIFSVTYPKGTLFL 244 Query: 965 KNFERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQ 1144 K+ + S +L ++L SV+ E+G E+VVQI+ + +S Y L+MG+Y ++ Sbjct: 245 KSVDVSGHEDDTTYLFELLESVVLEVGVEDVVQIITDTSSSYIYAGRLLMGKYSSLFWSP 304 Query: 1145 CVSHGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSR 1324 C S+ + +LEDI K+ EW+ V ++A+ I +++ + L +MR + +E+ RP +R Sbjct: 305 CASYCINKILEDIGKQ-EWVCIVLEEARTITNFIGSHGWTLSMMRKFAGGRELVRPKINR 363 Query: 1325 FVSHFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISV 1504 FV++F+ L+SI+ E+N++ M PEW + +R P + + ++ FW +E +++ Sbjct: 364 FVTNFLNLRSIVIQEDNIKHMFSHPEWVSSASSRRPEAQAVKSLLYVERFWQHAQEAVTI 423 Query: 1505 LEPLITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIH 1684 EPL+ +LR+VDG+ GY+YE +E + ++ + KY+ +W++ + + + + Sbjct: 424 AEPLVKILRIVDGDMPAMGYIYEGIESAKIAIKTYYKGIEEKYMPIWDIIDRRWSMQLHS 483 Query: 1685 KIHAAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPNE-MDDFAAQLLLYNGKSPK 1861 +HAAAA LNPS+ Y+ K + +R+G + M +E + + +Y Sbjct: 484 SLHAAAASLNPSIFYNPNFKIDS-RMRNGFQETMLRMASTHEDKMEITKEHPVYVTAQGA 542 Query: 1862 LFNTLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKK 2041 L + +I+ P WW G E+P L++ A+RILSQPCSS C NWS FE KK Sbjct: 543 LGSDFAIMGRTLNAPGDWWAGYGYEIPTLQRYALRILSQPCSSHWCCWNWSTFESIHAKK 602 Query: 2042 INKLSPDILEDLVYTRMN 2095 ++ P+ +DLV+ N Sbjct: 603 HSRTEPENFDDLVFVHCN 620 >gb|AAR96007.1| transposase-like protein [Musa acuminata] Length = 670 Score = 323 bits (828), Expect = 2e-85 Identities = 191/621 (30%), Positives = 325/621 (52%), Gaps = 23/621 (3%) Frame = +2 Query: 311 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQALAFHAV 481 WE+ + + + C +C +++ GG+ R+K HL++ + +DI C+ VP+DV+ L H++ Sbjct: 9 WEHCVLVDATRQKVRCNYCHREFSGGVYRMKFHLAQIKNKDIVPCSEVPDDVRNL-IHSI 67 Query: 482 CGKDLTPYKKRKTATCSTDNERNG---ADSIVSPILSCPDSSTVLHQTTLATIY------ 634 TP K++ D+ NG + S S + S+ H +T ++ Sbjct: 68 L---TTPRKQKAPKKLKIDHTANGPQHSSSSASGYNAKNAGSSGQHGSTCPSLLLPLPSP 124 Query: 635 ---------NKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHS 787 K+ D D +A F N+I A +S + AM+ AIA+ G Y P + Sbjct: 125 GAQPTANDAQKQKYDNADNKIALFFFHNSIPFSASKSIYYQAMIDAIADCGAGYKPPTYE 184 Query: 788 TLCTKLVQDSRKDVEEYVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLK 967 L + L++ ++++ E +KD W TGC+++ D W+D + + + SPKG FLK Sbjct: 185 GLRSTLLEKVKEEINENHRKLKDEWKDTGCTILSDNWSDGRSKSLLVLSVASPKGTQFLK 244 Query: 968 NFERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQC 1147 + S + +L ++L SVI E+GAENVVQ++ ++A+ Y L++ +YP ++ C Sbjct: 245 LVDISSRADDAYYLFELLDSVIMEVGAENVVQVITDSATSYTYAAGLLLKKYPSLFWFPC 304 Query: 1148 VSHGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRF 1327 S+ ++ +LEDI K +EW+ + ++ + I ++ L LM+ T +E+ RP +RF Sbjct: 305 ASYSIEKMLEDISK-LEWVSTTLEETRTIARFICSDGWILSLMKKLTGGRELVRPKVARF 363 Query: 1328 VSHFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVL 1507 ++HF+ L+SI+ E++L+ +W +R P I ++ FW E+I + Sbjct: 364 MTHFLTLRSIVNQEDDLKHFFSHADWLSSVHSRRPDALAIKSLLYLERFWKSAHEIIGMS 423 Query: 1508 EPLITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHK 1687 EPL+ +LRLVDG+ GY+YE +ER + ++ KY+ V E+ E + + Sbjct: 424 EPLLKLLRLVDGDMPAMGYIYEGIERAKMAIKAFYKGCEEKYMSVLEIIERRWSMHCHSH 483 Query: 1688 IHAAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPNEMD--DFAAQLLLYNGKSPK 1861 +HAAAAFLNPS+ YD K++ ++R+G + + M P E D + +Y Sbjct: 484 LHAAAAFLNPSIFYDPSFKFD-VNMRNGFHAAMWKMF-PEENDRIELIKDQPVYIKAQGA 541 Query: 1862 LFNTLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKK 2041 L + +I+ P WW G E+P+L++ A+RILSQPCSS NWSAFE TK Sbjct: 542 LGSKFAIMGRTLNSPGDWWATYGYEIPVLQRAAVRILSQPCSSYWFKWNWSAFENIYTKN 601 Query: 2042 INKLSPDILEDLVYTRMNSKM 2104 ++ + L DLV+ N ++ Sbjct: 602 HTRMELEKLNDLVFVHCNLRL 622 >ref|XP_006490683.1| PREDICTED: uncharacterized protein LOC102618477 [Citrus sinensis] Length = 764 Score = 311 bits (797), Expect = 9e-82 Identities = 188/604 (31%), Positives = 310/604 (51%), Gaps = 6/604 (0%) Frame = +2 Query: 311 WEYAEDLKGRFV-CKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQALAFHAVCG 487 WEYAE L G V CKFC + GGI+R+K HLSR + + C+ V +DV + Sbjct: 97 WEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDRVRAIIAS 156 Query: 488 KD---LTPY-KKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIYNKKDKDL 655 K+ TP KK++ A + S++ P + T + + +++ Sbjct: 157 KEDVKETPIGKKQRVAEAKPVGIVCSSKSLMPLETPSPVTKVFATMTPMGNS-SLNNQEN 215 Query: 656 VDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCTKLVQDSRKDVEE 835 +R +A F N + +S S+ M+ A+ + G ++ P+ L T + + +V Sbjct: 216 AERSIALFFFENKLDFAVARSSSYQQMIDAVGKCGPGFTGPSAEALKTMWLDRIKSEVNV 275 Query: 836 YVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFERSDKGKTGVFLGD 1015 +++ W++TGC+++ DTWTD K IN + SP FLK+ + S K +L D Sbjct: 276 QSKDIEKEWAMTGCTIIADTWTDNKSKALINFLVSSPSRTFFLKSVDTSSNFKNTKYLAD 335 Query: 1016 MLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHGVQLLLEDIYKEV 1195 + SVI +IG ENVVQI+++++ Y V + I+ Y I+ C S + ++LE+ + +V Sbjct: 336 IFDSVIQDIGPENVVQIIMDSSFNYTGVANHILQNYGTIFVSPCASQSLNIILEE-FSKV 394 Query: 1196 EWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRFVSHFMMLQSILEVEEN 1375 +W+ AQ I ++Y + L LM+ +T E+ R +++VS+F+ LQSIL+ Sbjct: 395 DWVNRCILQAQTISKFIYNNASMLDLMKKFTGGLELIRTGITKYVSNFLSLQSILKQRSR 454 Query: 1376 LRLMIVSPEWRDMSDNRS-PLTEKITQMIESTTFWSRGEEVISVLEPLITVLRLVDGEGS 1552 L+ M SPE+ S + P + ++E FW EE +++ EP + VLR V G Sbjct: 455 LKHMFNSPEYSTSSPYANKPQSLSCISIVEDNDFWRAVEESVAISEPFLKVLREVSGGKP 514 Query: 1553 TAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAAAAFLNPSLMYD 1732 G +YE M R + +R + D K ++ + G + +H+AAAFLNPS+ Y+ Sbjct: 515 AVGSIYELMTRAKESIRTYYIMDENKCKIFLDIVDRNWRGQLHSPLHSAAAFLNPSIQYN 574 Query: 1733 GKIKYEQPDIRDGMNYVVESMVGPNEMDDFAAQLLLYNGKSPKLFNTLSILMMKKAHPRV 1912 +IK+ D N + + + P+ D Q+L ++ S L++ + P + Sbjct: 575 PEIKFLGSIKEDFFNVLEKLLPTPDTRRDITTQILTFSRASGMFGCKLAMEARETVPPGL 634 Query: 1913 WWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINKLSPDILEDLVYTRM 2092 WWE G P+L++VAIRILSQ CSS + R+WS F+ ++K NK+ + L DLVY Sbjct: 635 WWEQYGDSAPVLQRVAIRILSQVCSSFSFERHWSTFQQIHSEKRNKIDKETLNDLVYISY 694 Query: 2093 NSKM 2104 N K+ Sbjct: 695 NLKL 698 >gb|EXB72473.1| hypothetical protein L484_011475 [Morus notabilis] Length = 694 Score = 306 bits (783), Expect = 4e-80 Identities = 193/605 (31%), Positives = 315/605 (52%), Gaps = 7/605 (1%) Frame = +2 Query: 311 WEYAEDLKGRFV-CKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV----QALAFH 475 WEYAE L G V CKFC + GGI+R+K HLSR + + C+ V +DV +A+ Sbjct: 24 WEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDRVRAIIAS 83 Query: 476 AVCGKDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIYNKKDKDL 655 K+ + KK+K + + + ++VS + P + T +A + ++ Sbjct: 84 KEDVKETSSTKKQKLVEVKSPGNVSASKALVSTDTTSPVAKVFPAVTPVAPP-SLNSQEN 142 Query: 656 VDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCTKLVQDSRKDVEE 835 +R +A F N + G +S S+ MV AIA+ G ++ P+ TL T ++ + ++ Sbjct: 143 AERSIALFFFENKLDFGIARSSSYQLMVDAIAKCGPGFTGPSAETLKTTWLERIKSEMSL 202 Query: 836 YVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFERSDKGKTGVFLGD 1015 +++ W TGC+++ DTWTD K IN + SP F K+ + S K L D Sbjct: 203 QSKDIEKEWMTTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASAYFKNMKCLAD 262 Query: 1016 MLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHGVQLLLEDIYKEV 1195 + SVI + G +NVVQ++++++ Y V + I+ Y I+ CVS + L+LE+ + +V Sbjct: 263 LFDSVIQDFGPDNVVQVIMDSSFNYTGVANHILQNYSTIFVSPCVSQCLNLILEE-FSKV 321 Query: 1196 EWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRFVSHFMMLQSILEVEEN 1375 +W+ Q I ++Y + L LM+ YT +E+ R ++ VS F+ LQSIL+ + Sbjct: 322 DWVNRCILQGQTISKFIYNSASMLDLMKKYTGGQELIRTGITKSVSSFLSLQSILKQKSR 381 Query: 1376 LRLMIVSPEWRDMSDN-RSPLTEKITQMIESTTFWSRGEEVISVLEPLITVLRLVDGEGS 1552 L+ M SPE+ S P + ++E + FW EE +++ EP + VLR V G Sbjct: 382 LKHMFNSPEYCTNSLYVNKPQSISCISIVEDSDFWRAVEESVAISEPFLKVLREVAGGKP 441 Query: 1553 TAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAAAAFLNPSLMYD 1732 G +YE M R + +R + D K ++ + K + +H+AAAFLNPS+ Y+ Sbjct: 442 AVGSIYELMTRAKESIRTYYIMDENKCKTFLDIVDRKWRDQLHSPLHSAAAFLNPSIQYN 501 Query: 1733 GKIKYEQPDIRDGMNYVVESMVGPNEM-DDFAAQLLLYNGKSPKLFNTLSILMMKKAHPR 1909 +IK+ I++ V+E ++ EM D +Q+ + +L++ P Sbjct: 502 PEIKF-LSSIKEDFFKVLEKLLPLPEMRRDITSQIFTFTKAMSMFGCSLAMEARDVVSPG 560 Query: 1910 VWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINKLSPDILEDLVYTR 2089 +WWE G P+L++VAIRILSQ CSS T R+WSAF+ ++K NK+ + L DLVY Sbjct: 561 LWWEQYGDSAPVLQRVAIRILSQVCSSFTFERHWSAFQQIHSEKRNKIDRETLNDLVYIN 620 Query: 2090 MNSKM 2104 N K+ Sbjct: 621 YNLKL 625 >ref|XP_007038933.1| HAT transposon superfamily isoform 4 [Theobroma cacao] gi|508776178|gb|EOY23434.1| HAT transposon superfamily isoform 4 [Theobroma cacao] Length = 682 Score = 304 bits (779), Expect = 1e-79 Identities = 185/604 (30%), Positives = 308/604 (50%), Gaps = 6/604 (0%) Frame = +2 Query: 311 WEYAEDLKGRFV-CKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV----QALAFH 475 WEYAE L G V CKFC + GGI+R+K HLSR + + C+ V +DV +A+ Sbjct: 13 WEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDRVRAILSS 72 Query: 476 AVCGKDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIYNKKDKDL 655 K+ + KK+K A + + I+ P+ + + V T+ + ++ Sbjct: 73 KEEIKETSSVKKQKIAEARSPGNISTCSKII-PLEASSPVAKVFPATSPIAPPSLNSQEN 131 Query: 656 VDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCTKLVQDSRKDVEE 835 V+R +A F N + +S S+ AM+ A+ +FG ++ P+ TL T ++ + +V Sbjct: 132 VERSIALFFFENKLDFSVARSSSYQAMIDAVGKFGPGFTGPSVETLKTMWLERIKSEVCL 191 Query: 836 YVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFERSDKGKTGVFLGD 1015 + + W+ TGC+++ DTWTD K IN + SP F K+ + S K L D Sbjct: 192 QSKDTEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASSYFKNTKCLAD 251 Query: 1016 MLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHGVQLLLEDIYKEV 1195 + SVI + G ENVVQI+++++ Y +++ I+ Y I+ C S + L+LE+ + +V Sbjct: 252 LFDSVIQDFGPENVVQIIMDSSFNYTGISNHILQNYGTIFVSPCASQCLNLILEE-FSKV 310 Query: 1196 EWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRFVSHFMMLQSILEVEEN 1375 +W+ AQ + ++Y + L LM+ +T E+E+ R ++ VS F+ LQS+L+ Sbjct: 311 DWVNRCILQAQTLSKFLYNNASMLDLMKKFTGEQELIRTGITKSVSSFLSLQSMLKQRSR 370 Query: 1376 LRLMIVSPEWRDMSDNRS-PLTEKITQMIESTTFWSRGEEVISVLEPLITVLRLVDGEGS 1552 L+ M SPE+ S + P + ++E FW +E +++ EP + VLR V G Sbjct: 371 LKHMFNSPEYSTNSSYANKPQSISCIAIVEDNDFWRAVDECVAISEPFLKVLREVSGGKP 430 Query: 1553 TAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAAAAFLNPSLMYD 1732 G +YE M R + +R + D K ++ + K + +H+A AFLNPS+ Y+ Sbjct: 431 AVGSIYELMTRAKESIRTYYIMDEGKCKTFLDIVDRKWRDQLHSPLHSAGAFLNPSIQYN 490 Query: 1733 GKIKYEQPDIRDGMNYVVESMVGPNEMDDFAAQLLLYNGKSPKLFNTLSILMMKKAHPRV 1912 +IK+ D + + + P D Q+ + L++ P + Sbjct: 491 QEIKFLGSIKEDFFKVLEKLLPTPELRRDITNQIFTFTRAKGMFACNLAMEARDTVSPGL 550 Query: 1913 WWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINKLSPDILEDLVYTRM 2092 WWE G P+L++VAIRILSQ CS+ T R+WS F+ ++K NK+ +IL DLVY Sbjct: 551 WWEQFGDSAPVLQRVAIRILSQVCSTFTFERHWSTFQQIHSEKRNKIDKEILNDLVYINY 610 Query: 2093 NSKM 2104 N ++ Sbjct: 611 NLRL 614 >ref|XP_007038931.1| HAT transposon superfamily isoform 2 [Theobroma cacao] gi|590673575|ref|XP_007038932.1| HAT transposon superfamily isoform 2 [Theobroma cacao] gi|508776176|gb|EOY23432.1| HAT transposon superfamily isoform 2 [Theobroma cacao] gi|508776177|gb|EOY23433.1| HAT transposon superfamily isoform 2 [Theobroma cacao] Length = 678 Score = 304 bits (779), Expect = 1e-79 Identities = 185/604 (30%), Positives = 308/604 (50%), Gaps = 6/604 (0%) Frame = +2 Query: 311 WEYAEDLKGRFV-CKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV----QALAFH 475 WEYAE L G V CKFC + GGI+R+K HLSR + + C+ V +DV +A+ Sbjct: 9 WEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDRVRAILSS 68 Query: 476 AVCGKDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIYNKKDKDL 655 K+ + KK+K A + + I+ P+ + + V T+ + ++ Sbjct: 69 KEEIKETSSVKKQKIAEARSPGNISTCSKII-PLEASSPVAKVFPATSPIAPPSLNSQEN 127 Query: 656 VDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCTKLVQDSRKDVEE 835 V+R +A F N + +S S+ AM+ A+ +FG ++ P+ TL T ++ + +V Sbjct: 128 VERSIALFFFENKLDFSVARSSSYQAMIDAVGKFGPGFTGPSVETLKTMWLERIKSEVCL 187 Query: 836 YVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFERSDKGKTGVFLGD 1015 + + W+ TGC+++ DTWTD K IN + SP F K+ + S K L D Sbjct: 188 QSKDTEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASSYFKNTKCLAD 247 Query: 1016 MLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHGVQLLLEDIYKEV 1195 + SVI + G ENVVQI+++++ Y +++ I+ Y I+ C S + L+LE+ + +V Sbjct: 248 LFDSVIQDFGPENVVQIIMDSSFNYTGISNHILQNYGTIFVSPCASQCLNLILEE-FSKV 306 Query: 1196 EWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRFVSHFMMLQSILEVEEN 1375 +W+ AQ + ++Y + L LM+ +T E+E+ R ++ VS F+ LQS+L+ Sbjct: 307 DWVNRCILQAQTLSKFLYNNASMLDLMKKFTGEQELIRTGITKSVSSFLSLQSMLKQRSR 366 Query: 1376 LRLMIVSPEWRDMSDNRS-PLTEKITQMIESTTFWSRGEEVISVLEPLITVLRLVDGEGS 1552 L+ M SPE+ S + P + ++E FW +E +++ EP + VLR V G Sbjct: 367 LKHMFNSPEYSTNSSYANKPQSISCIAIVEDNDFWRAVDECVAISEPFLKVLREVSGGKP 426 Query: 1553 TAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAAAAFLNPSLMYD 1732 G +YE M R + +R + D K ++ + K + +H+A AFLNPS+ Y+ Sbjct: 427 AVGSIYELMTRAKESIRTYYIMDEGKCKTFLDIVDRKWRDQLHSPLHSAGAFLNPSIQYN 486 Query: 1733 GKIKYEQPDIRDGMNYVVESMVGPNEMDDFAAQLLLYNGKSPKLFNTLSILMMKKAHPRV 1912 +IK+ D + + + P D Q+ + L++ P + Sbjct: 487 QEIKFLGSIKEDFFKVLEKLLPTPELRRDITNQIFTFTRAKGMFACNLAMEARDTVSPGL 546 Query: 1913 WWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINKLSPDILEDLVYTRM 2092 WWE G P+L++VAIRILSQ CS+ T R+WS F+ ++K NK+ +IL DLVY Sbjct: 547 WWEQFGDSAPVLQRVAIRILSQVCSTFTFERHWSTFQQIHSEKRNKIDKEILNDLVYINY 606 Query: 2093 NSKM 2104 N ++ Sbjct: 607 NLRL 610 >ref|XP_002513602.1| protein dimerization, putative [Ricinus communis] gi|223547510|gb|EEF49005.1| protein dimerization, putative [Ricinus communis] Length = 688 Score = 304 bits (778), Expect = 1e-79 Identities = 188/606 (31%), Positives = 315/606 (51%), Gaps = 7/606 (1%) Frame = +2 Query: 311 WEYAEDLKGRFV-CKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV----QALAFH 475 WEYAE L G V CKFC + GGI+R+K HLSR + + C+ V +DV +A+ Sbjct: 18 WEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDRVRAIIAS 77 Query: 476 AVCGKDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIYNKKDKDL 655 K+ + KK++ A + ++V+ + S ++ V T + + +++ Sbjct: 78 KEDIKEPSSAKKQRPAEAKSPAHIYATKALVN-VESVAPAAKVYPTVTSISPPSLSNQEN 136 Query: 656 VDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCTKLVQDSRKDVEE 835 +R +A F N + +SPS+ M++AI + G ++ P+ L T ++ + +V Sbjct: 137 AERSIALFFFENKLDFSVARSPSYQLMIEAIEKCGPGFTGPSAEILKTTWLERIKSEVSL 196 Query: 836 YVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFERSDKGKTGVFLGD 1015 + + + W+ TGC+++ DTWTD K IN SP F K+ + S K L D Sbjct: 197 QLKDTEKEWTTTGCTIIADTWTDNKSRALINFFVSSPSRTFFHKSVDASSYFKNTKCLAD 256 Query: 1016 MLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHGVQLLLEDIYKEV 1195 + SVI + GAENVVQI+++++ Y V + I+ Y I+ C S + L+LED + +V Sbjct: 257 LFDSVIQDFGAENVVQIIMDSSFNYTGVANHILQNYGTIFVSPCASQCLNLILED-FSKV 315 Query: 1196 EWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRFVSHFMMLQSILEVEEN 1375 +W+ AQ + ++Y ++ L LM+ +T +E+ + ++ VS F+ LQS+L+ Sbjct: 316 DWVNRCISQAQTLSKFIYNNSSMLDLMKKFTGGQELIKTGITKSVSSFLSLQSMLKQRPR 375 Query: 1376 LRLMIVSPEWRDMSDNRS-PLTEKITQMIESTTFWSRGEEVISVLEPLITVLRLVDGEGS 1552 L+LM S E+ S S P + ++E FW EE +++ EP + VLR V G Sbjct: 376 LKLMFSSNEYSANSSYSSKPQSIACITIVEDGDFWRAVEECVAITEPFLKVLREVSGGKP 435 Query: 1553 TAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAAAAFLNPSLMYD 1732 G +YE M R + +R + D K ++ + K + +H+AAAFLNP + Y+ Sbjct: 436 AVGSIYELMTRAKESIRTYYIMDESKCKTFLDIVDRKWRDQLHSPLHSAAAFLNPCVQYN 495 Query: 1733 GKIKYEQPDIRDGMNYVVESMV-GPNEMDDFAAQLLLYNGKSPKLFNTLSILMMKKAHPR 1909 +IK+ +I++ V+E ++ P+ D Q+ ++ S L++ P Sbjct: 496 PEIKF-LVNIKEDFFKVIEKLLPTPDMRRDITNQIFIFTRASGMFGCNLAMEARDTVAPG 554 Query: 1910 VWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINKLSPDILEDLVYTR 2089 +WWE G P+L++VAIRILSQ CS+ T R+W+ F ++K NK+ + L DLVY Sbjct: 555 LWWEQYGDSAPVLQRVAIRILSQVCSTFTFERHWNTFRQIHSEKRNKIDKETLNDLVYIN 614 Query: 2090 MNSKMM 2107 N K+M Sbjct: 615 YNLKLM 620 >ref|XP_004145979.1| PREDICTED: uncharacterized protein LOC101215128, partial [Cucumis sativus] Length = 685 Score = 302 bits (774), Expect = 4e-79 Identities = 193/608 (31%), Positives = 310/608 (50%), Gaps = 10/608 (1%) Frame = +2 Query: 311 WEYAEDLKGRFV-CKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV----QALAFH 475 WEYAE L G V CKFC + GGI+R+K HLSR R + C+ V +DV +A+ Sbjct: 13 WEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILAT 72 Query: 476 AVCGKDLTPYKKRKTATCSTDNERNGAD---SIVSPILSCPDSSTVLHQTTLA--TIYNK 640 K+ + KK+K A T S+VS P + T +A +++N Sbjct: 73 REEIKEASTGKKQKLAEVKTVESVPSISMCKSVVSIETPSPVAKVFPTVTPMAPPSLHNH 132 Query: 641 KDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCTKLVQDSR 820 ++ ++ +A F N + +S S+ M+ AI + G ++ P+ TL T ++ + Sbjct: 133 EN---AEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTGPSAETLKTTWLERIK 189 Query: 821 KDVEEYVSNVKDSWSLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFERSDKGKTG 1000 +V +++ W+ TGC++++DTWTD K IN + SP F K+ + S K Sbjct: 190 TEVSLQSKDIEKEWTTTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSVDASTYFKNT 249 Query: 1001 VFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHGVQLLLED 1180 LGD+ SVI + G ENVVQI+++++ Y + I+ Y I+ C S + +LE+ Sbjct: 250 KCLGDLFDSVIQDFGHENVVQIIMDSSLNYSGTANHILQTYGTIFVSPCASQCLNSILEE 309 Query: 1181 IYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIRRPCNSRFVSHFMMLQSIL 1360 + +V+W+ AQ I ++Y ++ L LMR +T +E+ R S+ VS F+ LQSIL Sbjct: 310 -FSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGGQELIRTGISKPVSSFLSLQSIL 368 Query: 1361 EVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLEPLITVLRLVD 1540 + L+ M SP++ S P + +IE FW EE +++ EP + VLR V Sbjct: 369 KQRSRLKHMFNSPDYTTNSYANKPQSISCIAIIEDNDFWRAVEECVAISEPFLRVLREVC 428 Query: 1541 GEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAAAAFLNPS 1720 G G +YE M R + +R + D +K ++ + K + +HAAAAFLNPS Sbjct: 429 GGKPAVGCIYELMTRAKESIRTYYIMDEIKCKTFLDIVDRKWRDQLHSPLHAAAAFLNPS 488 Query: 1721 LMYDGKIKYEQPDIRDGMNYVVESMVGPNEMDDFAAQLLLYNGKSPKLFNTLSILMMKKA 1900 + Y+ +IK+ D N + + + P D Q+ + + +L++ Sbjct: 489 IQYNPEIKFLTSIKEDFFNVLEKLLPLPEMRRDITNQIFTFTKANGMFGCSLAMEARDTV 548 Query: 1901 HPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINKLSPDILEDLV 2080 P +WWE G P+L++VAIRILSQ CS+ + R+WS F+ ++K NK+ + L DLV Sbjct: 549 SPWLWWEQFGDSAPVLQRVAIRILSQVCSTFSFERHWSMFQQIHSEKRNKIDKETLNDLV 608 Query: 2081 YTRMNSKM 2104 Y N K+ Sbjct: 609 YINYNLKL 616 >ref|XP_006573373.1| PREDICTED: uncharacterized protein LOC102669318 [Glycine max] Length = 816 Score = 302 bits (773), Expect = 5e-79 Identities = 190/656 (28%), Positives = 320/656 (48%), Gaps = 42/656 (6%) Frame = +2 Query: 311 WEYAEDL----KGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQALAFHA 478 W+Y L VC FC K GGI R K HL + G A + P V+ L + Sbjct: 23 WKYCHSLVEGDTNTIVCNFCGKITKGGITRAKQHLIGKSGNVAACKKTPPNVVEELKEYM 82 Query: 479 VCGKDLTPYKKRKTAT--------------CSTDNERNGADSI---VSPILSC------- 586 K T Y + C E ADS S C Sbjct: 83 ATKKSGTTYSTSGSGNMANIRDFEFGEPIGCDGSEEDEFADSCNAAASAKTKCGTKKGPM 142 Query: 587 ------PDSST------VLHQTTLATIYNKKDKDLVDRMVAQAFIMNNIALGAIQSPSFV 730 P+++ +L Q + +K + V + +A+ + ++ I+ SF Sbjct: 143 DKFCKNPENAINRRKMEMLRQMNIRESMDKNEVLKVHQHIARFWYQAGLSFNLIKLKSFE 202 Query: 731 AMVKAIAEFGTSYSLPNHSTLCTKLVQDSRKDVEEYVSNVKDSWSLTGCSLMLDTWTDMK 910 MV AI ++G +P++ + L++ + E + ++ W GC++M D WTD K Sbjct: 203 NMVAAIGQYGPHLPIPSYHDIRVPLLKKEVEYTENLMKGHREQWVKYGCTIMSDAWTDRK 262 Query: 911 DVCFINVVAYSPKGAVFLKNFERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKY 1090 C IN + S G +FLK+ + SD KTG L ++L ++++E+G ENVVQ+V +N S Y Sbjct: 263 QRCIINFLINSQAGTMFLKSVDGSDFVKTGEKLFELLDAIVEEVGEENVVQVVTDNGSNY 322 Query: 1091 ECVTDLIMGRYPHIYKIQCVSHGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALK 1270 V L+ + HIY C +H + L+LEDI K + I+ A +V ++Y +++ L Sbjct: 323 VLVGKLLEEKRKHIYWTPCAAHCIDLMLEDIGK-LPLIRKTIRRAINLVGFIYAHSSTLS 381 Query: 1271 LMRVYTAEKEIRRPCNSRFVSHFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKIT 1450 L+R +T ++E+ R +RF + ++ L+ + + + N+R M S EW ++ P ++ Sbjct: 382 LLRNFTNKRELVRHAITRFATSYLTLERLHKEKANIRKMFTSDEWTLNKLSKEPKGKEAA 441 Query: 1451 QMIESTTFWSRGEEVISVLEPLITVLRLVDGEGSTA-GYLYEAMERVRTELRQHCNSDSL 1627 +++ +FW+ + V+ PL+ VLRLVDGE A GY+YEAM++ + + + N++ Sbjct: 442 KVVLMPSFWNSVVYTLKVMAPLVKVLRLVDGERKPAMGYIYEAMDKAKETIMKSFNNNES 501 Query: 1628 KYLKVWELFESKRNGDMIHKIHAAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPN 1807 KY V+E+ + + N + +HAAA FLNP YD ++ +G+ ++ ++ Sbjct: 502 KYKDVFEIIDKRWNCQLHRPLHAAAHFLNPEFFYDNTDLEFDFEVTNGLFECIKKLIPQF 561 Query: 1808 EMDD-FAAQLLLYNGKSPKLFNTLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPC 1984 ++ +L LY + + ++ K P WW G + P L+K+AI+ILS C Sbjct: 562 DVQQKILTELHLYKIGADHFGSDFAMAQRKTHSPTYWWRMFGSQTPNLQKLAIKILSLTC 621 Query: 1985 SSSTCGRNWSAFEVAKTKKINKLSPDILEDLVYTRMNSKMMAYYNELEMRDKFAIS 2152 S+S C RNWS FE +KK N+L L DLV+ + N ++ YN + D +++ Sbjct: 622 SASGCERNWSVFEQIHSKKRNRLEHKRLHDLVFVKYNQQLKQRYNARDEIDPISLN 677 >ref|XP_003553157.1| PREDICTED: uncharacterized protein LOC100793012 [Glycine max] Length = 816 Score = 301 bits (770), Expect = 1e-78 Identities = 190/657 (28%), Positives = 322/657 (49%), Gaps = 43/657 (6%) Frame = +2 Query: 311 WEYAEDL----KGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV-QALAFH 475 W+Y L VC FC K GGI R K HL + G ++A C P +V + L + Sbjct: 23 WKYCHSLVEGDTNTIVCNFCGKITKGGITRAKQHLIGKSG-NVAACKKTPPNVIEELKEY 81 Query: 476 AVCGKDLTPYKKRKTAT--------------CSTDNERNGADSI---VSPILSC------ 586 K T Y + C E ADS S C Sbjct: 82 MATKKSGTTYSTSGSGNMANIRDFEFGEPIGCDGSEEDEFADSCNAAASAKTKCGTKKGP 141 Query: 587 -------PDSST------VLHQTTLATIYNKKDKDLVDRMVAQAFIMNNIALGAIQSPSF 727 P+++ +L Q + +K + V + +A+ + ++ I+ SF Sbjct: 142 MDKFCKNPENAINRRKMEMLRQMNIRESMDKNEVLKVHQHIARFWYQAGLSFNLIKLKSF 201 Query: 728 VAMVKAIAEFGTSYSLPNHSTLCTKLVQDSRKDVEEYVSNVKDSWSLTGCSLMLDTWTDM 907 MV AI ++G +P++ + L++ + E + ++ W GC++M D WTD Sbjct: 202 ENMVAAIGQYGPHLPIPSYHDIRVPLLKKEVEYTENLMKGHREQWVKYGCTIMSDAWTDQ 261 Query: 908 KDVCFINVVAYSPKGAVFLKNFERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASK 1087 K C IN + S G +FLK+ + SD KTG L ++L ++++E+G ENVVQ+V +N S Sbjct: 262 KQRCIINFLINSQAGTMFLKSVDDSDFVKTGEKLFELLDAIVEEVGEENVVQVVTDNGSN 321 Query: 1088 YECVTDLIMGRYPHIYKIQCVSHGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTAL 1267 Y L+ + HIY C +H + L+LEDI K + I+ A +V ++Y +++ L Sbjct: 322 YVLAGKLLEEKRKHIYWTPCAAHCIDLMLEDIGK-LPLIRKTIRRAINLVGFIYAHSSTL 380 Query: 1268 KLMRVYTAEKEIRRPCNSRFVSHFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKI 1447 L+R +T ++E+ R +RF + ++ L+ + + + N+R M S EW ++ P ++ Sbjct: 381 SLLRNFTNKRELVRHAITRFATSYLTLERLHKEKANIRKMFTSDEWTLNKLSKEPKGKEA 440 Query: 1448 TQMIESTTFWSRGEEVISVLEPLITVLRLVDGEGSTA-GYLYEAMERVRTELRQHCNSDS 1624 +++ +FW+ + V+ PL+ VLRLVDGE A GY+YEAM++ + + + N++ Sbjct: 441 AKVVLMPSFWNSVVYTLKVMAPLVKVLRLVDGERKPAMGYIYEAMDKAKETIMKSFNNNE 500 Query: 1625 LKYLKVWELFESKRNGDMIHKIHAAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGP 1804 KY V+E+ + + N + +HAAA FLNP YD ++ +G+ ++ ++ Sbjct: 501 SKYKDVFEIIDKRWNCQLHRPLHAAAHFLNPEFFYDNTDLEFDFEVTNGLFECIKKLIPQ 560 Query: 1805 NEMDD-FAAQLLLYNGKSPKLFNTLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQP 1981 ++ +L LY + + ++ K P WW G + P L+K+AI+ILS Sbjct: 561 FDVQQKILTELHLYKIGADHFGSDFAMAQRKTHSPTYWWRMFGSQTPNLQKLAIKILSLT 620 Query: 1982 CSSSTCGRNWSAFEVAKTKKINKLSPDILEDLVYTRMNSKMMAYYNELEMRDKFAIS 2152 CS+S C RNWS FE +KK N+L L DLV+ + N ++ YN + D +++ Sbjct: 621 CSASGCERNWSVFEQIHSKKRNRLEHKRLHDLVFVKYNQQLKQRYNARDEIDPISLN 677