BLASTX nr result

ID: Akebia27_contig00001104 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00001104
         (2377 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002263775.2| PREDICTED: uncharacterized protein LOC100247...   343   2e-91
emb|CBI38444.3| unnamed protein product [Vitis vinifera]              343   2e-91
emb|CAN68527.1| hypothetical protein VITISV_044224 [Vitis vinifera]   341   8e-91
ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222...   337   1e-89
ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   336   3e-89
ref|XP_002509591.1| DNA binding protein, putative [Ricinus commu...   333   2e-88
ref|XP_006857527.1| hypothetical protein AMTR_s00061p00028660 [A...   332   6e-88
ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627...   330   2e-87
ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao]...   328   5e-87
ref|XP_003538648.1| PREDICTED: uncharacterized protein LOC100805...   327   2e-86
ref|XP_004300713.1| PREDICTED: uncharacterized protein LOC101309...   323   2e-85
gb|AAR96007.1| transposase-like protein [Musa acuminata]              323   3e-85
ref|XP_006490683.1| PREDICTED: uncharacterized protein LOC102618...   310   1e-81
gb|EXB72473.1| hypothetical protein L484_011475 [Morus notabilis]     307   2e-80
ref|XP_006573373.1| PREDICTED: uncharacterized protein LOC102669...   303   2e-79
ref|XP_007038933.1| HAT transposon superfamily isoform 4 [Theobr...   303   2e-79
ref|XP_007038931.1| HAT transposon superfamily isoform 2 [Theobr...   303   2e-79
ref|XP_002513602.1| protein dimerization, putative [Ricinus comm...   303   2e-79
ref|XP_003553157.1| PREDICTED: uncharacterized protein LOC100793...   302   4e-79
ref|XP_006594368.1| PREDICTED: uncharacterized protein LOC102669...   302   5e-79

>ref|XP_002263775.2| PREDICTED: uncharacterized protein LOC100247282 [Vitis vinifera]
          Length = 672

 Score =  343 bits (880), Expect = 2e-91
 Identities = 194/620 (31%), Positives = 323/620 (52%), Gaps = 20/620 (3%)
 Frame = -2

Query: 2160 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV--QALAFH 1996
            WE+    +  + +  C +C +++ GG+ R+K HL++ + +DI  C  VP DV  Q  +  
Sbjct: 9    WEHCVLVDATRQKVRCNYCHREFSGGVYRMKFHLAQIKNKDIVSCTEVPNDVRDQIQSIL 68

Query: 1995 AVCGKDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIY------- 1837
            +   K  TP K +     +     + A     P       S+  H +T   ++       
Sbjct: 69   STPKKQKTPKKTKVDLAANGQQNSSSASGDFHP----NHGSSGQHGSTCPLLFPRPSPSE 124

Query: 1836 -------NKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTL 1678
                    K+ +D  D+ VA  F  N++   A +S  +  MV AIAE G  Y  P++  L
Sbjct: 125  QPAVDDEQKQKQDDADKKVAVFFFHNSVPFSAAKSMYYQEMVDAIAECGVGYKAPSYEKL 184

Query: 1677 CTKLVQDSRKDVEEYVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNF 1498
             + L++  + DV +    ++D W  TGC+++ D W+D +    +      PKG +FLK+ 
Sbjct: 185  RSTLMEKVKCDVNDCCKKLRDGWRATGCTILCDCWSDGRTKSLVVFSVTCPKGTLFLKSV 244

Query: 1497 ERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVS 1318
            + S       +L ++L SV+ E+G ENVVQ++ ++A+ Y     L+M +Y  ++   C S
Sbjct: 245  DISGHADDAHYLYELLESVVLEVGLENVVQVITDSAASYVYAGRLLMAKYTTLFWSPCAS 304

Query: 1317 HGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRFVS 1138
              +  +LEDI K+ EW+ +V ++A+ I  Y+Y +   L +MR +T  +E+ RP  +RFV+
Sbjct: 305  FCIDKMLEDISKQ-EWVSTVLEEAKTITHYIYSHAWILNMMRKFTGGRELIRPRITRFVT 363

Query: 1137 HFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLEP 958
            +F+ L+SI+  E+NL+LM    +W     +R P ++ +  ++    FW    E +SV EP
Sbjct: 364  NFLSLRSIVVQEDNLKLMFSHMDWMSSVYSRRPDSQNVKSLLYLERFWKSAHEAVSVSEP 423

Query: 957  LITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIH 778
            L+ VLR+VDG+    GY+YE +ER +  ++ + NS   KY+ +W++ + + N  +   +H
Sbjct: 424  LVKVLRIVDGDMPAMGYIYEGIERAKIAIKGYYNSIEEKYMPIWDIIDRRWNVQLHSPLH 483

Query: 777  AAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPN-EMDDFAAQLLLYNGKSPKLFN 601
            AAAAFLNPS+ Y    K +   +R+G    +  M   + +  +   +  +Y      L  
Sbjct: 484  AAAAFLNPSIFYGPNFKVDL-RMRNGFQEAMRKMATEDRDKIEITKEHPIYINAQGALGT 542

Query: 600  TLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINK 421
              +I+         WW   G E+P L++ AIRILSQPCSS  CG NWS+FE   TKK N+
Sbjct: 543  EFAIMGRTLNAAGDWWAGYGYEIPTLQRAAIRILSQPCSSHWCGWNWSSFEALHTKKRNR 602

Query: 420  LSPDILEDLVYTRMNSKMMA 361
            +  + L DLV    N  + A
Sbjct: 603  MELEKLNDLVLVHCNLHLQA 622


>emb|CBI38444.3| unnamed protein product [Vitis vinifera]
          Length = 712

 Score =  343 bits (880), Expect = 2e-91
 Identities = 194/620 (31%), Positives = 323/620 (52%), Gaps = 20/620 (3%)
 Frame = -2

Query: 2160 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV--QALAFH 1996
            WE+    +  + +  C +C +++ GG+ R+K HL++ + +DI  C  VP DV  Q  +  
Sbjct: 49   WEHCVLVDATRQKVRCNYCHREFSGGVYRMKFHLAQIKNKDIVSCTEVPNDVRDQIQSIL 108

Query: 1995 AVCGKDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIY------- 1837
            +   K  TP K +     +     + A     P       S+  H +T   ++       
Sbjct: 109  STPKKQKTPKKTKVDLAANGQQNSSSASGDFHP----NHGSSGQHGSTCPLLFPRPSPSE 164

Query: 1836 -------NKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTL 1678
                    K+ +D  D+ VA  F  N++   A +S  +  MV AIAE G  Y  P++  L
Sbjct: 165  QPAVDDEQKQKQDDADKKVAVFFFHNSVPFSAAKSMYYQEMVDAIAECGVGYKAPSYEKL 224

Query: 1677 CTKLVQDSRKDVEEYVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNF 1498
             + L++  + DV +    ++D W  TGC+++ D W+D +    +      PKG +FLK+ 
Sbjct: 225  RSTLMEKVKCDVNDCCKKLRDGWRATGCTILCDCWSDGRTKSLVVFSVTCPKGTLFLKSV 284

Query: 1497 ERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVS 1318
            + S       +L ++L SV+ E+G ENVVQ++ ++A+ Y     L+M +Y  ++   C S
Sbjct: 285  DISGHADDAHYLYELLESVVLEVGLENVVQVITDSAASYVYAGRLLMAKYTTLFWSPCAS 344

Query: 1317 HGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRFVS 1138
              +  +LEDI K+ EW+ +V ++A+ I  Y+Y +   L +MR +T  +E+ RP  +RFV+
Sbjct: 345  FCIDKMLEDISKQ-EWVSTVLEEAKTITHYIYSHAWILNMMRKFTGGRELIRPRITRFVT 403

Query: 1137 HFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLEP 958
            +F+ L+SI+  E+NL+LM    +W     +R P ++ +  ++    FW    E +SV EP
Sbjct: 404  NFLSLRSIVVQEDNLKLMFSHMDWMSSVYSRRPDSQNVKSLLYLERFWKSAHEAVSVSEP 463

Query: 957  LITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIH 778
            L+ VLR+VDG+    GY+YE +ER +  ++ + NS   KY+ +W++ + + N  +   +H
Sbjct: 464  LVKVLRIVDGDMPAMGYIYEGIERAKIAIKGYYNSIEEKYMPIWDIIDRRWNVQLHSPLH 523

Query: 777  AAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPN-EMDDFAAQLLLYNGKSPKLFN 601
            AAAAFLNPS+ Y    K +   +R+G    +  M   + +  +   +  +Y      L  
Sbjct: 524  AAAAFLNPSIFYGPNFKVDL-RMRNGFQEAMRKMATEDRDKIEITKEHPIYINAQGALGT 582

Query: 600  TLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINK 421
              +I+         WW   G E+P L++ AIRILSQPCSS  CG NWS+FE   TKK N+
Sbjct: 583  EFAIMGRTLNAAGDWWAGYGYEIPTLQRAAIRILSQPCSSHWCGWNWSSFEALHTKKRNR 642

Query: 420  LSPDILEDLVYTRMNSKMMA 361
            +  + L DLV    N  + A
Sbjct: 643  MELEKLNDLVLVHCNLHLQA 662


>emb|CAN68527.1| hypothetical protein VITISV_044224 [Vitis vinifera]
          Length = 926

 Score =  341 bits (875), Expect = 8e-91
 Identities = 194/620 (31%), Positives = 320/620 (51%), Gaps = 20/620 (3%)
 Frame = -2

Query: 2160 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV--QALAFH 1996
            WE+    +  + +  C +C +++ GG+ R+K HL++ + +DI  C  VP DV  Q  +  
Sbjct: 263  WEHCVLVDATRQKVRCNYCHREFSGGVYRMKFHLAQIKNKDIVSCTEVPNDVRDQIQSIL 322

Query: 1995 AVCGKDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIY------- 1837
            +   K  TP K +     +     + A     P       S+  H +T   ++       
Sbjct: 323  STPKKQKTPKKTKVDXAANGQQNSSSASGDFHP----NHGSSGQHGSTCPLLFPRPSPSE 378

Query: 1836 -------NKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTL 1678
                    K+ +D  D+ VA  F  N++   A +S  +  MV AIAE G  Y  P++  L
Sbjct: 379  QPAVDDEQKQKQDDADKKVAVFFFHNSVPFSAAKSMYYQEMVDAIAECGVGYKAPSYEKL 438

Query: 1677 CTKLVQDSRKDVEEYVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNF 1498
             + L++  + DV +    ++D W  TGC+++ D W+D +           PKG +FLK+ 
Sbjct: 439  RSTLMEKVKCDVNDCCKKLRDGWRXTGCTILCDCWSDGRTKSLXVFSVTCPKGTLFLKSV 498

Query: 1497 ERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVS 1318
            + S       +L ++L SV+ E+G ENVVQ++ ++A+ Y     L+M +Y  ++   C S
Sbjct: 499  DISGHADDAHYLFELLESVVLEVGLENVVQVITDSAASYVYAGRLLMAKYTTLFWSPCAS 558

Query: 1317 HGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRFVS 1138
              +  +LEDI K+ EW+ +V ++A  I  Y+Y +   L +MR +T  +E+ RP  +RFV+
Sbjct: 559  FCIDKMLEDISKQ-EWVSTVLEEANTITHYIYSHAWILNMMRKFTGGRELIRPRITRFVT 617

Query: 1137 HFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLEP 958
            +F+ L+SI+  E+NL+LM    +W     +R P  + +  ++    FW    E +SV EP
Sbjct: 618  NFLSLRSIVVQEDNLKLMFSHMDWMSSVYSRRPDAQNVKSLLYLERFWKSAHEAVSVSEP 677

Query: 957  LITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIH 778
            L+ VLR+VDG+    GY+YE +ER +  ++ + NS   KY+ +W++ + + N  +   +H
Sbjct: 678  LVKVLRIVDGDMPAMGYIYEGIERAKIAIKXYYNSIEEKYMPIWDIIDRRWNVQLHSPLH 737

Query: 777  AAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPN-EMDDFAAQLLLYNGKSPKLFN 601
            AAAAFLNPS+ Y    K +   +R+G    +  M   + +  +   +  +Y      L  
Sbjct: 738  AAAAFLNPSIFYGPNFKVDL-RMRNGFQEAMRKMATEDRDKIEITKEHPIYINAQGALGT 796

Query: 600  TLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINK 421
              +I+         WW   G E+P L++ AIRILSQPCSS  CG NWS+FE   TKK N+
Sbjct: 797  EFAIMGRTLNAAGDWWAGYGYEIPTLQRAAIRILSQPCSSHWCGWNWSSFEALHTKKRNR 856

Query: 420  LSPDILEDLVYTRMNSKMMA 361
            +  + L DLV    N  + A
Sbjct: 857  MELEKLNDLVLVHCNLHLQA 876


>ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222344 [Cucumis sativus]
          Length = 673

 Score =  337 bits (864), Expect = 1e-89
 Identities = 197/613 (32%), Positives = 326/613 (53%), Gaps = 18/613 (2%)
 Frame = -2

Query: 2160 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQ--ALAFH 1996
            WE+    +  + +  C +C++++ GG+ R+K HL++ + +DI  C  VP DV+       
Sbjct: 9    WEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGIL 68

Query: 1995 AVCGKDLTPYKKRKTATCSTDNERNGAD--------SIVSPILSCPDSSTVLHQTTLATI 1840
            +   K   P K +     +T+ +++ +         S      +CP +   L  +    I
Sbjct: 69   STPKKQKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTYPCLSPSAQPPI 128

Query: 1839 YN--KKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCTKL 1666
             +  K+ KD  D+ VA  F  N+I   A +S  +  MV AIAE+G  Y  P++  L + L
Sbjct: 129  DDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTL 188

Query: 1665 VQDSRKDVEEYVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFERSD 1486
            +   + D+       +D W  TGC+++ D+W+D +   F+ +     KG +FLK+ + S 
Sbjct: 189  LDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISG 248

Query: 1485 KGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHGVQ 1306
                  +L D+L ++I E+G ENVVQI+ +  + Y     L+M +Y  ++   CVS+ V 
Sbjct: 249  HEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVN 308

Query: 1305 LLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRFVSHFMM 1126
             +LEDI K +EW+ +V ++A++I  Y+Y + + L  MR +T  KE+ RP  +RFV++F+ 
Sbjct: 309  QMLEDISK-IEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLS 367

Query: 1125 LQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLEPLITV 946
            L+SI+ +E+NL+ M    EW     +R P  + I  ++    FW    E I++ EPLI +
Sbjct: 368  LRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRI 427

Query: 945  LRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAAAA 766
            LR+VDG+    GY++E +ER + E++ + N    KY+ +WE  + + N  +   +H AAA
Sbjct: 428  LRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAA 487

Query: 765  FLNPSLMYDGKIKYEQPDIRDGMNYVVESM--VGPNEMDDFAAQLLLYNGKSPKLFNTLS 592
            FLNPS+ Y+   K +   IR+G    +  M     ++M+         NG+   L    +
Sbjct: 488  FLNPSVFYNPNFKIDL-RIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQG-ALGTDFA 545

Query: 591  ILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTC-GRNWSAFEVAKTKKINKLS 415
            IL      P  WW   G E+P L++ A+RILSQPCSS  C G NWS FE   +KK ++  
Sbjct: 546  ILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAE 605

Query: 414  PDILEDLVYTRMN 376
             + L DLV+ + N
Sbjct: 606  QEKLTDLVFVQCN 618


>ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101222344 [Cucumis
            sativus]
          Length = 673

 Score =  336 bits (862), Expect = 3e-89
 Identities = 197/613 (32%), Positives = 325/613 (53%), Gaps = 18/613 (2%)
 Frame = -2

Query: 2160 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQ--ALAFH 1996
            WE+    +  + +  C +C++++ GG+ R+K HL++ + +DI  C  VP DV+       
Sbjct: 9    WEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGIL 68

Query: 1995 AVCGKDLTPYKKRKTATCSTDNERNGAD--------SIVSPILSCPDSSTVLHQTTLATI 1840
            +   K   P K +     +T+ +++ +         S      +CP +   L  +    I
Sbjct: 69   STPKKQKAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCLSPSAQPPI 128

Query: 1839 YN--KKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCTKL 1666
             +  K+ KD  D+ VA  F  N+I   A +S  +  MV AIAE+G  Y  P++  L + L
Sbjct: 129  DDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTL 188

Query: 1665 VQDSRKDVEEYVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFERSD 1486
            +   + D+       +D W  TGC+++ D+W+D +   F+ +     KG +FLK+ + S 
Sbjct: 189  LDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLVISVTCSKGTLFLKSVDISG 248

Query: 1485 KGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHGVQ 1306
                  +L D+L ++I E+G ENVVQI+ +  + Y     L+M +Y  ++   CVS+ V 
Sbjct: 249  HEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVN 308

Query: 1305 LLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRFVSHFMM 1126
             +LEDI K +EW+ +V ++A++I  Y+Y + + L  MR +T  KE+ RP  +RFV++F+ 
Sbjct: 309  QMLEDISK-IEWVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLS 367

Query: 1125 LQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLEPLITV 946
            L+SI+ +E+NL+ M    EW     +R P  + I  ++    FW    E I++ EPLI +
Sbjct: 368  LRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRI 427

Query: 945  LRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAAAA 766
            LR+VDG+    GY++E +ER + E++ + N    KY+ +WE  + + N  +   +H AAA
Sbjct: 428  LRIVDGDMPAMGYIFEGIERAKVEIKTYYNGFEDKYMPIWETIDRRWNLQLHTTLHTAAA 487

Query: 765  FLNPSLMYDGKIKYEQPDIRDGMNYVVESM--VGPNEMDDFAAQLLLYNGKSPKLFNTLS 592
            FLNPS  Y+   K +   IR+G    +  M     ++M+         NG+   L    +
Sbjct: 488  FLNPSXFYNPNFKIDL-RIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQG-ALGTDFA 545

Query: 591  ILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTC-GRNWSAFEVAKTKKINKLS 415
            IL      P  WW   G E+P L++ A+RILSQPCSS  C G NWS FE   +KK ++  
Sbjct: 546  ILGRTINAPGDWWSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAE 605

Query: 414  PDILEDLVYTRMN 376
             + L DLV+ + N
Sbjct: 606  QEKLTDLVFVQCN 618


>ref|XP_002509591.1| DNA binding protein, putative [Ricinus communis]
            gi|223549490|gb|EEF50978.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 670

 Score =  333 bits (854), Expect = 2e-88
 Identities = 191/624 (30%), Positives = 320/624 (51%), Gaps = 22/624 (3%)
 Frame = -2

Query: 2160 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQALAFHAV 1990
            WE+    +  + +  C +C +++ GG+ R+K HL++ + +DI  C  VP+DV+    + +
Sbjct: 9    WEHCVLVDATRQKVRCNYCNREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVR----NHI 64

Query: 1989 CGKDLTPYKKRKTATCSTDNERNGADSIVSPI--------------LSCPD----SSTVL 1864
                 TP K++      TD   NG D+  S                 +CP          
Sbjct: 65   QSILSTPKKQKTPKKQKTDQAENGQDNSSSASGGVHPNRGSSGQHGSTCPSLLFSRPLPT 124

Query: 1863 HQTTLATIYNKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHS 1684
             Q  +    N+K  +  D+ +A  F  N+IA  A +S  +  M  A+AE G  Y  P+  
Sbjct: 125  SQPVVDDAQNEKQNN-ADKRIAVFFFHNSIAFSAAKSIYYQEMFDAVAECGQGYKAPSFE 183

Query: 1683 TLCTKLVQDSRKDVEEYVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLK 1504
             L + L++  + D+ ++    +D W  TGC+++ D W+D +    I      PKG +FLK
Sbjct: 184  KLRSSLLEKVKGDIHDWYRKYRDDWKETGCTILCDGWSDGRTKSVIVFSVTCPKGTLFLK 243

Query: 1503 NFERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQC 1324
            + + S       +L ++L S++ E+G ENV+Q++ ++ + Y     L+M +Y  ++   C
Sbjct: 244  SVDISGHENDANYLFELLESILLEVGVENVIQVITDSTASYVYAGRLLMAKYSSLFWSPC 303

Query: 1323 VSHGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRF 1144
             S+ V  +LEDI K+ EW+ +V ++A  I  Y+Y +   L +MR +T  +E+ RP  +R+
Sbjct: 304  ASYCVNKMLEDISKQ-EWVGTVMEEANTITKYIYSHAWTLNMMRRFTGGRELIRPRITRY 362

Query: 1143 VSHFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVL 964
            VS+++ L++I+  E+NL+ M    EW     +R P  + +   +    FW    E +S+ 
Sbjct: 363  VSNYLSLRAIVIQEDNLKHMFSHSEWLSSMHSRRPDAQIVKSFLSQDRFWKFAHEAVSIS 422

Query: 963  EPLITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHK 784
            EPLI +LR+VDG+    GY+YE +ER +  ++ +      KY+ +WE+ + + N  +   
Sbjct: 423  EPLIKILRIVDGDMPAMGYIYEVLERAKVSIKAYYKGIEDKYMPIWEIIDRRWNIQLHSP 482

Query: 783  IHAAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPN-EMDDFAAQLLLYNGKSPKL 607
            +HAAAAFLNPS+ Y+   K +   +R+G    +  M   + +  +   +  +Y      L
Sbjct: 483  LHAAAAFLNPSIFYNQNFKIDL-RMRNGFQEAMIKMATSDIDKIEITKEHPIYINGQGAL 541

Query: 606  FNTLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKI 427
                +I+      P  WW   G E+P L++VAIR+LSQPCSS  C  NWS FE   TKK 
Sbjct: 542  GTDFAIMGRTLNSPGDWWAGYGYEIPTLQRVAIRLLSQPCSSHWCRWNWSTFESIHTKKR 601

Query: 426  NKLSPDILEDLVYTRMNSKMMAYY 355
            NK   + L DLV+   N  + A Y
Sbjct: 602  NKAELEKLNDLVFVHCNLWLQAIY 625


>ref|XP_006857527.1| hypothetical protein AMTR_s00061p00028660 [Amborella trichopoda]
            gi|548861623|gb|ERN18994.1| hypothetical protein
            AMTR_s00061p00028660 [Amborella trichopoda]
          Length = 863

 Score =  332 bits (850), Expect = 6e-88
 Identities = 195/620 (31%), Positives = 326/620 (52%), Gaps = 20/620 (3%)
 Frame = -2

Query: 2160 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQALAFHAV 1990
            WE+    +  + +  C +C++++ GG+ R+K HL++ + +DI  C+ VP DV+ L    +
Sbjct: 195  WEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSDVPNDVRDLIQSVL 254

Query: 1989 CG--KDLTPYKKRKTATCSTDNERNGADS-------------IVSPILSCPDSSTVLHQT 1855
                K  TP K +   T ++ +  + A                  P L  P  S    Q 
Sbjct: 255  NTPRKQKTPKKPKIEQTPNSPHNSSSASGGFHLNVGSSGQRGSTCPSLLFPHPSPS-GQP 313

Query: 1854 TLATIYNKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLC 1675
             L     +K ++  D+ +A  F  N+I   + +S  +  MV AIA+ G  Y  P++  L 
Sbjct: 314  ILDDSQRQKQEE-ADKKIALFFFHNSIPFSSSKSIYYHGMVDAIADCGVGYRAPSYDRLR 372

Query: 1674 TKLVQDSRKDVEEYVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFE 1495
            T L++  + ++ +     +D W  +GC++M D WTD +    I      P+G +FLK+ +
Sbjct: 373  TTLLEKVKVEITDSYKTYRDEWRESGCTIMSDGWTDGRSKFLIVFSVACPRGTLFLKSVD 432

Query: 1494 RSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSH 1315
             S       +L ++L SV+ E+G E +VQ++ ++A+ Y     L+  +YP ++   C S+
Sbjct: 433  ASAHVDDAHYLFELLESVVLEVGLEYIVQVITDSAANYVYAGRLLTAKYPSLFWSPCASY 492

Query: 1314 GVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRFVSH 1135
             +  +LEDI K+ EW+ +V ++A+ I  Y+Y ++  L LM+ +T  KE+ R   +RFV+H
Sbjct: 493  CIDRMLEDISKQ-EWVSTVIEEARSITKYIYGHSWVLNLMKRFTGGKELLRSRITRFVTH 551

Query: 1134 FMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLEPL 955
            F+ L+SI+  E+NL+ M    EW     ++    + +  +I    FW   +EV+++ EPL
Sbjct: 552  FLSLRSIVIHEDNLKHMFSHTEWLSSLYSKKSDAQAVRSLIYLDRFWKSAQEVVNLSEPL 611

Query: 954  ITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHA 775
            I VLR+VDG+    GY+YE +ER +  ++ +      KY+ +WE+ + + N  +   +HA
Sbjct: 612  IKVLRIVDGDMPAMGYIYEGIERAKVAIKAYYKGSEDKYMPIWEIIDRRWNLQLHSPLHA 671

Query: 774  AAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPNEMD--DFAAQLLLYNGKSPKLFN 601
            AAAFLNP++ Y+   K +   IR+G +  +  MV  N+ D  +   +  +Y      L N
Sbjct: 672  AAAFLNPAIFYNPSFKIDSK-IRNGFHEAMMKMV-LNDKDKMELTKETPMYINAHGALGN 729

Query: 600  TLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINK 421
              +++      P  WW   G EVP+L++ AIRILSQPCSS  C  NW  FE   TKK N+
Sbjct: 730  DFAMMARTLNTPGDWWAGYGYEVPVLQRAAIRILSQPCSSYWCRWNWGTFENVHTKKRNR 789

Query: 420  LSPDILEDLVYTRMNSKMMA 361
            L  +   DLVY   N +  A
Sbjct: 790  LEQEKFNDLVYVHCNLRFQA 809


>ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627361 [Citrus sinensis]
          Length = 674

 Score =  330 bits (846), Expect = 2e-87
 Identities = 186/627 (29%), Positives = 321/627 (51%), Gaps = 17/627 (2%)
 Frame = -2

Query: 2160 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQALAFHAV 1990
            WE+    +  + +  C +C++++ GG+ R+K HL++ + +DI  C+ VP+DV+      +
Sbjct: 9    WEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPDDVRDHIQRIL 68

Query: 1989 CGKDLTPYKKRKTATCSTDNERNGADSIVSPI-----------LSCPDSSTVLHQTTLAT 1843
                     KR     +T N +  + S    I            SCP         ++  
Sbjct: 69   SIPKKQKNPKRPKVEKATANGQQNSSSASGGIHQNNRSSGQHGSSCPSLLFRHPSPSIQP 128

Query: 1842 IYN---KKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCT 1672
            I +   K+ +D  D+ +A  F  N+I   A +S  +  MV AIAE G  Y  P++  L +
Sbjct: 129  IVDDTQKQRQDDTDKKIAVFFFHNSIPFSAAKSMYYQEMVNAIAECGVGYIAPSYEKLRS 188

Query: 1671 KLVQDSRKDVEEYVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFER 1492
             L++  + D+++     ++ W  TGC+++ D W+D +    +      PKG +FLK+ + 
Sbjct: 189  TLLEKVKVDIDDCCKKYREEWKETGCTILCDNWSDERTKSLVVFSVACPKGTLFLKSVDV 248

Query: 1491 SDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHG 1312
            S   +   FL ++L SV+ ++G ENV+Q++ ++A+ Y     L+M +Y  ++   C ++ 
Sbjct: 249  SGHEEDATFLFELLESVVLDVGVENVIQVITDSAACYVYAGRLLMTKYSSLFWSPCAAYC 308

Query: 1311 VQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRFVSHF 1132
            +  +LEDI K+ EW+  V ++A+ I  Y Y +   L +MR  T  +E+ RP  +RFV+++
Sbjct: 309  IDKMLEDISKQ-EWVAMVLEEAKTITKYFYSHAWTLNMMRKLTGGRELIRPRITRFVANY 367

Query: 1131 MMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLEPLI 952
            + L+SI+  EENL+ M    EW     +R P  + I  ++    FW    EV+SV EPL+
Sbjct: 368  LSLRSIVIHEENLKHMFSHSEWLSSIYSRRPDAQAIKSLLYLDRFWRSAHEVVSVSEPLV 427

Query: 951  TVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAA 772
             +LR+VDG+    GY+YE +ER +  ++ +      KY+ +W++ + + N  +   +HAA
Sbjct: 428  KILRIVDGDMPAMGYMYEGIERAKLAIQAYYKGVEEKYVPIWDIIDRRWNMQLHSPLHAA 487

Query: 771  AAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPNEMDDFAAQLLLYNGKSPKLFNTLS 592
            AAFLNPS+ Y+   K +          +++      +  +   +  +Y      L    +
Sbjct: 488  AAFLNPSIFYNPNFKIDLRMRNGFQEAMIKLATADKDKIEITKEHPVYINAQGALGTDFA 547

Query: 591  ILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINKLSP 412
            +L  K   P  WW   G E+P L++ AIRILSQPCSS     NWS FE    KK NK+  
Sbjct: 548  VLGRKLNAPGDWWASYGYEIPTLQRAAIRILSQPCSSYWYRWNWSTFESIHNKKRNKVEM 607

Query: 411  DILEDLVYTRMNSKMMAYYNELEMRDK 331
            +   DL++   N ++ A Y   + + K
Sbjct: 608  EKFNDLLFVHCNLRLQAIYRSRDGKSK 634


>ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao]
            gi|508777206|gb|EOY24462.1| HAT transposon superfamily
            [Theobroma cacao]
          Length = 674

 Score =  328 bits (842), Expect = 5e-87
 Identities = 186/620 (30%), Positives = 319/620 (51%), Gaps = 20/620 (3%)
 Frame = -2

Query: 2160 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQALAFHAV 1990
            WE+    +  + +  C +C +++ GG+ R+K HL++ + +DI  C  VP+DV+      +
Sbjct: 9    WEHCVLVDATRQKVRCNYCHREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVRDHIQTIL 68

Query: 1989 CG--KDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIY------- 1837
                K  TP K +     + D + +   S  S  L     S+  H +T  ++        
Sbjct: 69   NSPKKQKTPKKPKVDKAVANDQQNS---SSASGGLHLNHGSSGQHGSTCPSLLFPRPSPS 125

Query: 1836 --------NKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHST 1681
                     K+ ++  D+ +A  F  N+I   A +S  +  MV AIA+ G  Y  P++  
Sbjct: 126  EQPAVDDGQKQKQEDADKKIAVFFFHNSIPFSAAKSMYYQEMVDAIAKCGVGYKAPSYEN 185

Query: 1680 LCTKLVQDSRKDVEEYVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKN 1501
            L + L++  + D+ +     +D W  TGC+++ D+W+D +   F+      PKG +FLK+
Sbjct: 186  LRSTLLEKVKGDIHDCYKKYRDEWKETGCTILCDSWSDGRTKSFVIFSVTCPKGTLFLKS 245

Query: 1500 FERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCV 1321
             + S       +L ++L SV+ E+G ENV+Q++ + A+ Y     L+M +Y  ++   C 
Sbjct: 246  VDVSGHEDDASYLFELLESVVLEVGLENVIQVITDTAASYVYAGRLLMAKYSSLFWSPCA 305

Query: 1320 SHGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRFV 1141
            S+ +  +LEDI K+ EW+  V ++A+ IV Y+Y +   + +MR +T  +E+ RP  +RFV
Sbjct: 306  SYCINKMLEDISKQ-EWVGIVLEEAKSIVQYIYSHAWIVNMMRKFTGGRELMRPRITRFV 364

Query: 1140 SHFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLE 961
            ++++ L+SI+  E+NL+ M    EW     +R    + I  ++    FW    E +SV E
Sbjct: 365  ANYLTLRSIIIQEDNLKHMFSHSEWLSSIYSRRSDAQAIKSLLYLERFWKSAHEAVSVSE 424

Query: 960  PLITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKI 781
            PL+ +LR+VDG+    GY+YE +ER +  ++ +      KY+ +W++ + + N  +   +
Sbjct: 425  PLVKILRIVDGDMPAMGYIYEGIERAKVAIKAYYKGLEEKYMPIWDIIDRRWNMQLHSPL 484

Query: 780  HAAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPNEMDDFAAQLLLYNGKSPKLFN 601
            HAAAAFLNPS+ Y+   K +          +++      +  +   +  +Y      L  
Sbjct: 485  HAAAAFLNPSIFYNPNFKIDLRMRNGFQEAMLKLATTDKDKIEITKEHPMYINAQGALGT 544

Query: 600  TLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINK 421
              +I+      P  WW   G E+P L++VAIRILSQPCSS  C  NWS FE   TKK NK
Sbjct: 545  DFAIMGRTLNAPGDWWASYGYEIPTLQRVAIRILSQPCSSHWCRWNWSTFESIHTKKRNK 604

Query: 420  LSPDILEDLVYTRMNSKMMA 361
            +  +   DLV+   N  + A
Sbjct: 605  VELEKFNDLVFVHCNLCLQA 624


>ref|XP_003538648.1| PREDICTED: uncharacterized protein LOC100805582 isoform X1 [Glycine
            max] gi|571487050|ref|XP_006590550.1| PREDICTED:
            uncharacterized protein LOC100805582 isoform X2 [Glycine
            max]
          Length = 675

 Score =  327 bits (838), Expect = 2e-86
 Identities = 185/630 (29%), Positives = 318/630 (50%), Gaps = 20/630 (3%)
 Frame = -2

Query: 2160 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQ--ALAFH 1996
            WE+    +  K +  C +C++++ GG+ R+K HL++ + +DI  C  VP DV+    +  
Sbjct: 9    WEHCVLVDATKQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQSIL 68

Query: 1995 AVCGKDLTPYKKRKTATCSTDNERN---------------GADSIVSPILSCPDSSTVLH 1861
            +   K  TP K++       + ++N               G +    P L  P+ S    
Sbjct: 69   SAPKKPKTPKKQKTDQATVANGQQNSSSASGGFHHNHGYSGQNGSACPSLLFPNPSPSAQ 128

Query: 1860 QTTLATIYNKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHST 1681
               L     K+ +D  DR +A  F  N+I   A +S  +  MV A+A+ G  Y  P++  
Sbjct: 129  P--LEHDAQKQKQDDADRKLAIFFFHNSIPFSAAKSIYYQEMVDAVAQCGVGYKAPSYEK 186

Query: 1680 LCTKLVQDSRKDVEEYVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKN 1501
            L + L++  + D+       +D W  TGC+++ D W+D +           PKG +FLK+
Sbjct: 187  LRSTLLEKVKADIHSDYKKYRDEWKETGCTVLCDNWSDGRTGSLAVFSVACPKGTLFLKS 246

Query: 1500 FERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCV 1321
             + S       +L ++L SV+ E+GAENVVQ++ + ++ Y C   L++ RY  ++   CV
Sbjct: 247  VDVSGHENDSTYLFELLESVVLEVGAENVVQVITDASASYVCAGRLLIARYSFLFWSPCV 306

Query: 1320 SHGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRFV 1141
            ++ +  +LEDI ++ +W+ +V ++A+ I  Y+Y +   L +MR +T  KE+ RP  +RFV
Sbjct: 307  AYCIDKMLEDIGRQ-DWVGTVLEEAKTITQYIYSHAWILNIMRKFTGGKELIRPKITRFV 365

Query: 1140 SHFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVLE 961
            ++F+ L+SI+  E+N++ M    EW      R P  + I  ++ S  FW    E +SV E
Sbjct: 366  TNFLSLKSIVMQEDNIKHMFSHSEWLSSIYRRRPDAQAINSLLYSDRFWKYAHEAVSVSE 425

Query: 960  PLITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKI 781
            PL+  LR+VDG+    GY+YE +ER +  ++ +      KY+ +W++ + + N  +   +
Sbjct: 426  PLVKCLRMVDGDMPAMGYVYEGIERAKVAIKAYYKGIEEKYIPIWDIIDRRWNMQIHSSL 485

Query: 780  HAAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPNEMDDFAAQLLLYNGKSPKLFN 601
            HAAAAFLNPS+ Y+   K +          ++   +   +  +   +L  Y      L  
Sbjct: 486  HAAAAFLNPSISYNPNFKKDLRMRNGFQEAMLRLAITDKDKMEITKELPTYINAQGALGT 545

Query: 600  TLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINK 421
              ++L      P  WW   G E+P L+K A+RILSQPCSS     NWS FE    +K N+
Sbjct: 546  DFAVLGRTLNAPGDWWASYGYEIPTLQKAAVRILSQPCSSLWYRWNWSTFESIHNRKRNR 605

Query: 420  LSPDILEDLVYTRMNSKMMAYYNELEMRDK 331
            +  +   +LV+   N  +   +   E + +
Sbjct: 606  VELEKFSELVFVHSNLWLQTIFKRREAKSE 635


>ref|XP_004300713.1| PREDICTED: uncharacterized protein LOC101309161 [Fragaria vesca
            subsp. vesca]
          Length = 677

 Score =  323 bits (829), Expect = 2e-85
 Identities = 183/618 (29%), Positives = 325/618 (52%), Gaps = 23/618 (3%)
 Frame = -2

Query: 2160 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQ--ALAFH 1996
            WE+    +  K +  C +C++++ GG+ R+K HL++ + +DI  C  VP DV+   L+  
Sbjct: 9    WEHCVLVDATKQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHILSIL 68

Query: 1995 AVCGKDLTPYKKRKTATCSTDNER---------------NGADSIVSPILS--CPDSSTV 1867
                K  TP K +       + ++               NG +    P L   CP  ++ 
Sbjct: 69   ETPKKQKTPKKPKVDKAALANGQQISSSASGDFHPTHVSNGQNGSTCPSLLFLCPSPTS- 127

Query: 1866 LHQTTLATIYNKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNH 1687
              Q  +  +  +K +DL D+ VA  F  N+I   A +S  +  MV A+AE G +Y  P++
Sbjct: 128  --QEPVDDVQKQK-QDLADKTVAVFFFHNSIPFSAARSIYYREMVDAVAECGGNYKAPSY 184

Query: 1686 STLCTKLVQDSRKDVEEYVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFL 1507
              L + L++    D+ +     +D W  TGC+++ ++W+D ++   +      PKG +FL
Sbjct: 185  EVLRSTLLEKVNSDIHDRYKKYRDEWKETGCTILCESWSDGRNKSLVIFSVTYPKGTLFL 244

Query: 1506 KNFERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQ 1327
            K+ + S       +L ++L SV+ E+G E+VVQI+ + +S Y     L+MG+Y  ++   
Sbjct: 245  KSVDVSGHEDDTTYLFELLESVVLEVGVEDVVQIITDTSSSYIYAGRLLMGKYSSLFWSP 304

Query: 1326 CVSHGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSR 1147
            C S+ +  +LEDI K+ EW+  V ++A+ I +++  +   L +MR +   +E+ RP  +R
Sbjct: 305  CASYCINKILEDIGKQ-EWVCIVLEEARTITNFIGSHGWTLSMMRKFAGGRELVRPKINR 363

Query: 1146 FVSHFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISV 967
            FV++F+ L+SI+  E+N++ M   PEW   + +R P  + +  ++    FW   +E +++
Sbjct: 364  FVTNFLNLRSIVIQEDNIKHMFSHPEWVSSASSRRPEAQAVKSLLYVERFWQHAQEAVTI 423

Query: 966  LEPLITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIH 787
             EPL+ +LR+VDG+    GY+YE +E  +  ++ +      KY+ +W++ + + +  +  
Sbjct: 424  AEPLVKILRIVDGDMPAMGYIYEGIESAKIAIKTYYKGIEEKYMPIWDIIDRRWSMQLHS 483

Query: 786  KIHAAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPNE-MDDFAAQLLLYNGKSPK 610
             +HAAAA LNPS+ Y+   K +   +R+G    +  M   +E   +   +  +Y      
Sbjct: 484  SLHAAAASLNPSIFYNPNFKIDS-RMRNGFQETMLRMASTHEDKMEITKEHPVYVTAQGA 542

Query: 609  LFNTLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKK 430
            L +  +I+      P  WW   G E+P L++ A+RILSQPCSS  C  NWS FE    KK
Sbjct: 543  LGSDFAIMGRTLNAPGDWWAGYGYEIPTLQRYALRILSQPCSSHWCCWNWSTFESIHAKK 602

Query: 429  INKLSPDILEDLVYTRMN 376
             ++  P+  +DLV+   N
Sbjct: 603  HSRTEPENFDDLVFVHCN 620


>gb|AAR96007.1| transposase-like protein [Musa acuminata]
          Length = 670

 Score =  323 bits (827), Expect = 3e-85
 Identities = 191/621 (30%), Positives = 325/621 (52%), Gaps = 23/621 (3%)
 Frame = -2

Query: 2160 WEYA---EDLKGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQALAFHAV 1990
            WE+    +  + +  C +C +++ GG+ R+K HL++ + +DI  C+ VP+DV+ L  H++
Sbjct: 9    WEHCVLVDATRQKVRCNYCHREFSGGVYRMKFHLAQIKNKDIVPCSEVPDDVRNL-IHSI 67

Query: 1989 CGKDLTPYKKRKTATCSTDNERNG---ADSIVSPILSCPDSSTVLHQTTLATIY------ 1837
                 TP K++       D+  NG   + S  S   +    S+  H +T  ++       
Sbjct: 68   L---TTPRKQKAPKKLKIDHTANGPQHSSSSASGYNAKNAGSSGQHGSTCPSLLLPLPSP 124

Query: 1836 ---------NKKDKDLVDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHS 1684
                      K+  D  D  +A  F  N+I   A +S  + AM+ AIA+ G  Y  P + 
Sbjct: 125  GAQPTANDAQKQKYDNADNKIALFFFHNSIPFSASKSIYYQAMIDAIADCGAGYKPPTYE 184

Query: 1683 TLCTKLVQDSRKDVEEYVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLK 1504
             L + L++  ++++ E    +KD W  TGC+++ D W+D +    + +   SPKG  FLK
Sbjct: 185  GLRSTLLEKVKEEINENHRKLKDEWKDTGCTILSDNWSDGRSKSLLVLSVASPKGTQFLK 244

Query: 1503 NFERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQC 1324
              + S +     +L ++L SVI E+GAENVVQ++ ++A+ Y     L++ +YP ++   C
Sbjct: 245  LVDISSRADDAYYLFELLDSVIMEVGAENVVQVITDSATSYTYAAGLLLKKYPSLFWFPC 304

Query: 1323 VSHGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRF 1144
             S+ ++ +LEDI K +EW+ +  ++ + I  ++      L LM+  T  +E+ RP  +RF
Sbjct: 305  ASYSIEKMLEDISK-LEWVSTTLEETRTIARFICSDGWILSLMKKLTGGRELVRPKVARF 363

Query: 1143 VSHFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKITQMIESTTFWSRGEEVISVL 964
            ++HF+ L+SI+  E++L+      +W     +R P    I  ++    FW    E+I + 
Sbjct: 364  MTHFLTLRSIVNQEDDLKHFFSHADWLSSVHSRRPDALAIKSLLYLERFWKSAHEIIGMS 423

Query: 963  EPLITVLRLVDGEGSTAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHK 784
            EPL+ +LRLVDG+    GY+YE +ER +  ++        KY+ V E+ E + +      
Sbjct: 424  EPLLKLLRLVDGDMPAMGYIYEGIERAKMAIKAFYKGCEEKYMSVLEIIERRWSMHCHSH 483

Query: 783  IHAAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPNEMD--DFAAQLLLYNGKSPK 610
            +HAAAAFLNPS+ YD   K++  ++R+G +  +  M  P E D  +      +Y      
Sbjct: 484  LHAAAAFLNPSIFYDPSFKFD-VNMRNGFHAAMWKMF-PEENDRIELIKDQPVYIKAQGA 541

Query: 609  LFNTLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKK 430
            L +  +I+      P  WW   G E+P+L++ A+RILSQPCSS     NWSAFE   TK 
Sbjct: 542  LGSKFAIMGRTLNSPGDWWATYGYEIPVLQRAAVRILSQPCSSYWFKWNWSAFENIYTKN 601

Query: 429  INKLSPDILEDLVYTRMNSKM 367
              ++  + L DLV+   N ++
Sbjct: 602  HTRMELEKLNDLVFVHCNLRL 622


>ref|XP_006490683.1| PREDICTED: uncharacterized protein LOC102618477 [Citrus sinensis]
          Length = 764

 Score =  310 bits (795), Expect = 1e-81
 Identities = 188/604 (31%), Positives = 309/604 (51%), Gaps = 6/604 (0%)
 Frame = -2

Query: 2160 WEYAEDLKGRFV-CKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQALAFHAVCG 1984
            WEYAE L G  V CKFC +   GGI+R+K HLSR   + +  C+ V +DV       +  
Sbjct: 97   WEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDRVRAIIAS 156

Query: 1983 KD---LTPY-KKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIYNKKDKDL 1816
            K+    TP  KK++ A          + S++      P +      T +    +  +++ 
Sbjct: 157  KEDVKETPIGKKQRVAEAKPVGIVCSSKSLMPLETPSPVTKVFATMTPMGNS-SLNNQEN 215

Query: 1815 VDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCTKLVQDSRKDVEE 1636
             +R +A  F  N +     +S S+  M+ A+ + G  ++ P+   L T  +   + +V  
Sbjct: 216  AERSIALFFFENKLDFAVARSSSYQQMIDAVGKCGPGFTGPSAEALKTMWLDRIKSEVNV 275

Query: 1635 YVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFERSDKGKTGVFLGD 1456
               +++  W +TGC+++ DTWTD K    IN +  SP    FLK+ + S   K   +L D
Sbjct: 276  QSKDIEKEWAMTGCTIIADTWTDNKSKALINFLVSSPSRTFFLKSVDTSSNFKNTKYLAD 335

Query: 1455 MLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHGVQLLLEDIYKEV 1276
            +  SVI +IG ENVVQI+++++  Y  V + I+  Y  I+   C S  + ++LE+ + +V
Sbjct: 336  IFDSVIQDIGPENVVQIIMDSSFNYTGVANHILQNYGTIFVSPCASQSLNIILEE-FSKV 394

Query: 1275 EWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRFVSHFMMLQSILEVEEN 1096
            +W+      AQ I  ++Y   + L LM+ +T   E+ R   +++VS+F+ LQSIL+    
Sbjct: 395  DWVNRCILQAQTISKFIYNNASMLDLMKKFTGGLELIRTGITKYVSNFLSLQSILKQRSR 454

Query: 1095 LRLMIVSPEWRDMSDNRS-PLTEKITQMIESTTFWSRGEEVISVLEPLITVLRLVDGEGS 919
            L+ M  SPE+   S   + P +     ++E   FW   EE +++ EP + VLR V G   
Sbjct: 455  LKHMFNSPEYSTSSPYANKPQSLSCISIVEDNDFWRAVEESVAISEPFLKVLREVSGGKP 514

Query: 918  TAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAAAAFLNPSLMYD 739
              G +YE M R +  +R +   D  K     ++ +    G +   +H+AAAFLNPS+ Y+
Sbjct: 515  AVGSIYELMTRAKESIRTYYIMDENKCKIFLDIVDRNWRGQLHSPLHSAAAFLNPSIQYN 574

Query: 738  GKIKYEQPDIRDGMNYVVESMVGPNEMDDFAAQLLLYNGKSPKLFNTLSILMMKKAHPRV 559
             +IK+      D  N + + +  P+   D   Q+L ++  S      L++   +   P +
Sbjct: 575  PEIKFLGSIKEDFFNVLEKLLPTPDTRRDITTQILTFSRASGMFGCKLAMEARETVPPGL 634

Query: 558  WWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINKLSPDILEDLVYTRM 379
            WWE  G   P+L++VAIRILSQ CSS +  R+WS F+   ++K NK+  + L DLVY   
Sbjct: 635  WWEQYGDSAPVLQRVAIRILSQVCSSFSFERHWSTFQQIHSEKRNKIDKETLNDLVYISY 694

Query: 378  NSKM 367
            N K+
Sbjct: 695  NLKL 698


>gb|EXB72473.1| hypothetical protein L484_011475 [Morus notabilis]
          Length = 694

 Score =  307 bits (786), Expect = 2e-80
 Identities = 193/605 (31%), Positives = 316/605 (52%), Gaps = 7/605 (1%)
 Frame = -2

Query: 2160 WEYAEDLKGRFV-CKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV----QALAFH 1996
            WEYAE L G  V CKFC +   GGI+R+K HLSR   + +  C+ V +DV    +A+   
Sbjct: 24   WEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDRVRAIIAS 83

Query: 1995 AVCGKDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIYNKKDKDL 1816
                K+ +  KK+K     +    + + ++VS   + P +      T +A   +   ++ 
Sbjct: 84   KEDVKETSSTKKQKLVEVKSPGNVSASKALVSTDTTSPVAKVFPAVTPVAPP-SLNSQEN 142

Query: 1815 VDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCTKLVQDSRKDVEE 1636
             +R +A  F  N +  G  +S S+  MV AIA+ G  ++ P+  TL T  ++  + ++  
Sbjct: 143  AERSIALFFFENKLDFGIARSSSYQLMVDAIAKCGPGFTGPSAETLKTTWLERIKSEMSL 202

Query: 1635 YVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFERSDKGKTGVFLGD 1456
               +++  W+ TGC+++ DTWTD K    IN +  SP    F K+ + S   K    L D
Sbjct: 203  QSKDIEKEWMTTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASAYFKNMKCLAD 262

Query: 1455 MLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHGVQLLLEDIYKEV 1276
            +  SVI + G +NVVQ++++++  Y  V + I+  Y  I+   CVS  + L+LE+ + +V
Sbjct: 263  LFDSVIQDFGPDNVVQVIMDSSFNYTGVANHILQNYSTIFVSPCVSQCLNLILEE-FSKV 321

Query: 1275 EWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRFVSHFMMLQSILEVEEN 1096
            +W+       Q I  ++Y   + L LM+ YT  +E+ R   ++ VS F+ LQSIL+ +  
Sbjct: 322  DWVNRCILQGQTISKFIYNSASMLDLMKKYTGGQELIRTGITKSVSSFLSLQSILKQKSR 381

Query: 1095 LRLMIVSPEWRDMSDN-RSPLTEKITQMIESTTFWSRGEEVISVLEPLITVLRLVDGEGS 919
            L+ M  SPE+   S     P +     ++E + FW   EE +++ EP + VLR V G   
Sbjct: 382  LKHMFNSPEYCTNSLYVNKPQSISCISIVEDSDFWRAVEESVAISEPFLKVLREVAGGKP 441

Query: 918  TAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAAAAFLNPSLMYD 739
              G +YE M R +  +R +   D  K     ++ + K    +   +H+AAAFLNPS+ Y+
Sbjct: 442  AVGSIYELMTRAKESIRTYYIMDENKCKTFLDIVDRKWRDQLHSPLHSAAAFLNPSIQYN 501

Query: 738  GKIKYEQPDIRDGMNYVVESMVGPNEM-DDFAAQLLLYNGKSPKLFNTLSILMMKKAHPR 562
             +IK+    I++    V+E ++   EM  D  +Q+  +         +L++       P 
Sbjct: 502  PEIKF-LSSIKEDFFKVLEKLLPLPEMRRDITSQIFTFTKAMSMFGCSLAMEARDVVSPG 560

Query: 561  VWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINKLSPDILEDLVYTR 382
            +WWE  G   P+L++VAIRILSQ CSS T  R+WSAF+   ++K NK+  + L DLVY  
Sbjct: 561  LWWEQYGDSAPVLQRVAIRILSQVCSSFTFERHWSAFQQIHSEKRNKIDRETLNDLVYIN 620

Query: 381  MNSKM 367
             N K+
Sbjct: 621  YNLKL 625


>ref|XP_006573373.1| PREDICTED: uncharacterized protein LOC102669318 [Glycine max]
          Length = 816

 Score =  303 bits (777), Expect = 2e-79
 Identities = 190/656 (28%), Positives = 321/656 (48%), Gaps = 42/656 (6%)
 Frame = -2

Query: 2160 WEYAEDL----KGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDVQALAFHA 1993
            W+Y   L        VC FC K   GGI R K HL  + G   A   + P  V+ L  + 
Sbjct: 23   WKYCHSLVEGDTNTIVCNFCGKITKGGITRAKQHLIGKSGNVAACKKTPPNVVEELKEYM 82

Query: 1992 VCGKDLTPYKKRKTAT--------------CSTDNERNGADSI---VSPILSC------- 1885
               K  T Y    +                C    E   ADS     S    C       
Sbjct: 83   ATKKSGTTYSTSGSGNMANIRDFEFGEPIGCDGSEEDEFADSCNAAASAKTKCGTKKGPM 142

Query: 1884 ------PDSST------VLHQTTLATIYNKKDKDLVDRMVAQAFIMNNIALGAIQSPSFV 1741
                  P+++       +L Q  +    +K +   V + +A+ +    ++   I+  SF 
Sbjct: 143  DKFCKNPENAINRRKMEMLRQMNIRESMDKNEVLKVHQHIARFWYQAGLSFNLIKLKSFE 202

Query: 1740 AMVKAIAEFGTSYSLPNHSTLCTKLVQDSRKDVEEYVSNVKDSWLLTGCSLMLDTWTDMK 1561
             MV AI ++G    +P++  +   L++   +  E  +   ++ W+  GC++M D WTD K
Sbjct: 203  NMVAAIGQYGPHLPIPSYHDIRVPLLKKEVEYTENLMKGHREQWVKYGCTIMSDAWTDRK 262

Query: 1560 DVCFINVVAYSPKGAVFLKNFERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASKY 1381
              C IN +  S  G +FLK+ + SD  KTG  L ++L ++++E+G ENVVQ+V +N S Y
Sbjct: 263  QRCIINFLINSQAGTMFLKSVDGSDFVKTGEKLFELLDAIVEEVGEENVVQVVTDNGSNY 322

Query: 1380 ECVTDLIMGRYPHIYKIQCVSHGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTALK 1201
              V  L+  +  HIY   C +H + L+LEDI K +  I+     A  +V ++Y +++ L 
Sbjct: 323  VLVGKLLEEKRKHIYWTPCAAHCIDLMLEDIGK-LPLIRKTIRRAINLVGFIYAHSSTLS 381

Query: 1200 LMRVYTAEKEIKRPCNSRFVSHFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKIT 1021
            L+R +T ++E+ R   +RF + ++ L+ + + + N+R M  S EW     ++ P  ++  
Sbjct: 382  LLRNFTNKRELVRHAITRFATSYLTLERLHKEKANIRKMFTSDEWTLNKLSKEPKGKEAA 441

Query: 1020 QMIESTTFWSRGEEVISVLEPLITVLRLVDGEGSTA-GYLYEAMERVRTELRQHCNSDSL 844
            +++   +FW+     + V+ PL+ VLRLVDGE   A GY+YEAM++ +  + +  N++  
Sbjct: 442  KVVLMPSFWNSVVYTLKVMAPLVKVLRLVDGERKPAMGYIYEAMDKAKETIMKSFNNNES 501

Query: 843  KYLKVWELFESKRNGDMIHKIHAAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGPN 664
            KY  V+E+ + + N  +   +HAAA FLNP   YD        ++ +G+   ++ ++   
Sbjct: 502  KYKDVFEIIDKRWNCQLHRPLHAAAHFLNPEFFYDNTDLEFDFEVTNGLFECIKKLIPQF 561

Query: 663  EMDD-FAAQLLLYNGKSPKLFNTLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQPC 487
            ++      +L LY   +    +  ++   K   P  WW   G + P L+K+AI+ILS  C
Sbjct: 562  DVQQKILTELHLYKIGADHFGSDFAMAQRKTHSPTYWWRMFGSQTPNLQKLAIKILSLTC 621

Query: 486  SSSTCGRNWSAFEVAKTKKINKLSPDILEDLVYTRMNSKMMAYYNELEMRDKFAIS 319
            S+S C RNWS FE   +KK N+L    L DLV+ + N ++   YN  +  D  +++
Sbjct: 622  SASGCERNWSVFEQIHSKKRNRLEHKRLHDLVFVKYNQQLKQRYNARDEIDPISLN 677


>ref|XP_007038933.1| HAT transposon superfamily isoform 4 [Theobroma cacao]
            gi|508776178|gb|EOY23434.1| HAT transposon superfamily
            isoform 4 [Theobroma cacao]
          Length = 682

 Score =  303 bits (777), Expect = 2e-79
 Identities = 185/604 (30%), Positives = 307/604 (50%), Gaps = 6/604 (0%)
 Frame = -2

Query: 2160 WEYAEDLKGRFV-CKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV----QALAFH 1996
            WEYAE L G  V CKFC +   GGI+R+K HLSR   + +  C+ V +DV    +A+   
Sbjct: 13   WEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDRVRAILSS 72

Query: 1995 AVCGKDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIYNKKDKDL 1816
                K+ +  KK+K A   +    +    I+ P+ +    + V   T+     +   ++ 
Sbjct: 73   KEEIKETSSVKKQKIAEARSPGNISTCSKII-PLEASSPVAKVFPATSPIAPPSLNSQEN 131

Query: 1815 VDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCTKLVQDSRKDVEE 1636
            V+R +A  F  N +     +S S+ AM+ A+ +FG  ++ P+  TL T  ++  + +V  
Sbjct: 132  VERSIALFFFENKLDFSVARSSSYQAMIDAVGKFGPGFTGPSVETLKTMWLERIKSEVCL 191

Query: 1635 YVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFERSDKGKTGVFLGD 1456
               + +  W  TGC+++ DTWTD K    IN +  SP    F K+ + S   K    L D
Sbjct: 192  QSKDTEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASSYFKNTKCLAD 251

Query: 1455 MLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHGVQLLLEDIYKEV 1276
            +  SVI + G ENVVQI+++++  Y  +++ I+  Y  I+   C S  + L+LE+ + +V
Sbjct: 252  LFDSVIQDFGPENVVQIIMDSSFNYTGISNHILQNYGTIFVSPCASQCLNLILEE-FSKV 310

Query: 1275 EWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRFVSHFMMLQSILEVEEN 1096
            +W+      AQ +  ++Y   + L LM+ +T E+E+ R   ++ VS F+ LQS+L+    
Sbjct: 311  DWVNRCILQAQTLSKFLYNNASMLDLMKKFTGEQELIRTGITKSVSSFLSLQSMLKQRSR 370

Query: 1095 LRLMIVSPEWRDMSDNRS-PLTEKITQMIESTTFWSRGEEVISVLEPLITVLRLVDGEGS 919
            L+ M  SPE+   S   + P +     ++E   FW   +E +++ EP + VLR V G   
Sbjct: 371  LKHMFNSPEYSTNSSYANKPQSISCIAIVEDNDFWRAVDECVAISEPFLKVLREVSGGKP 430

Query: 918  TAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAAAAFLNPSLMYD 739
              G +YE M R +  +R +   D  K     ++ + K    +   +H+A AFLNPS+ Y+
Sbjct: 431  AVGSIYELMTRAKESIRTYYIMDEGKCKTFLDIVDRKWRDQLHSPLHSAGAFLNPSIQYN 490

Query: 738  GKIKYEQPDIRDGMNYVVESMVGPNEMDDFAAQLLLYNGKSPKLFNTLSILMMKKAHPRV 559
             +IK+      D    + + +  P    D   Q+  +          L++       P +
Sbjct: 491  QEIKFLGSIKEDFFKVLEKLLPTPELRRDITNQIFTFTRAKGMFACNLAMEARDTVSPGL 550

Query: 558  WWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINKLSPDILEDLVYTRM 379
            WWE  G   P+L++VAIRILSQ CS+ T  R+WS F+   ++K NK+  +IL DLVY   
Sbjct: 551  WWEQFGDSAPVLQRVAIRILSQVCSTFTFERHWSTFQQIHSEKRNKIDKEILNDLVYINY 610

Query: 378  NSKM 367
            N ++
Sbjct: 611  NLRL 614


>ref|XP_007038931.1| HAT transposon superfamily isoform 2 [Theobroma cacao]
            gi|590673575|ref|XP_007038932.1| HAT transposon
            superfamily isoform 2 [Theobroma cacao]
            gi|508776176|gb|EOY23432.1| HAT transposon superfamily
            isoform 2 [Theobroma cacao] gi|508776177|gb|EOY23433.1|
            HAT transposon superfamily isoform 2 [Theobroma cacao]
          Length = 678

 Score =  303 bits (777), Expect = 2e-79
 Identities = 185/604 (30%), Positives = 307/604 (50%), Gaps = 6/604 (0%)
 Frame = -2

Query: 2160 WEYAEDLKGRFV-CKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV----QALAFH 1996
            WEYAE L G  V CKFC +   GGI+R+K HLSR   + +  C+ V +DV    +A+   
Sbjct: 9    WEYAEKLDGNKVRCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDRVRAILSS 68

Query: 1995 AVCGKDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIYNKKDKDL 1816
                K+ +  KK+K A   +    +    I+ P+ +    + V   T+     +   ++ 
Sbjct: 69   KEEIKETSSVKKQKIAEARSPGNISTCSKII-PLEASSPVAKVFPATSPIAPPSLNSQEN 127

Query: 1815 VDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCTKLVQDSRKDVEE 1636
            V+R +A  F  N +     +S S+ AM+ A+ +FG  ++ P+  TL T  ++  + +V  
Sbjct: 128  VERSIALFFFENKLDFSVARSSSYQAMIDAVGKFGPGFTGPSVETLKTMWLERIKSEVCL 187

Query: 1635 YVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFERSDKGKTGVFLGD 1456
               + +  W  TGC+++ DTWTD K    IN +  SP    F K+ + S   K    L D
Sbjct: 188  QSKDTEKEWATTGCTIIADTWTDNKSRALINFLVSSPSRTFFHKSVDASSYFKNTKCLAD 247

Query: 1455 MLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHGVQLLLEDIYKEV 1276
            +  SVI + G ENVVQI+++++  Y  +++ I+  Y  I+   C S  + L+LE+ + +V
Sbjct: 248  LFDSVIQDFGPENVVQIIMDSSFNYTGISNHILQNYGTIFVSPCASQCLNLILEE-FSKV 306

Query: 1275 EWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRFVSHFMMLQSILEVEEN 1096
            +W+      AQ +  ++Y   + L LM+ +T E+E+ R   ++ VS F+ LQS+L+    
Sbjct: 307  DWVNRCILQAQTLSKFLYNNASMLDLMKKFTGEQELIRTGITKSVSSFLSLQSMLKQRSR 366

Query: 1095 LRLMIVSPEWRDMSDNRS-PLTEKITQMIESTTFWSRGEEVISVLEPLITVLRLVDGEGS 919
            L+ M  SPE+   S   + P +     ++E   FW   +E +++ EP + VLR V G   
Sbjct: 367  LKHMFNSPEYSTNSSYANKPQSISCIAIVEDNDFWRAVDECVAISEPFLKVLREVSGGKP 426

Query: 918  TAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAAAAFLNPSLMYD 739
              G +YE M R +  +R +   D  K     ++ + K    +   +H+A AFLNPS+ Y+
Sbjct: 427  AVGSIYELMTRAKESIRTYYIMDEGKCKTFLDIVDRKWRDQLHSPLHSAGAFLNPSIQYN 486

Query: 738  GKIKYEQPDIRDGMNYVVESMVGPNEMDDFAAQLLLYNGKSPKLFNTLSILMMKKAHPRV 559
             +IK+      D    + + +  P    D   Q+  +          L++       P +
Sbjct: 487  QEIKFLGSIKEDFFKVLEKLLPTPELRRDITNQIFTFTRAKGMFACNLAMEARDTVSPGL 546

Query: 558  WWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINKLSPDILEDLVYTRM 379
            WWE  G   P+L++VAIRILSQ CS+ T  R+WS F+   ++K NK+  +IL DLVY   
Sbjct: 547  WWEQFGDSAPVLQRVAIRILSQVCSTFTFERHWSTFQQIHSEKRNKIDKEILNDLVYINY 606

Query: 378  NSKM 367
            N ++
Sbjct: 607  NLRL 610


>ref|XP_002513602.1| protein dimerization, putative [Ricinus communis]
            gi|223547510|gb|EEF49005.1| protein dimerization,
            putative [Ricinus communis]
          Length = 688

 Score =  303 bits (776), Expect = 2e-79
 Identities = 188/606 (31%), Positives = 314/606 (51%), Gaps = 7/606 (1%)
 Frame = -2

Query: 2160 WEYAEDLKGRFV-CKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV----QALAFH 1996
            WEYAE L G  V CKFC +   GGI+R+K HLSR   + +  C+ V +DV    +A+   
Sbjct: 18   WEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSKGVNPCSKVRDDVTDRVRAIIAS 77

Query: 1995 AVCGKDLTPYKKRKTATCSTDNERNGADSIVSPILSCPDSSTVLHQTTLATIYNKKDKDL 1816
                K+ +  KK++ A   +        ++V+ + S   ++ V    T  +  +  +++ 
Sbjct: 78   KEDIKEPSSAKKQRPAEAKSPAHIYATKALVN-VESVAPAAKVYPTVTSISPPSLSNQEN 136

Query: 1815 VDRMVAQAFIMNNIALGAIQSPSFVAMVKAIAEFGTSYSLPNHSTLCTKLVQDSRKDVEE 1636
             +R +A  F  N +     +SPS+  M++AI + G  ++ P+   L T  ++  + +V  
Sbjct: 137  AERSIALFFFENKLDFSVARSPSYQLMIEAIEKCGPGFTGPSAEILKTTWLERIKSEVSL 196

Query: 1635 YVSNVKDSWLLTGCSLMLDTWTDMKDVCFINVVAYSPKGAVFLKNFERSDKGKTGVFLGD 1456
             + + +  W  TGC+++ DTWTD K    IN    SP    F K+ + S   K    L D
Sbjct: 197  QLKDTEKEWTTTGCTIIADTWTDNKSRALINFFVSSPSRTFFHKSVDASSYFKNTKCLAD 256

Query: 1455 MLLSVIDEIGAENVVQIVVNNASKYECVTDLIMGRYPHIYKIQCVSHGVQLLLEDIYKEV 1276
            +  SVI + GAENVVQI+++++  Y  V + I+  Y  I+   C S  + L+LED + +V
Sbjct: 257  LFDSVIQDFGAENVVQIIMDSSFNYTGVANHILQNYGTIFVSPCASQCLNLILED-FSKV 315

Query: 1275 EWIQSVFDDAQLIVDYMYKYTTALKLMRVYTAEKEIKRPCNSRFVSHFMMLQSILEVEEN 1096
            +W+      AQ +  ++Y  ++ L LM+ +T  +E+ +   ++ VS F+ LQS+L+    
Sbjct: 316  DWVNRCISQAQTLSKFIYNNSSMLDLMKKFTGGQELIKTGITKSVSSFLSLQSMLKQRPR 375

Query: 1095 LRLMIVSPEWRDMSDNRS-PLTEKITQMIESTTFWSRGEEVISVLEPLITVLRLVDGEGS 919
            L+LM  S E+   S   S P +     ++E   FW   EE +++ EP + VLR V G   
Sbjct: 376  LKLMFSSNEYSANSSYSSKPQSIACITIVEDGDFWRAVEECVAITEPFLKVLREVSGGKP 435

Query: 918  TAGYLYEAMERVRTELRQHCNSDSLKYLKVWELFESKRNGDMIHKIHAAAAFLNPSLMYD 739
              G +YE M R +  +R +   D  K     ++ + K    +   +H+AAAFLNP + Y+
Sbjct: 436  AVGSIYELMTRAKESIRTYYIMDESKCKTFLDIVDRKWRDQLHSPLHSAAAFLNPCVQYN 495

Query: 738  GKIKYEQPDIRDGMNYVVESMV-GPNEMDDFAAQLLLYNGKSPKLFNTLSILMMKKAHPR 562
             +IK+   +I++    V+E ++  P+   D   Q+ ++   S      L++       P 
Sbjct: 496  PEIKF-LVNIKEDFFKVIEKLLPTPDMRRDITNQIFIFTRASGMFGCNLAMEARDTVAPG 554

Query: 561  VWWEYNGGEVPLLRKVAIRILSQPCSSSTCGRNWSAFEVAKTKKINKLSPDILEDLVYTR 382
            +WWE  G   P+L++VAIRILSQ CS+ T  R+W+ F    ++K NK+  + L DLVY  
Sbjct: 555  LWWEQYGDSAPVLQRVAIRILSQVCSTFTFERHWNTFRQIHSEKRNKIDKETLNDLVYIN 614

Query: 381  MNSKMM 364
             N K+M
Sbjct: 615  YNLKLM 620


>ref|XP_003553157.1| PREDICTED: uncharacterized protein LOC100793012 [Glycine max]
          Length = 816

 Score =  302 bits (774), Expect = 4e-79
 Identities = 190/657 (28%), Positives = 323/657 (49%), Gaps = 43/657 (6%)
 Frame = -2

Query: 2160 WEYAEDL----KGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV-QALAFH 1996
            W+Y   L        VC FC K   GGI R K HL  + G ++A C   P +V + L  +
Sbjct: 23   WKYCHSLVEGDTNTIVCNFCGKITKGGITRAKQHLIGKSG-NVAACKKTPPNVIEELKEY 81

Query: 1995 AVCGKDLTPYKKRKTAT--------------CSTDNERNGADSI---VSPILSC------ 1885
                K  T Y    +                C    E   ADS     S    C      
Sbjct: 82   MATKKSGTTYSTSGSGNMANIRDFEFGEPIGCDGSEEDEFADSCNAAASAKTKCGTKKGP 141

Query: 1884 -------PDSST------VLHQTTLATIYNKKDKDLVDRMVAQAFIMNNIALGAIQSPSF 1744
                   P+++       +L Q  +    +K +   V + +A+ +    ++   I+  SF
Sbjct: 142  MDKFCKNPENAINRRKMEMLRQMNIRESMDKNEVLKVHQHIARFWYQAGLSFNLIKLKSF 201

Query: 1743 VAMVKAIAEFGTSYSLPNHSTLCTKLVQDSRKDVEEYVSNVKDSWLLTGCSLMLDTWTDM 1564
              MV AI ++G    +P++  +   L++   +  E  +   ++ W+  GC++M D WTD 
Sbjct: 202  ENMVAAIGQYGPHLPIPSYHDIRVPLLKKEVEYTENLMKGHREQWVKYGCTIMSDAWTDQ 261

Query: 1563 KDVCFINVVAYSPKGAVFLKNFERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASK 1384
            K  C IN +  S  G +FLK+ + SD  KTG  L ++L ++++E+G ENVVQ+V +N S 
Sbjct: 262  KQRCIINFLINSQAGTMFLKSVDDSDFVKTGEKLFELLDAIVEEVGEENVVQVVTDNGSN 321

Query: 1383 YECVTDLIMGRYPHIYKIQCVSHGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTAL 1204
            Y     L+  +  HIY   C +H + L+LEDI K +  I+     A  +V ++Y +++ L
Sbjct: 322  YVLAGKLLEEKRKHIYWTPCAAHCIDLMLEDIGK-LPLIRKTIRRAINLVGFIYAHSSTL 380

Query: 1203 KLMRVYTAEKEIKRPCNSRFVSHFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKI 1024
             L+R +T ++E+ R   +RF + ++ L+ + + + N+R M  S EW     ++ P  ++ 
Sbjct: 381  SLLRNFTNKRELVRHAITRFATSYLTLERLHKEKANIRKMFTSDEWTLNKLSKEPKGKEA 440

Query: 1023 TQMIESTTFWSRGEEVISVLEPLITVLRLVDGEGSTA-GYLYEAMERVRTELRQHCNSDS 847
             +++   +FW+     + V+ PL+ VLRLVDGE   A GY+YEAM++ +  + +  N++ 
Sbjct: 441  AKVVLMPSFWNSVVYTLKVMAPLVKVLRLVDGERKPAMGYIYEAMDKAKETIMKSFNNNE 500

Query: 846  LKYLKVWELFESKRNGDMIHKIHAAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGP 667
             KY  V+E+ + + N  +   +HAAA FLNP   YD        ++ +G+   ++ ++  
Sbjct: 501  SKYKDVFEIIDKRWNCQLHRPLHAAAHFLNPEFFYDNTDLEFDFEVTNGLFECIKKLIPQ 560

Query: 666  NEMDD-FAAQLLLYNGKSPKLFNTLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQP 490
             ++      +L LY   +    +  ++   K   P  WW   G + P L+K+AI+ILS  
Sbjct: 561  FDVQQKILTELHLYKIGADHFGSDFAMAQRKTHSPTYWWRMFGSQTPNLQKLAIKILSLT 620

Query: 489  CSSSTCGRNWSAFEVAKTKKINKLSPDILEDLVYTRMNSKMMAYYNELEMRDKFAIS 319
            CS+S C RNWS FE   +KK N+L    L DLV+ + N ++   YN  +  D  +++
Sbjct: 621  CSASGCERNWSVFEQIHSKKRNRLEHKRLHDLVFVKYNQQLKQRYNARDEIDPISLN 677


>ref|XP_006594368.1| PREDICTED: uncharacterized protein LOC102669187 [Glycine max]
          Length = 816

 Score =  302 bits (773), Expect = 5e-79
 Identities = 190/657 (28%), Positives = 323/657 (49%), Gaps = 43/657 (6%)
 Frame = -2

Query: 2160 WEYAEDL----KGRFVCKFCRKDYPGGIARVKSHLSRQQGRDIAICNSVPEDV-QALAFH 1996
            W+Y   L        VC FC K   GGI R K HL  + G ++A C   P +V + L  +
Sbjct: 23   WKYCHSLVEGDTNTIVCNFCGKITKGGITRAKQHLIGKSG-NVAACKKTPPNVIEELKEY 81

Query: 1995 AVCGKDLTPYKKRKTAT--------------CSTDNERNGADSI---VSPILSC------ 1885
                K  T Y    +                C    E   ADS     S    C      
Sbjct: 82   MATKKSGTTYSTSGSGNMANIRDFEFGEPIGCDGSEEDEFADSCNAAASAKTKCGTKRGP 141

Query: 1884 -------PDSST------VLHQTTLATIYNKKDKDLVDRMVAQAFIMNNIALGAIQSPSF 1744
                   P+++       +L Q  +    +K +   V + +A+ +    ++   I+  SF
Sbjct: 142  MDKFCKNPENAINRRKMEMLRQMNIRESMDKNEVLKVHQHIARFWYQAGLSFNLIKLKSF 201

Query: 1743 VAMVKAIAEFGTSYSLPNHSTLCTKLVQDSRKDVEEYVSNVKDSWLLTGCSLMLDTWTDM 1564
              MV AI ++G    +P++  +   L++   +  E  +   ++ W+  GC++M D WTD 
Sbjct: 202  ENMVAAIGQYGPHLPIPSYHDIRVPLLKKEVEYTENLMKGHREQWVKYGCTIMSDAWTDR 261

Query: 1563 KDVCFINVVAYSPKGAVFLKNFERSDKGKTGVFLGDMLLSVIDEIGAENVVQIVVNNASK 1384
            K  C IN +  S  G +FLK+ + SD  KTG  L ++L ++++E+G ENVVQ+V +N S 
Sbjct: 262  KQRCIINFLINSQAGTMFLKSVDGSDFVKTGEKLFELLDAIVEEVGEENVVQVVTDNGSN 321

Query: 1383 YECVTDLIMGRYPHIYKIQCVSHGVQLLLEDIYKEVEWIQSVFDDAQLIVDYMYKYTTAL 1204
            Y     L+  +  HIY   C +H + L+LEDI K +  I+     A  +V ++Y +++ L
Sbjct: 322  YVLAGKLLEEKRKHIYWTPCAAHCIDLMLEDIGK-LPLIRKTIRRAINLVGFIYAHSSTL 380

Query: 1203 KLMRVYTAEKEIKRPCNSRFVSHFMMLQSILEVEENLRLMIVSPEWRDMSDNRSPLTEKI 1024
             L+R +T ++E+ R   +RF + ++ L+ + + + N+R M  S EW     ++ P  ++ 
Sbjct: 381  SLLRNFTNKRELVRHAITRFATSYLTLERLHKEKANIRKMFTSDEWTLNKLSKEPKGKEA 440

Query: 1023 TQMIESTTFWSRGEEVISVLEPLITVLRLVDGEGSTA-GYLYEAMERVRTELRQHCNSDS 847
             +++   +FW+     + V+ PL+ VLRLVDGE   A GY+YEAM++ +  + +  N++ 
Sbjct: 441  AKVVLMPSFWNSVVYTLKVMAPLVKVLRLVDGERKPAMGYIYEAMDKAKETIMKSFNNNE 500

Query: 846  LKYLKVWELFESKRNGDMIHKIHAAAAFLNPSLMYDGKIKYEQPDIRDGMNYVVESMVGP 667
             KY  V+E+ + + N  +   +HAAA FLNP   YD        ++ +G+   ++ ++  
Sbjct: 501  SKYKDVFEIIDKRWNCQLHRPLHAAAHFLNPEFFYDNTDLEFDFEVTNGLFECIKKLIPQ 560

Query: 666  NEMDD-FAAQLLLYNGKSPKLFNTLSILMMKKAHPRVWWEYNGGEVPLLRKVAIRILSQP 490
             ++      +L LY   +    +  ++   K   P  WW   G + P L+K+AI+ILS  
Sbjct: 561  FDVQQKILTELHLYKIGADHFGSDFAMAQRKTHSPTYWWRMFGSQTPNLQKLAIKILSLT 620

Query: 489  CSSSTCGRNWSAFEVAKTKKINKLSPDILEDLVYTRMNSKMMAYYNELEMRDKFAIS 319
            CS+S C RNWS FE   +KK N+L    L DLV+ + N ++   YN  +  D  +++
Sbjct: 621  CSASGCERNWSVFEQIHSKKRNRLEHKRLHDLVFVKYNQQLKQRYNARDEIDPISLN 677


Top