BLASTX nr result

ID: Achyranthes23_contig00004457 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes23_contig00004457
         (1875 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EXB78097.1| hypothetical protein L484_004798 [Morus notabilis]     159   5e-36
ref|XP_006446765.1| hypothetical protein CICLE_v10015391mg [Citr...   145   6e-32
ref|XP_006469032.1| PREDICTED: uncharacterized protein LOC102611...   140   2e-30
gb|EOX92817.1| Uncharacterized protein isoform 1 [Theobroma caca...   135   5e-29
ref|XP_004303170.1| PREDICTED: uncharacterized protein LOC101303...   135   8e-29
ref|XP_006294244.1| hypothetical protein CARUB_v10023243mg, part...   119   3e-24
gb|EOX92819.1| Uncharacterized protein isoform 3, partial [Theob...   118   1e-23
ref|XP_006350037.1| PREDICTED: uncharacterized protein LOC102601...   117   2e-23
ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus c...   117   2e-23
ref|XP_006350032.1| PREDICTED: uncharacterized protein LOC102601...   117   2e-23
ref|XP_004251799.1| PREDICTED: uncharacterized protein LOC101256...   117   2e-23
ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arab...   116   4e-23
ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana] ...   115   5e-23
ref|XP_004505507.1| PREDICTED: uncharacterized protein LOC101513...   113   3e-22
ref|XP_002300422.2| hypothetical protein POPTR_0001s38590g [Popu...   112   4e-22
gb|EOX92820.1| Uncharacterized protein isoform 4, partial [Theob...   111   9e-22
gb|EOX92821.1| Uncharacterized protein isoform 5 [Theobroma caca...   110   2e-21
ref|XP_002325580.2| hypothetical protein POPTR_0019s11960g [Popu...   110   2e-21
ref|XP_004487168.1| PREDICTED: protein gar2-like [Cicer arietinum]    109   5e-21
ref|XP_004149372.1| PREDICTED: uncharacterized protein LOC101205...   108   6e-21

>gb|EXB78097.1| hypothetical protein L484_004798 [Morus notabilis]
          Length = 443

 Score =  159 bits (401), Expect = 5e-36
 Identities = 140/452 (30%), Positives = 214/452 (47%), Gaps = 42/452 (9%)
 Frame = +3

Query: 255  MDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYSDVMQDLLPP 434
            MD+KG +WVG++YQKFEAMC+EVEE+MY+DT KYVE+QV+TVG +VK+FYSDVMQDLLPP
Sbjct: 1    MDVKGITWVGNVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGASVKRFYSDVMQDLLPP 60

Query: 435  SSLDSEKAQDPVSTVDHDSKVSISMKTKLNDANGP-----EQLVKDFSVAVTQEDSLNYV 599
            SS DSEK          DS   IS K  +     P     EQL++  ++ VT +    Y+
Sbjct: 61   SSQDSEKVSLCGFIGKQDSDDGISKKPNVAKKEKPAKADDEQLIR--TLKVTSDSKDVYL 118

Query: 600  SSCCPIYC-VIKPCKP--------LSNDSAEEKSEEAKFQSQVTMALSDRATSE---AKD 743
            +    + C V   C+P         SN  + +K  +    S   +++++  + +     +
Sbjct: 119  APSIHVRCDVDNMCRPSGECVKGACSNLRSRKKCRDVSVHSSSNLSVNENRSDKKLIPPE 178

Query: 744  LDCTKDSRESLSNLPSIAENITNNYGEVVRLEPCSCSDKSLPVHEDASSKEIPAVSHSEK 923
              C     + LS   S      N   E+   +  + + K+  V+ED SS  I       +
Sbjct: 179  TSCAITREKHLSRPLSSYSEFVNEIHEISLDQ--TGTTKAPSVNEDTSSDSIVESCDEIE 236

Query: 924  DSLGICKDMMECY-GGGAIINVKKI-------TILSDGDMTTDIDGHFSTDSRLSLLKAD 1079
            +S     D+   +     II VK +        + S G ++   +G +++    + L + 
Sbjct: 237  NSSECMADLSSSFHASSEIILVKSVGYDGNEMDVPSGGGLSEQANGDYTSKCSSNSLAST 296

Query: 1080 LSTMITSNVEDTIEVDKELSM-----FDIETSSPFLSSEFRKCESNEEGFPDGQLDKVKL 1244
              +       +    D+++ +     FD + +     SE       E      Q DKVKL
Sbjct: 297  GGSSQNEEARNDKYADEDVFVSLPRKFD-DWNLNITESEIATEHGTE---TIQQRDKVKL 352

Query: 1245 EESCVLV---EKNIFSKTGLQVQKSYKKKIKDVLSPRRWSKGKRNQEHLAAWYEE----- 1400
            EE+CVLV   E +I  + G +  + YKKKI+D L  R  S  K   E L   Y +     
Sbjct: 353  EETCVLVNEDELHILPQRGGK-WRPYKKKIRDALYSRMRSARKEEYEQLVLQYGDNKKLN 411

Query: 1401 ----QCGKHTMIEGELRKSETIESFESDWELL 1484
                +    T+I  E +K   ++S ES+WELL
Sbjct: 412  QDFGEALAPTLIVKERKKLPHLDSCESEWELL 443


>ref|XP_006446765.1| hypothetical protein CICLE_v10015391mg [Citrus clementina]
            gi|567908905|ref|XP_006446766.1| hypothetical protein
            CICLE_v10015391mg [Citrus clementina]
            gi|557549376|gb|ESR60005.1| hypothetical protein
            CICLE_v10015391mg [Citrus clementina]
            gi|557549377|gb|ESR60006.1| hypothetical protein
            CICLE_v10015391mg [Citrus clementina]
          Length = 416

 Score =  145 bits (366), Expect = 6e-32
 Identities = 132/427 (30%), Positives = 196/427 (45%), Gaps = 17/427 (3%)
 Frame = +3

Query: 255  MDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYSDVMQDLLPP 434
            MDLKG +WVG +YQKFEAMC+EVEE+MY+DT KYVE+QV+TVG TVKKFYSDV++DLLPP
Sbjct: 1    MDLKGITWVGHVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGSTVKKFYSDVIEDLLPP 60

Query: 435  SSLDSEK-AQDPVSTVDHDSKVSISMKTKL----NDANGPEQLVKDFSVAVTQED-SLNY 596
             S+D  K A      ++ ++ V I  K K+       N   + + + S+A T  D     
Sbjct: 61   PSVDLVKGAVASNLPLEQNADVGIYKKPKIGIKEEAMNVNNEQLSESSLATTDLDKGAGG 120

Query: 597  VSSCCPIYCVIKPCKPLSNDSAEEKSEEAKFQSQVTMALSDRATSEAKDLDCTKDSRESL 776
              S C  +      +P   D+      +  F    +     R+      +   K S+E  
Sbjct: 121  GQSFCRFHIEDTSFQPSLGDTL-----KGVFSDAYSKEYDIRSGHNQSSICMQKISKE-- 173

Query: 777  SNLPSIAENITNNYGEVVRLEPCSCSDKSLPVHEDASSKEIPAVSHSEKDSLGICKDMME 956
             NLP    +    + E   L   S S + L   ++ S  ++          +  CK   E
Sbjct: 174  DNLPPSEMSGAGPHME-RGLRRASSSCELLDKIQEVSDDQVVVDPTPVTTEVASCKSFEE 232

Query: 957  CYGGGAIINVKKITILSDGDMTTDIDGHFSTDSRLSLLKADLSTMITSN-----VEDTIE 1121
             Y      +      L+      + D   +  S  S L A+L+ + T++     V   + 
Sbjct: 233  IYDELEKASKGASGALTSSPAAKNCDESENAHSSCSSLSAELNGICTNDGVVSLVGSFVN 292

Query: 1122 VDKELSMFDIETSSPFLSSEFRKCESN---EEGFPDGQ-LDKVKLEESCVLV--EKNIFS 1283
             D + S F     S +  S     ESN   E+G+   Q +D +++EE+CVLV  ++  F 
Sbjct: 293  EDVQPSEFPDPGRSDY--STVDATESNIDVEQGYETVQRVDNIQVEETCVLVNGDELCFV 350

Query: 1284 KTGLQVQKSYKKKIKDVLSPRRWSKGKRNQEHLAAWYEEQCGKHTMIEGELRKSETIESF 1463
                   + YKKKI+D +S R  S  K   + LA WY E   K      E++   +    
Sbjct: 351  PCREGKHRPYKKKIQDAISSRMRSTRKHEYKQLAVWYNED-EKSKQQNAEMKGKPSHGYC 409

Query: 1464 ESDWELL 1484
            E +WELL
Sbjct: 410  ELEWELL 416


>ref|XP_006469032.1| PREDICTED: uncharacterized protein LOC102611541 isoform X1 [Citrus
            sinensis] gi|568829444|ref|XP_006469033.1| PREDICTED:
            uncharacterized protein LOC102611541 isoform X2 [Citrus
            sinensis] gi|568829446|ref|XP_006469034.1| PREDICTED:
            uncharacterized protein LOC102611541 isoform X3 [Citrus
            sinensis] gi|568829448|ref|XP_006469035.1| PREDICTED:
            uncharacterized protein LOC102611541 isoform X4 [Citrus
            sinensis]
          Length = 416

 Score =  140 bits (353), Expect = 2e-30
 Identities = 130/427 (30%), Positives = 190/427 (44%), Gaps = 17/427 (3%)
 Frame = +3

Query: 255  MDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYSDVMQDLLPP 434
            MDLKG +WVG +YQKFEAMC+EVEE+MY+DT KYVE+QV+TVG TVKKFYSDV++DLLPP
Sbjct: 1    MDLKGITWVGHVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGSTVKKFYSDVIEDLLPP 60

Query: 435  SSLDSEK-AQDPVSTVDHDSKVSISMKTKLNDAN-----GPEQLVKDFSVAVTQEDSLNY 596
             S+D  K A      ++ ++ V I  K K+           EQL +        +     
Sbjct: 61   PSVDLVKGAVASNLPLEQNADVGIYKKPKIGIKEEAMKVNNEQLSESSLATTDLDKGAGG 120

Query: 597  VSSCCPIYCVIKPCKPLSNDSAEEKSEEAKFQSQVTMALSDRATSEAKDLDCTKDSRESL 776
              S C  +      +P   ++      +  F          R+      +   K S+E  
Sbjct: 121  GQSFCRFHIEDTSFQPSLGNTL-----KGVFSDAYPKEYDIRSGHNQSSICMQKISKE-- 173

Query: 777  SNLPSIAENITNNYGEVVRLEPCSCSDKSLPVHEDASSKEIPAVSHSEKDSLGICKDMME 956
             NLP    +    + E   L   S S + L   ++ S  ++     S    +  CK   E
Sbjct: 174  DNLPPSEMSGAGPHME-RGLRRASSSCELLDKIQEVSDDQVVVDPTSVTTEVASCKSFEE 232

Query: 957  CYGGGAIINVKKITILSDGDMTTDIDGHFSTDSRLSLLKADLSTMITSN-----VEDTIE 1121
             Y      +      L+      + D   S  S  S L A+L+ + T++     V   + 
Sbjct: 233  IYDELEKASKGASGALTSSPAAKNCDESESAHSSCSSLSAELNGICTNDGVVSLVGSFVN 292

Query: 1122 VDKELSMFDIETSSPFLSSEFRKCESN---EEGFPDGQ-LDKVKLEESCVLV--EKNIFS 1283
             D + S F     S +  S     ESN   E+G+   Q +D +++EE+CVLV  ++  F 
Sbjct: 293  EDVQPSEFPDPGRSDY--STVDATESNIDVEQGYETVQRVDNIQVEETCVLVNGDELCFV 350

Query: 1284 KTGLQVQKSYKKKIKDVLSPRRWSKGKRNQEHLAAWYEEQCGKHTMIEGELRKSETIESF 1463
                   +  KKKI+D +S R  S  K   + LA WY E   K      E +   +    
Sbjct: 351  PCREDKHRPCKKKIQDAISSRMRSTRKHEYKQLAVWYNED-EKSKQQNAETKGKPSHGYC 409

Query: 1464 ESDWELL 1484
            E +WELL
Sbjct: 410  ELEWELL 416


>gb|EOX92817.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508700922|gb|EOX92818.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 397

 Score =  135 bits (341), Expect = 5e-29
 Identities = 132/440 (30%), Positives = 202/440 (45%), Gaps = 27/440 (6%)
 Frame = +3

Query: 246  MIVMDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYS----DV 413
            MI MDLKG +WVG +Y+KFEAMC+EVEEVMY+DT KYVE++V+TVG +VKKFYS    DV
Sbjct: 1    MISMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDV 60

Query: 414  MQDLLPPSSLDSEKAQDPVSTVDHDSKVSISMKTKLN-----DA--NGPEQLVKDFSVAV 572
            MQDLL PSSL+  KA   V+  D   ++      K N     DA     EQL +D  V  
Sbjct: 61   MQDLLLPSSLEPMKA---VAASDLPVEIYAETLKKPNVGLKEDAIQGDSEQLTEDSEVIA 117

Query: 573  TQEDSLNYVSSCCPIYCVIKPCKPLSNDSAEEKSEEAKFQSQVTMALSDRATSEAKDLDC 752
               ++  +V S C ++ V    +  S    E  S +      ++   ++R T    +++ 
Sbjct: 118  DVNENAAHVPSSCQLHMVDNIFESCSGSFVERASSDL-----LSGEHNNRCTLNKTNVEH 172

Query: 753  TKDSRESLSNLPSIAENITNNYGEVVRLEPCSCSDKSLPVHEDASSKEIPA----VSHSE 920
               +  S     S A  + N +G +       C + +   + + S  +IPA    VS  E
Sbjct: 173  LLPAETS-----SEAGCVENEFGRMSSF----CGNAN--ANHEVSCHQIPATLTPVSVEE 221

Query: 921  KDSLGICKDMMECYGGGAIINVKKITILSDGDMTTDIDGHFSTDSRLSLLKADLSTMITS 1100
             D    C  + E        +     IL DG     I           + K ++    +S
Sbjct: 222  DD----CDSIEESSNEIKSASDSVPEILPDGLHLVGI-----------VEKNEMEMRCSS 266

Query: 1101 NVEDTIEVDKELSMF-DIETSSPFLSSEFRKCESNEEGFPDGQLDKVKLEESCVLVE--K 1271
            ++ ++ E + +L+   D   SS     E    +         QLDK++++ESC +V   +
Sbjct: 267  SIIESEESNGKLNWTKDASGSSTVGRKEIETVQ---------QLDKIRVDESCFMVNGAE 317

Query: 1272 NIFSKTGLQVQKSYKKKIKDVLSPRRWSKGKRNQEHLAAWYEEQCGKHTMIEG------- 1430
              F        K+Y++KI+D +S R  S  K+  E L  WY +        EG       
Sbjct: 318  LHFHPQREGKHKTYQRKIRDAISSRMRSARKKEYEQLPLWYGDDVKSDQDSEGSSTSALT 377

Query: 1431 --ELRKSETIESFESDWELL 1484
              + R++   +  +S+WELL
Sbjct: 378  REDTRRTLNHDDLDSEWELL 397


>ref|XP_004303170.1| PREDICTED: uncharacterized protein LOC101303722 [Fragaria vesca
            subsp. vesca]
          Length = 389

 Score =  135 bits (339), Expect = 8e-29
 Identities = 133/438 (30%), Positives = 193/438 (44%), Gaps = 25/438 (5%)
 Frame = +3

Query: 246  MIVMDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYSDVMQDL 425
            MI MD+KG +WVG +Y+KFE+MC+EVEE MYEDT K+VEDQV+TVG++VKKFY+DVMQDL
Sbjct: 1    MITMDVKGITWVGCVYEKFESMCLEVEENMYEDTVKFVEDQVQTVGESVKKFYADVMQDL 60

Query: 426  LPPSSLDSEKAQDPVSTVDHDSKVSIS----MKTKLNDANGPEQLVKDFSVAVTQEDSLN 593
            L  SSLD +        V+H S V  S     K K +   G E++  D  V       ++
Sbjct: 61   LCDSSLDRDDVSAGGFPVEHYSDVDNSKSKIRKKKEHVKAGVEEVKGDSEVISAVLKDVD 120

Query: 594  YV---------SSCCPI--YCVIKPCKPLSNDSAEEKSEEAKFQSQVTMALSDRATSEAK 740
            +           SC      C    C    +       +    ++ +   L    T+  K
Sbjct: 121  HTGLFHRQRVYDSCTKSSGNCAKLACSRQDHGVRSCNKKIVVRETPIKDRLPGANTAVGK 180

Query: 741  DLDCTKDSRESLSNLPSIAENITNNYGEVVRLEPCSCSDKSLPVHEDASSKEIPAVSHSE 920
            D      SRESLS+         + +    R   C   D+ +   +        ++S S 
Sbjct: 181  DF-----SRESLSS--------CSEFSNEDRDTSCDQPDEVITPSKPPEGMRCDSMSES- 226

Query: 921  KDSLGICKDMMECYGGGAIINVKKITIL----SDGDMTTDIDGHFSTDSRLSLLKADLST 1088
                 +  +  +C G    +N +   ++    SDG    ++      DS +  L  +L+ 
Sbjct: 227  ----CVVANASQCTGDDVSVNCQSSDMIVLDNSDGKRWNEL-----LDSSIGGLSTELNG 277

Query: 1089 MITSNVEDTIEVDKELSMFDIETSSPFLSSEFRKCESNEEGFPDGQLDKVKLEESCVLV- 1265
               +   D IE        +I T    +                 Q DK KLEE+CV+V 
Sbjct: 278  GSINPSMDAIE-------SNIGTHGTEIIQ---------------QSDKPKLEETCVMVS 315

Query: 1266 -EKNIFSKTGLQVQKSYKKKIKDVLSPRRWSKGKRNQEHLAAWYEEQCGKHT--MIEG-- 1430
             E   F    +   K YKKKI    + R  S  K+  E LA W+    G HT  ++EG  
Sbjct: 316  GEDLHFVHHTVANYKPYKKKIPKAFTSRTSSARKQEYEQLALWH----GHHTKSILEGGE 371

Query: 1431 ELRKSETIESFESDWELL 1484
            E +KS T +  ES+WE+L
Sbjct: 372  ESKKSPTHDFCESEWEIL 389


>ref|XP_006294244.1| hypothetical protein CARUB_v10023243mg, partial [Capsella rubella]
            gi|482562952|gb|EOA27142.1| hypothetical protein
            CARUB_v10023243mg, partial [Capsella rubella]
          Length = 436

 Score =  119 bits (299), Expect = 3e-24
 Identities = 129/457 (28%), Positives = 210/457 (45%), Gaps = 35/457 (7%)
 Frame = +3

Query: 219  NQFKLLVDHMIVMDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKK 398
            N++K +V    +MD KG  WVG++YQKFEAMC+EVEE++ +DT KYVE+QV TVG++VKK
Sbjct: 1    NKYKDVVGFGEIMDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVHTVGNSVKK 60

Query: 399  FYSDVMQDLLPPSSLDSEKAQDPVSTVDHDSKVSISMKTKLNDANGPEQLVK-DFSVAVT 575
            F SDV+QDLLP           PVS ++  + V  S K K   AN   + VK +  V   
Sbjct: 61   FCSDVVQDLLPDDDSVGSGKPLPVSMLNEYAPV-CSFKKKRESANRKTRDVKQEEEVTEG 119

Query: 576  QED--SLNYVSSCCPIYCVIKPCKPLSNDSAEEKSEEAKFQ-------SQVTMALSDRAT 728
            ++D  ++N        Y +    +  S      +    + Q       SQ+T     + +
Sbjct: 120  KKDGCAMNLRGLDADDYDICTSPRQYSYGGPYRRGRVGRKQIFKKEELSQITRPYIQKDS 179

Query: 729  SEAKDLDCTKDSRESLSNLPSIAENITNNYGEVVRLEPCSCSDKSLPVHEDASSK---EI 899
            S    +   +  ++ +  + S   +++  +   V+ +  + +  SL +   A  K   E 
Sbjct: 180  SNLTMVHSAR-VKDDVGTVNS--SSLSMAHSGRVKDDVGTVNSSSLSMVHSARIKADVET 236

Query: 900  PAVSHSEKDSLGICKDMMECYGGGAIINVKKITILS-----DGDMTTDIDGHFSTDSRL- 1061
               S S    +       EC       N   +T+++     D ++ T+I+   +  + + 
Sbjct: 237  VKSSDSRPGEIERLISKKECQKDDRTDNQHGLTMVNSVRSKDSEIRTEIEHSLTVVNSVR 296

Query: 1062 ----SLLKADLSTMITSNVEDTIEVDKELSMFDIETSSPFLSSEFRKCESNEEGFPDGQL 1229
                 +L +  ++++T +  +  +  KE SM   E SS  +S +  K E  +       L
Sbjct: 297  SQDSEILPSVATSLLTGSSNEFRKETKEDSM---EASSSSVSEQ--KSEILQ------HL 345

Query: 1230 DKVKLEESCVLVEKNIF-----SKTGLQVQKSYKKKIKDVLSPRRWSKGKRNQEHLA-AW 1391
                +EESC+LV+++ F      K      K Y KKI+D +S R     ++  + LA  W
Sbjct: 346  SGRSVEESCILVDRDEFHCVFPDKMENDKHKPY-KKIRDAISSRMKQNREKEYKRLARQW 404

Query: 1392 YEE------QCGKHTMIEGELRKSETIESFESDWELL 1484
            Y E      +CG     E E R SE     ES+WELL
Sbjct: 405  YAEDVENGSECGDDPKPEDENRSSE-----ESEWELL 436


>gb|EOX92819.1| Uncharacterized protein isoform 3, partial [Theobroma cacao]
          Length = 343

 Score =  118 bits (295), Expect = 1e-23
 Identities = 118/385 (30%), Positives = 180/385 (46%), Gaps = 18/385 (4%)
 Frame = +3

Query: 246  MIVMDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYS----DV 413
            MI MDLKG +WVG +Y+KFEAMC+EVEEVMY+DT KYVE++V+TVG +VKKFYS    DV
Sbjct: 1    MISMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDV 60

Query: 414  MQDLLPPSSLDSEKAQDPVSTVDHDSKVSISMKTKLN-----DA--NGPEQLVKDFSVAV 572
            MQDLL PSSL+  KA   V+  D   ++      K N     DA     EQL +D  V  
Sbjct: 61   MQDLLLPSSLEPMKA---VAASDLPVEIYAETLKKPNVGLKEDAIQGDSEQLTEDSEVIA 117

Query: 573  TQEDSLNYVSSCCPIYCVIKPCKPLSNDSAEEKSEEAKFQSQVTMALSDRATSEAKDLDC 752
               ++  +V S C ++ V    +  S    E  S +      ++   ++R T    +++ 
Sbjct: 118  DVNENAAHVPSSCQLHMVDNIFESCSGSFVERASSDL-----LSGEHNNRCTLNKTNVEH 172

Query: 753  TKDSRESLSNLPSIAENITNNYGEVVRLEPCSCSDKSLPVHEDASSKEIPA----VSHSE 920
               +  S     S A  + N +G +       C + +   + + S  +IPA    VS  E
Sbjct: 173  LLPAETS-----SEAGCVENEFGRMSSF----CGNAN--ANHEVSCHQIPATLTPVSVEE 221

Query: 921  KDSLGICKDMMECYGGGAIINVKKITILSDGDMTTDIDGHFSTDSRLSLLKADLSTMITS 1100
             D    C  + E        +     IL DG     I           + K ++    +S
Sbjct: 222  DD----CDSIEESSNEIKSASDSVPEILPDGLHLVGI-----------VEKNEMEMRCSS 266

Query: 1101 NVEDTIEVDKELSMF-DIETSSPFLSSEFRKCESNEEGFPDGQLDKVKLEESCVLVE--K 1271
            ++ ++ E + +L+   D   SS     E    +         QLDK++++ESC +V   +
Sbjct: 267  SIIESEESNGKLNWTKDASGSSTVGRKEIETVQ---------QLDKIRVDESCFMVNGAE 317

Query: 1272 NIFSKTGLQVQKSYKKKIKDVLSPR 1346
              F        K+Y++KI+D +S R
Sbjct: 318  LHFHPQREGKHKTYQRKIRDAISSR 342


>ref|XP_006350037.1| PREDICTED: uncharacterized protein LOC102601397 isoform X6 [Solanum
            tuberosum]
          Length = 420

 Score =  117 bits (293), Expect = 2e-23
 Identities = 129/444 (29%), Positives = 198/444 (44%), Gaps = 34/444 (7%)
 Frame = +3

Query: 255  MDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYSDVMQDLLPP 434
            MDLKG +WVGD+YQKFEAMC+E+E+ MY+DT +YVE+QV+TVG +VK+FYSDV+ DL P 
Sbjct: 1    MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60

Query: 435  SSLDSEKAQDPVSTVDHDSKVSISMKTKLNDANG-----PEQLVKDFSVAVTQEDSLN-Y 596
             ++D  K      +++  +   IS K K     G      ++L+ D  V   +  S   Y
Sbjct: 61   FNIDPVKVAAADLSLNPYAHTEISKKLKAKLKGGHPMVINKELIDDTQVIKGKSKSGGVY 120

Query: 597  VSSCCPIYCVIKPCKPLSNDS---------AEEKSEEAKFQSQVTMALSDRATSEAKDLD 749
                  I  +++   P S  S         A + S ++K +    +A SD  T     L 
Sbjct: 121  RRQSVGIKEIVRDNHPPSKKSDALCLVSGNAIKLSSDSKVRGGFEVA-SDHMTM-TSPLA 178

Query: 750  CTKDSRESLSNLPSIAENITNNYGEVVRLE-PCSCSDKSLPV----HEDASSKEIPAVSH 914
              K  R S      ++ +I         +    + SD+SL V       A  +   +V  
Sbjct: 179  SVK-GRSSAETGKEVSNHIIKTDVSAAGISINVAASDRSLSVDCVGQNQADLRNTSSVGD 237

Query: 915  SEKDS--LGICKDMMECYGGGAIINVKKITILSDGDMTTDIDGHFSTDSRLSLLKADLST 1088
             + DS   G CK++                    GD    I  + + D+ ++  + +   
Sbjct: 238  LQSDSHDRGTCKELA-------------------GDTGLKISSN-TGDNNIASEEINNIA 277

Query: 1089 MITSNVEDTIEVDKELSMFDIETSSPFLSSEFRKCESNEEGFPDGQ-LDKVKLEESCVLV 1265
             I+SN  D     +E++    E S    S    K +  E      +  D+ KLEE+CVLV
Sbjct: 278  KISSNTGDNNITGEEINESCKERSDKSCSPPPEKYDLIESDVEIVEHYDESKLEETCVLV 337

Query: 1266 EKNIFSKTGLQV-QKSYKKKIKDVLSPRRWSKGKRNQEHLAAWY---------EEQCGKH 1415
            E          V QKSYKKK++ V S ++ S  ++  E L A +         EE+  + 
Sbjct: 338  EAEKLHVPQESVKQKSYKKKLRQVFSMKKKST-RKEYEQLGALHGDQQPNLEPEEKPMQV 396

Query: 1416 TMIEGELRK-SETIESFESDWELL 1484
                  ++K S   +  ES+WELL
Sbjct: 397  LSKNSNMKKLSSADDHSESEWELL 420


>ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus communis]
           gi|223535579|gb|EEF37247.1| hypothetical protein
           RCOM_0553590 [Ricinus communis]
          Length = 490

 Score =  117 bits (293), Expect = 2e-23
 Identities = 57/89 (64%), Positives = 72/89 (80%)
 Frame = +3

Query: 255 MDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYSDVMQDLLPP 434
           MDLKG SWVG++YQKFEAMC+EVEEVMY+DT KYVE+QV+TVG +VK+FYSDVMQDLLPP
Sbjct: 1   MDLKGISWVGNIYQKFEAMCLEVEEVMYQDTVKYVENQVQTVGSSVKRFYSDVMQDLLPP 60

Query: 435 SSLDSEKAQDPVSTVDHDSKVSISMKTKL 521
           SS+D+ K       ++  + + I MK K+
Sbjct: 61  SSVDAAKGAGVDVPLELYADLGIYMKPKV 89


>ref|XP_006350032.1| PREDICTED: uncharacterized protein LOC102601397 isoform X1 [Solanum
            tuberosum] gi|565366720|ref|XP_006350033.1| PREDICTED:
            uncharacterized protein LOC102601397 isoform X2 [Solanum
            tuberosum] gi|565366722|ref|XP_006350034.1| PREDICTED:
            uncharacterized protein LOC102601397 isoform X3 [Solanum
            tuberosum] gi|565366724|ref|XP_006350035.1| PREDICTED:
            uncharacterized protein LOC102601397 isoform X4 [Solanum
            tuberosum] gi|565366726|ref|XP_006350036.1| PREDICTED:
            uncharacterized protein LOC102601397 isoform X5 [Solanum
            tuberosum]
          Length = 421

 Score =  117 bits (292), Expect = 2e-23
 Identities = 129/445 (28%), Positives = 198/445 (44%), Gaps = 35/445 (7%)
 Frame = +3

Query: 255  MDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYSDVMQDLLPP 434
            MDLKG +WVGD+YQKFEAMC+E+E+ MY+DT +YVE+QV+TVG +VK+FYSDV+ DL P 
Sbjct: 1    MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60

Query: 435  SSLDSEKAQDPVSTVDHDSKVSISMKTKLNDANG-----PEQLVKDFSVAVTQEDSLN-Y 596
             ++D  K      +++  +   IS K K     G      ++L+ D  V   +  S   Y
Sbjct: 61   FNIDPVKVAAADLSLNPYAHTEISKKLKAKLKGGHPMVINKELIDDTQVIKGKSKSGGVY 120

Query: 597  VSSCCPIYCVIKPCKPLSNDS---------AEEKSEEAKFQSQVTMALSDRATSEAKDLD 749
                  I  +++   P S  S         A + S ++K +    +A SD  T     L 
Sbjct: 121  RRQSVGIKEIVRDNHPPSKKSDALCLVSGNAIKLSSDSKVRGGFEVA-SDHMTM-TSPLA 178

Query: 750  CTKDSRESLSNLPSIAENITNNYGEVVRLE-PCSCSDKSLPV----HEDASSKEIPAVSH 914
              K  R S      ++ +I         +    + SD+SL V       A  +   +V  
Sbjct: 179  SVK-GRSSAETGKEVSNHIIKTDVSAAGISINVAASDRSLSVDCVGQNQADLRNTSSVGD 237

Query: 915  SEKDS---LGICKDMMECYGGGAIINVKKITILSDGDMTTDIDGHFSTDSRLSLLKADLS 1085
             + DS    G CK++                    GD    I  + + D+ ++  + +  
Sbjct: 238  LQSDSHADRGTCKELA-------------------GDTGLKISSN-TGDNNIASEEINNI 277

Query: 1086 TMITSNVEDTIEVDKELSMFDIETSSPFLSSEFRKCESNEEGFPDGQ-LDKVKLEESCVL 1262
              I+SN  D     +E++    E S    S    K +  E      +  D+ KLEE+CVL
Sbjct: 278  AKISSNTGDNNITGEEINESCKERSDKSCSPPPEKYDLIESDVEIVEHYDESKLEETCVL 337

Query: 1263 VEKNIFSKTGLQV-QKSYKKKIKDVLSPRRWSKGKRNQEHLAAWY---------EEQCGK 1412
            VE          V QKSYKKK++ V S ++ S  ++  E L A +         EE+  +
Sbjct: 338  VEAEKLHVPQESVKQKSYKKKLRQVFSMKKKST-RKEYEQLGALHGDQQPNLEPEEKPMQ 396

Query: 1413 HTMIEGELRK-SETIESFESDWELL 1484
                   ++K S   +  ES+WELL
Sbjct: 397  VLSKNSNMKKLSSADDHSESEWELL 421


>ref|XP_004251799.1| PREDICTED: uncharacterized protein LOC101256948 [Solanum
            lycopersicum]
          Length = 421

 Score =  117 bits (292), Expect = 2e-23
 Identities = 124/448 (27%), Positives = 191/448 (42%), Gaps = 38/448 (8%)
 Frame = +3

Query: 255  MDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYSDVMQDLLPP 434
            MDLKG +WVGD+YQKFEAMC+E+E+ MY+DT +YVE+QV+TVG +VK+FYSDV+ DL P 
Sbjct: 1    MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60

Query: 435  SSLDSEKAQDPVSTVDHDSKVSISMKTKLNDANG-----PEQLVKDFSVAVTQEDSLN-Y 596
             ++D  K      +++  +   IS K K     G      ++L+ D  V   +  S   Y
Sbjct: 61   FNIDPVKVAAADLSLNPYAHTEISKKLKAQLKGGHPRVINKELIDDTQVIKGKSKSGGVY 120

Query: 597  VSSCCPIYCVIKPCKPLSNDSAEEKSEEAKFQSQVTMALSDRATSEAKDLDCTKDSRESL 776
                  +  +++      N    +KS+     S  T+ LS  +       +   D     
Sbjct: 121  RRQSVGMKEIVR-----DNHPPSKKSDALCLVSGNTIKLSSDSKVRG-GFEVASDHMTMT 174

Query: 777  SNLPSIAENITNNYGEVVRLEPCSCSDKSLPVHEDASSKEIPAVSHSEKDSLGICKDMME 956
            S L S+              +    ++    V       E+PA   S   +       ++
Sbjct: 175  SPLASV--------------KGLKSTETGKEVSNHIIKTEVPAAGISINIAASDTSLSVD 220

Query: 957  CYGGGAIINVKKITILSDGDMTTDIDGHFSTDSRLSLLKADLSTMITSNVEDTIEVDKE- 1133
            C G             S GD+ +  D H    +R   L  D    I+SN  D     KE 
Sbjct: 221  CVGQN---QADLRNTFSVGDLQS--DSHVDRGTRKE-LAGDTGLKISSNTGDNNIASKEV 274

Query: 1134 --LSMFDIETSSPFLSSEFRK--CESNEEGF----PD------------GQLDKVKLEES 1253
              ++     T    ++ E  K  C++  +      PD             + D+ KLEE+
Sbjct: 275  NNIAKISSNTDDNNIAGEEIKESCKARSDKSCSPPPDKYDLIESDVEIVERYDEPKLEET 334

Query: 1254 CVLVE-KNIFSKTGLQVQKSYKKKIKDVLSPRRWSKGKRNQEHLAAWYEEQCGKHTMIEG 1430
            CVLVE + +    G   +KSYKKK++ V S ++ S  +   E L A Y +Q       E 
Sbjct: 335  CVLVEAEKLHVPQGSVKRKSYKKKLRQVFSMKKKST-RTEYEQLGALYGDQQPNLQPEEK 393

Query: 1431 EL----------RKSETIESFESDWELL 1484
            ++          + S   +  ES+WELL
Sbjct: 394  QMQVLSKNSNPKKLSSADDHSESEWELL 421


>ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arabidopsis lyrata subsp.
            lyrata] gi|297326997|gb|EFH57417.1| hypothetical protein
            ARALYDRAFT_482041 [Arabidopsis lyrata subsp. lyrata]
          Length = 418

 Score =  116 bits (290), Expect = 4e-23
 Identities = 124/441 (28%), Positives = 207/441 (46%), Gaps = 31/441 (7%)
 Frame = +3

Query: 255  MDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYSDVMQDLLPP 434
            MD KG  WVG++YQKFEAMC+EVEE++ +DT KYVE+QV+TVG++VKKF SDV+QDLLP 
Sbjct: 1    MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVQDLLPD 60

Query: 435  SSLDSEKAQDPVSTVDHDSKVSISMKTKLNDANGPEQLVK-DFSVAVTQED--SLNYVSS 605
             S+DS K   PVS + H+     S K K +  N   + VK +  V   ++D  +  +   
Sbjct: 61   DSVDSGKPL-PVSML-HEYAPVCSFKKKRDSMNRKTRDVKQEQEVTEGKKDGCAQKFRGL 118

Query: 606  CCPIYCVIKPCKPLSNDSAEEKSEEAKFQ-------SQVTMALSDRATSEAKDLDCTKDS 764
                Y +    +  S      ++   + Q       SQVT     + +S    +   +  
Sbjct: 119  DADDYDICTSPRQYSYGGPYRRTRVGRKQIFKKEELSQVTRPYMQKDSSSLSMVHSAR-V 177

Query: 765  RESLSNLPSIAENITNNYGEVVRLEPCSCSDKSLPVHEDASSKEIPAVSHSEKDSLGICK 944
            ++ +  + S   +++  +   V+ +  + +  SL +   A  K+      S     G  +
Sbjct: 178  KDDVGTVNS--SSLSMVHSARVKDDVGTVNSSSLTMVHSARIKDDVGTVKSSDSPPGEVE 235

Query: 945  DMM---ECYGGGAIINVKKITIL-----SDGDMTTDID-GHFSTDSRLSLLKADLSTMIT 1097
             ++   EC       N + +T++     +D ++  D + G     S+ S ++  ++T + 
Sbjct: 236  KLIYKKECQKDDKTKNQQSLTVVNSVKRNDSEIRIDNEHGLMGDSSQDSEIQPSVATSLA 295

Query: 1098 SNVEDTIEVDKELSMFDIETSSPFLSSEFRKCESNEEGFPDGQLDKVKLEESCVLVEKNI 1277
            +  +D     ++ +  D +TSS  +S +  K E  +       L    +EESC+LV+++ 
Sbjct: 296  AGSDDC----RKETNVDTKTSSSSVSEQ--KSEILQ------PLSGRSVEESCILVDRDE 343

Query: 1278 F-----SKTGLQVQKSYKKKIKDVLSPRRWSKGKRNQEHLA-AWYEE------QCGKHTM 1421
            F      K      K Y KKI+D +S R     ++  + LA  WY E      +CG    
Sbjct: 344  FHCVFPDKMENDKHKPY-KKIRDAISSRMKQNREKEYKRLARQWYAEDVENGRECGDDPK 402

Query: 1422 IEGELRKSETIESFESDWELL 1484
               E +  E     ES+WELL
Sbjct: 403  PLEENQSPE-----ESEWELL 418


>ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana]
            gi|16612317|gb|AAL27517.1|AF439849_1 At2g31130/T16B12.6
            [Arabidopsis thaliana] gi|20197328|gb|AAC63838.2|
            expressed protein [Arabidopsis thaliana]
            gi|23506163|gb|AAN31093.1| At2g31130/T16B12.6
            [Arabidopsis thaliana] gi|330253402|gb|AEC08496.1|
            uncharacterized protein AT2G31130 [Arabidopsis thaliana]
          Length = 419

 Score =  115 bits (289), Expect = 5e-23
 Identities = 132/447 (29%), Positives = 202/447 (45%), Gaps = 37/447 (8%)
 Frame = +3

Query: 255  MDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYSDVMQDLLPP 434
            MD KG  WVG++YQKFEAMC+EVEE++ +DT KYVE+QV+TVG++VKKF SDV+ DLLP 
Sbjct: 1    MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVHDLLPD 60

Query: 435  SSLDSEKAQDPVSTVDHD-------SKVSISMKTKLNDANGPEQLVK----DFSVAVTQE 581
             S+DS K   PVS + H+        K   SM  K  D    +++ +     F+  +   
Sbjct: 61   ESVDSGKPL-PVSML-HEYAPVYSFKKKKDSMNRKTKDVTQEQEVTEGKKDGFAKKLRGL 118

Query: 582  DSLNYVSSCCP-IYCVIKPCK--PLSNDSAEEKSEEAK-----FQSQVTMALSDRATSEA 737
            D+ +Y     P  Y    P +   +      +K E ++      Q  +T +LS   ++  
Sbjct: 119  DADDYDICTSPRQYSYGGPYRRTRIGRKQIFKKEELSQVIRPYIQKDLT-SLSMVHSARV 177

Query: 738  KDLDCTKDSRESLSNLPSIAENITNNYGEVVRLEPCSCSDKSLPVHEDASSKEIPAVSHS 917
            KD D    +  SLS + S    + ++ G V        +  SL +   AS K+      S
Sbjct: 178  KD-DLGTVNSSSLSMVHS--ARVNDDVGTV--------NSSSLSMVHHASMKDDVGTVKS 226

Query: 918  EKDSLGICKDMM---ECYGGGAIINVKKITILS---DGDMTTDIDGHFSTDSRLSLLKAD 1079
                 G  + ++   +C       N + +T+++     D    +D      +  S+   D
Sbjct: 227  SDSPPGEVEKLISKKKCQKDDKAKNQQSLTVVNSVKSNDSEVIVDNEHGLSADKSVRSQD 286

Query: 1080 LSTMITSNVEDTIEVDKELSMFDIETSSPFLSSEFRKCESNEEGFPDGQLDKVKLEESCV 1259
            L    +       E D      ++ETSS  +S    K E  +       L    +EESC+
Sbjct: 287  LEIQPSLATSLPAESDDCRKETNVETSSSSVSEP--KSEILQ------HLSGRSVEESCI 338

Query: 1260 LVEKNIF-----SKTGLQVQKSYKKKIKDVLSPRRWSKGKRNQEHLA-AWYEE------Q 1403
            LV+++ F      K      K Y KKI+D +S R     ++  + LA  WY E      +
Sbjct: 339  LVDRDEFHSVFPDKMENDKHKPY-KKIRDAISSRMKQNREKEYKRLARQWYAEDVENGRE 397

Query: 1404 CGKHTMIEGELRKSETIESFESDWELL 1484
            CG +     E + SE     ES+WELL
Sbjct: 398  CGDNPKPIEENQSSE-----ESEWELL 419


>ref|XP_004505507.1| PREDICTED: uncharacterized protein LOC101513718 isoform X1 [Cicer
            arietinum] gi|502143898|ref|XP_004505508.1| PREDICTED:
            uncharacterized protein LOC101513718 isoform X2 [Cicer
            arietinum]
          Length = 396

 Score =  113 bits (282), Expect = 3e-22
 Identities = 108/438 (24%), Positives = 188/438 (42%), Gaps = 25/438 (5%)
 Frame = +3

Query: 246  MIVMDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYSDVMQDL 425
            MI MD+KG +WVG++YQKFE MC+E E++MYEDT +Y+E+Q+++VG +VKK YSD++ DL
Sbjct: 1    MITMDVKGITWVGNIYQKFENMCLEAEDMMYEDTAEYIENQMQSVGASVKKLYSDIVGDL 60

Query: 426  LPPSSLDSEKAQDPVSTVDHDSKVSISMKTKLNDANGPEQLVKD----FSVAVTQEDSLN 593
            LP  +   ++ +D     D  +   +  K        P ++ K+     ++  T  DS  
Sbjct: 61   LPSIACGLDEKEDSELPADQCTDAGLRKK--------PVKIFKERPAKANIKQTTVDS-- 110

Query: 594  YVSSCCPIYCVIKPCKPLSNDSAEEKSEEAKFQSQVTMALSDRATSEAKDLDCTKDSRES 773
                        +  K   ND   + S +   ++ V    S R  +  K  +    SR+ 
Sbjct: 111  ------------RIDKNADNDCIHDASYDGTCKTDVLFKSSSR--NSVKKSNFISRSRQY 156

Query: 774  LSNLPSIAENITNNYGEVVRLEPCSCSDKSLPVHEDASSKEIPAVSHSEKDSLGICKDMM 953
            + ++     +I +N G  V   P +    +  +  + +S E    +         C+ + 
Sbjct: 157  VGSM-----DIKSNLG--VDENPVNEKMAATKISNEITSAETKIFNEITSAETDTCRPLQ 209

Query: 954  EC--------YGGGAII----NVKKITILSDGDMTTDIDGHFSTDSRLSLLKAD------ 1079
             C           GA +    + +  ++ S+ D   +I+   +      L++        
Sbjct: 210  RCEISNEDQNQNHGARVSKPASAEVTSLASEADHCNEIENACTEQFPYVLVQVKSAEEKQ 269

Query: 1080 ---LSTMITSNVEDTIEVDKELSMFDIETSSPFLSSEFRKCESNEEGFPDGQLDKVKLEE 1250
                S+       D   +D+ +   D   S   LS         E+G    Q D +KLEE
Sbjct: 270  IGMSSSCGPFGERDDFSMDRTVQSDDCSNSMVVLSYP-------EQGKKSMQEDHLKLEE 322

Query: 1251 SCVLVEKNIFSKTGLQVQKSYKKKIKDVLSPRRWSKGKRNQEHLAAWYEEQCGKHTMIEG 1430
            +CV+V  +   + G  ++ + K + +   S  + S  K+  E LA W+E  C   T+   
Sbjct: 323  TCVMVTGDDLQEIG-NLKTNKKTRRRQGFSLSKKSARKQEYEELAIWHENNCLDKTLHHK 381

Query: 1431 ELRKSETIESFESDWELL 1484
            +L         ES+WELL
Sbjct: 382  KLLVPSV---SESEWELL 396


>ref|XP_002300422.2| hypothetical protein POPTR_0001s38590g [Populus trichocarpa]
            gi|550349191|gb|EEE85227.2| hypothetical protein
            POPTR_0001s38590g [Populus trichocarpa]
          Length = 469

 Score =  112 bits (281), Expect = 4e-22
 Identities = 125/489 (25%), Positives = 213/489 (43%), Gaps = 79/489 (16%)
 Frame = +3

Query: 255  MDL--KGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYSDVMQDLL 428
            MDL  KG +W G++YQKFE MC EV+ V+ +D FK+VE+QV +VG+ +KK YSD + DL+
Sbjct: 1    MDLRSKGMAWAGNIYQKFETMCHEVDNVVNKDAFKFVENQVHSVGENMKKIYSDAVHDLI 60

Query: 429  PPSSLDSEKAQDPVST------------VDHDSKVSISMKTKLNDANGPEQLVKDFSVAV 572
            PP  +D  K + P +             ++ D + +   K    +    + + K     V
Sbjct: 61   PP-LVDPAKCEAPAAATIGAYIIKTVIGIEDDHEYASRAKHSPAELGDHDPMTKQLGKDV 119

Query: 573  TQEDSLNYVSSCCPIYCVIKPCKPLSNDSAE----EKSEEAKFQSQVTMALSDRATSE-- 734
             +    N ++              ++ +S E     +SE A     VT   SD +T E  
Sbjct: 120  WELQVANQLT--------------ITGNSEETIEGAESESALGVDDVTTETSDVSTEENS 165

Query: 735  AKDLDCTKDSRESLSNLPSIAENITNNYGEVVRLEPCSCSDKSLPVHE------------ 878
             K+  C  +  ES+++        ++ + +      C   DK  PV              
Sbjct: 166  VKENSCGPEELESITHGDKEPFEASSEFPDFSSENACGFLDKVSPVTSVPGEAFQCPQDV 225

Query: 879  -----------------DASSKEIPAVSHSEKDSLGI------CKDMME--CYGGGAIIN 983
                              +SS+   +V  SEK+++ +      C    E  C  G  + N
Sbjct: 226  GTVCDSSAGDSYSANVIVSSSQMSFSVVSSEKEAVEMEIVSPSCSIFKESHCLPGNPLDN 285

Query: 984  VKKITILSDGDMTTDIDGHFSTDSRLSLLK----ADLSTMITSNVE-DTIEVDKELSMFD 1148
            +    ++S G+   D+ GH S  S++ L      +D S +++S+    T+      +   
Sbjct: 286  I-TTKLISCGN-PFDVAGHDSDSSKMLLSSTSSHSDSSVVLSSSTSAPTVSCKINGAEMG 343

Query: 1149 IETSSPFLSSEFRKC--ESNEEGFPDGQL------DKVKLEESCVLVEKNIFSKTGLQVQ 1304
            + +S+  LS     C  +S  E   + ++      + VKL+ESCV+V+ +   +   + +
Sbjct: 344  LASSNSVLSLVSIGCSDDSAIEDLTESEMENIDLSENVKLDESCVIVDNSFLYEVSRRNR 403

Query: 1305 --KSYKKKIKDVLSPRRWSKGKRNQEHLAAWYEEQCGKHTMIEGELRKSETIE------- 1457
              +SYKKKI+D  S ++  +  +  E LA W+ +  G  TM + EL  S TI        
Sbjct: 404  RLRSYKKKIQDAFSSKK--RLTKEYEQLAIWFGDLDGHDTM-QHELSSSTTITLDPQTNW 460

Query: 1458 SFESDWELL 1484
              +S+WELL
Sbjct: 461  RQDSEWELL 469


>gb|EOX92820.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
          Length = 341

 Score =  111 bits (278), Expect = 9e-22
 Identities = 115/383 (30%), Positives = 175/383 (45%), Gaps = 16/383 (4%)
 Frame = +3

Query: 246  MIVMDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYS----DV 413
            MI MDLKG +WVG +Y+KFEAMC+EVEEVMY+DT KYVE++V+TVG +VKKFYS    DV
Sbjct: 1    MISMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDV 60

Query: 414  MQDLLPPSSLDSEKAQDPVSTVDHDSKVSISMKTKLN-----DA--NGPEQLVKDFSVAV 572
            MQDLL PSSL+  KA   V+  D   ++      K N     DA     EQL +D  V  
Sbjct: 61   MQDLLLPSSLEPMKA---VAASDLPVEIYAETLKKPNVGLKEDAIQGDSEQLTEDSEVIA 117

Query: 573  TQEDSLNYVSSCCPIYCVIKPCKPLSNDSAEEKSEEAKFQSQVTMALSDRATSEAKDLDC 752
               ++  +V S C ++ V    +  S    E  S +      ++   ++R T    +++ 
Sbjct: 118  DVNENAAHVPSSCQLHMVDNIFESCSGSFVERASSDL-----LSGEHNNRCTLNKTNVEH 172

Query: 753  TKDSRESLSNLPSIAENITNNYGEVVRLEPCSCSDKSLPVHEDASSKEIPA----VSHSE 920
               +  S     S A  + N +G +       C + +   + + S  +IPA    VS  E
Sbjct: 173  LLPAETS-----SEAGCVENEFGRMSSF----CGNAN--ANHEVSCHQIPATLTPVSVEE 221

Query: 921  KDSLGICKDMMECYGGGAIINVKKITILSDGDMTTDIDGHFSTDSRLSLLKADLSTMITS 1100
             D    C  + E        +     IL DG     I           + K ++    +S
Sbjct: 222  DD----CDSIEESSNEIKSASDSVPEILPDGLHLVGI-----------VEKNEMEMRCSS 266

Query: 1101 NVEDTIEVDKELSMF-DIETSSPFLSSEFRKCESNEEGFPDGQLDKVKLEESCVLVEKNI 1277
            ++ ++ E + +L+   D   SS     E    +         QLDK++++ESC +V    
Sbjct: 267  SIIESEESNGKLNWTKDASGSSTVGRKEIETVQ---------QLDKIRVDESCFMVNGAE 317

Query: 1278 FSKTGLQVQKSYKKKIKDVLSPR 1346
                  +  K    +I+D +S R
Sbjct: 318  LHFHPQREGKHKTYQIRDAISSR 340


>gb|EOX92821.1| Uncharacterized protein isoform 5 [Theobroma cacao]
           gi|508700926|gb|EOX92822.1| Uncharacterized protein
           isoform 5 [Theobroma cacao]
          Length = 334

 Score =  110 bits (276), Expect = 2e-21
 Identities = 71/156 (45%), Positives = 92/156 (58%), Gaps = 11/156 (7%)
 Frame = +3

Query: 246 MIVMDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYS----DV 413
           MI MDLKG +WVG +Y+KFEAMC+EVEEVMY+DT KYVE++V+TVG +VKKFYS    DV
Sbjct: 1   MISMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDV 60

Query: 414 MQDLLPPSSLDSEKAQDPVSTVDHDSKVSISMKTKLN-----DA--NGPEQLVKDFSVAV 572
           MQDLL PSSL+  KA   V+  D   ++      K N     DA     EQL +D  V  
Sbjct: 61  MQDLLLPSSLEPMKA---VAASDLPVEIYAETLKKPNVGLKEDAIQGDSEQLTEDSEVIA 117

Query: 573 TQEDSLNYVSSCCPIYCVIKPCKPLSNDSAEEKSEE 680
              ++  +V S C ++ V    +  S    E  S +
Sbjct: 118 DVNENAAHVPSSCQLHMVDNIFESCSGSFVERASSD 153


>ref|XP_002325580.2| hypothetical protein POPTR_0019s11960g [Populus trichocarpa]
            gi|550317324|gb|EEE99961.2| hypothetical protein
            POPTR_0019s11960g [Populus trichocarpa]
          Length = 442

 Score =  110 bits (275), Expect = 2e-21
 Identities = 125/470 (26%), Positives = 191/470 (40%), Gaps = 57/470 (12%)
 Frame = +3

Query: 246  MIVMDLKGGSWVGDLYQKFEAMCVEVEEVMYE---------------------------- 341
            M +MDLKG +WVGD+Y KFEA  +EVEE+M E                            
Sbjct: 1    MTIMDLKGITWVGDIYLKFEARLLEVEEIMREAAEFEWPARAVQFPPKLQMLGCCGCCFG 60

Query: 342  -DTFKYVEDQVRTVGDTVKKFYSDVMQDLLPPSSLDSEKAQDPVSTVDHDSKVSISMK-- 512
             +  KYVE+Q++TV + V+KFYSDVMQDL  P S D          VD  + V I MK  
Sbjct: 61   QEAVKYVENQMQTVSNNVRKFYSDVMQDLCSPDSEDPANGAVSKFPVDSGADVGIYMKPE 120

Query: 513  ----TKLNDANGPEQLVKDFSVAVTQEDSLNYVSSCCPIYCVIKPCKPLSNDSAEEKSEE 680
                 K   A+ PEQL +D  +           S C P+   I   +     S    S +
Sbjct: 121  DGMEEKCGKADDPEQLAEDPKMTADSG------SDCLPLRRRITVRRISRQHSKGSLSNK 174

Query: 681  AKFQSQVTMALSDRATSEAKDLDCTKDSRESLSNLPSIAENITNNYGEVVRLEPCSCSDK 860
            +   +      ++ + +E      T  S +  SN+    +N+  +  +  RL    C + 
Sbjct: 175  SNLDTDKNSNCNNVSPNEIS--GTTTLSSKFSSNVELSDQNLEASCDQTARLATPGCVEV 232

Query: 861  SLPVHEDASSKEIPAVSHSEKDSLGICKDMMECYGGGAIINVKKITILSDGDMTTDIDGH 1040
            +     + S  EI   S      +   K  ++      ++N+            T+   H
Sbjct: 233  TDHFSMEESKNEIKNAS-KHVPEISFNKPSLD------MVNI------------TETGRH 273

Query: 1041 FSTDSRLS---LLKAD----LSTMITSNVEDTIEVDKELSMFDIETSSPFLSSEFRKCES 1199
              TDSR S   LL+      +S    S +E     + + + F  E      S E+   ES
Sbjct: 274  EGTDSRPSSRNLLEESNGVCISNEFVSMIESAANGNMQTNKFAYEEDFVSNSDEW-GIES 332

Query: 1200 NEEG--FPDG----QLDKVKLEESCVLVEKNIFSKTGLQVQKSYKKKIKDVLSPRRWSKG 1361
            +E+G    +G    + DK +LEE CVLV  + F     + +    KKI+DV   R+ S  
Sbjct: 333  DEDGTIIDEGMEIIRADKARLEEVCVLVNVDEFHHVPREGKNRPYKKIRDVFRSRKRSVM 392

Query: 1362 KRNQEHLAAWYEEQCGKH---------TMIEGELRKSETIESFESDWELL 1484
            K  ++  A    +   K          T+   E  +S + +  ES+WEL+
Sbjct: 393  KEYEQLAAQCSSDSKSKEEESITSLMPTLSIKEANRSLSHDPSESEWELV 442


>ref|XP_004487168.1| PREDICTED: protein gar2-like [Cicer arietinum]
          Length = 365

 Score =  109 bits (272), Expect = 5e-21
 Identities = 103/422 (24%), Positives = 194/422 (45%), Gaps = 9/422 (2%)
 Frame = +3

Query: 246  MIVMDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYSDVMQDL 425
            M+ MD  G +WV ++YQKFE M +E+++ ++E+TF+Y+E+Q+  VG++VKKFYSDVMQ++
Sbjct: 1    MMTMDATGIAWVCNIYQKFENMFLELQDTLFEETFQYIENQMEIVGESVKKFYSDVMQEI 60

Query: 426  LPPSSLDSEKAQDPVSTVDHDSKVSISMKTKLNDANGPEQLVKDFSVAVTQEDSLNYVSS 605
            LP SS  +     PV +V+  +   +   TK +     E  ++ ++   T+  S+N+   
Sbjct: 61   LPQSSFSA-----PVMSVEQYTGAGL---TKKSFQASREITIRAYTEQSTENSSVNHDIG 112

Query: 606  CCPIYCVIKPCKPLSNDSAEEKSEEAKFQSQVTMAL-SDRATSE---AKDLDCTKDSRES 773
               +Y   + C     +SAE KS  +  ++Q  M + +   T+E   +K   C       
Sbjct: 113  NDAVYA--ESCGKQCVESAEVKSNLSSDENQQNMKMVASNTTTEVALSKTDTCISSQSCE 170

Query: 774  LSNLPSIAENITN--NYGEVVRLEPC-SCSDKSLPVHEDASSKEIPAVSHSEKDSLGICK 944
            ++N+    E   +  +Y EV        C ++S     + +S  +  V  +E++ +    
Sbjct: 171  IANVNQNHEATVSKTDYAEVTNFASVEDCCNESENASTEQNSNAMELVESTEENEIN--- 227

Query: 945  DMMECYGGGAIINVKKITILSDGDMTTDIDGHFSTDSRLSLLKADLSTMITSNVEDTIEV 1124
                 +   A  +  +++ +  G M  D   H    S +++   + S++   N +  +E 
Sbjct: 228  --TSYFSSDAFEDAHELSTI--GAMQLDDCSH----STITVSHPESSSLDIENFDAAMEK 279

Query: 1125 DKELSMFDIETSSPFLSSEFRKCESNEEGFPDGQLDKVKLEESCVLVEKNIFSKT--GLQ 1298
            D ++   D                           D+++ +E+CV++ K+ +      + 
Sbjct: 280  DHKIIHQD---------------------------DELQFDETCVMITKDEYQSVPEAIV 312

Query: 1299 VQKSYKKKIKDVLSPRRWSKGKRNQEHLAAWYEEQCGKHTMIEGELRKSETIESFESDWE 1478
              K+ KKK +   S  + S  K+  E LA          T +E + +KS   +  ES+WE
Sbjct: 313  NLKTSKKKWRQPFSLSKKSARKQEYEELALL--------TFVEHK-QKSSAADISESEWE 363

Query: 1479 LL 1484
            LL
Sbjct: 364  LL 365


>ref|XP_004149372.1| PREDICTED: uncharacterized protein LOC101205697 [Cucumis sativus]
          Length = 379

 Score =  108 bits (271), Expect = 6e-21
 Identities = 117/434 (26%), Positives = 194/434 (44%), Gaps = 24/434 (5%)
 Frame = +3

Query: 255  MDLKGGSWVGDLYQKFEAMCVEVEEVMYEDTFKYVEDQVRTVGDTVKKFYSDVMQDLLPP 434
            MD+KG +WVG LY+KFE MC+EVE+++ +DT KYVE+QV  VG +VK+FYSDVMQD LPP
Sbjct: 1    MDVKGIAWVGRLYEKFETMCLEVEDIICQDTVKYVENQVEVVGASVKRFYSDVMQDFLPP 60

Query: 435  SSLDSEKAQDPVSTVDHDSKVSI----SMKTKLNDANGPEQLVKDFSVAVTQEDSLNYVS 602
            S L  EK     S +++   V I    +M  K+  +   E+   + S      D+   ++
Sbjct: 61   SELSDEKVAVCNSALENYENVVICKKPTMGMKIERSKFSEEKSNENSKVTA--DAKRDIA 118

Query: 603  SCCP--------IYCVIKPCKPLSN---DSAEEKSEEAKFQSQVTMALSDRATSEAKDLD 749
               P        +Y V  P    +    D    K ++     ++ +   +  T   K L 
Sbjct: 119  CKLPRGHNHANYLYLVSSPYSAANRAQIDGYSRKKDDENIHHKIDLDGRESTTRGCKSL- 177

Query: 750  CTKDSRESLSNLPSIAENITNNYGEVVRLEPCSCSD-----KSLPVHEDASSKEIPAVSH 914
                +  S +NL    EN  ++   ++  +  + S+     +++ V +   +  + + + 
Sbjct: 178  ----TETSPTNLEKKYENDASSCCTILNRKSEASSELAGNMETMLVKDTRCNSVMQSANE 233

Query: 915  SEKDSLGICKDMMECYGGGAIINVKKIT-ILSDGDMTTDIDGHFSTDSRLSLLKADLSTM 1091
            +E  +  I  D        AI++ +K T +LS GD + ++DG   +DS            
Sbjct: 234  TEIKTDNILPDT----PSSAIVDTEKETRLLSYGDSSAELDGR--SDS------------ 275

Query: 1092 ITSNVEDTIEVDKELSMFDIETSSPFLSSEFRKCESNEEGFPD-GQLDKVKL-EESCVLV 1265
                           S+ DIE                E+G  +  Q D+ KL EE+CVLV
Sbjct: 276  --------------WSLDDIEL---------------EQGTHNIQQADETKLDEEACVLV 306

Query: 1266 E-KNIFSKTGLQVQKSYKKKIKDVLSPRRWSKGKRNQEHLAAWYEEQCGKHTMIEGELRK 1442
            +  ++      +V++ + KKI    S  + SK K+  + LA  +    G     + E +K
Sbjct: 307  KGDDLHFDFNEEVKQRHYKKIAGAFSFTKKSKRKQEYKELAMKHGYGFGTIPNQQDE-QK 365

Query: 1443 SETIESFESDWELL 1484
                +  E DW+LL
Sbjct: 366  LTAEDVLEQDWQLL 379


Top