BLASTX nr result

ID: Rehmannia22_contig00005887 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00005887
         (1393 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004239298.1| PREDICTED: uncharacterized protein LOC101253...   421   e-115
ref|XP_006343189.1| PREDICTED: uncharacterized protein LOC102581...   417   e-114
ref|XP_006438862.1| hypothetical protein CICLE_v10031153mg [Citr...   387   e-105
ref|XP_006438861.1| hypothetical protein CICLE_v10031153mg [Citr...   387   e-105
ref|XP_006483007.1| PREDICTED: myosin-9-like isoform X1 [Citrus ...   387   e-105
ref|XP_002267942.2| PREDICTED: uncharacterized protein LOC100260...   385   e-104
gb|EPS73766.1| hypothetical protein M569_00990, partial [Genlise...   381   e-103
gb|EXB81215.1| hypothetical protein L484_013156 [Morus notabilis]     379   e-102
ref|XP_004157632.1| PREDICTED: uncharacterized LOC101205430, par...   378   e-102
ref|XP_004140652.1| PREDICTED: uncharacterized protein LOC101205...   378   e-102
ref|XP_006483008.1| PREDICTED: myosin-9-like isoform X2 [Citrus ...   375   e-101
gb|EOX99658.1| Uncharacterized protein isoform 6 [Theobroma caca...   372   e-100
gb|EOX99655.1| Uncharacterized protein isoform 3 [Theobroma cacao]    372   e-100
gb|EOX99654.1| Uncharacterized protein isoform 2 [Theobroma cacao]    372   e-100
gb|EOX99653.1| Uncharacterized protein isoform 1 [Theobroma cacao]    372   e-100
gb|EOX99656.1| Uncharacterized protein isoform 4 [Theobroma caca...   370   e-100
ref|XP_002512652.1| conserved hypothetical protein [Ricinus comm...   370   e-100
ref|XP_002511106.1| conserved hypothetical protein [Ricinus comm...   358   4e-96
gb|EOY22608.1| Uncharacterized protein isoform 1 [Theobroma cacao]    357   5e-96
ref|XP_004145144.1| PREDICTED: uncharacterized protein LOC101221...   353   7e-95

>ref|XP_004239298.1| PREDICTED: uncharacterized protein LOC101253187 [Solanum
            lycopersicum]
          Length = 551

 Score =  421 bits (1082), Expect = e-115
 Identities = 240/419 (57%), Positives = 290/419 (69%), Gaps = 10/419 (2%)
 Frame = -3

Query: 1229 KMRQWSSESG--------GGSPARGSHVRSSSASGISNIKRTXXXXXXXXXXXXXXXXAS 1074
            ++R+WSSESG        G S  R  H RSSS +G+SNIKRT                AS
Sbjct: 11   QLRKWSSESGAPMAALTVGSSSPR--HGRSSSITGMSNIKRTQNVAAKAAAQRLAQVMAS 68

Query: 1073 QAAVXXXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNG-ANNGSGGVRPVIPSPK 897
            QAA                     RF+           V T   +++ S  + PVI S K
Sbjct: 69   QAATGNDDDEDGDDDLGF------RFSAPPPPSFSRSKVSTAANSSSDSNAINPVIQSAK 122

Query: 896  IS-RSSSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXSLKTXXXXXXXXXPSN 720
            ++ RSSS A+  N+++ +   RSTS GRP++              ++T         P+N
Sbjct: 123  LNTRSSSPALARNIVEELPSLRSTSAGRPTVPSRPPPSIPSTQQPVRTPSPIPPIDPPTN 182

Query: 719  RQREKKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEARV 540
            + REK+FSPDL ++NLKD GD RA+SALRDELDMLQEENEN+L KLR+AE S E AEARV
Sbjct: 183  KLREKRFSPDLRQVNLKDTGDHRAASALRDELDMLQEENENLLGKLRVAETSYEEAEARV 242

Query: 539  KELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIAK 360
            KELEKQVAALGEGVSLEAKLLSRKEA+LRQREAALK+AK AK+G+D E++SL SNV+ AK
Sbjct: 243  KELEKQVAALGEGVSLEAKLLSRKEASLRQREAALKDAKQAKNGIDAELASLHSNVQKAK 302

Query: 359  NEAATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPDI 180
            +EAA  V+QL+G+ESEVKALRSMTQRM+LTQ+EME+VVLKRCWLARYWGLA Q GIC DI
Sbjct: 303  DEAAAAVDQLQGSESEVKALRSMTQRMILTQNEMEDVVLKRCWLARYWGLATQFGICADI 362

Query: 179  AMSKHEYWSSLAPLPLEIVLSAGQKAKEQCWEKGDNNPQRGKLVEDFSDLTGEGNIESM 3
            A SKHEYWSS APLP E+V+SAGQKAKE+C EKGD+NP+ GK V+D +DLTGEGNIESM
Sbjct: 363  AASKHEYWSSFAPLPFELVISAGQKAKEECLEKGDDNPEMGKFVQDLNDLTGEGNIESM 421


>ref|XP_006343189.1| PREDICTED: uncharacterized protein LOC102581164 [Solanum tuberosum]
          Length = 551

 Score =  417 bits (1072), Expect = e-114
 Identities = 239/419 (57%), Positives = 287/419 (68%), Gaps = 10/419 (2%)
 Frame = -3

Query: 1229 KMRQWSSESG--------GGSPARGSHVRSSSASGISNIKRTXXXXXXXXXXXXXXXXAS 1074
            ++R+WSSE+G        G S  R  H RSSS SG+SNIKRT                AS
Sbjct: 11   QLRKWSSETGAPMAALTVGSSSPR--HGRSSSISGMSNIKRTQNVAAKAAAQRLAQVMAS 68

Query: 1073 QAAVXXXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNG-ANNGSGGVRPVIPSPK 897
            QAA                     RF            V T   +++ S  + P I S K
Sbjct: 69   QAATGNDDDEDGDDDLGF------RFAAPPPPTFSRSKVSTAANSSSDSNAINPAIQSAK 122

Query: 896  IS-RSSSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXSLKTXXXXXXXXXPSN 720
            ++ RSSS A+  N ++ +   RSTS GRP++              +KT         P+N
Sbjct: 123  LNTRSSSPALARNFVEELPSLRSTSAGRPTVPSRPPPSIPSTQQPVKTPSPIPPIDPPTN 182

Query: 719  RQREKKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEARV 540
            + REK+FSPDL ++NLKD GD RA+SALRDE DMLQEENEN+L KLR+AE S E AEARV
Sbjct: 183  KLREKRFSPDLRQVNLKDTGDHRAASALRDEFDMLQEENENLLGKLRVAETSYEEAEARV 242

Query: 539  KELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIAK 360
            KELEKQVAALGEGVSLEAKLLSRKEA+LRQREAALK+AK AK+G+ VE++SL SNV+ AK
Sbjct: 243  KELEKQVAALGEGVSLEAKLLSRKEASLRQREAALKDAKQAKNGIAVELASLHSNVQKAK 302

Query: 359  NEAATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPDI 180
            ++AA  V+QL+GTESEVK+LRSMTQRM+LTQ+EME+VVLKRCWLARYWGLA Q GIC DI
Sbjct: 303  DDAAAAVDQLQGTESEVKSLRSMTQRMILTQNEMEDVVLKRCWLARYWGLATQFGICADI 362

Query: 179  AMSKHEYWSSLAPLPLEIVLSAGQKAKEQCWEKGDNNPQRGKLVEDFSDLTGEGNIESM 3
            A SKHEYWSS APLP E+V+SAGQKAKE+C EKGD+NP+RGK V+D +DLTGEGNIESM
Sbjct: 363  AASKHEYWSSFAPLPFELVISAGQKAKEECLEKGDDNPERGKFVQDLNDLTGEGNIESM 421


>ref|XP_006438862.1| hypothetical protein CICLE_v10031153mg [Citrus clementina]
            gi|557541058|gb|ESR52102.1| hypothetical protein
            CICLE_v10031153mg [Citrus clementina]
          Length = 547

 Score =  387 bits (995), Expect = e-105
 Identities = 231/420 (55%), Positives = 271/420 (64%), Gaps = 13/420 (3%)
 Frame = -3

Query: 1223 RQWSSESGGGS-----PARGSHVRSSSASGISNIKRTXXXXXXXXXXXXXXXXASQAAVX 1059
            RQW SESGG S     PAR  H RSSS+SG+S+IKR                 ASQ A  
Sbjct: 13   RQWGSESGGTSSPAMSPARHHHARSSSSSGLSSIKRNQNVAAKAAAQRLAQVMASQTA-- 70

Query: 1058 XXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGA----NNGSGGVRPVIPSPKIS 891
                               R++              NGA    N G+   +P + S +I+
Sbjct: 71   -------DDDEDDDDDLGFRYSAPPPLALSRSR-NVNGASIAGNAGAASTKPAVTSSRIN 122

Query: 890  RSSSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXS---LKTXXXXXXXXXPSN 720
            RS S A+  N++D     RSTS GRPSM                 L+T         P N
Sbjct: 123  RSPSPALGRNVVDEPTSVRSTSAGRPSMSHCAAAAPPVPQNKPLPLRTAVSLPPIDPPKN 182

Query: 719  RQREKKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEARV 540
              RE++        NLKD GD+R +S LRDELDMLQEENENIL KLRL EE CE AEARV
Sbjct: 183  LHREQR------NFNLKDNGDQREASVLRDELDMLQEENENILNKLRLEEERCEEAEARV 236

Query: 539  KELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIAK 360
            +ELEKQVAALGEGVSLEAKLLSRKEAALRQREAALK+AK  KD VD E+ SL+S ++  K
Sbjct: 237  RELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKDAKQNKDEVDKEIVSLRSELENTK 296

Query: 359  NEAATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPDI 180
            ++ A V++QLR  +SEVKALRSMTQRM+LTQ EMEEVVLKRCWLARYWGLAA+ GIC D+
Sbjct: 297  DDTAAVLQQLRAADSEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAAKYGICADV 356

Query: 179  AMSKHEYWSSLAPLPLEIVLSAGQKAKEQCWEKG-DNNPQRGKLVEDFSDLTGEGNIESM 3
            A+SK+EYWSSLAPLP E+V+SAGQKAKE+CWEKG D+N +R KLV D +DLTGEGNIESM
Sbjct: 357  AVSKYEYWSSLAPLPFEVVISAGQKAKEECWEKGDDDNEKRSKLVVDVNDLTGEGNIESM 416


>ref|XP_006438861.1| hypothetical protein CICLE_v10031153mg [Citrus clementina]
            gi|557541057|gb|ESR52101.1| hypothetical protein
            CICLE_v10031153mg [Citrus clementina]
          Length = 470

 Score =  387 bits (995), Expect = e-105
 Identities = 231/420 (55%), Positives = 271/420 (64%), Gaps = 13/420 (3%)
 Frame = -3

Query: 1223 RQWSSESGGGS-----PARGSHVRSSSASGISNIKRTXXXXXXXXXXXXXXXXASQAAVX 1059
            RQW SESGG S     PAR  H RSSS+SG+S+IKR                 ASQ A  
Sbjct: 13   RQWGSESGGTSSPAMSPARHHHARSSSSSGLSSIKRNQNVAAKAAAQRLAQVMASQTA-- 70

Query: 1058 XXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGA----NNGSGGVRPVIPSPKIS 891
                               R++              NGA    N G+   +P + S +I+
Sbjct: 71   -------DDDEDDDDDLGFRYSAPPPLALSRSR-NVNGASIAGNAGAASTKPAVTSSRIN 122

Query: 890  RSSSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXS---LKTXXXXXXXXXPSN 720
            RS S A+  N++D     RSTS GRPSM                 L+T         P N
Sbjct: 123  RSPSPALGRNVVDEPTSVRSTSAGRPSMSHCAAAAPPVPQNKPLPLRTAVSLPPIDPPKN 182

Query: 719  RQREKKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEARV 540
              RE++        NLKD GD+R +S LRDELDMLQEENENIL KLRL EE CE AEARV
Sbjct: 183  LHREQR------NFNLKDNGDQREASVLRDELDMLQEENENILNKLRLEEERCEEAEARV 236

Query: 539  KELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIAK 360
            +ELEKQVAALGEGVSLEAKLLSRKEAALRQREAALK+AK  KD VD E+ SL+S ++  K
Sbjct: 237  RELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKDAKQNKDEVDKEIVSLRSELENTK 296

Query: 359  NEAATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPDI 180
            ++ A V++QLR  +SEVKALRSMTQRM+LTQ EMEEVVLKRCWLARYWGLAA+ GIC D+
Sbjct: 297  DDTAAVLQQLRAADSEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAAKYGICADV 356

Query: 179  AMSKHEYWSSLAPLPLEIVLSAGQKAKEQCWEKG-DNNPQRGKLVEDFSDLTGEGNIESM 3
            A+SK+EYWSSLAPLP E+V+SAGQKAKE+CWEKG D+N +R KLV D +DLTGEGNIESM
Sbjct: 357  AVSKYEYWSSLAPLPFEVVISAGQKAKEECWEKGDDDNEKRSKLVVDVNDLTGEGNIESM 416


>ref|XP_006483007.1| PREDICTED: myosin-9-like isoform X1 [Citrus sinensis]
          Length = 547

 Score =  387 bits (994), Expect = e-105
 Identities = 231/420 (55%), Positives = 272/420 (64%), Gaps = 13/420 (3%)
 Frame = -3

Query: 1223 RQWSSESGGGS-----PARGSHVRSSSASGISNIKRTXXXXXXXXXXXXXXXXASQAAVX 1059
            RQW SESGG S     PAR  H RSSS+SG+S+IKR                 ASQ A  
Sbjct: 13   RQWGSESGGTSSPAMSPARHHHARSSSSSGLSSIKRNQNVAAKAAAQRLAQVMASQTA-- 70

Query: 1058 XXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGA----NNGSGGVRPVIPSPKIS 891
                               R++              NGA    N G+   +P + S +I+
Sbjct: 71   -------DDDEDDDDDLGFRYSAPPPLALSRSR-NVNGASIAGNAGATSTKPAVTSSRIN 122

Query: 890  RSSSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXS---LKTXXXXXXXXXPSN 720
            RS S A+  N++D     RSTS GRPSM                 L+T         P N
Sbjct: 123  RSPSPALGRNVVDEPTSVRSTSAGRPSMSHCAAAAPPVPQNKPLPLRTAVSLPPIDPPKN 182

Query: 719  RQREKKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEARV 540
              RE++       LNLKD GD+R +S LRDELDMLQEENENIL KLRL EE CE AEARV
Sbjct: 183  LHREQR------NLNLKDNGDQREASVLRDELDMLQEENENILNKLRLEEERCEEAEARV 236

Query: 539  KELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIAK 360
            +ELEKQVAALGEGVSLEAKLLSRKEAALRQREAALK+AK  KD VD E+ SL+S ++  K
Sbjct: 237  RELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKDAKQNKDEVDKEIVSLRSELENTK 296

Query: 359  NEAATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPDI 180
            ++ A V++QLR  +SEVKALRSMTQRM+LTQ EMEEVVLKRCWLARYWGLAA+ GIC D+
Sbjct: 297  DDTAAVLQQLRAADSEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAAKYGICVDV 356

Query: 179  AMSKHEYWSSLAPLPLEIVLSAGQKAKEQCWEKG-DNNPQRGKLVEDFSDLTGEGNIESM 3
            A+SK+EYWSSLAPLP E+V+SAGQKAKE+CWEKG D+N +R KL+ D +DLTGEGNIESM
Sbjct: 357  AVSKYEYWSSLAPLPFEVVISAGQKAKEECWEKGDDDNEKRSKLLVDINDLTGEGNIESM 416


>ref|XP_002267942.2| PREDICTED: uncharacterized protein LOC100260846 isoform 1 [Vitis
            vinifera] gi|296088170|emb|CBI35662.3| unnamed protein
            product [Vitis vinifera]
          Length = 553

 Score =  385 bits (988), Expect = e-104
 Identities = 224/415 (53%), Positives = 275/415 (66%), Gaps = 8/415 (1%)
 Frame = -3

Query: 1223 RQWSSESGGG-------SPARGSHVRSSSASGISNIKRTXXXXXXXXXXXXXXXXASQAA 1065
            RQWSSESG         SP+   H RS+SA+GISNIKRT                ASQ A
Sbjct: 13   RQWSSESGATGTSSPAMSPSLYHHSRSASATGISNIKRTQNFAAKAAAQRLAQVMASQTA 72

Query: 1064 VXXXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGANNGSGGVRPVIPSPKISRS 885
                                  F+                 N+G    +P +P+ +++RS
Sbjct: 73   DDDEDDEDDDLGFRYSAPPPPAFSRT--------------VNSG----KPAVPASRVTRS 114

Query: 884  SSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXSLKTXXXXXXXXXPSNRQREK 705
             S  +  N ++     RSTS GRPSM              L+T         P NRQ+EK
Sbjct: 115  PSPGLGRNFVEETPSVRSTSAGRPSMSLNAIPLVSPSRAPLRTPVPIPPIEPP-NRQKEK 173

Query: 704  KFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEARVKELEK 525
            +FS ++G  N KD GD+R +SALRDE+DMLQEENENIL+KLRL EE C+ AEARV+ELEK
Sbjct: 174  RFSSNVGHFNPKDTGDQREASALRDEVDMLQEENENILDKLRLEEERCKDAEARVRELEK 233

Query: 524  QVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIAKNEAAT 345
            QVAALGEGVSLEAKLLSRKEAALRQREAALK+AK ++DG D E++ L+S ++ AK+ A  
Sbjct: 234  QVAALGEGVSLEAKLLSRKEAALRQREAALKDAKQSRDGEDEEIAFLRSELENAKDRAGA 293

Query: 344  VVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPDIAMSKH 165
            V++QL G +SEVKALRSMTQRMVLTQ EMEEVVLKRCWLARYWGLAA+ GIC DIA+SKH
Sbjct: 294  VLDQLHGAKSEVKALRSMTQRMVLTQKEMEEVVLKRCWLARYWGLAARHGICADIAVSKH 353

Query: 164  EYWSSLAPLPLEIVLSAGQKAKEQCWEKGDNNPQ-RGKLVEDFSDLTGEGNIESM 3
            E+WSSLAPLP E+V+SAGQKAKE+ W +G+++P+ R KLV+D +DLTG+GNIESM
Sbjct: 354  EHWSSLAPLPFEVVISAGQKAKEE-WRRGEDDPETRSKLVQDLNDLTGDGNIESM 407


>gb|EPS73766.1| hypothetical protein M569_00990, partial [Genlisea aurea]
          Length = 405

 Score =  381 bits (978), Expect = e-103
 Identities = 235/416 (56%), Positives = 266/416 (63%), Gaps = 5/416 (1%)
 Frame = -3

Query: 1235 MEKMRQWSSESGGGSPARGSHVRSSSASGISNIKRTXXXXXXXXXXXXXXXXASQAAVXX 1056
            MEKMRQWS+E  G SPAR  H RSSS S ISNIKRT                ASQ+A   
Sbjct: 1    MEKMRQWSAEPAGASPARVQHGRSSSVSSISNIKRTQNYAAKAAAQRLAQVMASQSAADN 60

Query: 1055 XXXXXXXXXXXXXXXXXLR--FNXXXXXXXXXXPVKTNGANNGSGGVRPVIPSPKISRSS 882
                              +                K +GANN   G++P IPSPKI RS+
Sbjct: 61   DEDDDDEYDADDYSLLRFKPPLPLSLSSASRPPVNKISGANNAIAGIKPPIPSPKIDRST 120

Query: 881  SDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXSLKTXXXXXXXXXPSNR-QREK 705
            SD V     + + P+RSTSTG+ S                KT         P NR QREK
Sbjct: 121  SDTVLQLQQEEIPPSRSTSTGKSSTSIKTASSLPPFRPPFKTPAPIPPTTDPPNRRQREK 180

Query: 704  KFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEARVKELEK 525
            KFS        +D    RASSALRDELD+LQEENE+ILEKLRL EESCEAAEARVKELEK
Sbjct: 181  KFS--------QDSAGSRASSALRDELDILQEENESILEKLRLTEESCEAAEARVKELEK 232

Query: 524  QVAALGEGVSLEAKLLSRKEAALRQREAALK-EAKVAKDGVDVEMSSLQSNVKIAKNEAA 348
            QVA LGEGV+LEAKLLSRKEAALR+REAAL+ EAKVAKDG DVEM SL+S++K AK EAA
Sbjct: 233  QVATLGEGVTLEAKLLSRKEAALRRREAALREEAKVAKDGTDVEMESLRSDLKTAKKEAA 292

Query: 347  TVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPDIAMSK 168
             V E L GT+SEVKALRS            EEVVLKRCWLARYWGLA +LGIC DIA+SK
Sbjct: 293  AVYEHLHGTKSEVKALRS------------EEVVLKRCWLARYWGLAMELGICEDIAVSK 340

Query: 167  HEYWSSLAPLPLEIVLSAGQKAKEQCWEKGDNNPQRGKLVEDFSD-LTGEGNIESM 3
            +EYWSSLAPLP+E+VLSAGQKAKE+C +KG       ++ +DFSD LTGEGNIESM
Sbjct: 341  YEYWSSLAPLPVEVVLSAGQKAKEECRKKG------YRIADDFSDHLTGEGNIESM 390


>gb|EXB81215.1| hypothetical protein L484_013156 [Morus notabilis]
          Length = 464

 Score =  379 bits (973), Expect = e-102
 Identities = 231/419 (55%), Positives = 273/419 (65%), Gaps = 12/419 (2%)
 Frame = -3

Query: 1223 RQWSSESG----------GGSPARGSHVRSSSASGISNIKRTXXXXXXXXXXXXXXXXAS 1074
            RQW+SESG            SPAR  H RSSS+SGISNIKRT                AS
Sbjct: 13   RQWTSESGTTIPASQSSPAMSPARNRHARSSSSSGISNIKRTQNFAAKAAAQRLAQVMAS 72

Query: 1073 QAAVXXXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGANNGSGGVRPVIPSPKI 894
            Q A                     R++           VK       SG  +P +PS K 
Sbjct: 73   QTAADEDEDEDDGDLGF-------RYSAPPPLSLSRT-VK-------SGATKPAVPSAKT 117

Query: 893  SRSSSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXSLKTXXXXXXXXXPSNR- 717
            +RS S ++  N ++    ARSTSTGRPS+              L+T         P N  
Sbjct: 118  TRSPSPSLAQNFVEETPSARSTSTGRPSIRPAPLAPPNKTT--LRTAVSMPPTETPVNNW 175

Query: 716  QREKKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEARVK 537
            Q++ +F  + G    KD GD+  +SALRDELDMLQEENENIL+KLR  EE  E AEARV+
Sbjct: 176  QKDYRFLSETGLYKSKDSGDQNEASALRDELDMLQEENENILDKLRHEEERHEVAEARVR 235

Query: 536  ELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIAKN 357
            ELEKQVAALGEGVSLEAKLLSRKEAALRQREAALK+AK +K  VD E+ SL+S V  AK+
Sbjct: 236  ELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKDAKQSKGVVDKEIVSLRSEVANAKD 295

Query: 356  EAATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPDIA 177
             AAT+V+QL+G ESEVKALRSMTQRM+LTQ EMEEVVLKRCWLARYWGLAA+ GICPDIA
Sbjct: 296  AAATIVQQLQGAESEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAAKHGICPDIA 355

Query: 176  MSKHEYWSSLAPLPLEIVLSAGQKAKEQCWEKGD-NNPQRGKLVEDFSDLTGEGNIESM 3
            ++K+E+WSSLAPLP E+V+SAGQKAKE+C EKGD +  +R KLV+D +DLTGEGNIESM
Sbjct: 356  VTKYEHWSSLAPLPFEVVVSAGQKAKEECREKGDADTEKRSKLVQDLNDLTGEGNIESM 414


>ref|XP_004157632.1| PREDICTED: uncharacterized LOC101205430, partial [Cucumis sativus]
          Length = 415

 Score =  378 bits (971), Expect = e-102
 Identities = 234/417 (56%), Positives = 269/417 (64%), Gaps = 10/417 (2%)
 Frame = -3

Query: 1223 RQWSSESG---GG------SPARGSHVRSSSASGISNIKRTXXXXXXXXXXXXXXXXASQ 1071
            RQWSSESG   GG      SPARG H RSSS SGISNIKRT                ASQ
Sbjct: 13   RQWSSESGTTGGGPASPAMSPARGHHSRSSSVSGISNIKRTQNFAAKAAAQRLAQVMASQ 72

Query: 1070 AAVXXXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGANNGSGGVRPVIPSPKIS 891
             A                     R++             +   NNGS   R   PS K +
Sbjct: 73   TA---------DDDDDDQDDLGFRYSAPPPISL------SRNVNNGS---RLAAPSAKTT 114

Query: 890  RSSSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXSLKTXXXXXXXXXPSNRQR 711
            RS S  +  N L+     RSTSTGR S+              L+T         P+  QR
Sbjct: 115  RSPSPGLARNFLEDTSSVRSTSTGRSSISHHSLPVAPPKTT-LRTATSMPPLDPPT--QR 171

Query: 710  EKKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEARVKEL 531
            +K+FS D  R + KD G++R +SALRDELD+LQEENENILEKLRL EE C+ AE RV+EL
Sbjct: 172  DKRFSSDTVRFSTKDSGNQREASALRDELDILQEENENILEKLRLEEERCKEAETRVREL 231

Query: 530  EKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIAKNEA 351
            EKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAK +K G D E+ SL+S VK AK E 
Sbjct: 232  EKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKQSKGGGDKEIESLKSEVKKAKEET 291

Query: 350  ATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPDIAMS 171
             +VV+ L G E +VKALRSMTQRM+LTQ EMEEVVLKRCWLARYWGLAA+ GIC DIA++
Sbjct: 292  TSVVQHLHGVEHDVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAAKYGICMDIAVT 351

Query: 170  KHEYWSSLAPLPLEIVLSAGQKAKEQCWEKGDNNPQ-RGKLVEDFSDLTGEGNIESM 3
            K+E+WSSLAPLP EIV+SAGQKAKE+  +KGD +P+ R  LV D SDLTGEGNIESM
Sbjct: 352  KYEHWSSLAPLPFEIVISAGQKAKEEFSQKGDLDPESRSNLVPDISDLTGEGNIESM 408


>ref|XP_004140652.1| PREDICTED: uncharacterized protein LOC101205430 [Cucumis sativus]
          Length = 535

 Score =  378 bits (971), Expect = e-102
 Identities = 234/417 (56%), Positives = 269/417 (64%), Gaps = 10/417 (2%)
 Frame = -3

Query: 1223 RQWSSESG---GG------SPARGSHVRSSSASGISNIKRTXXXXXXXXXXXXXXXXASQ 1071
            RQWSSESG   GG      SPARG H RSSS SGISNIKRT                ASQ
Sbjct: 13   RQWSSESGTTGGGPASPAMSPARGHHSRSSSVSGISNIKRTQNFAAKAAAQRLAQVMASQ 72

Query: 1070 AAVXXXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGANNGSGGVRPVIPSPKIS 891
             A                     R++             +   NNGS   R   PS K +
Sbjct: 73   TA---------DDDDDDQDDLGFRYSAPPPISL------SRNVNNGS---RLAAPSAKTT 114

Query: 890  RSSSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXSLKTXXXXXXXXXPSNRQR 711
            RS S  +  N L+     RSTSTGR S+              L+T         P+  QR
Sbjct: 115  RSPSPGLARNFLEDTSSVRSTSTGRSSISHHSLPVAPPKTT-LRTATSMPPLDPPT--QR 171

Query: 710  EKKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEARVKEL 531
            +K+FS D  R + KD G++R +SALRDELD+LQEENENILEKLRL EE C+ AE RV+EL
Sbjct: 172  DKRFSSDTVRFSTKDSGNQREASALRDELDILQEENENILEKLRLEEERCKEAETRVREL 231

Query: 530  EKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIAKNEA 351
            EKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAK +K G D E+ SL+S VK AK E 
Sbjct: 232  EKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKQSKGGGDKEIESLKSEVKKAKEET 291

Query: 350  ATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPDIAMS 171
             +VV+ L G E +VKALRSMTQRM+LTQ EMEEVVLKRCWLARYWGLAA+ GIC DIA++
Sbjct: 292  TSVVQHLHGVEHDVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAAKYGICMDIAVT 351

Query: 170  KHEYWSSLAPLPLEIVLSAGQKAKEQCWEKGDNNPQ-RGKLVEDFSDLTGEGNIESM 3
            K+E+WSSLAPLP EIV+SAGQKAKE+  +KGD +P+ R  LV D SDLTGEGNIESM
Sbjct: 352  KYEHWSSLAPLPFEIVISAGQKAKEEFSQKGDLDPESRSNLVPDISDLTGEGNIESM 408


>ref|XP_006483008.1| PREDICTED: myosin-9-like isoform X2 [Citrus sinensis]
          Length = 543

 Score =  375 bits (962), Expect = e-101
 Identities = 228/420 (54%), Positives = 269/420 (64%), Gaps = 13/420 (3%)
 Frame = -3

Query: 1223 RQWSSESGGGS-----PARGSHVRSSSASGISNIKRTXXXXXXXXXXXXXXXXASQAAVX 1059
            RQW SESGG S     PAR  H RSSS+SG+S+IKR                 ASQ A  
Sbjct: 13   RQWGSESGGTSSPAMSPARHHHARSSSSSGLSSIKRNQNVAAKAAAQRLAQVMASQTA-- 70

Query: 1058 XXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGA----NNGSGGVRPVIPSPKIS 891
                               R++              NGA    N G+   +P + S +I+
Sbjct: 71   -------DDDEDDDDDLGFRYSAPPPLALSRSR-NVNGASIAGNAGATSTKPAVTSSRIN 122

Query: 890  RSSSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXS---LKTXXXXXXXXXPSN 720
            RS S A+  N++D     RSTS GRPSM                 L+T         P N
Sbjct: 123  RSPSPALGRNVVDEPTSVRSTSAGRPSMSHCAAAAPPVPQNKPLPLRTAVSLPPIDPPKN 182

Query: 719  RQREKKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEARV 540
              RE++       LNLKD GD+R +S LRDELDMLQEENENIL KLRL EE CE AEARV
Sbjct: 183  LHREQR------NLNLKDNGDQREASVLRDELDMLQEENENILNKLRLEEERCEEAEARV 236

Query: 539  KELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIAK 360
            +ELEKQVAALGEGVSLEAKLLSRKEAALRQREAALK+AK  KD VD E+ SL+S ++  K
Sbjct: 237  RELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKDAKQNKDEVDKEIVSLRSELENTK 296

Query: 359  NEAATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPDI 180
            ++ A V++QLR  +SEVKALRSMTQRM+LTQ EMEEVVLKRCWLARYWGLAA+     D+
Sbjct: 297  DDTAAVLQQLRAADSEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAAKY----DV 352

Query: 179  AMSKHEYWSSLAPLPLEIVLSAGQKAKEQCWEKG-DNNPQRGKLVEDFSDLTGEGNIESM 3
            A+SK+EYWSSLAPLP E+V+SAGQKAKE+CWEKG D+N +R KL+ D +DLTGEGNIESM
Sbjct: 353  AVSKYEYWSSLAPLPFEVVISAGQKAKEECWEKGDDDNEKRSKLLVDINDLTGEGNIESM 412


>gb|EOX99658.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508707763|gb|EOX99659.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 464

 Score =  372 bits (956), Expect = e-100
 Identities = 226/421 (53%), Positives = 265/421 (62%), Gaps = 14/421 (3%)
 Frame = -3

Query: 1223 RQWSSESGGGS-----------PARGS--HVRSSSASGISNIKRTXXXXXXXXXXXXXXX 1083
            R+WSS+SG GS           PAR    H RSSSA+GIS+IKRT               
Sbjct: 13   RRWSSDSGSGSTGAAVDSPTLSPARHQPHHSRSSSATGISSIKRTQNFAAKAAAQRLAQV 72

Query: 1082 XASQAAVXXXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGANNGSGGVRPVIPS 903
             ASQ                       R++             T  A  G  G +  + S
Sbjct: 73   MASQTT-------DDDDDENDGDDLGFRYSAPPPLALSRNVNAT--ATTGGAGNKAAMNS 123

Query: 902  PKISRSSSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXSLKTXXXXXXXXXPS 723
             +I RS S A+  N L+     RSTS GR  +             +            P 
Sbjct: 124  TRIGRSPSPALARNFLEEAPTVRSTSAGRSPVSLRVAPPVPPPSKTSLRTAVSLPSEPPK 183

Query: 722  NRQREKKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEAR 543
            N+Q EK+F+ D+G  N KD GD+  +SALRDELDMLQEENEN+L+KLR  EE C+  EAR
Sbjct: 184  NQQPEKRFASDIG-FNSKDTGDQHEASALRDELDMLQEENENVLDKLRHEEEQCKDVEAR 242

Query: 542  VKELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIA 363
            V+ELEKQVAALGEGVSLEAKLLSRKEAALRQREAALK+AK  KD VD E+ SL+S V+ A
Sbjct: 243  VRELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKDAKQTKDVVDTEILSLRSEVENA 302

Query: 362  KNEAATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPD 183
            K+E   V+ QL G ESEVKALRSMTQRM+LTQ EMEEVVLKRCWLARYWGLAA+ GIC D
Sbjct: 303  KDEVTAVIRQLHGAESEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAARYGICAD 362

Query: 182  IAMSKHEYWSSLAPLPLEIVLSAGQKAKEQCWEKGDN-NPQRGKLVEDFSDLTGEGNIES 6
            IA+SK+EYWSSLAPLP E+V+SAGQKAKE+  EKGD+ N +R KLVED +DLTGEGNIES
Sbjct: 363  IALSKYEYWSSLAPLPFEVVVSAGQKAKEEFSEKGDDENEKRSKLVEDLNDLTGEGNIES 422

Query: 5    M 3
            M
Sbjct: 423  M 423


>gb|EOX99655.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 475

 Score =  372 bits (956), Expect = e-100
 Identities = 226/421 (53%), Positives = 265/421 (62%), Gaps = 14/421 (3%)
 Frame = -3

Query: 1223 RQWSSESGGGS-----------PARGS--HVRSSSASGISNIKRTXXXXXXXXXXXXXXX 1083
            R+WSS+SG GS           PAR    H RSSSA+GIS+IKRT               
Sbjct: 13   RRWSSDSGSGSTGAAVDSPTLSPARHQPHHSRSSSATGISSIKRTQNFAAKAAAQRLAQV 72

Query: 1082 XASQAAVXXXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGANNGSGGVRPVIPS 903
             ASQ                       R++             T  A  G  G +  + S
Sbjct: 73   MASQTT-------DDDDDENDGDDLGFRYSAPPPLALSRNVNAT--ATTGGAGNKAAMNS 123

Query: 902  PKISRSSSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXSLKTXXXXXXXXXPS 723
             +I RS S A+  N L+     RSTS GR  +             +            P 
Sbjct: 124  TRIGRSPSPALARNFLEEAPTVRSTSAGRSPVSLRVAPPVPPPSKTSLRTAVSLPSEPPK 183

Query: 722  NRQREKKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEAR 543
            N+Q EK+F+ D+G  N KD GD+  +SALRDELDMLQEENEN+L+KLR  EE C+  EAR
Sbjct: 184  NQQPEKRFASDIG-FNSKDTGDQHEASALRDELDMLQEENENVLDKLRHEEEQCKDVEAR 242

Query: 542  VKELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIA 363
            V+ELEKQVAALGEGVSLEAKLLSRKEAALRQREAALK+AK  KD VD E+ SL+S V+ A
Sbjct: 243  VRELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKDAKQTKDVVDTEILSLRSEVENA 302

Query: 362  KNEAATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPD 183
            K+E   V+ QL G ESEVKALRSMTQRM+LTQ EMEEVVLKRCWLARYWGLAA+ GIC D
Sbjct: 303  KDEVTAVIRQLHGAESEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAARYGICAD 362

Query: 182  IAMSKHEYWSSLAPLPLEIVLSAGQKAKEQCWEKGDN-NPQRGKLVEDFSDLTGEGNIES 6
            IA+SK+EYWSSLAPLP E+V+SAGQKAKE+  EKGD+ N +R KLVED +DLTGEGNIES
Sbjct: 363  IALSKYEYWSSLAPLPFEVVVSAGQKAKEEFSEKGDDENEKRSKLVEDLNDLTGEGNIES 422

Query: 5    M 3
            M
Sbjct: 423  M 423


>gb|EOX99654.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 511

 Score =  372 bits (956), Expect = e-100
 Identities = 226/421 (53%), Positives = 265/421 (62%), Gaps = 14/421 (3%)
 Frame = -3

Query: 1223 RQWSSESGGGS-----------PARGS--HVRSSSASGISNIKRTXXXXXXXXXXXXXXX 1083
            R+WSS+SG GS           PAR    H RSSSA+GIS+IKRT               
Sbjct: 13   RRWSSDSGSGSTGAAVDSPTLSPARHQPHHSRSSSATGISSIKRTQNFAAKAAAQRLAQV 72

Query: 1082 XASQAAVXXXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGANNGSGGVRPVIPS 903
             ASQ                       R++             T  A  G  G +  + S
Sbjct: 73   MASQTT-------DDDDDENDGDDLGFRYSAPPPLALSRNVNAT--ATTGGAGNKAAMNS 123

Query: 902  PKISRSSSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXSLKTXXXXXXXXXPS 723
             +I RS S A+  N L+     RSTS GR  +             +            P 
Sbjct: 124  TRIGRSPSPALARNFLEEAPTVRSTSAGRSPVSLRVAPPVPPPSKTSLRTAVSLPSEPPK 183

Query: 722  NRQREKKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEAR 543
            N+Q EK+F+ D+G  N KD GD+  +SALRDELDMLQEENEN+L+KLR  EE C+  EAR
Sbjct: 184  NQQPEKRFASDIG-FNSKDTGDQHEASALRDELDMLQEENENVLDKLRHEEEQCKDVEAR 242

Query: 542  VKELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIA 363
            V+ELEKQVAALGEGVSLEAKLLSRKEAALRQREAALK+AK  KD VD E+ SL+S V+ A
Sbjct: 243  VRELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKDAKQTKDVVDTEILSLRSEVENA 302

Query: 362  KNEAATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPD 183
            K+E   V+ QL G ESEVKALRSMTQRM+LTQ EMEEVVLKRCWLARYWGLAA+ GIC D
Sbjct: 303  KDEVTAVIRQLHGAESEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAARYGICAD 362

Query: 182  IAMSKHEYWSSLAPLPLEIVLSAGQKAKEQCWEKGDN-NPQRGKLVEDFSDLTGEGNIES 6
            IA+SK+EYWSSLAPLP E+V+SAGQKAKE+  EKGD+ N +R KLVED +DLTGEGNIES
Sbjct: 363  IALSKYEYWSSLAPLPFEVVVSAGQKAKEEFSEKGDDENEKRSKLVEDLNDLTGEGNIES 422

Query: 5    M 3
            M
Sbjct: 423  M 423


>gb|EOX99653.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 567

 Score =  372 bits (956), Expect = e-100
 Identities = 226/421 (53%), Positives = 265/421 (62%), Gaps = 14/421 (3%)
 Frame = -3

Query: 1223 RQWSSESGGGS-----------PARGS--HVRSSSASGISNIKRTXXXXXXXXXXXXXXX 1083
            R+WSS+SG GS           PAR    H RSSSA+GIS+IKRT               
Sbjct: 13   RRWSSDSGSGSTGAAVDSPTLSPARHQPHHSRSSSATGISSIKRTQNFAAKAAAQRLAQV 72

Query: 1082 XASQAAVXXXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGANNGSGGVRPVIPS 903
             ASQ                       R++             T  A  G  G +  + S
Sbjct: 73   MASQTT-------DDDDDENDGDDLGFRYSAPPPLALSRNVNAT--ATTGGAGNKAAMNS 123

Query: 902  PKISRSSSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXSLKTXXXXXXXXXPS 723
             +I RS S A+  N L+     RSTS GR  +             +            P 
Sbjct: 124  TRIGRSPSPALARNFLEEAPTVRSTSAGRSPVSLRVAPPVPPPSKTSLRTAVSLPSEPPK 183

Query: 722  NRQREKKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEAR 543
            N+Q EK+F+ D+G  N KD GD+  +SALRDELDMLQEENEN+L+KLR  EE C+  EAR
Sbjct: 184  NQQPEKRFASDIG-FNSKDTGDQHEASALRDELDMLQEENENVLDKLRHEEEQCKDVEAR 242

Query: 542  VKELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIA 363
            V+ELEKQVAALGEGVSLEAKLLSRKEAALRQREAALK+AK  KD VD E+ SL+S V+ A
Sbjct: 243  VRELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKDAKQTKDVVDTEILSLRSEVENA 302

Query: 362  KNEAATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPD 183
            K+E   V+ QL G ESEVKALRSMTQRM+LTQ EMEEVVLKRCWLARYWGLAA+ GIC D
Sbjct: 303  KDEVTAVIRQLHGAESEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAARYGICAD 362

Query: 182  IAMSKHEYWSSLAPLPLEIVLSAGQKAKEQCWEKGDN-NPQRGKLVEDFSDLTGEGNIES 6
            IA+SK+EYWSSLAPLP E+V+SAGQKAKE+  EKGD+ N +R KLVED +DLTGEGNIES
Sbjct: 363  IALSKYEYWSSLAPLPFEVVVSAGQKAKEEFSEKGDDENEKRSKLVEDLNDLTGEGNIES 422

Query: 5    M 3
            M
Sbjct: 423  M 423


>gb|EOX99656.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508707761|gb|EOX99657.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 422

 Score =  370 bits (951), Expect = e-100
 Identities = 225/420 (53%), Positives = 264/420 (62%), Gaps = 14/420 (3%)
 Frame = -3

Query: 1223 RQWSSESGGGS-----------PARGS--HVRSSSASGISNIKRTXXXXXXXXXXXXXXX 1083
            R+WSS+SG GS           PAR    H RSSSA+GIS+IKRT               
Sbjct: 13   RRWSSDSGSGSTGAAVDSPTLSPARHQPHHSRSSSATGISSIKRTQNFAAKAAAQRLAQV 72

Query: 1082 XASQAAVXXXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGANNGSGGVRPVIPS 903
             ASQ                       R++             T  A  G  G +  + S
Sbjct: 73   MASQTT-------DDDDDENDGDDLGFRYSAPPPLALSRNVNAT--ATTGGAGNKAAMNS 123

Query: 902  PKISRSSSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXSLKTXXXXXXXXXPS 723
             +I RS S A+  N L+     RSTS GR  +             +            P 
Sbjct: 124  TRIGRSPSPALARNFLEEAPTVRSTSAGRSPVSLRVAPPVPPPSKTSLRTAVSLPSEPPK 183

Query: 722  NRQREKKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEAR 543
            N+Q EK+F+ D+G  N KD GD+  +SALRDELDMLQEENEN+L+KLR  EE C+  EAR
Sbjct: 184  NQQPEKRFASDIG-FNSKDTGDQHEASALRDELDMLQEENENVLDKLRHEEEQCKDVEAR 242

Query: 542  VKELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIA 363
            V+ELEKQVAALGEGVSLEAKLLSRKEAALRQREAALK+AK  KD VD E+ SL+S V+ A
Sbjct: 243  VRELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKDAKQTKDVVDTEILSLRSEVENA 302

Query: 362  KNEAATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPD 183
            K+E   V+ QL G ESEVKALRSMTQRM+LTQ EMEEVVLKRCWLARYWGLAA+ GIC D
Sbjct: 303  KDEVTAVIRQLHGAESEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAARYGICAD 362

Query: 182  IAMSKHEYWSSLAPLPLEIVLSAGQKAKEQCWEKGDN-NPQRGKLVEDFSDLTGEGNIES 6
            IA+SK+EYWSSLAPLP E+V+SAGQKAKE+  EKGD+ N +R KLVED +DLTGEGNIES
Sbjct: 363  IALSKYEYWSSLAPLPFEVVVSAGQKAKEEFSEKGDDENEKRSKLVEDLNDLTGEGNIES 422


>ref|XP_002512652.1| conserved hypothetical protein [Ricinus communis]
            gi|223548613|gb|EEF50104.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 506

 Score =  370 bits (950), Expect = e-100
 Identities = 226/423 (53%), Positives = 267/423 (63%), Gaps = 16/423 (3%)
 Frame = -3

Query: 1223 RQWSSES---GGG-------SPARGSHVRSSSASGISNIKRTXXXXXXXXXXXXXXXXAS 1074
            RQWSSES   G G       SP R  H RSSS SGIS+IKR                 AS
Sbjct: 13   RQWSSESSNPGTGPSSPAAMSPGRHHHARSSSVSGISSIKRNQNFAAKAAAQRLAQVMAS 72

Query: 1073 QAAVXXXXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGANNGSGGVRPVIPSPKI 894
            Q A                     R N            + N                  
Sbjct: 73   QTADDDEEDDLGFRYSAPPPFSLSRNNNPTKPAAVPSSTRIN------------------ 114

Query: 893  SRSSSDAVTGNLLD-GVLPARSTSTGRPSMXXXXXXXXXXXXXS---LKTXXXXXXXXXP 726
            +RSSS ++  NL+D      RSTSTGR SM             S   L+T         P
Sbjct: 115  NRSSSPSLARNLVDESPSSVRSTSTGRSSMSLKTAPPPMPPPPSKGSLRTAVSLPPLEPP 174

Query: 725  SNRQRE-KKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAE 549
             N Q++ K+F  D+G LN KD GD+R +SALRDELDMLQEENEN+L+KLRL E+ C+ AE
Sbjct: 175  KNGQKDGKRFLTDVGLLNSKDTGDQREASALRDELDMLQEENENMLQKLRLEEDRCKEAE 234

Query: 548  ARVKELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVK 369
             RV+ELEKQVAALGEGVSLEAKLLSRKEA+LRQREAALK+AK  ++ +D E+SS++S V+
Sbjct: 235  TRVRELEKQVAALGEGVSLEAKLLSRKEASLRQREAALKDAK-QRNVIDKEISSIRSEVE 293

Query: 368  IAKNEAATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGIC 189
             AK EA   V QL G ESE+KAL+ MTQRM+LTQ EMEEVVLKRCWLARYWGLAA+ GIC
Sbjct: 294  NAKEEATAAVRQLHGAESELKALQLMTQRMILTQKEMEEVVLKRCWLARYWGLAARYGIC 353

Query: 188  PDIAMSKHEYWSSLAPLPLEIVLSAGQKAKEQCWEKG-DNNPQRGKLVEDFSDLTGEGNI 12
            PD+A+SKHEYWSSLAPLP E+V+SAGQKAKE+CWEKG D+N ++ K+V+D SDLTGEGNI
Sbjct: 354  PDVALSKHEYWSSLAPLPFEVVVSAGQKAKEECWEKGDDSNEKKSKIVQDLSDLTGEGNI 413

Query: 11   ESM 3
            ESM
Sbjct: 414  ESM 416


>ref|XP_002511106.1| conserved hypothetical protein [Ricinus communis]
            gi|223550221|gb|EEF51708.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 555

 Score =  358 bits (918), Expect = 4e-96
 Identities = 216/410 (52%), Positives = 257/410 (62%), Gaps = 3/410 (0%)
 Frame = -3

Query: 1223 RQWS-SESGGGSPARG-SHVRSSSASGISNIKRTXXXXXXXXXXXXXXXXASQAAVXXXX 1050
            RQWS S +G  SPA   +H  S   +G+S IKRT                ASQ A     
Sbjct: 13   RQWSGSSTGSSSPAMSPAHPSSRLGTGMSTIKRTQNVAAKAAAQRLAQVMASQTA----- 67

Query: 1049 XXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGANNGSGGVR-PVIPSPKISRSSSDA 873
                            RF+              +  NN +  +  P I   + +RS S A
Sbjct: 68   ----DDDDDEDDDLGFRFSAPPPPAPSSFSNNNHSGNNNNNSITAPSISLARPNRSPSPA 123

Query: 872  VTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXSLKTXXXXXXXXXPSNRQREKKFSP 693
            +  N  + V   RS+S GRPS+             S++T         PSNR REK+F+ 
Sbjct: 124  LGRNFAEHVPSVRSSSAGRPSISVRTGTLVPPTKSSIRTPISIPAIEPPSNRSREKRFTS 183

Query: 692  DLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEARVKELEKQVAA 513
            D+G+L LKD GD+R +SALRDELDMLQEENE IL+KLRL EE  E AEAR +ELEKQVAA
Sbjct: 184  DVGQLKLKDAGDQREASALRDELDMLQEENEVILDKLRLTEERREEAEARARELEKQVAA 243

Query: 512  LGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIAKNEAATVVEQ 333
            LGEGVSLEAKLLSRKEAALRQREAALK AK AK G D E+++L+S ++  K  AA  VEQ
Sbjct: 244  LGEGVSLEAKLLSRKEAALRQREAALKAAKQAKGGKDEEIAALRSELENLKEGAAVAVEQ 303

Query: 332  LRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPDIAMSKHEYWS 153
             R  ESE KALR+MTQRM+LTQ EMEEVVLKRCWLARYW LA Q GIC DIA +KHE+WS
Sbjct: 304  FREAESEAKALRTMTQRMILTQEEMEEVVLKRCWLARYWALAVQHGICSDIAGTKHEHWS 363

Query: 152  SLAPLPLEIVLSAGQKAKEQCWEKGDNNPQRGKLVEDFSDLTGEGNIESM 3
            +LAPLP E+V+SAGQKAKE+    G ++P RGK V D SDL+GEGNIESM
Sbjct: 364  ALAPLPFEVVISAGQKAKEE--SLGGDDPDRGKSVRDLSDLSGEGNIESM 411


>gb|EOY22608.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 561

 Score =  357 bits (917), Expect = 5e-96
 Identities = 217/414 (52%), Positives = 261/414 (63%), Gaps = 7/414 (1%)
 Frame = -3

Query: 1223 RQWS---SESGGGSPARG-SHVRSSSASGISNIKRTXXXXXXXXXXXXXXXXASQAAVXX 1056
            RQWS   S SG  SPA   S +   +A G+S IKRT                ASQ     
Sbjct: 13   RQWSGGSSSSGSSSPAHPQSRLHPGAAGGMSTIKRTQNVAAKAAAQRLAQVMASQTP--- 69

Query: 1055 XXXXXXXXXXXXXXXXXLRFNXXXXXXXXXXPVKTNGANNG-SGGVRPVIPSPKISRSSS 879
                              RF            V T+ +N+  +    P I   + +RS S
Sbjct: 70   -------DDDEEDDDLGFRFGGPP--------VPTSFSNSSLNHSTLPAISVTRPNRSPS 114

Query: 878  DAVTGNLLDGVLPARSTSTGRP--SMXXXXXXXXXXXXXSLKTXXXXXXXXXPSNRQREK 705
             A+  N ++     RSTS GRP  SM             S++T         P NR R+K
Sbjct: 115  PALGRNFVEHAPSVRSTSAGRPAISMRSTAPTLMPPSRTSVRTPVTIPPIDPP-NRSRDK 173

Query: 704  KFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAAEARVKELEK 525
            +F+ D+G+L  KD GD+R +SALRDELDMLQEENEN+L+KLR AEE  E  EAR +ELEK
Sbjct: 174  RFTADVGQLKAKDTGDQREASALRDELDMLQEENENLLDKLRSAEERREEGEARARELEK 233

Query: 524  QVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNVKIAKNEAAT 345
            QVA+LGEGVSLEAKLLSRKEAALRQREAALK AK  KDG + E+++L+S ++  K+ AAT
Sbjct: 234  QVASLGEGVSLEAKLLSRKEAALRQREAALKAAKQTKDGREEEIAALRSELENLKDGAAT 293

Query: 344  VVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGICPDIAMSKH 165
             VEQL   +SE KALRSMTQRM+LTQ EMEEVVLKRCWLARYWGLA Q GIC DIA+SKH
Sbjct: 294  AVEQLHEAKSETKALRSMTQRMILTQEEMEEVVLKRCWLARYWGLAVQHGICADIAVSKH 353

Query: 164  EYWSSLAPLPLEIVLSAGQKAKEQCWEKGDNNPQRGKLVEDFSDLTGEGNIESM 3
            EYWS+LAPLP E+V+SAGQKAKE+ W++G  +P R KLV D +DLTGEGNIESM
Sbjct: 354  EYWSALAPLPFEVVVSAGQKAKEEAWDRGGGDPDRSKLVRDLNDLTGEGNIESM 407


>ref|XP_004145144.1| PREDICTED: uncharacterized protein LOC101221393 [Cucumis sativus]
            gi|449473860|ref|XP_004154004.1| PREDICTED:
            uncharacterized protein LOC101206186 [Cucumis sativus]
            gi|449518639|ref|XP_004166344.1| PREDICTED:
            uncharacterized LOC101206186 [Cucumis sativus]
          Length = 551

 Score =  353 bits (907), Expect = 7e-95
 Identities = 191/303 (63%), Positives = 225/303 (74%)
 Frame = -3

Query: 911  IPSPKISRSSSDAVTGNLLDGVLPARSTSTGRPSMXXXXXXXXXXXXXSLKTXXXXXXXX 732
            I  P+I+RS S A+  N+++ V   RSTSTGRPSM              LKT        
Sbjct: 109  ISGPRINRSPSPALGRNIVEIVPQVRSTSTGRPSMSVRVNPNVPPSKQPLKTSVSIPPIE 168

Query: 731  XPSNRQREKKFSPDLGRLNLKDVGDRRASSALRDELDMLQEENENILEKLRLAEESCEAA 552
             PSNR  +++F+ D+G+   KD GD+R +SALRDELDMLQEENENILEKLRLAEE  E A
Sbjct: 169  PPSNRIGDRRFASDIGQAKSKDAGDQREASALRDELDMLQEENENILEKLRLAEEKREEA 228

Query: 551  EARVKELEKQVAALGEGVSLEAKLLSRKEAALRQREAALKEAKVAKDGVDVEMSSLQSNV 372
            EAR + LEKQVA LGEGVSLEAKLLSRKEAALRQREAALK A+  KD  + E+++L+S +
Sbjct: 229  EARARMLEKQVATLGEGVSLEAKLLSRKEAALRQREAALKAAQPTKDSRNEELAALRSEI 288

Query: 371  KIAKNEAATVVEQLRGTESEVKALRSMTQRMVLTQHEMEEVVLKRCWLARYWGLAAQLGI 192
            +  K E+    EQLR  ESE KALR MTQRMVLTQ EMEEVVLKRCWLARYWGLA Q GI
Sbjct: 289  ENLKEESVAATEQLREAESEAKALRVMTQRMVLTQEEMEEVVLKRCWLARYWGLAVQYGI 348

Query: 191  CPDIAMSKHEYWSSLAPLPLEIVLSAGQKAKEQCWEKGDNNPQRGKLVEDFSDLTGEGNI 12
            C DIA+SKHEYWSSLAPLP E+V+SAGQKAKE+   +G N+  R KL++D +DL+GEGNI
Sbjct: 349  CADIAISKHEYWSSLAPLPFEVVISAGQKAKEE--PEGRNDQDRSKLIQDINDLSGEGNI 406

Query: 11   ESM 3
            ESM
Sbjct: 407  ESM 409


Top